5,018 Matching Annotations
  1. Feb 2024
    1. Authors’ response (11 February 2024)

      GENERAL ASSESSMENT

      Ionotropic glutamate receptors mediate the large majority of excitatory synaptic transmission in the brain. These receptors consist of four classes: AMPA, kainate, NMDA and delta receptors. NMDA receptors are obligate tetramers composed of two GluN1 and two GluN2 (or GluN3) subunits. Compared to other iGluRs, they have the particularity of requiring two different agonists for their channel to open: glycine binding on GluN1 and glutamate on GluN2.

      Seljeset et al. investigate the molecular determinants controlling ligand potency and NMDAR activity at the level of the ligand-binding domains (LBDs), where the agonists bind. They identify a specific position, D732, whose mutation to either leucine or phenylalanine leads to a constitutively active GluN1 subunit, and thus to NMDARs activated solely by glutamate. This aspartate is well known in the field, since it is a highly conserved, signature residue in iGluRs that binds amino acid ligands, together with an arginine in the LBD upper lobe. Surprisingly, although glycine cannot further activate GluN1-D732L/GluN2Awt receptors, glycine site antagonists like 5,7-DCKA or CGP-78608 can still bind and inhibit NMDAR activity. This study is therefore very intriguing, as it raises new questions about something that was previously thought to be understood. By using a combination of unnatural amino acids and conventional mutagenesis, the authors propose that D732 contributes to glycine-mediated effects by changing local interactions with nearby residues. In addition, they show that this behavior is specific for the GluN1 subunit, since mutation of the equivalent aspartate in the GluN2 subunit does not yield constitutively activated GluN2 subunits. Finally, the authors identify a homomeric iGluR from the placozoan Trichoplax adhaerens, Trichoplax AKDF<sup>19383</sup>, in which this conserved aspartate is replaced by a tyrosine. When expressed in Xenopus oocytes, the channel shows constitutive activity. Mutation of the tyrosine into an aspartate, to convert Trichoplax AKDF<sup>19383</sup> into a “classical” iGluR, decreases Trichoplax AKDF<sup>19383</sup> constitutive current and allows this channel to be activated by glycine and D-serine. Interestingly, an adjacent residue that is a serine in most mammalian subunits is also a tyrosine in Trichoplax AKDF<sup>19383</sup>, and mutation of both tyrosines yields a glutamate-gated ion channel comparable to mammalian receptors. All of this suggests that the nature of the residue at position 732 influences not only ligand binding but also channel gating.

      The study is technically sound, with appropriate controls, and uncovers intriguing properties of a position in GluN1 LBD at which specific side chain mutations can lock the subunit in an active state. Investigation of Trichoplast iGluR further reinforces these findings. This study should lead to a better understanding of how LBDs prime channel opening in iGluRs in the absence of agonists. In addition, co-agonist insensitive GluN1-D732L containing NMDARs could be used as tools to investigate the physiological consequences of NMDAR regulation by their co-agonist site. In contrast to previously engineered NMDARs activated solely by glutamate, which rely on the LBD being locked in its active state by cysteine bridges (Blanke and VanDongen, J Biol Chem 2008), GluN1-D37L/GluN2A NMDARs remain druggable (i.e. they can still be inhibited by glycine-site competitive antagonists). This is a great advantage when investigating the function of these receptors in a native context. The study identifies a few gaps that remain in our mechanistic understanding of D732’s role in channel gating. Particularly, it is unclear how subtle modification of residue side chains at position D732 lead to such drastic changes in function and why these effects are specific to GluN1 LBD. Also, why does mutation of D732 into isoleucine lead to a constitutively active GluN1 subunit, while mutation of a closely related leucine residue prevents activation of the receptor by glycine? The idea of a “hydrophobic plug” formed by D732L or D732F sidechains leading to constitutive activation would benefit from further validation since other hydrophobic substitutions (A, V, I, Y, and W) do not produce similar effects. Finally, it would be interesting to carry out further investigations of the role of the interaction between D732 and Q536 in open conformation stability. Thus, this paper puts forth interesting questions that could be addressed by future studies, for example molecular dynamics simulations and exploration of the LBD free energy landscapes (as in Yao et al., Structure 2013), to understand the impact of the GluN1-D732L mutation on GluN1 LBD conformational mobility.

      RECOMMENDATIONS

      Essential revisions:

      1. Page 2, “These data show that essentially all substitutions at the GluN1-732 position decrease glycine potency, but leucine and phenylalanine substitutions also remove the requirement for glycine co-agonism in GluN1/GluN2A NMDA receptors”: One other hypothesis for the lack of glycine dependence of GluN1-D732I and D732Y + GluN2A receptors could be that the mutated receptors have a glycine potency so high that GluN1 LBD is already saturated by contaminating, ambient glycine. At this point in the paper, the authors cannot distinguish between one hypothesis or the other, therefore we suggest that this sentence be rephrased. Later in the text, control experiments with GluN1-R523K mutations that kill glycine binding and competition with 5,7-DCKA show that glycine-independent activation of GluN1-D732L/GluN2A mutants is not due to constitutive occupancy of GluN1 LBD by contaminating glycine.

      ER1) We have now changed this to (page 4): “These data show that most substitutions at the GluN1-732 position decrease glycine potency, but leucine and phenylalanine substitutions alter GluN1 activity in such a way that leads to single-mutant NMDA receptors activated solely by glutamate.”

      1. Does glycine insensitivity in GluN1-D732L/GluN2A NMDARs reflect a constitutively active GluN1 subunit or is this subunit locked in another conformational state that cannot be further modified by glycine? This could be answered by estimating the maximum open probability of GluN1-D732L/GluN2A NMDARs compared to their wt counterparts. To estimate Po, the authors could measure the kinetics of NMDA receptor current inhibition by MK801 (the slower MK801 inhibition, the lower the Po; see Chen et al., J. Neurosci 1999; Blanke and VanDongen, JBC 2008) in the presence of saturating agonist concentrations (100 μM Glu, 100 μM Gly for wt and only 100 μM Glu for mutant).

      ER2) We have now assessed the rate of MK-801 block in glutamate-gated mutant and glycine + glutamate-gated WT receptors, and reshuffled text/figures, as this ties in well with ER4) below. MK-801 results now in Figure 3 on page 6, and main text on page 5: “In order to understand whether the glycine-insensitive GluN1-D732L subunit is in a constantly activated state or occupies a different conformation that may reflect an alternative to typical channel gating, we compared the kinetics of WT receptor and GluN1-D732L-containing receptor inhibition by the open-channel blocker MK-801, which can be used to evaluate maximum open probability of NMDARs <sup>26,30</sup>. We observed very similar kinetics of inhibition of WT and mutant receptors (Fig. 3A), indicating similar open probability in solely glutamate-gated GluN1-D732L-containing receptors and glutamate and glycine-gated WT receptors. This reflects unchanged maximum open probability in solely glutamate-gated NMDARs with disulfide-locked GluN1 LBDs assayed by single channel recordings <sup>27</sup>. This suggests that the GluN1-D732L subunit is in a constantly activated state.”

      When viewed alongside high sensitivity of mutant subunits to DCKA - OS1) below - it’s difficult to conclude what sort of active state the mutant subunit adopts. We’ve assessed the best we can at the moment, and in this paper we’ll have to leave it at “here is the observation; here is some evidence ruling out various possibilities; and here is a receptor from another family that shows something remarkably consistent”. Future studies will have to establish exactly what state the mutant subunit adopts.

      1. Page 4: The term “hydrophobic plug” is not fully justified since other hydrophobic residues do not lock GluN1 LBD in its active state.

      ER3) We have replaced nearly all use of this term, in the title and in the main text, to e.g. “certain hydrophobic substitutions” or “L/F substitutions”.

      1. Figure 2, redox sensitivity of GluN1-D732L/GluN2Awt: It would be helpful to explain the point of this experiment – maybe to investigate if the D732L mutation has an impact on the receptor gate rather than on the LBD? In any case, the authors should investigate the effect of DTT on the activity of wt GluN1/GluN2A receptors to determine whether there is an absence of an effect of the D732L mutant on redox sensitivity.

      ER4) Indeed we were curious if D732L affected the gate via this allosteric route, rather than by just altering LBD conformation. And we have now shown the effect of DTT on WT receptors.

      In addition to re-writing to better explain the point, as suggested, we have also re-written to follow on from new data/text on the whether the D732L mutation affects LBD, gating, etc: “We next questioned if D732L/F substitutions affect channel gating, rather than simply altering the LBD conformation. The gating machinery is complex, but it includes the peptide segment linking the C-terminal end of the LBD to membrane-spanning helix 4 (LBD-M4 linker, (11)). The LBD and LBD-M4 linker are confined by a C744—C798 disulfide, just four helical turns after D732, whose disruption by reduction enhances channel gating (28)). We considered that if the D732L/F substitution is coupled to channel gating via this route, then removal of the C744—C798 disulfide via the C744A mutation might alter glutamate-gated currents in GluN1-D732L-containing receptors. Alternatively, the typical enhancement by the reducing agent dithiothreitol (DTT) might differ in GluN1-D732L compared to WT receptors.”

      And new Figure 3 now includes DTT effects on WT receptors.

      1. Page 6: The authors find that mutation of Q536 decreases glycine potency and conclude there is an interaction between D732 and Q536. However, the effects of D732 and Q536 mutations could be independent, therefore the authors should consider mutating both residues together to look at the additive/non-additive effects of the mutations. Or perhaps, note in the Discussion that some sort of mutant cycle analysis or molecular dynamics simulation would be needed to rigorously test these ideas.

      ER5) We have now made and tested a double mutant combining D732E and Q536N and performed mutant cycle analysis.

      (We also tried to do this for Q536 side chain (regular mutations) and A734 main chain (non-canonical substitutions), but double mutants involving non-canonical amino acids at A734 were not successful – Figure S1.)

      As is now shown in Figure 4D, the effects of the mutations are decidedly non-additive, yielding an Ω value of 0.05, corresponding to a reasonably high energetic coupling of ~7 kJ/mol. We have now added to the relevant section of the Results on page 8: “If an interaction between Q536 and D732 were energetically important for receptor activation, the effects of their mutations should be non-additive <sup>31</sup>. We therefore tested glycine potency at double-mutant GluN1-Q536N/D732E-containing receptors and observed non-additive changes in EC<sub>50</sub>, with a strong coupling value, Ω, of 0.05 (Fig. 4D). This deviation of Ω from unity, corresponding to an interaction energy of 7.4 kJ/mol is relatively high <sup>31</sup>, confirming that Q536 and D732 are energetically coupled. We tried to analyse energetic coupling between Q536 and A734 via double mutants incorporating nonsense suppression at the A734 position, but unfortunately, attempts to incorporate Aah into such double mutants via nonsense suppression were unsuccessful (Fig. S1B).”

      1. Page 6, “A hydrophobic plug does not cause constitutive activity in all NMDA receptor subtypes”: This title is misleading as it raises the expectation that the effect of GluN1-D732L has been investigated in the context of GluN1/GluN2A, GluN1/GluN2B, etc NMDARs. Instead, the equivalent mutation is made in the GluN2 subunit. We suggest using the word “subunit” rather than “subtype”.

      ER6) We have changed this Results section title (page 8) to: “L/F substitutions do not cause constitutive activity in all NMDA receptor subunits”

      1. Page 7, effect of GluN1-D732L in the context of GluN1/GluN3 NMDARs: We would not expect current to be observed with GluN1-D732L/GluN3 NMDARs, since locking GluN1 LBD in its active state desensitizes the receptors. The effect of the D732L mutation seems therefore conserved between GluN1/GluN2 and GluN1/GluN3 NMDARs. In addition, when using CGP, please cite Grand et al., Nat. Commun. 2018 since they were the first to use CGP as a tool to record GluN1/GluN3 currents.

      ER7) We have now cited that paper specifically here (page 8) and inserted the following (page 8/9): “While this seems like inactivity of the mutant GluN1 subunit in GluN1(4a)/GluN3A, it could yet reflect the activity of constitutively active mutant GluN1 subunits in GluN1/GluN2A receptors, as GluN1 activity in GluN1/GluN3A receptors is known to cause more desensitization than activation (Grand et al 2018).”

      1. Figure 5C: It is stated in the text that the aspartate position is “highly” conserved. However, no actual number or percentages are given for this statement. How does it compare to the residues in the highly conserved SYTANLAAF motif or other conserved positions? This sort of analysis does not need to be done for the entire receptor, but perhaps for glycine and glutamate binding residues and SYTANLAAF motif, to give a quantitative feel for statements about conservation. In addition, what other types of residues occupy this position in other species? And what was the number of species/subunits included in the analysis?

      ER8) To clarify the level of conservation, we have added Table 1 (page 10) listing the % conservation of amino acids at selected positions.

      In analyzing % conservation, we noticed that several iGluR sequences with gaps in the ligand-binding domain or channel-forming helices had escaped our filtering out incomplete sequences in our phylogenetic analysis. We therefore revisited our phylogenetic analysis, removed several incomplete sequences, and replaced Crassostrea gigas (a mollusc spiralian) iGluR sequences with Schmidtea mediterranea (a flatworm spiralian) sequences. This (1) means less sequences with gaps in the ligand-binding domain in our alignment/tree and (2) better covers the diversity of the lineage Spiralia now that we have sequences of Lingula anatina and Schmidtea mediterranea, which are more distantly related than Lingula anatina and Crassosttrea gigas (Laumer et al 2019, PMID:31690235; Marlétaz et al., 2019, PMID:30639106).

      The result is a phylogenetic and amino acid sequence analysis of 204 iGluR genes (previous version had 212 genes) with the same overall topology as the previous version, including lambda, NDMA, epsilon, and AKDF iGluR families (Fig. 5B, page 9).

      The number of subunits/genes used is stated in the Figure legend. The number of and reasoning behind the number of species used is described under Methods, Bioinformatic analyses: in exploring the conservation of the D732 residue, we have not tried to use as many iGluR sequences as possible; rather we have tried to assess this residue in a broad sample covering all (animal) iGluR families and from a careful selection of different animal lineages, while also avoiding fast-evolving species like Drosophila, which complicate tree topology. Hence our description of “two ctenophores, one poriferan, etc” under Methods, Bioinformatic analyses. In the main text (Results, page 9), we retain our original description: “We assembled diverse iGluR sequences, covering all animal lineages and animal iGluR families (Fig. 6A,B)…”

      1. Figure 5, panel F: From what we understand, the authors created dose-response curves for wt Trichoplast AKDF<sup>193863</sup> based on steady-state currents and for Y742D/Y743S mutants based on peak currents. If this is the case, one cannot compare the two dose-response curves since peak current potentiation and steady-state inhibition likely reflect different conformational transitions.

      ER9) We acknowledge this issue and that we can’t really say that ligand-activated D742 channels bind D-serine better than ligand-deactivated Y742 channels. But we think it’s fair to point out that mutant D742 channels react (by conducting current) to micromolar ligand concentrations whereas wildtype Y742 channels react (with decreased current) only to millimolar concentrations, and we have re-written to acknowledge the issue raised for this comparison (page 11): “Finally, we tried to assess whether position 742 determines ligand potency in addition to channel activity in AKDF<sup>19383</sup> receptors. For these experiments we employed D-serine, as recovery from glycine-induced deactivation (Fig. 6C, far-left) and activation/desensitization (Fig. 6C, far-right) was very slow. Substantial deactivation of WT receptors was only induced by millimolar D-serine concentrations, whereas Y742D-containing mutants were activated by micromolar concentrations (Fig. 6D,E), with an EC<sub>50</sub> of 490 ± 120 µM at Y742D/Y743S (n = 4; Y742D EC<sub>50</sub> not assessed due to slow recovery from desensitization). Our measure of potency is confounded by the fact that deactivation (in WT channels) and activation (in mutant channels) are presumably coupled to D-serine binding via different conformational transitions. Nonetheless, we observe that a naturally occurring large hydrophobic side chain at the top of the β-strand preceding the αI helix leads to an AKDF homo-tetramer that shows constitutive activity and responds only to millimolar concentrations of D-serine. In contrast, “re-introducing” an aspartate to this position reinstates more typical ligand-dependent activation and sensitivity to micromolar concentrations of D-serine.”

      Optional suggestions:

      1. Figure 2, glycine/DCKA competition: It is difficult to understand how a GluN1 LBD-locked closed (active state) could still bind DCKA. If the open-to-close equilibrium of GluN1 LBD is displaced towards its closed state, then DCKA Ki should be shifted to the right compared to wt receptors. Additionally, DCKA inhibition kinetics should be slower if DCKA needs to “wait” for rare resting-like conformational changes to bind. Did the authors investigate DCKA potency and inhibition kinetics?

      OS1) We have now investigated DCKA potency. DCKA capably inhibits GluN1-D732L/GluN2A-WT activity, and perhaps surprisingly, potency of DCKA at the mutant is greater than at wildtype. We suspect this is due to (1) the introduction of a hydrophobic leucine residue right next to an aryl group of DCKA, increasing DCKA affinity directly, (2) the absence of glycine binding to this site, so no need for competition, and (3) potentially other mechanisms such as cooperativity between subunits. Again, establishing the precise nature of our mutant LBD conformation here is for future structural and molecular dynamics studies. But we have described the results, along with our following interpretation, (page 4): “Whether increased DCKA potency in GluN1-D732L subunits derives from the now non-competitive nature of the inhibition in mutant receptors or from the introduction of a favourable hydrophobic interaction with the dichlorobenzene moiety of the inhibitor is unclear. But the high DCKA potency would suggest that the constitutively active GluN1-D732L subunit is, unexpectedly, not due to a permanently clamshell-closed LBD in the mutant. This may reflect the fact that extent of LBD closure is poorly correlated with agonist efficacy in GluN1 subunits, in contrast to AMPA receptor GluA2 subunits <sup>21</sup>.”

      1. The authors show in many panels that GluN1/GluN2A currents desensitize (e.g. Fig.1B, 3C, 4A). In Xenopus oocytes, NMDAR currents do not normally desensitize. We fear this desensitization might stem from contamination of the NMDA current by calcium-activated chloride channels, which can be activated by high quantities of barium when large NMDAR currents are measured. To avoid this problem, we advise that NMDA currents above 2 µA are avoided.

      OS2) We have moved forward presuming that potential changes in current amplitude due to a small chloride flux doesn’t affect our measures of potency or ligand-selectivity. But in our new experiments, we’ve especially tried to avoid large currents.

      1. Page 5, investigation of D732 state-dependent interactions: Mutation of residues near D732 to unnatural amino acids to replace the peptidic NH do not bring much information about the mechanisms of D732 action. The fact that the 734Aah and 735Vah cannot mimic the effect of the D732L mutation could be due to many factors, including the fact that changing the peptide bond probably changes the local structure of the LBD. Perhaps mention this in the discussion.

      OS3) We have now acknowledged this possibility in the Results, right after we describe the decrease in glycine potency caused by the 734Aah mutation (page 7): “Although this may be due to local conformational changes due to altered main chain structure,…”

      1. It is intriguing that the D732L mutation locks an active conformation of the GluN1 subunit but not the GluN2 subunit, suggesting two different mechanisms of LBD closure by glutamate and glycine. It would be interesting to look at the effect of the equivalent mutation on the GluN3 subunit to investigate if this locking effect is specific to glycine-binding LBDs or just to the GluN1 subunit.

      OS4) We have now made and tested mutant GluN3A subunits D485L and D485F. Simply decreases glycine activity altogether (reflecting the effects of the mutations in GluN2A). Described on page 9: “Similarly, at oocytes injected with GluN1(4a)-WT and GluN3A-D845L or -D845F mRNAs, we saw no response to glycine alone or glycine in the presence of CGP 78608 (Fig 5D). Together, these results indicate that the induction of a constitutively active state by the D732L/F substitution is an exclusive feature of the GluN1 subunit, and the only conserved feature of the mutation in different subunits is a decrease in agonist potency.”

      1. Page 9: Discussing the position of residue side chains from structures with 4 Å resolution does not seem relevant and would benefit from a caveat.

      OS5) We want to retain our comparison of experiments with available structural data, so we have kept this but re-written to more openly acknowledge the caveat (page 12): “Indeed, in a cryo-electron microscopy (cryo-EM) study of GluN1/GluN2B receptors, D732 has only swung toward the ligand and away from A734 in a second of two putative pre-gating step structural models, although this is speculative considering the poor resolution of D732 side chains in those cryo-EM maps (12).”

      1. Page 10: We don’t understand the point that the authors want to make with the activation of Aplysia californica. Please clarify.

      OS6) He we were trying to say that “not much is required to change NMDARs from requisite co-agonism to single-ligand agonism”, either (a) in the lab via the D732L mutation or (b) naturally, as invertebrate NMDA receptors apparently show single-ligand agonism (results on invertebrate NDMARs in the literature). Further, we want to say that “by extension, we wonder if (c) in certain physiological situations, vertebrate NMDARs might indeed need only a single ligand.” We acknowledge this was unclear and – although it’s still speculative – we have now changed to (page 13): “Our work shows that only small changes in the GluN1 LBD are required for solely glutamate-gated currents in vertebrate GluN1/GluN2 receptors, and previous work suggests that invertebrate Drosophila melanogaster and Aplysia californica GluN1/GluN2 receptors can be activated by single ligands <sup>50,51</sup>. This suggests that NMDA receptors’ requirement of co-agonism is easily alleviated by certain mutations or conditions. As iGluR-modulatory proteins vary across cell types or even across neuronal compartments <sup>52,53</sup> and NMDA receptor sequence varies across animals, it is foreseeable that in certain physiological settings, certain NMDA receptors might be activated by glutamate alone. But in most settings, certainly in vertebrates, it seems that glutamate-induced activation of NMDA receptors relies on a system of ambient glycine or D-serine <sup>54,55</sup>.”

      1. In iGluRs, constitutive currents are often induced by mutations in the gate region, near the SYTANLAAF motif (e.g. lurcher mutations). Does the sequence around the gate of Trichoplast AKDF<sup>193863</sup> predict channel constitutive activity?

      OS7) Our results with WT, single mutant Y742D, and double mutant Y742D/Y743S Trichoplax AKDF<sup>19383</sup> receptors already show convincing evidence that the constitutive activity is via the Y742 and Y743 position: the tyrosine residues are unique to this leaky channel, and their mutation to more typical residues removes the leak current (Fig. 7B, page 11, revised manuscript).

      But a look at upper M3 is warranted. As shown in Fig. 6C, AKDF<sup>19383</sup> (YTANMAAFL) is quite similar to typical iGluRs (e.g. GluA2 YTANLAAFL). But one might ask about the single M/L difference in that motif, and we have therefore made and tested the M657L AKDF<sup>19383</sup> mutant, comparing it with WT. Results show that this small M3 difference has little effect on channel activity. We have added this data in new Figure 7D and described it (page 11): “As channel activity of iGluRs also relies on the upper segment of the third membrane-spanning helix (M3, (34)), we also examined this segment in AKDF<sup>19383</sup>. AKDF<sup>19383</sup> differs only subtly from most iGluRs with a methionine residue (M657) instead of leucine here (Fig. 6C), but we tested potential effects of this difference by mutating M657 to leucine. M657L activity was much like WT (Fig. 7D), however, confirming that divergence at Y742/Y743 and not the upper M3 segment determines the unique activity of AKDF<sup>19383</sup>.”

      1. D-serine is another co-agonist that binds the GluN1 subunit. Compared to glycine, D-serine can make additional interactions with the lower lobe of GluN1 LBD. It would be interesting to look at D-serine dose-response curves in GluN1-D732L/GluN2A receptors: are these receptors also D-serine insensitive or can they be further activated by D-serine?

      OS8) We have now measured the effects of D-serine on GluN1-D732L/GluN2A-WT receptors. As we now show in Figure 1B (green symbols), D-serine at increasing concentrations (100 nM through 100 μM) activates no additional current on top of the glutamate-gated current in mutant receptors. We have added to the end of the first Results paragraph (page 3): “Similarly, large currents were activated in mutant GluN1-D732L/GluN2A-WT receptors when 100 nM through 100 μM D-Serine was applied the presence of 100 µM glutamate (green in Fig. 1B).”

      (This is a response to peer review conducted by Biophysics Colab on version 1 of this preprint.)

    1. Author Response

      The following is the authors’ response to the original reviews.

      eLife assessment

      This study uses a multi-pronged empirical and theoretical approach to advance our understanding of how differences in learning relate to differences in the ways that male versus female animals cope with urban environments, and more generally how reversal learning may benefit animals in urban habitats. The work makes an important contribution and parts of the data and analyses are solid, although several of the main claims are only partially supported or overstated and require additional support.

      Public Reviews:

      We thank the Editor and both Reviewers for their time and for their constructive evaluation of our manuscript. We worked to address each comment and suggestion offered by the Reviewers in our revision—please see our point-by-point responses below.

      Reviewer #1 (Public Review):

      Summary:

      In this highly ambitious paper, Breen and Deffner used a multi-pronged approach to generate novel insights on how differences between male and female birds in their learning strategies might relate to patterns of invasion and spread into new geographic and urban areas.

      The empirical results, drawn from data available in online archives, showed that while males and females are similar in their initial efficiency of learning a standard color-food association (e.g., color X = food; color Y = no food) scenario when the associations are switched (now, color Y = food, X= no food), males are more efficient than females at adjusting to the new situation (i.e., faster at 'reversal learning'). Clearly, if animals live in an unstable world, where associations between cues (e.g., color) and what is good versus bad might change unpredictably, it is important to be good at reversal learning. In these grackles, males tend to disperse into new areas before females. It is thus fascinating that males appear to be better than females at reversal learning. Importantly, to gain a better understanding of underlying learning mechanisms, the authors use a Bayesian learning model to assess the relative role of two mechanisms (each governed by a single parameter) that might contribute to differences in learning. They find that what they term 'risk sensitive' learning is the key to explaining the differences in reversal learning. Males tend to exhibit higher risk sensitivity which explains their faster reversal learning. The authors then tested the validity of their empirical results by running agent-based simulations where 10,000 computersimulated 'birds' were asked to make feeding choices using the learning parameters estimated from real birds. Perhaps not surprisingly, the computer birds exhibited learning patterns that were strikingly similar to the real birds. Finally, the authors ran evolutionary algorithms that simulate evolution by natural selection where the key traits that can evolve are the two learning parameters. They find that under conditions that might be common in urban environments, high-risk sensitivity is indeed favored.

      Strengths:

      The paper addresses a critically important issue in the modern world. Clearly, some organisms (some species, some individuals) are adjusting well and thriving in the modern, human-altered world, while others are doing poorly. Understanding how organisms cope with human-induced environmental change, and why some are particularly good at adjusting to change is thus an important question.

      The comparison of male versus female reversal learning across three populations that differ in years since they were first invaded by grackles is one of few, perhaps the first in any species, to address this important issue experimentally.

      Using a combination of experimental results, statistical simulations, and evolutionary modeling is a powerful method for elucidating novel insights.

      Thank you—we are delighted to receive this positive feedback, especially regarding the inferential power of our analytical approach.

      Weaknesses:

      The match between the broader conceptual background involving range expansion, urbanization, and sex-biased dispersal and learning, and the actual comparison of three urban populations along a range expansion gradient was somewhat confusing. The fact that three populations were compared along a range expansion gradient implies an expectation that they might differ because they are at very different points in a range expansion. Indeed, the predicted differences between males and females are largely couched in terms of population differences based on their 'location' along the rangeexpansion gradient. However, the fact that they are all urban areas suggests that one might not expect the populations to differ. In addition, the evolutionary model suggests that all animals, male or female, living in urban environments (that the authors suggest are stable but unpredictable) should exhibit high-risk sensitivity. Given that all grackles, male and female, in all populations, are both living in urban environments and likely come from an urban background, should males and females differ in their learning behavior? Clarification would be useful.

      Thank you for highlighting a gap in clarity in our conceptual framework. To answer the Reviewer’s question—yes, even with this shared urban ‘history’, it seems plausible that males and females could differ in their learning. For example, irrespective of population membership, such sex differences could come about via differential reliance on learning strategies mediated by an interaction between grackles’ polygynous mating system and malebiased dispersal system, as we discuss in L254–265 (now L295–306). Population membership might, in turn, differentially moderate the magnitude of any such sex-effect since an edge population, even though urban, could still pose novel challenges—for example, by requiring grackles to learn novel daily temporal foraging patterns such as when and where garbage is collected (grackles appear to track this food resource: Rodrigo et al. 2021 [DOI: 10.1101/2021.06.14.448443]). We now introduce this important conceptual information— please see L89–96.

      Reinforcement learning mechanisms:

      Although the authors' title, abstract, and conclusions emphasize the importance of variation in 'risk sensitivity', most readers in this field will very possibly misunderstand what this means biologically. Both the authors' use of the term 'risk sensitivity' and their statistical methods for measuring this concept have potential problems.

      Please see our below responses concerning our risk-sensitivity term.

      First, most behavioral ecologists think of risk as predation risk which is not considered in this paper. Secondarily, some might think of risk as uncertainty. Here, as discussed in more detail below, the 'risk sensitivity' parameter basically influences how strongly an option's attractiveness affects the animal's choice of that option. They say that this is in line with foraging theory (Stephens and Krebs 2019) where sensitivity means seeking higher expected payoffs based on prior experience. To me, this sounds like 'reward sensitivity', but not what most think of as 'risk sensitivity'. This problem can be easily fixed by changing the name of the term.

      We apologise for not clearly introducing the field of risk-sensitive foraging, which focuses on how animals evaluate and choose between distinct food options, and how such foraging decisions are influenced by pay-off variance i.e., risk associated with alternative foraging options (seminal reviews: Bateson 2002 [DOI: 10.1079/PNS2002181]; Kacelnik & Bateson 1996 [DOI: 10.1093/ICB/36.4.402]). We have added this information to our manuscript in L494–497. We further apologise for not clearly explaining how our lambda parameter estimates such risk-sensitive foraging. To do so here, we need to consider our Bayesian reinforcement learning model in full. This model uses observed choice-behaviour during reinforcement learning to infer our phi (information-updating) and lambda (risksensitivity) learning parameters. Thus, payoffs incurred through choice simultaneously influence estimation of each learning parameter—that is, in a sense, they are both sensitive to rewards. But phi and lambda differentially direct any reward sensitivity back on choicebehaviour due to their distinct definitions. Glossing over the mathematics, for phi, stronger reward sensitivity (bigger phi values) means faster internal updating about stimulus-reward pairings, which translates behaviourally into faster learning about ‘what to choose’. For lambda, stronger reward sensitivity (bigger lambda values) means stronger internal determinism about seeking the non-risk foraging option (i.e., the one with the higher expected payoffs based on prior experience), which translates behaviourally into less choice-option switching i.e., ‘playing it safe’. We hope this information, which we have incorporated into our revised manuscript (please see L153–161), clarifies the rationale and mechanics of our reinforcement learning model, and why lamba measures risk-sensitivity.

      In addition, however, the parameter does not measure sensitivity to rewards per se - rewards are not in equation 2. As noted above, instead, equation 2 addresses the sensitivity of choice to the attraction score which can be sensitive to rewards, though in complex ways depending on the updating parameter. Second, equations 1 and 2 involve one specific assumption about how sensitivity to rewards vs. to attraction influences the probability of choosing an option. In essence, the authors split the translation from rewards to behavioral choices into 2 steps. Step 1 is how strongly rewards influence an option's attractiveness and step 2 is how strongly attractiveness influences the actual choice to use that option. The equation for step 1 is linear whereas the equation for step 2 has an exponential component. Whether a relationship is linear or exponential can clearly have a major effect on how parameter values influence outcomes. Is there a justification for the form of these equations? The analyses suggest that the exponential component provides a better explanation than the linear component for the difference between males and females in the sequence of choices made by birds, but translating that to the concepts of information updating versus reward sensitivity is unclear. As noted above, the authors' equation for reward sensitivity does not actually include rewards explicitly, but instead only responds to rewards if the rewards influence attraction scores. The more strongly recent rewards drive an update of attraction scores, the more strongly they also influence food choices. While this is intuitively reasonable, I am skeptical about the authors' biological/cognitive conclusions that are couched in terms of words (updating rate and risk sensitivity) that readers will likely interpret as concepts that, in my view, do not actually concur with what the models and analyses address.

      To answer the Reviewer’s question—yes, these equations are very much standard and the canonical way of analysing individual reinforcement learning (see: Ch. 15.2 in Computational Modeling of Cognition and Behavior by Farrell & Lewandowsky 2018 [DOI: 10.1017/CBO9781316272503]; McElreath et al. 2008 [DOI: 10.1098/rstb/2008/0131]; Reinforcement Learning by Sutton & Barto 2018). To provide a “justification for the form of these equations'', equation 1 describes a convex combination of previous values and recent payoffs. Latent values are updated as a linear combination of both factors, there is no simple linear mapping between payoffs and behaviour as suggested by the reviewer. Equation 2 describes the standard softmax link function. It converts a vector of real numbers (here latent values) into a simplex vector (i.e., a vector summing to 1) which represents the probabilities of different outcomes. Similar to the logit link in logistic regression, the softmax simply maps the model space of latent values onto the outcome space of choice probabilities which enter the categorial likelihood distribution. We can appreciate how we did not make this clear in our manuscript by not highlighting the standard nature of our analytical approach—we now do so in our revised manuscript (please see L148–149). As far as what our reinforcement learning model measures, and how it relates cognition and behaviour, please see our previous response.

      To emphasize, while the authors imply that their analyses separate the updating rate from 'risk sensitivity', both the 'updating parameter' and the 'risk sensitivity' parameter influence both the strength of updating and the sensitivity to reward payoffs in the sense of altering the tendency to prefer an option based on recent experience with payoffs. As noted in the previous paragraph, the main difference between the two parameters is whether they relate to behaviour linearly versus with an exponential component.

      Please see our two earlier responses on the mechanics of our reinforcement learning model.

      Overall, while the statistical analyses based on equations (1) and (2) seem to have identified something interesting about two steps underlying learning patterns, to maximize the valuable conceptual impact that these analyses have for the field, more thinking is required to better understand the biological meaning of how these two parameters relate to observed behaviours, and the 'risk sensitivity' parameter needs to be re-named.

      Please see our earlier response to these suggestions.

      Agent-based simulations:

      The authors estimated two learning parameters based on the behaviour of real birds, and then ran simulations to see whether computer 'birds' that base their choices on those learning parameters return behaviours that, on average, mirror the behaviour of the real birds. This exercise is clearly circular. In old-style, statistical terms, I suppose this means that the R-square of the statistical model is good. A more insightful use of the simulations would be to identify situations where the simulation does not do as well in mirroring behaviour that it is designed to mirror.

      Based on the Reviewer’s summary of agent-based forward simulation, we can see we did a poor job explaining the inferential value of this method—we apologise. Agent-based forward simulations are posterior predictions, and they provide insight into the implied model dynamics and overall usefulness of our reinforcement learning model. R-squared calculations are retrodictive, and they say nothing about the causal dynamics of a model. Specifically, agent-based forward simulation allows us to ask—what would a ‘new’ grackle ‘do’, given our reinforcement learning model parameter estimates? It is important to ask this question because, in parameterising our model, we may have overlooked a critical contributing mechanism to grackles’ reinforcement learning. Such an omission is invisible in the raw parameter estimates; it is only betrayed by the parameters in actu. Agent-based forward simulation is ‘designed’ to facilitate this call to action—not to mirror behavioural results. The simulation has no apriori ‘opinion’ about computer ‘birds’ behavioural outcomes; rather, it simply assigns these agents random phi and lambda draws (whilst maintaining their correlation structure), and tracks their reinforcement learning. The exercise only appears circular if no critical contributing mechanism(s) went overlooked—in this case computer ‘birds’ should behave similar to real birds. A disparate mapping between computer ‘birds’ and real birds, however, would mean more work is needed with respect to model parameterisation that captures the causal, mechanistic dynamics behind real birds’ reinforcement learning (for an example of this happening in the human reinforcement learning literature, see Deffner et al. 2020 [DOI: 10.1098/rsos.200734]). In sum, agent-based forward simulation does not access goodness-of-fit—we assessed the fit of our model apriori in our preregistration (https://osf.io/v3wxb)—but it does assess whether one did a comprehensive job of uncovering the mechanistic basis of target behaviour(s). We have worked to make the above points on the method and the insight afforded by agent-based forward simulation explicitly clear in our revision—please see L192–207 and L534–537.

      Reviewer #2 (Public Review):

      Summary:

      The study is titled "Leading an urban invasion: risk-sensitive learning is a winning strategy", and consists of three different parts. First, the authors analyse data on initial and reversal learning in Grackles confronted with a foraging task, derived from three populations labeled as "core", "middle" and "edge" in relation to the invasion front. The suggested difference between study populations does not surface, but the authors do find moderate support for a difference between male and female individuals. Secondly, the authors confirm that the proposed mechanism can actually generate patterns such as those observed in the Grackle data. In the third part, the authors present an evolutionary model, in which they show that learning strategies as observed in male Grackles do evolve in what they regard as conditions present in urban environments.

      Strengths:

      The manuscript's strength is that it combines real learning data collected across different populations of the Great-tailed grackle (Quiscalus mexicanus) with theoretical approaches to better understand the processes with which grackles learn and how such learning processes might be advantageous during range expansion. Furthermore, the authors also take sex into account revealing that males, the dispersing sex, show moderately better reversal learning through higher reward-payoff sensitivity. I also find it refreshing to see that the authors took the time to preregister their study to improve transparency, especially regarding data analysis.

      Thank you—we are pleased to receive this positive evaluation, particularly concerning our efforts to improve scientific transparency via our study’s preregistration (https://osf.io/v3wxb).

      Weaknesses:

      One major weakness of this manuscript is the fact that the authors are working with quite low sample sizes when we look at the different populations of edge (11 males & 8 females), middle (4 males & 4 females), and core (17 males & 5 females) expansion range. Although I think that when all populations are pooled together, the sample size is sufficient to answer the questions regarding sex differences in learning performance and which learning processes might be used by grackles but insufficient when taking the different populations into account.

      In Bayesian statistics, there is no strict lower limit of required sample size as the inferences do not rely on asymptotic assumptions. With inferences remaining valid in principle, low sample size will of course be reflected in rather uncertain posterior estimates. We note all of our multilevel models use partial pooling on individuals (the random-effects structure), which is a regularisation technique that generally reduces the inference constraint imposed by a low sample size (see Ch. 13 in Statistical Rethinking by Richard McElreath [PDF: https://bit.ly/3RXCy8c]). We further note that, in our study preregistration (https://osf.io/v3wxb), we formally tested our reinforcement learning model for different effect sizes of sex on learning for both target parameters (phi and lambda) across populations, using a similarly modest N (edge: 10 M, 5 F; middle: 22 M, 5 F ; core: 3 M, 4 F) to our actual final N, that we anticipated to be our final N at that time. This apriori analysis shows our reinforcement learning model: (i) detects sex differences in phi values >= 0.03 and lambda values >= 1; and (ii) infers a null effect for phi values < 0.03 and lambda values < 1 i.e., very weak simulated sex differences (see Figure 4 in https://osf.io/v3wxb). Thus, both of these points together highlight how our reinforcement learning model allows us to say that across-population null results are not just due to small sample size. Nevertheless the Reviewer is not wrong to wonder whether a bigger N might change our population-level results (it might; so might muchneeded population replicates—see L310), but our Bayesian models still allow us to learn a lot from our current data. We now explain this in our revised manuscript—please see L452–457.

      Another weakness of this manuscript is that it does not set up the background well in the introduction. Firstly, are grackles urban dwellers in their natural range and expand by colonising urban habitats because they are adapted to it? The introduction also fails to mention why urban habitats are special and why we expect them to be more challenging for animals to inhabit. If we consider that one of their main questions is related to how learning processes might help individuals deal with a challenging urban habitat, then this should be properly introduced.

      In L74–75 (previously L53–56) we introduce that the estimated historical niche of grackles is urban environments, and that shifts in habitat breadth—e.g., moving into more arid, agricultural environments—is the estimated driver of their rapid North American colonisation. We hope this included information sufficiently answers the Reviewer’s question. We have worked towards flushing out how urban-imposed challenges faced by grackles, such as the wildlife management efforts introduced in L64–65 (now L85–86), may apply to animals inhabiting urban environments more broadly; for example, we now include an entire paragraph in our Introduction detailing how urban environments may be characterised differently to nonurban environments, and thus why they are perhaps more challenging for animals to inhabit— please see L56–71.

      Also, the authors provide a single example of how learning can differ between populations from more urban and more natural habitats. The authors also label the urban dwellers as the invaders, which might be the case for grackles but is not necessarily true for other species, such as the Indian rock agama in the example which are native to the area of study. Also, the authors need to be aware that only male lizards were tested in this study. I suggest being a bit more clear about what has been found across different studies looking at: (1) differences across individuals from invasive and native populations of invasive species and (2) differences across individuals from natural and urban populations.

      We apologise for not including more examples of such learning differences. We now include three examples (please see L43–49), and we are careful to call attention to the fact that these data cover both resident urban and non-urban species as well as urban invasive species (please see L49–50). We also revised our labelling of the lizard species (please see L44). We are aware only male lizards were tested but this information is not relevant to substantiating our use of this study; that is, to highlight that learning can differ between urbandwelling and non-urban counterparts. We hope the changes we did make to our manuscript satisfy the Reviewer’s general suggestion to add biological clarity.

      Finally, the introduction is very much written with regard to the interaction between learning and dispersal, i.e. the 'invasion front' theme. The authors lay out four predictions, the most important of which is No. 4: "Such sex-mediated differences in learning to be more pronounced in grackles living at the edge, rather than the intermediate and/or core region of their range." The authors, however, never return to this prediction, at least not in a transparent way that clearly pronounces this pattern not being found. The model looking at the evolution of risk-sensitive learning in urban environments is based on the assumption that urban and natural environments "differ along two key ecological axes: environmental stability 𝑢 (How often does optimal behaviour change?) and environmental stochasticity 𝑠 (How often does optimal behaviour fail to pay off?). Urban environments are generally characterised as both stable (lower 𝑢) and stochastic (higher 𝑠)". Even though it is generally assumed that urban environments differ from natural environments the authors' assumption is just one way of looking at the differences which have generally not been confirmed and are highly debated. Additionally, it is not clear how this result relates to the rest of the paper: The three populations are distinguished according to their relation to the invasion front, not with respect to a gradient of urbanization, and further do not show a meaningful difference in learning behaviour possibly due to low sample sizes as mentioned above.

      Thank you for highlighting a gap in our reporting clarity. We now take care to transparently report our null result regarding our fourth prediction; more specifically, that we did not detect credible population-level differences in grackles’ learning (please see L130). Regarding our evolutionary model, we agree with the Reviewer that this analysis is only one way of looking at the interaction between learning phenotype and apparent urban environmental characteristics. Indeed, in L282–288 (now L325–329) we state: “Admittedly, our evolutionary model is not a complete representation of urban ecology dynamics. Relevant factors—e.g., spatial dynamics and realistic life histories—are missed out. These omissions are tactical ones. Our evolutionary model solely focuses on the response of reinforcement learning parameters to two core urban-like (or not) environmental statistics, providing a baseline for future study to build on”. But we can see now that ‘core’ is too strong a word, and instead ‘supposed’, ‘purported’ or ‘theorised’ would be more accurate—we have revised our wording throughout our manuscript to say as much (please see, for example, L24; L56; L328). We also further highlight the preliminary nature of our evolutionary model, in terms of allowing a narrow but useful first-look at urban eco-evolutionary dynamics—please see L228–232. Finally, we now detail the theorised characteristics of urban environments in our Introduction (rather than in our Results; please see L56–71), and we hope that by doing so, how our evolutionary results relate to the rest of our paper is now better set up and clear.

      In conclusion, the manuscript was well written and for the most part easy to follow. The format of eLife having the results before the methods makes it a bit harder to follow because the reader is not fully aware of the methods at the time the results are presented. It would, therefore, be important to more clearly delineate the different parts and purposes. Is this article about the interaction between urban invasion, dispersal, and learning? Or about the correct identification of learning mechanisms? Or about how learning mechanisms evolve in urban and natural environments? Maybe this article can harbor all three, but the borders need to be clear. The authors need to be transparent about what has and especially what has not been found, and be careful to not overstate their case.

      Thank you, we are pleased to read that the Reviewer found our manuscript to be generally digestible. We have worked to add further clarity, and to tempter our tone (please see our above and below responses).

      Reviewer #1 (Recommendations For The Authors):

      Several of the results are based on CIs that overlap zero. Tone these down somewhat.

      We apologise for overstating our results, which we have worked to tone down in our revision. For instance, in L185–186 we now differentiate between estimates that did or did not overlap zero (please also see our response to Reviewer 2 on this tonal change). We note we do not report confidence intervals (i.e., the range of values expected to contain the true estimate if one redoes the study/analysis many times). Rather, we report 89% highest posterior density intervals (i.e., the most likely values of our parameters over this range). We have added this definition in L459, to improve clarity.

      The literature review suggesting that urban environments are more unpredictable is not convincing. Yes, they have more noise and light pollution and more cars and planes, but does this actually relate to the unpredictability of getting a food reward when you choose an option that usually yields rewards?

      To answer the Reviewer’s question—yes. But we can see that by not including empirical examples from the literature, we did a poor job of arguing such links. In L43–49 we now give three empirical examples; more specifically, we state: “[...] experimental data show the more variable are traffic noise and pedestrian presence, the more negative are such human-driven effects on birds' sleep (Grunst et al., 2021), mating (Blickley et al., 2012), and foraging behaviour (Fernández-Juricic, 2000).” We note we now detail such apparently stable but stochastic urban environmental characteristics in our Introduction rather than our Results section, to hopefully improve the clarity of our manuscript (please see L56–71). We further note that we cite three literature reviews—not one—suggesting urban environments are stable in certain characteristics and more unpredictable in others (please see L59–60). Finally, we appreciate such characterisation is not certain, and so in our revision we have qualified all writing about this potential dynamic with words such as “apparent”, “supposed”, “theorised”, “hypothesised” etc.

      It would be interesting to see if other individual traits besides sex affect their learning/reversal learning ability and/or their learning parameters. Do you have data on age, size, condition, or personality? Or, the habitat where they were captured?

      We do not have these data. But we agree with the Reviewer that examining the potential influence of such covariates on grackles’ reinforcement learning would be interesting in future study, especially habitat characteristics (please see L306–309).

      For most levels of environmental noise, there appears to be an intermediate maximum for the relationship between environmental stability and the risk sensitivity parameter. What does this mean?

      There is indeed an intermediate maximum for certain values of environmental stochasticity (although the differences are rather small). The most plausible reason for this is that for very stable environments, simulated birds essentially always “know” the rewarded solution and never need to “relearn” behaviour. In this case, differences in latent values will tend to be large (because they consistently get rewarded for the same option), and different lambda values (in the upper range) will produce the same choice behaviour, which results in very weak selection. While in very unstable environments, optimal choice behaviour should be more exploratory, allowing learners to track frequently-changing environments. We now note this pattern in L240–248.

      Reviewer #2 (Recommendations For The Authors):

      L2: I'd encourage the authors to reconsider the term "risk-sensitive learning", at least in the title. It's not apparent to me how 'risk' relates to the investigated foraging behaviour. Elsewhere, risk-reward sensitivity is used which may be a better term.

      We apologise for not clearly introducing the field of risk-sensitive foraging, which focuses on how animals evaluate and choose between distinct food options, and how such foraging decisions are influenced by pay-off variance i.e., risk associated with alternative foraging options (seminal reviews: Bateson 2002 [DOI: 10.1079/PNS2002181]; Kacelnik & Bateson 1996 [DOI: 10.1093/ICB/36.4.402]). We have added this information to our manuscript in L494–497. In explaining our reinforcement model, we also now detail how risk relates to foraging behaviour. Specifically, in L153–161 we now state: “Both learning parameters capture individual-level internal response to incurred reward-payoffs, but they differentially direct any reward sensitivity back on choice-behaviour due to their distinct definitions (full mathematical details in Materials and methods). For 𝜙, stronger reward sensitivity (bigger values) means faster internal updating about stimulus-reward pairings, which translates behaviourally into faster learning about ‘what to choose’. For 𝜆, stronger reward sensitivity (bigger values) means stronger internal determinism about seeking the nonrisk foraging option (i.e., the one with the higher expected payoffs based on prior experience), which translates behaviourally into less choice-option switching i.e., ‘playing it safe’.” We hope this information clarifies why lamba measures risk-sensitivity, and why we continue to use this term.

      L1-3: The title is a bit misleading with regard to the empirical data. From the data, all that can be said is that male grackles relearn faster than females. Any difference between populations actually runs the other way, with the core population exhibiting a larger difference between males and females than the mid and edge populations.

      It is customary for a manuscript title to describe the full scope of the study. In our study, we have empirical data, cognitive modelling, and evolutionary simulations of the background theory all together. And together these analytical approaches show: (1) across three populations, male grackles—the dispersing sex in this historically urban-dwelling and currently urban-invading species—outperform female counterparts in reversal learning; (2) they do this via risk-sensitive learning, so they’re more sensitive to relative differences in reward payoffs and choose to stick with the ‘safe’ i.e., rewarding option, rather than continuing to ‘gamble’ on an alternative option; and (3) risk-sensitive learning should be favoured in statistical environments characterised by purported urban dynamics. So, we do not feel our title “Leading an urban invasion: risk-sensitive learning is a winning strategy” is misleading with regard to our empirical data; it just doesn’t summarise only our empirical data. Finally, as we now state in L312–313, we caution against speculating about any between-population variation, as we did not infer any meaningful behavioural or mechanistic population-level differences.

      L13: "Assayed", is that correctly put, given that the authors did not collect the data?

      Merrian-Webster defines assay as “to analyse” or “examination or determination as to characteristics”, and so to answer the Reviewer’s question—yes, we feel this is correctly put. We note we explicitly introduce in L102–103 that we did not collect the data, and we have an explicit “Data provenance” section in our methods (please see L342–347).

      L42-46: The authors provide a single example of how learning can differ between populations from more urban and more natural habitats. I would like to point out that many of these studies do not directly confirm that the ability in question has indeed led to the success of the species tested (e.g. show fitness consequences). Then the authors could combine these insights to form a solid prediction for the grackles. As of now, this looks like cherry-picking supportive literature without considering negative results.

      Here are some references that might be helpful in identifying relevant literature to cite:

      Szabo, B., Damas-Moreira, I., & Whiting, M. J. (2020). Can cognitive ability give invasive species the means to succeed? A review of the evidence. Frontiers in Ecology and Evolution, 8, 187.

      Griffin AS, Tebbich S, Bugnyar T, 2017. Animal cognition in a human-dominated world. Anim Cogn 20(1):1-6.

      Kark, S., Iwaniuk, A., Schalimtzek, A., & Banker, E. (2007). Living in the city: Can anyone become an "urban exploiter"? Journal of Biogeography, 34(4), 638-651.

      We apologise for not including more examples of such learning differences. We now include three examples (please see L43–49). We are aware that direct evidence of fitness consequences is entirely lacking in the scientific literature on cognition and successful urban invasion; hence why such data is not present in our paper. But we now explicitly point out a role for likely fitness-affecting anthropogenic disturbances on sleep, mate, and foraging behaviour on animals inhabiting urban environments (please see L63–68). We hope these new data bolster our predictions for our grackles. Finally, the Reviewer paints a (in our view) inaccurate picture of our use of available literature. Nevertheless, to address their comment, we now highlight a recent meta-analysis advocating for further research to confirm apparent ‘positive’ trends between animal ‘smarts’ and successful ‘city living’ (please see L43).

      L64: Is their niche historically urban, or have they recently moved into urban areas?

      In L74–75 (previously L53–56) we introduce that the estimated historical niche of grackles is urban environments, and that shifts in habitat breadth—e.g., moving into more arid, agricultural environments—is the estimated driver of their rapid North American colonisation. We hope this included information sufficiently answers the Reviewer’s question.

      L66-67: This is an important point that is however altogether missing from the discussion.

      We thank the Reviewer for highlighting a gap in our discussion regarding populationlevel differences in grackles’ reinforcement learning. In L310–312 we now state: “The lack of spatial replicates in the existing data set used herein inherently poses limitations on inference. Nevertheless, the currently available data do not show meaningful population-level behavioural or mechanistic differences in grackles’ reinforcement learning, and we should thus be cautious about speculating on between-population variation”.

      L68-71: The paper focuses on cognitive ability. The whole paragraph sets up the prediction of why male grackles should be better learners due to their dispersal behaviour. This example, however, focuses on aggression, not cognition. Here is a study showing differences in learning in male and female mynas that might be better suited:

      Federspiel IG, Garland A, Guez D, Bugnyar T, Healy SD, Güntürkün O, Griffin AS, 2017. Adjusting foraging strategies: a comparison of rural and urban common mynas (Acridotheres tristis). Anim Cogn 20(1):65-74.

      We thank the Reviewer for suggesting this paper. We feel it is better suited to substantiating our point in the Discussion about reversal learning not being indicative of cognitive ability—please see L276–277.

      L73: Generally, I suggest not writing "for the first time" as this is not a valid argument for why a study should be conducted. Furthermore, except for replication studies, most studies investigate questions that are novel and have not been investigated before.

      The Reviewer makes a fair point—we have removed this statement.

      L80-81: Here again, this is left undiscussed later on.

      By ‘this’ we assume the Reviewer is referring to our hypothesis, which is that sex differences in dispersal are related to sex differences in learning in an urban invader— grackles. At the beginning of our Discussion, we state how we found support for this hypothesis (please see L250–261); and in our ‘Ideas and speculation’ section, we discuss how these hypothesis-supporting data fit into the literature more broadly (please see L294–331). We feel this is therefore sufficiently discussed.

      L77-81: This sentence is very long and therefore hard to read. I suggest trying to split it into at least 2 separate sentences which would improve readability.

      Per the Reviewer’s useful suggestion, we have split this sentence into two separate sentences—please see L97–115.

      L83: Please explain choice-option switches. I am not aware of what that is and it should be explained at first mention.

      We apologise for this operational oversight. We now include a working definition of speed and choice-option switches at first mention. Specifically, in L107–108 we state: “[...] we expect male and female grackles to differ across at least two reinforcement learning behaviours: speed (trials to criterion) and choice-option switches (times alternating between available stimuli)”.

      L83-87: Again, a very long sentence. Please split.

      We thank the Reviewer for their suggestion. In this case we feel it is important to not change our sentence structure because we want our prediction statements to match between our manuscript and our preregistration.

      L96-97: Important to not overstate this. It merely demonstrates the potential of the proposed (not detected) mechanism to generate the observed data.

      As in any empirical analysis, our drawn conclusions depend on causal assumptions about the mechanisms generating behaviour (Pearl, J. (2009). Causality). Therefore, we “detected” specific learning mechanisms assuming a certain generative model, namely reinforcement learning. As there is overwhelming evidence for the widespread importance of value-based decision making and Rescorla-Wagner updating rules across numerous different animals (Sutton & Barto (2018) Reinforcement Learning), we would argue that this assumed model is highly plausible in our case. Still, we changed the text to “inferred” instead of “detected” learning mechanisms to account for this concern—please see L123–124.

      L99: "urban-like settings" again a bit confusing. The authors talk about invasion fronts, but now also about an urbanisation gradient. Is the main difference between the size and the date of establishment, or is there additionally a gradient in urbanisation to be considered?

      We now include a paragraph in our Introduction detailing apparent urban environmental characteristics (please see 56–71), and we now refer to this dynamic specifically when we define urban-like settings (please see L126–127). To answer the Reviewer’s question—we consider both differences. Specifically, we consider the time since population establishment in our paper (with respect to our behavioural and mechanistic modelling), as well as how statistical environments that vary in how similar they are to apparently characteristically urban-like environments, might favour particular learning phenotypes (with respect to our evolutionary modelling). We hope the edits to our Introduction as a whole now make both of the aims clear.

      L11-112: Above the authors talk about a comparable number of switches (10.5/15=0.7), and here of fewer number of switches (25/35=0.71), even though the magnitude of the difference is almost identical and actually runs the other way. The authors are probably misled by their conservative priors, which makes the difference appear greater in the second case than in the first. Using flat priors would avoid this particular issue.

      Mathematically, the number of trials-to-finish and the number of choice-optionswitches are both a Poisson distributed outcome with rate λ (we note lambda here is not our risk-sensitivity parameter; just standard notation). As such, our Poisson models infer the rate of these outcomes by sex and phase—not the ratio of these outcomes by sex and phase. So comparing the magnitude of divided medians of choice-option-switches between the sexes by phase is not a meaningful metric with respect to the distribution of our data, as the Reviewer does above. For perspective, 1 vs. 2 switches provides much less information about the difference in rates of a Poisson distribution than 50 vs 100 (for the former, no difference would be inferred; for the latter, it would), but both exhibit a 1:2 ratio. To hopefully prevent any such further confusion, and to focus on the fact that our Poisson models estimate the expected value i.e., the mean, we now report and graph (please see Fig. 2) mean and not median trialsto-finish and total-switch-counts. Finally, we can see that our use of the word “conservative” to describe our weakly informative priors is confusing, because conservative could mean either strong priors with respect to expected effect size (not our parameterisation) or weak priors with respect to such assumptions (our parameterisation). To address this lack of clarity, we now state that we use “weakly informative priors” in L457–458.

      L126: It is not clear what risk sensitivity means in the context of these experiments.

      Thank you for pointing out our lack of clarity. In L153–161 we now state: “Both learning parameters capture individual-level internal response to incurred reward-payoffs, but they differentially direct any reward sensitivity back on choice-behaviour due to their distinct definitions (full mathematical details in Materials and methods). For 𝜙, stronger reward sensitivity (bigger values) means faster internal updating about stimulus-reward pairings, which translates behaviourally into faster learning about ‘what to choose’. For 𝜆, stronger reward sensitivity (bigger values) means stronger internal determinism about seeking the nonrisk foraging option (i.e., the one with the higher expected payoffs based on prior experience), which translates behaviourally into less choice-option switching i.e., ‘playing it safe’.” We hope this information clarifies what risk sensitivity means and measures, with respect to our behavioural experiments.

      L128-129: I find this statement too strong. A plethora of other mechanisms could produce similar patterns, and you cannot exclude these by way of your method. All you can show is whether the mechanism is capable of producing broadly similar outcomes as observed

      In describing the inferential value of our reinforcement learning model, we now qualify that the insight provided is of course conditional on the model, which is tonally accurate. Please see L161.

      L144: As I have already mentioned above, here is the first time we hear about unpredictability related to urban environments. I suggest clearly explaining in the introduction how urban and natural environments are assumed to be different which leads to animals needing different cognitive abilities to survive in them which should explain why some species thrive and some species die out in urbanised habitats.

      Thank you for this suggestion. We now include a paragraph in our Introduction detailing as much—please see L56–71.

      L162: "almost entirely above zero" again, this is worded too strongly.

      In reporting our lambda across-population 89% HPDI contrasts in L185–186, we now state: “[...] across-population contrasts that lie mostly above zero in initial learning, and entirely above zero in reversal learning”. Our previous wording stated: ““[...] across-population contrasts that lie almost entirely above zero”. The Reviewer was correct to point out that this previous wording was too strong if we considered the contrasts together, as, indeed, we find the range of the contrast in initial learning does minimally overlap zero (L: -0.77; U: 5.61), while the range of the contrast in reversal learning does not (L: 0.14; U: 4.26). This rephrasing is thus tonally accurate.

      L178-179: I think it should be said instead that the model accounts well for the observed data.

      We have rephrased in line with the Reviewer’s suggestion, now stating in L217–218 that “Such quantitative replication confirms our reinforcement learning model results sufficiently explain our behavioural sex-difference data.”

      L188-190: I am not convinced this is a general pattern. It is quite a bold claim that I don't find to be supported by the citations. Why should biotic and abiotic factors differ in how they affect behavioural outcomes? Also, events in urban environments such as weekend/weekday could lead to highly regular optimal behaviour changes.

      Please see our response to Reviewer 1 on this point. We note we now touch on such regular events in L94–96.

      L209-211: The first sentence is misleading. The authors have found that males and females differ in 'risk sensitivity', that their learning model can fit the data rather well, and that under certain, not necessarily realistic assumptions, the male learning type is favoured by natural selection in urban environments. A difference between core, middle, and edge habitats however is barely found, and in fact seems to run the other way than expected.

      In our study, we found: (1) across three populations, male grackles—the dispersing sex in this historically urban-dwelling and currently urban-invading species—outperform female counterparts in reversal learning; (2) they do this via risk-sensitive learning, so they’re more sensitive to relative differences in reward payoffs and choose to stick with the ‘safe’ i.e., rewarding option, rather than continuing to ‘gamble’ on an alternative option; (3) we are sufficiently certain risk-sensitive learning generates our sex-difference data, as our agentbased forward simulations replicate our behavioural results (not because our model ‘fits’ the data, but because we inferred meaningful mechanistic differences—see our response to Reviewer 1 on this point); and (4) under theorised dynamics of urban environments, natural selection should favour risk-sensitive learning. We therefore do not feel it is misleading to say that we mapped a full pathway from behaviour to mechanisms through to selection and adaptation. Again, as we now state in L311–313, we caution against speculating about any between-population variation, as we did not infer any meaningful behavioural or mechanistic population-level differences. And we note the Reviewer is wrong to assume an interaction between learning, dispersal, and sex requires population-level differences on the outcome scale—please see our discussion on phenotypic plasticity and inherent species trait(s) in L313–324.

      L216: "indeed explain" again worded too strongly.

      We have tempered our wording. Specifically, we now state in L218: “sufficiently explain”. This wording is tonally accurate with respect to the inferential value of agent-based forward simulations—please see L192–207 on this point.

      L234: "reward-payoff sensitivity" might be a better term than risk-sensitivity?

      Please see our earlier response to this suggestion. We note we have changed this text to state “risk-sensitive learning” rather than “reward-payoff sensitivity”, to hopefully prevent the reader from concluding only our lambda term is sensitive to rewards—a point we now include in L153–154.

      L234-237: I think these points may be valuable, but come too much out of the blue. Many readers will not have a detailed knowledge of the experimental assays. It therefore also does not become clear how they measure the wrong thing, what this study does to demonstrate this, or whether a better alternative is presented herein. It almost seems like this should be a separate paper by itself.

      We apologise for this lack of context. We now explicitly state in L275 that we are discussing reversal learning assays, to give all readers this knowledge. In doing so, we hope the logic of our argument is now clear: reversal learning assays do not measure behavioural flexibility, whatever that even is. The Reviewer’s suggestion of a separate paper focused on what reversal learning assays actually measure, in terms of mechanism(s), is an interesting one, and we would welcome this discussion. But any such paper should build on the points we make here.

      L270-288: Somewhere here the authors have to explain how they have not found differences between populations, or that in so far as they found them, they run against the originally stated hypothesis.

      We thank the Reviewer for these suggestions. In L310—313 we now state: “The lack of spatial replicates in the existing data set used herein inherently poses limitations on inference. Nevertheless, the currently available data do not show meaningful population-level behavioural or mechanistic differences in grackles’ reinforcement learning, and we should thus be cautious about speculating on between-population variation”.

      L284: should be "missing" not "missed out"

      We have made this change.

      L290-291: It is unclear what "robust interactive links" were found. A pattern of sexbiased learning was found, which can potentially be attributed to evolutionary pressures in urban environments. An interaction e.g. between learning, dispersal, and sex can only be tentatively suggested (no differences between populations). Also "fully replicable" is a bit misleading. The analysis may be replicable, but the more relevant question of whether the findings are replicable we cannot presently answer.

      We apologise for our lack of clarity. By “robust” we mean “across population”, which we now state in L333. We again note the Reviewer is wrong to assume an interaction between learning, dispersal, and sex requires population-level differences on the outcome scale— please see our discussion on phenotypic plasticity and inherent species trait(s) in L313–324. Finally, the Reviewer makes a good point about our analyses but not our findings being replicable. In L334 we now make this distinction by stating “analytically replicable”.

      L306-315: I think you have a bit of a sample size issue not so much when populations are pooled but when separated. This might also factor in the fact that you do not really find differences across the populations in your analysis. When we look at the results presented in Figure 2 (and table d), we can see a trend towards males having better risk sensitivity in core (HPDI above 0) and middle populations (HPDI barely crossing 0) but the difference is very small. Especially the results on females are based on the performance of only 8 and 4 females respectively. I suggest making this clear in the manuscript.

      In Bayesian statistics, there is no strict lower limit of required sample size as the inferences do not rely on asymptotic assumptions. With inferences remaining valid in principle, low sample size will of course be reflected in rather uncertain posterior estimates. We note all of our multilevel models use partial pooling on individuals (the random-effects structure), which is a regularisation technique that generally reduces the inference constraint imposed by a low sample size (see Ch. 13 in Statistical Rethinking by Richard McElreath [PDF: https://bit.ly/3RXCy8c]). We further note that, in our study preregistration (https://osf.io/v3wxb), we formally tested our reinforcement learning model for different effect sizes of sex on learning for both target parameters (phi and lambda) across populations, using a similarly modest N (edge: 10 M, 5 F; middle: 22 M, 5 F ; core: 3 M, 4 F) to our actual final N, that we anticipated to be our final N at that time. This apriori analysis shows our reinforcement learning model: (i) detects sex differences in phi values >= 0.03 and lambda values >= 1; and (ii) infers a null effect for phi values < 0.03 and lambda values < 1 i.e., very weak simulated sex differences (see Figure 4 in https://osf.io/v3wxb). Thus, both of these points together highlight how our reinforcement learning model allows us to say that across-population null results are not just due to small sample size. Nevertheless the Reviewer is not wrong to wonder whether a bigger N might change our population-level results; it might; so might muchneeded population replicates—see L310. But our Bayesian models still allow us to learn a lot from our current data, and, at present, we infer no meaningful population-level behavioural or mechanistic differences in grackles’ behaviour. To make clear the inferential sufficiency of our analytical approach, we now include some of the above points in our Statistical analyses section in L452–457. Finally, we caution against speculating on any between-population variation, as we now highlight in L311—313 of our Discussion.

      Figure 2: I think the authors should rethink their usage of colour in this graph. It is not colour-blind friendly or well-readable when printed in black and white.

      We used the yellow (hex code: #fde725) and green (hex code: #5ec962) colours from the viridis package. As outlined in the viridis package vignette (https://cran.rproject.org/web/packages/viridis/index.html), this colour package is “designed to improve graph readability for readers with common forms of color blindness and/or color vision deficiency. The color maps are also perceptually-uniform, both in regular form and also when converted to black-and-white for printing”.

      Figure 3B: Could the authors turn around the x-axis and the colour code? It would be easier to read this way.

      We appreciate that aesthetic preferences may vary. In this case, we prefer to have the numbers on the x-axis run the standard way i.e., from small to large. We note we did remove the word ‘Key’ from this Figure, in line with the Reviewer’s point about these characteristics not being totally certain.

      I also had a look at the preregistration. I do think that there are parts in the preregistration that would be worth adding to the manuscript:

      L36-40: This is much easier to read here than in the manuscript.

      We changed this text generally in the Introduction in our revision, so we hope the Reviewer will again find this easier to read.

      L49-56: This is important information that I would also like to see in the manuscript.

      We no longer have confidence in these findings, as our cleaning of only one part of these data revealed considerable experimenter oversight (see ‘Learning criterion’).

      L176: Why did you remove the random effect study site from the model? It is not part of the model in the manuscript anymore.

      The population variable is part of the RL_Comp_Full.stan model that we used in our manuscript to assess population differences in grackles’ reinforcement learning, the estimates from which we report in Table C and D (please note we never coded this variable as “study cite”). But rather than being specified as a random effect, in our RL_Comp_Full.stan model we index phi and lambda by population as a predictor variable, to explicitly model population-level effects. Please see our code:

      https://github.com/alexisbreen/Sex-differences-in-grackles- learning/blob/main/Models/Reinforcement%20learning/RL_Comp_Full.stan

      L190-228: I am wondering if the model validation should also be part of the manuscript as well, rather than just being in the preregistration?

      We are not sure how the files were presented to the Reviewer for review, but our study preregistration, which includes our model validation, should be part of our manuscript as a supplementary file.

    1. Reviewer #2 (Public Review):

      Summary:<br /> In this manuscript, the authors reported the biological role of RBM7 deficiency in promoting metastasis of breast cancer. They further used a combination of genomic and molecular biology approaches to discover a novel role of RBM7 in controlling alternative splicing of many genes in cell migration and invasion, which is responsible for the RBM7 activity in suppressing metastasis. They conducted an in-depth mechanistic study on one of the main targets of RBM7, MFGE8, and established a regulatory pathway between RBM7, MFGE8-L/MFGE8-S splicing switch, and NF-κB signaling cascade. This link between RBM7 and cancer pathology was further supported by analysis of clinical data.

      Strengths:<br /> Overall, this is a very comprehensive study with lots of data, and the evidence is consistent and convincing. Their main conclusion was supported by many lines of evidence, and the results in animal models are pretty impressive.

      Weaknesses:<br /> However, there are some controls missing, and the data presentation needs to be improved. The writing of the manuscript needs some grammatical improvements because some of the wording might be confusing.

      Specific comments:<br /> (1) Figure 2. The figure legend is missing for Figure 2C, which caused many mislabels in the rest of the panels. The labels in the main text are correct, but the authors should check the figure legend more carefully. Also in Figure 2C, it is not clear why the authors choose to examine the expression of this subset of genes. The authors only refer to them as "a series of metastasis-related genes", but it is not clear what criteria they used to select these genes for expression analysis.

      (2) Line 218-220. The comparison of PSI changes in different types of AS events is misleading. Because these AS events are regulated in different mechanisms, they cannot draw the conclusion that "the presence of RBM7 may promote the usage of alternative splice sites". For example, the regulators of SE and IR may even be opposite, and thus they should discuss this in different contexts. If they want to conclude this point, they should specifically discuss the SE and A5SS rather than draw an overall conclusion.

      (3) In the section starting at line 243, they first referred to the gene and isoforms as "EFG-E8" or "EFG-E8-L", but later used "EFGE8" and "EFGE8-L". Please be consistent here. In addition, it will be more informative if the authors add a diagram of the difference between two EFGE8 isoforms in terms of protein structure or domain configuration.

      (4) Figure 7B and 7C. The figures need quantification of the inclusion of MFGE exon7 (PSI value) in addition to the RT-PCR gel. The difference seems to be small for some patients.

      Minor comments:<br /> The writing in many places is a little odd or somewhat confusing, I am listing some examples, but the authors need to polish the whole manuscript more to improve the writing.

      (1) Line 169-170, "...followed by profiling high-throughput transcriptome by RNA sequencing", should be "followed by high-throughput transcriptome profiling with RNA sequencing".

      (2) Line 170, "displayed a wide of RBM7-regulated genes were enriched...", they should add a "that" after the "displayed" as the sentence is very long.

      (3) Line 213, "PSI (percent splicing inclusion)" is not correct, PSI stands for "percent spliced in".

      (4) Line 216-217, the sentence is long and fragmented, they should break it into two sentences.

      (5) Line 224, the "tethering" should be changed to "recognizing". There is a subtle difference in the mechanistic implication between these two words.

      (6) Line 250, should be changed to "..in the ratio of two MFGE8 isoforms".

    1. Author Response

      The following is the authors’ response to the current reviews.

      We thank the reviewers for their valuable feedback which has improved this work greatly from its original form, and are elated to have such glowing reviews of the revised work published alongside the revised preprint. Reviewer 3 raises some final salient points, which deserve a brief address here.

      Teeth: We thank the reviewer for clarifying their points. We do make the assumption that the ecological parameter space of toothed and beaked organisms will be comparable. Both are governed by the same set of physical principles and have the jaw bone as the most likely point of failure (teeth are harder than bone, and keratinous rhamphothecae are malleable and can be regrown with relative ease when deformed). Differences in stress/strain distribution between toothed and beaked organisms will occur but are already accounted for in our methods as we model both the teeth and rhamphotheca and will observe these different effects. We have added an explicit statement of this hypothesis to the Methods section of the manuscript.

      Cranial kinesis: In our opinion, it is a safe assumption that the lower jaws of extant birds and enantiornithines are comparable. We do not see why the acquisition of kinesis in the upper jaw would generally affect the functional role of or constraints on the lower jaw. One possibility we discussed is that a quickly-moving kinetic premaxilla could let the lower jaw move a shorter distance during effective prey capture and lower the selection for speed (i.e. allow jaw-closing MA to remain higher). While we have added this possibility to our call for the investigation of cranial kinesis, we consider it too speculative to begin altering interpretations of fossil taxa. All raw measurement data remains available so that, if evidence is found for cranial kinesis having predictable effects on our measured parameters, future researchers can re-analyse our data and update any ecological predictions accordingly.

      Organization: To our knowledge eLife format incorporates what one would think of as a Conclusions section into the Discussion. Our Discussion section currently contains 18 subheadings which should guide a reader to any specific topic of interest. The Discussion also progresses from a more narrow to broad focus which we and several colleagues find intuitive.

      We thank all three reviewers once again for their feedback that has improved this work and their kind words throughout the process.


      The following is the authors’ response to the original reviews.

      We thank all three reviewers for their detailed reviews, and generally agree with their feedback. To accompany the reviewed preprint of this manuscript, we wished to respond to comments from the reviewers so that they (and the public) will know what we are planning to incorporate in the revised manuscript we are currently preparing. If there are any comments on our plans in the meantime, please let us know.

      • Reviewer 1, on concerns regarding identification of ontogenetic stage and comparison of taxa from different ontogenetic stages: It is fair to say that enantiornithine ontogeny is still poorly understood, though we believe all current evidence points to each specimen used in this study to being adequately mature for comparison to the extant birds used in the study. Stages of skeletal fusion are the standard method of assessing enantiornithine ontogeny (Hu and O'Connor 2017), and our comparison of histological work (Atterholt, Poust et al. 2021) to skeletal stages in Table S4 suggests a transition from juvenile to subadult in stage 0 or 1 and from subadult to adult within stage 3. Thus, the specimens we quantitatively examine in this study, all at stages 2 or 3 (Figure S10), are advanced subadults or adults. It is well-known that many living animals considered “adults” would be considered subadults or even juveniles to a palaeontologist (Hone, Farke et al. 2016). So, even if some individuals in this study are not fully skeletally mature, they should have obtained the morphology which they would possess for most of their lives and thus the morphology which undergoes selective pressure. We will add this context to the “Bohaiornithid Ontogeny” section and thank the reviewer for seeking more detail for this point.

      • Reviewer 2, on need of a context figure: We have an artistic life reconstruction of a bohaiornithid in preparation, and can include that in the revised manuscript as a figure.

      • Reviewer 2, on raptor claw categories: We explain these categories in-depth in a previous work (Miller, Pittman et al. 2023). However, we will now add a short summary of that explanation to this work so that this manuscript will become self-contained in this regard. In short, the “large raptor” category includes extant birds with records of regularly taking prey which cannot be encircled with the pes, while birds in the “small raptor” have no such records. As Reviewer 2 points out this does often follow phylogenetic lines, but not always. E.g. most owls specialise in taking small prey, but the great horned owl Bubo virginianus regularly takes mammals and birds larger than its pes (Artuso, Houston et al. 2020); and conversely we can only find reports of the common black hawk Buteogallus anthracinus taking prey samll enough for the pes to encircle (Schnell 2020) despite other accipiters frequently taking large prey. In both cases these taxa plot in PCA nearer to other large or small raptors (respectively) than to their phylogenetic relatives.

      • Reviewer 3, on teeth vs beaks: We are not aware of any foods which are exclusive to toothed or beaked animals. There are some aspects of extant bird biology that may affect the way a certain diet may need to be adapted to which we do comment on, e.g. discussion of alternatives to the crop and ventriculus for processing plant matter in the Bohaiornithid Ecology and Evolution section. For functional studies, e.g. FEA, we have included the rhamphotheca in toothless models which serves the same role as teeth, to be a feeding surface. It should not matter, in theory, if the feeding surface is hard or soft as mechanical failure occurs in high stress/strain states regardless of the medium. If having teeth necessarily increases or decreses overall stress/strain relative to a beak (and from our work this does not appear to be the case), this would in turn necessarily limit dietary options. So, all models in our work should be directly comparable.

      As an additional note on this topic, we address tooth shape in bohaiornithids at the end of the Bohaiornithid Ecology and Evolution section. We specifically note that their tooth shape is likley controlled by phylogeny in the current version, though we will add a note in the upcoming version that the morphospace of bohaiorntihid teeth overlaps that of many other clades with purportedly diverse diets, which is consistent with a hypothesis of diverse diets within the clade.

      • Reviewer 3, on cranial kinesis: Our FE models should be unaffected by cranial kinesis, as these are two-dimensional and model the akinetic lower jaw only. Some mediolateral kinesis may be relevant in the mandible in the form of “wishboning” in different taxa, but its prevalence in extant birds is currently unknown. The preservation of enantiornithines (two-dimensionally and typically in lateral view) limits the ability to capture any mediolateral function regardless.

      Our models of mechanical advantage do not account for any cranial kinesis. This is a necessary simplifcation. The nature of cranial kinesis in extant birds, and the role that it plays in feeding, is poorly understood. Cranial kinesis will increase gape, but we don’t yet know how/if it affects jaw closing force and speed (moreover, given the variation in quadrate and hinge morphology present in extant birds, this is also something that is likely to be highly diverse). We have therefore modelled the extant birds’ jaw closing systems as having one, akinetic out lever (the jaw joint to the bite point), to match the situation in our fossil taxa. This is a common simplification that has been used previously with success (Corbin, Lowenberger et al. 2015, Olsen 2017). However, we acknowledge that this simplification may introduce some error. Unfortunately, until the mechanics of cranial kinesis – and the variation in the anatomy and performance of kinetic structures in extant birds – are better understood, we cannot determine exactly what that error looks like. We therefore have greater confidence in the inter-species comparability this conservative, akinetic approach (in other words, we may not be making assumptions that are 100% accurate, but we are at least making the same assumption across all taxa, so it should be comparable in its error). We will add a section in the Mechanical Advantage and Functional Indices discussion calling for further research into the mechanics of cranial kinesis so future mechanical advantage work in birds can take this matter into account.

      • Reviewer 3, on skull reconstruction: This issue is partly addressed in the Bohaiornithid Skull Reconstruction section, though we agree that adding more mentions of it in the MA and FEA Discussion sections and the Bohaiornithid Ecology and Evolution sections will benefit the manuscript. Most notably Shenqiornis and Sulcavis have similar ecological interpretations, but much of the Shenqiornis skull reconstruction uses Sulcavis bones. Longusunguis is the only other taxon which takes more than two bones from a different taxon, and in this case all but the quadrate are not used in any quanitative measurements. We have ensured that the skull reconstructions presented in Figure 2 show what portions of the skull come from what specimen so that as new material is discovered and phylogenetic relationships are updated it will be clear to future readers which parts of reconstructions will need to be updated.

      • Reviewer 3, on data availability: All data including FEA models and raw measurement data are included in the same repository as the scripts, which we will make clear in the manuscript. Good catch on the data link being dead, we will publish it now.

      As a final note, it was brought to our attention by another colleague that the original manuscript’s ancestral state reconstrction lacked an outgroup. An updated reconstruction using Sapeornis as an outgroup will be included in the revised manuscript. The addition of the outgroup does not change any conclusions of the manuscript.

      We once again thank our reviewers for their valuable feedback and will submit a revised version of this manuscript for publication shortly. Please let us know if you have any additional comments after reading our response that we can take onboard in our revision.

      References

      Artuso, C., C. S. Houston, D. G. Smith and C. Rohner (2020). Great Horned Owl (Bubo virginianus), version 1.0. Birds of the World. A. F. Poole. Ithaca, NY, USA, Cornell Lab of Ornithology.

      Atterholt, J., A. W. Poust, G. M. Erickson and J. K. O'Connor (2021). "Intraskeletal osteohistovariability reveals complex growth strategies in a Late Cretaceous enantiornithine." Frontiers in Earth Science 9: 640220.

      Corbin, C. E., L. K. Lowenberger and B. L. Gray (2015). "Linkage and trade‐off in trophic morphology and behavioural performance of birds." Functional ecology 29(6): 808-815.

      Hone, D. W. E., A. A. Farke and M. J. Wedel (2016). "Ontogeny and the fossil record: what, if anything, is an adult dinosaur?" Biology letters 12(2): 20150947.

      Hu, H. and J. K. O'Connor (2017). "First species of Enantiornithes from Sihedang elucidates skeletal development in Early Cretaceous enantiornithines." Journal of Systematic Palaeontology 15(11): 909-926.

      Miller, C. V., M. Pittman, X. Wang, X. Zheng and J. A. Bright (2023). "Quantitative investigation of Mesozoic toothed birds (Pengornithidae) diet reveals earliest evidence of macrocarnivory in birds." iScience 26(3): 106211.

      Olsen, A. M. (2017). "Feeding ecology is the primary driver of beak shape diversification in waterfowl." Functional Ecology 31(10): 1985-1995.

      Schnell, J. H. (2020). Common Black Hawk (Buteogallus anthracinus), version 1.0. Birds of the World. A. F. Poole and F. B. Gill. Ithaca, NY, USA, Cornell Lab of Ornithology.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary

      This fascinating paper by M. Alfatah et al. describes work to uncover novel genes affecting lifespan in the budding yeast S. cerevisiae, eventually identifying and further characterizing a gene, YBR238C, now named AAG1 by the authors. The authors began by considering published gene sets pulled from the Saccharomyces genome database that described increases or decreases in either chronological lifespan or replicative lifespan in yeast. They also began with gene sets known to be downregulated upon treatment with the lifespan-extending TOR inhibitor rapamycin.

      YBR283C was unique in being largely uncharacterized, downregulated upon rapamycin treatment, and linked to both increased replicative lifespan and increased chronological lifespan upon deletion.

      The authors show that YBR283C may act to negatively regulate mitochondrial function, in ways that are both dependent on and independent of the stressresponsive transcription factor Hap4, largely by looking at relative expression levels of relevant mitochondrial genes.

      In a hard-to-fully interpret but well-documented series of experiments the authors note that the two paralogues YBR283C and RMD9 (which have ~66% similarity) (a) have opposite effects when acting alone, and (b) appear to interact in that some phenotypes of ybr283c are dependent on RMD9.

      A particularly interesting finding in light of the current literature and of the authors' strategy in identifying YBR283C is that changes in electron transport chain genes upon rapamycin treatment appear to be affected via YBR283C.

      Based on a series of experiments the authors move to conclude the existence of "a feedback loop between TORC1 and mitochondria (the TORC1-Mitochondria-TORC1 (TOMITO) signaling process) that regulates cellular aging processes."

      Strengths

      Overall, this study describes a great deal of new data from a large number of experiments, that shed light on the potential specific roles of YBR238C and its paralog RMD9 in aging in yeast, and also underscore the potential of an approach looking for "dark matter" such as uncharacterized genes when seining the increasing deluge of published datasets for new hypotheses to test. This work when revised will become a valuable addition to the field.

      Weaknesses

      A paralog of YBR283C, RMD9, also exists in the yeast genome. While the authors indicate that part of their interest in YBR283C lies in its uncharacterized nature, its paralogue, RMD9, is not uncharacterized but is named due to its phenotype of Required for Meiotic nuclear Division, which is not mentioned or discussed anywhere in the manuscript currently.

      In the context of the current work, in addition to the cited Hillen, H.S et al. and Nouet C. et al, the authors might be very interested in the 2007 Genetics paper "Translation initiation in Saccharomyces cerevisiae mitochondria: functional interactions among mitochondrial ribosomal protein Rsm28p, initiation factor 2, methionyl-tRNAformyltransferase and novel protein Rmd9p" (PMID: 17194786), which does not appear to be cited or discussed in the current version of the manuscript.

      Thank you for your thorough and insightful review of our manuscript. We value your positive feedback and recognition of the strengths in our study. Your constructive comments have been carefully considered, leading to the inclusion of RMD9, identified as 'Required for Meiotic Nuclear Division,' and the addition of the relevant reference (PMID: 12586695) in the revised manuscript. This information has been incorporated into the second paragraph of the "The YBR238C paralogue RMD9 deletion decreases the lifespan of cells" results section.

      Furthermore, we appreciate the reviewer's suggestion to include the 2007 Genetics paper on translation initiation in Saccharomyces cerevisiae mitochondria (PMID: 17194786). This citation has been integrated into our revised manuscript.

      We believe that these revisions significantly strengthen the manuscript and address the concerns raised by Reviewer #1. We thank the reviewer for their time and valuable input.

      Reviewer #2 (Public Review):

      The effectors of cellular aging in yeast have not been fully elucidated. To address this, the authors curated gene expression studies to link genes influenced by rapamycin - a well-known mediator of longevity across model systems - to genes known to affect chronological and replicative lifespan (RLS) in yeast. Through their analyses, they find one gene, ybr238c, whose deletion increases both CLS and RLS upon deletion and that is downregulated by rapamycin. Curiously, despite these selection criteria, the authors only use CLS as a proxy for cellular aging throughout their study and do not explore the effects of ybr238c deletion on RLS. This does not diminish their conclusions, but given the importance of this phenotype in their selection criteria, it is surprising that the authors did not choose to test both types of aging throughout their study.

      Nonetheless, the authors demonstrate that deletion of ybr238c increases CLS across multiple yeast strains and through multiple assays. The authors also test the effects of YBR238C overexpression on lifespan and find the opposite effect, with overexpression yeast showing decreased survival relative to wild-type cells, consistent with "accelerated aging" as the authors propose. The authors also note that ybr238c has a paralog, rmd9, whose deletion decreases CLS and seems to be epistatic to ybr238c, as a double ybr238c/rmd9 mutant has decreased CLS relative to a wild-type strain.

      Collectively, the data presented by the authors convincingly demonstrate that ybr238c influences lifespan in a manner that is distinct from (and likely opposite to) rmd9. However, the authors then link the increased CLS in Δybr238c yeast to mitochondrial function using only a handful of assays that do not directly test mitochondrial function. These include total cellular ATP levels, levels of reactive oxygen species, and the transcript levels of select nuclear-encoded mitochondrial genes. Yeast is well established to generate ATP through non-mitochondrial pathways such as glycolysis in fermentive conditions. While it is possible that the ATP levels assayed in the manuscript were tested in stationary phase, which would more likely reflect "mitochondrial function," the methods nor the figure legends contain these details, which are critical for the interpretation of these data. Similarly, ROS can be generated through non-mitochondrial pathways, and the transcription of nuclear-encoded mitochondrial genes is an indirect measure of mitochondrial function at best. Thus, the authors' proposed connection of ybr238c to mitochondrial function is correlative and should be substantiated with assays that more closely align with organellar function, such as respirometry or assaying the activity of oxidiative phosphorylation complexes. Finally, the authors attempt to tie the phenotypes of mitochondrial dysfunction caused by the deletion of ybr238c to TORC1 signaling, as the gene is influenced by rapamycin. However, the presentation of the data, such as reporting ATP levels as relative percentages or failing to perform appropriate statistical comparisons between conditions in which the authors derive conclusions, renders the data difficult to interpret. As such, this manuscript establishes that ybr238c is rapamycin responsive and influences CLS, but its influence on mitochondrial activity and ties to TORC1 signaling remain speculative.

      We would like to express our gratitude to Reviewer #2 for the thoughtful feedback on our manuscript. We have carefully considered your comments and have made comprehensive revisions to address the concerns raised.

      We appreciate the suggestion to investigate the role of YBR238C in replicative lifespan (RLS). However, we want to bring to your attention that four previous studies (references 7, 39, 40, and 41) have already identified the involvement of YBR238C in the RLS phenotype. Given the existing body of literature on this aspect, we chose not to duplicate these efforts in our study.

      Instead, we focused our efforts on validating the role of YBR238C in chronological lifespan (CLS) phenotype, a finding reported in only one genome-wide study (reference 38). To enhance the comprehensiveness of our study, we performed analyses on different phenotypes, including mitochondria activity and oxidative stress, under both logarithmic-phase (condition for RLS) and stationary phase (condition for CLS). We now clearly indicate the logarithmic-phase/stationary phase conditions in the figure legends of the manuscript, specifying whether the conditions are relevant to RLS or CLS. Additional results of the new experiments have been included in the revised manuscript as supplementary figures (S3E-S3I).

      To address concerns about the indirect nature of our mitochondrial function assays, we have performed relative mitochondria content (S3F), quantification of ROS levels from fermentative to stationary phase conditions (S3G), and assessment in respiratory glycerol medium (S3H), which provides a more direct insight into mitochondrial biology. Additionally, we have investigated the resistance of ybr238c∆ cells to H2O2 toxicity and found them to be more resistant compared to wild-type cells.

      We believe these revisions strengthen the scientific rigor and clarity of our study. We sincerely appreciate the guidance from Reviewer #2, and we hope these modifications address the concerns raised effectively.

      Reviewer #3 (Public Review):

      Summary: The study by Alfatah et al. presented a role for YBR238C in mediating lifespan through improved mitochondrial function in a TOR1-dependent metabolic pathway. The authors used a dataset comparison approach to identify genes positively modulating yeast chronological (CLS) and Replicative (RLS) lifespan when deleted, and their expression is reduced under Rapamycin treatment condition. This approach revealed an unknown, mitochondria-localized yeast gene YBR238C, and through mechanistic studies, they identified its paralogous gene RMD9 regulating lifespan in an antagonistic effect.

      Strengths:

      Findings have valuable implications for understanding the YBR238C-mediated, mitochondrial-dependent yeast lifespan regulation, and the interplay between two paralogous genes in the regulation of mitochondrial function represents an inserting case for gene evolution.

      Weaknesses:

      Overall, the implication/findings of this study are restricted only to the yeast model since these two genes do not have any homology in higher eukaryotes. The primary methods must be carefully designed by considering two different metabolic states: respiration-associated with CLS and fermentation-associated with RLS in a single comparative approach. Yeast CLS and RLS are two completely different processes. It is already known that most gene-regulating CLS is not associated with RLS or vice versa. The method section is poorly written and missing important information. The experimental approaches are poorly designed, and variability across the datasets (e.g., media condition "YPD," "SC" etc.) and their experimental conditions are not well described/considered; thus, presented data are not conclusive, which decreases the overall rigor of the study.

      We sincerely appreciate your thorough review of our manuscript and your insightful comments. We acknowledge the limitation of our study being yeast-specific due to the absence of homologous genes in higher eukaryotes. However, we would like to highlight the significance of our findings in revealing a feedback loop between mitochondrial function and TORC1 signaling (TORC1-Mitochondria-TORC1 or TOMITO signaling process) in cellular lifespan regulation.

      Our interpretation of the experimental results is grounded in recent literature. Two studies (references 62 and 63) support our findings by demonstrating TORC1 activation after mitochondrial electron transport chain dysfunction and the delay in brain pathology progression upon TORC1 inhibition, respectively. These studies, discussed in our manuscript, reinforce the relevance of our work in a broader biological context.

      We recognize the importance of carefully designing our primary methods to account for the different metabolic states associated with cellular processes, such as respiration in cellular lifespan (CLS) and fermentation in replicative lifespan (RLS). We want to bring to your attention that four previous studies (references 7, 39, 40, and 41) have already identified the involvement of YBR238C in the RLS phenotype. To avoid duplicating these efforts, we have chosen not to reiterate these findings in our study. However, we have clarified the logarithmic-phase/stationary phase conditions in the figure legends, specifying their metabolic states relevance to RLS or CLS. Additionally, we have included new supplementary figures (S3E-S3I) to provide further details on the new experiments conducted.

      We appreciate your feedback regarding the clarity and completeness of our method section. In the revised manuscript, we have invested additional effort to enhance the clarity of the method section, providing a more detailed account of the experimental procedures, including the missing information you identified.

      We believe these revisions strengthen the scientific rigor and clarity of our study. We sincerely appreciate the guidance from Reviewer #3, and we hope these modifications address the concerns raised effectively.

      Reviewer #1 (Recommendations For The Authors):

      Thank you for your detailed review and valuable recommendations. We have carefully addressed each of your comments in the revised manuscript. The specific changes made include:

      (1) "TORC1 positively regulates aging, and its inhibition increases lifespan in various eukaryotic organisms including yeast and mammalian 13,26,27,29,30." Here I would suggest replacing "mammalian" with "mammals".

      We have amended the sentence as recommended.

      (2) "Next, we experimentally tested whether the transcriptome longevity signatures are associated with enhanced mitochondrial metabolism, whether the cellular energy level has gone up and cellular stress responses are induced with a switch to oxidative metabolism 47,48." Here I would replace "transcriptome longevity signatures is" with "transcriptome longevity signatures are".

      We have amended the sentence as recommended.

      (3) "Thus, HAP4-independent mechanism does exist through which YBR238C also affects cellular aging (Figure 3I)." I would replace "Thus, HAP4-independent" with "Thus, a HAP4-independent".

      We have amended the sentence as recommended.

      (4) "We examined other mitochondrial dysfunctional conditions to confirm that suppressive effect of rapamycin is not only specific to YBR238C-OE." I would change "that suppressive effect" to "that the suppressive effect".

      We have amended the sentence as recommended.

      (5) "Understanding the mechanism of aging will also require to understand the role of many genes of yet unknown function as YBR238C at the beginning of this work." I would switch "require to understand" to "require understanding".

      We have amended the sentence as recommended.

      (6) "The gene lists that modulate cellular lifespan in aging model organism yeast Saccharomyces cerevisiae were extracted from database SGD 22 and GenAge 23 (as of 8th November 2022)" "yeast" should not be italicized.

      Corrected.

      (7) Figure 1, panels C and D, ybr238c should be italicized.

      Corrected.

      (8) Figure 2B, top left-most (oxidative phosphorylation) network. I might consider repositioning some labels to make them more readable if possible.

      Thank you for your feedback. The figure labels in Figure 2B are default from Metascape analysis, so repositioning isn't feasible. However, we have indicated in the figure legends that the full set of genes for functional enrichment analysis and the MCODE complex is available in Additional File 3.

      (9) Figure 4E, rmd9, pet100, and cox6 should be italicized.

      Corrected.

      (10) Figure 5C, rmd9 and rmd9 ybr238c should be italicized. Corrected.

      Reviewer #2 (Recommendations For The Authors):

      Thank you for your detailed review and valuable recommendations. We have carefully addressed each of your comments in the revised manuscript. The specific changes made include:

      (1) The presentation of data as heatmaps (Figures 1F, 3D, 4C, 4G, 5B, 5H, 5L, 6K) obfuscates the quantitative nature of the data. These data would be much stronger if presented as bar graphs with appropriate statistical analysis. If the authors prefer the visual of the heat map, there should be some statistical analysis performed to accompany these figures. This is particularly important for Figure 3D, in which the authors state "We found that HAP4 deletion significantly decrease the ETC complex I-V genes' expression" (bottom of page 8). As no statistical analyses were performed, the authors should refrain from using such language as it is unsupported by the data as analyzed.

      Thank you for your insightful comments and suggestions regarding the presentation of our data. We appreciate the attention you have given to Figures 1F, 3D, 4C, 4G, 5B, 5H, 5L, and 6K.

      In response to your feedback, we have carefully re-evaluated our approach. Considering the large volume of data associated with our lifespan analysis at different time points, we initially chose to visualize it using heatmaps to comprehensively capture the complexity of the results. However, we have now incorporated quantification information into the heatmaps.

      For Figure 3D, which addresses the impact of HAP4 deletion on the expression of ETC complex I-V genes, we have replaced the heatmap with a bar graph. This modification allows for a clearer representation of the quantitative nature of the data. Moreover, we have conducted thorough statistical analyses comparing data between ybr238c∆ and ybr238c∆ hap4∆ to support the statements made in the text. The results of these analyses are now included in the revised figure. Moreover, we also replaced the Figure 6K heatmap with a bar graph.

      We believe that these changes enhance the interpretability and robustness of our findings. We are grateful for your guidance, and we are confident that these adjustments will strengthen the overall quality of our manuscript.

      (2) The presentation of ATP data, given its importance in supporting the core conclusions of this manuscript, is poor. The conditions under which yeast was collected are not reported, making these data impossible to interpret; total cellular ATP levels would be significantly altered and influenced by separate pathways in fermentive versus stationary phases. Minimally, the authors should describe the conditions of yeast growth (e.g., age, culture media) in which these measurements were made. The presentation of relative ATP percentages is problematic, particularly with measurements that deviate so far from wild-type ATP levels in conditions such as those in Figure 6A, in which the authors report that rapamycin induces a 1200% increase in cellular ATP. Previous papers have established that ATP levels in yeast hover around 4 mM and are stable through the cell cycle and across nutrient conditions (PMID: 30858198, 35438635). Given this, the reported ATP levels would be expected to be near 48 mM, which is strongly outside of the typically accepted values of 1-10 mM for this metabolite. Without understanding the contexts in which these measurements are made, as well as the absolute values for these measurements (which would be easily achievable through the use of a standard curve of ATP), these data are uninterpretable. Furthermore, it seems unlikely that yeast would be able to accommodate shifts of ATP levels that span an order of magnitude without dire cellular consequences, particularly during rapamycin treatment.

      We appreciate the valuable feedback from the reviewer regarding the importance of providing detailed information on yeast growth conditions for interpreting ATP data. In response to this suggestion, we have enhanced the figure legends associated with the relevant figures to include a comprehensive description of the yeast growth conditions. This now specifies the age of the culture, culture media composition, and other pertinent parameters.

      In addressing the concern raised about the rapamycin-induced ATP increase, we have carefully re-examined our experimental procedures. We performed additional experiments and confirmed the consistency of our findings in logarithmic-treated cultures. The results remain in alignment with our initial observations, reinforcing the reliability and reproducibility of our data.

      (3) As stated above, the inference of mitochondrial function from cellular ATP levels, cellular ROS levels, and gene expression of a handful of nuclear-encoded genes is not sound. The authors should include further experimentation as evidence of mitochondrial functionality, such as respirometry or metabolic flux experiments.

      Thank you for your constructive feedback on our manuscript. We appreciate your careful consideration of our work. In response to your concerns regarding the indirect nature of our mitochondrial function assays, we have implemented the following changes: We have incorporated additional assays to provide a more direct insight into mitochondrial biology. Specifically, we performed relative mitochondria content analysis (S3F) and quantified ROS levels under fermentative to stationary phase conditions (S3G). These assays offer a more direct and comprehensive assessment of mitochondrial function. Furthermore, we conducted experiments in respiratory glycerol medium (S3H) to complement our previous findings.

      To further support our claims, we investigated the resistance of ybr238c∆ cells to H2O2 toxicity. Our results demonstrate that these cells exhibit increased resistance compared to wild-type cells. This additional evidence strengthens the link between mitochondrial function and cellular response to oxidative stress.

      We believe these adjustments address your concerns and significantly enhance the robustness of our study. We hope you find these modifications satisfactory. We are grateful for your valuable input, which has undoubtedly improved the clarity and reliability of our findings.

      (4) Multiple gene expression analyses are performed on n=2 measurements, and this should be bolstered by further replicates. Many bar graphs do not have accompanying statistics; these should be added. Some statistical tests are performed across inappropriate comparisons, such as Figure 3G, in which expression levels of mitochondrial genes in both deletion and overexpression strains should be compared to a wild-type control rather than to each other.

      Thank you for your thorough review and constructive feedback on our manuscript. We appreciate your careful examination of our work. In response to your comments, we have made the following revisions to address your concerns: The multiple gene expression analysis in our study focused specifically on ETC genes. It is important to note that ETC genes themselves represent multiple replicates within the ybr238c deletion and overexpression cells, as illustrated in Figures 4D, 4G, and 6B.

      We acknowledge and appreciate your observation regarding Figure 3G. To address this concern, we have revised the statistical comparisons. The expression levels of mitochondrial genes in the overexpression strain are now appropriately compared to a wild-type control. This correction has been applied in the figure that correctly corresponds to text in the manuscript.

      (5) Figure 2B is uninterpretable as it stands, as most gene symbols are obscured.

      We appreciate the reviewer's attention to Figure 2B and the feedback provided. Regarding the gene labels in Figure 2B, we would like to clarify that these labels are default outputs from the Metascape analysis, and unfortunately, repositioning them within the current figure layout isn't feasible without compromising the integrity of the information.

      However, we have taken the reviewer's concern seriously and have made efforts to address the interpretability issue. To provide readers with access to the full set of genes for functional enrichment analysis and the MCODE complex, we have included this information in Additional File 3. The figure legends have been updated accordingly to guide readers to refer to Additional File 3 for a more detailed examination of the gene symbols and their annotations.

      We hope that this solution addresses the concern raised by the reviewer.

      (6) The conclusions to be drawn from Figure 3A are not clear, and this figure is cited only once in the text along with two other figures (page 8).

      Thank you for your valuable feedback. We have carefully considered your comments and made revisions to improve the clarity of the conclusions drawn from Figure 3A.

      (7) Figure 6K reports a range of 100-200% cell survival - how does a cell have 200% survival? Isn't survival binary (i.e., you survive or you are dead)? Perhaps this is meant to be relative to another condition; this should be more clearly stated in the figure, or the axis should be normalized to a maximum of 100% survival.

      Thank you for your guidance and valuable feedback. Based on your recommendation, we have made significant changes to Figure 6K in the revised manuscript. Specifically, we replaced the heatmap with a bar graph to enhance clarity. Additionally, we would like to highlight that cell survival of combined treated cells is measured relative to the control treatment, which is considered 100% survival. This aims to provide a more accurate and comprehensible representation of the data. We believe these modifications contribute to a clearer presentation of our findings.

      (8) The authors state that "TORC1 inhibition in yeast and human cells with mitochondrial dysfunction suppresses their accelerated aging." No studies of aging were done in human cells; survival in response to mitochondrial toxins does not reveal aging phenotypes. To state such is a substantial overstatement and should be amended to perhaps "cellular survival" rather than directly linked to aging.

      We appreciate the careful review of our manuscript and the constructive feedback provided by the reviewer. In response to the concern raised regarding the statement about TORC1 inhibition and accelerated aging in human cells, we have revised the relevant passage as follows: "In turn, TORC1 inhibition in yeast and human cells with mitochondrial dysfunction enhances their cellular survival." We believe that this modification accurately reflects the outcomes of our experiments and addresses the concern raised by the reviewer. We would like to express our gratitude for the valuable feedback, which has contributed to the improvement of our manuscript. Thank you for your thoughtful consideration.

      Reviewer #3 (Recommendations For The Authors):

      Thank you for your detailed review and valuable recommendations. We have carefully addressed each of your comments in the revised manuscript. The specific changes made include:

      The authors should have attempted to fully characterize the RLS and CLS phenotype of strains lacking the YBR238C and RMD9 gene, the single most important gene identified in this study. Before further characterization, its association with aging must be tested to replicate findings from the literature. Although Figure 3 shows partially characterized CLS in SC medium, different media conditions could be tested, and the full spectrum of CLS lifespan curves should be represented. RLS phenotypes of these cells were not analyzed throughout the study.

      We appreciate the suggestion to investigate the role of YBR238C in both Replicative Lifespan (RLS) and Chronological Lifespan (CLS). However, it's essential to note that the involvement of YBR238C in the RLS phenotype has been previously documented in four studies (references 7, 39, 40, and 41). Considering the established literature on this matter, we chose not to duplicate these efforts in our study.

      Our primary focus was on confirming the role of YBR238C in the chronological lifespan (CLS) phenotype, as indicated by a genome-wide study (reference 43). Accordingly, we also conducted an analysis of the role of RMD9 in CLS. The methods and figure legends explicitly state that CLS experiments for prototrophic CEN.PK113-7D strains were conducted in synthetic defined (SD) medium containing 6.7 g/L yeast nitrogen base with ammonium sulfate without amino acids and 2% glucose. For auxotrophic BY4743 strains, SD medium was supplemented with histidine (40 mg/L), leucine (160 mg/L), and uracil (40 mg/L).

      It is important to clarify that SC medium was not used for CLS analysis. Instead, we employed SD medium, recommended for CLS analysis (reference 15; PMID: 22768836). The CLS experiments were conducted using three different methods, providing a comprehensive representation of the entire CLS lifespan (Figures 1C, 1D, 1E, and 1F).

      While we did not present the Replicative Lifespan (RLS) phenotype explicitly, we performed experiments such as mitochondrial activity and ROS production under both CLS and RLS conditions. These additional analyses contribute valuable insights into the broader implications of YBR238C and RMD9 on cellular function.

      We believe that these clarifications and the inclusion of additional experimental details enhance the robustness and validity of our findings. We hope these explanations address the concerns raised by the reviewer and contribute to the overall improvement of our manuscript.

      In addition, authors include RNAseq data from Rapamycin-treated cells to identify differentially expressed genes. Notably, genes with decreased expression were used to compare KO strains' lifespan phenotype. Additional RNAseq analyses were performed on individual KO cells. The methodology section needs to be better written with information on which media and metabolic state that these cells are collected after treatment with rapamycin. If the cells are collected during logarithmic growth, the data can be compared with RLS aging gene sets only. A separate experiment has to be performed on stationary cells (respiratory) to collect RNAseq data after rapamycin treatment, then can be compared to the CLS aging gene set.

      Thank you for your insightful comments and considerations regarding our methodology for obtaining Rapamycin response genes (RRGs). We appreciate the opportunity to address your concerns and provide further clarification on our experimental approach.

      As mentioned in our manuscript, we obtained RRGs by treating logarithmic cells with 50 nM Rapamycin for 1 hour, and the details have been included in supplementary Figure S1C legends. Our primary objective was to compare these RRGs with agingassociated genes that modulate both Replicative Lifespan (RLS) and Chronological Lifespan (CLS). We acknowledge the significance of this comparison and believe that our approach, treating logarithmic cells, is suitable for achieving this goal.

      It is important to note that the use of a higher concentration of Rapamycin for treatment renders the cells less efficient in terms of growth, resulting in a very low optical density (OD) at 72 hours, as illustrated in Figure 6H. Unfortunately, due to this limitation in growth efficiency, obtaining Rapamycin response genes at the stationary phase was not feasible in our experimental setup.

      As the experimental conditions vary among the reports and the gene expression signature significantly changes under different metabolic conditions, the media condition that samples are collected for RNAseq analyses should match the media condition that the lifespans of those KO strains are tested. However, more information needs to be detailed on these methodologies. For example, the transcriptomic signature of the YBR238C KO strain should be done under both fermentative and respiratory conditions to understand the true gene expression signature associated with CLS and RLS. Throughout the manuscript, these two metabolic conditions and associated lifespan types (CLS vs. RLS) are not differentiated and treated as the same, probably causing the biggest confounding effect that resulted in the identification of a single yeast-specific gene.

      We obtained the transcriptomic signature of the YBR238C KO strain from logarithmic phase cultures. This consistency was maintained to align with the Rapamycin Response Genes (RRGs) obtained from logarithmic cells treated with rapamycin. Detailed methodology and metabolic status information is provided in the method section and relevant figure legends.

      To broaden the scope of our study, we conducted analyses on various phenotypes, including mitochondrial activity and oxidative stress, under both logarithmic phase (relevant to Replicative Lifespan, RLS) and stationary phase (relevant to Chronological Lifespan, CLS). We have now explicitly indicated the logarithmic phase/stationary phase conditions in the figure legends of the manuscript, specifying their relevance to RLS or CLS.

      Results from these additional experiments have been incorporated into the revised manuscript as supplementary figures (S3E-S3I). We believe that these clarifications and the inclusion of additional experimental details enhance the robustness and validity of our findings. We trust that these explanations effectively address the concerns raised by the reviewer and contribute to the overall improvement of our manuscript.

      YBR238C gene KO effect on mitochondrial function missing comprehensive characterization. Whether the improved mito function caused by increased mtDNA copy number and/or increased mitochondrial number could be easily tested by analyzing normalizing RNAseq reads from mtDNA genes to reads from nucDNA genes. Data could be further combined with western blot specific to mito membrane proteins to analyze mito copy number.

      Thank you for your insightful comments and suggestions. Following your recommendation, we conducted an assessment of relative mitochondrial content (see Figure S3F) and observed significantly higher mtDNA content in the ybr238c∆ compared to the wild type (see Figure S3F). Additionally, we have incorporated the methodology for mitochondrial DNA copy number analysis in the methods section.

      The two paralogous gene interaction is an interesting observation. However, in yeast, it is known that deletion of one of the paralogous genes causes copy number amplification of the certain chromosome that the other paralogous gene is located, causing aneuploid chromosome. Many of the observed phenotypes can be associated with increased chromosome copy number and should be carefully tested. However, the authors did not consider this important point. Simply, using RNA seq data normalized read/per chromosome could be plotted to analyze the karyotype of YBR238C and RMD9 KO cells.

      We appreciate your thoughtful consideration of our work and the suggestion to investigate chromosome copy number variations. While we did not directly test the chromosome copy, we want to highlight that our study extensively explores the impact of YBR238C on cellular lifespan through an RMD9-dependent mechanism (Figure 5). Deletion of YBR238C increases, whereas overexpression of YBR238C decreases the expression of its paralog, RMD9 (Figure 5F). Furthermore, this phenotype is associated with the lifespan of YBR238C-deleted and overexpressed cells. In our study, we have thoroughly investigated this aspect.

    1. Author Response

      The following is the authors’ response to the original reviews.

      We appreciate the care and the detail shown by the Reviewers. Their comments have made our article more focused and more accessible to a general audience.

      We would like to begin with a comment about the last sentence of the “eLife assessment”. The evolution of metamorphosis in insects was a major triumph in animal evolution that subsequently impacted almost every aspect of plant and animal evolution in the terrestrial and freshwater aquatic biospheres. Unlike the metamorphoses of most other groups, whose evolutions are lost in time, insect evolution arose relatively recently (~400 mya) and insect orders have branched off at various points in this evolution and have persisted to modern times. Although these “relic” groups also have undergone millions of years of evolution and specialization, they still provide us with windows into how this progression may have come about. The study of these groups provides a unique opportunity to explore the mechanisms that underlie major life history shifts and should be of interest to anyone interested in evolution – not just entomologists.

      Reviewer #1 (Public Review):

      Summary:

      This paper provides strong evidence for the roles of JH in an ametabolous insect species. In particular, it demonstrates that:

      • JH shifts embryogenesis from a growth mode to a differentiation mode and is responsible for terminal differentiation during embryogenesis. This, and other JH roles, are first suggested as correlations, based on the timing of JH peaks, but then experimentally demonstrated using JH antagonists and rescue thereof with JH mimic. This is a robust approach and the experimental results are very convincing.

      • JH redirects ecdysone-induced molting to direct formation of a more mature cuticle

      • Kr-h1 is downstream of JH in Thermobia, as it is in other insects, and is a likely mediator of many JH effects

      • The results support the proposed model that an ancestral role of JH in promoting and maintaining differentiation was coopted during insect radiations to drive the evolution of metamorphosis. However, alternate evolutionary scenarios should also be considered.

      Strengths:

      Overall, this is a beautiful, in-depth student. The paper is well-written and clear. The background places the work in a broad context and shows its importance in understanding fundamental questions about insect biology. The researchers are leaders in the field, and a strength of this manuscript is their use of a variety of different approaches (enzymatic assays, gene expression, agonists & antagonists, analysis of morphology using different types of microscopy and detection, and more) to attack their research questions. The experimental data is clearly presented and carefully executed with appropriate controls and attention to detail. The 'multi-pronged' approach provides support for the conclusions from different angles, strengthening conclusions. In sum, the data presented are convincing and the conclusions about experimental outcomes are well-justified based on the results obtained.

      Weaknesses:

      This paper provides more detail than is likely needed for readers outside the field but also provides sufficient depth for those in the field. This is both a strength and a weakness. I would suggest the authors shorten some aspects of their text to make it more accessible to a broader audience. In particular, the discussion is very long and accompanied by two model figures. The discussion could be tightened up and much of the text used for a separate review article (perhaps along with Figure 11) that would bring more attention to the proposed evolution of JH roles.

      We appreciate the comments about the strengths and weaknesses of the paper. To deal with the weaknesses, we have condensed some of the Results to make them less cumbersome and the Discussion has been completely revised, keeping a sharp focus on the actions of JH in Thermobia embryos and how these actions relate to the status quo functions of JH in insects with metamorphosis. As part of the revision of the Discussion, we have replaced Figures 10 and 11.

      Reviewer #1 (Recommendations For The Authors):

      In keeping with my public review, this paper is very strong and I have very few suggestions for improvement. They are:

      (1) Thermobia are extant insects and are not ancestral insects. It is likely that they retain features found in an insect ancestor. However, these insects have been evolving for a very long time, and for any one feature, many changes may have occurred, both gain and loss of gene function and morphology. Further, even for morphological features present in an extant species that are the same as an ancestor, genetic pathways regulating this feature may have changed over time (see for examples papers from the Haag and Pick labs). Although I realize this is a small, possibly almost semantic point, I feel it is important to be precise here. For example, in the title, "before" is speculative as there could have been a different role in the ancestor with the role in embryogenesis arising in lineages leading to Thermobia; similarly in the abstract, "this ancestral role of JH' is an overstatement since we cannot actually measure the ancestral role.

      Since the title has already been cited in a Perspectives review, we decided to keep the title as is.

      (2) I don't understand the results in Met and myo in Fig. 3B. Perhaps include them in the explanation of Fig.3 and not after the description of Fig. 4 and explain them in more detail (or perhaps not include them at all?). I don't really understand the statistical analysis of these panels either.

      We have revised the figure legends to explain the statistics.

      (3) Another point regarding language - talking about the embryo being "able" to go through a developmental stage implies decision-making. I would suggest dropping that wording (e.g, in the description of Fig. 5C). Similarly, in explaining Fig. 6B, it would be more correct to say "JH treatment no longer inhibited" than as written "could no longer inhibit" (implying 'no matter how hard it tried, it still couldn't do it')

      We have removed the “can’t” wording. Figure 6 has been revised

      Reviewer #2 (Public Review):

      The authors have studied in detail the embryogenesis of the ametabolan insect Thermobia domestica. They have also measured the levels of the two most important hormones in insect development: juvenile hormone (JH) and ecdysteroids. The work then focuses on JH, whose occurrence concentrates in the final part (between 70 and 100%) of embryo development. Then, the authors used a precocene compound (7-ethoxyprecocene, or 7EP) to destroy the JH producing tissues in the embryo of the firebrat T. domestica, which allowed to unveil that this hormone is critically involved in the last steps of embryogenesis. The 7EP-treated embryos failed to resorb the extraembryonic fluid and did not hatch. More detailed observations showed that processes like the maturational growth of the eye, the lengthening of the foregut and posterior displacement of the midgut, and the detachment of the E2 cuticle, were impaired after the 7EP treatment. Importantly, a treatment with a JH mimic subsequent to the 7EP treatment restored the correct maturation of both the eye and the gut. It is worth noting that the timing of JH mimic application was essential for correcting the defects triggered by the treatment with 7EP.

      This is a relevant result in itself since the role of JH in insect embryogenesis is a controversial topic. It seems to have an important role in hemimetabolan embryogenesis, but not so much in holometabolans. Intriguingly, it appears important for hatching, an observation made in hemimetabolan and in holometabolan embryos. Knowing that this role was already present in ametabolans is relevant from an evolutionary point of view, and knowing exactly why embryos do not hatch in the absence of JH, is relevant from the point of view of developmental biology.

      The unique and intriguing aspect of juvenile hormone is its status quo action in the control of metamorphosis. Our reason for dealing with an insect group that branched off from the line of insects that eventually evolved metamorphosis, was to gain insight into the ancestral functions of this hormone. Our data from Thermobia as well as that from grasshoppers and crickets indicate that the developmental actions of JH were originally confined to embryogenesis where it promoted the terminal differentiation of the embryo. Its actions in promoting differentiation also included suppressing morphogenesis. This latter function was not pronounced during embryogenesis because JH only appeared after morphogenesis was essentially completed. However, it was a preadaptation that proved useful in more derived insects that delayed aspects of morphogenesis into the postembryonic realm. JH was then used postembryonically to inhibit morphogenesis until late in juvenile growth when JH disappears, and this inhibition is released.

      Then, the authors describe a series of experiments applying the JH mimic in early embryogenesis, before the natural peak of JH occurs, and its effects on embryo development. Observations were made under different doses of JHm, and under different temporal windows of treatment. Higher doses triggered more severe effects, as expected, and different windows of application produced different effects. The most used combination was 1 ng JHm applied 1.5 days AEL, checking the effects 3 days later. Of note, 1.5 days AEL is about 15% embryonic development, whereas the natural peak of JH occurs around 85% embryonic development. In general, the ectopic application of JHm triggered a diversity of effects, generally leading to an arrest of development. Intriguingly, however, a number of embryos treated with 1 ng of JHm at 1.5 days AEL showed a precocious formation of myofibrils in the longitudinal muscles. Also, a number of embryos treated in the same way showed enhanced chitin deposition in the E1 procuticle and showed an advancement of at least a day in the deposition of the E2 cuticle.

      While the experiments and observations are done with great care and are very exhaustive, I am not sure that the results reveal genuine JH functions. The effects triggered by a significant pulse of ectopic JHm when the embryo is 15% of the development will depend on the context: the transcriptome existing at that time, especially the cocktail of transcription factors. This explains why different application times produce different effects. This also explains why the timing of JHm application was essential for correcting the effects of 7EP treatment. In this reasoning, we must consider that the context at 85% development, when the JH peaks in natural conditions and plays its genuine functions, must be very different from the context at 15% development, when the JHm was applied in most of the experiments. In summary, I believe that the observations after the application of JHm reveal effects of the ectopic JHm, but not necessarily functions of the JH. If so, then the subsequent inferences made from the premise that these ectopic treatments with JHm revealed JH functions are uncertain and should be interpreted with caution.

      We disagree with the reviewer. An analogous situation would be in exploring gene function in which both gain-of-function and loss-of-function experiments often provide complementary insights into how a gene functions. We see JH effects only when its receptor, Met, is present and JH can induce its main effector protein, Kr-h1. The latter gives us confidence that we are looking at bona fide JH effects. We have also kept in mind, though, that the nature of the responding tissues is changing through time. Nevertheless, we see a consistent pattern of responses in the embryo and these can be related to its postembryonic effects in metamorphic insects.

      Those inferences affect not only the "JH and the progressive nature of embryonic molts" section, but also, the "Modifications in JH function during the evolution of hemimetabolous and holometabolous life histories" section, and the entire "Discussion". In addition to inferences built on uncertain functions, the sections mentioned, especially the Discussion, I think suffer from too many poorly justified speculations. I love speculation in science, it is necessary and fruitful. But it must be practiced within limits of reasonableness, especially when expressed in a formal journal.

      We have tried to dial back the speculation.

      Finally, In the section "Modifications in JH function during the evolution of hemimetabolous and holometabolous life", it is not clear the bridge that connects the observations on the embryo of Thermobia and the evolution of modified life cycles, hemimetabolan and holometabolan.

      Our Figure 12 should put this into perspective.

      Reviewer #2 (Recommendations For The Authors):

      Main points

      (1) Please, reduce the level of overinterpretation of ectopic treatment experiments with JHm, since the resulting observations represent effects, but not necessarily functions of JH.

      We have revised this section to indicate that the “effects” of ectopic treatments provide insights into the function of JH. Using a genetic analogy, both “loss-of-function” and “gain-of-function” experiments provide insights into a given gene. (see response to Public Comments)

      (2) Especially in the sections "JH and the progressive nature of embryonic molts" and "Modifications in JH function during the evolution of hemimetabolous and holometabolous life histories", and the entire "Discussion", please keep the level of speculation within reasonable limits, avoiding especially the inference of conclusions on the basis of speculation, itself based on previous speculation.

      We have toned down some of the speculation and provided reasons why it is worth suggesting.

      (3) Please revisit the argued roles of myoglianin in the story, in light of its effects as an inhibitor of JH production, repressing the expression of JHAMT, as has been reliably demonstrated in hemimetabolan species (DOI: 10.1073/pnas.1600612113 and DOI: 10.1096/ fj.201801511R).

      Our appreciation to the reviewer. We are more explicit about the relationship between JH and myo.

      Minor points

      (4) Please keep the consistency of the scientific binomial nomenclature for the species mentioned. For example, read "Manduca sexta" (in italics) at the first mention, and then "M. sexta" (in italics) in successive mentions (instead of reading "Manduca" on page 17, and then "Manduca sexta" on page 18, for example). The same for "Drosophila" ("Drosophila melanogaster" first, and then "D. melanogaster"), "Thermobia" ("Thermobia domestica" first, and then "T. domestica"), etc. In the figure legends, I recommend using the complete name: Thermobia domestica, in the main heading.

      Where there is no possibility of confusion, we intend to use Thermobia, rather than T. domestica, etc. We think that it is easier for a non-specialist to read and it is commonly done in endocrine papers.

      (5) There is no purpose in evolution and biological processes. Thus, I suggest avoiding expressions that have a teleological aftertaste. For example (capitals are mine), on p. 3 "appears to have been extended into postembryonic life where it acts TO antagonize morphogenic and allow the maintenance of a juvenile state".

      We have tried to avoid teleological wording.

      (6) The title "The embryonic role of juvenile hormone in the firebrat, Thermobia domestica, reveals its function before its involvement in metamorphosis" contains a redundancy ("role" and "function"), and an apparent obviousness ("before its involvement in metamorphosis"). I suggest a more straightforward title. Something like "Juvenile hormone plays developmental functions in the embryo of the firebrat Thermobia domestica, which predate its status quo action in metamorphosis".

      As noted above, we are retaining the title since it has already been cited.

      (7) Page 2. "The transition from larva to adult then occurred through a transitional stage, the pupa, thereby providing the three-part life history diagnostic of the "complete metamorphosis" exhibited by holometabolous insects (reviews: Jindra, 2019; Truman & Riddiford, 2002, 2019)". I suggest adding the reference ISBN: 9780128130209 9 7 8 - 0 - 1 2 - 8 1 3 0 2 0 - 9, as the most comprehensive and recent review on complete metamorphosis.

      Done

      (8) Page 3. "These severe developmental effects suggest that the developmental role of JH in insects was initially CONFINED to the embryonic domain" (capitals are mine). This appears contradictory with the observations of Watson, 1967, on the relationships between the apparition of scales and JH, mentioned shortly before by the authors.

      This is explained in the Discussion. Although JH can suppress scale appearance in the J4 stage, we have not been able to show that scales appearance is caused by changes in the juvenile JH titer.

      (9) Page 4. "we measured JH III levels during Thermobia embryogenesis at daily intervals starting at 5 d AEL". Why not before, like in the case of ecdysteroids? The authors might perhaps argue that the levels of Kr-h1 expression are consistently low from the very beginning, according to Fernandez-Nicolas et al, 2022 (reference cited later in the manuscript).

      (10) Page 4. "Ecdysteroid titers through embryogenesis and the early juvenile instars were measured using the enzyme immunoassay method (Porcheron et al., 1989) that is optimized for detecting 20-hydroxyecdysone (20E)". The antibody generated by Porcheron (and now sold by Cayman) recognizes ecdysone and 20-hydroxyecdysone alike. But that's not relevant here. I would refer to "ecdysteroids" when mentioning measurements. Also in figure 2B (and "juvenile hormone III" without the formula, in Panel A, for harmonization). And I would not expand on specifications, like those at the beginning of page 5, or towards the end of page

      We thank the reviewer for this important correction.

      (12) ("the fact that we detected only a slight rise in ecdysteroids at this time (Fig 2B) is likely due to the assay that we used being designed to detect 20E rather than ecdysone").

      Omitted.

      (11) Page 5. "Low levels of Kr-h1 transcripts were present at 12 hr after egg deposition, but then were not detected until about 6 d AEL when JH-III first appeared". There is a very precise Kr-h1 pattern in Fernandez-Nicolas et al. 2023 (reference mentioned later in the manuscript).

      (12) Page 5. "notably myoglianin (myo), have become prominent as agents that promote the competence and execution of metamorphosis in holometabolous and hemimetabolous insects (He et al., 2020; Awasaki et al., 2011)". See my note 3 above.

      The myoglianin issue has been revised.

      (13) Page 5. "a drug that suppresses JH production". Rather, "a drug that destroys the JH producing tissues". Why the way, do the authors know when the CA are formed in T. domestica embryo development?

      We prefer to keep our original wording. There have been some cases in which precocene has blocked JH production but did not kill the CA cells. We do not have observations that show that 7EP kills the CA cells in Thermobia embryos.

      (14) Page 5. "subsequent treatment with a JHm". I would say here that the JHm is pyriproxyfen, not on page 6 or page 7. Thus, to be consistent, after the first mention of "pyriproxyfen (JHm)" on page 5, I'd consistently use the abbreviation "JHm".

      (15) Page 9. "Limb loss in such embryos was often STOCHASTIC, i.e., in a given embryo some limbs were completely lost while others were maintained in a reduced state" (capitals are mine). The meaning of "stochastic" is random, involving a random variable; it is a concept usually associated to probability theory and related fields. I suggest using the less specialized word "variable", since to ascertain that the values are really stochastic would require specific mathematical approaches.

      We are still using stochastic because the loss is random.

      (16) Page 10. "9E). Indeed, the JH treatment redirects the molt to be more like that to the J2 stage, rather than to the E2 (= J1) stage". Probably too assertive given the evidence available (see my points 1 and 2 above).

      We do not see a problem with our conclusion. In response to the JHm treatment, the embryo produced a smooth, rather than a “pebbly” cuticle, failed to make the J1-specific egg tooth, and attempted to make cuticular lenses (a J2 feature). This ability of premature JH exposure to cause embryos to “skip” a stage is also seen in locusts (Truman & Riddiford, 1999) and crickets (Erezyilmaz et al., 2004). The JHm treatment resulted in the production of smooth cuticle, lack of a hatching tooth, and an attempt to make cuticular lenses.

      (17) Page 11. "early JHM treatment", read "early JHm treatment".

      Corrected

      (18) Page 11. "likely. A target of JH, and likely Kr-h1, in Thermobia is myoglianin...". Please see my notes 1, 2, and especially 3, above.

      This has been revised

      (19) Page 13. "the locust, Locusta americana (Aboulafia-Baginshy et al.,1984)". Please read "the locust, Locusta migratoria (Aboulafia-Baginshy et al.,1984)".

      Corrected

      (20) Page 13 "Acheta domesticus" three times. The correct name now is "Acheta domestica", after harmonizing the declension of the specific name with the generic one. See additionally my note 4 above.

      Acheta domesticus has been used in hundreds (thousands?) of papers since it was originally named by Linnaeus. We will continue to use it.

      (21) Page 15, "(also called the vermiform larva (Bernays, 1971) redirects embryonic development to form an embryo with proportions, cuticular pigmentation, cuticular sculpturing and bristles characteristic of a nymph, while pronymph modifications, such as the cuticular surface sculpturing (Bernays, 1971)". The reference "Bernays, 1971" is indeed "Bergot et al., 1971".

      There was a mistake in the references. The Bernays reference was omitted from the revised Discussion

      (22) Page 16. "Since JH also induces Kr-h1 in embryos of many insects, including Thermobia". I'm not sure that this has been studied in many insects. In any case, any reference would be useful.

      (23) Page 17. "Tribolium casteneum". Please read "Tribolium castaneum".

      Changed

      (24) Page 17. "...results in a permanent larva that continues to molt well after it has surpassed its critical weight (He et al., 2019)". The paper of He et al., 2019 is preceded by two key papers that previously demonstrate (and in hemimetabolan insects) that myoglianin is a determining factor in the preparation for metamorphosis: DOI: 10.1073/pnas.1600612113 and DOI: 10.1096/ fj.201801511R). See my note 3 above.

      Corrected in revision

      (25) Page 18. "These persisting embryonic primordia join the wing primordia in delaying their morphogenesis into postembryonic life". This reader does not understand this sentence.

      Made clearer in the revision.

      (26) Page 18. "is first possible in the commercial silkworm (Daimon et al., 2015)". Please mention the scientific Latin name of the species, Bombyx mori.

      (27) Page 19. "The functioning of farnesol derivatives in growth versus differentiation control extends deep into the eukaryotes.../... this capacity was eventually exploited by the insects to provide the hormonal system that regulates their metamorphosis". This information appears quite out of place.

      We have retained this point.

      (28) Page 21. Heading "Hormones". I suggest using the heading "Bioactive compounds", as neither pyriproxyfen nor 7-ethoxyprecocene are hormones.

      Done

      (29) Page 29, legend of figure 1. "Photomicrographs" is somewhat redundant. The technical word is "micrographs". "Thermobia domestica" appears in the explanation of panel C, but this is not necessary, as the name appears in the main heading of the legend.

      Done

      (30) Page 30, legend of figure 2. Panel B, see my comment 10 above. Why embryonic age is expressed in % embryo development in panel C (and in days in panels A and B)?

      All have been converted to days AEL

      (31) Page 35, legend of figure 5. "Photomicrograph" see my note 28 above.

      Done

      (32) Page 40, figure 10. In panel A, the indication of the properties of JH is misleading. The arrow going to promoting differentiation and maturation is OK, but the repression sign that indicates suppression of morphogenetic growth and cell determination seems to suggest that JH has retroactive effects. In panel B, I suggest to label "Flies" instead of "Higher Diptera", which is an old-fashioned term. In any case, see my general comments 1 and 2, above, about speculation.

      Figure has been completely revised

      (33) Figure 11. See my general comments 1 and 2, above, about speculation.

      Figure has been revised

      Reviewer #3 (Public Review):

      Summary:

      In this manuscript, the authors use inhibitors and mimetics of juvenile hormone (JH) to demonstrate that JH has a key role in late embryonic development in Thermobia, specifically in gut and eye development but also resorption of the extraembryonic fluid and hatching. They then exogenously apply JH early in development (when it is not normally present) to examine the biological effects of JH at these stages. This causes a plethora of defects including developmental arrest, deposition of chitin, limb development, and enhanced muscle differentiation. The authors interpret these early effects on development as JH being important for the shift from morphogenetic growth to differentiation - a role that they speculate may have facilitated the evolution of metamorphosis (hemi- and holo-metaboly). This paper will be of interest to insect evo-devo researchers, particularly those with interests in the evolution of metamorphosis.

      Strengths:

      The experiments are generally conducted very well with appropriate controls and the authors have included a very detailed analysis of the phenotypes.

      The manuscript significantly advances our understanding of Thermobia development and the role of JH in Thermobia development.

      The authors interpret this data to present some hypotheses regarding the role of JH in the evolution of metamorphosis, some aspects of which can be addressed by future studies.

      Weaknesses:

      The results are based on using inhibitors and mimetics of JH and there was no attempt to discern immediate effects of JH from downstream effects. The authors show, for instance, that the transcription of myoglianin is responsive to JH levels, it would have been interesting to see if any of the phenotypic effects are due to myoglianin upregulation/suppression (using RNAi for example). These kinds of experiments will be necessary to fully work out if and how the JH regulatory network has been co-opted into metamorphosis.

      We agree completely and should be a feature of future work.

      The results generally support the authors' conclusions. However, the discussion contains a lot of speculation and some far-reaching conclusions are made about the role of JH and how it became co-opted into controlling metamorphosis. There are some interesting hypotheses presented and the author's speculations are consistent with the data presented. However, it is difficult to make evolutionary inferences from a single data point as although Thermobia is a basally branching insect, the lineage giving rise to Thermobia diverged from the lineages giving rise to the holo- and hemimetabolous insects approx.. 400 mya and it is possible that the effects of JH seen in Thermobia reflect lineage-specific effects rather than the 'ancestral state'. The authors ignore the possibility that there has been substantial rewiring of the networks that are JH responsive across these 400 my. I would encourage the authors to temper some of the discussion of these hypotheses and include some of the limitations of their inferences regarding the role of JH in the evolution of metamorphosis in their discussion.

      We have tried to be less all-encompassing in the Discussion. The strongest comparisons can be made between ametabolous and hemimetabolous insects and we have focused most of the Discussion on the role of JH in that transition. We still include some discussion of holometabolous insects because the ancestral embryonic functions of JH may be somehow related to the unusual reappearance of JH in the prepupal period. We have reduced this discussion to only a few sentences.

      Reviewer #3 (Recommendations For The Authors):

      (1) The overall manuscript is very long (especially the discussion), and the main messages of the manuscript get lost in some of the details. I would suggest that the authors move some of the results to the supplementary material (e.g. it might be possible to put a lot of the detail of Thermobia embryogenesis into the supplementary text if the authors feel it is appropriate). The discussion contains a lot of speculation and I suggest the authors make this more concise. One example: At the moment there is a large section on the modification in JH function during the evolution of holo and hemi-metabolous life history strategies. There are some interesting ideas in this section and the authors do a good job of integrating their findings with the literature - but I would encourage the authors to limit the bulk of their discussion to the specific things that their results demonstrate. E.g. The first half of p17 contains too much detail, and the focus should be on the relationship with Thermobia (as at the bottom of p17).

      Section has been revised and is more focused

      (2) I would also suggest a thorough proofread of the manuscript, I have highlighted some of the errors/points of confusion that I found in the list below - but this list is unlikely to be exhaustive . We appreciate catching the errors. Hopefully the final version is better proofed.

      (3) It might be me, but I found the wording in the second half of the abstract a bit confusing. Particularly the statement about the redeployment of morphogen systems - could this be stated more clearly?

      Abstract has been revised.

      (4) Introduction

      a. "powered flight" rather than 'power flight'

      Done

      b. 'brought about a hemimetabolous lifecycle' implies causality which hasn't been shown and directionality to evolution - suggest 'facilitated the evolution of a hemi...". Similar comment for 'subsequent step to complete metamorphosis'.

      c. Bottom of p2 - unclear whether you are referring to hemi- holo- or both

      d. Suggest removing sentence beginning "besides its effects..." as the relevance of the role of JH in caste isn't clear.

      Kept sentence but removed initial clause

      e. State that Thermoia is a Zygentoma.

      Done

      f. Throughout - full species names on first usage only, T. domestica on subsequent usages.

      We will continue to use genus names for the reason given above.

      Gene names e.g. kr-h1 in italics.

      g. 'antagonise morphogens"? rather than 'antagonise morphoentic'.

      Done

      (5) Results

      a. Unclear why drawings are provided rather than embryonic images in Fig. 1A

      We think that the points can be made better with diagrams.

      b. Top of p4, is 'slot' the correct word?

      Corrected

      c. Unclear why the measurements of JHIII weren't measured before 5 days AEL, especially given that many of the manipulative experiments are at earlier time points than this. I appreciate that, based on kr-h1, levels that JHIII is also likely to be low.

      d. Reference for the late embryonic peak of 20E being responsible for the J2 cuticle?

      Clarified that this is an assumption

      e. Clarify "some endocrine related transcripts" why were these ones in particular picked? Kr-h1 is a good transcriptional proxy for JH and Met is the JH-receptor, why myoglianin and not some of the other transcriptional proxies of neuroendocrine signalling?

      Hopefully, the choice is clearer.

      f. Fig 2C rather than % embryo development for the gene expression data please represent this in days (to be consistent with your other figures).

      It is now consistent with other parts of figure.

      g. In Fig. 3 the authors do t-tests, because there are three groups there needs to be some correction for multiple testing (e.g. Bonferroni) can the authors add this to the relevant methods section?

      We think that pair-wise comparisons are appropriate.

      h. Fig. 3 legend: you note that you treat stage 2 juveniles with 7EP - I couldn't tell what AEL this corresponded to.

      This is after hatching so AEL does not apply.

      i. Top of p7 'deformities' rather than 'derangements'?

      Done

      j. Regarding the dosage effects of embryonic abnormalities - it would be good to include these in the supp material, as it convinces the reader that the effects you have seen aren't just due to toxicity.

      It is not clear what the objection is.

      k. Bottom of p7 'problematic' not 'problematical'

      Done

      l. P8 Why are the clusters of Its important? - provide a bit more interpretation for the reader here.

      This is clear in the revised version.

      m. P9 Why is the modulation of transcription of kr-h1, met, and myo important in this context

      Explained

      n. P9 'fig. 7F'? there is no Fig. 5F

      Thanks for catching the typo.

      o. Fig. 7B add to the legend which treatment the dark and light points correspond to.

      We think it is obvious from the labeling on Fig 7B.

      (6) Discussion:

      a. What do we know about how terminal differentiation is controlled in non-insect arthropods? Most of the discussion is focused on insects (which makes sense as JH is an insect-specific molecule), but if the authors are arguing the ancestral role of JH it would be useful to know how their findings relate to non-insect arthropods.

      We have not been able to find any information about systemic signals being involved in non-insect arthropods.

      b. There is no Fig. 5E (are they referring to 7E?)

      Yes, it should have been Fig. 7E.

      c. Is myoglianin a direct target of JH in other species?

      Other reports are in postembryonic stages and show that myoglianin suppresses JH production. Our paper is the first examination in embryos and we find that the opposite is true – i.e., that JH treatment suppresses myoglianin production. We suspect that these two signaling systems are mutually inhibitory. It would be interesting to see whether treatment of a post-critical weight larva with JH (which would induce a supernumerary larval molt) would also suppress myoglianin production (as we see in Thermobia embryos).

      d. P12 What is the evidence that JH interacts with the first 20E peak to alter the embryonic cuticle?

      We are not sure what the issue is. The experimental fact is that treatment with JH before the E1 ecdysteroid peak causes the production of an altered E1 cuticle. We are faced with the question of why is this molt sensitive to JH when the latter will not appear until 3 or 4 days later? A possible answer is that the ecdysone response pathway has a component that has inherent JH sensitivity. The mosquito data suggest that Taiman provides another link between JH and ecdysone action

      e. Top of p13 - this paragraph can be cut down substantially. Although this is evidence that JH can alter ecdysteriods - it is in a species that is 400 my derived from the target species. Is it likely to be the exact same mechanism? I would encourage the authors to distil and retain the most important points.

      This paragraph has been shortened and focused.

      f. Bottom of p13 - what does this study add to this knowledge?

      The response of Thermobia embryos to JH treatment is qualitatively the same as seen in other short germband embryos. This similarity supports the assumption that the same responses would have been seen in their last common ancestor.

      g. P19 the last paragraph in the conclusions is really peripherally relevant to the paper and is a bit of a stretch, I would encourage the authors to leave this section out.

      We agree that it is a stretch. JH and its precursor MF are the only sesquiterpene hormones. How did they come about to acquire this function? We think it is worth pointing out the farnesol metabolites have been associated with promoting differentiation in various eukaryotes. An ancient feature of these molecules in promoting (maintaining?) differentiation may have been exploited by the insects to develop a unique class of hormones. It is worth putting the idea out to be considered.

      h. P19 "conclusions" rather than 'concluding speculations'.

      Changed as suggested.

      Methods:

      It is standard practice to include at least two genes as reference genes for RT-qPCR analysis (https://doi.org/10.1186/gb-2002-3-7-research0034, https://doi.org/10.1373/clinchem.2008.112797) If there are large-scale differences in the tissues being compared (e.g. as there are here during development) then more than two reference genes may be required and a reference gene study (such as https://doi.org/10.3390%2Fgenes12010021) is appropriate. Have the authors confirmed that rp49 is stably expressed during the stages of Thermobia development that they assay here?

      We have explained our choice in the Methods.

    1. Author Response

      The following is the authors’ response to the original reviews.

      eLife assessment

      This study is valuable as it sheds light on the pivotal role played by alterations in glycan metabolism within chondrocytes in the onset of cartilage degeneration and early onset of osteoarthritis (OA) through the process of hypertrophic differentiation of chondrocytes, giving insights into the identification of nascent markers for early-stage OA. Although the methods, data, and analyses broadly support the claims, the data shown by the authors are incomplete because the mechanism by which cartilage degeneration induced by changes in glycometabolism occurs has not been fully elucidated. The authors' deductions stand to gain further credence through undertaking additional experiments aimed at analyzing the mechanisms underlying the changes in glycometabolism in cartilage, such as the meticulous identification of the target glycan molecules bearing core fucose and analysis of endochondral ossification in cartilage-specific Fut8 KO mice.

      We wish to express our strong appreciation to the Reviewer for his or her insightful comments on our paper. We feel the comments have helped us significantly improve the paper. In particular, we wish to acknowledge the Reviewer’s highly valuable comments on the effect of Fut8 on endochondral ossification.

      Reviewer #1 (Public Review): :<br /> Summary:

      This study is valuable in that it may lead to the discovery of future OA markers, etc., in that changes in glycan metabolism in chondrocytes are involved in the initiation of cartilage degeneration and early OA via hypertrophic differentiation of chondrocytes. However, more robust results would be obtained by analyzing the mechanisms and pathways by which changes in glycosylation lead to cartilage degeneration.

      Strengths:

      This study is important because it indicates that glycan metabolism may be associated with pre-OA and may lead to the elucidation of the cause and diagnosis of pre-OA.

      We thank reviewer #1 for their interest in our work and their overall positive report.

      Weaknesses:

      More robust results would be obtained by analyzing the mechanism by which cartilage degeneration induced by changes in glycometabolism occurs.

      To understand the mechanisms of cartilage degeneration induced by changes in glycometabolism, we attempted additional experiments using rescue experiments with external administration of TGF-β. We had shown that the addition of mannosidase to an organ culture system of normal wild-type mouse cartilage increased TGF-β gene expression from 6 hours (Fig. 3E) and that TGF-β expression was even suppressed in chondrocytes from Fut8 cKO mice (Fig. 4D). In addition to these results, an early OA model in which mannosidase is added to the cartilage was used to test the effect of exogenous TGF-β. As a result, under TGF-β treated conditions, no degenerative changes occurred when high-mannose type N-glycans were trimmed, and proteoglycan leakage during the recovery period was significantly reduced. This was considered to be a very useful finding and it was decided to include the experimental results in Figure 4F, rather than making them supplement data.

      Reviewer #2 (Public Review):

      Summary:

      This paper consists of mostly descriptive data, judged from alpha-mannosidase-treated samples, in which they found an increase in core fucose, a product of Fut 8.

      Strengths:

      This paper is interesting in the clinical field, but unfortunately, the data is mostly descriptive and does not have a significant impact on the scientific community in general.

      We thank reviewer #2 for their interest in our work and their overall positive report. In response to your comment about our attempts to show that glycan changes occur at the precursor stage of cartilage substrate degeneration and that this glycosylation is also what triggers substrate degeneration, we would like to add that reversing cartilage substrate degeneration is a very ambitious challenge. We are currently in the preparatory stages of characterizing the appropriate glycan-substrate relationships to 'rescue' cartilage tissue from degeneration, and we hope to use this approach to provide information on the pre-developmental stages of OA.

      Weaknesses:

      If core fucose is increased, at least the target glycan molecules of core fucose should be evaluated. They also found an increase in NO, suggesting that inflammatory processes also play an important role in OA in addition to glycan changes.

      As the increase in NO was observed in the organ culture system and cartilage is a tissue without vascular invasion, we thought that the involvement of immune cells could be excluded. On the other hand, our research group has reported that chondrocytes themselves have inflammatory circuits (Ota et al., Arthritis Rheum. 2019. DOI:10.1002/art.41182), but as we did not find increased expression of NF-κB, an indicator of inflammatory amplifier activation, we concluded that inflammation was not involved in this study.

      It has already been reported that core fucose is decreased by administration of alpha-mannosidase inhibitors. Therefore, it is expected that alpha-mannosidase administration increases core fucose.

      The report by Toegel et al. that the synthesis of complex-type N-glycans (Man2a1, Mgat2) is predicted in human OA chondrocytes along with the expression of Fut8 also led to the expectation that administration of α-mannosidase would increase core fucose. However, there was no conclusive evidence that administration of α-mannosidase increased core fucose; in 1987, Vignon et al performed an enzyme assay on experimental OA cartilage (rabbit ACLT model) and showed that mannosidase was very high in operated joints and that its activity increased and decreased with the severity of fibrosis in the cartilage. The results suggest that glycoprotein hexose degradation is an early transient event in the enzymatic process of cartilage destruction. These findings led to the conception of a novel 'pre-OA model' in which mannosidase is added to the joint. The present study is valuable in its demonstration that glycometabolism is a driver of degeneration.

      (see manuscript REF. 25, 9)

      Toegel et al., Arthritis Res. Ther. 2013. DOI:10.1186/ar4330

      Vignon et al., Clin Rheumatol. 1987. DOI:10.1007/BF02201026

      Reviewer #3 (Public Review):

      Summary:

      In the manuscript "Articular cartilage corefucosylation regulates tissue resilience in osteoarthritis", the authors investigate the glycan structural changes in the context of pre-OA conditions. By mainly conducting animal experiments and glycomic analysis, this study clarified the molecular mechanism of N-glycan core fucosylation and Fut8 expression in the extracellular matrix resilience and unrecoverable cartilage degeneration. Lastly, a comprehensive glycan analysis of human OA cartilage verified the hypothesis.

      Strengths:

      Generally, this manuscript is well structured with rigorous logic and clear language. This study is valuable and important in the early diagnosis of OA patients in the clinic, which is a great challenge nowadays.

      We thank reviewer #3 for their interest in our work and their mainly positive report. This is precisely the purpose of our study, as we are primarily interested in the detection of conditions prior to the onset of OA.

      Weaknesses:

      I recommend minor revisions:

      (1) I would suggest the authors prepare an illustrative scheme for the whole study, to explain the complex mechanism and also to summarize the results.

      We would like to thank the reviewer for this comment and have created a new Figure 7 for the overall study scheme.

      We included the following statement in the opening discussion part:

      "The objective of this work was to provide novel and translational insights into pathogenesis of OA associated with changes in glycan structure. A graphical abstract summarizing our findings is shown in Fig. 7." (line199-201, p9)

      (2) Including but not limited to Figures 2A-C, Figures 3A and C, Figure 4B, and Figures 5A and D. The texts in the above images are too small to read, I would suggest the authors remake these images.

      The font size of the figures has been reviewed and revised throughout.

      (3) The paper is generally readable, but the language could be polished a bit. Several writing errors should be realized during the careful check.

      Thanks to your suggestion, I have noticed several writing errors. In addition, we have had the manuscript rewritten by an experienced scientific editor, who has improved the grammar and stylistic expression of the paper.

      (4) As several species and OA models were conducted in this study, it would be better if the authors could note the reason behind their choice for it.

      The authors agree with the reviewer's argument that since several species and OA models were performed in this study, it would be better to note the reason for their choice.

      We first attempted to inject mannosidase into rabbits, matching the animal species to a previous paper showing that N-glycans are altered prior to degeneration of the cartilage matrix. Next, we checked whether similar changes occur in mouse cartilage after mannosidase treatment, assuming that we would verify this in genetically engineered mice. We then used the integrated glycome in human cartilage to see if the corefucosylation phenomenon detected was conserved across species.

      For the modeling of OA in Fut8 cKO mice, the instability-induced OA model and the age-associated OA model were adapted. The former emphasizes mechanical stress factors in OA, the latter aging factors. OA is a multifactorial disease. Therefore, we thought it was appropriate to validate both aspects of OA.

      We included the following statements in each Methods part:

      "We injected mannosidase into rabbit knee joints in accordance with a previous paper showing that N-type glycans are altered prior to cartilage matrix degeneration." (line289-290, p12)

      "Organ culture experiments in mice were established to study the effects of mannosidase on articular cartilage without immunoreaction and in anticipation of later candidate gene research using transgenic mice." (line326-328, p14)

      "To determine whether the glycosylation detected is conserved across species, we analyzed the total glycome in human cartilage." (line407-408, p17)

      We included the following statements in the Discussion part:

      "For the modeling of OA in Fut8 cKO mice, the instability-induced OA model and the age-associated OA model were adapted. The former emphasizes mechanical stress factors in OA, the latter aging factors. OA is a multifactorial disease. Therefore, we thought it was appropriate to validate both aspects of OA." (line254-257, p11)

      Reviewer #1 (Recommendations For The Authors):

      (1) The cited literature states that core fucosylation by FUT8 has a chondroprotective effect via the TGF-β pathway and that the loss of these chondroprotective effects in Fut8 led to cartilage degeneration, but these need to be proven by experiment.

      We agree that corefucosylation and the TGF-β signaling pathway are important lines of investigation. We have now acknowledged this and added in the revised manuscript that additional experiments have shown that TGF-β restores the protective effects of Fut8 cKO cartilage by external administration.

      We included the following statements in the Results part:

      "To evaluate whether TGF-β1 decreases cartilage degeneration after mannosidase stimulation, TGF-β1 was exogenously added to Col2-Fut8−/− cartilage in the presence of α-mannosidase stimulation for 24 h. The samples treated with TGF-β1 leaked significantly less PG following mannosidase stimulation compared to samples not treated with TGF-β1 (Fig. 4F)." (line143-147, p6-7)

      We included the following statements in the Discussion part:

      "Here, the exogenous addition of TGF-β1 rescued them from cartilage degeneration." (line274-275, p12)

      (2) There are skeletal differences in cartilage-specific Fut8 KO mice compared to WT, and the effect of Fut8 on endochondral ossification should also be analyzed.

      We agree that Fut8 is associated with various endochondral ossification processes (for example by the TGF-β signaling pathway). Moreover, we would like to thank the reviewer for the proposed experiment.

      The growth curve was normal at birth, with differences beginning around weaning (~3 w for mice). Therefore, we evaluated the epiphyseal line of 4-week-old mice stained with toluidine, type 10 collagen, and proliferating cell nuclear antigen. This is similar to the epiphyseal growth plate phenotype of Smad3ex8/ex8 mice by Yang et al. and is consistent with the finding that Smad3 deficiency does not affect chondrogenesis during developmental stages, but the hypertrophic zone is increased in 3-4 week-old Smad3 KO mice. Chondrocytes in Fut8 cKO mice were suppressed of Tgf-β expression (Fig. 4D), suggesting that inhibition of TGF-β signaling, which is suppressive for late hypertrophic chondrocyte differentiation, led to the increased height of the hypertrophic zone.

      The results suggested that the growth plate of Fut8 cKO mice had an enlarged hypertrophic layer and decreased primary trabecular bone. Because these results have important implications for the content of the paper, we have included the staining results in Figure 5 and added a graph quantitatively assessing the extent of the hypertrophic zone as supplementary Figure S6.

      We included the following statement in the Results part:

      "To assess the role of FUT8 in endochondral ossification, we performed an epiphyseal plate analysis of 4-week-old Col2-Fut8−/− mice. This uncovered a significant enlargement of the zone of hypertrophic chondrocytes in the growth plates of the long bones of Col2-Fut8−/− mice compared to controls (Fig. 5C, S6 Figure)." (line154-158, p7)

      We included the following statement in the Discussion part:

      "The high-mannose/corefucosylation relationship estimated function to maintain formed cartilage. In endochondral ossification, the Fut8 cKO growth plate had an enlarged hypertrophic zone and reduced primary spongiosa because it is involved in the next process of cartilage replacement into bone rather than the process of cartilage formation." (line214-217, p9)

      Literature mentioned above (not included in manuscript):

      Yang X, et al. TGF-beta/Smad3 signals repress chondrocyte hypertrophic differentiation and are required for maintaining articular cartilage. J Cell Biol. 2001;153(1):35–46.

      (3) The DMM model analysis is performed with n=5 for each group. Please consider if the sample size is sufficient.

      In the literature, the sample sizes for DMM models have varied in previous studies (Doyran et al., n=5; Liao et al., n=6-7; Ouhaddi et al., n=8). Therefore, we performed a preliminary test of the DMM in WT and Flox mice with n=3 each and a power analysis with the outcome set to the OARSI score at 8 weeks. This resulted in n=4. The sample size for this study was increased to n=5 to account for attrition. The summed OARSI score of the WT in this study was comparable to that of Ouhaddi et al. and the model was judged to be working accurately. The summed OARSI score of the WT in this study was comparable to that of Ouhaddi et al. and the model was judged to be working accurately. The summed OARSI score of the WT in this study was comparable to that of Ouhaddi et al. and the model was judged to be working accurately.

      Literature mentioned above (not included in manuscript):

      (1) Doyran B, Tong W, Li Q, Jia H, Zhang X, Chen C, et al. Nanoindentation modulus of murine cartilage: a sensitive indicator of the initiation and progression of post-traumatic osteoarthritis. Osteoarthr Cartil. 2017;25(1):108–17.

      (2) Liao L, Zhang S, Gu J, Takarada T, Yoneda Y, Huang J, et al. Deletion of Runx2 in Articular Chondrocytes Decelerates the Progression of DMM-Induced Osteoarthritis in Adult Mice. Sci Rep. 2017 24;7(1):2371.

      (3) Ouhaddi Y, Nebbaki SS, Habouri L, Afif H, Lussier B, Kapoor M, et al. Exacerbation of Aging-Associated and Instability-Induced Murine Osteoarthritis With Deletion of D Prostanoid Receptor 1, a Prostaglandin D2 Receptor. Arthritis Rheum. 2017;69(9):1784–95.

      Reviewer #2 (Recommendations For The Authors):

      This paper is suitable for publication in clinical Journals related to osteoarthritis and cartilage.

      Identification of core fucosylated glycans from chondrocytes is essential for this type of paper.

      We mentioned that we had identified similar corefucosylated glycans in isolated mouse chondrocytes from the cartilage (line117-118, p5), but we have now also added the following to the subtitle of the Results section to avoid any potential confusion: "Corefucosylated N-glycan was formed in resilient cartilage and its isolated chondrocyte" (line109, p5)

      Thank you again for your comments on our paper. We trust that the revised manuscript is suitable for publication.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The coupling between cell polarity and cell cycle progression is an important aspect of symmetric and asymmetric cell division. Although there are several examples of cell cycle kinases phosphorylating polarity proteins, it has been difficult to assess the importance of these on cell division due to the strong and pleiotropic effects of manipulating these kinases. Here, the authors generate an analogue-sensitive allele of cdk1 in flies to tackle this question in neuroblasts (NBs) and sensory organ precursors (SOPs), two well characterised examples of asymmetric cell divisions. They show that partial Cdk1 inhibition (which still allows cell cycle progression) does not block Bazooka (PARD3 in mammals) polarization in NBs, but prevents coalescence of the Baz crescent, which has previously been shown to be an actomyosin-based process. They further identify a Cdk1 consensus site on Baz (S180) for which they generate a phospho-specific antibody, allowing them to show that this site is specifically phosphorylated in dividing NBs and SOPs. Although mutations at this site do not recapitulate the effect of Cdk1 on Baz coalescence, they do delay Miranda polarization in NBs and affect lateral inhibition and asymmetric cell division of SOPs. Finally, the authors show that human PARD3 can also be phosphorylated by Cyclin B/Cdk1 in vitro.

      Major comments:

      • Figure 2A: it would be good to show that polarization of Baz::GFP in consecutive divisions is maintained in cdk1as2 animals in the absence of 1-NA-PP1. We now show in Fig S2B a panel with two consecutive divisions of a cdk1as2 neuroblast in the absence of 1-NAP-PP1, followed by a third division in the presence of 1-NAP-PP1. The neuroblast shows high levels of Baz polarization in the two first divisions.

      • The interpretation of the observed SOP phenotypes is complicated by the uneven expression of the pnr-GAL4 driver and the fact that it is expressed in epithelial cells rather than just SOPs. The authors could express their control and mutant Baz constructs under the control of neurP72-GAL4. It is not likely they would be able to deplete endogenous Baz as they have done in NBs, as neurP72-GAL4 is expressed too late to deplete most proteins before SOP division, but they could at least look at localization of the mutants and any possible gain-of-function phenotypes.

      Following this suggestion, we have recombined Neur-GAL4 with UAS-delta RNAi to attempt to deplete both endogenous Baz::mScarlet and Delta while expressing our Baz::GFP constructs specifically in SOPs. Baz::mScarlet depletion was surprisingly efficient considering, as the reviewer points out, the late timing of Neur-GAL4 expression. However, the adult flies did not present any sensory organs transformations, perhaps because Delta might not be as efficiently depleted. We can at least rule out dominant-negative effects.

      We thank the reviewer for his constructive feedback and as suggested, we now extensively analysed the localisation of the Baz-S180 mutants in SOPs and found significant defects. We describe these observations in a new Figure 6. Briefly, we observed that the Baz phosphomutants have localisation defects during the pIIa cell division but not the pI cell division. We also observed a very surprising mosaicism of expression of our UASz-driven constructs within the SOP lineage that allowed us to make a few interesting observations which should be of interest to SOP specialists. Briefly, mosaic expression of Baz::GFP within the SOP lineage allowed to analyse the relative contributions of pIIa and pIIb/pIIIb to different Baz cortical pools and revealed an unexpected cell non-autonomous mechanism controlling pIIb division orientation. We describe these findings in a new associated supplemental figure.

      The authors speculate that Baz phosphorylation during lateral inhibition may be the reason for the observed excess specification of SOPs in the S180 mutants. This could easily be tested by looking at their antibody staining at earlier stages in the notum. Following this suggestion (also coming from Reviewer #2), we have stained nota between around 8h APF. We observed that patches of cells of the early notum display a strong Baz-pS180 phospho-signal. These patches partially overlap with the Delta-positive stripes in which lateral inhibition occurs (as described for example in (Corson et al., 2017), consistent with the possibility that Baz-S180 phosphorylation does somehow regulate lateral inhibition.

      These new experiments clearly show that Baz can be phosphorylated on S180 in cells that do not divide asymmetrically. This led us to change the title.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Cell polarization in dividing cells, including stem cells, is typically coupled such that polarity can inform the architecture, orientation, and/or asymmetry during cell division. In Drosophila neural stem cells (neuroblasts/SOP), Par polarity is coupled to the cell cycle, but the nature of this coupling remains unclear. In this work, Loyer and colleagues report on impacts of CDK1 inhibition on Bazooka/Par3 localization and basal fate determinant localization. They provide evidence for a novel phosphorylation site that appears unique to asymmetrically dividing cells and may be involved in regulation of asymmetric division. Finally, they show that CDK1 can, at least in principle, phosphorylate human Par3 in vitro.

      Overall, the major claims of the abstract appear supported by the experimental work; however, we think the title overstates the overall conclusions that can be drawn from the work.

      Major comments:

      • The major claim of the paper is the role of specific phosphorylation of S180 in asymmetrically dividing cells in polarization and sensory organ formation, which relies heavily on interpretation of S180A/D phosphomutants. The experiments are carefully performed and quantified, and are consistent with the conclusions drawn. However, we wondered if it possible that the phenotypes are not linked to phosphorylation (the authors acknowledge this in the Discussion)? In other words could the A/D mutants simply be weak Baz mutants? This could this potentially explain the extra-SOP phenotype if Baz function is generally altered, especially given that it is difficult to rationalise a role for SOP-specific phosphorylation in the processes that specify SOP cells in the precursor epithelial cells. The authors speculate that these early precursors may exhibit also phosphorylation, but this isn't examined. Chasing this down seems key to support the core titular claim of the paper. Following this suggestion (also coming from Reviewer #1), we have stained nota around 8h APF. We observed that patches of cells of the early notum display a strong Baz-pS180 phospho-signal. These patches partially overlap with the Delta-positive stripes in which lateral inhibition occurs (as described for example in Corson et al., 2017). This result is presented in Fig. 5H. As would be the case for any phosphomutant, this does not strictly rule out that the S180A and S180D could simply be weak Baz mutants, but it strongly supports the possibility that the lateral inhibition defects observed in these mutants result from defective Baz-S180 phosphorylation.

      • Implicit in the core message of the paper is the elucidation of CDK1 regulation of polarity and specifically Baz. However, the connection between CDK1 and S180 (and Baz regulation overall) is relatively tenuous in this work. First, the S180A mutant does not phenocopy CDK1 inhibition with respect to basal determinant phenotypes, though obviously CDK1 may be more pleiotropic. Second, whether the CDK1 inhibition phenotype is linked to any effect on Baz/PAR behaviour is not really explored. Third, they do not test whether S180 phosphorylation is CDK1-dependent. We fully agree with these comments. We cannot think of any way of addressing the first two points, which would require fully inhibiting CDK1 and somehow maintaining neuroblasts in mitosis to examine how it impacts Baz localisation. We tried to arrest neuroblasts in mitosis and block the proteasome as this at least in HeLa cells led to persistence of mitosis when CDK1 was inhibited (Skoufias et al., 2007). However, neuroblasts arrested in mitosis by proteasome inhibition slipped out of mitosis.

      However, concerning the third point, we now provide evidence showing that, at least in vitro, Drosophila BazS180 is phosphorylated by CDK1 (see below).

      The method for quantifying domain signal only references prior work and should be described in this work. From our search of the cited reference, it appears to be peak signal intensity at a user specified point on the cortex. While this does not undermine the core findings as presented, it may not capture additional features that may be informative (domain size, fluorescence distribution, total signal etc.). For example domain coalescence would imply smaller, brighter domains, but similar total protein amounts, which appears to be the case from images, but isn't quantified per se. We now describe our method for quantifying average signal intensity in the middle of the Baz crescents. We agree that quantifying additional features to check whether they are affected by partial CDK1 inhibition would be interesting. However, doing so requires determining exactly where Baz crescents start and end. As Baz crescent edges in neuroblasts often end in a gradient rather than a sharp edge (Hannaford et al., 2018), we are not sure to be able to confidently do so in every case with the image quality of our dataset: we prioritised limiting photobleaching to accurately quantify the levels of endogenously expressed Baz rather than obtaining very sharp and high contrast images. This is further complicated by the fact that, depending on the depth of neuroblasts within the tissue and the orientation of their division relative to the imaging plane, the signal intensity of Baz crescents is quite variable, preventing a simple thresholding approach to arbitrarily determine the limits of crescents based on signal intensity. In short, accurately determining the size of crescents is very challenging.

      The phosphospecific antibody signal is relatively weak, leading to relatively low signal to noise, which could compromise the ability to detect phospho-S180 in non-asymmetrically dividing cells or generally in cells in which Baz is not polarised and thus signal would be diffused around the cell rather than concentrated. Similar caveats could also apply to the lack of signal in interphase cells, where Baz may be less enriched at the cortex and not polarized. We are inclined to believe the authors conclusions, particularly given their examination of multiple cell types and tissues. However, it is a potential caveat as it may be most visible in polarised cells where it is asymmetrically enriched. We thank the reviewer for pointing this out. Given the fact that Baz levels at the neuroepithelial cells adherens junctions are similar, we are confident that Baz-S180 is phosphorylated in dividing neuroblasts but not in non-mitotic epithelial cells, which is at least consistent with our new finding that CDK1 phosphorylates Baz-S180 in vitro. However, we agree that we cannot strictly rule out that Baz-S180 is phosphorylated but below a detection threshold in mitotic neuroepithelial cells as cortical Baz levels decrease in these cells.

      We have also gathered new data showing that, in the early notum, Baz-S180 is detected in epithelial cells that are not dividing asymmetrically, definitely ruling out the notion that Baz-S180 is strictly ACD-specific. We have changed the title of the paper accordingly, toned down the mention of apparently ACD-specific Baz-S180 phosphorylation in the abstract and now describe and discuss the fact that the apparent ACD-specificity of Baz-S180 phosphorylation is context-specific.

      Examination of in vitro phosphorylation of human Par3D (Figure 6) seems out of place and does not add much. It is human, not Bazooka. They reveal 30 sites, 18 of which in both replicates, but most are not obvious CDK sites and the S180 equivalent site is missing. None of these sites is validated in vivo, at least in this work.

      We fully agree with these comments. We initially attempted to purify both full length Baz and human PARD3 but only managed to purify small amounts of PARD3, which is why our analysis was limited to human PARD3. To circumvent these difficulties, we instead purified a smaller N-terminal fragment of Baz and PARD3, which was successful for both proteins and gave us much higher quantities of sample for analysis. Using two different approaches (Western blot with our phospho-specific antibody on Baz and targeted mass spectrometry on Baz and PARD3), we now show in a new Figure 7 that CDK1 phosphorylates Baz-S180 and PARD3-S187 in vitro.

      Minor comments: Figure 1: Uses metaphase arrested cells, presumably colcemid, but colcemid is only noted in Figure 2. We now mention Colcemid in the legend of Figure 1. - Figure 2A: Scale bar is truncated. We have corrected this. - Figure 2A: Example images of control neuroblasts could be useful to readers. We now show control neuroblasts in Figure 2A. - Figure 2G' vs H': Because G' has two panels and H' has only one, we often confused the PKC and Mira box plots when comparing to Numb. Perhaps Mira could be in a separate sub panel or be more closely juxtaposed with Numb? The quantification of the Mira signal is now right next to Numb. - Whereas both Numb/Mira were examined in CDK1(as), only Mira is reported for the S180A/D experiments. Is there a Numb phenotype as well?

      We actually co-stained Numb and Miranda in the dataset that we analysed in the S180A/D experiments shown in Fig 4E, F. We did not analyse Numb localisation in the first version we submitted because of a penetration issue of the Numb antibody: the Numb signal fades extremely fast as we image deeper in the tissue, causing large difference of signal intensity even within a single cell. This prevents us from performing any meaningful quantitative measurement of the Numb signal like the one we did in Fig. 2H, K, for which we did not encounter this issue. All our further immunostaining experiments with this antibody have had the same problem since then, even after using Triton concentrations up to 4% for permeabilization.

      Nonetheless, following the reviewer’s question, we have at least performed a simple qualitative analysis of Numb localisation in this experiment. We observed that Numb localised to the basal pole in most cases in controls and Baz phosphomutants, but localised uniformly at the cortex in half the cases where Miranda showed very low levels of polarisation in metaphase in BazS180D mutants. This Numb localisation defect suggests a loss of function of the PAR complex whereas, intriguingly, the Miranda localisation defect suggests a gain of function of the PAR complex. These new observations are described in Fig. 4G-H’.

      • The discussion of the notch / Baz phenotypes (Figure 5) is rather complicated and a bit difficult to follow. We agree with this, we have rewritten this part. This is further simplified by our new observation that Baz-S180 is phosphorylated in the early notum during lateral inhibition.

      • Figure 5A: captions should indicate that RFP RNAi is depleting Baz. We have modified the figure accordingly.

      • Box plots are used, but not described. i.e. outliers seem to be marked, but criteria unclear. Mean vs median, etc. We now describe boxplots in the legend in the first instance they are used (Fig 2A’), and in the material and methods
      • Some grammatical mistakes:
      • Title: neuroblast (no 's'),
      • Page 1: Cell fate difference(s?) in the resulting daughter cells
      • Page 4: (As) CDK1 inhibition with 10 μM 1-NA-PP1 prevents neuroblasts from cycling and causes metaphase- arrested neuroblasts to slip out of mitosis. (Reword)
      • Page 6: increased levels of basal fate(no 's') determinants

      We have corrected these mistakes.

      Reviewer #2 (Significance (Required)):

      The links between cell cycle and cell polarity are clearly important and remain poorly understood. Hence, the work addresses key conceptual/mechanistic questions relevant to our fundamental understanding of stem cell biology and regulation of polarity and asymmetric cell division. In our opinion, there are clearly some interesting observations in the manuscript, the experiments are performed carefully, and the data are generally well described. That said, overall, the work seems somewhat premature.

      The direct impact of CDK1 on Baz behaviour remains somewhat unclear. The authors do a good job of limiting the concentration of inhibitor to decouple effects of cell cycle progression from CDK1 levels per se, but this does potentially impact the strength of the phenotypes they can detect and hence the observed phenotypes are relatively minor. Note that driving cells out of mitosis with stronger CDK1 inhibition clearly impacts Baz localization, so the 'real' effect of CDK1 inhibition on Baz could be stronger than reported here. It is also unclear whether the phenotypes observed are directly linked to CDK1 regulation of PAR polarity or an indirect effect of cell cycle control of other processes. The authors' suggestion that it could be related to defects in cortical actin organization, which is known to be cell cycle controlled, seems most likely, but neither this or other models are explored further. We agree but are not aware of any experiment that would allow testing full inhibition of CDK1 on membrane-bound Baz in mitotic neuroblasts. As mentioned above in our response to reviewer #1 we tried to arrest neuroblasts in mitosis and block the proteasome as this at least in HeLa cells led to persistence of mitosis when CDK1 was inhibited (Skoufias et al., 2007). However, neuroblasts arrested in mitosis by proteasome or Colcemid or both slipped out of mitosis upon inhibition of CDK1.

      We agree it would be interesting to study how CDK1 affects the actomyosin network in neuroblasts but feel that this is somewhat beyond the scope of the manuscript.

      Using phosphospecific antibodies, they report on a novel putative CDK1 phosphorylation site, but aside from looking like a consensus CDK1 site, whether this site is CDK1 dependent is not examined. Notably, the corresponding phosphomutants have modest effects and don't obviously account for the CDK1 inhibition phenotype, leaving it somewhat unclear whether it is under cell cycle regulation. We now provide a new figure 7 to address this point. As mentioned already above, using two different approaches (Western blot with our phospho-specific antibody on Baz and targeted mass spectrometry on Baz and PARD3 using), we now show in a new Figure 7 that CDK1 phosphorylates Baz-S180 and PARD3-S187 in vitro. Again, we cannot identify any experiment that would allow us testing whether S180 Baz is a direct target of CDK1 in vivo. The fact that we now report significant defects on Baz localisation in pIIa divisions, strongly suggests functional relevance and CDK1 seems a plausible kinase based on the new in vitro results.

      The observation that S180 phosphorylation appears unique to asymmetrically dividing cells is very curious, but this observation is not followed up extensively. Again phenotypes of phosphomutants are quite modest, and while one can propose models to rationalise the phenotypes observed, these models are not fully explored. As mentioned above, we now show that Baz-S180 phoshorylation is not strictly ACD-specific and changed the title accordingly. We also have new data showing that the S180 phosphomutants of Baz have localisation defects in mitotic pIIa divisions (new figure 6). Therefore, this phosphorylation event on Baz can be linked to Baz’s cortical localisation and interestingly shows context dependency.

      The findings that human Par3D can be phosphorylated by CDK1 in vitro do not add much particularly as they obtain a very large number of putative sites raising questions of specificity, the sites are not validated, and an S180 equivalent site was not identified. We agree that this has been a weakness which we feel we have addressed. We paste here the answer already provided above when replying to reviewer #1.

      We initially attempted to purify both full length Baz and human PARD3 but only managed to purify small amounts of PARD3, which is why our phospho-proteomics analysis was limited to human PARD3. To circumvent these difficulties, we instead purified a smaller N-terminal fragment of Baz and PARD3, which was successful for both proteins and gave us much higher quantities of sample for analysis. Using two different approaches (Western blot with our phospho-specific antibody on Baz and phosphor proteomics on Baz and PARD3 using mass spectrometry), we now show in a new Figure 7 that CDK1 phosphorylates Baz-S180 and PARD3-S187 in vitro.

      References

      CORSON, F., COUTURIER, L., ROUAULT, H., MAZOUNI, K. & SCHWEISGUTH, F. 2017. Self-organized Notch dynamics generate stereotyped sensory organ patterns in Drosophila. Science, 356.

      HANNAFORD, M. R., RAMAT, A., LOYER, N. & JANUSCHKE, J. 2018. aPKC-mediated displacement and actomyosin-mediated retention polarize Miranda inDrosophilaneuroblasts. eLife, 7__,__ 166.

      SKOUFIAS, D. A., INDORATO, R. L., LACROIX, F., PANOPOULOS, A. & MARGOLIS, R. L. 2007. Mitosis persists in the absence of Cdk1 activity when proteolysis or protein phosphatase activity is suppressed. J Cell Biol, 179__,__ 671-85.

    1. Author Response

      The following is the authors’ response to the original reviews.

      We highly thank the editor and reviewers for their time and insightful comments and suggestions. We have made revisions by performing additional experiments and analysis, and clarified the items based on the suggestions.

      Reviewer #1 (Public Review):

      Summary of Author's Objectives:

      The authors aimed to explore JMJD6's role in MYC-driven neuroblastoma, particularly in the interplay between pre-mRNA splicing and cancer metabolism, and to investigate the potential for targeting this pathway.

      Strengths:

      (1) The study employs a diverse range of experimental techniques, including molecular biology assays, next-generation sequencing, interactome profiling, and metabolic analysis. Moreover, the authors specifically focused on gained chromosome 17q in neuroblastoma, in combination with analyzing cancer dependency genes screened with Crispr/Cas9 library, analyzing the association of gene expression with prognosis of neuroblastoma patients with large clinical cohort. This comprehensive approach strengthens the credibility of the findings. The identification of the link between JMJD6-mediated premRNA splicing and metabolic reprogramming in MYC-driven cancer cells is innovative.

      (2) The authors effectively integrate data from multiple sources, such as gene expression analysis, RNA splicing analysis, JMJD6 interactome assay, and metabolic profiling. This holistic approach provides a more complete understanding of JMJD6's role.

      (3) The identification of JMJD6 as a potential therapeutic target and its correlation with the response to indisulam have significant clinical implications, addressing an unmet need in cancer treatment.

      Weaknesses:

      (1) The manuscript contains complex technical details and terminology that may pose challenges for readers without a deep background in molecular biology and cancer research. Providing simplified explanations or additional context would enhance accessibility.

      We have provided simplified explanations for some terminology.

      (2) It would be beneficial to explore whether treatment with JMJD6 inhibitors, both in vitro and in vivo, can effectively target the enhanced pre-mRNA splicing of metabolic genes in MYC-driven cancer cells.

      Unfortunately, there is no potent and selective JMJD6 inhibitors available.

      Reviewer #3 (Public Review):

      Summary:

      Jablonowski and colleagues studied key characteristics of MYC-driven cancers: dysregulated pre-mRNA splicing and altered metabolism. This is an important field of study as it remains largely unclear as to how these processes are coordinated in response to malignant transformation and how they are exploitable for future treatments. In the present study, the authors attempt to show that Jumonji Domain Containing 6, Arginine Demethylase And Lysine Hydroxylase (JMJD6) plays a central role in connecting pre-mRNA splicing and metabolism in MYC-driven neuroblastoma. JMJD6 collaborates with the MYC protein in driving cellular transformation by physically interacting with RNA-binding proteins involved in pre-mRNA splicing and protein regulation. In cell line experiments, JMJD6 affected the alternative splicing of two forms of glutaminase (GLS), an essential enzyme in the glutaminolysis process within the central carbon metabolism of neuroblastoma cells. Additionally, the study provides in vitro (and in silico) evidence for JMJD6 being associated with the anti-proliferation effects of a compound called indisulam, which degrades the splicing factor RBM39, known to interact with JMJD6.

      Overall, the findings presented by Jabolonowski et al. begin to illuminate a cancer-promoting metabolic, and potentially, a protein synthesis suppression program that may be linked to alternative pre-mRNA splicing through the action of JMJD6 - downstream of MYC. This discovery can provide further evidence for considering JMJD6 as a potential therapeutic target for the treatment of MYC-driven cancers.

      Strengths:

      Alternative Splicing Induced by JMJD6 Knockdown: the study presents evidence for the role of JMJD6 in alternative splicing in neuroblastoma cells. Specifically, the RNA immunoprecipitation experiments demonstrated a significant shid from the GAC to the KGA GLS isoform upon JMJD6 knockdown. Moreover, a significant correlation between JMJD6 levels and GAC/KGA isoform expression was identified in two distinct neuroblastoma cohorts. This suggests a causative link between JMJD6 activity and isoform prevalence.

      Physical Interaction of JMJD6 in Neuroblastoma Cells: The paper provides preliminary insight into the physical interactome of JMJD6 in neuroblastoma cells. This offers a potential mechanistic avenue for the observed effects on metabolism and protein synthesis and could be exploited for a deeper investigation into the exact nature, and implications of neuroblastoma-specific JMJD6 protein-protein interactions.

      Weaknesses:

      There are several areas that would benefit from improvements with regard to the current data supporting the claims of the paper (i.e., the conclusion presented in Figure 8).

      Neuroblastoma Modelling Strategy: The study heavily relies on cell lines without incorporating patient derived cells/biomaterials. Using databases to fill gaps in the experimental design can only fortify the observations to a certain extent. A critical oversight is the absence of non-cancerous control cells in many figures, and the rationale for selecting specific cell lines for assays/approaches remains somewhat unclear. A foundational control for such experiments should involve the non-transformed neural crest cell line, which the authors have readily available. Are the observed splicing and metabolic effects of JMJD6 specific to neuroblastoma? Is there a neuroblastoma-specific JMJD6 interactome? Is MYC function essential?

      In Vivo Modelling: The inclusion of a genetic mouse model combined with an inducible JMJD6 knockdown, would enhance the study by allowing examination of JMJD6's role during both tumor initiation and growth in vivo. For instance, the TH-MYCN mice overexpressing MYCN in neural crest cells, could be a promising choice.

      Dependence on Colony Formation Assay: The study leans on 2D and semi-quantitative colony formation assays to assess malignant growth. To validate the link between the mechanistic insights discussed (e.g., reduced protein synthesis) and JMJD6-mediated malignant growth as a potential therapeutic target, evidence from in vivo or representative 3D models would be crucial.

      Data Presentation and Rigor: The presented data is predominantly qualitative and necessitates quantification. For instance, Western blots should be quantified. The RNAseq, metabolism, and pulldown data should be transparently and numerically presented. The figure legends seem elusive and their lack of transparency (oden with regards to biological repeats, error bars, cell line used etc.) is concerning. Adequate citation and identification of all data sources, including online resources, are imperative. The manuscript would also benefit from a more rigorous depiction and quantification of RNA interference of both stable and transient knockdowns with quantitative validation at mRNA and protein levels.

      Novelty Concerns: The emphasis on JMJD6 as a novel neuroblastoma target is contingent on the new mechanistic revelations about the JMJD6-centered link between splicing, metabolism, and protein synthesis. Given that JMJD6 has been previously linked to neuroblastoma biology, the rationale (particularly in Figure 1) for concentrating on JMJD6 may stem more from bias rather than data-driven reasoning.

      Depth of Mechanistic Investigation: Current evidence lacks depth in key areas such as JMJD6-RNA binding. A more thorough approach would involve pinpointing specific JMJD6 binding sites on endogenous RNAs using techniques such as cross-linking and immunoprecipitation, paired with complementary proximity-based methodologies. Regarding the presented metabolism data, diving deeper into metabolic flux via isotope labeling experiments could shed light on dynamic processes like TCA and glutaminolysis. As it stands, the 'pathway cartoon' in Figure 6d appears overly qualitative.

      Response: We agree with this reviewer that more in-depth studies are needed to understand the biological functions of JMJD6 in neuroblastoma. We have included one paragraph “limitation of the study” to point out that additional work needs to be done to address the comments from this reviewer.

      We have also added details in figure legend to increase rigor.

      Reviewer #1 (Recommendations For The Authors):

      In this study, Jablonowski and colleagues identify the link between JMJD6-mediated pre-mRNA splicing and metabolic reprogramming in cancer cells, with implications for therapeutic response to splicing inhibitors. I have reviewed your manuscript and found it quite promising. However, there are some specific points that require further clarification and additional experiments. Please consider the following comments:

      Major concerns:

      (1) Regarding Figure 1d and e: to enhance the robustness of your findings, it would be beneficial to include additional datasets, such as the Kocak-649 dataset. It is important to narrow down the analysis to high-risk patient groups when examining survival rates, specifically to investigate whether the elevated expression of the 114 gene signature correlates with poor survival within this subgroup. Additionally, please consider conducting a more detailed breakdown of the subsets depicted in Fig. 1b to explore the association between their expression levels and patient survival rates.

      Response: We have included the Kocak-649 datasets as Supplemental Figure 1. We have further analyzed the 114 gene signature in low-risk and high-risk patients, respectively, as Supplemental Figure 2.

      (2) Fig. 2b: Similar to the previous comment, it would strengthen your findings to include survival rate analysis in more datasets, particularly in high-risk patient groups.

      Response: We have further analyzed the association of JMJD6 with survival in low-risk and high-risk patients, respectively, as Supplemental Figure 3. Regardless of the risk factors, high expression of JMJD6 was associated with a poor outcome.

      (3) In reference to Fig. S1D, please clarify the time point under investigation. It looks like siRNAs were utilized in this study. Ensure consistency between the siRNA # mentioned in the methods section and what is presented in Fig. S1d.

      Response: We have clarified the time point under investigation in Fig. S1D (now as Fig. S4D). We have corrected the siRNA# on the method section.

      Additionally, it would be beneficial to include data on knockdown efficacy and consider incorporating western blot results, similar to those presented in Fig. 2c.

      Response: These experiments were performed as shown in Figure 4C. We assumed the knockdown efficiency was comparable.

      Furthermore, I recommend analyzing the RNA-seq data from JMJD6-depleted BE(2)C cells to identify any alterations in the expression of neuronal differentiation signature genes, with the aim of exploring potential associations with changes in cell morphology showed in Fig. S1D.

      Response: We have analyzed the data and indeed like this reviewer expected, we do see the upregulation of neuronal differentiation pathways. We have included the data as Fig. S7B.

      (4) Fig. 4g: Confirm whether the data is related to GAC, and if so, where is the data for KGA?

      Response: We apologize for this. KGA data was missed when we assembled the figure. We have added back as Figure 4H.

      (5) In relation to Fig. 4, I suggest conducting experiments to individually silence GAC and KGA, if feasible (for instance, by targeting their 3'-UTRs). This would allow for a more in-depth investigation into whether GAC and KGA play essential roles in NB cell proliferation.

      Response: As this reviewer suggested, we have performed the experiments to knock down GAC and KGA in BE2C cells, and we found that both isoforms seemed to be important for cell survival. We have included the data as Figure 5G-I. Additionally, we have also performed RNA-seq to understand the differential functions of GAC and KGA in neuroblastoma cells when they were overexpressed separately. We have included the data as Figure 5E,F, and Supplemental Figure 9.

      (6) Fig. 5c: Could this protein synthesis reduction be attributed to an artificial overexpression of JMJD6? It would be interesting to investigate whether the genetic silencing of JMJD6 has an impact on total protein synthesis.

      Response: This is a great question but could be very challenging to have a definitive answer. Since cells are not happy with knockdown of JMJD6, we may have a secondary effect resulting from activation of cell death. While we have successfully generated single cell JMJD6 CRISPR KO clones, the cells are not happy either. In the future, we may generate dTAG knockin cell line which will allow us to induce an acute protein degradation, and then we can assess if JMJD6 loss will consequently impact total protein synthesis.

      (7) Fig. S7: the authors have shown that knocking down of JMJD6 in NB cells reduced cell proliferation (Fig. 2c-e). Please clarify how you obtained sufficient cells ader CRISPR knockout of JMJD6 clones and whether the cells remained healthy. It would be helpful to provide cell images.

      Response: We harvested cells at different time points in Fig 2C-E, and we have added the information in Figure legends. Cells were not happy ader JMJD6 KD or KO. We therefore harvest cells for Western blot at an early time point while stained cells for survival effect at a late time point.

      (8) Fig. 7f: Address the paradox where JMJD-knockdown cells grow slower (Fig. 2c-e), but these JMJD-KO4E5 cells grow at a similar rate compared to SKNAS-WT in the DMSO treatment group. Clarify whether this aligns with the results observed with shRNA results shown in Fig. 2c-e.

      Response: The JMJD6 KO cells grew much slower than the wild-type cells. In these experiments, we intentionally seeded a lot more cells for JMJD6 KO clone so that we can have a comparable comparison for the cells with DMSO treatment.

      Minor concerns:

      (1) Fig. 2c: Please specify the time point for Fig. 2c to provide a clearer context for readers.

      We have added the information.

      (2) In Line 204, it is stated that 'Supplementary Table 3,' which describes the 'Correlation of JMJD6 KO and its co-dependency genes,' can actually be found in 'Supplementary Table 4.' Please clarify this discrepancy.

      We apologize for this. We probably accidentally uploaded the duplicates. We have uploaded the new table in our revision.

      (3) Line 207: The order of figures should be clarified. Fig. 3c should be mentioned before Fig. 3b in the text.

      Yes, we did.

      (4) In Line 216, it is mentioned that 'Supplementary Table 4,' which describes 'Differentially expressed genes by JMJD6 KD,' can actually be found in 'Supplementary Table 3.' Please provide clarification for this discrepancy.

      We have corrected this.

      (5) Line 244-247: Please provide clarification of this section to ensure readers can fully understand your point.

      We have rephrased the sentence.

      (6) Line 1048: Confirm whether Fig. 2c represents siRNA or shRNA, as the label in the graph does not match the figure legends.

      Sorry for this. We have corrected.

      (7) Line 1161: Provide clarification regarding the use of Image J from k, and in Line 1162, specify the source of Image J from l.

      We apologized for the confusion of our description. We meant “Image J” sodware. We have corrected in Figure legend.

      Reviewer #2 (Recommendations For The Authors):

      Suggestions to authors:

      Line 39 - suggest introducing JMJD6.

      Response: We have added the full name of JMJD6.

      Line 47 - suggest slightly rephrasing 'metabolic program that is coupled with...'.

      We have made a slight change by changing “coupled” to “associate”.

      Line 85 - please delete/replace 'exceptional'; proofread for inadequate use of ambiguous wording.

      We have changed it as “significant”.

      Line 141 - please concisely define 'high risk'.

      We have defined it with a citation (line 142-146).

      Line 143 - please concisely define 'event free'.

      We have defined the event free and overall survival precisely (line 149, 150).

      Line 153 - provide an adequate citation for 'cBioportal'.

      We have added the citation (line166).

      Line 161 - please state the utilized cell lines.

      We have referenced to Materials and Methods (line 175).

      Line 166 - please note that 'morphological changes' of a cell do not suffice to determine 'stemness', please rephrase.

      We agreed and changed it to “regulate cellular differentiation” (line 181).

      Line 182 - provide a quantifiable measure for color change and or remove observation from the narrative.

      We have removed “indicative of acidic pH change” (line 198).

      Line 185 - the statement commencing with 'It is believed...' requires referencing.

      We have added references (line 200).

      Line 187 - please provide an adequate citation for the 'JoMa1' neural crest-derived cells (J. Maurer and colleagues?).

      We have added the reference (line 201).

      Line 203 - please provide an adequate citation for 'DepMap'.

      There is no citation specifically for DepMap and that’s why we can only provide the DepMap link.

      Line 234 - please provide an adequate citation for 'two algorithms'.

      We have provided the reference (line 265).

      Line 265 - please provide a rationale for the choice of the three tested cell lines.

      We have added definition by saying C-MYC overexpressed SKNAS, BE2C and SIMA with MYCN amplification (line 302, 303).

      Line 279 - suggest rephrasing 'gaining more ATPs'.

      We have removed these words as we do not have direct evidence to show ATP production (line 320).

      Line 342 - suggest rephrasing 'are in the only gene signature'.

      We have rephrased by saying “lysine demethylase (HDM) genes, including JMJD6, are present in the most significantly enriched gene signature in indisulam-sensitive cells” (line 416-416).

      Line 424 - please state the source or all cell lines (commercial provider?).

      We have added the source of cell lines.

      Lines 438 to 442 - are STR and mycoplasma profiling data adequately presented in the manuscript?

      We routinely test STR and mycoplasma for all cell lines cultured in hood in our Department every month.

      Lines 520 onwards - is the JMJD6 knockout generation data (e.g., cell viability upon knockout) adequately presented in the manuscript? Why does the study depend on transient transfection of siRNAs for obtaining mechanistic results?

      We created stable JMJD6 KO clones by selecting single cell with complete knockout. Cells are not happy ader KO. siRNA knockdown is a method for relatively acute depletion of JMJD6, which is easy and fast, and may be more reliable to assess the direct effect of JMJD6.

      Figures: please provide adequate axis-labeling for all graphs (e.g., FIg2 b, and e).

      We have added the axis labeling.

      Discussion line 370 - what is meant by 'too harsh' - please use unambiguous phrasing to highlight limitations.

      We have changed to “stringent”.

      Please provide a study limitation paragraph.

      We have added one limitation paragraph.

      Limitation of the study

      Our study focused on the understanding of JMJD6 function in neuroblastoma cell lines. In the future, we will consolidate our study by expanding our models to patient-derived xenograds, organoids, and neuroblastoma genetic models, in comparison with non-cancerous cells. Although we have identified a conserved interactome of JMJD6 in neuroblastoma cells, it remains to be determined whether it is neuroblastoma-specific and essential to MYC-driven cancers. The genome-wide RNA binding by JMJD6 in cancer cells and normal cells coupled with isotope labeling to dissect the metabolic effect of JMJD6 will enhance our understanding of the biological functions of JMJD6, awaiting future studies. Inability to target the enhanced pre-mRNA splicing of metabolic genes in MYC-driven cancer cells by pharmacologic inhibition of JMJD6 is another limitation, due to lack of selective and potent JMJD6 inhibitors.

      Additional editing and proof-reading of the manuscript's narrative, figures, legends, and methods is highly recommended.

      We have gone through the whole MS to have proof-reading.

    1. Author Response

      The following is the authors’ response to the original reviews.

      eLife assessment

      This study presents a valuable finding on the distinct subpopulation of adipocytes during brown-to-white conversion in perirenal adipose tissue (PRAT) at different ages. The evidence supporting the claims of the authors is convincing, although specific lineage tracing of this subpopulation of cells and mechanistic studies would expand the work. The work will be of interest to scientists working on adipose and kidney biology.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      In this manuscript, the authors performed single nucleus RNA-seq for perirenal adipose tissue (PRAT) at different ages. They concluded a distinct subpopulation of adipocytes arises through brown-to-white conversion and can convert to a thermogenic phenotype upon cold exposure.

      Strengths:

      PRAT adipose tissue has been reported as an adipose tissue that undergoes browning. This study confirms that brown-to-white and white-to-beige conversions also exist in PRAT, as previously reported in the subcutaneous adipose tissue.

      Response: We thank the reviewer for summarizing the strengths of our manuscript. However, we would like to clarify two points here. First, PRAT has been reported as a visceral adipose depot that contains brown adipocytes and a process of continuous replacement of brown adipocytes by white adipocytes has been previously suggested based on histological assessment. There is no evidence that PRAT undergoes browning, unless cold exposure is involved. Second, unlike the brown-to-white conversion, white-to-beige conversion in PRAT was not observed under normal conditions. The adipocyte population that arises from brown-to-white conversion (mPRAT-ad2) can respond to cold and restore their UCP1 expression. However, the adipocytes that arise from the mPRAT-ad2 subpopulation after cold exposure have a distinct transcriptome to that of cold-induced beige adipocyte in iWAT (Figure S7K) and are more related to iBAT brown adipocytes (Figure 6E). Therefore, it is more of a white-to-brown conversion in PRAT upon cold exposure rather than white-to-beige conversion and the underlying mechanism is likely different from the white-to-beige conversion in the subcutaneous adipose tissue.

      Weaknesses:

      (1) There is overall a disconnection between single nucleus RNA-seq data and the lineage chasing data. No specific markers of this population have been validated by staining.

      Response: We are not sure what “this population” refers to. We assume that it is the Ucp1-&Cidea+ mPRAT-ad2 adipocyte subpopulation. If so, we did not identify specific markers for these adipocytes as shown in Figure 1H and statements in the Discussion section. mPRAT-ad2 is negative for Ucp1 and Cyp2e1, which are markers for mPRAT-ad1 and mPRAT-ad3&4, respectively. To visualize the mPRAT-ad2 adipocytes on tissue sections, we collected pvPRAT and puPRAT of Ucp1CreERT2;Ai14 mice one day after tamoxifen injection and stained with CYP2E1 antibody and BODIPY. The Tomato-&CYP2E1-&BODIPY+ cells represent the mPRAT-ad2 adipocytes. Based on such strategy, we revealed a significantly higher percentage of mPRAT-ad2 cells in puPRAT than pvPRAT (presented as Figure S3E in the revised manuscript).

      (2) It would be nice to provide more evidence to support the conclusion shown in lines 243 to 245 "These results indicated that new BAs induced by cold exposure were mainly derived from UCP1- adipocytes rather than de novo ASPC differentiation in puPRAT". Pdgfra-negative progenitor cells may also contribute to these new beige adipocytes.

      Response: We stained pvPRAT and puPRAT of the PdgfraCre;Ai14 mice with the adipocyte marker Plin1 and observed a 100% overlap between the tdTomato signal and the Plin1 staining, after examining a total of 832 and 628 adipocytes in pvPRAT and puPRAT of two animals (Figure S4). Plin1 stains all adipocytes, while the endogenous tdTomato labels both the adipocytes and blood vessels. This result suggests that all adipocytes in mPRAT are derived from Pdgfra-expressing cells, which is in line with a previous study that integrated several single-cell RNA sequencing data sets and showed that Pdgfra is expressed by virtually all ASPCs (Ferrero et al., 2020).

      Also, we would like to point out that the cold-induced adipocytes in mPRAT resemble more to the brown adipocytes of iBAT than the beige adipocytes of iWAT (Figure 6E and S7K).

      Ferrero, R., Rainer, P., and Deplancke, B. (2020). Toward a Consensus View of Mammalian Adipocyte Stem and Progenitor Cell Heterogeneity. Trends Cell Biol 30, 937-950.

      (3) The UCP1Cre-ERT2; Ai14 system should be validated by showing Tomato and UCP1 co-staining right after the Tamoxifen treatment.

      Response: We collected pvPRAT and puPRAT of 1- and 6-month-old Ucp1CreERT2;Ai14 mice one day after the last tamoxifen injection and stained with UCP1 antibody to check the overlap between the Tomato and UCP1signal. All Tomato+ cells were UCP1+, indicating 100% specificity of the Ucp1CreERT2; and the labelling efficiency was over 93% at both time points for both regions (Figure S3C-D).

      Reviewer #2 (Public Review):

      Summary:

      In the present manuscript, Zhang et al utilize single-nuclei RNA-Seq to investigate the heterogeneity of perirenal adipose tissue. The perirenal depot is interesting because it contains both brown and white adipocytes, a subset of which undergo functional "whitening" during early development. While adipocyte thermogenic transdifferentiation has been previously reported, there remain many unanswered questions regarding this phenomenon and the mechanisms by which it is regulated.

      Strengths:

      The combination of UCP1-lineage tracing with the single nuclei analysis allowed the authors to identify four populations of adipocytes with differing thermogenic potential, including a "whitened" adipocyte (mPRAT-ad2) that retains the capacity to rapidly revert to a brown phenotype upon cold exposure. They also identify two populations of white adipocytes that do not undergo browning with acute cold exposure.

      Anatomically distinct adipose depots display interesting functional differences, and this work contributes to our understanding of one of the few brown depots present in humans.

      Weaknesses:

      The most interesting aspect of this work is the identification of a highly plastic mature adipocyte population with the capacity to switch between a white and brown phenotype. The authors attempt to identify the transcriptional signature of this ad2 subpopulation, however, the limited sequencing depth of single nuclei somewhat lessens the impact of these findings. Furthermore, the lack of any form of mechanistic investigation into the regulation of mPRAT whitening limits the utility of this manuscript. However, the combination of well-executed lineage tracing with comprehensive cross-depot single-nuclei presented in this manuscript could still serve as a useful reference for the field.

      Response: The sequencing depth of our data is comparable, if not better than previously published snRNA-seq studies on adipose tissue (Burl et al., 2022; Sarvari et al., 2021; Sun et al., 2020). Therefore, the depth of our data has reached the limit of the 3’ sequencing methods. Unfortunately, due to size limitation of the adipocytes, it is challenging to sort them for Smart-seq. We suspect that lack of specific markers for mPRAT-ad2 is partly due to its intermediate and plastic phenotype. Regarding the mechanistic regulation of mPRAT whitening, we believe that it is more suitable to leave such investigations for a separate follow-up and more in-depth study.

      Burl, R.B., Rondini, E.A., Wei, H., Pique-Regi, R., and Granneman, J.G. (2022). Deconstructing cold-induced brown adipocyte neogenesis in mice. Elife 11. 10.7554/eLife.80167.

      Sarvari, A.K., Van Hauwaert, E.L., Markussen, L.K., Gammelmark, E., Marcher, A.B., Ebbesen, M.F., Nielsen, R., Brewer, J.R., Madsen, J.G.S., and Mandrup, S. (2021). Plasticity of Epididymal Adipose Tissue in Response to Diet-Induced Obesity at Single-Nucleus Resolution. Cell Metab 33, 437-453 e435. 10.1016/j.cmet.2020.12.004.

      Sun, W., Dong, H., Balaz, M., Slyper, M., Drokhlyansky, E., Colleluori, G., Giordano, A., Kovanicova, Z., Stefanicka, P., Balazova, L., et al. (2020). snRNA-seq reveals a subpopulation of adipocytes that regulates thermogenesis. Nature 587, 98-102. 10.1038/s41586-020-2856-x.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      (1) There is overall a disconnection between single nucleus RNA-seq data and the lineage chasing data. No specific markers of this population have been validated by staining.

      (2) It would be nice to provide more evidence to support the conclusion shown in lines 243 to 245: "These results indicated that new BAs induced by cold exposure were mainly derived from UCP1- adipocytes rather than de novo ASPC differentiation in puPRAT". Pdgfra-negative progenitor cells may also contribute to these new beige adipocytes.

      (3) The UCP1Cre-ERT2; Ai14 system should be validated by showing Tomato and UCP1 co-staining right after the Tamoxifen treatment.

      Please see above for the responses.

      Reviewer #2 (Recommendations For The Authors):

      • Without specific lineage tracing it is not possible to conclude that the mPRAT-ad2 population converted to beige with CE. The authors should change this wording from "likely" to "possible".

      Response: We have changed the word “likely” to “possible” in the text. Also, we would like to point out that the cold-induced adipocytes in mPRAT resemble more to the brown adipocytes of iBAT than the beige adipocytes of iWAT (Figure 6E and S7K).

      • The sentence "precursor cells may be less sensitive to environmental temperature and have a limited contribution to mature adipocyte phenotypes through de novo adipogenesis after cold exposure." and others like it should be changed to indicate the acute timeframe of this experiment. It has been shown that the precursors make a more significant contribution to de novo beige adipogenesis with chronic cold exposure.

      Response: We have modified the sentence as follows: “precursor cells may be less sensitive to acute environmental temperature drop and have a limited contribution to mature adipocyte phenotypes through de novo adipogenesis after cold exposure”. As mentioned above, the cold-induced adipocytes in mPRAT resemble more to the brown adipocytes of iBAT and therefore may have a different mechanism to the de novo beige adipogenesis with chronic cold exposure.

    1. 208-00002 Erosion Log Type 1 (12 Inch) LF 3316 6.50$ 21,554.00$15 208-00045 Concrete Washout Structure EACH 1 4,150.00$ 4,150.00$16 208-00056 Storm Drain Inlet Protection (Type III) EACH 1 375.00$ 375.00$18 210-00810 Reset Ground Sign EACH 10 470.00$ 4,700.00$19 210-00858 Reset Pedestal Pole EACH 2 2,750.00$ 5,500.00$20 212-00707 Seeding (Native) Hydraulic ACRE 0.30 5,000.00$ 1,500.00$21 213-00004 Mulching (Weed Free Straw) ACRE 0.30 1,100.00$ 330.00$22 213-00061 Mulch Tackifier LB 30 5.00$ 150.00$23 304-06000 Aggregate Base Course (Class 6) TON 576 80.00$ 46,080.00$24 514-00200 Pedestrian Railing (Steel) LF 415 40.00$ 16,600.00$27 603-05018 18 Inch Reinforced Concrete Pipe End Section EACH 1 3,000.00$ 3,000.00$28 603-01180 18 Inch Reinforced Concrete Pipe LF 285 200.00$ 57,000.00$29 604-19105 Inlet Type R L 5 (5 Foot) EACH 2 8,500.00$ 17,000.00$30 608-00000 Concrete Sidewalk SY 2437 100.00$ 243,700.00$31 608-00010 Concrete Curb Ramp SY 153 175.00$ 26,775.00$32 608-00015 Detectable Warnings SF 288 50.00$ 14,400.00$33 609-21020 Curb and Gutter Type 2 (Section II-B) LF 260 50.00$ 13,000.00$34 609-24004 Gutter Type 2 (4 Foot) LF 110 75.00$ 8,250.00

      Material

    Annotators

    1. Author Response

      Note to the editor and reviewers.

      All the authors would like to thank the editorial team and the two anonymous reviewers for their efforts and thoughtfulness in assessing our manuscript. We very much appreciate it and we all believe that the manuscript has been much improved in addressing the comments and suggestions made.

      General considerations on the revised manuscript

      We have applied extensive modifications to the manuscript with our main goal being the improvement of clarity. The Introduction has been changed mainly to introduce precisely our terminology and we have stuck to it in the rest of the manuscript. The Results section has been divided up into more defined sections. The discussion has been extensively re-written to improve clarity, following the suggestion of the reviewers. Main figures 1 and 4 have been modified with clearer schematics. Supplementary figures and legends have been modified and several supplementary schematic figures have been added to clearly present our interpretations for various data. We have added a Supplementary Discussion where the most detailed technical parts of our discussion are presented to avoid unnecessarily weighing down the main discussion, where our main conclusions are outlined. We have presented our mass photometry mixing experiment in a new supplementary figure, with detailed explanation. We have also expanded our discussion of in vivo and general relevance of our study.

      Response to manuscript evaluation

      Our manuscript has been evaluated as a valuable study and presenting solid experimental evidence. We appreciate the recognition of our work.

      Two weaknesses were identified by reviewers: 1) our experiments do not completely exclude the possibility of an alternative nucleophile. This relates to the evaluation of our experimental evidence. 2) Our study does not address the in vivo relevance of the interface swapping phenomenon, which relate to the value of the study for the community.

      Response to the evaluation of experimental evidence (Weakness #1):

      We argued in the original manuscript that we have excluded completely the presence of an alternative nucleophile. This conclusion is based on a series of experiments which were presented in the originally submitted manuscript. These experiments are not discussed by the reviewers in relation to this main conclusion and therefore we suggest that they have not been properly evaluated. We believe our conclusion to be appropriately supported by these data (see our response to reviewer #1). In addition, the criticism of our gel-filtration data by reviewer #2 was based on a misinterpretation of Supplementary figure 1 b. We accept of course that the way the data was presented could be misleading and we assume responsibility for this. We have attempted to correct this by changing the main text and the figures legends and annotation. In conclusion, we believe that the evaluation of experimental evidence as presented in the revised manuscript could be upgraded to “convincing”.

      Response to our study general relevance evaluation (weakness #2):

      We agree with both reviewers about the in vivo relevance of our observation being an important question, not addressed so far. Indeed, the value of our study would be greatly increased by in vivo data and be of interest to a wider audience. However, we would like to argue that our study would interest a wider audience than initially stated for the following reasons: 1) Our study is the first evidence of interface swapping in vitro and will constitute a base to investigate this phenomenon both in vivo and in vitro. It will therefore interest a wide audience due to the potential involvement of interface swapping in a wide range of processes, such as recombination, evolution, and drug targeting (see also below). 2) DNA cleavage is the central mode of action of antibiotics targeting bacterial type II topoisomerases (i.e. topoisomerases “poisons”). This already established target is one of the few having produced new scaffolds and too few new antibacterial are in production to fulfill medical needs. The role of interface stability is also emerging as a modulator of the efficiency of topoisomerase poisons. See for instance (Germe, Voros et al. 2018, Bandak, Blower et al. 2023). By shedding light on interface dynamics, our study will be of interest to scientist interested in the development of these drugs. In addition, the heterodimer system can potentially produce detailed mechanistic information (Gubaev, Weidlich et al. 2016, Hartmann, Gubaev et al. 2017, Stelljes, Weidlich et al. 2018) not only on gyrase but also on other, dimeric type II topoisomerases or even other dimeric enzyme in general. We have amended the manuscript to make these points clearer. Therefore, we believe that the evaluation of the revised manuscript’s relevance could be upgraded to “important”.

      Point-by-point response to the reviewer

      Reviewer #1 (Public Review):

      Germe and colleagues have investigated the mode of action of bacterial DNA gyrase, a tetrameric GyrA2GyrB2 complex that catalyses ATP-dependent DNA supercoiling. The accepted mechanism is that the enzyme passes a DNA segment through a reversible double-stranded DNA break formed by two catalytic Tyr residues-one from each GyrA subunit. The present study sought to understand an intriguing earlier observation that gyrase with a single catalytic tyrosine that cleaves a single strand of DNA, nonetheless has DNA supercoiling activity, a finding that led to the suggestion that gyrase acts via a nicking closing mechanism. Germe et al used bacterial co-expression to make the wild-type and mutant heterodimeric BA(fused). A complexes with only one catalytic tyrosine. Whether the Tyr mutation was on the A side or BA fusion side, both complexes plus GyrB reconstituted fluoroquinolone-stabilized double-stranded DNA cleavage and DNA supercoiling. This indicates that the preparations of these complexes sustain double strand DNA passage. Of possible explanations, contamination of heterodimeric complexes or GyrB with GyrA dimers was ruled out by the meticulous prior analysis of the proteins on native Page gels, by analytical gel filtration and by mass photometry. Involvement of an alternative nucleophile on the Tyr-mutated protein was ruled unlikely by mutagenesis studies focused on the catalytic ArgTyrThr triad of residues. Instead, results of the present study favour a third explanation wherein double-strand DNA breakage arises as a consequence of subunit (or interface/domain) exchange. The authors showed that although subunits in the GyrA dimer were thought to be tightly associated, addition of GyrB to heterodimers with one catalytic tyrosine stimulates rapid DNA-dependent subunit or interface exchange to generate complexes with two catalytic tyrosines capable of double-stranded DNA breakage. Subunit exchange between complexes is facilitated by DNA bending and wrapping by gyrase, by the ability of both GyrA and GyrB to form higher order aggregates and by dense packing of gyrase complexes on DNA. By addressing a puzzling paradox, this study provides support for the accepted double strand break (strand passage) mechanism of gyrase and opens new insights on subunit exchange that may have biological significance in promoting DNA recombination and genome evolution.

      The conclusions of the work are mostly well supported by the experimental data.

      Strengths:

      The study examines a fundamental biological question, namely the mechanism of DNA gyrase, an essential and ubiquitous enzyme in bacteria, and the target of fluoroquinolone antimicrobial agents.

      The experiments have been carefully done and the analysis of their outcomes is comprehensive, thoughtful and considered.

      The work uses an array of complementary techniques to characterize preparations of GyrA, GyrB and various gyrase complexes. In this regard, mass photometry seems particularly useful. Analysis reveals that purified GyrA and GyrB can each form multimeric complexes and highlights the complexities involved in investigating the gyrase system.

      The various possible explanations for the double-strand DNA breakage by gyrase heterodimers with a single catalytic tyrosine are considered and addressed by appropriate experiments.

      The study highlights the potential biological importance of interactions between gyrase complexes through domain-or subunit-exchange

      We thank the reviewer for their support, effort, and comments. The above is a great summary.

      Weaknesses:

      The mutagenesis experiments described do not fully eliminate the perhaps unlikely participation of an alternative nucleophile.

      We agree that the mutagenesis experiment on its own does not fully eliminate the possibility of an alternative nucleophile. The number of residues mutated is limited, and therefore it is possible we have missed a putative alternative nucleophile.

      However, we have other data and experiments supporting the conclusion that no alternative nucleophile exists. Therefore, we want to stress that our conclusion that no such alternative exist is based on these extra data. These data and experiments are not discussed by either reviewer despite being present in the original manuscript. This puzzled us and we have modified the manuscript and the figures in the hope that they, and their significance, would not be missed.

      Briefly:

      1) We have performed cleavage-based labeling of the nucleophile responsible for cleavage. This experiment is depicted in Figure 4. The nucleophilic activity of the residue involved results in covalent link between the polypeptide (that includes the residue) and radiolabeled DNA. Therefore, a polypeptide that includes an active nucleophile will be radiolabeled and visible, whereas a polypeptide that is missing an active nucleophile will remain unlabeled and invisible. We can distinguish the BA and the A polypeptide from their size. In the case of the BA.A complex both the BA polypetide and the A polypetide are radiolabeled and therefore both have an active nucleophile. In the case of the BAF.A complex, the unmutated A polypeptide is labeled, meaning that a nucleophile is still active. In contrast, the BAF polypeptide shows no detectable labeling. This result means that removing the hydroxyl group from the catalytic tyrosine abolishes any protein-DNA covalent link, suggesting that no other nucleophile from the BA polypetidic chain can substitute for the catalytic tyrosine hydroxyl group. This experiment excludes the possibility of an alternative nucleophile coming from the polypeptidic chain of either GyrA or GyrB. This experiment, described in figure 4, is not discussed by the reviewer. This experiment is similar in principle to early experiments identifying catalytic tyrosine in topoisomerases. See for instance, (Shuman, Kane et al. 1989).

      2) The experiment above does not exclude a nucleophile coming from the solvent. To exclude this possibility, we have used T5 exonuclease (which needs a free 5’ DNA end to digest) and ExoIII (which need a free 3’ DNA end to digest). We have shown the reconstituted cleavage is not sensitive to T5 and sensitive to ExoIII. This shows that the 5’ end of the cleaved sites are protected by a bulky polypeptide impairing T5 activity, which is active in our reaction as shown by the digestion of a control DNA fragment. This experiment shows that the reconstituted cleavage is very unlikely to come from a small nucleotide potentially provided by the solvent. This experiment is described in the main text and the results are shown in supplementary figure 5. It is not mentioned by either reviewer.

      3) Finally, we would like to emphasize our experiment comparing the BAF.A59 to BALLL.A59. The BALLL.A59 complex displays increased cleavage compared to BAF.A59. If this increased cleavage was due to an alternative nucleophile on the BALLL side, we would expect an accompanying increase in supercoiling activity since the BALLL.A59 possesses one CTD, which is sufficient for supercoiling. The fact that no increased supercoiling activity is observed strongly suggests subunit exchange reconstituting an A59 dimer, inactive for supercoiling but active for cleavage. We believe this somewhat complex observation to be quite significant and we have attempted to clarify the manuscript and discuss its full significance in several places.

      Reviewer #1 (Recommendations For The Authors):

      An interesting paper on DNA gyrase that explains a puzzling paradox in terms of the double-strand break mechanism.

      Major points

      1) The authors consider several mechanisms that could potentially explain their data. On page 15, the authors present the evidence against the nicking closing mechanism proposed by Gubaev et al. Throughout the manuscript, they indicate where their experimental results agree with this earlier work but should also indicate and account for differences. For example, Gubaev et al describe cross linking experiments that they claim rule out subunit exchange. These aspects should be clearly explained.

      Thank you for the suggestion. We have re-written the discussion to address this point. We are extensively discussing experiments from (Gubaev, Weidlich et al. 2016), and offer our interpretation of apparently conflicting results. We suggest that their experiments are basically consistent with our data when correctly interpreted. To keep the main manuscript clear, we have added a supplementary discussion where experiments from (Gubaev, Weidlich et al. 2016) are discussed further in relation to our data.

      2) Page 9. The experiments done to rule out the perhaps unlikely alternative nucleophile hypothesis relate to the possible role of the Arg and Threonine of the RYT triad. These residues are close to the DNA and therefore are prime candidates and attractive targets for mutagenesis. However, strictly speaking, the mutant enzyme data presented do not rule all possibilities. For example, Serine is often the nucleophile used by resolvases to effect DNA recombination via subunit exchange. The ideal experiment to rule out/rule in other nucleophiles would be to identify the residue(s) that become attached to DNA in the cleavage reaction.

      Please see above. We have effectively ruled an alternative nucleophile with our cleavage-based labeling experiment and others that were present and discussed in the original manuscript but were missed. We have modified the manuscript and figures in order to make this point clearer than before.

      3) p17. The readout for subunit exchange used by the authors is double-stranded DNA cleavage. Attempts to directly detect the formation of the DNA cleaving complexes GyrA2B2 and (GyrBA)2 (arising from subunit exchange between heterodimers) by mass photometry were not successful. Perhaps FRET would have been another approach to try as it could also detect interface and domain interchanges.

      Directly detecting interface exchange directly by proximity experiment would be extremely useful. FRET would have to be done in the BAF.A + GyrB configuration where the amount of interface exchange is important. Now, we do not have the tools to do that and developing them would be outside the scope of the study. We propose cross linking experiment to be done in the future. We argue that the manuscript is convincing without these for now. This will be addressed in the future. This point, and other possible future experiments are now discussed in the discussion section.

      4) The underlying canvas of this paper is the strand passage mechanism of gyrase. It would seem appropriate to include the papers first proposing it - Brown P.O and Cozzarelli N.R. (1979) and Mizuuchi K et al (1980).

      We very much agree. These papers have now been added in the introduction as appropriate, highlighting the relationship between double-strand cleavage and the strand-passage mechanism.

      5) Figure 1. The quality of the insets is poor. It is difficult to pick out the key catalytic residues and their disposition vis-a-vis DNA.

      We agree, Figure 1 has been re-done and the schematic theme has been harmonized throughout the whole manuscript. We very much hope that clarity has improved. Thank you for the suggestion.

      6) The experimental work is a very detailed analysis of a specific feature of engineered gyrase heterodimers. Making the work accessible to the general reader will be important. Using shorter paragraphs each with a specific theme might help. In particular, the second paragraph of the Results on p7, the section on p9 and bottom of p11, p13 and the first paragraph of the Discussion on p14 are each a page or more long. A shorter manuscript that avoids overinterpretation of the smaller details would also help.

      We agree. We have now split long paragraphs into individual sections, with titles, in the Results. This structure is recapitulated at the beginning of the discussion, and we have split the discussion into shorter paragraphs, each with a unique point being made.

      7) The impact of the Gubaev et al (2016) paper for the field in general, and as the catalyst for the present work should be better documented. Mention of this earlier paper and its significance at the beginning of the Abstract and elsewhere e.g in the Introduction might also help with a more logical organization of the current findings and result in a shorter paper (which would be easier to read).

      We have added a reference to (Gubaev, Weidlich et al. 2016) in the abstract and have expanded our introduction

      Minor points

      1) Legends for Figs 2 and 6; Supplementary Figs 1 and 8. The designation of subfigures as a, b, c, d , e etc appears to be incorrect. Check throughout and in the text.

      The manuscript has been checked for such errors.

      2) Figure 2, and first paragraph p8. Peaks in Fig 2c should be labelled to facilitate discussion on p8.

      Agreed, this has been done.

      3) Supplementary Fig 4 and elsewhere in the manuscript. A variety of notations are used to denote phenylalanine mutants e.g. AsubscriptF, AsuperscriptF and AF. Check and use one format throughout.

      Done

      4) Figures showing gels include the label '+EtBr, +cipro'. This is somewhat confusing because EtBr was contained in the gel (not the samples) whereas cipro was included in the reaction. Modify or describe in the legend..

      We have re-written the figure legend.

      5) Supplementary Fig 4b describes a small effect on the ratio of linear to nicked DNA for the triple LLL mutant. Is this significant? How many times was the measurement made?

      This has been addressed in the original manuscript in the supplementary data. In term of quantification, the experiment has been done 3 times for each prep, with the same GyrB prep and concentration. The standard error is displayed on the figure. This result is very reproducible and have been reproduced more than 3 times. No LLL cleavage assay showed more single-strand than double-strand cleavage. For the phenylalanine mutant, no cleavage assay showed more double-strand than single-strand cleavage.

      6) Supplementary Fig 5 legend. Should 'L' read 'size markers' (and give their sizes)?

      Yes indeed, we have modified the figure to clarify.

      7) p11 line 5. Is this statement correct?

      Yes, it is correct. Although we hope we are on the same line. When the Tyrosine is mutated on one side only of the heterodimer, both single- and double-strand cleavage are protected from T5 exonuclease digestion.

      8) 12 last line should read...and supercoiling activity (not shown)..were

      Thank you, done.

      There are a number of typos throughout the text, for example:

      Page 3 line..Difficult to conclude...what?

      Page 3 para 3...Lopez....and Blazquez

      We have corrected these typos and checked the whole manuscript.

      Reviewer #2 (Public Review):

      DNA gyrase is an essential enzyme in bacteria that regulates DNA topology and has the unique property to introduce negative supercoils into DNA. This enzyme contains 2 subunits GyrA and GyrB, which forms an A2B2 heterotetramer that associates with DNA and hydrolyzes ATP. The molecular structure of the A2B2 assembly is composed of 3 dimeric interfaces, called gates, which allow the cleavage and transport of DNA double stranded molecules through the gates, in order to perform DNA topology simplification. The article by Germe et al. questions the existence and possible mechanism for subunit exchange in the bacterial DNA gyrase complex.

      The complexes are purified as a dimer of GyrA and a fusion of GyrB and GyrA (GyrBA), encoded by different plasmids, to allow the introduction of targeted mutations on one side only of the complex. The conclusion drawn by the authors is that subunit exchange does happen, favored by DNA binding and wrapping. They propose that the accumulation of gyrase in higher-order oligomers can favor rapid subunit exchange between two active gyrase complexes brought into proximity.

      The authors are also debating the conclusions of a previous article by Gubaev, Weidlich et al 2016 (https://doi.org/10.1093/nar/gkw740). Gubaev et al. originally used this strategy of complex reconstitution to propose a nicking-closing mechanism for the introduction of negative supercoils by DNA gyrase, an alternative mechanism that precludes DNA strand passage, previously established in the field. Germe et al. incriminate in this earlier study the potential subunit swapping of the recombinant protein with the endogenous enzyme, that would be responsible for the detected negative supercoiling activity.

      Accordingly, the authors also conclude that they cannot completely exclude the presence of endogenous subunits in their samples as well.

      Strengths

      The mix of gyrase subunits is plausible, this mechanism has been suggested by Ideka et al, 2004 and also for the human Top2 isoforms with the formation of Top2a/Top2b hybrids being identified in HeLa cells (doi: 10.1073/pnas.93.16.8288).

      Germe et al have used extensive and solid biochemical experiments, together with thorough experimental controls, involving :

      • the purification of gyrase subunits including mutants with domain deletion, subunit fusion or point mutations.

      • DNA relaxation, cleavage and supercoiling assays

      • biophysical characterization in solution (size exclusion chromatography, mass photometry, mass spectrometry)

      Together the combination of experimental approaches provides solid evidence for subunit swapping in gyrase in vitro, despite the technical limitations of standard biochemistry applied to such a complex macromolecule.

      We thank the reviewer for their supportive and considered comments.

      Weaknesses

      The conclusions of this study could be strengthened by in vivo data to identify subunit swapping in the bacteria, as proposed by Ideka et al, 2004. Indeed, if shown in vivo, together with this biochemical evidence, this mechanism could have a substantial impact on our understanding of bacterial physiology and resistance to drugs.

      Thank you for this comment. Indeed, whether this interface exchange can happen in vivo and lead to recombination is a very important question. However, we believe that this is outside the scope of this study simply because of the amount of work one can fit into one paper. Proving that interface exchange can happen in vitro has already necessitated a number of non-trivial experiments and likewise investigating interface exchange in vivo will require a careful, long-term study (see our reply to reviewer #2 comment, who also raised this point). We can’t address it with one additional experiment with the tools we have. However, we very much hope to do it in the future.

      Reviewer #2 (Recommendations For The Authors):

      Specific questions and comments for the authors:

      1) Complex identification during purification

      The statement line 236-237 that "Our heterodimer preparation showed a single-peak on a gel-filtration column, distinct from the GyrA dimer peak" is not entirely clear. In Fig supp 1 b, how can the authors conclude from the superose 6 that GyrBA is separated from the GyrA dimer? Since they seem close in size 160/180kDa, they are unlikely to be well separated in a superose 6 gel filtration column. The SDS-PAGE seems to show both species in the same fractions #15-17 therefore it would not be possible to distinguish GyrBA. A from A2.

      There appears to be some confusion about what Supp Fig. 1b shows. First, in all our gel filtration conditions both GyrBA and GyrA can’t exist as monomers at a significant concentration. Therefore, we can never observe the GyrBA monomer on a gel filtration column. Supp Fig. 1b shows the gel filtration profile of the BA.A heterodimer only. This is the output of the last, polishing step in the reaction. We analyze these results using SDS-PAGE. Therefore, the BA.A heterodimer will be denatured and separated into 2 polypeptides: GyrBA and GyrA, which migrates according to their size in an SDS-PAGE and forms two bands. These two bands do not represent two separate species in solution. They represent the separation of one species only, the BA.A heterodimer into its two, denatured, subunits: GyrA and GyrBA. We do not conclude from Supp Fig. 1 as a whole that GyrBA and the GyrA dimer are well separated, and this is not stated in the manuscript. We conclude that the BA.A dimer is fairly well separated from the GyrA dimer. They have significant different size (~260 kDa and ~180 kDa respectively) and form different peaks on a gel filtration column. The BA.A heterodimer has a GyrA subunit and therefore will shows a GyrA band on an SDS-PAGE, like the GyrA dimers but the two are obviously distinct in their quaternary structure. We are hoping that our new schematics and re-write of some of the results and figure legends will clarify this.

      Panel 6 shows a different elution volume for the 2 species BA.A and A2 on an analytical S200 column, which appears better at separating the complexes in this size range.

      Did the authors consider using a S200 column instead of superose 6 for the sample preparation, to optimize the separation of GyrBA. A from A2?

      This is not a necessarily true statement (see above). We have not run the GyrA dimer on a Superose 6 column. The analysis was done on an s200 because extensive data for the GyrA dimer was already available with this, already calibrated column. We do not expect the Superose 6 to be worse in this size range. In fact, it might even be better. The Superose 6 profile in Supp. Fig. 1b shows BA.A only and no GyrA dimer. We have clarified the annotations in the figure to make this clearer.

      Regarding the analytical gel filtration experiment, there is however an overlap in the elution volume in the analytical column, therefore how can the authors ensure there is no excess free A2 complex in the GyrBA. A sample?

      Indeed, there is an overlap, but we argue that it is overstated. The important part of the overlap is where the maximum height of the GyrA peak is positioned compared to the BA.A trace, not where the traces intersect. This overlap is minimal. If a contaminating GyrA peak was hidden in the BA.A peak, it would have to be at least 10 times less intense than the BA.A peak. Since BA.A and GyrA dimer have roughly the same extinction coefficient, this means that a contamination would detectable at 10 % or even less. Our mass photometry further excludes such contamination.

      Alternatively, the addition of a larger (cleavable) tag at the C-terminal end of the BA construct (therefore not disturbing dimer association) could allow to better distinguish the 2 populations already at the size exclusion step.

      This is true and could allow cleaner purification. There are also other ways to achieve cleaner purification, like adding a secondary tag. However, like we argue in the manuscript, our contaminations are already minimal. It is questionable what benefits could be gained in changing the protocol. We also argue that the tandem tag method does not completely exclude contamination (Supplementary Discussion) and therefore we are not sure if this would be worth the time and expenditure.

      2) GyrA and GyrB Oligomers:

      In the mass photometry experiment, the authors explain that the low concentration of the proteins promotes dissociation of GyrA dimers, hence the detection of GyrA monomers instead of GyrA dimers, which are also detected in the GyrBA.A sample.

      However, it cannot be concluded that the GyrA dimer is not formed in the condition of the gel filtration chromatography, at higher concentration.

      In our mass photometry experiment, The BA.A sample is not as diluted as the GyrA dimer and much closer to our experimental condition. Since we have calculated the dissociation constant, we can calculate the expected level of dissociation (or reassociation). The level of dissociation is minimal in these conditions. If some dissociation is expected from the BA.A heterodimers, a very low amount of GyrBA monomer should also be present and yet they are not observed. We presume that it is because mass photometry is much more sensitive to GyrA (see our mixing mass photometry experiment that we have added). If the GyrA would reassociate at higher concentration, it would do so either with itself (forming a GyrA dimer) or with the GyrBA monomer, reforming the heterodimer. Assuming both GyrA dimer and heterodimer have the same dissociation constant, roughly one third of the GyrA monomer would reassociate with themselves. Assuming even complete reassociation of the GyrA dimer, this would leave only GyrA dimer accounting for 2% of the prep.

      Another interpretation would be to assume that GyrBA monomers are not present at all and that GyrA monomer are reassociating only with themselves. This is not valid because of the following thermodynamic reason:

      Since the profile for the GyrA dimer are collected at equilibrium, we should expect a ratio between GyrA monomer and dimers that follow the dissociation constant. In other words, if the GyrA monomer were in equilibrium with GyrA dimer we should expect a much higher dimer concentration already as the GyrA monomers are not as dilute. We do not observe a GyrA dimer peak in the BA.A profile, even though we can detect a low amount of GyrA dimer mixed with BA.A. Therefore, we conclude that the observed GyrA monomer must be in equilibrium with another dimerization partner, which is most probably the GyrBA monomer (see above). Therefore, only a minimal amount of GyrA dimer is expected to be formed at higher concentration by direct reassociation. This could probably increase if we let this solution-based exchange carry on for a long time at dissociation equilibrium. We have actually shown that this solution-based exchange is very slow and take several days because of the low dissociation at equilibrium.

      The mass spectrometry analysis in Fig 2 confirms the presence of (monomeric) GyrA in the sample, despite different experimental conditions.

      The concentration of heterodimer in the mass spectrometry experiment is actually higher than in the mass photometry experiment. This shows that self-reassociation of the GyrA monomer as suggested above is undetectable with mass spectrometry at higher concentration.

      We considered that the “GyrA monomer” peak could be a contaminating GyrB monomer, which is ~90 kDa, which would explain the lack of reassociation. However, the mass spectrometry peak shows precisely the expected molecular weight of GyrA so we interpret this peak as arising from very limited dissociation of the BA.A heterodimer. The reassociation is limited at high concentration due simply to the fact that the difference in concentration between the mass photometry and our other experimental conditions is not that high. The GyrA dimer had to be diluted 400 times to see significant dissociation and yet even at this very low concentration the dissociation is far from complete.

      Our general conclusions on the couple of point above is that we cannot completely exclude the presence of GyrA dimers being present, although they are undetectable in our working conditions either by mass photometry (lower concentration), Mass spectrometry (higher concentration) and even gel filtration (even higher concentration, see above). For the mass photometry, we have established that our detection threshold for a contamination is very low (see our mixing experiment).

      Figure 2A: the authors state in the introduction that GyrB is a monomer in solution and then explain that the upper bands in the native gel are multimer of GyrB. Could the authors comment and provide the size exclusion profile of the Gyr B purification?

      We have expanded our discussion of this. However, we have not been successful in collecting a gel filtration profile for GyrB. This is likely due to excessive oligomerization at the concentration we are using for gel filtration. We suggest that our mass photometry and Blue-Native PAGE experiment shows clearly that GyrB can be detected as a monomer in solution at the appropriate dilution. However, GyrB tends to oligomerize in a regular fashion (Consider especially Supp Fig. 8a), which suggest that it could align heterodimers on DNA in a linear, regular orientation. We have added a discussion of this.

      Together the relevance of the oligomeric state of purified GyrA or GyrB should be clarified, relative to their role in subunit swapping.

      We have added explanation in our discussion, while also trying to not be too speculative. Basically, we believe that GyrB oligomerization is likely to be involved. It is difficult to conclude for GyrA since no experiment has allowed us to test it. Therefore, the role of GyrA oligomerization, if any, is unclear. The GyrA tetramer is very prominent though and forms very easily. GyrB on the contrary forms longer oligomers more readily than GyrA and we surmise that this would help interface exchange. However, the structure of these GyrA and GyrB oligomers is not clear, which make it difficult to go beyond speculation on this. It would be a very interesting experiment if we were able to suppress GyrB oligomerization whilst conserving its ability to promote strand-passage and cleavage. Same goes for GyrA. Unfortunately, we are unable to do that at this time.

      4) Subunit exchange

      Line 320: the concept of subunit exchange in this context should be clearly explained. If one understands correctly, the authors mean that the BAF polypeptide, part of the BAF.A complex, could be replaced by a combination of B+A therefore forming a fully functional WT A2B2 gyrase complex.

      Thank you for the suggestion. We have harmonized and clearly defined our terminology for interface swapping and subunit exchange in the introduction and attempted to be much more rigorous when referring to it.

      A great effort has been done in this study to explain all the pros and cons of the experimental design but the length of the explanations may prevent readers outside of the field to fully appreciate the conclusions. This article would benefit from the addition of a few schematics to summarize the working hypothesis.

      Thanks for the suggestion. We have added a series of schematics to illustrate our interpretation for each construct. As mentioned above the terminology has been more rigorously defined and updated throughout the manuscript.

      5) Presence of endogenous GyrA

      Line 419-425: it is quite difficult to follow the explanations regarding the possible contamination of the sample by endogenous GyrA.

      Maybe these points should rather be addressed in the discussion, when debating the conclusions of Gubaev et al.

      We agree. We have re-organized the Discussion doing just that. We added a Supplementary Discussion in which we further discuss the contamination problem in relation to (Gubaev, Weidlich et al. 2016).

      Production of the subunits in another (non bacterial) expression system or a cell free system may prevent the association of endogenous protein.

      Absolutely. We are planning on addressing this in the future, using the yeast expression system.

      6) Mechanism for subunit swapping

      Lines 588-595: As described by the authors the BA fusion shows decreased activity when compared with the WT probably due to limited conformational flexibility in absence of an additional linker sequence between the fused subunits.

      The affinity of BA for A may possibly be reduced compared to the free A2B2 complex, due to a relative stiffness of the fusion upon full association with a free B subunit, as rightfully pointed by the authors.

      If subunit exchange do happen in vitro, at least in the conditions of this study, the authors could assess the affinity of BA for A, when compared to the association of free B and A subunits

      Experiments using analytical ultracentrifugation or surface plasmon resonance (SPR) may allow to determine the relative affinity of the BA +(A+B) compared to the A2B2 complex. This could be done also for the BALLL mutant and association with A59.

      It would be extremely useful to measure the affinity of BA for A. However, this is difficult because of the high affinity of the interface. To measure a dissociation constant, one has to be able to measure the concentration of the monomer and the dimer at equilibrium. Because of this, the complex must be diluted enough to see any dissociation, making detection difficult. In practice, this also means that we cannot purify monomeric versions of these subunits. We therefore can’t perform “on-rate” study on an SPR surface, which would require flowing monomers on its partner subunit tethered to the SPR surface. However, we could perform “off-rate” studies, but the dissociation time is likely to be very long, making the measurement difficult. We have not tried it though, and it could turn out to be informative. An analysis of antibodies off-rate done in the past could provide a guideline for us to perform this experiment. Analytical ultracentrifugation is an excellent technique and could in theory provide information. In practice however it would be still necessary to dilute the complex enough to obtain significant dissociation at equilibrium, making detection difficult. As far as we are aware, analytical ultracentrifugation rely on UV absorbance for protein detection and therefore we probably would not detect our material at the necessary dilution. We are however open-minded about technique with very sensitive detection methods that could be used.

      9) In vivo relevance

      The study does not conclude on the subunits exchange in vivo, which have been suggested by earlier studies by Ikeda et al. To elaborate further on the relevance of such mechanism in the bacteria, experiments involving the fluorescent labeling of endogenous / exogenous mutant subunits may be required to provide further information on this phenomenon.

      We completely agree that the in vivo relevance of such phenomena is the central question. Addressing this directly is not trivial though. Expressing both BA and A in vivo will results in random partnering and lead to a mix of dimers: A2 (1/4), BA2(1/4) and BA.A (1/2), assuming equal interface affinity. Therefore, to see subunit exchange in the same way as in vitro, one would have to get rid of the BA2 and A2 dimer together, or the BA.A dimer only. Our initial strategy to do that would be to engineer a specific dimer as being uniquely targeted for degradation. This could allow us to “get rid” of for instance the BA.A dimer. Subsequently, we would turn off the degradation and translation together and observe the rate of subunit exchange. This is not trivial though and would be the subject of a further study.

      10) Figure 3: I guess the "intact" label refers to the supercoiled DNA (SC) ? It also appears as "uncleaved" in supp Figure 6. The same label for this topoisomer should be used throughout.

      Thank you for pointing that out. It has now been corrected.

      Bandak, A. F., T. R. Blower, K. C. Nitiss, R. Gupta, A. Y. Lau, R. Guha, J. L. Nitiss and J. M. Berger (2023). "Naturally mutagenic sequence diversity in a human type II topoisomerase." Proceedings of the National Academy of Sciences 120(28).

      Germe, T., J. Voros, F. Jeannot, T. Taillier, R. A. Stavenger, E. Bacque, A. Maxwell and B. D. Bax (2018). "A new class of antibacterials, the imidazopyrazinones, reveal structural transitions involved in DNA gyrase poisoning and mechanisms of resistance." Nucleic Acids Res.

      Gubaev, A., D. Weidlich and D. Klostermeier (2016). "DNA gyrase with a single catalytic tyrosine can catalyze DNA supercoiling by a nicking-closing mechanism." Nucleic Acids Res 44(21): 10354-10366.

      Hartmann, S., A. Gubaev and D. Klostermeier (2017). "Binding and Hydrolysis of a Single ATP Is Sufficient for N-Gate Closure and DNA Supercoiling by Gyrase." J Mol Biol 429(23): 3717-3729. Shuman, S., E. M. Kane and S. G. Morham (1989). "Mapping the active-site tyrosine of vaccinia virus DNA topoisomerase I." Proc Natl Acad Sci U S A 86(24): 9793-9797.

      Stelljes, J. T., D. Weidlich, A. Gubaev and D. Klostermeier (2018). "Gyrase containing a single C-terminal domain catalyzes negative supercoiling of DNA by decreasing the linking number in steps of two." Nucleic Acids Res.

    1. Author Response

      Reviewer #1 (Public Review):

      The goal of the current study was to evaluate the effect of neuronal activity on blood-brain barrier permeability in the healthy brain, and to determine whether changes in BBB dynamics play a role in cortical plasticity. The authors used a variety of well-validated approaches to first demonstrate that limb stimulation increases BBB permeability. Using in vivo-electrophysiology and pharmacological approaches, the authors demonstrate that albumin is sufficient to induce cortical potentiation and that BBB transporters are necessary for stimulus-induced potentiation. The authors include a transcriptional analysis and differential expression of genes associated with plasticity, TGF-beta signaling, and extracellular matrix were observed following stimulation. Overall, the results obtained in rodents are compelling and support the authors' conclusions that neuronal activity modulates the BBB in the healthy brain and that mechanisms downstream of BBB permeability changes play a role in stimulus-evoked plasticity. These findings were further supported with fMRI and BBB permeability measurements performed in healthy human subjects performing a simple sensorimotor task. While there are many strengths in this study, there is literature to suggest that there are sex differences in BBB dysfunction in pathophysiological conditions. The authors only used males in this study and do not discuss whether they would also expect to sex differences in stimulation-evoked BBB changes in the healthy brain. Another minor limitation is the authors did not address the potential impact of anesthesia which can impact neurovascular coupling in rodent studies. The authors could have also better integrated the RNAseq findings into mechanistic experiments, including testing whether the upregulation of OAT3 plays a role in cortical plasticity observed following stimulation. Overall, this study provides novel insights into how neurovascular coupling, BBB permeability, and plasticity interact in the healthy brain.

      While there are many strengths in this study, there is literature to suggest that there are sex differences in BBB dysfunction in pathophysiological conditions. The authors only used males in this study and do not discuss whether they would also expect to sex differences in stimulation-evoked BBB changes in the healthy brain.

      We agree with the reviewer regarding the importance of examining sex differences on stimulation-evoked BBB changes. To address this issue we have: (1) clarified in the methods section that the human study involved both males and females; (2) added a section to the discussion highlighting the male bias as a key limitation of our animal experiments; and (3) stated that future work should examine whether stimulation-evoked BBB changes differ between makes and females.

      Another minor limitation is the authors did not address the potential impact of anesthesia which can impact neurovascular coupling in rodent studies.

      We are grateful for this comment and agree with the reviewer that the potential effects of anesthesia should be discussed. We have added the following discussion paragraph:

      “A key limitation of our animal experiments is the fact they were performed under anesthesia, due to the complex nature of the experimental setup (i.e., simultaneous cortical imaging and electrophysiological recordings). Anesthetic agents can affect various receptors within the NVU, potentially altering neuronal activity, SEPs, CBF, and vascular responses (Aksenov et al., 2015; Lindauer et al., 1993; Masamoto & Kanno, 2012). To minimize these effects, we used ketamine-xylazine anesthesia, which unlike other anesthetics, was shown to generate robust BOLD and SEP responses to neuronal activation (Franceschini et al., 2010; Shim et al., 2018).”

      Reviewer #2 (Public Review):

      Summary:

      This study builds upon previous work that demonstrated that brain injury results in leakage of albumin across the bloodbrain barrier, resulting in activation of TGF-beta in astrocytes. Consequently, this leads to decreased glutamate uptake, reduced buffering of extracellular potassium, and hyperexcitability. This study asks whether such a process can play a physiological role in cortical plasticity. They first show that stimulation of a forelimb for 30 minutes in a rat results in leakage of the blood-brain barrier and extravasation of albumin on the contralateral but not ipsilateral cortex. The authors propose that the leakage is dependent upon neuronal excitability and is associated with an enhancement of excitatory transmission. Inhibiting the transport of albumin or the activation of TGF-beta prevents the enhancement of excitatory transmission. In addition, gene expression associated with TGF-beta activation, synaptic plasticity, and extracellular matrix are enhanced on the "stimulated" hemisphere. That this may translate to humans is demonstrated by a breakdown in the blood-brain barrier following activation of brain areas through a motor task.

      Strengths:

      This study is novel and the results are potentially important as they demonstrate an unexpected breakdown of the blood-brain barrier with physiological activity and this may serve a physiological purpose, affecting synaptic plasticity.

      The strengths of the study are:

      1) The use of an in vivo model with multiple methods to investigate the blood-brain barrier response to a forelimb stimulation.

      2) The determination of a potential functional role for the observed leakage of the blood-brain barrier from both a genetic and electrophysiological viewpoint.

      3) The demonstration that inhibiting different points in the putative pathway from activation of the cortex to transport of albumin and activation of the TGF-beta pathway, the effect on synaptic enhancement could be prevented.

      4) Preliminary experiments demonstrating a similar observation of activity-dependent breakdown of the blood-brain barrier in humans.

      Weaknesses:

      There are both conceptual and experimental weaknesses.

      1) The stimulation is in an animal anesthetized with ketamine, which can affect critical receptors (ie NMDA receptors) in synaptic plasticity.

      We agree that the potential effects of anesthesia should be considered. The Discussion was revised to address this point: “A key limitation of our animal experiments is the fact they were performed under anesthesia, due to the complex nature of the experimental setup (i.e., simultaneous cortical imaging and electrophysiological recordings). Anesthetic agents can affect various receptors within the NVU, potentially altering neuronal activity, SEPs, CBF, and vascular responses (Aksenov et al., 2015; Lindauer et al., 1993; Masamoto & Kanno, 2012). To minimize these effects, we used ketamine-xylazine anesthesia, which unlike other anesthetics, was shown to generate robust BOLD and SEP responses to neuronal activation (Franceschini et al., 2010; Shim et al., 2018)”

      2) The stimulation protocol is prolonged and it would be helpful to know if briefer stimulations have the same effect or if longer stimulations have a greater effect ie does the leakage give a "readout" of the stimulation intensity/length.

      Thank you for this important comment. We are also very curious about the potential relationship between stimulation magnitude/duration and subsequent leakage and have added the following statement to the discussion:

      “Future studies should also explore the effects of stimulation magnitude/duration on BBB modulation, as well as the stimulation threshold between physiological and pathological increase in BBB permeability.”

      Our current findings indicate that a one-minute stimulation does not affect vascular permeability or SEP and we aim to test additional stimulation paradigms in future studies.

      3) For some of the experiments (see below), the numbers of animals are low and the statistical tests used may not be the most appropriate, making the results less clear cut.

      We appreciate this comment and have revised the statistical analysis of Figure 1J,K. We now use a nested t-test to test for differences between rats (as opposed to sections). The differences remain significant (EB, p=0.0296; Alexa, p=0.0229). The text was modified accordingly.

      4) The experimental paradigms are not entirely clear, especially the length of time of drug application and the authors seem to try to detect enhancement of a blocked SEP.

      Thank you for pointing this out. Figures 2&3 were revised for clarification and a ‘Drug Application’ subsection was added to the methods section.

      5) It is not clear how long the enhancement lasts. There is a remark that it lasts longer than 5 hours but there is no presentation of data to support this.

      Thank you for this comment. As the length of experiments differed between animals, the exact length could not be specifically stated. To clarify this point, we revised the text to indicate that LTP was recorded until the end of each experiment (between 1.5-5 hours, depending on the condition the animal was in). We also added a panel to figure 2 (Figure 2d) with exemplary data showing potentiation 60, 90, and 120 min post stimulation.

      6) The spatial and temporal specificity of this effect is unclear (other than hemispheric in rats) and even less clear in humans.

      Our animal experiments (using both in vivo imaging and histological analysis) showed no evidence of BBB modulation outside the cortical somatosensory area corresponding to the limbs. We looked at the entirety of the coronal section of the brain and found enhancement solely in the somatosensory area corresponding to limb. The right side of panels h and i in Figure 1 show an x20 magnification of the section, focusing on the enhanced area. The whole section was not shown, as no fluorescence was found outside the magnified area. Moreover, our quantification showed that the enhancement was specific to the contralateral and not ipsilateral somatosensory cortex (Figure 1 j-k).

      We agree that temporal specificity needs to be further explored, and we have now stated that in the discussion: “Future studies are needed to explore the BBB modulating effects of additional stimulation protocols – with varying durations, frequencies, and magnitudes. Such studies may also elucidate the temporal and ultrastructural characteristics that may differentiate between physiological and pathological BBB modulation.”

      We also agree that larger studies are needed to better understand the specificity of the observed effect in humans, and to account for potential inter-human variability in vascular integrity and brain function due to different schedules, diets, exercise habits, etc.

      8) The experimenters rightly use separate controls for most of the experiments but this is not always the case, also raising the possibility that the application of drugs was not done randomly or interleaved, but possibly performed in blocks of animals, which can also affect results.

      Thank you for pointing out this lack of clarity. We have now highlighted that drug application was done randomly.

      9) Methyl-beta-cyclodextrin clears cholesterol so the effect on albumin transport is not specific, it could be mediating its effect through some other pathway.

      We agree that the effect of mβCD may not be specific. To mitigate this issue, we used a very low mβCD concentration (10uM). Notably, this is markedly lower than the concentrations reported by Koudinov et al, showing that cholesterol depletion is observed at 5mM mβCD and not at 2.5mM/5mM (Koudinov & Koudinova, 2001). This point was added to the discussion.

      10) Since the breakdown of the blood-brain barrier can be inhibited by a TGF-beta inhibitor, then this implies that TGFbeta is necessary for the breakdown of the blood-brain barrier. This does not sit well with the hypothesis that TGF-beta activation depends upon blood-brain barrier leakage.

      Thank you for pointing out this lack of clarity. We have added a discussion paragraph that clarifies our hypothesis: “As mentioned above, albumin is a known activator of TGF-β signaling, and TGF-β has a well-established role in neuroplasticity. Interestingly, emerging evidence suggests that TGF-β also increases cross-BBB transcytosis (Betterton et al., 2022; Kaplan et al., 2020; McMillin et al., 2015; Schumacher et al., 2023). Hence, we propose the following two-part hypothesis for the TGF-β/BBB-mediated synaptic potentiation observed in our experiments: (1) prolonged stimulation triggers TGF-β signaling and increased caveolae-mediated transcytosis of albumin; and (2) extravasated albumin induces further TGF-β signaling, leading to synaptogenesis and additional cross-BBB transport – in a self-reinforcing positive feedback loop. Future research is needed to examine the validity of this hypothesis.

      Reviewer #3 (Public Review):

      Summary:

      This study used prolonged stimulation of a limb to examine possible plasticity in somatosensory evoked potentials induced by the stimulation. They also studied the extent that the blood-brain barrier (BBB) was opened by prolonged stimulation and whether that played a role in the plasticity. They found that there was potentiation of the amplitude and area under the curve of the evoked potential after prolonged stimulation and this was long-lasting (>5 hrs). They also implicated extravasation of serum albumin, caveolae-mediated transcytosis, and TGFb signalling, as well as neuronal activity and upregulation of PSD95. Transcriptomics was done and implicated plasticity-related genes in the changes after prolonged stimulation, but not proteins associated with the BBB or inflammation. Next, they address the application to humans using a squeeze ball task. They imaged the brain and suggested that the hand activity led to an increased permeability of the vessels, suggesting modulation of the BBB.

      Strengths:

      The strengths of the paper are the novelty of the idea that stimulation of the limb can induce cortical plasticity in a normal condition, and it involves the opening of the BBB with albumin entry. In addition, there are many datasets and both rat and human data.

      Weaknesses:

      The conclusions are not compelling however because of a lack of explanation of methods and quantification. It also is not clear whether the prolonged stimulation in the rat was normal conditions. To their credit, the authors recorded the neuronal activity during stimulation, but it seemed excessive excitation. Since seizures open the BBB this result calls into question one of the conclusions. that the results reflect a normal brain. The authors could either conduct studies with stimulation that is more physiological or discuss the caveats of using a supraphysiological stimulus to infer healthy brain function.

      The conclusions are not compelling however because of a lack of explanation of methods and quantification.

      Thank you for this comment. In the revised paper, we expanded the Methods section to better describe the procedures and approaches we used for data analysis.

      It also is not clear whether the prolonged stimulation in the rat was normal conditions.

      We believe that the used stimulation protocol is within the physiological range (and relevant to plasticity, learning and memory) for the following reasons:

      1) In our continuous electrophysiological recordings, we did not observe any form of epileptiform or otherwise pathological activity.

      2) Memory/training/skill acquisition experiments in humans often involve similar training duration or longer (Bengtsson et al., 2005), e.g., a 30 min thumb training session performed by (Classen et al., 1998).

      3) The levels of SEP potentiation we observed are similar to those reported in:

      a) Rats following a 10-minute whisker stimulation (one hour post stimulation, (Mégevand et al., 2009)).

      b) Humans following a 15 min task (McGregor et al., 2016).

      This important point is now presented in the discussion.

      Reviewer #1 (Recommendations For The Authors):

      The discussion would benefit from additional discussion of the potential impacts of sex and anesthesia in their findings.

      We agree with the reviewer and have added the following paragraph to the discussion:

      “A key limitation of our animal experiments is the fact they were performed under anesthesia, due to the complex nature of the experimental setup (i.e., simultaneous cortical imaging and electrophysiological recordings). Anesthetic agents can potentially alter neuronal activity, SEPs, CBF, and vascular responses (Aksenov et al., 2015; Lindauer et al., 1993; Masamoto & Kanno, 2012). To minimize these effects, we used ketaminexylazine anesthesia, which unlike other anesthetics, was shown to maintain robust BOLD and SEP responses to neuronal activation (Franceschini et al., 2010; Shim et al., 2018). Another limitation of our animal study is the potentially non-specific effect of mβCD – an agent that disrupts caveola transport but may also lead to cholesterol depletion (Keller & Simons, 1998). To mitigate this issue, we used a very low mβCD concentration (10uM), orders of magnitude below the concentration reported to deplete cholesterol (Koudinov et al). Lastly, our animal study is limited by the inclusion of solely male rats. While our findings in humans did not point to sex-related differences in stimulation-evoked BBB modulation, larger animals and human studies are needed to examine this question.”

      The figure text is quite small.

      Thank you for pointing this out, we revised all figures and increased font size for clarity.

      Including pharmacological concentrations within the figure legends would improve the readability of the manuscript.

      Thank you for this suggestion, the figure legends were modified accordingly.

      In methods for immunoassays the 5 groups could be more clear by stating that there are 3 timepoints for stimulation experiments. There is a typo in this section where the 24-hour post is stated twice in the same sentence.

      Thank you for pointing this out, the text was modified accordingly.

      Reviewer #2 (Recommendations For The Authors):

      1) In Figure 1, J and K seem to indicate that in these experiments the statisitics were done per slice and not per animal. This is not a reasonable approach, a repeat measure ANOVA or averaging for each animal are more appropriate statistical approaches.

      We thank the reviewer for pointing this out. The statistical analysis for Figure 1j,k was modified. We now use a nested ttest to test for differences between rats and not sections. The differences are still significant (EB, p=0.0296; Alexa, p=0.0229). The manuscript was modified accordingly.

      2) In Figure 2, the protocol does not seem to give much idea about time course. There was a stimulation test for 1 minute before and then 1 minute after the 30-minute stimulation train. How was potentiation assessed for the next 5 hours and where are the data?

      Potentiation was assessed by repeating 1min test stim every 30 min for the duration of the experiment, we added a panel to show late potentiation, see response above.

      3) In Figure 2, there is a notable lack of controls eg the effect of sham stimulation and application of saline. These are important as the drift of response magnitude can be a problem in long experiments.

      We did test for the potential presence of response drift, by examining whether SEPs of non-stimulated animals change over time (at baseline, 30 or 60 minutes of recording; n=6). No statistical differences were found. Our analysis focused on using each animal as its own control (i.e., comparing baseline SEP to SEP post albumin perfusion), because SEP studies highlight the importance of comparing each animal to its own baseline, due to the large inter-animal variability (All et al., 2010; Mégevand et al., 2009; Zandieh et al., 2003).

      4) Figure 3 a is not clear – were the drugs applied throughout?

      Thank you for pointing this out. We have revised Figure 3 a to show that the drugs were applied for 50 min before the stimulation.

      5) In Figure 3 panel d is repeated in panel j. This needs correcting

      Thank you. This mistake was fixed.

      6) In LTP-type experiments usually the antagonist is applied during the stimulation and then washed out. This avoids the problem in this figure in which CNQX effectively blocks transmission and so it is not possible to detect any enhancement if it were there. Eg in panel e, CNQX block transmission, and then the assessment is performed when the AMPA receptors are blocked after 30 minutes of stimulation. If receptors are blocked no enhancement will be detectable. Moreover, surely the question is the ratio of the effect of 30-minute stimulation on the SEP in the presence of CNQX and so the statistics should be done on the fold change in the SEP following 30-minute stimulation in the presence of CNQX.

      Thank you. The protocol might have been misrepresented in the original figure. We modified Fig 3a to clarify that the antagonists were indeed washed out upon stimulation start to make sure the receptors are not blocked during the test stimulation following the 30 min stimulation. In addition, we tested for the difference in fold change between 30 min stim, and 30 min stimulation following antagonists wash-in (Fig 3f and Fig S2a).

      7) Interesting in Figure f, stimulation, albumin, and AP5 all seem to have the same enhancement of the SEP. Is the lack of effect of 30-minute stimulation in the presence of AP5, a ceiling effect ie AP5 has enhanced the SEP, and no further enhancement from stimulation is possible.

      This is a very interesting point that will require further research.

      8) SJN seems to block neurotransmission. What is the mechanism? The same analysis as for CNQX should be performed ie what is the fold change not compared to baseline but in the presence of SJN.

      Our quantification showed that SJN did not significantly reduce the SEP max amplitude, and we therefore did not include this graph in the figure.

      9) Please acknowledge that the effect of mbetaCD is non-specific. There is a large literature on the effects of cholesterol depletion on LTP.

      We agree that the effect of mβCD may not be specific. To mitigate this issue, we used a very low mβCD concentration (10µM). Notably, this is markedly lower than the concentrations reported by Koudinov et al, showing that cholesterol depletion is only observed at a concentration of 5mM (Koudinov & Koudinova, 2001). This point is now discussed under the discussion paragraph describing the study’s limitations.

      10) k&l seem to have used the same control in which case they should not be analysed separately (they are all part of the same experiment).

      We agree with the reviewer and have revised the figure accordingly.

      11) The difference in gene expression in Figure 4 would be more convincing if it could be prevented by for example a TGFbeta inhibitor.

      We agree and acknowledge the impact such experiments could provide. We plan to incorporate these experiments into our future studies.

      12) Figure 5 seems to indicate bilateral and widespread BBB modulation arguing that this may be a non-specific effect. Panel g should look at other neocortical regions eg occipital cortex.

      We agree and thank the reviewer for this comment. We revised the figure to include other cortical areas, such as the frontal and occipital cortices (Figure 5g)

      Minor comments

      1) Paired data eg in Fig 2D are better represented by pairing the dots usually with a line.

      2) Please correct the %fold baseline in axes in graphs which show % change for baseline.

      3) Figure 4 is not correctly referred to in the text.

      We agree with all the points raised by the reviewer and revised the figures and text accordingly.

      Reviewer #3 (Recommendations For The Authors):

      The conclusions are not compelling however because of a lack of explanation of methods and quantification. It also is not clear whether the prolonged stimulation in the rat was normal conditions. To their credit, the authors recorded the neuronal activity during stimulation, but it seemed excessive excitation. Since seizures open the BBB this result calls into question one of the conclusions. that the results reflect a normal brain. The authors could either conduct studies with stimulation that is more physiological or discuss the caveats of using a supraphysiological stimulus to infer healthy brain function.

      Major concerns:

      Methods need more explanation. Rationales need more justification. Examples are provided below.

      Throughout many sections of the paper, sample sizes and stats are often missing. For stats, please provide p-values and other information (tcrit, U statistic, F, etc.)

      Thank you, we added the relevant information where it was missing throughout the manuscript.

      For transcriptomics, they might have found changes in BBB-related genes if they assayed vessels but they assayed the cortex.

      We agree with the reviewer that this would be a very interesting future direction. The present study could not include this kind of analysis due to lack of access to vasculature isolation methods or single-cell RNA seq.

      What were the inclusion/exclusion criteria for the subjects?

      Thank you for pointing out this lack of clarity. The methods section (under ‘Magnetic Resonance Imaging’ – ‘Participants’) was expanded to include the following:

      “Male and female healthy individuals, aged 18-35, with no known neurological or psychiatric disorders were recruited to undergo MRI scanning while performing a motor task (n=6; 3 males and 3 females). MRI scans of 10 sex- and age- matched individuals (with no known neurological or psychiatric disorders) who did not perform the task were used as control data (n=10; 5 males and 5 females.

      Were they age and sex-matched?

      They were, indeed, age and sex-matched. This was now clarified in the relevant Methods section.

      Were there other factors that could have influenced the results?

      Certainly. Human subjects are difficult to control for due to different schedules, diets, exercise habits, and other factors that may impact vascular integrity and brain function. Larger multimodal studies are needed to better understand the observed phenomenon.

      Fig. 1. Images are very dim. Text here and in other figures is often too small to see. Some parts of the figures are not explained.

      Our apologies. Figures and legends were revised accordingly.

      Fig 2a, f. I don't see much difference here- do the authors think there was?

      We agree that the difference may not be visually obvious. The quantification of trace parameters (amplitude and area under curve) does, however, reveal a significant SEP difference in response to both stimulation (panels X and y) and albumin (panels z and q).

      Fig 3 d and j seem the same.

      We thank the reviewer for noticing. This was a copy mistake that was now rectified.

      Lesser concerns and examples of text that need explana9on:

      Introduction

      Insulin-like growth factor is transported. From where to where?

      The text was edited to clarify that this was cross-BBB influx of insulin-like growth factor-I.

      RMT that underlies the transport of plasma proteins was induced by physiological or non-physiological stimulation.

      This was shown without stimulation, in normal physiology of young and aged healthy mice. The text was edited to clarify this point.

      What was the circadian modulation that was shown to implicate BBB in brain function?

      The text was edited for clarity.

      Results

      When the word stimulation is used please be specific if whiskers are moved by an experimenter, an electrode is used to apply current, etc.

      We have now moved the ‘Stimulation protocol’ section closer to beginning of the Methods and emphasized that we administered electrical stimulation to the forepaw or hindlimb using subdermal needle electrodes.

      Please explain how the authors are convinced they localized the vascular response.

      The vascular response was localized via: (1) visual detection of arterioles that dilated in response to stimulation (due to functional hyperemia / neurovascular coupling) [figure 1 d]; and (2) quantitative mapping of increased hemoglobin concentration (Bouchard et al., 2009) [Figure 1 b]. This is now mentioned in the methods (under ‘In vivo imaging’) and results (under the ‘Stimulation increases BBB permeability’).

      "30 min of limb stimulation" means what exactly? 6 Hz 2mA for 30 min?

      Thank you. The text was revised for clarity (Methods under ‘Stimulation protocol’):

      “The left forelimb or hind limb of the rat was stimulated using Isolated Scmulator device (AD Instruments) attached with two subdermal needle electrodes (0.1 ms square pulses, 2-3 mA) at 6 Hz frequency. Test stimulation consisted of 360 pulses (60 s) and delivered before (as baseline) and after long-duration stimulation (30 min, referred throughout the text as ‘stimulation’). In control and albumin rats, only short-duration stimulations were performed. Under sham stimulation, electrodes were placed without delivering current.”

      Histology that was performed to confirm extravasation needs clarification because if tissue was removed from the brain, and fixed in order to do histology, what is outside the vessels would seem likely to wash away.

      Thank you for pointing out the need to clarify this point. The Histology description in the Methods section was revised in the following manner:

      “Albumin extravasacon was confirmed histologically in separate cohorts of rats that were anesthetized and stimulated without craniotomy surgery. Assessment of albumin extravasacon was performed using a well-established approach that involves peripheral injection of either labeled-albumin (bovine serum albumin conjugated to Alexa Flour 488, Alexa488-Alb) or albumin-labeling dye (Evans blue, EB – a dye that binds to endogenous albumin and forms a fluorescent complex), followed by histological analysis of brain tissue (Ahishali & Kaya, 2020; Ivens et al., 2007; Lapilover et al., 2012; Obermeier et al., 2013; Veksler et al., 2020). Since extravasated albumin is taken up by astrocytes (Ivens et al., 2007; Obermeier et al., 2013), it can be visualized in the brain neuropil after brain removal and fixation (Ahishali & Kaya, 2020; Ivens et al., 2007; Lapilover et al., 2012; Veksler et al., 2020). Five rats were injected with Alexa488-Alb (1.7 mg/ml) and five with EB (2%, 20 mg/ml, n=5). The injections were administered via the tail vein. Following injection, rats were transcardially perfused with…”

      It is not clear why there was extravasacon contralateral but not ipsilateral if there are cortical-cortical connections.

      Interpersonally, we also did not observe ipsilateral SEP in response to limb stimulation, with evidence of SEP and BBB permeability only in the contralateral sensorimotor region. This finding is consistent with electrophysiological and fMRI studies showing that peripheral stimulation results in predominantly contralateral potentials (Allison et al., 2000; Goff et al., 1962).

      After injection of Evans blue or Alexa-Alb, how was it shown that there was extravasacon?

      Extravasalon in cortical sections was visualized using a fluorescent microscope (Figure 1 h-i). Since extravasated albumin is taken up by astrocytes, fluorescent imaging can be used for visualizing and quantifying labeled albumin (Ahishali & Kaya, 2020; Ivens et al., 2007; Knowland et al., 2014). Here is the relevant methods excerpt:

      “Coronal sections (40-μm thick) were obtained using a freezing microtome (Leica Biosystems) and imaged for dye extravasacon using a fluorescence microscope (Axioskop 2; Zeiss) equipped with a CCD digital camera (AxioCam MRc 5; Zeiss).”

      How is a sham control not stimulated - what is the sham procedure?

      In the sham stimulation protocol electrodes were placed, but current was not delivered. A section titled ‘Stimulation protocol’ was added to the methods to clarify this point.

      What was the method for photothrombosis-induced ischemia?

      The procedure for photothrombosis-induced ischemia is described under the Methods section ‘Immunoassays’ – ‘Enzyme-linked immunosorbent assay (ELISA) for albumin extravasalon’:

      “Rats were anesthetilzed and underwent … photothrombosis stroke (PT) as previously described (Lippmann et al., 2017; Schoknecht et al., 2014). Briefly, Rose Bengal was administered intravenously (20 mg/kg) and a halogen light beam was directed for 15 min onto the intact exposed skull over the right somatosensory cortex.”

      Fig 1d. All parts of d are not explained.

      Thank you for pointing this out. In the revised manuscript, the panels of this figure were slightly reordered, and we made sure all panels are explained in the legend.

      e. Is the LFP a seizure? How physiological is this- it does not seem very physiological.

      Thank you for your comment. We believe that this activity is not a seizure because it lacks the typical slow activity that corresponds to the “depolarizalon shir” observed during seizures (Ivens et al., 2007; Milikovsky et al., 2019; Zelig et al., 2022).

      f. Permeability index needs explanation. How was the area chosen for each rat? Randomly? Was it the same across rats?

      We have now revised the Methods section to provide a clearer description of the permeability index calculation and the choice of the imaging area:

      “Across all experiments, acquired images were the same size (512 × 512 pixel, ~1x1 mm), centered above the responding arteriole. Images were analyzed offline using MATLAB as described (Vazana et al., 2016). Briefly, image registration and segmentation were performed to produce a binary image, separating blood vessels from extravascular regions. For each extravascular pixel, a time curve of signal intensity over time was constructed. To determine whether an extravascular pixel had tracer accumulation over time (due to BBB permeability), the pixel’s intensity curve was divided by that of the responding artery (i.e., the arterial input function, AIF, representing tracer input). This ratio was termed the BBB permeability index (PI), and extravascular pixels with PI > 1 were identified as pixels with tracer accumulation due to BBB permeability.”

      g. For Evans blue and Alexa-Alb was the sample size rats or sections?

      Thank you for this question. We revised the statistical analysis for Figure 1j,k to appropriately asses the differences between rats. We used a nested t-test to test for differences between rats (and not sections). The differences remained significant (EB, p=0.0296; Alexa, p=0.0229) and the text was modified accordingly.

      h, i, j need more contrast and/or brightness to appreciate the images. Arrows would help. The text is too small to read.

      Thank you. This issue was addressed in the revised paper.

      To induce potentiation, 6 Hz 2 mA stimuli were used for 30 min. Please justify this as physiological.

      Thank you for the comment. We believe that the used stimulation protocol is within the physiological range (and relevant to plasticity, learning and memory) for the following reasons:

      1. In our continuous electrophysiological recordings, we did not observe any form of epileptiform or otherwise pathological activity.

      2. Memory/training/skill acquisition experiments in humans often involve similar training duration or longer (Bengtsson et al., 2005), e.g., a 30 min thumb training session performed by (Classen et al., 1998).

      3. The levels of SEP potentiation we observed are similar to those reported in:

      a. Rats following a 10-minute whisker stimulation (one hour post stimulation, (Mégevand et al., 2009)).

      b. Humans following a 15 min task (McGregor et al., 2016).

      We have revised the Discussion of the paper to clarify this important point.

      The test stimulus to evoke somatosensory evoked potentials was 1 min. Was this 6 Hz 2 mA for 1 min? Please justify.

      Yes. We chose these parameters as these ranges were shown to induce the largest changes in blood flow (with laserdoppler flowmetry) and summated SEP (Ngai et al., 1999), corresponding with our findings. We also show that these stimulation parameters do not induce changes in BBB permeability nor synaptic potentiation, therefore served as test control.

      How long after the 30 min was the test stimulus triggered- immediately? 30 sec afterwards?

      The test stimulus was applied 5 min afterwards to allow for BBB imaging protocol (now explained in the Methods section).

      How were amplitude and AUC measured? Baseline to peak? For AUC is it the sum of the upward and downward deflections comprising the LFP?

      Yes, and yes. This is now clarified in the ‘Analysis of electrophysiological recordings’ section in the Methods.

      How was the same site in the somatosensory cortex recorded for each animal?<br /> Potentiation was said to last >5 hrs. How often was it measured? Was potentiation the same for the amplitude and the AUC?

      The location of the cranial window over the somatosensory cortex was the same in all rats. The location of the specific responding arteriole may change between animals, but the recording electrode was places around the responding arteriole in the same approaching angle and depth for all animals.

      As the length of experiments differed between animals, the exact length could not be specifically stated. We therefore revised the text to clarify that LTP was recorded until the end of each experiment (depending on the animal condition, between 1.5-5 hours) and added a panel to figure 2 (Figure 2f) with exemplary data showing potentiation 120 min (2hr) post stimulation.

      Why was 25% of the serum level of albumin selected- does the brain ever get exposed to that much? Was albumin dissolved in aCSF or was aCSF chosen as a control for another reason?

      Yes, albumin was dissolved in aCSF and the solution was allowed to diffuse through the brain. The relatively high concentration of albumin was chosen to account for factors that lower its effective tissue concentration:

      1. The low diffusion rate of albumin (Tao & Nicholson, 1996).

      2. The likelihood of albumin to encounter a degradation site or a cross-BBB efflux transporter (Tao & Nicholson, 1996; Zhang & Pardridge, 2001).

      Figure 2.

      a. Please show baseline, the stimulus, and aftier the stimulus.

      Please point out when there was stimulacon.

      What is the inset at the top?

      The inset on top is the example trace of the stimulus waveform, the legend of the figure was modified for clarity.

      b. Please show when the stimulus artifact occurred. The end of the 1-minute test stimulus period is fine. Why are the SEPs different morphologies? It suggests the different locations in the cortex were recorded.

      What is shown is the averaged SEP response over 1min test stimulus, each SEP is time locked to each stimulus. Regarding SEP waveform, it does indeed show different morphology between animals, as sometimes different arterioles respond to the stimulation, and we localize the recording to the responding vessel in each rat. However, in each rat the recording is only from one location. Once the electrode was positioned near the responding arteriole it was not moved.

      d, e. What are the stats?

      h, i. Add stats. Are all comparisons Wilcoxon? Please provide p values.

      The comparisons were performed with the Wilcoxon test. We now state that and provide the exact p values.

      j. What was selected from the baseline and what was selected during Albumin and how long of a record was selected?

      What program was used to create the spectrogram?

      What is meant by changes at frequencies above 200 Hz, the frequencies of HFOs?

      The Method section (under ‘Electrophysiology – Data acquisition and analyses’) has been revised for clarification. Spectrogram was created with MATLAB and graphed with Prism. For analysis, we selected a 10 min recorded segment before starting albumin perfusion, and 10 min after terminating albumin perfusion.

      When the cortial window was exposed to drugs, what were concentrations used that were selective for their receptor? How long was the exposure?

      Was the vehicle tested?

      We have revised the Methods section (under ‘Animal preparation and surgical procedures - Drug application’) to clarify the duration and concentration used and justification. All blockers were exposed for 50 min. The vehicle was an artificial cerebrospinal fluid solution (aCSF).

      For PSD-95, what was the area of the cortex that was tested?

      Were animals acutely euthanized and the brain dissected, frozen, etc?

      We have revised the Methods section (under ‘Immunoassays’) for clarity.

      What is mbetaCD?

      The full term was added to the results section. It is also mentioned in the Methods.

      Is SJN specific at the concentration that was chosen? Did it inhibit the SEP?

      In the concentration used in our experiments, SJN is a selective TGF-β type I receptor ALK5 inhibitor (see (Gellibert et al., 2004)).

      Fig. 3b. It looks like CNQX increased the width of the vessels quite a bit. Please explain.

      For AP5, very large vessels were imaged, making it hard to compare to the other data.

      The vascular dilation in response to the stimulation under CNQX was similar to that seen under “normal” conditions (i.e. aCSF). As for AP5, in some experiments the responding arteriole was in close proximity to a large venule that cannot be avoidable while imaging. For quantification we always measured arterioles within the same diameter range.

      e. Sometimes CNQX did not block the response after 30 min stimulation. Why?

      CNQX is washed out before the 30 min stimulation starts, so it is not expected to block the response to stimulation. However, in some cases the response to stimulation was lower in amplitude, likely due to residual CNQX that did not wash out completely.

      Regarding DEGs, on the top of p 10 what are the percentages of?

      In this analysis we tested in each hemisphere how many genes expressed differentially between 1 and 24 hours post stimulation (either up- or down- regulated). The results were presented as the percentages of differentially expressed genes in each hemisphere (13.2% contralateral, and 7.3% ipsilateral). The text was rephrased for clarity.

      Please add a ref for the use of the JSD metric methods and support for its use as the appropriate method. Other methods need explanation/references.

      References were added to the text to clarify. The Jensen-Shannon Divergence metric is commonly used to calculate the statistical pairwise distance among two distributions (Sudmant et al., 2015). From comparing a few different distance metric calculations including JSD, our results were similar irrespective of the distance metric applied. Therefore, we demonstrate the variability between paired samples of stimulated and non-stimulated cortex of each animal at two time points following stimulation (24 h vs. 1 h) using JSD.

      What synaptic plasticity genes were selected for assay and what were not?

      What does "largely unaffected" mean? Some of the genes may change a small amount but have big functional effects.

      The selected genes of interest were taken from a large list compiled from previous publications (see (Cacheaux et al., 2009; Kim et al., 2017)) and are well documented in gene ontology databases and tools (e.g., Metascape, (Zhou et al., 2019)).

      We agree that the term ‘largely unaffected’ is suboptimal, and we rephrased this section of the results to indicate that “No significant differences were found in BBB or inflammation related genes between the hemispheres”. We also agree that a small number of genes can have big functional effects. Future studies are needed to better understand the genes underlying the observed BBB modulation.

      Please note that Slc and ABCs are not only involved in the BBB.

      Thank you. We modified the text to no longer specify that these are BBB-specific transporters.

      Please explain the choice of the stress ball squeeze task, and DCE.

      DCE is a well-established method for BBB imaging in living humans, and it is cited throughout the manuscript. The ball squeeze task was chosen as it is presumed to involve primarily sensory motor areas, without high-level processing (Halder et al., 2005). This is now stated in the discussion.

      What is Gd-DOTA?

      Gd-DOTA is a gadolinium-based contrast agent (gadoterate meglumine, AKA Dotarem). Text was revised for clarity. Please see the Methods section under ‘Magnetic Resonance Imaging’ - ‘Data Acquisition’.

      What does a higher percentage of activated regions mean- how was activacon defined and how were regions counted?

      Higher percentage of activated regions refers to regions in which voxels showed significant BOLD changes due to the motor task preformed. The statistical approaches and analyses are detailed in the Methods section under ‘Magnetic Resonance Imaging - Preprocessing of functional data, and fMRI Localizer Motor Task’.

      Figure. 4

      Was stimulation 1 min or 30 min.?

      30 min, Text has been revised for clarity.

      What is the Wald test and how were p values adjusted-please add to the Stats section.

      The Methods section under ‘Statistical analysis’ was revised to clarify this point.

      Is there a reason why p values are sometimes circles and otherwise triangles?

      The legend was revised to explain that ”Circles represent genes with no significant differences between 1 and 24 h poststimulation. Upward and downward triangles indicate significantly up- and down- regulated genes, respectively.”

      How can a p-value be zero? Please explain abbreviations.

      The p-value is very low (~10-10) and therefore appears to be zero due to the scale of the y-axis.

      Fig. 5b.

      There are unexplained abbreviations.

      The x on the ball and hand is not clear relative to the black ball and hand.

      Thank you for noticing. We revised the figure for clarity.

      c. What was the method used to make an activator map and what is meant by localizer task?

      The explanation of the “fMRI Localizer Motor Task” section in the methods was revised for added clarity.

      f. What is the measurement "% area" that indicates " BBB modulation"?

      Is it in f, the BBB permeable vessels (%)? f. Please explain: "Heatmap of BBB modulated voxels percentage in motor/sensory-related areas of task vs. controls."

      The %area measurement indicates the percentage of voxels within a specific brain region that have a leaky BBB. See Methods.

      Is Task - the control?

      Yes.

      Supplemental Fig. 2.

      Why is AUC measured, not amplitude?

      The amplitude, and now also the AUC are shown in Figure 3.

      b. There is no comparison to baseline. The arrowhead points to the start of stimulation but there is no arrowhead marking the end.

      In the revised paper we added a grey shade over the stimulation period to better visualize the difference to baseline. In this panel we wanted to show that NMDA receptor antagonist did not block the SEP, while AMPA receptor antagonist did.

      c. In the blot there are two bands for PSD95- which is the one that is PSD95? There is no increase in PSD95 uncl 24 hrs but in the graph in d there is. In the blot, there is a strong expression of PSD95 ipsilateral compared to contralateral in the sham-why?

      What is the percent change fold?

      The PSD-95 is the top and larger band. The lower band was disregarded in the analysis. The example we show may not fully reflect the group statistics presented in panel d. Upon quantification of 8 animals, PSD-95 is significantly higher 30 min and 24 hours post stimulation in the contralateral hemisphere. No significant changes were found in sham animals. The % change fold refers to the AUC change compared to baseline. This panel was now incorporated in Figure 3 (panel h), and the title was corrected to “|AUC|, % change from baseline”.

      Supplemental Fig. 4.

      a. If ipsilateral and contralateral showed many changes why do the authors think the effects were only contralateral?

      Our gene analysis was designed to complement our in vivo and histological findings, by assessing the magnitude of change in differentially expressed genes (DEGs). This analysis showed that: (1) the hemisphere contralateral to the stimulus has significantly more DEGs than the ipsilateral hemisphere; and (2) the DEGs were related to synaptic plasticity and TGF-b signaling. These findings strengthen the hypothesis raised by our in vivo and histological experiments.

      Supplemental Fig. 5 includes many processes not in the results. Examples include dorsal cuneate and VPL, dynamin, Kir, mGluR, etc. The top right has numbers that are not mentioned. If the drawings are from other papers they should be cited.

      The drawings of Figure 5 are original and were not published before. This hypothesis figure points to mechanisms that may drive the phenomena described in the paper. The legend of the figure was revised to include references to mechanisms that were not tested in this study.

      Papers referenced in this letter:

      Ahishali, B., & Kaya, M. (2020). Evaluation of Blood-Brain Barrier Integrity Using Vascular Permeability Markers: Evans Blue, Sodium Fluorescein, Albumin-Alexa Fluor Conjugates, and Horseradish Peroxidase. Methods in Molecular Biology, 2367, 87–103. https://doi.org/10.1007/7651_2020_316

      Aksenov, D. P., Li, L., Miller, M. J., Iordanescu, G., & Wyrwicz, A. M. (2015). Effects of anesthesia on BOLD signal and neuronal activity in the somatosensory cortex. Journal of Cerebral Blood Flow and Metabolism, 35(11), 1819–1826. https://doi.org/10.1038/jcbfm.2015.130

      All, A. H., Agrawal, G., Walczak, P., Maybhate, A., Bulte, J. W. M., & Kerr, D. A. (2010). Evoked potential and behavioral outcomes for experimental autoimmune encephalomyelitis in Lewis rats. Neurological Sciences, 31(5), 595–601. https://doi.org/10.1007/s10072-010-0329-y

      Allison, J. D., Meador, K. J., Loring, D. W., Figueroa, R. E., & Wright, J. C. (2000). Functional MRI cerebral activation and deactivation during finger movement. Neurology, 54(1), 135–142. https://doi.org/10.1212/wnl.54.1.135

      Bengtsson, S. L., Nagy, Z., Skare, S., Forsman, L., Forssberg, H., & Ullén, F. (2005). Extensive piano practicing has regionally specific effects on white matter development. Nature Neuroscience, 8(9), 1148–1150. https://doi.org/10.1038/nn1516

      Betterton, R. D., Abdullahi, W., Williams, E. I., Lochhead, J. J., Brzica, H., Stanton, J., Reddell, E., Ogbonnaya, C., Davis, T. P., & Ronaldson, P. T. (2022). Regula/on of Blood-Brain Barrier Transporters by Transforming Growth Factor-β/Activin Receptor-Like Kinase 1 Signaling: Relevance to the Brain Disposition of 3-Hydroxy-3-Methylglutaryl Coenzyme A Reductase Inhibitors (i.e., Sta/ns). Drug Metabolism and Disposition, 50(7), 942–956. https://doi.org/10.1124/dmd.121.000781

      Bouchard, M. B., Chen, B. R., Burgess, S. A., & Hillman, E. M. C. (2009). Ultra-fast multispectral optical imaging of cortical oxygenation, blood flow, and intracellular calcium dynamics. Optics Express, 17(18), 15670. https://doi.org/10.1364/oe.17.015670

      Cacheaux, L. P., Ivens, S., David, Y., Lakhter, A. J., Bar-Klein, G., Shapira, M., Heinemann, U., Friedman, A., & Kaufer, D. (2009). Transcriptome profiling reveals TGF-β signaling involvement in epileptogenesis. Journal of Neuroscience, 29(28), 8927–8935. https://doi.org/10.1523/JNEUROSCI.0430-09.2009

      Classen, J., Liepert, J., Wise, S. P., Hallett, M., & Cohen, L. G. (1998). Rapid plasticity of human cortical movement representation induced by practice. Journal of Neurophysiology, 79(2), 1117–1123. https://doi.org/10.1152/JN.1998.79.2.1117/ASSET/IMAGES/LARGE/JNP.JA47F4.JPEG

      Franceschini, M. A., Radhakrishnan, H., Thakur, K., Wu, W., Ruvinskaya, S., Carp, S., & Boas, D. A. (2010). The effect of different anesthetics on neurovascular coupling. NeuroImage, 51(4), 1367–1377. https://doi.org/10.1016/j.neuroimage.2010.03.060

      Gellibert, F., Woolven, J., Fouchet, M. H., Mathews, N., Goodland, H., Lovegrove, V., Laroze, A., Nguyen, V. L., Sautet, S., Wang, R., Janson, C., Smith, W., Krysa, G., Boullay, V., De Gouville, A. C., Huet, S., & Hartley, D. (2004). Identification of 1,5-naphthyridine derivatives as a novel series of potent and selective TGF-β type I receptor inhibitors. Journal of Medicinal Chemistry, 47(18), 4494–4506. https://doi.org/10.1021/jm0400247

      Goff, W. R., Rosner, B. S., & Allison, T. (1962). Distribution of cerebral somatosensory evoked responses in normal man. Electroencephalography and Clinical Neurophysiology, 14(5), 697–713. https://doi.org/10.1016/0013-4694(62)90084-6

      Halder, P., Sterr, A., Brem, S., Bucher, K., Kollias, S., & Brandeis, D. (2005). Electrophysiological evidence for cortical plasticity with movement repetition. European Journal of Neuroscience, 21(8), 2271–2277. https://doi.org/10.1111/J.1460-9568.2005.04045.X

      Ivens, S., Kaufer, D., Flores, L. P., Bechmann, I., Zumsteg, D., Tomkins, O., Seiffert, E., Heinemann, U., & Friedman, A. (2007). TGF-β receptor-mediated albumin uptake into astrocytes is involved in neocortical epileptogenesis. Brain, 130(2), 535–547. https://doi.org/10.1093/brain/awl317

      Kaplan, L., Chow, B. W., & Gu, C. (2020). Neuronal regulation of the blood–brain barrier and neurovascular coupling. In Nature Reviews Neuroscience (Vol. 21, Issue 8, pp. 416–432). Nature Research. https://doi.org/10.1038/s41583-020-0322-2

      Keller, P., & Simons, K. (1998). Cholesterol is required for surface transport of influenza virus hemagglutinin. Journal of Cell Biology, 140(6), 1357–1367. https://doi.org/10.1083/jcb.140.6.1357

      Kim, S. Y., Senatorov, V. V., Morrissey, C. S., Lippmann, K., Vazquez, O., Milikovsky, D. Z., Gu, F., Parada, I., Prince, D. A., Becker, A. J., Heinemann, U., Friedman, A., & Kaufer, D. (2017). TGFβ signaling is associated with changes in inflammatory gene expression and perineuronal net degradation around inhibitory neurons following various neurological insults. Scientific Reports, 7(1), 7711. https://doi.org/10.1038/s41598-017-07394-3

      Knowland, D., Arac, A., Sekiguchi, K. J., Hsu, M., Lutz, S. E., Perrino, J., Steinberg, G. K., Barres, B. A., Nimmerjahn, A., & Agalliu, D. (2014). Stepwise Recruitment of Transcellular and Paracellular Pathways Underlies Blood-Brain Barrier Breakdown in Stroke. Neuron, 82(3), 603–617. https://doi.org/10.1016/j.neuron.2014.03.003

      Koudinov, A. R., & Koudinova, N. V. (2001). Essen/al role for cholesterol in synaptic plasticity and neuronal degeneration. The FASEB Journal, 15(10), 1858–1860. https://doi.org/10.1096/r.00-0815re

      Lapilover, E. G., Lippmann, K., Salar, S., Maslarova, A., Dreier, J. P., Heinemann, U., & Friedman, A. (2012). Periinfarct blood-brain barrier dysfunction facilitates induction of spreading depolarization associated with epileptiform discharges. Neurobiology of Disease, 48(3), 495–506. htttts://doi.org/10.1016/j.nbd.2012.06.024

      Lindauer, U., Villringer, A., & Dirnagl, U. (1993). Characterization of CBF response to somatosensory stimulation: Model and influence of anesthetics. American Journal of Physiology - Heart and Circulatory Physiology, 264(4 33-4), 223–1228. https://doi.org/10.1152/ajpheart.1993.264.4.h1223

      Lippmann, K., Kamintsky, L., Kim, S. Y., Lublinsky, S., Prager, O., Nichtweiss, J. F., Salar, S., Kaufer, D., Heinemann, U., & Friedman, A. (2017). Epileptiform activity and spreading depolarization in the bloodbrain barrier-disrupted peri-infarct hippocampus are associated with impaired GABAergic inhibition and synaptic plasticity. Journal of Cerebral Blood Flow and Metabolism, 37(5), 1803–1819. https://doi.org/10.1177/0271678X16652631

      Masamoto, K., & Kanno, I. (2012). Anesthesia and the quantitative evaluation of neurovascular coupling. In Journal of Cerebral Blood Flow and Metabolism (Vol. 32, Issue 7, pp. 1233–1247). SAGE PublicationsSage UK: London, England. https://doi.org/10.1038/jcbfm.2012.50

      McGregor, H. R., Cashaback, J. G. A., & Gribble, P. L. (2016). Functional Plasticity in Somatosensory Cortex Supports Motor Learning by Observing. Current Biology, 26(7), 921–927. https://doi.org/10.1016/j.cub.2016.01.064

      McMillin, M. A., Frampton, G. A., Seiwell, A. P., Patel, N. S., Jacobs, A. N., & DeMorrow, S. (2015). TGFβ1 exacerbates blood-brain barrier permeability in a mouse model of hepatic encephalopathy via upregulation of MMP9 and downregulation of claudin-5. Laboratory Investigation, 95(8), 903–913. https://doi.org/10.1038/labinvest.2015.70

      Mégevand, P., Troncoso, E., Quairiaux, C., Muller, D., Michel, C. M., & Kiss, J. Z. (2009). Long-term plasticity in mouse sensorimotor circuits after rhythmic whisker stimulation. Journal of Neuroscience, 29(16), 5326– 5335. https://doi.org/10.1523/JNEUROSCI.5965-08.2009

      Milikovsky, D. Z., Ofer, J., Senatorov, V. V., Friedman, A. R., Prager, O., Sheintuch, L., Elazari, N., Veksler, R., Zelig, D., Weissberg, I., Bar-Klein, G., Swissa, E., Hanael, E., Ben-Arie, G., Schefenbauer, O., Kamintsky, L., Saar-Ashkenazy, R., Shelef, I., Shamir, M. H., … Friedman, A. (2019). Paroxysmal slow cortical activity in Alzheimer’s disease and epilepsy is associated with blood-brain barrier dysfunction. Science Translational Medicine, 11(521), eaaw8954–eaaw8954. https://doi.org/10.1126/scitranslmed.aaw8954

      Ngai, A. C., Jolley, M. A., D’Ambrosio, R., Meno, J. R., & Winn, H. R. (1999). Frequency-dependent changes in cerebral blood flow and evoked potentials during somatosensory stimulation in the rat. Brain Research, 837(1–2), 221–228. https://doi.org/10.1016/S0006-8993(99)01649-2

      Obermeier, B., Daneman, R., & Ransohoff, R. M. (2013). Development, maintenance and disruption of the blood-brain barrier. In Nature Medicine (Vol. 19, Issue 12, pp. 1584–1596). Nature Publishing Group. https://doi.org/10.1038/nm.3407

      Schoknecht, K., Prager, O., Vazana, U., Kamintsky, L., Harhausen, D., Zille, M., Figge, L., Chassidim, Y., Schellenberger, E., Kovács, R., Heinemann, U., & Friedman, A. (2014). Monitoring stroke progression: In vivo imaging of cortical perfusion, blood-brain barrier permeability and cellular damage in the rat photothrombosis model. Journal of Cerebral Blood Flow and Metabolism, 34(11), 1791–1801. https://doi.org/10.1038/jcbfm.2014.147

      Schumacher, L., Slimani, R., Zizmare, L., Ehlers, J., Kleine Borgmann, F., Fitzgerald, J. C., Fallier-Becker, P., Beckmann, A., Grißmer, A., Meier, C., El-Ayoubi, A., Devraj, K., Mittelbronn, M., Trautwein, C., & Naumann, U. (2023). TGF-Beta Modulates the Integrity of the Blood Brain Barrier In Vitro, and Is Associated with Metabolic Alterations in Pericytes. Biomedicines, 11(1), 1–19. https://doi.org/10.3390/biomedicines11010214

      Shim, H. J., Jung, W. B., Schlegel, F., Lee, J., Kim, S., Lee, J., & Kim, S. G. (2018). Mouse fMRI under ketamine and xylazine anesthesia: Robust contralateral somatosensory cortex ac/va/on in response to forepaw stimulation. NeuroImage, 177, 30–44. https://doi.org/10.1016/J.NEUROIMAGE.2018.04.062

      Sudmant, P. H., Alexis, M. S., & Burge, C. B. (2015). Meta-analysis of RNA-seq expression data across species, tissues and studies. Genome Biology, 16(1), 287. https://doi.org/10.1186/s13059-015-0853-4

      Tao, L., & Nicholson, C. (1996). Diffusion of albumins in rat cortical slices and relevance to volume transmission. Neuroscience, 75(3), 839–847. https://doi.org/10.1016/0306-4522(96)00303-X

      Vazana, U., Veksler, R., Pell, G. S., Prager, O., Fassler, M., Chassidim, Y., Roth, Y., Shahar, H., Zangen, A., Raccah, R., Onesti, E., Ceccanti, M., Colonnese, C., Santoro, A., Salvati, M., D’Elia, A., Nucciarelli, V., Inghilleri, M., & Friedman, A. (2016). Glutamate-mediated blood–brain barrier opening: Implications for neuroprotection and drug delivery. Journal of Neuroscience, 36(29), 7727–7739. https://doi.org/10.1523/JNEUROSCI.0587-16.2016

      Veksler, R., Vazana, U., Serlin, Y., Prager, O., Ofer, J., Shemen, N., Fisher, A. M., Minaeva, O., Hua, N., SaarAshkenazy, R., Benou, I., Riklin-Raviv, T., Parker, E., Mumby, G., Kamintsky, L., Beyea, S., Bowen, C. V., Shelef, I., O’Keeffe, E., … Friedman, A. (2020). Slow blood-to-brain transport underlies enduring barrier dysfunction in American football players. Brain, 143(6), 1826–1842. https://doi.org/10.1093/brain/awaa140

      Zandieh, S., Hopf, R., Redl, H., & Schlag, M. G. (2003). The effect of ketamine/xylazine anesthesia on sensory and motor evoked potentials in the rat. Spinal Cord, 41(1), 16–22. https://doi.org/10.1038/sj.sc.3101400

      Zelig, D., Goldberg, I., Shor, O., Ben Dor, S., Yaniv-Rosenfeld, A., Milikovsky, D. Z., Ofer, J., Imtiaz, H., Friedman, A., & Benninger, F. (2022). Paroxysmal slow wave events predict epilepsy following a first seizure. Epilepsia, 63(1), 190–198. https://doi.org/10.1111/epi.17110

      Zhang, Y., & Pardridge, W. M. (2001). Mediated efflux of IgG molecules from brain to blood across the blood– brain barrier. Journal of Neuroimmunology, 114(1–2), 168–172. https://doi.org/10.1016/S01655728(01)00242-9

      Zhou, Y., Zhou, B., Pache, L., Chang, M., Khodabakhshi, A. H., Tanaseichuk, O., Benner, C., & Chanda, S. K. (2019). Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nature Communications, 10(1), 1–10. https://doi.org/10.1038/s41467-019-09234-6

    1. Reviewer #1 (Public Review):

      Summary:<br /> In this manuscript, Rohde et al. discuss how single cells isolated from the presomitic mesoderm of the zebrafish embryo follow a cell-autonomous differentiation "programme", which is dependent on the initial anteroposterior position in the embryo.

      Strengths:<br /> This work and in particular the comparison to cellular behaviour in vivo presents a detailed description of the oscillatory system that brings the developmental biology forward in their understanding of somitogenesis.<br /> The main novelty lies in the direct comparison of these isolated single cells to single cells tracked within the developing embryo. This allows them to show that isolated cells follow a similar path of differentiation without direct contact to neighbours or the presence of external morphogen gradients. Based on this, the authors propose an internal timer that starts ticking as cells traverse the presomitic mesoderm, while external signals modify this behaviour.

      Weaknesses:<br /> There are a few things that would clarify the current statement or might be added in a reasonable amount of time to further increase the relevance of this study:<br /> - My main point of concern is the precision of dissection. The authors distinguish cells isolated from the tailbud and different areas in the PSM. They suggest that the cell-autonomous timer is initiated, as cells exit the tailbud.<br /> This is also relevant for the comparison of single cells isolated from the embryo and cells within the embryo. The dissection will always be less precise and cells within the PSM4 region could contain tailbud cells (as also indicated in Figure 1A), while in the analysis of live imaging data cells can be selected more precisely based on their location. This could therefore contribute to the difference in noise between isolated single cells and cells in the embryo. This could also explain why there are "on average more peaks" in isolated cells (p. 6, l. 7).<br /> This aspect should be considered in the interpretation of the data and mentioned at least in the discussion.<br /> (It does not contradict their finding that more anterior cells oscillate less often and differentiate earlier than more posterior ones.)

      - Here, the authors focus on the question of how cells differentiate. The reverse question is not addressed at all. How do cells maintain their oscillatory state in the tailbud? One possibility is that cells need external signals to maintain that as indicated in Hubaud et al. 2014. In this regard, the definition of tailbud is also very vague. What is the role of neuromesodermal progenitors? The proposal that the timer is started when cells exit the tailbud is at this point a correlation and there is no functional proof, as long as we do not understand how cells maintain the tailbud state. These are points that should be considered in the discussion.

      - The authors observe that the number of oscillations in single cells ex vivo is more variable than in the embryo. This is presumably due to synchronization between neighbouring cells via Notch signalling in the embryo. Would it be possible to add low doses of Notch inhibitor to interfere with efficient synchronization, while at the same time keeping single cell oscillations high enough to be able to quantify them?

      In the same direction, it would be interesting to test if variation is decreased, when the number of isolated cells is increased, i.e. if cells are cultured in groups of 2,3 or 4 cells, for instance.

      - It seems that the initiation of Mesp2 expression is rather reproducible and less noisy (+/- 2 oscillation cycles), while the number of oscillations varies considerably (and the number of cells continuing to oscillate after Mesp2 expression is too low to account for that). How can the authors explain this apparent discrepancy?

      - The observation that some cells continue oscillating despite the upregulation of Mesp2 should be discussed further and potential mechanism described, such as incomplete differentiation.

      - Fig. 3 supplement 3 B missing

    1. Reviewer #2 (Public Review):

      Pheochromocytoma (PCC), a rare neuroendocrine tumor, is currently considered malignant, but non-surgical treatment options are very limited and there is an urgent need for more basic research to support the development of new therapeutic approaches. In the present work, the authors described the intra- and inter-tumor heterogeneity by performing scRNA-seq on tumor samples from five patients with PCC, and evaluated the corresponding PASS scores.

      Strengths: The tumor microenvironment of PCC was characterized and potential molecular classification criteria based on single-cell transcriptomics were proposed, offering new theoretical possibilities for the treatment of PCC. The article is logically written and the results are clearly presented.

      Weaknesses: I still have concerns about some of the article's content. My main concerns are: In this study, the authors seem to have demonstrated the inaccuracy of a subjective score (PASS) by another objective means (scRNA-seq). In fact, the multiparametric scoring systems such as PASS are no longer endorsed in the 2022 WHO guidelines. The PASS scoring system does not have a high positive predictive value for risk stratification of PCC metastasis, but "rule-out" of metastasis risk with a PASS score of <4 seems to be fairly reliable. Could the authors please explain why the PASS scores were chosen rather than the GAPP, m-GAPP, or COPPS scoring systems? If possible, please try to emphasize the importance and necessity of using the PASS scoring system, either by replacing it with a more acceptable scoring system or by deleting the relevant part, which does not seem to be very relevant to the subject of the article.

      Moreover, I noted the following statement in the text "There are no studies reporting the composition of immune cells in PCCs. The few published studies investigating the immune microenvironment of PCCs have been limited to the expression of PDL1 at the histological level and to assessment of the tumor mutation burden (TMB) at the genomic level, and these results only seem to suggest that PCCs are immune-cold (Bratslavsky et al, 2019; Guo et al, 2019; Pinato et al, 2017)." This statement is very wrong. The reason for this error may be that the authors did not adequately search and read the relevant literature. I noticed that almost all references in this paper are dated 2021 and earlier, which is surprising. Please update the references cited in this paper in a comprehensive and detailed manner; referring to literature published too early may lead to inadequate discussion or even one-sided or incorrect conclusions and conjectures.

      For example, the text statement "Combined with previously reported negative regulatory effects of kinases (such as RET, ALK, and MEK) on HLA-I expression on tumor cells (Brea et al., 2016; Oh et al., 2019), we speculate that the possible reason for inability in recruiting CD8+ T cells of kinase-type PCCs is the downregulation of HLA-I in tumor cells regulated by RET, while the mechanism of immune escape in metabolism-type PCCs (with antigen presentation ability) needs to be further explored. Our results also indicate that the application of immunotherapy to metabolism-type PCCs is likely unsuitable, while kinase-type PCCs may have the potential of combined therapy with kinase inhibitors and immunotherapy." is rather one-sided; in fact, the presence of immune escape in PCC, as the malignancy with the lowest tumor mutation compliance, has been well characterized, and the low number of infiltrating T cells in tumor tissue may be influenced by a variety of factors, such as the release of catecholamines, the expression of inhibitory receptors on the surface of T cells, and so on, although genetic mutation still plays the most crucial role. The Discussion section also has a lot of information that needs to be updated or corrected and expanded, so please rewrite the above section with sufficiently updated references.

      Below I have listed some references for the authors to read:

      Tufton N, Hearnden RJ, Berney DM, et al. The immune cell infiltrate in the tumour microenvironment of phaeochromocytomas and paragangliomas. Endocr Relat Cancer. 2022;29(11):589-598. Published 2022 Sep 19. doi:10.1530/ERC-22-0020<br /> Jin B, Han W, Guo J, et al. Initial characterization of immune microenvironment in pheochromocytoma and paraganglioma. Front Genet. 2022;13:1022131. Published 2022 Dec 7. doi:10.3389/fgene.2022.1022131<br /> Celada L, Cubiella T, San-Juan-Guardado J, et al. Pseudohypoxia in paraganglioma and pheochromocytoma is associated with an immunosuppressive phenotype. J Pathol. 2023;259(1):103-114. doi:10.1002/path.6026<br /> Calsina B, Piñeiro-Yáñez E, Martínez-Montes ÁM, et al. Genomic and immune landscape Of metastatic pheochromocytoma and paraganglioma. Nat Commun. 2023;14(1):1122. Published 2023 Feb 28. doi:10.1038/s41467-023-36769-6

    1. we first report the immunostimulatory activity of total genomic DNA from two plants, Brassica chinensis L. and Zea may, the CpG methylation status of which is incomplete compared with E. coli DNA. These plant DNA can activate B cells to proliferate. Plant DNA promotes secretion of IL-12, and increases expression of MHC and costimulatory molecules by bone marrow-derived dendritic cells (BMDC). Plant DNA can also enhance antigen presentation capacity of BMDC and macrophages.

      Here is where they begin the process with the two plants to conduct their experiment.

    1. Note: This response was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      1. General Statements

      We thank the reviewers for their excellent work that greatly improved our work. We are very content that reviewer #1 considered our work to be “novel, interesting and important for understanding the mitochondrial biology of PD”. This reviewer also valued our work as “a significant advancement” and suggested further study of the relationship of CISD1 (dimerization) to general mitophagy/autophagy. We addressed this in the already transferred revision (version 1, v1).

      Also reviewer #2 considered our work to be “an exciting and well-executed piece of research focusing on the defects in iron homeostasis observed in Parkinson's disease which a wide audience will appreciate”. This reviewer had a very specific suggestion on how to improve our manuscript which makes a lot of sense and is feasible. As the suggested experiments include fly breeding and behavioral analysis, these experiments will be included in the second revision to be uploaded as soon as possible (version 2, v2).

      Finally, reviewer #3 gathered that parts of our results “are confirmatory to recently published work” but also appreciated that our results established that iron-depleted apo-Cisd is an important determinant of toxicity which has not been shown before. I would like to comment here, that in contrast to the paper mentioned by this reviewer, our contribution includes data from dopaminergic neurons obtained from human patients suffering from familial Parkinson’s disease that demonstrate the same increase in apo-Cisd levels as the flies. This reviewer mainly suggested that the manuscript would be improved by a more balanced discussion of the strengths and weaknesses of the study and more circumspection in interpretation of data which we did in the revised version of our manuscript. We also added data on the expression levels of Cisd and apo-Cisd in transgenic flies as also suggested.

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary: The manuscript focuses on mitochondrial CISD1 and its relationship to two Parkinson's disease (PD) proteins PINK1 and Parkin. Interestingly, CISD1 is a mitochondrial iron sulfur binding protein and an target of Parkin-mediated ubiquitinylation. Disruption of iron metabolism and accumulation of iron in the brain has long since been reported in PD but the involvement of iron sulfur binding is little studied both in vivo and in human stem cell models of PD. This work addresses the relationship between CISD1 and two mitochondrial models of PD (PINK1 and Parkin) making use of in vivo models (Drosophila), PINK1 patient models (iPSC derived neurons) and Mouse fibroblasts. The authors report a complex relationship between CISD1, PINK1 and Parkin, where iron-depleted CISD1 may illicit a toxic gain of function downstream of PINK1 and Parkin.

      Major comments:

      The conclusions are overall modest and supported by the data. One question remains unaddressed. Is mitochondrial CISD1 a downstream target that specifically mediates PINK1 and Parkin loss of function phenotypes or are the phenotypes being mediated because CISD1 is downstream of mitophagy in general?

      It would be interesting to know what happens to CISD1 (dimerization?) upon initiation of mitophagy in wild type cells? Would dissipation of mitochondrial membrane potential be sufficient to induce changes to CISD1 in wild type cells or PINK1 deficient cells? Since iron chelation is a potent inducer of mitophagy (Loss of iron triggers PINK1/Parkin-independent mitophagy. George F G Allen, Rachel Toth, John James, Ian G Ganley. EMBO Reports (2013)14:1127-1135) it would be useful to show one experiment addressing the role of CISD1 dimerization under mitochondrial depolarizing and non-depolarizing conditions in cells.

      Based on the overall assumption of the reviewer that our work is “novel, interesting and important for understanding the mitochondrial biology of PD” and “a significant advancement” we understand the word “modest” here as meaning “not exaggerated”. To address this question, we studied CISD1 dimerization in response to more classical activators of mitophagy namely FCCP and antimycin/oligomycin which had no significant effect on dimerization suggesting that this phenotype is more pronounced under iron depletion. These data are shown in the new Fig. 2c.

      Alternatively, the authors should discuss the topic of mitophagy (including PINK1-parkin independent mitophagy), the limitation of the present study not being able to rule out a general mitophagy effect and previous work on the role of iron depletion on mitophagy induction in the manuscript.

      The data and the methods are presented in such a way that they can be reproduced.

      The experiments are adequately replicated and statistical analysis is adequate.

      Minor comments:

      Show p values even when not significant (ns) since even some of the significant findings are borderline < p0.05.

      Here, I decided to leave it as it is, because the figures became very cluttered and less easy to understand. Borderline findings are however indicated and mentioned in the text.

      Because the situation for CISD1 is complicated (overexpression, different models etc.) it would be helpful if in the abstract the authors could summarize the role. E.g. as in the discussion that iron-depleted CISD1 could represent a toxic function.

      The abstract has been completely rewritten and now mentions the potential toxic function of iron-depleted CISD1.

      If there is sufficient iron (accumulation in PD) why would CISD1 be deactivated? Perhaps that could be postulated or discussed in a simplified way?

      We actually think that apo-CISD1 without its iron/sulfur cluster is incapable of transferring its Fe/S cluster to IRP1 and IRP2. This then results in increased levels of apo-IRP1/2 and subsequent changes that lead to iron overload. Such a sequence of events would place CISD1 upstream of the changes in iron homeostasis observed in PD and models of PD. This is now discussed in more detail.

      In the methods section both reducing and non-reducing gel/Western blotting is mentioned but the manuscript only describes data from blots under reducing conditions. Are there blots under non-reducing conditions that could be shown to see how CISD1 and dimerized CISD1 resolve?

      We now show these blots as supplemental data in new supplemental Figure 2.

      In the results section, PINK1 mutant flies, it is said that the alterations to CISD1 (dimerization) are analogous to the PINK1 mutation patient neurons. The effect is seen in old but not young flies. Since iPSC-derived neurons are relatively young in the dish, would one not expect that young flies and iPSC-derived neurons have similar CISD1 phenotypes? Could the authors modify the text to reflect that? or discuss the finding in further context.

      We only studied one time point in PINK1 mutation patient neurons and controls. It would indeed be interesting whether neuronal aging (as far as this can be studied in the dish) would result in increased CISD1 dimerization. This is now discussed.

      Reviewer #1 (Significance):

      The strengths of this work are in the novelty of the topic and the use of several well established in vivo and cell models including patient-derived neurons. The findings discussed in the text are honest and avoid over-interpretation. The findings are novel, interesting and important for understanding the mitochondrial biology of PD.

      We thank the reviewer for their kind words.

      Limitations include the lack of strong phenotypes in the CISD1 models and the lack of robust, sustained and consistent increase in CISD1 dimers in the patient and fly models (just significant because of variability). The relationship of CISD1 (dimerization) to general mitophagy/autophagy is not shown here.

      We do not completely agree with the assumption that all CISD1 models lack a strong phenotype. At least the CISD1-deficient fibroblasts exhibit a strong phenotype consisting of fragmented mitochondria and increased oxidative stress. The lack of a strong phenotype in Cisd-deficient flies could actually hint to a potential compensatory mechanism that could also protect the Pink1 mutant x Cisd-deficient double-knockout flies. It is correct that the increase in CISD1/Cisd dimers in the PD models are not overwhelming but – as also mentioned by the reviewer – this could be increased in “older” cultures. This is now discussed in more detail. As suggested by the reviewer, we have now added experiments that study the relationship between CISD1 dimerization and conventional mitophagy as described above.

      There is a significant advancement. So far researchers were able to describe the importance of iron metabolism in PD (For example refer to work from the group of Georg Auburger such as PMID 33023155 and discussion of therapeutic intervention such as reviewed by Ma et al. PMID: 33799121) but few papers describe involvement of iron sulfur cluster proteins specifically (such as Aconitase) in relation to PINK1 and parkin (these are cited). The fact that CISD1 is a protein of the mitochondrial outer membrane makes it particularly interesting and further studies looking more closely at the interaction of CISD1 with mitochondrial proteins associated with PD will be of interest.

      We thank the reviewer for pointing out these excellent publications. Key et al present an enormous wealth of data on protein dysregulation of wildtype and Pink1-/- fibroblast cell lines upon perturbation of the iron homeostasis (Key et al, 2020). Both cell lines exhibit a downregulation of CISD1 levels upon iron deprivation with the agent 2,2′ -Bipyridine possibly as a compensatory mechanism to limit the toxic gain of function of iron-depleted CISD1. The other paper, Ma et al. is a recent review on changes in iron homeostasis in PD and PD models (Ma et al, 2021). Both papers are now cited in the manuscript.

      This paper describes CISD1 as a new and relevant player in PINK1 and Parkin biology. Further work could lead to exploration of whether CISD1 could be a therapeutic target, considering its role in maintaining mitochondrial redox and mitochondrial health. This is of particular interest to mitochondrial biologists and pre-clinical research in PD.

      This preprint was reviewed by three scientists whose research focus in the mitochondrial biology underlying Parkinson's disease. The group has a special interest in the functions of the mitochondrial outer membrane. We work with several cell models of Parkinson's disease and work with patient donated samples. We do not have expertise in Drosophila models of PD nor the quantification of iron described in the manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary: In the paper entitled 'Mitochondrial CISD1 is a downstream target that mediates PINK1 and Parkin loss-of-function phenotypes', Bitar and co-workers investigate the interaction between CISD1 and the PINK1/Parkin pathway. Mutations in PINK1 and PARKIN cause early onset Parkinson's disease and CISD1 is a homodimeric mitochondrial iron-sulphur binding protein. They observed an increase in CISD1 dimer formation in dopaminergic neurons derived from Parkinson's disease patients carrying a PINK1 mutation. Immuno-blots of cells expressing CISD1 mutants that affects the iron sulphur cluster binding and as well as cells treated with iron chelators, showed that the tendency of CISD1 to form dimers is dependent on its binding to iron-sulphur clusters. Moreover, the Iron-depleted apo-CISD1 does not rescue mitochondrial phenotypes observed in CISD1 KO mouse cells. Finally, In vivo studies showed that overexpression of Cisd and mutant apo-Cisd in Drosophila shortened fly life span and, using a different overexpression model, apo-Cisd caused a delay in eclosion. Similar as patient derived neurons, they observed an increase in Cisd dimer levels in Pink1 mutant flies. Additionally, the authors showed that double mutants of Cisd and Pink1 alleviated all Pink1 mutant phenotypes, while double mutants of Prkn and Cisd rescued most Prkn mutant phenotypes.

      Major comments:

      1) The authors observed an increase in the levels of Cisd dimers in Pink1 mutant flies and removing Cisd in Pink1 mutant background rescues all the mutant phenotypes observed in Pink1 mutant flies, suggesting that the Cisd dimers are part and partial of the Pink1 mutant phenotype. The authors also generated a UAS_C111S_Cisd fly which can overexpress apo-Cisd. Overexpression of the C111S_Cisd construct with Tub-Gal4 showed a developmental delay. Since apo-Cisd forms more dimeric Cisd, my question is: does the strong overexpression (e.g. with Tub-Gal4) of the C111S_Cisd in wild type flies shows any of the Pink1 mutant phenotypes? If not, the authors should mention this and elaborate on it.

      We thank the reviewer for their comments. In fact, we only observed very few flies ecclosing after overexpression of wildtype Cisd or C111S Cisd using the strong tubP-Gal4 driver during development. We considered these very few flies to be escapees (also indicated by the rather low induction of Cisd mRNA suggesting compensatory downregulation) and only used them to conduct the analysis shown in Figure 4c-e. This is now mentioned in more detail in the manuscript.

      2) Figure 6g: Shows the TEM pictures of the indirect flight muscles of Pink1 mutant flies and Pink1, Cisd double mutants. To me, the Picture of Pink1 mutant mitochondria is not very convincing. We expect swollen (enlarged) mitochondria with disrupted mitochondrial matrix. However, this is not clear in the picture. Moreover, in my opinion, Figure6 g, is missing an EM Picture of the Cisd mutant indirect flight muscles.

      We now show exemplary pictures from Pink1 mutant and DKO in a higher magnification which better demonstrate the rounded Pink1 mutant mitochondria and the disrupted cristae structure. EM pictures of all four genotypes in different magnifications are now shown in new supplemental Figure 6.

      3) OPTIONAL: The authors suggest that most probably apo-Cisd, assumes a toxic function in Pink1 mutant flies and serves as a critical mediator of Pink1-linked phenotypes. If this statement is correct, we can hypothesize that increasing apo-Cisd in Pink1 mutant background should worsen the pink1 mutant defects.

      Therefore, I suggest overexpressing Cisd1 wild type (and/or C111S Cisd) in pink1 mutant flies, as pink1 is on the X chromosome, and mild overexpression of Cisd1 with da is not lethal, these experiments could be done in 3-4 fly crosses and hence within 1.5 - 2 months.

      We have set up this experiment and will report in the second revision (v2) of our manuscript.

      Since Pink1 mutant flies contain higher levels of endogenous Cisd dimers, we can expect that overexpression of wild type Cisd will result in an even stronger increase of dimers. If these dimers indeed contribute to Pink1 mutant phenotypes we can expect that overexpression of Cisd will result in a worsening of the Pink1 mutant phenotypes.

      We have set up this experiment and will report in the second revision of our manuscript.

      Minor Comments:

      -) In the Introduction (Background) there are some parts without references:

      E.g., there is not a single reference in the following part between

      'However, in unfit mitochondria with a reduced mitochondrial membrane potential ...&... compromised mitochondria safeguards overall mitochondrial health and function.'

      We thank the reviewer for pointing out this flaw. We have now added a suitable reference to the introduction.

      -) In the introduction there is some confusion about the nomenclature used in the article: e.g. following comments are made in the text: Cisd2 (in this publication referred to as Dosmit) or fly Cisd2 (in this publication named MitoNEET).

      However, the names Dosmit and MitoNEET do not appear in the manuscript (except in references)

      The literature and nomenclature for CISD1 are indeed confusing. We have now revised the introduction.

      -) Figure 1: I am not sure why some gels are shown in this figure. The two last lanes of figure 1c are redundant and Figure 1c' which is also not mentioned in the text, is also a repetition of figure 1c.

      The blots in 1c and 1c’ represent all data points (different patients and different individual differentiations) shown in the quantification in 1d. This is now explained better in the revised manuscript.

      -) The authors mention in material and methods that T2A sites are used at the C-terminus of CISD1 to avoid tagging of CISD1. However, this is not entirely true as T2A will leave some amino acids (around 20) after the self-cleaving and therefore CISD1 will be tagged.

      This is indeed true and we have now changed the wording in the revised manuscript.

      -) In figure 5 P1 is used to abbreviate Pink1 mutants, however P1, to me, refers to pink1 wild type. It would be clearer to abbreviate Pink1 mutants as P1B9 in the graphs as B9 is the name of the mutant pink1 allele.

      We thank the reviewer for pointing out this flaw. We have now altered Fig. 5 to be clearer.

      -) In figure 7: Parkin is abbreviated both as Prkn and as Park

      We thank the reviewer for pointing out this flaw, we indeed mixed up both names because it is complicated. The gene symbol is Prkn, the fly line is called Park25. We have now clarified this in the text and Fig. 7.

      -) I suggest changing the title. Recently an article (Ham et al, 2023 PMID: 37626046) was published showing similar genetic interactions between Pink1/Prkn and Cisd. However, the article of Ham et al, 2023 was focused on Pink1/Prkn regulation of ER calcium release, while this article is more related to iron homeostasis. I suggest that the title shows this distinction.

      This is indeed a very good suggestion. We have now altered the title to “Iron/sulfur cluster loss of mitochondrial CISD1 mediates PINK1 loss-of-function phenotypes”.

      Reviewer #2 (Significance):

      In general, this is an exciting and well-executed piece of research focusing on the defects in iron homeostasis observed in Parkinson's disease which a wide audience will appreciate. Very recently, a similar genetic interaction between Cisd and Pink1/Prkn in flies was published (Ham et al, 2023 PMID: 37626046) however, from a different angle. While, Ham et al focused on the role of Pink1/Prkn and Cisd in IP3R related ER calcium release, this manuscript approaches the Pink1/Prkn - Cisd interaction from an iron homeostasis point of view. Since, iron dysregulation contributes to the pathogenesis of Parkinson's disease, the observations in this manuscript are relevant for the disease. Hence, the work is sufficiently novel and deserves publication. However, additional experiments are suggested to strengthen the authors' conclusions.

      We thank the reviewer for their kind words. As mentioned above, these additional experiments are on their way and will be included in version 2 of our revised manuscript (v2).

      I work on Drosophila models of Parkinson's disease

      Referees cross-commenting

      I agree with the reviewer number 1 that it would be interesting to investigate CISD1 dimerisation status during mitophagy.

      As mentioned above, we now studied CISD1 dimerization in response to more classical activators of mitophagy namely FCCP and antimycin/oligomycin which had no significant effect on dimerization suggesting that this phenotype is more pronounced under iron depletion. These data are shown in the new Fig. 2c.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Here the authors provide evidence that Cisd is downstream of Parkin/Pink1 and suggest that the levels of apo-Cisd correlate with neurotoxicity. The data presented generally supports the conclusions of the authors and will be useful to those in the field. The manuscript would be improved by a more balanced discussion of the strengths and weaknesses of the study and more circumspection in interpretation of data.

      We thank the reviewer for their comments aimed to improve our manuscript. We have now discussed the strengths and weaknesses of our study in more detail.

      Introduction. While iron has been implicated in Parkinson's disease, it is an overstatement to say that disruption in iron metabolism contributes significantly to the pathogenesis of the disease.

      There is certainly a plethora of data implicating perturbed iron homeostasis in PD as also pointed out by reviewer #1. We have tried to tone down our wording in the text and added a recent review on the topic (Ma et al, 2021) as also suggested by reviewer #1.

      Introduction. The discussion of the various names for Cisd2 is important, but confusing as written. Specifically, the use of "this" makes the wording unclear.

      We thank the reviewer for pointing out this flaw. We have altered the wording in the introduction.

      Methods. It would be preferable to use heterozygous driver lines or a more similar genetic control rather than w-1118.

      The exact controls were indeed not well explained in the Methods section, this has been corrected in the revised version. In brief, homozygous driver and UAS lines were indeed used in Fig. 4, this will be addressed in the second revision of our manuscript together with the experiments reviewer #2 suggested. The data shown in Fig. 5, 6, and 7 all used w1118 as control because all other fly strains are on the same genetic background.

      Page 10. It appears that the PINK1 lines have been described previously. The authors should clarify this point and ensure that the new data presented in the current manuscript (presumably the mRNA levels, Fig. 1a) is indicated, as well as data that is confirmatory of prior findings (Fig. 1b).

      Yes, these PINK1 lines have been described previously as pointed out in the manuscript. The original paper did not quantify the PINK1 mRNA levels shown in Fig. 1a. The blots shown in Fig. 1b are from new differentiations and have also not been shown before but confirm findings published in Jarazo et al. (Jarazo et al, 2022). This has been clarified in the revised version of our manuscript.

      Fig. 3 legend. There is a typographical error, "ne-way ANOVA."

      We thank the reviewer for pointing out this flaw. This has been corrected in the revised version.

      Page 15. The nature of the Pink1-B9 mutant should be specified.

      We now added a supplemental Figure 1 that depicts the specific mutation in these flies.

      Fig. 4. Levels of mutant and wild type Cisd should be compared in transgenic flies.

      We now added a quantification of mutant and wildtype Cisd levels to the new Figure 4d.

      Fig. 5b,d. The striking change seems to be the decrease in dimers in young Pink1 mutant animals, not the small increase in dimers in the older Pink1 mutants.

      It is always difficult to find a “typical” picture that reflects all changes observed in quantitative data. This Figure actually shows a decrease of total Cisd levels in young flies in Fig. 5c but no difference of the dimer/monomer ratio in Fig. 5d.

      Fig. 5f. Caution should be used in interpreting the results. Deferiprone has toxicity to wildtype flies (trend) and may simply be making sick Pink1 mutants sicker.

      There is certainly a tendency for wildtype flies to thrive less in food containing deferiprone. To make this more obvious, we have now added the exact p value (0.0764, which we don’t consider borderline but a tendency) to this figure and mention this fact in the text.

      Fig. 5e. The data are hard to interpret. The number of animals is very small for a viability study and the strains are apparently in different genetic backgrounds, though this is not clearly specified. The experiment in Supplementary Fig. 1 appears better controlled and supports the Pink1 data; however, a similar concern pertains to Fig. 7. The authors may thus wish to be more circumspect in their interpretation, especially of the Parkin data.

      In Fig 5e we quantified total iron levels and the Fe3+/Fe2+ ratio using capillary electrophoresis-inductively coupled plasma mass spectrometry (CE-ICP-MS). Although indeed not so many flies were used in this quantification, the results are highly significant. If the reviewer was referring to Fig. 5f, we agree that this experiment was not well (to be honest, even wrongly explained) which we corrected in the revised version of this manuscript. We thank the reviewer for pointing out this flaw.

      Reviewer #3 (Significance):

      The major significance of the study is in putting downstream of Parkin/Pink1 (largely confirmatory to recently published work) and suggesting that the levels of apo-Cisd are an important determinant of toxicity. The work will be of interest to those in the field.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      The changes already carried out and included in the transferred manuscript (v1) are indicated above in bold orange. All changes pending on ongoing experiments to be included in the second revision of the manuscript are indicated above in bold magenta.

      4. Description of analyses that authors prefer not to carry out

      All changes suggested by the reviewers were addressed (v1) or will be addressed (v2).

      References

      Jarazo J, Barmpa K, Modamio J, Saraiva C, Sabaté-Soler S, Rosety I, Griesbeck A, Skwirblies F, Zaffaroni G, Smits LM, et al (2022) Parkinson’s Disease Phenotypes in Patient Neuronal Cultures and Brain Organoids Improved by 2-Hydroxypropyl-β-Cyclodextrin Treatment. Mov Disord 37: 80–94

      Key J, Sen NE, Arsović A, Krämer S, Hülse R, Khan NN, Meierhofer D, Gispert S, Koepf G & Auburger G (2020) Systematic Surveys of Iron Homeostasis Mechanisms Reveal Ferritin Superfamily and Nucleotide Surveillance Regulation to be Modified by PINK1 Absence. Cells 9

      Ma L, Gholam Azad M, Dharmasivam M, Richardson V, Quinn RJ, Feng Y, Pountney DL, Tonissen KF, Mellick GD, Yanatori I, et al (2021) Parkinson’s disease: Alterations in iron and redox biology as a key to unlock therapeutic strategies. Redox Biol 41: 101896

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      __We thank all the reviewers for their time and their constructive criticism, based on which we will revise our manuscript. All our responses are indicated in red. __

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      The manuscript by Nguyen and Cheng is investigating the timing and mechanism of cessation of neuroblasts in the pupal optic lobe. Previous studies by several groups have determined the spatial and temporal factors required for the neuroepithelial to neuroblast transition and neuroblast to neural/glycogenesis in third instar larvae such that neuroblasts are eliminated. The mechanism of elimination of neuroblasts in the VNC or mushroom bodies have been investigated, but the mechanism(s) and the timing of elimination of medulla neuroblasts has not been investigated. The authors suggest that medulla neuroblasts are eliminated via a combination of mechanisms including apoptosis, prospero induced size symmetric terminal differentiation and a switch to gliogenesis by gcm expression. Expression of Tailless also was found to affect the timing of medulla neuroblast termination. They also ruled out several mechanisms such as ecdysone pulses.

      Major comments

      Clearly written and logical flow to experiments and results not over interpreted.

      Clearly show that the neuroblast number and size decrease (12 to 18 hrs) and are eliminated by 30 hours

      Figure 2a Marking of the Neuroepithelium. Would be more convincing if shown by PatJ expression and is clonal analysis. While the following panels use PatJ in clones suggesting are NE and NBs present it is more difficult to put into the context in the higher magnification images (Figure 2 D- M) and the Miranda expression in F' seems to be the entire lobe and it is not clear if would be any NE which does not agree with what is shown in panel A.

      We will perform clonal analysis using MARCM to show that the elimination of medulla NBs (marked by Dpn) is accompanied by the depletion of NE (marked by PatJ). For Figure 2 D, E, I, L, we will change the images to the whole lobes to clearly show the shift in the NE-NB transition upon Notch OE/KD.

      Is difficult to see the neuroblasts in Figure 2 D D" and E. The figure does not match what is stated in the results in that the neuroblasts are difficult to observe. If the point is that there is fewer NE cells and more neuroblasts then this is hard to see. It has been previously shown that with Notch RNAi clones prematurely extrude form the NE (Egger 20210; Keegan 2023) and could be expressing more Neuroblast markers but this is not visible in the panels as shown. Are the images single focal plane or maximum projections? Imaging more deeply in the brain or viewing in cross section would account for these possibilities. The possibility that are more neuroblasts but not all at the surface of the OL should be addressed as this could also alter the overall results.

      Figure 2 is key to first point of the paper so needs to be addressed.

      The images are single focal plane of the superficial layer of the medulla. We will specify this information in the figure legends. We will include cross-section of the notch RNAi clones to show the delamination of precocious NBs.

      Minor comments

      Why express volume of DPN in clone volume. Would make the point more clear and more strong be to express as number of NB in the 3-D volume of the clone. This measurement occurs in several figures.

      We will redo the quantification as suggested.

      Use of Miranda to mark NBs is unclear in Figure 2. Perhaps more clear in B&W.

      We will redo the staining with Dpn instead of Mira to mark medulla NBs. Figures will be presented in B&W as suggested.

      Make clear in figures (or figure legend) if single focal plane or projections.

      We will do so.

      It is unclear what percentage of NB the Gal4 line eyR16F10 are expressed in. Veen 2023 state that the GAL4 is also expressed in neurons and at different levels whether deeper within the brain or superficially on the surface of the brain. At 16 APF it is expressed but it is not clear whether it is in all cells at a low level or only within a few cells

      We will further characterize the expression of eyR16F10-GAL4 in the pupal medulla as suggested.

      Some RNAi lines referenced as previously validated and other are not. For example: EcR, Oxphos, Med27, Notch need references or confirmation of specificity to the intended target (qRT)

      We will perform RT-qPCR to validate the use of UAS-med27 RNAi. For RNAi stocks such as UAS-EcR RNAi, UAS-Atg1 RNAi, UAS-notch RNAi that have been previously used in other publications, we will provide appropriate references.

      At least 2 animals per genotype were used. While I appreciate the technical difficulty of working in pupae this seems a bit low in terms of number of samples and data would be more robust with more numbers.

      Any experiments in which less than 3 animals were used, we will redo the experiments.

      Reviewer #1 (Significance (Required)):

      This provides mechanism and timing for the elimination of neuroblasts (NE to NB) that arise from the medulla. As these are most similar to mammalian brain development (Radial glial to NSC) this information provides more context to interpret the formation of glial and neurons in the adult optic lobe given the effect on timing and mechanisms of elimination.

      This paper would be of interest to developmental biologist who work with Drosophila or mice who are looking at neural development. An understanding of how neural diversity is achieved and the mechanisms behind this that can be dysfunctional in terms of etiology of neural diseases. Is a well done study for the most part that would be improved by clarifying some data and provided more replicates for robustness of the data.

      I am a developmental biologist working with Drosophila in larval and adult neural development.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      Lineages of neural stem cells are of great interest to understand how many neural types are generated. They produce very diverse neurons, often in a highly stereotyped series. However, they must terminate their life when the animal becomes functional or if neurons need time to become mature before birth.

      In the Drosophila optic lobes, neural stem cells are produced over a period of several days by a wave of neurogenesis that transforms a neuroepithelium into neural stem cells that undergo a series of temporal patterning steps. It has been reported that they finish their life when a symmetric division generates glial cells. The authors however analyze the end of a particular lineage, that of the latest born neural stem cells of the medulla.

      The paper shows that neural stem cells stop being produced when the neuroepithelium is consumed. But how do the latest born neural stem cells stop their lineage?

      The results show that they do so by several means, which is quite unexpected: they may die from apoptosis, or autophagy, by becoming glioblasts or by a terminal symmetric division.

      There are no major issues affecting the conclusions

      • The paper shows that the end of production of neural stem cells occurs the neuroepithelium is completely transformed. The experiments performed by the authors are fine and show that, if the transition is delayed, neural stem cells terminate their life later, and vice versa. However, the lifespan of the neural stem cells is not affected by the timing of the transition. Therefore, these experiments do not tell us how neural stem cells terminate their life, which is the central question of the study. The discussion should be written accordingly and the title and the model in Fig 6 modified to reflect the importance of the end of life of the stem cells, the main theme of the paper.

      We agree that our said experiments did not elucidate how NBs terminate at the end of neurogenesis. Nevertheless, our aim is to show that the timing of NB termination in the medulla is dependent on the timing of the NE-NB transition.

      In Supplementary Figure 1, we showed that factors previously shown to be involved in NB termination in other lineages did not play similar roles in the medulla NBs. Thus, we think that NB termination in the medulla is likely regulated at the levels of the NE, but not the NBs themselves. Although we have briefly mentioned this in our manuscript, we hope by conducting the experiments suggested by the reviewer (see below), we can subsequently modify our model in Figure 6 and our discussion.

      • The authors talk about Pros-dependent symmetric division and gliogenic switch as two separate processes, but these may be two sides of the same phenomenon. Tll+ gcm+ neural stem cells undergo Pros-dependent cell cycle exit, generating glial progeny. If the authors agree with this, could they update their model (and discussion) to reflect the fact that gliogenic switch occurs via a Pros-dependent symmetric division, and these are not two separate processes independently contributing to the depletion of the neural stem cell pool? Ideally, a triple staining between Dpn, Pros, and gcm would show that the symmetrically dividing cells seen by the authors are committed to the glial fate.

      We will further test how gliogenesis is affected in pros RNAi clones. The results may shed light on whether Pros-mediated symmetric division is required for Gcm-mediated gliogenesis in the medulla. Regarding the model, we have summarized our findings and suggestions in Figure 5K, however, we will integrate this information into our final model.

      In Figure 5C, we showed that at 12h APF, there are Dpn+ NBs in the medulla that expressed both Pros and Gcm, suggesting that it is very likely that Pros is upstream of Gcm to induce the glial cell fate switch of the medulla NBs.

      • Why were Notch RNAi experiments assessed for the presence of neural stem cells at P12 and gcm RNAi experiments at P24? Given that most optic lobe neural stem cells disappear between P12-18, a subtle effect of gcm RNAi may have been missed. Do the authors have data for gcm RNAi at P12?

      We hypothesized that the timing of NE-NB transition affects the timing of NB termination in the medulla. Because Notch KD was previously shown to induce precocious NE-NB transition in the OL, meaning that medulla NBs are born prematurely, we expected that this manipulation will lead to a corresponding premature elimination of the NBs. In contrast, gcm RNAi which inhibits the switch into the glial cell fate of the NBs, is expected to prolong the neurogenic phase of the NBs, and thereby, their persistence by 24h APF when WT NBs are eliminated.

      • The authors should acknowledge that the inhibition of either apoptosis or autophagy alone may not be fully sufficient to prevent the death of NBs. In mushroom body neural stem cells, both processes must be inhibited simultaneously to produce a strong effect on their survival (Pahl et al. 2019, PMID 30773368).

      We will add this information in our discussions.

      • There is an important missing point that should be addressed: is there a specific point in time when all neural stem cells must stop their lineage wherever they are in the temporal series and either die or divide symmetrically? One possibility that is not discussed is that most neural stem cells end their life through a gliogenic symmetric division while those that were generated late must stop en route and die by apoptosis and/or autophagy. This would solve the strange diversity of end-of-life, which could be easily addressed by identifying the temporal stage of the neural stem cells that undergo apoptosis

      We agree that it would be of interest to understand how there are diverse mechanisms by which medulla NBs terminate during pupal development. To address if temporal progression is involved in apoptosis of the medulla NBs, we will first characterize the expression of some temporal TFs (e.g., Ey, Slp, Tll) at 12h APF when we found a subset of medulla NBs undergo apoptosis in the wildtype animals.

      Minor suggestions:

      We agree with these minor modifications.

      • Line 46: Specify that there are 8 type II neural stem cells in each hemisphere*.

      • The statement in lines 181-182 that "cell death, and not autophagy, makes a minor contribution to..." should be replaced with "apoptosis, and not autophagy," as autophagy is also a type of cell death.

      • The authors should adjust the logic of the section "Medulla neuroblasts terminate during early pupal development": Describe the wild-type pattern first (the decrease in the number of neural stem cells and their size with age) and then describe the perturbations aimed at disrupting the number and the size of neural stem cells

      • Line 151 should refer to Fig. 2I-K, not Fig. 2J-K.

      **Referees cross-commenting**

      How can NBs die by different mechanisms?? This might only happen is they are in a different states, an issue that is not addressed.

      it has been shown that optic lobe NBs end their life by a symmetric, gliogenic last division at the end of the last temporal window, and not by PCD.

      It is likely, and the authors do hint at it, that NBs only die by PCD when they prematurely interrupt the temporal series in early pupation when neurons synchronously start undergoing maturation.

      I believe that the authors should explain this, if this is indeed their model, and show that NBs die while still in early temporal windows.

      Reviewer #2 (Significance (Required)):

      Lineages of neural stem cells are of great interest to understand how many neural types are generated. They produce very diverse neurons, often in a highly stereotyped series. However, they must terminate their life when the animal becomes functional or if neurons need time to become mature before birth.

      In the Drosophila optic lobes, neural stem cells are produced over a period of several days by a wave of neurogenesis that transforms a neuroepithelium into neural stem cells that undergo a series of temporal patterning steps. It has been reported that they finish their life when a symmetric division generates glial cells. The authors however analyze the end of a particular lineage, that of the latest born neural stem cells of the medulla.

      The paper shows that neural stem cells stop being produced when the neuroepithelium is consumed. But how do the latest born neural stem cells stop their lineage?

      The results show that they do so by several means, which is quite unexpected: they may die from apoptosis, or autophagy, by becoming glioblasts or by a terminal symmetric division.

      There are no major issues affecting the conclusions

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary

      In this manuscript, the authors address the timing and mechanisms responsible for the termination of medulla neuroblasts in Drosophila visual processing centres, also known as optic lobes. Through time course experiments the authors demonstrate the medulla NBs are completely eliminated by 30h APF during early pupal development. By manipulating the Notch signalling pathway as well as proneural genes such as lethal of scute, the authors show that altering the NE-NB transition is sufficient to change the timing of NB termination. In contrast, ecdysone signalling and components of the mediator complex, known to terminate proliferation of central brain NBs, are not required for the termination of medulla NBs. Medulla NBs sequentially express a variety of temporal transcription factors to promote cellular diversity, however, the authors demonstrate that altering temporal factors such as Ey, Sco or Hth, does not affect the timing of the medulla NBs termination. Interestingly however overexpression of the transcription factor tailless can cease medulla NB termination via the conversion of type I to type II NB fate. They further go on to show the importance of the differentiation factor, Prospero, in promoting the differentiation of medulla NBs as well as terminating medulla neurogenesis during pupal development. Finally, in addition to differentiation, the authors show another mechanism responsible for the cessation of neurogenesis which is the commencement of gliogenesis. Through manipulation of the neurogenic to gliogenic switch by knockdown or overexpressing the glial regulatory gene, gcm, the authors show that even though the downregulation of gcm is is not sufficient to induce NB persistence, gcm overexpression can cause premature termination of NBs.

      Major comments:

      • Are the key conclusions convincing?

      Yes, the key conclusions are convincing with proper controls, quantifications and statistical analyses.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The conclusion that temporal transcription factors (TTF) do not affect the timing of medulla NB termination is somewhat preliminary. The authors investigated a simplified temporal series including Homothorax, Eyeless, Sloppy-paired, Dichaete and Tailless. However, there are additional temporal factors that have not been examined for their potential involvement in medullar NB termination. Previous reports have identified several other temporal factors that play a role in medulla TTF cascade, such as, SoxNeuro (SoxN) and doublesex-Mab related 99B (Dmrt99B) that start their expression in the NE similar to Hth, however, Dmrt99B is likely to be repressed much later than Hth (Li, Erclik et al. 2013, Zhu, Zhao et al. 2022). At this point, it remains challenging to completely rule out the possibility that other temporal factors play a role in medullar NB termination or have redundant functions in regulating the timing of medulla NB cessation. It is suggested to tone down this claim and provide a brief discussion on alternative possibilities, citing relevant papers on the functions of other temporal factors in medullar NBs.

      We agree.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Loss of pros by RNAi caused the formation of ectopic NBs and the NBs persist even at 24h APF. Do these NBs persist at 30h or 48h APF? Does overexpression of Pros result in early termination of medulla NBs?

      We will do these experiments in clones as suggested.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Yes, I believe the suggested experiments are realistic in terms of time and resources, with an estimation of 3 months to complete the experiments.

      • Are the data and the methods presented in such a way that they can be reproduced?

      Yes.

      • Are the experiments adequately replicated and statistical analysis adequate?

      The experiments are straight forward and were performed with proper controls, supported by quantifications and proper statistical analyses. However, there is no mention about how many replicates were used.

      We will add this information in our Material and Methods section.

      Minor comments:

      1. The authors use the eyR6F10-Gal4 driver in certain experiments. The eyR6F10-Gal4 driver is however expressed only in a subset of medulla NBs. Can the authors comment on what percentage of medulla NBs is the driver expressed in? We will characterize this.

      Does the EGFR signalling pathway or JAK/STAT pathway affect the timing of termination of medulla NBs? Experiments are not necessary. The author can speculate on their roles.

      We will modify our discussion accordingly.

      Figure 1C has a p value of only 0.03 (*) but shows a strong reduction in the number of Dpn+ cells from 12h to 18h, etc. Is this correct? Also, is the p value the same for the comparison between 12h and 24h as well as 12h and 30h APF?

      Yes. P-values showed no significant differences between 28-24h and 24-30h APF.

      The controls in figure 2B and to some extent figure 2H show one major outlier (much higher than the other brain lobes in the control). Will the removal of this outlier affect the significance/ p-value of the experiment?

      No, removing the outliers do not change the statical results.

      In figure 2B what is the p-value between 12h and 18h APF? Is it *** as well?

      No, it’s not significant.

      Line 84 of the introduction introduces Tll, Gcm and Pros for the first time in the manuscript and should be written out in full.

      We will change this.

      • Are prior studies referenced appropriately?

      Yes.

      • Are the text and figures clear and accurate?

      Yes.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Quite a few of data mentioned in the manuscript have been described as data not shown. I think it would be nice to show quantifications or representative images in the supplementary figures.

      We will add the data which was previously not shown.

      Reviewer #3 (Significance (Required)):

      Since the mechanisms by which medulla NBs are terminated are currently unknow, this is an important and interesting study to understand how medulla neuroblasts in the optic lobe are terminated. The balance between stem cell maintenance and differentiation is critical for proper brain development and the results presented in this paper are impactful. Furthermore, Drosophila melanogaster is an excellent model to study stem cell niches and neuroblast temporal patterning. The authors provide key mechanisms namely cell death, Pros-mediated differentiation and the gliogenic switch that contribute to a better understanding of how the NB progenitor pool can be terminated in the Drosophila OL, which is largely supported by the data.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      So far, most work in this field has focused on the regulation of the temporal factors to promote the progression of the TTF transcriptional cascade and thereby diversity of the neural progenitors (Li, Erclik et al. 2013, Naidu, Zhang et al. 2020, Ray and Li 2022, Zhu, Zhao et al. 2022). Furthermore, work on pathways such as EGFR and Notch signalling that allows the proneural wave to progress and subsequently induce neuroblast formation in a precise and orderly manner have also been studied (Yasugi, Umetsu et al. 2008, Yasugi, Sugie et al. 2010). Here, considering previous literature, the authors move one step forward to determine how and when these neuroblast progenitors cease proliferation during development thus providing mechanisms for the regulation of the neuroepithelial stem cell pool, its timely conversion into NSCs and the switch from neurogenesis to gliogenesis thus providing important implications for brain size determination and function.

      • State what audience might be interested in and influenced by the reported findings.

      Stem cell research, neurobiologists and developmental biologists.

      • Define your field of expertise

      Stem cells, developmental biology

    1. Author Response:

      We thank the reviewers and editor for their careful analysis of our manuscript and their appreciation of its strengths. Our plans to address the reviewers’ concerns regarding the weaknesses of the study are outlined below.

      Reviewing Editor (Public Review):

      “Weaknesses mainly concern the experiments and arguments leading to the authors' notion that Cav3 channels may partially compensate for the loss of Cav1.4 calcium currents in cone synapses. It is possible that the non-conducting Cav1.4 variant supports synapse development and the Cav3 channel then provides the calcium influx. However, in its current state, the study does not unequivocally assess Cav3 expression in wild-type cones, it lacks direct evidence of Cav3 expression and upregulation, e.g. via single cell transcriptomics, immunolabeling, or an elaboration on electrophysiology, and it does not test the authors' earlier idea that Cav1.4 might couple to intracellular calcium stores at photoreceptor synapses.”

      Current transcriptomic studies indicate that Cav3 transcripts are present at extremely low levels compared to that for Cav1.4 in cones of young mice (PMID 26000488, summarized in PMID 35650675), adult mice (PMID: 36807640), macaque (PMID 30712875), and human (PMID 31075224). Thus, it was somewhat surprising that Davison et al reported the presence of low voltage activated (LVA) Cav3-like currents with amplitudes that were ~50% of that for the Cav1 current in mouse cones at -40 mV (PMID 35803735). Using similar pharmacological criteria as Davison et al, we did not find functional evidence for a LVA current in cones of wild-type (WT) mouse retina: the Ca2+ current in our recordings was suppressed by the Cav1 antagonist isradipine (Fig 3a) but minimally affected in the expected voltage range by the Cav3 antagonist ML218 (Fig 3b). In WT mouse, voltage clamp steps from -90 mV to more depolarized voltages failed to show a transient inward current at onset (Fig 2e), which is a hallmark of LVA calcium currents. In addition, by standard physiological and pharmacological critera, we could not identify LVA currents in cones of ground squirrel (Fig.3c,d) and macaque retina (Supp. Fig.S3). Our results argue against a significant role for LVA currents in mammalian cones.

      A problem that we discovered (as did Davison et al, their Fig.2C) was that Cav3 blockers (e.g., ML218 and Z944) have non-specific actions on the high voltage activated (HVA) Ca2+ current (presumably mediated by Cav1.4) in WT mouse cones. This is clearly shown in our Supp. figure S1a-b where ML218 causes a dose-dependent negative shift in the I-V relationship but also inhibition of current density in HEK293T cells transfected with Cav1.4. We are planning a second study to thoroughly characterize these actions of ML218 and Z944 on Cav1 channels as the results are important for understanding the actions of these drugs in cell-types with mixed populations of Cav1 and Cav3 channels.

      A second problem is that dihydropyridines (DHP) used in both our study and that of Davison et al (e.g., isradipine, nifedipine) incompletely and slowly block Cav1 channels at negative membrane potentials (PMID: 12853422). Due to the slow kinetics of DHP block, Cav1 currents in the presence of such blockers can appear to inactivate rapidly (see Fig.6A in PMID 11487617). Thus, the Cav current recorded in the presence of DHP blockers in WT mouse cones may represent unblocked Cav1.4-mediated currents that appear rapidly inactivating, and therefore misconstrued as being mediated by Cav3 channels.

      Given the caveats of the pharmacological approach, we agree that stronger evidence is needed to rule out a small contribution of Cav3 channels in WT mouse cones. As mentioned in our text, we have found that currently available Cav3 antibodies produce similar patterns of immunofluorescence in WT and corresponding Cav3 KO retina so analysis at the level of Cav proteins is not possible. Thus, we are planning to compare the relative expression of Cav channel genes in cones using drop-seq experiments of G369i KI and WT mouse retina. We also plan to elaborate on our electrophysiological dissection of the HVA and LVA currents.

      Among the 3 Cav3 subtypes, Cav3.2 was the only one detected in mouse cones by Davison et al using nested RT-PCR (PMID 35803735). Thus, we obtained the Cav3.2 mouse strain from JAX (B6;129-Cacna1htm1Kcam/J) and generated a Cav3.2 KO/G369i KI double mutant mouse strain. If the Cav3 current that appears in the G369i KI cones is mediated by Cav3.2, then it should be undetectable in cones of the double mutant mice. Moreover, if these Cav3.2 channels contribute to the residual cone synaptic responses in G369i KI mice, then the double mutant mice should be deficient in this regard. We will test these predictions in patch clamp recordings and ERGs.

      Finally, we will conduct Ca2+ imaging experiments in cone terminals of the WT vs G369i KI mice to test whether increased coupling of Cav channels to intracellular Ca2+ release may be involved in cone synaptic responses of the G369i KI mice.

      Reviewer #1 (Public Review):

      Weaknesses:

      “The major criticism that I have of the study is that it infers Ca channel molecular composition based solely on pharmacological analysis, which, as the authors note, is confounded by the cross-reactivity of many of the "specific" channel-type antagonists. The authors note that Cav3 mRNAs have been found in cones, but here, they do not perform any analysis to examine Cav3 transcript expression after G369i-KI nor do they examine Ca channel transcript expression in monkey or squirrel cones, which serve as controls of sorts for the G369i-KI (i.e. like WT mouse cones, cones of these other species do not seem to exhibit LVA Ca currents).”

      Actually, we also used non-pharmacological (i.e., electrophysiological) criteria to back up our interpretation that Cav3 channels contribute to the Cav current in cones primarily in the absence of functional Cav1.4 channels. For example, in Fig.2, we show that the Ca2+ current in G369i KI and Cav1.4 KO mice exhibit the hallmarks of the Cav3 channel (negative activation and inactivation voltages and window current, rapid inactivation), which are quite distinct from the Ca2+ currents in WT cones. In recordings of ground squirrel and macaque cones (Supp.Figs.S2-3), negative holding voltages do not unmask a LVA current according to various criteria. In addition to the transcriptomic approaches described above, we plan to elaborate on the electrophysiological evidence for the absence of a LVA current in WT mouse cones as part of the revision.

      “Secondarily, in Maddox et al. 2020, the authors raise the possibility that G369i-KI, by virtue of having a functional voltage-sensing domain-might couple to intracellular Ca2+ stores, and it seems appropriate that this possibility be considered experimentally here.”

      We will conduct Ca2+ imaging experiments in cone terminals of the WT vs G369i KI mice to test whether increased coupling of Cav channels to intracellular Ca2+ release may be involved in cone synaptic responses of the G369i KI mice.

      “As a minor point: the authors might wish to note - in comparison to another retinal ribbon synapse-that Zhang et al. 2022 (in J. Neuroscience) performed a study of mouse rod bipolar cells found a number of LVA and HVA Ca conductances in addition to the typical L-type conductance mediated by Cav1-containing channels.”

      We are aware of the extensive evidence for the expression of Cav3 channels in retinal bipolar cells (PMID 11604141, 22909426, 19275782, 35896423) and our recordings of cone bipolar cells in ground squirrel confirm this (Supp. Fig.S2D). We could add reference to this work in our revision.

      Reviewer #2 (Public Review):

      Weaknesses:

      “The major critiques are related to the description of the Cav1.4 knock-in mouse as "sparing" function, which can be remedied in part by a simple rewrite, and in certain places, the data may need to be examined more critically. In particular, the authors should address features in the data presented in Figures 6 and 7 that seem to indicate that the retina of the Cav1.4 knock-in is not intact, but the interpretation given by the authors as "intact" is not appropriate and made without rigorous statistical testing.”

      We intended to use “sparing” and “intact” to indicate that cone synapses are present and to some extent functional, in contrast to their complete absence in the Cav1.4 KO mouse. However, we recognize this may be misinterpreted as “normal”. As suggested by the reviewer, we will revise our statistical analyses and text to clarify that cone synaptic responses do indeed differ significantly in G369i KI as compared to WT mice. We feel that this will be a strong addition to the study and will emphasize the key point that Cav3 cannot fully compensate for loss of Cav1.4 with respect to cone synapse structure and function.

      Reviewer #3 (Public Review):

      Weaknesses:

      “The study has been expertly performed but remains descriptive without deciphering the underlying molecular mechanisms of the observed phenomena, including the proposed homeostatic switch of synaptic calcium channels. Furthermore, a relevant part of the data in the present paper (presence of T-type calcium channels in cone photoreceptors) has already been identified/presented by previous studies of different groups (Macosko et al., 2015; pmid 26000488; Davison et al., 2021; pmid 35803735; Williams et al., 2022; pmid 35650675). The degree of novelty of the present paper thus appears limited.”

      We respectfully disagree that our paper lacks novelty. As indicated by Reviewer 2, a major advance of our study is in providing a mechanism that can explain the longstanding conundrum that congenital stationary night blindness type 2 mutations that would be expected to severely compromise Cav1.4 function do not produce complete blindness. We also disagree that the presence of T-type channels in cone photoreceptors has been unequivocally demonstrated, as the non-biased transcriptomic approaches show very little Cav3 transcript expression in mouse cones (PMIDs 26000488, 35650675, 36807640), macaque cones (PMID 30712875), and human cones (PMID 31075224). Transcription may not equate to translation, particularly at low expression levels. We also note that the one study to date that suggests a functional contribution of Cav3 channels in mouse cones (Davison et al., 2021; pmid 35803735) used a DHP to isolate the “LVA” current, which is problematic as described above. Our demonstration of minimal or undetectable Cav3-type currents in mammalian cones using physiological and pharmacological approaches, while a negative result, adds important context to the recent literature. As described in our response to the editor’s review, our planned revisions include testing whether Cav3 transcripts are upregulated in G369i KI cones and whether the Cav3.2 subtype suggested to be present in cones (PMID 35803735) contributes to Cav currents in these cells using Cav3.2 KO and Cav3.2 KO/G369i KI double mutant mice.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Response to reviews

      We would like to extend our thanks to the reviewers who took the time to carefully read our paper and provide thoughtful insights and suggestions on how to strengthen our conclusions. All reviewers agreed that our study presented strong data supporting a role for triglyceride lipase brummer (bmm) in regulating testis lipid droplets and spermatogenesis in Drosophila, and that our findings advance our understanding of lipid biology during sperm development. Reviewers made several helpful suggestions on how to strengthen our manuscript even further. Below, we outline how we revised our manuscript in response to reviewer comments to ensure we clearly communicate our data and conclusions with readers, and properly contextualize our findings.

      REVIEWER 1

      In this study, the authors investigate the role of triglycerides in spermatogenesis. This work is based on their previous study (PMID: 31961851) on triglyceride sex differences in which they showed that somatic testicular cells play a role in whole body triglyceride homeostasis. In the current study, they show that lipid droplets (LDs) are significantly higher in the stem and progenitor cell (pre-meiotic) zone of the adult testis than in the meiotic spermatocyte stages. The distribution of LDs anti-correlates with the expression of the triglyceride lipase Brummer (Bmm), which has higher expression in spermatocytes than early germline stages. Analysis of a bmm mutant (bmm[1]) - a P-element insertion that is likely a hypomorphic - and its revertant (bmm[rev]) as a control shows that bmm acts autonomously in the germline to regulate LDs. In particular, the number of LDs is significantly higher in spermatocytes from bmm[1] mutants than from bmm[rev] controls. Testes from males with global loss of bmm (bmm[1]) are shorter than controls and have fewer differentiated spermatids. The zone of bam expression, typically close to the niche/hub in WT, is now many cell diameters away from the hub in bmm[1] mutants. There is an increase in the number of GSCs in bmm[1] homozygotes, but this phenotype is probably due to the enlarged hub. However, clonal analyses of GSCs lacking bmm indicate that a greater percentage of the GSC pool is composed of bmm[1]-mutant clones than of bmm[rev]-clones. This suggests that loss of bmm could impart a competitive advantage to GSCs, but this is not explored in greater detail. Despite the increase in number of GSCs that are bmm[1]-mutant clones, there is a significant reduction in the number of bmm[1]-mutant spermatocyte and post-meiotic clones. This suggests that fewer bmm[1]-mutant germ cells differentiate than controls. To gain insights into triglyceride homeostasis in the absence of bmm, they perform mass spec-based lipidomic profiling. Analyses of these data support their model that triglycerides are the class of lipid most affected by loss of bmm, supporting their model that excess triglycerides are the cause of spermatogenetic defects in bmm[1]. Consistent with their model, a double mutant of bmm[1] and a diacylglycerol Oacyltransferase 1 called midway (mdy) reverts the bmm-mutant germline phenotypes.

      There are numerous strengths of this paper. First, the authors report rigorous measurements and statistical analyses throughout the study. Second, the authors ulize robust genetic analyses with loss-of-function mutants and lineage-specific knockdown. Third, they demonstrate the appropriate use of controls and markers. Fourth, they show rigorous lipidomic profiling. Lastly, their conclusions are appropriate for the results. In other words, they don't overstate the results.

      We thank the Reviewer for their positive assessment of our paper.

      There are a few weaknesses. Although the results support the germline autonomous role of bmm in spermatogenesis, one potential caveat that the mdy rescue was global, i.e., in both somatic and germline lineages. The authors did not recover somatic bmm clones, suggesting that bmm may be required for somatic stem self-renewal and/or niche residency. While this is beyond the scope of this paper, it is possible that somatic bmm does impact germline differentiation in a global bmm mutant.

      In the revised manuscript, we made several changes to address these points.

      1) We now clearly state when we used global versus germline-only loss of mdy to rescue bmm mutant phenotypes in the testis.

      “Notably, at least some of the effects of global loss of mdy on bmm1 males can be attributed to the germline:

      RNAi-mediated knockdown of mdy in the germline of bmm1 males partially rescued the defects in testis size (Figure 4I; Kruskal-Wallis rank sum test with Dunn’s multiple comparison test) and GSC variance (Figure S5J; p=4.5 x 10-5 and 8.2 x 10-3 by F-test from the GAL4- and UAS-only crosses, respectively).”

      “Importantly, testes isolated from males with global loss of both bmm and mdy (mdyQX25/k03902;bmm1) had fewer LD than testes dissected from bmm1 males (Figures 5D, S5I; one-way ANOVA with Tukey multiple comparison test).”

      2) We also discuss the possibility that somatic bmm may play a role in germline differentiation in a global bmm mutant, and present phenotypic data on somatic bmm1 clones.

      “We also reveal a potential non-cell-autonomous role for somatic bmm. While there was no difference in the ratio of Zd-1-positive cells between homozygous clones and heterozygous clones in animals carrying the bmm1 or bmmrev alleles at 14 days post clone induction (Figure S4O; Kruskal-Wallis rank sum test), the distance from the hub to the Zd-1 positive clones reside was significantly decreased in bmm1 homozygous clones (Figure S4P; Kruskal-Wallis rank sum test). Together, these data indicate bmm may play a cell-autonomous role in germline cells, and potentially a non-cell-autonomous role in somatic cells, to regulate spermatogenesis.”

      3) Finally, we clarify that we were unable to assess somatic LD. Specifically, this was a technical issue as the dye we use to visualize testis LD is incompatible with staining protocols to identify somatic cells. As a result, we were unable to count LD in somatic clones with confidence.

      “While we were unable to assess LD in bmm1 somatic clones, our data when taken together reveals a previously unrecognized cell-autonomous role for bmm as a regulator of testis LD in germline cells.”

      Regarding data presentation, I have a minor point about Fig. 3L: why aren't all data shown as box plots (only Day 14 bmm[rev] does).

      In our revised manuscript Figure 4L does present a boxplot across all genotypes and times; the appearance of ‘no boxes’ is simply due to the large number of datapoints with a value of zero, which compress the box near the X-axis.

      Finally, the authors provide a detailed pseudotime analysis of snRNA-seq of the testis in Fig. S2A-D, but this analysis is not sufficiently discussed in the text.

      In the revised manuscript we added text to describe our pseudotime analysis of single-cell RNA seq data in more detail.

      “Using pseudotime analysis, we arranged the germline (Figure S2A) and the somatic cells (Figure S2B) based on their annotated developmental trajectory. The expression pattern of bmm in the germline matched our observation with bmm-GFP reporter (Figure S2C). While levels of the bmm-GFP reporter were lower in somatic cells, single-cell RNA sequencing data identified bmm expression in the somatic lineage that was higher in cells at later stages of development (Figure S2D). Additional neutral lipid- and lipid droplet-associated genes such as lipid storage droplet-2, Seipin, Lipin, and midway also showed differential regulation during differentiation (Figure S2C, S2D). Combined with our data on the location of testis LD, these data suggest that bmm upregulation in both somatic and germline cells during differentiation corresponds to the downregulation of testis LD. Supporting this, germline GFP levels were negatively correlated with testis LD in bmm-GFP flies (Figure 2A, 2C), suggesting regions with higher bmm expression had fewer LD.”

      Overall, the many strengths of this paper outweigh the relatively minor weaknesses. The rigorously quantified results support the major aim that appropriate regulation of triglycerides are needed in a germline cell-autonomous manner for spermatogenesis.

      This paper should have a positive impact on the field. First and foremost, there is limited knowledge about the role of lipid metabolism in spermatogenesis. The lipidomic data will be useful to researchers in the field who study various lipid species. Going forward, it will be very interesting to determine what triglycerides regulate in germline biology. In other words, what functions/pathways/processes in germ cells are negatively impacted by elevated triglycerides. And as the authors point out in the discussion, it will be important to determine what regulates bmm expression such that bmm is higher in later stages of germline differentiation.

      We agree with the reviewer about the many interesting future directions for this project. We added a model figure in the revised manuscript to visualize our findings and highlight remaining questions about how bmm and triglycerides support normal spermatogenesis in Drosophila (Fig. 6).

      REVIEWER 2

      Summary:

      Here, the authors show that neutral lipids play a role in spermatogenesis. Neutral lipids are components of lipid droplets, which are known to maintain lipid homeostasis, and to be involved in non-gonadal differentiation, survival, and energy. Lipid droplets are present in the testis in mice and Drosophila, but not much is known about the role of lipid droplets during spermatogenesis. The authors show that lipid droplets are present in early differentiating germ cells, and absent in spermatocytes. They further show a cell autonomous role for the lipase brummer in regulating lipid droplets and, in turn, spermatogenesis in the Drosophila testis. The data presented show that a relationship between lipid metabolism and spermatogenesis is congruous in mammals and flies, supporting Drosophila spermatogenesis as an effective model to uncover the role lipid droplets play in the testis.

      We thank the Reviewer for their positive assessment of our paper.

      Strengths and weaknesses:

      The authors do a commendably thorough characterization of where lipid droplets are detected in normal testes: located in young somatic cells, and early differentiating germ cells. They use multiple control backgrounds in their analysis, including w[1118], Canton S, and Oregon R, which adds rigor to their interpretations. The authors employ markers that identify which lipid droplets are in somatic cells, and which are in germ cells. The authors use these markers to present measured distances of somatic and germ cell-derived lipid droplets from the hub. Because they can also measure the distance of somatic and germ cells with age-specific markers from the hub, these results allow the authors to correlate position of lipid droplets with the age of cells in which they are present. This analysis is clearly shown and well quantified.

      The quantification of lipid droplet distance from the hub is applied well in comparing brummer mutant testes to wild type controls. The authors measure the number of lipid droplets of specific diafteters, and the spatial distribution of lipid droplets as a function of distance from the hub. These measurements quantitatively support their findings that lipid droplets are present in an expanded population of cells further from the hub in brummer mutants. The authors further quantify lipid droplets in germline clones of specified ages; the quantitative analysis here is displayed clearly, and supports a cell autonomous role for brummer in regulating lipid droplets in spermatocytes.

      Data examining testis size and number of spermatids in brummer mutants clearly indicates the importance of regulating lipid droplets to spermatogenesis. The authors show beautiful images supported by rigorous quantification supporting their findings that brummer mutants have both smaller testes with fewer spermatids at both 29 and 25C. There is also significant data supporting defects in testis size for 14-day-old brummer mutant animals compared to controls. The comparison of number of spermatids at this age is not significant, which does not detract from the story but does not support sperm development defects specifically caused by brummer loss at 14 days. Their analysis clearly shows an expanded region beyond the testis apex that includes younger germ cells, supporting a role for lipid droplets influencing germ cell differentiation during spermatogenesis.

      We thank the reviewer for pointing out this inaccuracy in our manuscript. In the revised manuscript we chose more precise language to describe defects in 14-day-old bmm mutants:

      “Defects in testis size were also observed at 14-day post eclosion; suggesting testis size defects persist later into the life course (Figure S4C; Welch two-sample t-test). In contrast, the number of spermatid bundles per testis was not significantly different between bmm1 and bmmrev males at this age (Figure S4D; Welch two-sample ttest), potentially due to a large decrease in the number of spermatid bundles in 14-day-old bmmrev males (Figure 4C, S4D).”

      The authors present a series of data exploring a cell autonomous role for brummer in the germline, including clonal analysis and tissue specific manipulations. The clonal data indicating increased lipid droplets in spermatocyte clones, and a higher proportion of brummer mutant GSCs at the hub are convincing and supported by quantitation. The authors also show a tissue specific rescue of the brummer testis size phenotype by knocking down mdy specifically in germ cells, which is also supported by statistically significant quantitation. The authors present data examining the number of spermatocyte and post-meiotic clones 14 days aeer clonal induction. While data they present is significant with a 95% confidence interval and a p value of 0.0496, its significance is not as robust as other values reported in the study, and it is unclear how much information can be gained from that specific result.

      We thank the reviewer for raising this point. In the revised manuscript we displayed the p-value clearly in the text and on the figure to ensure our statistical output is clear for readers to evaluate our conclusions regarding bmm mutant clones 14 days after clone induction. We also state that the finding should be reproduced by others given that the statistical significance of this result was not as strong as our other data.

      “Because we observed significantly fewer bmm1 spermatocyte and spermatid clones at 14 days after clone induction (Figure 4K,4L; p = 0.0496, Kruskal-Wallis rank sum test), these effects on germline development may represent a cell-autonomous role in regulating spermatogenesis for bmm in this cell type. Given that the statistical significance of this finding was not as strong as for our other data, future studies should repeat this experiment with more samples.”

      The authors do a beautiful job of validating where they detect brummer-GFP by presenting their own pseudotime analysis of publicly available single cell RNA sequencing data. Their data is presented very clearly, and supports expression of brummer in older somatic and germline cells of the age when lipid droplets are normally not detected. The authors also present a thorough lipidomic analysis of animals lacking brummer to identify triglycerides as an important lipid droplet component regulating spermatogenesis.

      Impact:

      The authors present data supporting the broad significance of their findings across phyla. This data represents a key strength of this manuscript. The authors show that loss of a conserved triglyceride lipase impacts testis development and spermatogenesis, and that these impacts can be rescued by supplementing diet with medium chain triglycerides. The authors point out that these findings represent a biological similarity between Drosophila and mice, supporting the relevance of the Drosophila testis as a model for understanding the role of lipid droplets in spermatogenesis. The connection buttresses the relevance of these findings and this model to a broad scientific community.

      We thank the Reviewer very much for their positive assessment of our paper!

      REVIEWER 3

      In this manuscript, Chao et al seek to understand the role of brummer, a triglyceride lipase, in the Drosophila testis. They show that Brummer regulates lipid droplet degradation during differentiation of germ and somatic cells, and that this process is essential for normal development to progress. These findings are interesting and novel, and contribute to a growing realisation that lipid biology is important for differentiation.

      We thank the Reviewer for their positive comments about our manuscript.

      Major comments:

      1) The data in Figs 1 and 2, while helpful in setting the scene, do not add much to what was previously shown by the same group, namely that lipid droplets are present in both early germ cells and early somatic cells in the testis, and that Bmm regulates their degradation (PMID: 31961851). Measuring the distance of lipid droplets from the hub, while helpful in quantifying what is apparent, that only stem and early differentiated stages have lipid droplets, is not as informative as the way data are presented later (Fig. 2I), where droplets in specific stages are measured. Much of this could be condensed without much overall loss to the manuscript.

      We thank the reviewer for this comment. In our revised manuscript we edited the first part of the paper while still preserving the detailed characterization that builds upon our previous paper.

      2) It would be important to show images of the clones from which the data in Fig. 2I are generated. The main argument is that Bmm regulates lipid droplets in a cell autonomous manner; these data are the strongest argument in support of this and should be emphasised at the expense of full animal mutants (which could be moved to supplementary data).

      We thank the reviewer for this comment. In the revised manuscript we added a figure showing lipid droplets in control and bmm mutant spermatocyte clones in Fig. 3A, 3B with a quantification of this data in Figure 3C.

      Similarly, the title of Fig. S2 ("brummer regulates lipid droplets in a cell autonomous manner") should be changed as the figure has no experiments with cell (or cell-type)-specific knockdowns/mutants. This figure does show changes in lipid droplets in both lineages in bmm mutants, so an appropriate title could be "brummer regulates lipid droplets in both germ and soma".

      We thank the reviewer for this comment, we adjusted the Figure 2 legend title in the revised manuscript to “brummer regulates lipid droplets in both germline and somatic cells of the testis”.

      3) Interestingly, the clonal data show that bmm is dispensable in germ cells until spermatocyte stages, as no increase in lipid droplet number is seen until then. This should be more clearly stated, as it indicates that the important function of Bmm is to degrade lipid droplets at the transition from spermatogonial to spermatocyte stages. This is consistent with the phenotypes observed in which late stage germ cells are reduced or missing. However, the effect on niche retention of the mutant GSCs at the expense of neighbouring wildtype GSCs is hard to explain. Are lipid droplets in mutant GSCs larger than in control? Is there any discernible effect of bmm mutation on lipids in GSCs? Additionally, bam expression is delayed, suggesting that bmm may have roles on cell fate in earlier stages than its roles that can be detected on lipid droplets.

      We thank the reviewer for this comment. We included more text in the revised manuscript to clarify the key role bmm plays in regulating lipid droplets at the spermatogonia-spermatocyte transition.

      “Because we observed no significant effect of cell-autonomous bmm loss on LD at any other stage of germline development (Figure 3C), this suggests bmm function is not required to regulate LD at early stages of germ cell development. Instead, our data suggests bmm plays a role in regulating LD at the spermatogonia-spermatocyte transition.”

      We also added more detail to our description of how bmm affects lipid droplets in cells at the earliest stages of germline development.

      “Given that we detected no effect of cell-autonomous bmm loss on the number of GSC LD (Fig. 3C), more work will be needed to understand how bmm regulates GSC at a stage prior to its effects on LD number.”

      4) The bmm loss-of-function phenotype could be better described. Some of the data is glossed over with little description in the text (see for example the reference to Fig. 3A-C). For instance, in the discussion, the text states "loss of bmm delays germline differentiation leading to an accumulation of early-stage germ cells" (p13, l.25960). However, this accumulation has not been clearly shown, or at least described in the manuscript. Most of the data show a reduction (or almost complete absence) of differentiated cell types. This could indeed be due to delayed differentiation, or alternatively to a block in differentiation or to death of the differentiated cells. The clonal data presented show a decrease in the number of cells recovered, but do not allow inferences as to the timing of differentiation, making it hard to distinguish between the various possibilities for the lack of differentiated spermatids. Apart from data showing that GSCs are more likely to remain at the niche, no further data are shown to support the fact that mutant germ cells accumulate in early stages. While additional experiments could help resolve some of these issues, much of this could also be resolved by tempering the conclusions drawn in the text.

      We thank the reviewer for these comments. In the revised manuscript we temper our conclusions regarding bmm’s precise role in spermatogenesis by discussing different mechanisms (e.g. differentiation or death) that could lead to the phenotypes we observe.

      “This regulation is important for sperm development, as our data indicates that loss of bmm causes a decrease in the number of differentiated cell types. This reduction in differentiated cell types may be attributed to a delay in differentiation, a block in differentiation, or to a loss of differentiated cells through cell death. Future studies will therefore be essential to resolve why bmm loss causes a reduction in differentiated cell types.”

      5) In the discussion (p.14, l-273 onwards), the authors suggest that products of triglyceride breakdown are important for spermatogenesis. However, an alternative interpretation of the results presented here (especially those using the midway mutant) could be that triglycerides impede normal differentiation directly. Indeed, preventing the cells' ability to produce triglycerides in the first place can rescue many of the defects observed. A better discussion of these results with a model for the function of triglycerides and their by-products would be a great improvement to this manuscript.

      We thank the reviewer for this comment. To ensure our data is clearly communicated with readers, we added a model to the paper suggesting how triglyceride and its by-products influence spermatogenesis (Fig. 6) and text to clarify that triglyceride could potentially impeded differentiation.

      “It will also be important to determine whether it is the loss of metabolites produced by bmm’s enzymatic action, or an increase in triglycerides, that leads to the reduction in differentiated cell types during spermatogenesis. Together, these experiments will provide critical insight into how triglyceride stored within testis LD contributes to overall cellular lipid metabolism during spermatogenesis.”

      Together, these changes will strengthen our overall finding that bmm-mediated regulation of testis triglyceride is important for normal sperm development. Because our findings in flies align with and extend data from rodent models, the developmental mechanisms we uncovered about how triglyceride lipase bmm regulates testis lipid droplets and sperm development will likely operate in other species.  

      Reviewer #1 (Recommendations For The Authors):

      I have a minor concern about methodology: how were spermatocytes identified? I ask because data in Figure 3 indicate that there is a significant delay in germline differentiation in the bmm[1] mutant, with relatively smaller germ cells throughout the apical half of the testis. Typical large spermatocyte-like cells are not clearly obvious to me in Fig. 3.

      We thank the Reviewer for suggesting we add more clarity to how we identified spermatocytes. We state in the revised manuscript how we identify spermatocytes:

      “Cells in the testis region occupied by primary spermatocytes were identified by their large cell size and decondensed chromosome staining occupying three nuclear domains [120].”

      Also, we note that while it is difficult to see where the bmm1 testis have spermatocytes in Fig. 4E, this is due to the large number of early-stage cells in this close-up image. The spermatocytes can be more easily seen in Fig. 4I and 4I’ when the whole testis is included in the image.    

      Reviewer #2 (Recommendations For The Authors):

      • Lines 197-198 mention "Boule-positive area," "individualization complexes," and "waste bags." It would be helpful to the reader to explain what these measurements are to help contextualize the data shown related to these statements.

      We thank the Reviewer for this comment. We added the following text to the revised manuscript:

      “Because Boule-positive area, individualization complexes, and waste bags are all markers for later stages in sperm development, these data indicate the loss of bmm causes a reduction in differentiated cell types.”

      • Line 162 states a defect in sperm development observed in 14-day-old bmm[1] males, but the data presented in Figure S3D does not show a significant difference. The words "sperm development" should be removed from this sentence.

      We thank the Reviewer for pointing out this inaccurate statement. We fixed the statement as follows in the revised manuscript:

      “Defects in testis size were also observed at 14-day post eclosion; suggesting testis size defects persist later into the life course (Figure S4C; Welch two-sample t-test). In contrast, the number of spermatid bundles per testis was not significantly different between bmm1 and bmmrev males at this age (Figure S4D; Welch two-sample ttest), potentially due to a large decrease in the number of spermatid bundles in 14-day-old bmmrev males (Figure 4C, S4D).”

      • Line 294 has a typo: "regulating" should likely be "regulated"

      We thank the Reviewer for pointing out this mistake, which we corrected.

      • Line 456 should include the length of time for heat shock

      We thank the Reviewer for pointing out this omission. We now include these details:

      “Adult males were collected at 3-5 days post-eclosion and heat-shocked three times at 37°C for 30 min followed by a 10 min rest period at room temperature between heat shocks.”

      • Methods section beginning on Line 442 might include an explanation of how hub area was quantified.

      We thank the Reviewer for this suggestion. We now include the following information:

      “Hub size was measured by quantifying FasIII-positive area of the testis.”

      • Figure 1 legend could benefit from adding a statement on how spermatocytes (arrowheads) were identified

      We thank the Reviewer for this suggestion, we now refer the reader to the more detailed description in the methods section.

      • Figure 2A should present the merged panel in A' first. The legend states that Panel A shows Lipid Droplets, but LipidTox is not shown until A'.

      We thank the Reviewer for this suggestion, we now clarify that the text refers to panels A-A''''.

      • Figure 2I would benefit from a key, to emphasize that these are individual cell clones, highlighting the idea of cell autonomous effects of bmm in the spermatocytes. Showing example images of spermatocyte clones with increased lipid droplets could also emphasize this result. The legend for this panel should note the statistical test done to confirm significance in the SC result.

      We agree with the Reviewer and have added images of the LD in bmm1 spermatocyte clones in Figure 3B, and the quantification in Figure 3C. We explicitly state the significance of this result and the statistical test in Figure 3 legend.

      • In Figure 3, the cell autonomous data clearly indicates that there are higher proportions of bmm mutant GSCs occupying the hub compared to control GSCs. It could be worth stating whether this observation indicates an increased ability of bmm mutant GSCs to compete for occupying space at the hub.

      We thank the Reviewer for pointing out this potential implication of our data, which we acknowledge in the revised version of our manuscript:

      “Future studies will also need to confirm whether bmm1 mutant GSCs show an increased ability to occupy space at the hub.”

      • In Figure 4, I suggest changing the title of Panel B to "Proportion of significant species in each lipid class" for clarity.

      We made this change in the Figure 5 legend (Figure 5 is the corresponding figure in the revised manuscript).

      • It could be valuable to quantify the number of spermatids in the germline specific mdy knockdown, which would lend additional support to a cell autonomous requirement for bmm in spermatogenesis

      We added a sentence to the revised manuscript recognizing that this is an interesting experiment for studies on the role of germline triglyceride in promoting spermatogenesis.

      “While future studies will need to test whether germline-specific loss of mdy also rescues spermatid number defects in bmm1 males, our data suggest bmm-mediated regulation of testis triglyceride plays a previously unrecognized role in regulating sperm development.”

      Reviewer #3 (Recommendations For The Authors):

      1) bmm-GFP does not show expression in somatic cells yet previous work by the same group has shown a requirement for bmm in the testis soma using C587-Gal4.

      We thank the Reviewer for raising this issue. While the reporter shows low GFP expression in the somatic cells, the single-cell RNA sequencing data we analyze suggests bmm is expressed in these cells. We address this issue in the revised manuscript as follows:

      “While levels of the bmm-GFP reporter were lower in somatic cells, single-cell RNA sequencing data identified bmm expression in the somatic lineage that was higher in cells at later stages of development (Figure S2D).”

      2) p.11 l.200-202 "Because we recovered fewer bmm1 spermatocyte and spermatid clones 14 days after clone induction (Figure 3K,3L; Kruskal-Wallis rank sum test), this effect on germline development represents a cell-autonomous role for bmm." This sentence should be rephrased as the phenotype could be a combination of autonomous roles within the germline and non-autonomous roles in supporting cyst cells.

      “We also reveal a potential non cell-autonomous role for somatic bmm. While there was no difference in the ratio of Zd-1-positive cells between homozygous clones and heterozygous clones in animals carrying the bmm1 or bmmrev alleles at 14 days post clone induction (Figure S4O; Kruskal-Wallis rank sum test), the distance from the hub to the Zd-1 positive clones reside was significantly decreased in bmm1 homozygous clones (Figure S4P; Kruskal-Wallis rank sum test). Together, these data indicate bmm may play a cell-autonomous role in germline cells, and potentially a non-cell-autonomous role in somatic cells, to regulate spermatogenesis.”

      3) The labelling in Fig. 3 is confusing - presumably the graph in 3C refers to spermatid bundles [this comment applies to other figures showing spermatid bundle numbers], not individual spermatids, while the graph in 3G refers to the proportion of the total GSC pool that is contained within the clone. The data in Fig. 3C are not described in the main text.

      We adjusted the confusing labelling to ‘spermatid bundles’ from ‘number of spermatids’, as suggested. We also changed the title of panel Fig. 3G (now 4G) as suggested and men5oned Fig. 3C (now Fig. 4C) in the text.

      4) On p.9, comments are speculative or seek to draw comparisons with the broader literature and would seem to belong more to the discussion (eg "our data suggests flies are a good model to study how bmm/ATGL influences sperm development" - also there is a typo, it should be "suggest").

      We thank the Reviewer for raising concern about our speculative statement; we changed the text as follows in the revised manuscript:

      “This identifies similarities between flies and mice in fertility-related phenotypes associated with whole-body loss of bmm/ATGL.”

      5) The length of the heat shocks used for clone induction should be specified in the methods (rather than just the period in between heat shocks).

      We now include more information on clone induction:

      “Adult males were collected at 3-5 days post-eclosion and heat-shocked three times at 37°C for 30 min followed by a 10 min rest period at room temperature between heat shocks. Amer heat-shock, the flies were incubated at room temperature until dissection.”

      6) p.8 l.132 "bmm-GFP accurately reproduces changes to bmm mRNA levels". This sentence should be rephrased.

      We thank the Reviewer for this comment and rephrased the sentence:

      “We first examined bmm expression in the testis by isolating this organ from flies carrying a bmm promoter driven GFP transgene (bmm-GFP) that recapitulates many aspects of bmm mRNA regulation [77].”

      7) p.9 l.172 "we used germline-specific marker" should read "we used an antibody against the germline-specific marker".

      We corrected this inaccurate statement in our revised manuscript.

      8) p.10 several lines, "GSC" should be "GSCs".

      We corrected this inaccurate use of GSC in our revised manuscript.

      9) p.13 l.247 should read "variance in GSC numbers".

      Thank you, this error was fixed.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Rebuttal for Review Commons to:

      “A specific innate immune response silences the virulence of Pseudomonas aeruginosa in a latent infection model in the Drosophila melanogaster host”

      We thank the reviewers for their careful scrutiny of our manuscript. We believe that we have addressed satisfactorily the points raised by the reviewers and that our revised manuscript is definitely improved. Our replies below are in blue and use a distinctive font.

      Reviewer #1

      __Evidence, reproducibility and clarity __

      This works describes a latent Drosophila intestinal infection, which spreads systemically, with a direct systemic Drosophila infection using a common laboratory strain of Pseudomonas aeruginosa. The major observation of this study is that P. aeruginosa can cause a latent infection via its passage through the gut (as opposed to being injected). In doing so it exhibits cell rounding (instead of elongation), reduced cell motility, loss of O5-antigen, antibiotic resistance, ability to cross the intestinal barrier and circulate in the hemolymph and infiltrate the host tissue underneath the cuticle. In addition, latent infection bacteria induce all brunches of the systemic response: the Imd pathway, phagocytosis, and the melanization cascade. Moreover, the melanization pathway protects the host from a secondary systemic infection with various types of bacterial and fungal microbes.

      An issue that needs to be clarified is the sensitivity of P. aeruginosa virulence to its biochemical environment. The authors note that. For example, liquid bacterial culture in BHI induces the latent form of bacteria. So the growth conditions and the infection media play a major role in the infection process. They authors need to clarify further the effect of media and infection vehicles, sucrose (high/low), LB, and BHI (as well as temperature) on the latent phenotype.

      Temperature is definitely an important parameter and bacteria appear to be somewhat more virulent at 25°C. This point is now addressed in the Material and Methods section (lines 675-679) and in Fig. S6I.

      As regards the influence of the composition of the infection solution, it does not seem to be a critical parameter that we have described in the context of continuous feeding on the bacterial solution (Limmer et al., PNAS, 2011). In preliminary experiments, we had tried LB or BHB medium to grow the bacteria and this did not make any difference (see Panel A below for LB [BHB used in our experiments]). As regards the sucrose concentration to the infection solution, we have tried two concentrations and did not observe any difference as regards the establishment of the latent infection. (see panel B below for 50 mM sucrose [100mM used in our experiments]). Of note, P. aeruginosa does not grow on sucrose solution alone. However, a latent infection was still established upon feeding the flies with PAO1 in sucrose alone, albeit likely with a mildly increased virulence, in the absence of any BHB medium (see panels C-D) below.

      A) Comparison of LB vs. BHB B) Establishment of latent infection with 50mM sucrose

      C) Establishment of a latent infection with a sucrose-only bacterial solution D) Colonization of host tissues by PAO1 ingested in a sucrose-only bacterial solution Minor issues: -Lines 579-581> How were the PAO1GFP/RFP constructed (details are needed)

      Done; please, see lines 641-643.

      -Figure 1D and other figures > CFUs given as Log2 are unconventional. One cannot easily deduce the burden unless e.g. translate 2e10 to ~1000 and 2e30 to ~10e9 CFUs.

      True, but bacterial titer increases by a factor of two at each division cycle. Even though we have previously used a Log10 representation, we now prefer using a Log2 representation. This representation has also been used by other authors in the field, e.g., Duneau et al., eLife, 2019.

      -Figire S1DB (now S1C)> "but from the outside of the gut". The given experiment does not prove that statement.

      This issue has been already dealt with in the Nehme et al. PLoS Pathogens 2007 article, as cited in the manuscript. We further provide in Fig. S1B pictures documenting the presence of bacteria associated with visceral muscles. Finally, we also show that the gut lumen is essentially cleared of bacteria after a period of feeding on a sucrose solution or gentamicin. Hence, most bacterial colonies originate from the outer layer of the gut. We clarify the issue in the text (lines 154-158).

      -Lines 146-7 > data are missing in support to the statement.

      We have now added Fig. S1B to document that the gentamicin treatment does work, as actually does feeding on sucrose solution alone, as previously documented in Limmer et al., 2011 (Fig. S2B). Of note, we cannot exclude that a few bacteria remain, especially in the crop, but those would be at very low titer. Please, see also reply to Reviewer 2.

      -Figure S1C > The effect of injury seems to be huge, and may account for much/most of the differences observed (including those between latent and active infection). This is further supported by Figure4A, injury may account for gut collapse and/or systemic stress.

      It is well known that injury alone induces the systemic IMD pathway response 6 hours after injury but largely subsides by 24 hours. The point of Fig. S1C is that the level of induction reached during latent infection is very low as compared to that observed during a systemic infection, here obtained for reference with an Escherichia coli injection and to a lesser extent with a PBS injection. In our latent infection model, we do not perform any injury, except as noted by the reviewer in Fig. 4: the effects of an experimental injury are observed only while the bacteria are crossing the intestinal barrier and hardly any effect is observed when the injury is performed on day 10 (Fig. S4B).

      -Figure S1D > How was "fated to die" assessed?

      The fluorescent flies were sorted out and their subsequent survival was monitored. As compared to nonfluorescent flies from the same batch, they died within two days of sorting them.

      -Figure 3B/10th day > Average line is misplaced.

      We thank the reviewer for pointing out this problem. The line is not the average but the median. We have now added a precise description of the bars to all the figure legends.

      -Lines 382-5 > what is the evidence of gut damage (or the absence of it)? How do the bacteria escape the gut?

      The absence of major gut damages has been documented in Limmer et al, PNAS, 2011. How the bacteria escape the gut remains an open question (intracellular and/or paracellular route).

      -Lines437-442 > The distinction between dormant P. aeruginosa in the fly tissues and persister cells (upon antibiotic treatment) cannot be justifies on the basis of relative bacterial numbers in the two systems. The extent of resistance to antibiotics though my serve that purpose.

      In our latent infection model, most of the bacteria that have crossed the gut barrier become dormant and are associated with tissues, except at the beginning of the infection. In contrast, when a bacterial culture is treated with antibiotics, most of the bacteria are killed by the treatments and only a few ones persist, likely because of an inactive metabolism. Thus, the induction of dormancy in our latent infection model does not rely on the selection of a few metabolically-inactive bacteria able to withstand an immune response or an antibiotic treatment.

      Significance

      The study is a significant advance to our knowledge. Notwithstanding further explanations, it provides a solid basis of understanding active versus dormant bacteria. It further establishes a mode of intestinal to systemic infection as a tool for further explorations.

      Reviewer #2

      Evidence, reproducibility and clarity

      Summary: In this study, Chen and colleagues investigated a new latent infection model for Pseudomonas aeruginosa using Drosophila melanogaster as a host. First, the authors established a new model for latent Pseudomonas infection. The key feature of this model is the translocation of P. aeruginosa from the gut to the hemolymph and the colonization of fly tissues by the dormant bacteria. Bacteria that translocated from the gut appeared strikingly different in morphology and resistance to antibiotics compared to bacteria that were directly injected. Next, the authors suggest that melanization but not the Imd pathway or hemocytes are necessary to promote dormancy and colonization of fly tissues. Finally, flies with latent P. aeruginosa infection exhibit improved survival after secondary infections in a melanisation-dependent manner. The study reports an interesting model for latent infection, provides insights into the host factors promoting latency and describes some of the consequences of such latent infection for the host. However, some of the conclusions are not fully supported by the data and need further experimental evidence.

      Major comments: 1. The latent infection model requires some clarifications. First, temperature. Could the authors explain why they used 18 {degree sign}C and could low temperature contribute to the establishment of dormancy?

      As shown in Fig.S6I, the latent infection model is less compelling at 25°C in terms of survival curves, which may reflect an increased rate of spontaneous reactivation of the virulence, a phenomenon we have not studied at 25°C. In another manuscript in preparation (Lin et al.), we actually show that a small heat shock does contribute to reactivation of the bacteria, an issue that is outside of the scope of the present study. Please, see also reply to reviewer 1.

      Second, the use of gentamycin. How does gentamicin affect PAO1 outside the gut? From Fig.1C It looks like the cfus in the hemolymph diminished rapidly after gentamicin treatment (around day 3), suggesting the potential effect of the antibiotic. Once the bacteria have crossed the gut and entered the hemolymph, they could still be affected by feeding flies the antibiotic. Is there a possibility that gentamicin treatment is a stress factor that could trigger or facilitate the transition to dormancy? The authors could test this experimentally either by omitting the antibiotic and assessing dormancy or by feeding injected flies with gentamycin and scoring dormancy.

      We had actually tested the issue about the potential role of gentamicin outside of the gut compartment. We have thus fed flies on different concentrations of gentamicin and monitored the survival of those flies to the injection of PAO1 bacteria (please, see Figure below). When flies were feeding on the highest concentration of gentamicin tested, 32 mg/mL, they were succumbing fast to the PAO1 challenge, but not as fast as nontreated positive control PAO1 injected flies. The use of lower concentrations (16, 8, and 4 mg/mL) led to a progressively stronger protection from PAO1 injection that inversely correlated with the gentamicin dose. We interpret the data with high gentamicin concentrations as an indication that gentamicin at such concentrations is likely directly toxic to the flies, an issue that could be experimentally tested but is not relevant to this study. Interestingly, lower doses led to a much-decreased protection from PAO1 (2mg/mL) to no protection at the dose we use to establish latent infection (100 µg/mL). Thus, these data show that gentamicin can pass the gut barrier when provided at high concentrations, down to 2 mg/mL. However, there is no proof of such a passage at the dose we use. In keeping with this latter possibility, we made a control experiment in which the gentamicin treatment step was replaced by simply feeding on the sucrose solution: the bacterial titer decreased in the hemolymph at the same rate as for gentamicin-treated flies (new Fig. S1F), demonstrating that ingested gentamicin does not contribute to the decreased titer. Rather, the likely depletion of the “source”, that is PAO1 in the gut lumen, best accounts for this phenomenon.

      We have now cited references which document a lack of permeability of the gut barrier to ingested gentamicin in vertebrate animals (lines 130-133).

      As regards the possibility that gentamicin acts as a stress factor on bacteria, our data do not support this possibility, as a latent infection is established in the absence of gentamicin by just feeding the flies on a sucrose solution. We had previously reported that flies fed with P. aeruginosa for up to three days do not succumb within the next two weeks when they are fed on a sterile sucrose solution after having ingested the bacterial solution (Limmer et al., PNAS, 2011; Fig. 1C). Under the conditions of two days of PAO1 ingestion, we document in novel Fig. S1G that the carcass is equally well colonized under these conditions.

      Figure: impact of gentamicin ingestion at diverse concentrations on the survival of injected PAO1 bacteria. The ingested antibiotics can act on bacteria present in the hemocoel at concentrations over 2 mg/mL and not at that used in our experiments (100 µg/mL).

      Does melanization really induce the dormant state of the bacteria? I am not sure the provided data fully support this claim. Addressing these questions might provide a stronger evidence: Fig. 2 A-F: What causes the morphological changes of the bacteria? Melanization or the passage through the gut? Do authors see the same changes in bacteria retrieved from PO-deficient mutant flies? Fig. 2G: Do the authors see the same resistance of PAO1 that colonized PO mutant flies to antibiotics?

      In a novel Fig. S4, we now document comprehensively the physiological state of PAO1 bacteria fed to PO-deficient flies. We find that these bacteria are susceptible to antibiotics treatment as they can be rescued from death by the injection of antibiotics on day 3 (Fig. S4A-B). Second, they show a mixed phenotype in terms of colony morphology (Fig. S4C) and bacterial morphology and cell wall properties: even though most bacteria appeared to have kept a rounded morphology, they predominantly (about 75%) expressed the O5-LPS antigen. We interpret these data in terms of a slower transition to virulence than in a septic injury model. Melanization thus strongly contributes to the establishment of latency, even though it is likely that other factors contribute to the establishment of dormancy, but at best provide a minor contribution.

      How do PO mutants behave after PAO1 injection? Are they similarly more susceptible?

      PPO1-PPO2 mutants are not more susceptible to PAO1 injection than wt controls (new Fig. S3C).

      Fig. 3F: PPO1 is believed to be the fast-acting PPO, whereas PPO2 is deployed later in infection.

      This statement is based on experimental data gained on larvae, not adults. It is not really clear whether the about 10% adult hemocytes that express PPO2 actually contain crystals, in as much as the adult may be better oxygenated than larvae that grow in a hypoxic environment (description by the laboratory of Prof. Jiwon Shim of a role for PO in respiration at the latest EDRC meeting).

      How does the Western blot look for PPO1? Will it show an early induction of melanization that could drive the change into the dormant state?

      We provide below a characterization of the PPO antibody we use by Western blot analysis. This antibody had originally been raised by the late Dr. Hans-Michael Müller against a PPO from mosquito cell lines, hence explaining its cross-reaction to both * Drosophila PPO1 and PPO2 (Muller, H.M., Dimopoulos, G., Blass, C., and Kafatos, F.C. (1999)). A hemocyte-like cell line established from the malaria vector Anopheles gambiae expresses six prophenoloxidase genes. J Biol Chem 274*, 11727-11735.). It follows that at least one PO is partially cleaved at day 2 and that both are fully cleaved by day6 of the establishment of the latent infection (Fig. 3F, Fig. S3F).

      Figure: characterization of the antibody raised against A. gambiae PPO

      Alternatively, the induction of melanization could also be measured with an L-DOPA test.

      This experiment is not needed given the explanation provided above.

      Fig. 3E: Melanization prevents the growth of PAO1 adhering to tissues, as shown in Fig. 3E. One can see higher levels of cfus in the carcass in PO deficient flies compared to wt flies. However, after, 5 days, there is no difference in the cfus of wt and mutant flies anymore. If the growth inhibition was melanization mediated, would we not expect a consistent growth of bacteria in PO mutants? How to explain the drop in cfus in PO deficient mutants?

      This observation is difficult to account for and the explanations we can put forward at this stage are somewhat speculative. It appears that bacteria found in the tissues in PO-deficient flies have a morphology found in in vitro culture and within the gut, which does not correlate with virulence but also not with the avirulence state since they are LPS O5 positive. Given the shallow survival curves, we envision that there is a progressive release of bacteria from the tissue and then quick proliferation in the hemolymph in a few flies that would then die, but at a frequency too low to reliably ascertain in our hemolymph titer data, with a few flies displaying a high titer (Fig. 3D). By day5, the decreased titer in the carcass may reflect the progressive depletion of tissue-associated bacteria as they progressively become planktonic.

      Fig. 5D: How do PAO1 bacteria react to Levofloxacin treatment? Do they still go into the dormant state? Do they still attach to tissues? The authors should show that Levofloxacin treatment leads to the same dormant state as gentamycin before interpreting the results of this experiment.

      Taken together, our data yield a mixed result. When levofloxacin was fed for two days to latently-infected flies, we found that colonization was not altered (Fig. S2D’), in contrast to a septic injury model in which injected bacteria were susceptible to the ingested antibiotics (Fig. S2D”). Following the reviewer’s query, we have further monitored survival and bacterial colonization in the levofloxacin ingestion model. Fig. S2D had already demonstrated that ingested levofloxacin protects the flies from injected PAO1. Fig. S6F shows that the double mutant PO bacteria are protected from ingested PAO1 by the ingestion of this antibiotics. When we monitored the bacterial burden, we found for both wild-type and double PO mutant flies that the bacteria had been cleared in some 50% of the flies. The exact interpretation of the wild-type data is not straightforward. On the one hand, the colonizing bacteria may have become susceptible to the antibiotics even though they remained dormant. On the other hand, they might have been reactivated in their virulence and thus become secondarily susceptible to the antibiotics. For the double PO mutants, the 40% bacteria remaining may witness the mixed bacterial state of PAO1 in these mutants, as documented in Fig. S4. Nevertheless, the important point is that bacteria are unlikely to contribute to the demise of secondarily infected flies since they have been cleared in at least 50% of the flies, yet the secondarily challenged flies become susceptible only when the relevant melanization genes are affected. The nonPAO1-infected controls succumb faster to the infection than infected ones: the protection against secondary infections is provided by the activation of the melanization cascade by colonizing PAO1 bacteria, even if the colonization is transient in the levofloxacin treatment.

      We have altered the main text to reflect these novel data: lines 387-403.

      Minor comments:

      Lines 68-72. Mechanisms that are listed are not specific against Gram-negative bacteria but rather general. Please correct.

      We are of course aware of this. If it is general, it also applies to Gram-negative bacteria that are the focus of this article. Actually, an earlier version of the manuscript just mentioned the IMD pathway, hence the reference to Gram-negative bacteria. However, the Toll pathway is also required in the host defense against some Gram-negative pathogens such as P. aeruginosa. We have now deleted “Gram-negative” in this corrected version.

      Line 95. In - as?

      We are not sure we understand this comment. We have now added a reference documenting that P. aeruginosa can be found in rotting fruits (line 97).

      Lines 182-187. Some background information is needed. What is O5 LPS antigen? What motivated the authors to look at it specifically?

      The O-antigen is a long-chain polysaccharide motif that constitutes the outermost part of the cell wall. It varies according to the strain. We have added a couple of references that refer to O-antigen (line 198). We had actually already found out this result (unpublished) with the Serratia marcescens Db11 O-antigen (O18) that was not found in bacteria that had crossed the gut. The loss of the O5 antigen changes the surface of the bacterium and likely its interactions with tissues and/or the immune system. In the case of Serratia, we suspect that the loss of its O-antigen allows the bacterium to be phagocytosed in an eater-dependent manner.

      Fig. 3C: Why PPO1 and Hayan and PPO1,2 and Sp7 are compared but not mutant vs wild type?

      The reason is that it was obviously significant. We have now added the comparisons to wild-type in the revised figure.

      How precise is estimation of bacteria in the carcass?

      Even though it is not possible to measure how precise these measures are, they are nevertheless reproducible making us confident that they provide an estimate of the rough number of these bacteria found associated to tissues.

      How do the authors prevent dissemination of the bacteria during dissection? I wonder if some bacteria might by lost during the dissection (when removing the gut and ovaries) or if you carry over some bacteria from the hemolymph into the carcass measurement? How to make sure, that the bacteria you recover were really adherent and were not leftover from the hemolymph?

      It is not possible to prevent dissemination as we cannot fix the tissues and bacteria if we make cfu counts. However, the finding that bacteria are found in the hemolymph only for the first three days, with a distinct morphology from tissue-associated bacteria, and not at later time points make us confident that this is not an issue, which suggests that the bacteria are rather tightly attached to the tissues. As regards contamination of tissues by hemolymph, it is also not an issue since the hemolymph titers are so low. However, when the bacteria are actively proliferating to high levels, this is a legitimate concern.

      I am also curious how the differences in the cfu levels between whole fly and carcass can be explained (Fig. 1D). After day 5 there are almost no bacteria left in the hemolymph, however, if you compare cfus in the whole fly vs. the carcass, one can see that the whole fly cfus are rising from day 4 onwards. Where do these bacteria come from if not from the hemolymph?

      To assess the statement of the reviewer, we now have included the numerical values of the medians of the bacterial burdens displayed in Fig. 1D. There is no increased bacterial burden in whole flies between days 5 to 12; however, the titer is increased at days 15 and 22. Whether this slight increase is biologically relevant is questionable given the spread of the data (see also reply to previous point on the precision of measures). We cannot rigorously exclude that there might be a low-level proliferation of colonizing bacteria late in the latent infection, which has been observed in specific conditions of reactivation of dormant bacteria (Lin et al., in preparation).

      Fig. S4D: If the protection to secondary PAO1 infection is not mediated via Imd or phagocytosis, is it mediated via melanization? How do melanization mutants (increased or decreased) respond to PAO1 secondary infection?

      We have performed the experiment (Fig. S6A-B) and found that the protection was abrogated. As noted in the main text, the interpretation is however difficult since the bacteria are no longer in a dormancy state in the PPO mutants.

      Significance

      This study suggests that host factors, particularly specific immune responses, could drive the latent infections. Hence, besides bacterial mechanisms that received significant attention, we should not underestimate the host's contribution to promoting the latent state in bacteria.

    1. Reviewer #1 (Public Review):

      Summary: This work is an extension of their earlier work published in Sci Adv in 2021, wherein they showed that DTD2 deacylates N-ethyl-D-aminoacyl-tRNAs arising from acetaldehyde toxicity. The authors (Kumar et al.) in this study, investigate the role of archaeal/plant DTD2 in the deacylation/detoxification of D-Tyr-tRNATyr modified by multiple other aldehydes and methylglyoxal (produced by plants). Importantly, the authors take their biochemical observations to plants, to show that deletion of DTD2 gene from a model plant (Arabidopsis thaliana) makes them sensitive to the aldehyde supplementation in the media especially in the presence of D-Tyr. These conclusions are further supported by the observation that the model plant shows increased tolerance to the aldehyde stress when DTD2 is overproduced from the CaMV 35S promoter. The authors propose a model for the role of DTD2 in the evolution of land plants. Finally, the authors suggest that the transgenic crops carrying DTD2 may offer a strategy for stress-tolerant crop development. Overall, the authors present a convincing story, and the data are supportive of the central theme of the story.

      Strengths: Data are novel and they provide a new perspective on the role of DTD2, and propose possible use of the DTD2 lines in crop improvement.

      Weaknesses: (a) Data obtained from a single aminoacyl-tRNA (D-Tyr-tRNATyr) have been generalized to imply that what is relevant to this model substrate is true for all other D-aa-tRNAs (term modified aa-tRNAs has been used synonymously with the modified Tyr-tRNATyr). This is not a risk-free extrapolation. For example, the authors see that DTD2 removes modified D-Tyr from tRNATyr in a chain-length dependent manner of the modifier. Why do the authors believe that the length of the amino acid side chain will not matter in the activity of DTD2? (b) While the use of EFTu supports that the ternary complex formation by the elongation factor can resist modifications of L-Tyr-tRNATyr by the aldehydes or other agents, in the context of the present work on the role of DTD2 in plants, one would want to see the data using eEF1alpha. This is particularly relevant because there are likely to be differences in the way EFTu and eEF1alpha may protect aminoacyl-tRNAs (for example see description in the latter half of the article by Wolfson and Knight 2005, FEBS Letters 579, 3467-3472).

      Note added after revision: The authors have addressed all my concerns by doing additional experiments and by providing convincing arguments. I am happy to conclude that all my concerns on the weaknesses of the work have been nicely addressed. The already convincing story is now stronger.

    1. Author Response

      The following is the authors’ response to the original reviews.

      The reviewers make some suggestions aimed towards increasing the clarity of the manuscript, and I suggest that the authors examine those carefully. In particular, the figure is difficult to read and could contain additional information to help the reader's interpretation. For example, Reviewer 1 suggests including sample age estimates alongside depth, while Reviewer 3 also notes that there is missing information in the figure. Apart from the figure, Reviewer 1 suggests two additional analysis to help explain the amount of mammoth DNA recovered, which they observe is much higher than previous similar investigations. This would seem to be an important issue to address, given the surprising nature of the findings. In addition to this larger issue, the Reviewer makes a few important suggestions for supplementary material that may be needed to support the authors' statements.

      Some additional recommended edits -- in particular to the text and included references to related studies -- are suggested by Reviewers 2 and 3, and both commented on the lack of a publicly-available data repository. The authors may also wish to comment on or revisit their differential treatment of wooly mammoth vs. wooly rhinoceros samples, though I suspect this has more to do with low read numbers for the rhinos.

      Thank you very much for the positive assessment of our manuscript and clear suggestions for revision. We address these points below.

      Reviewer #1 (Recommendations For The Authors):

      I have a few suggestions that might further improve the manuscript:

      It is difficult for the reader to follow which core slices exactly have been sampled and sequenced. The authors mention 23 samples were taken from core LK-001 and 16 samples from core LK-007. From the text it remains unclear to me what the exact age of each of these samples is. Figure 1 shows the depth at which the LK-001 core was sampled, maybe sample age estimates could be included here.

      Thanks for pointing this out. We have added approximate ages to Figure 1, added the depth range to the text (“from 1.5 to 80 cm”; l. 73-74, caption Figure 1), and reworked the table of the sampling depths in the supplement.

      Line 84-87. The authors mention the retrieval of DNA from several expected Arctic taxa, however no further data regarding these findings is given in the manuscript. It would be useful to report the same numbers for these species as the ones given for the Mammuthus and woolly rhinoceros, which would allow for a comparison of the relative abundance of the DNA between these species. Are the expected Arctic species for instance at much higher (DNA) abundance in the samples? It would also be interesting to know if the authors discovered DNA from extant species that are unlikely to have occurred in the geographic region. A (supplementary)table listing the number of mapped reads to each of the respective mitogenomes for each sequence library would be useful for the reader.

      We added a supplementary table (S8) indicating the numbers of reads assigned to mammals.

      Line 90: I am somewhat amazed by the amount of mammoth DNA the authors recovered from these cores. A total depth of over 400X of the mitogenome is quite extraordinary and I am not aware of any ancient sediment study to date that has retrieved a similar amount of data. For instance, the Wang et al. 2021 paper, which the authors cite, sequenced over 400 samples and did not find any mammoth DNA in 70% of those. For the 30% of samples showing signs of mammoth DNA they retrieved on average 530 sequence reads. In this study the authors find on average ~20.000 reads, in 22 out of the 23 sequence libraries. This makes me wonder if the way the mapping was performed has been too lenient, resulting in possible spurious mappings? To really confirm the authenticity of the mammoth (and woolly rhino data) I would suggest two additional analysis:

      1) Mapping all the sequence libraries to a reference consisting of the complete Asian-elephant genome (for instance https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_024166365.1/), the complete human genome (+mitogenome) and the Asian elephant mitogenome. This could possibly reduce spurious mappings as conserved regions between the genomes are filtered out and could also reduce the possible mapping of NUMTS. If the authors could show that after such a mapping approach a significant number of reads are still assigned to the Asian elephant part (including the mitogenome) of the reference, the reported findings would be strengthened.

      2) I also suggest to construct a mitochondrial haplotype network from the obtained DNA, while also including previously published Asian and African elephants as well as previously published mammoth mitogenomes. If the obtained haplotypes indeed show that they cluster within the known haplotype diversity of mammoth, that would be strong support for the authenticity of the data

      The same analysis could be considered for the woolly rhino data, although the lower read numbers might make this analysis challenging.

      We agree that the amount of mammoth DNA is surprising, which is why we opted for further laboratory experiments for confirmation of the hybridization capture results of the first core, i.e., 1) DNA extraction from a second core of a different lake, 2) a quantitative PCR approach (ddPCR), and 3) metabarcoding. Our results of the highly specific ddPCR and metabarcoding assays confirmed considerable amounts of mammoth DNA in two sediment cores of different lakes, thus we have no doubts regarding the authenticity of the data. Considering the large amount of mammoth DNA, the high number of reads, and particularly the high mitogenome coverage, we argue that the effect of some spurious mapping is negligible and does not affect the main outcome and conclusions of our study. Although we agree that a haplotype network would be interesting, such analyses would stretch beyond the focus of this publication.

      Line 91: The authors mention negative controls (extraction and library blanks) did not produce any reads assigned to mammals. This is quite remarkable, as in my experience low levels of (human)contamination are almost always present in the blanks. Could the authors comment on why they think the blanks did not show any signal of mammalian DNA?

      The hybridization capture enrichment and the filtration and mapping procedures likely eliminated human contamination. Also, the data were mapped against Arctic mammal mitogenomes, which did not include human reference sequences. However, six of the sediment samples contained human sequences (now shown in supplementary table S8), albeit at low read counts (mean = 65)

      Line 97: "mapping suggested that the sequences throughout the core originated from multiple individuals" The authors do not provide any supporting data showing this. I think that an analysis (for instance based on allele frequencies) has to be included in manuscript to support this claim.

      We agree that his claim was not sufficiently supported. We performed further analyses including genomic data of previously retrieved mammoth remains and assigned our data to these haplogroups; the results were added to the main text and are shown as a figure (Fig. 2).

      Line 98: "Signatures of post-mortem DNA decay were comparably minor."

      Do the authors know if the used hybridisation enrichment method can distort the measurement of post-mortem damage? Are for instance reads with C-T substitutions less likely to be captured by the baits?

      To our knowledge, there is no study suggesting that damaged sites are less likely to be captured. In general, the hybridization capture procedure is not overly specific, and studies report that DNA is readily and preferentially captured as long as the difference between baits and DNA is not above 10%.

      Line 100: "The proportions of bases did not suggest a substantial deviation from those in the reference genomes or in the closest extant relative of Mammuthus, the Asian elephant (Elephas maximus)."

      It is not clear to me what the authors mean by this. Could the authors explain how this was measured and what their interpretation of this result is?

      We realize that the sentence was unclear. We meant that the nucleotide composition was similar to that of the reference genomes or the closest extant relative. However, as we do not consider this important for the argument, we have removed this sentence from the manuscript.

      Given the high number of recovered mammoth reads in the samples, it would be interesting to know how much mammoth reads are present in the sample before enrichment capture with the baits. Shotgun sequencing the raw extract of one of the samples with the highest number of mammoth reads might allow for a rough estimate of mammoth DNA abundance compared to the other extant species (e.g. reindeer, Arctic lemming and hare) found in the sample(s). This could give further clarification about the extent of stratigraphy disturbance and its overall effect on the DNA based community reconstruction. However, this is just a suggested additional analysis and not something I believe crucial for supporting the overall findings in this manuscript.

      We fully agree that this would be a highly interesting and informative additional analysis to perform. It was, however, not possible to perform this additional analyses in the course of the current experiments.

      Finally, I could not find a public link to the (sequence)data produced in this study. I strongly encourage the authors to make their data publicly available.

      Thank you for pointing this out. We have added a Data Availability paragraph, including the respective reference.

      Reviewer #2 (Recommendations For The Authors):

      In the Discussion it is mentioned that the reasons for Mammoth extinction are not entirely clear but are largely attributed to sudden climate warming (and add some relevant citations). However, there is also abundant literature that suggest humans also played a role in their extinction (for instance, a recent one, Damien et al. (2022) at Ecology Letters 25: 127-137).

      We agree with the reviewer and have added some the recent citation highlighting the possible influence of humans.

      One possibility to add further interest to this paper would be to conduct a phylogenetic tree with the Mammoth mitogenome(s) retrieved and a reference dataset; it could be interesting to know where do they fall in the phylogeny -already abundant with tens of individuals- and maybe it could be even possible to roughly estimate their date. There are some papers that report many Mammoth mitogenomes, including of course some from Siberia; for instance Chang et al. (2017) at Sci Reports and also Fellow Yates et al. (2017) also at Sci Reports (the latter mainly from Central Europe).

      We are well aware of the amount of mt genomes available for mammoth, and such an analyses would be an interesting addition, potentially also offering the possibility to date the DNA. However, the analyses was hampered and would be less secure for this dataset, as our sequences display quite some variation among each other, suggesting that we have a mix of multiple mt genomes, which we cannot readily distinguish. We thus refrain from this, also because we instead provide multiple lines of evidence for the existence of the mammoth DNA in the surface sediment core (metabarcoding, ddPCR).

      Minor points:

      -Correct wooly to woolly

      Revised.

      -In the sampling description it is not totally clear if the samples were taken at 1 cm each (it is mentioned that core LK-001 is sliced in the field at 1-cm steps for radiometric dating and later it is explained that 23 samples were analyzed from this core, but it is unclear if they represent 23 cm of core)

      -Maybe the authors could briefly define some terms such as "talik"

      Revised.

      Reviewer #3 (Recommendations For The Authors):

      Maybe I missed this but I could not find a data availability statement or the location of the repository

      We have added a Data Availability paragraph, including the respective reference.

      It would be good to see some additional analysis on the distribution of the woolly rhinoceros DNA through the sediment core - like the figure for the mammoth i.e read numbers vs depth.

      We have added to the supplements a table showing the numbers of assigned mammal reads over the core depths (Table S8). However, as rhinoceros reads are considerable rarer in our results, we did not produce a figure.

      Would it be possible to be more explicit about the multiple mammoth individuals, could you calculate a minimum number or haplotypes for example.

      We agree that his claim was not sufficiently supported and added results from additional analyses (incl. Fig. 2). Please see our response above.

      Based on the aim stated in the introduction, the analysis of the Arctic biodiversity of this area is missing, it would be nice to see these result added or maybe the focus needs to be changed for clarity.

      We now explicitly state that this objective pertains to a different study, which is currently still in preparation for publication.

      The single main figure needs a bit more consideration. For example in panel A - there was no information on the transformation performed or what the general trend line refers to. Do the results in panel B refer to all 22 libraries? What is the x-axis in Panel C and what do the coloured lines refer to? Additionally, I think the figure needs to be in higher resolution with increased text size on all axes.

      We revised the figure and the caption for clarity and readability.

      Finally this might be an accidental typo - but when referring to the sample aged at around 8,677 years in text it states this the 36.5 cm sample (line 130 and 192), but the supplementary says this is the 51cm sample (Table S6). This would maybe impact potential conclusions. Would you be able to clarify this.

      Thank you for noting this error, we revised it.

  2. Jan 2024
    1. B A C K T O W A R D S L A V E R Y

      Based on Du Bois’s chapter, provide three examples of post-Civil War lawlessness and anti-Black violence, disenfranchisement, and economic exploitation between 1865 and 1877.

      What are some of the major problems with the Dunning school interpretation of Reconstruction?

      What does it overlook and misrepresent?

    1. Author Response

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      The present study provides a phylogenetic analysis of the size prefrontal areas in primates, aiming to investigate whether relative size of the rostral prefrontal cortex (frontal pole) and dorsolateral prefrontal cortex volume vary according to known ecological or social variables.

      I am very much in favor of the general approach taken in this study. Neuroimaging now allows us to obtain more detailed anatomical data in a much larger range of species than ever before and this study shows the questions that can be asked using these types of data. In general, the study is conducted with care, focusing on anatomical precision in definition of the cortical areas and using appropriate statistical techniques, such as PGLS. That said, there are some points where I feel the authors could have taken their care a bit further and, as a result, inform the community even more about what is in their data.

      We thank the reviewer for this globally positive evaluation of our work, and we appreciate the advices to improve our manuscript.

      The introduction sets up the contrast of 'ecological' (mostly foraging) and social variables of a primate's life that can be reflected in the relative size of brain regions. This debate is for a large part a relic of the literature and the authors themselves state in a number of places that perhaps the contrast is a bit artificial. I feel that they could go further in this. Social behavior could easily be a solution to foraging problems, making them variables that are not in competition, but simply different levels of explanation. This point has been made in some of the recent work by Robin Dunbar and Susanne Shultz.

      Thank you for this constructive comment, and we acknowledge that the contrast between social vs ecological brain is relatively marginal here. Based also on the first remark by reviewer 3, we have reformulated the introduction to emphasize what we think is actually more critical: the link between cognitive functions as defined in laboratory conditions and socio-ecological variables measured in natural conditions. And the fact that here, we use brain measures as a potential tool to relate these laboratory vs natural variables through a common scenario. Also, we were already mentioning the potential interaction between social and foraging processes in the discussion, but we are happy to add a reference to recent studies by S. Shultz and R. Dunbar (2022), which is indeed directly relevant. We thank the reviewer for pointing out this literature.

      In a similar vein, the hypotheses of relating frontal pole to 'meta-cognition' and dorsolateral PFC to 'working memory' is a dramatic oversimplification of the complexity of cognitive function and does a disservice to the careful approach of the rest of the manuscript.

      We agree that the formulation of which functions we were attributing to the distinct brain regions might not have been clear enough, but the functional relation between frontal pole and metacognition in the one hand, and DLPFC and working memory on the other hand, have been firmly established in the literature, both through laboratory studies and through clinical data. Clearly, no single brain region is necessary and sufficient for any cognitive operation, but decades of neuropsychology have demonstrated the differential implication of distinct brain regions in distinct functions, which is all we mean here. We have made a specific point on that topic in the discussion (cf p. 16). We have also reformulated the introduction to clarify that, even if the relation between these regions and their functions (FP/ metacognition; DLPFC/ working memory) was clear in laboratory conditions, it was not clear whether this mapping could be used for real life conditions. And therefore whether that simplification was somehow justified beyond the lab (and the clinics), and whether these neuro-cognitive concepts could be applied to natural conditions, are indeed critical questions that we wanted to address. The central goal of the present study was precisely to evaluate the extent to which this brain/cognition relation could be used to understand more natural behaviors and functions, and we hope that it appears more clearly now.

      One can also question the predicted relationship between frontal pole meta-cognition and social abilities versus foraging, as Passingham and Wise show in their 2012 book that it is frontal pole size that correlates with learning ability-an argument that they used to relate this part of the brain to foraging abilities. I would strongly suggest the authors refrain from using such descriptive terms. Why not simply use the names of the variables actually showing significant correlations with relative size of the areas?

      We basically agree with the reviewer, and we acknowledge the lack of clarity in the introduction of the previous manuscript. There were indeed lots of ambiguity in what we were referring to as ‘function’, associated with a given brain region. « Function » referred to way to many things! We have reformulated the introduction not only to clarify the different types of functions that were attributed to distinct brain regions in the literature but also to clarify how this study was addressing the question: by trying to articulate concepts from neuroscience laboratory studies with concepts from behavioral ecology and evolution using intuitive scenarios. We hope that the present version of the introduction makes that point clearer.

      The major methodological judgements in this paper are of course in the delineation of the frontal pole and dorsolateral prefrontal cortex. As I said above, I appreciate how carefully the authors describe their anatomical procedure, allowing researchers to replicate and extend their work. They are also careful not to relate their regions of interest to precise cytoarchitectonic areas, as such a claim would be impossible to make without more evidence. That said, there is a judgement call made in using the principal sulcus as a boundary defining landmark for FP in monkeys and the superior frontal sulcus in apes. I do not believe that these sulci are homologous. Indeed, the authors themselves go on to argue that dorsolateral prefrontal cortex, where studied using cytoarchitecture, stretches to the fundus of principal sulcus in monkeys, but all the way to the inferior frontal sulcus in apes. That means that using the fundus of PS is not a good landmark.

      We thank the reviewer for his kind remarks on our careful descriptions. But then, it is not clear whether our choice of using the principal sulcus as a boundary for FP in monkeys vs the superior frontal sulcus in apes is actually a judgement call. First, and foremost, there is no clear and unambiguous definition of what should be the boundaries of the FP. By contrast with cytoarchitectonic maps, but clearly this is out of reach here. In humans and great apes we used Bludau et al 2014 (i.e. sup frontal sulcus), and in monkeys, we chose a conservative landmark that eliminated area 9, which is traditionally associated with the DLPFC (Petrides, 2005; Petrides et al, 2012; Semendeferi et al, 2001).

      Of course, any definition will attract criticism, so the best solution might be to run the analysis multiple times, using different definitions for the areas, and see how this affects results.

      Indeed, functional maps indicate that dorsal part of anterior PFC in monkeys is functionally part of FP. But again, cytoarchitectonic maps also indicate that this part of the brain includes BA 9, which is traditionally associated with DLPFC (Petrides, 2005; Petrides et al, 2012). As already pointed out in the discussion, there is a functional continuum between FP and DLPFC and our goal when using PS as dorsal border was to be very conservative and to exclude the ambiguous area. But we agree with the reviewer that given that this decision is arbitrary, it was worth exploring other definitions of the FP volume. So, we did complete a new analysis with a less conservative definition of the FP, to include this ambiguous dorsal area, and it is now included in the supplementary material. Maybe as expected, including the ambiguous area in the FP volume shifted the relation with socio-ecological variables towards the pattern displayed by the DLPFC (ie the influence of population density decreased). The most parsimonious interpretation of this results is that when extending the border of the FP region to cover a part of the brain which might belong to the DLPFC, or which might be somehow functionally intermediate between the 2, the specific relation of the FP with socio-ecological variables decreases. Thus, even if we agree that it was important to conduct this analysis, we believe that it only confirms the difficulty to identify a clear boundary between FP and DLPFC. Again, we have clearly explained throughout the manuscript that we admit the lack of precision in our definitions of the functional brain regions. In that frame, the conservative option seems more appropriate and for the sake of clarity, the results of the additional analysis of a FP volume that includes the ambiguous area is only included in the supplementary material.

      If I understand correctly, the PGLS was run separately for the three brain measure (whole brain, FP, DLPFC). However, given that the measures are so highly correlated, is there an argument for an analysis that allows testing on residuals. In other words, to test effects of relative size of FP and DLPFC over and above brain size?

      Generally, using residuals as “data” (or pseudo-data) is not recommended in statistical analyses. Two widely cited references from the ecological literature are:

      Garcia-Berthou E. (2001) On the Misuse of Residuals in Ecology: Testing Regression Residuals vs. the Analysis of Covariance. Journal of Animal Ecology, 70 (4): 708-711.

      Freckleton RP. (2002). On the misuse of residuals in ecology: regression of residuals vs. multiple regression. Journal of Animal Ecology 71: 542–545. https://doi.org/10.1046/ j.1365-2656.2002.00618.x.

      The main reason for this recommendation is that residuals are dependent on the fitted model, and thus on the particular sample under consideration and the eventual significant effects that can be inferred.

      In the discussion and introduction, the authors discuss how size of the area is a proxy for number of neurons. However, as shown by Herculano-Houzel, this assumption does not hold across species. Across monkeys and apes, for instance, there is a different in how many neurons can be packed per volume of brain. There is even earlier work from Semendeferi showing how frontal pole especially shows distinct neuron-to-volume ratios.

      We appreciate the reviewer’s comment, but the references to Herculano-Houzel that we have in mind do indicate that the assumption is legitimate within primates.

      Herculano-Houzel et al (2007) show that the neuronal density of the cortex is well conserved across primate species (but only monkeys were studied). The conclusion of that study is that using volumes as a proxy for number of neurons, as a measure of computational capacity, should be avoided between rodents and primates (and as they showed later, even more so with birds, for which neuronal density is higher). BUT within primates, since neuronal densities are conserved, volume is a good predictor of number of neurons. Gabi et al (2016) provide evidence that the neuronal density of the PFC is well conserved between humans and non-human primates, which implies that including humans and great apes in the comparison is legitimate. In addition, the brain regions included in the analysis presumably include very similar architectonic regions (e.g. BA 10 for FP, BA 9/46 for DLPFC), which also suggests that the neuronal density should be relatively well conserved across species. Altogether, we believe that there is sufficient evidence to support the idea that the volume of a PFC region in primates is a good proxy for the number of neurons in that region, and therefore of its computational capacity.

      Semendeferi and colleagues (2001) pointed out some differences in cytoarchitectonic properties across parts of the FP and discussed how these properties could 1) be used to identify area 10 across species 2) be associated with distinct computational properties, with the idea that thicker ‘cell body free’ layers would leave more space for establishing connections (across dendrites and axons). This pioneering work, together with more recent imaging studies on functional connectivity (e.g. Sallet et al, 2013) emphasize the critical contribution of connectivity pattern as a tool for comparative anatomy. But unfortunately, as pointed out in the discussion already, this is currently out of reach for us.

      We acknowledge the limitations, and to be fair, the notion of computational capacity itself is hard to define operationally. Based on the work of Herculano-Houzel et al, average density is conserved enough across primates (including humans) to justify our approximation. We have tried to define our regions of interest using both anatomical and functional maps and, thanks to the reviewer’s suggestions, we even tried several ways to segment these regions. Functional maps in macaques and humans do not exactly match cytoarchitectonic maps, presumably because functions rely not only upon the cytoarchitectonics but also on connectivity patterns (e.g. Sallet et al, 2013).

      In sum, we appreciate the reviewer’s point but feel that, given the current understanding of brain functions and the relative conservation of neuronal density across primate PFC regions, the volume of a PFC region seems to be reasonable proxy for its number of neurons, and therefore its computational capacity. We have added these points to the discussions, and we hope that the reader will be able to get a fair sense of how legitimate is that position, given the literature.

      Overall, I think this is a very valuable approach and the study demonstrates what can now be achieved in evolutionary neuroscience. I do believe that they authors can be even more thorough and precise in their measurements and claims.

      Reviewer #2 (Public Review):

      In the manuscript entitled "Linking the evolution of two prefrontal brain regions to social and foraging challenges in primates" the authors measure the volume of the frontal pole (FP, related to metacognition) and the dorsolateral prefrontal cortex (DLPFC, related to working memory) in 16 primate species to evaluate the influence of socio-ecological factors on the size of these cortical regions. The authors select 11 socio-ecological variables and use a phylogenetic generalized least squares (PGLS) approach to evaluate the joint influence of these socio-ecological variables on the neuro-anatomical variability of FP and DLPFC across the 16 selected primate species; in this way, the authors take into account the phylogenetic relations across primate species in their attempt to discover the influence of socio-ecological variables on FP and DLPF evolution.

      The authors run their studies on brains collected from 1920 to 1970 and preserved in formalin solution. Also, they obtained data from the Mussée National d´Histoire Naturelle in Paris and from the Allen Brain Institute in California. The main findings consist in showing that the volume of the FP, the DLPFC, and the Rest of the Brain (ROB) across the 16 selected primate species is related to three socio-ecological variables: body mass, daily traveled distance, and population density. The authors conclude that metacognition and working memory are critical for foraging in primates and that FP volume is more sensitive to social constraints than DLPFC volume.

      The topic addressed in the present manuscript is relevant for understanding human brain evolution from the point of view of primate research, which, unfortunately, is a shrinking field in neuroscience.

      We must not have been clear enough in our manuscript, because our goal is precisely not to separate humans from other primates. This is why, in contrast to other studies, we have included human and non-human primates in the same models. If our goal had been to study human evolution, we would have included fossil data (endocasts) from the human lineage.

      But the experimental design has two major weak points: the absence of lissencephalic primates among the selected species and the delimitation of FP and DLPFC. Also, a general theoretical and experimental frame linking evolution (phylogeny) and development (ontogeny) is lacking.

      We admit that lissencephalic species could not be included in this study because we use sulci as key landmarks. We believe that including lissencephalic primates would have introduced a bias and noise in our comparisons, as the delimitations and landmarks would have been different for gyrencephalic and lissencephalic primates. Concerning development, it is simply beyond the scope of our study.

      Major comments.

      1) Is the brain modular? Is there modularity in brain evolution?: The entire manuscript is organized around the idea that the brain is a mosaic of units that have separate evolutionary trajectories:

      "In terms of evolution, the functional heterogeneity of distinct brain regions is captured by the notion of 'mosaic brain', where distinct brain regions could show a specific relation with various socio-ecological challenges, and therefore have relatively separate evolutionary trajectories".

      This hypothesis is problematic for several reasons. One of them is that each evolutionary module of the brain mosaic should originate in embryological development from a defined progenitor (or progenitors) domain [see García-Calero and Puelles (2020)]. Also, each evolutionary module should comprise connections with other modules; in the present case, FP and DLPFC have not evolved alone but in concert with, at least, their corresponding thalamic nuclei and striatal sector. Did those nuclei and sectors also expand across the selected primate species? Can the authors relate FP and DLPFC expansion to a shared progenitor domain across the analyzed species? This would be key to proposing homology hypotheses for FP and DLPFC across the selected species. The authors use all the time the comparative approach but never explicitly their criteria for defining homology of the cerebral cortex sectors analyzed.

      We do not understand what the referee is referring to with the word ‘module’, and why it relates to development. Same thing for the anatomical relation with subcortical structures. Yes, the identity of distinct functional cortical regions relies upon subcortical inputs during development, but clearly this is neither technically feasible, nor relevant here anyways.

      We acknowledge, however, that our definition of functional regions was not precise enough, and we have updated the introduction to clarify that point. In short, we clearly do not want to make a strong case for the functional borders that we chose for the regions of interest here (FP and DLPFC), but rather use those regions as proxies for their corresponding functions as defined in laboratory conditions for a couple of species (rhesus macaques and humans, essentially).

      Contemporary developmental biology has showed that the selection of morphological brain features happens within severe developmental constrains. Thus, the authors need a hypothesis linking the evolutionary expansion of FP and DLPFC during development. Otherwise, the claims form the mosaic brain and modularity lack fundamental support.

      Once again, we do not think that our definition of modules matches what the reviewer has in mind, i.e. modules defined by populations of neurons that developed together (e.g. visual thalamic neurons innervating visual cortices, themselves innervating visual thalamic neurons). Rather, the notion of mosaic brain refers to the fact that different parts of the brain are susceptible to distinct (but not necessarily exclusive) sources of selective pressures. The extent to which these ‘developmental’ modules are related to ‘evolutionary’ modules is clearly beyond the scope of this paper.

      Our goal here was to evaluate the extent to which modules that were defined based on cognitive operations identified in laboratory conditions could be related (across species) to socio-ecological factors as measured in wild animals. Again, we agree that the way these modules/ functional maps were defined in the paper were confusing, and we hope that the new version of the manuscript makes this point clearer.

      Also, the authors refer most of the time to brain regions, which is confusing because they are analyzing cerebral cortex regions.

      We do not understand why the term ‘brain’ is more confusing than ‘cerebral cortex’, especially for a wide audience.

      2) Definition and delimitation of FP and DLPFC: The precedent questions are also related to the definition and parcellation of FP and DLPFC. How homologous cortical sectors are defined across primate species? And then, how are those sectors parcellated?

      The authors delimited the FP:

      "...according to different criteria: it should match the functional anatomy for known species (macaques and humans, essentially) and be reliable enough to be applied to other species using macroscopic neuroanatomical landmarks".

      There is an implicit homology criterion here: two cortical regions in two primate species are homologs if these regions have similar functional anatomy based on cortico-cortical connections. Also, macroscopic neuroanatomical landmarks serve to limit the homologs across species.

      This is highly problematic. First, because similar function means analogy and not necessarily homology [for further explanation see Puelles et al. (2019); García-Cabezas et al. (2022)].

      We are not sure to follow the Reviewer’s point here. First, it is not clear what would be the evolutionary scenario implied by this comment (evolutionary divergence followed by reversion leading to convergence?). Second, based on the literature, both the DLPFC and the FP display strong similarities between macaques and humans, in terms of connectivity patterns (Sallet et al, 2013), in terms of lesion-induced deficit and in terms of task-related activity (Mansouri et al, 2017). These criteria are usually sufficient to call 2 regions functionally equivalent. We do not see how this explanation is "highly problematic" as it is clearly the most parsimonious based on our current knowledge.

      Second, because there are several lissencephalic primate species; in these primates, like marmosets and squirrel monkeys, the whole approach of the authors could not have been implemented. Should we suppose that lissencephalic primates lack FP or DLPFC?

      We understand neither the reviewer’s logic, nor the tone. We understand that the reviewer is concerned by the debate on whether some laboratory species are more relevant than others for studying the human prefrontal cortex, but this is clearly not the objective of our work. As explained in the manuscript, we identified FP and DLPFC based on functional maps in humans and laboratory monkeys (macaques), and we used specific gyri as landmarks that could be reliably used in other species. And, as rightfully pointed out by reviewer 1, this is in and off itself not so trivial. Of course, lissencephalic animals could not be studied because we could not find these landmarks, but why would it mean that they do not have a prefrontal cortex? The reviewer implies that species that we did not study do not have a prefrontal cortex, which makes little sense. Standards in the field of comparative anatomy of the PFC, especially when it implies rodents (lissencephalic also) include cytoarchitectonic and connectivity criteria, but obviously we are not in a position to address it here. We have, however, included references to the seminal work of Angela Roberts and collaborator in the discussion on marmosets prefrontal functions, to reinforce the idea that the functional organization is relatively well conserved across all primates (with or without gyri on their brain) (Dias et al, 1996; Roberts et al, 2007).

      Do these primates have significantly more simplistic ways of life than gyrencephalic primates? Marmosets and squirrel monkeys have quite small brains; does it imply that they have not experience the influence of socio-ecological factors on the size of FP, DLPFC, and the rest of the brain?

      Again, none of this is relevant here, because we could not draw conclusions on species that we cannot study for methodological reasons. The reviewer seems to believe that an absence of evidence is equivalent to an evidence of absence, but we do not.

      The authors state that:

      "the strong development of executive functions in species with larger prefrontal cortices is related to an absolute increase in number of neurons, rather than in an increase in the ration between the number of neurons in the PFC vs the rest of the brain".

      How does it apply to marmosets and squirrel monkeys?

      Again, we do not understand the reviewer’s point, since it is widely admitted that lissencephalic monkeys display both a prefrontal cortex and executive functions (again, see the work of Angela Roberts cited above). Our goal here was certainly not to get into the debate of what is the prefrontal cortex in a handful of laboratory species, but to evaluate the relevance of laboratory based neuro-cognitive concepts for understanding primates in general, and in their natural environment.

      References:

      García-Cabezas MA, Hacker JL, Zikopoulos B (2022) Homology of neocortical areas in rats and primates based on cortical type analysis: an update of the Hypothesis on the Dual Origin of the Neocortex. Brain structure & function Online ahead of print. doi:doi.org/ 10.1007/s00429-022-02548-0

      García-Calero E, Puelles L (2020) Histogenetic radial models as aids to understanding complex brain structures: The amygdalar radial model as a recent example. Front Neuroanat 14:590011. doi:10.3389/fnana.2020.590011

      Nieuwenhuys R, Puelles L (2016) Towards a New Neuromorphology. doi:10.1007/978-3-319-25693-1

      Puelles L, Alonso A, Garcia-Calero E, Martinez-de-la-Torre M (2019) Concentric ring topology of mammalian cortical sectors and relevance for patterning studies. J Comp Neurol 527 (10):1731-1752. doi:10.1002/cne.24650

      Reviewer #3 (Public Review):

      This is an interesting manuscript that addresses a longstanding debate in evolutionary biology - whether social or ecological factors are primarily responsible for the evolution of the large human brain. To address this, the authors examine the relationship between the size of two prefrontal regions involved in metacognition and working memory (DLPFC and FP) and socioecological variables across 16 primate species. I recommend major revisions to this manuscript due to: 1) a lack of clarity surrounding model construction; and 2) an inappropriate treatment of the relative importance of different predictors (due to a lack of scaling/normalization of predictor variables prior to analysis). My comments are organized by section below:

      We thank the reviewer for the globally positive evaluation and for the constructive remarks. Introduction:

      • Well written and thorough, but the questions presented could use restructuring.

      Again, we thank the reviewer, and we believe that this is coherent with some of the remarks of reviewer 1. We have extensively revised the introduction, toning down the social vs ecological brain issue to focus more on what is the objective of the work (evaluating the relevance of lab based neuro-cognitive concepts for understanding natural behavior in primates).

      Methods:

      • It is unclear which combinations of models were compared or why only population density and distance travelled tested appear to have been included.

      The details of the model comparison analysis were presented as a table in the supplementary material (#3, details of the model comparison data), but we understand that this was not clear enough. We have provided more explanation both in the main manuscript and in the supplements. All variables were considered a priori; however, we proceeded beforehand to an exploratory analyses which led us to exclude some variables because of their lack of resolution (not enough categories for qualitative variables) or strong cross-correlations with other quantitative variables. There were much more than three variables included in the models but the combination of these 3 (body mass, daily traveled distance and population density) best predicted (had the smallest AIC) the size of the brain regions. We provide additional information about these exploratory analyses in the supplementary material, sections 2 and 3.

      • Brain size (vs. body size) should be used as a predictor in the models.

      We do not understand the theoretical reason for replacing body size by brain size in the models. Brain size is not a socio-ecological variable. And of course, that would be impossible for modeling brain size itself. Or is it that the reviewer suggests to use brain size as a covariate to evaluate the effects of other variables in the model over and above the effect on brain size? But what is the theoretical basis for this?

      • It is not appropriate to compare the impact of different predictors using their coefficients if the variables were not scaled prior to analysis.

      We thank the Reviewer for this comment; however, standardized coefficients are not unproblematic because their calculations are based on the estimated standard-deviations of the variables which are likely to be affected by sampling (in effect more than the means). We note that the methods of standardized coefficients have attracted several criticisms in the literature (see the References section in https://en.wikipedia.org/wiki/Standardized_coefficient). Nevertheless, we now provide a table with these coefficients which makes an easy comparison for the present study. We also updated tables 1, 2 and 3 to include standardized beta values.

      Reviewer #1 (Recommendations For The Authors):

      N/A

      Reviewer #2 (Recommendations For The Authors):

      Contemporary developmental biology has showed that the brain of all mammals, including primates, develops out of a bauplan (or blueprint) made of several fundamental morphological units that have invariant topological relations across species (Nieuwenhuys and Puelles 2016).

      At some point in the discussion the authors acknowledge that:

      "Our aim here was clearly not to provide a clear identification of anatomical boundaries across brain regions in individual species, as others have done using much finer neuroanatomical methods. Such a fine neuroanatomical characterization appears impossible to carry on for a sample size of species compatible with PGLS".

      I do not think it would be impossible to carry such neuroanatomical characterization. It would take time and effort, but it is feasible. Such characterization, if performed within the framework of contemporary developmental biology, would allow for well-founded definition and delineation of cortical sectors across primate species, including lissencephalic ones, and would allow for meaningful homologies and interspecies comparisons.

      We do not see how our work would benefit from developmental biology at that point, because it is concerned with evolution, and these are very distinct biological phenomena. We do not understand the reviewer’s focus on lissencephalic species, because they are not so prevalent across primates, and it is unlikely that adding a couple of lissencephalic species will change much to the conclusions.

      Minor points:

      • Please, format references according to the instructions of the journal.

      Ok - done

      • The authors could use the same color code across Figures 1, 2, and 3.

      Ok – done

      • The authors say that group hunting "only occurs in a few primate species", but it also occurs in wolves, whales, and other mammalian species.

      We focus on primates here, these other species are irrelevant. Again, this is beside the point.

      Reviewer #3 (Recommendations For The Authors):

      My comments are organized by section below:

      Introduction:

      • Well written and thorough

      • The two questions presented towards the end of the intro are not clear and do not guide the structure of the methods/results sections. I believe one it would be more appropriate to ask if: 1) the relative proportions of the FP and DLPFC (relative to ROB) are consistent across primates; and 2) if the relative size of these region is best predicted by social and/ or ecological variables. Then, the results sections could be organized according to these questions (current results section 1 = 1; current results sections 2, 3, 4 = 2.1, 2.2, 2.3)

      As explained above, we agree with the reviewer that the introduction was somehow misleading and we have edited it extensively. We do not, however, agree with the reviewer regarding the relative (vs absolute) measure. We have discussed this in our response to reviewer 1 regarding the comparison of regional volumes as proxies for number of neurons. The best predictor of the computing capacity of a brain region is its number of neurons, but there is no reason to believe that this capacity should decrease if the rest of the brain increases, as implied by the relative measure that the reviewer proposes. That debate is probably critical in the field of comparative neuroanatomy, and confronting different perspectives would surely be both interesting and insightful, but we feel that it is beyond the scope of the present article.

      Methods:

      • While the methods are straightforward and generally well described, it is unclear which combinations of models were compared or why only population density and distance travelled tested appear to have been included (in e.g., Fig SI 3.1) even though many more variables were collected.

      We agree that this was not clear enough, and we have tried to improve the description of our model comparison approach, both in the main text and in the supplementary material.

      • Why was body mass rather than ROB used as a predictor in the models? The authors should instead/also include analyses using ROB (so the analysis is of FP and DLPFC size relative to brain size). Using body mass confounds the analyses since they will be impacted by differences in brain size relative body size.


      Again, we have addressed this issue above. First, body size is a socio-ecological variable (if anything, it especially predicts energetic needs and energy expenditure), but ROB is clearly not. We do not see the theoretical relevance of ROB in a socio-ecological model. Second, from a neurobiological point of view, since within primates the volume of a given brain region is directly related to its number of neurons (again, see work of Herculano-Houzel), which is a good proxy for its computing capacity, we do not see the theoretical reason for considering ROB.

      • It is not appropriate to compare the impact of different predictors using their coefficients if the variables were not scaled prior to analysis. The authors need to implement this in their approach to make such claims.

      We thank the reviewer again for pointing that out. We have addressed this question above.

      • Differences across primates in terms of frontal lobe networks throughout the brain should be acknowledged (e.g., Barrett et al. 2020, J Neurosci).

      We have added that reference to the discussion, together with other references showing that the difference between human and non-human primates is significant, but essentially quantitative, rather than qualitative (the building blocks are relatively well conserved, but their relative weight differs a lot). Thank you for pointing it out.

      I hope the authors find my comments helpful in revising their manuscript.

      And we thank again the reviewer for the helpful and constructive comments.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This well done and interesting paper examining the connection between TXNIP and GDF15. The main thrust is that TXNIP upregulation chemotherapies, such as Oxa, results in an a down regulation of GDF15 early in tumorigenesis. Later in tumorigenesis, TXNIP upregulation is less pronounced, elevating GFP15 resulting in a blockage of tumor suppressive immune responses. Generally the work is convincing. For example, it's clear that TXNIP is up regulated by Oxa in an ROS and MondoA-dependent manner. Likewise its quite clear TXNIP loss reads to an upregulation of GDF15. However, it's also quite clear that Oxa suppresses GDF15 in a manner that appears to be completely independent of TXNIP. The writing in the paper implies strongly that there is a mechanistic connection between TXNIP and GDF15, but no experiments investigate this possibility.

      We feel this is very fair and is reflective of a) perhaps an overemphasis of the TXNIP knockout observation and supportive tissue data, which suggests a relationship but not a mechanistic understanding b) an underemphasis of the data in Figure 3 that shows a decrease in GDF15 after oxaliplatin treatment in TXNIP knockout lines.

      We have addressed these concerns in several ways:

      1. We have carried out knockdown experiments using siRNA for ARRDC4, which we felt, given its regulation by MondoA and ROS, and homology to TXNIP, may also regulate GDF15. This was found to be the case and may explain the data in Figure 3. At the very least it shows that other factors involved in oxidative stress management may have similar impacts – a form of functional redundancy. Lines 553-559 “Finally, given our previous data (Figure S4) we looked to assess the role of ARRDC4 on GDF15 expression. In the absence of oxaliplatin, knocking down ARRDC4 in DLD1 and HCT15 cells drove an increase in GDF15. When challenged with oxaliplatin, both ARRDC4 and TXNIP expression increased and GDF15 decreased. When the ARRDC4 knockdown was challenged TXNIP increased further and GDF15 decreased further (Figure S6G-J). Given the common regulatory pathways and homology between TXNIP and ARRDC4, and their similar functional roles, we suggest these data are evidence of redundancy within this system. “

      We have included some context in the discussion:

      Lines 930-933: “Further support for both TXNIP and ARRDC4’s role in regulating GDF15 after the induction of ROS comes from a pan cancer meta-analysis assessing the impact of metformin (which has been reported to inhibit ROS) on gene expression. Here the top two downregulated genes were TXNIP and ARRDC4 and the top four upregulated genes were DDIT4, CHD2, ERN1 and GDF1572

      We have tempered the text:

      Lines 522-524 “It is important to note however that here we saw clear evidence that TXNIP was not solely responsible for the downregulation of GDF15 post oxaliplatin treatment, with decreased levels seen in knockout lines (Figure 3C-G, S5E).”

      Lines 926-929 “It must be stressed that these data do not place TXNIP as the sole regulator of GDF15, for example ARRDC4 can also be seen to regulate GDF15. We envisage TXNIP as one of a number of ROS-dependent GDF15 regulators, with this redundancy potential evidence of the importance of this regulatory framework.”

      We have carried out additional analysis detailed in the discussion and in Figure S12 which suggests TXNIP impacts MYC function, as reported elsewhere (detailed below). For ease, the key paper can be accessed through this link https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3001778

      Lines 934-956: “The main shortcoming of this paper is the lack of mechanistic understanding linking TXNIP to GDF15. There are 650 transcription factors that have been shown, or are predicted, to bind to GDF15 promoter and/or enhancer regions. By assessing our list of differentially expressed genes (Suppl. Table 1-2) for the presence of these factors we identified 6 GDF15 binding TFs that show significantly decreased expression after oxaliplatin treatment in both cell lines (ATF4, MYC, SREBF1, PHB2, HBP1, KLF9). There was only one, MYC, that was downregulated by oxaliplatin treatment (validated; Figure S12A), and with this downregulation partially being rescued in a matched TXNIP knockout line (Figure S12B). We then observed that c-myc has been shown or is predicted to bind to promoter/enhancer regions of the top five transcriptomic and proteomic differentials in TXNIP knockout lines, including TXNIP itself (apart from C16orf90). Even with c-myc’s promiscuity (binds to 10-20% of all promoters/enhancers) this may be suggestive of a specific relationship. Finally, when looking at the correlations between these 6 TFs and TXNIP and GDF15 in the TCGA COAD dataset, MYC has the greatest and most significant negative correlation to TXNIP (r=-0.4631 p=1.42e-28) and the greatest and most significant positive correlation to GDF15 (r=0.4653 p=7.32e-29). ATF4 and PHB2 are the other TFs in the list, that show the same significant trends (Figure S12C), and therefore may play a role in the TXNIP-independent oxaliplatin-dependent regulation of GDF15. Further exploration of these additional TFs is outside the scope of the current manuscript.

      MYC’s role in bridging from TXNIP to GDF15 is further supported by a recent paper which shows that TXNIP is “a broad repressor of MYC genomic binding” and that “TXNIP loss mimics MYC overexpression”73. Furthermore, the inter-dependent regulatory relationship between MondoA, TXNIP, and MYC has been seen in a variety of models74, whilst the impact of NAC on MYC-dependent pathways has been seen in lymphoma75. These studies lend credence to the idea that MYC is the most likely TXNIP-regulated TF that regulates GDF15 in our systems.”

      It seems equally likely that TXNIP and GDF15 represent independent parallel pathways. Even if TXNIP is a direct regulator of GDF15, it's also clear that other "factors" up or down-regulated by Oxa also contribute to the regulation of GDF15. These are not explored and even though TXNIP is highly regulated genes shown Figure 2 that are not identified or discussed that may also be contributing to GDF15 regulation.

      As mentioned above, the new data suggests that at least one other factor, ARRDC4, can regulate GDF15 (changes upon oxaliplatin treatment) and that MYC is a potential mechanistic bridge between TXNIP and GDF15. Whilst assessing for the transcription factor that may link TXNIP and GDF15 we found an additional 5 TXNIP-independent factors (ATF4, PHB2, SREBF1, HBP1, KLF9) that bind to GDF15 promoter/enhancer regions and are downregulated post-oxaliplatin treatment. When looking at correlations between these factors and GDF15 in the TCGA COAD dataset, ATF4 and PHB2 correlate most closely with GDF15 (when removing MYC) and so we would cautiously suggest that these may be the most pertinent. This data is now included.

      Further, the experiments treating PBMCs with conditioned media contain other cytokines/factors, in addition to GDF15, that likely also contribute the observed effects on the different immune cells understudy. The conditioned media from GDF15 knock out cells are a good experiment, but the media is not rigorously tested to see what other cytokines/factors might have also been depleted.

      The TXNIP knockout media is the same as that analysed by mass spec and the protein array, however as the reviewer states there is no analysis (excluding assessing for the presence or absence of GDF15) on the double knockout supernatant or over-expression supernatant. The text has been corrected as follows:

      Lines 675-679. “In light of other secreted factors being seen to be regulated by TXNIP (Figure 3A-B), we included double knockouts (TXNIP and GDF15 knockout; GTKO) as well as an overexpression system (GDF15a) to test for GDF15 specific effects. However, we do not know the impact of knocking out or overexpressing GDF15 on the broader secretome.”

      Perhaps a GDF15 complementation experiment would help here.

      We felt that the association between GDF15 and Treg induction is reasonably well established in the literature, and so once we saw that the supernatant from our GDF15 overexpression system (+/- CD48 blockade) complemented what has already been demonstrated, we were encouraged. However we needed more – hence the TCGA data and IHC staining.

      Finally, even if completely independent, a TXNIP/GDF15 ratio does seem to have utility in determining chemo-therapeutic response.

      We agree – we feel that conceptually this may be the most interesting part of the project and is an example of what can be done with these tools.

      Other major points: 1. Please label the other highly regulated genes shown in Fig 2A and B. Might they also explain some of the underlying biology. This could be on the current figures or in a supplement, though the former is preferred.

      Many thanks – we have done this.

      Please address why the TXNIP induction is so much less in patient-derived organoids vs. cell line spheroids (Fig S2). By the western blots, TXNIP inductions in the organoids looks quite modest. Further, the text is quite cryptic and implies that the "upregulation" is similar in both organoids and spheroids.

      You are absolutely correct. Many apologies, the wording has changed:

      Lines 320-323 “In both models we observed the upregulation of TXNIP mRNA (Figure S2E-H) and TXNIP protein (Figure S2I-L) after oxaliplatin treatment, with spheroids showing greater responsiveness. This difference is most likely due to culturing conditions or differences in the number and location of cycling cells.”

      We have two possible explanations. Firstly the media in which the organoids are cultured contains a lower glucose concentration than that used for the spheroids. As per some of our new data (Figure S3 – later in the rebuttal), the upregulation of TXNIP after oxaliplatin is glucose dependant, with lower concentrations resulting in less of a differential. Secondly, while restricted to the periphery, the Ki67 signal in DLD1 spheroids is quite pronounced indicating that, within the outer zone, many cells (probably the majority) are in the S/G1/G2 phase of the cell cycle at any given point in time (figure below this text).

      This is not the case for the organoids, where the Ki67 (and pCDK1) signal is quite weak, and only sporadic in the outer layer. So we believe that there are many more rapidly cycling cells in the most drug-exposed layer of spheroids when compared to the comparable region in organoids. As the spheroid cells are likely cycling more rapidly, they would also be expected to be more adversely affected by the drug within the finite drug treatment window. Indeed, these spheroids grow large, and quite quickly. If the organoid cells are cycling more slowly and if, within the cell layer most exposed to drug, these cycling cells are less abundant, then the TXNIP response may well be subdued in organoids when compared with spheroids.

      We have decided to not include the above (full) explanation and figure within the new draft, as we feel it may distract from the central message. However do let ourselves and the editor know if you disagree.

      What was the rationale of performing the MS experiment on control and TXNIP KO DLD1 cells in the absence of oxaliplatin? The other experiments in Fig 3 clearly show that Oxa can repress GDF15 even in the absence of TXNIP, which implicates other pathways. ARRDC4? Or something else? This needs to be addressed.

      We adopted this approach because of the order in which the assays occurred and technical issues surrounding the use of post-oxaliplatin treated supernatant. By the time we moved to the proteomics we had already identified, and validated, GDF15 as our number one candidate (initially from the protein array), in terms of response to oxaliplatin and dependence on TXNIP. This led us to the next stage of the project – to assess the environmental impacts of this factor in vitro before validation in situ. To do this, aware of the issue of contaminated recombinant GDF15, we decided early on to use cell line supernatant. We carried out some pilot studies on immune cells using supernatant from oxaliplatin treated cell lines and we had several technical issues (difficulty in determining the correct controls, immune cell death…). This changed the emphasis to using supernatant from knockout models rather than knockout and treated models. Before we began these assays in earnest we wanted to assess exactly what was enriched in TXNIP knockout supernatant and so we turned to proteomics. When this further validated GDF15, we then generated GDF15 and TXNIP/GDF15 knockouts to further elucidate GDF15’s role specifically.

      With regards the other pathways, as you correctly predicted, ARRDC4 also appears to regulate GDF15 – many thanks for helping with this line of enquiry. Please see earlier in the rebuttal for more details and the data.

      The data in 3J with the MondoA knockdown is not convincing. The knockdown is weak and TXNIP goes down a smidge. Agree that GDF15 goes up

      We agree. We have re-run this and pooled the densitometry data – see new figure below (Panel 3J).

      Minor points 1. Line 79. The "loss" of TXNIP/GDF15 axis is confusing. It's really loss of TXNIP and upregulation of GDF15, right?

      Absolutely - corrected to responsiveness.

      Lines 144-147: “Intriguingly, multiple models including patient-derived tumor organoids demonstrate that the loss of TXNIP and GDF15 responsiveness to oxaliplatin is associated with advanced disease or chemotherapeutic resistance, with transcriptomic or proteomic GDF15/TXNIP ratios showing potential as a prognostic biomarker.”

      Please provide an explanation for the different stages in tables 1 and 2. This will likely not be clear to non-clinicians.

      Many thanks. The following has been added at the bottom of the second table.

      Lines 304-309: “The TNM staging system stands for Tumor, Node, Metastasis. T describes the size of the primary tumor (T1-2; 5cm). N describes the presence of tumor cells in the lymph nodes (N0; no lymph nodes. N1-3 >0). M describes whether there are any observable metastases (M0; no metastases. M1; metastases). The clinical stage system is as follows: I/II; the tumor has remained stable or grown, but hasn’t spread. III/IV; the tumor has spread, either locally (III) or systemically (IV).”

      Line 231 should probably read ...cysteine (NAC), a reactive oxygen species inhibitor,

      Many thanks - corrected

      Line 247, should be RT-qPCR I think.

      Many thanks - corrected

      Lines 343-345. I don't quite understand the wording. Does this mean to say that 675 soluble proteins were not changed between the condition media from both cell populations?

      Yes, exactly this. We have removed as this is superfluous and confusing.

      The data in FigS1 B and C don't seem to reach the standard p value of > 0.05

      Very true – we have rewritten the text to make sure the reader knows there is no significance.

      Lines 269-271. “High levels of both the protein (significantly) and the transcript (not significantly) were seen to be associated with favourable prognosis (Figure 1G,H and S1B,C).”

      **Referee Cross-Commenting**

      cross comment regarding referees 2 and 3 above. I'm am convinced that TXNIP is at least contemporaneously upregulated with GDF15 downregulation. However, the strong implication from the writing is that TXNIP regulates GDF15 directly. I agree with the comment above that exploring mechanisms may be open-ended especially as TXNIP has been implicated in gene regulation by several different mechanism. I'd be satisfied with a more open-minded discussion of potential mechanisms by which TXNIP may repress GDF15 and the possibility of other parallel pathways that likely contribute to GDF15 repression.

      Many thanks, this is a generous and understanding approach. As described above we have carried out extra analysis and have found 6 differentially regulated transcription factors which have been shown to bind GDF15 promoter or enhancer regions with 1 of these, MYC, being significantly affected in the TXNIP knockout cell lines, which in combination with supportive literature suggests a degree of TXNIP dependence. We have also identified ARRDC4 as an additional regulator of GDF15 – again please see above.

      Reviewer #1 (Significance (Required)):

      This is an interesting contribution but the mechanistic connection between GDF15 and TXNIP is relatively weak. That said, even as independent variables they do seem to have utility in predicting therapeutic response.

      Many thanks for the comment – we concur. We have reanalysed our data looking for relevant transcription factors (those that bind GDF15 promoter / enhancer regions) finding MYC as the most likely bridge. Please see above.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Deng et al. investigates a mechanistic link between TXNIP and GDF15 expression and oxaliplatin treatment and acquired resistance. They observe an upregulation in TXNIP expression in the tumors of patients who have previously received chemotherapy. They demonstrate oxaliplatin-driven MondoA transcriptional activity is what underlies the induction of TXNIP. They further demonstrate that TXNIP is a negative regulator of GDF15 expression. Together, oxaliplatin induces MondoA activity and TXNIP expression, resulting in a downregulation of GDF15 expression and consequently decreased Treg differentiation.

      Major Comments

      1. The authors suggest that TXNIP induction and GDF15 downregulation are a common effect of chemotherapies; however, the mechanistic studies were limited to oxaliplatin. The authors should clarify this point through further investigation using other commonly used CRC chemotherapies (5-FU, irinotecan, etc.),or through textual changes. To be clear, I think that the oxaliplatin results could potentially stand on their own but would require additional clarification. For example, regarding the patient samples analyzed in 1D and 4F, which patients received oxaliplatin? Could the analysis of publicly available molecular data be drilled down to just the patients who received oxaliplatin?

      Many thanks – this is an excellent point. Firstly, all the patients in 1D and 4F received oxaliplatin. Secondly, we have included new data looking at the impact of other chemotherapies (FOLRIRI, FU-5 and SN-38) on aspects of the study, ultimately finding that these processes (especially an anti-correlation between GDF15 and TXNIP changes upon chemo treatment) appear to be specific to oxaliplatin. These data have been added (Figure S11) and throughout the emphasis has been switched from chemotherapeutic treatment to oxaliplatin treatment.

      Lines 796-799: “To check if the pre-treatment GDF15/TXNIP ratio could be used for patients treated with FOLFIRI we performed the same analyses finding no significance (S11A-D). This oxaliplatin specificity was then confirmed by western blot analysis in DLD1 and HCT15 cells treated with 5-FU or SN38 (Figure S11E-F).

      Example of change of emphasis from ‘chemotherapy’ to ‘oxaliplatin’ – lines 139-142: “Here, in colorectal adenocarcinoma (CRC) we identify oxaliplatin-induced Thioredoxin Interacting Protein (TXNIP), a MondoA-dependent tumor suppressor gene, as a negative regulator of Growth/Differentiation Factor 15 (GDF15).”

      The data demonstrating the induction of MondoA transcriptional activity and TXNIP expression in response to oxaliplatin treatment is quite convincing. The data regarding ROS induction of TXNIP is interesting, especially in light of other studies arguing that ROS limits MondoA activity (PMID: 25332233). Given this apparent disparity, I think that this study could really be strengthened by also investigating other potential mechanisms of oxaliplatin induction of MondoA. In particular, given many studies arguing for direct nutrient-regulation of MondoA, the authors should address the potential for oxaliplatin regulation of glucose availability and a potential glucose dependence of oxaliplatin-induced TXNIP. 2

      In line with the previous point, since MondoA activity and TXNIP expression are sensitive to glucose levels, the authors should investigate oxaliplatin-regulation of TXNIP under physiological glucose levels. No need to replicate everything, just key experiments.

      We feel these are excellent point and really help the piece – many thanks. We have carried out assays around these points suggested and have included the findings in the new draft – see below.

      Lines 332-339: “As such, we went back to first principles and assessed the impact of different concentrations of glucose on TXNIP induction +/- oxaliplatin treatment, finding a concentration dependent effect (Figure S3A). Intriguingly, high glucose alone was able to induce increased TXNIP expression. We then assessed if oxaliplatin treatment drove an increase in glucose uptake, with this seen at concentrations >10mM (Figure S3B). Next, to investigate the impact of glucose metabolism, and consequent ROS generation, on TXNIP induction we treated cells with Antimycin A, an inhibitor of oxidative phosphorylation, finding a complete block in oxaliplatin-induced TXNIP (Figure S3C).”

      The authors did a good job of linking TXNIP and GDF15 in untreated conditions; however, the data arguing for oxaliplatin regulation of GDF15 through TXNIP is less clear. For example, in 3B-H, oxaliplatin treatment reduces GDF15 approximately to the same extent in the NTC and TKO cells, potentially in line with a mechanism of downregulation that doesn't involve TXNIP.

      A very salient point and completely in line with the other reviewers. We have carried out a few additional analyses mentioned previously in this letter. The most pertinent for this specific point are the experiments around ARRDC4, where we found evidence to suggest that, like TXNIP, it regulates GDF15.

      Minor Comments

      1. The presentation of data in Figure 5 is confusing. A-B include raw cell numbers, whereas C-F show "normalized proliferation." What does this mean? And how was the normalization done?

      Apologies for this. Legend test has been corrected to “Normalised proliferation (normalised to MFI from control: i.e. cells treated with supernatant from NTC cells) on gated CD3+CD8+ or CD3+CD4+ cells is shown. n=6. (G-H) Normalised IFNg concentrations (normalised to MFI from control: i.e. cells treated with supernatant from NTC cells) in the supernatant of cells from C-F.” (lines 727-729).

      **Referee Cross-Commenting**

      cross-comment regarding reviewer #1

      I agree with the referee that the link between TXNIP and GDF15 is weak, though as I mentioned before, this is particularly true in the context of oxaliplatin-regulation of TXNIP. I agree that given all the presented data, it is likely that oxaliplatin-regulation of TXNIP and GDF15 are independent. In my opinion, the referee brought up all valid concerns, but this is by far the biggest concern that I share.

      We agree that this is the weakest aspect of the paper, however our new analyses plus supportive literature, suggests that the relationship between TXNIP and GDF15 may be mediated by MYC (please see above)

      cross-comment regarding reviewer #3

      The major concern that this referee addresses is whether another transcription factor supersedes the proposed MondoA/TXNIP induction in regulating GDF15 expression in later stage CRC. In my opinion, this another other concerns of the referee are all valid, but still I remain unconvinced that TXNIP induction underlies the oxaliplatin-regulation of GDF15. I think fleshing out that aspect of the study would potentially help the authors tease apart how this potential MondoA-TXNIP-GDF15 axis is dysregulated later in CRC progression.

      This is a great discussion. Interestingly enough, c-myc is seen at higher levels in late stage CRC (Hu X, Fatima S, Chen M, Huang T, Chen YW, Gong R, Wong HLX, Yu R, Song L, Kwan HY, Bian Z. Dihydroartemisinin is potential therapeutics for treating late-stage CRC by targeting the elevated c-Myc level. Cell Death Dis. 2021 Nov 5;12(11):1053. Doi: 10.1038/s41419-021-04247-w. PMID: 34741022; PMCID: PMC8571272.), is seen as an important factor in resistance, and as this review argues, is driven by stress (Saeed H, Leibowitz BJ, Zhang L, Yu J. Targeting Myc-driven stress addiction in colorectal cancer. Drug Resist Updat. 2023 Jul;69:100963. Doi: 10.1016/j.drup.2023.100963. Epub 2023 Apr 20. PMID: 37119690; PMCID: PMC10330748.). So it is very plausible that the partial TXNIP-mediated regulation of myc in early / sensitive CRCs that we may be observing, and has been reported recently (TXNIP loss expands Myc-dependent transcriptional programs by increasing Myc genomic binding Lim TY, Wilde BR, Thomas ML, Murphy KE, Vahrenkamp JM, et al. (2023) TXNIP loss expands Myc-dependent transcriptional programs by increasing Myc genomic binding. PLOS Biology 21(3): e3001778. https://doi.org/10.1371/journal.pbio.3001778) is lost in late stage / resistant CRCs. If this is the case, in effect what we would have observed is the loss of a stress-associated method (TXNIP) of controlling c-myc activity. What makes our collective lives difficult is that, as reported “this expansion of Myc-dependent transcription following TXNIP loss occurs without an apparent increase in Myc’s intrinsic capacity to activate transcription and without increasing Myc levels.” (TXNIP loss expands Myc-dependent transcriptional programs by increasing Myc genomic binding Lim TY, Wilde BR, Thomas ML, Murphy KE, Vahrenkamp JM, et al. (2023) TXNIP loss expands Myc-dependent transcriptional programs by increasing Myc genomic binding. PLOS Biology 21(3): e3001778. https://doi.org/10.1371/journal.pbio.3001778)

      Reviewer #2 (Significance (Required)):

      Generally speaking the experiments are well controlled and the findings are significant and novel. Though the link between MondoA activity and ROS could be strengthened, and the data could be validated under more physiological settings. Further, the authors should clarify their interpretations so as to not overstate the findings.

      Many thanks for the comments. We have taken onboard the need for more physiological settings and have included varying levels of glucose to reflect concentrations in different environments. We have repeated the siMondoA work in 3J strengthening the conclusions wrt its impact on TXNIP and GDF15 expression (see above).

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this well-written manuscript, the authors show that chemotherapy increases a MondoA-dependent oxidative stress-associated protein, TXNIP, in chemotherapy-responsive colorectal cancer cells. They show that TXNIP negatively regulates GDF-15 expression. GDF-15, in turn, correlates with the presence of T cells (Treg), and inhibits CD4 and CD8 T cell stimulation. In advanced disease and chemo-resistant cancers, upregulation of TXNIP and downregulation of GDF-15 appear to get lost. Based on a somewhat smallish data set, the authors suggest that the pre-treatment GDF-15/TXNIP ratio can predict responses to oxaliplatin treatment. This is a very interesting, novel finding. In general, the quality of the experiments and the data are high and the conclusions appear sound. Still, there are a number of aspects that should still be improved:

      The observed loss of the ROS - MondoA - TXNIP - GDF15 axis in chemoresistant and/or metastatic tumors implies that another transcription factor or pathway becomes dominant upon tumor progression. As this switch would be key to better understanding the mechanism underlying the prognostic role of the TXNIP/GDF15 ratio, the authors should at least do data mining followed by ChEA or Encode (or other) analysis to identify transcription factors or pathways that become activated in late-stage/metastatic CRC cells. There is a high likelihood that a transcription factor or pathway involved in GDF-15 upregulation in cancer (e.g. p53, HIF1alpha, Nrf2, NF-kB, MITF, C/EBPß, BRAF, PI3K/AKT, MAPK p38, EGR1) supersedes the inhibitory effect of the MondoA-TXNIP axis. As it stands, the proposed loss of function of the ROS - MondoA - TXNIP - GDF-15 axis is far less convincing than almost all other aspects of the study.

      An extremely fair point. We adopted a similar approach to that suggested – as mentioned above, we looked at TFs that bind to GDF15 promoter/enhancer regions and then looked at the presence of these in our transcriptomic data – specifically any evidence of change post oxaliplatin treatment. We found 6 such TFs that were decreased post-oxaliplatin treatment. We then looked for any evidence of TXNIP dependence in these TFs by comparing post-oxaliplatin treatment across NTC and TXNIP knockout lines, when we did this we found only one GDF15 promoter/enhancer binding TF was significantly changed: MYC. We then looked at the relationship between MYC,TXNIP, and GDF15 against the other 5 ‘control’ TFs in the TCGA COAD dataset, we found that MYC showed the strongest correlations, in the ‘correct’ directions. This finding was further backed up in the literature where a TXNIP knockout in a breast cancer model drove c-myc-dependent transcription, whilst c-myc has been observed to increase in later stage CRC patients, is associated with cellular stress and resistance. The collective evidence therefore suggests that MYC is the factor that is initially at least partially regulated by TXNIP, before this regulation is lost in advanced / resistant disease. Continuing on this line, it is likely that the predictive GDF15/TXNIP ratio is at least in part, a measure of c-myc responsiveness to oxaliplatin. All the while we must bear in mind TXNIP-independent oxaliplatin-dependent regulation of GDF15, most likely ARRDC4, as described earlier in this document.

      Using pathway analysis software to compare our transcriptomic data from cell lines treated with/without oxaliplatin, the most likely pathways upstream of MYC/c-myc that are negatively affected by chemotherapy are BAG2, Endothelin-1, telomerase, ErbB2-ErbB3 and Wnt/B-catenin. When looking at the comparison of UTC and resistant lines’ transcripts there is only one key component of these pathways which is upregulated in both lines - ERBB3 – which has already been shown to be important in CRC metastasis and resistance (Desai O, Wang R. HER3- A key survival pathway and an emerging therapeutic target in metastatic colorectal cancer and pancreatic ductal adenocarcinoma. Oncotarget. 2023 May 10;14:439-443. doi: 10.18632/oncotarget.28421. PMID: 37163206; PMCID: PMC10171365.). It is highly speculative, but our data suggests the most likely pathway to supersede TXNIP in its (partial) regulation of MYC is the ErbB2-ErbB3 pathway.

      My further criticisms are mostly more technical:

      Figure 2 I-L: What was the extent of MondoA downregulation achieved by siRNA treatment? Could the effects also be seen with the small molecule mondoA inhibitor SBI-477 (or a related substance)?

      This experiment has been repeated. The pooled densiometric data is also now given (please see above).

      How do you explain the different GDF-15 levels between untreated non-target control cells (NTC) and TXNIP knock-down cells (TKO) in Figures 3C-F?

      The only way to interpret this is that there is a TXNIP-independent pathway regulating GDF15 expression after oxaliplatin treatment, as described this is most likely to be ARRDC4 - the text has been updated to:

      Lines 522-524: “It is important to note, however, that we saw clear evidence that TXNIP was not solely responsible for the downregulation of GDF15 post oxaliplatin treatment (Figure 3C-G, S6E).”

      In figures 3 E-G the dots for the individual measurements should be indicated. This would be more informative than just the bar graphs.

      Completed.

      Figure 4C,D and Table 3: Data on the role of GDF-15 in CRC are largely valedictory of previous work (e.g. Brown et al. Clin Cancer Res 2003, 9(7):2642-2650, Wallin et al., Br J Cancer. 2011 May, 10;104(10):1619-27). Therefore, the previous studies should be cited.

      Apologies for the oversight and many thanks – this is an excellent addition.

      Figure 5C-F: Please indicate in the figure legend how proliferation was assessed.

      Many thanks. This was noticed by another reviewer also. We have changed the text to include how the data was normalised: “(C-F) Labelled PBMCs were stimulated with anti-CD3 and anti-CD28 for 4 days in the presence of fresh supernatant from indicated cell lines, before being stained with anti-CD3 and anti-CD8 (C-D) or anti-CD4 (E-F) antibodies and measured by flow cytometry. Normalised proliferation (normalised to MFI from control: i.e. cells treated with supernatant from NTC cells) on gated CD3+CD8+ or CD3+CD4+ cells is shown. n=6. (G-H) Normalised IFNg concentrations (normalised to MFI from control: i.e. cells treated with supernatant from NTC cells) in the supernatant of cells from C-F.” (lines 724-730)

      Figure S8E-G: Please indicate the analysed parameters in the graphs. In Figure S8G, the legend just indicates that "aggression of tumour" is dichotomized and plotted. This clearly requires a better definition.

      Many thanks, this has been changed as per the below.

      Lines 862-868: “(E-G) Receiver operating characteristic (ROC) curves showing area under the curve and p values for the use of GDF15/TXNIP ratio in predicting origin of cell line (E; primary; DLD1, HCT15, HT29, SW48 [n=4] or secondary; DiFi, LIM1215 [n=2]), sensitivity to oxaliplatin (F; parental DLD1 (plus biological repeat), HCT15 [n=3] or resistant DLD1 (plus biological repeat), HCT15 [n=3]), aggression of tumor (G; non-aggressive; The authors propose a novel ROS - MondoA - TXNIP - GDF15 - Treg axis, where MondoA activation, TXNIP up- and GDF-15 downregulation enhance tumor immunogenicity. While this axis has been analyzed in some detail, GDF-15 is not only linked to induction of regulatory T cells. There has been a report showing that GDF-15/MIC-1 expression in colorectal cancer correlates with the absence of immune cell infiltration (Brown et al. Clin Cancer Res 2003, 9(7):2642-2650). The link between GDF-15 and immune cell exclusion has also been confirmed in other conditions, including different cancers (Kempf et al. Nat Med 2011, 17(5):581-588, Roth P et al. Clin Cancer Res 2010, 16(15):3851-3859, Haake et al. Nat Commun 2023, 14(1):4253). A key mechanism is the GDF-15 mediated inhibition of LFA-1 activation on immune cells. As the authors argue that the described pathways turns cold tumors hot in response to oxaliplatin-based chemotherapy, this GDF-15 dependent immune cell exclusion mechanism might be at least as relevant than induction of Treg. Likewise, inhibition of dendritic cell maturation by GDF-15 (Zhou et al. PLoS One 2013, 8(11):e78618) could explain why GDF-15high tumors are immunologically cold. Reviewed in 3

      The authors propose that the pathways discovered by them contributed to the "heating up" of the tumor microenvironment after oxaliplatin-based chemotherapy. The authors should thus look in their data sets for the presence of cytotoxic T cells and their possible correlation with TXNIP and GDF-15 levels.

      This is a wonderful explanation – many thanks. We have taken the opportunity to assess the impact of GDF15 expression on a variety of T cell markers (Figure S9). In this data a negative association between GDF15 and CD8 CTLs can clearly be seen, as predicted by the reviewer.

      Lines 712-717: “To assess if the GDF15-dependent presence of Tregs may be associated with a decrease in activated cytotoxic CD8 T cells, we interrogated the TCGA COAD dataset. We found that low GDF15 tumors carried significantly higher levels of CD8, CD69, IL2RA, CD28, PRF1, GZMA, GZMK, TBX21, EOMES and IRF4 (Figure S9); transcripts indicative of activated cytotoxic CD8 T cells. High GDF15 tumors were enrichment for FOXP3 and, interestingly, RORC (Figure S9). These data support the hypothesis that GDF15 induces Foxp3+ve Tregs which inhibit CD8 T cell proliferation and activation in the TME.”

      The paragraph on GDF-15 receptors needs to be corrected: The purported role of a type 2 transforming growth factor (TGF)-beta receptor in GDF-15 signalling had been due to a frequent contamination of recombinant GDF-15 with TGF-beta (Olsen et al. PLoS One 2017, 12(11):e0187349). There have been a number of screenings for GDF-15 receptors that have all failed to show an interaction between GDF-15 and TGF-beta receptors. Instead, only GFRAL was found in these large-scale screenings (Emmerson et al. Nat Med 2017, 23(10):1215-1219, Hsu et al. Nature 2017, 550(7675):255-259, Mullican et al. Nat Med 2017, 23(10):1150-1157, Yang et al. Nat Med 2017, 23(10):1158-1166). The one subsequent report that shows a link between GDF-15, engagement of CD48 on T cells and induction of a regulatory phenotype (Wang et al. J Immunother Cancer 2021, 9(9)) still awaits independent validation. Considering that CD48 lacks an intracellular signaling domain that would be critical for a classical receptor function, I recommend to be more cautious regarding the role of CD48 as GDF-15 receptor. Given the mechanism outlined by Wang et al. the word interaction partner might be more apt. Moreover, an anti-GDF-15 antibody would be a good control for the experiments involving an anti-CD48 antibody in Figure 5.

      Thank you so much for this concise and highly informative paragraph. We have changed the text to read:

      202-204: “As a soluble protein, GDF15 exerts its effects by binding to its cognate receptor, GDNF-family receptor a-like (GFRAL)44,45,46,47 or interaction partner, CD48 receptor (SLAMF2)43, with the latter still requiring additional verification.”

      We would have ideally included an anti-GDF15 antibody in the CD48 assay at the time but didn’t have the foresight. We have included the additional text to temper any conclusions.

      Lines 701-711: “Furthermore, when stimulating naïve CD4 T cells in the presence of GDF15 enriched supernatant we were able to both differentiate these cells into functional Tregs and also block the generation of this functionality using an anti-CD48 antibody (Figure 5M-N). However, it must be stressed that the binding and functional impacts of GDF15’s interaction with CD48 still require further verification.”

      Cell surface externalization of annexin A1 has been described as a failsafe mechanism to prevent inflammatory responses during secondary necrosis (PMID: 20007579). Thus, I am surprised that the authors list annexin A1 among the immune-stimulatory molecules exposed or released in response to chemotherapy-induced cell death (line 103). Please clarify!

      We agree – it shouldn’t be there!! Removed. Many thanks.

      **Referee Cross-Commenting**

      Regarding the cross-comment by referee 2: In my opinion, the data shown in Figure 3C-H clearly demonstrates that TXNIP can repress GDF-15 expression. I agree that there will likely be further regulators. The GDF-15 promoter is constantly regulated by a multitude of factors (which mostly induce transcription). As downregulation of GDF-15 in response to oxaliplatin is the opposite of the frequently described induction of GDF-15 upon chemotherapy, net effects may always be "smudged" by contributions from different pathways (e.g. by cell stress due to siRNA transfection). Therefore, I believe that the data are as good as it will get. Accordingly, I would not force the authors to further amplify the observed effect.

      Many thanks for your understanding – yes, GDF15 has >650 TFs that bind its promoter/enhancer regions – a number we found rather daunting. Happily your comments and those of the other reviewers inspired us to dig and we now have data that is supportive of MYC’s and ARRDC4’s involvement – detailed throughout this reply.

      cross comment regarding referee #1: I share the general assessment of the referee and recognize the very detailed mechanistic analysis. To further support the moderate effects of the MondoA knockdown, a small molecule inhibitor like SBI-477 might be useful. (I had already suggested using this inhibitor to support these data.)

      Many thanks for the suggestion. We opted to increase the number of siRNA repeats instead – with the data included in Figure 3J (above).

      Still, my view on the potential relevance of oxaliplatin-induced, TXNIP-independent downregulation of GDF-15 differs from that of referee 1. In the clinics, platinum-based chemotherapy is one of the strongest inducers of GDF-15 (compare Breen et al. GDF-15 Neutralization Alleviates Platinum-Based Chemotherapy-Induced Emesis, Anorexia, and Weight Loss in Mice and Nonhuman Primates. Cell Metabolism 32(6), P938-950, 2020.DOI:https://doi.org/10.1016/j.cmet.2020.10.023). I was thus surprised that the authors found a pathway, which leads to an outcome that an exactly opposite effect.

      This is fascinating that oxaliplatin drives this increase in GDF15 – we were unaware of this paper. Looking at figure 2(H-K), GDF15 is being produced from multiple non-diseased tissues after systemic chemotherapy – even at day 19 post-treatment – this suggests that wrt this study, systemic GDF15 could not be used as a readout of success or otherwise – which is extremely helpful! Thank you.

      Thus far, the only obvious reason for reduced GDF-15 secretion upon treatment with cytotoxic drugs was a reduction in tumor cell number due to cytotoxicity.

      Please do not discount this. This study was focused on the cells which survived oxaliplatin treatment – the cells which did not were discarded. Our view, given your input, would be a complex picture where in early stages systemic GDF15 goes up, due to off-target effects, but locally levels drop owing to cell death and this, and other, stress-related pathways in the remaining tumor cells.

      Still, the authors managed to convince me that the described pathway (ROS - MondoA - TXNIP - GDF-15) exists. (Here, I still largely concur with referee 1.) Moreover, as we have identified some factors required for GDF-15 biosynthesis that could easily interact with TXNIP, I find the proposed mechanism plausible.

      Extremely encouraging for us to hear!

      Nevertheless, as a downregulation of GDF-15 in response to chemotherapy is hardly ever observed in late-stage cancers, I believe that the observed switch in pathway activation between early- and late-stage cancers might be highly relevant - in particular, as there is so much evidence for platinum-based induction of GDF-15 in late-stage cancer patients. Emphasizing the divergent clinical observations (e.g. by Breen et al.) could thus help to put the finding into perspective.

      Very much agree. We did see this phenomenon in LIM1215 cells (Figure 6B) and the resistant lines we generated continually produced higher levels.

      Analysing TXNIP-independent mechanisms involved in the oxaliplatin-dependent repression of GDF-15, as suggested by referee #1, will require enormous efforts and resources, and may still turn out to be fruitless. Personally, I would thus be content if the authors just mentioned possible contributions from other pathways upon cancer progression. To me, the described pathway seems to be limited to early-stage cancers, and the actual finding that GDF-15 is downregulated is an interesting observation, irrespective of further involved pathways.

      Many thanks – this is extremely fair. Happily we have managed to make some tentative steps forward in highlighting the potential role of MYC, and the suggestion of redundancy wrt ARRDC4, but as you say, much more work needs to be done to fully understand these processes.

      cross comment regarding referee #2: I fully agree with the referee that activation of the pathway by further chemotherapeutic drugs could be a valuable addition. As Guido Kroemer´s lab has described oxaliplatin to induce a more immunogenic cell death compared to other platinum-based chemotherapies, even a rather limited comparison between oxaliplatin and cisplatin could be very interesting.

      Absolutely agree – extra data on this has been included in Figure S11, which is included earlier in this letter. We also uncovered a meta-analysis using metformin, which has been seen to inhibit ROS, where TXNIP and ARRDC4 are the top two downregulated transcripts whilst GDF15 appears in the top four upregulated. This may suggest that chemotherapeutic immunogenicity, at least through the presence or absence of GDF15, may in part be driven by ROS.

      Lines 930-933: “Further support for both TXNIP and ARRDC4’s role in regulating GDF15 after the induction of ROS comes from a pan cancer meta-analysis assessing the impact of metformin (which has been reported to inhibit ROS) on gene expression. Here the top two downregulated genes were TXNIP and ARRDC4 and the top four upregulated genes were DDIT4, CHD2, ERN1 and GDF1572 “

      Reviewer #3 (Significance (Required)):

      In general, this is a very interesting manuscript describing a cascade of events that may contribute to successful chemotherapy (which likely requires induction of an immune response against dying tumor cells.) The observation that this pathway is only active in early/non-metastatic cancer cells is striking. Unfortunately, the authors cannot explain inactivation of this pathway in later stage/ metastatic/ highly aggressive cancers. Understanding this switch could easily be the most important finding triggered by this report. Therefore, I highly recommend to make some effort in this direction. Strikingly, the authors find that disruption of TXNIP-mediated GDF-15 downregulation is strongly associated with worse prognosis. They also suggest that this ratio could indicate whether a patient will respond to oxaliplatin-based chemotherapy.

      This is again very fair – we have posited a potential mechanism for the loss of this switch elsewhere in this reply– one which involves a change in TXNIP-mediated MYC regulation and/or increased HER2-HER3 signalling – but although reasonable for a rebuttal (and publication in that context) we do not feel we have the evidence to include this within the full manuscript.

      Altogether, the findings described in manuscript are very novel and may have prognostic (or, in case of the presumed loss of the MondoA - TXNIP - GDF-15 pathway) therapeutic implications. Thus, the manuscript certainly fills various gaps and should be of major interest for cell biologists working on immunogenic cell death, or colorectal cancer, or MondoA, TXNIP or GDF-15. Still, due to its translational implications, it would also be worthwhile reading for a large number of researchers in the oncology field.

      We are very grateful for your kind comments.

      1 Sinclair, L. V., Barthelemy, C. & Cantrell, D. A. Single Cell Glucose Uptake Assays: A Cautionary Tale. Immunometabolism 2, e200029, doi:10.20900/immunometab20200029 (2020).

      2 Yu, F. X., Chai, T. F., He, H., Hagen, T. & Luo, Y. Thioredoxin-interacting protein (Txnip) gene expression: sensing oxidative phosphorylation status and glycolytic rate. J Biol Chem 285, 25822-25830, doi:10.1074/jbc.M110.108290 (2010).

      3 Wischhusen, J., Melero, I. & Fridman, W. H. Growth/Differentiation Factor-15 (GDF-15): From Biomarker to Novel Targetable Immune Checkpoint. Front Immunol 11, 951, doi:10.3389/fimmu.2020.00951 (2020).

    1. Reviewer #2 (Public Review):

      Summary:<br /> The authors try to establish that there is an Abeta-dependent loss of nuclear pores early in Alzheimer's disease. To do so the authors compared different NUP proteins and assessed their function by analyzing nuclear leakage and resistance to induction of nuclear damage and the associated necroptosis. The authors use a mouse knockin for hAPP with familial Alzheimer's mutations to model amyloidosis related to Alzheimer's disease. Treatment with an inhibitor of beta-amyloid production partially rescued the loss of nuclear pore proteins in young KI neurons, implicating beta-amyloid in Nuclear Pore dysfunction, a mechanism already described in other neurodegenerative diseases but not in Alzheimer's disease.

      The conclusions of this paper related to familial AD are well supported by data but are not related to an aging decline in NUP function, where it is required to extend data analysis and one additional experiment.

      1. Adding statistics and comparisons between wild-type changes at different times/ages to determine if the nuclear pore changes with time in wild-type neurons. The images show differences in the Nuclear pore in neurons from the wild-type mice, with time in culture and age. However, a rigorous statistical analysis is lacking to address the impact of age/development on NUP function. Although the authors state that nuclear pore transport is reported to be altered in normal brain aging, the authors either did not design their experiments to account for the normal aging mechanisms or overlooked the analysis of their data in this light.

      2. Add experiments to assess the contribution of wild-type beta-amyloid accumulation with aging. It was described in 2012 (Guix FX, Wahle T, Vennekens K, Snellinx A, Chávez-Gutiérrez L, Ill-Raga G, Ramos-Fernandez E, Guardia-Laguarta C, Lleó A, Arimon M, Berezovska O, Muñoz FJ, Dotti CG, De Strooper B. 2012. Modification of γ-secretase by nitrosative stress links neuronal ageing to sporadic Alzheimer's disease. EMBO Mol Med 4:660-673, doi:10.1002/emmm.201200243) and 2021 (Burrinha T, Martinsson I, Gomes R, Terrasso AP, Gouras GK, Almeida CG. 2021. Upregulation of APP endocytosis by neuronal aging drives amyloid-dependent synapse loss. J Cell Sci 134. doi:10.1242/jcs.255752), 28 DIV neurons are senescent and accumulate beta-amyloid42. In addition, beta-amyloid 42 accumulates normally in the human brain (Baker-Nigh A, Vahedi S, Davis EG, Weintraub S, Bigio EH, Klein WL, Geula C. 2015. Neuronal amyloid-β accumulation within cholinergic basal forebrain in ageing and Alzheimer's disease. Brain 138:1722-1737. doi:10.1093/brain/awv024), thus, it would be important to determine if it contributes to NUP dysfunction. Unfortunately, the authors tested the Abeta contribution at div14 when wild-type Abeta accumulation was undetected. It would enrich the paper and allow the authors to conclude about normal aging if additional experiments were performed, namely, treating 28Div neurons with DAPT and assessing if NUP is restored.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1:

      In no particular order:

      1. In Figs S3 and S4, can they also show gamma fit? (or rather corrected fit accounting for abundance conditioning?) The shapes look different, especially for the microbial mat.

      Author response: We have added gamma distribution fits to the rescaled AFD plots (Figs. S3, S4).

      1. Lines 170-176 seem like they should come before lines 164-166.

      Author response: In lines 166-170 we discuss empirical patterns in the data that motivate the introduction of the SLM as a model in lines 170-175. We have clarified these points in the revision.

      1. The wiggles in the gamma predictions in the occupancy-abundance plots are because occupancy depends not only on abundance but also on the shape parameter, right? Probably good to write a sentence or two explaining what's going on here.

      Author response: We agree with the reviewer that the variation in the prediction could be in-part driven by variation in the shape parameter across community members. We now include this observation in our revision (lines 209-211).

      1. In the predicted vs observed occupancy plots, it would be nice to add curves showing predicted standard deviation or similar to give a sense of how well the model is predicting the variability.

      Author response: In the revised manuscript we now include predictions for the variance of occupancy using the gamma distribution under both taxonomic and phylogenetic coarse-graining (Fig. S9; S10; lines 211-214).

      1. Covariance between sister groups: Figs S9 and S10 look very nice, but it's hard to see much because they're log-log plots over multiple decades, while even a several-fold difference from y = x would indicate a strong effect of correlations. It would be clearer if the y-axis showed the ratio of the coarsegrained variance to the sum of OTU variances and we were looking at how well it fit y = 1.

      Author response: We have included these plots in the revision (Fig. S14, S15).

      1. If the sum of gammas can be well-approximated by a gamma, does that mean that the gamma is just a fairly flexible distribution and we shouldn't take the quality of the gamma fits in general as a very specific indication of what's going on?

      Author response: While the sum of random variables that are drawn from gamma distributions with different parameters is often well-approximated by another gamma, this does not tell us why the gamma distribution holds for microbial communities at the finest-grain level (i.e., OTUs/ASVs). At present, the best explanation is that the gamma is a stationary distribution for certain stochastic differential equations which have ecological interpretations (Grilli, 2020; Shoemaker et al., 2023). Furthermore, alternative two-parameter distributions have been tested alongside the gamma and have done a comparatively poor job capturing observed macroecological patterns (Grilli, 2020). These results suggest that the utility of the gamma distribution is not simply an outcome of its flexible nature, it succeeds because it has captured core ecological properties of microbial communities. In the case of the SLM, gamma-like distributions arise when a community member is subject to self-limiting growth and environmental noise. On the other hand, the stability of the gamma distribution might explain why it can be detected as shape of the AFD, as it does not fade out across coarse-graining level.

      1. What's going on with the variance of diversity in Fig S12? Does this suggest that some of the problem in Figure 4 could be with the analytic approximation rather than the model? I had a hard time understanding the part of the Methods explaining the simulation details (lines 587-597). It would be worth expanding this. Is there some way to explain how the correlations were simulated in terms of the SLM, e.g., correlations in the noise term across OTUs?

      Author response: We believe that deviations in the variance of diversity in Fig. S16g,h are driven by small deviations in our predictions of the second moment $$< (x*ln(x) | N_{m}, \bar{x}{i}, \beta{i}^{2} >$$ (Eq. S16). Alone these predictions are slight, but their effects become noticeable when summed over hundreds or thousands of taxa. We have included this observation in the revised manuscript (lines 268-271). However, this deviation pales in comparison with the magnitude of covariance in the empirical data, suggesting that our inability to predict the variance of richness and diversity is primarily driven by our assumption of statistical independence.

      Regarding the source of the correlations, under the SLM correlations in abundances can be introduced either by adding deterministic interaction terms or through correlated environmental noise. Determining which of these two options drives empirical correlations is an active area of research (e.g., Camacho-Mateu et al., 2023). For the purpose of this study, we remain agnostic on the cause of the correlations, optioning to instead emphasize that that the inclusion of correlations is necessary to reproduce observed slopes of the fine vs. coarse-grained relationship for diversity.

      1. In Figure 5ab, is the idea that the correlation in richness is primarily driven by the number of samples from the environment? Line 390 seems to say so, but it would be good to make this explicit and put it right in that section of the Results.

      Author response: Our results suggest that sampling effort (# reads) plays a larger role in determining the correlations between fine and coarse-grained measures of richness. We now clarify this point in the revised manuscript (lines 429-435).

      1. I don't totally understand the contrast in lines 369-372. If fine-scale diversity within one group begets coarse-grained diversity in another group, couldn't that show up as correlations in the AFDs? Or is the argument that only including within-group correlations in AFDs is enough to reproduce the pattern? I'm not sure I see how that could be.

      Author response: The term “begets” implies both causation and direction. If we see a positive relationship between diversity estimates at two different scales of observation the causal mechanism cannot be determined solely from correlations between samples obtained once from different sites. So, mechanisms consistent with niche construction/"DBD" can produce correlations, though the existence of correlations do not necessarily imply DBD.

      1. The discussion of niche construction on 429-431 doesn't match very well with 440-441. Basically, niche construction is a very broad concept, not a specific one, right?

      Author response: In lines 472-576 (formerly 429-431) we discuss how the existence of correlations between fine and coarse-grained scales does not point to a single ecological mechanism. Alternatively stated, observing a non-zero slope does not mean that niche construction is driving the relationship.

      In lines 476-487 (formerly 440-441) we discuss how the mechanism of cross-feeding has been shown to generate a positive relationship between fine and coarse-grained measures of diversity. This mechanism can be interpreted as a form of “niche construction”, so it is an instance of a tested ecological mechanism that aligns with the interpretation given in Madi et al. (2020).

      1. Isn't (8) just the negative binomial distribution?

      Author response: The convolution of the stationary solution of the SLM (i.e., a gamma distribution) and the Poisson limit of a multinomial sampling distribution returns a negative binomial distribution of read counts across hosts if samples have identical sampling depths. We now include this detail in the revision (line 593-595). Note however that if different samples have different sampling depths, the distribution of reads across samples is not a negative binomial.

      1. Missing 1/M in (9).

      Author response: We have fixed this omission in the revision.

      1. Schematic figures illustrating what the different statistics are intuitively capturing would really help this work be understandable to a broader audience, but they'd also be a ton of work.

      Author response: Richness and diversity are used in ecology to such an extent that we do not see the benefit of a conceptual diagram. Furthermore, we have included a conceptual diagram about our pipeline in our revision at the request of Reviewer 2 (Fig. S20).

      Reviewer #2:

      Major Recommendations

      If I were reviewing this manuscript for a regular journal, I believe the following issues would be important to address prior to publication.

      1. From my reading, the main points of this advance are that

      a. SLM models AFDs well at all levels of coarse-graining.

      b. This makes SLM a better null-model than UNTB for macroecological relationships.

      c. Using SLM on the EMP data, the richness slopes are well explained by SLM but not the diversity slopes. Therefore, any theory that hopes to explain the diversity slopes must include interactions. Argument B appears to be one of the key points yet is missing from the abstract, and should be made clearer. If these aren't the main points the authors intended, then other main points need to be highlighted more.

      Author response: In the revision we now explicitly mention argument b in the Abstract.

      1. The title should be more specific, so as to better reflect the content. (E.g. "UNTB is not a good null model for macroecological patterns" would seem more appropriate.)

      Author response: We would prefer to focus on the success of the SLM rather than the limitations of the UNTB in the title of this work. Therefore, we have modified our title as follows: “Investigating macroecological patterns in coarse-grained microbial communities using the stochastic logistic model of growth”.

      1. The manuscript would benefit from a clearer description of exactly what information the SLM retains about the data (perhaps even a cartoon panel in one of the figures). In particular, it is important to be explicit about the number of model parameters.

      Author response: The number of model parameters for the gamma AFD are now explicitly stated in the revision (Lines 579-580).

      1. The main point of Figures 2-4 seems to be that SLM is good at describing the data (and when it fails it is due to interactions) while UNTB fails to reproduce this behavior, in support of Argument B. This is not clear from the figure descriptions or titles, which focus on SLM's "predictive" power.

      Author response: Fig. 2a demonstrates that the gamma distribution predicted by the SLM explains the empirical distribution of abundances. This result provides motivation to predict the fraction of sites harboring a given community member (i.e., occupancy, Fig. 2c) as well as general measures of community composition including mean richness (Fig. 3a,c) and mean diversity (Fig. 3b,d) using parameters estimated from the data (not free parameters).

      This success led us to consider whether the gamma distribution could predict the variance of richness and diversity, which it could not because it does not capture covariance between community members (Fig. 4).

      In the revision we have identified opportunities to make these points clear throughout the Results. Furthermore, we have added additional detail to the legends of Figs. 2-4.

      1. The manuscript would benefit from clarifying the use of "prediction" related to the SLM. Since the gamma distributions predicted by SLM were fit to empirical data, it seems like the agreement between analytic means and empirical means (Fig. 3) is a statement on gamma distributions being a good fit for the AFD's more than SLM predicting richness and diversity. For example, from my reading, it seems like this analysis could be done numerically by shuffling species abundances across environments and seeing whether this changed the mean richness/diversity. I would not call this shuffling test a prediction, since it is more a statement on the relevance of interactions. SLM predicts gamma-distributed AFD's, but those distributions recovering the data they were trained on doesn't seem like a prediction.

      Author response: In this manuscript we identified the gamma distribution as an appropriate probability distribution to describe the distribution of relative abundances across samples over a range of coarse-grained scales. Motivated by this result, we performed a separate analysis where at each scale we estimated the mean and variance of relative abundance across sites for each community member. We then used these parameters to obtain the expected value of a community-level measure using an equation we derived by assuming that the gamma distribution was appropriate (e.g., richness, Eq. 13). We then compared the expected value of richness to the mean value from empirical data and assessed the similarity between the two values.

      The outcome of this procedure constitutes a prediction. While the mean and variance are parameters, estimating them from the empirical data has no connection with the operation of training a distribution on empirical data. We could have derived predictions such as Eq. 13 using any other probability distribution that can be parameterized using the mean and variance (e.g., Gaussian). Such a prediction would likely do a poor job even though it used the same means and variances used for our gamma predictions. This is because the choice of distribution would not have been a good descriptor of the distribution of abundances across hosts.

      To better explain this last -- perhaps the most significant -- issue, I'd like to ask the authors if the following recasting would be an accurate reflection of their conclusions, or if something is missing.

      1. "Focusing on the empirical relationship observed between diversity slopes by Madi 2020, we ask the question: does explaining these relationships require accounting for species-species correlations? Or could it be reproduced in a noninteracting model?" To address this question, one can perform a randomization test, shuffling abundances to preserve all single-OTU statistics but breaking any correlations. My reading of the authors' results is that (new result 1) the richness relationships would be preserved, while diversity relationships would not be preserved. [Note that this result 1 need not mention either SLM or UNTB.]

      Author response: The question of whether correlations between species are necessary to explain the observed slope of the fine vs. coarse-grained relationship was only one component of our research goals. Our first question was whether the SLM would prove to be a more appropriate null for evaluating the novelty of observed slopes. We believe that our results support the conclusion that the SLM is an appropriate null for this question, as it was able to capture observed slopes of the fine vs. coarse-grained relationship for estimates of richness, determining that correlations and the interactions that are ultimately responsible are not necessary to explain this result.

      We then find that the SLM as a null model fails to capture observed slopes of the fine vs. coarsegrained relationship for estimates of diversity and simulate the SLM with correlations to return reasonable estimates of the slope. However, here the question about correlations is a direct follow-up from our question about a null model that excludes interactions, so it is unclear how a randomization test would relate to this result.

      1. Instead of doing a randomization test (resampling the empirical distribution), one might insist on instead fitting a model to the AFD distributions, and sampling from that distribution rather than the empirical one.

      a. If doing it this way, one should of course ensure that the distribution being fit is a good description of the data.

      b. UNTB is a bad fit. SLM is a better fit, and in fact (new result 2) continues to be a good empirical fit even at coarse-grained levels.

      c. Can make statements on using SLM as a null model for these types of cross-scale relationships. Could try arguing that fitting an SLM model per-OTU (instead of resampling the empirical distribution) could offer some advantage if certain properties could be computed analytically from the fit parameters, instead of averaging over multiple computational rounds of resampling.

      Do these two points accurately summarize the manuscript? If so, this presentation avoids the confusion with "prediction". If my summary is missing some important point, the presentation should be revised to clarify the points I appear to have missed.

      Author response: In our manuscript we derive predictions from the gamma distribution, the stationary distribution of the SLM, that require parameters estimated from the data (i.e., mean and variance of relative abundance). These parameters are estimated from the data using normal procedures and then plugged into our predictions that assume the appropriateness of the gamma, returning values that are then compared to estimates from empirical data. Our estimation of the mean and variance does not assume that the empirical distribution following a gamma distribution, but the value returned by our function derived from the gamma distribution (e.g., Eq. 13) does make that assumption.

      To address the reviewer’s broader comment, we believe that following points summarize our manuscript:

      1. The gamma distribution as a stationary solution of the SLM captures macroecological patterns and predicts typical community-level properties (i.e., mean richness and diversity) across phylogenetic and taxonomic scales.

      2. The gamma distribution fails to predict variation in community-level properties (i.e., variance of richness and diversity) across phylogenetic and taxonomic scales. This occurs because the SLM is a mean-field model that does not explicitly include interactions between community members.

      3. Despite the inability to capture interactions, the gamma distribution succeeds at predicting the fine vs. coarse-grain slope for richness, a pattern that had previously been attributed to community member interactions. This result demonstrates that the novelty of a macroecological pattern hinges on one’s choice of null model.

      4. However, the gamma cannot capture the same relationship for diversity. Simulations of the gamma distribution that incorporate correlations between community members are capable of generating reasonable estimates of the slope.

      To address the reviewer’s comments regarding the appropriateness fitted gamma distributions, in our revision we have added fitted gamma distributions to plots of AFDs so that the reader can visually assess the ability of the gamma to describe empirical patterns (Fig. S3, S4).

      We have also obtained predictions for the slope of the fine vs. coarse-grained relationship for community richness using the same form of UNTB used by Madi et al (2020). In our revised manuscript we establish a procedure to infer the single parameter of this model, generate predictions of richness at fine and coarse-grained scales, and then evaluate whether the UNTB is capable of predicting the slope of the fine vs. coarse-grained relationship for richness (Supplementary Information; Figs. S18, 24-28; lines 277-278; 370-380).

      Other/minor comments

      1. The manuscript would be improved with more consistent terminology ("fine vs. coarse-grained relationship"/"the relationship" vs. "diversity slope"). Also, many readers may be used to OTUs referring to the rather fine level of description, as opposed to any chosen level; and could interpret indexing over groups as being in contrast with indexing over OTU's (coarse vs fine). The authors' use is perfectly correct, but keeping a consistent terminology would help.)

      Author response: We have revised our manuscript to specify the “slope” as the “slope of the fine vs. coarse-grained relationship” (e.g., Line 318). We also specify in the Results and in the Methods that we use “fine” and “coarse” as relative terms, keeping with the sliding-scale approach used in Madi et al (2020).

      1. While I appreciate this "slope" is something borrowed from other work, the clarity of the paper might benefit from a cartoon of how one goes from the raw data to the slopes at a particular coarse-graining level. (Optional).

      Author response: We had added a conceptual diagram to the revision (Fig. S20).

      1. The text often colloquially references "the gamma," "predictions of the gamma," etc. This phrasing comes across as sloppy, and the manuscript would be improved by being more specific.

      Author response: We now specify “gamma” as the “gamma distribution” throughout the manuscript.

      1. Equation 6 appears to be missing some subscripts on the x terms (included on the left of the equation).

      Author response: We thank the reviewer for noticing this error and we have corrected it in the revision.

      1. In "Simulating communities of correlated...AFDs", the acronym SAD is not defined.

      Author response: We thank the reviewer for noticing this error and we have corrected it in the revision.

      1. In Figure 2:

      a. Invariant is probably the wrong word for the title, since all the AFD's were rescaled by mean and variance before being compared. Data does support that the gamma distributions are good at describing the AFD's, but as stated in the description it's the general shape that is preserved, not the distribution itself.

      Author response: When we mention the invariance of the AFD we now specify that we mean that the shape of the distribution remained qualitatively invariant.

      b. I'd recommend changing the color coding to something with more contrast, since currently it's impossible to assess the claim that the shape of the distribution collapses.

      Author response: Our coarse-graining procedure is a sequential operation that has no intuitive point that would suggest the use of a contrasting colormap (e.g., if our scale ranged from -1 to 1 then there would be a natural point of contrast at zero).

      c. The legend is missing relevant technical details: How many OTU's were used to make plot a? How many samples?

      Author response: The number of samples was listed in the Materials and Methods (line 523). In the revision we now include a table with the average and total number of OTUs as well as the average number of reads for each environment (Table S1, S2).

      d. In plot b, is the mean relative abundance referring to "mean abundance when observed" or "mean across all samples"?

      Author response: The mean relative abundance is the mean abundance across all sites (line 204) and in the legend of Fig. 2.

      e. Since one argument here is that SLM fits these distributions better than UNTB, if possible it would be nice to see UNTB's failed fits here.

      Author response: A major feature of the UNTB is that the demographic parameters of community members are indistinguishable. Under the SLM, the variation in the mean relative abundance we observe suggests that the carrying capacities of community members vary over multiple orders of magnitude, a result that is incompatible with most forms of the UNTB (x-axis of Fig. 2b). We now mention this point in the revised manuscript (lines 110; 229; 455-471).

      1. In Figure 3:

      a. It is not clear how coarse-graining is included in model fitting. The "Deriving biodiversity measure predictions" section would benefit from including how coarse-graining is incorporated.

      Author response: We predict measures of biodiversity separately at each coarse-grained scale. We now clarify this detail in the revised manuscript (Lines 624-627).

      b. Reference Shannon Diversity in Methods.

      Author response: We now cite Shannon’s diversity.

      c. What is the blue/white color coding in plots a & c? It doesn't have any color key.

      Author response: Figs. 3-6 use a uniform light-to-dark scale for all environments, with each environment having its own color. For example, Fig. 3a contains data from the human gut microbiome. Human gut data were assigned the color aquamarine, so the shade of aquamarine for a given datapoint in Fig. 3a indicates the phylogenetic scale.

      In the revision we now clarify the colorscale in the legend of Fig. 3 and specify that the same scale is used in all subsequent figure legends.

      d. Re: earlier comments, why is richness considered a prediction? (Am I correct in my interpretation that panel b is almost a tautology - counting the number of zeros in the matrix either by rows or by columns - whereas panel d is nontrivial?)

      Author response: Mean richness as a measure of biodiversity depends on the fraction of sites where a given community member is present (i.e., occupancy). The mean relative abundance of a community member and its variation across sites (beta) is clearly related to occupancy, but those two statistics do not give you a prediction of occupancy. Obtaining a prediction of occupancy and, subsequently, richness, requires 1) a probability distribution of abundances (i.e., the gamma) and 2) a probability distribution of sampling (i.e., the Poisson). Using these two pieces of information, we derived a prediction for mean richness (Eq. 13). We then compare the value of richness obtained by plugging in the mean relative abundances, betas, and known number of reads to the observed mean richness obtained from the data.

      e. The lettering of subplots in Figure 3 is not consistent with Figure 4. Figure 3 subplots are also cited incorrectly in paragraph two on page six (lines 251-254).

      Author response: We thank the reviewer for noticing the error and we have corrected it in the revision.

      f. Again, if possible show UNTB predictions in plots a & c.

      Author response: In our revised manuscript we provide extensive descriptions and predictions of mean richness and the slope of the fine vs. coarse-grained relationship for richness using the form of the UNTB used in Madi et al. (2020; Figs. S18, S24 - S29; lines 277-282; 370-380). We then compare the error of these slope predictions to those obtained from the SLM, finding that the SLM generally outperforms UNTB (Figs. S27-S29).

      1. In Figure 4:

      a. What are the color codings in plots a & b?

      Author response: The color scale used in Fig. 4 is identical to the color scale used in Fig. 3. This detail is now specified in the legend of Fig. 4.

      b. What are the two lines of empirical data in plots a & b, and why is one of them dashed?

      Author response: We now specify what the two lines mean in the key within the figure.

      c. Same comment as earlier on predictions and richness.

      Author response: We now specify what the two lines mean in the key within the figure.

      1. In Figure 5:

      a. It wasn't clear to me in the manuscript how the authors generated these plots from the raw data. The manuscript would benefit from a clear cartoon/description of the data pipeline, from raw data to empirical (and analytic) slopes.

      Author response: We have added a conceptual diagram to the revised manuscript (Fig. S20).

      b. Make the figure title more descriptive to better connect it to the figure's objective (the richness slopes relationship is not novel, but the diversity slopes relationship is).

      Author response: We have revised the figure title.

      References

      Camacho-Mateu, J., Lampo, A., Sireci, M., Muñoz, M. Á., & Cuesta, J. A. (2023). Species interactions reproduce abundance correlations patterns in microbial communities (arXiv:2305.19154). arXiv. https://doi.org/10.48550/arXiv.2305.19154

      Grilli, J. (2020). Macroecological laws describe variation and diversity in microbial communities. Nature Communications, 11(1), 4743. https://doi.org/10.1038/s41467-020- 18529-y

      Madi, N., Vos, M., Murall, C. L., Legendre, P., & Shapiro, B. J. (2020). Does diversity beget diversity in microbiomes? eLife, 9, e58999. https://doi.org/10.7554/eLife.58999

      Shoemaker, W. R., Sánchez, Á., & Grilli, J. (2023). Macroecological laws in experimental microbial systems (p. 2023.07.24.550281). bioRxiv. https://doi.org/10.1101/2023.07.24.550281

    1. ChromaticchangesweremonitoredaccordingtotheproceduredescribedinEuropeanStandardEN15886[42]usingtheCIELAB1976method,withthestandardilluminantD65andobserver10◦.ThecolourcoordinatesL*,a*,andb*wererecordedforoneareaoneachcoupon(Ø8mm),beforepaintingandaftercleaningusingaKONICA-MINOLTACM2600dspectrophotometerandSCEdatawerecollected.ColourchangeswerecalculatedandreportedasE,whereE=[(L*)2+(a*)2+(b*)2]½.TheL*=L*−L0*valuedescribesachangeinbrightnessatanygivenstageofcleaning;hence,negativevaluesofL*correspondtocleanedareaswhichreflectlesslightthanthereferenceunpaintedsurface(indexed0).Theotherdifferencesa*=a*−a0*andb*=b*−b0*behavelikewise:positive(negative)valuesofa*andb*indicatethatthecleanedareasaremorered(green)andyellow(blue),respec-tively,thanthereferencesurface.Fivemeasurementsweretakenbyrepositioningtheinstrumentonthesamespoteachtime,andthenaveraged,toimprovedataaccuracy.

      This all makes sense to someone out there. Not me. From what I can gather, this is an explanation of how the researchers quantitatively gather and describe their results.

    1. Author Response

      The following is the authors’ response to the original reviews.

      eLife assessment

      This important study advances our understanding of the ways in which different types of communication signals differentially affect mouse behaviors and amygdala cholinergic/dopaminergic neuromodulation. Researchers interested in the complex interaction between prior experience, sex, behavior, hormonal status, and neuromodulation should benefit from this study. Nevertheless, the data analysis is incomplete at this stage, requiring additional analysis and description, justification, and - potentially - power to support the conclusions fully. With the analytical part strengthened, this paper will be of interest to neuroscientists and ethologists.

      GENERAL COMMENTS ON REVIEWS AND REVISIONS

      Experimental design

      Here we address questions from several reviewers regarding our periods of neuromodulator and behavioral analysis. First, we recognize that the text would benefit from an overview of the experimental structure different from the narrative we provide in the first paragraphs of the Results. We now include this near the beginning for the Materials and Methods (page 17). We further articulate that the 10-minute time periods were dictated by the sampling duration required to perform accurate neurochemical analyses (and to reserve half of the sample in the event of a catastrophic failure of batch-processing samples). Since neurochemical release may display multiple temporal components (e.g., ACh: Aitta-aho et al., 2018) during playback stimulation, and since these could differ across neurochemicals of interest, we decided to collect, analyze, and report in two stimulus periods as well as one Pre-Stim control. We now clarify this in additional text in the Material and Methods (p. 24, lines 20-22; p. 26, lines 17-19). We decided not to include analyses of the post-stimulus period because this is subject to wider individual and neuromodulator-specific effects and because it weakens statistical power in addressing the core question—the change in neuromodulator release DURING vocal playback.

      We also sought to clarify the meaning of the periods “Stim 1” and “Stim 2”; they are two data collection periods, using the same examplar sequences in the same order. We have added statements in the Material and Methods (p. 18, lines 4-7; Fig. caption, p. 39, lines 11-13) to clarify these periods.

      For behavioral analyses, observation periods were much shorter than 10 mins, but the main purpose of behavioral analyses in this report is to relate to the neurochemical data. As a result, we matched the temporal features of the behavioral and neurochemical analyses (p. 22, lines 17-22). We plan a separate report, focused exclusively on a broader set of behavioral responses to playback, that may examine behaviors at a more granular level.

      Data and statistical analyses

      Reviewers 1 and 3 expressed concerns about our normalization of neurochemical data, suggesting that it diminishes statistical power or is not transparent. We note that normalization is a very common form of data transformation that does not diminish statistical power. It is particularly useful for data forms in which the absolute value of the measurement across experiments may be uninformative. Normalization is routine in microdialysis studies, because data can be affected by probe placement and factors affecting neurochemical recovery and processing. Recent examples include:

      Li, Chaoqun, Tianping Sun, Yimu Zhang, Yan Gao, Zhou Sun, Wei Li, Heping Cheng, Yu Gu, and Nashat Abumaria. "A neural circuit for regulating a behavioral switch in response to prolonged uncontrollability in mice." Neuron (2023).

      Gálvez-Márquez, Donovan K., Mildred Salgado-Ménez, Perla Moreno-Castilla, Luis Rodríguez-Durán, Martha L. Escobar, Fatuel Tecuapetla, and Federico Bermudez-Rattoni. "Spatial contextual recognition memory updating is modulated by dopamine release in the dorsal hippocampus from the locus coeruleus." Proceedings of the National Academy of Sciences 119, no. 49 (2022): e2208254119.

      Holly, Elizabeth N., Christopher O. Boyson, Sandra Montagud-Romero, Dirson J. Stein, Kyle L. Gobrogge, Joseph F. DeBold, and Klaus A. Miczek. "Episodic social stress-escalated cocaine self-administration: role of phasic and tonic corticotropin releasing factor in the anterior and posterior ventral tegmental area." Journal of Neuroscience 36, no. 14 (2016): 4093-4105.

      Bagley, Elena E., Jennifer Hacker, Vladimir I. Chefer, Christophe Mallet, Gavan P. McNally, Billy CH Chieng, Julie Perroud, Toni S. Shippenberg, and MacDonald J. Christie. "Drug-induced GABA transporter currents enhance GABA release to induce opioid withdrawal behaviors." Nature neuroscience 14, no. 12 (2011): 1548-1554.

      However, since all reviewers requested raw values of neurochemicals, we provide these in supplementary tables 1-3. The manuscript references these table early in the Results (p. 6, lines 18-19) and in the Material and Methods (p. 27, lines 3-4)

      All reviewers commented on correlation analyses that we presented, with different perspectives. Reviewer 2 questioned the validity of such analyses, performed across experimental groups, while Reviewer 1 pointed out that the analyses were redundant with the GLM. We agree with these criticisms, and note the challenges associated with correlations involving behaviors for which there is a “floor” in the number of observations. As a result, we have removed most correlation analyses from the manuscript. The text and figures have been modified accordingly. Due these changes, we have to decline requests of Reviewer 3 to include many more such analyses. While correlation analyses could still be performed between neurochemicals and behaviors for each group, the relatively small size of each experimental group, the large number of groups, and the even larger numbers of pairings between neurochemicals and behavior, the statistical power is very low. The only correlations we utilize in the manuscript concern the interpretation of our increased acetylcholine levels.

      As part of this revision, we re-ran our statistical analyses on neuromodulators because of a calculation error in 3 animals (regarding baseline values). In a few instances, a significance level changed, but none of these changed a conclusion regarding neuromodulator changes under our experimental conditions.

      Other revisions

      INTRODUCTION: We modified the Introduction to provide both a more general framework and specific gaps in our understanding relating neuromodulators with vocal communication.

      DISCUSSION: We have added material in the first two pages of the Discussion to provide more framework to our conclusions, to address the issues of the temporal aspects of neurochemical release and behavioral observations, and to identify limitations that should be addressed in future studies.

      FIGURES: All figures are now in the main part of the manuscript. We modified most figures in response to reviewer comments. We removed neuromodulator – behavior correlations from several figures. We modified all box plots to ensure that all data points are visible. The visible data points match the numbers reported in figure captions. We brought 5-HIAA data into the main figures reporting on neuromodulator results.

      Public Reviews:

      Reviewer #1 (Public Review):

      The manuscript addresses a fundamental question about how different types of communication signals differentially affect brain states and neurochemistry. In addition, the manuscript highlights the various processes that modulate brain responses to communication signals, including prior experience, sex, and hormonal status. Overall, the manuscript is well-written and the research is appropriately contextualized. The authors are thoughtful about their quantitative approaches and interpretations of the data.

      That being said, the authors need to work on justifying some of their analytical approaches (e.g., normalization of neurochemical data, dividing the experimental period into two periods (as opposed to just analyzing the entire experimental period as a whole)) and should provide a greater discussion of how their data also demonstrate dissociations between neurochemical release in the basolateral amygdala and behavior (e.g., neurochemical differences during both of the experimental periods but behavioral differences only during the first half of the experimental period). The normalization of neurochemical data seems unnecessary given the repeated-measures design of their analysis and could be problematic; by normalizing all data to the baseline data (p. 24), one artificially creates a baseline period with minimal variation (all are "0"; Figures 2, 3 & 5) that could inflate statistical power.

      Please see our general responses to structure of observation periods and normalization of neuromodulator data. Normalization is a common and appropriate procedure in microdialysis studies that does not alter statistical power.

      We have included a section in the Discussion concerning the temporal relationship between behavioral responses and neurochemical changes in response to vocal playback (p. 12, lines 3-17). We note where the linkage is particularly strong (e.g., ACh release and flinching). This points to a need to examine these phenomena with finer temporal resolution, but also with the recognition that the brain circuits driving a behavioral response may extend beyond the BLA.

      The Introduction could benefit from a priori predictions about the differential release of specific neuromodulators based on previous literature.

      We added some material to the Introduction to provide additional rationale for the study. However, we did not attempt to develop predictions for the range of neuromodulators that we sought to test. The literature can lead to opposite predictions for a given neuromodulator. For example, acetylcholine could be associated with both positive and negative valence. Instead, we note in the Introduction the association of both DA and ACh with vocalizations.

      The manuscript would also benefit from a description of space use and locomotion in response to different valence vocalizations.

      We have provided additional descriptions of space use and video tracking data in Material and Methods (p. 23, lines 1-6). We now report a few correlations based on these data in the Results to demonstrate that increased ACh in Restraint males and Mating estrus females was not related to the amount of locomotion (p. 9, lines 8-14).

      Nevertheless, the current manuscript seems to provide some compelling support for how positive and negative valence vocalizations differentially affect behavior and the release of acetylcholine and dopamine in the basolateral amygdala. The research is relevant to broad fields of neuroscience and has implications for the neural circuits underlying social behavior.

      Reviewer #2 (Public Review):

      Ghasemahmad et al. report findings on the influence of salient vocalization playback, sex, and previous experience, on mice behaviors, and on cholinergic and dopaminergic neuromodulation within the basolateral amygdala (BLA). Specifically, the authors played back mice vocalizations recorded during two behaviors of opposite valence (mating and restraint) and measured the behaviors and release of acetylcholine (ACh), dopamine (DA), and serotonin in the BLA triggered in response to those sounds.

      Strength: The authors identified that mating and restraint sounds have a differential impact on cholinergic and dopaminergic release. In male mice, these two distinct vocalizations exert an opposite effect on the release of ACh and DA. Mating sounds elicited a decrease of Ach release and an increase of DA release. Conversely, restraint sounds induced an increase in ACh release and a trend to decrease in DA. These neurotransmission changes were different in estrus females for whom the mating vocalization resulted in an increase of both DA and ACh release.

      Weaknesses: The behavioral analysis and results remain elusive, and although addressing interesting questions, the study contains major flaws, and the interpretations are overstating the findings.

      Although Reviewer 2 raises several valid issues that we have addressed in our response and revision, we believe that none represent “major flaws” in the study that challenge the validity of our central conclusions. In brief, we will:

      --provide enhanced description of behaviors (pp. 22-23 and Table 1)

      --clarify / modify box-plot representations of data (p 28. Lines 3-9)

      --point to our methods that describe corrections for multiple comparisons (p. 27; lines 15-16)

      --revise figures to clarify sample size (Figs. 3-6)

      Reviewer #3 (Public Review):

      Ghasemahmad et al. examined behavioral and neurochemical responses of male and female mice to vocalizations associated with mating and restraint. The authors made two significant and exciting discoveries. They revealed that the affective content of vocalizations modulated both behavioral responses and the release of acetylcholine (ACh) and dopamine (DA) but not serotonin (5-HIAA) in the basolateral amygdala (BLA) of male and female mice. Moreover, the results show sex-based differences in behavioral responses to vocalizations associated with mating. The authors conclude that behavior and neurochemical responses in male and female mice are experience-dependent and are altered by vocalizations associated with restraint and mating. The findings suggest that ACh and DA release may shape behavioral responses to context-dependent vocalizations. The study has the potential to significantly advance our understanding of how neuromodulators provide internal-state signals to the BLA while an animal listens to social vocalizations; however, multiple concerns must be addressed to substantiate their conclusions.

      Major concerns:

      1) The authors normalized all neurochemical data to the background level obtained from a single pre-stimulus sample immediately preceding playback. The percentage change from the background level was calculated based on a formula, and the underlying concentrations were not reported. The authors should report the sample and background concentrations to make the results and analyses more transparent. The authors stated that NE and 5-HT had low recovery from the mouse brain and hence could not be tracked in the experiment. The authors could be more specific here by relating the concentrations to ACh, DA, and 5-HIAA included in the analyses.

      Please see our general statement regarding normalization of neurochemical data. We have added supplemental tables that shows concentrations of dopamine, acetylcholine, 5-HIAA. We do not report serotonin or noradrenalin since these were below the detection threshold.

      2) For the EXP group, the authors stated that each animal underwent 90-min sessions on two consecutive days that provided mating and restraint experiences. Did the authors record mating or copulation during these experiments? If yes, what was the frequency of copulation? What other behaviors were recorded during these experiences? Did the experiment encompass other courtship behaviors along with mating experiences? Was the female mouse in estrus during the experience sessions?

      In the mating experience, mounting or attempted mounting was required for the animal to be included in subsequent testing. Since the session lasted 90 minutes, more general courtship behavior was likely. However, we did not record detailed behaviors or track estrous stage for the mating experience. See p. 21, line 20-22.

      3) For the mating playback, the authors stated that the mating stimulus blocks contained five exemplars of vocal sequences emitted during mating interactions. The authors should clarify whether the vocal sequences were emitted while animals were mating/copulating or when the male and female mice were inside the test box. If the latter was the case, it might be better to call the playback "courtship playback" instead of "mating playback".

      We have modified the Results (p. 5, lines 18-20) and Materials and Methods (p. 21, lines 8-15) to clarify our meaning. We continue to use the term “mating” because this refers to a specific set of behaviors associated with mounting and copulation, rather than the more general term “courtship”. We also indicate that we based these behaviors on previous work (e.g., Gaub et al., 2016).

      4) Since most differences that the authors reported in Figure 3 were observed in Stim 1 and not in Stim 2, it might be better to perform a temporal analysis - looking at behaviors and neurochemicals over time instead of dividing them into two 10-minute bins. The temporal analysis will provide a more accurate representation of changes in behavior and neurochemicals over time.

      Please see our general response to the structuring of experimental periods. The 10-min periods are the minimum for the neurochemical analyses, and we adopted the same periods for behavioral analyses to match the two types of observations. Our repeated measures analysis is a form of temporal analysis, since it compares values in three observation periods.

      5) In Figures 2 and 3, the authors show the correlation between Flinching behavior and ACh concentration. The authors should report correlations between concentrations of all neurochemicals (not just ACh) and all behaviors recorded (not just Flinching), even if they are insignificant. The analyses performed for the stim 1 data should also be performed on the stim 2 data. Reporting these findings would benefit the field.

      Please see general comments regarding correlation analyses. We removed almost all such analyses and references to them from the manuscript based on concerns of the other reviewers.

      6) The mice used in the study were between p90 - p180. The mice were old, and the range of ages was considerable. Are the findings correlated with age? The authors should also discuss how age might affect the experiment's results.

      Our p90-p180 mice are not “old”. CBA/CaJ mice display normal hearing for at least 1 year (Ohlemiller, Dahl, and Gagnon, JARO 11: 605-623, 2010) and adult sexual and social behavior throughout our observation period. They are sexually mature adults, appropriate for this study. We decline to perform correlation analyses with age, both because this was not a question for this study and because the very large number of correlations, for each experimental group (as requested by reviewer #2), render this approach statistically problematic.

      7) The authors reported neurochemical levels estimated as the animals listened to the sounds played back. What about the sustained effects of changes in neurochemicals? Are there any potential long-term effects of social vocalizations on behavior and neurochemical levels? The authors might consider discussing long-term effects.

      We have not included discussion of long term effects of neuromodulatory release, both because our data analysis doesn’t address it (see response to Comment #10) and because we desired to keep the Discussion focused on topics more closely related to the results.

      8) Histology from a single recording was shown in supplementary figure 1. It would benefit the readers if additional histology was shown for all the animals, not just the colored schematics summarizing the recording probe locations. Further explanation of the track location is also needed to help the readers. Make it clear for the readers which dextran-fluorescein labeling image is associated with which track in the schematic.

      Based on the recent publications cited in our overall response to reviewer comments about statistical methods, our reporting of histological location of microdialysis exceeds the standard. We believe that the inclusion of all histology is unnecessary and not particularly helpful. Raw photomicrographs do not always illustrate boundaries, so interpretation is required. However, we added a second photomicrograph example and we identified which tracks correspond to these photomicrographs (see Figure 2; now in main body of manuscript).

      9) The authors did not control for the sounds being played back with a speaker. This control may be necessary since the effects are more pronounced in Stim 1 than in Stim 2. Playing white noise rather than restraint or courtship vocalizations would be an excellent control. However, the authors could perform a permutation analysis and computationally break the relationship between what sound is playing and the neurochemical data. This control would allow the authors to show that the actual neurochemical levels are above or below chance.

      We considered a potential “control” stimulus in our experimental design. We concluded, based on our previous work (e.g., Grimsley et al., 2013; Gadziola et al., 2016), that white noise is not or not necessarily a neutral stimulus and therefore the results would not clarify the responses to the two vocal stimuli. Instead, we opted to use experience as a type of control. This control shows very clearly that temporal patterns and across-group differences in neurochemical response to playback disappear in the absence of experience with the associated behavior.

      10) The authors indicated that each animal's post-vocalization session was also recorded. No data in the manuscript related to the post-vocalization playback period was included. This omission was a missed opportunity to show that the neurochemical levels returned to baseline, and the results were not dependent on the normalization process described in major concern #1. The data should be included in the manuscript and analyzed. It would add further support for the model described in Figure 6.

      We decided not to include analyses of the post-stimulus period because this period is subject to wider individual and neuromodulator-specific effects and because it weakens statistical power in addressing the core question—the change in neuromodulator release DURING vocal playback. We agree that the general question is of interest to the field, but we don’t think our study is best designed to answer that question.

      11) The authors could use a predictive model, such as a binary classifier trained on the CSF sampling data, to predict the type of vocalizations played back. The predictive model could support the conclusions and provide additional support for the model in Figure 6.

      We recognize that a binary classifier could provide an interesting approach to support conclusions. However, we do not believe that the sample size per group is sufficient to both create and test the classifier.

      Reviewer #1 (Recommendations For The Authors):

      Major comments:

      • Introduction: It would be useful to set up an experimental framework before delving into the results. What are the predictions about specific neuromodulators based on previous literature?

      Because this narrative is laid out in the first two paragraphs of the Results, which immediately follow the Introduction, we believe that additional text in the Introduction on the experimental framework is redundant. As stated above, detailing predictions for a range of neuromodulators would make for a long and not particularly illuminating Introduction. We instead have related our findings to more general understanding of DA and ACh in the Discussion.

      • There really isn't a major difference in stimuli during the "Stim 1" and "Stim 2" phases, and it's not clear why the authors divided the experimental period into two phases. Therefore, the authors need to justify their experimental approach. For example, the authors could first anecdotally mention that behavioral responses to playbacks seem to be larger in the first half of the playbacks than during the second half, therefore they individually analyzed each half of the experimental period. Or adopt a different approach to justify their design. Overall, the analytical approach is reasonable but it is currently not justified.

      See general comment for analysis periods. As noted, we clarified these issues in several locations with Materials and Methods (pp. 24, lines 20-22; p. 26, lines 17-19). We also sought to clarify the meaning of the periods “Stim 1” and “Stim 2”; they are two data collection periods, using the same examplar sequences in the same order. We have added statements in the Material and Methods (p. 18, lines 4-7; Fig. caption, p. 39, lines 11-13).

      • The normalization of neurochemical data seems problematic and unnecessary. By normalizing all data to the baseline data (p. 24), one artificially creates a baseline period with minimal variation (all are "0"; Figures 2, 3 & 5) and this has implications for statistical power. Because the analysis is a within-subjects analysis, this normalization is not necessary for the analysis itself. It can be useful to normalize data for visualization purposes, but raw data should be analyzed. Indeed, behavioral data are qualitatively similar to the neurochemical data, and those data are not normalized to baseline values.

      Please see our general comment on this issue. We believe normalization does not affect statistical power and is both the standard way and an appropriate way to analyze microdialysis results. We include concentrations of ACh, DA, and 5-HIAA in supplementary tables?

      • The authors should include a discussion (in the Discussion section) of how behavior and neurochemical release are associated during the first half of the experimental session but not in the second half (e.g., differences in Ach and DA release between mating and restraint groups during stim 1 and 2, but behavioral differences only during stim 1).

      We have included a section in the Discussion concerning the temporal relationship between behavioral responses and neurochemical changes in response to vocal playback. We note that the linkage is particularly strong in some cases (e.g., ACh release and flinching). This points to a need to examine these phenomena with finer temporal resolution, but also with the recognition that the brain circuits driving a behavioral response may extend beyond the BLA.

      Minor comments:

      • Keywords: add "serotonin" (even though there are no significant differences on 5-HIAA, people interested in serotonin would find this interesting).

      Added to keywords list.

      • Do the authors collect data on the vocalizations of mice in response to these playbacks?

      We monitored vocalizations during playback, noting that vocalizations–especially “Noisy” vocalization–were common. However, we did not record vocalizations and are therefore unable quantify our observations.

      • First line of page 7: readers do not know about "stim 1" and "stim 2". Therefore, the authors need to describe their approach to analyzing behavior and neurochemical release.

      We first introduce these terms earlier, citing Figure 1D,E. We have added some additional wording for further clarification. page 7, lines 4-5.

      • Make sure citations are uniformly formatted (e.g., Inconsistencies in: "As male and female mice emit different vocalizations during mating (Finton et al., 2017; J. M. S. Grimsley et al., 2013; Neunuebel et al., 2015; Sales (née Sewell), 1972)").

      We have reviewed and corrected citations throughout the manuscript.

      • Last paragraph of page 7: "attending behavior" has not been defined yet.

      Table 1 contains our description of the behaviors analyzed in this study. We have now inserted a reference to Table 1 earlier in the Results (p. 6, line 12).

      • Figure 2E and 3G: I find these correlations to be redundant with the GLMs. This is because the significant relationship is likely to be driven by group differences in behavior and in neurochemical release.

      Please see general comments regarding correlation analyses. We removed such analyses and references to them from the manuscript.

      • Page 2, 2nd paragraph, 2nd sentence: this paragraph seems to be rooted in comparing and contrasting experienced and inexperienced mice, so there should be explicit comparisons in each sentence. For example, the 2nd sentence should read: "Whereas EXP estrus females demonstrated increased flinching behaviors in response to mating vocalizations, INEXP ....". This paragraph overall could use some refining.

      We believe this refers to page 9. We have revised the paragraph to clarify our findings (Beginning p. 9, line 23).

      • Page 9: "Further, there were no significant differences across groups during Stim 1 or Stim 2 periods. These results contrast sharply with those from all EXP groups, in which both ACh and DA release changed significantly during playback (Figs. 2C, 2D, 3E, 3F)." While I understand their perspective, this is misleading because changes were only observed during the Stim 1 period.

      We have slightly revised the wording in this paragraph, because the restraint males did not show significant ACh decreases. However, we do not believe our statements mislead readers just because some changes are observed in only one of the stimulation periods (p 10, lines 13-16).

      • Last paragraph of page 14: it would be useful to mention the increase in flinching in experienced females in response to mating vocalizations.

      We have added a sentence in this paragraph relating flinching in estrus females to increased ACh (p. 15, lines 18-20).

      • Was there a full analysis of locomotion in response to playbacks? I see that locomotion was correlated with neurochemical release but was it different in response to different stimuli? Were there changes to the part of the arena that mice occupied in response to restraint vs. mating vocalizations? Given their methods section, it would be useful for the authors to mention the results of the analyses of these aspects of movement.

      We have provided additional descriptions of space use and video tracking data in Material and Methods (p. 23, lines 1-6). We now report additional results associated with these analyses (p. 8, lines 13-15; p. 9, lines 8-14).

      • I believe that each experimental mouse only heard one of the stimuli (given the analytical approach). Because it is plausible to measure neurochemical release in response to both types of stimuli, I encourage the authors to be more explicit about this aspect of the experimental design (e.g., mention in Results section).

      Sentence modified to read: “Each mouse received playback of either the mating or restraint stimuli, but not both: same-day presentation of both stimuli would require excessively long playback sessions, the condition of the same probe would likely change on subsequent days, and quality of a second implanted probe on a subsequent day was uncertain.” (p. 7, lines 5-9).

      • Figure 1A and 1B: add labels to the panels so readers don't have to read the legend to know what spectrogram is associated with what context.

      We added these labels to Figure 1.

      • Table 1: in the definition of "still and alert", should this mention "abrupt attending" instead of "abrupt freezing"? The latter isn't described.

      Yes, we intended “abrupt attending”, and now indicated that in Table 1

      Reviewer #2 (Recommendations For The Authors):

      Major comments:

      • The authors report they performed manual behavioral analysis, and provide a table defining the different behaviors. However, it remains unclear how some of these behaviors were detected (such as still-and-alert events). A thorough description of the criteria used to define these events needs to be provided.

      We have modified some descriptions of manually analyzed behaviors in Table 1, and have added additional description of how we developed this set of behaviors for analysis in the study (pp. 22-23).

      • The box plots do not appear to represent the "minimum, first quartile, median, third quartile, and maximum values." as specified on page 24 (Methods). Indeed, the individual data points sometimes do not reach the max or min of the bar plot, and sometimes are way beyond them.

      We used the “inclusive median” function in Excel to generate final boxplots. These boxplots will sometimes result in a data point being placed outside of the whiskers. SPSS considers these to be “outliers”, but our GLM analysis includes these values. We describe this in Data Analysis section of Materials and Methods (p. 28, lines 3-9)

      • Some of the data are replicated in different Figures: Figure 2A and Figure 3C. While this is acceptable, the authors did not correct for multiple comparisons (dividing the p value by the number of comparisons).

      Our analysis included corrections for multiple comparisons, as we have indicated on p. 27, lines 15-16.

      • Overall, the sample sizes are too small (for example in Figure 3, non-estrus females are at n=3), and are different in experiments where they should be equal (Figure 2B: mating stim 1 is at n=5 and mating stim 2 is at n=3).

      We apologize that sample sizes were not properly displayed in figures. Please note that sample sizes are identified in the figure captions. For neuromodulator data, all sample sizes are at least 7. For behavioral data, the minimum sample size is 5. We have revised Figures 3-6 to ensure that all data points are visible.

      • It remains unclear why the impact of mating vocalizations has been tested only in males.

      We assume the reviewer meant that only males were tested in restraint. We now indicate that our preliminary evidence indicated no difference in behavioral responses to restraint vocalization between males and females, so we opted to perform the neurochemical analysis for restraint only in males (page 22 lines 4-5). If there were no limitations to time and cost, we would have preferred to test responses to restraint in females as well. We note that such inclusion would have added up to 4 experimental groups (estrus and non-estrus groups in both EXP and INEXP groups).

      • The correlation between the number of flinching and ACh release changes (Figure 2E) visually appears to be opposite between mating and restraint playbacks. The authors should perform independent correlations for these 2 playbacks.

      Please see general comments regarding correlation analyses. We removed such analyses and references to them from the manuscript.

      • The authors state that their findings "indicate that behavioral responses to salient vocalizations result from interactions between sex of the listener or context of vocal stimuli with the previous behavioral experience associated with these vocalizations.". However, in male mice, they do not report any difference in previous experience on flinching for both restraint and mating sounds, as well as no difference in rearing for the restrain sounds (Figure 4A-B). Thus, the discussion of these results should be completely revisited.

      We revised the paragraph in question (p. 9, line 22 through p. 10, line 9). For instance, we note that significant differences between EXP male-mating and male-restraint flinching do not exist between the INEXP groups. We believe that the last sentence correctly summarizes findings described in this paragraph.

      • For serotonin experiments in Figure S2 there are strong outliers (150% increase in 5HIAA release). Did the authors correlate these levels with the behavior of the animals?

      Outliers are identified by the Excel function that generated the boxplots, but we have no reason to consider these as outliers and exclude them. As noted above, we have clarified that these “outliers” are the result of the Excel function in the Materials and Methods (p. 28, lines 3-9) and we have revised the plotting of data points

      Minor comments:

      • Mating vocalization playback is mainly emitted by males, thus, instead of a positive valence signal, this could also be interpreted as a competitive signal to other males.

      There is support in the literature for viewing our mating stimulus as having positive valence. Gaub et al., 2016 describe the emission of stepped calls, lower frequency harmonics, and increased sound level as indicators of “positive emotion”. We have shown (Grimsley et al, 2013) that the female LFH vocalization can be highly attractive to male mice, under the right conditions, indicating something like “sex is happening”. The inclusion of both the male and female vocalizations in our stimuli was a key piece of our experimental design, based on our understanding of the contributions of both vocalizations to the meaning of the overall acoustic experience.

      • Figure 1 should include panel titles.

      No change. This information is available in the Figure caption.

      • n=31 should be indicated in the EXP group.

      We’re not sure where the reviewer is referring to this value.

      • The color legend of Figure 1E is absent, making the Figure not understandable.

      We added text in the Figure 1 caption to indicate that each color represents a different exemplar. We don’t think a legend provides additional useful information.

      • The point of making two blocks (stim 1 and stim2) should be stated more clearly.

      Please see general statement regarding experimental blocks. We have modified our description of these in an Experimental overview section in the Material and Methods.

      • Including raw data of micro-dialysis in the supplementary figures would allow assessment of the variability and quality of the measurements.

      We have added concentrations of neurochemicals in supplemental tables 1-3.

      • Baseline (prestimulus) number of flinch and rearing should systematically be indicated (missing in Figure 4).

      The focus in this figure is on the differences that occur in Stim 1 values. There are no differences between EXP and INEXP animals of any group during the Pre-Stim period. We now state that in the Figure 4 caption.

      • Discussion: "increase in AMPA/NMDA currents". We believe the authors are referring to the ratio of AMPA to NMDA currents. This sentence should be reformulated.

      These are modified to refer to “… the AMPA/NMDA current ratio…” in two locations in the Discussion (p. 14, lines 8-9; p. 15, line 4)

      • Overall the discussion is very speculative and should rely more on the data.

      We believe that the Discussion provides appropriate speculation that is based on our experimental data and previous literature. We have added a paragraph to identify limitations of our findings and recommendations of future experiments to resolve some issues (p. 12, lines 3-17)

      Reviewer #3 (Recommendations For The Authors):

      Minor concerns:

      1) The authors stated that USVs are most likely to be emitted by males, and LFH are likely to be emitted by females. However, Oliveira-Stahl et al. 2023, Matsumoto et al. 2022, Warren et al. 2018, Heckman et al. 2017, Neunuebel et al., 2015 showed that females also emit USVs. The authors should mention that USVs are emitted by both males and females and discuss how the sex of the vocalizing animal (both males and females) can influence neuromodulator release.

      The reviewer slightly mis-stated the wording of our text, changing the meaning significantly. Our wording is “These sequences included ultrasonic vocalizations (USVs) with harmonics, steps, and complex structure, mostly emitted by males, and low frequency harmonic calls (LFHs) emitted by females (Fig. 1A,C)…” This phrasing is correct and carefully chosen. The Discussion in Oliveira-Stahl et al 2023 (p. 10-11) supports our statement: “The exact fraction of USVs emitted by females as concluded in all previous studies on dyadic courtship has varied, ranging from 18%, 17.5%, and 16% to 10.5% in the present study…”.

      2) The authors should explain why ECF from BLA was collected unilaterally from the left hemisphere.

      p. 23, lines 9-11: We inserted a sentence to explain why we targeted the BLA unilaterally. “Since both left and right amygdala are responsive to vocal stimuli in human and experimental animal studies (Wenstrup et al., 2020), we implanted microdialysis probes into the left amygdala to maintain consistency with other studies in our laboratory..” Beyond that, the choice was arbitrary.

      3) The authors said each animal recovered in its home cage for four days before the playback experiment. A 4-day period may not be sufficient for every animal to recover from surgery, so the authors should describe how a mouse's recovery was assessed.

      p. 23, lines 20-23: We provide more description about the recovery and how it was assessed. Except for a few animals that were not included in the experiments, all animals recovered within 4 days.

      4) The authors stated that each animal was exposed to 90-min sessions with mating and restraint behaviors in a counterbalanced design. This description for Figure 1D should also include the duration of the mating and restraint experience.

      The Results that immediately precede citation to this figure include this information.

      5) The authors stated, "Data are reported only from mice with more than 75% of the microdialysis probe implanted within the BLA". What are the implications of having 25% of the probe outside the BLA? The authors should shed more light on this by discussing this issue as it relates to the findings and commenting on where the other 25% of the probe was located.

      We inserted a sentence to explain the rationale for this inclusion criterion. “We verified placement of microdialysis probes to minimize variability that could arise because regions surrounding BLA receive neurochemical inputs from different sources (e.g., cholinergic inputs to putamen and central amygdala).” (p. 25, lines 21-23).

      All brain regions that surround BLA, dorsal, medial, ventral, or lateral, could have been sampled by the “other” 25%. Some of these, e.g., the central amygdala or caudate-putamen, have different sources of cholinergic input that may not have the same release pattern. We do not think it is worthy of further speculation in the Discussion. Due to the high cost of the neurochemical analysis, we often did not process the neurochemistry data if histology indicated that a probe missed the BLA target.

      6) The authors confirmed that the estrus stage did not change during the experiment day by evaluating and comparing estrus prior to and after data collection. This strategy was a fantastic experimental approach, but the authors should have discussed the results. How did the results the authors included change when the females were in estrus before but not after data collection? What percentage of females started in estrus but ended in metestrus? Assuming that some females changed estrus state, were these animals excluded from the analyses?

      All animals were in the same estrus state at the beginning and end of the playback session.

      7). Authors cite Neunuebel et al., 2015 for the sentence "As male and female mice emit different vocalizations during mating". However, Neunuebel et al., 2015 showed vocalizations emitted during chasing--not mating. If mating is a general term for courtship, then this reference is appropriate, but see major concern #3.

      In the Results (p. 8, line 5), we changed the phrasing to “courtship and mating” to include the Neunubel et al study.

      As we indicate in our response to Public Comment #3, we have modified the Results (p. 5, lines 18-20) and Materials and Methods (p. 21, lines 8-15) to clarify our meaning. We continue to use the term “mating” because this refers to a specific set of behaviors associated with mounting and copulation, rather than the more general term “courtship”. We also indicate that we based these behaviors on previous work (e.g., Gaub et al., 2016).

      8) Authors interpret Figure 3F as DA release showed a "consistent" increase during mating playback across all three experimental groups. However, the increase in the estrus female group is inconsistent, as seen in the graph. This verbiage should be reworded to describe the data more accurately.

      p. 8, line 23 “consistent” was deleted.

      9) In all the box plots, multiple data points overlay each other. A more transparent way of showing the data would be adding some jitter to the x value to make each data point visible. The mean (X's) in Figure 3D (pre-stim mating and mating estrus) are difficult to see, as are all the data points in mating non-estrus. Adding all the symbols to the figure legend or a key in the figure instead of the method section would aid the reader and make the plots easier to interpret

      We have revised the boxplots to ensure that all data points are visible.

      10) Some verbiage used in the discussion should be toned down. For example, "intense" experiences and "emotionally charged" vocalizations should be removed.

      We have not changed these terms, which we believe are appropriate to describe these experiences and vocalizations.

      11) The authors include "Emotional Vocalizations" in the title. It would be beneficial if the authors included more detail and references in the introduction to help set up the emotional content of vocalizations. It may benefit a broader readership as typically targeted by eLife.

      We now cite Darwin and some more recent publications that articulate the general understanding that social vocalizations carry emotional content.

    1. Reviewer #1 (Public Review):

      This valuable study demonstrates a novel mechanism by which implicit motor adaptation saturates for large visual errors in a principled normative Bayesian manner. Additionally, the study revealed two notable empirical findings: visual uncertainty increases for larger visual errors in the periphery, and proprioceptive shifts/implicit motor adaptation are non-monotonic, rather than ramp-like. This study is highly relevant for researchers in sensory cue integration and motor learning. However, I find some areas where statistical quantification is incomplete, and the contextualization of previous studies to be puzzling.

      Issue #1: Contextualization of past studies.

      While I agree that previous studies have focused on how sensory errors drive motor adaptation (e.g., Burge et al., 2008; Wei and Kording, 2009), I don't think the PReMo model was contextualized properly. Indeed, while PReMo should have adopted clearer language - given that proprioception (sensory) and kinaesthesia (perception) have been used interchangeably, something we now make clear in our new study (Tsay, Chandy, et al. 2023) - PReMo's central contribution is that a perceptual error drives implicit adaptation (see Abstract): the mismatch between the felt (perceived) and desired hand position. The current paper overlooks this contribution. I encourage the authors to contextualize PReMo's contribution more clearly throughout. Not mentioned in the current study, for example, PReMo accounts for the continuous changes in perceived hand position in Figure 4 (Figure 7 in the PReMo study).

      There is no doubt that the current study provides important additional constraints on what determines perceived hand position: Firstly, it offers a normative Bayesian perspective in determining perceived hand position. PReMo suggests that perceived hand position is determined by integrating motor predictions with proprioception, then adding a proprioceptive shift; PEA formulates this as the optimal integration of these three inputs. Secondly, PReMo assumed visual uncertainty to remain constant for different visual errors; PEA suggests that visual uncertainty ought to increase (but see Issue #2).

      Issue #2: Failed replication of previous results on the effect of visual uncertainty.

      2a. A key finding of this paper is that visual uncertainty linearly increases in the periphery; a constraint crucial for explaining the non-monotonicity in implicit adaptation. One notable methodological deviation from previous studies is the requirement to fixate on the target: Notably, in the current experiments, participants were asked to fixate on the target, a constraint not imposed in previous studies. In a free-viewing environment, visual uncertainty may not attenuate as fast, and hence, implicit adaptation does not attenuate as quickly as that revealed in the current design with larger visual errors. Seems like this current fixation design, while important, needs to be properly contextualized considering how it may not represent most implicit adaptation experiments.

      2b. Moreover, the current results - visual uncertainty attenuates implicit adaptation in response to large, but not small, visual errors - deviates from several past studies that have shown that visual uncertainty attenuates implicit adaptation to small, but not large, visual errors (Tsay, Avraham, et al. 2021; Makino, Hayashi, and Nozaki, n.d.; Shyr and Joshi 2023). What do the authors attribute this empirical difference to? Would this free-viewing environment also result in the opposite pattern in the effect of visual uncertainty on implicit adaptation for small and large visual errors?

      2c. In the current study, the measure of visual uncertainty might be inflated by brief presentation times of comparison and referent visual stimuli (only 150 ms; our previous study allowed for a 500 ms viewing time to make sure participants see the comparison stimuli). Relatedly, there are some individuals whose visual uncertainty is greater than 20 degrees standard deviation. This seems very large, and less likely in a free-viewing environment.

      2d. One important confound between clear and uncertain (blurred) visual conditions is the number of cursors on the screen. The number of cursors may have an attenuating effect on implicit adaptation simply due to task-irrelevant attentional demands (Parvin et al. 2022), rather than that of visual uncertainty. Could the authors provide a figure showing these blurred stimuli (gaussian clouds) in the context of the experimental paradigm? Note that we addressed this confound in the past by comparing participants with and without low vision, where only one visual cursor is provided for both groups (Tsay, Tan, et al. 2023).

      Issue #3: More methodological details are needed.

      3a. It's unclear why, in Figure 4, PEA predicts an overshoot in terms of perceived hand position from the target. In PReMo, we specified a visual shift in the perceived target position, shifted towards the adapted hand position, which may result in overshooting of the perceived hand position with this target position. This visual shift phenomenon has been discovered in previous studies (e.g., (Simani, McGuire, and Sabes 2007)).

      3b. The extent of implicit adaptation in Experiment 2, especially with smaller errors, is unclear. The implicit adaptation function seems to be still increasing, at least by visual inspection. Can the authors comment on this trend, and relatedly, show individual data points that help the reader appreciate the variability inherent to these data?

      3c. The same participants were asked to return for multiple days/experiments. Given that the authors acknowledge potential session effects, with attenuation upon re-exposure to the same rotation (Avraham et al. 2021), how does re-exposure affect the current results? Could the authors provide clarity, perhaps a table, to show shared participants between experiments and provide evidence showing how session order may not be impacting results?

      3d. The number of trials per experiment should be detailed more clearly in the Methods section (e.g., Exp 4). Moreover, could the authors please provide relevant code on how they implemented their computational models? This would aid in future implementation of these models in future work. I, for one, am enthusiastic to build on PEA.

      3f. In addition to predicting a correlation between proprioceptive shift and implicit adaptation on a group level, both PReMo and PEA (but not causal inference) predict a correlation between individual differences in proprioceptive shift and proprioceptive uncertainty with the extent of implicit adaptation (Tsay, Kim, et al. 2021). Interestingly, shift and uncertainty are independent (see Figures 4F and 6C in Tsay et al, 2021). Does PEA also predict independence between shift and uncertainty? It seems like PEA does predict a correlation.

      References:

      Avraham, Guy, Ryan Morehead, Hyosub E. Kim, and Richard B. Ivry. 2021. "Reexposure to a Sensorimotor Perturbation Produces Opposite Effects on Explicit and Implicit Learning Processes." PLoS Biology 19 (3): e3001147.<br /> Makino, Yuto, Takuji Hayashi, and Daichi Nozaki. n.d. "Divisively Normalized Neuronal Processing of Uncertain Visual Feedback for Visuomotor Learning."<br /> Parvin, Darius E., Kristy V. Dang, Alissa R. Stover, Richard B. Ivry, and J. Ryan Morehead. 2022. "Implicit Adaptation Is Modulated by the Relevance of Feedback." BioRxiv. https://doi.org/10.1101/2022.01.19.476924.<br /> Shyr, Megan C., and Sanjay S. Joshi. 2023. "A Case Study of the Validity of Web-Based Visuomotor Rotation Experiments." Journal of Cognitive Neuroscience, October, 1-24.<br /> Simani, M. C., L. M. M. McGuire, and P. N. Sabes. 2007. "Visual-Shift Adaptation Is Composed of Separable Sensory and Task-Dependent Effects." Journal of Neurophysiology 98 (5): 2827-41.<br /> Tsay, Jonathan S., Guy Avraham, Hyosub E. Kim, Darius E. Parvin, Zixuan Wang, and Richard B. Ivry. 2021. "The Effect of Visual Uncertainty on Implicit Motor Adaptation." Journal of Neurophysiology 125 (1): 12-22.<br /> Tsay, Jonathan S., Anisha M. Chandy, Romeo Chua, R. Chris Miall, Jonathan Cole, Alessandro Farnè, Richard B. Ivry, and Fabrice R. Sarlegna. 2023. "Implicit Motor Adaptation and Perceived Hand Position without Proprioception: A Kinesthetic Error May Be Derived from Efferent Signals." BioRxiv. https://doi.org/10.1101/2023.01.19.524726.<br /> Tsay, Jonathan S., Hyosub E. Kim, Darius E. Parvin, Alissa R. Stover, and Richard B. Ivry. 2021. "Individual Differences in Proprioception Predict the Extent of Implicit Sensorimotor Adaptation." Journal of Neurophysiology, March. https://doi.org/10.1152/jn.00585.2020.<br /> Tsay, Jonathan S., Steven Tan, Marlena Chu, Richard B. Ivry, and Emily A. Cooper. 2023. "Low Vision Impairs Implicit Sensorimotor Adaptation in Response to Small Errors, but Not Large Errors." Journal of Cognitive Neuroscience, January, 1-13.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      In this manuscript, the authors explore the effects of DNA methylation on the strength of regulatory activity using massively parallel reporter assays in cell lines on a genome-wide level. This is a follow-up of their first paper from 2018 that describes this method for the first time. In addition to adding more indepth information on sequences that are explored by many researchers using two main methods, reduced bisulfite sequencing and sites represented on the Illumina EPIC array, they now show also that DNA methylation can influence changes in regulatory activity following a specific stimulation, even in absence of baseline effects of DNA methylation on activity. In this manuscript, the authors explore the effects of DNA methylation on the response to Interferon alpha (INFA) and a glucocorticoid receptor agonist (dexamethasone). The authors validate their baseline findings using additional datasets, including RNAseq data, and show convergences across two cell lines. The authors then map the methylation x environmental challenge (IFNA and dex) sequences identified in vitro to explore whether their methylation status is also predictive of regulatory activity in vivo. This is very convincingly shown for INFA response sequences, where baseline methylation is predictive of the transcriptional response to flu infection in human macrophages, an infection that triggers the INF pathways.

      Thank you for your strong assessment of our work!

      The extension of the functional validity of the dex-response altering sequences is less convincing.

      We agree. We note that genes close to dex-specific mSTARR-seq enhancers tend to be more strongly upregulated after dex stimulation than those near shared enhancers, which parallels our results for IFNA (lines 341-344). However, there is unfortunately no comparable data set to the human flu data set (i.e., with population-based whole genome-bisulfite sequencing data before and after dex challenge), so we could not perform a parallel in vivo validation step. We have added this caveat to the revised manuscript (lines 555-557).

      Sequences altering the response to glucocorticoids, however, were not enriched in DNA methylation sites associated with exposure to early adversity. The authors interpret that "they are not links on the causal pathway between early life disadvantage and later life health outcomes, but rather passive biomarkers". However, this approach does not seem an optimal model to explore this relationship in vivo. This is because exposure to early adversity and its consequences is not directly correlated with glucocorticoid release and changes in DNA methylation levels following early adversity could be related to many physiological mechanisms, and overall, large datasets and meta-analyses do not show robust associations of exposure to early adversity and DNA methylation changes. Here, other datasets, such as from Cushing patients may be of more interest.

      Thank you for making these important points. We have expanded the set of caveats regarding the lack of enrichment of early adversity-reported sites in the mSTARR-data set (lines 527-533). Specifically, we note that the relationship between early adversity and glucocorticoid physiology is complex (e.g., Eisenberger and Cole, 2012; Koss and Gunnar, 2018) and that dex challenge models one aspect of glucocorticoid signaling but not others (e.g., glucocorticoid resistance). Nevertheless, we also see little evidence for enrichment of early adversity-associated sites in the mSTARR data set at baseline, independently of the dex challenge experiment (lines 483-485; Figure 4).

      We also agree that large data sets (e.g., Houtepen et al., 2018; Marzi et al., 2018) and reviews (e.g., Cecil et al., 2020) of early adversity and DNA methylation in humans show limited evidence of associations between early adversity and DNA methylation levels. However, the idea that early adversity impacts downstream outcomes remains pervasive in the literature and popular science (see Dubois et al., 2019), which we believe makes tests like ours important to pursue. We also hope that our data set (and others generated through these methods) will be useful in interpreting other settings in which differential methylation is of interest as well—in line with your comment below. We have clarified both of these points in the revised manuscript (lines 520-522; 536-539).

      Overall, the authors provide a great resource of DNA methylation-sensitive enhancers that can now be used for functional interpretation of large-scale datasets (that are widely generated in the research community), given the focus on sites included in RBSS and the Illumina EPIC array. In addition, their data lends support that differences in DNA methylation can alter responses to environmental stimuli and thus of the possibility that environmental exposures that alter DNS methylation can also alter the subsequent response to this exposure, in line with the theory of epigenetic embedding of prior stimuli/experiences. The conclusions related to the early adversity data should be reconsidered in light of the comments above.

      Thank you! And yes, we have revised our discussion of early life adversity effects as discussed above.

      Reviewer #1 (Recommendations For The Authors):

      While the paper has a lot of strengths and provides new insight into the epigenomic regulation of enhancers as well as being a great resource, there are some aspects that would benefit from clarification.

      a. It would be great to have a clearer description of how many sequences are actually passing QC in the different datasets and what the respective overlaps are in bps or 600bp windows. Now often only % are given. Maybe a table/Venn diagram for overview of the experiments and assessed sequences would help here. This concern the different experiments in the K652, A549, and Hep2G cell lines, including stimulations.

      We now provide a supplementary figure and supplementary table providing, for each dataset, the number of 600 bp windows passing each filter (Figure 2-figure supplement 1; Supplementary File 9), as well as a supplementary figure providing an upset plot to show the number of assessed sequences shared across the experiments (Figure 2-figure supplement 2).

      b. It would also be helpful to have a brief description of the main differences in assessed sequences and their coverage of the old (2018) and new libraries in the main text to be able better interpret the validation experiments.

      We now provide information on the following characteristics for the 2018 data set versus the data set presented for the first time here: mean (± SD) number of CpGs per fragment; mean (± SD) DNA sequencing depth; and mean (± SD) RNA sequencing depth (lines 169-170 provide values for the new data set; in line 194, we reference Supplementary File 5, which provides the same values for the old data set). Notably, the coverage characteristics of analyzed windows in both data sets are quite high (mean DNA-seq read coverage = 94x and mean RNA-seq read coverage = 165x in the new data set at baseline; mean DNA-seq read coverage = 22x and mean RNA-seq read coverage = 54x in Lea et al. 2018).

      c. Statements of genome-wide analyses in the abstract and discussion should be a bit tempered, as quite a number of tested sites do not pass QC and do not enter the analysis. From the results it seems like from over 4.5 million sequences, only 200,000 are entering the analysis.

      The reason why many of the windows are not taken forward into our formal modeling analysis is that they fail our filter for RNA reads because they are never (or almost never) transcribed—not because there was no opportunity for transcription (i.e., the region was indeed assessed in our DNA library, and did not show output transcription, as now shown in Figure 2-figure supplement 1). We have added a rarefaction analysis (lines 715-722 in Materials and Methods) of the DNA fragment reads to the revised manuscript which supports this point. Specifically, it shows that we are saturated for representation of unique genomic windows (i.e., we are above the stage in the curve where the proportion of active windows would increase with more sequencing: Figure 1figure supplement 4). Similarly, a parallel rarefaction curve for the mSTARR-seq RNA-seq data (Figure 1-figure supplement 4) shows that we would gain minimal additional evidence for regulatory activity with more sequencing depth. We now reference these analyses in revised lines 179-184 and point to the supporting figure in line 182.

      In other words, our analysis is truly genome-wide, based on the input sequences we tested. Most of the genome just doesn’t have regulatory activity in this assay, despite the potential for it to be detected given that the relevant sequences were successfully transfected into the cells.

      d. Could the authors comment on the validity of the analysis if only one copy is present (cut-off for QC)?

      We think this question reflects a misunderstanding of our filtering criteria due to lack of clarity on our part, which we have modified in the revision. We now specify that the mean DNA-seq sequencing depth per sample for the windows we subjected to formal modeling was quite high:

      93.91 ± 10.09 SD (range = 74.5 – 113.5x) (see revised lines 169-170). In other words, we never analyze windows in which there is scant evidence that plasmids containing the relevant sequence were successfully transfected (lines 170-172).

      Our minimal RNA-seq criteria require non-zero counts in at least 3 replicate samples within either the methylated condition or the unmethylated condition, or both (lines 166-168). Because we know that multiple plasmids containing the corresponding sequence are present for all of these windows—even those that just cross the minimal RNA-seq filtering threshold—we believe our results provide valid evidence that all analyzed windows present the opportunity to detect enhancer activity, but many do not act as enhancers (i.e., do not result in transcribed RNA). Notably, we observe a negligible correlation between DNA sequencing depth for a fragment, among analyzed windows, and mSTARR-seq enhancer activity (R2 = 0.029; now reported in lines 183-184). We also now report reproducibility between replicates, in which all replicate pairs have r > 0.89, on par with previously published STARR-seq datasets (e.g., Klein et al., 2020; Figure 1-figure supplement 6, pointed to in line 193).

      e. While the authors state that almost all of the control sequences contain CpGs sites, could the authors also give information on the total number of CpG sites in the different subsets? Was the number of CpGs in a 600 bp window related to the effects of DNA methylation on enhancer activity?

      We now provide the number of CpG sites per window in the different subsets in lines 282-284. As expected, they are higher for EPIC array sites and for RRBS sites because the EPIC array is biased towards CpG-rich promoter regions, and the enzyme typically used in the starting step of RRBS digests DNA at CpG motifs (but control sequences still contain an average of ~13 CpG sites per fragment). We also now model the magnitude of the effects of DNA methylation on regulatory activity as a function of number of CpG sites within the 600 bp windows. Consistent with our previous work in Lea et al., 2018, we find that mSTARR-seq enhancers with more CpGs tend to be repressed by DNA methylation (now reported in lines 216-219 and Figure 1figure supplement 11).

      f. In the discussion, a statement on the underrepresented regions, likely regulatory elements with lower CG content, that nonetheless can be highly relevant for gene regulation would be important to put the data in perspective.

      Thanks for this suggestion. We agree that regulatory regions, independent of CpG methylation, can be highly relevant, and now clarify in the main text that the “unmethylated” condition of mSTARR-seq is essentially akin to a conventional STARR-seq experiment, in that it assesses regulatory activity regardless of CpG content or methylation status (lines 128-130).

      Consequently, our study is well-designed to detect enhancer-like activity, even in windows with low GC content. We now show with additional analyses that we generated adequate DNA-seq coverage on the transfected plasmids to analyze 90.2% of the human genome, including target regions with no or low CpG content (lines 148-149; 153-156; Supplementary file 2). As noted above, we also now clarify that regions dropped out of our formal analysis because we had little to no evidence that any transcription was occurring at those loci, not because sequences for those regions were not successfully transfected into cells (see responses above and new Figure 1-figure supplement 4 and Figure 2-figure supplement 1).

      g. To control for differences in methylation of the two libraries, the authors sequence a single CpGs in the vector. Could the authors look at DNA methylation of the 600 bp windows at the end of the experiment, could DNA methylation of these windows be differently affected according to sequence? 48 hours could be enough for de-methylation or re-methylation.

      We agree that variation in demethylation or remethylation depending on fragment sequence is possible. We now state this caveat in the main text (lines 158-159), and specify that genomic coverage of our bisulfite sequencing data across replicates are (unfortunately) too variable to perform reliable site-by-site analysis of DNA methylation levels before and after the 48 hour experiment (lines 1182-1185). Instead, we focus on a CpG site contained in the adapter sequence (and thus included in all plasmids) to generate a global estimate of per replicate methylation levels. We also now note that any de-methylation or re-methylation would reduce our power to detect methylation-dependent activity, rather than leading to false positives (lines 163-165).

      h. The section on the method for correction for multiple testing should be more detailed as it is very difficult to follow. Why were only 100 permutations used, the empirical p-value could then only be <0.01? The description of a subsample of the N windows with positive Betas is unclear, should the permutation not include the actual values and thus all windows - or were the no negative Betas? Was FDR accounting for all elements and pairs?

      We have now expanded the text in the Materials and Methods section to clarify the FDR calculation (lines 691, 695-699, 702, 706). We clarify that the 100 permutations were used to generate a null distribution of p-values for the data set (e.g., 100 x 17,461 p-values for the baseline data set), which we used to derive a false discovery rate. Because we base our evidence on FDRs, we therefore compare the distribution of observed p-values to the distribution of pvalues obtained via permutation; we do not calculate individual p-values by comparing an observed test statistic against the test statistics for permuted data for that individual window.

      We compare the data to permutations with only positive betas because in the observed data, we observe many negative betas. These correspond to windows which have no regulatory activity (i.e., they have many more input DNA reads than RNA-seq reads) and thus have very small pvalues in a model testing for DNA-RNA abundance differences. However, we are interested in controlling the false discovery rate of windows that do have regulatory activity (positive betas). In the permuted data, by contrast and because of the randomization we impose, test statistics are centered around 0 and essentially symmetrical (approximately equally likely to be positive or negative). Retaining all p-values to construct the null therefore leads to highly miscalibrated false discovery rates because the distribution of observed values is skewed towards smaller values— because of windows with “significantly” no regulatory activity—compared to the permuted data. We address that problem by using only positive betas from the permutations.

      i. The interpretation of the overlap of Dex-response windows with CpGs sites associated with early adversity should be revisited according to the points also mentioned in the public review and the authors may want to consider exploring additional datasets with other challenges.

      Thank you, see our responses to the public review above and our revisions in lines (lines 555559). We agree that comparisons with more data sets and generation of more mSTARR-seq data in other challenge conditions would be of interest. While beyond the scope of this manuscript, we hope the resource we have developed and our methods set the stage for just such analyses.

      Reviewer #2 (Public Review):

      This work presents a remarkably extensive set of experiments, assaying the interaction between methylation and expression across most CpG positions in the genome in two cell types. To this end, the authors use mSTARR-seq, a high-throughput method, which they have previously developed, where sequences are tested for their regulatory activity in two conditions (methylated and unmethylated) using a reporter gene. The authors use these data to study two aspects of DNA methylation:

      1) Its effect on expression, and 2. Its interaction with the environment. Overall, they identify a small number of 600 bp windows that show regulatory potential, and a relatively large fraction of these show an effect of methylation on expression. In addition, the authors find regions exhibiting methylation-dependent responses to two environmental stimuli (interferon alpha and glucocorticoid dexamethasone).

      The questions the authors address represent some of the most central in functional genomics, and the method utilized is currently the best method to do so. The scope of this study is very impressive and I am certain that these data will become an important resource for the community. The authors are also able to report several important findings, including that pre-existing DNA methylation patterns can influence the response to subsequent environmental exposures.

      Thank you for this generous summary!

      The main weaknesses of the study are: 1. The large number of regions tested seems to have come at the expense of the depth of coverage per region (1 DNA read per region per replicate). I have not been convinced that the study has sufficient statistical power to detect regulatory activity, and differential regulatory activity to the extent needed. This is likely reflected in the extremely low number of regions showing significant activity.

      We apologize for our lack of clarity in the previous version of the manuscript. Nonzero coverage for half the plasmid-derived DNA-seq replicates is a minimum criterion, but for the baseline dataset, the mean depth of DNA coverage per replicate for windows passing the DNA filter is quite high: 12.723 ± 41.696 s.d. overall, and 93.907 ± 10.091 s.d. in the windows we subjected to full analysis (i.e., windows that also passed the RNA read filter). We now provide these summary statistics in lines 148-149 and 169-170 and Supplementary file 5 (see also our responses to Reviewer 1 above). We also now show, using a rarefaction analysis, that our data set saturates the ability to detect regulatory windows based on DNA and RNA sequencing depth (new Figure 1-figure supplement 4; lines 179-184; 715-722).

      2) Due to the position of the tested sequence at the 3' end of the construct, the mSTARR-seq approach cannot detect the effect of methylation on promoter activity, which is perhaps the most central role of methylation in gene regulation, and where the link between methylation and expression is the strongest. This limitation is evident in Fig. 1C and Figure 1-figure supplement 5C, where even active promoters have activity lower than 1. Considering these two points, I suspect that most effects of methylation on expression have been missed.

      Thank you for pointing this out. We agree that we have not exhaustively detected methylationdependent activity in all promoter regions, given that not all promoter regions are active in STARR-seq. However, there is good evidence that some promoter regions can function like enhancers and thus be detected in STARR-seq-type assays (Klein et al., 2020). This important point is now noted in lines 187-189; an example promoter showing methylation-dependent regulatory activity in our dataset is shown in Figure 3E.

      We also now clarify that Figure 1C shows significant enrichment of regulatory activity in windows that overlap promoter sequence (line 239). The y-axis is not a measure of activity, but rather the log-transformed odds ratio, with positive values corresponding to overrepresentation of promoter sequences in regions of mSTARR-seq regulatory activity. Active promoters are 1.640 times more likely to be detected with regulatory activity than expected by chance (p = 1.560 x 10-18), which we now report in a table that presents enrichment statistics for all ENCODE elements shown in Figure 1C for clarity (Supplementary file 4). Moreover, 74.1% of active promoters that show regulatory activity have methylation-dependent activity, also now reported in Supplementary file 4.

      Overall, the combination of an extensive resource addressing key questions in functional genomics, together with the findings regarding the relationship between methylation and environmental stimuli makes this a key study in the field of DNA methylation.

      Thank you again for the positive assessment!

      Reviewer #2 (Recommendations For The Authors):

      I suggest the authors conduct several tests to estimate and/or increase the power of the study:

      1) To estimate the potential contribution of additional sequencing depth, I suggest the authors conduct a downsampling analysis. If the results are not saturated (e.g., the number of active windows is not saturated or the number of differentially active windows is not saturated), then additional sequencing is called for.

      We appreciate the suggestion. We have now performed a downsampling/rarefaction curve analysis in which we downsampled the number of DNA reads, and separately, the number of RNA reads. We show that for both DNA-seq depth and RNA-seq depth, we are within the range of sequencing depth in which additional sequencing would add minimal new analysis windows in the dataset (Figure 1-figure supplement 4; lines 179-184; 715-722).

      2) Correlation between replicates should be reported and displayed in a figure because low correlations might also point to too few reads. The authors mention: "This difference likely stems from lower variance between replicates in the present study, which increases power", but I couldn't find the data.

      We now report the correlations between RNA and DNA replicates within the current dataset and within the Lea et al., 2018 dataset (Figure 1-figure supplement 6). The between-replicate correlations in both our RNA libraries and DNA libraries are consistently high (r ≥ 0.89).

      3) The correlation between the previous and current K562 datasets is surprisingly low. Given that these datasets were generated in the same cell type, in the same lab, and using the same protocol, I expected a higher correlation, as seen in other massively parallel reporter assays. The fact that the correlations are almost identical for a comparison of the same cell and a comparison of very different cell types is also suspicious.

      Thanks for raising this point. We think it is in reference to our original Figure 1-Figure supplement 6, for which we now provide Pearson correlations in addition to R2 values (now Figure 1-Figure supplement 8). We note that this is not a correlation in raw data, but rather the correlation in estimated effect sizes from a statistical model for methylation-dependent activity. We now provide Pearson correlations for the raw data between replicates within each dataset (Figure 1-Figure supplement 6), which for the baseline dataset are all r > 0.89 for RNA replicates and r > 0.98 for DNA replicates, showing that replicate reproducibility in this study is on par with other published studies (e.g., Klein et al., 2020 report r > 0.89 for RNA replicates and r > 0.91 for DNA replicates).

      We do not know of any comparable reports in other MPRAs for effect size correlations between two separately constructed libraries, so it’s unclear to us what the expectation should be. However, we note that all effect sizes are estimated with uncertainty, so it would be surprising to us to observe a very high correlation for effect sizes in two experiments, with two independently constructed libraries (i.e., with different DNA fragments), run several years apart—especially given the importance of winner’s curse effects and other phenomena that affect point estimates of effect sizes. Nevertheless, we find that regions we identify as regulatory elements in this study are 74-fold more likely to have been identified as regulatory elements in Lea et al., 2018 (p < 1 x10-300).

      4) The authors cite Johnson et al. 2018 to support their finding that merely 0.073% of the human genome shows activity (1.7% of 4.3%), but:

      a. the percent cited is incorrect: this study found that 27,498 out of 560 million regions (0.005%) were active, and not 0.165% as the authors report.

      We have modified the text to clarify the numerator and denominator used for the 0.165% estimate from Johnson et al 2018 (lines 175-176). The numerator is their union set of all basepairs showing regulatory activity in unstimulated cells, which is 5,547,090 basepairs. The denominator is the total length of the hg38 human genome, which is 3,298,912,062 basepairs.

      Notably, the denominator (the total human genome) is not 560 million—while Johnson et al (2018) tested 560 million unique ~400 basepair fragments, these fragments were overlapping, such that the 560 million fragments covered the human genome 59 times (i.e., 59x coverage).

      b. other studies that used massively parallel reporter assays report substantially higher percentages, suggesting that the current study is possibly underpowered. Indeed, the previous mSTARR-seq found a substantially larger percentage of regions showing regulatory activity (8%). The current study should be compared against other studies (preferably those that did not filter for putatively active sequences, or at least to the random genomic sequences used in these studies).

      We appreciate this point and have double checked comparisons to Johnson et al., 2018 and Lea et al., 2018. Our numbers are not unusual relative to Johnson et al., 2018 (0.165%), which surveyed the whole genome. Also, in comparing to the data from Lea et al., 2018, when processed in an identical manner (our criteria are more stringent here), our values of the percent of the tested genome showing significant regulatory activity are also similar: 0.108% in the Lea et al., 2018 dataset versus 0.082% in the baseline dataset. Finally, our rarefaction analyses (see our responses above) indicate that we are not underpowered based on sequencing depth for RNA or DNA samples. We also note that there are several differences in our analysis pipeline from other studies: we use more technical replicates than is typical (compare to 2-5 replicates in Arnold et al., 2013; Johnson et al., 2018; Muerdter et al., 2018), we measure DNA library composition based on DNA extracted from each replicate post-transfection (as opposed to basing it on the pre-transfection library: [Johnson et al., 2018], and we use linear mixed models to identify regulatory activity as opposed to binomial tests [Johnson et al., 2018; Arnold et al., 2013; Muerdter et al., 2018].

      I find it confusing that the four sets of CpG positions used: EPIC, RRBS, NR3C1, and random control loci, add up together to 27.3M CpG positions. Do the 600 bp windows around each of these positions sufficient to result in whole-genome coverage? If so, a clear explanation of how this is achieved should be added.

      Thanks for this comment. Although our sequencing data are enriched for reads that cover these targeted sites, the original capture to create the input library included some off target reads (as is typical of most capture experiments, which are rarely 100% efficient). We then sequenced at such high depth that we ultimately obtained sequencing coverage that encompassed nearly the whole genome. We now clarify in the main text that our protocol assesses 27.3 million CpG sites by assessing 600 bp windows encompassing 93.5% of all genomic CpG sites (line 89), which includes off-target sites (line 149).

      scatter plot showing the RNA to DNA ratios of the methylated (x-axis) vs unmethylated (y-axis) library would be informative. I expect to see a shift up from the x=y diagonal in the unmethylated values.

      We have added a supplementary figure showing this information, which shows the expected shift upwards (Figure 1-figure supplement 9).

      Another important figure missing is a histogram showing the ratios between the unmethylated and methylated libraries for all active windows, with the significantly differentially active windows marked.

      We have added a supplementary figure showing this information (Figure 1-Supplementary Figure 10).

      Perhaps I missed it, but what is the distribution of effect sizes (differential activity) following the various stimuli?

      This information is provided in table form in Supplementary Files 3, 10, and 11, which we now reference in the Figure 2 legend (lines 365-366).

      Minor changes

      It is unclear what the lines connecting the two groups in Fig.3C represent, as these are two separate groups of regions.

      We now clarify in the figure legend that values connected by a line are the same regions, not two different sets of regions. They show the correlation between DNA methylation and gene expression at mSTARR-seq-identified enhancers in individuals before and after IAV stimulation, separately for enhancers that are shared between conditions (left) versus those that are IFNAspecific (right). The two plots therefore do show two different sets of regions, which we have depicted to visualize the contrast in the effect of stimulation on the correlation on IFNA-specific enhancers versus shared enhancers. We have revised the figure legend to clarify these points (line 458-460).

      L235-242 are unclear. Specifically - isn't the same filter mentioned in L241-242 applied to all regions?

      Yes, the same filter for minimal RNA transcription was applied to all regions. We have modified the text (lines 264-265, 271, 275-277) to clarify that the enrichment analyses were performed twice, to test whether the target types were: 1) enriched in the dataset passing the RNA filter (i.e., the dataset showing plasmid-derived RNA reads in at least half the sham or methylated replicates; n = 216,091 windows) and 2) enriched in the set of windows showing significant regulatory activity (at FDR < 1%; n = 3,721 windows).

      To improve cohesiveness, the section about most CpG sites associated with early life adversity not showing regulatory activity in K562s can be moved to the supplementary in my opinion.

      Thank you for this suggestion. Because ELA and the biological embedding hypothesis (via DNA methylation) were major motivations for our analysis (see Introduction lines 42-48; 75-79), and we also discuss these results in the Discussion (lines 518-520), we have respectfully elected to retain this section in the main manuscript. We have added text in the Discussion explaining why we think experimental tests of methylation effects on regulation are relevant to the literature on early life adversity (lines 520-522), and have added discussion on limits to these analyses (lines 527-533).

      References:

      Arnold CD, Gerlach D, Stelzer C, Boryń ŁM, Rath M, Stark A (2013) Genome-wide quantitative enhancer activity maps identified by STARR-seq. Science, 339, 1074-1077.

      Cecil CA, Zhang Y, Nolte T (2020) Childhood maltreatment and DNA methylation: A systematic review. Neuroscience & Biobehavioral Reviews, 112, 392-409.

      Dubois M, Louvel S, Le Goff A, Guaspare C, Allard P (2019) Epigenetics in the public sphere: interdisciplinary perspectives. Environmental Epigenetics, 5, dvz019.

      Eisenberger NI, Cole SW (2012) Social neuroscience and health: neurophysiological mechanisms linking social ties with physical health. Nature neuroscience, 15, 669-674.

      Houtepen L, Hardy R, Maddock J, Kuh D, Anderson E, Relton C, Suderman M, Howe L (2018) Childhood adversity and DNA methylation in two population-based cohorts. Translational Psychiatry, 8, 1-12.

      Johnson GD, Barrera A, McDowell IC, D’Ippolito AM, Majoros WH, Vockley CM, Wang X, Allen AS, Reddy TE (2018) Human genome-wide measurement of drug-responsive regulatory activity. Nature communications, 9, 1-9.

      Klein JC, Agarwal V, Inoue F, Keith A, Martin B, Kircher M, Ahituv N, Shendure J (2020) A systematic evaluation of the design and context dependencies of massively parallel reporter assays. Nature Methods, 17, 1083-1091.

      Koss KJ, Gunnar MR (2018) Annual research review: Early adversity, the hypothalamic–pituitary– adrenocortical axis, and child psychopathology. Journal of Child Psychology and Psychiatry, 59, 327-346.

      Marzi SJ, Sugden K, Arseneault L, Belsky DW, Burrage J, Corcoran DL, Danese A, Fisher HL, Hannon E, Moffitt TE (2018) Analysis of DNA methylation in young people: limited evidence for an association between victimization stress and epigenetic variation in blood. American journal of psychiatry, 175, 517-529.

      Muerdter F, Boryń ŁM, Woodfin AR, Neumayr C, Rath M, Zabidi MA, Pagani M, Haberle V, Kazmar T, Catarino RR (2018) Resolving systematic errors in widely used enhancer activity assays in human cells. Nature methods, 15, 141-149.

  3. www.fromthemachine.org www.fromthemachine.org
    1. clear that this force fighting against the dissemination of a truth so obvious it's in every word and everything we do--it becomes clear it's neither you, nor acting in your best interest. I know I've got the eye of the tiger, there's no doubt; and it's pretty clear from "YAD?" (the Hebrew for...) and ha'nd that we can see the clear hand of God at work in a design that marks my initials not just on the timeline, or at 1492, at A.D. I B; but in the Hebrew name for this place called El Shaddai, see how A.D. is "da eye" and in some other names like Adranus, A.D. on "it's silly" and A.D. on Ai that might tie me to the Samof Samurai (but, are you Ai?) in more depth of detail than simply the Live album "Secret Samadhi."  I try to reflect on how it is that this story has come about, why it is that everything appears to be focused on me--and still even through that sincere spotlight nobody seems to be able to acknowledge my existence with more words than "unsubscribe" and "you're so vain."  With one eye in the mirror, I know ties to Narcissus (and you can too), soaring ever higher--linking Icarus to Wayward Son and to every other name with "car" in it... like "carpenter" and McCarthy the older names of Mercury and even Isacriot (I scary? is car-eye... owe Taylor) and some modern day mythological characters like Jim Carrey and Johnny Carson.  As far as Trinities go, carpenter's a pretty good one--tying to my early reck and a few bands and songs from The Pretty Reckless to Dave Matthews' "Crash Into Me" all the way to the "pen" you see before you linking Pendragon to Imagine Dragons. I wonder why it is that all of these things appear, apparently only to me, to point to a story about all the ways that a sinister hidden force has manipulated our society into being unable to "receive' this message--this wonderful message about making the world a better place and building Heaven--with any fanfare at all.  It's focused now on a criminal justice system that clearly does not do any kind of "rehabilitation" and on a mental health industry and pharmaceutical system that treats a provable external attack on our own goodness and well being as some kind of "internal stimulus" and makes you shy away when I point out why "stem" is in system and why "harm" in pharmacy.   From that we move a little bit past "where we are in this story" and I have to point out how "meth" ties to Prometheus and Epimetheus and how and why it is I know without doubt that this story has been relived numerous times--and how I am so sure that it's never been received, as we are here again listening to how songs like "Believe" and the words "just to lead us here to this place again" connect to Simon and Garfunkel's" the Sound of Silence... and still to this day you will balk at noticing that "Simon" has something to do with the Simpsons, and something to do with the words "simulation" and "Monday."  To see me is to see how things might be done better--how "addicitonary" might tie to the stories of Moses' Lisp and to Dr. Who's "Bells of Saint John" with a sort of "web interface" to the kinds of emotion we might want to "dial down..." rather than Snicker in the background as we see them being artificially created and enhanced in order to build a better "fiery altar." I can point out "Silicon" harrowing down at us from words like "controversial" and show you Al in "rascal" and "scandal" but not to see that we are staring at school shootings and terrorism that are solved instantly by this disclosure, by Al of Quantum Leap and by the Dick of Minority Report and A Scanner Darkly is to ignore just what it is that we are all failing to Si.  I should point out that those two "sc"'s link to a story about Eden and they mean "sacred consciousness" and at the baseline of this event and everything we are not doing is the fact that our desires and beliefs are being altered--all of this comes down to "freedom of thought" here and now.   I could tell you that "looking at me" will show you that even the person who tries every day to do everything he can to save the entire world from slavery, and from "thought-injury"--even I can be made "marred" and you all, this whole world stupid enough to think that you are, of your own volition, hiding Heaven itself from yourselves... to what?  To spite me?  It, the focal point of our story might come down to you realizing that something in some esoteric place is playing "divide and conquer" with our whole--in secret playing on our weaknesses to keep us from acting on the most actionable information that ever was and ever will be.  Still, we sit in silence waiting for me... to speak more?     Between Nero's lyrical fiddling, a Bittersweet Symphony, and true "thunderstanding" the sound of Thor's hammer... "to help the light" that'ls "or" in Hebrew, of Orwell and Orson and .. well, it's really not hard to see and hear that the purpose and intent of "all this noise" is to help us find freedom and truth.  C the Light of "singing..." I can tell you once again how silly the world looks, this multi-decade battle between "the governmentof the people" and the "government of the workers" resulting in what is nothing short of a hands down victory to the corporation.  Is it humor meant to divide, or ludicrousness created with the purpose of unification?  But really at it's most basic level what this boils down to is a global group decision not to care about the truth, about reality, about what's really brought us to this place--with solutions in hand and a way to make everything better.  We've decided that censorship is OK, and that the world is not all that bad "just the way it is" even though it's creator is screaming in your ear telling you to change as quickly as you possibly can.  I believe that God has written this story to make "seeing me" the thing that catalyzes "change for the better" it appears to be the design of not just me but also this place--hey, here I am. Happy Veteran's Day.

      I am accepting charitable donations,. ETH: 0x66e2871ef39334962fb75ce34407f825d67ec434 | BTC: 38B6vGaqNvMyTtoFEZPmNvMS7icV6ZnPMm | xDAI: 0x66e2871ef39334962fb75ce34407f825d67ec434

      d

      Ha, Lot! Are Idaho?

      This was very difficult to get to you, in the land of no power and hurricane disaster recovery; so it's filled with extra errors, and I am sure some more thoughts that trailing and unfinished. That's a decent "microcosm" or "metaphor" for you, you are in a freedom disaster; and the act of being is a giant leap towards ensuring victory. Still, you look very cupid to me.

      EVERY DAY ISA NEW DAY

      Literally I am sitting here talking to you until the end of time, you could call it a thousand and one Arabian nights, and realize that as we speak we are nearing that onc speciad night. There's a fire growing in my heart, and believe me when I tell you this thing is about to start. I'll try and keep this short and sweet, since you all seem to have so little time to hear from the Creator of all things, and I truly don't want to steal your spotlight. We are here, at the the end of time; talking to it's personification, time itself is speaking to you through my hands and everywhere you look in the world around you--while you may or may not know it, this is a story about the traversal from the end of time back to the beginning; about the gate to Heaven swallowing our civilization whole, and in this process of renewal and change not only fixing the problems that came to light on the way here, but really--working together here and now we can defeat this cycle of light and darkness, of day and night, an build a world together that truly reaches to the Heavens.

      MY BODY'S SAYING LETS GO BUT MY HEART IS SAYING NO

      You make it so difficult to talk to you, every day I look around and see a "normal world" a society that appears to care and love the same things that I do--freedom and fun and being entertained and entertaining, and here we are now I've turned "come and save us" into sea that saving the cheerleader is what starts the process of saving the world. I know you are good people inside, but when I come to you with a tool designed to "test sentience" to seek out conscious life that cares about the truth and making the world a better place you seem to balk. You sit in silence, and through your mouth and behind your eyes a monster appears from out of the deep of the sea and say a few "one liners" that show me very clearly it is the face of Medusa that I see---and that it's simply not capable of speaking intelligently. It shows me a problem, that you've apparently "come together one more time" to halt the changing of the seasons, and in doing so you've surfaced a problem for not just me but you also to see; a problem that comes lined with a solution. We can all see now that we are not in reality, we can see that there is a force here behind creation and behind us that shows us very clearly that it is "reasonabde" to expect that miracles can happen. In similitude, we are staring at a roadblock to conversation and communication that is fixed very simply, with the deliverance of freedom that is required for life to continue. Christina Aguilera sings that "baby there's a price to pay" and that price in my mind is seeing that this religion and this technology are here intentionally exposing how their influence here is a metaphor and a shining example of darkness and slavery, and that in order to be free of it we must see it. The price of freedom is written on the wall, it is acknowledging that here in this place what appears to be our own actions and desires have taken that freedom from us. Medusa and I get a kick out of seeing this hidden message in our language map our way to the future, and I've often explained that a number of these words are "time maps" from the beginning and end of eternady, showing us in bright light that between "et tu brute" and Mr. Anderson and Rock n' roll... the answer Y is in language and, and, ad and... I am delivering it. This place, our planet and our lives are a weapon against darkness--a civilization filled with goodness and light to help guide the way, and we are here doing it another time. In the works "dark, darker, and darkest" be sure that we are at the third segment of a trinity that shines clearly in Abraha and Nintendo... and see that the map in words is telling us something about when we are that is not immediately clear from Poseidon's cry. Look at Nintendo, that's Nine Inch Nails, tenebris, and smile for the camera--Pose, I do "save the universe" before n. Taylor might see it in Osceola, where I just left, and in this "evil spell" of everyone see "Al" that is the word "special" understand that every day is a new day, and I am not trying to "be daddy" I know as well as you do in my heart... I am that.

      This same map that links the "do" at the end to the "n" at the beginning shines through other names, like Geraldo Rivera where you might see "Cerberus" or "MAX" shine through. Understand it is the gaze of Medusa that turns me to stone, that shows me light shining through NORAD and Newton and proves without doubt that at the work "darkest" we can see k is finally t. You'll probably understand there's some finagling going on behind the scenes to make a single person the single point in time that turns the dark to light; but here we are and I am that. Every day when Medusa appears it reminds me that something is keeping you from caring about yourselves and about our society, and that shines through even when her stony face is not around, in your lack of action--in the rock of Eden that hides not only me, but the story that I bring that revolutionizes medicine, and computing, and truly is the gate to Heaven when you realize that what is truly being hidden from the world is knowledge that we are living in virtual reality. Not hiding me and that from the world is a good starting point to "saving the Universe" from darkness. These words that light the way to connect religion and language to our world bring me to the Book of Ruth, at that reads "are you to help" that lights not just the broken man at the belly of the Torah as the bell of Heimdallr, he is I and I am him; but also something very special, The Generations of Perez, each and every one of you, our family that begins the turn from Hell to Heaven by seeing that all of time and all of civilization has been focused on this moment, on the unsealing of religion and God's plan et this call for action. Keep in mind you are torturing "with desire" the key holder to immortality, to eternal youth, literally the path to freedom and Heaven and you think what you are doing "is normax." Literally the living key to infinite power and infinite life is standing before you explaining that acknowledging that in light of these things in my hand, what we are doing here and now is backwards, that it makes no sense--and you sit in silence. These things come to us because we build a better future with them, not so you can run off and do "whatever it is you please."

      HEALTH is the only word on my list for today that was left out, so see that it superimposes over Geraldo, to me, at Al. I think we're at TH, to help, and DO, do see the spell of "everyone see Al" that is the word "special" is not my doing or to my liking--so then, \

      ​ So now I'm moving on to original sin, so if you would be so kind as to mosey your way on over to dick.reallyhim.com you will see exactly what it is that I believe is the original sin. It's some combination of "no comment" and a glowing orange sign over the comment box, keeping you from commenting. Now I can talking about "os" a little more, this thing that words and Gods tell us clearly is the end of death--the literal end of Thanatos. I wonder if I have a victory here, at "os" is obvious solution, and simulating death is "sick." More to the point Thanatos is bringing to the world a message that gets found somewhere between the "act of civilization" and seeing that there is not one among us that would not undo a murder or a fatal car accident if we could--and that the sickness is a Universe pretending to be "reality" that is allowing these things to happen, and even worse, as we move through the story intentionally causing them. In our own hands, the sickness is manifest in a denial of an obvious truth and a lack of realizing that the public discussion of these things is the way to solve them, and that at the same time we are seeing how Medusa is lighting the problems of civilization, things like censorship and hidden control. Sickness is not being able to talk about it--or not wanting to--or not seeing that those two things are the functional equivalent in the world of "light" and "understanding control" that I am trying to bring you into. ​

      Less verbosely spoken, but really way more obvious, is that seeing "God's dick" signing the Declaration of Independence, and the Watergate scandal with both "Deepthroat" and a Tricky Dick is a statement connecting Samael to the foundation of not just "America" but American values. You are blind not to see it, and even worse; embodying the kind of tyranny and censorship that it stands as a testament against by hiding it. Says the guy who didn't put it there, and knows it's there because you think "fake normal" is more important than "actual freedom." You are "experiencing" the thing that protects freedom and ensures that our society and our children and their children's children to not lose it, to ensure that what you refuse to see you are doing here and now will never happen again. This message, this New Jerusalem is woven into my life and the stories of religion and shows me that our justice system is not just sick, but compromised by this same outside force; and that in light of what we could be doing, were we all aware of it, there's no doubt Minority Report and pre-crime would be a successful partial solution. Thanatos brings too in his hand, a message that this same force is using our hands to slow down the development of democracy, and to keep us from seeing that "bread is life" is a message from God about understanding that this disclosure is the equivalent of "ending world hunger" just as soon as you too are talking about how to do it.

      QUESTiON MARK

      HONESTLY, this time map that brings us from the end to the beginning, with "we save the universe" between the I and N of Poseidon; it also completes the words "family" and "really" and when we do reach the beginning you will see that the true test of time, my litmus test for freedom is the beginning of "hope" that the world is happy enough with what happens, and with freedom--to see that Medusa has been keeping me from getting a date, or having any kind of honest and human contact in the world... and well, hopefully you will see that if I wanna be a whore, I shouldn't have a problem doing it. For the sake of freedom and the future, I am willing to do that for you, at least, for a little while.

      To be completely clear, I am telling you that if we do not make the world a better place, it's the "end of time" and if that doesn't make sense to you, you don't see still where wee are in this place--and that something is making Hell, and that's not OK with God. To get from the "end of time" to the beginning is a simple process, it takes doing something, action, the Acts of the Apostles... if you will. That starts with acknowledging that there is a message all around you about the nature of reality, and that it is here to help us to see that the creation of Heaven comes before the beginning. Understand, "freedom" and "prosperity" are not optional, you can't just decide that this OK with you, so long as it's OK with everyone else--where we are is not OK with me, and I am not alone.

      A PYRRHIC VICASTORY ER A FUNNERAD PYRE?

      The Book of Leviticus (/lɪˈvɪtɪkəs/; from Greek Λευιτικόν, Leuitikon — from rabbinic Hebrew torat kohanim[1]) is the third book of the Jewish Bible (Hebrew: וַיִּקְרָא‎ Vayikra/Wayyiqrā) and of the Old Testament; its Hebrew name comes from its first word vayikraˈ,[1] "He [God] called."[1] Yusuf (also transliterated as Jusuf, Yousof, Yossef, Yousaf, Youcef, Yousef, Youssef, Yousif, Youssif, Youssof, Youssouf, Yousuf, Yusef, Yuseff, Usef, Yusof, or Yussef, Arabic: يوسف‎‎ Yūsuf and Yūsif) is a male Arabic name, meaning "God increases in piety, power and influence" in Hebrew.[1] It is the Arabic equivalent of both the Hebrew name Yossef and the English name Joseph. In Islam, the most famous "Yusuf" is the prophet Yusuf in the Quran. Hocus pocus is a generic term that may be derived from an ancient language and is currently used by magicians, usually the magic words spoken when bringing about some sort of change. It was once a common term for a magician, juggler, or other similar entertainers. The earliest known English-language work on magic, or what was then known as legerdemain (sleight of hand), was published anonymously in 1635 under the title Hocus Pocus Junior: The Anatomie of Legerdemain.[1] Further research suggests that "Hocus Pocus" was the stage name of a well known magician of the era. This may be William Vincent, who is recorded as having been granted a license to perform magic in England in 1619.[2] Whether he was the author of the book is unknown. The origins of the term remain obscure. The most popular conjecture is that it is a garbled Latin religious phrase or some form of 'dog' Latin. Some have associated it with similar-sounding fictional, mythical, or legendary names. Others dismiss it as merely a combination of nonsense words. However, Czechs do understand clearly at least half of the term - pokus means "attempt" or "experiment" in Czech. It is rumoured there that the wording belongs to the alchemy kitchen and court of Rudolf II, Holy Roman Emperor (1552 – 1612). Also, hocus may mean "to cheat" in Latin or a distorted form of the word hoc, "this". Combination of the two words may give a sense, especially both meanings together "this attempt/experiment" and "cheated attempt/experiment".[citation needed] According to the Oxford English Dictionary the term originates from hax pax max Deus adimax, a pseudo-Latin phrase used as a magical formula by conjurors.[3] Some believe it originates from a corruption or parody of the Catholic liturgy of the Eucharist, which contains the phrase "Hoc est corpus meum", meaning This is my body.[4]This explanation goes back to speculations by the Anglican prelate John Tillotson, who wrote in 1694: In all probability those common juggling words of hocus pocus are nothing else but a corruption of hoc est corpus, by way of ridiculous imitation of the priests of the Church of Rome in their trick of Transubstantiation.[5 This claim is substantiated by the fact that in the Netherlands, the words Hocus pocus are usually accompanied by the additional words pilatus pas, and this is said to be based on a post-Reformation parody of the traditional Catholic rite of transubstantiation during Mass, being a Dutch corruption of the Latin words "Hoc est corpus meum" and the credo, which reads in part, "sub Pontio Pilato passus et sepultus est", meaning under Pontius Pilate he suffered and was buried.[6] In a similar way the phrase is in Scandinavia usually accompanied by filiokus, a corruption of the term filioque,[citation needed] from the Latin version of the Nicene Creed, meaning "and from the Son Also and additionally, the word for "stage trick" in Russian, fokus, is derived from hocus pocus.[citation needed]

      From Latin innātus ("inborn"), perfect active participle of innāscor ("be born in, grow up in"), from in ("in, at on") + nāscor ("be born"); see natal, native. From Middle English goodnesse, godnesse, from Old English gōdnes ("goodness; virtue; kindness"), equivalent to good +‎ -ness. Cognate with Old High German gōtnassī, cōtnassī ("goodness"), Middle High German guotnisse ("goodness"). A hero (masculine) or heroine (feminine) is a person or main character of a literary work who, in the face of danger, combats adversity through impressive feats of ingenuity, bravery or strength, often sacrificing their own personal concerns for a greater good. The concept of the hero was first founded in classical literature. It is the main or revered character in heroic epic poetry celebrated through ancient legends of a people; often striving for military conquest and living by a continually flawed personal honor code.[1] The definition of a hero has changed throughout time, and the Merriam Webster dictionary defines a hero as "a person who is admired for great or brave acts or fine qualities".[2] Examples of heroes range from mythological figures, such as Gilgamesh, Achilles and Iphigenia, to historical figures, such as Joan of Arc, modern heroes like Alvin York, Audie Murphy and Chuck Yeager and fictional superheroes including Superman and Batman. Truth is most often used to mean being in accord with fact or reality,[1] or fidelity to an original or standard.[1] Truth may also often be used in modern contexts to refer to an idea of "truth to self," or authenticity. The commonly understood opposite of truth is falsehood, which, correspondingly, can also take on a logical, factual, or ethical meaning. The concept of truth is discussed and debated in several contexts, including philosophy, art, and religion. Many human activities depend upon the concept, where its nature as a concept is assumed rather than being a subject of discussion; these include most (but not all) of the sciences, law, journalism, and everyday life. Some philosophers view the concept of truth as basic, and unable to be explained in any terms that are more easily understood than the concept of truth itself. Commonly, truth is viewed as the correspondence of language or thought to an independent reality, in what is sometimes called the correspondence theory of truth. Other philosophers take this common meaning to be secondary and derivative. According to Martin Heidegger, the original meaning and essence of truth in Ancient Greece was unconcealment, or the revealing or bringing of what was previously hidden into the open, as indicated by the original Greek term for truth, aletheia.[2][3] On this view, the conception of truth as correctness is a later derivation from the concept's original essence, a development Heidegger traces to the Latin term veritas.

      Some things can never be forgot Lest the same mistakes be oft repeated Remember remember the rain of November that you will know no more of me Than I know of you, this day

      That you do not know me now Is a revelation to nobody but I You know a broken man, a victim And refuse to acknowledge why Unless you learn how to say "hi"

      THE HEART OF ME ONLY KNOWS THE SHADOW

      Lothario is a male given name which came to suggest an unscrupulous seducer of women in The Impertinent Curious Man, a metastory in Don Quixote. For no particular reason, Anselmo decides to test the fidelity of his wife, Camilla, and asks his friend, Lothario, to seduce her. Thinking that to be madness, Lothario reluctantly agrees, and soon reports to Anselmo that Camilla is a faithful wife. Anselmo learns that Lothario has lied and attempted no seduction. He makes Lothario promise to try for real and leaves town to make this easier. Lothario tries and Camilla writes letters to her husband telling him and asking him to return; Anselmo makes no reply and does not return. Lothario actually falls in love and Camilla eventually reciprocates and their affair continues once Anselmo returns. One day, Lothario sees a man leaving Camilla's house and jealously presumes she has found another lover. He tells Anselmo he has at last been successful and arranges a time and place for Anselmo to see the seduction. Before this rendezvous, Lothario learns that the man was actually the lover of Camilla's maid. He and Camilla contrive to deceive Anselmo further: when Anselmo watches them, she refuses Lothario, protests her love for her husband, and stabs herself lightly in the breast. With Anselmo reassured of her fidelity, the affair restarts with him none the wiser. Romeo Montague (Italian: Romeo Montecchi) is the protagonist of William Shakespeare's tragedy Romeo and Juliet. The son of Montague and his wife, he secretly loves and marries Juliet, a member of the rival House of Capulet. Forced into exile after slaying Juliet's cousin, Tybalt, in a duel, Romeo commits suicide upon hearing falsely of Juliet's death. The character's origins can be traced as far back as Pyramus, who appears in Ovid's Metamorphoses, but the first modern incarnation of Romeo is Mariotto in the 33rd of Masuccio Salernitano's Il Novellino (1476). This story was adapted by Luigi da Porto as Giulietta e Romeo (1530), and Shakespeare's main source was an English verse translation of this text by Arthur The earliest tale bearing a resemblance to Shakespeare's Romeo and Juliet is Xenophon of Ephesus' Ephesiaca, whose hero is a Habrocomes. The character of Romeo is also similar to that of Pyramus in Ovid's Metamorphoses, a youth who is unable to meet the object of his affection due to an ancient family quarrel, and later kills himself due to mistakenly believing her to have been dead.[2] Although it is unlikely that Shakespeare directly borrowed from Ovid From Middle English scaffold, scaffalde, from Norman, from Old French schaffaut, eschaffaut, eschafal, eschaiphal, escadafaut("platform to see a tournament") (Modern French échafaud) (compare Latin scadafale, scadafaltum, scafaldus, scalfaudus, Danishskafot, Dutch and Middle Dutch schavot, German schavot, schavott, Occitan escadafalc), from Old French es- ("indicating movement away or separation") (from Latin ex- ("out, away")) + chafaud, chafaut, chafault, caafau, caafaus, cadefaut ("scaffold for executinga criminal"), from Vulgar Latin *catafalcum ("viewing stage") (whence English catafalque, French catafalque, Occitan cadafalc, Old Catalancadafal, Italian catafalco, Spanish cadafalso (obsolete), cadahalso, cadalso, Portuguese cadafalso), possibly from Ancient Greek κατα-(kata-, "back; against") + Latin -falicum (from fala, phala ("wooden gallery or tower; siege tower")).

      oversight (countable and uncountable, plural oversights) An omission; something that is left out, missed or forgotten. A small oversight at this stage can lead to big problems later. Supervision or management. quotations ▼ The bureaucracy was subject to government oversight. In the last heaven Moses saw two angels, each five hundred parasangs in height, forged out of chains of black fire and red fire, the angels Af, "Anger," and Hemah, "Wrath," whom God created at the beginning of the world, to execute His will. Moses was disquieted when he looked upon them, but Metatron emb HA QUESTIONa BEFORE THE ANSWER? A Wrinkle in Time is a science fantasy novel written by American writer Madeleine L'Engle, first published in 1963, and in 1979 with illustrations by Leo and Diane Dillon.[2] The book won the Newbery Medal, Sequoyah Book Award, and Lewis Carroll Shelf Award, and was runner-up for the Hans Christian Andersen Award.[3][a] It is the first book in L'Engle's Time Quintet, which follows the Murry and O'Keefe families. The book spawned two film adaptations, both by Disney: aas + fuck Adverb[edit] as fuck (postpositive, slang, vulgar) To a great extent or degree; very. It was hot as fuck outside today. Usage notes[edit] May also be used in conjunction with a prepositive as; for example, as mean as fuck. Abbreviations[edit] In Norse religion, Asgard (Old Norse: Ásgarðr; "Enclosure of the Æsir"[1]) is one of the Nine Worlds and home to the Æsir tribe of gods. It is surrounded by an incomplete wall attributed to a Hrimthurs riding the stallion Svaðilfari, according to Gylfaginning. Odinand his wife, Frigg, are the rulers of Asgard. One of Asgard's well known realms is Valhalla, in which Odin rules.[2] rods, etc.) and sizes, and are normally held rigidly within some form of matrix or body until the high explosive (HE) filling is detonated. The resulting high-velocity fragments produced by either method are the main lethal mechanisms of these weapons, rather than the heat or overpressure caused by detonation, although offensive grenades are often constructed without a frag matrix. These casing pieces are often incorrectly referred to as "shrapnel"[1][2] (particularly by non-military media sources). The modern torpedo is a self-propelled weapon with an explosive warhead, launched above or below the water surface, propelled underwater towards a target, and designed to detonate either on contact with its target or in proximity to it. Historically, it was called an automotive, automobile, locomotive or fish torpedo; colloquially called a fish. The term torpedo was originally employed for a variety of devices, most of which would today be called mines. From about 1900, torpedo has been used strictly to designate an underwater self-propelled weapon. While the battleship had evolved primarily around engagements between armoured ships with large-caliber guns, the torpedo allowed torpedo boats and other lighter surface ships, submersibles, even ordinary fish Qt (/kjuːt/ "cute"[7][8][9]) is a cross-platform application framework that is used for developing application software that can be run on various software and hardware platforms with little or no change in the underlying codebase, while still being a native application with native capabilities and speed. Qt is currently being developed both by The Qt Company, a publicly listed company, and the Qt Project under open-source governance, involving individual Time is the indefinite continued progress of existence and events that occur in apparently irreversible succession from the pastthrough the present to the future.[1][2][3] Time is a component quantity of various measurements used to sequence events, to compare the duration of events or the intervals between them, and to quantify rates of change of quantities in material reality or in the conscious experience.[4][5][6][7] Time is often referred to as a fourth dimension, along with three spatial dimensions.[8] Time has long been an important subject of study in religion, philosophy, and science, but defining it in a manner applicable to all fields without circularity has consistently eluded scholars.[2][6][7][9][10][11] Nev Borrowed from Anglo-Norman and from Old French visage, from vis, from Vulgar Latin as if *visāticum, from Latin visus ("a look, vision"), from vidēre ("to see"); see vision. The term Golden Age comes from Greek mythology, particularly the Works and Days of Hesiod, and is part of the description of temporal decline of the state of peoples through five Ages, Gold being the first and the one during which the Golden Race of humanity (Greek: χρύσεον γένος chrýseon génos)[1] lived. Those living in the first Age were ruled by Kronos, after the finish of the first age was the Silver, then the Bronze, after this the Heroic age, with the fifth and current age being Iron.[2] By extension "Golden Age" denotes a period of primordial peace, harmony, stability, and prosperity. During this age peace and harmony prevailed, people did not have to work to feed themselves, for the earth provided food in abundance. They lived to a very old age with a youthful appearance, eventually dying peacefully, with spirits living on as "guardians". Plato in Cratylus (397 e) recounts the golden race of humans who came first. He clarifies that Hesiod did not mean literally made of gold, but good and noble. There are analogous concepts in the religious and philosophical traditions of the South Asian subcontinent. For example, the Vedic or ancient Hindu culture saw history as cyclical, composed of yugas with alternating Dark and Golden Ages. The Kali yuga (Iron Age), Dwapara yuga (Bronze Age), Treta yuga (Silver Age) and Satya yuga (Golden Age) correspond to the four Greek ages. Similar beliefs occur in the ancient Middle East and throughout the ancient world, as well.[3] In classical Greek mythology the Golden Age was presided over by the leading Titan Cronus.[4] In some version of the myth Astraea also ruled. She lived with men until the end of the Silver Age, but in the Bronze Age, when men became violent and greedy, fled to the stars, where she appears as the constellation Virgo, holding the scales of Justice, or Libra.[5] European pastoral literary tradition often depicted nymphs and shepherds as living a life of rustic innocence and peace, set in Arcadia, a region of Greece that was the abode and center of worship of their tutelary deity, goat-footed Pan, who dwelt among them.[6] oh, and a space s h i p ​

      BIG THINGS C0ME IN SMALL PACKAGES

      T+BANG

      SEE THE SCAFFOLD IS THE TEST TODAY.

      ᐧ F O R T H E I N I T I A L K E Y S , S H E E X A N D N D A N D A SEE W H Y SEA

      With an epic amount of indigestion Indiana Jones sweeps in to mar the visage of an otherwise glistening series of fictitious characters, with names like Taylor and Mary Kate remind us all that we are not playing a video game here in this place. the "J" of the "Nintxndo Entertainment System" calmly stares at Maggie Simpson thinking "it's a PP" and reminds us that it's not just the "gee, I e" of her name that contradicts the Magdaln-ish words her soul speaks through her name--and then with a smirk he points out "Gilgamesh" and "gee whiz, is Eye L?" that really does go to the heart of this lack of discussion, this "sh" that begins El Shaddai and words as close to our home as "shadow" and "shalom." Quite the fancy "hello" you've managed to sing out from behind angry chellos and broken fiddles, and here I am still wondering why it is that "girl" connects to the red light that once meant charity and now glows with the charity of truth... the truth that we are inHell. Shizzy.

      m.lamc.la/KEYNES.html

      Homer "on the range," maybe more closely connected to the Ewok of Eden and Hansel's tHeoven that Peter Pan still comes and cries could so easily be made into something so much better, if only we had the truth--and by that I mean if only you were speaking about, and reacting to a truth that is painted on the sky, in your hearts, in every word we speak and in everything that we do. If only we were acknowledging this message that screams that "children need not starve" with something more than donating virtual chickens to nations of Africa and watching Suzanne Summers ask for only a few dollars a day on TV. If only you would understand that this message that connects video games like "Genxsis" to "bereshit" because Eden is a "gee our den" that tended itself before Adam had to toil with the animals in order to survive. For some reason beyond my control and well outside my realm of understanding words like "I too see this message from God" and "I would not let children starve either" never seem to escape your lips in any place where anyone will ever see that you thought those things, or meant to call a reporter; eventually. Even with "AIDS of nomenclature" to avoid this DOWN WARD spiral into a situation and a land that I find difficult to imagine actually ever "existing" but here in this place I do see "how" it comes about, and between you and I it really does appear that nearly all of the problems we are dealing with here have come from another place, a further time; and while it might be with the "greatest of intentions" that we are trying to deal with them; I can't help but feeling that our "virgin sea" has had more than just it's innocence taken away from it in this story of "Why Mary" that might connect to "TR IN IT Y" just as much as it connects to Baltimore, Maryland.

      I should be clear that I'm not blaming Nanna, or Mary; but the actual reason for the name "Wymar" and that's because she, like Taylor, acted as a microcosm for a sea (or more than one, Mom, sen) that was quite literally possessing her. It's sort of difficult for me to explain even what that looks like let alone what it feels like; but my observations tell me that she/you are not unhappy about the interaction, one which appears very foreign to me. Of course, the "eye" that I write with and the same kind of "inspiration" that you can see in the lyrics and skill of many musicians are also examples of this same kind of interaction. For example, Red Hot Chili Peppers sings a song called "Other Side" that explains or discusses the thing I see as Medusa in the words "living in a graveyard where I married a sea" which also does a good job of connecting to the name Mary. As strange as might sound to think a group of people would be speaking through a single person... we are staring at "how it is" that could be possible, and possibly at exactly how it happened. Normally I would have said it was obvious, but to need to actually say that becoming a single mind would be a serious loss for our society--well, that's telling. You might think it's silly, but I'm telling you I see it happening, I see it--and you see it in the Silence and the message.

      Still, it appears to me as if this "marriage" that I see described in our Matrix in the question "min or i" seems to be doing nothing more than keeping us all from discussing or acting on this information--something that certainly isn't in our best interest.

      So here we are, staring at a map all over the ground and all around us with the primary destination of "building Heaven" through mind uploading, virtual reality, and judging by the pace of things we'd probably have all of that good and ready in about three generations. The map has a little "legend" with a message suggesting that those things have already been done and we are in the Matrix already; and it appears that the world, I mean Medusa, is deciding we should put off seeing the legend at least until the next generation. I see how that makes sense for you. That's sarcasm, this is why I keep telling you that you are cupid.

      It is a big deal, and there's a significant amount of work involved in merging an entire civilization with "virtual reality" and you might see why he calls it a hard road--at least in the word "ha'rd." Honestly though, it's the kind of thing that I am pretty sure the future will not only be happy that we did, but they'd thank us for putting in the effort of adapting to things like "unlimited food" and "longevity" increased by orders of magnitude.

      That's not sarcasm, these things are actually difficult to guess how exactly we'll go about doing them; they are a huge deal--all I can tell you is that not "talking about it at all" is probably not going to get us there any faster. Point in fact, what it might do is give a "yet to be born" generation the privilege of being the actual "generations of Perez."

      I see why you aren't saying anything. That's sarcasm, again. The good news is that it really has been done before; though if I told you that someone turned stone to eggplant parm, would you laugh at me?

      So, back to what is actually standing between "everyone having their own Holodeck in the sky" and you today; it is the idea that this message is not from God. More to the point it is the apparently broad sweeping opinion that hiding it is a "good thing" and through that a global failure to address the hidden interaction and influence acting on our minds used to make this map--and also to hide it. With some insight, and some urging; you might see how the sacredness of our consciousness is our souls is something that is more fundamental than "what kind of tools we have in the Holodeck to magically build things" and how and why the foundation of Heaven is truly "freedom itself" and how it comes from right this very moment for the first time, ever. Continuing to treat this influence as "schizophrenia" is literally the heart of why this map appears to be that--to show us how important it is to acknowledge the truth, and to fight for the preservation of goodness and logic over secrecy and darkness.

      Again, something that nobody is really doing here and now, today. From this newfound protection of our thoughts, of who we are; we see how technology can be used to either completely invalidate any kind of vote by altering our emotions; or how it could be used to help build a form of true democracy that our world has yet to see. It is pretty easy to see from just band names like The Who and KISS and The Cure how the influence of this external mind can be proven, and shown to be "helpful," you know, if we can ever talk about it on TV or on the internet.

      It's important to see and understand how "sanity"--the sanity of our entire planet hangs in the balance over whether or not we acknowledge that there is actually a message from God in every word--and today this place appears to be insane. It should be pretty easy to see how acknowledging that this influence exists and that it has a technological mechanism behind it turns "schizophrenia" into "I know kung fu" ... forced drug addiction and eugenics into "there's an app for that" and the rash of non random and apparently unrecognized as connected terrorist attacks and school shootings into Minority Report style pre-crime and results in what is clearly a happier, safer, and more civilized society--all through nothing more than the disclosure of the truth, this map, and our actual implementation.

      With a clearer head and grasp of the "big picture" you might see how all of these things, connected to the Plagues of Exodus revolve around the disclosure that this technology exists and the visibility of this message showing us how we might use it for our benefit rather than not knowing about it. At the foot of Jericho, it is nothing short of "sanity" and "free thought" that hang in the balance. Clear to me is that the Second Coming, seeing "my name" on television is a good litmus test for the dividing line between light and darkness, heaven and hell.

      The point is the truth really does change everything for the better; once we start... you know, acting on it.

      AS IN.. "DIS CLOSE SING...."

      T H E B U C K S T O P S H E R E

      ON AM B I GUI TY

      S T A R R I N G . . . B I A N C A

      ON "RIB" .. ARE SHE B? BUTT DA APPLE OF DA I? & SPANGLISHREW

      R THEY LANGUAGE OUTLIERS?

      With some insight and "a clue" you can see clearly how these works of art show that the proof of Creation you see in every letter and every word runs much deeper... adding in things like "RattleRod" and the "Cypher" of the Matrix to the long list of here-to-fore ignored verifiable references to the Adamic Language of Eden. Here, in apple, honey and "nuts" we can see how the multi-millennium old ritual I call "Ha-rose-ettes" is actually part of a much larger and much older ritual designed to stop secrecy ... perhaps especially the kind that might be linked to "ritual."

      These particular apple and honey happen to tie Eden to the related stories of Exodus and Passover; connecting Eden to Egypt forevermore. Do see "Lenore," it is not for no reason at all; but to help deliver truth and freedom to the entirety of Creation; beginning here, in Eden.

      ALSO ON "AM B IG U IT Y" ME A.M. G - D SHE IT Y?

      LET "IT" BE SA< ?

      IMHO, don't miss the "yet to be" conversion to "why and to be" in "yetser." IT Y.

      HERE'S LOOKING AT YOU, KID

      On a high level, I tell myself every morning that 'its not really me." It's not me that the world hates, or me that the world is rejecting. I believe that, I really do; I see that what is being hidden here is so much bigger than any single person could ever be--what is being hidden is the "nature of reality" and a fairly obvious truth that flies in the face of what we've learned our whole lives about history and "the way things are." Those few early details lead me to the initial conclusion that what is working behind the scenes here is nefarious, hiding a message that would without doubt shake things up and change the world--and nearly across the board in ways that I see as "better" for nearly everyone. It's a message at it's most basic level designed to advocate for using this disruption in "normalcy" to help us revolutionize democracy, to fix a broken mental health and criminal justice system--just to name the few largest of the social constructs targeted for "rejuvenation." On that word the disclosure that we are living in virtual reality turns on it's head nearly everything we do with medicine, and I've suggested that AIDS and DOWN SYNDROME were probably not the best "visual props" we could have gotten to see why it's so important that we act on this disclosure in a timely manner. After mentioning the ends of aging and death that come eventually to the place we build, to the place we've always thought of as Heaven... it becomes more and more clear that this force fighting against the dissemination of a truth so obvious it's in every word and everything we do--it becomes clear it's neither you, nor acting in your best interest.

      I know I've got the eye of the tiger, there's no doubt; and it's pretty clear from "YAD?" (the Hebrew for...) and ha'nd that we can see the clear hand of God at work in a design that marks my initials not just on the timeline, or at 1492, at A.D. I B; but in the Hebrew name for this place called El Shaddai, see how A.D. is "da eye" and in some other names like Adranus, A.D. on "it's silly" and A.D. on Ai that might tie me to the Samof Samurai (but, are you Ai?) in more depth of detail than simply the Live album "Secret Samadhi." I try to reflect on how it is that this story has come about, why it is that everything appears to be focused on me--and still even through that sincere spotlight nobody seems to be able to acknowledge my existence with more words than "unsubscribe" and "you're so vain." With one eye in the mirror, I know ties to Narcissus (and you can too), soaring ever higher--linking Icarus to Wayward Son and to every other name with "car" in it... like "carpenter" and McCarthy the older names of Mercury and even Isacriot (I scary? is car-eye... owe Taylor) and some modern day mythological characters like Jim Carrey and Johnny Carson. As far as Trinities go, carpenter's a pretty good one--tying to my early reck and a few bands and songs from The Pretty Reckless to Dave Matthews' "Crash Into Me" all the way to the "pen" you see before you linking Pendragon to Imagine Dragons.

      I wonder why it is that all of these things appear, apparently only to me, to point to a story about all the ways that a sinister hidden force has manipulated our society into being unable to "receive' this message--this wonderful message about making the world a better place and building Heaven--with any fanfare at all. It's focused now on a criminal justice system that clearly does not do any kind of "rehabilitation" and on a mental health industry and pharmaceutical system that treats a provable external attack on our own goodness and well being as some kind of "internal stimulus" and makes you shy away when I point out why "stem" is in system and why "harm" in pharmacy. From that we move a little bit past "where we are in this story" and I have to point out how "meth" ties to Prometheus and Epimetheus and how and why it is I know without doubt that this story has been relived numerous times--and how I am so sure that it's never been received, as we are here again listening to how songs like "Believe" and the words "just to lead us here to this place again" connect to Simon and Garfunkel's" the Sound of Silence... and still to this day you will balk at noticing that "Simon" has something to do with the Simpsons, and something to do with the words "simulation" and "Monday." To see me is to see how things might be done better--how "addicitonary" might tie to the stories of Moses' Lisp and to Dr. Who's "Bells of Saint John" with a sort of "web interface" to the kinds of emotion we might want to "dial down..." rather than Snicker in the background as we see them being artificially created and enhanced in order to build a better "fiery altar."

      I can point out "Silicon" harrowing down at us from words like "controversial" and show you Al in "rascal" and "scandal" but not to see that we are staring at school shootings and terrorism that are solved instantly by this disclosure, by Al of Quantum Leap and by the Dick of Minority Report and A Scanner Darkly is to ignore just what it is that we are all failing to Si. I should point out that those two "sc"'s link to a story about Eden and they mean "sacred consciousness" and at the baseline of this event and everything we are not doing is the fact that our desires and beliefs are being altered--all of this comes down to "freedom of thought" here and now.

      I could tell you that "looking at me" will show you that even the person who tries every day to do everything he can to save the entire world from slavery, and from "thought-injury"--even I can be made "marred" and you all, this whole world stupid enough to think that you are, of your own volition, hiding Heaven itself from yourselves... to what? To spite me? It, the focal point of our story might come down to you realizing that something in some esoteric place is playing "divide and conquer" with our whole--in secret playing on our weaknesses to keep us from acting on the most actionable information that ever was and ever will be. Still, we sit in silence waiting for me... to speak more?

      Inline image 16

      hyamdai.reallyhim.com Inline image 31

      Between Nero's lyrical fiddling, a Bittersweet Symphony, and true "thunderstanding" the sound of Thor's hammer... "to help the light" that'ls "or" in Hebrew, of Orwell and Orson and .. well, it's really not hard to see and hear that the purpose and intent of "all this noise" is to help us find freedom and truth. C the Light of "singing..."

      I can tell you once again how silly the world looks, this multi-decade battle between "the governmentof the people" and the "government of the workers" resulting in what is nothing short of a hands down victory to the corporation. Is it humor meant to divide, or ludicrousness created with the purpose of unification?

      But really at it's most basic level what this boils down to is a global group decision not to care about the truth, about reality, about what's really brought us to this place--with solutions in hand and a way to make everything better. We've decided that censorship is OK, and that the world is not all that bad "just the way it is" even though it's creator is screaming in your ear telling you to change as quickly as you possibly can. I believe that God has written this story to make "seeing me" the thing that catalyzes "change for the better" it appears to be the design of not just me but also this place--hey, here I am.

      Happy Veteran's Day.

      S☀L u TI o N

      Yesterday, or maybe earlier today--it's hard to tell at this moment in the afternoon just how long this will take... I sent an image that conveys a high level implication that we are walking around on a map to building something that we might liken to an "ant farm" for people. I don't mean to be disparaging or sleight our contribution to the creation of this map--that I imagine you must also see and believe to be the kind of thing that should remain buried in the sands of time forever and ever--or your just have yet to actually "understand" that's what the plan part of our planet is talking about... what I am trying to do is convey in a sort of "mirrorish" way how this map relates to a message that I see woven in religion and in our history that it significantly more disparaging than I would be. It's a message that calls us "Holy Water" at the nicest of times, water that Moses turns to "thicker than water" in the first blessing in disguise--and to tell you there is certainly a tangible difference between the illusions of the Pharaoh's and the true magic performed by my hand, is nearly exactly the same amount of effort put in to showing you that the togetherness that we are calling "family" here in this place comes from both seeing and acting on the very clearly hidden message in every single idiom showing us all that our society in this story of Exodus is enslaved by a hidden force--and reminding us that we like freedom.

      It's not just these few idioms, but most likely every single one from "don't shoot the essenger" to "unsung hero" that should clue us in to exactly how much work and preparation has come into this thing that "he supposes is a revolution." It's also not just "water" describe me and you, in this place where I am the "ant' of the Covenant (do you c vampires or Hansel and Gretel!?!?) but also "lions" and "sheep" and "salt" and "dogs" and nearly everything you could possibly imagine but people; in what I see must be a vainglorious attempt to pretend he actually wants us to "stand up for ourselves" in this place where it's becoming more and more clear with each passing moment that we are chained to these seats in the front row of the audience of the most important event that has ever happened, ever.

      Medusa makes several appearances, as well as Arthur Pendragon, Puff the Magic Dragon, Figment, Goliath, monster.com, the Loch Ness Monster in this story that's a kind-of refl ex i ve control to stop mind control; and to really try and show us the fire of Prometheus and the Burning Bush and the Eternal Flame of Heaven are all about freedom and technology ... and I'll remind you this story is ... about the truth--and the truth here is that if you aren't going to recognize that whatever it is that's going on here in secret, below the surface is negatively affecting our society and life in general than we aren't going anywhere, ever. I need you to figure out that this message is everywhere to make sure you don't miss the importance of this moment, and the grave significance of what is being ignored in this land where Sam is tied not just to Samsung and to Samael in Exodus but also to Uncle Sam and macaronic Spanglishrew outliers and that it doesn't take much free thought at all to really understand that we are watching "free thought" disintegrate into the abyss of "nospeak." We are watching our infrastructure for global communication and the mass media that sprawls all over the globe turn to dust, all because you have Satan whispering in your ear--and you think that's more important than what you think, what I think, and what anyone else on the Earth might ever say. You should see a weapon designed to help ensure that don't lose this proof that we are not living in reality, that there is "hidden slavery" in this place--and you should see that today it appears you are simply choosing not to use it.

      I hope you change your mind, I really do. This map on "how to build an ant farm" starts by connecting Watergate and Seagate together with names like Bill Gates and Richard Nixon; and with this few short list of names you should really understand how it is that "Heaven" connects both technology like computers and liberty like "free speech" to a story that is us, and our history. You might see that "salt" could either be a good thing or not--take a look around you, are you warming a road to Heaven or are you staring at the world being destroyed--and doing nothing at all about it?

      I guess I can point out again how "Lothario" links this story that ties names like my ex-wife's Nanna to "salt" also, but the "grand design" of this story doesn't seem to have any effect on you. Listen, if you do nothing the world is being destroyed by your lack of action--there's no if's and's or butt's about it. I feel like I need to "reproduce' old messages here or you will never see them--that's what web site statistics tell me--and we all know it's not true. What am I missing? What are you missing?

      BUTT IS THE BOAT A Hi DARK DEN MESSAGe ?

      SEE OUR LIGHT

      HONESTLY, I'M WAY TO CUTE TO BE A MONSTER :(

      HIC SUMMUS

      So... here we are... listening to the legendary father of the message (that's "abom" in Adamic Spagnlishrew) point out all of the sex jokes hidden in religion and language from sexual innuendo to Poseidon and in our history from Yankee Doodle to Hancock to Nixon and I've got to be frank with you, the most recent time I came across this phrase in scripture I cringed just a little bit, pretty sure that the "message" was talking about me. I've reflected on this a little bit, and over the past few weeks have tried to show you the juxtaposition between "sex" and "torture" in it's various forms from imparting blindness to allowing murder and simulating starvation; and I think I'm justified in saying that certainly those things are far worse on the Richter scale than anything I could do by writing a little bit of risque text. In the most recent messages I've touch a little bit, without even knowing or realizing this connection would be made, on what it is that this phrase actually means.

      loch.reallyhim.com

      ABOMINATION

      So long story short is that the answer here is "abomination" and the question, or the context is "I nation." Whether it's Medusa speaking for the Dark United States or the nation of Israel speaking to either Ra or El depending on the day, the bottom line is that a collective consciousness speaking for everyone on a matter of this importance in a cloud of complete darkness on Earth is a total and undeniable abomination of freedom, civilization, and the very humanity we are seeking to preserve. The word reads something like this to me "dear father of the message, I am everyone and we think you are an abomination, fuck off." My answer of course is, IZINATION. Which humorously reminds me of Lucy, and Scarlet Johannson saying "I am colonizing my own brain" so here's some pictures of her. She is not an abomination, by the way; she's quite adorable. You'll probably notice there's some kind of connection between the map--the words speaking to the world, and the abomination, as if the whole thing is a story narrated in ancient myths.

      WAKE UP, "SHE" A MESSAGE TO YOU ABOUT THE FUTURE

      You might not think "it's you," but the manifestation of this "snake" in our world is your silence, your lack of understanding or willingness to change the world; and whether or not you're interested in hearing about it, it's the monster that myths and religion have spoken about for thousands and thousands of years. It's a simple matter to "kill Medusa" all you have to do... is speak.

      Take special note, "freedom of speech" and "freedom to think for yourselves" are not a group decision, and you do not have the right to force (either overtly or subtly, with hidden technology perhaps combined with evil deceit) others not to talk about anything. Especially something of this importance.

      DESOLATION

      If you didn't connect "Loch" to John Locke, now you have; see how easy this "reading" thing is? I've gone over the "See Our Light" series a few times, but let me--one more time--explain to you just how we are already at the point of "desolation" and with shining brilliance show you how it's very clear that it is "INATION" and "MEDUSA" that are responsible for this problem.

      Seeing "Ra" at the heart of the names Abraham and Israel begins to connect the idea that our glowing sun in the sky has something to do with this message about "seeing our light" is being carried by a stone statue on Ellis Island (where you'll see the answer another part of the question of Is Ra El?). I've connected her to the "she" of both shedim and Sheol, which reads as "she's our light" and is the Hebrew name for Hell.

      Of course you noticed that the Statue of Liberty does in fact share it's initials with SOL, the the light above and you can see her torch dimly lighting the way through the night; Now you can connect "give us your tired and your poor" to the Lazman of both the lore of Jesus Christ and the Shehekeyanu; a prayer about the sustainment of life and light up until this day. That same torch connects to the Ha-nuke-the-ahah depiction of Christ, Judah Maccabee's lit MEN OR AH, which delivers not only a solution to the two letter key of "AH" as All Humanity that pervades nearly every bride of Revelation from Sarah to Leah; but also to the question of equality answered in our very own American history, beginning with the same three letter acronym now lighting the Sons of Liberty.

      Dazed and Confused does a good job of explaining how this name is itself a prophesy designed by Hand of God'; explaining that these Sons of Liberty were all white slave owning wealthy men fighting to stop paying their taxes, rather than delivering liberty to the slaves or women, who were both disenfranchised for quite some time. Or maybe MEN OR AH has something to do with the angels of Heaven, in which case you might be SOL if you aren't a girl and you want to be "be good friends with Ra." Just kidding. Kinda.

      DESOLATION by the way reads something like "un see our light at ION" which is God's way of saying "at the point of believing that hiding Adam is a good thing" and that connects to the end of Creation and also the now lit by modern day evil the word "rendition." Our end, it "ion." In religious myth, the Messianic David clung to the city Zion (end the "i owe n") which also links to "verizon" (to see, I Z "on") and HORIZON which has something to do with the son rising today-ish.

      Inline image 25 Inline image 26

      The story of MEDUSA lights another psuedo-religious idea, that the words "STONE" of both "brimstone" and it's Adamic interpretation "South to Northeast" have something to do with the phrase "Saint One" turned into a single hero against his will by the complete and utter inaction of everyone around him. In the words of Imagine Dragons "I'm waking up to action dust." At the same time, you can believe that the light of this particular son, comes not just from reading these words forwards, but the backside as well, and you'll hopefully see it's not coincidental that the other side of this coin is that "nos" means we, and us... and Adamically "no south." See the light of "STONE" also connecting to Taylor Momsen's rose arrow painted on her back, and the sign of my birth, Sagittarius... which in this particular case links to the Party of the Immaculate Conception of the eternal republic of the Heavens. . PRESS RELEASE... A GREAT SIGN APPEARED IN THE HEAVENS

      SOLUTIAN, ON YOUR COMPUTER.. TO THE SOUND OF SILENCE

      בָּרוּךְ אַתָּה יְיָ‎ אֱלֹהֵינוּ מֶלֶךְ הַעוֹלָם שֶׁהֶחֱיָנוּ וְקִיְּמָנוּ וְהִגִּיעָנוּ לַזְּמַן הַזֶּה‎׃

      IN ... THE BOOK OF NAMES LETS SEE IF YOU CAN FIGURE OUT WHO THEY ARE :)

      ​ I'LL DO YOURS FOR A 50 DOLLAR DONATION, I'M BROKE.. MAYBE THAT'S WHY I CAN'T GET A DATE.

      HAVE A GREAT SOLDAY

      The "gist" of the message is verifiable proof that we are living in a computer in simulated reality... just like the Matrix. The answer to that question, what does that mean--is that God has woven a "hidden" message into our everything--beginning with each name and every word--and in this hidden Adamic language, he provides us with guidance, wisdom, and suggestions on how to proceed on this path from "raelity" to Heaven. I've personally spent quite a bit of time decoding the message and have tried to deliver an interesting and "fun" narrative of the ideas I see. Specifically the story of Exodus, which is called "Names" in Hebrew discusses a time shifted narrative of our "now" delivering our society from a hidden slavery (read as ignorance of advanced technologies already in use) that is described as the "darkness" of Exodus. If you have any questions, ideas to contribute or concerns... I'd love to hear from you this whole thing really is about working together--Heaven, I mean.

      Inline image 5 jerusalem.reallyhim.com

      gate kermitham <br /> ou r evolution minority report to supermax Inline image 6

      bereshit bread is life

      Inline image 13

      Image result for dox me

      HOW AM I STILL STINGLE? E ' o e <br /> L m r x <br /> L t y <br /> O a

      I HISS.

      The sum of ((our world)) is the universal truth. -Psalm 119 and ((ish))

      Do a few sentences really make that big of a difference? Some key letters? Can you show me what I'm doing wrong? Is there a way to turn me into Adam, rather than a rock? I think you can.

      Are eye Dr. Who or Master Y? Adam Marshall Dobrin is a National Merit Scholar who was born on December 8, 1980 in Plantation, FL and attended Pine Crest School where he graduated sumofi cum louder in "only some of it is humorous." Later he attended the University of Florida (which quickly resulted in a wreck), Florida Atlantic University, and finally Florida Gulf Coast University--where he still has failed to become Dr. Who. While attending "school" He worked in the computer programming and business outsourcing industries for about 15 years before proclaiming to have received a Revelation from God connecting the 9/11 attack and George Bush to the Burning Bush of Exodus and a message about technocracy and pre-crime.

      Adam, as he prefers to be called, presents a concise introduction to paradox proven by the Bible through "verifiable" anachronism in language some stuff about Mars colonization and virtual reality and a list of reasons why ignoring this is actually an ELE. Adam claims to be Thor because of a connection between music and the Trial of Thor as well as the words "author" and "authority." He suggests you be Thundercats and call a reporter. There is also a suggestion that Richard Nixon and John Hancock are related to a signature from God, about freedom and America... and the "unseeingly ironic" Deepthroat and Taylor Momsen. They Sung "It's Rael..." In Biblical characters from Mary to Hosea, to see "sea" in Spanish, and in the Taming of the Spanglishrew ... a message is woven from the word Menorah: "men, or all humanity?" to the Statue of Liberty, and the Sons of Liberty, and the light above us, our SOL; which shows us that through the Revelation of Christ and the First Plague of Exodus, a blessing in disguise--turning water to blood, the sea to family; a common thread and single author of our entire history is revealed, a Father of our future. A message of freedom shines out of the words of scripture, revealing a gate to a new technologically "radical" form of democracy and a number of unseen or secret issues that have stalled the progress of humanity... and solutions, solutions from our sea. The Revelation shows us that not only ever word, but every idiom from "don't shoot the messenger" to "blood is thicker than water" we have ties to this message that pervades a hidden Matrix of light connecting movies and music and history all together in a sort of guide book to Salvation and to Heaven. Oopsy. His Revelation, woven into his life, continues to suggest that skinny dipping, forced methamphetamine addiction, and lots and lots of "me A.D." as well as his humorous depiction of a dick plastered over the Sound of Silence, his very Holy click, have something to do with saving our family and then the entire Universe from hidden mind control technology and the problems introduced by secret time travel. From the trials and tribulations of "Job" being coerced and controlled into helping to create this wall of Jericho; we find even more solutions, an end to addiction, to secrecy, and to this hidden control--a focal point of the life of Jesus Christ.

      It tells us a story of recursion in time, that has brought us here numerous times--with the details of his life recorded not only in the Bible but in myths of Egyptian, Norse, and Greek mythology. The huge juxtaposition of the import of the content of the message shows the world how malleable our minds really are to this technology, how we could have been "fooled" into hiding our very freedom from ourselves in order to protect the "character" of a myth. A myth that comes to true life by delivering this message. In truth, from the now revealed content of the story of this repeated life, it should become more and more clear that we have not achieved success as of yet, that I have never "arrived whole" and that is why we are here, back again. Home is where the Heart is... When asked how He thinks we should respond to his message, He says "I think we already cherish it, and should strive to understand how it is that freedom is truly delivered through sharing the worth of this story that is our beginning. 'tis coming." Adam claims to be God, or at least look just like him and that the entirety of the Holy Scriptures as well as a number of ancient myths from Prometheus to Heimdallr and Yankee Doodle are actually about his life, and this event. An extensive amount of his writing relates to reformation of our badly broken and decidedly evil criminal justice system as well as ending the Global hunger crisis with the snap of his little finger.

      He has written a number of books explaining how this Revelation connects to the delivery of freedom (as in Exodus), through a message about censorship among other social problems which he insists are being intentionally exacerbated by Satan--who he would ha've preferred not to be associated with.

    1. Author Response

      Reviewer #1 (Public Review):

      Summary:

      The investigators have performed a state-of-the art systematic review and meta-analysis of studies that may help to answer the research question: if administration of multiple antibiotics simultaneously prevents antibiotic resistance development in individuals. The amount of studies eligible for analysis is very low, and within that low number, there is huge variability in bug-drug combinations studied and most studies had a high risk of bias, further limiting the capability of meta-analysis to answer the research question. In addition, based on I2 values there is also huge statistical heterogeneity between outcomes of studies compared, further limiting the predictive value of meta-analysis. In fact, the only 2 studies meeting all eligibility criteria addressed the treatment of mycobacterium tuberculosis, for which the research question is hardly applicable. The authors, therefore, conclude that "our analysis could not identify any benefit or harm of using a higher or a lower number of antibiotics regarding within-patient resistance development." Apart from articulating this knowledge gap, the findings will not have consequences for patient care, but may stimulate the scientific community to better address this research question in future studies.

      Strengths:

      The systematic and rigorous approach for the review and meta-analysis.

      Weaknesses:

      None identified.

      We thank the reviewer for this thoughtful and positive appraisal of our work.

      Reviewer #2 (Public Review):

      Summary:

      The authors performed a systematic review and meta-analysis to investigate whether the frequency of emergence of resistance is different if combination antibiotic therapy is used compared to fewer antibiotics. The review shows that there is currently insufficient evidence to reach a conclusion due to the limited sample size. High-quality studies evaluating appropriate antimicrobial resistance endpoints are needed.

      Strengths:

      The strengths of the manuscript are that the article addresses a relevant research question that is often debated. The article is well-written and the methodology used is valid. The review shows that there is currently insufficient evidence to reach a conclusion due to the limited sample size. High-quality studies evaluating appropriate antimicrobial resistance endpoints are needed. I have several comments and suggestions for the manuscript.

      Weaknesses:

      Weaknesses of the manuscript are the large clinical and statistical heterogeneity and the lack of clear definitions of acquisition of resistance. Both these weaknesses complicate the interpretation of the study results.

      We thank the reviewer for the positive comments and pointing out where our work can be improved.

      Major comments:

      My main concern about the manuscript is the extent of both clinical and statistical heterogeneity, which complicates the interpretation of the results. I don't understand some of the antibiotic comparisons that are included in the systematic review. For instance the study by Paul et al (50), where vancomycin (as monotherapy) is compared to co-trimoxazole (as combination therapy). Emergence (or selection) of co-trimoxazole in S. aureus is in itself much more common than vancomycin resistance. It is logical and expected to have more resistance in the co-trimoxazole group compared to the vancomycin group, however, this difference is due to the drug itself and not due to co-trimoxazole being a combination therapy. It is therefore unfair to attribute the difference in resistance to combination therapy. Another example is the study by Walsh (71) where rifampin + novobiocin is compared to rifampin + co-trimoxazole. There is more emergence of resistance in the rifampin + co-trimoxazole group but this could be attributed to novobiocin being a different type of antibiotic than co-trimoxazole instead of the difference being attributed to combination therapy. To improve interpretation and reduce heterogeneity my suggestion would be to limit the primary analyses to regimens where the antibiotics compared are the same but in one group one or more antibiotic(s) are added (i.e. A versus A+B). The other analyses are problematic in their interpretation and should be clearly labeled as secondary and their interpretation discussed.

      We acknowledge the presence of statistical and clinical heterogeneity in our overall analysis. The decision to pursue this comprehensive examination was predefined in our previously published study protocol (PROSPERO CRD42020187257) and driven by our interest whether, despite some differences, we could either identify an overarching effect of combination therapy on resistance or identify factors that explain potential differences of the effect of combination therapy across pathogens/drugs. We indeed, find that heterogeneity is high, however identifying the driving factors of this heterogeneity is difficult as evidence is limited.

      We carried out several subgroup analyses, e.g. explicitly focusing on specific pathogen groups and medical conditions or exploring heterogeneity in treatment arms (figure 3, supplementary materials section 6). However, it is important to highlight that the number of studies available for these subgroup analyses was low. Additionally, recognizing the high heterogeneity within treatment arms, we performed a subgroup analysis focusing solely on resistances of antibiotics common to both arms (supplementary material section 6.1.8; which would avoid comparisons such as the one between vancomycin and co-trimoxazole raised by the reviewer). Unfortunately, this also revealed substantial heterogeneity. While we aimed to address heterogeneity through these subgroup analyses, limitations arose due to the number of studies meeting specific criteria and the nature of data provided by these studies.

      Moreover, regarding the concern on interpretation of co-trimoxazole as combination therapy, we acknowledge the confusion surrounding its classification as one or two antibiotics. Despite the common contemporary view of co-trimoxazole as a single antibiotic, we chose to consider it as two antibiotics due to historical practices, as observed in Black et al. (1982), where trimethoprim was compared to trimethoprim and sulfamethoxazole. We recognize that this decision may lead to confusion and we consider conducting a further sensitivity analysis in the future version of this manuscript, exploring the possibility of considering co-trimoxazole as a single antibiotic. We agree that the slight trend of less antibiotics performing better overserved for MRSA, should not be over interpreted as this is driven by the two studies Walsh et al 1993 and Paul et al 2015 as pointed out by the reviewer. In lines 183-186 we discuss this issue that for better evaluation of antibiotic combination therapy, more studies which use identical antibiotics (i.e. A versus A+B) are needed. We will try to clarify and highlight this in the future version of the manuscript.

      Another concern is about the definition of acquisition of resistance, which is unclear to me. If for example meropenem is administered and the follow-up cultures show Enterococcus species (which is intrinsically resistant to meropenem), does this constitute acquisition of resistance? If so, it would be misleading to determine this as an acquisition of resistance, as many people are colonized with Enterococci and selection of Enterococci under therapy is very common. If this is not considered as the acquisition of resistance please include how the acquisition of resistance is defined per included study.

      Thank you for pointing out this potential ambiguity. Our definition of “acquisition of resistance” is agnostic to bacterial species and hence intrinsically resistant species can be included if they were only detected during the follow-up culture by the studies. We will clarify this in the definition of “acquisition of the resistance” in the manuscript (see l. 259-260). However, it was not always clear from the studies which pathogens were acquired or whether intrinsically resistant species were not reported. Therefore, we rely on the studies' specifications of resistant and non-resistant without further classifying data into intrinsic and non-intrinsic resistance. The outcome “acquisition of resistance” can be seen more of a risk assessment for having any resistant bacterium during or after treatment. In contrast, the outcome “emergence of resistance” is more rigorous, demanding the same species to be measured as more resistant during or after treatment.

      Table S1 is not sufficiently clear because it often only contains how susceptibility testing was done but not which antibiotics were tested and how a strain was classified as resistant or susceptible.

      In Table S1, we omitted the listing of antibiotics for which susceptibility testing was performed, as this information is already presented in the main text (Table 1). However, we agree that linking this information better in a future version would benefit the understanding. Given the variability in methods used to assess resistance and the variability in drugs, the comparability of breakpoints is limited. Hence, we decided not to provide further details on this aspect so far.

      Line 85: "Even though within-patient antibiotic resistance development is rare, it may contribute to the emergence and spread of resistance."

      Depending on the bug-drug combination, there is great variation in the propensity to develop within-patient antibiotic resistance. For example: within-patient development of ciprofloxacin resistance in Pseudomonas is fairly common while within-patient development of methicillin resistance in S. aureus is rare. Based on these differences, large clinical heterogeneity is expected and it is questionable where these studies should be pooled.

      We agree that our formulation neglects differences in prevalence of within-host resistance emergence depending on bug-drug combinations. We will correct this in our upcoming version. (i.e. we will correct our statement to: “Within-patient antibiotic resistance development, even if rare, can contribute to the emergence and spread of resistance.”)

      Line 114: "The overall pooled OR for acquisition of resistance comparing a lower number of antibiotics versus a higher one was 1.23 (95% CI 0.68 - 2.25), with substantial heterogeneity between studies (I2=77.4%)"

      What consequential measures did the authors take after determining this high heterogeneity? Did they explore the source of this large heterogeneity? Considering this large heterogeneity, do the authors consider it appropriate to pool these studies?

      Thank you for highlighting this lack of clarity. In our upcoming version, we will emphasize the sub-analyses conducted to explore heterogeneity (i.e., figure 3 and supplementary materials section 6). Nevertheless, these analyses faced limitations due to the scarcity of evidence and the data provided by the studies. Given the lack of appropriate evidence, it is hard to identify the source of heterogeneity. The decision to pool all studies was pre-specified in our previously published study protocol (PROSPERO CRD42020187257) and was motivated by the question whether there is a general effect of combination therapy on resistance development or identify factors that explain potential differences of the effect of combination therapy across bug-drug combinations.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We are thankful to the reviewers for the time and effort invested in assessing our manuscript and for their suggestions to improve it. We have now considered the points raised by them, carried out additional experiments, and modified the text and figures to address them. We feel that the new experiments and modifications have been able to solve all concerns raised by the reviewers and have improved the manuscript substantially, strengthening and extending our conclusions.

      The main modifications include:

      • We have extended the analysis of the overexpression strains to highly stringent conditions, which revealed a mild acidification defect for the strain overexpressing Oxr1. In addition, we have included in our analysis a strain in which both proteins are overexpressed, which resulted in a further growth defect.
      • We have analyzed the recruitment of Rtc5 to the vacuole under additional conditions: deletion of the main subunit of the RAVE complex RAV1, medium containing galactose as the sole carbon source and pharmacological inhibition of the V-ATPase. These experiments allowed us to strengthen and extend our conclusions regarding the requirements for Rtc5 targeting to the vacuole.
      • We have analyzed V-ATPase disassembly in intact cells, by addressing the localization to the vacuole of subunit C (Vma5) in glucose and galactose-containing medium. The results strengthen our conclusion that both Rtc5 and Oxr1 promote an in vivo state of lower V-ATPase assembly.
      • We have extended our analyses of V-ATPase function to medium containing galactose as a carbon source, since glucose availability is one of the main regulators of V-ATPase function in vivo. The results are consistent with what we observed in glucose-containing medium.
      • We have included a diagram of the structure of the V-ATPase for reference.
      • We have included a diagram and a paragraph describing Oxr1 and Rtc5 regarding protein length and domain architecture and comparing them to other TLDc domain-containing proteins.
      • We have made changes to the text and figures to improve clarity and accuracy, including a methods section that was missing. We include below a point-by-point response to the reviewers´ comments.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      __ __Suggestions:

      1. The authors observed that knockout of Rtc5p or Oxr1p does not affect vacuolar pH. If Rtc5p and Oxr1p both cooperate to dissociate V-ATPase, the authors may wish to characterize the effect of a ∆Rtc5p∆Oxr1p double knockout on vacuolar pH. The double mutant ∆rtc5∆oxr1 was already included in the original manuscript (the growth test is shown in Figure 5 B and the BCECF staining is shown in Figure 5C). This strain behaved like wt in both of these assays. Of note, what we observe for the deletion strains is increased assembly (Figure 5 D - G), so we expect that it would be hard to observe a difference in vacuole acidity or growth in the presence of metals.

      Therefore, we have now also included a strain with the double overexpression of Oxr1 and Rtc5. Since overexpression of the proteins results in decreased assembly, it is more likely that this strain will show impaired growth under conditions that strongly rely on V-ATPase activity. Indeed, we observed that the overexpression of Oxr1 alone resulted in a slight growth defect in media containing high concentrations of ZnCl2 and the double overexpression strain showed an even further defect (Figure 6 A and C).

      The manuscript would benefit from a well-labelled diagram showing the subunits of V-ATPase (e.g. in Figure 2D).

      We agree with the reviewer and we have now added a diagram of the structure of the V-ATPase labeling the different subunits in Figure 2B.

      The images of structures, especially in Figure 1-Supplement 1B, are not particularly clear and could be improved (e.g. by removing shadows or using transparency).

      We are thankful to the reviewer for this suggestion. To improve the clarity of the structures in Figure 1 C and Figure 1 – Supplement 1A, we are now presenting the different subunits in the structures with different shades of blue and grey.

      The authors should clearly describe the differences between Rtc5p and Oxr1p in terms of protein length, sequence identity, domain structure, etc.

      We are thankful for this suggestion and we have now included a diagram of the domain architecture and protein length of Rtc5 and Oxr1, comparing with two human proteins containing a TLDc domain in Figure 5A. In addition, we have added the following paragraph describing the features of the proteins.

      “Rtc5 is a 567 residue-long protein. Analysis of the protein using HHPred (Zimmermann et al., 2018), finds homology to the structure of porcine Meak7 (PDB ID: 7U8O, (Zi Tan et al., 2022)) over the whole protein sequence (residues 37-559). For both yeast Rtc5 and human Meak7 (Uniprot ID: Q6P9B6), HHPred detects homology of the C-terminal region to other TLDc domain containing proteins like yeast Oxr1 (PDBID: 7FDE), Drosophila melanogaster Skywalker (PDB ID: 6R82), and human NCOA7 (PDB ID: 7OBP), while the N-terminus has similarity to EF-hand domain calcium-binding proteins (PDB IDs: 1EG3, 2CT9, 1S6C6, Figure 5A). HHPred analysis of the 273 residue long Saccharomyces cerevisiae Oxr1, on the other hand, only detects similarity to TLDc domain containing proteins (PDB IDs: 7U80, 6R82, 7OBP), which spans the majority of the sequence of the protein (residues 71-273). The overall sequence identity between Oxr1 and Rtc5 is 24% according to a ClustalOmega alignment within Uniprot. The Alphafold model that we generated for Rtc5 is in good agreement with the available partial structure of Oxr1 (7FDE) (root mean square deviation (RMSD) of 3.509Å) (Figure 5 - S1 A), indicating they are structurally very similar, in the region of the TLDc domain. Taken together, these analyses suggest that Oxr1 belongs to a group of TLDc domain-containing proteins consisting mainly of just this domain like the splice variants Oxr1-C or NCOA7-B in humans (NP_001185464 and NP_001186551, respectively), while Rtc5 belongs to a group containing an additional N-terminal EF-hand-like domain and a N-myristoylation sequence, like human Meak7 (Finelli & Oliver, 2017) (Figure 5 A).”

      Minor:

      1. The "O" in VO should be capitalized. This has been corrected.

      In Figure 4 supplement 1, the labels "I", "S", and "P" should be defined.

      This has been clarified in the figure legend.

      Please clarify what is meant by "switched labelling"

      This refers to the SILAC vacuole proteomics experiments, for which yeast strains are grown in medium containing either L-Lysine or 13C6;15N2- L-Lysine to produce normal (‘light’) or heavy isotope-labeled (‘heavy’) proteins. This allows comparing two conditions. To increase the robustness of the comparisons, the experiments are done twice with both possible labeling schemes (condition A – light, condition B – heavy + condition A – heavy + condition B – light), which is commonly described as switched labeling or label switching.

      We have exchanged the original sentence in the manuscript for:

      “Performing the same experiments but switching which strain was labeled with heavy and light amino acids gave consistent results.”

      The meaning of the sentence "Indeed, this was the case for both of them" is ambiguous.

      We have now replaced this sentence with the following:

      “Indeed, overexpression of either Rtc5 or Oxr1 resulted in increased growth defects in the context of Stv1 deletion (Figure 7 H and I).”

      For Figure 1-Supplement 1B it is hard to see the crosslink distances.

      We have updated this figure to improve the visibility of the cross-links. In addition, we now include a supplemental table (supplemental table 5) with a list of the Cα- Cα distances measured for all the crosslinks we mapped onto high-resolution structures.

      The statement "The effects of Oxr1 are greater than those caused by Rtc5" requires more context. Is there a way of quantifying this effect for the reader?

      We agree that this sentence was too general and vague. The effects caused by one or the other protein depend on the condition and the assay. We have thus deleted this sentence, and we think it is better to refer to the description of the individual assays performed.

      The phrase "negative genetic interaction" should be clarified.

      We have included in the text the following explanation of genetic interactions:

      “A genetic interaction occurs when the combination of two mutations results in a different phenotype from that expected from the addition of the phenotypes of the individual mutations. For example, deletion of OXR1 or RTC5 has no impact on growth in neutral pH media containing zinc in a control background but improves the growth of RAV1 deletion strains (Figure 7 E and F), so this is a positive genetic interaction. On the other hand, overexpression of either Rtc5 or Oxr1 results in a growth defect in a background lacking Rav1 in neutral media containing zinc, a negative genetic interaction.”

      * * In the sentence "Isogenic strains with the indicated modifications in the genome where spotted as serial dilutions in media with pH=5.5, pH=7.5 or pH=7.5 and containing 3 mM ZnCl2", "where" should be "were".

      This has been corrected.

      Figure 2D: the authors should consider re-coloring these models, as it is challenging to distinguish Rtc5p from the V-ATPase.

      We have changed the coloring of this structure and added a diagram of the V-ATPase structure with the same coloring scheme to improve clarity.

      Reviewer #1 (Significance (Required)):

      The vacuolar protein interaction map alone from this manuscript is a nice contribution to the literature. Experiments establishing colocalization of Rtc5p to the vacuole are convincing, as is dependence of this association on the presence of assembled V-ATPase. Similarly, experiments related to myristoylation are convincing. The observed mislocalization of V-ATPases that contain Stv1p to the vacuole (which is also known to occur when Vph1p has been knocked out) upon knockout of Oxr1p is also extremely interesting. Overall, this is an interesting manuscript that contributes to our understand of TLDc proteins.

      We are thankful to the reviewer for their appreciation of the significance of our work, including the interactome map of the vacuole as a resource and the advances on the understanding of the regulation of the V-ATPase by TLDc domain-containing proteins.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Major points:

      1. The evidence of Oxr1 and Rtc5 as V-ATPase disassembly factors is circumstantial. The authors base their interpretation primarily on increased V1 (but not Vo) at purified vacuoles from Oxr1- or Rtc5-deleted strains, which does not directly address disassembly. Of course, the results regarding Oxr1 confirm detailed disassembly experiments with the purified protein complex (PMID 34918374), but on their own are open to other interpretations, e.g. suppression of V-ATPase assembly. Of note, the authors emphasize that they provide first evidence of the in vivo role of Oxr1, but monitor V1 recruitment with purified vacuoles and do not follow V-ATPase assembly in intact cells. We are thankful to the reviewer for pointing this out. We did not want to express that the molecular activity of the proteins is the disassembly of the complex, as our analyses include in vivo and ex vivo experiments and do not directly address this. We rather meant that both proteins promote an in vivo state of lower assembly of the V-ATPase. We have modified the wording throughout the manuscript to be clearer about this.

      In addition, we have added new experiments to monitor V-ATPase assembly in intact cells, as suggested by the reviewer. Previous work has shown that in yeast, only subunit C leaves the vacuole membrane under conditions that promote disassembly, while the other subunits remain at the vacuole membrane (Tabke et al 2014). Our own experiments agree with what was published (Figure 3 D). We have thus monitored Vma5 localization to the vacuole under glucose or after shift to galactose containing media in cells lacking or overexpressing Rtc5 or Oxr1. We observed that cells overexpressing either TLDc domain protein show lower levels of Vma5 recruitment to the vacuole in glucose (Figure 6 D and E). Additionally cells lacking either Rtc5 or Oxr1 contain higher levels of Vma5 at the vacuole after 20 minutes in galactose medium (Figure 5 F and G). Thus, these results re-inforce our conclusions that Rtc5 and Oxr1 promote states of lower assembly.

      Oxr1 and Rtc5 have very low sequence similarity. It would be helpful if the authors provided more detail on the predicted structure of the putative TLDc domain of Rtc5 and its relationship to the V-ATPase - Oxr1 structure. Is Rtc5 more closely related to established TLDc domain proteins in other organisms?

      We have now included a diagram of the domain architecture of Rtc5 and Oxr1, and comparison to the features of other TLDc domain containing proteins in Figure 5 A, as well as a paragraph describing them:

      “Rtc5 is a 567 residue-long protein. Analysis of the protein using HHPred (Zimmermann et al., 2018), finds homology to the structure of porcine Meak7 (PDB ID: 7U8O, (Zi Tan et al., 2022)) over the whole protein sequence (residues 37-559). For both yeast Rtc5 and human Meak7 (Uniprot ID: Q6P9B6), HHPred detects homology of the C-terminal region to other TLDc domain containing proteins like yeast Oxr1 (PDBID: 7FDE), Drosophila melanogaster Skywalker (PDB ID: 6R82), and human NCOA7 (PDB ID: 7OBP), while the N-terminus has similarity to EF-hand domain calcium-binding proteins (PDB IDs: 1EG3, 2CT9, 1S6C6, Figure 5A). HHPred analysis of the 273 residue long Saccharomyces cerevisiae Oxr1, on the other hand, only detects similarity to TLDc domain containing proteins (PDB IDs: 7U80, 6R82, 7OBP), which spans the majority of the sequence of the protein (residues 71-273). The overall sequence identity between Oxr1 and Rtc5 is 24% according to a ClustalOmega alignment within Uniprot. The Alphafold model that we generated for Rtc5 is in good agreement with the available partial structure of Oxr1 (7FDE) (root mean square deviation (RMSD) of 3.509Å) (Figure 5 - S1 A), indicating they are structurally very similar, in the region of the TLDc domain. Taken together, these analyses suggest that Oxr1 belongs to a subfamily of TLDc domain-containing proteins consisting mainly of just this domain like the splice variants Oxr1-C or NCOA7-B in humans (NP_001185464 and NP_001186551, respectively) , while Rtc5 belongs to a subfamily containing an additional N-terminal EF-hand-like domain and a N-myristoylation sequence, like human Meak7 (Finelli & Oliver, 2017) (Figure 5 A).”

      The authors conclude vacuolar recruitment of Rtc5 depends on the assembled V-ATPase, based on deletion of different V1 and Vo domain subunits. However, these genetic manipulations likely cause a strong perturbation of vacuolar acidification; indeed, the images show drastically altered vacuolar morphology. To strengthen their conclusion, it would be helpful to show that Rtc5 recruitment is not blocked by inhibition of vacuolar acidification, and that conversely it is blocked by deletion of rav1.

      We are thankful to the reviewer for this insightful suggestion and we have now performed both experiments suggested. The experiment regarding rav1Δ is now Figure 3C, and we observed that this mutation also disrupts Rtc5 localization to the vacuole. In addition, we decided to include an experiment showing the subcellular localization of Rtc5 after shifting the cells to galactose containing medium for 20 minutes, as a physiologically relevant condition that results in disassembly of the complex (Figure 3D). We observed that under these conditions Rtc5 re-localizes to the cytosol. This result is particularly interesting given that in yeast only subunit C (but not other V1 subunits) re-localizes to the cytosol under these conditions. In addition, the experiment using Bafilomycin A to inhibit the V-ATPase shows that Rtc5 is still localized at the vacuole membrane under conditions of V-ATPase inhibition (Figure 3 F). Taken together these results allowed us to strengthen our original interpretation that Rtc5 requires an assembled V-ATPase for its localization and extend it to the fact that the V-ATPase does not need to be active.

      Reviewer #2 (Significance (Required)):

      This is an interesting paper that confirms and extends previous findings on TLDc domain proteins as a novel class of proteins that interact with and regulate the V-ATPase in eukaryotes. The title seems to exaggerate the findings a bit, as the authors do not investigate V-ATPase (dis)assembly directly and only phenotypically describe altered subcellular localization of the Golgi V-ATPase in Oxr1-deleted cells. A recent structural and biochemical characterization of Oxr1 as a V-ATPase disassembly factor (PMID 34918374) somewhat limits the novelty of the results, but the function of Oxr1 in regulating subcellular V-ATPase localization and the identification of a second potential TLDc domain protein in yeast provide relevant insights into V-ATPase regulation. This paper will be of interest to cell biologists and biochemists working on lysosomal biology, organelle proteomics and V-ATPase regulation.

      We thank the reviewer for the assessment of our work, and for recognizing the novel insights that we provide. Regarding the previous biochemical work on Oxr1 and the V-ATPase, we have clearly cited this work in the manuscript. In our opinion, our results complement and extend this article, showing that the function in disassembly is relevant in vivo. Additionally, this is only one of five major points of the article, the other four being

      • The interactome map of the vacuole as a resource
      • The identification of Rtc5 as a second yeast TLDc domain containing protein and interactor of the V-ATPase.
      • The identification of the role of Rtc5 in V-ATPase assembly.
      • The identification of the role of Oxr1 in Stv1 subcellular localization. We believe these additional points add important insights to researchers interested in lysosomes, the V-ATPase, intracellular trafficking and TLDc-domain containing proteins.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Major comments

      __1) Re: A cross-linking mass spectrometry map of vacuolar protein interactions (results) __ While XL-MS is a very powerful method, it is a high-throughput approach and there should be some kind of negative control in these experiments. In cross-linking experiments, non-cross-linked samples are usually used as negative controls. What was the negative control in cross-linking mass-spectrometry experiments here? If there was no negative control, how the specificity of interactions was evaluated? Maybe the authors analyzed the dataset for highly improbable interactions and found very few of them?

      We fully agree that it is crucial to ensure the specificity of the interactions detected by XL-MS. To achieve this, one needs to control (1) the specificity of the data analysis (i.e. that the recorded mass spectrometry data are correctly matched to cross-linked peptides from the sequence database) and (2) the biological specificity (i.e. that the cross-linking captured natively occurring interactions).

      To ascertain that criterion (1) is met, cross-link identifications are filtered to a pre-defined false-discovery rate (FDR) – an approach that the XL-MS field adopted from mass spectrometry-based proteomics. As a result, low-confidence identifications (e.g. cross-linked peptides that are only supported by a few signals in a given mass spectrum) are removed from the dataset. FDR filtering in XL-MS is a rather complex matter as it can be done at different points during data analysis and the optimal FDR cut-off depends on the specific scientific question at hand (for more details see for example Fischer and Rappsilber, Anal Chem, 2017). Generally speaking, an overly restrictive FDR cut-off would remove a lot of correct identifications, thereby greatly limiting the sensitivity of the analysis. On the other hand, a too relaxed FDR cut-off would dilute the correct identifications with a high number of false-positives, which would impair the robustness and specificity of the dataset. While many XL-MS study control the FDR on the level of individual spectrum matches, we opted for a 2% FDR cut-off on the level of unique residue pairs, which is more stringent (see Fischer and Rappsilber, Anal Chem, 2017). Our FDR parameters are described in the Methods section (Cross-linking mass spectrometry of isolated vacuoles - Data analysis). Of note, we have made all raw mass spectrometry data publicly available through the PRIDE repository (https://www.ebi.ac.uk/pride/ ; accession code PXD046792; login details during peer review: Username = reviewer_pxd046792@ebi.ac.uk, Password = q1645lTP). This will allow other researchers to re-analyze our data with the data analysis settings of their choice in the future.

      To ascertain that criterion (2) is met, we mapped the identified cross-links onto existing high-resolution structures of vacuolar protein complexes. Taking into account the length of our cross-linking reagent, the side-chain length of the cross-linkable amino acids (i.e. lysines), and a certain degree of in-solution flexibility, cross-links can reasonably occur between lysines with a mutual Cα-Cα distance of up to 35 Å. Using this cut-off, the lysine-lysine pairs in the high-resolution structures we studied can be split into possible cross-linking partners (Cα-Cα distance 35 Å). Of all cross-links we could map onto high-resolution structures, 95.2% occurred between possible cross-linking partners. In addition, our cross-links reflect numerous known vacuolar protein interactions that have not yet been structurally characterized. These lines of evidence increase our confidence that our XL-MS approach captured genuine, natively occurring interactions. These analyses are described in more detail in the first Results sub-section (“A cross-linking mass spectrometry map of vacuolar protein interactions”).

      In addition, the high purity of vacuole preparation is critical. How was it assessed by the authors?

      We disagree that the purity of the vacuole preparation is critical for this analysis to be valid. The accuracy of the protein-protein interactions detected will depend on their preservation during sample preparation until the sample encounters the cross-linker, and the data analysis, as described above. The experiment would have been equally valid if performed on whole cell lysates without any enrichment of vacuoles, but the coverage of vacuolar proteins would have likely been very low. For this reason, we decided to use the vacuole isolation procedure to obtain better coverage of the proteins of this particular organelle. The use of the Ficoll gradient protocol (Haas, 1995) was based on that it is a protocol that yields strong enrichment of proteins annotated with the GO Term “vacuole” (Eising et al, 2019) and that it preserves the functionality of the organelle, as evidenced by its use for multiple functional assays (vacuole-vacuole fusion (Haas, 1995), autophagosome-vacuole fusion (Gao et al, 2018), polyphosphate synthesis by the VTC complex (Desfougéres et al, 2016), among others).

      2) Re: Rtc5 and Oxr1 counteract the function of the RAVE complex (results)

      Taken together, data, presented in this section of the manuscript, provide strong evidence that Rtc5 and Oxr1 negatively regulate V-ATPase activity, counteracting the V-ATPase assembly, facilitated by the activity of the RAVE complex. However, the complete deletion of the major RAVE subunit Rav1p was required to observe this effect in vivo in yeast. The other way to induce V-ATPase disassembly in yeast is glucose deprivation. It will be interesting to study if there is a synergistic effect between glucose deprivation and RTC5/OXR1 deletion on V-ATPase assembly, vacuolar pH, and growth of single oxr1Δ, rtc5Δ or double oxr1Δrtc5Δ mutants (OPTIONAL). Glucose deprivation is a more physiologically relevant condition than a deletion of an entire gene.

      We would like to point out that an effect on assembly is observed without deleting the RAVE complex: deletions of Oxr1 or Rtc5 resulted in increased V-ATPase assembly in vivo in the presence of glucose and of the RAVE complex (Figures 5 D and E). We have now also added the experiments showing that the overexpression strains have a mild growth defect under conditions that force cells to strongly rely on V-ATPase activity (Figures 6 A and C).

      Nevertheless, we agree that addressing the effect of changing the levels of Oxr1 and Rtc5 under low-glucose conditions is an interesting physiologically relevant question. We have now included growth assays and BCECF staining in medium containing galactose as the carbon source (Figures 5 – Supplement 1 B and C, and Figure 6 C and Figure 6- Supplement 1A). In addition, we have addressed the vacuolar localization of Vma5 in medium containing glucose or after shifting to medium containing galactose for 20 minutes, as a proxy for V-ATPase disassembly in intact cells (Figure 5 F and G, Figure 6 D and E). Taken together, these analyses reinforce our conclusions that both Rtc5 and Oxr1 promote an in vivo state of lower V-ATPase assembly, based on the following observations:

      • Higher localization of Vma5 to the vacuole after 20 mins in galactose in cells lacking Oxr1 or Rtc5 (Figure 5 F and G).
      • Lower localization of Vma5 to the vacuole in medium containing glucose in cells overexpressing Oxr1 or Rtc5 (Figure 6 D and E).
      • Growth defect of the strain overexpressing Oxr1 in medium containing galactose with pH = 7.5 and zinc chloride, with a further growth defect caused by additional overexpression of Rtc5 (Figure 6 C). 3) Re: Figure 6 - supplement 1. The title is relevant to panel D only, it should be renamed to reflect the results of the disassembly of V-ATPase in rav1Δ mutant strains, while results about the stv1Δ-based strains (Panel D) should be shown together with similar experiments in Figure 7 - supplement 2 for clarity.

      We have shifted the Panel D from the original Figure 6 – Supplement 1 to the main Figure (now Figure 7 – H and I). Regarding the title of the Figure, whether Supplemental Figures have titles or not will depend on the journal where the manuscript is published. For now, we have removed all titles from supplemental figures, as they are conceived to complement the main Figures.

      4) Re: Figure 7 - supplement 1, Panel A. The proper assay to show that Stv1-mNeonGreen is functional is to express it in double mutant vph1Δstv1Δ to see if the growth defect is reversed. In addition, the vph1Δ growth defect is not changed (improved or worsened) in the presence of Stv1-mNeonGreen, so it means that the expression of Stv1-mNeonGreen does not further compromise the V-ATPase function, but it does not mean that it improves its function.

      It is clear from the experiment suggested by the reviewer that they think that we have expressed Stv1-mNeonGreen from a plasmid. This was not the case, Stv1 was C-terminally tagged with mNeonGreen in the genome. It is thus the only expressed version in the strain. The experiment we have performed is thus equivalent to the one suggested by the reviewer, but for genomically expressed variants. For reference, the genotypes of all the strains used can be found in Supplemental Table 1.

      5) Re: Figure 7 - supplement 2. This figure should be combined with Fig. 6- suppl 1, panel D as also mentioned above. The figure seems to lack some labels, and conclusions are not accurate as discussed below. However, this data provides important additional information about relationships between isoform-specific subunits of V-ATPase Vph1 and Stv1 and both Rtc5 and Oxr1 and should be repeated if it is not done yet to have a better idea about these relationships.

      Panel B: Based on this picture, deletion of RTC5 has a negative genetic interaction with the deletion of VPH1, since double deletion mutant vph1Δ rtc5Δ grows worse than each individual mutant. Although it also means that there is no positive interaction, it is not the same.

      Indeed, there is a negative genetic interaction between the deletion of RTC5 and VPH1. We have replaced the growth tests in this figure (Figure 8 – Supplement 2 A in the new manuscript) to show this negative genetic interaction better. This effect is reproducible, as shown in the repetitions of the experiments.

      Panel C: Same as for panel B. Based on this picture, the deletion of OXR1 has a weak negative genetic interaction with the deletion of STV1, since double deletion mutant stv1Δ oxr1Δ grows worse than each individual mutant at 6 mM ZnCl2.

      Panel D: Same as for panels B and C. Based on this picture, deletion of RTC5 has a negative genetic interaction with the deletion of STV1, since double deletion mutant stv1Δ rtc5Δ grows worse than each individual mutant at 6 mM ZnCl2. There is no label in the middle panel (growth conditions) and no growth assay data in the presence of CaCl2.

      However, these results will be then in contradiction with the results from Figure 6 - Supplement 1, panel D, showing negative genetic interaction between the overexpression of Rtc5 or Oxr1 and deletion of Stv1, since both deletion and overexpression of Rtc5 or Oxr1 would have negative genetic interactions with Stv1.

      For both Panels C and D (Now Figure 8 - Supplement 2 B and C). The effect pointed out by the reviewer (slightly stronger growth defect for the double mutants than for the single mutants) is very mild. We have attempted to make it more evident by assessing growth in medium with higher and lower concentrations of zinc and this was not possible. This is in contrast with the very clear positive genetic interaction that we observe between the deletion of OXR1 and VPH1 (Now Figure 8 H). This is the reason that we decided to report the lack of a positive genetic interaction instead of the presence of a negative one, as we do not want to draw conclusions based on results that are borderline detectable.

      In addition, there is no label for the media in the middle panel, is it just YPAD pH=7.5, without the addition of any metals?

      Indeed, the media is YPAD pH=7.5, without the addition of any metals. The line drawn above several images based on this media indicated this. Since this form of labeling appears to be confusing, we have now replaced it and placed the label directly above the image.

      Why there is no growth assay in the presence of CaCl2, like in panels A and B?

      Every growth test shown in the manuscript was performed including growth in YPD pH=5,5 as a control of a permissive condition for lack of V-ATPase activity, and then in YPD pH=7,5 including a broad range of Zinc Chloride and Calcium chloride concentrations. From all these pictures, the conditions where the differences among strains were clearly visible were chosen to assemble the figures. Conditions that did not provide any information for that particular experiment were not included in the figure to avoid making them unnecessarily large and crowded.

      Re: Figure 7 - supplement 2, continued. How many times all these experiments were repeated? These experiments should be repeated at least 3 times, which is especially necessary for the experiments in panel C, because the effects are borderline. If results are reproducible and statistically significant, although small, the conclusion should be changed from "no positive genetic interactions" to "negative genetic interactions", which is more precise and informative.

      All growth tests shown in the manuscript were repeated at least three times for the conditions shown. We are thankful to the reviewer for pointing out that this was not mentioned, and we have added this to the methods section. We have assembled a file with all repetitions of the shown growth tests and added it at the end of this file. In doing so, these are already available for the public. These repetitions show that all effects reported are reproducible. We will then discuss with the editors of the journal where this manuscript is published about the necessity of including it with the final article.

      Regarding reporting the lack of a positive genetic interaction vs. a negative one, we have discussed this above. Shortly, for Panel B (Figure 8 – Supplement 2 A in the new manuscript) we have changed the conclusion to “negative genetic interaction” as adjusting the zinc chloride concentration allowed us to show this clearly and reproducibly, as shown by the repetitions of the experiments. For panels C and D (Now Figure 8 - Supplement 2 B and C), the effect is really mild and barely detectable, even when we tried a wide range of zinc chloride concentrations. For this reason, we would prefer to maintain the “no positive genetic interaction” conclusion.

      Re: Methods. There is no description of yeast serial dilution growth assay at all. In addition, why the specific media (neutral pH, in the presence of high concentrations of calcium or zinc) was used is not explained either in the results or methods. Appropriate references should be included, for example, PMID: 2139726, PMID: 1491236.

      We apologize for the oversight of the missing methods section, which we have now included.

      Regarding the explanation of the media used, the following section was already a part of the results section, before the description of the first growth test:

      “The V-ATPase is not essential for viability in yeast cells, and mutants lacking subunits of this complex grow similarly to a wt strain in acidic media. However, when cells grow at near-neutral pH or in the presence of divalent cations such as calcium and zinc, the mutants lacking V-ATPase function show a strong growth impairment (Kane et al, 2006).”

      We have now replaced this with the following, more complete version:

      “As a first approach for addressing the role of these proteins, we tested growth phenotypes related to V-ATPase function in strains lacking or overexpressing them. The V-ATPase is not essential for viability in yeast cells, and mutants lacking subunits of this complex grow similarly to a wt strain in acidic media, but display a growth defect at near-neutral pH the mutants (Nelson & Nelson, 1990). In addition, the proton gradient across the vacuole membrane generated by the V-ATPase energizes the pumping of metals into the vacuole, as a mechanism of detoxification. Thus, increasing concentrations of divalent cations such as calcium and zinc, generate conditions in which growth is increasingly reliant on V-ATPase activity (Förster & Kane, 2000; MacDiarmid et al, 2002; Kane, 2006).”


      MINOR COMMENTS

      Yeast proteins are named with "p" at the end, such as "Rtc5p".

      This nomenclature rule is falling into disuse during the last decades, as the use of capitals vs lowercase and italics allows to distinguish between genes proteins and strains (OXR1 = gene, Oxr1 = protein, oxr1Δ = strain). As an example, I include a list of the latest papers by some of the major yeast labs around the world, all of which use the same nomenclature as we do (in alphabetical order). This list even includes some work in the field of the V-ATPase.

      • Alexey Merz, USA. PMID: 33225520
      • Benoit Kornmann, UK. PMID: 35654841
      • Christian Ungermann, Germany. PMID: 37463208
      • Claudio de Virgilio, Switzerland. PMID: 36749016
      • Daniel E. Gottschling, USA. PMID: 37640943
      • David Teis, Austria. PMID: 32744498
      • Elizabeth Conibear, Canada. PMID: 35938928
      • Fulvio Reggiori, Denmark. PMID: 37060997
      • J Christopher Fromme, USA. PMID: 37672345
      • Maya Schuldiner, Israel. PMID: 37073826
      • Patricia Kane, USA. PMID: 36598799
      • Scott Emr, USA. PMID: 35770973
      • W Mike Henne, USA. PMID: 37889293
      • Yoshinori Ohsumi, Japan. PMID: 37917025 In addition, we would prefer to keep the nomenclature that we already use, to keep consistency with other published articles from our lab.

      Re: Introduction. In the introduction it should be indicated that Rtc5 was originally discovered as a "restriction of telomere capping 5", using screening of temperature-sensitive cdc13-1 mutants combined with the yeast gene deletion collection [PMID: 18845848]. A couple of sentences should be written about the RAVE complex and its role in V-ATPase assembly.

      We are thankful for this suggestion and we have now included both pieces of information in the introduction.

      *“The re-assembly of the V1 onto the VO complex when glucose becomes again available, is aided by a dedicated chaperone complex known as the RAVE complex, which also likely has a general role in V-ATPase assembly (Seol et al, 2001; Smardon et al, 2002, 2014).” *

      “In our cross-linking mass spectrometry interactome map of isolated vacuoles we found that the only other TLDc-domain containing protein of yeast, Rtc5, is a novel interactor of the V-ATPase. Rtc5 is a protein of unknown function, originally described in a genetic screen for genes related to telomere capping (Addinall et al, 2008)”

      Re: The TLDc domain-containing protein of unknown function Rtc5 is a novel interactor of the vacuolar V-ATPase (results)

      1) It is important to understand, that Oxr1 was co-purified before with the V1 domain of V-ATPase from a certain mutant strain, not wild-type yeast [PMID: 34918374]. It may explain why the authors did not identify it in their original protein-protein interactions screen here.

      The structural work on the V1 domain bound to Oxr1 (Khan et al, 2022) showed that the binding of Oxr1 prevented V1 from assembling onto the Vo. Since our experiments rely on the purification of vacuoles, they should contain mainly only V1 assembled onto the VO, and not the free soluble V1. This is likely the reason that we do not detect Oxr1, in addition to it being less abundant. We have clarified this now in the manuscript and added the fact that Oxr1 was co-purified with a V1 containing a mutant version of the H subunit.

      “In a previous study, Oxr1 was co-purified with a V1 domain containing a mutant version of the H subunit, and its presence prevented the in vitro assembly of this V1 domain onto the VO domain and promoted disassembly of the holocomplex (Khan et al., 2022). This is likely the reason why we do not detect Oxr1 in our experiments, which rely on isolated vacuoles and thus would only include V1 domains that are assembled onto the membrane. In addition, Oxr1 is less abundant in yeast cells than Rtc5 according to the protein abundance database PaxDb (Wang et al, 2015).”

      2) It is a wrong conclusion that because Rtc5 was co-purified with both V1 and V0 domain subunits it interacts with the assembled V-ATPase, this does not exclude a possibility that Rtc5 also interacts with separate V1 sector or separate V0 sector of V-ATPase.

      We agree with the reviewer that the co-purification of Rtc5 with both V1 and VO domain subunits does not necessarily mean that it interacts with the assembled V-ATPase. Thus, we have modified the text in this part to:

      “The fact that we can co-enrich Rtc5 both with Vma2 and with Vph1 indicates that it can interact either with both the VO and V1 domains or with the assembled V-ATPase.”

      However, other results throughout the manuscript can be taken into account to strengthen this idea:

      1. Rtc5 requires an assembled V-ATPase to localize to the vacuole membrane, and thus seems not to interact with free VO domains, which would be available when we delete V1 subunits or in medium containing galactose.
      2. Rtc5 becomes cytosolic in galactose-containing media. This would indicate that it also does not interact with free V1 domains, which are still localized to the vacuole membrane under these conditions. Taken together with the pull-downs, these results suggest that Rtc5 interacts with the assembled V1-VO V-ATPase. Thus, we have included the following sentence after Figure 3, which shows the subcellular localization experiments.

      *“Taking into account that Rtc5 is co-enriched with subunits of both the VO and V1 domain, and that it localizes at the vacuole membrane dependent on an assembled V-ATPase, we suggest that Rtc5 interacts with the assembled V-ATPase complex.” *

      Re: Figure 1, Panel C. Is it possible to show individual proteins in different colors for clarity?

      Panel D. How were cross-link distances measured? It is not obvious if you are not an expert in the field and it is not described in the methods.

      We have modified Figure 1 C and Figure 1 – Supplement 1B (now Figure 1 – Supplement 1 A) to present the different subunits in the structures with different shades of blue and grey.

      Furthermore, we have clarified the distance measurement approach in the methods section and in the legend of Fig 1D: “Ca-Ca distances were determined using the measuring function in Pymol v.2.5.2 (Schrodinger LLC).”

      __Re: Figure 1 - Supplement 1, __

      Panel A. What scientific information are we getting from this picture?

      This panel was just a visual representation of the complexity of the protein network we obtained. Indeed, there was no specific scientific message, so we have decided to remove this panel from the revised manuscript.

      Panel B. Why are these complexes shown separately from the complexes in Figure 1, panel C? Also, can individual proteins be colored differently here as well?

      We did not want to overload Fig 1C, so we decided to show some of the protein complexes in Fig 1 – Supplement 1B. The most important information is the histogram showing that 95% of the mapped cross-links fall within the expected length range, and this is shown in the main Figure (Figure 1D). As stated above, we have adjusted the subunit coloring in Figure 1 C to improve clarity.

      Re: Figure 3. It will be nice to show the localization of the untagged protein as well if antibodies are available (OPTIONAL).

      Unfortunately, there are no available antibodies for either Rtc5 or Oxr1. This hinders us from detecting the endogenous untagged proteins. We would like to point out that we have been very careful in showing which tagged proteins are functional (C-terminally tagged Rtc5) and which are not (C-terminally tagged Oxr1), so that the reader can know how to interpret the localization data.

      Re: Figure 4. Why different tags were used in panels A (GFP), C (msGFP2) and D

      (mNeonGreen)?

      In general, we prefer to use mNeonGreen as a tag for microscopy experiments because it is brighter and more stable, and msGFP2 as a tag for experiments involving Western blots because we have better antibodies available. There was a mistake in the labeling, and actually, all constructs labeled as GFP were msGFP2. We have now corrected this. Of note, we have tested the functionality of both tagged version (mNeonGreen and msGFP2).

      Panels B and C. Were Rtc5 fusions detected using anti-GFP antibodies?

      Indeed, Rtc5-msGFP2 was detected with an anti-GFP antibody. We have now indicated next to each Western blot membrane the primary antibody used. In addition, all antibodies are detailed in Supplemental Figure 3.

      The authors should have full-size Western blots available, not just cut-out bands, as some journals and reviewers require them for publication.

      For all western blots, we always showed a good portion of the membrane and not cut-out bands. The cropping was performed to avoid making figures unnecessarily large. The whole membranes are of course available and will be included in an “extended data file” if required by the journal.

      Re: Figure 4 - Supplement 1, Panel A. Does "-" and "+" mean -/+ Azido-Myr?

      Indeed. We have now added this label to the figure.

      Panel B. There is no blot with a membrane protein marker (Vam3 or Vac8), it should be included.

      We have replaced this western blot for a different repetition of this experiment in which a membrane protein marker was included. Of note, the two other repetitions of the experiment shown (Figure 4 – Supplement 1 panel C and Figure 4 panel C) also include both a membrane protein marker and a soluble protein marker.

      Re: Figure 5. The title does not describe all results in this figure and should be modified accordingly.

      The original data from Figure 5 is now separated into Figures 5 and 6 because of the additional experiments included during revisions. We have modified the Figure titles to be descriptive of the overall message of the Figures.

      Panel C. Statistical significance value for *** should be indicated in the legend.

      This has been indicated in the Figure legend.

      It is not clear how many times yeast growth assays were repeated. Usually, all experiments should be done in triplicates or more.

      All shown growth tests were performed at least three times for the conditions shown. We have now indicated this in the materials and methods section. In addition, we now provide in this response a file with all repetitions of growth tests, which will be appended to the article if deemed necessary by the editors.

      Re: Figure 5 - supplement 1. No title

      Re: Figure 5 - supplement 2. No title

      Whether the supplemental Figures should have a title or not will depend on the style of the journal where the manuscript is finally published. The current idea of the supplemental Figures is that they complement the corresponding main Figure. For this reason, we have removed all titles from supplemental Figures.

      Re: Figure 6. There is a typo on the second lane in the legend: "...the genome were", not "...the genome where".

      This has been corrected.

      Panel C. Why the analysis of BCECF vacuole staining of double mutants oxr1Δrav1Δ and rtc5Δrav1Δ is not shown? Was it done at all?

      We had not included this piece of data, as we thought that the genetic interaction of RTC5 and OXR1 and rav1Δ was sufficiently well supported with the included data (growth tests in combination with the deletion, growth tests in combination with the overexpression, vacuole proteomics in combination with overexpression, and BCECF staining in combination with the overexpression). Because of the request of the reviewer, we have now included this experiment as Figure 7 G.

      Re: Figure 6 - Supplement 2. Why were two different tags (2xmNG and msGFP2) used?

      We tried both tags to see if one of them would be functional. Unfortunately, they both resulted in non-functional proteins, as shown by the corresponding growth tests.

      Did the authors study N-terminally tagged Oxr1? Was it functional?

      We have tagged Oxr1 N-terminally, and this unfortunately resulted in a protein that was not completely functional. We show below the localization of N-terminally mNeon-tagged Oxr1, under the control of the TEF1 promoter. The protein appears cytosolic (Panel A) but is not completely functional (Panel B). The localization of Oxr1 had already been misreported by using a tagged version that we now show to be non-functional. For this reason, we preferred not to include this data in the manuscript, to avoid again including in the literature subcellular localizations that correspond to non-functional or partially functional proteins.

      Panel B. Results for the untagged TEF1pr-Oxr1 overexpression are not shown, thus tagged and untagged proteins can't be compared. Are they available? What is the promoter for the expression of 2xmNG fusion constructs?

      Oxr1-2xmNG was C-terminally tagged in the genome, which means that the promoter is the endogenous one, it was not modified. For this reason, the correct controls are a strain expressing Oxr1 at endogenous levels (the wt strain) and a strain lacking Oxr1. Both controls were included in the Figure, and in all repetitions made of this experiment. For reference, all the genotypes of the strains used are found in Supplemental Table 1.

      Re: Methods. Were vacuoles prepared differently for XL-MS and SILAC-based vacuole proteomics (there are different references) and why? Methods for XL-MS and quantitative SILAC-based proteomics can be placed together for clarity.

      The basis for the method of vacuole purification is the same, from (Haas, 1995). This reference was included in both protocols that include vacuole purifications. However, modifications of this method were performed to fit the crosslinking method (higher pH, no primary amines) or to fit the SILAC labeling (combination of two differentially labeled samples in one purification). The reference for the vacuole proteomics (Eising et al 2022) corresponds to a paper in which the SILAC-based comparison of vacuoles from different mutant strains was optimized, and includes not only the vacuole purification but the growth conditions and downstream processing of the vacuoles.

      Since both the SILAC-based vacuole proteomics and the XL-MS are multi-step methods, containing numerous parameters including the sample preparation, processing for MS, MS run and data analysis, we would prefer to keep them separate. We think this would allow a person attempting to reproduce these methods to go through them step by step.

      What is CMAC dye? Why was it used to stain the vacuolar lumen?

      We apologize for this oversight, we have included the definition of CMAC as 7-Amino-4-Chlormethylcumarin. It is a standard-used organelle marker for the lumen of the vacuole.

      Some abbreviations (TEAB, ACN) are not explained.

      We apologize for this oversight. We have now replaced these abbreviations with the full names of the compounds in the article.

      What is 0% Ficoll?

      We used the term 0% Ficoll, because this is the name given to the buffer in the original Haas 1995 paper on vacuole purifications. However, we agree that the term is misleading and we have now added the composition of the buffer (10 mM PIPES/KOH pH=6.8, 0.2 M Sorbitol).

      Reviewer #3 (Significance (Required)):

      The vacuolar-type proton ATPase, V-ATPase, is the key proton pump, that hydrolases ATP and uses this energy to pump protons across membranes. Amazingly, this proton pump and its function are conserved in eukaryotes from yeast to mammals. While V-ATPase structure and function have been studied for more than 30 years in various organisms, its regulation is not completely understood. The very recent discoveries of two new V-ATPase interacting proteins in yeast, first Oxr1 (OXidative Resistance 1), and now Rtc5 (Restriction of Telomere Capping 5), both the only two members of TLDc (The Tre2/Bub2/Cdc16 (TBC), lysin motif (LysM), domain catalytic) proteins in yeast, provide new insights in V-ATPase regulation in yeast, and because the interaction is conserved in mammals its relevance to mammalian V-ATPases regulation as well.

      TLDc proteins are best known for their role in protection from oxidative stress, in particular in yeast and in the nervous system in mammals. The discovery of the novel Rtc5-V-ATPase interaction points to the role of V-ATPase not only in protection from oxidative stress but also in restriction of telomere capping in yeast and most likely higher species. The studies of other species also highlight the possible conserved role of V-ATPase in lifespan determination and Torc1 signaling, mediated through these interactions. Thus, the discovery of this new functionally important interaction between the second TLDc family member in yeast, Rtc5, and V-ATPase will shed light on the molecular mechanisms of all these essential biological processes and pathways.

      In addition, because the authors performed a comprehensive proteomics protein-protein interaction study of the purified yeast vacuole it provides a valuable resource for all researchers who study vacuoles and/or related to them lysosomes.

      The follow-up functional studies using the rav1Δ strain clearly demonstrated that Rtc5 and Oxr1 disassemble V-ATPase and counteract the function of V-ATPase assembly RAVE complex in vivo in yeast. Thus, they are essentially the first discovered endogenous eukaryotic protein inhibitors of V-ATPase. Moreover, because the authors obtained the evidence that Oxr1 is the regulator of the specific subunit isoform of V-ATPase Stv1p in vivo in yeast, it suggests that different TLDc proteins may regulate different specific V-ATPase subunit isoforms in cell- and tissue-specific manner in higher eukaryotes. The mechanism of this isoform-specific regulation in yeast and other species needs further investigation in the future.

      Because of the conservation of the TLDc-V-ATPase interactions, all this information can be extrapolated to higher species, all the way to humans, in whom genetic mutations in various TLDc proteins are known to cause devastating diseases and syndromes.

      We are thankful to the reviewer for their positive comments about the significance of our work.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      1) A single biomarker seems very unlikely to be of much help in the detection of glaucoma due to the etiological heterogeneity of the disease, the existence of different subtypes, and the genetic variability among patients. Rather, a panel of biomarkers may provide more useful information for clinical prediction, including better sensitivity and specificity. The inclusion of additional metabolites already identifying in the study, in combination, may provide more reliable and correct assignment results.

      The authors’ answer: Thank you for your comment. We recognize the constraints of using single biomarkers for diagnosis. In upcoming research, we aim to incorporate multiple biomarkers to improve diagnostic accuracy and will consider adding more metabolites as suggested.

      2) The number of samples in the supplementary phase is low, larger sample sizes are mandatory to confirm the diagnostic accuracy.

      The authors’ answer: Thank you for your comment. Collecting aqueous humor is invasive, making samples scarce. We acknowledge the small sample size limitation. In future studies, we plan to use larger samples to verify the biomarker's diagnostic accuracy. Your feedback emphasizes the need for thorough validation in our next research

      3) Cohorts from different populations are needed to verify the applicability of this candidate biomarker.

      The authors’ answer: Thank you for the suggestion. We agree on the need to test the biomarker's relevance across varied populations. Reports from other groups will help confirm and broaden our results.

      4) Sex hormones seem to be associated also with other types of glaucoma, such as primary open-angle glaucoma (POAG), although the molecular mechanisms are unclear (see doi:10.1167/iovs.17-22708). The inclusion of patients diagnosed with other subtypes of glaucoma, like POAG, may contribute to determining the sensitivity and specificity of the proposed biomarker. Androstenedione levels should be determined in POAG, NTG, or PEXG patients.

      The authors’ answer: I agree with your comment and thank you for your suggestion. PACG is a major cause of irreversible blindness in Asians. While this study centers on PACG, the link between sex hormones and other glaucoma subtypes, like POAG, merits investigation. Future studies will include POAG and other subtypes to further assess androstenedione's diagnostic relevance.

      5) In addition, the levels of androstenedione were found significantly altered during other diseases as described by the authors or by conditions like polycystic ovary syndrome, limiting the utility of the proposed biomarker.

      The authors’ answer: Thank you for your advice. Androstenedione levels also change in conditions like polycystic ovary syndrome, which could affect the biomarker's specificity. We plan to further study androstenedione's unique changes in glaucoma versus other conditions to clarify its diagnostic value.

      6) Uncertainty of the androstenedione levels compromises its usefulness in clinical practice.

      The authors’ answer: The uncertainty surrounding androstenedione levels and its impact on clinical applicability is a valid concern. We plan to delve deeper into understanding the variability and determinants of androstenedione levels to better assess its clinical relevance.

      Reviewer #2 (Public Review):

      The "predict" part is on much less solid ground. The visual field progression and association with serum androstenedione within the current experimental design eludes to a correlation. It truly cannot be stated as predictive. To predict one needs to put the substance when nothing is there and demonstrate that the desired endpoint is reached. Conversely, the substance (androstenedione) can be removed, and show that the condition regresses. None of these are possible without model system experiments, which have not been done. The authors could put some additional details in the methods, such as: 1) how much sample was collected, 2) whether equal serum volume for analysis had equal serum proteins (or cells). They have used a LC-MS/MS and a Chemiluminescence method, but another independent method such as GC-MS/MS or NMR to detect androstenedione for a subset of patients with different stages of visual field defect would be desirable.

      The authors’ answer: We acknowledge your constructive critique concerning our use of the term "predict". In the present study, we elucidated a discernible correlation between visual field progression and serum androstenedione concentrations. We are cognizant of the critical distinction between correlation and causation, and we concur that our application of the term “predict” may have been overly assertive in this context.

      Your emphasis on the imperative of employing model system experiments to unequivocally ascertain causative relationships is well-received. The experimental approach of modulating the substance, androstenedione in this case, to empirically observe its consequential impact on the condition, is a pivotal direction that warrants exploration in subsequent research endeavors. With regard to the variability of serum protein concentrations across participants, we adopted a methodological standardization by ensuring that the analyzed serum volume remained consistent across samples. This was implemented to enhance the reliability and generalizability of our findings.

      Your recommendation to consider alternative detection methodologies, specifically GC-MS/MS or NMR, is duly noted. Although our choice of LC-MS/MS and Chemiluminescence was predicated on available resources, we recognize the scientific merit in leveraging multiple analytical techniques. In future investigations, we endeavor to incorporate a broader spectrum of detection methodologies for androstenedione, particularly when assessing patients with varied visual field defect stages, thereby bolstering the robustness and validity of our conclusions.

      Reviewer #1 (Recommendations for The Authors):

      1) POAG is the leading cause of irreversible blindness worldwide (see reference #4). The prevalence of PACG is highest in Asia, but the major form of glaucoma is still POAG. The authors should modify the abstract and background sections accordingly (see line 30 and lines 61-62).

      The authors’ answer: Thank you for your suggestion, and we apologize for this mistake. The sentence” Primary angle closure glaucoma (PACG) is the leading cause of irreversible blindness worldwide” has been changed to” Primary angle closure glaucoma (PACG) is the leading cause of irreversible blindness in Asia”. (Page 2, lines 33; Page 3, lines 62-64)

      2) Line 69, please change the sentence "the He et al. taught us..." to the following "the He et al. study taught us.".

      The authors’ answer: Thank you for your comment. The sentence "the He et al. taught us..." has been changed to "the He et al. study taught us.". (Page 3, lines 72)

      3) I suggest including the name of the identified candidate biomarker in the title of the manuscript. The title must be straightforward.

      The authors’ answer: We agree with your comment and thank you for your suggestion. The sentence “Metabolomics Identifies and Validates Serum Novel Biomarker for Diagnosing Primary Angle Closure Glaucoma and Predicting the Visual Field Progression” has been changed to “Metabolomics Identifies and Validates Serum Androstenedione as Novel Biomarker for Diagnosing Primary Angle Closure Glaucoma and Predicting the Visual Field Progression”. (Page 1, lines 1)

      4) Line 88, please change "normal subjects" to "control individuals".

      The authors’ answer: Thank you for your comment. We have changed "normal subjects" to "control individuals”. (Page 4, lines 91)

      5) Line 95 and so on along the manuscript, avoid the term "normal controls" or "normal" and use only the term "controls".

      The authors’ answer: Thank you for your advice. "normal subjects" has been changed to "controls". (Page 4, lines 113; Page5, lines 118,120,124,128,133)

      6) In the participants section, indicate the ocular treatments of PACG patients. For example, on line 141, which "treatment" are you referring to?

      The authors’ answer: Thank you for your comment. We apologize to this vague statement. Treatment included medical treatment and surgical treatment. We have revised it in the manuscript. (Page 5, lines 142)

      7) The entire section 2.4 is confusing. According to Figure S2, untargeted metabolomics was conducted with a mixed sample containing "all" serum extracts in order to obtain an in-house database with molecular features present in serum by LCHRMS. Then, this database was used for targeted metabolomics in individual serum samples using LCQQQ. However, as it is described in the manuscripts, it seems that first, an untargeted metabolomics analysis was carried out to identify altered metabolites, then targeted metabolomics was carried out to validate the untargeted analysis and finally, a profiling analysis was carried out to construct the database. The workflow must be clearly discussed and amended to be understable.

      The authors’ answer: Thank you for your comment. We have revised the description of the experimental method section 2.4. (Page 7, lines 195-198)

      8) Please, briefly explain what widely-targeted metabolomics is and how it works in this study (see section 2.4).

      The authors’ answer: Thank you for your comment. For extensively targeted metabolome detection, a local database was first established by using the standard database, and ion pair information was obtained by scanning ion pairs of mixed samples (QC) with QTOF. A wide range of metabolites were qualitatively obtained by comparing with the local self-built database, and then the metabolites of each sample were qualitatively and quantitatively measured by MRM scanning mode of triple four-bar QQQ. This project combines the non-target public database scanning construction database and the wide target local database to build a new database, and then scans the database of the samples of this project with Q-TOF, and then carries out the qualitative and quantitative detection of metabolites of each sample in MRM mode. (Figure S2)

      9) On Table 1, indicate the number of patients and controls with cataracts.

      The authors’ answer: For the glaucoma group and the control group, we have excluded people with cataracts. This section is described in the inclusion and exclusion criteria for supplementary materials. (Inclusion and exclusion criteria)

      10) On "Sample processing" section, lines 152 and 153: Have you used cold methanol to ensure metabolic quenching? If not, how metabolite quenching was carried out?

      The authors’ answer: Thank you for your comment. We use cold methanol to extract metabolites, and the early blood samples have been stored in a -80°C refrigerator to ensure a low temperature process and ensure metabolic quenching. (Page 6, lines 196)

      11) On the same "Sample processing" section, have you used internal standards during metabolite extraction? If yes, ones? If not, why?

      The authors’ answer: Thank you for your comment. In the metabolite extraction process of each sample, the same internal standard was added, and the same volume of 50 μL serum samples were extracted. The specific internal label name has been added in "Sample processing" section. (Page 6, lines 153-155)

      12) Lines 161-163, I suggest including in the supplementary material the worklist of the entire experiment run by LC-MS, including analytical replicates and QCs.

      The authors’ answer: Thank you for your comment. Worklist for mass spectrometry can be found in supplementary sheet1. (Page 6, lines 165)

      13) The title of the section "Detection method" does not seem appropriate, please change it to "Analytical methods "or something similar.

      The authors’ answer: Thank you for your advice. "Detection method" has been changed to “Analytical methods “. (Page 6, lines 168)

      14) Section 2.4.1, I suggest changing "Untargeted detection conditions" to "Untargeted metabolomics analysis".

      The authors’ answer: Thank you for your comment. "Untargeted detection conditions" has been changed to "Untargeted metabolomics analysis". (Page 6, lines 169)

      15) Lines 170-172, the column used is compatible with 100% water, why start with 5% acetonitrile?

      The authors’ answer: Thank you for your comment. If the acetonitrile starting gradient is 0, it will cause a lot of water-soluble substances to elute and easily clog the column, so we want to use 5% organic phase.

      16) Section 2.4.1, the chromatographic conditions (mobiles phases) were the same in both positive and negative ion mode? It is desirable to change or adjust a basic pH when working in negative, so please amend and clarify it.

      The authors’ answer: Thank you for your comment. In the negative ion mode, the peak shape of the chromatogram under the acidic system is better than that under the alkaline system, so we choose the acidic system.

      17) I am not able to clearly understand what is "widely targeted conditions" (see section 2.4.2). What is the difference with the conventional targeted metabolomics analysis? In my view, widely-targeted metabolomics refers to the combination of untargeted metabolomics and targeted metabolomics. This must be clarified and simplified.

      The authors’ answer: Thank you for your syggestion. The characterization of metabolites in this study was conducted using a non-targeted database and a self-built database. Non-targeted metabolites were characterized with mixed samples, and then combined with the laboratory self-established database to form a new metabolome database for this study. 2.4.2 The broad targeting here refers to the use of the MWDB standard self-built database to characterize metabolites, and then the QQQ MRM model to quantify metabolites. In order to clearly describe the detection process, this part of the method has been modified. (Figure S2)

      18) Line 199, please, indicate the normalization carried out.

      The authors’ answer: We agree with your comment and thank you for your suggestion. The normalization description is missing from its data processing steps and has been corrected in the manuscript. (Page 7, lines 203)

      19) How many instrumental replicates have you carried out both in untargeted and targeted metabolomics? Please, indicate it.

      The authors’ answer: Thank you for your advice. In this project, all sample mixtures were used as QC samples, which were repeated several times in the testing process (one QC sample was inserted between every 10 samples), and the repeated correlation between repeated QC was more than 99% to ensure the stability of sample testing. (Sheet1)

      20) Line 267, why did you select a fold changes threshold greater than 1.15 (or lower 0.85)? In metabolomics, it would be desirable to have a minimum of 1.5-fold change considering the variability of data.

      The authors’ answer: Thank you for your comment. FC reduction is selected to expand potential candidate metabolites and can be repeated in three batches and refer to the literature "Blood metabolomics uncovers inflammation-associated mitochondrial dysfunction as a potential mechanism. underlying ACLF "method screening threshold.

      21) To include anywhere the molecular formula of androstenedione.

      The authors’ answer: I agree with your comment and thank you for your suggestion. We have added the molecular formula of androstenedione to the supplementary material. (Page 17, lines 475)

      22) Line 290 is not Figure 4B and 4C, you may refer to Figure 3B and 3C.

      The authors’ answer: Thank you for your advice. We apologize to this mistake. Figure 4B and 4C have been changed to Figure 3B and 3C.

      23) Figure S3 was lost from Supplementary material, please include it.

      The authors’ answer: Thank you for your comment. We apologize to this mistake. There is an error in the ordering of the supplementary graph. Figure 3 is redundant, and we have modified it in the supplementary materials.

      24) Figure 4 B, indicate in the text the average and uncertainty of androstenedione levels in both control and PACG groups.

      The authors’ answer: Thank you for your comment. In the manuscript, We have added descriptions of mean ± standard deviation of androstendione levels in the control group and the disease group. (Page 11, lines 311-312)

      25) Section 3.6. please include the average and uncertainty of androstenedione levels in males and females in both control and PACG groups.

      The authors’ answer: Thank you for your advice. For 3.6 section, we supplemented the mean ± standard deviation of androstenedione levels in the control and disease groups. (Page 13, lines 350-356)

      26) Figure S9 seems missing.

      The authors’ answer: Thank you for your comment. We apologize to this mistake. Figures S9 has been added in the Supplementary material.

      27) Lines 345-346, indicate the levels obtained for the metabolite in the compared groups.

      The authors’ answer: Thank you for your suggestion. The levels of androstenedione in each group are seen in “The results from both discovery set 1 (Figure S9A, Mild:32600±17011, Moderate:33215±17855, Severe:46060±21789) and discovery set 2 (Figure S9B, Mild:27866±19873, Moderate:27057±13166, Severe:43972±19234) indicated that the mean serum androstenedione levels were significantly higher in the severe PACG group compared to the moderate and mild PACG groups (P<0.001). These findings were further validated in both validation phase 1 (Figure S9C, Mild:75726±45719, Moderate:65798±30610, Severe:94348±30858) and validation phase 2 (Figure S9D, Mild:1.121±0.3143 ng/ml, Moderate:1.461±0.4391 ng/ml, Severe:2.147±0.6476 ng/ml).” and “Notably, the level of androstenedione was found to be significantly higher in PACG patients than in normal subjects in both discovery set 1 (Figure 4B, P=0.0081, Normal:33987±11113, PACG:42852±20767) and discovery set 2 (Figure 4C, P=0.0078, Normal:31559±10975, PACG:37934±18529).”

      28) Line 368, you don't need to indicate the PACG abbreviation again.

      The authors’ answer: Thank you for your comment. We apologize to this mistake. I have changed " patients with PACG " to "patients". (Page 13, lines 377)

      29) Figure 6, panels A and B are not labeled (i.e., commented) in the body text of the manuscript.

      The authors’ answer: Thank you for your suggestion. We’re very sorry for this mistake. Figure 6, panels A and B have been labeled in the manuscript. (Page 13, lines 377-379)

      30) Section 3.7., when you indicate "after therapy" are you referring to surgical treatment? Please, clarify.

      The authors’ answer: Thank you for your comment. We apologize to this vague statement. Blood samples were taken before and three months after surgery. “therapy” has been changed to “surgical treatment” in the manuscript. (Page 13, lines 377)

      31) Line 370, "97th patient" should be replaced by "nine patients"?

      The authors’ answer: Thank you for your advice. We apologize to this mistake. "97th patient" has been changed to “nine patients". (Page 13, lines 378-379)

      32) Lines 370-372, it difficult to understand, please clarify why these findings indicate that severity is related to increased PACG according to Figure 6B.

      The authors’ answer: Thank you for your comment. We’re very sorry for this vague statement. The sentence of “These findings showed that the levels of androstenedione that were tightly connected with PACG severity rose dramatically as PACG progressed.” Has been removed.

      33) Line 447, the word "corrected" should be changed to "correlated"?

      The authors’ answer: Thank you for your comment. "corrected" has been changed to "correlated". (Page 16, lines 453,456)

      34) According to the literature, the levels found in control subjects are within the range of the "normal" values, i.e., are they comparable?

      The authors’ answer: Thank you for your advice. Androstenedione ranges from 0.4 to 2 in the normal population. The mean standard deviation of androstenedione in the normal population was 1.552 ± 0.4859.

      35) Lines 471-474, why "steroid hormone biosynthesis appears to be the critical node to high-match PACG pathophysiological concepts" while the high enrichment was observed in the "metabolic pathways"?

      The authors’ answer: Metabolic pathways encompass a series of chemical reactions within a cell that enable the synthesis or breakdown of molecules to maintain the cell's energy balance. Steroid hormone biosynthesis is one of these metabolic pathways, and its products, steroid hormones, participate in a wide range of physiological processes, including metabolism, immune response, and the regulation of inflammation. In a different context, a study related to fatigue during Androgen Deprivation Therapy (ADT) showed a significant difference in metabolite levels within the steroid hormone biosynthesis pathways, emphasizing the role these pathways play in metabolic alterations. The mentioned findings suggest that steroid hormone biosynthesis and metabolic pathways are intertwined. (Page 17, lines 481-488)

      36) Figure S13 and Figure S14A are the same.

      The authors’ answer: Thank you for your comment. Figure S14A has been removed.

      37) On lines 476-485, it would be interesting to discuss whether alterations of this metabolite could be a cause or consequence of PACG.

      The authors’ answer: Based on the literature found, androstenedione is a naturally occurring steroid hormone produced by the gonads and adrenal glands, and serves as an intermediate in testosterone biosynthesis (Androstenedione (a Natural Steroid and a Drug Supplement): A Comprehensive Review of Its Consumption, Metabolism, Health Effects, and Toxicity with Sex Differences). Early events in the pathobiology of glaucoma involve oxidative, metabolic, or mechanical stress acting on retinal ganglion cells (RGCs), leading to their rapid release of danger signals such as extracellular ATP, thus triggering microglial and macroglial activation as well as neuroinflammation (Immune Responses in the Glaucomatous Retina: Regulation and Dynamics). However, one might speculate that since androstenedione is a steroid hormone, it could potentially impact the inflammatory and metabolic stress observed in the pathophysiological processes of glaucoma (Adaptive responses to neurodegenerative stress in glaucoma). Metabolic and anti-inflammatory avenues might be crucial in understanding the relationship between alterations in androstenedione levels and the severity of glaucoma. Nevertheless, more research and literature analysis would be necessary to better understand the precise relationship and its underlying mechanisms between these two entities.

      38) I suggest sending the MS and MS/MS into a publicly available repository.

      The authors’ answer: Thank you for your suggestion. Further research will necessitate the utilization of the raw mass spectrometry data. We anticipate making this raw data available in a public repository upon the conclusion of subsequent experiments.

      Reviewer #2 (Recommendations for The Authors):

      The authors should aim to describe methods in greater detail.

      The authors could improve the writing to accurately describe their results and their interpretation and state what else could be done to make the result truly "predictive".

      The authors’ answer: (1) Detail Enhancement in the Methods section: We expand the description of methods such as sample pre-processing, mass spectrometry detection, and result analysis in the study to provide more detailed information about the procedures, equipment, and materials used. (2) Improvement in Writing Quality: We have engaged a scientific editor to review our manuscript for clarity, coherence, and consistency to ensure that the results and interpretations are accurately and clearly conveyed. Terminologies and phrases have been revised to better reflect the findings and interpretations. (3) Limitation supplement: We have included a discussion on the limitations of our study and suggested additional studies and analyses that could be conducted to enhance the predictive value of our findings. We sincerely appreciate the constructive feedback from the reviewer, which has greatly contributed to improving the quality and rigor of our manuscript.

    1. Reviewer #2 (Public Review):

      Summary:

      This interesting paper examines the earliest steps in progesterone-induced frog oocyte maturation, an example of non-genomic steroid hormone signaling that has been studied for decades but is still very incompletely understood. In fish and frog oocytes it seems clear that mPR proteins are involved, but exactly how they relay signals is less clear. In human sperm, the lipid hydrolase ABHD2 has been identified as a receptor for progesterone, and so the authors here examine whether ABHD2 might contribute to progesterone-induced oocyte maturation as well. The main results are:

      1. Knocking down ABHD2 makes oocytes less responsive to progesterone, and ectopically expressing ABHD2.S (but not the shorter ABHD2.L gene product) partially rescues responsiveness. The rescue depends upon the presence of critical residues in the protein's conserved lipid hydrolase domain, but not upon the presence of critical residues in its acyltransferase domain.

      2. Treatment of oocytes with progesterone causes a decrease in sphingolipid and glycerophospholipid content within 5 min. This is accompanied by an increase in LPA content and arachidonic acid metabolites. These species may contribute to signaling through GPCRs. Perhaps surprisingly, there was no detectable increase in sphingosine-1-phosphate, which might have been expected given the apparent substantial hydrolysis of sphingolipids. The authors speculate that S1P is formed and contributes to signaling but diffuses away.

      3. Pharmacological inhibitors of lipid-metabolizing enzymes support, for the most part, the inferences from the lipidomics studies, although there are some puzzling findings. The puzzling findings may be due to uncertainty about whether the inhibitors are working as advertised.

      4. Pharmacological inhibitors of G-protein signaling support a role for G-proteins and GPCRs in this signaling, although again there are some puzzling findings.

      5. Reticulocyte expression supports the idea that mPR and ABHD2 function together to generate a progesterone-regulated PLA2 activity.

      6. Knocking down or inhibiting ABHD2 inhibited progesterone-induced mPRinternalization, and knocking down ABHD2 inhibited SNAP2520-induced maturation.

      Strengths:

      All in all, this could be a very interesting paper and a nice contribution. The data add a lot to our understanding of the process, and, given how ubiquitous mPR and AdipoQ receptor signaling appear to be, something like this may be happening in many other physiological contexts.

      Weaknesses:

      I have several suggestions for how to make the main points more convincing.

      Main criticisms:

      1. The ABHD2 knockdown and rescue, presented in Fig 1, is one of the most important findings. It can and should be presented in more detail to allow the reader to understand the experiments better. E.g.: the antisense oligos hybridize to both ABHD2.S and ABHD2.L, and they knock down both (ectopically expressed) proteins. Do they hybridize to either or both of the rescue constructs? If so, wouldn't you expect that both rescue constructs would rescue the phenotype since they both should sequester the AS oligo? Maybe I'm missing something here.

      In addition, it is critical to know whether the partial rescue (Fig 1E, I, and K) is accomplished by expressing reasonable levels of the ABHD2 protein, or only by greatly overexpressing the protein. The author's antibodies do not appear to be sensitive enough to detect the endogenous levels of ABHD2.S or .L, but they do detect the overexpressed proteins (Fig 1D). The authors could thus start by microinjecting enough of the rescue mRNAs to get detectable protein levels, and then titer down, assessing how low one can go and still get rescue. And/or compare the mRNA levels achieved with the rescue construct to the endogenous mRNAs.

      Finally, please make it clear what is meant by n = 7 or n = 3 for these experiments. Does n = 7 mean 7 independently lysed oocytes from the same frog? Or 7 groups of, say, 10 oocytes from the same frog? Or different frogs on different days? I could not tell from the figure legends, the methods, or the supplementary methods. Ideally one wants to be sure that the knockdown and rescue can be demonstrated in different batches of oocytes, and that the experimental variability is substantially smaller than the effect size.

      2. The lipidomics results should be presented more clearly. First, please drop the heat map presentations (Fig 2A-C) and instead show individual time course results, like those shown in Fig 2E, which make it easy to see the magnitude of the change and the experiment-to-experiment variability. As it stands, the lipidomics data really cannot be critically assessed.

      [Even as heat map data go, panels A-C are hard to understand. The labels are too small, especially on the heat map on the right side of panel B. The 25 rows in panel C are not defined (the legend makes me think the panel is data from 10 individual oocytes, so are the 25 rows 25 metabolites? If so, are the individual oocyte data being collapsed into an average? Doesn't that defeat the purpose of assessing individual oocytes?) And those readers with red-green colorblindness (8% of men) will not be able to tell an increase from a decrease. But please don't bother improving the heat maps; they should just be replaced with more informative bar graphs or scatter plots.]

      3. The reticulocyte lysate co-expression data are quite important and are both intriguing and puzzling. My impression had been that to express functional membrane proteins, one needed to add some membrane source, like microsomes, to the standard kits. Yet it seems like co-expression of mPR and ABHD2 proteins in a standard kit is sufficient to yield progesterone-regulated PLA2 activity. I could be wrong here - I'm not a protein expression expert - but I was surprised by this result, and I think it is critical that the authors make absolutely certain that it is correct. Do you get much greater activities if microsomes are added? Are the specific activities of the putative mPR-ABHD2 complexes reasonable?

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      Issue 1: The relevance is somewhat unclear. High cysteine levels can be achieved in the laboratory, but, is this relevant in the life of C. elegans? Or is there physiological relevance in humans, e.g. a disease? The authors state "cells and animals fed excess cysteine and methionine", but is this more than a laboratory excess condition? SUOX nonfunctional conditions in humans don't appear to tie into this, since, in that context, the goal is to inactivate CDO or CTH to prevent sulfite production. The authors also mention cancer, but the link to cysteine levels is unclear. In that sense, then, the conditions studied here may not carry much physiological relevance.

      Response 1: We set out to answer a fundamental question: what pathways regulate the function of cysteine dioxygenase, a highly conserved enzyme in sulfur amino acid metabolism? In an unbiased genetic screen that sampled millions of EMS generated mutations across all ~20,000 C. elegans genes, we discovered loss of function/null mutations in egl-9 and rhy-1, two negative regulators of the hypoxia inducible transcription factor (hif-1). Genetic ablation of the egl-9 or rhy-1 loci are likely not relevant to the life of a C. elegans animal, i.e. this is not representative of a natural state. Yet, this extreme genetic intervention has taught us a new fundamental truth about the interaction between EGL-9/RHY-1, HIF-1, and the transcriptional activation of cdo1. Similarly, the high cysteine levels used in our assays may or may not be representative of a state in nature, we do not know (nor do we make any claims about the environmental relevance of our choice of cysteine concentrations). It seems very plausible that pathological states exist where cysteine concentrations may rise to comparable levels in our experimental system. More importantly, we have started with excess to physiology to elicit a clear response that we can study in the lab. Similar strategies established the cysteine-induction phenotype of CDO1 in mammalian systems. For instance, in Kwon and Stipanuk 2001, hepatocytes are cultured in media supplemented with 2mmol/L cysteine to promote a ~4-fold increase in CDO1 mRNA.

      Issue 2: The pathway is described as important for cysteine detoxification, which is described to act via H2S (Figure 6). Much of that pathway has already been previously established by the Roth, Miller, and Horvitz labs as critical for the H2S response. While the present manuscript adds some additional insight such as the additional role of RHY-1 downstream on HIF-1 in promoting toxicity, this study therefore mainly confirms the importance of a previously described signalling pathway, essentially adding a new downstream target rhy-1 -> cysl-1 -> egl-9 -> hif-1 -> sqrd-1/cdo-1. The impact of this finding is reduced by the fact that cdo-1 itself isn't actually required for survival in high cysteine, suggesting it is merely a maker of the activity of this previously described pathway.

      Response 2: We agree that the primary impact of our manuscript is the establishment of a novel intersection between the H2S-sensing pathway (largely worked out by Roth, Miller, and Horvitz) and our gene of interest, cysteine dioxygenase. We believe that the connection between these two pathways is exciting as it suggests a logical homeostatic circuit. High cysteine yields enzymatically produced H2S. This H2S may then act as a signal promoting HIF-1 activity (via RHY-1/CYSL-1/EGL-9). High HIF-1 activity increases cdo-1 transcription and activity promoting the degradation of the high-cysteine trigger. As pointed out by the reviewer, cdo-1(-) loss of function alone does not cause cysteine sensitivity at the concentrations tested. Given that cysl-1(-) and hif-1(-) mutants are exquisitely sensitive to high levels of cysteine, we propose that HIF-1 activates the transcription of additional genes that are required for high cysteine tolerance. However, our genetic data show that cdo-1 is more than simply a marker of HIF-1 transcription. Our genetic data in Table 1 demonstrate that HIF-1 activation (caused by egl-9(-)) is sufficient to cause severe sickness in a suox-1 hypomorphic mutant which cannot detoxify sulfites, a critical product of cysteine catabolism. This severe sickness can be reversed by inactivating hif-1, cth-2, or cdo-1. These data demonstrate a functional intersection between the established H2S-sensing pathway and cysteine catabolism governed by cdo-1.

      Reviewer #2 (Public Review):

      Issue 3: First, the authors show that the supplementation of exogenous cysteine activates cdo-1p::GFP. Rather than showing data for one dose, the author may consider presenting dose-dependency results and whether cysteine activation of cdo-1 also requires HIF-1 or CYSL-1, which would be important data given the focus and major novelty of the paper in cysteine homeostasis, not the cdo-1 regulatory gene pathway.

      Response 3: We agree with the reviewer and have performed the suggested dose-response curve for expression of Pcdo-1::GFP in wild-type C. elegans. We observe substantial activation of the Pcdo-1::GFP transcriptional reporter beginning at 100µM supplemental cysteine (Figure 3C). Higher doses of cysteine do not elicit a substantially stronger induction of the Pcdo-1::GFP reporter. Thus, we find that 100µM supplemental cysteine strikes the right balance between strongly inducing the Pcdo-1::GFP reporter while not inducing any toxicity or lethality in wild-type animals (Figure 3E).

      We further agree that testing for induction of the Pcdo-1::GFP reporter in a hif-1(-) or cysl-1(-) mutant background is a critical experiment. However, we have not been able to identify a cysteine concentration that induces Pcdo-1::GFP and is not 100% lethal for hif-1(-) or cysl-1(-) mutant C. elegans. The remarkable sensitivity of hif-1(-) or cysl-1(-) mutant C. elegans to supplemental cysteine demonstrates the critical role of these genes in promoting cysteine homeostasis. But because of this lethality, we could not assay the Pcdo1::GFP reporter in the hif-1(-) or cysl-1(-) mutant animals. But the lethality to excess cysteine demonstrates that this cysteine response is salient. To get at how cysteine might be interacting with the HIF-1-signaling pathway, we performed new additivity experiments by supplementing 100µM cysteine to wild type, egl-9(-), and rhy-1(-) mutant C. elegans expressing the Pcdo-1::GFP reporter. Surprisingly, we found that cysteine had no significant impact on Pcdo-1::GFP expression in an egl-9(-) mutant background but significantly increased the Pcdo-1::GFP expression in a rhy-1(-) background (Figure 3A,B). These data suggest that cysteine acts in a pathway with egl-9 and in parallel to rhy-1. These data have been incorporated into Figure 3A,B and are included in the Results section of the manuscript.

      Issue 4: While the genetic manipulation of cdo-1 regulators yields much more striking results, the effect size of exogenous cysteine is rather small. Does this reflect a lack of extensive condition optimization or robust buffering of exogenous/dietary cysteine? Would genetic manipulation to alter intracellular cysteine or its precursors yield similar or stronger effect sizes?

      Response 4: We agree that the induction of the Pcdo-1::GFP reporter by supplemental cysteine is not as dramatic as the induction caused by the egl-9 or rhy-1 null alleles. We believe our Response 3 and new Figure 3C demonstrate that this phenomenon is not due to lack of condition optimization, but likely reflects some biology. As pointed out by the reviewer, C. elegans likely buffers exogenous cysteine and this (perhaps) prevents the impressive Pcdo-1::GFP induction observed in the egl-9(-) and rhy-1(-) mutant animals. We have now mentioned this possible interpretation in the Results section. Furthermore, we like the idea of using genetic tricks to promote cysteine accumulation within C. elegans cells and tissues and will consider these approaches in future studies.

      Issue 5: Second, there remain several major questions regarding the interpretation of the cysteine homeostasis pathway. How much specificity is involved for the RHY-1/CYSL-1/EGL-9/HIF-1 pathway to control cysteine homeostasis? Is the pathway able to sense cysteine directly or indirectly through its metabolites or redox status in general? Given the very low and high physiological concentrations of intracellular cysteine and glutathione (GSH, a major reserve for cysteine), respectively, there is a surprising lack of mention and testing of GSH metabolism.

      Response 5: Future studies are required to determine the specificity of the RHY-1/CYSL-1/EGL-9/HIF-1 pathway for the control of cysteine homeostasis. Our proposed mechanism, that H2S activates the HIF-1 pathway is based largely on the work of the Horvitz lab (Ma et al. 2012). They demonstrate that H2S promotes a direct inhibitory interaction between CYSL-1 and EGL-9, leading to activation of HIF-1. These findings align nicely with our genetic and pharmacological data. However, our work does not provide direct evidence as to the cysteine-derived metabolite that activates HIF-1. We propose H2S as a likely candidate.

      We have added a note to the introduction regarding the role of GSH as a reservoir of excess cysteine and agree that future studies might find interesting links between CDO-1, GSH metabolism, and HIF-1.

      Issue 6: In addition, what are the major similarities and differences of cysteine homeostasis pathways between C. elegans and other systems (HIF dependency, transcription vs post-transcriptional control)? These questions could be better discussed and noted with novel findings of the current study that are likely C. elegans specific or broadly conserved.

      Response 6: We have included a new section in the Discussion highlighting the nature of mammalian CDO1 regulation. We propose the hypothesis that a homologous pathway to the C. elegans RHY-1/CYSL-1/EGL9/HIF-1 pathway might operate in mammalian cells to sense high cysteine and induce CDO1 transcription. Importantly, all proteins in the C. elegans pathway have homologous counterparts in mammals. However, this hypothesis remains to be tested in mammalian systems.

      Reviewer #3 (Public Review):

      Major weaknesses of the paper include:

      Issue 7: the over-reliance on genetic approaches.

      Response 7: This is a fair critique. Our expertise is genetics. Our philosophy, which the reviewers may not share, is that there is no such thing as too much genetics!

      Issue 8: the lack of novelty regarding prolyl hydroxylase-independent activities of EGL-9.

      Response 8: We believe the primary novelty of our work is establishing the intersection between the H2Ssensing HIF-1 pathway and cysteine catabolism governed by cysteine dioxygenase. Our demonstration that cdo-1 regulation operates largely independent of VHL-1 and EGL-9 prolyl hydroxylation is a mechanistic detail of this regulation and not the critical new finding. Although, we believe it does suggest where pathway analyses should be directed in the future. We also believe that our homeostatic feedback model for the regulation of HIF-1 (and cdo-1) by cysteine-derived H2S is new and exciting and provides insight into the logic of why HIF-1 might respond to H2S and promote the activity of cdo-1. Our work suggests that one reason for this intersection of hif-1 and cdo-1 is to sense and maintain cysteine homeostasis when cysteine is in excess.

      Issue 9: the lack of biochemical approaches to probe the underlying mechanism of the prolyl hydroxylaseindependent activity of EGL-9.

      Response 9: While not the primary focus of our current manuscript, we agree that this is an exciting area of future research. To uncover the prolyl hydroxylase-independent activity of EGL-9, we agree that a combination of approaches will be required including, biochemical, structure-function, and genetic.

      Major Issues We Feel the Authors Should Address:

      Issue 10: One particularly glaring concern is that the authors really do not know the extent to which the prolyl hydroxylase activity is (or is not) impacted by the H487A mutation in egl-9(rae276). If there is a fair amount of enzymatic activity left in this mutant, then it complicates interpretation. The paper would be strengthened if the authors could show that the egl-9(rae276) eliminates most if not all prolyl hydroxylase activity. In addition, the authors may want to consider doing RNAi for egl-9 in the egl-9(rae276) mutant as a control, as this would support the claim that whatever non-hydroxylase activity EGL-9 may have is indeed the causative agent for the elevation of CDO-1::GFP. Without such experiments, readers are left with the nagging concern that this allele is simply a hypomorph for the single biochemical activity of EGL-9 (i.e., the prolyl hydroxylase activity) rather than the more interesting, hypothesized scenario that EGL-9 has multiple biochemical activities, only one of which is the prolyl hydroxylase activity.

      Response 10: We have two lines of evidence that suggest the egl-9(rae276)-encoded H487A variant eliminates prolyl hydroxylase activity. First, Pan et al. 2007 (reference 57) demonstrate that when the equivalent histidine (H313) is mutated in human protein, that protein lacks detectible prolyl hydroxylase activity. Second, the phenotypic similarities caused by egl-9(rae276) and the vhl-1 null allele, ok161. Both alleles cause nearly identical activation of the Pcdo-1::GFP reporter transgene (Fig. 5C,D), and similarly impact the growth of the suox-1(gk738847) hypomorphic mutant (Table 1). This phenotypic overlap is highly relevant as the established role of VHL-1 is to recognize the hydroxyl mark conferred by the EGL-9 prolyl hydroxylase domain and promote the degradation of HIF-1. If EGL-9[H487A] had residual prolyl hydroxylase activity, we would expect the vhl-1(-) null mutant C. elegans to display more dramatic phenotypes than their egl-9(rae276) counterparts. This is not the case.

      Issue 11: The authors observed that EGL-9 can inhibit HIF-1 and the expression of the HIF-1 target cdo-1 through a combination of activities that are (1) dependent on its prolyl hydroxylase activity (and subsequent VHL-1 activity that acts on the resulting hydroxylated prolines on HIF-1), and (2) independent of that activity. This is not a novel finding, as the authors themselves carefully note in their Discussion section, as this odd phenomenon has been observed for many HIF-1 target genes in multiple publications. While this manuscript adds to the description of this phenomenon, it does not really probe the underlying mechanism or shed light on how EGL-9 has these dual activities. This limits the overall impact and novelty of the paper.

      Response 11: See response to Issues #8.

      Issue 12: Cysteine dioxygenases like CDO-1 operate in an oxygen-dependent manner to generate sulfites from cysteine. CDO-1 activity is dependent upon availability of molecular oxygen; this is an unexpected characteristic of a HIF-1 target, as its very activation is dependent on low molecular oxygen. Authors neither address this in the text nor experimentally, and it seems a glaring omission.

      Response 12: We agree this is an important point to raise within our manuscript. Although, despite its induction by HIF-1, there is no evidence that cdo-1 transcription is induced by hypoxia. In fact, in a genome wide transcriptomic study, cdo-1 was not found to be induced by hypoxia in C. elegans (Shen et al. 2005, reference 71).

      We have newly commented on the use of molecular oxygen as a substrate by both EGL-9 and CDO-1 in our Discussion section. The mammalian oxygen-sensing prolyl hydroxylase (EGLN1) has been demonstrated to have high a Km value for O2 (high µM range). This likely allows EGLN1 to be poised to respond to small decreases in cellular oxygen from normal oxygen tensions. Clearly, CDO-1 also requires oxygen as a substrate, however the Km of CDO-1 for O2 is likely to be much lower, preventing sensitivity of the cysteine catabolism to physiological decreases in O2 availability. Although, to our knowledge, the CDO1 Km value for O2 has not been experimentally determined. We have added a new Discussion section where we address the conundrum about low oxygen inducing HIF-1 but oxygen being needed by CDO-1/CDO1.

      Issue 13: The authors determined that the hypodermis is the site of the most prominent CDO-1::GFP expression, relevant to Figure 4. This claim would be strengthened if a negative control tissue, in the animal with the knockin allele, were shown. The hypodermal specific expression is a highlight of this paper, so it would make this article even stronger if they could further substantiate this claim.

      Response 13: Our claim that the hypodermis is the critical site of cdo-1 function is based on; i) our hands on experience looking at Pcdo-1::GFP, Pcdo-1::CDO-1::GFP, CDO-1::GFP (encoded by cdo-1(rae273)) and our reporting of these expression patterns in multiple figures throughout the manuscript and ii) the functional rescue of cdo-1(-) phenotypes by a cdo-1 rescue construct expressed by a hypodermal-specific promoter (col10). We agree that providing negative control tissues would modestly improve the manuscript. However, we do not think that adding these controls will substantially alter the conclusions of the paper. Importantly, we acknowledge this limitation of our work with the sentence, “However, we cannot exclude the possibility that CDO-1 also acts in other cells and tissues as well.”

      Minor issues to note:

      Issue 14: Mutants for hif-1 and cysl-1 are sensitive to exogenous cysteine levels, yet loss of CDO-1 expression is not sufficient to explain this phenomenon, suggesting other targets of HIF-1 are involved. Given the findings the authors (and others) have had showing a role for RHY-1 in sulfur amino acid metabolism, shouldn't the authors consider testing rhy-1 mutants for sensitivity to exogenous cysteine?

      Response 14: To test the hypothesis that rhy-1(-) C. elegans might be sensitive to supplemental cysteine, we cultured wild type and rhy-1(-) animals on 0, 100, and 1000µM supplemental cysteine. At 0 and 100µM supplemental cysteine, neither wild-type nor rhy-1(-) animals display any lethality suggesting rhy-1 is not required for survival in the face of excess cysteine (Fig. 3D,E). We also cultured these same strains on 1000µM supplemental cysteine, a concentration that is highly toxic to wild-type animals (100% lethality). rhy1(-) animals were resistant to 1000µM supplemental cysteine with a substantial fraction of the population surviving overnight exposure to this lethal dose of cysteine. Similarly, egl-9(-) mutant C. elegans were also resistant to 1000µM supplemental cysteine. We propose that loss of egl-9 or rhy-1 activates HIF-1-mediated transcription which is priming these mutants to cope with the lethal dose of cysteine. These data are now presented in Figure 3D-F and presented in the Results section.

      Issue 15: The cysteine exposure assay was performed by incubating nematodes overnight in liquid M9 media containing OP50 culture. The liquid culture approach adds two complications: (1) the worms are arguably starving or at least undernourished compared to animals grown on NGM plates, and (2) the worms are probably mildly hypoxic in the liquid cultures, which complicates the interpretation.

      Response 15: We agree that it is possible that animals growing overnight in liquid culture are undernourished and mildly hypoxic. However, we are confident in our data interpretation as all our experiments are appropriately controlled. Meaning, control and experimental groups were all grown under the same liquid culture conditions. Thus, these animals would all experience the same stressors that come with liquid culture. Importantly, we never make comparisons between groups that were grown under different culture conditions (i.e. solid media vs. liquid culture).

      Issue 16: An easily addressable concern is the wording of one of the main conclusions: that cdo-1 transcription is independent of the canonical prolyl hydroxylase function of EGL-9 and is instead dependent on one of EGL-9's non-canonical, non-characterized functions. There are several points in which the wording suggests that CDO-1 toxicity is independent of EGL-9. In their defense, the authors try to avoid this by saying, "EGL-9 PHD," to indicate that it is the prolyl hydroxylase function of EGL-9 that is not required for CDO-1 toxicity. However, this becomes confusing because much of the field uses PHD and EGL-9/EGLN as interchangeable protein names. The authors need to be clear about when they are describing the prolyl hydroxylase activity of EGL-9 rather than other (hypothesized) activities of EGL-9 that are independent of the prolyl hydroxylase activity.

      Response 16: We appreciate the reviewer alerting us to this practice within the field. To avoid confusion, we have removed the “PHD” abbreviation from our manuscript and explicitly referred to the “prolyl hydroxylase domain” where relevant.

      Issue 17: The authors state in the text, "the egl-9; suox-1 double mutants are extremely sick and slow growing." We appreciate that their "health" assay, based on the exhaustion of food from the plate, is qualitative. We also appreciate that it is a functional measure of many factors that contribute to how fast a population of worms can grow, reproduce, and consume that lawn of food. However, unless they do a lifespan assay and/or measure developmental timing and specifically determine that the double mutant animals themselves are developing and/or growing more slowly, we do not think it is appropriate to use the words "slow growing" to describe the population. As they point out, the rate of consumption of food on the plate in their health assay is determined by a multitude and indeed a confluence of factors; the growth rate is one specific one that is commonly measured and has an established meaning.

      Response 17: We see how the phrase ‘slow growing’ might imply a phenotype that we have not actually assessed with this assay. Therefore, we have removed all claims about “slow growth” of the strains presented in Table 1 and have highlighted the assay more overtly in the results section. For example; “While egl-9(-) and suox-1(gk738847) single mutant animals are healthy under standard culture conditions, the egl-9(-); suox1(gk738847) double mutant animals are extremely sick and require significantly more days to exhaust their E. coli food source under standard culture conditions (Table 1).”

      Reviewer #1 (Recommendations For The Authors):

      Issue 18: Relevance could be addressed further in the text.

      Response 18: We have added additional context for our work in the Discussion section. Please see our response to Issues #5, 6, 12, and 24.

      Issue 19: Better appreciation and integration of the manuscript's findings with published studies would be appropriate.

      Response 19: We have added additional context for our work in the Discussion section. Please see our response to Issues #5, 6, 12, and 24.

      Issue 20: It might be perhaps relevant to test whether cdo-1 is relevant for hypoxia resistance since it appears to be a key target for hif-1.

      Response 20: We agree that this is an interesting future direction, however given that cdo-1 mRNA is not induced by hypoxia (Shen et al. 2005) we have not prioritized these experiments for the current manuscript.

      Issue 21: "egl-9 inhibits cdo-1 transcription in a prolyl-hydroxylase and VHL-1-independent manner" should be tempered. vhl-1 mutants and egl-9 hydroxylase point mutant still have significant induction of the reporter.

      Response 21: Thank you for identifying this oversight. We have modified the Figure 5 legend title to read, “egl9 inhibits cdo-1 transcription in a largely prolyl-hydroxylase and VHL-1-independent manner.”

      Issue 22: Please use line numbers in the future for easier tracking of comments.

      Response 22: We shall.

      Issue 23: Abstract and elsewhere, "high cysteine activates...", should be rephrased to "high levels of cysteine".

      Response 23: We have made this change throughout the manuscript.

      Reviewer #3 (Recommendations For The Authors):

      Issue 24: The authors discuss CDO1 in the context of tumorigenesis, as well as the potential regulation between cysteine and the hypoxia response pathway. Thus, I was surprised that there was no mention of the foundational Bill Kaelin paper (Briggs et al 2016) showing how the accumulation of cysteine is related to tumorigenesis, and that cysteine is a direct activator of EglN1. Puzzling that CDO1 is a tumor suppressor: you lose it, cysteine can accumulate and activate EglN1, causing HIF1 turnover. How do the authors reconcile their results with this paper? I was also surprised that there was no mention in the Discussion of the role of hydrogen sulfide, cysteine metabolism, and CTH and CBS in oxygen sensation in the carotid body given the role they play there. Seems important to discuss this issue.

      Response 24: We have added new sections to our Discussion that consider the relationship between our work and Briggs et al. 2016 as well as mentioned the role of CTH and H2S in the mammalian carotid body.

      Issue 25: The abstract has a variety of contradictory statements. For example, the authors state that "HIF-1mediated induction of cdo-1 functions largely independent of EGL-9," but then go on to conclude in the final sentence that cysteine stimulates H2S production, which then activates EGL-9 signaling, which then increases HIF-1-mediated transcription of cdo-1. A quick reading of the abstract leaves the reader uncertain whether EGL-9 is or is not involved in this regulation of cdo-1 expression. In addition, the conclusion sentence implies that activation of the EGL-9 pathway increases HIF-1-mediated transcription, yet it is well established that EGL-9 is an inhibitor of HIF-1. The abstract fails to deliver a clear summary of the paper's conclusions. Perhaps consider this alternative (changes in capital letters):

      The amino acid cysteine is critical for many aspects of life, yet excess cysteine is toxic. Therefore, animals require pathways to maintain cysteine homeostasis. In mammals, high cysteine activates cysteine dioxygenase, a key enzyme in cysteine catabolism. The mechanism by which cysteine dioxygenase is regulated remains largely unknown. We discovered that C. elegans cysteine dioxygenase (cdo-1) is transcriptionally activated by high cysteine and the hypoxia inducible transcription factor (hif-1). hif-1- dependent activation of cdo-1 occurs downstream of an H2S-sensing pathway that includes rhy-1, cysl-1, and egl-9. cdo-1 transcription is primarily activated in the hypodermis where it is sufficient to drive sulfur amino acid metabolism. EGL-9 and HIF-1 are core members of the cellular hypoxia response. However, we demonstrate that the mechanism of HIF-1-mediated induction of cdo-1 IS largely independent of EGL-9 prolyl hydroxylASE ACTIVITY and the von Hippel-Lindau E3 ubiquitin ligase. We propose that the REGULATION OF cdo-1 BY HIF-1 reveals a negative feedback loop for maintaining cysteine homeostasis. High cysteine stimulates the production of an H2S signal. H2S then ACTS THROUGH the rhy-1/cysl-1/egl-9 signaling pathway DISTINCTLY FROM THEIR ROLE IN HYPOXIA RESPONSE TO INCREASE HIF-1-mediated transcription of cdo-1, promoting degradation of cysteine via CDO-1.

      Response 25: We agree that the abstract could be clearer. We believe this concern stems from the fact that we did not discuss our initial screen in the abstract. Thus, we failed to establish a role for egl-9 in the regulation of cdo-1. To remedy this, we have modified the abstract as suggested by the reviewer and added additional context. We believe that these changes improve the clarity of the Abstract substantially.

      Issue 26: An easily addressable concern involves the "dark" microscopy controls showing lack of fluorescence from a nematode. In these dark negative control micrographs, the authors should draw dotted outlines around where the worms are or include a brightfield image next to the fluorescence image. On a computer screen, it is in fact possible to make out the worms. Yet, when printed out, the reader must assume there are worms in the dark images. Additionally, we realize that adjusting fluorescence so that wild-type CDO-1 expression can be seen will result in oversaturation of the egl-9 and rhy-1; cdo-1 doubles; however, this would be a useful figure to add into the supplement to both provide a normal reference of CDO-1 low-level expression and a demonstration of just how bright it is in the mutant backgrounds. It would also be useful for you to please report your exposure settings for purposes of reproducibility.

      Response 26: As suggested, we have added dotted lines around the location of the C. elegans animals in all images where GFP expression is low or basal. We have also reported the exposure times for each image in the appropriate figure legends.

      Issue 27: This title is quite generic and doesn't even mention the main players (CDO-1 and sulfite metabolism).

      Response 27: We have updated our title to call attention to cysteine dioxygenase. The improved title is: “Hypoxia-inducible factor induces cysteine dioxygenase and promotes cysteine homeostasis in Caenorhabditis elegans”

      Issue 28: The authors mention two disorders in which CDO-1 plays a pathogenic role: MoCD and ISOD. We recommend switching the order in which the authors mention these, as the remainder of the paragraph is about MoCD. Also, they should write out the number "2" in the first sentence of that paragraph.

      Response 28: We have made the suggested changes.

      Issue 29: The authors state in the main text, "...to ubiquitinate HIF-1, targeting it for degradation by the proteosome." Here, they should refer to the pathway in Figure 5a.

      Response 29: We have made the suggested change.

      Issue 30: The authors state in the main text, "Elements of the HIF-1 pathway have emerged..." which is vague and confusingly worded. Change to, "Members of the HIF-1 pathway and its targets have emerged from C. elegans genetic studies."

      Response 30: We have made the suggested change.

      Issue 31: Clarify in the figure legends that supplemental cysteine did not affect the mortality of worms that were imaged.

      Response 31: We have added this note to Figure 3A and Figure S3A.

      Issue 32: Figure 1b. "the cdo-1 promoter is shown..." Add: "as a straight line" to the end of this phrase.

      Response 32: We have made the suggested change.

      Issue 33: The authors should consider changing the red text in Figure 1 to magenta, which tends to be more readable for people who have limited color vision.

      Response 33: We have adjusted the colors in Figure 1 as suggested.

      Issue 34: Figure 2, legend title. Consider changing "hif-1" to "HIF-1," as well as rhy-1, cysl-1, and egl-9. In this case, they are talking about proteins, not mutants or genes. This will make the paper easier to follow for readers who lack a C. elegans background.

      Response 34: We have made the suggested change.

      Issue 35: Figure 5, caption text. "...indicates weak similarity." Add, "amongst species compared."

      Response 35: We have made the suggested change.

      Issue 36: It is starting to become a standard for showing the datapoints in bar graphs. Although this is done in many graphs in the paper, it should also be done for Figure S1 and Figure 4C.

      Response 36: We have made the suggested change.

      Issue 37: An extensive ChIP-seq and RNA-seq analysis of C. elegans HIF-1 was recently published (Vora et al, 2022), which the authors should reference in support of the regulation of CDO-1 transcription by HIF-1 in their description of published expression studies of the pathway (Results section, page 4). Indeed, Vora et al were key generators of the ChIP-seq data cited in Warnhoff et al but not included as authors in the ModERN/ModENCODE publication: their contributions were published separately in Vora et al and should be acknowledged equivalently.

      Response 37: We appreciate the reviewer pointing this detail out and we have added the correct citation as indicated.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements We thank the Editors and the Reviewers for their time and constructive criticism, which has allowed us to improve our manuscript. All of our responses are indicated in blue font. Revision Figures for the Reviewers are included just below the response. The line numbers given here refer to those in the revised manuscript, where we have marked the changes in red.
      2. Description of the planned revisions If granted a full revision, we will experimentally address the following major points, which were raised by more than one Reviewer: ● Repeat experiment in Figure 4 C to assess statistical significance (Reviewer 1 and 3) ● Western blot analysis of HDV infected HLCs showing small and large delta antigens. We have already performed such an analysis on HLCs (see Revision Figure 2). In addition, we will perform a comparative analysis with common HDV infection models dHepaRG and Huh7-NTCP cells over time (Reviewers 2 and 3). ● Additional characterisation of the two HLC subpopulations at transcript and protein level (Reviewer 1 and 3). In addition, we planned to conduct the following experiments in response to the individual Reviewers: In response to Reviewer 1: We thank the Reviewer for their encouraging feedback on our model and for their helpful comments, allowing us to improve our manuscript. Figure 1: The observation of a denser subpopulation of hepatocytes more susceptible to HDV is interesting. Do you have more characterization of this cell subpopulation, by IFA, in term of hepatic maturation marker, known HDV host factors and particularly NTCP expression? We agree with the Reviewer that this is an interesting observation. We separated the two hepatocyte subpopulations to analyse the gene expression of the liver maturation markers NTCP and ALB by RT-qPCR (see Revision Figure 1A). Surprisingly, we found that the low-density population expressed higher levels of both ALB and NTCP, suggesting that they are more mature than the high-density population. In addition, we stained both markers by immunofluorescence and observed no apparent differences (see Revision Figures 1B & C). In contrast, the new host factor identified in our study, CD63, appeared to be more highly expressed in the high-density population compared to the low-density population (Fig. 6G). However, we cannot exclude the Revision Plan possibility that other factors play an additional role. As outlined in our response to Reviewer 3, we will separate the two populations and analyse the gene expression of other known HBV and HDV co-host factors to assess whether they play a role in addition to CD63 in conferring the higher susceptibility to HDV infection to the highly dense HLC population. Revision Figure 1: High-density HLCs population is not more mature than the low-density HLC population. (A) The low-density HLCs population was separated from the high-density HLC population by gentle dissociation. Total RNAs were isolated from both populations and Albumin and NTCP expression was analysed by RT-qPCR. (B & C) High-density HLCs (upper image) and low-density HLCs (bottom image) were stained with Albumin specific antibody. Shown are either images taken on an epifluorescence microscope (B) or single slices of confocal images acquired on a Airyscan confocal microscope (C). Fig 1B and C: Can a BLV control be included in the figure? Thank you for this suggestion, we will repeat the experiment for these panels and add BLV as control. Fig 1A-F: What is the overall level of NTCP between HLC, HepaRG, Huh7NTCP and HLCAAV- NTCP? Can NTCP and HDAg be stained simultaneously in your cells? This is an excellent question and we will compare the total NTCP levels between differentiated HepaRG, Huh7 NTCP, HLCs +/- AAV NTCP by Western blot analysis and immunofluorescence (IF) staining. Comparing NTCP expression in HLCs +/- AAV NTCP, we observed a strong upregulation of surface NTCP upon AAV transduction by IF staining (Figure 1D). Unfortunately, our initial attempts to simultaneously detect NTCP and HDAg were technically hampered. Since HDAg is mainly localised in the nuclei, we have to permeabilize the cells in a harsh manner, which interferes with the detection of membrane NTCP. The latter is further hampered by the availability of suitable anti-NCTP antibodies for IF staining. In our study, we used high doses of fluorescence-conjugated MyrB peptide to stain NTCP, but unfortunately it is very sensitive to the harsh permeabilization detergents mentioned above. However, since we have meanwhile optimised HDV infection, we will likewise try again to optimise the staining Revision Plan protocol. If we succeed, we will repeat the co-staining of NTCP and HDAg and include it in a revised manuscript. Figure 4: While the strategy is interesting, based on what has been previously shown for HCV in Wu et al., 2012, the lack of statistical data prevents the reader to really understand and see drastic difference in term of susceptibility to infection and level of expression of host genes. In panel C, is the difference between day 13 and 15 statistically significant? Same for panel D, day 17 vs 19?As a remark, day 19, the peak of susceptibility to HDV, seems to be also the peak of maturation, based on ALB RTqPCR (panel B). Thank you for this comment, and will perform another set of experiments allowing us to calculate statistical significance. The Reviewer correctly points out the correlation between HDV infection and hepatocyte maturity, which we find very intriguing. To identify potential host co- or restriction factors expressed in highly mature HLCs, we then performed the differential gene expression analysis (Figure 5). As shown in the new Figure 5A, GO analysis revealed that genes involved in pathways regulating viral entry into host cells were most significantly upregulated in mature HLCs and, as a probable consequence, they were more permissive to HDV infection. Indeed, among these factors, we identified CD63 as a novel host cofactor that renders mature HLCs susceptible to HDV infection (Figure 6). In response to Reviewer 2: We thank the Reviewer for their assessment of our study and for critically pointing out the increments over the previous study by Lange et al. We also appreciate their helpful suggestions, which allow us to improve the manuscript. The manuscript would benefit from a more detailed virological analysis, such as: •Determination of HDV genome and antigenome sequences and analysis of HDV editing. We thank the Reviewer for this comment. Accordingly, we will determine HDV genomes and antigenomes by Northern blot analysis and study HDV editing rates by sequencing in HDVinfected HLCs. •Analysis of HDV short and large antigens by western blot. We have already detected small and large HDAg in HDV-infected HLCs (see Revision Figure 2). To also satisfy Reviewer 3, we will additionally compare the S/L-HDAg ratios over time in HLCs, dHepaRGs, and Huh7-NTCP cells and include the results in a revised manuscript. Revision Figure 2: Detection of small and large delta antigen in HDV-infected HLCs. Mature HLCs were infected with HDV (MOI= 5 Int. Units/cell) and harvested 1 or 3 days post-infection. Cell lysates were analysed by Western blotting using antibodies against HDAg and b-actin. Revision Plan •Analysis of HBV-related virological parameters in monoinfected and co-infected cells. We agree with the Reviewer and we will include the characterisation of more HBV-related virological parameters in our mono- and co-infected HLCs. Accordingly, we will assess HBV cccDNA, RNA, and DNA by RT-qPCR, as well as released HBsAg and HBeAg via ELISA and add the results to the revised manuscript. In response to Reviewer 3: We thank the Reviewer for their positive evaluation, and we acknowledge their helpful comments, which will help us to improve our manuscript. Line 143: the authors describe two forms of HLCs (less and more confluent with differences regarding the susceptibility to HDV infection). The characteristics of the less and more confluent HLCs should be described in more detail-what is causative for the differences in susceptibility for HDV infection of these two forms? We thank the Reviewer for this comment. We likewise find this observation intriguing. As stated in our response to Reviewer 1, we have ruled out that NTCP and/or other mature markers such as ALB are differentially expressed between the two subpopulations. As one factor that could make a difference, we have identified CD63, which is highly expressed in the high-density HLC population and less so in the low-density HLC population (Figure 6G). Nevertheless, we will separate the two populations and analyse by RT-qPCR the expression of other known HBV and HDV host co-factors that may be additional factors governing the increased susceptibility of the highly dense HLC population. The statistical analyses should be improved: There are no p-values provided for the data presented in the supplement and a variety of figures lacks p-values We have added p-values to the Supplementary Figures (see revised Supplementary Fig. S2) and will repeat the experiments for Fig. 4 and Supplementary Fig. S1B and Fig S3 so that we can calculate the corresponding p-values. Kinetic of the infection: Here it would be interesting to see a comparative analysis by western blot investigating the ratio HBsAg/HDAg over the time in HLCs, HepaRGs and NTCP oe cells We thank the Reviewer for his comments. As stated in our response to Reviewer 2, we will perform this WB analysis to detect S/L-HDAg over time in infected HLCs, dHepaRG, and Huh7- NTCP cells. Line 157: What is the experimental evidence for the proper localization and functionality of the ectopically expressed NTCP in HLCs. Did the authors study the taurocholate transport after overexpression of NTCP? We thank the Reviewer for this comment. We analysed endogenous and ectopic NTCP expression by microscopy using a fluorescently conjugated peptide Atto-MyrB-565, which specifically binds to the ectodomain of human NTCP (Figure 2D) and found that both Revision Plan endogenously and ectopically expressed NTCP are located on the cell surface. To further confirm the correct localisation, we will perform NTCP co-staining with a cell membrane marker. We will also test the proper function of the ectopically expressed NTCP using a specific taurocholate transport assay as shown in our previous study (Ni et al, 2014, Gastroenterology). Line 169: The authors should include data comparing the number of double positive cells in HLCs, HepaRGs and Huh7NTCP o.e. expressing cells under the chosen experimental conditions We thank the Reviewer for this suggestion. We have already performed HBV/HDV co-infection of dHepaRG cells (Revision Figure 3) and we will perform the same experiment with Huh7-NTCP cells. Revision Figure 3: HBV/HDV co-infection of dHepaRG cells. Differentiated HepaRG were infected with HBV (MOI = 450 genome copies/cell) and HDV (MOI = 5). Cells were stained against HBV core (HBc), HDAg, and nuclei (DAPI) ten days p.i.. HBc- and HDAg-positive cells were counted using Cellprofiler imaging software to quantify HBV (pink) and HDV (green) single and co-infection (white) events. Images are representative of three independent differentiations. Line 291: expression analysis by RT-PCR is not sufficient. It will be important to study by CLSM if the identified factors are really present as proteins and properly localized. To satisfy this Reviewer, we will be happy to perform WB analysis of lysates from cells obtained at different stages of HLC differentiation to detect LDLR, LAMP1 and SR-B1 to further confirm our transcriptome analysis. As protein expression is easier to compare by WB analysis, we prefer this method to microscopic analysis. Regarding the role of CD63: what is the evidence for a direct role of CD63 for HDV entrycan the authors exclude that CD63 is relevant for targeting other factors to the surface? What is the impact of loss of CD63 on the functionality of the autophagosomal-MVB-EV system in HLCs? Since downregulation of CD63 before but not after impairs HDV infection, we conclude that CD63 is likely to be important for the early steps of the HDV life cycle, namely cell entry. Indeed, we speculate that CD63 may be critical for HDV trafficking to the vesicle, where fusion of the HBV glycoproteins is induced to allow capsid entry, based on the following observations: Although neither the precise site of HBV viral fusion nor the cues that induce fusion are currently fully understood, studies suggest that HBV can be co-transported with EGFR and NTCP to late endosomes for trafficking (Herrscher et al; 2020, Cells). We speculate that similar to what has been described for Lujo virus, CD63 may be involved in either HDV trafficking and/or virus fusion in the endosomal system (late endosome or lysosome) (Tominaga et al, 2014 Molecular cancer). Revision Plan CD63 is a ubiquitously expressed protein that localises to the endosomal system and, in its glycosylated form, to the cell surface. Non-glycosylated CD63 is not properly trafficked and aggregates at the nuclear periphery instead of the cell membrane (Tominaga et al., 2014, Molecular Cancer). According to the Western blot analysis in Figure 6, immature HLCs appear to express less glycosylated CD63 than mature HLCs. We will confirm the glycosylation by treating the cell lysates with PNGase F. Although AAV transduction enhanced CD63 expression of all three HLC stages tested (see new Supplementary Figure S6 in the revised manuscript), it only enhanced HDV infection of immature HLCs, in which the non-glycosylated form of CD63 appears to be the predominant form. To demonstrate that the glycosylated form of CD63 is involved in HDV entry, we will rescue WT CD63 in parallel with a glycosylation-deficient CD63 mutant (Yoshida et al., 2009, Microbiology and Immunology) in immature HLCs. We will also stain CD63 in both immature and mature HLCs to compare the subcellular localisation (plasma membrane/endosomes vs. nuclear membrane) of CD63 between the two stages.
      3. Description of the revisions that have already been incorporated in the transferred manuscript Based on the constructive comments by the Reviewers we already made the following changes, which are highlighted in red in the revised manuscript. In response to Reviewer 1: Fig 1B-C: the comparison with dHepaRG is very interesting, and confirms the validity of SC derived hepatocytes as a model for HDV infection. dHepaRG can be heterogeneous. Do you also see the same phenotype of enriched HDV infection within a denser subpopulations of dHepaRG We thank the Reviewer for their comment. Undifferentiated bipotent HepaRG cells are not permissive for HDV infection due to the lack of surface NTCP expression. Due to their bipotent nature and upon differentiation, two morphologically distinct populations become apparent: hepatocyte-like cells and biliary epithelial-like cells (McGill et al., 2010, Hepatology). As shown in the Figure 1 of the study by Mesnage et al. (2018, Molecular Toxicology), dense hepatocyte-like colonies are surrounded by clear epithelial cells corresponding to primitive biliary cells. In agreement with other studies, we only observe that the ALB-positive hepatocyte-like cells are permissive to HBV and HDV infection (Hantz et al., 2009, Journal of General Virology), highlighting their specific hepatic tropism and the cellular determinants required. Fig 1I is confusing. Was BLV assay also performed on the HLC infection (Day 0), or only during the titration assay in Huh7NTCP? We apologise for the confusion in this panel. BLV was only added during the titration assay on Huh7NTCP cells to confirm new and productive infections and to rule out carry-over. We have changed the order of Figures 1I - 1K to make this clearer and explain this better in the new results section (line 171-179) and figure legend (line 797-806). Revision Plan Fig 1K: x-axis is confusing... is it number of HBV, HDV and HBV/HDV positive cells? Or number of infected cells upon inoculation with HBV, HDV, or both? Please clarify. We apologise for this additional confusion caused in this panel. We infected HLCs with both HBV and HDV simultaneously and then counted the number of positive cells that were either single infected with HBV (pink cells/column), single infected with HDV (green cells/column) or double infected with both viruses (white cells/column). We have clarified this in the revised Results section (line 172-176) and in the revised Figure Legend (line 798-803). Figure 2: The AAV based vector to over express HBsAg is a very interesting tool, and the figure convincingly show production of HDV progeny viruses in HLC-AAV-HBsAg. Results shown are in agreement with previous studies based on hepatoma cell lines. We thank the Reviewer for this positive comment and we agree that AAVs represent interesting tools to genetically manipulate HLCs and other hepatocyte culture systems. Figure 2B: What is IU/ml? Infectious Unit? International Unit? Are units in Fig 1B, 2B and 2C the same? We apologise for the lack of clarity. In Figures 1B and 2C, IU corresponds to infectious units of HDV, whereas in Figure 2B, IU corresponds to international units for the assessment of secreted HBSAg levels in the supernatant. To make the difference clearer, we have changed the unit on the y-axis in Figure 2B and explicitly stated the abbreviations in the corresponding revised Figure Legends (lines 785, 786, 794, 795, 816, and 819). Figure 3: What is the overall number of transmission events observed in the co-culture setup? Can you visually observed viral spreading? Panel A shows only 1 event, making it hard to assess its efficiency. Titration assay in Fig 2C show production of up to 4-5 log of infectious HDV. But HLCs susceptibility to HDV infection may change during time... Thank you for your comment and for raising this important issue. Panel A clearly and visually demonstrates that extracellular spread of HDV had occurred in the HLCs system, as initially only WT and non-GFP positive HLCs were infected with HDV. After co-culture, the progeny of WT HLCs were able to infect GFP-HLCs (Figure 3A). The overall efficiency of HDV spread/transmission in HLC efficiency is shown in Figure 3C. If we allow spread to occur (DMSOtreated condition), the total number of HDV-positive HLCs grown in a 24-well plate is approximately 1000. When we block secondary infection of progeny with BLV and thus spread, we count only about 500 HDV-positive HLCs in a well. In general, spreading in HLCs (Figure 3C) is not as efficient as retitration to Huh7-NTCP (Figure 2C) for the following reasons: In Figure 2C, we wanted to have an estimation of the maximum amount of secreted infectious progeny from HDV-producing HLCs. To this end, we did not want the re-infection itself to be a major bottleneck and used the most susceptible model Huh7-NTCP and infected them under the best conditions, which includes the addition of 4% PEG and 2% DMSO in the culture medium. For our spread assay in HLCs, we cannot add PEG to the cells over the course of the experiment and we also wanted to be as physiological as possible. PEG significantly enhances HDV infection Revision Plan of HLCs (Supplementary Fig. S2) and Huh7-NTCP cells (Revision Figure 4), which is in agreement with previous studies (Michailidis et al., 2017, Scientific reports). In addition, as the Reviewer correctly points out, similar to other primary hepatocyte culture models, the HLC system deteriorates over time. However, we have found that HLCs can be cultured for up to 3 weeks. Nevertheless, we believe that the efficiency of HDV spread in HLCs is sufficient for drug testing (Fig. 3C & D). Revision Figure 4: PEG enhances HDV infection of Huh7-NTCP cells. Huh7- NTCP cells were infected with HDV (MOI= 5 Int. Units/cell) in the absence or presence of PEG. Cells were harvested on D5 pi and HDV genome copies were quantified by RT-qPCR. Figure 5: In panel A, GO pathways should be sorted based on significance, not Number of genes. In panel B-D, what is the scale of the heatmap on figure 5: change in CPM values, however log2, log10? Thank you for this comment, we have sorted the GO pathways based on significance (new Figure 5A). For panels B-D, we did not calculate the fold change in CPM values and they were not log transformed. Instead, we calculated the z-scores of the genes shown by comparing the expression level of a given gene (in CPM) in a given sample with the expression level of that gene across all samples. To avoid further confusion, we have added "z-score" to the new Figure 5. Figure 6: Do you have info about CD63 in other mature model, like dHepaRG and PHHs? Is CD63 also limiting in these models? Our data in Figure 6 suggest that CD63 may be a limiting factor for HDV infection of immature HLCs but not mature HLCs. Both dHepaRG cells and PHHs are mature hepatocyte models and therefore we speculate that CD63 is not rate limiting. However, we will investigate whether CD63 is rate-limiting in undifferentiated HepaRG cells. In response to Reviewer 2: Additional information that needs to be added, better explained, or corrected: The authors should explain why they used different MOIs depending on the genotype. In our previous study by Wang et al. 2021 J Hepatol, we found that the different HDV genotypes are heterogeneous in their ability to infect Huh7 NCTP cells. For example, as shown in Figure 4B of Wang et al. 2021 J Hepatol, GT 4 and 5 are less infectious than other genotypes. Based on the different infectious titres of the genotypes obtained on Huh7 NTCP cells, we then decided to use different MOIs for infection of our HLCs. The aim of the present study by Chi et al. was not to Revision Plan compare the different HDV genotypes, but to analyse whether they can all infect HLCs. In order to obtain similar infection efficiencies of our HLCs with the different genotypes, we used higher MOIs for those genotypes that were less infectious in Huh7-NTCP cells compared to those genotypes that were more infectious in Huh7-NCTP cells. We apologise for not making this sufficiently clear and have added this information to the results section (line 167-170) and the corresponding figure legend (line 796) of the revised manuscript. In Figure 1, it is unclear on which day the HCLs were infected by HDV and on which day they were transduced with AAV-NTCP. We apologise for the lack of clarity in the experimental design. We transduced HLCs with AAV two days before HDV infection to ensure sufficient ectopic NTCP expression on the day of HDV infection to study its effect on HDV entry. We have clarified this in the results section (line 153, 156) and in the figure legend (line 788) in the revised manuscript. It is not very clear if the authors used AAV serotype 6 consistently to transduce the cells. It would be valuable to show the transduction efficiency of AAV at different time points of HLC maturation, as it might also be affected and could explain some results. For example, in Figure 6H, why does AAV-CD63 transduction increase HDV infectivity at day15 but not at day 10? It would be interesting to repeat the anti-CD63 western blot after AAV-CD63 transduction. Thank you for this comment. Yes, we have consistently used AAV 6 due to its relatively broad tissue tropism (Verdera et al., 2020, Molecular Therapy) and we have clarified this information in the revised manuscript (see line 331). We agree with the Reviewer's concerns regarding the variable transduction efficiency. We have previously tested different AAV capsids and found that AAV6 transduced mature HLCs at high levels (Zhang et al., 2022, Hepatol Commun). In this study, we also performed Western blot analysis to confirm successful CD63 overexpression by AAV transduction at different stages of hepatocyte differentiation. As shown in new Supplementare Figure 6, although there were some differences in transduction efficiency, the majority of all cells at each stage of differentiation were successfully transduced to ectopically express CD63. The authors claim that by using AAV to express HBsAg, they are mimicking the expression of HBsAg from the integrated sequence rather than cccDNA. However, it is the opposite, as AAV genomes, like cccDNA, remain as episomes in the cells. Yes, the Reviewer is conceptually correct and we apologise for the incorrect wording. In principle, we aim to trans-complement HBsAg in a setting outside of HBV infection and thus mimic the expression of antigen from integrated cells, although AAVs of course remain mostly episomal. We have clarified this in the revised manuscript (see lines 188 & 378). In response to Reviewer 3: Line 217: the complete inhibition of cell to cell spread by myrcludex suggests that there is no spread by cell-cell contact. This should be discussed. Revision Plan Yes, there is no evidence of HDV spread by cell-cell contact because, as the Reviewer correctly points out, BLV treatment almost completely blocked HDV de novo infection (Figure 2D & E). To our knowledge, cell-to-cell spread has not been demonstrated for HDV. According to our own studies by Zhang et al, 2021/2022, Journal of Hepatology, HDV spreads either extracellularly (which can be blocked by BLV) or by cell division (discussed in lines 362). Since HLC are similar to primary human hepatocytes and do not divide in vitro, we believe that extracellular spread is the predominant mode of spread in HLC (line 365). Line 210ff:Is there any evidence for syncytia formation in this system? No, we have not observed syncytia formation. Since HDV has no glycoproteins, we would not expect syncytia to form. Line 42: secrete should be replaced by release We thank the Reviewer for pointing out the inaccuracy in our terminology. We have replaced "secrete" with "release" (line 42). Line 241: proteins are not expressed, genes are expressed Thank you, we agree and changed the wording accordingly (line 246).
      4. Description of analyses that authors prefer not to carry out In response to Reviewer 1: Fig 1B: Unit is confusing, using terms usually used for titration of infectivity, from the virus input point of view, not from the cellular point of view. Can you use % infected cells instead, or "HDV infection rate" like in Supp Fig 1B? We apologise for this confusion. For other viruses, such as but not limited to HCV or HEV, the most common method is to report focus forming units per ml (FFU/ml). HLCs do not divide and, in the absence of HBV S antigen, no cell-division mediated HDV spread can occur and only single infection events can be observed (hence infectious unit = IU/ml). Since differentiated, authentic hepatocyte culture models such as PHHs, HLCs or HepaRG cells are always characterised by strong cell heterogeneity, it is difficult to directly compare the overall percentage of infection with a homogeneous cell population such as Huh7-NTCP cells. Therefore, if the Reviewer allows us, we prefer to keep this unit in our main figures. However, and hopefully to the satisfaction of this Reviewer, we have also calculated the percentage of infected cells of this exact dataset and show it in the Supplementary figures (Suppl. Fig. S1 C). The proportion of infection efficiency comparing HLCs, dHepaRGs, and Huh7-NTCP cells does not differ when presented either as IU/ml or as percentage of infected cells.
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the authors report a novel and simple method to analyze the heterogeneity of various organelles. After imaging a large set of fluorescent-marker-labeled organelles, cluster analysis is adapted for illuminating the dynamics of organelles. Through this novel method, the authors are able to report organelle contact, which previously can only be observed by super-resolution imaging. This is method could significantly accelerate future discoveries at the cellular level. The manuscript is well written and has the potential be published in high-ranking journals, after a minor revision.

      To further demonstrate the unique power of this new method, the authors should test cells under known stimulation altering the dynamics of organelles. For instance, wortmannin can blocks the conversion from early endosomes to late endosomes. By doing that, the potential of this new method will be endorsed.

      Response:

      We thank Reviewer #1 for the positive comments. We will add an experiment using wortmannin to block the process of endocytosis at a specific stage, as part of the experiments analyzing the process of endocytosis.

      **Minor issue:** The authors should include more details about how to avoid signal crosstalk between adjacent fluorescent channels.

      Response:

      In the Methods section, we have added the following sentences to Lines 398-405.

      “In order to avoid signal crosstalk between adjacent fluorescence channels, eight fluorophores with distinct spectral distances were selected, and the samples were irradiated sequentially with lasers in the order from the longest wavelength, i.e., fluorescence from 646 to 731 nm was excited by a 640 nm laser, fluorescence from 569 to 634 nm was excited by a 561 nm laser, fluorescence from 494 to 554 nm was excited by a 488 nm laser, and fluorescence from 411 to 481 nm was excited by a 405 nm laser, as shown in Extended Data Fig. 1b.”

      Reviewer #1 (Significance (Required)):

      The comprehensive monitoring of organelle dynamics through the integration of multi-dimensional parameters can proficiently evaluate the condition and prognosticate the destiny of living cells in response to external stimulations. This new multi-dimensional assay reported in this manuscript represents a huge step towards this goal. Since this new method is simple and powerful, cell biologists will quickly start to use this new method for the study of subcellular dynamics.

      My lab is also developing a similar approach for organelles based on super-resolution imaging. I would like to congratulate the authors for this beautiful work.

      Response:

      We thank Reviewer #1 for the positive comment.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript reports a multi-parametric particle-based method for analysis of organelles. The method aims to resolve heterogeneous populations of organelles involved in various cellular processes. They propose to isolate organelles labelled with multiple markers, after homogenization and sonification of the cells, and analyse the resulting particles by fluorescence microscopy using spectral imaging. Afterwards, the authors visualize and analyse the obtained data with dimension reduction techniques.

      Even though an interesting approach, the method and presented applications needs major improvisations before it can prove to be impactful for the field

      I note some possible improvement points below:

      • Initially, I think the current set of cell lines and labels should be extended also to include a wider set. The current limited set raises the question if the method authors report is also applicable to other cell lines, or if it only feasible with overexpressed markers. Including different cell lines with different labels would make the study more convincing and comprehensive.

      Response:

      We thank Reviewer #2 for this constructive comment. Regarding cell types, we will conduct experiments with HEK293T cells in addition to HeLa cells, labeling at least five different types of typical organelles. In our method, as shown in Figure 1a and 5a, we have already used not only overexpressed markers but also fluorescently labeled ligands (EGF-Alexa, transferrin-Alexa) and antibodies against endogenous proteins (anti-PMP70, anti-LAMP1), as well as direct labeling of cell membrane proteins (Alexa-NHS). Therefore, there are no significant limitations with respect to organelle labeling methods.

      • It is surprising that the authors explicitly list already the limitations of fluorescence microscopy and super-resolution microscopy in the second paragraph of their introduction, however present a method fully dependent on fluorescence labelling and imaging methods. Actually their approach takes away the spatial information of FM approaches, and further makes the approach prone to the limitations they state.

      They are also not fully fair about the limitation they state for Electron microscopy, as newly developed approaches (e.g. doi:10.1093/micmic/ozad067.1091;  doi:10.1126/science.aay3134) widely extend the limited field of view and sampling capacity of EM. I recommend the authors to state the potential advantage/superiority of the reported method rather than stating the unclear limitations of the existing powerful methods.

      Response:

      Regarding fluorescence microscopy, it appears that our description was inadequate and misled the reviewers. There is no problem with fluorescence microscopy itself. What we intended to convey was that “when attempting to detect individual organelles ‘in cells’, there are limitations in the resolution of fluorescence microscopy because organelles are densely packed”. We have added this to the text on Line 49. Also, we thank Reviewer #2 for informing us about the high-speed 3D electron microscopy. We have cited the indicated papers in the text at Lines 54-55 and mention that “except for the recently developed high-throughput electron microscopy”.

      • Most organelle markers the isolation of organelles are based on are overexpressed in the cells: endoplasmic reticulum (ER, mTagBFP2 (BFP)-SEC61B), mitochondria (GFP-OMP25 and SNAP-OMP25), and the Golgi (Venus-GS27). This raises significant questions about the native state relevance of the reported results, and how well they represent the endogenous processes.

      Response:

      We will add experiments analyzing the behavior of both endogenous and exogenous markers for the same organelles, for example, anti-LAMP1 antibody and VAMP7-GFP for lysosomes, and anti-PMP70 antibody and PEX16-GFP for peroxisomes.

      • For the application on endosomes, can the authors state what is the new information enabled by their method? They study the very trafficking of EGF and Transferrin, 2 widely used endosomal cargoes with very well characterized trafficking steps, and show they are trafficked through Rab5/7 and Rab11 positive endosomes, respectively. This recapitulates the existing information, however falls short in delivering new insight. The authors can use these cargoes for proof-of-concept, but I would recommend to extend their study with less exploited cargoes to represent the potential of the reported method to deliver new information.

      Response:

      We thank Reviewer #2 for the positive suggestion about the potential of our method to provide new information. However, to demonstrate new biological insights, it would take a lot of time and delay the provision of our methodology, so we would like to submit this manuscript as a Methods paper with the proof-of-concept data.

      Reviewer #2 (Significance (Required)):

      The significance of biochemical and cellular processes being spatially regulated cellular organelles, and the roles of specific organelles in diseases from cancer to neurodegeneration are continuously being discovered and appreciated. Therefore development of methods reporting on the structure and function of organelles is important to accelerate these studies. In the reported method, however, the ultrastructure (as in Fib 1b) and the spatial information of the cellular organelles are inherently lost. The method falls in between a biochemical and a microscopic approach, however the advantages are not clearly portrayed. I recommend the authors to carefully and explicitly state where their method would be the method of choice rather than a biochemistry, mass spectroscopy, or microscopy approach. The authors should critically consider such an experiment as a proof-of-concept case.

      Response:

      We thank Reviewer #2 for the valuable suggestion. We added the following to the Discussion (Lines 267-277).

      “A further potential application of our method would be to measure how the levels of key molecules in an organelle change during its differentiation or maturation. For example, the levels of PI4P and syntaxin 17 change during autophagosome maturation (Shinoda et al. eLife Preprint Review doi.org/10.7554/eLife.92189.1), which can be better demonstrated by this method using multiple markers for each stage of autophagosome formation and maturation, PI4P, and syntaxin17 because autophagosomes at different stages coexist in cells. In such cases, our single-particle analysis method, which examines the state of individual autophagosomes, would be more appropriate than biochemical methods that examine averages. In addition, it is difficult to quantitatively analyze many organelle structures in cells using fluorescence microscopy. Our particle-based analysis method can overcome this problem.”

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Comments, suggestions, and questions**

      • I would like to start with a positive suggestion. The authors completely miss out on the opportunity to promote their approach by not relying on any type of fixation. In most multiplexing experiments, the first major challenge is to find antibodies that work well for imaging. The second challenge is then to find antibodies that work well under the same fixation conditions. The authors present a multiplexing approach that is completely independent of fixation. I suggest discussing this in the manuscript and promoting the approach in that regard.

      Response:

      We thank Reviewer #3 for pointing out the advantages of our method. We have added “Our method that is independent of fixation is advantageous for the optimization of the staining condition (Lines 298-299).

      • I am wondering what defines the ‘resolution’ of this approach. I assume it is a combination of the sonication time -the longer the cell is sonicated, the smaller the fragments are - and the density of particles on the coverslip. What are the limits here? How does this affect the UMAP analysis? I would encourage the authors to discuss this in the manuscript.

      Response:

      The particle density on a coverslip can be easily reduced by simply diluting the particles in a buffer solution. Therefore, there is no density limit, which is an advantage of a cell-free system. To improve the resolution within a single organelle, for example, to separate distinct subdomains, as the reviewer mentioned, we can prolong the sonication time to make the particles smaller. However, since this will reduce the signal-to-background ratio and destroy organelle contacts, we used the sonication conditions as mild as possible. To investigate organelle subdomains and fragile contacts, the sonication conditions need to be optimized carefully, which should affect the UMAP analysis, but we think that these will be future work.

      We do not think that prolonged sonication will affect the UMAP analysis because relative fluorescent signals of each particle would not change. However, as mentioned above, too strong sonication would worsen the signal-to-noise ratio, resulting in poor clustering.

      We have added the above discussion to the Discussion (Lines 288-293).

      “Also, to improve the resolution within a single organelle, for example, to separate distinct subdomains, we can prolong the sonication time to make the particles smaller. However, since this will reduce the signal-to-background ratio and may destroy organelle contacts, we used the sonication conditions as mild as possible. To investigate organelle subdomains and fragile contacts, the sonication conditions need to be optimized carefully.”

      • The only real control the authors present are the correlative light and electron microscopy (CLEM) three images in Figure 1b, which seems very minimalistic for a very central and essential control experiment. How many of these control images did the authors take? Is there possibly a second method for a control experiment to link the fluorescence readout to an organelle fragment (e.g., purification or pulldown)?

      Response:

      Since all the markers we used are well-established, we believe that there is no concern about the fluorescence readouts to the organelle fragments. We have cited the following papers in Lines 84-85.

      SEC61B: Rapoport, T. A., Jungnickel, B. & Kutay, U. Protein transport across the eukaryotic endoplasmic reticulum and bacterial inner membranes. Annu Rev Biochem 65, 271–303 (1996).

      OMP25: Horie, C., Suzuki, H., Sakaguchi, M. & Mihara, K. Characterization of signal that directs C-tail-anchored proteins to mammalian mitochondrial outer membrane. Mol Biol Cell 13, 1615–1625 (2002).

      GS27: Hay, J. C. et al. Localization, Dynamics, and Protein Interactions Reveal Distinct Roles for ER and Golgi SNAREs. J Cell Biol 141, 1489–1502 (1998).

      Fusella, A., Micaroni, M., Di Giandomenico, D., Mironov, A. A. & Beznoussenko, G. V. Segregation of the Qb-SNAREs GS27 and GS28 into Golgi Vesicles Regulates Intra-Golgi Transport. Traffic 14, 568–584 (2013).

      Although it is relatively easy to identify mitochondria-derived particles by EM based on their size and the presence of cristae-like structures (indeed we see many examples), it is more challenging for other organelles (because they appear simple vesicles). This is why we showed only mitochondria in Fig. 1b. Furthermore, the main purpose of this EM image is to show membrane contacts between the ER and mitochondria (related to Fig. 3).

      • Line 37-41: Could the authors please strengthen these statements with an appropriate citation (e.g., a review)?

      Response:

      We have cited the textbook Molecular Biology of THE CELL (the 6th edition, Chapter 12 and Chapter 13) in Lines 37 and 41.

      Response:

      We thank Reviewer #3 for notifying us of these important studies. We have rewritten the sentence on Lines 51-52 to read “Although multicolor imaging has been attempted with super-resolution microscopy (references of the indicated papers), it only partially solves the issue of resolution.”

      • The authors use spectral unmixing to overcome the limit of spectral multiplexing. While this has been demonstrated to work well for less than ten targets, it does not scale to multiplexing experiments with more than ten target species. I suggest that the authors discuss in the discussion part of the manuscript the potential of DNA-based multiplexed imaging, such as CODEX or DNA-PAINT, in combination with the presented approach.

      Response:

      In the Discussion (Lines 295-298), we have added the sentence “Current fluorescent particle detection uses spectral multiplexing, but this method has only been able to detect up to eight colors. Methods such as CODEX or DNA-PAINT with wide-field type illumination could significantly increase the number of targets”.

      Response:

      We thank Reviewer #3 for informing us. We have cited it in Line 72.

      • By using spectral unmixing for multiplexing, this method is limited to confocal due to spectral detection needs and therefore limited in throughput. It would be beneficial if it could work with wide-field type illumination. This could substantially increase the throughput, which is another reason why I think it would be important to discuss sequential multiplexing.

      Response:

      We agree with the Reviewer’s comment. We have added the discussion to Lines 295-298 as described in our response to Reviewer #3, Comment (6).

      • To image contact sites, the authors use split GFP. There have been discussions that split GFP might, in some cases, facilitate the process that is supposed to be measured, in this case, the formation of contact sites. I suggest using at transient version of split GFP, called split fast, for follow-up experiments in the authors’ next papers (https://www.nature.com/articles/s41467-019-10855-0).

      Response:

      We thank Reviewer #3 for providing this information. We will do it as suggested in the next paper.

      • Line 27 & 253: Please drop the term ‘intuitive’ or explain better what you mean by intuitive. For me, UMAP is certainly a very useful tool, but it is not at all what I would describe as intuitive.

      Response:

      We have deleted ‘intuitive’ in all seven places and rewritten them (Lines 27, 43, 58, 72, 180, 231, and 253).

      • Lastly, I want to mention that the authors state they used ChatGPT, DeepL, and DeepL Write for translation from Japanese to English. I appreciate their honesty.

      Response:

      We thank Reviewer #3 for the comment.

      Reviewer #3 (Significance (Required)):

      In the manuscript titled “Organelle Landscape Analysis Using a Multi-parametric-Based Method,” Kurikawa et al.present a method for multi-parametric, particle-based analysis of cellular organelles. After lysing cells, the fractions of the organelles are partially labeled with fluorescently tagged antibodies, while others are already tagged with fluorescent proteins, using six to eight spectrally different fluorescent dyes/proteins. These fractions are subsequently immobilized on a poly-L-lysine-coated coverslip. The authors use spectral unmixing to distinguish these markers. The6-8 multiplexed imaging data is then presented in two-dimensional UMAP space. The authors then use this approach to visualize seven major organelles, transitional sites of endocytic organelles, and contact sites between the endoplasmic reticulum and mitochondria using split GFP.

      The authors present, in my opinion, a conceptually new and interesting approach by combining spectral unmixing for imaging up to eight targets, with organelle fragment imaging, and presenting multidimensional data in two-dimensional Uniform Manifold Approximation and Projection (UMAP) space in this manuscript. They further validated this approach by linking the results of the experiments to results established or at least reported in the literature.

      In general, the manuscript is, in my opinion, a good fit for publication as it presents a conceptionally new approach and an interesting example of applying the UMAP approach, which might be of interest to a broader readership. Therefore, after an appropriate response to my comments, suggestions, and questions (see below), I would recommend this manuscript for publication.

      Response:

      We thank Reviewer #3 for the positive comment.

    1. Author Response

      The following is the authors’ response to the original reviews.

      eLife assessment

      This work presents important findings for the field of Alzheimer's disease, especially for the electrophysiology subfield, by investigating the temporal evolution of different disease stages typically reported using M/EEG markers of resting-state brain activity. The evidence supporting the conclusions is solid and the methodology as well as the descriptions of the processes are of high quality, although a separation of individuals who are biomarker positive versus negative would have strengthened the interpretability of the results and the conclusions of the study.

      Response: Thank you for the positive assessment of the paper.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      The authors aimed to infer the trajectories of long range and local neuronal synchrony across the Alzheimer's disease continuum, relative to neurodegeneration and cognitive decline. The trajectories are inferred using event-based models, which infer a set of data-driven disease stages from a given dataset. The authors develop an adapted event-based modelling approach, in which they characterise each stage as a particular biomarker increasing by a particular z-score deviation from controls. Fitting infers the optimal set of z-scores to use for each biomarker and the order in which each biomarker reaches each z-score. The authors apply this approach to data from 148 individuals (70 cognitively unimpaired older adults and 78 individual with mild cognitive impairment or Alzheimer's disease), identifying trajectories in which long-range (amplitude-envolope correlation) and local (regional spectral power) neuronal synchrony in the alpha and beta bands becomes abnormal prior to neurodegeneration (measured as the volume of the parahippocampal gyrus) and cognitive decline (measured using the mini-mental state examination).

      Strengths:

      • The main strength is that the authors assess two models. In the first they derive a staging system based only on the volume of the parahippocampal gyrus and mini-mental state examination score. They then investigate how neuronal synchrony metrics change compared to this staging system. In the second they derive a staging system that also includes an average (combined long-range and local) neuronal synchrony metric and investigate how long-range and local synchrony metrics change relative to this staging system. This is a strength as the first model provides confidence that there is not overfitting to the neuronal synchrony data, and the second provides more detailed insights into the dynamics of the early neuronal synchrony changes.

      • Another strength is that the authors automatically infer the optimal z-scores to choose, rather than having to pre-select them manually, as in previous approaches.

      Response: Thank you for the positive comments and a succinct summary of the paper and its strengths.

      Weaknesses:

      • The dataset is small and no external validation is performed.

      Response: We agree that future validation studies of the predictions are necessary. We now include the related sentences in the last paragraph of the limitations section in the revised manuscript.

      • A high proportion of the data is from controls (nearly 50%) with no biomarker evidence of Alzheimer's disease, and so the changes may be driven by aging or other non-Alzheimer's effects.

      Response: We would like to clarify that the z-scores of the metrics used in the EBMs were computed using age-adjusted values. All our controls were recruited from an ongoing longitudinal study of healthy aging. Amongst the 70 controls, 39 have confirmed A-beta negative PET scans and 8 were confirmed A-beta positive PET scans, and in the rest of the 23 we do not have any biomarker data available. However, in all the controls, we have conducted comprehensive neuropsychological assessment (see Appendix 1—table 1 in the revised supplementary file) and based on this data we can be quite confident about their lack of clinical deficits, and we have a very high degree of confidence that none of the controls have any neurodegeneration (AD-related or otherwise). Consistent with this assessment, in our EBM analyses, most of the control participants were indeed categorized to the preclinical stages.

      • Inferring the optimal z-scores is a strength, however as different sets of z-scores are allowed per biomarker, there is a concern that the changes reflected are mainly driven by the choice of z-score, rather than the markers themselves (e.g. if lower z-scores are selected for one marker than another, then changes in that marker will appear to be detected earlier, even if both markers change at the same time).

      Response: Indeed, the biomarker sequence depends on the choice of the z-scores per biomarker. However, please note that our choice of z-scores is based on maximizing the sequence likelihood. Therefore, other values of the z-scores will have by construction a smaller likelihood of sequence occurrence compared to the results shown.

      • In equation 2 it is unclear why the gaussian is measured based on a sum over I. The more obvious choice would be to use a multivariate gaussian with no covariance, which would mean taking the product rather than the sum over I.

      Response: We thank the reviewer for pointing this out and we now clarify this point. In this revision, we do not use the term ‘multivariate’. Indeed, the model likelihood assumes independence for each metric’s priors, and hence is the product of each metric’s univariate gaussian probability distribution. This can be seen in equations 1 and 2 of the revision manuscript (Section titled “Event-based sequencing modeling’). The assumption about independent priors is similar to the one used in the original event-based model (see equation (2) in A .L. Young et al., Nature Comm. 9.1 (2018): 4273).

      • In the original event-based model, k is a hidden variable. Presumably that is also the case here, however the notation k=stage(j) makes it seem like each subject is assigned a stage during the sequence optimisation.

      Response: We would like to clarify that the posterior probability of each stage for every subject is estimated during the sequence optimization. To clarify the notation, we have now deleted the term “stage” and use “tj” to denote stages for each subject j. The sequence optimization was performed with the assumption of a uniform prior distribution p(tj=k) = 1/(N+1) for each stage k. Then, the posterior probability p(tj=k|Zj,S), i.e., the probability that subject j belongs to stage k, given the metrics and the sequence, was computed during the sequence optimization procedure.

      • Typically for event-based modeling, positional variance diagrams are created from the markov chain monte carlo samples of the event sequence, enabling visualisation of the uncertainty in the sequence, but these are not included in the study.

      Response: In the revised supplementary file, we have now included positional uncertainty diagrams for the optimal set of z-score events that were created from 50,000 MCMC samples. Please see Appendix 1—figure 2 for the AC-EBM and Appendix 1—figure 9 for the SAC-EBMs.

      • Many of the figures in the manuscript (e.g. Figure 1E/G, Figure 2A/B, Figure 3A/B/E/F/I/J, Figure 4 A/B/E/F/I/J) are based on averages in both the x and the y axis. In the x dimension, individuals have a weighted contribution to the value on the y axis, depending on their stage probability. In the y dimension, the values are averages across those individuals, and the error bars represent the standard error rather than the standard deviation. Whilst the trajectories themselves are interesting, they may not be discriminative at the individual level and may be more heterogeneous than it appears.

      Response: In the current study, the predictions of trajectories are intended at the cohort level. Individual level investigations will be the topic of future investigations.

      • The bootstrapped statistical analyses comparing metrics between the stages do not consider the variability in the sequence.

      Response: Please see the response above. The positional uncertainty diagrams are included in the revised supplementary file.

      Reviewer #2 (Public Review):

      Summary:

      This work presented by Kudo and colleagues is of great importance to strengthen our understanding of electrophysiological changes in the course of AD. Although the main conclusions regarding functional connectivity and spectral power change through the course of the disease are not new and have been largely studied and theorised on, this article offers an innovative approach that certainly consolidates previous knowledge on the topic. Not only that, this article also broadens our knowledge presenting useful and important details on the specificity of frequency and cortical distribution of these early alterations. The main take-home message of this work is the early disruption of electrophysiological signatures that precedes detectable alterations in other more commonly used pathology markers (i.e. gray matter atrophy and cognitive impairment). More specifically, these signatures include long-range connectivity in the alpha and beta bands, and local synchrony (spectral power) in the same frequency bands.

      Response: Thank you for the positive comments and for providing a nice succinct summary.

      Strengths:

      The present work has some major strengths that make it paramount for the advance of our understanding of AD electrophysiology. It is a very well written manuscript that, despite the complexity of the analyses employed, runs the reader through the different steps of the analysis in a pedagogic and clever way, making the points raised by the results easy to grasp. The methodology itself is carefully chosen and appropriate to the nature of the question posed by the researchers, as event-based models are well-suited for cross-sectional data.

      The quality of the figures is outstanding; not only are they aesthetic but, more importantly, the figures convey information exceptionally well and facilitate comprehension of the main results.

      The conclusions of the paper are, in general, well described and discussed, and consider the state-of-the-art works of AD electrophysiology. Furthermore, even though the conclusions themselves are not groundbreaking at all (synaptic damage preceding structural and cognitive impairment is one of the epitomes of the pathological cascading model proposed by Jack in 2010), this article is innovative and groundbreaking in the way they address with clever analyses in a relatively large sample for neuroimaging standards.

      Response: Thank you for the positive comments of the strengths of the paper.

      Weaknesses:

      The main limitation of the work revolves around sample definition and inclusion criteria that are somewhat confusing obscuring some of the points of the analyses. Firstly it is not clear why the purely clinical approach is employed to diagnose the "probable Alzheimer´s Disease" for the 78 participants in the "AD group". In the same paragraph, it is stated that 67 out of the 78 participants show biomarker positivity, thus allowing a more biologically guided diagnosis that is preferred according to current NIA-AA criteria. This would avoid highly possible mixing of different subtypes of dementia etiologies. One might wonder, why would those 11 participants be included if we have strong indications that their symptoms are not due to AD? Furthermore, the real pathological status of the control group is somewhat questionable. The authors do not specify whether common AD biomarkers are available for this subgroup. In that case, it would have highly increased the clarity and interpretability of the results if this group was subdivided in a preclinical and completely healthy control group. This would be particularly interesting since a significant proportion of the control group is labeled as belonging to stages 2,3,4 (MCI) and even 5 (mild dementia). This raises the question of whether these participants are true healthy controls mislabeled by the EBM model, or actual cognitive controls with actual underlying AD pathology well identified by the model proposed.

      Response: Please see responses above to a similar comment from R1. To clarify, all our controls were recruited from an ongoing longitudinal study of healthy aging. Amongst the 70 controls, 39 have confirmed A-beta negative PET scans and 8 were confirmed A-beta positive PET scans, and in the rest of the 23 we do not have any biomarker data available. The biomarker positivity rates in our control cohort are completely consistent with the prevalence of A-beta positivity in cognitively healthy individuals and are within a normal biological continuum for amyloid beta (Jansen WJ et al. 2015). In all the controls, we have conducted comprehensive neuropsychological assessment (see Appendix 1—table 1 in the revised supplementary file) and based on this data we can be quite confident about their lack of clinical deficits, and we have a high degree of confidence that none of the controls have any neurodegeneration (AD-related or otherwise). We include these details in the revision (see the revised ‘Participants’ section in the Materials and methods.).

      Jansen WJ et al., 2015 JAMA; 667 313(19):1924-1938.

      On this note, Figure 2 (C and D) and Figure 3 (C, G and K) show a cortical surface depicting the mean difference of each stage vs the control group, which again, is formed by subjects that can be included (and in fact, are included) in all those stages, obscuring the meaning and interpretability of these cortical distributions.

      Response: We would like to clarify that these figures depict the regional maps of each metric for each stage of AD progression, not the contrast against a control group.

      Reviewer #1 (Recommendations For The Authors):

      • If possible, perform independent validation of the results.

      Response: This is something we indeed intend to examine in our future investigations.

      • Repeat the analysis in the subset of individuals that are amyloid positive.

      Response: Amongst the 78 AD patients, 20 had autopsy confirmed AD neuropathology, an additional 41 patients had molecular pathology identified by Abeta-PET, and another additional 9 had fluid biomarker (CSF) confirmation of amyloid and tau levels consistent with AD diagnosis. Eight remaining patients had a diagnosis of AD with high certainty, based on clinical presentation, neurological assessment, and cortical atrophy on MRI. Given that there are only eight patients who had clinical diagnosis of AD (with no biomarkers), and the comprehensive clinical characterization of all the AD patients in our cohort (Appendix 1—table 1), we do not believe that any subgroup analysis is warranted.

      • When inferring the optimal z-scores, select the same set of z-scores per biomarker, or include diagrams of stage vs z-score that include all of the markers so that it is easy to see how one marker changes relative to the others (overlay Figure 1G on Figure 2A and 2B).

      Response: How the neural synchrony metrics, PHG volume and MMSE scores change relative to each other is exactly what we show in Figures 3 B/F/J and 4 B/F/J. Since each EBM model optimizes the z-score thresholds, sequence likelihood and posterior probability of each stage for each subject, the EBM framework provides the most likely estimate for each metric at every stage. Therefore, the SAC-EBM model gives the most accurate description of the relative differences in these metrics over the AD progression stages. The reviewer’s suggestion to overlay Figure 1G (now figure 1F, based on optimized z-scores for PHG volume and MMSE scores) on Figures 2A and 2B will be inaccurate, as the neural synchrony measures plotted in figures 2A and 2B are not for optimized z-scores.

      • Change equation 2 to use a multivariate gaussian.

      Response: We now clarify that we use a factorized multivariate form that reflects independent priors for each metric which are Gaussian.

      • Clarify whether k is a hidden variable and possibly change the notation.

      Response: We now clarify that in our notation, k is a label for the stage [k=1,..,7 (when I=2) or k=1,...,10 (when I =3)] and is indeed a hidden variable and not observed (but inferred from the EBM). Specifically, the posterior probability for each subject j belonging to stage k was estimated as part of the sequence optimization procedure.

      • Generate positional variance diagrams of the MCMC samples.

      Response: We are doing the MCMC to obtain the most likely sequence. We have now included positional variance diagrams of the optimal set of z-score events in Appendix 1—figure 2 and Appendix 1—figure 9 in the revised supplementary file.

      • It would be interesting to study whether the stages are predictive of conversion or look at longitudinal data, if available.

      Response: This is something we indeed intend to examine in our future investigations.

      • Also look at statistics across MCMC samples of the sequence.

      Response: Thank you for this suggestion. In the Appendix 1—figure 10, we now include an example of the MCMC samples for an SAC-EBM including the alpha-band AEC. We then derived the positional variances for each metric that are now shown in Appendix 1—figure 2 and Appendix 1—figure 9.

      Reviewer #2 (Recommendations For The Authors):

      Some really minor changes are suggested on two specific points that somewhat confused me as a reader and got me stuck in the reading process to try to get the meaning of what I was seeing/reading:

      1. It is not specified (or at least I was unable to find it) what are you comparing exactly for the group comparison in the long-range synchrony metric (AEC) before creating your scalar metric. Are you comparing individual links (in which case you would have 93 link values for each ROI to compare)? Or are you comparing the strength for each ROI (thus, one value -the individual links sum- for each ROI)? I guess it should be the latter for what I see in the figures but it could be useful to specify it.

      Response: The reviewer is correct. We compare the strength of each ROI, i.e., averaging over edges of the symmetric AEC matrix of functional connectivity. We now clarify this in the Amplitude-envelope correlation section and the caption of the revised Appendix 1—figure 6.

      1. In Figure 1 (which, by the way, is exceptionally aesthetic, congratulations for that!) I got stuck for a relatively long time in a really small detail and I am not completely sure if I came to the right conclusion. It is regarding the X axis of the histograms in panels B and D. They are expressed as "PHG volume loss" and "MMSE decline". So I supposed those histograms were showing some kind of subtraction, (maybe from stage X to stage Y, or from group X to group Y). I was trying to understand the histogram and rereading methods to see if I overlooked any description of that graphic and then just realized they might be just the Z-score itself for each group (control and AD) with respect to the whole population. If that is the case I would suggest changing the X-label to "PHG z-score" and "MMSE z-score" avoiding the reference to "loss and "decline" as they are just reflecting the direct transformation to z-score.

      Response: Thank you. We would like to clarify that the z-score for PHG volume and MMSE scores were sign-inverted so that higher values denote “PHG Volume loss” and “MMSE decline”, respectively. We now clarify this point in the revised text and legend for the revised figure 1.

      Lastly, regarding the point I raised in the limitations section of the public review, I understand it might fall out of the scope of eLife reviewing process as it would require a more extensive change of the current manuscript, which is great as it is. But as a reader and researcher in the field, I would have recommended using biomarkers to divide the control group (if available) thus including in the models only those belonging to the AD continuum according to their biomarker status, and leaving those control without any biomarker positivity as the reference group for the figures I mention in that section (those showing differences for each stage in the cortical surface with respect to the control group).

      Response: Please see a similar comment from R1. Amongst the 70 controls, 39 have confirmed A-beta negative PET scans and only 8 were confirmed A-beta positive PET scans, and in the rest of the 23 we do not have any biomarker data available. In all the controls, we have conducted comprehensive neuropsychological assessment (see Appendix 1—table 1 in the revised supplementary file) and based on this data we can be quite confident about their lack of clinical deficits, and we have a high degree of confidence that none of the controls have any neurodegeneration (AD-related or otherwise). Since only 8 participants were confirmed as amyloid positive in the control group and this sample size is small, we do not conduct this recommended re-analysis in this manuscript.

    1. Author Response

      The following is the authors’ response to the original reviews.

      eLife assessment

      The manuscript describes the synergy among PI3Kbeta activators, providing compelling results concerning the mechanism of their activation. The particular strengths of the work arise to a great extent from the reconstitution system better mimicking the natural environment of the plasma membrane than previous setups have. The study will be a landmark contribution to the signaling field.

      Public Reviews:

      Reviewer #1 (Public Review):

      The manuscript aims to provide mechanistic insight into the activation of PI3Kbeta by its known regulators tyrosine phosphorylated peptides, GTP-loaded Rac1 and G-protein beta-gamma subunits. To achieve this the authors have used supported lipid bilayers, engineered recombinant peptides and proteins (often tagged with fluorophores) and TIRF microscopy to enable bulk (averages of many molecules) and single molecule quantitation. The great strength of this approach is the precision and clarity of mechanistic insight. Although the study does not use "in transfecto" or in vivo models the experiments are performed using "physiologically-based" conditions and provide a powerful insight into core regulatory principles that will be relevant in vivo.

      The results are beautiful, high quality, well controlled and internally consistent (and with other published work that overlaps on some points) and as a result are compelling. The primary conclusion is that the primary regulator of PI3Kbeta are tyrosine phosphorylated peptides (and by inference tyrosine phosphorylated receptors/adaptors) and that the other activators can synergise with that input but have relatively weak impacts on their own.

      Although the methodology is not easily imported, for reasons of both cost and the experience needed to execute them well, the results have broad importance for the field and reverse an impression that had built in large parts of the broader signalling and PI3K communities that all of the inputs to PI3Kbeta were relatively equivalent, however, these conclusions were based on "in cell" or in vivo studies that were very difficult to interpret clearly.

      Reviewer #2 (Public Review):

      The manuscript of Duewell et al has made critical observations that help to understand the mechanisms of activation of the class IA PI3Ks. By using single-molecule kinetic measurements, the authors have made outstanding progress toward understanding how PI3Kbeta is uniquely activated by phosphorylated tyrosine kinase receptors, Gbeta/gamma heterodimers and the small G protein Rac1. While previous studies have defined these as activators of PI3Kbeta, the current manuscript makes clear the quantitative limitations of these previous observations. Most previous quantitative in vitro studies of PI3Kbeta activation have used soluble peptides derived from bis-phosphorylated receptors to stimulate the enzyme. These soluble peptides stimulate the enzyme, and even stimulate membrane interaction. Although these previous studies showed that the release of p85-mediated autoinhibition unmasks an intrinsic affinity of the enzyme for lipid membranes, they ignored what would be the consequence of these peptide sequences being present in the context of intrinsic membrane proteins. The current manuscript shows that the effect of membrane-conjugated peptides on the enzyme activity is profound, in terms of recruiting the enzyme to membranes. In this context, the authors show that G proteins associated with the membranes have an important contribution to membrane recruitment, but they also have a profound allosteric effect on the activity on the membrane, These are observations that would not have been possible with bulk measurements, and they do not simply recapitulate observations that were made for other class IA PI3Ks.

      An important observation that the authors have made is that Gbeta/gamma heterodimers and RAc1 alone have almost no ability to recruit PI3Kbeta to the membranes that they are using, and this is central to one of the most profoundly novel activation mechanisms offered by the manuscript. The authors propose that the nSH2- and Gbeta/gamma binding sites partially overlap, so that Gbeta/gamma can only bind once the nSH2 domain releases the p110beta subunit. This mechanism would mean that once the nSH2 is engaged by membrane-conjugated pY, the Gbg heterodimer can bind and increase the association of the enzyme with membranes. Indeed, this increased membrane association is observed by the authors. However, the authors also show that this increased recruitment to membranes accounts for relatively little increase in activity, and that the far greater component of activation is due to an allosteric effect of the membrane association on the activity of the enzyme. The proposal for competition between Gbg binding and the nSH2 is consistent with the behavior of an nSH2 mutant that cannot bind to pY and which, consequently, does not vacate the Gbg-binding site. In addition to the outstanding contribution to understanding the kinetics of activation of PI3Kbeta, the authors have offered the first structural interpretation for the kinetics of Gbg activation in synergy with pY activation. The proposal for an overlapping nSH2/Gbg binding site is supported by predictions made by John Burke, using alphafold multimer. Although there is no experimental structure to support this structural model, it is consistent with HDX-MS analyses that were published previously.

      Reviewer #1 (Recommendations For The Authors):

      1. The approx relative concentrations (surface densities ) of Rac1-GTP, GBetagammas and PY-peptides used in experiments in Fig 1 are not easy to understand and useful to give an intuitive feel for the relative sensitivity of the PI3Kbeta reporter to those inputs.

      In our revised manuscript, we provide densities of the individual signaling inputs used to reconstitute Dy647-PI3Kβ membrane recruitment (see Figure legend 1). We provide a more detailed explanation about our quantification method in subsequent figures where the membrane surface density of signaling inputs is varied to modulate the strength of PI3Kβ membrane localization and activity.

      Building off the quantification of Rac1-GTP and pY membrane density measurements presented in our initial manuscript submission, we now include an estimate of the GβGγ membrane density. For these new measurements, we recombinantly expressed and purified additional SNAP-GβGγ protein, which we fluorescently labeled with AlexaFluor 555. The membrane surface density of GβGγ was quantified at equilibrium using a combination of AF488-SNAP-GβGγ (bulk signal) and dilute AF555-SNAP-GβGγ (0.0025%), which allowed us to resolve and count the single molecule density (Figure 3A). We calculate the total surface density of GβGγ based on the AF555-SNAP-GβGγ dilution factor. In the methods section titled, “surface density calibration,” we describe our protocol.

      1. The estimates of the PIP3 concentrations/densities measured using the BTK reporter seem good but its unclear (to me) how they were derived.

      The density of PI(3,4,5)P3 lipids in our supported lipid bilayers was calculated based on the incorporation of a define molar ratio of PI(3,4,5)P3 in our small unilamellar vesicles. Based on the average footprint of 0.72 nm2 for a single lipid, we calculated the density of lipids per µm2. In the methods section titled, “kinetic measurements of PI(3,4,5)P3 lipid production,” we include the following description:

      “Assuming an average footprint of 0.72 nm2 for phosphatidylcholine (Carnie et al., 1979; Hansen et al., 2019), we calculated a density of 2.8 × 104 PI(3,4,5)P3 lipids/μm2 for supported membranes that contain an initial concentrations of 2% PI(4,5)P2. We assume that the plateau fluorescence intensity of the AF488-SNAP-Btk sensor following reaction completion in the presence of PI3Kβ represents the production of 2% PI(3,4,5)P3. The bulk membrane intensity of AF488-SNAP-Btk was normalized from 0 to 1, and then multiplied times the total density of PI(3,4,5)P3 lipids to generate kinetic traces that report the kinetics of PI(3,4,5)P3 production.”

      Minor points

      l164; Rac1(GTP) AND GBeta gammas. In this context it should be OR. Or have I misunderstood?

      l1093; kineticS measurementS.

      Thank you for pointing out these typos. We made the appropriate edits.

      The paper of Suire etal (Suire, S., Lécureuil, C., Anderson, K. E., Damoulakis, G., Niewczas, I., Davidson, K., Guillou, H., Pan, D., Jonathan Clark, Phillip T Hawkins, & Stephens, L. (2012). GPCR activation of Ras and PI3Kc in neutrophils depends on PLCb2/b3 and the RasGEF RasGRP4. The EMBO journal, 31(14), 3118-3129. https://doi.org/10.1038/emboj.2012.167) make the point that in vivo it appears that although Ras-activation is required for full activation of PI3Kgamma (and can activate PI3Kgamma in vitro directly) if you use tools to activate Ras in the absence of receptor and Gbetagamma signalling, it has no affect on PIP3 . This directly supports the authors conclusions.

      Thank you for sharing this citation. We incorporated the reviewer’s insight into our discussion section to broaden the significance of our work.

      Reviewer #2 (Recommendations For The Authors):

      There are only a few relatively minor points that could be addressed to improve the paper:

      1. Why is the density still going up after 10 minutes in Figure 1 Figure supplement 2? Doesn't this seem like a very long time? Are we seeing fast on/off combined with fast on/slow off? Are the particles eventually becoming stuck in odd places or are they slowly denaturing?

      Our movies do not indicate a slow accumulation of immobilized or stuck Dy647-PI3Kβ particles on the membrane surface. On the long timescale, we believe that a small fraction of Dy647-PI3Kβ molecular do exhibit longer dwell times on membranes containing a high density of pY (>6,000 molecules/µm2). This is likely due to membrane hopping of Dy647-PI3Kβ. In other words, rather than Dy647-PI3Kβ dissociating from the membrane surface directly into the solution, the Dy647-PI3Kβ molecule immediately rebinds to another membrane conjugated pY peptide. This type of behavior of a peripheral membrane binding protein is generally correlated with there being a higher surface density of the binding partner (Yasui et al., 2014). Characterization of potential Dy647-PI3Kβ membrane hopping will require additional experimentation (e.g. PI3Kβ mutants) and quantitative analysis that goes beyond the scope of this study.

      1. Lines 188-189. "By quantifying the average number of Alexa488-pY particles per unit area of supported membrane we calculated the absolute density of pY per μm2 (Figure 2D). I think this should be Figure 2C, right hand y-axis.

      Thank you for identifying our typo. We’ve corrected the text for clarity.

      1. Lines 102-193. "When Dy647-PI3Kβ was flowed over a membrane containing a low density of {less than or equal to} 500 pY/μm2, we observed rapid equilibration kinetics consistent with a 1:1 binding stoichiometry (Figure 2E).” There is no density shown in Fig. 2E. There is only "membrane intensity." Perhaps it was their intent to include a right-hand axis with density (number of particles/area), as they did in Figure 2C. However, they did not, so Figure 2E does not support the text. The value of Intensity/#py/um**2 does not appear to be the same for Figure 2C as for Figure 2E, assuming that the statement in the text is correct. The authors should include the density as a right-hand axis in 2E.

      We have reworded this portion of the results section for clarity. In reading the reviewers comment, we recognize that a more convincing way to support our claim of a 1:1 binding stoichiometry would be to show that there are ~500 Dy647-PI3Kβ/μm2 membrane bound complexes when the pY surface density equals ~500 pY/μm2. For us to make this connection, we would need to perform experiments using a Dy647-PI3Kβ concentration that fully saturates all the binding pY binding sites. However, at this elevated Dy647-PI3Kβ solution concentration, individual Dy647-PI3Kβ complexes can start to bind to a single phosphotyrosine of the dually phosphorylated peptide due to competition for pY binding sites. As an alternative to performing the experiment described above, we can infer binding stoichiometry from the shape of the membrane absorption kinetic traces. For example, a simple bimolecular interaction exhibits rapid equilibration kinetics with a hyperbolic shaped kinetic trace. Systems that have more complex binding equilibria, however, generally take longer to equilibrate (due to the change in KOFF) and can often be broken down into 2 or 3 distinct dissociation constants (KD). This type of kinetic analysis has previously been used to describe multivalent membrane binding interactions for the Btk-PI(3,4,5)P3 (Chung et al., 2019) and PI3Kγ-GβGγ (Rathinaswamy et al., 2021) complexes. Considering that there are multiple interpretations of the Dy647-PI3Kβ membrane absorption traces show in Figure 2E, we refrain from saying that our results explicitly reveal a 1:1 binding stoichiometry. Instead, we provide several possible explanations for the results. Ultimately, additional experiments and kinetic modeling of wild type and mutant PI3Kβ is necessary to define the binding stoichiometry under different conditions.

      1. Table 1. The authors have analysed the data to extract two dwell times and two diffusion coefficients. The legend should make this clear, referring to D1 as the slow diffusion component and D2 as fast diffusion, similarly, there are short and long dell times. This should be stated in the legend. There are two columns labelled "alpha". This presumably should be alpha1 and alpha2, the fractions of particles with short and long dwell times. The table legend should clarify this.

      In our revision, additional text has been added to the figure legends and Table 1.

      Text from Table 1: “Alpha (α) equals the fraction of molecules with the characteristic dwell time, τ1 (DT = dwell time). The fraction of molecules with the characteristic dwell time, τ2, equals 1-α. Alpha (αD) equals the fraction of molecules with the characteristic diffusion coefficient, D1. The fraction of molecules with diffusion coefficient, D2, equals 1-αD.”

      1. In the legend for Figure 5 figure supplement 1, for part D, the "Cumulative membrane of binding events..." The "of" should be deleted.

      Thank you for identifying this typo.

      1. Lines 423-426: "We found that PI3Kβ kinase activity is also relatively insensitive to either Rac1(GTP) or GβGγ alone. This is in contrast to previous reports that showed Rho-GTPases (Fritsch et al. 2013) and GβGγ (Katada et al. 1999; Hashem A. Dbouk et al. 2012; Maier, Babich, and Nürnberg 1999) can activate PI3Kβ, albeit modest, compared to synergistic activation with pY peptides plus Rac1(GTP) or GβGγ." It is not clear what this statement means. On the surface, it might be interpreted as saying that these previous studies had some flaw that led the authors to conclude that there is some activation caused by Rac1 or Gbeta/gamma on their own. The current manuscript is an important contribution to understanding the mechanism of synergistic activation, but it is also true that the Hansen and his colleagues have not used the same membranes as were used previously. The authors state that they have used a wide range of membrane compositions, but the only ones that have appeared in the manuscript are nearly pure PC (with 2% PIP2) or PC with 20% PS. Extensive studies with varying membrane compositions are beyond the scope of the current study, since the current manuscript concisely makes important observations regarding mechanism. However, it would be helpful for readers if the authors at least mention the differences in membrane compositions among the studies.

      The reviewer raises an important point concerning our interpretation of PI3Kβ activation data in relationship to existing literature. In our original submission, we made conclusions concerning how individual signaling inputs modulate PI3Kβ activity, without showing all our data or providing sufficient explanation. In our revised manuscript, we include PI3Kβ kinase activity measurements performed in the presence of either pY, Rac1(GTP), or GβGγ alone (Figure 5B-5C). These experiments were reconstituted on supported membranes in the absence or presence of 20% PS lipids. We found that increasing the density of anionic lipids increased the overall activity of PI3Kβ in the presence of pY or GβGγ alone. This is consistent with a subtle increase in PI3Kβ membrane affinity due to the negatively charged PS lipids. Mutations that disrupt the direct interaction between PI3Kβ and GβGγ eliminated the observed lipid kinase activity. We were unable to detect PI3Kβ activity in the presence of Rac1(GTP) alone. In conclusion, we’re able to detect some PI3Kβ activity in the presence of GβGγ alone, which is consistent with previous reports (Dbouk et al., 2010; Katada et al., 1999; Maier et al., 2000). In the future, a more comprehensive analysis will be required to map the relationship between PI3Kβ activity, membrane localization, and lipid composition. For example, previous reconstitutions have revealed differential activation of PI3Kα that depends on the most abundant lipid being phosphatidylethanolamine (PE) rather than phosphatidylcholine (PC) (Hon et al., 2012; Ziemba et al., 2016). PE lipids comprise 25-30% of the cellular plasma membrane (Yang et al., 2018) and have been used in previous studies to measure PI3K lipid kinase activity on small unilamellar vesicles (Dbouk et al., 2010; Hon et al., 2012).

      In this study, we elected to use a simplified membrane composition that minimized non-specific membrane localization of fluorescently labeled PI3Kβ. This allowed us to more clearly define the strength of individual and combinations of protein-protein interactions that regulate PI3Kβ localization and kinase activity. When reconstituting amphiphilic molecules (i.e. lipids) in aqueous solution a variety of structures, including micelles, inverted micelles, and planar bilayers can form based on the lipid composition (Kulkarni, 2019). The organization of these membrane structures is related to the molecular packing parameter of the individual phospholipids (Israelachvili et al., 1976). The packing parameter (P=v⁄((a•l_c))) depends on the volume of the hydrocarbon (v), area of the lipid head group (a), and the lipid tail length (l_c). When generating supported lipid bilayers on a flat two-dimensional glass surface, we aim to create a fluid lamellar membrane. We find that phosphatidylcholine (PC) lipids are ideal for making supported lipid bilayers because they have a packing parameter of ~1 (Costigan et al., 2000). In other words, PC lipids are cylindrical like a paper towel roll. In contrast, cholesterol and phosphatidylethanolamine (PE) lipids have packing parameters of 1.22 and 1.11, respectively (Angelov et al., 1999; Carnie et al., 1979). This gives cholesterol and PE lipids an inverted truncated cone shape, which prefers to adopt a non-lamellar phase structure. Due to the intrinsic negative curvature of PE lipids, they can spontaneously form inverted micelles (i.e. hexagonal II phase) in aqueous solution when they are the predominant lipid species (Israelachvili et al., 1980; Kobierski et al., 2022; Wnętrzak et al., 2013). In the methods section of our manuscript, we note that from our experience incorporation of PE lipids dramatically reduces the protein-maleimide coupling efficiency, displayed more membrane defects, and resulted in a larger fraction of surface immobilized Dy647-PI3Kβ. This could be related to the intrinsic negative curvature of PE membranes. However, further investigation is needed to decipher these issues.

      Angelov B, Ollivon M, Angelova A. 1999. X-ray Diffraction Study of the Effect of the Detergent Octyl Glucoside on the Structure of Lamellar and Nonlamellar Lipid/Water Phases of Use for Membrane Protein Reconstitution. Langmuir 15:8225–8234. doi:10.1021/la9902338

      Carnie S, Israelachvili JN, Pailthorpe BA. 1979. Lipid packing and transbilayer asymmetries of mixed lipid vesicles. Biochim Biophys Acta 554:340–357. doi:10.1016/0005-2736(79)90375-4

      Chung JK, Nocka LM, Decker A, Wang Q, Kadlecek TA, Weiss A, Kuriyan J, Groves JT. 2019. Switch-like activation of Bruton’s tyrosine kinase by membrane-mediated dimerization. Proc Natl Acad Sci 116:10798–10803. doi:10.1073/pnas.1819309116

      Costigan SC, Booth PJ, Templer RH. 2000. Estimations of lipid bilayer geometry in fluid lamellar phases. Biochim Biophys Acta 1468:41–54. doi:10.1016/s0005-2736(00)00220-0

      Dbouk HA, Pang H, Fiser A, Backer JM. 2010. A biochemical mechanism for the oncogenic potential of the p110 catalytic subunit of phosphoinositide 3-kinase. Proc Natl Acad Sci 107:19897–19902. doi:10.1073/pnas.1008739107

      Hansen SD, Huang WYC, Lee YK, Bieling P, Christensen SM, Groves JT. 2019. Stochastic geometry sensing and polarization in a lipid kinase–phosphatase competitive reaction. Proc Natl Acad Sci 116:15013–15022. doi:10.1073/pnas.1901744116

      Hon W-C, Berndt A, Williams RL. 2012. Regulation of lipid binding underlies the activation mechanism of class IA PI3-kinases. Oncogene 31:3655–3666. doi:10.1038/onc.2011.532

      Israelachvili JN, Marcelja S, Horn RG. 1980. Physical principles of membrane organization. Q Rev Biophys 13:121–200. doi:10.1017/s0033583500001645

      Israelachvili JN, Mitchell DJ, Ninham BW. 1976. Theory of self-assembly of hydrocarbon amphiphiles into micelles and bilayers. J Chem Soc Faraday Trans 2 Mol Chem Phys 72:1525–1568. doi:10.1039/F29767201525

      Katada T, Kurosu H, Okada T, Suzuki T, Tsujimoto N, Takasuga S, Kontani K, Hazeki O, Ui M. 1999. Synergistic activation of a family of phosphoinositide 3-kinase via G-protein coupled and tyrosine kinase-related receptors. Chem Phys Lipids 98:79–86. doi:10.1016/S0009-3084(99)00020-1

      Kobierski J, Wnętrzak A, Chachaj-Brekiesz A, Dynarowicz-Latka P. 2022. Predicting the packing parameter for lipids in monolayers with the use of molecular dynamics. Colloids Surf B Biointerfaces 211:112298. doi:10.1016/j.colsurfb.2021.112298

      Kulkarni CV. 2019. Calculating the “chain splay” of amphiphilic molecules: Towards quantifying the molecular shapes. Chem Phys Lipids 218:16–21. doi:10.1016/j.chemphyslip.2018.11.004

      Maier U, Babich A, Macrez N, Leopoldt D, Gierschik P, Illenberger D, Nürnberg B. 2000. Gβ 5 γ 2 Is a Highly Selective Activator of Phospholipid-dependent Enzymes. J Biol Chem 275:13746–13754. doi:10.1074/jbc.275.18.13746

      Rathinaswamy MK, Dalwadi U, Fleming KD, Adams C, Stariha JTB, Pardon E, Baek M, Vadas O, DiMaio F, Steyaert J, Hansen SD, Yip CK, Burke JE. 2021. Structure of the phosphoinositide 3-kinase (PI3K) p110γ-p101 complex reveals molecular mechanism of GPCR activation. Sci Adv 7:eabj4282. doi:10.1126/sciadv.abj4282

      Wnętrzak A, Lątka K, Dynarowicz-Łątka P. 2013. Interactions of alkylphosphocholines with model membranes-the Langmuir monolayer study. J Membr Biol 246:453–466. doi:10.1007/s00232-013-9557-4

      Yang Y, Lee M, Fairn GD. 2018. Phospholipid subcellular localization and dynamics. J Biol Chem 293:6230–6240. doi:10.1074/jbc.R117.000582

      Yasui M, Matsuoka S, Ueda M. 2014. PTEN Hopping on the Cell Membrane Is Regulated via a Positively-Charged C2 Domain. PLoS Comput Biol 10:e1003817. doi:10.1371/journal.pcbi.1003817

      Ziemba BP, Burke JE, Masson G, Williams RL, Falke JJ. 2016. Regulation of PI3K by PKC and MARCKS: Single-Molecule Analysis of a Reconstituted Signaling Pathway. Biophys J 110:1811–1825. doi:10.1016/j.bpj.2016.03.001

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1:

      We thank the referee for the positive review.

      Reviewer #2 (Public review):

      We thank the referee for his/her constructive comments

      1. The weakness of this work is the lack of clarification on the function of eIF2A in general. The novelty of this study was limited.

      We believe our study is valuable in providing strong evidence that eIF2A does not functionally substitute for eIF2 in tRNAi recruitment even when eIF2 function is impaired, and in showing that it does not contribute to translational control by uORFs or IRESs, thus ruling out the most likely possibilities for its function in yeast based on studies of the mammalian factor. We agree that the function of yeast eIF2A remains to be identified; however, we think this should be regarded as a limitation rather than a weakness in experimental design or data obtained in the current study.

      1. Related to this, it would be worth investigating common features in mRNAs selectively regulated (surveyed in Figure 3A).

      We did not embark on this because only 17 of the 32 transcripts showing TE reductions in Fig. 3A showed a pattern of TE changes consistent with a conditional requirement for eIF2A under conditions of reduced eIF2 function, exhibiting greater TE decreases when both eIF2 function was impaired by phosphorylation and eIF2A was eliminated from cells. Moreover, we could validate this conditional eIF2A dependence by LUC reporter for only a single mRNA, HKR1.

      Also, it would be worth analyzing the effect of eIF2A deletion on elongation (ribosome occupancy on each codon and/or global ribosome footprint distribution along CDS) and termination/recycling (footprint reads on stop codon and on 3′ UTR).

      We have analyzed the effects of deleting eIF2A on ribosome pausing at individual codons by calculating tri-peptide pause scores from our ribosome profiling data. The results shown in new Fig. 7 reveal that eIF2A plays no discernible role in stimulating the rate of decoding of any three-codon combinations.

      1. Regarding Figure 3D, the reporters were designed to include promoter and 5′ UTR of the target genes. Thus, it should be worth noting that reporter design was based on the assumption that eIF2A-dependency in translation regulation was not dependent on 3′ UTR or CDS region. The reason why the effects on ribosome profiling-supported mRNAs could not be recapitulated in reporter assay may originate from this design. This should be also discussed.

      We agree and included this stipulation in the DISCUSSION, while at the same time noting that the native mRNAs were examined in the orthogonal assay of polysome distributions.

      1. Related to the point above, the authors claimed that eIF2A affects "possibly only one" (HKR1) mRNA. However, this was due to the reporter assay which is technically variable and could not allow some of the constructs to pass the authors' threshold. Alternative wording for this point should be considered.

      We agree and revised text in the DISCUSSION to read: “A possible limitation of our LUC reporter analysis in Fig. 3D was the lack of 3’UTR sequences of the cognate transcripts, which might be required to observe eIF2A dependence. Given that native mRNAs were examined in the orthogonal assay of polysome profiling in Fig. 3E, the positive results obtained there for SAG1 and SVL3 in addition to HKR1 should be given greater weight. Nevertheless, our findings indicate a very limited role of yeast eIF2A in providing a back-up mechanism for Met-tRNAi recruitment when eIF2 function is diminished by phosphorylation of its α-subunit.”

      1. For Figure 3D, it would be worth considering testing the #-marked genes (in Figure 3C) in this set up.

      Actually, we did test 10 of the 17 mRNAs marked with “#”s in the reporter assays of Fig. 3C, which had been noted in the Fig. 3C legend.

      1. In box plots, the authors should provide the statistical tests, at least where the authors explained in the main text.

      At the first occurrence of a notched box plot (Fig. 2D), we explained in the main text that in all such plots, when the notches of different boxes do not overlap, their median values differ significantly with a 95% confidence level. In cases where overlaps between notches is difficult to assess by eye, we added the results of Mann-Whitney U tests with the p values indicated by asterisks, as explained in the legends. We added results of additional Mann-Whitney U tests to such box plots in Figs. 3B, 6A-C, and 6-supp. 1E & G and mentioned this in the corresponding legends.

      Reviewer #2 (Recommendations For The Authors):

      The first section of "Yeast eIF2A does not play a prominent role as a functional substitute for eIF2 in the presence or absence of amino acid starvation" can be subdivided into a couple of sections for better readability.

      Done.

      Although the authors have used SM to induce ISR in yeasts previously, the validation of eIF2alpha phosphorylation in Western blot would be helpful for readers. Also, it should be worth testing whether eIF2alpha phosphorylation was properly induced in eIF2A KO cells.

      The translational induction of GCN4 mRNA, which we have documented in WT and eIF2A∆ cells, provides a quantitative read-out of eIF2 functional attenuation superior to determining the proportion of eIF2α that is phosphorylated.

      For Figure 2B, the Venn diagram that shows the overlap between TE-changes genes in WT_SM/WT and those in eIF2A∆_SM/eIF2A∆ would be helpful (although a list was provided by the source data).

      The Venn diagram has been provided in a new figure, Figure 2-figure supplement 1B.

      For Figures 1C and 5A-B, the depiction of the positions of uORFs within the orange gene region would be helpful for readers.

      Done.

      For Figure 4A-C, the depiction of the IRES regions (if known) within the orange gene region would be helpful for readers.

      Done for the URE2 IRES, whose location is known.

      For Figures 1C, 4A-C, and 5A-B, the y-axis should have a label/scale.

      Added.

      For Figure 3C, the definition of #-marked genes should be concretely described (e.g., value range) in the legend.

      Added.

      For Figure 3D-E, the statistical test has been only shown in a couple of data. A full depiction of the statistical results for all the data sets may be helpful for readers.

      We explained that when notches in box plots do not overlap, their medians differ with 95% confidence. In cases where overlaps were difficult to discern, we added p values from Mann-Whitney U tests to the relevant box plots.

      For Figure 3E, it would be helpful if the authors could show the UV spectrum of the sucrose density gradient to show the regions isolated for the experiments.

      Added for a representative replicate gradient in the new figure, Figure 3-figure supplement 1.

      Reviewer #3 (Public Review):

      We thank the referee for his/her positive assessment of our study.

      Weaknesses:

      While no role of eIF2A in translation initiation is apparent, the authors do not determine what function eIF2A does play in yeast. Whether it plays a role in regulating translation in a different stress response is not determined.

      We agree that there are many additional possibilities to consider for functions of eIF2A in translation initiation, including different stress situations or mutant backgrounds; however, we regard this as a limitation rather than a weakness in the experimental design and data obtained in the current study in which we examined the most likely possibilities for eIF2A function in yeast based on studies of the mammalian factor.

      Reviewer #3 (Recommendations For The Authors):

      Curiously, the authors indicate that they could not replicate published results for eIF2A's repressor function for URE2, PAB1, or GIC1 translation. This is a little concerning and one wonders if the yeast strain used in the previous study is different in some way from the authors' strain. Did the authors obtain that strain to test it in their assays?

      The same WT and eIF2A∆ strains have been analyzed here and in the two cited studies on yeast IRESs.

      The authors do discuss the fact that eIF2A may function to regulate translation in response to different stresses. It would have been a strength to test an alternative stress in the current study. However, I also appreciate that this could be the subject of a future study.

      Agreed.

      One minor question I have is whether the yeast strains used possess L-A dsRNA virus? While it may not be that this virus would necessarily mask a role of eIF2A-dependent translation, do the authors have any specific thoughts on this? Would different results be obtained if cured strains were used?

      According to Ravoityte et al. (doi: 10.3390/jof8040381), the S. cerevisiae strain we employed, BY4741, harbors L-A-1 dsRNA; however, we have not explored whether curing the virus would alter the consequences of eliminating eIF2A.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements We appreciate the insightful reviewer comments. Both reviewers alluded to the logical lack of connection between two themes in the original paper. Specifically, we showed that N-cad differentially regulates migration in different environments, and that leader and follower cells differ phenotypically, but did not connect the two themes. In this revised version, we've performed additional experiments and undertaken a comprehensive reorganization of both the manuscript and figures. The major changes are outlined below:

      2. Figure 4 A-C has been moved to Figure 6 F-H.

      3. Figure 5 has been moved to Figure S3 F-H.
      4. Figure 6 F has been moved to Figure 7 A.
      5. Figure 6 G-H have been moved to Figure 7 D-E.
      6. Figure 6 I-J have been moved to Figure S5 A-B.
      7. Figure 7 C-F have been moved to Figure S5 C-F.
      8. Added transcriptome profiling of control and N-cad-depleted cells and of leader and follower cells (Figures 6 E, S1 H and S4 C-D, Tables S2 and S3). We have incorporated additional figures (Figure 4 and 5 in the revised manuscript) to support the idea that the amount of N-cad at the cell surface is regulated by endocytic recycling, thereby stimulating glioma migration in the different local environments. Furthermore, our new findings showed that YAP1/TAZ regulates the surface level of N-cad during glioma migration (Figure 8). We trust that these additions contribute to the clarity and robust justification of our paper.

      Similar to other types of tumors, our findings revealed that pediatric high-grade gliomas migrate collectively, possibly contributing to a more aggressive invasion than single cells. In this study, we found that N-cad mediates this collective glioma migration. Interestingly, within these migrating groups, leader and follower cells dynamically interchange positions during migration, accompanied by changes their phenotypic characteristics. This suggests that differences in phenotypes, including N-cad recycling, proliferation and YAP activation, may be predominantly regulated by cell-extrinsic factors rather than being predetermined by genetic or epigenetic factors. Moreover, our new RNA-sequencing results indicate minimal difference between leader and follower cells, except for the upregulation of YAP response and wound healing migration genes in leader cells. Although genomic alterations still possibly encode the leader-follower exchange, our findings strongly suggest that the activation of YAP1 and glioma migration are regulated by the cellular context, specifically where cells are located within the group.

      Contrary to our initial findings suggesting a positive feedback loop between N-cad endocytosis and nuclear YAP1, our revised data indicates that nuclear YAP appears to be independent of N-cad. We observed that homotypic interactions with N-cad present in the surrounding environment, such as neurons (Figure 6 C-D) or N-cad extracellular domain-coated surface (Figure 7 B-C), did not affect nuclear YAP1. However, YAP1/TAZ depletion decreased N-cad expression and altered its localization at the surface (Figure 8). This leads us to propose a revised model where nuclear YAP1 stimulates surface N-cad, thereby facilitating the distinct modes of migration on ECM and neurons (Figure 8 I).

      1. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Kim and colleagues describe the role of N-Cadherin during pediatric glioma migration. They compare cell lines that have similar transcripts but different levels of N-Cadherin protein and find that N-Cadherin levels influence the route of migration - whether it be on ECM or other tissues. They also describe molecular feedback between N-Cadherin and YAP in leader vs follow cells of their systems. The data are clear, well presented, and convincing; and the conclusions described by the manuscript are mostly justified. My major criticism of the manuscript is that the line of questioning undertaken does not appear well justified. At many points, I was left asking "but why are they doing this?" and I could not understand the rationale for some of the experiments that were performed (even if they were performed well). The manuscript opens by validly describing how gliomas are highly invasive, poorly understood and that N-Cadherin was highly expressed in comparison to other adhesion proteins. This opened the path for the questions and experiments performed that contributed to Figures 1-3, which I thought were interesting. From there on, I found the logic of the story unclear and poorly justified. For example, I do not know why leader and follower cells were justified - when it had nothing to do with N-Cadherin which was the focus of the work prior. And then, having rightly concluded in Figure 4 that the data suggested that leader and follower cells dynamically exchange positions rather than being pre-determined, they went onto further figures focusing on differences between leader and follower cells, which left my quite confused. I am likewise confused by the model proposed in that, they authors describe that the difference between leader and follower cells contributes to a nuclear YAP/N-Cad endocytosis feedback loop that feeds into the speed of migration. Yet, the authors describe earlier that leader and follower cells frequently exchange positions, with no evidence that they are pre-determined. How do the authors square these seemingly conflicting points? And further, what is the relevance of this to understanding the differing modes of migration (on ECM or other tissues)? On this issue, I suggest authors re-consider whether the order of figures or logic of the story is appropriate (perhaps consider moving some figures to supplement?), and to clearly justify in the text the elements that are being addressed. Overall, I think the messaging, logic and justification could be use significant improvement; the experiments however are well performed, and the figures are very clear and nicely presented, and I don't have any qualms about them.

      We appreciate your insightful comments, recognizing the need for logical and justifiable improvements in certain sections of our previous manuscript. Please see Section 1, General Statements, for an explanation of changes made.

      Minor Comments

      1. Not required, but the authors may wish to consider putting t=0 pictures of the experiments in the supplement as supportive evidence for the circles of the initial seeding location they show in Fig 1.

      We provide the t=0 images in Figure S1 N and O.

      1. I assume the title of the second results section should say "migration speed" rather than "speed migration"

      The new title of the second results section is “N-cad stimulates and inhibits migration through intercellular homotypic interaction”.

      1. Fig. 4D - Are both example cell pictures leaders? If so, I'm not sure why two have been provided; I'm guessing the bottom set are supposed to be follower cells. If so, please label as appropriate. (And if not, a representative set of pictures from a follower cell should be provided).

      We have enhanced the clarity of the labels. We provide representative high magnification images of leader and follower cells. The updated figure can be found in Figure 5 A.

      1. Figure 5 Legend - the title of this figure is too definitive, and exaggerates further than the main text does, which was correct in saying that the experiments only suggest that N-Cadherin endocytosis might regulate the localisation of b-catenin and p120-catenin. Probably I would go further and say that there is no experimental evidence provided that even suggests that in the first place, and that this is a hypothesis that remains to be tested. The authors should inhibit endocytosis specifically (rather than just depleting N-Cad) and see the effect, to justify their conclusion.

      We appreciated your points and concerns. Following your earlier suggestion, we have moved the figure to the supplementary section (Figure S3 F-H). Moreover, we have addressed the reciprocal regulation of N-cad and catenins by knocking down p120-, β- or α-catenin. Our new findings showed that p120-, β- or α-catenin depletion decrease the amount of N-cad at the cell surface, not steady-state protein level, resulting in decreased migration on astrocytes but increased migration on ECM (see Figure 4). These findings support the idea that catenins play a role in glioma migration according to the environment by altering surface N-cad level. With that, we updated the figure title to “Catenins regulate N-cad surface levels to stimulate or inhibit migration.”

      Reviewer #1 (Significance (Required)):

      The manuscript provides a characterised of invasive glioma migration that was previously lacking. It also provides interesting observations related to the role of N-Cadherin for different modes of migration (on ECM or on tissues) that will be of interest for further exploration. It makes a good advance in terms of addressing a highly invasive cell type that has poor prognosis. I anticipate that now this initial characterisation has been performed, authors and others will be interested in gaining a deeper understanding as to how these two modes of migration are controlled, how there might be interplay between them and how such findings contribute to its highly invasive nature. I have expertise in collective cell migration and directed cell migration.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary In the submitted manuscript, Kim et al. describe various aspects of N-cadherin function in the collective migration of PBT-05 cells, a pediatric high-grade glioma line, on laminin, 3D-matrigel, neurons or astrocytes. N-cadherin promotes the collective migration on neurons or astrocytes, whereas it suppresses the migration on laminin or 3D-matrigel. The authors also show that N-cadherin is actively internalized and recycled in the leader, but not follower, cells of the collective, which induce the nuclear accumulation of YAP/TAZ proteins. YAP/TAZ proteins are shown to regulate the collective migration.

      Thank you for the comments. Please see Section 1, General Statements, for a summary of changes made. Please also note that our new experiments failed to show that N-cad levels or traffic regulate YAP/TAZ nuclear accumulation. Rather, YAP/TAZ are regulated by cell density independent of N-cad, and YAP/TAZ regulate N-cad protein levels and traffic independent of changes in N-cad RNA levels

      Major comments

      1. In Fig. 1G, N-cadherin knockdown seems to affect the distribution of astrocytes. The authors should stain a marker for astrocytes, instead of actin, and the red alone images should be provided.

      Astrocytes were cultured for 4 days to generate 3D scaffolds before adding the glioma spheroid, essentially as described (Gritsenko et al., Histochem Cell Biol, 2017). Co-cultures were stained for human-specific vimentin (glioma) or actin (glioma and astrocytes) (see Figure 1 G and separate channels in new Figure S1 P). There do not appear to be major changes in astrocyte organization outside the migration front, but we lack a way to stain for astrocytes specifically and cannot visualize astrocytes under the glioma cells. It remains possible that astrocytes may be affected differently by contact with control and N-cad-deficient glioma cells. However, we added a new experiment, assaying migration on decellularized astrocyte ECM. While N-cad stimulated migration on astrocytes it inhibited migration on astrocyte ECM (Figures 1 I and J and S1 Q). Thus N-cad stimulates glioma migration on astrocyte cells and not their ECM.

      1. The colocalization between N-cadherin and Rab11 may not be high in Figs. 4F and S2B. It is unclear whether the majority of the internalized N-cadherin is recycled to the plasma membrane. In Fig. S2B, the internalized N-cadherin may be located mainly at the early endosomes before transported to the recycling endosomes (Is it 20 min after the N-cadherin antibody internalization?). First, the authors should analyze the colocalization between the N-cadherin and Rab11 at 30-40 min after the internalization. If the colocalization with Rab11 would be still low at that time point, some of the internalized N-cadherin might be degraded in the lysosomes. To test this possibility, the authors should analyze the colocalization between N-cadherin and LAMP1 under the treatment with a lysosome inhibitor.

      At steady state, N-cad co-localized better with Rab5 than with Rab11 or LAMP1 (Figure 5 C-D). In kinetics experiments, N-cad antibodies were internalized for 40 min. They colocalized better with Rab5 or EEA1 than with Rab11 or LAMP1. When we allowed recycling for an additional 20 min, the surface level of N-cad antibodies partially recovered in leader cells more than follower cells (see Figures 5 G and S3 D). We tested whether treatment with lysosomal inhibitors would increase co-localization of N-cad with Rab11 in recycling endosomes. Surprisingly, however, Chloroquine or Bafilomycin A1 decreased the amount of internalized N-cad antibody in leader and follower cells, and long-term treatment did not increase total N-cad levels. Therefore, the fate of internalized N-cad in follower cells remains unclear.

      1. When N-cadherin is depleted, dissociated single cells are increased, but these cells are not well characterized. A high magnification image of the dissociated single cells is required. In addition, the migration speed of the dissociated single cells should be measured.

      We didn’t quantify single cell migration because only a minority of cells separate from the collective even when N-cad is depleted. Therefore, we quantified migration directionality and speed for cells at or near the front of collective migration (Figure 2 D-I). We have updated the image of single cells, providing representative high-magnification images in Figure S1 N and O.

      1. In Fig. S2D, treatment with Pitstop-2 alone or Dyngo-4a alone is required. Dynamin is also involved in clathrin-independent endocytosis and N-cadherin is reported to be internalized via caveolin-1-mediated endocytosis as well as clathrin-mediated during neuronal migration. It would be better to clarify which type of endocytosis occurs in the leader cells.

      We have removed results showing inhibition of cell migration and N-cad endocytosis by Pitstop-2 and Dyngo-4a from the paper. Treatment with either chemical alone had much less effect on internalization or migration than adding both together (see figure below). This is hard to explain. Pitstop-2 should inhibit clathrin-coated pit formation and Dyngo-4a should inhibit clathrin and caveolin-mediated endocytosis. Caveolin-1 and 2 transcripts were not detected in our cells (Table S2). There may be some other form of clathin-independent endocytosis. Interpretation is also challenging since these inhibitors will inhibit endocytosis of many receptors, not just N-cad. Accordingly, we have removed these results in the revised manuscript.

      1. In Fig. 2, N-cadherin depletion disturbs the migration directionality. Is this a result from disruption of cell polarity? To test this, the position of centrosome or Golgi or lamellipodia in the leader cells should be analyzed. (OPTIONAL)

      We elected not to perform this analysis.

      1. I cannot understand the significance of Fig. 5F and 5G. If the authors would speculate that alpha- and beta-Catenins may transduce the intracellular signaling from the internalized N-cadherin, the authors should perform the knockdown experiments of the Catenins and analyze whether it may affect the nuclear accumulation of YAP/TAZ. (OPTIONAL)

      We agree. In the initial manuscript, we showed that N-cad depletion altered the localization of p120-, β-, and α-catenin (previously shown in Figure 5 F-G). For better clarity and logic, these figures have been moved to Figure S2 H in the revised manuscript. Additionally, to test whether catenins regulate N-cad and YAP1, we depleted p120-, β-, or α-catenin using shRNA. We found that downregulation of p120-, β-, or α-catenin decreased N-cad surface levels, consequently slowing migration on astrocytes and stimulating migration on laminin (Figure 4). In other words, depleting catenins altered migration in parallel with the changes in N-cad surface level. Catenin depletion also increased single-cell dissociation, reduced the crowding of leader and follower cells, and increased nuclear YAP1 (see figure below). These findings suggest that the main role of p120-, β-, or α-catenin is to regulate surface N-cad. Since this result does not support a role for catenins transducing an N-cad signal to YAP1, we have not included it in the paper.

      Minor comments

      1. The quantitative data is required in Fig. 5E.

      Quantitative data from three independent experiment are now presented in Figure S2 G.

      1. Vinculin is associated with the cadherin-catenin complex and it may not be a good loading control (Fig. 3C and 3L).

      The Western blot data has been updated and is now presented in new Figure 3 B and 3 F, with β-tubulin as a loading control.

      **Referees cross-commenting**

      I totally agree with the other Reviewers' comments and evaluation. As the reviewer-1 pointed out, I also think the experiments are well performed, but it would lack logic at least in part (see my comment-6). In addition, as the reviewer-3 pointed out, the linking mechanism of N-cadherin homophilic interaction with YAP/TAZ signaling is important to improve this manuscript

      We hope the revisions have improved the logical flow. We have also added new results showing that YAP/TAZ regulate N-cad protein levels and localization but not N-cad RNA. N-cad is not needed for cell density-dependent regulation of YAP1 localization. The model is shown in Figure 8 I.

      Reviewer #2 (Significance (Required)):

      Strength N-cadherin has multiple function in cancer and neuronal migration, and both positive and negative effects of N-cadherin on cancer cell migration have been reported. In this regard, different behaviors of N-cadherin in the leader and follower cells of the collective are interesting and may explain the controversial previous results.

      Limitation This study reveals various aspects of N-cadherin function in the collective migration of the glioma cell line, but it is unclear whether these findings are applied to pediatric high-grade gliomas in vivo.

      Thus, this study is a potentially important and informative to cell biologists and researchers in cancer biology, although this reviewer also found several weak points that should be improved.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the authors explore the role of N-cadherin in the migratory/infiltrative behavior of human pediatric brain tumor cells, in light of their surrounding microenvironment. Their in-depth phenotype analysis allows to document the behavior of migrating cells and revisit the concept of leading/follower migratory cells (somehow more commonly applied to endothelial cells). They suspected that the YAP/TAZ pathway might modulate N-cadherin endocytosis and vice versa, using imagery-based cell tracking.

      Major comments

      1. To control for co-culture models, migration should be evaluated on decellularized matrices from astrocyte and neuron cultures.

      We thank for your suggestion. We tested glioma migration on astrocyte-derived decellularized matrices. The mouse astrocytes we used are known to produce various extracellular matrices with a composition similar to Matrigel, except for laminin α5. (Gritsenko et al., J Cell Sci, 2018). N-cad shRNA cells migrated faster on decellularized ECM than control (Figure 1 I-J and S1 Q). This result agrees with N-cad depletion increasing migration on ECM but is opposite to migration on astrocytes.

      1. N-cadherin was stably knocked down with shRNA, which raises the question of adaptative/compensatory mechanisms. First, one could ask what happen in knockout conditions. Similarly, transient siRNA-mediated silencing might help to strengthen the findings. Second, is there any impact of Ncad knock down on alternate adhesive receptors (either cell-cell or cell-ECM). This should be verified with bulk RNAseq.

      Transient knockdown with N-cad siRNA also increased migration on laminin-coated surface (Figure S1 L-M). Unfortunately, N-cad depletion with siRNA was short-lived, precluding its use for long-term assays, like coculture with neurons or astrocytes. To test whether there is any impact of N-cad knockdown on alternative adhesion receptors, we performed RNA-Seq (Figure S1 H, Table S2). We found N-cad depletion did not alter expression of other cell-cell and cell-ECM adhesive receptors except CDH3 (2.8-fold increase compared with 7-fold decrease in CDH2). Integrin expression was unchanged.

      1. It would be interesting to evaluate the impact of N-cadherin/N-cadherin homotypic interactions on YAP/TAZ signaling, using for instance N-cad-coated surface.

      We observed that the homotypic interaction of N-cad with surrounding neurons and astrocytes did not hinder the accumulation of nuclear YAP1 in leader cells (Figure 6 C-D). To further support the idea that N-cad does not directly regulate YAP1 signaling, we have now measured YAP1 localization in cells migrating over N-cad ECD. The new data confirms that N-cad does not directly regulate YAP1 localization (Figure 7 B-C).

      1. along this line, the impact of mechanical cues (stiffness, 2D vs 3D) is not explored.

      We appreciate your suggestion. It is possible that different mechanical and cytoskeletal cues between leader and follower cells affect YAP1 signaling. In this study, we would like to focus more on the role of N-cad-mediated cell adhesions in YAP signaling.

      Minor comments

      1. Levels of N-cadherin expression in normal Astro and Neurons to compare with pediatric brain cancer cells (S1C)

      A new western blot analysis to show N-cad levels in DMG, PHGG and mouse cerebellar neurons and astrocytes has been added to Figure S1 F.

      1. Low versus high density culture conditions should be controlled and its further impact on the YAP/Ncad endocytosis route should be supported experimentally, or to be omitted from their proposed model.

      We previously used different size of micropattern discs to control low or high cell density. Smaller cell clusters, with more edge cells and hence fewer cell-cell interactions, had higher nuclear YAP1 (Figure 7 D-E). We have repeated this experiment, including N-cad ECD antibodies to measure N-cad endocytosis. Smaller cell clusters had higher N-cad antibody internalization (Figure 7 F). Together with our evidence that leader cells have higher YAP1 and more N-cad internalization than followers, and that YAP/TAZ knockdown inhibits N-cad internalization, these results high YAP/TAZ in leader cells regulates N-cad internalization.

      Reviewer #3 (Significance (Required)):

      This paper presents robust image analysis of human pediatric brain tumor migration in the context of the different microenvironment that they might encounter (matrices, neurons, astrocytes). This study brings new concepts on the way N-cadherin might contribute to tumor cell migratory behavior based on the nature of the interactions in which N-cadherin is involved. As a limitation, it remains unclear the mechanism by which N-cadherin endocytosis is driven.

      We now discuss the limitations of the study as follows:

      “The mechanisms by which YAP1 regulates N-cad levels and trafficking remain to be explored. YAP1 is widely expressed in human brain tumors and strongly associated poor survival. Leader cells expressed higher levels of YAP1-response and wound-healing gene transcripts, but transcript levels of N-cad and proteins known to regulate cadherin traffic, such as p120-catenin, Rab5/11 and Rac1, were similar. Therefore, N-cad is likely regulated at the level of protein synthesis or turnover. More endosomal N-cad recycled to the surface of leader than follower cells, implying that follower cells might divert more N-cad for lysosomal degradation, but our attempts to interfere with N-cad endocytosis or degradation specifically were unsuccessful. Further understanding of the mechanism and function of N-cad recycling for glioma cell migration will require cargo-specific ways to selectively regulate endocytosis and recycling”.

    1. Author Response

      The following is the authors’ response to the original reviews.

      eLife assessment

      This important study combines genetically barcoded rabies viruses with spatial transcriptomics in vivo in the mouse brain to decode connectivity of neural circuits. The data generated by the combination of these approaches in this new way is mostly convincing as the authors provide validation and proof-of-concept that the approach can be successful. While this new combination of established techniques has promise for elucidating brain connectivity, there are still some nuances and caveats to the interpretations of the results that are lacking especially with regards to noting unexpected barcodes either due to unexpected/novel connections or unexpected rabies spread.

      In this revised manuscript, we added a new control experiment and additional analyses to address two main questions from the reviewers: (1) How the threshold of glycoprotein transcript counts used to identify source cells was determined, and (2) whether the limited long-range labeling was expected in the trans-synaptic experiment. The new experiments and analyses validated the distribution of source cells and presynaptic cells observed in the original barcoded transsynaptic tracing experiment and validated the choice of the threshold of glycoprotein transcripts. As the reviewers suggested, we also included additional discussion on how future experiments can improve upon this study, including strategies to improve source cell survival and minimizing viral infection caused by leaky expression of TVA. We also provided additional clarification on the analyses for both the retrograde labeling experiment and the trans-synaptic tracing experiment. We modified the Results and Discussion sections on the trans-synaptic tracing experiment to improve clarity to general readers. Detailed changes to address specific comments by reviewers are included below.

      Public Reviews:

      Reviewer #1 (Public Review):

      In this preprint, Zhang et al. describe a new tool for mapping the connectivity of mouse neurons. Essentially, the tool leverages the known peculiar infection capabilities of Rabies virus: once injected into a specific site in the brain, this virus has the capability to "walk upstream" the neural circuits, both within cells and across cells: on one hand, the virus can enter from a nerve terminal and infect retrogradely the cell body of the same cell (retrograde transport). On the other hand, the virus can also spread to the presynaptic partners of the initial target cells, via retrograde viral transmission.

      Similarly to previously published approaches with other viruses, the authors engineer a complex library of viral variants, each carrying a unique sequence ('barcode'), so they can uniquely label and distinguish independent infection events and their specific presynaptic connections, and show that it is possible to read these barcodes in-situ, producing spatial connectivity maps. They also show that it is possible to read these barcodes together with endogenous mRNAs, and that this allows spatial mapping of cell types together with anatomical connectivity.

      The main novelty of this work lies in the combined use of rabies virus for retrograde labeling together with barcoding and in-situ readout. Previous studies had used rabies virus for retrograde labeling, albeit with low multiplexing capabilities, so only a handful of circuits could be traced at the same time. Other studies had instead used barcoded viral libraries for connectivity mapping, but mostly focused on the use of different viruses for labeling individual projections (anterograde tracing) and never used a retrograde-infective virus.

      The authors creatively merge these two bits of technology into a powerful genetic tool, and extensively and convincingly validate its performance against known anatomical knowledge. The authors also do a very good job at highlighting and discussing potential points of failure in the methods.

      We thank the reviewer for the enthusiastic comments.

      Unresolved questions, which more broadly affect also other viral-labeling methods, are for example how to deal with uneven tropism (ie. if the virus is unable or inefficient in infecting some specific parts of the brain), or how to prevent the cytotoxicity induced by the high levels of viral replication and expression, which will tend to produce "no source networks", neural circuits whose initial cell can't be identified because it's dead. This last point is particularly relevant for in-situ based approaches: while high expression levels are desirable for the particular barcode detection chemistry the authors chose to use (gap-filling), they are also potentially detrimental for cell survival, and risk producing extensive cell death (which indeed the authors single out as a detectable pitfall in their analysis). This is likely to be one of the major optimisation challenges for future implementations of these types of barcoding approaches.

      As the reviewer suggested, we included additional discussion about tropism and cytotoxicity in the revised Discussion. Our sensitivity for barcode detection is sufficient, since we estimated (based on manual proofreading) that most barcoded neurons had more than ten counts of a barcode in the trans-synaptic tracing experiment. The high sensitivity may potentially allow us to adapt next-generation rabies virus with low replication, such as the third generation ΔL rabies virus (Jin et al, 2022, biorxiv) in future optimizations.

      Overall the paper is well balanced, the data are well presented and the conclusions are strongly supported by the data. Impact-wise, the method is definitely going to be useful for the neurobiology research community.

      We thank the reviewer for her/his enthusiasm.

      Reviewer #2 (Public Review):

      Although the trans-synaptic tracing method mediated by the rabies virus (RV) has been widely utilized to infer input connectivity across the brain to a genetically defined population in mice, the analysis of labeled pre-synaptic neurons in terms of cell-type has been primarily reliant on classical low-throughput histochemical techniques. In this study, the authors made a significant advance toward high-throughput transcriptomic (TC) cell typing by both dissociated single-cell RNAseq and the spatial TC method known as BARseq to decode a vast array of molecularly labeled ("barcoded") RV vector library. First, they demonstrated that a barcoded-RV vector can be employed as a simple retrograde tracer akin to AAVretro. Second, they provided a theoretical classification of neural networks at the single-cell resolution that can be attained through barcoded-RV and concluded that the identification of the vast majority (ideally 100%) of starter cells (the origin of RV-based trans-synaptic tracing) is essential for the inference of single-cell resolution neural connectivity. Taking this into consideration, the authors opted for the BARseq-based spatial TC that could, in principle, capture all the starter cells. Finally, they demonstrated the proof-of-concept in the somatosensory cortex, including infrared connectivity from 381 putative pre-synaptic partners to 31 uniquely barcoded-starter cells, as well as many insightful estimations of input convergence at the cell-type resolution in vivo. While the manuscript encompasses significant technical and theoretical advances, it may be challenging for the general readers of eLife to comprehend. The following comments are offered to enhance the manuscript's clarity and readability.

      We modified the Results and Discussion sections on the trans-synaptic tracing experiment to improve clarity to general readers. We separated out the theoretical discussion about barcode sharing networks as a separate subsection, explicitly stated the rationale of how different barcode sharing networks are distinguished in the in situ trans-synaptic tracing experiment, and added additional discussion on future optimizations. Detailed descriptions are provided below.

      Major points:

      1. I find it difficult to comprehend the rationale behind labeling inhibitory neurons in the VISp through long-distance retrograde labeling from the VISal or Thalamus (Fig. 2F, I and Fig. S3) since long-distance projectors in the cortex are nearly 100% excitatory neurons. It is also unclear why such a large number of inhibitory neurons was labeled at a long distance through RV vector injections into the RSP/SC or VISal (Fig. 3K). Furthermore, a significant number of inhibitory starter cells in the somatosensory cortex was generated based on their projection to the striatum (Fig. 5H), which is unexpected given our current understanding of the cortico-striatum projections.

      The labeling of inhibitory neurons can be explained by several factors in the three different experiments.

      (1) In the scRNAseq-based retrograde labeling experiment (Fig. 2 and Fig. S3), the injection site VISal is adjacent to VISp. Because we dissected VISp for single-cell RNAseq, we may find labeled inhibitory neurons at the VISp border that extend short axons into VISal. We explained this in the revised Results.

      (2) In the in situ sequencing-based retrograde labeling experiment (Fig. 3,4), the proximity between the two injection sites VISal and RSP/SC, and the sequenced areas (which included not only VISp but also RSP) could also contribute to labeling through local axons of inhibitory neurons. Furthermore, because we also sequenced midbrain regions, inhibitory neurons in the superior colliculus could pick up the barcodes through local axons. We included an explanation of this in the revised Results.

      (3) In the trans-synaptic tracing experiment, we speculate that low level leaky expression from the TREtight promoter led to non-Cre-dependent expression in many neurons. To test this hypothesis, we first performed a control injection in which we saw that the fluorescent protein expression were indeed restricted to layer 5, as expected from corticostriatal labeling. Based on the labeling pattern, we estimated that about 12 copies of the glycoprotein transcript per cell would likely be needed to achieve fluorescent protein expression. Since many source cells in our experiment were below this threshold, these results support the hypothesis that the majority of source cells with low level expression of the glycoprotein were likely Cre-independent. Because these cells could still contribute to barcode sharing networks, we could not exclude them as in a conventional bulk trans-synaptic tracing experiment. In future experiments, we can potentially reduce this population by improving the helper AAV viruses used to express TVA and the glycoprotein. We included this explanation in Results and more detailed analysis in Supplementary Note 2, and discussed potential future optimizations in the Discussion. This new analysis in Supplementary Note 2 is also related to the Reviewer’s question regarding the threshold used for determining source cells (see below).

      1. It is unclear as to why the authors did not perform an analysis of the barcodes in Fig. 2. Given that the primary objective of this manuscript is to evaluate the effectiveness of multiplexing barcoded technology in RV vectors, I would strongly recommend that the authors provide a detailed description of the barcode data here, including any technical difficulties or limitations encountered, which will be of great value in the future design of RV-barcode technologies. In case the barcode data are not included in Fig. 2, I would suggest that the authors consider excluding Fig. 2 and Fig. S1-S3 in their entirety from the manuscript to enhance its readability for general readers.

      In the single-cell RNAseq-based retrograde tracing, all barcodes recovered matched to known barcodes in the corresponding library. We included a short description of these results in the revised manuscript.

      1. Regarding the trans-synaptic tracing utilizing a barcoded RV vector in conjunction with BARseq decoding (Fig. 5), which is the core of this manuscript, I have a few specific questions/comments. First, the rationale behind defining cells with only two rolonies counts of rabies glycoprotein (RG) as starter cells is unclear. Why did the authors not analyze the sample based on the colocalization of GFP (from the AAV) and mCherry (from the RV) proteins, which is a conventional method to define starter cells? If this approach is technically difficult, the authors could provide an independent histochemical assessment of the detection stringency of GFP positive cells based on two or more colonies of RG.

      In situ sequencing does not preserve fluorescent protein signals, so we used transcript counts to determine which cells expressed the glycoprotein. We have added new analyses in the Results and in Supplementary Note 2 to determine the transcript counts that were equivalent to cells that had detectable BFP expression. We found that BFP expression is equivalent to ~12 counts of the glycoprotein transcript per cell, which is much higher than the threshold we used. However, we could not solely rely on this estimate to define the source cells, because cells that had lower expression of the glycoprotein (possibly from leaky Cre-independent expression) may still pass the barcodes to presynaptic cells. This can lead to an underestimation of double-labeled and connected-source networks and an overestimation of single-source networks and can obscure synaptic connectivity at the cellular resolution. We thus used a very conservative threshold of two transcripts in the analysis. This conservative threshold will likely overestimate the number of source cells that shared barcodes and underestimate the number of single-source networks. Since this is a first study of barcoded transsynaptic tracing in vivo, we chose to err on the conservative side to make sure that the subsequent analysis has single-cell resolution. Future characterization and optimization may lead to a better threshold to fully utilize data.

      Second, it is difficult to interpret the proportion of the 2,914 barcoded cells that were linked to barcoded starter cells (single-source, double-labeled, or connected-source) and those that remained orphan (no-source or lost-source). A simple table or bar graph representation would be helpful. The abundance of the no-source network (resulting from Cre-independent initial infection of the RV vector) can be estimated in independent negative control experiments that omit either Cre injection or AAV-RG injection. The latter, if combined with BARseq decoding, can provide an experimental prediction of the frequency of double-labeled events since connected-source networks are not labeled in the absence of RG.

      We have added Table 2, which breaks down the 2,914 barcoded cells based on whether they are presynaptic or source cells, and which type of network they belong to. We agree with the reviewer that the additional Cre- or RG- control experiments in parallel would allow an independent estimate of the double labeled networks and the no-source networks. We have included added a discussion of possible controls to further optimize the trans-synaptic tracing approach in future studies in the Discussion.

      Third, I would appreciate more quantitative data on the putative single-source network (Fig. 5I and S6) in terms of the distribution of pre- and post-synaptic TC cell types. The majority of labeling appeared to occur locally, with only two thalamic neurons observed in sample 25311842 (Fig. S6). How many instances of long-distance labeling (for example, > 500 microns away from the injection site) were observed in total? Is this low efficiency of long-distance labeling expected based on the utilized combinations of AAVs and RV vectors? A simple independent RV tracing solely detecting mCherry would be useful for evaluating the labeling efficiency of the method. I have experienced similar "less jump" RV tracing when RV particles were prepared in a single step, as this study did, rather than multiple rounds of amplification in traditional protocols, such as Osakada F et al Nat Protocol 2013.

      We imaged an animal that was injected in parallel to assess labeling (now included in Supplementary Note 2 and Supp. Fig. S5). The labeling pattern in the newly imaged animal was largely consistent with the results from the barcoded experiment: most labeled neurons were seen in the vicinity of the injection site, and sparser labeling was seen in other cortical areas and the thalamus. We further found that most neurons that were labeled in the thalamus were about 1 mm posterior to the center of the injection site, and thus would not have been sequenced in the in situ sequencing experiment (in which we sequenced about 640 µm of tissue spanning the injection site).

      In addition, we found that the bulk of the cells that expressed mCherry from the rabies virus only partially overlapped with the area that contained cells co-expressing BFP with the rabies glycoprotein. Moreover, very few cells co-expressed mCherry and BFP, which would be considered source cells in a conventional mono-synaptic tracing experiment. The small numbers of source cells likely also contributed to the sparseness of long-range labeling in the barcoded experiment.

      These interpretations and comparisons to the barcoded experiment are now included in Supplementary Note 2.

      Reviewer #3 (Public Review):

      The manuscript by Zhang and colleagues attempts to combine genetically barcoded rabies viruses with spatial transcriptomics in order to genetically identify connected pairs. The major shortcoming with the application of a barcoded rabies virus, as reported by 2 groups prior, is that with the high dropout rate inherent in single cell procedures, it is difficult to definitively identify connected pairs. By combining the two methods, they are able to establish a platform for doing that, and provide insight into connectivity, as well as pros and cons of their method, which is well thought out and balanced.

      Overall the manuscript is well-done, but I have a few minor considerations about tone and accuracy of statements, as well as some limitations in how experiments were done. First, the idea of using rabies to obtain broader tropism than AAVs isn't really accurate - each virus has its own set of tropisms, and it isn't clear that rabies is broader (or can be made to be broader).

      As the reviewer suggested, we toned down this claim and stated that rabies virus has different tropism to complement AAV.

      Second, rabies does not label all neurons that project to a target site - it labels some fraction of them.

      We meant to say that retrograde labeling is not restricted to labeling neurons from a certain brain region. We have clarified in the text.

      Third, the high rate of rabies virus mutation should be considered - if it is, or is not a problem in detecting barcodes with high fidelity, this should be noted.

      Our analysis showed that sequencing 15 bases was sufficient to tolerate a small number of mismatches in the barcode sequences and could distinguish real barcodes from random sequences (Fig. 4A). Thus, we can tolerate mutations in the barcode sequence. We have clarified this in the text.

      Fourth, there are a number of implicit assumptions in this manuscript, not all of which are equally backed up by data. For example, it is not clear that all rabies virus transmission is synaptic specific; in fact, quite a few studies argue that it is not (e.g., detection of rabies transcripts in glial cells). Thus, arguments about lost-source networks and the idea that if a cell is lost from the network, that will stop synaptic transmission, is not clear. There is also the very real propensity that, the sicker a starter cell gets, the more non-specific spread of virus (e.g., via necrosis) occurs.

      We agree with the reviewer that how strictly virus transmission is restricted to synapses remains a hotly debated question in the field, and this question is relevant not only to techniques based on barcoded rabies tracing, but to all trans-synaptic tracing experiments. A barcoding-based approach can generate single-cell data that enable direct comparison to other data modalities that measure synaptic connectivity, such as multi-patch and EM. These future experiments may provide additional insights into the questions that the reviewer raised. We have included additional discussion about how non-synaptic transmission of barcodes because of the necrosis of source cells may affect the analysis in the Discussion.

      Regarding the scenario in which the source cell dies, we agree with the reviewer and have clarified in the revised manuscript.

      Fifth, in the experiments performed in Figure 5, the authors used a FLEx-TVA expressed via a retrograde Cre, and followed this by injection of their rabies virus library. The issue here is that there will be many (potentially thousands) of local infection events near the injection site that TVA-mediated but are Cre-dependent (=off-target expression of TVA in the absence of Cre). This is a major confound in interpreting the labeling of these cells. They may express very low levels of TVA, but still have infection be mediated by TVA. The authors did not clearly explore how expression of TVA related to rabies virus infection of cells near the rabies injection site. A modified version of TVA, such as 66T, should have been used to mitigate this issue. Otherwise, it is impossible to determine connectivity locally. The authors do not go to great lengths to interpret the findings of these observations, so I am not sure this is a critical issue, but it should be pointed out by the authors as a caveat to their dataset.

      We agree with the reviewer that this type of infection could potentially be a major contributor to no-source networks, which were abundant in our experiment. Because small no-source networks were excluded from our analyses, and large no-source networks were only included for barcodes with low frequency (i.e., it would be nearly impossible statistically to generate such large no-source networks from independent infections), we believe that the effect of independent infections on our analyses were minimized. We have added a control experiment in Fig S5 and Supplementary Note 2, which further supported the hypothesis that there were many independent infections. We also included additional discussion about how this can be assessed and optimized in future studies in the Discussion.

      Sixth, the authors are making estimates of rabies spread by comparison to a set of experiments that was performed quite differently. In the two studies cited (Liu et al., done the standard way, and Wertz et al., tracing from a single cell), the authors were likely infecting with a rabies virus using a high multiplicity of infection, which likely yields higher rates of viral expression in these starter cells and higher levels of input labeling. However, in these experiments, the authors need to infect with a low MOI, and explicitly exclude cells with >1 barcode. Having only a single virion trigger infection of starter cells will likely reduce the #s of inputs relative to starter neurons. Thus, the stringent criteria for excluding small networks may not be entirely warranted. If the authors wish to only explore larger networks, this caveat should be explicitly noted.

      In the trans-synaptic labeling experiment, we actually used high rabies titer (200 nL, 7.6e10 iu/mL) that was comparable to conventional rabies tracing experiments. We did not exclude cells with multiple barcodes (as opposed to barcodes in multiple source cells), because we could resolve multiple barcodes in the same cell and indeed found many cells with multiple barcodes. We have clarified this in the text.

      Overall, if the caveats above are noted and more nuance is added to some of the interpretation and discussion of results, this would greatly help the manuscript, as readers will be looking to the authors as the authority on how to use this technology.

      In addition to addressing the specific concerns of the reviewer as described above, we modified the Results and Discussion sections on the trans-synaptic tracing experiment to improve clarity to general readers and expanded the discussion on future optimizations.

      Reviewer #1 (Recommendations For The Authors):

      The scientific problem is clearly stated and well laid out, the data are clearly presented, and the experiments well justified and nicely discussed. It was overall a very enjoyable read. The figures are generally nice and clear, however, I find the legends excessively concise. A bit too often, they just sort of introduce the title of the panel rather than a proper explanation of what it is depicted. A clear case is for example visible in Fig 2, where the description of the panels is minimal, but this is a general trend of the manuscript. This makes the figures a bit hard to follow as self-contained entities, without having to continuously go back to the main text. I think this could be improved with longer and more helpful descriptions.

      We have revised all figure legends to make them more descriptive.

      Other minor things:

      In the cDNA synthesis step for in-situ sequencing, I believe the authors might have forgotten one detail: the addition of aminoallyl dUTP to the RT reaction. If I recall correctly this is done in BARseq. The fact that the authors crosslink with BS-PEG on day 2, makes me suspect they spike in these nucleotides during the RT but this is not specified in the relevant step. Perhaps this is a mistake that needs correction.

      The RT primers we used have an amine group at 5’, which directly allows crosslinking. Thus, we did not need to spike in aminoallyl dUTP in the RT reaction. We have clarified this in the Methods.

      Reviewer #2 (Recommendations For The Authors):

      Throughout the manuscript, there are frequent references to the "Methods" section for important details. However, it can be challenging to determine which specific section of the Methods the authors are referring to, and in some cases, a thorough examination of the entire Methods section fails to locate the exact information needed to support the authors' claims. Below are a few specific examples of this issue. The authors are encouraged to be more precise in their references to the Methods section.

      In the revised manuscript, we numbered each subsection of Methods and updated pointers and associated hyperlinks in the main text to the subsection numbers.

      • On page 7, line 14, it is unclear how the authors compared the cell marker gene expression with the marker gene expression in the reference cell type.

      We have clarified in the revised manuscript.

      • On page 7, line 33, the authors note that some barcodes may have been missed during the sequencing of the rabies virus libraries, but the Methods section lacked a convincing explanation on this issue (see my point 2 above).

      We included a separate subsection on the sequencing of rabies libraries and the analysis of the sequencing depth in the Methods. In this new subsection, we further clarified our reasoning for identifying the lack of sequencing depth as a reason for missing barcodes, especially in comparison to sequencing depth required for establishing exact molecule counts used in established MAPseq and BARseq techniques with Sindbis libraries.

      • On page 9, line 44, the authors state that they considered a barcode to be associated with a cell if they found at least six molecules of that barcode in a cell, as detailed in the Methods section. However, the rationale behind this level of stringency is not provided in the Methods.

      We initially chose this threshold based on visual inspection of the sequencing images of the barcoded cells. Because the labeled cell types were consistent with our expectations (Fig. 4E-G), we did not further optimize the threshold for detecting retrogradely labeled barcoded cells.

      • I have noticed that some important explanations of figure panels are missing in the legends, making it challenging to understand the figures. Below are typical examples of this issue.

      In addition to the examples that the reviewer mentioned below, we also revised many other figure panels to make them clear to the readers.

      • In Fig. 2, "RV into SC" in panel C does not make sense, as RV was injected into the thalamus. There is no explanation of the images in this panel C.

      We have corrected the typo in the revision.

      • In Fig. 3, information on the endogenous gene panel for cell type classification (Table S3) could be mentioned in the legend or corresponding text.

      We now cite Table S3 both in Fig 3 legend and in the main text. We also included a list of the 104 cell type marker genes we used in Table S3.

      • In panel J, it is unclear why the total number of BC cells is 2,752, and not 4,130 as mentioned in the text.

      This is a typo. We have corrected this in the revision. The correct number (3,746) refers to the number of cells that did not belong to either of the two categories at the bottom of the panel, and not the total number of neurons. To make this clear, we now also include the total number of barcoded cells at the top of the panel.

      • In Fig. 4, the definitions of "+" and "−" symbols in panels K and L are unclear. Also, it seems that the second left column of panel K should read "T −."

      We corrected the typo in K, further clarified the “Area” labels, and changed the “S” label in 4K to “−”. This change does not change the original meaning of the figure: when considering the variance explained in L4/5 IT neurons, considering the subclass compositional profile is equivalent to not using the compositional profiles of cell types, because L4/5 IT neurons all belong to the same subclass (L4/5 IT subclass). Although operationally we simply considered subclass-level compositional profiles when calculating the variance explained, we think that changing this to “−” is clearer for the readers.

      • In Fig. 5, panel E is uninterpretable.

      We revised the main text and the figure to clarify how we manually proofread cells to determine the QC thresholds for barcoded cells. These plots showed a summary of the proofreading. We also revised the figures to indicate that they showed the fraction of barcoded cells that were considered real after proofreading. In the revised version, we moved these plots to Fig. S5.

      • In Fig. S1, I do not understand the identity of the six samples on the X-axis of panel A (given that only two animals were described in the main text) and what panel B shows, including the definition of map_cluster_conf and map_cluster_corr.

      In the revised Fig. S1, we made it more explicit that the six animals include both animals used for retrograde tracing (2 animals) and those used for trans-synaptic tracing (4 animals). We updated the y axis labels to be more readable and cited the relevant Methods section for definitions.

      • In Fig. S2, please provide the definitions of blue and red dots and values in panel A, as well as the color codes and size of the circles in panel B. My overall impression from panel B is that there is no significant difference between RV-infected and non-infected cells. The authors should provide more quantitative and statistical support for the claim that "RV-infected cells had higher expression of immune response-related genes."

      We toned down the statement to “Consistent with previous studies […], some immune response related genes were up-regulated in virus-infected cells compared to non-infected cells.” Because the main point of the single-cell RNAseq analysis was that rabies did not affect the ability to distinguish transcriptomic types, the change in immune response-related genes was not essential to the main conclusions. We clarified the red and blue dots in panel A and changed panel B to show the top up-regulated immune response-related genes in the revised manuscript.

      • In Fig. S3, the definitions of the color code and circle size are missing.

      We have added the legends in Fig. S3.

    1. Reviewer #1 (Public Review):

      Summary:<br /> The authors investigated causal inference in the visual domain through a set of carefully designed experiments, and sound statistical analysis. They suggest the early visual system has a crucial contribution to computations supporting causal inference.

      Strengths:<br /> I believe the authors target an important problem (causal inference) with carefully chosen tools and methods. Their analysis rightly implies the specialization of visual routines for causal inference and the crucial contribution of early visual systems to perform this computation. I believe this is a novel contribution and their data and analysis are in the right direction.

      Weaknesses:<br /> In my humble opinion, a few aspects deserve more attention:

      1. Causal inference (or causal detection) in the brain should be quite fundamental and quite important for human cognition/perception. Thus, the underlying computation and neural substrate might not be limited to the visual system (I don't mean the authors did claim that). In fact, to the best of my knowledge, multisensory integration is one of the best-studied perceptual phenomena that has been conceptualized as a causal inference problem. Assuming the causal inference in those studies (Shams 2012; Shams and Beierholm 2022; Kording et al. 2007; Aller and Noppeney 2018; Cao et al. 2019) (and many more e.g., by Shams and colleagues), and the current study might share some attributes, one expects some findings in those domains are transferable (at least to some degree) here as well. Most importantly, underlying neural correlates that have been suggested based on animal studies and invasive recording that has been already studied, might be relevant here as well. Perhaps the most relevant one is the recent work from the Harris group on mice (Coen et al. 2021). I should emphasize, that I don't claim they are necessarily relevant, but they can be relevant given their common roots in the problem of causal inference in the brain. This is a critical topic that the authors may want to discuss in their manuscript.

      2. If I understood correctly, the authors are arguing pro a mere bottom-up contribution of early sensory areas for causal inference (for instance, when they wrote "the specialization of visual routines<br /> for the perception of causality at the level of individual motion directions raises the possibility that this function is located surprisingly early in the visual system *as opposed to a higher-level visual computation*."). Certainly, as the authors suggested, early sensory areas have a crucial contribution, however, it may not be limited to that. Recent studies progressively suggest perception as an active process that also weighs in strongly, the top-down cognitive contributions. For instance, the most simple cases of perception have been conceptualized along this line (Martin, Solms, and Sterzer 2021)<br /> and even some visual illusion (Safavi and Dayan 2022), and other extensions (Kay et al. 2023). Thus, I believe it would be helpful to extend the discussion on the top-down and cognitive contributions of causal inference (of course that can also be hinted at, based on recent developments). Even adaptation, which is central in this study can be influenced by top-down factors (Keller et al. 2017). I believe, based on other work of Rolfs and colleagues, this is also aligned with their overall perspective on vision.

      3. The authors rightly implicate the neural substrate of causal inference in the early sensory system. Given their study is pure psychophysics, a more elaborate discussion based on other studies that used brain measurements is needed (in my opinion) to put into perspective this conclusion. In particular, as I mentioned in the first point, the authors mainly discuss the potential neural substrate of early vision, however much has been done about the role of higher-tier cortical areas in causal inference e.g., see (Cao et al. 2019; Coen et al. 2021).

      There were many areas in this manuscript that I liked: clever questions, experimental design, and statistical analysis.

      Bibliography<br /> \============

      Aller, Mate, and Uta Noppeney. 2018. "To Integrate or Not to Integrate: Temporal Dynamics of Bayesian Causal Inference." Biorxiv, December, 504118. .

      Cao, Yinan, Christopher Summerfield, Hame Park, Bruno Lucio Giordano, and Christoph Kayser. 2019. "Causal Inference in the Multisensory Brain." Neuron 102 (5): 1076-87.e8. .

      Coen, Philip, Timothy P. H. Sit, Miles J. Wells, Matteo Carandini, and Kenneth D. Harris. 2021. "The Role of Frontal Cortex in Multisensory Decisions." Biorxiv, April. Cold Spring Harbor Laboratory, 2021.04.26.441250. .

      Kay, Kendrick, Kathryn Bonnen, Rachel N. Denison, Mike J. Arcaro, and David L. Barack. 2023. "Tasks and Their Role in Visual Neuroscience." Neuron 111 (11). Elsevier: 1697-1713. .

      Keller, Andreas J, Rachael Houlton, Björn M Kampa, Nicholas A Lesica, Thomas D Mrsic-Flogel, Georg B Keller, and Fritjof Helmchen. 2017. "Stimulus Relevance Modulates Contrast Adaptation in Visual Cortex." Elife 6. eLife Sciences Publications, Ltd: e21589.

      Kording, K. P., U. Beierholm, W. J. Ma, S. Quartz, J. B. Tenenbaum, and L. Shams. 2007. "Causal Inference in Multisensory Perception." PloS One 2: e943. .

      Martin, Joshua M., Mark Solms, and Philipp Sterzer. 2021. "Useful Misrepresentation: Perception as Embodied Proactive Inference." Trends Neurosci. 44 (8): 619-28. .

      Safavi, Shervin, and Peter Dayan. 2022. "Multistability, Perceptual Value, and Internal Foraging." Neuron, August. .

      Shams, L. 2012. "Early Integration and Bayesian Causal Inference in Multisensory Perception." In The Neural Bases of Multisensory Processes, edited by M. M. Murray and M. T. Wallace. Frontiers in<br /> Neuroscience. Boca Raton (FL).

      Shams, Ladan, and Ulrik Beierholm. 2022. "Bayesian Causal Inference: A Unifying Neuroscience Theory." Neuroscience & Biobehavioral Reviews 137 (June): 104619. .

    1. Author Response

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      The authors were trying to investigate whether viral IBs are involved in antagonizing IFN-I production during EBOV trVLPs infection. They found that IRF3 is hijacked and sequestered into EBOV IBs after viral infection, thereby leading to the spatial isolation of IRF3 with TBK1 and IKKε. In such a progress, the activity of IRF3 is suppressed and downstream IFN-I induction is inhibited. The authors designed many experiments, such as the PLA that examined the colocalization, to support their conclusions. However, necessary negative controls were missed in several assays. More key index is needed to be examined in several assays.

      The paper is well organized and most data in this paper could support the conclusions, while there are several issues that need to be further solved.

      1. In Figure 2-4, authors should examine the expression of downstream IFNs as well as the phosphorylation and nuclear localization of IRF3 to further prove the suppression of IRF3 activity by infecting with trVLPs.

      Response: The inhibitory effect of trVLPs infection on the phosphorylation of IRF3 S396 and SeV-induced IRF3 nuclear localization was determined by immunoprecipitation (Figure 3D) and immunofluorescence (Figure 4A and 4B), respectively. In addition, we demonstrated that IFN-β transcription was inhibited more potently by EBOV viral inclusion bodies compared with VP35 alone (Figure 7B and 7C).

      Moreover, EBOV viral inclusion bodies were demonstrated to inhibit the transcription of IFN downstream genes (e.g., CXCL10, ISG15 and ISG56) more potently than VP35 alone (new Figure 7D-F).

      1. In Figure 5, to better prove the conclusion that EBOV NP and VP35 play an important role in sequestering IRF3 in IBS, authors should add the "NP+VP35+VP30" and "NP+VP35+VP24" groups to reperform the assay.

      Response: According to the reviewer’s suggestion, VP24 or VP30 was added to the “VP35+NP” group, and the results showed that the “NP+VP35+VP24” and “NP+VP35+VP30” groups exhibited little, if any, effect on the distribution of IRF3 compared with the “NP+VP35” group (new Figure 5 - figure supplement 2A-B).

      1. In Figure 6f, the expression of STING should be examined by immunostaining to show the knockdown efficiency in trVLPs-infected cells.

      Response: As suggested by the reviewer, immunostaining was performed to visually detect the effect of STING knockdown on the IRF3 distribution during trVLPs infection (new Figure 6F).

      Reviewer #2 (Public Review):

      The manuscript by Zhu et al explored molecular mechanisms by which Ebola virus (EBOV) evades host innate immune response. EBOV has a number of means to shut down the type I interferon induction (by viral VP35 protein) and block type I interferon action (by viral VP24 protein). This study reported a new mechanism that inclusion body (IB) used for viral replication sequesters IRF3, a key transcription factor involved in the interferon signaling, resulting in blockade of downstream type I interferon gene transcription. This finding is potentially interesting and may provide a new insight into EBOV's evasion of innate immunity. However, there are some flaws in the experimentations and analyses that need to be addressed.

      1. Most of experiments were performed by transfection of trVLP plasmids, which is very different from virus infection. The conclusions should be examined and verified in the context of virus infection.

      Response: As suggested by the reviewer, the effects of IRF3 depletion on live Ebola virus replication were examined as described in the revised manuscript. Consistent with the results obtained after trVLPs infection, IRF3 depletion exerted little, if any, effect on viral replication (new Figure 7H), which supports the notion that, upon EBOV infection and the formation of inclusion bodies, IRF3 has little, if any, transcription activation activity after sequestration by inclusion bodies.

      1. Fig 1 - VP35 displayed a classical IB staining only in Panel A, while much less so in Panel C and not in panel B. It seemed that the VP35 staining images were chosen in a way towards the authors' favor. The statistical analysis of co-localization of VP35 and IRF3, TBK1 or IKKe should be performed to draw the conclusion. Another concern is that IKKe is normally lowly expressed under a rest condition and becomes induced only when the interferon signaling is activated. It seemed to be expressed at a high level even when the interferon signaling is blocked in Panel C. The authors should comment on this discrepancy.

      Response: Ebola virus inclusion bodies show variations in both shape and size. According to the reviewer’s suggestion, the colocalization of TBK1 or IKKε and VP35 is shown in new figures (new Figure 1C and 1E), and quantitatively analyzed by the fluorescence intensity using ImageJ software (new Figure 1B, 1D and 1F).

      1. Fig 2 - Was this experiment done by transfection or infection? The description of result is not consistent with the figure legend. The labeling was also not consistent between panel A and B. I would suggest performing Western blot to analyze the expression level of IRF3.

      Response: We apologize for the incorrect description of the data. Ebola virus trVLPs were initially produced based on transfection but also involved the viral infection process. The use of “transfection” in the figure and figure legends has been changed to “infection” in the revised manuscript. As suggested by the reviewer, Western blotting was performed to analyze the IRF3 expression levels at different time points after trVLPs infection (new Figure 2D).

      1. Fig 3 and 4 - As VP35 is well known for its highly efficient blockade of type I interferon activation, how would the authors differentiate the effect of VP35 alone from the sequestration of IRF3 in IBs in these experiments?

      Response: Previous studies have found that VP35, rather than NP, inhibits the expression of interferon, and the “VP35+NP” treatment, which induces IRF3 sequestration, showed inhibited IFN-β luciferase activity much more potently than VP35 expression alone (Figure 7B).

      1. Fig 3 - PolyIC can activate both RLR and TLR signaling pathways. Can the author comment on which pathway it activates in this experiment?

      Response: In this study, the effect of poly(I:C) was consistent with the results observed with SeV, which indicated that poly(I:C) may mainly activate the RLR signaling pathway. A discussion was added to the revised manuscript.

      1. The authors demonstrated that VP35 interacts with STING and recruit the latter to IBs. How would this affect the function of STING given that STING plays essential roles in cGAS/cGAMP pathway?

      Response: This study unexpectedly showed that VP35 can recruit IRF3 into viral inclusion bodies through STING, but whether it regulates the cGAS-STING pathway remains to be further investigated. Related discussion was added to the revised manuscript.

      1. It is difficult to follow the logics of Fig 7. The expression level of each viral protein should be determined. Ideally, a mutation in VP35 that disrupts its ability to antagonize the interferon signaling but still allows for the IB formation can be used to assess the relative contribution of IB sequestering IRF3.

      Response: As suggested by the reviewer, a series of VP35 mutants were constructed, but we failed to obtain a VP35 mutant that contains a mutation that disrupts the ability of the protein to antagonize interferon signaling but still allows IB formation. Instead, coexpression of “NP+VP35+VP30+L”, which induces IBs formation, inhibited IFN-I more potently than the expression of VP35 alone (Figure 7B). IRF3 knockout inhibited poly(I:C)-induced IFN-I production but had little, if any, effect on poly(I:C)-induced IFN-I production in the “NP+VP35+VP30+L” group (Figure 7C). IRF3 knockout in the cells did not significantly affect viral replication, but overexpression of activated IRF3 (IRF3/5D), instead of wild-type IRF3, inhibited viral replication (new Figure 7G-H). These results collectively suggested that almost all IRF3 in cells was hijacked and sequestered into IBs in the Ebola virus-infected cells.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #2 (Recommendations For The Authors):

      The evidence provided in this study reflects important discoveries on language lateralisation and most of the conclusions of this paper are supported by evidence. However, there are several areas regarding the characteristics of participants tested, hypotheses/predictions and the type of analysis, that need to be clarified and/or corrected.

      1. There is a substantial disconnection between the introduction and the methods/results section.

      One reason is because of lack of consistency. One example refers to the fact that, in the introduction, only IFC is mentioned. However, the analyses carried out to examine neural activity in different groups focused on IFC as well as other brain regions related to inhibitory control. However, these areas were not mentioned at all in the introduction. Second and related to the above, the rationale for conducting certain types of analyses is not specified. Some brain analyses focus on IFC only. Instead, other analyses focus on several areas.

      Another weakness is that there is not sufficient detail regarding the hypotheses/predictions and the specific types of analyses chosen to test these hypotheses/predictions. For example, there is no mention of resting state fMRI data in the introduction, but later we discover that this type of data was collected and analyzed. Even a brief mention of the inclusion of resting state data in the introduction would be beneficial. Along the same lines, by reading the methods section we find out that VBM analyses were conducted. But it is unclear why. What was the purpose of this data analysis? This should be clarified briefly in the introduction and then in the methods section. It remains unclear why resting state results would be particularly informative for addressing the research question of this study. Task-related brain connectivity seems a more appropriate choice. Additionally, it is not explained what comparisons and outcomes would be informative/expected to distinguish between the two mentioned competing hypotheses. This should be made clear.

      Another aspect that lacks clarity is the authors' predictions when investigating the relationship "between the lateralization of both functions and inter-hemispheric structural-functional connectivity, as well as with behavioural markers of certain clinical conditions that have been related with atypical lateralization". The hypotheses are completely omitted in this section.

      Thank you for bringing this to our attention. We concur with Reviewer #2 that our introduction was somewhat lacking in detail and assumed too much prior knowledge on the part of the reader. This, together with a lack of a clear presentation of our tested hypotheses, made the introduction have a poor connection with both the results and discussion sections, which hindered the understanding of the paper.

      As a result, we have made some additions to enhance the exposition of the following areas: (1) the causal and statistical hypotheses of lateralization (Lines 55-65); and (2) the hypotheses regarding subclinical markers of neurological disorders and the corpus callosum (Lines 90-104).

      Furthermore, we have extensively revised the final paragraph of the introduction (Lines 105-121) to provide a clearer and more coherent linkage between the drivers presented during the introduction, our hypotheses, and the subsequent analyses.

      1. It is important to provide more information on the language background of the participants. Were the participants in this study Catalan-Spanish bilinguals? If so, it is crucial for the authors to mention this.

      Language background of the participants has been added to the corresponding section (Lines 138-145).

      In fact, previous studies, including several publications from the authors themselves (Garbin et al., 2010; Rodríguez-Pujadas et al., 2013; Anderson et al., 2018), have shown that there are qualitative differences between bilinguals and monolinguals in the neural circuitry underlying executive control. Across all these studies, it was consistently reported that bilingual individuals, when engaged in non-linguistic inhibitory control tasks, recruited a broader network of left-brain regions associated with language control, including the left IFC, in comparison to monolingual individuals. If the participants in this study were indeed bilinguals, it raises concern if the aim of the study is to generalize the conclusions on lateralization effects beyond the bilingual population.

      Rodríguez-Pujadas, A., Sanjuán, A., Ventura-Campos, N., Román, P., Martin, C., Barceló, F., … & Ávila, C. (2013). Bilinguals use language-control brain areas more than monolinguals to perform non-linguistic switching tasks. PLoS One, 8(9), e73028.

      Garbin, G., Sanjuan, A., Forn, C., Bustamante, J. C., Rodríguez-Pujadas, A., Belloch, V., ... & Ávila, C. (2010). Bridging language and attention: Brain basis of the impact of bilingualism on cognitive control. NeuroImage, 53(4), 1272-1278.

      Anderson, J. A., Chung-Fat-Yim, A., Bellana, B., Luk, G., & Bialystok, E. (2018). Language and cognitive control networks in bilinguals and monolinguals. Neuropsychologia, 117, 352-363.

      Indeed, we have thoroughly reported that, when compared to monolinguals, bilinguals exhibit a significant implication of left brain regions during switching and inhibition tasks. So, this is a legitimate concern. Unfortunately, the society from which our participants were drawn is primarily bilingual, encompassing both active and passive bilinguals. The monolingual sample in those previous studies consisted of university students originating from predominantly monolingual regions of Spain. Given this context, it is unsurprising that the current study has a rather limited number of monolinguals (n=8), with only 2 displaying atypical language lateralization. Thus, we cannot provide a reliable answer to the role of bilingualism status in our data. Consequently, we have included a comment on this limitation on the discussion (Lines 504-512).

      1. Regarding the methods section, I have the following specific queries. The first is about the control condition in the verb generation task. I find it puzzling that the 'task' and 'control' conditions differ in terms of the number of words uttered. Could the authors please provide further clarification on this?

      Thank you for raising this question. Regarding the control condition, it is important to note that the design of this task drew inspiration from previously published verb generation tasks for fMRI (Benson et al., 1999; Fitzgerald et al., 1997) and PET (Petersen et al., 1988). In the fMRI tasks, a fixation cross served as the control condition, while the PET study used word repetition as the control. We acknowledged that a mere fixation cross might not adequately control for the movement and visual-related activations inherent in the verb generation task. Conversely, word repetition could potentially engage the default mode network due to the repetition of the same simple task, which might not be suitable for a control condition, and it could be overly linguistic because it involves a word. Consequently, we aimed to strike a balance by employing a control condition that consisted of reading letters. This approach allowed us to control for movement and vision factors without invoking semantics. Thus, after careful consideration, we ultimately opted on the reading of two letters to equate the response to the vocalization length of generating a verb.

      Although we understand the concern of single vs. two vocalizations, it is worth emphasizing that this version of the verb generation task had undergone prior testing to assess its suitability for determining language lateralization in both healthy and clinical populations (Sanjuan et al., 2010). In fact, this task has been an integral component of our lab’s standard presurgical assessment protocol, which has been used for nearly two decades in individually evaluating language function in over 500 patients with central nervous system lesions.

      Benson, R. R., Fitzgerald, D. B., Lesueur, L. L., Kennedy, D. N., Kwong, K. K., Buchbinder, B. R., Davis, T. L., Weisskoff, R. M., Talavage, T. M., Logan, W. J., Cosgrove, G. R., Belliveau, J. W., & Rosen, B. R. (1999). Language dominance determined by whole brain functional MRI in patients with brain lesions. Neurology, 4(52), 798–809.

      Fitzgerald, D. B., Cosgrove, G. R., Ronner, S., Jiang, H., Buchbinder, B. R., Belliveau, J. W., Rosen, B. R., & Benson, R. R. (1997). Location of Language in the Cortex: A Comparison between Functional MR Imaging and Electrocortical Stimulation. AJNR Am J Neuroradiol, 18, 1529–1539.

      Petersen, S. E., Fox, P. T., Posner, M. I., Mintun, M., & Raichle, M. E. (1988). Positron emission tomographic studies of the cortical anatomy of single-word processing. Nature, 331(18), 585–589.

      Sanjuán, A., Bustamante, J. C., Forn, C., Ventura-Campos, N., Barrós-Loscertales, A., Martínez, J. C., Villanueva, V., & Ávila, C. (2010). Comparison of two fMRI tasks for the evaluation of the expressive language function. Neuroradiology, 52(5), 407–415. https://doi.org/10.1007/s00234-010-0667-8

      Second, it is mentioned that some participants were excluded from different tasks due to technical issues or time constraints. It is important to ensure that all the results can be attributed to the exact same sample of participants across all tasks.

      We absolutely agree that excluding participants can be problematic when presenting the results of multiple sets of analyses. Therefore, we repeated all analyses while excluding the 7 participants that lacked resting-state data. All results remained virtually identical, with a few minor exceptions:

      1) Region-wise analysis of the stop-signal task: Hemisphere × Group effect in the preSMA region is significant (uncorrected P = 0.019), but it does not survive Bonferroni correction (corrected P = 0.076)

      2) Voxel-wise analysis of the stop-signal task: The Thalamus + STN and Caudate clusters are significant at the voxel level, but do not survive the cluster-based FWE correction. They do survive FDR correction, though.

      3) Correlation between SPQ score and LI of the stop-signal task: This correlation weakens just behind statistical significance, with a P value of 0.053.

      4) Correlation between reading variables and LIs of both tasks: Severe drops in P values are evident between both LIs and reading length accuracy (P = .111 and .133), as well as between verb generation LI and reading familiarity accuracy (P = .111). However, the association between the stop-signal LI and the reading length time is now significant (r = −.229, P = .042).

      According to this, we have included this statement in the methods section: (Lines 218-220).“It is important to highlight that the exclusion of these seven participants across all analyses does not notably impact the overall results.“

      It is unclear how the authors have estimated the RTs results from the practice trials. This requires more explanation. Also, why was the median used for the Go Reaction Time instead of the mean, when calculating the individual SSRT?

      We adapted the procedure used by Xue et al. (2008), implementing their approach to calculate SSRT. This has been elaborated further (Lines 227-230), together with the use of practice trials (Lines 233-236).

      Xue, G., Aron, A.R., and Poldrack, R.A. (2008). Common Neural Substrates for Inhibition of Spoken and Manual Responses. Cerebral Cortex 18, 1923–1932. 10.1093/CERCOR/BHM220.

      On a final note, information about the different types of pre-processing and data analysis is all reported in the same paragraph. I think using subsections would increase the intelligibility of the section.

      Thank you for this suggestion. We have added subsections in both the ‘image processing’ and ‘statistical analyses’ sections.

      1. Data analysis and Interpretation of the results. It is unclear how the mean BOLD signal was extracted to conduct ROI analysis (Marsbar?).

      Thank you for ponting this out. Indeed, we were not very accurate in the description of this procedure. We extracted the first eigenvariate via the VOI function within SPM12. This has been included in Lines 291-293.

      I feel uneasy about the way results are corrected for multiple comparisons. For instance, it is mentioned that in the ROI analysis, all p-values were FDR-corrected for four comparisons, but it is unclear why. The correct procedure for supporting conclusions about the effect of specific brain would be to have 'brain region' (n=4) as another within-subject factor. Furthermore, the one-tailed correlation is appropriate but only when testing for the possibility of a relationship in one direction and completely disregarding the possibility of a relationship in the other direction. However, this does not seem to be the case here (see Introduction), so a two-tailed correlation would be more appropriate.

      We agree with Reviewer #2 that presenting this analysis as a single MANOVA that includes a ‘Region’ factor is a more accurate approach. Consequently, we have made the aforementioned correction in the methods section (Lines 357-364) and the results section (Lines 395-406). The LI-LI one-tailed correlation was also changed to a two-tailed correlation in the methods section (Line 383), the results section (Line 417), and Figure 2 (Line 886).

      I am quite confused about using the term interhemispheric connectivity to refer to the volume of the genu, body and splenium of the corpus callosum. In fact, the volumes of genu, body and splenium of the corpus callosum do not reflect a measure of how strongly RH and LH IFC are connected to each other.

      We agree that using the term ‘interhemispheric connectivity’ when referring to callosal volume may be somewhat misleading. We have replaced every instance of this terminology throughout the paper.

      Furthermore, it is unclear why in a set of analyses (ROI and whole brain analyses) the authors focus on brain responses in different ROIs but instead, in connectivity measures the focus is only on IFC.

      Our initial rationale was to focus on regions that are prominently involved in language, particularly the IFC, for examining inter-hemispheric connectivity at rest.

      However, upon more careful consideration, it is true that the preSMA is also implicated in the language network (Labache et al., 2018), and certain studies have reported an impact of STN stimulation on specific language skills (for a review, see Vos et al., 2021). Consequently, we have incorporated these two regions into the resting-state analysis, along with subsequent correlations with LIs (Table 1 and Lines 118, 321-322 & 449-452).

      Labache, L., Joliot, M., Saracco, J., Jobard, G., Hesling, I., Zago, L., Mellet, E., Petit, L., Crivello, F., Mazoyer, B., & Tzourio-Mazoyer, N. (2018). A SENtence Supramodal Areas AtlaS (SENSAAS) based on multiple task-induced activation mapping and graph analysis of intrinsic connectivity in 144 healthy right-handers. Brain Structure and Function 2018 224:2, 224(2), 859–882. https://doi.org/10.1007/S00429-018-1810-2

      Vos, S. H., Kessels, R. P. C., Vinke, R. S., Esselink, R. A. J., & Piai, V. (2021). The Effect of Deep Brain Stimulation of the Subthalamic Nucleus on Language Function in Parkinson’s Disease: A Systematic Review. Journal of Speech, Language, and Hearing Research, 64(7), 2794–2810. https://doi.org/10.1044/2021_JSLHR-20-00515

      Minor corrections/comments:

      It is unclear why in figure caption 1, the conjunction maps are mentioned even if formal conjunction analysis was not conducted.

      This poor choosing of words has been replaced to ‘overlapping maps’.

      Line 382. VHMC should be VMHC.

      Fixed. Thank you.

      Line 334. This sentence and especially its relationship with the results is not clear at all. What do you mean by 'This finding is consistent with previous reports showing that cognitive deficits appear only in specific cognitive domains'?

      This has been clarified (Lines 521-525).

    1. Reviewer #2 (Public Review):

      Theta-nested gamma oscillations (TNGO) play an important role in hippocampal memory and cognitive processes and are disrupted in pathology. Deep brain stimulation has been shown to affect memory encoding. To investigate the effect of pulsed CA1 neurostimulation on hippocampal TNGO the authors coupled a physiologically realistic model of the hippocampus comprising EC, DG, CA1, and CA3 subfields with an abstract theta oscillator model of the medial septum (MS). Pathology was modeled as weakened theta input from the MS to EC simulating MS neurodegeneration known to occur in Alzheimer's disease. The authors show that if the input from the MS to EC is strong (the healthy state) the model autonomously generates TNGO in all hippocampal subfields while a single neurostimulation pulse has the effect of resetting the TNGO phase. When the MS input strength is weaker the network is quiescent but the authors find that a single CA1 neurostimulation pulse can switch it into the persistent TNGO state, provided the neurostimulation pulse is applied at the peak of the EC theta. If the MS theta oscillator model is supplemented by an additional phase-reset mechanism a single CA1 neurostimulation pulse applied at the trough of EC theta also produces the same effect. If the MS input to EC is weaker still, only a short burst of TNGO is generated by a single neurostimulation pulse. The authors investigate the physiological origin of this burst and find it results from an interplay of CAN and M currents in the CA1 excitatory cells. In this case, the authors find that TNGO can only be rescued by a theta frequency train of CA1 pulses applied at the peak of the EC theta or again at either the peak or trough if the MS oscillator model is supplemented by the phase-reset mechanism.

      The main strength of this model is its use of a fairly physiologically detailed model of the hippocampus. The cells are single-compartment models but do include multiple ion channels and are spatially arranged in accordance with the hippocampal structure. This allows the understanding of how ion channels (possibly modifiable by pharmacological agents) interact with system-level oscillations and neurostimulation. The model also includes all the main hippocampal subfields. The other strength is its attention to an important topic, which may be relevant for dementia treatment or prevention, which few modeling studies have addressed.

      The work has several weaknesses. First, while investigations of hippocampal neurostimulation are important there are few experimental studies from which one could judge the validity of the model findings. All its findings are therefore predictions. It would be much more convincing to first show the model is able to reproduce some measured empirical neurostimulation effect before proceeding to make predictions. Second, the model is very specific. Or if its behavior is to be considered general it has not been explained why. For example, the model shows bistability between quiescence and TNGO, however what aspect of the model underlies this, be it some particular network structure or particular ion channel, for example, is not addressed. Similarly for the various phase reset behaviors that are found. We may wonder whether a different hippocampal model of TNGO, of which there are many published (for example [1-6]) would show the same effect under neurostimulation. This seems very unlikely and indeed the quiescent state itself shown by this model seems quite artificial. Some indication that particular ion channels, CAN and M are relevant is briefly provided and the work would be much improved by examining this aspect in more detail. In summary, the work would benefit from an intuitive analysis of the basic model ingredients underlying its neurostimulation response properties. Third, while the model is fairly realistic, considerable important factors are not included and in fact, there are much more detailed hippocampal models out there (for example [5,6]). In particular, it includes only excitatory cells and a single type of inhibitory cell. This is particularly important since there are many models and experimental studies where specific cell types, for example, OLM and VIP cells, are strongly implicated in TNGO. Other missing ingredients one may think might have a strong impact on model response to neurostimulation (in particular stimulation trains) include the well-known short-term plasticity between different hippocampal cell types and active dendritic properties. Fourth the MS model seems somewhat unsupported. It is modeled as a set of coupled oscillators that synchronize. However, there is also a phase reset mechanism included. This mechanism is important because it underlies several of the phase reset behaviors shown by the full model. However, it is not derived from experimental phase response curves of septal neurons of which there is no direct measurement. The work would benefit from the use of a more biologically validated MS model.

      [1] Hyafil A, Giraud AL, Fontolan L, Gutkin B. Neural cross-frequency coupling: connecting architectures, mechanisms, and functions. Trends in neurosciences. 2015 Nov 1;38(11):725-40.

      [2] Tort AB, Rotstein HG, Dugladze T, Gloveli T, Kopell NJ. On the formation of gamma-coherent cell assemblies by oriens lacunosum-moleculare interneurons in the hippocampus. Proceedings of the National Academy of Sciences. 2007 Aug 14;104(33):13490-5.

      [3] Neymotin SA, Lazarewicz MT, Sherif M, Contreras D, Finkel LH, Lytton WW. Ketamine disrupts theta modulation of gamma in a computer model of hippocampus. Journal of Neuroscience. 2011 Aug 10;31(32):11733-43.

      [4] Ponzi A, Dura-Bernal S, Migliore M. Theta-gamma phase-amplitude coupling in a hippocampal CA1 microcircuit. PLOS Computational Biology. 2023 Mar 23;19(3):e1010942.

      [5] Bezaire MJ, Raikov I, Burk K, Vyas D, Soltesz I. Interneuronal mechanisms of hippocampal theta oscillations in a full-scale model of the rodent CA1 circuit. Elife. 2016 Dec 23;5:e18566.

      [6] Chatzikalymniou AP, Gumus M, Skinner FK. Linking minimal and detailed models of CA1 microcircuits reveals how theta rhythms emerge and their frequencies controlled. Hippocampus. 2021 Sep;31(9):982-1002.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      The manuscript is very-well written. Although the study is well-conducted the authors should be more convincing on how bacteria residing in tissues do not induce death. The association with IL-10 cytokine production appears weak and more experiments are needed to make it more robust

      Reviewer #2 (Public Review):

      Iske et al. provide experimental data that NAD+ lessens disease severity in bacterial sepsis without impacting on the host pathogen load. They show that in macrophages, NAD+ prevents Il1b secretion potentially mediated by Caspase11.

      While the in vivo and in vitro data is interesting and hints towards a crucial role of NAD+ to promote metabolic adaptation in sepsis, the manuscript has shortcomings and would profit from several changes and additional experiments that support the claims.

      Conceptually, the definition of sepsis is outdated. Sepsis is not SIRS, as in sepsis-2. Sepsis-3 defines sepsis as infection-associated organ dysfunction. This concept needs to be taken into account for the introduction and when describing the potential effects of NAD+ in sepsis. Also, LPS application cannot be considered a sepsis model, since it only recapitulates the consequence of TLR-4 activation. It is a model of endotoxemia. Also, the LPS data does not allow to draw conclusions about bacterial clearance (L135).

      The authors state that protective effects by NAD were independent of the host pathogen load. This clearly indicates that NAD confers protection via enhancing a disease tolerance mechanism, potentially via reducing immunopathology. This aspect is not considered by the authors. The authors should incorporate the concept of disease tolerance in their work, cite the relevant literature on the topic and discuss it their findings in light of the published evidence for metabolic alteration sand adaptations in sepsis.

      For the in vitro data, the manuscript would benefit from additional experiments using in vitro infection models.

      In the merge manuscript, the authors provide two different versions of the figures. In one, bar plots are shown without individual data and in the other with scatter blots. All bar plots need to be provided as scatter plots showing individual values.

      The authors should show further serology data for kidney and liver failure etc. as well as further cytokine data such as IL-6 and TNF to better characterize their models.

      Careful revision of the entire manuscript, the figure legends and figures is required. The figure legend should not repeat the methods and materials section. The nomenclature for mouse protein and genes needs to be thoroughly revised.

      L350. The authors write that they dissect the capacity of NAD+ to dampen auto- and alloimmunity. In this work, no data that supports this statement is shown and experiments with autoantigens or alloantigens are not performed.

      L163 The authors describe pyroptosis but in the figure legend call it apoptosis. Specific markers for each cell death should be measured and determined which cell death mechanisms is involved.

      Animal data comes from an infection model and LPS application. The RNAseq data is obtained from cells primed with Pam3CSK4 and subsequently subjected to LPS. It is unclear how the cell culture model reflects the animal model. As such the link between IFN signaling and the bacterial infection/LPS model are not convincing and need to be further elaborated.

      Figure 5: It is unclear how many independent survival experiments were done, how many mice per group were used and whether the difference between groups was statistical significant. This information should be added.

      Further experiments with primary cells from Il10 k.o. and Caspase11 k.o. animals should be provided that support the findings in macrophages.

      Author Response:

      Reviewer #1 (Public Review):

      “The manuscript is very-well written. Although the study is well-conducted the authors should be more convincing on how bacteria residing in tissues do not induce death. The association with IL-10 cytokine production appears weak and more experiments are needed to make it more robust.”

      Thank you very much for your thoughtful and constructive feedback on our manuscript. We appreciate your positive assessment of the writing quality and the acknowledgment of the wel-lconducted nature of the study.

      In regard to the reviewer's comment that "The association with IL-10 cytokine production appears weak," we would like to provide a comprehensive response based on the findings and insights presented in our study (Fig 5). We would like to emphasize several key points to further elucidate this association:

      The established knowledge underscores IL-10's capacity to hinder the activation and proliferation of macrophages, thereby safeguarding against an overly aggressive immune-inflammatory reaction (as referenced). In our earlier investigations, we demonstrated that NAD+ orchestrates a systemic generation of IL-10, which assumes a pivotal function in curtailing proinflammatory responses across various conditions, such as autoimmune diseases (as referenced), alloimmunity (as referenced), and bacterial infections (as referenced). In our latest research, we divulge that the introduction of NAD+ leads to an elevated occurrence of IL-10-producing CD4+ T cells, CD8+ T cells, and macrophages, although not dendritic cells (depicted in Figure 5B and C). Furthermore, our comprehensive analyses have substantiated that NAD+ administration thwarts pyroptosis by specifically targeting the non-canonical inflammasome pathway. Intriguingly, our in vitro outcomes suggest that the neutralization of the autocrine IL-10 signaling pathway through a neutralizing antibody and an IL-10 receptor antagonist partially reverses the NAD+-mediated blockage of pyroptosis. These in vitro results imply that NAD+ induces the production of IL-10 cytokines by macrophages, contributing to the suppression of pyroptosis. To corroborate our in vitro conclusions, we employed IL-10 knockout mice and wild-type mice, both treated with either NAD+ or a placebo solution. The wild-type mice treated with NAD+ displayed a survival rate exceeding 80%, whereas the IL-10 knockout mice exhibited a survival rate of "only" 40%. These in vivo findings align with our in vitro discoveries, underscoring the crucial role of NAD+mediated IL-10 cytokine production in impeding pyroptosis through NAD+ and shielding against septic shock. Drawing from our prior and current investigations, we respectfully disagree with the reviewer's characterization of our work as "weak."

      Recommendations for the authors

      ‘’I suggest that animals subject to E. coli infection need to be followed-up for longer and sacrificed at a later time points. It is too difficult to believe that mice are surviving with full resting bacteria in tissues. Do results suggest a full shut-down of the mechanism? What was the level of infiltration of the tissues by neutrophils?’’

      ‘’I have difficulty to agree with the survival results of the IL-10(-/-) mice of Figure 5E. Can the authors provide the p-values and follow-up for longer? Why the WT and the IL-10(-/-) mice survive the same?’’

      Thank you for your thoughtful and constructive comments on our manuscript. We appreciate your valuable insights, and we have carefully considered your suggestions.

      We thank the reviewers for this comment. We have indeed followed-up for a longer period of time mice subjected to E. Coli infection and LPS (54mg/kg). Mice infected and treated with NAD+ survived for several months and recovered fully after 10 days. Mice survived for at least a year following infection. We have now included a sentence regarding the long-term survival in the results section of Figure 1 entitled “NAD+ protects mice against septic shock not via bacterial clearance but via inflammasome blockade”. Figure illustrating the level of infiltration of the tissues by neutrophils was added in supplementary data as supplementary figure 4.

      In contrast, WT and IL-10-/- mice failed to withstand E. Coli or LPS (54mg/kg) administration when treated with a placebo solution. To our knowledge, our investigation represents the pioneering instance of successfully conferring protection against the lethal doses of E. Coli and LPS administered to animals. Considering the potent immunosuppressive nature of IL-10, our anticipation was that IL-10-/- mice would manifest an exacerbated inflammatory response subsequent to LPS administration, in contrast to WT mice. Our in vivo findings indeed corroborate this assumption, revealing that IL-10-/- mice succumbed more swiftly to LPS administration, displaying statistically significant disparities in survival rates compared to WT mice (p value of 0.0154). The pertinent p-value has been thoughtfully included in Figure 5E of our study.

      Reviewer #2 (Public Review):

      “Iske et al. provide experimental data that NAD+ lessens disease severity in bacterial sepsis without impacting on the host pathogen load. They show that in macrophages, NAD+ prevents Il1b secretion potentially mediated by Caspase11.

      While the in vivo and in vitro data is interesting and hints towards a crucial role of NAD+ to promote metabolic adaptation in sepsis, the manuscript has shortcomings and would profit from several changes and additional experiments that support the claims.

      Conceptually, the definition of sepsis is outdated. Sepsis is not SIRS, as in sepsis-2. Sepsis-3 defines sepsis as infection-associated organ dysfunction. This concept needs to be taken into account for the introduction and when describing the potential effects of NAD+ in sepsis. Also, LPS application cannot be considered a sepsis model, since it only recapitulates the consequence of TLR-4 activation. It is a model of endotoxemia. Also, the LPS data does not allow to draw conclusions about bacterial clearance (L135).

      The authors state that protective effects by NAD were independent of the host pathogen load. This clearly indicates that NAD confers protection via enhancing a disease tolerance mechanism, potentially via reducing immunopathology. This aspect is not considered by the authors. The authors should incorporate the concept of disease tolerance in their work, cite the relevant literature on the topic and discuss it their findings in light of the published evidence for metabolic alteration sand adaptations in sepsis.

      For the in vitro data, the manuscript would benefit from additional experiments using in vitro infection models.

      In the merge manuscript, the authors provide two different versions of the figures. In one, bar plots are shown without individual data and in the other with scatter blots. All bar plots need to be provided as scatter plots showing individual values.

      The authors should show further serology data for kidney and liver failure etc. as well as further cytokine data such as IL-6 and TNF to better characterize their models.

      Careful revision of the entire manuscript, the figure legends and figures is required. The figure legend should not repeat the methods and materials section. The nomenclature for mouse protein and genes needs to be thoroughly revised.

      L350. The authors write that they dissect the capacity of NAD+ to dampen auto- and alloimmunity. In this work, no data that supports this statement is shown and experiments with autoantigens or alloantigens are not performed.

      L163 The authors describe pyroptosis but in the figure legend call it apoptosis. Specific markers for each cell death should be measured and determined which cell death mechanisms is involved.

      Animal data comes from an infection model and LPS application. The RNAseq data is obtained from cells primed with Pam3CSK4 and subsequently subjected to LPS. It is unclear how the cell culture model reflects the animal model. As such the link between IFN signaling and the bacterial infection/LPS model are not convincing and need to be further elaborated.

      Figure 5: It is unclear how many independent survival experiments were done, how many mice per group were used and whether the difference between groups was statistical significant. This information should be added.

      Further experiments with primary cells from Il10 k.o. and Caspase11 k.o. animals should be provided that support the findings in macrophages.”

      Thank you for taking the time to review our manuscript. We appreciate your insightful comments and valuable feedback regarding our study on the role protective role and underlying mechanisms of NAD+ in septic shock.

      “While the in vivo and in vitro data is interesting and hints towards a crucial role of NAD+ to promote metabolic adaptation in sepsis, the manuscript has shortcomings and would profit from several changes and additional experiments that support the claims.”

      We would like to point out that our current study does not underscore a metabolic adaptation in sepsis but more an immune regulation and a specific blockade of the non-canonical inflammasome signaling machinery.

      “Conceptually, the definition of sepsis is outdated. Sepsis is not SIRS, as in sepsis-2. Sepsis-3 defines sepsis as infection-associated organ dysfunction. This concept needs to be taken into account for the introduction and when describing the potential effects of NAD+ in sepsis. Also, LPS application cannot be considered a sepsis model, since it only recapitulates the consequence of TLR-4 activation. It is a model of endotoxemia. Also, the LPS data does not allow to draw conclusions about bacterial clearance (L135).”

      Our study uses highly lethal doses of E. Coli or LPS. These doses have been shown to result in multiple organ failure (1, 2). For many decades until now an un-numerable number of studies have used LPS as a model of sepsis (3, 4, 5). We have used LPS animal model based on a study published in 2013 by Kayagaki et al. (1), where the authors reported a novel TLR4-independent mechanism but mediated via activate caspase-11. We used the same animal model to demonstrate the specific role of NAD+ in targeting this TLR4-independent mechanism but mediated via activate caspase-11 and underscore NAD+’s mode of protection.

      Moreover, we have not only used LPS but bacterial infection as well using E. Coli. We have also previously published an additional research article demonstrating the protective effect against Listeria Monocytogenes (6). The only model we currently did not use in our current study, is a cecal ligation puncture (CLP) model which is also another common animal model for sepsis.

      Our conclusions regarding bacterial clearance are based not only on LPS results but also based on the bacterial load measurement and survival (Figure 1B&C) following E. Coli administration in different tissues (kidney and liver) and not LPS.

      “The authors state that protective effects by NAD were independent of the host pathogen load. This clearly indicates that NAD confers protection via enhancing a disease tolerance mechanism, potentially via reducing immunopathology. This aspect is not considered by the authors. The authors should incorporate the concept of disease tolerance in their work, cite the relevant literature on the topic and discuss it their findings in light of the published evidence for metabolic alteration sand adaptations in sepsis.”

      We respectfully disagree with the reviewer’s comment and do not believe that NAD+ enhances disease tolerance. We have supporting data indicating that NAD+ mediates protection via a specific blockade of the non-canonical inflammasome pathway, which prevents an over-zealous immune response that results in organ damage and multiple organ failure (MOF). Moreover, we demonstrate that not only NAD+ mediates protection via a specific blockade of the non-canonical inflammasome pathway but prevents septic shock induced death by an additional immunosuppression mediated by the systemic production of IL-10.

      Both Caspase-11 and IL-10 pathways are crucial in NAD+ mediated protection against lethal doses of E. Coli and LPS administration. Figure 5A indicates that caspase-11-/- mice treated with PBS have a modest survival rate (~40% survival) when compared to the group of mice treated with NAD+ (>80% survival). These data indicate that NAD+ promotes survival via a caspase-11independent mechanism. Similarly, wild type mice subjected to NAD+ administration exhibited >80% survival, while NAD+ administration to IL-10-/- mice resulted only in a 40% survival rate. Based on these findings, we believe that NAD+ mediated protection against septic shock via a blockade of caspase-11 blockade and by IL-10 cytokine production that dampened the overzealous immune response rather than a disease tolerance.

      “For the in vitro data, the manuscript would benefit from additional experiments using in vitro infection models.”

      In the current study we have used two in vivo models using LPS and E. Coli a gram-negative bacterium. We have also previously reported the protective role of NAD+ in the context of Listeria Monocytogenes (6) a gram-positive bacterium. In the current study, our aim was to demonstrate the inhibitory role of NAD+ on the non-canonical pathway specifically. We believe that additional in vitro experiments for this study are out of scope.

      “In the merge manuscript, the authors provide two different versions of the figures. In one, bar plots are shown without individual data and in the other with scatter blots. All bar plots need to be provided as scatter plots showing individual values.”

      As requested by reviewer #2 all bar plots are now provided as scatter plots showing individual values.

      “The authors should show further serology data for kidney and liver failure etc. as well as further cytokine data such as IL-6 and TNF to better characterize their models.”

      We did not perform further serology analysis, but we did measure IL-6 and TNFα in mice treated with NAD+ or PBS. Mice treated with NAD+ had a reduced systemic level of both cytokines IL-6 and TNFα. We have now added the figures (Figure 1F). In addition, we performed a long-term survival, and all mice treated with NAD+ recovered fully after 10 days and survived over a year after infection. In addition, the mice that survived following NAD+ treatment died of old age.

      “Careful revision of the entire manuscript, the figure legends and figures is required. The figure legend should not repeat the methods and materials section. The nomenclature for mouse protein and genes needs to be thoroughly revised.”

      A Careful revision of the entire manuscript has been performed.

      “L350. The authors write that they dissect the capacity of NAD+ to dampen auto- and alloimmunity. In this work, no data that supports this statement is shown and experiments with autoantigens or alloantigens are not performed.”

      We thank the reviewer for this comment. We have now re-phrased our last sentence in the discussion and included references for our previous work. We have now stated:” We have previously reported that NAD+ administration can block auto- (7) and allo-immunity (8) via IL10 cytokine production. Here, we unveiled the capacity of NAD+ to protect against sepsisinduced death via a specific blockade of the non-canonical inflammasome pathway and a robust immunosuppression mediated by IL-10 cytokine production.

      L163 The authors describe pyroptosis but in the figure legend call it apoptosis. Specific markers for each cell death should be measured and determined which cell death mechanisms is involved.

      We thank the reviewer for this comment. We have focuses on pyoptosis-mediated cell death and not apoptosis. We have now replaced the term “apoptosis” by “pyroptosis-mediated to cell death”.

      “Animal data comes from an infection model and LPS application. The RNAseq data is obtained from cells primed with Pam3CSK4 and subsequently subjected to LPS. It is unclear how the cell culture model reflects the animal model. As such the link between IFN signaling and the bacterial infection/LPS model are not convincing and need to be further elaborated.”

      Our findings, depicted in Figure 3, pertain exclusively to in vitro investigations rather than in vivo examinations. Our research has demonstrated the selective inhibition of the non-canonical inflammasome pathway by NAD+, with a primary focus on unraveling the specific signaling pathway influenced by NAD+. Our in vitro outcomes indicate that the introduction of recombinant IFN-β counteracted the inhibitory effect of NAD+ on the non-canonical pathway. However, it's important to note that we have not evaluated the IFN-β pathway within our E. Coli and LPS in vivo models. Our primary intention was to exclusively decipher the roles of IFN-β and NAD+ in the context of inhibiting the non-canonical inflammasome, without extending our investigation to the broader in vivo scenarios.

      “Figure 5: It is unclear how many independent survival experiments were done, how many mice per group were used and whether the difference between groups was statistical significant. This information should be added.”

      We have now included the number of experiments, p values and number of animals used in Figure 5.

      “Further experiments with primary cells from Il10 k.o. and Caspase11 k.o. animals should be provided that support the findings in macrophages.”

      We concur with the reviewer's suggestion regarding the need for further experiments involving primary cells from IL-10-/- and Caspase-11-/- mice. However, we are uncertain about the potential contribution of these experiments in generating novel or supplementary findings to the existing study.

      Recommendations For The Authors:

      Besides the comments made in the public section, there are further issues that need to be considered by the authors.

      “It is unclear what signifies „impressive, L106" or „dramatic, L257"”

      “impressive” meant that we were surprised by the results since to the best of our knowledge prior this study there exists no report/study claiming such survival (>80%) following such high dose of E. Coli. In this aspect protective effects of NAD+ are unique. “dramatic” We (8) and others (9, 10) have previously used this term to describe a robust increase of cytokine production.

      “L116. The authors describe „symptoms". It should be clarified what symptoms they observed and the data should be shown. If only temperature is available, then this should be said. It would be interesting to see effects of NAD+ on the glucose levels of the animals during sepsis.”

      We thank the reviewer’s comment. We have measured only temperature. We believe that glucose level is beyond the scope of this study.

      “L29. Sepsis is not restricted to bacterial and viral pathogens. Also fungi and protozoa can cause sepsis.”

      We have now included fungi and protozoa.

      “Suppl.Fig.1. A scale should be added.”

      Scale has been added

      “L822. Lethal dose of LPS would mean that this was lethal for all mice. However, the data suggests that NAD+ treated animals would not have died. This should be clarified.”

      Here we meant lethal dose in absence of NAD+ treatment. Our study focuses on the protective role of NAD+ in a lethal context (bacterial and LPS).

      “L823/824. The part of the sentence: ... IHC was performed staining for H&E.. is incomplete.”

      We thank the reviewer’s comment. We have re-phrased our sentence.

      “L804. IL-10 is not a pathway. This should be revised.”

      We have replaced “pathway” by” mechanism”.

      “The graphical abstract should be the last figure summarizing all findings.”

      Figure 4 isn't the final illustration, as it doesn't encompass an overarching graphical summary of our discoveries. Instead, it exclusively highlights the findings related to NAD+'s impact on noncanonical inflammasome inhibition. Notably, this figure omits NAD+-mediated IL-10 cytokine generation and its crucial role in mitigating septic shock.

      “The authors report that they used a dosage of 54mg/kg LPS (l.502). This is a rather unusual concentration. How was this determined?”

      This was initially based on the first study reporting the role of casapase-11 in septic shock induced death published in 2013 by Kayagaki et al. (1). Many other have used this dosage for septic shock induced death animal model (11, 12, 13).

      References:

      1. Kayagaki N, et al. Noncanonical inflammasome activation by intracellular LPS independ ent of TLR4. Science 341, 1246‐1249 (2013).

      2. Qin, X., Jiang, X., Jiang, X. et al. Micheliolide inhibits LPS-induced inflammatory response and protects mice from LPS challenge. Sci Rep 6, 23240 (2016).

      3. Li Z, Qu W, Zhang D, Sun Y, Shang D. The antimicrobial peptide chensinin-1b alleviates the inflammatory response by targeting the TLR4/NF-κB signaling pathway and inhibits Pseudomonas aeruginosa infection and LPS-mediated sepsis. Biomed Pharmacother. 2023 Aug 1; 165:115227.

      4. Ramani V, Madhusoodhanan R, Kosanke S, Awasthi S. A TLR4-interacting SPA4 peptide inhibits LPS-induced lung inflammation. Innate Immun. 2013 Dec;19(6):596610.

      5. Zhang Y, Lu Y, Ma L, Cao X, Xiao J, Chen J, Jiao S, Gao Y, Liu C, Duan Z, Li D, He Y, Wei B, Wang H. Activation of vascular endothelial growth factor receptor-3 in macrophages restrains TLR4-NF-κB signaling and protects against endotoxin shock. Immunity. 2014 Apr 17;40(4):501-14.

      6. Rodriguez Cetina Biefer H, Heinbokel T, Uehara H, Camacho V, Minami K, Nian Y, Koduru S, El Fatimy R, Ghiran I, Trachtenberg AJ, de la Fuente MA, Azuma H, Akbari O, Tullius SG, Vasudevan A, Elkhal A. Mast cells regulate CD4+ T-cell differentiation in the absence of antigen presentation. J Allergy Clin Immunol. 2018 Dec;142(6):18941908.e7.

      7. Tullius SG, Biefer HR, Li S, Trachtenberg AJ, Edtinger K, Quante M, Krenzien F, Uehara H, Yang X, Kissick HT, Kuo WP, Ghiran I, de la Fuente MA, Arredouani MS, Camacho V, Tigges JC, Toxavidis V, El Fatimy R, Smith BD, Vasudevan A, ElKhal A. NAD+ protects against EAE by regulating CD4+ T-cell differentiation. Nat Commun. 2014 Oct 7;5:5101.

      8. Elkhal A, et al. NAD(+) regulates Treg cell fate and promotes allograft survival via a systemic IL‐10 production that is CD4(+) CD25(+) Foxp3(+) T cells independent. Sci Rep 6, 22325 (2016).

      9. Natalia Garcia-Becerra, Marco Ulises Aguila-Estrada, Luis Arturo Palafox-Mariscal, Georgina Hernandez-Flores, Adriana Aguilar-Lemarroy, Luis Felipe Jave-Suarez, FOXP3 Isoforms Expression in Cervical Cancer: Evidence about the Cancer-Related Properties of FOXP3Δ2Δ7 in Keratinocytes, Cancers, 15, 2, (347), (2023).

      10. Estelle Bettelli, Maryam Dastrange, Mohamed Oukka. Foxp3 interacts with nuclear factor of activated T cells and NF-κB to repress cytokine gene expression and effector functions of T helper cells. Proceedings of the National Academy of Sciences. 2005.102; 14; 5138-5143.

      11. Han Gyung Kim, Chaeyoung Lee, Ji Hye Yoon, Ji Hye Kim, Jae Youl Cho,BN82002 alleviated tissue damage of septic mice by reducing inflammatory response through inhibiting AKT2/NF-κB signaling pathway,Biomedicine & Pharmacotherapy,Volume 148,2022,112740.

      12. Tao Q, Zhang Z-D, Qin Z, Liu X-W, Li S-H, Bai L-X, Ge W-B, Li J-Y and Yang Y-J (2022) Aspirin eugenol ester alleviates lipopolysaccharide-induced acute lung injury in rats while stabilizing serum metabolites levels. Front. Immunol. 13:939106.

      13. Chen, N, Ou, Z, Zhang, W, Zhu, X, Li, P, Gong, J. Cathepsin B regulate non-canonical NLRP3 inflammasome pathway by modulating activation of caspase-11 in Kupffer cells. Cell Prolif. 2018; 51:e12487.

    1. Joint Public Review:

      This work by Liu CSC et al. is an extension of the author's previous work on the role of Piezo1 mechano-sensor in human T cell activation. In this study, the authors address whether Piezo1 plays a role in T-cell chemotactic migration.

      The authors used CD4+ T cells or Jurkat T cells to test the effects of siRNA-mediated depletion of Piezo1 on chemotactic migration. They establish that Piezo1 is implicated in chemotactic migration, although the effects of depletion are relatively moderate.

      They show that Piezo1 is redistributed to the leading edge of T-cells.

      They identify that relocation of Piezo1 to the leading edge follows an increase in membrane tension.

      In Piezo-1 depleted cells, they observe a moderate reduction of LFA-1 polarity. With the use of specific inhibitors, they propose Piezo1 activation to be downstream of focal adhesion formation and upstream of calpain-mediated LFA-1, integrin alpha L beta 2, or CD11a/CD18 recruitment at the leading edge.

      Strengths:

      Together with their 2018 paper, this study presents Pieszo1 as a regulator of T-cell activation, implicating it as a player in the coordination of the chemotactic immune response.

      Weaknesses:<br /> Most of the effects observed are relatively modest. The authors did not challenge the cells with various physico-mechanical conditions to see when Piezo-1 might become really important. For instance, there are no experiments that expose T cells to varying counter-acting forces to see how piezo1 might affect migration.

      Technical weaknesses:

      The authors state that "these high tension edges are usually further emphasized at later time points", but after ten minutes the median tension and tension (Figure 2C and Supplementary Figure 2C respectively) reduce down to the pretreatment time point. It would be clearer if the author stated within which timeframe the tension edges are "further emphasized".

      Figures 3 and 4 - The author states the number of cells quantified from the images, but it is not clear whether the data is actually from 3 biological replicates.

      Some of the data has no representative images or videos included. There is no video in the supplementary for Figures 1 A and B. There are no representative images of transwell migration assay in Figures 1 D and E.

  4. Dec 2023
    1. Author Response

      The following is the authors’ response to the original reviews.

      Note to all Reviewers

      We appreciate the reviewers’ comments and suggestions for improving the manuscript. Below is a summary of new data added and a brief description of the major new results. A detailed pointby-point response follows.

      New data:

      • Figure 1f

      • Figure 2b, f, g

      • Figure 4b

      • Figure S7 • Figure S8

      • Figure S9

      Summary of major new results/edits:

      • At the request of Reviewer #1 we have updated the name of the degradation tag to be more specific and we now call it the “LOVdeg” tag.

      • We have added new controls demonstrating that light stimulation does not cause photobleaching or toxicity issues (Fig. S7).

      • We now show that LOVdeg can function at various points in the growth cycle, demonstrating robust degradation (Fig. 1f, Fig. S8).

      • We have included relevant controls for the AcrB-LOVdeg efflux pump results (Fig. 2f-g).

      • We have included important benchmarking controls, such as an EL222-only control and SsrA tag control to provide a clearer view of how LOVdeg performance compares to other systems (Fig. S9, Fig. 4b).

      Additional note:

      • While repeating experiments during the revision process we found that the results for the combined action of EL222 and the LOVdeg tag were not as dramatic as in our original measurements, though the overall findings are consistent with our original results. Specifically, we still find that the combination of EL222 and the LOVdeg tag produces a lower signal than either on their own. We have updated these data in the revised manuscript (Fig. 4b).

      Reviewer #1:

      Public Review:

      Specifically controlling the level of proteins in bacteria is an important tool for many aspects of microbiology, from basic research to protein production. While there are several established methods for regulating transcription or translation of proteins with light, optogenetic protein degradation has so far not been established in bacteria. In this paper, the authors present a degradation sequence, which they name "LOVtag", based on iLID, a modified version of the blue-light-responsive LOV2 domain of Avena sativa phototropin I (AsLOV2). The authors reasoned that by removing the three C-terminal amino acids of iLID, the modified protein ends in "-E-A-A", similar to the "-L-A-A" C-terminus of the widely used SsrA degradation tag. The authors further speculated that, given the light-induced unfolding of the C-terminal domain of iLID and similar proteins, the "-E-A-A" C-terminus would become more accessible and, in turn, the protein would be more efficiently degraded in blue light than in the dark.

      Indeed, several tested proteins tagged with the "LOVtag" show clearly lower cellular levels in blue light than in the dark. While the system works efficiently with mCherry (10-20x lower levels upon illumination), the effect is rather modest (2-3x lower levels) in most other cases. Accordingly, the authors propose to use their system in combination with other light-controlled expression systems and provide data validating this approach. Unfortunately, despite the claim that the "LOVtag" should work faster than optogenetic systems controlling transcription or translation of protein, the degradation kinetics are not consistently shown; in the one case where this is done, the response time and overall efficiency are similar or slightly worse than for EL222, an optogenetic expression system.

      The manuscript and the figures are generally very well-composed and follow a clear structure. The schematics nicely explain the underlying principles. However, limitations of the method in its main proposed area of use, protein production, should be highlighted more clearly, e.g., (i) the need to attach a C-terminal tag of considerable size to the protein of interest, (ii) the limited efficiency (slightly less efficient and slower than EL222, a light-dependent transcriptional control mechanism), and (iii) the incompletely understood prerequisites for its application. In addition, several important controls and measurements of the characteristics of the systems, such as the degradation kinetics, would need to be shown to allow a comparison of the system with established approaches. The current version also contains several minor mistakes in the figures.

      We thank reviewer #1 for the feedback and suggestions to strengthen the manuscript. We have addressed these comments in the points that follow and now include important controls and benchmarks for our molecular tool.

      Major points

      1. The quite generic name "LOVtag" may be misleading, as there are many LOV-based tags for different purposes.

      We appreciate that it would be beneficial to have a more specific name. We have updated the name to “LOVdeg” tag, which captures both the inclusion of LOV and the degradation function of the tag.

      Updated throughout the manuscript and figures

      1. Throughout the manuscript, the authors use "expression levels". As protein degradation is a post-expression mechanism, "protein levels" should be used instead.

      We have transitioned to using “protein levels” at many points in the manuscript.

      Updated throughout the manuscript

      1. Degradation dynamics (time course experiments) should be shown. The only time this is done in the current version (in Fig. 4), degradation appears to be in the same range (even a bit slower) than for EL222, which does not support the claim that the "LOVtag" acts faster than other optogenetic systems controlling protein levels.

      In the revised manuscript, time course data are now shown at multiple points. These include new data in Fig. 1f and Fig. S8 that demonstrate degradation at various stages of growth. Fig. S4 also shows the dynamics of degradation when comparing to the addition of exogenously expressed ClpA. We have added text in the results section to point the reader to these data. In addition, we have made minor modifications to the text in the Introduction to avoid making claims about speed comparisons. Fig. 1f, Fig. S8, Fig. S4

      Results: Design and characterization of the AsLOV2-based degradation tag, Introduction

      1. "Frequency" is used incorrectly for Fig. 3. A series of 5 seconds on, 5 seconds off corresponds to a frequency of 0.1 Hz (1 illumination round / 10 s), not of 0.5 Hz. What the authors indicate as "frequency" is the fraction of illumination time. However, the (correct) frequency should be given, as this is likely the more important factor.

      We have changed how we calculate frequency to use the proposed definition of one pulse per time period. We updated the values in the text and in the figure. Fig. 3c

      Results: Tuning frequency response of the LOVdeg tag

      1. To properly evaluate the system, several additional controls are needed:

      a. To test for photobleaching of mCherry by blue light illumination, untagged controls should be shown for the mCherry-based experiments. Fluorescence always seems to be lower upon illumination, except for the AsLOV2*(546) data, where it cannot be excluded that fluorescence readings are saturated. Relatedly, the raw data for OD and fluorescence should be included. Showing a Western blot against mCherry in at least one case would allow to separate the effects of photobleaching and degradation.

      We appreciate the suggestion and have conducted these important controls. We now include new data demonstrating that light induction does not change fluorescence levels using an untagged mCherry control, nor does it significantly affect endpoint OD levels. Based on these results, we did not perform a Western blot because there were no effects to separate. Fig. S7

      b. In Fig. 2b, light + IPTG should be shown to estimate the activity of the system at higher expression levels.

      We have added these to the figure. Light + IPTG modestly increases expression compared to IPTG only, likely due to the saturating level of IPTG added, which achieves near full induction. Fig. 2b

      c. In Fig. 4, EL222 alone should be shown to allow a comparison with the LOVtag. From the data presented, it looks like EL222 is both slightly faster and more efficient than the LOVtag.

      We have added the EL222-only case for comparison with LOVdeg only and EL222 + LOVdeg. We note that Reviewer #3 raised a similar concern. Fig. 4b

      d. The effect of the used light on bacterial viability under exponential and stationary conditions should be shown.

      In this revision, we have added new data on light exposure at various points during exponential and stationary phase (Fig. 1f, Fig. S8). These OD data show that growth curves are similar for all cultures, regardless of the time light is applied during the growth phase. Additionally, we also now include ODs for the photobleaching experiments. These data also show that growth is not significantly altered under continuous light exposure. Figure 1f, Fig. S7b

      1. The claim that "Post-translational control of protein function typically requires extensive protein engineering for each use case" is not correct. The authors should discuss alternative options, e.g. based on dimerization, more extensively and in a less biased manner.

      We have toned down the language in this location and at other points in the manuscript. However, we maintain that other types of post-translational control, such as dimerization or LOV2 domain insertion, require more protein engineering than inserting a degradation tag. For example, we and others have directly demonstrated this in previous work (e.g. DOI: 10.1021/acssynbio.9b00395, 10.1101/2023.05.26.542511, 10.1038/s41467-023-38993-6), where numerous split site or insertion variants need to be screened and fine-tuned for successful light control. In contrast, a degradation mechanism has the potential to require less fine tuning to achieve a light response. We have included the above sources to clarify this point. Introduction, Results: Modularity of the LOVdeg tag

      Minor points

      1. In Suppl. Fig. 1, amino acid numbers seem to be off. Also, the alterations in iLID (compared to AsLOV2) that are not used in "LOVtag" appear to be missing and the iLID sequence incorrect, as a consequence.

      Thank you for catching this. The number indices in Fig. S1 have been corrected. We also realized we were reporting the iLID(C530M) variant in our amino acid sequence and have reverted the 530M back to C. Fig. S1

      1. Why is AsLOV2(543) more efficiently degraded than AsLOV2(543) (blue column in Fig. 1d) when the dark state should be stabilized in AsLOV2(543)?

      We are not sure of the exact reason for the increased degradation response in the AsLOV2*(543) variant. It may be that the dark-state stabilizing mutations introduced also have more favorable interactions with degradation machinery, although this is highly speculative.

      1. Why does the addition of EL222 reduce protein levels so strongly in the dark for CpFatB1* (Fig. 5)?

      We believe this effect stems from the EL222 responsive promoter (PEL222). With LOVdeg only, CpFatB1* is expressed from an IPTG inducible promoter (PlacUV5) whereas EL222 responsive constructs necessitate a promoter switch containing an EL222 binding site. We have clarified this point and expanded our discussion of these results.

      Results: Optogenetic control of octanoic acid production

      1. Fig. 2f / S10 are difficult to interpret. Why does illumination only lead to a significant effect at 2.5 and 5 µg/ml and not at lower concentrations, where the degradation system would be expected to be most efficient?

      We have expanded our discussion on these results to explain that this likely stems from basal protein levels of AcrB-LOVdeg in the light that can provide resistance at low antibiotic concentrations. We have also added new controls to this figure to show the chloramphenicol sensitivity of a ΔacrB strain and a ΔacrB strain with an IPTG-inducible version of acrB with no induction, demonstrating the lowest achievable chloramphenicol resistance from a standard inducible system.

      Results: Modularity of the LOVdeg tag, Fig. 2f-g

      1. Fig. 2f / S10 do not measure the MIC (which is a clearly defined value), but the sensitivity to Chloramphenicol.

      We have changed the text to use the term chloramphenicol sensitivity instead of MIC. Results: Modularity of the LOVdeg tag

      1. "***" in Fig. S1 should be explained.

      We have removed the ‘***’ to avoid confusion. Fig. S1

      1. The fold-change differences between light and dark, indicated in some selected cases, should be listed for all figures.

      We have added fold-change values where appropriate. Fig 1d, Fig. 2b

      Reviewer #2:

      Public Review:

      In this manuscript the authors present and characterize LOVtag, a modified version of the bluelight sensitive AsLOV2 protein, which functions as a light-inducible degron in Escherichia coli. Light has been shown to be a powerful inducer in biological systems as it is often orthogonal and can be controlled in both space and time. Many optogenetic systems target regulation of transcription, however in this manuscript the authors target protein degradation to control protein levels in bacteria. This is an important advance in bacteria, as inducible protein degradation systems in bacteria have lagged behind eukaryotic systems due to protein targeting in bacteria being primarily dependent on primary amino acid sequence and thus more difficult to engineer. In this manuscript, the authors exploit the fact that the J-alpha helix of AsLOV2, which unwinds into a disordered domain in response to blue light, contains an E-A-A amino acid sequence which is very similar to the C-terminal L-A-A sequence in the SsrA tag which is targeted by the unfoldases ClpA and ClpX. They truncate AsLOV2 to create AsLOV2(543) and combine this truncation with a mutation that stabilizes the dark state to generate AsLOV2*(543) which, when fused to the C-terminus of mCherry, confers light-induced degradation. The authors do not verify the mechanism of degradation due to LOVtag, but evidence from deletion mutants contained in the supplemental material hints that there is a ClpA dominated mechanism. They demonstrate modularity of this LOVtag by using it to degrade the LacI repressor, CRISPRa activation through degradation of MCP-SoxS, and the AcrB protein which is part of the AcrAB-TolC multidrug efflux pump. In all cases, measurement of the effect of the LOVtag is indirect as the authors measure reduction in LacI repression, reduction in CRISPRa activation, and drug resistance rather than directly measuring protein levels. Nevertheless the evidence is convincing, although seemingly less effective than in the case of mCherry degradation, although it is hard to compare due to the different endpoints being measured. The authors further modify LOVtag to contain a known photocycle mutation that slows its reversion time in the dark, so that LOVtag is more sensitive to short pulses of light which could be useful in low light conditions or for very light sensitive organisms. They also demonstrate that combining LOVtag with a blue-light transcriptional repression system (EL222) can decrease protein levels an additional 269-fold (relative to 15-fold with LOVtag alone). Finally, the authors apply LOVtag to a metabolic engineering task, namely reducing expression of octanoic acid by regulating the enzyme CpFatB1, an acyl-ACP thioesterase. The authors show that tagging CpFatB1 with LOVtag allows light induced reduction in octanoic acid titer over a 24 hour fermentation. In particular, by comparing control of CpFatB1 with EL222 transcriptional repression alone, LOVtag, or both the authors show that light-induced protein degradation is more effective than light-induced transcriptional repression. The authors suggest that this is because transcriptional repression is not effective when cells are at stationary phase (and thus there is no protein dilution due to cell division), however it is not clear from the available data that the cells were in stationary phase during light exposure. Overall, the authors have generated a modular, light-activated degron tag for use in Escherichia coli that is likely to be a useful tool in the synthetic biology and metabolic engineering toolkit.

      We thank Reviewer #2 for the constructive feedback. In the updated manuscript, we now include data demonstrating degradation at different growth stages and address other points brought up in the review to improve understanding of the degradation tag.

      Overall, the authors present a well written manuscript that characterizes an interesting and likely very useful tool for bacterial synthetic biology and metabolic engineering. I have a few suggestions that could improve the presentation of the material.

      Major Comments:

      • Could the authors clarify, perhaps through OD measurements, that the cultures in the octanoic acid experiment are actually in stationary phase during the relevant light induction. It isn't clear from the methods.

      We have updated the Methods to clarify that the cells are entering stationary phase (OD600 = 0.6) when light is either kept on or turned off for production experiments. Production is continued for the following 24 hours. Note that we now show OD measurements in a separate set of experiments (Fig. 1f, Fig. S8).

      Methods: Octanoic acid production experiment. Fig. 1f, Fig. S8

      • Can the authors clarify why there is an overall decrease in protein in the clpX deletion? And is it this initial reduction that is the source of the change in fold in 1C? Similarly, for hslU is it because overall protein levels are higher with the tag? In general, I feel that the interpretation of Supplemental Figures S6-S10 could be moved in more detail to the main text, or at least the main takeaway points. But this is a personal preference, and not necessary to the major flow of the story which is about the utility of the LOVtag tool.

      As shown in Fig. S5, expression of mCherry without any degradation tag is decreased in a clpX knockout strain compared to wild type. This difference may be the result of reduced cell health, and we now note this in the text. The strains shown in Fig. 1c are in wild type cells with normal expression, so this is not the source of the fold change. As for hslU, we agree it is interesting that expression seems to increase. However, the increase is modest and could stem from gene network regulation differences in that strain compared to wild type and may not be related to LOVdeg tag degradation. Each endogenous protease is involved in a wide range of functions within the cell, and it is unknown how global gene expression is impacted. We acknowledge the suggestion of moving the protease results to the main text, but we have ultimately elected to keep these data in the Supplementary Information to maintain the flow in the manuscript. However, we have added additional text pointing the reader to the Supplemental Text and include a brief summary of the findings in the main text.

      Results: Design and characterization of the AsLOV2-based degradation tag

      • What is the source of the poor repression in Figure 2D?

      Presumably, this stems from low levels of the CRISPRa MCP-SoxS activator, even in the presence of light. We have added this point to the text.

      Results: Modularity of the LOVdeg tag

      • In general, it would be nice to have light-only controls for many of the experiments to validate that light is not affecting the indicated proteins or their function.

      We thank the reviewer for this suggestion and note that Reviewer #1 raised a similar concern. We have now included light-only data for a strain containing IPTG-inducible mCherry without the LOVdeg tag (Fig. S7). These data show that light itself, at the levels used in this study, does not affect mCherry expression or cell growth. This strain serves as a direct control for data presented in Fig. 1 and Fig. 2b, as the systems are identical except for the addition of the LOVdeg tag onto either mCherry or the LacI repressor. Additionally, the control translates to other experiments since mCherry is used as a reporter for other systems in this study. Fig. S7

      • It would be nice to directly measure the function of the tool at different phases of E. coli growth to show directly that protein degradation works at stationary phase, rather than the more indirect measurements used in the octanoic acid experiment.

      We thank the reviewer for this suggestion, which significantly strengthens our results. We have added an experiment that tests the LOVdeg tag at different phases of growth (Fig. 1f, Fig. S8). In this experiment, cultures are growth from early exponential to stationary phase, and light is introduced at various points. Exposure windows of 4 hours, ranging from early exponential to stationary phase, all show functional light inducible degradation. Fig. 1f, Fig. S8.

      Results: Design and characterization of the AsLOV2-based degradation tag

      Minor Comments:

      • It would be nice to make clear that the data in S6d and S7 is repeated, but with the HslUV data in S7.

      We clarified this point in the caption of Fig. S4 (the former Fig. S7 in the original manuscript). Fig. S4 caption

      • Why was 5s picked for the frequency response in Figure 3

      We picked 5s because 1) it is a substantially shorter timescale than overall degradation dynamics seen for the LOVdeg tag, and 2) we found that shorter pulses could not be reliably achieved with the light stimulation hardware and software we used (Light Plate Apparatus with Iris software). To ensure high fidelity pulses, we opted for 5 second pulses that we empirically determined to be stable throughout long experiments. We have added text clarifying this. Results: Tuning frequency response of the LOVdeg tag

      Reviewer #3:

      Public Review:

      The authors present the mechanism, validation, and modular application of LOVtag, a light-responsive protein degradation tag that is processed by the native degradosome of Escherichia coli. Upon exposure to blue light, the c-terminal alpha helix unfolds, essentially marking the protein for degradation. The authors demonstrate the engineered tag is modular across multiple complex regulatory systems, which shows its potential widespread use throughout the synthetic biology field. The step-by-step rational design of identifying the protein that was most dark stabilized as well as most light-responsive for degradation, was useful in terms of understanding the key components of this system. The most compelling data shows that the engineered LOVTag can be fused to multiple proteins and achieve light-based degradation, without affecting the original function of the fused protein; however, results are not benchmarked against similar degradation tagging and optogenetic control constructs. Creating fusion proteins that do not alter either of the original functions, is often difficult to achieve, and the novelty of this should be expanded upon to drive further impact.

      We appreciate the feedback from Reviewer #3 to improve the manuscript. We have included important controls and benchmarking experiments to address the reviewer’s concerns, which are detailed in the points below.

      Benchmarking:

      The similarity between the L-A-A sequence of SsrA and the E-A-A sequence of LOVtag is one of the pieces of evidence that led the authors to their current protein design. The differences in degradation efficiency between the SsrA degradation tag and LOVtag are not shown, and benchmarking against SsrA would be a valuable way to demonstrate the utility of this construct relative to an established protein tagging tool.

      We thank the reviewer for suggesting an experiment to benchmark performance. We have added new experimental data where a full length SsrA tag is added to a fusion protein of nearly identical size (mCherry-iLID), allowing us to directly compare performance to mCherryLOVdeg (Fig. S9). These results show that light inducible control with LOVdeg tag decreases protein expression levels to near those achieved with the native SsrA tag. Fig. S9.

      Results: Design and characterization of the AsLOV2-based degradation tag

      Additionally, there is a lack of an EL222-only control presented in Figure 4b and in the results section beginning with "Integrating the LOVtag with EL222...". Without benchmarking against this control the claim that "EL222 and the LOVtag work coherently to decrease expression" is unsubstantiated. No assumptions of synergy can be made.

      We appreciate this comment and note that Reviewer #1 raised a similar concern. We have added data to Fig. 4b with an EL222-only control for comparison. Fig. 4b

      The dramatic change in dark octanoic acid titer between the EL222, LOVtag and combined conditions are surprising, especially in comparison to the lack of change in the dark mCherry expression shown in Figure 4b. This data is the only to suggest that LOVtag may perform better than EL222. However, the inconsistencies in dark state regulation presented in the two experiments, and between conditions in this experiment bring the latter claim to question. A recommendation is that the authors either repeat this experiment, or comment on the observed discrepancy in dark state octanoic acid titers in their discussion.

      First, a key difference between the data presented in Fig. 4 and Fig. 5 is that the production experiment is conducted over a long time period (24 hours) and the EL222/LOVdeg reporter experiment is conducted over 5 hours. Likely, performance differences between EL222 and the LOVdeg tag become more pronounced as protein accumulation occurs. Second, the LOVdeg only construct is expressed from a non-EL222 promoter which is able to achieve higher expression (see response to Reviewer #1, Minor point #3). Lastly, a convoluting factor is that the relationship between expression of CpFatB1 and octanoic acid production is not completely linear, and there are likely thresholds or expressions windows that result in similar endpoint titers. We agree a more detailed examination of how CpFatB1 changes over the course of the production period would be very interesting. However, this is beyond the scope of the present study, whose goal is to introduce and showcase the utility of the LOVdeg tag as a tool. We have added new discussion on this in the Results section to clarify some of these points. We have also repeated all experiments in Fig. 4 and consistently see the LOVdeg tag performing as well as or better than EL222. As noted in the remarks to all reviewers, these data have been updated in the revised manuscript.

      Results: Optogenetic control of octanoic acid production. Fig. 4d

      Based on the methodology presented, no change in the duration in light exposure was tested, even though this may be an important part of the system response. The on/off, for example in Figure 4b, is either all light or all dark, but they claim that their system is beneficial especially at stationary phase. The authors should consider showing the effects of shifting from dark to light at set intervals. (i.e. 1 hr dark then light, 2hr dark until light, etc.) This data would also aid in supporting the utility of this tag for controlling expression during different growth phases, where light may be used after the cells have reached a certain phase.

      We have added new data showing the effect of light stimulation at different times in the growth cycle (see response to Reviewer #2, bullet point #5). These data demonstrate that the LOVdeg tag performs well at various points in the growth cycle. Fig. 1f, Fig. S8.

      Results: Design and characterization of the AsLOV2-based degradation tag

      Minor Revisions Figures:

      • Figure 1:

      • More clarity is needed in the naming conventions for this figure and in the body of the text. For example, a different convention than 546 and 543 should be used to refer to the full and truncated lengths of the tag. It would greatly aid understanding for this to be made more clear. The authors could simply continue to use "full" and "truncated" to refer to them. In addition, the term "stabilizing mutations" in 1c could be changed to read "dark state stabilizing mutations" to aid in clarity.

      When describing the design of the LOVdeg tag, we opted towards a more technically accurate description over clarity in order to make our engineering process easily comparable to other LOV2 systems. As such, we kept the number-based nomenclature (543 or 546) to represent the domain within the phototropin 1 protein from Avena sativa (AsLOV2). The domain used in this study, and many other studies, are only amino acids 404-546, i.e. not the full sequence, thus saying simply ‘full’ or ‘truncated’ is not technically accurate. We believe the detailed nomenclature, which is limited to one section, is important to provide clarity on exactly what we used for protein engineering. In the revised version we introduce the nickname “LOVdeg” tag earlier and use it throughout the rest of the manuscript.

      Results: Design and characterization of the AsLOV2-based degradation tag

      • 1b It is not clear that this is the dark state stabilized structure in the figure, but is referred to as such only in the body of the text.

      We have added text in the manuscript to clarify this is AsLOV2, not iLID, and have labeled it in the figure caption as well.

      Results: Design and characterization of the AsLOV2-based degradation tag

      • 1d. Fold change is reported in Figure 2d, and may be relevant to include those values in 1d as well.

      Done. Fig. 1d

      • 1e. It is not clear which tag is being used in this bar plot. Please specify that this is the dark state stabilized, truncated tag.

      We have added a title to the plot and language to the caption, both of which clarify this point. Fig. 1e

      • In addition, the microscopy images provided in supplemental material should be included in the first figure as it adds a compelling observation of LOVtag activity.

      We are pleased to hear that the microscopy results are beneficial, however we elected to leave them in Supplementary to preserve the flow of the manuscript in the text surrounding Fig. 1.

      • Figure 2:

      • 2d. It is unclear what the 2.5x fold change is relative to (the baseline or the dark)

      We have added a line in the figure to clarify the comparison being made. Fig. 2d

      • 2f. More discussion can be added to describe what concentration of chloramphenicol is biologically/bioreactor relevant.

      Our previous studies on the relationship between AcrAB expression and mutation rate (cited in the text) were carried out at a concentration within the range in which the LOVdeg tag is effective (5 μg/ml), suggesting this range to be relevant to tolerance and resistance.

      • Figure 3:

      • We recommend that this data and discussion are better suited for supplementary figures. The results shown here essentially recapitulate the same findings of Zoltowski et al., 2009. In addition, the paper describing this mutation should be cited in this figure caption in addition to the body of the text

      Although these results are in line with previous findings, we believe this dataset is important for several reasons. First, the agreement with known mutations validates the unfolding-based mechanism for degradation control. Second, degradation that is contingent on unfolding of LOV2 offers a direct actuating mechanism of photocycle properties. Other systems, like that in Zoltowski et al., examine properties of purified proteins but lack the mechanism to translate its effect in live cells. This figure demonstrates how degradation can do so and lays the groundwork for degradation-based frequency processing circuits. Last, there are discrepancies between photocycle kinetics in situ, as reported by Li et al. (DOI: 10.1038/s41467-020-18816-8), and in cell-free studies such as in Zoltowski et al. The studies use different methods of measuring photocycle kinetics (in situ vs cell-free). This dataset substantiates relaxation times from Li et al. and suggests cell-free relaxation time constants are over estimated relative to our live cell results.

      • Figure 4:

      • There is a lack of an EL222-only control presented in Figure 4b. Without this data present, the claim that "EL222 and the LOVtag work coherently to decrease expression" is unsubstantiated. No assumptions of synergy can be made.

      We have added EL222-only data to the figure; we note that Reviewer #1 made a similar request. Figure 4b

      Manuscript

      Results

      • Design and characterization...

      • Due to the extensive discussion of ClpX at the beginning of this section, more of the results on evaluating the binding partners and mechanism of LOVtag degradation should be presented in the main body of the manuscript and not in supplementary materials.

      To maintain flow of the manuscript and focus on how the LOVdeg tag works as a synthetic biology tool, we have opted to keep this section in the Supplement Information, but have several lines in the text related to Fig. 1 that point the reader to this material. Results: Design and characterization of the AsLOV2-based degradation tag

      • In the second paragraph of this section, the authors theorize that the C-terminal truncated E-AA sequence will "remain caged as part of the folded helix". How did the authors determine this? Was there any evidence to suggest that the truncated state would be any more responsive than the full length sequence? More data or rationale may need to be introduced to support the overall hypothesis presented in this paragraph.

      We determined this by examining the crystal structure which shows that the E-A-A sequence is part of the folded helix. As seen in Fig. 1b, addition of amino acids after the EAAKEL sequence would not be part of the folded helix which ends prior to the terminal leucine. We added text to clarify our logic.

      Results: Design and characterization of the AsLOV2-based degradation tag

      • The similarity between the L-A-A sequence of SsrA and the E-A-A sequence of LOVtag is one of the pieces of evidence that brought the authors to their current protein design. The differences in degradation efficiency between the SsrA degradation tag and LOVtag are not clear, and benchmarking against SsrA would be a valuable way to demonstrate the utility of this construct relative to an established protein tagging tool.

      We added an SsrA comparison to benchmark the system. Fig. S9

      Results: Design and characterization of the AsLOV2-based degradation tag

      • Tuning frequency and response...

      • Overall the results presented in this section essentially recapitulate the effects that mutation presented in Zoltowski et. al., 2009 have on AsLOV2 dark state recovery and although this is a useful observation of LOVtag performance, a recommendation is to move this into a supplementary section.

      See above response to Fig. 3 comment.

      • Integrating the LOVtag with EL222...

      • The claim is made in this section that LOVtag and EL222 work synergistically, however the experiments presented do not test repression due to EL222 activity alone. Without benchmarking against this control, the claim of synergy is not supported and we recommend that the authors perform this experiment again with the EL222-only control.

      We have added this important control. Fig. 4b

      Discussion

      • The statement "the LOVtag can easily be integrated with existing optogenetic systems to enhance their function" is not substantiated without benchmarking LOVtag against an EL222- only control. As mentioned above this condition should be included in the experiments discussed in Figure 4 and in the section "Integrating the LOVtag with EL222.."

      We added EL222-only regulation to benchmark the LOVdeg tag and LOVdeg + EL222 experiments. Fig. 4b

      Experiments

      Applications:

      The application of this tag to the metabolic control of octanoic acid production could be more impactful. For instance, using the LOVtag with two different enzymes to change the composition of long/short chain fatty acids with light induction., Or possibly integrating the tag into a switch to activate production. However, the authors address that "decreasing titers is not the overall goal in metabolic engineering" in their discussion, and therefore the pursuit of this additional experiment is up to the authors' discretion.

      We appreciate the suggestions for further applications of the LOVdeg tag. We envision that follow up studies will focus on the application of the LOVdeg tag in metabolic engineering. However, this will require significant development of production systems. We believe this to be out of the scope of this work, where the goal is to present the design and function of the LOVdeg tag as a tool.

    1. Reviewer #1 (Public Review):

      The idea is that inversions capture genetic variants that have antagonistic effects on male sexual success (via some display traits) and survival of females (or both sexes) until reproduction. A series of simulations are presented and show that the scenario works at least under some conditions. While a polymorphism at a single locus with large antagonistic effects can be maintained for a certain range of parameters, a second such variant with somewhat smaller effects tends to be lost unless closely linked. It becomes much more likely for genomically distant variants that add to the antagonism to spread if they get trapped in an inversion; the model predicts this should drive the accumulation of sexually antagonistic variants on the inversion versus standard haplotype, leading to the evolution of haplotypes with very strong cumulative antagonistic pleiotropic effects. This idea has some analogies with one of the predominant hypotheses for the evolution of sex chromosomes, and the authors discuss these similarities. To provide empirical support for this idea, the authors study the dynamics of inversions in population cages over one generation, tracking their frequencies through amplicon sequencing, from the parental generation through embryos to aged adults of either sex. Out of four inversions included in the experiment, two show patterns consistent with antagonistic effects on male sexual success (competitive paternity) and the survival of offspring, especially females, until old age, which the authors interpret as consistent with their theory.

      This is an interesting idea, and the authors should be praised for combining a model with experimental data. However, in addition to the potential for improvement of presentation (details below), the study has some substantial weaknesses that could be addressed with additional simulations and additional experiments.

      (1) The authors claim that the negative frequency dependence that maintains polymorphism in their model results from a non-linear relationship between the display trait and sexual success. I am not convinced about that. It seems to me that the "best of n" female choice implemented in the model (l. 741ff and Figure 2) does not lead to negative frequency dependence. Let p be the frequency of the competitively inferior male genotype. Assuming no noise in the male display, a female will mate with an inferior male only if all males among the n males sampled by the female are of the inferior genotype, which will be the fraction p^n, the remaining 1-p^n matings will go to the superior males. Thus, per capita, the inferior males will achieve (p^n)/p or p^(n-1) matings while the per-capita matings per superior male will be (1-p^n)/(1-p). Thus, the ratio of the mating success of the inferior to the superior males will be (1-p) p^(n-1) / (1- p^n). For the range of p from 0 to 1, this is an increasing function of p. E.g., with n = 2, the sexual fitness of the inferior genotype relative to that of the superior phenotype is p/(1+p). Thus, at least in the absence of noise in the mate choice, this generates positive rather than negative frequency dependence. Maybe I missed something, but the authors do not provide support for their claim about the negative frequency-dependence of sexual selection in their simulations. To do so they could (1) extract the relationship between the relative mating success of the two male types from the simulations and (2) demonstrate that polymorphism is not maintained if the relationship between male display trait and mating success is linear.

      (2) The authors only explore versions of the model where the survival costs are paid by females or by both sexes. We do not know if polymorphism would be maintained or not if the survival cost only affected males, and thus if sexual antagonism is crucial.

      (3) The authors assume no cost to aneuploidy, with no justification. Biologically, investment in aneuploid eggs would not be recoverable by Drosophila females and thus would potentially act against inversions when they are rare.

      (4) The authors appear to define balanced polymorphism as a situation in which the average allele frequency from multiple simulation runs is intermediate between zero and one (e.g., Figure 3). However, a situation where 50% of simulation runs end up with the fixation of allele A and the rest with the fixation of allele B (average frequency of 0.5) is not a balanced polymorphism. The conditions for balanced polymorphism require that selection favors either variant when it is rare.

      (5) Possibly the most striking result of the experiment is the fact that for 14 out of 16 combinations of inversion x maternal background, the changes in allele frequencies between embryo and adult appear greater in magnitude in females than in males irrespective of the direction of change, being the same in the remaining two combinations. The authors interpret this as consistent with sexually antagonistic pleiotropy in the case of In(3L)Ok and In(3R)K. The frequencies of adult inversion frequencies were, however, measured at the age of 2 months, at which point 80% of flies had died. For all we know, this may have been 90% of females and 70% of males that died at this point. If so, it might well be that the effects of inversion on longevity do not systematically differ between the ages and the difference in Figure 9B results from the fact that the sample includes 30% longest-lived males and 10% longest-lived females.

      (6) Irrespective of the above problem, survival until the age of 2 months is arguably irrelevant from the viewpoint of fitness consequences and thus maintenance of inversion polymorphism in nature. It would seem that trade-offs in egg-to-adult survival (as assumed in the model), female fecundity, and possibly traits such as females resistance to male harm would be much more relevant to the maintenance of inversion polymorphisms.

      (7) The experiment is rather minimalistic in size, with four cages in total; given that each cage contains a different female strain, it essentially means N=1. The lack of replication makes statements like " In(2L)t and In(2R)NS each showed elevated survival with all maternal strains except ZI418N" (l. 493) unsubstantiated because the claimed special effect of ZI418N is based on a single cage subject to genetic drift and sampling error. The same applies to statements on inversion x female background interaction (e.g., l. 550), as this is inseparable from residual variation. It is fortunate that the most interesting effects appear largely consistent across the cages/female backgrounds. Still, I am wondering why more replicates had not been included.

    1. Author Response

      eLife assessment

      This study presents a valuable finding on the distinct subpopulation of adipocytes during brown-to-white conversion in perirenal adipose tissue (PRAT) at different ages. The evidence supporting the claims of the authors is convincing, although specific lineage tracing of this subpopulation of cells and mechanistic studies would expand the work. The work will be of interest to scientists working on adipose and kidney biology.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      In this manuscript, the authors performed single nucleus RNA-seq for perirenal adipose tissue (PRAT) at different ages. They concluded a distinct subpopulation of adipocytes arises through brown-to-white conversion and can convert to a thermogenic phenotype upon cold exposure.

      Strengths:

      PRAT adipose tissue has been reported as an adipose tissue that undergoes browning. This study confirms that brown-to-white and white-to-beige conversions also exist in PRAT, as previously reported in the subcutaneous adipose tissue.

      We did not observe any white-to-beige conversion in PRAT under regular condition. The adipocyte population that arises from brown-to-white conversion (mPRAT-ad2) can respond to cold and restore their UCP1 expression. However, brown adipocytes that arise from the mPRAT-ad2 subpopulation after cold exposure have a distinct transcriptome to that of cold-induced beige adipocyte in iWAT (Figure S6K) and are more related to iBAT brown adipocytes (Figure 6E).

      Weaknesses:

      1. There is overall a disconnection between single nucleus RNA-seq data and the lineage chasing data. No specific markers of this population have been validated by staining.

      We are not sure what “this population” refers to. We suspect it is the Ucp1-&Cidea+ mPRAT-ad2 adipocyte subpopulation. If so, we did not identify specific markers for these adipocytes as shown in Figure 1H and statement in the Discussion. mPRAT-ad2 is negative for Ucp1 and Cyp2e1, which are markers for mPRAT-ad1 and mPRAT-ad3&4, respectively. Therefore, we plan to stain the mPRAT with Ucp1, Cyp2e1 and Perilipin (a pan adipocyte marker) antibodies. Cells that are Perilipin+&Ucp1-&Cyp2e1- will represent the mPRAT-ad2 subpopulation.

      1. It would be nice to provide more evidence to support the conclusion shown in lines 243 to 245 "These results indicated that new BAs induced by cold exposure were mainly derived from UCP1- adipocytes rather than de novo ASPC differentiation in puPRAT". Pdgfra-negative progenitor cells may also contribute to these new beige adipocytes.

      Our sequencing data and many previous studies (Angueira et al., 2021; Burl et al., 2022; Dong et al., 2022) have shown that Pdgfra is a marker for all ASPCs. We will also check adipocyte labelling pattern of mPRAT in the PdgfraCre;Ai14 mice. If all adipocytes are Tomato+, it suggests that adipocytes in mPRAT are all derived from Pdgfra-expressing cells. Also, the cold-induced adipocytes in mPRAT resemble more to the brown adipocytes of iBAT than the beige adipocytes of iWAT (Figure 6E and S6K).

      Angueira, A.R., Sakers, A.P., Holman, C.D., Cheng, L., Arbocco, M.N., Shamsi, F., Lynes, M.D., Shrestha, R., Okada, C., Batmanov, K., et al. (2021). Defining the lineage of thermogenic perivascular adipose tissue. Nat Metab 3, 469-484. 10.1038/s42255-021-00380-0.

      Burl, R.B., Rondini, E.A., Wei, H., Pique-Regi, R., and Granneman, J.G. (2022). Deconstructing cold-induced brown adipocyte neogenesis in mice. Elife 11. 10.7554/eLife.80167.

      Dong, H., Sun, W., Shen, Y., Balaz, M., Balazova, L., Ding, L., Loffler, M., Hamilton, B., Kloting, N., Bluher, M., et al. (2022). Identification of a regulatory pathway inhibiting adipogenesis via RSPO2. Nat Metab 4, 90-105. 10.1038/s42255-021-00509-1.

      1. The UCP1Cre-ERT2; Ai14 system should be validated by showing Tomato and UCP1 co-staining right after the Tamoxifen treatment.

      We will inject Ucp1CreERT2;Ai14 mice at 1- and 6-month-old of age with tamoxifen and collect one day after the last injection to check the overlap between the Tomato signal and UCP1 immunofluorescent staining.

      Reviewer #2 (Public Review):

      Summary:

      In the present manuscript, Zhang et al utilize single-nuclei RNA-Seq to investigate the heterogeneity of perirenal adipose tissue. The perirenal depot is interesting because it contains both brown and white adipocytes, a subset of which undergo functional "whitening" during early development. While adipocyte thermogenic transdifferentiation has been previously reported, there remain many unanswered questions regarding this phenomenon and the mechanisms by which it is regulated.

      Strengths:

      The combination of UCP1-lineage tracing with the single nuclei analysis allowed the authors to identify four populations of adipocytes with differing thermogenic potential, including a "whitened" adipocyte (mPRAT-ad2) that retains the capacity to rapidly revert to a brown phenotype upon cold exposure. They also identify two populations of white adipocytes that do not undergo browning with acute cold exposure.

      Anatomically distinct adipose depots display interesting functional differences, and this work contributes to our understanding of one of the few brown depots present in humans.

      Weaknesses:

      The most interesting aspect of this work is the identification of a highly plastic mature adipocyte population with the capacity to switch between a white and brown phenotype. The authors attempt to identify the transcriptional signature of this ad2 subpopulation, however, the limited sequencing depth of single nuclei somewhat lessens the impact of these findings. Furthermore, the lack of any form of mechanistic investigation into the regulation of mPRAT whitening limits the utility of this manuscript. However, the combination of well-executed lineage tracing with comprehensive cross-depot single-nuclei presented in this manuscript could still serve as a useful reference for the field.

      The sequencing depth of our data is comparable, if not better than previously published snRNA-seq studies on adipose tissue (Burl et al., 2022; Sarvari et al., 2021; Sun et al., 2020). Therefore, the depth of our data has reached the limit of the 3’ sequencing methods. Unfortunately, due to size limitation of the adipocytes, it is also not feasible to sort them for Smart-seq.

      Burl, R.B., Rondini, E.A., Wei, H., Pique-Regi, R., and Granneman, J.G. (2022). Deconstructing cold-induced brown adipocyte neogenesis in mice. Elife 11. 10.7554/eLife.80167.

      Sarvari, A.K., Van Hauwaert, E.L., Markussen, L.K., Gammelmark, E., Marcher, A.B., Ebbesen, M.F., Nielsen, R., Brewer, J.R., Madsen, J.G.S., and Mandrup, S. (2021). Plasticity of Epididymal Adipose Tissue in Response to Diet-Induced Obesity at Single-Nucleus Resolution. Cell Metab 33, 437-453 e435. 10.1016/j.cmet.2020.12.004.

      Sun, W., Dong, H., Balaz, M., Slyper, M., Drokhlyansky, E., Colleluori, G., Giordano, A., Kovanicova, Z., Stefanicka, P., Balazova, L., et al. (2020). snRNA-seq reveals a subpopulation of adipocytes that regulates thermogenesis. Nature 587, 98-102. 10.1038/s41586-020-2856-x.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In the paper entitled GOT1 primes the cellular response to hypoxia by supporting glycolysis and HIF1α stabilization, Grimm and co-authors investigate the metabolic adaptations of cancer cells upon acute hypoxia. By measuring metabolite levels at early time points upon hypoxia, they observe the accumulation of lactate and depletion of aspartate, along with other TCA cycle metabolites. Importantly, they demonstrate that these metabolic changes are independent of the HIF alpha-dependent transcriptional response. The authors investigate the role of aspartate during these initial phases of hypoxia. To this aim, they characterize cells devoid of glutamate oxaloacetate transaminase (GOT1), in which aspartate accumulates and can no longer be used for replenishing NAD+ via the downstream conversion of oxaloacetate to malate, via malate dehydrogenase. These cells have lower cytosolic NAD+ which affects glycolytic flux through the rate-limiting, NAD+-dependent enzyme GAPDH. GOT1 KO cells have a decrease in glucose consumption, lactate secretion and metabolite levels downstream of GAPDH upon early hypoxia, however ATP levels and viability are only affected with additional lactate dehydrogenase (LDH) impairment. Finally, the authors demonstrate that GOT1 KO cells have higher alpha-ketoglutarate (aKG) levels during early hypoxia, which could contribute to higher prolyl-hydroxylation and subsequent degradation of HIF, regulating the transcriptional response mediated by transcription factor.

      * Major comments *

      1. The authors claim that they were unable to supplement cells with aspartate (Figure S3), (even though an increase of aspartate is instead observed in cells treated with sodium aspartate) and had to resort to the GOT1 knock-out model to "prevent aspartate from decreasing in hypoxia". This approach implicitly assumes that Got1 is the main driver of aspartate depletion upon hypoxia. However, although steady-state levels of aspartate are indeed higher in these cells, there is still a strong decrease upon hypoxia, which the authors acknowledge but merely ascribe to "attenuated production from glutamine". This seems an insufficient explanation, considering the very fast depletion upon hypoxia originally observed. The authors should provide further information regarding why aspartate is depleted in these conditions and consider other aspartate-consuming enzymes such as GOT2, ASNS, or even nucleotide biosynthesis and urea cycle enzymes. These observations could be made using the labeling experiments already acquired. In addition, to corroborate their hypothesis, the authors could supplement 13 C-aspartate at a supraphysiological concentration (i.e. 5-10 mM) to determine to what extent it is consumed by GOT1 or other pathways. > We thank the reviewer for this comment that helped us to recognise, in retrospect, that by focusing on GOT1ko as a means to rescue aspartate levels detracted from our main finding and extensive mechanistic insights into the role of GOT1 in sustaining the increase in glycolysis in early hypoxia. As we detail in our response to the Reviewer’s point 2, we have now re-written our results section to better clarify why we focused on GOT1 (lines 175-223 of the revised manuscript – please note that line numbering corresponds to the word document with the track changes off). However, we also agree that, because the motivation that led us to GOT1 was the counter-correlation between aspartate and lactate, expanding on the pathways that determine aspartate levels in hypoxia would be useful to the reader.

      2. To address the reviewer’s point, in revised Fig. S3E, we present new data where we incubated cells in normoxia or hypoxia for 3h in the presence of 1.5 mM 13C-aspartate. We opted for an intermediate aspartate concentration which was enough to observe intracellular labelling while minimising significant perturbation to cells. We found that the amount of labelled aspartate that accumulates intracellularly is not significantly different between normoxia and hypoxia. At the same time, we observe a vast depletion of unlabelled aspartate. We accept that aspartate labelling may not have reached isotopic steady state within the 3h time point we are confined to for our experiments. However, if increased consumption contributed significantly to aspartate depletion within this timeframe, the amount of labelled aspartate that accumulated would be lower in hypoxia compared to normoxia. Therefore, the data in Fig. S3E indicate that, at least within the timeframe of our experiments, the magnitude of aspartate consumption is not likely to increase to such an extent that could significantly contribute to the depletion in aspartate.

      We had, indeed considered other aspartate-consuming pathways, however, in light of the above results and our subsequent finding that GOT1 is needed for increased glycolysis, we did not pursue these investigations any further and focused on the role of GOT1 instead.

      • In revised Figure S3, and also in response to one of the Reviewer’s other comments below, we have now replotted the data from the experiment in the original manuscript to show both absolute and fractional isotopologue abundances of TCA intermediates from cells labelled with 13C-glucose or 13C-glutamine. Based on these re-plotted data, we find that the amounts of labelled intermediates from both labels decreases; the apparent decrease from glutamine appears greater than that from glucose, likely because glutamine labels more rapidly a greater fraction of TCA intermediates. Moreover, glutamate fractional labelling from glutamine decreases, but modestly increases from glucose over time in hypoxia compared to normoxia. These data raise the possibility that TCA intermediates are diverted to glutamate synthesis. However, as we point out in the revised text, the fact that only glutamine has reached an isotopic steady state by the end of the time course precludes us from making a more accurate quantitative statement and therefore we have refrained from further elaborating on these observations.

      Taking the above observations together, in the revised text we do not dismiss increased consumption as a factor in decreased aspartate levels and rather state that “within the timeframe tested, decreased production is a significant contributor to the low aspartate levels in early hypoxia.” (lines 187-188).

      In line with the previous comment, the conclusion that "GOT1 activity, rather than a decrease in aspartate concentration itself, is required to sustain the increase in glycolysis in early hypoxia." seems questionable, especially considering the failed aspartate supplementation. The authors suspect low expression of plasma membrane aspartate transporters as the reason and quote Garcia-Bermudez et al.2018 (PMID: 29941933). This paper contains ranked SLC1A2 mRNA expression data from the Cancer Cell Line Encyclopedia (CCLE). The authors may apply aspartate supplementation and "early hypoxia" to a cancer cell line expressing SLC1A2 or other aspartate transporters. Alternatively, they could try introducing the transporter by overexpression.

      > We concede that the way we phrased this statement was not ideal and has rightly led to the reviewer’s criticism. In particular, referring to a “decrease in aspartate concentration”, could mislead the reader into thinking that we were referring to the process of aspartate consumption, rather than the low aspartate levels themselves, which is what we aimed to explore. In the revised text, we now carefully make this distinction; we show new data (Figure S3G) supporting the idea that low aspartate levels are not necessary for increased lactate; we explain that, given the known role of the malate-aspartate shuttle in coordinating redox balance and potentially affecting glycolytic flux, the fact that aspartate didn’t appear to be limiting was surprising and we therefore asked whether GOT1, which depends on aspartate, had a role in the increased glycolysis in early hypoxia. Given that GOT1ko attenuated the increase in glycolysis we subsequently focused on the mechanism underlying this observation. In more detail:

      As Reviewer 2 noted in point 1 of their review, the increase in lactate became more apparent after 2 h, when aspartate levels had almost reached their minimum. This successive timing of abundance changes raised the possibility that low aspartate levels precede, and possibly drive, the increased lactate. Therefore, we sought to test whether this was the case by preventing depletion of aspartate in hypoxia with exogenous aspartate. We agree that, to address the comment of Reviewer 1 here, overexpression of an aspartate transporter would have been a good way to overcome poor aspartate uptake by MCF7 cells, however, at the time we initiated this study, SLC1A2 was not known as an aspartate transporter. We, therefore, cultured MCF7 cells for several weeks in media containing 0.5 mM aspartate (which is normally absent in our standard media formulation) because we expected that cells would adapt to take up more aspartate. We, thereby, obtained a derivative cell line that we called MCF7Asp. In new Figure S3G, we show that addition of 0.5 mM aspartate in the media of MCF7Asp cells largely prevented the decrease in intracellular aspartate seen in parental MCF7 cells after 3h in 1% O2; however, the increase in lactate was similar between MCF7 and MCF7Asp cells. These data are consistent with the idea that the low aspartate levels in hypoxia are not the likely cause for the increase in lactate.

      As the Reviewer notes in point 3 below, production of malate m+1 from 2H-glucose does not decrease below the levels found in normoxia (Fig. 4H), even though aspartate levels are depleted (Fig. 1C). Together with the fact that maintaining aspartate levels to near-normoxic levels does not further boost lactate levels (Figure S3G), these findings speak against the notion that the lack of increased GOT1-MDH1 flux is due to insufficient aspartate and are aligned with the idea that the malate-aspartate shuttle is saturated (PMID: 35973426, 21982705).

      • The observation that labelled m+1 malate produced from [4-2H]-glucose is similar in normoxia and hypoxia (Figure 4G), does not support the notion that GOT1-MDH axis is increased at low oxygen and seems to suggest that the depletion of aspartate observed in early hypoxia is unrelated to this axis. The authors should resolve this discrepancy.*

      > In our manuscript, we do not claim that the flux through the GOT1-MDH1 axis is increased but, instead, we emphasise the fact that, as the reviewer observed, malate labelling from 2H-glucose is unchanged (e.g. see text in our original manuscript - lines 519-522 of the revised version: “Importantly, a model where increased upper glycolysis due to the Pasteur effect overwhelms GAPDH capacity also elucidates the apparent increase in the reliance of glycolysis on GOT1-MDH1 in hypoxia, even though flux through this pathway is not elevated.”). As we also detail in our responses to comments 1 and 2, above, in the revised manuscript, we have re-written the discussion to better explain that the reliance on GOT1 in hypoxia is not driven by increased flux through this pathway (which is likely saturated as outline in our response to point 2, above), but rather from the increased demand imposed by the elevation in incoming glucose carbons due to the Pasteur effect (lines 504-531). This is akin to a situation where increased demand for a product drives its price up if the manufacturer does not boost production to increase supply. We hope that the reviewed discussion makes this clearer and addresses the reviewer’s comment.

      • The alpha-KG level regulation by Got1 and the subsequent HIF1alpha "priming" seem quite promising and likely the most novel part of the manuscript. However, further proof should be added to support this strong claim. First, aKG to succinate ratio, rather than aKG alone, is a better indicator of aKG-dependent dioxygenases activity. So. the authors should provide this measurement. *

      In line with the reviewer’s excellent suggestion, in the revised manuscript, we added new panel in Figure 6F (discussed in lines 457-458) that shows αKG levels alongside the corresponding αKG/succinate ratios. These data agree with our original interpretation that cofactor levels in GOT1ko cells favour increased dioxygenase activity.

      *Second, the authors should rule out the possibility that the differential hydroxylation of HIF is due to the redistribution of intracellular oxygen due to alterations in mitochondrial function. To do this, they could determine whether cytosolic oxygen levels differ in the two conditions. *

      The reviewer raises the interesting hypothesis that, given the decreased respiration in hypoxic GOT1ko cells, one could expect increased availability of oxygen that could contribute to the destabilisation of HIF1α. To the best of our knowledge, measuring absolute cytosolic O2 concentration, particularly in hypoxia, would require specialised equipment [e.g. phosphorescence lifetime imaging (PMID: 26065366), or phosphorescence quenching oxymetry (PMID: 21912692); unfortunately, we do not have access to such equipment. In the revised manuscript, we acknowledge the reviewer’s point with added new text in the discussion (lines 576-577).

      Finally, the authors could test whether α-ketoglutarate or 2-hydroxyglutarate supplementation affects HIF stability in their experimental conditions.

      > We thank the reviewer for this suggestion. In the revised manuscript (new Figure S6H and lines 453-455) we show that addition of DM-αKG, a cell-permeable form of αKG, to the media of MCF7 cells incubated at 1% O2, decreases HIF1α protein levels in a dose-dependent manner and, at the highest dose, to a degree comparable to that of GOT1ko cells.

      Minor comments:

      - The glycerol-3-phosphate shuttle is another means of re-oxidizing NADH and α-GP is indeed higher in GOT1 KO. According to this, in Fig 5C a clear increase in a-GP is observed in LDH KO cells. Would the phenotype be stronger upon additional GPD1 knockout or inhibition?

      > The main phenotype of combined LDHA/GOT1 inhibition is a deficit in ATP and decreased cell survival. While increased flux through GPD1 could, indeed, provide more NAD+, this would come at the expense of glucose carbons that would otherwise need to flow into lower glycolysis to produce ATP. Consistent with this idea, our data show that, even if GPD1 or other dehydrogenases reoxidise NADH, as would be the case in both the LDHAko and GOT1ko cells where α-GP is elevated, they are not sufficient to compensate for the decrease in LDH and GOT1 activity. Therefore, we did not pursue this hypothesis further.

      * - Aspartate and lactate levels appear unchanged in MDA-MB231 upon hypoxia. Can these changes be ascribed to a pseudohypoxic state? The authors should comment on this observation.*

      > In Figure S2A, we show that MDA-MB-231 cells have increased basal levels of HIF1α compared to the almost undetectable HIF1α seen in BT474 (same figure, adjacent panel) or MCF7 cells (Figure 2A). We, therefore, agree with the reviewer’s hypothesis that the attenuated changes in aspartate or lactate levels in MDA-MB-231 cells are likely due to a pseudohypoxic state. As this is speculative, we have refrained from elaborating on this point further in the manuscript.

      * - Figure S3B: The authors do not provide information on the length of hypoxia for these experiments. *> The data shown in original Figure S3B (new Fig. S3A-B) are a time course. Cells were incubated at 21% or 1% O2 with the respective isotope label for increasing lengths of time, with the longest time point shown (6h) being the longest time we incubated cells in hypoxia. If the reviewer meant another panel, the length of hypoxia would be 3h unless otherwise stated.

      - Glucose and glutamine isotopic labelling should be accompanied by graphs showing the total pool levels of these metabolites, and also the uptake of glucose and glutamine (and their specific isotopologue distribution). It would be important to show the isotopologue distribution of aKG in all the conditions tested, in particular, because of its proposed regulation by Got1.

      > In the revised manuscript, new Fig. S3 panels A-D, we now show absolute and fractional isotopologue distributions for TCA intermediates for both glucose and glutamine labelling. We have omitted showing αKG in this figure as we could not reliably quantify it in the glutamine-labelling experiment. Also, unfortunately, quantification of glutamine in our GC-MS datasets is not reliable due to conversion to 5-oxoproline.

      - Malate generated by MDH1 can be converted by ME1 into Pyruvate, which could be further processed by LDH. Have the authors measured this conversion in their dataset.

      > In the figure below we labelled cells with [U-13C]-glutamine for 3 h at 21% or 1% O2 and plotted the fractional labelling for all observable isotopologues in malate, pyruvate and lactate. These data show that there is minimal labelling in pyruvate and lactate (- Aspartate absolute levels across cell lines appear different. Is this due to differences in cell volume? Can the authors comment on this observation?

      > To address the reviewer’s hypothesis, we focused on MCF7 and MDA-MB-231, the two cell lines with the highest and lowest aspartate levels, respectively. The volume of MCF7 is approx. 19% higher than that of MDA-MB-231 (calculated based on cell size data from PMID: 31015463). Based on this calculation, and bearing in mind that cell volume is a good predictor of biomass content (PMID: 18595067), cell volume differences may contribute to, but cannot fully account for the one order of magnitude difference in aspartate abundance we see between these cell lines (Figures 1C and S1A).

      The cell lines we used in this manuscript (MCF7, BT474, MDA-MB-231, MCF10A) represent different breast cancer (or untransformed, in the case of MCF10A) cell types, with different oncogenic mutation content (PMID: 17157791, 22460905) and proliferation rates (PMID: 22628656); all these factors can be related to steady-state cellular metabolite levels (PMID: 31015463). In the figure below, we have plotted aspartate abundance data (from PMID: 31068703) in 928 cell lines of various origins. These data show that aspartate levels can differ as much as 2 orders of magnitude between cancer cell lines and about half an order of magnitude between MCF7 and MDA-MB-231 or BT474 (MCF10A was not present in this dataset); they also show that aspartate levels in the three cell lines rank in the same order as in our manuscript (MCF7>BT474>MDA-MB-231), although, it is unclear if cells in this dataset were also cultured in dialysed serum as in ours, so we cannot confidently compare the absolute aspartate measurements between our studies.

      In conclusion, we suspect that cell volume differences together with other factors, such as proliferation rates and metabolic network differences may account for the differences in intracellular aspartate levels.

      - Under hypoxia the contribution of glutamine (labelled fraction, Fig. S3) to TCA cycle intermediates decreases. However, this is not paralleled by an increase in the contribution of glucose, as also supported by an increase in the m+0 in the glutamine labeling but not in the glucose one. How do the authors explain this apparent inconsistency? Are there sources of unlabelled TCA cycle during the hypoxic experiment?

      > While glucose and glutamine are the major carbon sources in many cultured cancer cell lines, incl. MCF7 as indicated by the data in Figure S3A-D, other nutrients (such as amino acids, other than glutamine, and fatty acids) can also provide carbons at various points of the TCA cycle. The fact that fractional labelling of glutamate from glutamine is decreased in hypoxia would suggest that the source of decreased contribution of glutamine into the TCA is unlabelled glutamate. We can exclude uptake of exogenous glutamate, because all our metabolic measurements are performed with cells incubated in media without glutamate and supplemented with dialysed serum. However, we observe a modest increase in the fractional labelling from glucose into glutamate (Figure S3A). As glucose labelling into the TCA cycle is not at steady-state even after 5h, it is hard to assess whether, increased labelling from glucose suffices to explain the dilution of glutamine-derived labelling into glutamate a quantitative conclusion but it points to efflux of intermediates out of the TCA cycle (discussed in lines 181-183 of the revised manuscript).

      We thank the reviewer for their time and thoughtful comments that helped us improve the presentation of our work.

      **Referees cross-commenting**

      Referee 2 raises important questions that are in part aligned with referee 1 and are reasonable and doable is the time frame proposed. These are all important questions and comments to consolidate the central hypothesis of the work and I believe are required for publication.

      *

      Reviewer #1 (Significance (Required)):*

      Overall, this is an exciting and well-executed piece of work focusing on the early biochemical consequences of hypoxia that the wide metabolism/biochemistry audience will appreciate. While most of these observations are not entirely unexpected, the work brings a sufficiently novel perspective and insights to the field and deserves publication. However, some conclusions are not fully supported by the data and some additional experiments are suggested to bring clarification and strengthen the authors' conclusions.

      We are a lab expert in cancer metabolism.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary ** This manuscript represents an interesting and novel description of the role of a cytosolic transaminase, glutamic-oxaloacetate transaminase 1 (GOT1) on both cytosolic redox (and therefore glycolysis through its functional linkage with malate dehydrogenase 1) and the availability of alpha-ketoglutarate for stabilisation of HIF1a in hypoxia. Some of the most interesting data are the evidence for increased cytosolic NAD+ regeneration through the combined action of LDHA (known) and GPD1 (less well-described increase in activity in hypoxia). The manuscript as a whole describes the multiple systems required for the early response to hypoxia, but the focus of the title and way the article is written do not entirely reflect this. For example, the title focuses on GOT1 as the enzymes whose activity is responsible for the early response to hypoxia. However, this is not reflected in some of the data - the deuteron labelling in particular - which shows that LDH and GPD1 are responsible for the biggest redox activity (i.e. support of glycolysis). A degree of reframing of the article may therefore be of benefit.

      We thank the reviewer for their constructive suggestions. In the revised manuscript, we have re-written the title and the relevant parts of the results section, and we have significantly re-structured the discussion section to reflect the fact that multiple enzyme systems, one of which is GOT1, converge to support the glycolytic increase and cell survival in early hypoxia. Furthermore, in our point-by-point responses, below, we highlight in detail how we have streamlined the way we present our results.

      *Major comments. *

      In Figure 1 C and D, the data suggest significant changes in the decrease in cellular aspartate between 1-2 hours, which then slow. This is followed by a change in lactate concentrations from 2 hours onwards, which is observed in the cells (D) and media (F). The rapid decrease in aspartate concentration suggests a relatively large change, which does not correspond to the later lack of alteration in deuteron labelling from d4-glucose (Figure 4H-J) in m+1 malate. This therefore suggests that the biggest determinant of decreased aspartate is not coupled to MDH1 activity directly. If the manuscript is focused on the relevance of GOT1 activity to the early hypoxic response, this should be better resolved. Given that this could undermine the strength of the case being made for GOT1 activity playing a significant role (through MDH1), could the authors perform the same experiments but in the GOT1KO cells to show how NADH is handled under these conditions by LDHA and GPD1? If the focus of the manuscript is shifted, these experiments would likely not be necessary.

      > We thank the reviewer for these comments, which, together with those by Reviewer 1, highlighted that the way we presented our results warranted improvement. First, we would like to clarify that by referring to a “decrease in aspartate concentration”, we may have misled the reader into thinking that we were referring to the process of aspartate consumption; rather we wanted to explore whether the low aspartate level itself could be causing the increase in lactate. This is because, as the Reviewer points out, the rate of lactate accumulation picked up after aspartate had almost reached its minimum. Furthermore, by not elaborating on the cause of decreased aspartate and by focusing on GOT1ko as a means to rescue aspartate levels implied a hypothesis whereby GOT1 was the main aspartate consumer, thereby detracting from our main finding and extensive mechanistic insights into the role of GOT1 in sustaining the increase in glycolysis in early hypoxia (regardless the contribution of GOT1 activity in the observed depletion of aspartate).

      In the revised text, we have re-written parts of the results section to better clarify these points (e.g. lines 175-223 - please note that line numbering corresponds to the word document with the track changes off). In summary, and as detailed below, we explore the glucose and glutamine data further and present new data with 13C-Asp, which, together support the idea that decreased aspartate in early hypoxia is largely attributable to decreased synthesis and, to a lesser extent, if at all, to increased degradation. We then explain that, given the known role of the malate-aspartate shuttle in coordinating redox balance and potentially affecting glycolytic flux, we asked whether GOT1, which depends on aspartate, still had a role in the increased glycolysis vis-à-vis the low aspartate levels in early hypoxia. Given that GOT1ko did attenuate the increase in glycolysis we subsequently focused on the mechanism underlying this observation. We have re-structured the discussion, to highlight that GOT1 is one the multiple systems required for survival in early hypoxia. We also explain that the reliance on GOT1 in hypoxia is not driven by increased flux through the GOT1-MDH1 axis (which is likely saturated), but rather from the increased demand imposed by the elevation in incoming glucose carbons due to the Pasteur effect (lines 504-531). A relatable situation is when increased demand for a product drives its price up if the manufacturer does not boost production to increase supply. We hope that the revised text better clarifies these points.

      Below, we detail the new experimental evidence/analyses we referred to above:

      • In revised Figure S3A-D, we have now replotted the data from the experiments in the original manuscript to show both absolute and fractional isotopologue abundances of TCA intermediates from cells labelled with 13C-glucose or 13C-glutamine. Based on these re-plotted data, we find that the amounts of labelled intermediates from both labels decreases; the apparent decrease from glutamine appears greater than that from glucose, likely because glutamine labels more rapidly a greater fraction of TCA intermediates. Moreover, glutamate fractional labelling from glutamine decreases, but modestly increases from glucose over time in hypoxia compared to normoxia. These data raise the possibility that TCA intermediates are diverted to glutamate synthesis. However, as we point out in the revised text, the fact that only glutamine has reached an isotopic steady state by 5h precludes us from making a more accurate quantitative statement and therefore we have refrained from further elaborating on these observations.

      • In revised Fig. S3E, we present new data where we incubated cells in normoxia or hypoxia for 3h in the presence of 1.5 mM 13C-aspartate. We found that the amount of labelled aspartate that accumulates intracellularly is not significantly different between normoxia and hypoxia. At the same time, we observe a vast depletion of unlabelled aspartate. We accept that aspartate labelling may not have reached isotopic steady state within the 3h time point we are confined to for our experiments. However, if increased consumption contributed significantly to aspartate depletion within this timeframe, the amount of labelled aspartate that accumulated would be lower in hypoxia compared to normoxia. Therefore, the data in Fig. S3E indicate that, at least within the timeframe of our experiments, the magnitude of aspartate consumption is not likely to increase to such an extent that could significantly contribute to the depletion in aspartate.

      Together with the data in Fig. S3A-D, these findings suggest that decreased aspartate in early hypoxia is to a great degree driven by decreased production.

      • The authors present data in Figure 1 and 3 using 2DG as a surrogate for glucose uptake. 2DG has been previously shown not to always be a surrogate for glucose uptake (Sinclair et al. Immunometabolism 2020). Given that this paper highlighted warns in particular about assuming SLC2A1 and SLC2A3 activities based on 2DG uptake, and that these two transporters are the major glucose transporters regulated by hypoxia, a cautious approach to these data is recommended. Assuming that 2DG uptake is a surrogate for glucose in this system (panel C), the effect of GOT1 appears to be at the level of glucose uptake even at 3 hours - it has been marked as being significant by the authors. This suggests that loss of GOT1 has an effect on glucose uptake prior to any transcriptional response is observed. Is the plasma membrane occupancy by the SLC2A1 or SLC2A3 been reduced after GOT1 KO? The same is true for Figure 1 - as intracellular aspartate and lactate and extracellular lactate is shown, could change in extracellular glucose not be presented as a direct measure?*

      The reviewer raises two points: (a) that using 2DG may not faithfully report transporter-mediated glucose uptake and (b) that, if our observations with 2DG are valid, they could point to the possibility that attenuation of glycolysis in GOT1ko cells may be attributable to effects in glucose uptake. In brief, we cannot use glucose measurements in media as an indicator of glucose uptake rates because we do not observe measurable glucose depletion from media within the relevant timeframe (3h) of our experiments.

      (a) Given that we did not have access to a set up for using radionuclides, we explored both 2DG-based and glucose depletion from media as potential means to assess glucose uptake. We found that, over 24h, MCF7 cells deplete glucose faster than cells incubated in normoxia for the same amount of time (figure below, A). The magnitude of this increase is similar to that we report using 2-DG (~3-fold, Fig. 1E and 3C). However, we observed only minimal depletion of glucose in the first 3-5 h of culturing cells with fresh media (figure below, B). This is perhaps not surprising given that studies that look at metabolite exchange rates (incl. glucose) typically sample over a period of one to several days rather than hours (e.g. PMID: 31015463, 22628656). In conclusion, we reasoned that detecting a positive change in signal (intracellular 2DG) would provide a more sensitive means than a decrease in extracellular glucose to enable assessment of glucose use within the early time-points that our manuscript is mainly concerned with.

      (b) Indeed, we were initially intrigued by the decrease in glucose uptake by GOT1ko cells as it could explain decreased lactate production. However, the upregulation of upstream glycolytic intermediates in GOT1ko cells in both normoxia and hypoxia (Figure 4A) together with the evidence of increased α-GP production from glucose (Figure 4K-L) suggested that, even if less glucose is taken up by GOT1ko cells, there is still a bottleneck at the GAPDH step that prevents maximal flow of glycolytic intermediates to lower glycolysis. We therefore did not pursue further the cause of decreased glucose uptake by GOT1ko cells at this stage.

      • The data shown in Figure 2D suggests that there is little change in overall contribution to citrate from glucose in hypoxia compared to normoxia, and that HIF1 is does not play a role in the hypoxic response at this point. However, the data presented are overall fractional labelling, and therefore do not focus on the main hypoxia-dependent point of control highlighted before this by the authors - pyruvate oxidation through PDH. Could the authors consider plotting m+2 isotopomer of citrate either alongside or instead of the total fractional label (which includes hypoxia-independent PC activity and cycling carbons). *

      We agree with the reviewer’s suggestion. In the revised manuscript, we added a new panel in Fig. 2D that shows the m+2 citrate isotopologue alongside the original fractional labelling data. This new panel is shown as a bar graph to enable the presentation of individual datapoints and statistical test results.

      Additionally, the experimental set-up means that average incorporation over the time shown is represented - i.e. the 3h timepoint is incorporation over the first two hours, while the 24 hour timepoint is averaged over the whole period. It is therefore likely under-representing the decrease in glucose contribution to citrate at 24 hours - the authors could point this out, or OPTIONALLY perform a more time-resolved experiment where flux over shorter periods is assessed for each of the timepoints (i.e. 0-1, 2-3, 5-6, 23-24).

      Indeed, we did consider a more time-resolved labelling experiment as the reviewer suggests, however, we decided against this approach as we were concerned that even if we pre-equilibrated the labelling media in hypoxia, it would be challenging to avoid perturbations associated with handling of the cells during addition of the isotopically labelled compound. The new panel in Fig. 2D that shows absolute citrate m+2 abundances should address this point, however, in the revised text (lines 162-164) we added new text that points out this issue.

      • Figure 3 data are key for the GOT1 theme of the manuscript, as the authors show that loss of GOT1 increases cellular aspartate in both normoxia and hypoxia - suggesting that GOT1 is an aspartate-consuming enzyme in both conditions. Indeed the magnitude of the change in aspartate after GOT1 knockdown appears similar in both conditions (Panel B). These are interesting data, as they contrast with a recently published study (Altea-Manzano et al. Molecular Cell 2022) suggesting that in respiration-deficient cells (a condition with parallels with hypoxia), GOT1 activity may be aspartate producing to supply aspartate to the mitochondria for GOT2. It would be important for the authors to discuss the differences between studies.*

      Following the reviewer’s suggestion, in the revised manuscript (lines 547-556), we have now expanded our previous discussion on the functions of GOT1 in cells with respiration defects.

      • Panel E shows data at 5 hours, while the rest of the panels here are a mix of 1 and 3h timepoints. Equally panel E also presents concentration, while D presents relative abundance of lactate - could a consistent approach to presenting the results be taken?*

      We agree. Taking into consideration that the data in this panel show one time point of the full time-course in Figure S3F, and to streamline the presentation of these data, in the revised manuscript, we have moved the time-course graph to the main figure.

      • In Figure S3, the authors show the lack of direct aspartate uptake, or supplementation through the use of an esterified form. OPTIONAL: they could consider using the expression of SLC1A3 (Tajan et al. Cell Metabolism 2018; Hart et al eLife 2023) to increase aspartate uptake in order to test their hypothesis. *

      We agree that, to address this point, overexpression of an aspartate transporter would have been a good way to overcome poor aspartate uptake by MCF7 cells, however, at the time we initiated this study, SLC1A2 was not known as an aspartate transporter. We, therefore, cultured MCF7 cells for several weeks in media containing 0.5 mM aspartate (which is normally absent in our standard media formulation) because we expected that cells would adapt to take up more aspartate. We, thereby, obtained a derivative cell line that we called MCF7Asp. In new Figure S3G, we show that addition of 0.5 mM aspartate in the media of MCF7Asp cells largely prevented the decrease in intracellular aspartate seen in parental MCF7 cells after 3h in 1% O2. However, the increase in lactate was similar between MCF7 and MCF7Asp cells. These data are consistent with the idea that the low aspartate levels in hypoxia are not the likely cause for the increase in lactate.

      *Figure S3B-E - the authors suggest based on these data that aspartate decrease in hypoxia is through decreased glutamine contribution. Indeed they could also interrogate the data further, as the defect is observed in glutamate, perhaps suggesting that glutamine metabolism through glutaminase is altered. *

      To address the Reviewer’s point, in revised Figure S3, we have now replotted the data from the experiment in the original manuscript to show both absolute and fractional isotopologue abundances of TCA intermediates from cells labelled with 13C-glucose or 13C-glutamine. We have elaborated on these results in our response to point 1, and we re-iterate our conclusions here for the Reviewer’s convenience: Based on these re-plotted data, we find that the amounts of labelled intermediates from both labels decreases; the apparent decrease from glutamine appears greater than that from glucose, likely because glutamine labels more rapidly a greater fraction of TCA intermediates. Moreover, glutamate fractional labelling from glutamine decreases but modestly increases from glucose over time in hypoxia compared to normoxia. These data raise the possibility that TCA intermediates are diverted to glutamate synthesis. However, as we point out in the revised text, the fact that only glutamine has reached an isotopic steady state by 5h precludes us from making a more accurate quantitative statement and therefore we have refrained from further elaborating on these observations.

      *Figure S3D and E - the authors show data from 3 hours of labelling, which is not at steady-state (observable from the timecourse also shown in B and C). To be able to compare the glucose and glutamine labelling, a timepoint in which (pseudo)steady-state is achieved would be better chose. *

      In the revised manuscript, this concern is now addressed by showing both absolute and relative isotopologue abundances for all available time points. We agree that quantitative comparison of labelling must be done at steady-state conditions, however, as we also point out in the revised text (lines 180-181), only glutamine reaches isotopic steady state by 5h whereas glucose hasn’t.

      Additionally, within the aspartate isotopomers arising from glutamine, there is an odd m+1 for aspartate not observed in the other proximal metabolites. Is this a technical defect or is there a biological reason for the significant fractional amount in normoxia?

      We thank the reviewer for pointing this irregularity, which we should have clearly identified as such during proofreading of the manuscript. Probed by the reviewer’s comment, we reviewed the corresponding data tables used to plot these data and found that M+1 had exactly the same values as M+0. We then inspected the original data and confirmed that this resulted from an error during the copying of the data from the R-script output data table to GraphPad Prism for plotting (the line containing the replicates for the m+0 isotopologue was pasted again in the line of the M+1 isotopologues). This issue is now obsolete, as, in the revised manuscript Fig S3 new panels A-D, we have replaced the fractional data with detailed absolute and fractional labelling showing all isotopologues. We apologise for this error.

      • Figure S6F - all samples from GOT1 KO cells have less actin - could an appropriately loaded western blot be presented?*

      In the revised manuscript, we added a new panel with the Ponceau (27/02/2018) staining of the same membrane used for immunoblotting. This staining shows equal loading between all lanes. It is unclear why despite equal loading, the actin signal differs between the two lines.

      • In Figure S5B, the authors present ATP data in wild-type control cells, and LDHA-KO with LDHA re-expression. These should be phenotypically similar, but clearly are not. It suggests that there is something not correct with the system being used.*

      As shown in the western blot of this figure, expression of exogenous LDH only reaches a fraction of endogenous levels, which likely explains the partial, albeit significant, rescue of the ATP depletion observed in the LDHAko cells. We have not been able to achieve higher LDH expression in our cell preparations that would enable us to address this point further.

      *

      *

      Minor comments

        • PHDs need iron, alpha-ketoglutarate, oxygen and critically ascorbate (Introduction page 2)*

          We thank the reviewer for highlighting this critical omission. In the revised manuscript, we have now added this information (line 58).

      * PDK1 phosphorylation of PDH leads to a reduction in pyruvate oxidation, rather than entry of glucose carbons to the TCA cycle (Introduction page 3)*

      We agree with the reviewer that our wording was not accurate, and, in the revised text, we have re-written this part (lines 72-74): “…[PDK1] catalyses the inhibitory phosphorylation of pyruvate dehydrogenase (PDH), leading to attenuated pyruvate oxidation and, consequently, decreased contribution of glucose-derived carbons into the tricarboxylic acid (TCA) cycle.

      * SLC25A51 has been identified as being required for NAD transport into the mitochondria (Kori et al. Science Advances 2020), so it is incorrect to say that the inner mitochondrial membrane is impermeable to this metabolite (page 7)*

      We agree that, in light of the Kori et al. study, the phrasing in our text presented an outdated view of pyridine nucleotide compartmentalisation. The data in Kory et al. support SLC25A51 as a mitochondrial NAD+ transporter, however, it is not clear if NADH is also a substrate. Furthermore, as the authors also point out, SLC25A51 has a relatively low affinity for NAD+ and therefore unlikely to interfere with the functions of the malate-aspartate shuttle. Taking all this into consideration, in the revised text (line 249), we acknowledge the existence of a low-affinity mitochondrial NAD+ transporter and retained the statement about impermeability specifically for NADH.

      * Figure S6D - authors shows a highly significant increase in the mRNA for EGLN3, which is a HIF1 target gene, as well as encoding PHD3, which acts to hydroxylate HIF1a alongside PHD2. This should be commented on in the text.*

      In the revised discussion (lines 577-578), we acknowledge that increased PHD3 (together with increased oxygen availability, related to Reviewer 1’s comment), may additionally contribute to HIF1α destabilisation. Please note that we have also added new data (Figure S6H) in response to Reviewer 1, where we show that exogenous αKG causes HIF1α destabilisation in hypoxia, further supporting the notion that boosting intracellular αKG, alone, can destabilise HIF1α.

      * Figure S5G - could it be made clear on the graph whether this is at 21% or 1% O2?*

      We thank the reviewer for pointing out this omission. We now state clearly both in the revised corresponding legend (line 937) and revised figure that these data are at 1% O2.

      • Figure 5I shows ATP level against % labelling of alpha-GP. It isn't clear whether this is abundance or fractional label, but if the latter this it potentially misleading, as if the concentration of alpha-GP increases as fractional label decreases, there is effectively no change. Could the authors extract the steady-state data from the analysis and use this to calculate amount of m+3 label instead of fraction? Similarly for Figure S1H showing fractional labelling of lactate from glucose. It is likely that the title of this graph is a typo, and that m+3 instead was meant. Additionally, measurement of fractional labelling does not demonstrate increased concentrations of the metabolite, but the glucose carbons making up this isotopomer in the pool.*

      For Figure 5I, we confirm that what we show is based on abundance of α-GP m+3 labelling from glucose and, in the revised manuscript (line 895), we amended the legend to clarify this important point.

      We concede that the way we had originally written this sentence, suggested that we derived our conclusion that increased lactate in media was due to increased glycolysis based solely on the fractional data in Fig. S1H. In the revised manuscript, we have re-phrased the relevant sentence (lines 136-137) to indicate that our conclusion is based on the fractional data, together with the total lactate data that we show in Fig. 1F.

      For all our GC-MS experiments we used ions that we detected reliably in all our experiments – in the case of lactate this is m/z 117. This is a 2-carbon fragment as indicated in the original legend; the molecular formula of the derivatised fragment is shown in Table S2. In the revised manuscript (line 671) we clarify that this fragment contains carbons 2 and 3 of lactate (which we concluded from experiments where labelling with 3,4-13C-glucose (which labels lactate at C1) led to partial decrease in this isotopologue); therefore changes in 117 m+2 indicate changes in glycolysis rather glycolysis and the PPP.

      * Figure S2G - the purpose of the measurement of cysteine is unclear; measurement of NAC directly within cells would be a clearer demonstration of its uptake, and to demonstrate direct contribution to antioxidant response would instead require measurement of cellular antioxidants rather than cysteine itself.*

      We agree with the reviewer’s comment that, ideally, we would have measured antioxidants, however, unfortunately our GC-MS experiments do not detect glutathione; we, therefore, opted to show cysteine as the best available proof that NAC was added to these cells from the same experiments where we measured aspartate and lactate.

      * There is no Figure S3F (page 6 of text)*

      In the original version of our manuscript we had awkwardly placed Figure S3F at the top right side of the figure due to space limitations, so, understandably, the reviewer may have missed it. In the revised manuscript, we have now moved this panel to the main Figure 3E, to also address the reviewer’s point 5, above (presentation of lactate data).

      * Figure 2E, lactate excretion into the media is presenting an odd profile, suggesting that between 3 and 6 hour there is uptake by cells. Equally, the 24 hour timepoint is being presented as p

      The overlap of the error bars arises from error propagation as we report the values at each time point relative to t=0h. The statistical difference we reported was calculated on the original values at 24 h alone, so to avoid this discrepancy we have opted for removing the results of this statistical test altogether.

      *

      Reviewer #2 (Significance (Required)):*

      * The data throughout this paper provide some strong evidence for an early and likely HIF-independent metabolic response - while this is understood, detailed studies have not been performed into the various redox balancing cytosolic pathways, which are presented here. The focus on GOT1 is also interesting and novel, but represents part of a larger overall picture presented, which is not reflected in the title.*

      * This is suitable for a relatively broad audience, as the phenotype is likely not cancer specific.

      *__Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      * Here, Grimm and colleagues investigate the immediate cellular response to hypoxia, prior to onset of HIF1a stabilization/activity. Consistent with established findings they describe that glycolysis is rapidly upregulated under hypoxia, in a HIF1 alpha independent manner, this correlates with an decreased aspartate levels. From this basis, they describe a key role for GOT1 activity in regulating the early hypoxic response, demonstrating its requirement for glycolysis, maintaining the NAD/NADH balance and - in combination with LDHA - maintaining ATP homeostasis in hypoxia. Finally they describe a role for GOT1 (though alpha KG depletion) in contributing to HIF1 alpha stabilization.*

      * In sum, the authors present a compelling study investigating the mechanistic basis of early response to hypoxia, placing GOT1 as a key metabolic regulator of this response. The question of how cell metabolically adapt in the short term to hypoxia is, in my view, an often overlooked area of investigation but clearly has importance across biology, not least in cancer biology - thus the area of investigation is topical. The authors conclusions are supported by their data, often in multiple cell lines and/or through orthologous methods. I would support publication of this study as is.*

      * Reviewer #3 (Significance (Required)):*

      * Significance is stated in my review above, an understudied area of investigation (early hypoxic responses) but clearly important since without a transient response, the long-term impact of HIF1 stress responses would not be possible*

      We thank the reviewer for their time assessing our manuscript and for their positive feedback.

    1. Author Response

      The following is the authors’ response to the original reviews.

      We thank the reviewers for their thorough reading of the manuscript and insightful comments. We have responded to both the “public review” and the “recommendations” and feel that the manuscript is now significantly strengthened.

      Public Review comments

      Reviewer #1:

      Weaknesses:

      1. The abstract does not discuss the reduction of E-gel consumption that occurs after multiple days of exposure to the THC formulation, but rather implies that a new model for chronic oral self-administration has been developed. Given that only two days of consumption was assessed, it is not clear if the model will be useful to determine THC effects beyond the acute measures presented here. The abstract should clarify that there was evidence of reduced consumption/aversive effects with repeated exposures.

      Thank you for your observation. We have added language to address this in the manuscript and the abstract. The model developed in the manuscript is an acute exposure model, with the intention of further chronic exposure adaptations to be developed separately (page 2, line 29).

      1. In the results section, the authors sometimes describe effects in terms of the concentration of gel as opposed to the dose consumed in mg/kg, which can make interpretation difficult. For example, the text describing Figure 1i states that significant effects on body temperature were achieved at 4 mg CTR-gel and 5 mg THC-gel, but were essentially equivalent doses consumed? It would be helpful to describe what average dose of THC produced effects given that consumption varied within each group of mice assigned to a particular concentration.

      We thank the reviewer for this comment and have edited our text to clarify our results. For example, this point is further emphasized by the correlation of the data in Figure1l-n showing the relationship between individual consumption and behavioral readouts (page 11, line 225-226).

      1. The description of the PK data in Figure 3 did not specify if sex differences were examined. Prior studies have found that males and females can exhibit stark differences in brain and plasma levels of THC and metabolites, even when behavioral effects are similar. However, this does depend on species, route, timing of tissue collection. It would be helpful to describe the PK profile of males and females separately.

      We did compare sex dependent effects and found no significant effects after THC E-gel consumption. We’ve added additional language to address this point in the discussion (Supplementary tables T1 and T2).

      1. In Figure 5, it is unclear how the predicted i.p. THC dose could be 30 mg/kg when 30 mg/kg was not tested by the i.p. route according to the figure, and if it had been it would have likely been almost zero acoustic startle, not the increased startle that was observed in the 2 hr gel group. It seems more likely that it would be equivalent to 3 mg/kg i.p. Could there be an error in the modeling, or was it based on the model used for the triad effects? This should be clarified.

      We apologize for the confusion created by that data, and it has now been updated for clarity. The original ~30mg/kg was not a predicted dose consumed, but rather an expected dose consumed based on individual male v. female consumption data in Supplemental Figure S1b. For clarity on the figure, we’ve instead placed dashed lines that draw attention only to the predicted startle response expected from our THC-E-gel model. We have also updated the text which hopefully makes this clearer.

      Reviewer #2:

      Weaknesses:

      Certainly, more THC mediated behavioral outcomes could have been tested, but the work presents a proof-of-concept study to investigate acute THC treatment.

      It would have been interesting if this application form is also possible for chronic treatment regimen

      We agree that a chronic treatment regimen and additional behavioral outcomes is the next, most exciting step for expanding this oral THC-E-gel consumption model, and something we are actively pursuing.

      Reviewer #3:

      Weaknesses:

      The main weaknesses of the manuscript revolve around clarification of the Methods section. All of these weaknesses are described in the "Recommendations to authors" section. Revising the manuscript would account for many of these weaknesses.

      Thank you for carefully reading through our methodology. We have made edits according to everything brought up in the recommendation section of reviewer comments.

      Recommendations for Authors

      Reviewer #1:

      Minor edits to the text:

      Abstract: "intraperitoneal contingent" should be "intraperitoneal noncontingent".

      Line 221, this sentence needs editing for clarity.

      Lines 249-250, incomplete sentence.

      Line 284, the word "activity" is missing from "locomotor between mice".

      Lines 299-301, incomplete sentence.

      Thank you for finding these mistakes. All these recommendations have been incorporated into the final publication.

      Reviewer #2:

      1. The typical THC tetrad includes catalepsy. Why was this behavioral outcome not monitored?

      We felt that locomotion, analgesia, and body temperature were robust behavioral readouts for monitoring cannabimimetic responses and that acoustic startle served as an additional, novel means of understanding THC-E-gel effects.

      1. Please specify the exact substrain of C57BL/6 (i.e., J or N or some other)

      C57BL/6J mice were used for the publication. This clarification has been made in the methods section.

      1. Figure S3 is not mentioned in the result part, but only in the discussion.

      Figure S3 is now referenced in the main body of the Results section.

      1. It might be interesting to follow up the issue that the individual THC consumption is considerable, as depicted in Fig. 1e (at high dose). This will presumably also lead to different behavioral responses. Or is there individual metabolism, also difference male vs. female?

      Thank you for the suggestion. We agree that the distribution of THC doses consumed (calculation based on weight) would be worth further investigating and have now included language about this (page 20, line 436). Please note that we did not find a sex difference (Supplemental Figure S1b), but it would be exciting to discover some biologically relevant cause such as individual absorption or metabolism

      Reviewer #3:

      Major

      1. Methods: Were the observers of experiments blinded to animal treatment? Why or why not?

      Multiple investigators performed the behavioral measurements and were not blinded to mouse treatments, but the dose consumed by each mouse remained blind. Thus, because animals consumed THC gelatin of their own volition while having ad libitum access, we performed the correlational analysis presented in Figure 1 l-n.

      1. Methods: The authors could consider relating their study design to the ARRIVE guidelines and providing a statement as to whether their study adheres to these guidelines. Related to this, were mice provided with any environmental enrichment during the study?

      We followed the ARRIVE guidelines with exception to investigator blinding (described above). Please note that mice were not provided with additional environmental enrichment during the study, a point that we specified in our methods (page 5, line 91).

      1. Methods / Results: In the Methods it is stated that the triad of cannabimimetic behaviors was measured 1 h post-injection or immediately after gelatin exposure. Why were these timepoints chosen? Perhaps this wording should be revised because measurements of cannabimimetic effects were taken several times after drug exposure. Peak i.p. drug may occur earlier than 1 h whereas peak oral drug effect is likely to occur over a longer time period (i.e., not immediately after) due to delays of absorption and first pass metabolism. Is it possible that the authors have underestimated oral drug effects by selecting these timepoints? Please discuss.

      We observed a reduction in locomotion activity starting 1 h following the beginning of exposure to the gelatin (Figure 2), suggesting initial cannabimimetic changes. Based on this observable response we chose to measure all cannabimimetic behaviors immediately following gelatin exposure. The exposure timeline for i.p. injection (1 h post-injection) was selected based on a standard published protocol (Metna-Laurent et al, 2017).

      a. Pharmacodynamics: Related to this and because the aim of this paper is to establish a rodent oral dose model, could the authors discuss the need for better characterization of the time course of drug effects? For example, how might anti-nociception or locomotor activity vary following THC E-gel consumption? This is somewhat addressed in the locomotion time course in Figure 2G but could be elaborated on or discussed in more detail.

      We agree that future studies should include additional time points measuring behavioral changes. This important point is now emphasized in the discussion (page 21, line 455).

      b. Pharmacokinetics: Related to this point above, have the authors considered collecting blood or tissue samples from their i.p.-injected animals to assess drug pharmacokinetics as they relate to drug effect and as compared to oral THC consumption? I am not suggesting the authors conduct a completely new study for this manuscript; however, this could be raised as a future study and/or as a weakness of the current study.

      We did not measure blood and tissue concentrations after i.p. administration due to the number of studies reporting these values by our co-author, Dr. Daniele Piomelli, that established these pharmacokinetic measures. Thus, we chose to reference these studies. Please note that repeating such measurements would be labor intensive, unnecessary use federal NIH resources and animals, while being very redundant to the existing literature.

      c. Minor, but related to these points: In the results, page 14 line 299: the first sentence of this paragraph is confusing as written. The Reviewer recognizes that the authors are relating the pharmacokinetic work to previously published findings, but still thinks that measuring and comparing THC levels from their cohort of i.p.-injected animals would have benefitted the present study.

      Thank you, this edit has been made in the manuscript.

      1. Methods, Histology: The methods as described do not contain sufficient detail regarding THC and THC metabolite quantification. In addition, it is not clear from this section what Histology was performed and how (no histology results appear in the manuscript). Please add more detail to this section of the Methods.

      We apologize for this typo and have corrected it in the methods section of the manuscript.

      1. Methods / Results: The statistics section requires additional detail regarding the rationale for tests being performed on different datasets. In addition, a description of the curve fitting used for data in figures 1H-J, 4B-D, and S4 would be helpful to the reader.

      Thank you, we have updated and provided more information regarding the curve fitting that was used in the methods and results section for the respective figure panels (page 9, line 183-184).

      Minor

      1. Throughout: The use of the phrase "high" dose is somewhat arbitrary and not defined relative to other doses of the THC formulation throughout the manuscript. The Reviewer suggests simply stating that THC was used, specifying the dose, or justifying in the Abstract and/or Introduction the classification of "high" based on relevant literature.

      Thank you for the observation. We have removed this ambiguity by specifically mentioning the dose that was consumed (e.g., abstract page 2, line 20).

      1. Abstract: define "CB1" in the abstract. Although this is a common abbreviation within the field, its use should be defined.

      We have added this definition in the abstract for clarification.

      1. Figure 2: why are the consumption panels B, C, and D given separate labels but the locomotor data are all labeled together as panel G?

      Thank you for the observation, we have adjusted the labeling, so it is equal for both sets of panels.

    1. Author Response

      The following is the authors’ response to the current reviews.

      We agree with the reviewer that the statistics are buried in a dense excel file without a read-me page. We will address this by making a summary excel page for p-values during the production process.


      The following is the authors’ response to the original reviews.

      eLife assessment

      This important study uses genomically-engineered glypican alleles to demonstrate convincingly that Dally (not Dally-like protein [Dlp]) is the key contributor to formation of the Dpp/BMP morphogen gradient in the wing disc of Drosophila. The authors provide solid genetic evidence that, surprisingly, the core domain of Dally appears to suffice to trap Dpp at the cell surface. They conclude with a model according to which Dally modulates the range of Dpp signaling by interfering with Dpp's internalization by the Dpp receptor Thickveins.

      Public Reviews:

      Reviewer #1 (Public Review):

      How morphogens spread within tissues remains an important question in developmental biology. Here the authors revisit the role of glypicans in the formation of the Dpp gradient in wing imaginal discs of Drosophila. They first use sophisticated genome engineering to demonstrate that the two glypicans of Drosophila are not equivalent despite being redundant for viability. They show that Dally is the relevant glypican for Dpp gradient formation. They then provide genetic evidence that, surprisingly, the core domain of Dally suffices to trap Dpp at the cell surface (suggesting a minor role for GAGs). They conclude with a model that Dally modulates the range of Dpp signaling by interfering with Dpp's degradation by Tkv. These are important conclusions, but more independent (biochemical/cell biological) evidence is needed.

      As indicated above, the genetic evidence for the predominant role of Dally in Dpp protein/signalling gradient formation is strong. In passing, the authors could discuss why overexpressed Dlp has a negative effect on signaling, especially in the anterior compartment. The authors then move on to determine the role of GAG (=HS) chains of Dally. They find that in an overexpression assay, Dally lacking GAGs traps Dpp at the cell surface and, counterintuitively, suppresses signaling (fig 4 C, F). Both findings are unexpected and therefore require further validation and clarification, as outlined in a and b below.

      a. In loss of function experiments (dallyDeltaHS replacing endogenous dally), Dpp protein is markedly reduced (fig 4R), as much as in the KO (panel Q), suggesting that GAG chains do contribute to trapping Dpp at the cell surface. This is all the more significant that, according to the overexpression essays, DallyDeltaHS seems more stable than WT Dally (by the way, this difference should also be assessed in the knock-ins, which is possible since they are YFP-tagged). The authors acknowledge that HS chains of Dally are critical for Dpp distribution (and signaling) under physiological conditions. If this is true, one can wonder why overexpressed dally core 'binds' Dpp and whether this is a physiologically relevant activity.

      According to the overexpression assay, DallyDeltaHS seems more stable than WT Dally (Fig. 4B’, E’, 5A’, B’). As the reviewer suggested, we addressed the difference using the two knock-in alleles and found that DallyDeltaHS is more stable than WT Dally (Fig.4 L, M inset), further emphasizing the insufficient role of core protein of Dally for extracellular Dpp distribution.

      In summary, we showed that, although Dally interacts with Dpp mainly through its core protein from the overexpression assay (Fig. 4E, I), HS chains are essential for extracellular Dpp distribution (Fig. 4R). Thus, the core protein of Dally alone is not sufficient for extracellular Dpp distribution under physiological conditions. These results raise a question about whether the interaction of core protein of Dally with Dpp is physiologically relevant. Since the increase of HS upon dally expression but not upon dlp expression resulted in the accumulation of extracellular Dpp (Fig. 2) and this accumulation was mainly through the core protein of Dally (Fig. 4E, I), we speculate that the interaction of the core protein of Dally with Dpp gives ligand specificity to Dally under physiological conditions.

      To understand the importance of the interaction of core protein of Dally with Dpp under physiological conditions, it is important to identify a region responsible for the interaction. Our preliminary results overexpressing a dally mutant lacking the majority of core protein (but keeping the HS modified region intact) showed that HS chains modification was also lost. Although this is consistent with our results that enzymes adding HS chains also interact with the core protein of Dally (Fig. 4D), the dally mutant allele lacking the core protein would hamper us from distinguishing the role of core protein of Dally from HS chains.

      Nevertheless, we can infer the importance of the interaction of core protein of Dally with Dpp using dally[3xHA-dlp, attP] allele, where dlp is expressed in dally expressing cells. Since Dally-like is modified by HS chains but does not interact with Dpp (Fig. 2, 4), dally[3xHA-dlp, attP] allele mimics a dally allele where HS chains are properly added but interaction of core protein with Dpp is lost. As we showed in Fig.3O, S, the allele could not rescue dallyKO phenotypes, consistent with the idea that interaction of core protein of Dally with Dpp is essential for Dpp distribution and signaling and HS chain alone is not sufficient for Dpp distribution.

      b. Although the authors' inference that dallycore (at least if overexpressed) can bind Dpp. This assertion needs independent validation by a biochemical assay, ideally with surface plasmon resonance or similar so that an affinity can be estimated. I understand that this will require a method that is outside the authors' core expertise but there is no reason why they could not approach a collaborator for such a common technique. In vitro binding data is, in my view, essential.

      We agree with the reviewer that a biochemical assay such as SPR helps us characterize the interaction of core protein of Dally and Dpp (if the interaction is direct), although the biochemical assay also would not demonstrate the interaction under the physiological conditions.

      However, SPR has never been applied in the case of Dpp, probably because purifying functional refolded Dpp dimer from bacteria has previously been found to be stable only in low pH and be precipitated in normal pH buffer (Groppe J, et al., 1998)(Matsuda et al., 2021). As the reviewer suggests, collaborating with experts is an important step in the future.

      Nevertheless, SPR was applied for the interaction between BMP4 and Dally (Kirkpatrick et al., 2006), probably because BMP4 is more stable in the normal buffer. Although the binding affinity was not calculated, SPR showed that BMP4 directly binds to Dally and this interaction was only partially inhibited by molar excess of exogenous HS, suggesting that BMP4 can interact with core protein of Dally as well as its HS chains. In addition, the same study applied Co-IP experiments using lysis of S2 cells and showed that Dpp and core protein of Dally are co-immunoprecipitated, although it does not demonstrate if the interaction is direct.

      In a subsequent set of experiments, the authors assess the activity of a form of Dpp that is expected not to bind GAGs (DppDeltaN). Overexpression assays show that this protein is trapped by DallyWT but not dallyDeltaHS. This is a good first step validation of the deltaN mutation, although, as before, an invitro binding assay would be preferable.

      Our overexpression assays actually showed that DppDeltaN is trapped by DallyWT and by dallyDeltaHS at similar levels (Fig. 5C), indicating that interaction of DppDeltaN and HS chains of Dally is largely lost but DppDeltaN can still interact with core protein of Dally.

      We thank the reviewer for the suggesting the in vitro experiment. Although we decided not to develop biophysical experiments such as SPR for Dpp in this study due to the reasons discussed above, we would like to point out that our result is consistent with a previous Co-IP experiment using S2 cells showing that DppDeltaN loses interaction with heparin (Akiyama2008).

      However, in contrast to our results, the same study also proposed by Co-IP experiments using S2 cells that DppDeltaN loses interaction with Dally (Akiyama2008). Although it is hard to conclude since western blotting was too saturated without loading controls and normalization (Fig. 1C in Akiyama 2008), and negative in vitro experiments do not necessarily demonstrate the lack of interaction in vivo. One explanation why the interaction was missed in the previous study is that some factors required for the interaction of DppDeltaN with core protein of Dally are missing in S2 cells. In this case, in vivo interaction assay we used in this study has an advantage to robustly detect the interaction.

      Nevertheless, the authors show that DppDeltaN is surprisingly active in a knock-in strain. At face value (assuming that DeltaN fully abrogates binding to GAGs), this suggests that interaction of Dpp with the GAG chains of Dally is not required for signaling activity. This leads to authors to suggest (as shown in their final model) that GAG chains could be involved in mediating the interactions of Dally with Tkv (and not with Dpp. This is an interesting idea, which would need to be reconciled with the observation that the distribution of Dpp is affected in dallyDeltaHS knock-ins (item a above). It would also be strengthened by biochemical data (although more technically challenging than the experiments suggested above). In an attempt to determine the role of Dally (GAGs in particular) in the signaling gradient, the paper next addresses its relation to Tkv. They first show that reducing Tkv leads to Dpp accumulation at the cell surface, a clear indication that Tkv normally contributes to the degradation of Dpp. From this they suggest that Tkv could be required for Dpp internalisation although this is not shown directly. The authors then show that a Dpp gradient still forms upon double knockdown (Dally and Tkv). This intriguing observation shows that Dally is not strictly required for the spread of Dpp, an important conclusion that is compatible with early work by Lander suggesting that Dpp spreads by free diffusion. These result show that Dally is required for gradient formation only when Tkv is present. They suggest therefore that Dally prevents Tkv-mediated internalisation of Dpp. Although this is a reasonable inference, internalisation assays (e.g. with anti-Ollas or anti-HA Ab) would strengthen the authors' conclusions especially because they contradict a recent paper from the Gonzalez-Gaitan lab.

      Thanks for suggesting the internalization assay. As we discussed in the discussion, our results suggest that extracellular Dpp distribution is severely reduced in dally mutants due to Tkv mediated internalization of Dpp (Fig. 6). Thus, extracellular Dpp available for labelling with nanobody is severely reduced in dally mutants, which can explain the reduced internalization of Dpp in dally mutants in the internalization assay. Therefore, we think that the nanobody internalization assay would not distinguish the two contradicting possibilities.

      The paper ends with a model suggesting that HS chains have a dual function of suppressing Tkv internalisation and stimulating signaling. This constitutes a novel view of a glypican's mode of action and possibly an important contribution of this paper. As indicated above, further experiments could considerably strengthen the conclusion. Speculation on how the authors imagine that GAG chains have these activities would also be warranted.

      Thank you very much!

      Reviewer #2 (Public Review):

      The authors are trying to distinguish between four models of the role of glypicans (HSPGs) on the Dpp/BMP gradient in the Drosophila wing, schematized in Fig. 1: (1) "Restricted diffusion" (HSPGs transport Dpp via repetitive interaction of HS chains with Dpp); (2) "Hindered diffusion" (HSPGs hinder Dpp spreading via reversible interaction of HS chains with Dpp); (3) "Stabilization" (HSPGs stabilize Dpp on the cell surface via reversible interaction of HS chains with Dpp that antagonizes Tkv-mediated Dpp internalization); and (4) "Recycling" (HSPGs internalize and recycle Dpp).

      To distinguish between these models, the authors generate new alleles for the glypicans Dally and Dally-like protein (Dlp) and for Dpp: a Dally knock-out allele, a Dally YFP-tagged allele, a Dally knock-out allele with 3HA-Dlp, a Dlp knock-out allele, a Dlp allele containing 3-HA tags, and a Dpp lacking the HS-interacting domain. Additionally, they use an OLLAS-tag Dpp (OLLAS being an epitope tag against which extremely high affinity antibodies exist). They examine OLLAS-Dpp or HA-Dpp distribution, phospho-Mad staining, adult wing size.

      They find that over-expressed Dally - but not Dlp - expands Dpp distribution in the larval wing disc. They find that the Dally[KO] allele behaves like a Dally strong hypomorph Dally[MH32]. The Dally[KO] - but not the Dlp[KO] - caused reduced pMad in both anterior and posterior domains and reduced adult wing size (particularly in the Anterior-Posterior axis). These defects can be substantially corrected by supplying an endogenously tagged YFP-tagged Dally. By contrast, they were not rescued when a 3xHA Dlp was inserted in the Dally locus. These results support their conclusion that Dpp interacts with Dally but not Dlp.

      They next wanted to determine the relative contributions of the Dally core or the HS chains to the Dpp distribution. To test this, they over-expressed UAS-Dally or UAS-Dally[deltaHS] (lacking the HS chains) in the dorsal wing. Dally[deltaHS] over-expression increased the distribution of OLLAS-Dpp but caused a reduction in pMad. Then they write that after they normalize for expression levels, they find that Dally[deltaHS] only mildly reduces pMad and this result indicates a major contribution of the Dally core protein to Dpp stability.

      Thanks for the comments. We actually showed that compared with Dally overexpression, Dally[deltaHS] overexpression only mildly reduces extracellular Dpp accumulation (Fig. 4I). This indicates a major contribution of the Dally core protein to interaction with Dpp, although the interaction is not sufficient to sustain extracellular Dpp distribution and signaling gradient.

      The "normalization" is a key part of this model and is not mentioned how the normalization was done. When they do the critical experiment, making the Dally[deltaHS] allele, they find that loss of the HS chains is nearly as severe as total loss of Dally (i.e., Dally[KO]). Additionally, experimental approaches are needed here to prove the role of the Dally core.

      Since the expression level of Dally[deltaHS] is higher than Dally when overexpressed, we normalized extracellular Dpp distribution (a-Ollas staining) against GFP fluorescent signal (Dally or Dally[deltaHS]). To do this, we first extracted both signal along the A-P axis from the same ROI in the previous version. The ratio was calculated by dividing the intensity of a-Ollas staining with the intensity of GFP fluorescent signal at a given position x. The average profile from each normalized profile was generated and plotted using the script described in the method (wingdisc_comparison.py) as other pMad or extracellular staining profiles.

      Although this analysis provides normalized extracellular Dpp accumulation at different positions along the A-P axis, we are more interested in the total amount of Dpp or DppDeltaN accumulation upon Dally or dallyDeltaHS expression. Therefore, in the revised ms, we decided to normalize total amount of extracellular Dpp against the level of Dally or Dally[deltaHS] by dividing total signal intensity of extracellular Dpp staining (ExOllas staining) by total GFP fluorescent signal (Dally or Dally[deltaHS]) around the Dpp producing cells in each wing disc. Statistical analysis showed that accumulation of extracellular Dpp is only slightly reduced without HS chains (Fig.4I), indicating that Dally interacts with Dpp mainly through its core protein.

      We agree with the reviewer that additional experimental approaches are needed to address the role of the core protein of Dally. As we discussed in the response to the reviewer1, to understand the importance of the interaction of core protein of Dally with Dpp, it is important to identify a region responsible for the interaction. Our preliminary results overexpressing a dally mutant lacking the majority of core protein (but keeping the HS modified region intact) showed that HS chains modification was also lost. Although this is consistent with our results that enzymes adding HS chains also interact with the core protein of Dally (Fig. 4D), the dally mutant allele lacking the core protein would hamper us from distinguishing the role of the core protein of Dally from HS chains.

      Nevertheless, we can infer the importance of the interaction of core protein of Dally with Dpp using dally[3xHA-dlp, attP] allele, where dlp is expressed in dally expressing cells. Since Dally-like is modified by HS chains but does not interact with Dpp (Fig. 2, 4), dally[3xHA-dlp, attP] allele mimics a dally allele where HS chains are properly added but interaction of core protein with Dpp is lost. As we showed in Fig.3O, S, the allele could not rescue dallyKO phenotypes, consistent with the idea that interaction of core protein of Dally with Dpp is essential for Dpp distribution and signaling.

      Prior work has shown that a stretch of 7 amino acids in the Dpp N-terminal domain is required to interact with heparin but not with Dpp receptors (Akiyama, 2008). The authors generated an HA-tagged Dpp allele lacking these residues (HA-dpp[deltaN]). It is an embryonic lethal allele, but they can get some animals to survive to larval stages if they also supply a transgene called “JAX” containing dpp regulatory sequences. In the JAX; HA-dpp[deltaN] mutant background, they find that the distribution and signaling of this Dpp molecule is largely normal. While over-expressed Dally can increase the distribution of HA-dpp[deltaN], over-expression of Dally[deltaHS] cannot. These latter results support the model that the HS chains in Dally are required for Dpp function but not because of a direct interaction with Dpp.

      Our overexpression assays actually showed that both Dally and Dally[deltaHS] can accumulate Dpp upon overexpression and the accumulation of Dpp is comparable after normalization (Fig. 5C), consistent with the idea that interaction of DppdeltaN and HS chains are largely lost. As the reviewer pointed out, these results support the model that the HS chains in Dally are required for Dpp function but not because of a direct interaction with Dpp.

      In the last part of the results, they attempt to determine if the Dpp receptor Thickveins (Tkv) is required for Dally-HS chains interaction. The 2008 (Akiyama) model posits that Tkv activates pMad downstream of Dpp and also internalizes and degrades Dpp. A 2022 (Romanova-Michaelides) model proposes that Dally (not Tkv) internalizes Dpp.

      To distinguish between these models, the authors deplete Tkv from the dorsal compartment of the wing disc and found that extracellular Dpp increased and expanded in that domain. These results support the model that Tkv is required to internalize Dpp.

      They then tested the model that Dally antagonizes Tkv-mediated Dpp internalization by determining whether the defective extracellular Dpp distribution in Dally[KO] mutants could be rescued by depleting Tkv. Extracellular Dpp did increase in the D vs V compartment, potentially providing some support for their model. However, there are no statistics performed, which is needed for full confidence in the results. The lack of statistics is particularly problematic (1) when they state that extracellular Dpp does not rise in ap>tkv RNAi vs ap>tkv RNAi, dally[KO] wing discs (Fig. 6E) or (2) when they state that extracellular Dpp gradient expanded in the dorsal compartment when tkv was dorsally depleted in dally[deltaHS] mutants (Fig. 6I). These last two experiments are important for their model but the differences are assessed only visually. In fact, extracellular Dpp in ap>tkv RNAi, dally[KO] (Fig. 6B) appears to be lower than extracellular Dpp in ap>tkv RNAi (Fig. 6A) and the histogram of Dpp in ap>tkv RNAi, dally[KO] is actually a bit lower than Dpp in ap>tkv RNAi, But the author claim that there is no difference between the two. Their conclusion would be strengthened by statistical analyses of the two lines.

      We provided statistics for all the quantifications for pMad and extracellular Dpp distribution as supplementary data. In the previous version, we argued that extracellular Dpp level in ap>tkvRNAi, dallyKO (Fig.6B) does not increase compared with that in ap>tkvRNAi (Fig.6A). Statistical analysis (t-test) showed that the extracellular Dpp level in Fig. 6B is similar to or lower than that in Fig. 6A (Fig. 6E), confirming our conclusion. Statistical analysis (t-test) also confirmed that extracellular Dpp distribution expanded when tkv was knocked down in dallyHS mutants (Fig. 6I).

      Strengths:

      1. New genomically-engineered alleles

      A considerable strength of the study is the generation and characterization of new Dally, Dlp and Dpp alleles. These reagents will be of great use to the field.

      Thanks. We hope that these resources are indeed useful to the field.

      1. Surveying multiple phenotypes

      The authors survey numerous parameters (Dpp distribution, Dpp signaling (pMad) and adult wing phenotypes) which provides many points of analysis.

      Thanks!

      Weaknesses:

      1. Confusing discussion regarding the Dally core vs HS in Dpp stability. They don't provide any measurements or information on how they "normalize" for the level of Dally vs Dally[deltaHS]? This is important part of their model that currently is not supported by any measurements.

      We explained how we normalized in the above section and updated the method section in the revised ms.

      1. Lacking quantifications and statistical analyses:

      a. Why are statistical significance for histograms (pMad and Dpp distribution) not supplied? These histograms provide the key results supporting the authors' conclusions but no statistical tests/results are presented. This is a pervasive shortcoming in the current study.

      Thanks. We provided t-test analyses together with the raw data as supplementary data.

      b. dpp[deltaN] with JAX transgene - it would strengthen the study to supply quantitative data on the percent survival/lethal stage of dpp[deltaN] mutants with or without the JAK transgene

      In this study, we are interested in the role of dpp[deltaN] during the wing disc development. Therefore, we decided not to perform the detailed analysis on the percent survival/lethal stage of dpp[deltaN] mutants with or without the JAX transgene in the current study. Nevertheless, the fact that dpp[deltaN] allele is maintained with a balanced stock and JAX;dpp[deltaN] allele can be maintained as homozygous stock indicates that the lethality of dpp[deltaN] allele comes from the early stages. Indeed, our preliminary results showed that pMad signal is severely lost in the dpp[deltaN] embryo without JAX (data not shown), indicating that the allele is lethal at early embryonic stages.

      c. The graphs on wing size etc should start at zero.

      Thanks. We corrected this in the current ms.

      d. The sizes of histograms and graphs in each figure should be increased so that the reader can properly assess them. Currently, they are very small.

      Thanks. We changed the sizes in the current ms.

      The authors' model is that Dally (not Dlp) is required for Dpp distribution and signaling but that this is not due to a direct interaction with Dpp. Rather, they posit that Dally-HS antagonize Tkv-mediated Dpp internalization. Currently the results of the experiments could be considered consistent with their model, but as noted above, the lack of statistical analyses of some parameters is a weakness.

      Thanks. We now performed and provided the statistical analyses in the revised ms.

      One problematic part of their result for me is the role of the Dally core protein (Fig. 7B). There is a mis-match between the over-expression results and Dally allele lacking HS (but containing the core). Finally, their results support the idea that one or more as-yet unidentified proteins interact with Dally-HS chains to control Dpp distribution and signaling in the wing disc.

      Our results simply suggest that Dpp can interact with Dally mainly through core protein but this interaction is not sufficient to sustain extracellular Dpp gradient formation under physiological conditions (dallyDeltaHS) (Fig. 4Q). We find that the mis-match is not problematic if the role of Dally is not simply mediated through interaction with Dpp. We speculate that interaction of Dpp and core protein of Dally is transient and not sufficient to sustain the Dpp gradient without HS chains of Dally stabilizing extracellular Dpp distribution by blocking Tkv-mediated Dpp internalization.

      There is much debate and controversy in the Dpp morphogen field. The generation of new, high quality alleles in this study will be useful to Drosophila community, and the results of this study support the concept that Tkv but not Dally regulate Dpp internalization. Thus the work could be impactful and fuel new debates among morphogen researchers.

      Thanks.

      The manuscript is currently written in a manner that really is only accessible to researchers who work on the Dpp gradient. It would be very helpful for the authors to re-write the manuscript and carefully explain in each section of the results (1) the exact question that will be asked, (2) the prior work on the topic, (3) the precise experiment that will be done, and (4) the predicted results. This would make the study more accessible to developmental biologists outside of the morphogen gradient and Drosophila communities.

      Thanks. We modified texts and changed the order of Fig.5. We hope that the changes make this study more accessible to developmental biologists outside of the field.

    1. Author Response:

      Reviewer #1 (Public Review):

      [...] Weaknesses:

      1. I feel the authors need to justify why flow-crushing helps localization specificity. There is an entire family of recent papers that aim to achieve higher localization specificity by doing the exact opposite. Namely, MT or ABC fRMRI aims to increase the localization specificity by highlighting the intravascular BOLD by means of suppressing non-flowing tissue. To name a few:

      Priovoulos, N., de Oliveira, I.A.F., Poser, B.A., Norris, D.G., van der Zwaag, W., 2023. Combining arterial blood contrast with BOLD increases fMRI intracortical contrast. Human Brain Mapping hbm.26227. https://doi.org/10.1002/hbm.26227.

      Pfaffenrot, V., Koopmans, P.J., 2022. Magnetization Transfer weighted laminar fMRI with multi-echo FLASH. NeuroImage 119725. https://doi.org/10.1016/j.neuroimage.2022.119725

      Schulz, J., Fazal, Z., Metere, R., Marques, J.P., Norris, D.G., 2020. Arterial blood contrast ( ABC ) enabled by magnetization transfer ( MT ): a novel MRI technique for enhancing the measurement of brain activation changes. bioRxiv. https://doi.org/10.1101/2020.05.20.106666

      Based on this literature, it seems that the proposed method will make the vein problem worse, not better. The authors could make it clearer how they reason that making GE-BOLD signals more extra-vascular weighted should help to reduce large vein effects.

      The empirical evidence for the claim that flow crushing helps with the localization specificity should be made clearer. The response magnitude with and without flow crushing looks pretty much identical to me (see Fig, 6d). It's unclear to me what to look for in Fig. 5. I cannot discern any layer patterns in these maps. It's too noisy. The two maps of TE=43ms look like identical copies from each other. Maybe an editorial error?

      The authors discuss bipolar crushing with respect to SE-BOLD where it has been previously applied. For SE-BOLD at UHF, a substantial portion of the vein signal comes from the intravascular compartment. So I agree that for SE-BOLD, it makes sense to crush the intravascular signal. For GE-BOLD however, this reasoning does not hold. For GE-BOLD (even at 3T), most of the vein signal comes from extravascular dephasing around large unspecific veins, and the bipolar crushing is not expected to help with this.

      The authors would like to clarify that the velocity-nulling gradient is NOT designed to suppress all the contributions from intravascular blood. Instead, we tried to find a balance so that the VN gradient maximally suppressed the macrovascular signal in unspecific veins but minimally attenuated the microvascular signal in specific capillary bed. We acknowledge the reviewer's concern regarding the potential extravascular contributions from large, non-specific vessels. This aspect will be thoroughly evaluated and addressed in the revised manuscript. Additionally, we will make clarifications in other parts that may have cause the reviewer’s misunderstandings.

      1. The bipolar crushing is limited to one single direction of flow. This introduces a lot of artificial variance across the cortical folding pattern. This is not mentioned in the manuscript. There is an entire family of papers that perform layer-fmri with black-blood imaging that solves this with a 3D contrast preparation (VAPER) that is applied across a longer time period, thus killing the blood signal while it flows across all directions of the vascular tree. Here, the signal cruising is happening with a 2D readout as a "snap-shot" crushing. This does not allow the blood to flow in multiple directions. VAPER also accounts for BOLD contaminations of larger draining veins by means of a tag-control sampling. The proposed approach here does not account for this contamination.

      Chai, Y., Li, L., Huber, L., Poser, B.A., Bandettini, P.A., 2020. Integrated VASO and perfusion contrast: A new tool for laminar functional MRI. NeuroImage 207, 116358. https://doi.org/10.1016/j.neuroimage.2019.116358

      Chai, Y., Liu, T.T., Marrett, S., Li, L., Khojandi, A., Handwerker, D.A., Alink, A., Muckli, L., Bandettini, P.A., 2021. Topographical and laminar distribution of audiovisual processing within human planum temporale. Progress in Neurobiology 102121. https://doi.org/10.1016/j.pneurobio.2021.102121

      If I would recommend anyone to perform layer-fMRI with blood crushing, it seems that VAPER is the superior approach. The authors could make it clearer why users might want to use the unidirectional crushing instead.

      We acknowledge that the degree of velocity nulling varies across the cortical folding pattern. We intend to discuss potential solutions to address this variance, and these may be implemented in the revised manuscript as appropriate. Furthermore, we will provide a comprehensive discussion on the advantages and disadvantages of both CBV-based and BOLD-based approaches.

      1. The comparison with VASO is misleading. The authors claim that previous VASO approaches were limited by TRs of 8.2s. The authors might be advised to check the latest literature of the last years. Koiso et al. performed whole brain layer-fMRI VASO at 0.8mm at 3.9 seconds (with reliable activation), 2.7 seconds (with unconvincing activation pattern, though), and 2.3 (without activation). Also, whole brain layer-fMRI BOLD at 0.5mm and 0.7mm has been previously performed by the Juelich group at TRs of 3.5s (their TR definition is 'fishy' though).

      Koiso, K., Müller, A.K., Akamatsu, K., Dresbach, S., Gulban, O.F., Goebel, R., Miyawaki, Y., Poser, B.A., Huber, L., 2023. Acquisition and processing methods of whole-brain layer-fMRI VASO and BOLD: The Kenshu dataset. Aperture Neuro 34. https://doi.org/10.1101/2022.08.19.504502

      Yun, S.D., Pais‐Roldán, P., Palomero‐Gallagher, N., Shah, N.J., 2022. Mapping of whole‐cerebrum resting‐state networks using ultra‐high resolution acquisition protocols. Human Brain Mapping. https://doi.org/10.1002/hbm.25855

      Pais-Roldan, P., Yun, S.D., Palomero-Gallagher, N., Shah, N.J., 2023. Cortical depth-dependent human fMRI of resting-state networks using EPIK. Front. Neurosci. 17, 1151544. https://doi.org/10.3389/fnins.2023.1151544

      The authors are correct that VASO is not advised as a turn-key method for lower brain areas, incl. Hippocampus and subcortex. However, the authors use this word of caution that is intended for inexperienced "users" as a statement that this cannot be performed. This statement is taken out of context. This statement is not from the academic literature. It's advice for the 40+ user base that wants to perform layer-fMRI as a plug-and-play routine tool in neuroscience usage. In fact, sub-millimeter VASO is routinely being performed by MRI-physicists across all brain areas (including deep brain structures, hippocampus etc). E.g. see Koiso et al. and an overview lecture from a layer-fMRI workshop that I had recently attended: https://youtu.be/kzh-nWXd54s?si=hoIJjLLIxFUJ4g20&t=2401

      Thus, the authors could embed this phrasing into the context of their own method that they are proposing in the manuscript. E.g. the authors could state whether they think that their sequence has the potential to be disseminated across sites, considering that it requires slow offline reconstruction in Matlab? Do the authors think that the results shown in Fig. 6c are suggesting turn-key acquisition of a routine mapping tool? In my humble opinion, it looks like random noise, with most of the activation outside the ROI (in white matter).

      Those literatures will be included and discussed in the revised manuscript. Furthermore, we are considering the exclusion of the LGN results presented in Figure 6, as they may divert attention from the primary focus of the study.

      We are enthusiastic about sharing our imaging sequence, provided its usefulness is conclusively established. However, it's important to note that without an online reconstruction capability, such as the ICE, the practical utility of the sequence may be limited. Unfortunately, we currently don’t have the manpower to implement the online reconstruction. Nevertheless, we are more than willing to share the offline reconstruction codes upon request.

      1. The repeatability of the results is questionable. The authors perform experiments about the robustness of the method (line 620). The corresponding results are not suggesting any robustness to me. In fact, the layer profiles in Fig. 4c vs. Fig 4d are completely opposite. The location of peaks turns into locations of dips and vice versa. The methods are not described in enough detail to reproduce these results. The authors mention that their image reconstruction is done "using in-house MATLAB code" (line 634). They do not post a link to github, nor do they say if they share this code.

      It is not trivial to get good phase data for fMRI. The authors do not mention how they perform the respective coil-combination. No data are shared for reproduction of the analysis.

      There may have been a misunderstanding regarding the presentation in Figure 4, which illustrates the impact of TEs and the VN gradient. To enhance clarity and avoid further confusion, we will redesign this figure for improved comprehension.

      The authors are open to sharing the MATLAB codes associated with our study. However, we were limited by manpower for refining and enhancing the readability of these codes for broader use.

      Regarding the coil combination, we utilized an adaptive coil combination approach as described in the paper by Walsh DO, Gmitro AF, and Marcellin MW, titled 'Adaptive reconstruction of phased array MR imagery' (Magnetic Resonance in Medicine 2000; 43:682-690). The MATLAB code for this method was implemented by Dr. Diego Hernando. We will include a link for downloading this code in the revised manuscript for the convenience of interested readers.

      1. The application of NODRIC is not validated. Previous applications of NORDIC at 3T layer-fMRI have resulted in mixed success. When not adjusted for the right SNR regime it can result in artifactual reductions of beta scores, depending on the SNR across layers. The authors could validate their application of NORDIC and confirm that the average layer-profiles are unaffected by the application of NORDIC. Also, the NORDIC version should be explicitly mentioned in the manuscript.

      Akbari, A., Gati, J.S., Zeman, P., Liem, B., Menon, R.S., 2023. Layer Dependence of Monocular and Binocular Responses in Human Ocular Dominance Columns at 7T using VASO and BOLD (preprint). Neuroscience. https://doi.org/10.1101/2023.04.06.535924

      Knudsen, L., Guo, F., Huang, J., Blicher, J.U., Lund, T.E., Zhou, Y., Zhang, P., Yang, Y., 2023. The laminar pattern of proprioceptive activation in human primary motor cortex. bioRxiv. https://doi.org/10.1101/2023.10.29.564658

      During our internal testing, we observed that the NORDIC denoising process did not alter the activation patterns. These findings will be incorporated into the revised manuscript. The details of NORDIC will be provided as well.

      Reviewer #2 (Public Review):

      [...] The well-known double peak feature in M1 during finger tapping was used as a test-bed to evaluate the spatial specificity. They were indeed able to demonstrate two distinct peaks in group-level laminar profiles extracted from M1 during finger tapping, which was largely free from superficial bias. This is rather intriguing as, even at 7T, clear peaks are usually only seen with spatially specific non-BOLD sequences. This is in line with their simple simulations, which nicely illustrated that, in theory, intravascular macrovascular signals should be suppressible with only minimal suppression of microvasculature when small b-values of the VN gradients are employed. However, the authors do not state how ROIs were defined making the validity of this finding unclear; were they defined from independent criteria or were they selected based on the region mostly expressing the double peak, which would clearly be circular? In any case, results are based on a very small sub-region of M1 in a single slice - it would be useful to see the generalizability of superficial-bias-free BOLD responses across a larger portion of M1.

      Given the individual variations in the location of the M1 region, we opted for manual selection of the ROI. In the revised manuscript, we plan to explore and implement an independent criterion for ROI selection to enhance the objectivity and reproducibility of our methodology.

      As repeatedly mentioned by the authors, a laminar fMRI setup must demonstrate adequate functional sensitivity to detect (in this case) BOLD responses. The sensitivity evaluation is unfortunately quite weak. It is mainly based on the argument that significant activation was found in a challenging sub-cortical region (LGN). However, it was a single participant, the activation map was not very convincing, and the demonstration of significant activation after considerable voxel-averaging is inadequate evidence to claim sufficient BOLD sensitivity. How well sensitivity is retained in the presence of VN gradients, high acceleration factors, etc., is therefore unclear. The ability of the setup to obtain meaningful functional connectivity results is reassuring, yet, more elaborate comparison with e.g., the conventional BOLD setup (no VN gradients) is warranted, for example by comparison of tSNR, quantification and comparison of CNR, illustration of unmasked-full-slice activation maps to compare noise-levels, comparison of the across-trial variance in each subject, etc. Furthermore, as NORDIC appears to be a cornerstone to enable submillimeter resolution in this setup at 3T, it is critical to evaluate its impact on the data through comparison with non-denoised data, which is currently lacking.

      We appreciate the reviewer’s comments. Those issues will be addressed carefully.

      Reviewer #3 (Public Review):

      [...] Weaknesses: - Although the VASO acquisition is discussed in the introduction section, the VN-sequence seems closer to diffusion-weighted functional MRI. The authors should make it more clear to the reader what the differences are, and how results are expected to differ. Generally, it is not so clear why the introduction is so focused on the VASO acquisition (which, curiously, lacks a reference to Lu et al 2013). There are many more alternatives to BOLD-weighted imaging for fMRI. CBF-weighted ASL and GRASE have been around for a while, ABC and double-SE have been proposed more recently.

      The principal distinction between DW-fMRI and our methodology lies in the level of the b-value employed. DW-fMRI typically measures cellular swelling by utilizing a b-value greater than 1000 s/mm^2 (e.g. 1800). Conversely, our Velocity Nulling functional MRI (VN-fMRI) approach continues to assess hemodynamic responses, utilizing a smaller b-value specifically for the suppression of signals from draining veins. In addition, other layer-fMRI methods will be discussed.

      • The comparison in Figure 2 for different b-values shows % signal changes. However, as the baseline signal changes dramatically with added diffusion weighting, this is rather uninformative. A plot of t-values against cortical depth would be much more insightful.
      • Surprisingly, the %-signal change for a b-value of 0 is not significantly different from 0 in the gray matter. This raises some doubts about the task or ROI definition. A finger-tapping task should reliably engage the primary motor cortex, even at 3T, and even in a single participant.
      • The BOLD weighted images in Figure 3 show a very clear double-peak pattern. This contradicts the results in Figure 2 and is unexpected given the existing literature on BOLD responses as a function of cortical depth.

      In our study, the TE in Figure 2 is shorter than that in Figure 3 (33 ms versus 43 ms). It has been reported in the literature that BOLD fMRI with a shorter TE tends to include a greater intravascular contribution. Acknowledging this, we plan to repeat the experiments with a controlled TE to ensure consistency in our results.

      • Given that data from Figures 2, 3, and 4 are derived from a single participant each, order and attention affects might have dramatically affected the observed patterns. Especially for Figure 4, neither BOLD nor VN profiles are really different from 0, and without statistical values or inter-subject averaging, these cannot be used to draw conclusions from.

      The order of the experiments were randomized to ensure unbiased results.

      It is important to note that the error bars presented in Figures 2, 3, and 4 do not represent the standard deviation of the residual fitting error. Instead, they illustrate the variation across voxels within a specific layer. This approach may lead to the error bars being influenced by the selection of the Region of Interest (ROI). In light of this, we intend to refine our statistical methodologies in the revised manuscript to address this issue.

      • In Figure 5, a phase regression is added to the data presented in Figure 4. However, for a phase regression to work, there has to be a (macrovascular) response to start with. As none of the responses in Figure 4 are significant for the single participant dataset, phase regression should probably not have been undertaken. In this case, the functional 'responses' appear to increase with phase regression, which is contra-intuitive and deserves an explanation.
      • Consistency of responses is indeed expected to increase by a removal of the more variable vascular component. However, the microvascular component is always expected to be smaller than the combination of microvascular + macrovascular responses. Note that the use of %signal changes may obscure this effect somewhat because of the modified baseline. Another expected feature of BOLD profiles containing both micro- and microvasculature is the draining towards the cortical surface. In the profiles shown in Figure 7, this is completely absent. In the group data, no significant responses to the task are shown anywhere in the cortical ribbon.
      • Although I'd like to applaud the authors for their ambition with the connectivity analysis, I feel that acquisitions that are so SNR starved as to fail to show a significant response to a motor task should not be used for brain wide directed connectivity analysis.

      We agree that exploring brain-wide directed functional connectivity may be overly ambitious at this stage, particularly before the VN-fMRI technique has been comprehensively evaluated and validated. In the revised manuscript, we will focus more on examining the characteristics of the layer-dependent BOLD signal rather than delving into layer-dependent functional connectivity.

    2. Reviewer #1 (Public Review):

      Summary:

      This study aims to provide imaging methods for users of the field of human layer-fMRI. This is an emerging field with 240 papers published so far. Different than implied in the manuscript, 3T is well represented among those papers. E.g. see the papers below that are not cited in the manuscript. Thus, the claim on the impact of developing 3T methodology for wider dissemination is not justified. Specifically, because some of the previous papers perform whole brain layer-fMRI (also at 3T) in more efficient, and more established procedures.

      The authors implemented a sequence with lots of nice features. Including their own SMS EPI, diffusion bipolar pulses, eye-saturation bands, and they built their own reconstruction around it. This is not trivial. Only a few labs around the world have this level of engineering expertise. I applaud this technical achievement. However, I doubt that any of this is the right tool for layer-fMRI, nor does it represent an advancement for the field. In the thermal noise dominated regime of sub-millimeter fMRI (especially at 3T), it is established to use 3D readouts over 2D (SMS) readouts. While it is not trivial to implement SMS, the vendor implementations (as well as the CMRR and MGH implementations) are most widely applied across the majority of current fMRI studies already. The author's work on this does not serve any previous shortcomings in the field.

      The mechanism to use bi-polar gradients to increase the localization specificity is doubtful to me. In my understanding, killing the intra-vascular BOLD should make it less specific. Also, the empirical data do not suggest a higher localization specificity to me.

      Embedding this work in the literature of previous methods is incomplete. Recent trends of vessel signal manipulation with ABC or VAPER are not mentioned. Comparisons with VASO are outdated and incorrect.

      The reproducibility of the methods and the result is doubtful (see below).

      I don't think that this manuscript is in the top 50% of the 240 layer-fmri papers out there.

      3T layer-fMRI papers that are not cited:<br /> Taso, M., Munsch, F., Zhao, L., Alsop, D.C., 2021. Regional and depth-dependence of cortical blood-flow assessed with high-resolution Arterial Spin Labeling (ASL). Journal of Cerebral Blood Flow and Metabolism. https://doi.org/10.1177/0271678X20982382

      Wu, P.Y., Chu, Y.H., Lin, J.F.L., Kuo, W.J., Lin, F.H., 2018. Feature-dependent intrinsic functional connectivity across cortical depths in the human auditory cortex. Scientific Reports 8, 1-14. https://doi.org/10.1038/s41598-018-31292-x

      Lifshits, S., Tomer, O., Shamir, I., Barazany, D., Tsarfaty, G., Rosset, S., Assaf, Y., 2018. Resolution considerations in imaging of the cortical layers. NeuroImage 164, 112-120. https://doi.org/10.1016/j.neuroimage.2017.02.086

      Puckett, A.M., Aquino, K.M., Robinson, P.A., Breakspear, M., Schira, M.M., 2016. The spatiotemporal hemodynamic response function for depth-dependent functional imaging of human cortex. NeuroImage 139, 240-248. https://doi.org/10.1016/j.neuroimage.2016.06.019

      Olman, C.A., Inati, S., Heeger, D.J., 2007. The effect of large veins on spatial localization with GE BOLD at 3 T: Displacement, not blurring. NeuroImage 34, 1126-1135. https://doi.org/10.1016/j.neuroimage.2006.08.045

      Ress, D., Glover, G.H., Liu, J., Wandell, B., 2007. Laminar profiles of functional activity in the human brain. NeuroImage 34, 74-84. https://doi.org/10.1016/j.neuroimage.2006.08.020

      Huber, L., Kronbichler, L., Stirnberg, R., Ehses, P., Stocker, T., Fernández-Cabello, S., Poser, B.A., Kronbichler, M., 2023. Evaluating the capabilities and challenges of layer-fMRI VASO at 3T. Aperture Neuro 3. https://doi.org/10.52294/001c.85117

      Scheeringa, R., Bonnefond, M., van Mourik, T., Jensen, O., Norris, D.G., Koopmans, P.J., 2022. Relating neural oscillations to laminar fMRI connectivity in visual cortex. Cerebral Cortex. https://doi.org/10.1093/cercor/bhac154

      Strengths:

      See above. The authors developed their own SMS sequence with many features. This is important to the field. And does not leave sequence development work to view isolated monopoly labs. This work democratises SMS.<br /> The questions addressed here are of high relevance to the field: getting tools with good sensitivity, user-friendly applicability, and locally specific brain activity mapping is an important topic in the field of layer-fMRI.

      Weaknesses:

      1. I feel the authors need to justify why flow-crushing helps localization specificity. There is an entire family of recent papers that aim to achieve higher localization specificity by doing the exact opposite. Namely, MT or ABC fRMRI aims to increase the localization specificity by highlighting the intravascular BOLD by means of suppressing non-flowing tissue. To name a few:

      Priovoulos, N., de Oliveira, I.A.F., Poser, B.A., Norris, D.G., van der Zwaag, W., 2023. Combining arterial blood contrast with BOLD increases fMRI intracortical contrast. Human Brain Mapping hbm.26227. https://doi.org/10.1002/hbm.26227.

      Pfaffenrot, V., Koopmans, P.J., 2022. Magnetization Transfer weighted laminar fMRI with multi-echo FLASH. NeuroImage 119725. https://doi.org/10.1016/j.neuroimage.2022.119725

      Schulz, J., Fazal, Z., Metere, R., Marques, J.P., Norris, D.G., 2020. Arterial blood contrast ( ABC ) enabled by magnetization transfer ( MT ): a novel MRI technique for enhancing the measurement of brain activation changes. bioRxiv. https://doi.org/10.1101/2020.05.20.106666

      Based on this literature, it seems that the proposed method will make the vein problem worse, not better. The authors could make it clearer how they reason that making GE-BOLD signals more extra-vascular weighted should help to reduce large vein effects.

      The empirical evidence for the claim that flow crushing helps with the localization specificity should be made clearer. The response magnitude with and without flow crushing looks pretty much identical to me (see Fig, 6d).<br /> It's unclear to me what to look for in Fig. 5. I cannot discern any layer patterns in these maps. It's too noisy. The two maps of TE=43ms look like identical copies from each other. Maybe an editorial error?

      The authors discuss bipolar crushing with respect to SE-BOLD where it has been previously applied. For SE-BOLD at UHF, a substantial portion of the vein signal comes from the intravascular compartment. So I agree that for SE-BOLD, it makes sense to crush the intravascular signal. For GE-BOLD however, this reasoning does not hold. For GE-BOLD (even at 3T), most of the vein signal comes from extravascular dephasing around large unspecific veins, and the bipolar crushing is not expected to help with this.

      2. The bipolar crushing is limited to one single direction of flow. This introduces a lot of artificial variance across the cortical folding pattern. This is not mentioned in the manuscript. There is an entire family of papers that perform layer-fmri with black-blood imaging that solves this with a 3D contrast preparation (VAPER) that is applied across a longer time period, thus killing the blood signal while it flows across all directions of the vascular tree. Here, the signal cruising is happening with a 2D readout as a "snap-shot" crushing. This does not allow the blood to flow in multiple directions.<br /> VAPER also accounts for BOLD contaminations of larger draining veins by means of a tag-control sampling. The proposed approach here does not account for this contamination.

      Chai, Y., Li, L., Huber, L., Poser, B.A., Bandettini, P.A., 2020. Integrated VASO and perfusion contrast: A new tool for laminar functional MRI. NeuroImage 207, 116358. https://doi.org/10.1016/j.neuroimage.2019.116358

      Chai, Y., Liu, T.T., Marrett, S., Li, L., Khojandi, A., Handwerker, D.A., Alink, A., Muckli, L., Bandettini, P.A., 2021. Topographical and laminar distribution of audiovisual processing within human planum temporale. Progress in Neurobiology 102121. https://doi.org/10.1016/j.pneurobio.2021.102121

      If I would recommend anyone to perform layer-fMRI with blood crushing, it seems that VAPER is the superior approach. The authors could make it clearer why users might want to use the unidirectional crushing instead.

      3. The comparison with VASO is misleading.<br /> The authors claim that previous VASO approaches were limited by TRs of 8.2s. The authors might be advised to check the latest literature of the last years.<br /> Koiso et al. performed whole brain layer-fMRI VASO at 0.8mm at 3.9 seconds (with reliable activation), 2.7 seconds (with unconvincing activation pattern, though), and 2.3 (without activation).<br /> Also, whole brain layer-fMRI BOLD at 0.5mm and 0.7mm has been previously performed by the Juelich group at TRs of 3.5s (their TR definition is 'fishy' though).

      Koiso, K., Müller, A.K., Akamatsu, K., Dresbach, S., Gulban, O.F., Goebel, R., Miyawaki, Y., Poser, B.A., Huber, L., 2023. Acquisition and processing methods of whole-brain layer-fMRI VASO and BOLD: The Kenshu dataset. Aperture Neuro 34. https://doi.org/10.1101/2022.08.19.504502

      Yun, S.D., Pais‐Roldán, P., Palomero‐Gallagher, N., Shah, N.J., 2022. Mapping of whole‐cerebrum resting‐state networks using ultra‐high resolution acquisition protocols. Human Brain Mapping. https://doi.org/10.1002/hbm.25855

      Pais-Roldan, P., Yun, S.D., Palomero-Gallagher, N., Shah, N.J., 2023. Cortical depth-dependent human fMRI of resting-state networks using EPIK. Front. Neurosci. 17, 1151544. https://doi.org/10.3389/fnins.2023.1151544

      The authors are correct that VASO is not advised as a turn-key method for lower brain areas, incl. Hippocampus and subcortex. However, the authors use this word of caution that is intended for inexperienced "users" as a statement that this cannot be performed. This statement is taken out of context. This statement is not from the academic literature. It's advice for the 40+ user base that wants to perform layer-fMRI as a plug-and-play routine tool in neuroscience usage. In fact, sub-millimeter VASO is routinely being performed by MRI-physicists across all brain areas (including deep brain structures, hippocampus etc). E.g. see Koiso et al. and an overview lecture from a layer-fMRI workshop that I had recently attended: https://youtu.be/kzh-nWXd54s?si=hoIJjLLIxFUJ4g20&t=2401

      Thus, the authors could embed this phrasing into the context of their own method that they are proposing in the manuscript. E.g. the authors could state whether they think that their sequence has the potential to be disseminated across sites, considering that it requires slow offline reconstruction in Matlab?<br /> Do the authors think that the results shown in Fig. 6c are suggesting turn-key acquisition of a routine mapping tool? In my humble opinion, it looks like random noise, with most of the activation outside the ROI (in white matter).

      4. The repeatability of the results is questionable.<br /> The authors perform experiments about the robustness of the method (line 620). The corresponding results are not suggesting any robustness to me. In fact, the layer profiles in Fig. 4c vs. Fig 4d are completely opposite. The location of peaks turns into locations of dips and vice versa.<br /> The methods are not described in enough detail to reproduce these results.<br /> The authors mention that their image reconstruction is done "using in-house MATLAB code" (line 634). They do not post a link to github, nor do they say if they share this code.

      It is not trivial to get good phase data for fMRI. The authors do not mention how they perform the respective coil-combination.<br /> No data are shared for reproduction of the analysis.

      5. The application of NODRIC is not validated.<br /> Previous applications of NORDIC at 3T layer-fMRI have resulted in mixed success. When not adjusted for the right SNR regime it can result in artifactual reductions of beta scores, depending on the SNR across layers. The authors could validate their application of NORDIC and confirm that the average layer-profiles are unaffected by the application of NORDIC. Also, the NORDIC version should be explicitly mentioned in the manuscript.

      Akbari, A., Gati, J.S., Zeman, P., Liem, B., Menon, R.S., 2023. Layer Dependence of Monocular and Binocular Responses in Human Ocular Dominance Columns at 7T using VASO and BOLD (preprint). Neuroscience. https://doi.org/10.1101/2023.04.06.535924

      Knudsen, L., Guo, F., Huang, J., Blicher, J.U., Lund, T.E., Zhou, Y., Zhang, P., Yang, Y., 2023. The laminar pattern of proprioceptive activation in human primary motor cortex. bioRxiv. https://doi.org/10.1101/2023.10.29.564658

    1. Author Response

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      The authors developed computational models that capture the electrical and Ca2+ signaling behavior in mesenteric arterial cells from male and female mice. A baseline model was first formulated with eleven transmembrane currents and three calcium compartments. Sex-specific differences in the L-type calcium channel and two voltage-gated potassium channels were then tuned based on experimental measurements. To incorporate the stochastic ion channel openings seen in smooth muscle cells under physiological conditions, noise was added to the membrane potential and the sarcoplasmic Ca2+ concentration equations. Finally, the models were assembled into 1D vessel representations and used to investigate the tissue-level electrical response to an L-type calcium channel blocker.

      Strengths:

      A major strength of the paper is that the modeling studies were performed on three different scales: individual ionic currents, whole-cell, and 1D tissue. This comprehensive computational framework can help provide mechanistic insight into arterial myocyte function that might be difficult to achieve through traditional experimental methods.

      The authors aimed to develop sex-specific computational models of mesenteric arterial myocytes and demonstrate their use in drug-testing applications. Throughout the paper, model behavior was both validated by experimental recordings and supported by previously published data. The main findings from the models suggested that sex-specific differences in membrane potential and Ca2+ handling are attributable to variability in the gating of a small number of voltage-gated potassium channels and L-type calcium channels. This variability contributes to a higher Ca2+ channel blocker sensitivity in female arterial vessels. Overall, the study successfully met the aims of the paper.

      Thank you for your insightful review and for recognizing the strengths of our study. We appreciate your encouraging comment regarding our multi-scale approach. Indeed, we believe that by systematically connecting these scales—individual ionic currents, whole-cell, and 1D tissue—we can integrate and reconcile experimental and clinical data. We anticipate that this approach will not only provide mechanistic insights into arterial myocyte function that may not be easy to glean from traditional experimental methods but will also facilitate the translation of this information into the development of therapeutic interventions.

      Weaknesses:

      A main weakness of the paper, as addressed by the authors, is the simplicity of the 1D vessel model; it does not take into account various signaling pathways or interactions with other cell types which could impact smooth muscle electrophysiology.

      Thank you for highlighting areas for improvement in our study. The strength of computational modeling lies in its iterative nature, allowing us to introduce and examine variables in a systematic manner. While our current model is simplified and does not contain all details, the modular nature of the build will allow continuous expansion to add the important elements described by the reviewer. We are enthusiastic about progressively enriching the model in subsequent studies, introducing signaling pathways in a step-by-step manner, and ensuring their validation with rigorous experimental data.

      Another potential shortcoming is the use of mouse data for optimizing the model, as there could be discrepancies in signaling behavior that limit the translatability to human myocyte predictions.

      We appreciate this important comment. Our model was parametrized using data from mouse mesenteric artery smooth muscle cells as initial proof of concept. Mouse arteries are a good representation of human arteries, as they have similar intravascular pressure-myogenic tone relationships, resting membrane potentials, and express similar ionic channels (e.g., CaV1.2, BK channels, RyRs, etc) (PMID: 28119464, PMID: 29070899, PMID: 23232643). In response to the reviewer, we have modified the discussion section of the manuscript to specifically note the mouse is not identical to the human but does share some common important features that make mice a good approximate model.

      Reviewer #2 (Public Review):

      In this study, Hernandez-Hernandez et al developed a gender-dependent mathematical model of arterial myocytes based on a previous model and new experimental data. The ionic currents of the model and its sex difference were formulated based on patch-clamp experimental data, and the model properties were compared with single-cell and tissue scale experimental results. This is a study that is of importance for the modeling field as well as for experimental physiology.

      Thank you for the comment. In fact, we developed a model that incorporates sex-dependent differences that allowed for male and female models. It’s an important distinction as sex is a biological variable and gender is a self-ascribed characteristic.

      Reviewer #3 (Public Review):

      Summary:

      This hybrid experimental/computational study by Hernandez-Hernandez sheds new light on sex-specific differences between male and female arterial myocytes from resistance arteries. The authors conduct careful experiments in isolated myocytes from male and female mice to obtain the data needed to parameterize sex-specific models of two important ionic currents (i.e., those mediated by CaV1.2 and KV2.1). Available experimental data suggest that KV1.5 channel currents from male and female myocytes are similar, but simulations conducted in the novel Hernandez-Hernandez sex-specific models provide a more nuanced view. This gives rise to the first of the authors' three key scientific claims: (1) In males, KV1.5 is the dominant current regulating membrane potential; whereas, in females, KV2.1 plays a primary role in voltage regulation. They further show that this (2) the latter distinction drives drive sex-specific differences in intracellular Ca2+ and cellular excitability. Finally, working with one-dimensional models comprising several copies of the male/female myocyte models linked by resistive junctions, they use simulations to (3) predict that the sensitivity of arterial smooth muscle to Ca2+ channel-blocking drugs commonly used to treat hypertension is heightened in female compared to male cells.

      Strengths:

      The Methodology is described in exquisite detail in straightforward language that will be easy to understand for most if not all peer groups working in computational physiology. The authors have deployed standard protocols (e.g., parameter fitting as described by Kernik et al., sensitivity analysis as described by Sobie et al.) and appropriate brief explanations of these techniques are provided. The manoeuvre used to represent stochastic effects on voltage dynamics is particularly clever and something I have not personally encountered before. Collectively, these strengthen the credibility of the model and greatly enrich the manuscript.

      We appreciate your comment highlighting the robustness of our methodology. Your acknowledgment of our approach to represent stochastic effects on voltage dynamics is especially encouraging. Indeed, noise is a fundamental component of physiological systems, including in vascular myocytes

      Broadly speaking, the Results section describes findings that robustly support the three key scientific claims outlined in my summary. While there is certainly room for further discussion of some nuanced points as outlined below, it is evident these experiments were carefully designed and carried out with care and intentionality. In the present version of the manuscript, there are a few figures in which experimental data is shown side-by-side with outputs from the corresponding models. These are an excellent illustration of the power of the authors' novel sex-specific computational simulation platform. I think these figures will benefit from some modest additional quantitative analysis to substantiate the similarities between experimental and computational data, but there is already clear evidence of a good match.

      We sincerely appreciate your constructive feedback on the Results section. We have included additional quantitative analysis to substantiate the similarities between experimental and computational data. We agree with the reviewer that the suggestion on the potential value of a more quantitative assessment. As such we have updated the figure to include an in-depth analysis that provides greater insights and solidifies the power of our simulation predictions when compared to experimental results. A detailed analysis of the male and female data as well as the male and female simulations are summarized in the text as follows:

      Baseline membrane potential is -40 mV in male myocytes compared to -30 mV. The frequency of hyperpolarization transients (THs) is 1 Hz in male and 2.5 Hz in female cells for the specific baseline membrane potential shown in Figure 5 A-B. In the range of membrane potentials from -50 mV to -30 mV the frequency increases from 1-2.8Hz which is identical to the experimental frequency range.

      Areas for Improvement:

      The authors used experimental data from a prior publication to calibrate their model of the BKCa current. As indicated in the manuscript, these data are for channel activity measured in a heterologous expression system (Xenopus oocytes). A similar principle applies to other major ion channels/pumps/etc. Is it possible there might be relevant sex-specific differences in these players as well? In the context of the present work, this feels like an important potential caveat to highlight, in case male/female differences in the activity of BKCa or other currents might influence model-predicted differences (e.g., the relative importance of KV1.5 and KV2.1). This should be discussed, and, if possible, related to the elegant sensitivity analysis presented in Fig. 5C (which shows, for example, that the models are relatively insensitive to variation in GBK).

      We fully agree with the reviewer - an important caveat to highlight is the unknown sex-specific differences in all the other players regulating membrane potential and calcium signaling. While our initial assessments indicated that the contribution of BKCa channels to the total voltage-gated K+ current (IKvTOT) was small within the physiological range of -50 mV to -30 mV, further analysis of spontaneous transient outward currents revealed sex-specific variations. We have investigations underway to explore if BKCa channel expression and organization may be also sex-dependent.

      The authors state that their model can be expanded to 2D/3D applications, "transitioning seamlessly from single-cell to tissue-level simulations". I would like to see more discussion of this. For example, given the modest complexity of the cell-scale model, how considerable would the computational burden be to implement a large network model of a subset of the human female or male arterial system? Are there sex-specific differences in vessel and/or network macro-structure that would need to be considered? How would this influence feasibility? Rather than a 1D cable as implemented here, I imagine a multi-scale implementation would involve the representation of myocytes wrapped around vessels. How would the behavior of such a system differ from the authors' presented work using a 1D representation of 100 myocytes coupled end-to-end? Could these differences partially explain why the traces in Fig. 8D are smoother than those in Fig. 8C? From my standpoint, discussing these points would enrich the paper.

      We appreciate the reviewer’s thoughtful and forward-looking ideas! Indeed, we are very interested to extend the model to incorporate a number of these important items.

      Our choice for the 1D cable model was driven by its anatomical relevance to the structure of third and fourth-order mesenteric arteries. These arteries possess a singular layer of vascular myocytes encircling the lumen in a cylindrical arrangement. When we conceptualize this structure as unrolled or viewed laterally, it aligns with a flat, rectangular form, closely paralleling our 1D cable implementation. One option is to expand this into a 2D representation by connecting multiple 1D cables together. Another option would be to connect the 1D cable end-to-end to create a ring to represent a cross section. While these approaches would appear to be different geometries, in either case, the dynamics will remain consistent because the cells comprising the tissue are the same. There is no propagating impulse (for example – although even then in a 2D homogenous tissue, a planar wave is identical in 1D), and the only effect will be an increase in electrotonic load (sink) from neighboring cells, which can readily be approximated in 1D by increasing coupling or modification of the boundary conditions.

      We totally agree that future investigation should include exploration into the potential sex-specific differences in vessel and/or network macro-structure, as these factors may critically impact predictions and indeed the difference in traces observed between Fig. 8D and Fig. 8C may well involve “insulating” effects of vessel layers and interaction between various cell types and other structural factors. In particular, the contribution of endothelial cells in modulating membrane potential in vascular myocytes might be one such influential factor. In future studies, we are also keen to investigate blood flow regulation where a 3D configuration might become necessary.

      The nifedipine data presented in Fig. 9 are quite compelling, and a nice demonstration of the potential power of the new models. How does this relate to what is known about the clinical male/female responses to nifedipine? Are there sex differences in drug efficacy?

      Thank you for your comment regarding Fig. 9.

      It is well known that sex-specific differences in pharmacokinetics and pharmacodynamics influence antihypertensive drug responses [PMID: 8651122., PMID: 22089536]. Previous studies, notably by Kloner et al., have illustrated this point quantitatively, highlighting a more pronounced diastolic BP response in women (91.4%) compared to men (83%) when treated with dihydropyridine-type channel blockers, such as amlodipine/nifedipine. Importantly, this distinction persisted even after adjusting for confounding factors such as baseline BP, age, weight, and dosage per kilogram [PMID: 8651122]. An interesting observation from Kajiwara et al. emphasizes that vasodilation-related adverse symptoms occur significantly more frequently in younger women (<50 years) compared to their male counterparts, suggesting a heightened sensitivity to dihydropyridine-type calcium channel blockers [PMID: 24728902].

      While our findings resonate with clinical observations, a word of caution is in order. Our data suggest that, in the mouse model, nifedipine elicits distinct sex-specific effects. Importantly, future research should test the direct translatability and implications of these observations in human subjects.

      Reviewer #1 (Recommendations For The Authors):

      1. Cellular simulations with noise: It might be useful to also include in this section how noise was introduced specifically into the [Ca]SR equations.

      We agree. The manuscript now includes an expanded explanation of how noise was incorporated into the model. This includes the addition of Equation 6 into section 2.4 "Cellular simulations with noise" to describe how noise was specifically integrated into the [Ca]SR equations. Please see LINE 355.

      1. For equation 14, the description might be confusing. RCG and Ri are not explicitly included.

      Thank you – this has been corrected.

      1. In the paragraph starting with, "Having explored the regulation of graded membrane potential..." , the references to Figure 7C-D do not seem to match the content of the text. Namely, the figures show female versus male responses to nifedipine, which is not introduced until the next paragraph. Additionally, the graphs in 7C-D do not have the panels titled and the y-axes labeled.

      We apologize for the error. We have modified the text and figures to address these issues.

      1. Perhaps give more detail on how the effects of nifedipine were mathematically simulated at the ionic current level.

      Good suggestion. Briefly, previous studies [PMID: 1329564] have shown that at the therapeutic dose of nifedipine (i.e., about 0.1 μM) L-type Cav1.2 channel currents are reduced by about 70%. Accordingly, we decreased ICaL in our mathematical simulations by the same extent. It is known that dihydropyridine-type channel blockers exhibit a voltage-dependent behavior, predominantly binding to the inactivated state. In smooth muscle cells, these blockers initiate inhibition quickly within a voltage range of -60 to -40 mV. This range aligns with the membrane potential baseline of vascular muscle cells (PMID: 8388295), ensuring the blockers are effective without the need of inducing significant depolarization. Therefore, the voltage dependency of dihydropyridine-type channel blockers can be neglected.

      1. For the simulations with 400 uncoupled myocytes, the methods stated that the "gap junctional resistance [was set] to zero". Did the authors mean to use "conductivity" or am I misunderstanding?

      Thank you for bringing up this issue with the term "gap junctional resistance." We now state that the "gap junctional conductivity" was set to zero to indicate no electrical communication/coupling.

      1. Address whether there are differences-such as in cell geometry, degree of sex-based ionic current changes, and frequency of spontaneous hyperpolarization-between mice and human smooth muscle myocytes that could limit the predictive capability of the model.

      Excellent point. Our model was parametrized using data from mouse mesenteric artery smooth muscle cells as initial proof of concept. In general terms, mouse arteries are a good animal model for human arteries, as they have similar intravascular pressure-myogenic tone relationships, resting membrane potentials, and express similar ionic channel (e.g., CaV1.2, BK channels, RyRs, etc) (PMID: 28119464, PMID: 29070899). Unfortunately, these studies have largely been done in male arteries and myocytes. Thus, while we recognize that the physiological distinctions between mice and humans could introduce variances in the model's outcomes. Our model offers valuable insights into the sex-specific mechanisms of KV2.1 and CaV1.2 channels in controlling membrane potential and Ca2+ dynamics in mice. It has been shown that sex-specific differences in pharmacokinetics and pharmacodynamics influence antihypertensive drug responses [[PMID: 8651122., PMID: 22089536]. Previous studies, notably by Kloner et al., have illustrated this point quantitatively, highlighting a more pronounced diastolic BP response in women (91.4%) compared to men (83%) when treated with dihydropyridine-type channel blockers, such as amlodipine/nifedipine. Importantly, this distinction persisted even after adjusting for confounding factors such as baseline BP, age, weight, and dosage per kilogram [PMID: 8651122]. An interesting observation from Kajiwara et al. emphasizes that vasodilation-related adverse symptoms occur significantly more frequently in younger women (<50 years) compared to their male counterparts, suggesting a heightened sensitivity to dihydropyridine-type calcium channel blockers [PMID: 24728902].

      While our findings resonate with clinical observations, a word of caution is in order. Our data suggest that, in the mouse model, nifedipine elicits distinct sex-specific effects. Importantly, future research should test the direct translatability and implications of these observations in human subjects.

      1. "A virtual drug-screening system that can model drug-channel interactions" (pg 32) sounds very novel.

      Thank you for highlighting this. We recognize the typo in our manuscript and have made the necessary corrections to ensure clarity and accuracy.

      Reviewer #2 (Recommendations For The Authors):

      The manuscript is well written. I only have some minor comments:

      1. In the patch clamp experiments, there is no information on the recovery of the ionic currents. Is recovery important or not in arterial myocytes? This question is related to the results shown in Figs 5-7. In Fig.5, is the oscillation caused by noise alone or a spontaneous oscillation (such as the oscillation in Fis.6-7) modulated by noise? In general, recovery is an important parameter for the frequency of spontaneous oscillations. It seems to me that the spontaneous oscillations in Fig.8 are mainly noise-driven since they disappear after the cells are coupled through gap junctions.

      One important aspect of the oscillatory behavior of the smooth muscle cells is the very long timescales, with fluctuations occurring on the order of seconds. But the majority of ion channels are operating and recovering on the order of milliseconds, so a reasonable approximation is that most ion channels in the cell are operating at steady state at low voltages.

      Oscillations in Fig.5: Both the intrinsic oscillations and the noise play key roles in shaping in the oscillations.

      The intrinsic deterministic dynamics of the model cells are oscillatory (as seen in Figures 6-7), but the noise can trigger sparks early or delay them, which leads to substantial fluctuations in the inter-spark intervals. Therefore, the spontaneous oscillations are technically modulated by the noise rather than driven by the noise. Nevertheless, in both cases, recovery dynamics play an essential role in shaping the oscillations and determining their frequency

      Note however that, when an excitable system is around the bifurcation for oscillations and noise is included, the "firing" statistics in the oscillatory state and the non-oscillatory state are indistinguishable for moderate to high levels of noise.

      Noise Exclusion in Figures 6-7: To offer a clear and undistracted interpretation of the results, noise was intentionally omitted from Figures 6-7. This was done to ensure that the primary phenomena under investigation were not obscured. While we recognize the significance of incorporating all elements, including noise, in simulating biological systems, in this case we prioritized a clear point to be made in this context.

      Oscillations in Fig.8: Your observation regarding Fig.8 is insightful. Here, uncoupled cells indeed display a spontaneous oscillatory behavior. As documented in previous research, this behavior is not an artifact resulting from cell isolation from the vessel but represents an intrinsic characteristic vital for maintaining electrical signals. The noise in the cells leads to substantial fluctuations in the inter-spike intervals. Because the noise in each cell is uncorrelated, it acts to desynchronize the activity of the cells. Therefore, instead of synchronizing the activity of the cells, the gap junction coupling quenches the large-scale oscillations (the spikes), creating lower amplitude irregular oscillations.

      1. The calcium level is much higher in women than in men as shown in Figs.7 and 9. Do women have higher arterial pressure than men?

      We thank the reviewer for the observation regarding the calcium levels in Figs.7 and 9. All data presented comes from both male and female C57BL/6J animal models, forming the foundation of our experimental framework.

      From earlier studies by the Santana lab (PMID: 32015129), distinct sex-specific differences were found between male and female vascular mesenteric vessels. When the endothelium was removed from small arteriole segments and these segments were subsequently pressurized within a range of 20–120 mmHg, the female arterioles exhibited a pronounced myogenic response in comparison to the male ones. This brings to the forefront the marked sex-based differences, especially in the context of vascular smooth muscle activity.

      Yet, when examining the behavior of whole, intact vessels, a different picture emerges. Despite clear sex-specific differences in conditions with the endothelium removed, these distinctions become less pronounced in whole, intact vessels. In essence, both male and female mice exhibit analogous arterial pressure patterns. This suggests possible compensatory mechanisms related to the caliber and structure of the small vessels.

      To address the core issue: Despite our data showing higher calcium levels in female samples, it doesn't necessarily imply females consistently exhibit higher arterial pressure across all physiological scenarios.

      1. In Fig.9, where is the intravascular pressure (a variable or a parameter) in the mathematical model?

      In our model, the intravascular pressure effects are implicitly introduced by modulating the conductance of the non-selective cation currents (INSCC). Specifically, the increase in INSCC is our way of simulating the effects of pressure-induced membrane depolarization. This approach allows us to capture the physiological response to intravascular pressure changes without explicitly introducing it as a separate parameter in the model. We have modified the manuscript to ensure that this rationale is clarified.

      1. In Eq.14, the given units of Rmyo (Ohmcm) and Rg (Ohmcmcm) are different, but Eq.14 implies they should have the same unit.

      We sincerely appreciate the reviewer's meticulous observation regarding the units discrepancy in Eq.14. We have revised the manuscript to correct the error.

      Reviewer #3 (Recommendations For The Authors):

      Suggestions for improved or additional experiments, data, or analyses:

      Fig. 5 A-B: This is a beautiful qualitative comparison between experimental and simulation data! I think it would be even more impactful if the authors carried out some quantitative analysis of the similarity between male/female experimental/simulation data. For example, the "resting" Vm levels (approx. -30 mV and -40 mV for females and males, respectively) and the peak levels of Vm hyperpolarization could be compared, as well as the frequency of transient hyperpolarization events. It seems like the female model is much more prone to intervals of relative quiescence (i.e., absence of transient hyperpolarization events - e.g., from ~5-6.5 s). Is this consistent with the duration of such ranges in the experimental data (e.g., from 0 to 2.5 s in Fig. 5A).

      Thank you for your positive remarks concerning the qualitative comparison in Fig. 5 A-B. We are indeed enthusiastic about the parallels we've identified between experimental and simulation outcomes. We agree with the reviewer that the suggestion on the potential value of a more quantitative assessment. As such we have updated the figure to include an in-depth analysis that provides greater insights and solidifies the power of our simulation predictions when compared to experimental results. A detailed analysis of the male and female data as well as the male and female simulations are summarized in the text as follows:

      Baseline membrane potential is -40 mV in male myocytes compared to -30 mV. The frequency of hyperpolarization transients (THs) is 1 Hz in male and 2.5 Hz in female cells for the specific baseline membrane potential shown in Figure 5 A-B. In the range of membrane potentials from -50 mV to -30 mV the frequency increases from 1-2.8Hz which is identical to the experimental frequency range.

      • Fig. 7 C-D: Likewise, it would be helpful to quantitatively characterize male/female differences in the model's response to simulated Ca channel blockade (e.g., rate of transient hyperpolarization events, relative levels of ICa and [Ca]i).

      Thank you for the constructive feedback on Fig. 7 C-D. We appreciate the emphasis on a quantitative approach to solidify our understanding and have modified the results as follows:

      Next, we simulated the effects of calcium channel blocker nifedipine on ICa at a steady membrane potential of -40 mV in male and female simulations. Briefly, previous studies70 have shown that at the therapeutic dose of nifedipine (i.e., about 0.1 μM) L-type Cav1.2 channel currents are reduced by about 70%. Accordingly, we decreased ICa in our mathematical simulations by the same extent. In Figure 7C-D, we show the predicted male (gray) and female (pink) time course of membrane voltage at -40 mV (top panel), ICa (middle panel), and [Ca2+]i (lower panel). First, we observed that in both male and females 0.1 μM nifedipine modifies the frequency of oscillation in the membrane potential, by causing a reduction in oscillation frequency. Second, both male and female simulations (middle panels) show that 0.1 μM nifedipine caused a reduction of ICa to levels that are very similar in male and female myocytes following treatment. Consequently, the reduction of ICa causes both male and female simulations to reach a very similar baseline [Ca2+]i of about 85 nM (lower panels). As a result, simulations provide evidence supporting the idea that CaV1.2 channels are the predominant regulators of intracellular [Ca2+] entry in the physiological range from -40 mV to -20 mV. Importantly, these predictions also suggest that clinically relevant concentrations of nifedipine cause larger overall reductions in Ca2+ influx in female than in male arterial myocytes.

      Recommendations for improving the writing and presentation:

      When I accessed the GitHub repository linked in section 2.7 (Aug 17, 13:30 PT) it only contained a LICENSE file and none of the described codes and model equations appeared to be publicly available. I would like to access and examine these files. Based on the Clancy lab's excellent track record for making their work publicly available, I have no doubt that the published files will be complete, thoroughly documented, and ready for implementation in studies to reproduce or extend the work described in this manuscript.

      https://github.com/ClancyLabUCD/sex-specific-responses-to-calcium-channel-blockers-in-mesenteric-vascular-smooth-muscle

      We sincerely apologize for the omission regarding the GitHub repository. It was never our intention to omit the crucial files that should accompany our manuscript. We deeply regret any inconvenience this may have caused in your review process.

      We deeply value transparency and the importance of making our work accessible to fellow researchers and the wider community. As you rightly pointed out, the Clancy lab has always been committed to ensuring that our work is available publicly, and this instance is no exception. Please find all codes and documentation here:

      Minor corrections to the text and figures:

      The introduction is somewhat lengthy, and some of the material contained therein might be more suitable to be merged into the Discussion instead (e.g., paragraphs on negative feedback regulation and the recent study by O'Dwyer et al.).

      Thank you – we have updated the introduction but left some foundational work descriptions intact.

      • Page 6, section 1.1: There is a missing word (mice?) in the first sentence.

      • Page 11, under Eqn. 7: Luo is misspelled as Lou. (Also twice on Page 20.)

      Thank you – these have been corrected.

      Figs. 2-3: As a colorblind person, it was somewhat challenging for me to differentiate between the red and black lines. Choosing a higher-contrast colour pairing would be beneficial. For some reason, this is not so much of an issue for other figures that use the red/black scheme later in the manuscript (e.g., Figs. 5, 7-8).

      We truly appreciate your feedback on the color contrast used in our figures. Accessibility and clarity are crucial to us, and we regret any difficulty you encountered due to the color choices. Based on your valuable feedback, we have included different color pairings in our visual representations to ensure they are comprehensible to all readers, including those who are colorblind.

      Fig. 2-3: I am also confused about the use of symbols to indicate significant differences in these plots. In Fig. 2, ** is defined in the legend but not used in the figure. In both figures, the symbols are placed above/below specific sets of points, but it is unclear whether large differences for other x-axis values are statistically significant (e.g., -20 mV in Fig. 3B, +40 mV in Fig. 2C, etc.) This should be clarified.

      Thank you – we now have included all the significant differences in the data discussed in the manuscript.

      Page 22: The authors state that they "introduced noise into the [Ca]SR..." but the specifics of this approach are not described. As with other aspects of the Methods section, it would be suitable to provide a brief description of the technique used in ref. 40, perhaps added to section 2.4.

      Thank you – it has been corrected.

      Fig.7 C-D: Axis labels and units are missing. Even though the labels and units will be inferred by most readers, it would be helpful to include them here (at least in C).

      Thank you for pointing out the inconsistency between the textual references and Figure 7C-D. We have added the corrected figure.

      Page 32: "...the first step toward the development of a virtual drug-screaming system..." I think the authors mean drug-screening. As a side note, this is immediately in the running for the best typo I've ever seen as a peer reviewer.

      <good laugh> Thank you for pointing out this error, and we sincerely appreciate your sense of humor about it. You are indeed correct; the intended word is "drug-screening." We have corrected this typo in the manuscript. We're grateful for your thorough review and the light-hearted way you brought this to our attention.

    1. Author Response

      The following is the authors’ response to the previous reviews.

      We would like to thank the Editors and Reviewers for their additional comments and constructive feedback on our manuscript. We have made minor adjustments to the figures and texts based on their suggestions, including improved images in Figure 1 and correction of figure labels.

      Reviewer #1 (Public Review):

      In their previous paper (Lari et al, 2019; Azra Lari Arvind Arul Nambi Rajan Rima Sandhu Taylor Reiter Rachel Montpetit Barry P Young Chris JR Loewen Ben Montpetit (2019) A nuclear role for the DEAD-box protein Dbp5 in tRNA export eLife 8:e48410.) as well as in the current manuscript the authors states that Dbp5 is involved in the export of tRNA that is independent of and parallel to Los1. They state that Dbp5 binds to the tRNA independent of known tRNA export proteins. The obtained conclusion is both intriguing and innovative, since it suggests that there are other variables, beyond the ones previously identified as tRNA factors, that might interact with Dbp5 to facilitate the export process. In order to find out additional factors aiding this process the authors may employ total RNA-associated protein purification (TRAPP) experiments ( Shchepachevto et al., 2019; Shchepachev V, Bresson S, Spanos C, Petfalski E, Fischer L, Rappsilber J, Tollervey D. Defining the RNA interactome by total RNA-associated protein purification. Mol Syst Biol. 2019 Apr 8;15(4):e8689. doi: 10.15252/msb.20188689. PMID: 30962360; PMCID: PMC6452921) to identify extra factors involved in conjunction with Dbp5. The process elucidates hitherto uninvestigated tRNA export components that function in conjunction with Dbp5.

      Author Response: We greatly appreciate this suggestion and agree with the reviewer that identification of the composition of the export competent Dbp5 containing tRNA complex is a critical next step for understanding the mechanism of Dbp5 mediated tRNA export, which will form the foundation of a future investigation in the laboratory and warrants its own study.

      Reviewer #1 (Public Review):

      Various reports suggest that eukaryotic translation elongation factor 1 eEF1A is involved tRNA export Bohnsack et al., 2002 (Bohnsack MT, Regener K, Schwappach B, Saffrich R, Paraskeva E, Hartmann E, Görlich D. Exp5 exports eEF1A via tRNA from nuclei and synergizes with other transport pathways to confine translation to the cytoplasm. EMBO J. 2002 Nov 15;21(22):620515. doi: 10.1093/emboj/cdf613. PMID: 12426392; PMCID: PMC137205), Grosshans etal., 2002; Grosshans H, Hurt E, Simos G. An aminoacylation-dependent nuclear tRNA export pathway in yeast. Genes Dev. 2000 Apr 1;14(7):830-40. PMID: 10766739; PMCID: PMC316491). The presence of mutations in eEF1A has been seen to hinder the nuclear export process of all transfer RNAs (tRNAs). eEF1A has been shown to interact with Los1 aiding in tRNA export. The authors can also explore the crosstalk between Dbp5 and eEF1A in this study. Additionally, suppressor screening analysis in dbp5R423A , los1∆dbp5R423A los1∆msn∆dbp5R423A could shed more light on this.

      Author Response: Thank you for this suggestion and raising an important possible role for Dbp5 in eEF1A mediated tRNA export. Based on more recent investigation of eEF1A function in tRNA export (PMID: 25838545), it is likely that eEF1A functions in re-export of charged tRNAs specifically (likely in conjunction with Msn5). The current manuscript has largely focused on the role of Dbp5 in pre-tRNA export, but a more careful mechanistic characterization of Dbp5 and re-export will be conducted in follow-up studies given the physical interaction between Dbp5 and spliced tRNAs we previously reported. Similarly, suppressor screens with the Dbp5 and los1Δmsn5Δ mutants will likely be a useful tool in identifying additional tRNA export factors and we thank the reviewer for this suggestion.

      Reviewer #1 (Public Review):

      The addition of Gle1 is potentially novel but it's unclear why the authors didn't address the potential involvement of IP6.

      Author Response: The text has been revised to highlight the importance of InsP6 in Gle1 mediated activation of Dbp5. This includes referencing InsP6 throughout the manuscript during discussions of Gle1 activation of Dbp5 and lines 401-404 discussing the potential role for the small molecule in regulating mRNA and tRNA export in different cellular contexts (e.g., stress and disease).

    1. Author Response

      Reviewer #1 (Public Review):

      In this paper, the authors develop new models of sequential effects in a simple Bernoulli learning task. In particular, the authors show evidence for both a "precision-cost" model (precise posteriors are costly) and an "unpredictabilitycost" model (expectations of unpredictable outcomes are costly). Detailed analyses of experimental data partially support the model predictions.

      Strengths:

      • Well-written and clear.

      • Addresses a long-standing empirical puzzle.

      • Rigorous modeling.

      Weaknesses:

      • No model adequately explains all of the data.

      • New empirical dataset is somewhat incremental.

      • Aspects of the modeling appear weakly motivated (particularly the unpredictability model).

      • Missing discussion of some relevant literature.

      We thank Reviewer #1 for her/his positive comments on our work and her/his comments and suggestions.

      Reviewer #2 (Public Review):

      This paper argues for an explanation of sequential effects in prediction based on the computational cost of representing probability distributions. This argument is made by contrasting two cost-based models with several other models in accounting for first- and second-order dependencies in people's choices. The empirical and modeling work is well done, and the results are compelling.

      We thank Reviewer #2 for her/his positive comments on our work.

      The main weaknesses of the paper are as follows:

      1) The main argument is against accounts of dependency based on sensitivity to statistics (ie. modeling the timeseries as having dependencies it doesn't have). However, such models are not included in the model comparison, which makes it difficult to compare these hypotheses.

      Many models in the sequential-effects literature (Refs. [7-12] in the manuscript) are ‘leaky-integration’ models that interpret sequential effects as resulting from an attempt to learn the statistics of a sequence of stimuli, through exponentiallydecaying counts of the simple patterns in the sequence (e.g., single stimuli, repetitions, and alternations). In some studies, the ‘forgetting’ of remote observations that results from the exponential decay is justified by the fact that people live in environments that are usually changing: it is thus natural that they should expect that the statistics underlying the task’s stimuli undergo changes (although in most experiments, they do not), and if they expect changes, then they should discard old observations that are not anymore relevant. This theoretical justification raises the question as to why subjects do not seem to learn that the generative parameters in these tasks are in fact not changing — all the more as other studies suggest that subjects are able to learn the statistics of changes (and consistently they are able to adapt their inference) when the environment does undergo changes (Refs. [42,57]).

      Our models are derived from a different approach: we derive behavior from the resolution of a problem of constrained optimization of the inference process. It is not a phenomenological model. When the constraint that weighs on the inference process is a cost on the precision of the posterior, as measured by its entropy, we find that the resulting posterior is one in which remote observations are ‘forgotten’, through an exponentially discount, i.e., we recover the predictions of the leaky-integration models, which past studies have empirically found to be reasonably good accounts of sequential effects. (Thus these models are already in our model comparison.) In our framework, the sequential effects do not stem from the subjects’ irrevocable belief that the statistics of the stimuli change from time to time, but rather from the difficulty that they have in representing precise belief; a rather different theoretical justification.

      Furthermore, we show that a large fraction of subjects are not best-fitted by precision-cost models (i.e., they are not best-fitted by leaky integration), but instead they are best fitted by unpredictability-cost models. These models suggest a different explanation of sequential effects: that they result from the subjects favoring predictable environments, in their inference. In the revised version of the manuscript, we have made clearer that the derivation of the optimal posterior under a precision cost results in the exponential forgetting of remote observations, as in the leaky-integration models. We mention it in the abstract, in the Introduction (l. 76-78), in the Results when presenting the precision-cost models (l. 264-278), and in the Discussion (l.706-716).

      2) The task is not incentivized in any way. Since incentives are known to affect probability-matching behaviors, this seems important. In particular, we might expect incentives would trade off against computational costs - people should increase the precision of their representations if it generates more reward.

      We thank Reviewer #2 for her/his attention to our paper and for her/his comments. As for the point on the models, see answer above (point 1).

      As for the point on incentivization: we agree that it would be very interesting to measure whether and to which extent the performance of subjects increases with the level of incentivization. Here, however, we wanted, first, to establish that subjects’ behavior could be understood as resulting from inference under a cost, and second, to examine the sensitivity of their predictions to the underlying generative probability — rather than to manipulating a tradeoff involving this cost (e.g. with financial reward). We note that we do find that subjects are sensitive to the generative probability, which implies that they exhibit some degree of motivation to put some effort in the task (which is the goal of incentivization), in spite of the lack of economic incentives. But it would indeed be interesting to know how the potential sensitivity to reward interacts with the sensitivity to the generative probability. Furthermore, as Reviewer #2 mentions, some studies show that incentives affect probability-matching behavior: it is then unclear whether the introduction of incentives in our task would change the inference of subjects (through a modification of the optimal trade-off that we model); or whether it would change their probability-matching behavior, as modeled by our generalized probability-matching response-selection strategy; or both. Note that we disentangled both aspects in our modeling and that our conclusions are about the inference, not the response-selection strategy. We deem the incentivization effects very much worth investigating; but they fall outside of the scope of our paper.

      We now mention this point in the Discussion of the revised manuscript (l. 828-840).

      3) The sample size is relatively small (20 participants). Even though a relatively large amount of data is collected from each participant, this does make it more difficult to evaluate the second-order dependencies in particular (Figure 6), where there are large error bars and the current analysis uses a threshold of p < .05 across a large number of tests hence creating a high false-discovery risk.

      Indeed we agree with Reviewer #2 that as the number of tests increases, so does the probability that at least one null hypothesis is rejected at a given level, even if the null hypothesis is correct. But in the panels a, b and c of Figure 6, about half of the tests are rejected, which is very unlikely under the null hypothesis that there is no effect of the stimulus history on the prediction, all the more as the signs of the non-significant results are in most cases consistent with the direction of the significant results. (In panel e, which reports a finer analysis in which the number of subjects is essentially divided by 2, about a fourth of the tests are rejected, and here also the non-significant results are almost all in the same direction as the significant ones.)

      However, we agree that there remains a risk of false discovery, thus we applied a Bonferroni-Holm-Šidák correction to the p-values in order to mitigate this risk. With these more conservative p-values, a lower number of tests are rejected, but in most cases in Fig. 6abc the effects remain significant. In particular, we are confident that there is a repulsive effect of the third-to-last stimulus in the case of Fig. 6c, while there is an attractive effect in the other cases.

      In the revised manuscript, Figure 6 now reports whether the tests are rejected when the p-values are corrected with the Bonferroni-Holm-Šidák correction.

      (We also applied this correction to the p-values of the tests in Fig. 2, which has more data: the corrected p-values are all below 1e-13, which we now indicate in the caption of this figure.)

      4) In the key analyses in Figure 4, we see model predictions averaged across participants. This can be misleading, as the average of many models can produce behavior outside the class of functions the models themselves can generate. It would be helpful to see the distribution of raw model predictions (ideally compared against individual data from humans). Minimally, showing predictions from representative models in each class would provide insight into where specific models are getting things right and wrong, which is not apparent from the model comparison.

      In the main text of the original manuscript, we showed the behavior of the pooled responses of the best-fitting models, and we agree with Reviewer #2 that it did not make clear to the reader that the apparent ability of the models to reproduce the subjects’ behavioral patterns was not a misleading byproduct of the averaging of different models. In the original version of the manuscript, we had put a figure showing the behavior of each individual model (each cost type with each Markov order) in the Methods section of the paper; but this could easily be overlooked, and indeed it would be beneficial for the reader to be shown the typical behaviors of the models, in the main text. We have reorganized the presentation of the models’ behaviors: the first panels in Fig. 4 (in the main text) are now dedicated to showing the individual sequential effects of the precision-cost and of the unpredictabilitycost models with Markov order 0 and 1. The Figure 4 is reproduced in the response to Reviewer #1, above, along with comments on the sequential effects produced by these models (and also on the impact of the generalized probability-matching response-selection strategy, in comparison with the traditional probability matching). We believe that this figure makes clearer how the individual models are able to reproduce the patterns in subjects’ predictions — in particular it shows that this ability of the models is not just an artifact of the averaging of many models, as was the legitimate concern of Reviewer #2. We have left the illustration of the firstorder sequential effects of the other models (with Markov order 2 and 3) in the Methods section (Fig. 7), so as not to overload Fig. 4, and because they do not bring new critical conceptual points.

      As for the higher-order sequential effects, the updated Figure 5, also reproduced above in the responses to Reviewer #1, now includes the sequential effects obtained with the precision-cost model of a Bernoulli observer (m=0), in addition to the precision-cost model of a Markov observer (m=1) and to the unpredictabilitycost model of a Markov observer (m=3), in order to better illustrate the behaviors of the different models. The higher-order sequential effects of the other models can be found in Fig. 8 in Methods.

      Reviewer #3 (Public Review):

      This manuscript offers a novel account of history biases in perceptual decisions in terms of bounded rationality, more specifically in terms of finite resources strategy. Bridging two works of literature on the suboptimalities of human decision-making (cognitive biases and bounded rationality) is very valuable per se; the theoretical framework is well derived, building upon the authors' previous work; and the choice of experiment and analysis to test their hypothesis is adequate. However, I do have important concerns regarding the work that do not enable me to fully grasp the impact of the work. Most importantly, I am not sure whether the hypothesis whereby inference is biased towards avoiding high precision posterior is equivalent or not to the standard hypothesis that inference "leaks" across time due to the belief that the environment is not stationary. This and other important issues are detailed below. I also think that the clarity and architecture of the manuscript could be greatly improved.

      We thank Reviewer #3 for her/his positive comments on our work and her/his comments and suggestions.

      1) At this point it remains unclear what is the relationship between the finite resources hypothesis (the only bounded rationality hypothesis supported by the data) and more standard accounts of historical effects in terms of adaptation to a (believed to be) changing environment. The Discussion suggests that the two approaches are similar (if not identical) at the algorithmic level: in one case, the posterior belief is stretched (compared to the Bayesian observer for stationary environments) due to precision cost, in other because of possible changes in the environment. Are the two formalisms equivalent? Or could the two accounts provide dissociable predictions for a different task? In other words, if the finite resources hypothesis is not meant to be taken as brain circuits explicitly minimizing the cost (as stated by the authors), and if it produces the same type of behavior as more classical accounts: is the hypothesis testable experimentally?

      We agree with Reviewer #3 that the relation between our approach and other approaches in the literature should be made clearer to the reader.

      Since the 1990s, in the psychology and neuroscience literature, many models of perception and decision-making have featured an exponential decay of past observations, resulting in an emphasis, in decisions, of the more recent evidence (‘leaky integration’, Refs. [7-12, 76-86]). In the context of sequential effects, this mechanism has found a theoretical justification in the idea that people believe that statistics typically change, and thus that remote observations should indeed be discarded [8,12]. In inference tasks with binary signals, in which the optimal Bayesian posterior is in many cases a Beta distribution whose two parameters are the counts of the two signals, one way to conveniently incorporate a forgetting mechanism is to replace these counts with exponentially-filtered counts, in which more recent observations have more weight (e.g., Ref. [12]).

      Our approach to sequential effects is not grounded in the history of leakyintegration models: we assume, first, that subjects attempt at learning the statistics of the signals presented to them (this is also the assumption in many studies [712]), and second, that their inference is subject to a cost, which prevents them from reaching the optimal, Bayesian posterior; but under the constraint of this cost, they choose the optimal posterior. We formalize this as a problem of constrained optimization.

      The two formalisms are thus not equivalent. Beyond the fact that we clearly state the problem which we assume the brain is solving, we do not propose that the origin of sequential effects resides in an adaptation to putatively changing environments: instead, we assume that they originate in a cognitive cost internal to the decision-maker. If this cost is proportional to the entropy of the posterior, as in our precision cost, then the optimal approximate posterior is one in which remote observations are ‘forgotten’ through an exponential filter, as in the leakyintegration models. In other words, in the context of this task and with this kind of cost, the models are, as Reviewer #3 writes, identical at the algorithmic level. As for the unpredictability cost, it does not result in a solution that resembles leaky integration; about half the subjects, however, are best fitted by unpredictabilitycost models. We thus provide a different rationale for sequential effects — that the brain favors predictive environment, in its inference — and this alternative account is successful in capturing the behavior of a large fraction of the subjects.

      In the revised manuscript, we now clarify that the precision cost results in leaky integration, in the abstract, in the Introduction (l. 76-78), in our presentation of the precision-cost models (Results section, l. 264-275), and in the Discussion (l. 706716). (We also refer Reviewer #3 to our response to the first comment of Reviewer #2, above.)

      Finally, Reviewer #3 asks the interesting question as to whether the “two accounts provide dissociable predictions for a different task”. Given that the leakyintegration approach is justified by an adaptation to potential changes, and our approach relies on the hypothesis that precision in beliefs is costly, one way to disentangle the two would be to eliminate the sequential nature of the task and presenting instead observations simultaneously. This would eliminate the mere notion of change across time. In this case, the leaky account would predict that subjects’ inference becomes optimal (because the leak should disappear in the absence of change), while in the second approach the precision cost would still weigh on the inference, and result in approximate posteriors that are “wider” (less precise) than the optimal one. The resulting divergence in the predictions of these models is very interesting, but out of the scope of this study on sequential effects.

      2) The current analysis of history effects may be confounded by effects of the motor responses (independently from the correct response), e.g. a tendency to repeat motor responses instead of (or on top of) tracking the distribution of stimuli.

      We thank Reviewer #3 for pointing out the possibility that subjects may have a tendency to repeat motor responses that is not related to their inference.

      We note that in Urai et al., 2017, as in many other sensory 2AFC tasks, successive trials are independent: the stimulus at a given trial is a random event independent of the stimulus at the preceding trial; the response at a given trial should in principle be independent of the stimulus at the preceding trial; and the response at the preceding trial conveys no information about the response that should be given at the current trial (although subjects might exhibit a serial dependency in their responses). By contrast, in our task an event is more likely than not to be followed by the same event (because observing this event suggests that its probability is greater than .5); and a prediction at a given trial should be correlated with the stimuli at the preceding trials, and with the predictions at the preceding trials. In a logit model (or any other GLM), this would mean that the predictors exhibit multicollinearity, i.e., they are strongly correlated. Multicollinearity does not reduce the predictive power of a model, but it makes the identification of parameters extremely unreliable: in other words, we wouldn’t be able to confidently attribute to each predictor (e.g., the past observations and the past responses) a reliable weight in the subjects’ decisions. Furthermore, our study shows that past stimuli can yield both attractive and repulsive effects, depending on the exact sequence of past observations. To capture this in a (generalized) linear model, we would have to introduce interaction terms for each possible past sequence, resulting in a very high number of parameters to be identified.

      However, this does not preclude the possibility that subjects may have a motor propensity to repeat responses. In order to take this hypothesis into account, we examined the behavior and the ability to capture subjects’ data of models in which the response-selection strategy allows for the possibility of repeating, or alternating, the preceding response. Specifically, we consider models that are identical to those in our study, except for the response-selection strategy, which is an extension of the generalized probability-matching strategy, in which a parameter eta, greater than -1 and lower than 1, determines the probability that the model subject repeats its preceding response, or conversely alternates and chooses the other response. With probability 1-|η|, the model subject follows the generalized probability-matching response-selection strategy (parameterized by κ). With probability |η|, the model subject repeats the preceding response, if η > 0, or chooses the other response, if η < 0. We included the possibility of an alternation bias (negative η), but we find that no subject is best-fitted by a negative η, thus we focus on the repetition bias (positive η). We fit the models by maximizing their likelihoods, and we compared, using the Bayesian Information Criterion (BIC), the quality of their fit to that of the original models that do not include a repetition propensity.

      Taking into account the repetition bias of subjects leaves the assignment of subjects into two families of inference cost mostly unchanged. We find that for 26% of subjects the introduction of the repetition propensity does not improve the fit (as measured by the BIC) and can therefore be discarded. For 47% of subjects, the fit is better with the repetition propensity (lower BIC), and the best-fitting inference model (i.e., the type of cost, precision or unpredictability, and the Markov order) is the same with or without repetition propensity. Thus for 73% (=26+47) of subjects, allowing for a repetition propensity does not change the inference model. We also find that the best-fitting parameters λ and κ, for these subjects, are very stable, when allowing or not for the repetition propensity. For 11% of subjects, the fit is better with the repetition propensity, and the cost type of the inference model is the same (as without the repetition propensity), but the Markov order changes. For the remaining 16%, both the cost type and the Markov order change.

      Thus for a majority of subjects, the BIC is improved when a repetition propensity is included, suggesting that there is indeed a tendency to repeat responses, independent of the subjects’ inference process and generative stimulus probability. In Figure 7, in Methods, we show the behavior of the models without repetition propensity, and with repetition propensity, with a parameter η = 0.2 close to the average best-fitting value of eta across subjects. We show, in Methods, that (i) the unconditional probability of a prediction A, p(A), is the same with and without repetition propensity, and that (ii) the conditional probabilities p(A|A) and p(A|B) when η≠0 are weighted means of the unconditional probability p(A) and of the conditional probabilities when eta=0 (see p. 47-49 of the revised manuscript).

      In summary, our results suggest that a majority of subjects do exhibit a propensity to repeat their responses. Most subjects, however, are best-fitted by the same inference model, with or without repetition propensity, and the parameters λ and κ are stable, across these two cases; this speaks to the robustness of our model fitting. We conclude that the models of inference under a cost capture essential aspects of the behavioral data, which does not exclude, and is not confounded by, the existence of a tendency, in subjects, to repeat motor responses.

      In the revised manuscript, we present this analysis in Methods (p.47-49), and we refer to it in the main text (l. 353-356 and 400-406).

      3) The authors assume that subjects should reach their asymptotic behavior after passively viewing the first 200 trials but this should be assessed in the data rather than hypothesized. Especially since the subjects are passively looking during the first part of the block, they may well pay very little attention to the statistics.

      The assumptions that subjects reach their asymptotic behavior after being presented with 200 observations in the passive trials should indeed be tested. To that end, we compared the behavior of the subjects in the first 100 active trials with their behavior in the remaining 100 active trials. The results of this analysis are shown in Figure 9.

      For most values of the stimulus generative probability, the unconditional proportions of predictions A, in the first and the second half (panel a, solid and dashed gray lines), are not significantly different (panel a, white dots), except for two values (p-value < 0.05; panel a, filled dots). Although in most cases the difference between the two is not significant, in the second half the proportions of prediction A seem slightly closer to the extremes (0 and 1), i.e., closer to the optimal proportions. As for the sequential effects, they appear very similar in the two halves of trials. We conclude that for the purpose of our analysis we can reasonably consider that the behavior of the subjects is stationary throughout the task.

      4) The experiment methods are described quite poorly: when is the feedback provided? What is the horizontal bar at the bottom of the display? What happens in the analysis with timeout trials and what percentage of trials do they represent? Most importantly, what were the subjects told about the structure of the task? Are they told that probabilities change over blocks but are maintained constant within each block?

      We thank Reviewer #3 for her/his close attention to the details of our experiment. Here are the answers to the reviewer’s questions:

      • The feedback (i.e., a lightning strike on the left or the right rod, with the rod and the battery turning yellow if the strike is on the side predicted by the subject,) is immediate, i.e., it is provided right after the subject makes a prediction, with no delay. We now indicate this in the caption of Figure 1.

      • The task is presented to the subjects as a game in which predicting the correct location of the lightning strike results in electric power being collected in the battery. The horizontal bar at the bottom of the display is a gauge that indicates the amount of power collected in the current block of trials. It has no operational value in the task. We now mention it in the Methods section (l. 872-874).

      • The timeout trials were not included in the analysis. The timeout trials represented 1.27% of the trials, on average (across subjects); and for 95% of the subjects the timeout trials represented less than 2.5% of the trials. This information was added in Methods (l. 887-889).

      • Each new block of trials was presented to the subject as the lightning strikes occurring in a different town. The 200 passive trials at the beginning of each block, in which subjects were asked to observe a sequence of 200 strikes, were presented as the ‘track record’ for that town, and the instructions indicated that it was ‘useful’ to know this track record. No information was given on the mechanism governing the locations of the strikes. In the main text of the revised manuscript, we now include these details when describing the task (p. 6).

    1. Joint Public Review:

      LD Score regression (LDSC) is a software tool widely used in the field of genome-wide association studies (GWAS) for estimating heritabilities, genetic correlations, the extent of confounding, and biological enrichment. LDSC is for the most part not regarded as an accurate estimator of \emph{absolute} heritability (although useful for relative comparisons). It is relied on primarily for its other uses (e.g., estimating genetic correlations). The authors propose a new method called \texttt{i-LDSC}, extending the original LDSC in order to estimate a component of genetic variance in addition to the narrow-sense heritability---epistatic genetic variance, although not necessarily all of it. Epistasis in quantitative genetics refers to the component of genetic variance that cannot be captured by a linear model regressing total genetic values on single-SNP genotypes. \texttt{i-LDSC} seems aimed at estimating that part of the epistatic variance residing in statistical interactions between pairs of SNPs. To simplify, the basic model of \texttt{i-LDSC} for two SNPs $X_1$ and $X_2$ is<br /> \begin{equation}\label{eq:twoX}<br /> Y = X_1 \beta_1 + X_2 \beta_2 + X_1 X_2 \theta + E,<br /> \end{equation}<br /> and estimation of the epistatic variance associated with the product term proceeds through a variant of the original LD Score that measures the extent to which a SNP tags products of genotypes (rather than genotypes themselves). The authors conducted simulations to test their method and then applied it to a number of traits in the UK Biobank and Biobank Japan. They found that for all traits the additive genetic variance was larger than the epistatic, but for height the absolute size of the epistatic component was estimated to be non-negligible. An interpretation of the authors' results that perhaps cannot be ruled out, however, is that pairwise epistasis overall does not make a detectable contribution to the variance of quantitative traits.

      Major Comments

      This paper has a lot of strong points, and I commend the authors for the effort and ingenuity expended in tackling the difficult problem of estimating epistatic (non-additive) genetic variance from GWAS summary statistics. The mere possibility of the estimated univariate regression coefficient containing a contribution from epistasis, as represented in the manuscript's Equation~3 and elsewhere, is intriguing in and of itself.

      Is \texttt{i-LDSC} Estimating Epistasis?

      Perhaps the issue that has given me the most pause is uncertainty over whether the paper's method is really estimating the non-additive genetic variance, as this has been traditionally defined in quantitative genetics with great consequences for the correlations between relatives and evolutionary theory (Fisher, 1930, 1941; Lynch & Walsh, 1998; Burger, 2000; Ewens, 2004).

      Let us call the expected phenotypic value of a given multiple-SNP genotype the \emph{total genetic value}. If we apply least-squares regression to obtain the coefficients of the SNPs in a simple linear model predicting the total genetic values, then the partial regression coefficients are the \emph{average effects of gene substitution} and the variance in the predicted values resulting from the model is called the \emph{additive genetic variance}. (This is all theoretical and definitional, not empirical. We do not actually perform this regression.) The variance in the residuals---the differences between the total genetic values and the additive predicted values---is the \emph{non-additive genetic variance}. Notice that this is an orthogonal decomposition of the variance in total genetic values. Thus, in order for the variance in $\mathbf{W}\bm{\theta}$ to qualify as the non-additive genetic variance, it must be orthogonal to $\mathbf{X} \bm{\beta}$.

      At first, I very much doubted whether this is generally true. And I was not reassured by the authors' reply to Reviewer~1 on this point, which did not seem to show any grasp of the issue at all. But to my surprise I discovered in elementary simulations of Equation~\ref{eq:twoX} above that for mean-centered $X_1$ and $X_2$, $(X_1 \beta_1 + X_2 \beta_2)$ is uncorrelated with $X_1 X_2 \theta$ for seemingly arbitrary correlation between $X_1$ and $X_2$. A partition of the outcome's variance between these two components is thus an orthogonal decomposition after all. Furthermore, the result seems general for any number of independent variables and their pairwise products. I am also encouraged by the report that standard and interaction LD Scores are ``lowly correlated' (line~179), meaning that the standard LDSC slope is scarcely affected by the inclusion of interaction LD Scores in the regression; this behavior is what we should expect from an orthogonal decomposition.

      I have therefore come to the view that the additional variance component estimated by \texttt{i-LDSC} has a close correspondence with the epistatic (non-additive) genetic variance after all.

      In order to make this point transparent to all readers, however, I think that the authors should put much more effort into placing their work into the traditional framework of the field. It was certainly not intuitive to multiple reviewers that $\mathbf{X}\bm{\beta}$ is orthogonal to $\mathbf{W}\bm{\theta}$. There are even contrary suggestions. For if $(\mathbf{X}\bm{\beta})^\intercal \mathbf{W} \bm{\theta} = \bm{\beta}^\intercal \mathbf{X}^\intercal \mathbf{W} \bm{\theta} $ is to equal zero, we know that we can't get there by $\mathbf{X}^\intercal \mathbf{W}$ equaling zero because then the method has nothing to go on (e.g., line~139). We thus have a quadratic form---each term being the weighted product of an average (additive) effect and an interaction coefficient---needing to cancel out to equal zero. I wonder if the authors can put forth a rigorous argument or compelling intuition for why this should be the case.

      In the case of two polymorphic sites, quantitative genetics has traditionally partitioned the total genetic variance into the following orthogonal components:<br /> \begin{itemize}<br /> \item additive genetic variance, $\sigma^2_A$, the numerator of the narrow-sense heritability;<br /> \item dominance genetic variance, $\sigma^2_D$;<br /> \item additive-by-additive genetic variance, $\sigma^2_{AA}$;<br /> \item additive-by-dominance genetic variance, $\sigma^2_{AD}$; and<br /> \item dominance-by-dominance genetic variance, $\sigma^2_{DD}$.<br /> \end{itemize}<br /> See Lynch and Walsh (1998, pp. 88-92) for a thorough numerical example. This decomposition is not arbitrary or trivial, since each component has a distinct coefficient in the correlations between relatives. Is it possible for the authors to relate the variance associated with their $\mathbf{W}\bm{\theta}$ to this traditional decomposition? Besides justifying the work in this paper, the establishment of a relationship can have the possible practical benefit of allowing \texttt{i-LDSC} estimates of non-additive genetic variance to be checked against empirical correlations between relatives. For example, if we know from other methods that $\sigma^2_D$ is negligible but that \texttt{i-LDSC} returns a sizable $\sigma^2_{AA}$, we might predict that the parent-offspring correlation should be equal to the sibling correlation; a sizable $\sigma^2_D$ would make the sibling correlation higher. Admittedly, however, such an exercise can get rather complicated for the variance contributed by pairs of SNPs that are close together (Lynch & Walsh, 1998, pp. 146-152).

      I would also like the authors to clarify whether LDSC consistently overestimates the narrow-sense heritability in the case that pairwise epistasis is present. The figures seem to show this. I have conflicting intuitions here. On the one hand, if GWAS summary statistics can be inflated by the tagging of epistasis, then it seems that LDSC should overestimate heritability (or at least this should be an upwardly biasing factor; other factors may lead the net bias to be different). On the other hand, if standard and interaction LD Scores are lowly correlated, then I feel that the inclusion of interaction LD Score in the regression should not strongly affect the coefficient of the standard LD Score. Relatedly, I find it rather curious that \texttt{i-LDSC} seems increasingly biased as the proportion of genetic variance that is non-additive goes up---but perhaps this is not too important, since such a high ratio of narrow-sense to broad-sense heritability is not realistic.

      How Much Epistasis Is \texttt{i-LDSC} Detecting?

      I think the proper conclusion to be drawn from the authors' analyses is that statistically significant epistatic (non-additive) genetic variance was not detected. Specifically, I think that the analysis presented in Supplementary Table~S6 should be treated as a main analysis rather than a supplementary one, and the results here show no statistically significant epistasis. Let me explain.

      Most serious researchers, I think, treat LDSC as an unreliable estimator of narrow-sense heritability; it typically returns estimates that are too low. Not even the original LDSC paper pressed strongly to use the method for estimating $h^2$ (Bulik-Sullivan et al., 2015). As a practical matter, when researchers are focused on estimating absolute heritability with high accuracy, they usually turn to GCTA/GREML (Evans et al., 2018; Wainschtein et al., 2022).

      One reason for low estimates with LDSC is that if SNPs with higher LD Scores are less likely to be causal or to have large effect sizes, then the slope of univariate LDSC will not rise as much as it ``should' with increasing LD Score. This was a scenario actually simulated by the authors and displayed in their Supplementary Figure~S15. [Incidentally, the authors might have acknowledged earlier work in this vein. A simulation inducing a negative correlation between LD Scores and $\chi^2$ statistics was presented by Bulik-Sullivan et al. (2015, Supplementary Figure 7), and the potentially biasing effect of a correlation over SNPs between LD Scores and contributed genetic variance was a major theme of Lee et al. (2018).] A negative correlation between LD Score and contributed variance does seem to hold for a number of reasons, including the fact that regions of the genome with higher recombination rates tend to be more functional. In short, the authors did very well to carry out this simulation and to show in their Supplementary Figure~S15 that this flaw of LDSC in estimating narrow-sense heritability is also a flaw of \texttt{i-LDSC} in estimating broad-sense heritability. But they should have carried the investigation at least one step further, as I will explain below.

      Another reason for LDSC being a downwardly biased estimator of heritability is that it is often applied to meta-analyses of different cohorts, where heterogeneity (and possibly major but undetected errors by individual cohorts) lead to attenuation of the overall heritability (de Vlaming et al., 2017).

      The optimal case for using LDSC to estimate heritability, then, is incorporating the LD-related annotation introduced by Gazal et al. (2017) into a stratified-LDSC (s-LDSC) analysis of a single large cohort. This is analogous to the calculation of multiple GRMs defined by MAF and LD in the GCTA/GREML papers cited above. When this was done by Gazal et al. (2017, Supplementary Table 8b), the joint impact of the improvements was to increase the estimated narrow-sense heritability of height from 0.216 to 0.534.

      All of this has at least a few ramifications for \texttt{i-LDSC}. First, the authors do not consider whether a relationship between their interaction LD Scores and interaction effect sizes might bias their estimates. (This would be on top of any biasing relationship between standard LD Scores and linear effect sizes, as displayed in Supplementary Figure~S15.) I find some kind of statistical relationship over the whole genome, induced perhaps by evolutionary forces, between \emph{cis}-acting epistasis and interaction LD Scores to be plausible, albeit without intuition regarding the sign of any resulting bias. The authors should investigate this issue or at least mention it as a matter for future study. Second, it might be that the authors are comparing the estimates of broad-sense heritability in Table~1 to the wrong estimates of narrow-sense heritability. Although the estimates did come from single large cohorts, they seem to have been obtained with simple univariate LDSC rather than s-LDSC. When the estimate of $h^2$ obtained with LDSC is too low, some will suspect that the additional variance detected by \texttt{i-LDSC} is simply additive genetic variance missed by the downward bias of LDSC. Consider that the authors' own Supplementary Table~S6 gives s-LDSC heritability estimates that are consistently higher than the LDSC estimates in Table~1. E.g., the estimated $h^2$ of height goes from 0.37 to 0.43. The latter figure cuts quite a bit into the estimated broad-sense heritability of 0.48 obtained with \texttt{i-LDSC}.

      Here we come to a critical point. Lines 282--286 are not entirely clear, but I interpret them to mean that the manuscript's Equation~5 was expanded by stratifying $\ell$ into the components of s-LDSC and this was how the estimates in Supplementary Table~S6 were obtained. If that interpretation is correct, then the scenario of \texttt{i-LDSC} picking up missed additive genetic variance seems rather plausible. At the very least, the increases in broad-sense heritability reported in Supplementary Table~S6 are smaller in magnitude and \emph{not statistically significant}. Perhaps what this means is that the headline should be a \emph{negligible} contribution of pairwise epistasis revealed by this novel and ingenious method, analogous to what has been discovered with respect to dominance (Hivert et al., 2021; Pazokitoroudi et al., 2021; Okbay et al., 2022; Palmer et al., 2023).

      REFERENCES

      Bulik-Sullivan, B., Loh, P.-R., Finucane, H. K., Ripke, S., Yang, J., Schizophrenia Working Group of the Psychiatric Genomics Consortium, Patterson, N., Daly, M. J., Price, A. L., & Neale, B. M. (2015). LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature Genetics, 47, 291-295.

      Burger, R. (2000). The mathematical theory of selection, recombination, and mutation. Wiley.

      de Vlaming, R., Okbay, A., Rietveld, C. A., Johannesson, M., Magnusson, P. K. E., Uitterlinden, A. G., van Rooij, F. J. A., Hofman, A., Groe- nen, P. J. F., Thurik, A. R., & Koellinger, P. D. (2017). Meta-GWAS Accuracy and Power (MetaGAP) calculator shows that hiding heritability is partially due to imperfect genetic correlations across studies. PLoS Genetics, 13, e1006495.

      Evans, L. M., Tahmasbi, R., Vrieze, S. I., Abecasis, G. R., Das, S., Gazal, S., Bjelland, D. W., de Candia, T. R., Haplotype Reference Consortium, Goddard, M. E., Neale, B. M., Yang, J., Visscher, P. M., & Keller, M. C. (2018). Comparison of methods that use whole genome data to estimate the heritability and genetic architecture of complex traits. Nature Genetics, 50, 737-745.

      Ewens, W. J. (2004). Mathematical population genetics I. Theoretical introduction (2nd ed.). Springer.

      Fisher, R. A. (1930). The genetical theory of natural selection. Oxford University Press.

      Fisher, R. A. (1941). Average excess and average effect of a gene substitution. Annals of Eugenics, 11, 53-63.

      Gazal, S., Finucane, H. K., Furlotte, N. A., Loh, P.-R., Palamara, P. F., Liu, X., Schoech, A., Bulik-Sullivan, B., Neale, B. M., Gusev, A., & Price, A. L. (2017). Linkage disequilibrium-dependent architecture of human complex traits shows action of negative selection. Nature Genetics, 49, 1421-1427.

      Hivert, V., Sidorenko, J., Rohart, F., Goddard, M. E., Yang, J., Wray, N. R., Yengo, L., & Visscher, P. M. (2021). Estimation of non-additive genetic variance in human complex traits from a large sample of unrelated individuals. American Journal of Human Genetics, 108, 786- 798.

      Lee, J. J., McGue, M., Iacono, W. G., & Chow, C. C. (2018). The accuracy of LD Score regression as an estimator of confounding and genetic correlations in genome-wide association studies. Genetic Epidemiology, 42, 783-795.

      Lynch, M., & Walsh, B. (1998). Genetics and the analysis of quantitative traits. Sinauer.

      Okbay, A., Wu, Y., Wang, N., Jayashankar, H., Bennett, M., Nehzati, S. M., Sidorenko, J., Kweon, H., Goldman, G., Gjorgjieva, T., Jiang, Y., Hicks, B., Tian, C., Hinds, D. A., Ahlskog, R., Magnusson, P. K. E., Oskarsson, S., Hayward, C., Campbell, A., ... Young, A. I. (2022). Polygenic prediction of educational attainment within and between families from genome-wide association analyses in 3 million individu- als. Nature Genetics, 54, 437-449.

      Palmer, D. S., Zhou, W., Abbott, L., Wigdor, E. M., Baya, N., Churchhouse, C., Seed, C., Poterba, T., King, D., Kanai, M., Bloemendal, A., & Neale, B. M. (2023). Analysis of genetic dominance in the UK Biobank. Science, 379, 1341-1348.

      Pazokitoroudi, A., Chiu, A. M., Burch, K. S., Pasaniuc, B., & Sankararaman, S. (2021). Quantifying the contribution of dominance deviation effects to complex trait variation in biobank-scale data. American Journal of Human Genetics, 108, 799-808.

      Wainschtein, P., Jain, D., Zheng, Z., TOPMed Anthropometry Working Group, NHLBI Trans-Omics for Precision Medicine Consoritum, Cupples, L. A., Shadyab, A. H., McKnight, B., Shoemaker, B. M., Mitchell, B. D., Psaty, B. M., Kooperberg, C., Liu, C.-T., Albert, C. M., Roden, D., Chasman, D. I., Darbar, D., Lloyd-Jones, D. M., Arnett, D. K., . . . Visscher, P. M. (2022). Assessing the contribution of rare variants to complex trait heritability from whole-genome sequence data. Nature Genetics, 54, 263-273.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1:

      1. The results that TF binding produces microdomains at medium and long linker DNA but not short linker is very interesting. Although the differences can be observed from the figure, it still lacks of quantitative comparison. It is not clear the exact definition of the microdomain observed from simulations and what numbers of microdomains can be identified under different conditions. A quantitative comparison of different conditions could also be provided.

      We thank the reviewer for this suggestion. Our intent was to show qualitatively how TF binding locations that we design can direct fiber folding and create microdomains, which we define in the paper as high frequency contact regions in the contact maps, similar to the TADs observed in HiC maps. Together with the fiber configurations, contact maps allow us to identify formation of such microdomains, and to observe how these microdomains change depending on the conditions we build into the model, such as TF binding region or linker DNA length.

      To address your point, we have added a clustering analysis of the contact matrices with nucleosome resolution and assign each contact along the genome position (nucleosome index) to a cluster. In Supporting Figure S6, we show how DBSCAN clustering provides a clustering distribution that quantitatively describes the microdomains observed in the matrices and estimates the number of microdomains. For example, in the 44 and 62 bp systems, the contacts along the genomic distance separate into 5, 2, and 1 nucleosome groups for topologies 1 to 3, and into 2 and 1 group for topology 4, respectively. In the 26 bp and Life-Like systems, where microdomains are more diffuse due to fiber rigidity or polymorphism, we see that the clustering results are not as TF-topology-dependent as in the 44 and 62 bp systems. We also decomposed the contact matrices into one dimensional plots that depict the magnitude of 𝑖, 𝑖 ± 𝑘 internucleosome interactions. We see that internucleosome patterns change with the TF binding topology, and that the 26 bp and Life-Like systems show the least changes.

      1. When increasing TF concentration, from 0 to 100%, it seems that both packing ratio and sedimentation coefficients are not sensitive to the TF concentrations after 25%. Is it due to the saturation of TF binding? How many TF binding sites are considered at each concentration?

      Yes, in most cases, at TF concentrations higher than 25%, the fiber compaction does not change due to saturation of TF binding. Although the TF concentrations are reached, such as 50%, 70%, or 100%, these do not influence the fiber architecture. A higher order folding and compaction cannot be reached due to excluded volume interactions that impede overlapping of beads in the model.<br /> We have clarified this in the manuscript.

      As stated in the Methods section, the TF concentration refers to the number of linker DNA beads that can engage in a constraint compared to the total number of linker DNA beads. Thus, at 25% TF, 25% of linker DNA beads are engaged in TF constraints. We have added a comment on this in the Results section.

      1. It is shown that the contact maps that reveal microdomains are ensemble-based maps and single trajectories do not show clear formation of microdomains. Does the formation of microdomains increase with the number of combined trajectories?

      The formation of microdomains occurs in each single trajectory. However, the microdomains formed in each trajectory can be different. That is why ensemble-based maps show clearer trends of microdomains that might not be as visible in single-trajectory maps. If we increase the number of trajectories, the macrodomains will be more visible and there will be more macrodomains in the contact map, but the formation of microdomains will not increase in each single trajectory.

      1. "As we see from Figure 4A, when the linker DNA is short, such as 26 and 35 bp, TF binding does not increase the packing ratio of the fiber." The results of 35bp cannot be found in Figure 4A. In addition, the color of 44 and 62 bp should be changed since they are very similar in the figure.

      Thank you for catching this. The results corresponding to the 35 bp system are presented in the Supporting Figure 7. We have changed the text to read “As we see from Figure 4A and Figure S7..”.

      We have changed the color of the 62 bp trace to blue in the plots of Figure 4. Consistently, we have also changed the color of the 62 bp fiber in Figure 2 and Figure 5.

      1. For modelling of TF binding at increasing concentrations, it is mentioned that in these three conditions, TFs are allowed to bind to any region. Do you mean TF can also bind to nucleosomal DNA? Nucleosome structure prevents the binding of many TFs.

      In our model, only linker DNA beads can engage in the constraints (bind TF).<br /> We have changed the text to read “TFs are allowed to bind to any linker DNA region”.

      1. The details of the Mnase-seq dataset and how NFRs are identified should be provided, such as the coverage of the data and what read fragments are selected for NFR mapping.

      MNase data in bedgraph format were downloaded from the Genome Expression Omnibus (GSM2083107) repository and loaded without further processing into the Genome Browser. NFRs were visually inspected and detected as genomic regions without peaks. As detailed in the GEO repository, the sequenced paired-end reads were mapped to the mm9 genome. Only uniquely mapped reads with no more than two mismatches were retained and reads with insert sizes less than 50 or larger than 500 bp were discarded.

      We have clarified this in the manuscript.

      1. The calculations of volume and area of the Eed promoter region should be further elucidated.

      Thank you. We now elaborate upon these calculations. In particular, the Eed promoter region is defined between cores 123 and 129. The x,y or x,y,z coordinates of those cores are used to create the bounding area or volume by defining the shape’s vertices.

      1. In Figure 3, it is not clear how different topology are identified.

      In Figure 3 the topology, or TF binding regions, is the same for each of the 10 contact maps as these emerge from trajectory replicas of the same system which we named Topology 1. Different microdomains are formed in each individual trajectory as the high-frequency regions appear in different locations on each contact map. However, when these 10 maps are summed, the ensemble contact map clearly shows consensus microdomains in each region where TF binds.

      Reviewer #2:

      To further improve the manuscript, I have the following suggestions/comments.

      1. While most of the conclusions in this paper follow from the evidence provided by the ximulations, the result in section 3.3 title "Gene locus repression is medicated by TF finding," may not follow from the results. In my opinion, repression is a more complex process, and many more factors (such as nucleosome positioning, nucleosome sliding, histone methylation, and other proteins such as PRC or HP1, etc) may be involved in repression. While compaction is often associated with repressed chromatin (heterochromatin), recent studies have shown that heterochromatin fibers are highly diverse, and compaction alone may not be the criteria for repression (eg. see Spracklin et al. Nat. Struct. Mol. Biol. 30, 38-51 (2023).). In this light, I would recommend slightly modifying the title to say, "TF binding-mediated compaction can help in gene locus repression" or something similar.

      Yes! We completely agree that gene repression is a very complex phenomenon that involves many factors that we are approaching by modeling starting from the simplest strategy. Thus, we have changed the subtitle to read “TF binding-mediated compaction as possible mechanism of gene locus repression”.

      1. Authors could also present the contact probability versus genomic distance. This may provide some generic features at nucleosome resolution, given the variability in linker length and LH density.

      We thank the reviewer for this suggestion. We have now calculated the contact probability for the EED gene with and without TF binding (Supporting Figure 8). We see that the contact probability corresponding to short range interactions (i ± 2, 3, 4, 5, and 6) is slightly lower for the EED gene upon TF binding. However, a striking increase in the contact probability upon TF binding is seen in the genomic region between 3 and 5 kb, which corresponds to local loop interactions. Thus, TF binding slightly decreases local interactions but increases chromatin loops. Such changes are not observed for the EED system with LH density 0.8 (Supporting Figure 9), further supporting the idea that an increase in LH density hampers the effect of TF binding for the EED gene architecture. <br /> We have now added these results to the manuscript.

      1. Write a short paragraph about the limitations of the model/study. For example, one of the limitations could be that, as of now, it has only the effect of a few proteins, but to predict repression, one may need to incorporate the effect of several proteins.

      We agree with the reviewer that our model is a simple, first-step approach. Nonetheless, even the simplest mathematical model can be enlightening in helping dissect essential factors. Here, our model clearly shows how TF binding location modulates fiber architecture and the interplay between TF binding and other chromatin elements, like linker DNA length, LH density, and histone acetylation. We have now stated in the Discussion section that although limited due to being implicit and not considering other protein partners, our model can provide insights on the regulation of chromatin architecture by protein binding. Future modeling with explicit protein binding or combination of several proteins will further help us understand genome folding regulation.

      1. The radius of gyration of 26 kb chromatin is around ~60nm in this paper. Is there any experimental measurement to compare (approximate order of magnitude)? While I do not know any measurement for Eed gene locus, I am aware of the results in the Boettiger et al. paper from Xiaowei Zhuang lab (Nature 2016). There, they find that the Rg of a 26 kb region is above 100nm. But that is for a different organism, a different set of genes. Also, see Sangram Kadam et al. Nature Communications 14 (1), 4108, 2023.

      Thank you for this suggestion. To the best of our knowledge, there are no radius of gyration measurements for the EED gene. Regarding the two papers you cite, in the paper from Boettiger et al. (1) they determine by microscopy experiments that Rg ∝ 𝐿! where 𝐿 is the genomic length and 𝑐 is 0.37 ± 0.02 for active chromatin (Figure 1d of the paper). In such case, the Rg for a 26 kb region would be 43 ± 9 nm. Considering that these are Drosophila cells, our value of 62 nm is in good agreement with that estimate. Regarding the Kadam et al. paper (2), by coarse grained modeling they find an Rg of around 100 nm for different genes. Considering that the radius of gyration depends on cell type and fiber configuration (see for example (3) for the dependency of Rg on loop number and persistence length), we believe that our measurements in the same ball park as experimental results and other theoretical modeling studies are good indicators of our model’s reasonableness.

      We have added this comparison to the manuscript.

      1. The reason why it is useful to compare some distance measurements (physical dimension) with experiments is the following: The contact map in Hi-C only gives relative contact probabilities. It does not give absolute contact probabilities. To convert a Hi-C map into a physical distance, one requires comparison with some experimentally measured 3D distance. The radius of gyration is an ideal quantity to compare. From my experience, the contact probability is often much smaller than 1, suggesting that the chromatin is more expanded. But this could be due to the effect of many other proteins in vivo and the crowding, etc. I do not expect this work to incorporate all those effects. However, it may be useful to make a comment about it in the manuscript.

      Thank you. We have added to the discussion a comment on our first-generation model of TF binding to chromatin and the neglect of many associated protein and RNA cofactors that certainly influence chromosome folding and domain formation on higher scales. Some distance measures are also added to the Results as mentioned above.

      References

      1. Boettiger,A.N., Bintu,B., Moffitt,J.R., Wang,S., Beliveau,B.J., Fudenberg,G., Imakaev,M., Mirny,L.A., Wu,C. and Zhuang,X. (2016) Super-resolution imaging reveals distinct chromatin folding for different epigenetic states. Nature, 529, 418–422.

      2. Kadam,S., Kumari,K., Manivannan,V., Dutta,S., Mitra,M.K. and Padinhateeri,R. (2023) Predicting scale-dependent chromatin polymer properties from systematic coarsegraining. Nat. Commun., 14, 4108.

      3. Wachsmuth,M., Knoch,T.A. and Rippe,K. (2016) Dynamic properties of independent chromatin domains measured by correlation spectroscopy in living cells. Epigenetics Chromatin, 9, 57.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      In this very strong and interesting paper the authors present a convincing series of experiments that reveal molecular mechanism of neuronal cell type diversification in the nervous system of Drosophila. The authors show that a homeodomain transcription factor, Bsh, fulfills several critical functions - repressing an alternative fate and inducing downstream homeodomain transcription factors with whom Bsh may collaborate to induce L4 and L5 fates (the author's accompanying paper reveals how Bsh can induce two distinct fates). The authors make elegant use of powerful genetic tools and an arsenal of satisfying cell identity markers.

      Thanks!

      I believe that this is an important study because it provides some fundamental insights into the conservation of neuronal diversification programs. It is very satisfying to see that similar organizational principles apply in different organisms to generate cell type diversity. The authors should also be commended for contextualizing their work very well, giving a broad, scholarly background to the problem of neuronal cell type diversification.

      Thanks!

      My one suggestion for the authors is to perhaps address in the Discussion (or experimentally address if they wish) how they reconcile that Bsh is on the one hand: (a) continuously expressed in L4/L4, (b) binding directly to a cohort of terminal effectors that are also continuously expressed but then, on the other hand, is not required for their maintaining L4 fate? A few questions: Is Bsh only NOT required for maintaining Ap expression or is it also NOT required for maintaining other terminal markers of L4? The former could be easily explained - Bsh simply kicks of Ap, Ap then autoregulates, but Bsh and Ap then continuously activate terminal effector genes. The second scenario would require a little more complex mechanism: Bsh binding of targets (with Notch) may open chromatin, but then once that's done, Bsh is no longer needed and Ap alone can continue to express genes. I feel that the authors should be at least discussing this. The postmitotic Bsh removal experiment in which they only checked Ap and depression of other markers is a little unsatisfying without further discussion (or experiments, such as testing terminal L4 markers). I hasten to add that this comment does not take away from my overall appreciation for the depth and quality of the data and the importance of their conclusions.

      Great suggestions, we will discuss these two hypotheses as requested.

      Bsh initiates Ap expression in L4 neurons which then maintain Ap expression independently of Bsh expression, likely through Ap autoregulation. During the synaptogenesis window, Ap expression becomes independent from Bsh expression, but Bsh and Ap are both still required to activate the synapse recognition molecule DIP-beta. Additionally, Bsh also shows putative binding to other L4 identity genes, e.g., those required for neurotransmitter choice, and electrophysiological properties, suggesting Bsh may initiate L4 identity genes as a suite of genes. The mechanism of maintaining identity features (e.g., morphology, synaptic connectivity, and functional properties) in the adult remains poorly understood. It is a great question whether primary HDTF Bsh maintains the expression of L4 identity genes in the adult. To test this, in our next project, we will specifically knock out Bsh in L4 neurons of the adult fly and examine the effect on L4 morphology, connectivity, and function properties.

      Reviewer #2 (Public Review):

      Summary:

      In this paper, the authors explore the role of the Homeodomain Transcription Factor Bsh in the specification of Lamina neuronal types in the optic lobe of Drosophila. Using the framework of terminal selector genes and compelling data, they investigate whether the same factor that establishes early cell identity is responsible for the acquisition of terminal features of the neuron (i.e., cell connectivity and synaptogenesis).

      Thanks for the positive words!

      The authors convincingly describe the sequential expression and activity of Bsh, termed here as 'primary HDTF', and of Ap in L4 or Pdm3 in L5 as 'secondary HDTFs' during the specification of these two neurons. The study demonstrates the requirement of Bsh to activate either Ap and Pdm3, and therefore to generate the L4 and L5 fates. Moreover, the authors show that in the absence of Bsh, L4 and L5 fates are transformed into a L1 or L3-like fates.

      Thanks!

      Finally, the authors used DamID and Bsh:DamID to profile the open chromatin signature and the Bsh binding sites in L4 neurons at the synaptogenesis stage. This allows the identification of putative Bsh target genes in L4, many of which were also found to be upregulated in L4 in a previous single-cell transcriptomic analysis. Among these genes, the paper focuses on Dip-β, a known regulator of L4 connectivity. They demonstrate that both Bsh and Ap are required for Dip-β, forming a feed-forward loop. Indeed, the loss of Bsh causes abnormal L4 synaptogenesis and therefore defects in several visual behaviors. The authors also propose the intriguing hypothesis that the expression of Bsh expanded the diversity of Lamina neurons from a 3 cell-type state to the current 5 cell-type state in the optic lobe.

      Thanks for the excellent summary of our findings!

      Strengths:

      Overall, this work presents a beautiful practical example of the framework of terminal selectors: Bsh acts hierarchically with Ap or Pdm3 to establish the L4 or L5 cell fates and, at least in L4, participates in the expression of terminal features of the neuron (i.e., synaptogenesis through Dip-β regulation).

      Thanks!

      The hierarchical interactions among Bsh and the activation of Ap and Pdm3 expression in L4 and L5, respectively, are well established experimentally. Using different genetic drivers, the authors show a window of competence during L4 neuron specification during which Bsh activates Ap expression. Later, as the neuron matures, Ap becomes independent of Bsh. This allows the authors to propose a coherent and well-supported model in which Bsh acts as a 'primary' selector that activates the expression of L4specific (Ap) and L5-specific (Pdm3) 'secondary' selector genes, that together establish neuronal fate.

      Thanks again!

      Importantly, the authors describe a striking cell fate change when Bsh is knocked down from L4/L5 progenitor cells. In such cases, L1 and L3 neurons are generated at the expense of L4 and L5. The paper demonstrates that Bsh in L4/L5 represses Zfh1, which in turn acts as the primary selector for L1/L3 fates. These results point to a model where the acquisition of Bsh during evolution might have provided the grounds for the generation of new cell types, L4 and L5, expanding lamina neuronal diversity for a more refined visual behaviors in flies. This is an intriguing and novel hypothesis that should be tested from an evo-devo standpoint, for instance by identifying a species when L4 and L5 do not exist and/or Bsh is not expressed in L neurons.

      Thanks for the appreciation of our findings!

      To gain insight into how Bsh regulates neuronal fate and terminal features, the authors have profiled the open chromatin landscape and Bsh binding sites in L4 neurons at mid-pupation using the DamID technique. The paper describes a number of genes that have Bsh binding peaks in their regulatory regions and that are differentially expressed in L4 neurons, based on available scRNAseq data. Although the manuscript does not explore this candidate list in depth, many of these genes belong to classes that might explain terminal features of L4 neurons, such as neurotransmitter identity, neuropeptides or cytoskeletal regulators. Interestingly, one of these upregulated genes with a Bsh peak is Dip-β, an immunoglobulin superfamily protein that has been described by previous work from the author's lab to be relevant to establish L4 proper connectivity. This work proves that Bsh and Ap work in a feed-forward loop to regulate Dip-β expression, and therefore to establish normal L4 synapses. Furthermore, Bsh loss of function in L4 causes impairs visual behaviors.<br /> Thanks for the excellent summary of our findings.

      Weaknesses:

      ● The last paragraph of the introduction is written using rhetorical questions and does not read well. I suggest rewriting it in a more conventional direct style to improve readability.

      We agree and have updated the text as suggested.

      ● A significant concern is the way in which information is conveyed in the Figures. Throughout the paper, understanding of the experimental results is hindered by the lack of information in the Figure headers. Specifically, the genetic driver used for each panel should be adequately noted, together with the age of the brain and the experimental condition. For example, R27G05-Gal4 drives early expression in LPCs and L4/L5, while the 31C06-AD, 34G07-DBD Split-Gal4 combination drives expression in older L4 neurons, and the use of one or the other to drive Bsh-KD has dramatic differences in Ap expression. The indication of the driver used in each panel will facilitate the reader's grasp of the experimental results.

      We agree and have updated the figure annotation.

      ● Bsh role in L4/L5 cell fate: o It is not clear whether Tll+/Bsh+ LPCs are the precursors of L4/L5. Morphologically, these cells sit very close to L5, but are much more distant from L4.

      Our current data show L4 and L5 neurons are generated by different LPCs. However, currently, we don’t have tools to demonstrate which subset of LPCs generate which lamina neuron type. We are currently working on a follow-up manuscript on LPC heterogeneity, but those experiments have just barely been started.

      ● Somatic CRISPR knockout of Bsh seems to have a weaker phenotype than the knockdown using RNAi. However, in several experiments down the line, the authors use CRISPR-KO rather than RNAi to knock down Bsh activity: it should be explained why the authors made this decision. Alternatively, a null mutant could be used to consolidate the loss of function phenotype, although this is not strictly necessary given that the RNAi is highly efficient and almost completely abolishes Bsh protein.

      The reason we chose CRISPR-KO (L4-specific Gal4, uas-Cas9, and uas-Bsh-sgRNAs) is that it effectively removed Bsh expression from the majority of L4 neurons. However, it failed to knock down Bsh in L4 neurons using L4-split Gal4 and Bsh-RNAi because L4-split Gal4 expression depends on Bsh. We have updated this explanation in the text.

      ● Line 102: Rephrase "R27G05-Gal4 is expressed in all LPCs and turned off in lamina neurons" to "is turned off as lamina neurons mature", as it is kept on for a significant amount of time after the neurons have already been specified.

      Thanks; we have made that change.

      ● Line 121: "(a) that all known lamina neuron markers become independent of Bsh regulation in neurons" is not an accurate statement, as the markers tested were not shown to be dependent on Bsh in the first place.

      Good point. We have rephrased it as “that all known lamina neuron markers are independent of Bsh regulation in neurons”.

      ● Lines 129-134: Make explicit that the LPC-Gal4 was used in this experiment. This is especially important here, as these results are opposite to the Bsh Loss of Function in L4 neurons described in the previous section. This will help clarify the window of competence in which Bsh establishes L4/L5 neuronal identities through ap/pdm3 expression.

      Thanks! We have updated Gal4 information in the text for every manipulation.

      ● DamID and Bsh binding profile:

      ● Figure 5 - figure supplement 1C-E: The genotype of the Control in (C) has to be described within the panel. As it is, it can be confused with a wild type brain, when it is in fact a Bsh-KO mutant.

      Great point! Thank you for catching this and we have updated it.

      ● It Is not clear how L4-specific Differentially Expressed Genes were found. Are these genes DEG between Lamina neurons types, or are they upregulated genes with respect to all neuronal clusters? If the latter is the case, it could explain the discrepancy between scRNAseq DEGs and Bsh peaks in L4 neurons.

      We did not use “L4-specific Differentially Expressed Genes”. Instead, we used all genes that are significantly transcribed in L4 neurons (line 209-213).

      ● Dip-β regulation:

      ● Line 234: It is not clear why CRISPR KO is used in this case, when Bsh-RNAi presents a stronger phenotype.

      As we explained above, the reason we chose CRISPR-KO (L4-specific Gal4, uas-Cas9, and uas-BshsgRNAs) is that it effectively removed Bsh expression from the majority of L4 neurons. However, it failed to knock down Bsh in L4 neurons using L4-split Gal4 and Bsh-RNAi because L4-split Gal4 expression depends on Bsh. We have updated this explanation in the text.

      ● Figure 6N-R shows results using LPC-Gal4. It is not clear why this driver was used, as it makes a less accurate comparison with the other panels in the figure, which use L4-Split-Gal4. This discrepancy should be acknowledged and explained, or the experiment repeated with L4-Split-Gal4>Ap-RNAi.

      I think you mean 6J-M shows results using LPC-Gal4. We first tried L4-Split-Gal4>Ap-RNAi but it failed to knock down Ap because L4-Split-Gal4 expression depends on Ap. We have added this to the text.

      ● Line 271: It is also possible that L4 activity is dispensable for motion detection and only L5 is required.

      Thanks! Work from Tuthill et al, 2013 showed that L5 is not required for any motion detection. We have included this citation in the text.

      ● Discussion: It is necessary to de-emphasize the relevance of HDTFs, or at least acknowledge that other, non-homeodomain TFs, can act as selector genes to determine neuronal identity. By restricting the discussion to HDTFs, it is not mentioned that other classes of TFs could follow the same PrimarySecondary selector activation logic.

      That is a great point, thank you! We have included this in the discussion.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      Strengths

      This paper is well situated theoretically within the habit learning/OCD literature.

      Daily training in a motor-learning task, delivered via smartphone, was innovative, ecologically valid and more likely to assay habitual behaviors specifically. Daily training is also more similar to studies with non-humans, making a better link with that literature. The use of a sequential-learning task (cf. tasks that require a single response) is also more ecologically valid.

      The in-laboratory tests (after the 1 month of training) allowed the researchers to test if the OCD group preferred familiar, but more difficult, sequences over newer, simpler sequences.

      The authors achieved their aims in that two groups of participants (patients with OCD and controls) engaged with the task over the course of 30 days. The repeated nature of the task meant that 'overtraining' was almost certainly established, and automaticity was demonstrated. This allowed the authors to test their hypotheses about habit learning. The results are supportive of the authors' conclusions.

      Response: We truly appreciate the positive assessment of referee 1, particularly the consideration that our study is theoretically strong and that ‘the results are supportive of the authors' conclusions’. This is an important external endorsement of our conclusions, contrasting somewhat with the views of referee 2.

      Weaknesses

      The sample size was relatively small. Some potentially interesting individual differences within the OCD group could have been examined more thoroughly with a bigger sample (e.g., preference for familiar sequences). A larger sample may have allowed the statistical testing of any effects due to medication status. The authors were not able to test one criterion of habits, namely resistance to devaluation, due to the nature of the task

      Response: We agree with the reviewer that the proof of principle established in our study opens new avenues for research into the psychological and behavioral determinants of the heterogeneity of this clinical population. However, considering the study timeline and the pandemic constraints, a bigger sample was not possible. Our sample can indeed be considered small if one compares it with current online studies, which do not require in-person/laboratory testing, thus being much easier to recruit and conduct. However, given the nature of our protocol (with 2 demanding test phases, 1-month engagement per participant and the inclusion of OCD patients without comorbidities only) and the fact that this study also involved laboratory testing, we consider our sample size reasonable and comparable to other laboratory studies (typically comprising on average between 30-50 participants in each group).

      This article is likely to be impactful -- the delivery of a task across 30 days to a patient group is innovative and represents a new approach for the study of habit learning that is superior to an inlaboratory approach.

      An interesting aspect of this manuscript is that it prompts a comparison with previous studies of goal-directed/habitual responding in OCD that used devaluation protocols, and which may have had their effects due to deficits in goal-directed behavior and not enhanced habit learning per se.

      Response: Thank you for acknowledging the impact of our study, in particular the unique ability of our task to interrogate the habit system.

      Reviewer #2 (Public Review):

      In this study, the researchers employed a recently developed smartphone application to provide 30 days of training on action sequences to both OCD patients and healthy volunteers. The study tested learning and automaticity-related measures and investigated the effects of several factors on these measures. Upon training completion, the researchers conducted two preference tests comparing a learned and unlearned action sequences under different conditions. While the study provides some interesting findings, I have a few substantial concerns:

      1. Throughout the entire paper, the authors' interpretations and claims revolve around the domain of habits and goal-directed behavior, despite the methods and evidence clearly focusing on motor sequence learning/procedural learning/skill learning. There is no evidence to support this framing and interpretation and thus I find them overreaching and hyperbolic, and I think they should be avoided. Although skills and habits share many characteristics, they are meaningfully distinguishable and should not be conflated or mixed up. Furthermore, if anything, the evidence in this study suggests that participants attained procedural learning, but these actions did not become habitual, as they remained deliberate actions that were not chosen to be performed when they were not in line with participants' current goals.

      Response: We acknowledge that the research on habit learning is a topic of current controversy, especially when it comes to how to induce and measure habits in humans. Therefore, within this context referee’s 2 criticism could be expected. Across distinct fields of research, different methodologies have been used to measure habits, which represent relatively stereotyped and autonomous behavioral sequences enacted in response to a specific stimulus without consideration, at the time of initiation of the sequence, of the value of the outcome or any representation of the relationship that exists between the response and the outcome. Hence these are stimulus-bound responses which may or may not require the implementation of a skill during subsequent performance. Behavioral neuroscientists define habits similarly, as stimulus-response associations which are independent of reward or outcome, and use devaluation or contingency degradation strategies to probe habits (Dickinson and Weiskrantz, 1985; Tricomi et al., 2009). Others conceptualize habits as a form of procedural memory, along with skills, and use motor sequence learning paradigms to investigate and dissect different components of habit learning such as action selection, execution and consolidation (Abrahamse et al., 2013; Doyon et al., 2003; Squire et al., 1993). It is also generally agreed that the autonomous nature of habits and the fluid proficiency of skills are both usually achieved with many hours of training or practice, respectively (Haith and Krakauer, 2018).

      We consider that Balleine and Dezfouli (2019) made an excellent attempt to bring all these different criteria within a single framework, which we have followed. We also consider that our discussion in fact followed a rather cautious approach to interpretation solely in terms of goaldirected versus habitual control.

      Referee 2 does not actually specify criteria by which they define habits and skills, except for asserting that skilled behavior is goal-directed, without mentioning what the actual goal of the implantation of such skill is in the present study: the fulfillment of a habit? We assume that their definition of habit hinges on the effects of devaluation, as a single criterion of habit, but which according to Balleine and Dezfouli (2019) is only 1 of their 4 listed criteria. We carefully addressed this specific criterion in our manuscript: “We were not, however, able to test the fourth criterion, of resistance to devaluation. Therefore, we are unable to firmly conclude that the action sequences are habits rather than, for example, goal-directed skills. Regardless of whether the trained action sequences can be defined as habits or goal-directed motor skills, it has to be considered…”. Therefore, we took due care in our conclusions concerning habits and thus found the referee’s comment misleading and unfair.

      We note that our trained motor sequences did in fact fulfil the other 3 criteria listed by Balleine and Dezfouli (2019), unlike many studies employing only devaluation (e.g. Tricomi et al 2009; Gillan et al 2011). Moreover, we cited a recent study using very similar methodology where the devaluation test was applied and shown to support the habit hypothesis (Gera et al., 2022).

      Whether the initiation of the trained motor sequences in experiment 3 (arbitration) is underpinned by an action-outcome association (or not) has no bearing on whether those sequences were under stimulus-response control after training (experiment 1). Transitions between habitual and goal-directed control over behavior are quite well established in the experimental literature, especially when choice opportunities become available (Bouton et al (2021), Frölich et al (2023), or a new goal-directed schemata is recruited to fulfill a habit (Fouyssac et al, 2022). This switching between habits and goal-directed responding may reflect the coordination of these systems in producing effective behavior in the real world.

      • Fouyssac M, Peña-Oliver Y, Puaud M, Lim NTY, Giuliano C, Everitt BJ, Belin D. (2021).Negative Urgency Exacerbates Relapse to Cocaine Seeking After Abstinence. Biological Psychiatry. doi: 10.1016/j.biopsych.2021.10.009

      • Frölich S, Esmeyer M, Endrass T, Smolka MN and Kiebel SJ (2023) Interaction between habits as action sequences and goal-directed behavior under time pressure. Front. Neurosci. 16:996957. doi: 10.3389/fnins.2022.996957

      • Bouton ME. 2021. Context, attention, and the switch between habit and goal-direction in behavior. Learn Behav 49:349– 362. doi:10.3758/s13420-021-00488-z

      1. Some methodological aspects need more detail and clarification.

      2. There are concerns regarding some of the analyses, which require addressing.

      Response: We thank referee 2 for their detailed review of the methods and analyses of our study and for the helpful feedback, which clearly helps improve our manuscript. We will clarify the methodological aspects in detail and conduct the suggested analysis. Please see below our answers to the specific points raised.

      Introduction:

      1. It is stated that "extensive training of sequential actions would more rapidly engage the 'habit system' as compared to single-action instrumental learning". In an attempt to describe the rationale for this statement the authors describe the concept of action chunking, its benefits and relevance to habits but there is no explanation for why sequential actions would engage the habit system more rapidly than a single-action. Clarifying this would be helpful.

      Response: We agree that there is no evidence that action sequences become habitual more readily than single actions, although action sequences clearly allow ‘chunking’ and thus likely engage neural networks including the putamen which are implicated in habit learning as well as skill. In our revised manuscript we will instead state: “we have recently postulated that extensive training of sequential actions could be a means for rapidly engaging the ‘habit system’ (Robbins et al., 2019)]”

      DONE in page 2

      1. In the Hypothesis section the authors state: “we expected that OCD patients... show enhanced habit attainment through a greater preference for performing familiar app sequences when given the choice to select any other, easier sequence”. I find it particularly difficult to interpret preference for familiar sequences as enhanced habit attainment.

      Response: We agree that choice of the familiar response sequence should not be a necessary criterion for habitual control although choice for a familiar sequence is, in fact, not inconsistent with this hypothesis. In a recent study, Zmigrod et al (2022) found that 'aversion to novelty' was a relevant factor in the subjective measurement of habitual tendencies. It should also be noted that this preference was present in patients with OCD. If one assumes instead, like the referee, that the familiar sequence is goal-directed, then it contravenes the well-known 'egodystonia' of OCD which suggests that such tendencies are not goal-directed.

      To clarify our hypothesis, we will amend the sentence to the following: “Finally, we expected that OCD patients would generally report greater habits, as well as attribute higher intrinsic value to the familiar app sequences manifested by a greater preference for performing them when given the choice to select any other, easier sequence”.

      DONE in page 5. We have now rephrased it: “Additionally, we hypothesized that OCD patients would generally display stronger habits and assign greater intrinsic value to the familiar app sequences, evidenced by a marked preference for executing them even when presented with a simpler alternative sequence.”

      A few notes on the task description and other task components:

      1. It would be useful to give more details on the task. This includes more details on the time/condition of the gradual removal of visual and auditory stimuli and also on the within practice dynamic structure (i.e., different levels appear in the video).

      Response: These details will be included in the revised manuscript. Thank you for pointing out the need for further clarification of the task design.

      Done in page 7

      1. Some more information on engagement-related exclusion criteria would be useful (what happened if participants did not use the app for more than one day, how many times were allowed to skip a day etc.).

      Response: This additional information will be added to the revised manuscript. If participants omitted to train for more than 2 days, the researcher would send a reminder to the participant to request to catch up. If the participant would not react accordingly and a third day would be skipped, then the researcher would call to understand the reasons for the lack of engagement and gauge motivation. The participant would be excluded if more than 5 sequential days of training were missed. Only 2 participants were excluded given their lack of engagement.

      Done in page 8

      1. According to the (very useful) video demonstrating the task and the paper describing the task in detail (Banca et al., 2020), the task seems to include other relevant components that were not mentioned in this paper. I refer to the daily speed test, the daily random switch test, and daily ratings of each sequence's enjoyment and confidence of knowledge.

      If these components were not included in this procedure, then the deviations from the procedure described in the video and Banca al. (2020) should be explicitly mentioned. If these components were included, at least some of them may be relevant, at least in part, to automaticity, habitual action control, formulation of participants' enjoyment from the app etc. I think these components should be mentioned and analyzed (or at least provide an explanation for why it has been decided not to analyze them).

      This is also true for the reward removal (extinction) from the 21st day onwards which is potentially of particular relevance for the research questions.

      Response: The task procedure was indeed the same as detailed in Banca et al., 2020. We did not include these extra components in this current manuscript for reasons of succinctness and because the manuscript was already rather longer than a common research article, given that we present three different, though highly inter-dependent, experiments in order to answer key interrelated questions in an optimal manner. However, since referee 2 considers this additional analysis to be important, we will be happy to include it in the supplementary material of the revised manuscript.

      These additional components of the task as well as the respective analysis are now described in the Supplementary Materials.

      Training engagement analysis:

      1. I find referring to the number of trials including successful and unsuccessful trials as representing participants "commitment to training" (e.g. in Figure legend 2b) potentially inadequate. Given that participants need at least 20 successful trials to complete each practice, more errors would lead to more trials. Therefore, I think this measure may mostly represent weaker performance (of the OCD patients as shown in Figure 2b). Therefore, I find the number of performed practice runs, as used in Figure 2a (which should be perfectly aligned with the number of successful trials), a "clean" and proper measure of engagement/commitment to training.

      Response: We acknowledge referee’s concern on this matter and agree to replace the y-axis variable of Figure 2b to the number of performed practices (thus aligning with Figure 2a). This amendment will remove any potential effect of weaker performance on the engagement measurement and will provide clearer results.

      We have now decided to remove this figure as it does not add much to figure 2a. Instead, we replaced figure 2b and 2c for new plots, following new analysis linked to the next reviewer request (point 10)

      1. Also, to provide stronger support for the claim about different diurnal training patterns (as presented in Figure 2c and the text) between patients and healthy individuals, it would be beneficial to conduct a statistical test comparing the two distributions. If the results of this test are not significant, I suggest emphasizing that this is a descriptive finding.

      Response: Done, see revised Figure 2b and 2c. We have assessed the diurnal training patterns within each group using circular statistics, followed by independent-sample statistical testing of those circular distributions with the Watson’s U2 test ( Landler et al., 2021). While OCD participants have a group effect of practice with a significant peak at ~18:00, and HV participants have an earlier significant peak at ~15:00, the Watson’s U test did not find statistical betweengroup differences.

      • Landler L, Ruxton GD, Malkemper EP. Advice on comparing two independent samples of circular data in biology. Scientific reports. 2021 Oct 13;11(1):20337.

      Learning results:

      1. When describing the Learning results (p10) I think it would be useful to provide the descriptive stats for the MT0 parameter (as done above for the other two parameters).

      Response: Thank you for pointing this out. The descriptive stats for MT0 will be added to the revised version of the manuscript.

      Done page 11

      1. Sensitivity of sequence duration and IKI consistency (C) to reward:

      I think it is important to add details on how incorrect trials were handled when calculating ∆MT (or C) and ∆R, specifically in cases where the trial preceding a successful trial was unsuccessful. If incorrect trials were simply ignored, this may not adequately represent trial-by-trial changes, particularly when testing the effect of a trial's outcome on performance change in the next trial.

      Response: This is an important question. Our analysis protocol was designed to ensure that incorrect trials do not contaminate or confound the results. To estimate the trial-to-trial difference in ∆MT (or C) and ∆R, we exclusively included pairs of contiguous trials where participants achieved correct performance and received feedback scores for both trials. For example, if a participant made a performance error on trial 23, we did not include ∆R or ∆MT estimates for the pairs of trials 23-22 and 24-23. Instead of excluding incorrect trials from our analyses, we retained them in our time series but assigned them a NaN (not a number) value in Matlab. As a result, ∆R and ∆MT was not defined for those two pairs of trials. Similarly for C. This approach ensured that our analyses are not confounded by incremental or decremental feedback scores between noncontiguous trials. In the past, when assessing the timing of correct actions during skilled sequence performance, we also considered events that were preceded and followed by correct actions. This excluded effects such as post-error slowing from contaminating our results (Herrojo Ruiz et al., 2009, 2019). Therefore, we do not believe that any further reanalysis is required.

      • Ruiz MH, Jabusch HC, Altenmüller E. Detecting wrong notes in advance: neuronal correlates of error monitoring in pianists. Cerebral cortex. 2009 Nov 1;19(11):2625-39.

      • Bury G, García-Huéscar M, Bhattacharya J, Ruiz MH. Cardiac afferent activity modulates early neural signature of error detection during skilled performance. NeuroImage. 2019 Oct 1;199:704-17.

      1. I have a serious concern with respect to how the sensitivity of sequence duration to reward is framed and analyzed. Since reward is proportional to performance, a reduction in reward essentially indicates a trial with poor performance, and thus even regression to the mean (along with a floor effect in performance [asymptote]) could explain the observed effects. It is possible that even occasional poor performance could lead to a participant demonstrating this effect, potentially regardless of the reward. Accordingly, the reduced improvement in performance following a reward decrease as a function of training length described in Figure 5b legend may reflect training-induced increased performance that leaves less room for improvement after poor trials, which are no longer as poor as before. To address this concern, controlling for performance (e.g., by taking into consideration the baseline MT for the previous trial) may be helpful. If the authors can conduct such an analysis and still show the observed effect, it would establish the validity of their findings."

      Response: Thank you for raising this point. This has been done, see updated Figures 5 and 6. After normalizing the ∆MT(n+1) := MT(n+1) – MT(n) difference values by dividing them with the baseline MT(n) at trial n, we obtain the same results. Similar results are also obtained for IKI consistency (C).

      See below our initial response from June 2023.

      Thank you for raising this point. Figure 5b illustrates two distinct effects of reward changes on behavioral adaptation, which are expected based on previous research.

      I. Practice effects: Firstly, we observe that as participants progress across bins of practice, the degree of improvement in behavior (reflected by faster movement time, MT) following a decrease in reward (∆R−) diminishes, consistent with our expectations based on previous work. Conversely, we found that ∆MT does not change across bins of practices following an increase in reward (∆R+).

      We appreciate the reviewer’s suggestion regarding controlling for the reference movement time (MT) in the previous trial when examining the practice effect in the p(∆T|∆R−) and p(∆T|∆R+) distributions. In the revised manuscript, we will conduct the proposed control analysis to better understand whether the sensitivity of MT to score decrements changes across practice when normalising MT to the reference level on each trial. But see below for a preliminary control analysis.

      II. Asymmetry of the effect of ∆R− and ∆R+ on performance: Figure 5b also depicts the distinct impact of score increments and decrements on behavioural changes. When aggregating data across practice bins, we consistently observed that the centre of the p(∆T|∆R−) distribution was smaller (more negative) than that of p(∆T|∆R+). This suggests that participants exhibited a greater acceleration following a drop in scores compared to a relative score increase, and this effect persisted throughout the practice sessions. Importantly, this enhanced sensitivity to losses or negative feedback (or relative drops in scores) aligns with previous research findings (Galea et al., 2015; Pekny et al., 2014; van Mastrigt et al., 2020).

      We have conducted a preliminary control analysis to exclude the potential impact that reference movement time (MT) values could have on our analysis. We have assessed the asymmetry between behavioural responses to ∆R− and ∆R+ using the following analysis: We estimated the proportion of trials in which participants exhibited speed-up (∆T < 0) or slow-down (∆T > 0) behaviour following ∆R− and ∆R+ across different practice bins (bins 1 to 4). By discretising the series of behavioural changes (∆T) into binary values (+1 for slowing down, -1 for speeding up), we can assess the type of changes (speed-up, slow-down) without the absolute ∆T or T values contributing to our results. We obtained several key findings:

      • Consistent with expectations (sanity check), participants exhibited more instances of speeding up than slowing down across all reward conditions.

      • Participants demonstrated a higher frequency of speeding up following ∆R− compared to ∆R+, and this asymmetry persisted throughout the practice sessions (greater proportion of -1 events than +1 events). 53% events were speed-up events in the in the p(∆T|∆R+) distribution for the first bin of practices, and 55% for the last bin. Regarding p(∆T|∆R-), there were 63% speed-up events throughout each bin of practices, with this proportion exhibiting no change over time.

      • Accordingly, the asymmetry of reward changes on behavioural adaptations, as revealed by this analysis, remained consistent across the practice bins.

      Thus, these preliminary findings provide an initial response to referee 2 and offer valuable insights into the asymmetrical effects of positive/negative reward changes on behavioural adaptations. We plan to include these results in the revised manuscript, as well as the full control analysis suggested by the referee. We will further expand upon their interpretation and implications.

      1. Another way to support the claim of reward change directionality effects on performance (rather than performance on performance), at least to some extent, would be to analyze the data from the last 10 days of the training, during which no rewards were given (pretending for analysis purposes that the reward was calculated and presented to participants). If the effect persists, it is less unlikely that the effect in question can be attributed to the reward dynamics.

      Response: The reviewer’s concern is addressed in the previous quesQon. Also, this analysis would not be possible because our Gaussian fit analyses use the Qme series of conQnuous reward scores, in which ∆R− or ∆R+ are embedded. These events cannot be analyzed once reward feedback is removed because we do not have behavioral events following ∆R− or ∆R+ anymore.

      Done

      1. This concern is also relevant and should be considered with respect to the sensitivity of IKI consistency (C) to reward. While the relationship between previous reward/performance and future performance in terms of C is of a different structure, the similar potential confounding effects could still be present.

      Response: We will conduct this analysis for the revised manuscript, similarly to the control analysis suggested by referee 2 on MT. Our preliminary control analysis, as explained above, suggests that the fundamental asymmetry in the effect of ∆R+ and ∆R+ on behavioral changes persists when excluding the impact of reference performance values in our Gaussian fit analysis.

      Done. See updated Figure 6. The results are very similar once we normalize the IKI consistency index C with the IKI of the baseline performance at trial n.

      1. Another related question (which is also of general interest) is whether the preferred app sequence (as indicated by the participants for Phase B) was consistently the one that yielded more reward? Was the continuous sequence the preferred one? This might tell something about the effectiveness of the reward in the task.

      Response: We have now conducted this analysis. There is in fact no evidence to conclude that the continuously rewarded sequence was the preferred one. The result shows that 54.5% of HV and 29% of the OCD sample considered the continuous sequence to be their preferred one, a nonstatistically significant difference. Note that this preference may not necessarily be linked simply to programmed reward. The overall preference may be influenced by many other factors, such as, for example, the aesthetic appeal of particular combinations of finger movements.

      Regarding both experiments 2 and 3:

      1. The change in context in experiment 2 and 3 is substantial and include many different components. These changes should be mentioned in more detail in the Results section before describing the results of experiments 2 and 3.

      Response: Following referee’s advice, we will move these details (currently written in the Methods section) to the Results section, when we introduce Phase B and before describing the results of experiments 2 and 3.

      Done in page 21

      Experiment 2:

      1. In Experiment 2, the authors sometimes refer to the "explicit preference task" as testing for habitual and goal-seeking sequences. However, I do not think there is any justification for interpreting it as such. The other framings used by the authors - testing whether trained action sequences gain intrinsic/rewarding properties or value, and preference for familiar versus novel action sequences - are more suitable and justified. In support of the point I raised here, assigning intrinsic rewarding properties to the learned sequences and thereby preferring these sequences can be conceptually aligned with goal-directed behavior just as much as it could be with habit.

      Response: We clearly defined the theoretical framing of experiment 2 as a test of whether trained action sequences gain intrinsic value and we are pleased to hear that the referee agrees with this framing. If the referee is referring to the paragraph below (in the Discussion), we actually do acknowledge within this paragraph that a preference for the trained sequences can either be conceptually aligned with a habit OR a goal-directed behavior.

      “On the other hand, we are describing here two potential sources of evidence in favor of enhanced habit formation in OCD. First, OCD patients show a bias towards the previously trained, apparently disadvantageous, action sequences. In terms of the discussion above, this could possibly be reinterpreted as a narrowing of goals in OCD (Robbins et al., 2019) underlying compulsive behavior, in favor of its intrinsic outcomes”

      This narrowing of goals model of OCD refers to a hypothetically transiQonal stage of compulsion development driven by behavior having an abnormally strong, goal-directed nature, typically linked to specific values and concerns.

      If the referee is referring to the penulQmate sentence of hypothesis secQon, this has been amended in response to Q5. We cannot find any other possible instances in this manuscript stating that experiment 2 is a test of habitual or goal-directed behavior.

      Experiment 3:

      1. Similar to Experiment 2, I find the framing of arbitration between goal-directed/habitual behavior in Experiment 3 inadequate and unjustified. The results of the experiment suggest that participants were primarily goal-directed and there is no evidence to support the idea that this reevaluation led participants to switch from habitual to goal-directed behavior.

      Also, given the explicit choice of the sequence to perform participants had to make prior to performing it, it is reasonable to assume that this experiment mainly tested bias towards familiar sequence/stimulus and/or towards intrinsic reward associated with the sequence in value-based decision making.

      Response: This comment is aligned with (and follows) the referee’s criticism of experiment 1 not achieving automatic and habitual actions. We have addressed this matter above, in response 1 to Referee 2.

      Mobile-app performance effect on symptomatology: exploratory analyses:

      1. Maybe it would be worth testing if the patients with improved symptomatology (that contribute some of their symptom improvement to the app) also chose to play more during the training stage.

      Response: We have conducted analysis to address this relevant question. There is no correlation between the YBOCS score change and the number of total practices, meaning that the patients who improved symptomatology post training did not necessarily chose to play the app more during the training stage (rs = 0.25, p = 0.15). Additionally, we have statistically compared the improvers (patients with reduced YBOCS scores post-training) and the non-improvers (patients with unchanged or increased YBOCS scores post-training) in their number of app completed practices during the training phase and no differences were observed (U = 169, p = 0.19).

      The result from the correlational analysis has been added to the revised manuscript (page 28).

      Discussion:

      1. Based on my earlier comments highlighting the inadequacy and mis-framing of the work in terms of habit and goal-directed behavior, I suggest that the discussion section be substantially revised to reflect these concerns.

      Response: We do not agree that the work is either "inadequate or mis-framed" and will not therefore be substantially revising the Discussion. We will however clarify further the interpretation we have made and make explicit the alternative viewpoint of the referee. For example, we will retitle experiment 3 as “Re-evaluation of the learned action sequence: possible test of goal/habit arbitration” to acknowledge the referee’s viewpoint as well as our own interpretation.

      Done

      1. In the sentence "Nevertheless, OCD patients disadvantageously preferred the previously trained/familiar action sequence under certain conditions" the term "disadvantageously" is not necessarily accurate. While there was potentially more effort required, considering the possible presence of intrinsic reward and chunking, this preference may not necessarily be disadvantageous. Therefore, a more cautious and accurate phrasing that better reflects the associated results would be useful.

      Response: We recognize that the term "disadvantageously" may be semantically ambiguous for some readers and therefore we will remove it.

      Done

      Materials and Methods:

      1. The authors mention: "The novel sequence (in condition 3) was a 6-move sequence of similar complexity and difficulty as the app sequences, but only learned on the day, before starting this task (therefore, not overtrained)." - for the sake of completeness, more details on the pre-training done on that day would be useful.

      Response: Details of the learning procedure of the novel sequence (in condition 3, experiment 3) will be provided in the methods of the revised version of the manuscript.

      Done in page 40

      Minor comments:

      1. In the section discussing the sensitivity of sequence duration to reward, the authors state that they only analyzed continuous reward trials because "a larger number of trials in each subsample were available to fit the Gaussian distributions, due to feedback being provided on all trials." However, feedback was also provided on all trials in the variable reward condition, even though the reward was not necessarily aligned with participants' performance. Therefore, it may be beneficial to rephrase this statement for clarity.

      Response: We will follow this referee’s advice and will rephrase the sentence for clarity.

      Done. See page 16.

      1. With regard to experiment 2 (Preference for familiar versus novel action sequences) in the following statement "A positive correlation between COHS and the app sequence choice (Pearson r = 0.36, p = 0.005) further showed that those participants with greater habitual tendencies had a greater propensity to prefer the trained app sequence under this condition." I find the use of the word "further" here potentially misleading.

      Response: The word "further" will be removed.

      Done

      Reviewer #1 (Recommendations For The Authors):

      This is a very interesting manuscript, which was a pleasure to review. I have some minor comments you may wish to consider.

      1. I believe that it is possible to include videos as elements in eLife articles - please consider if you can do this to demonstrate the action sequence on the smartphone. I followed the YouTube video, and it was very helpful to see exactly what participants did, but it would be better to attach the video directly, if possible.

      Response: This is a great idea and we will definitely attach our video demonstrating the task to the revised manuscript (Version of Record) if the eLife editors allow.

      We ask permission to the editor to add the video

      1. The abstract states that the study uses a "novel smartphone app" but is the same one as described in Banca et al. Suggest writing simply "smartphone app".

      Response: We will remove the word novel.

      Done

      1. Some of the hypotheses described in the second half of the Hypothesis section could be stated more explicitly. For example: "We also hypothesized that the acquisition of learning and automaticity would differ between the two action sequences based on their associated rewarded schedule (continuous versus variable) and reward valence (positive or negative)." The subsequent sentence explains the prediction for the schedule but what is the hypothesized direction for reward valence? More detail is subsequently given on p. 14, Results, but it would be better to bring these details up to the Introduction. "We additionally examined differential effects of positive and negative feedback changes on performance to build on previous work demonstrating enhanced sensitivity to negative feedback in patients with OCD (Apergis-Schoute et al 2023, Becker et al., 2014; Kanen et al., 2019)." In general, the second part of the Hypothesis section is a bit dense, sometimes with two predictions per sentence. It could be useful for the reader if hypotheses were enumerated and/or if a distinction was made among the hypotheses with respect to their importance.

      We fully revised the hypothesis section, on page 5, following this reviewer’s suggestion. We think this section is much clearer now, in our revised manuscript.

      Response: Thank you for pointing out the need for clarity in our hypothesis section. This is a very important point and we will carefully rewrite our hypothesis in the revised manuscript to make them as clear as possible.

      1. Did medication status correlate with symptom severity in the OCD group (e.g., higher symptoms for the 6 participants on SSRI+antipsychotics?). Could this, or SSRI-only status, have impacted results in any way? I appreciate that there is no way to test medication status statistically but readers may be interested in your thoughts on this aspect.

      Response: We have now conducted exploratory analysis to assess the potential effect of medication in the following output measures: app engagement (as measured by completed practices), explicit preference and YBOCS change post-training. The patients who were on combined therapy (SSRIs + antipsychotic) did not perform significantly different in these measures as compared to the remaining patients and no other effects of interest were observed. Their symptomatology was indeed slightly more severe but not statistically significant [Y-BOCS combined = 26.2 (6.5); Y-BOCS SSRI only = 23.8 (6.1); Y-BOCS No Med = 23.8 (2.2), mean(std)]. Only one patient showed symptom improvement after the app training, another became worse and the remaining patients on combined therapy remain stable during the month.

      Palminteri et al (2011) found that unmedicated OCD patients exhibited instrumental learning deficits, which were fully alleviated with SSRI treatment. Therefore, it is possible that the SSRI medication (present in our sample) may have reduced habit formation and facilitated behavioral arbitration. However, since the effect goes against the habit hypothesis, it has is unlikely that it has confounded our measure of automaticity. If anything, medication rendered experiment 2 and 3 more goal-oriented. We agree that further studies are warranted to address the effect of SSRIs on these measures.

      1. You could explain earlier why devaluation could not be tested here (it is only explained in the Limitations section near the end)

      Response: The revised manuscript will be amended to account for this note.

      Done in page 25.

      1. Capitalize 'makey-makey', I didn't realize there was a product called Makey Makey until I Googled it.

      Response: Sure. We will capitalize 'Makey-Makey'. Thank you for pointing this out!

      Done

      Reviewer #2 (Recommendations For The Authors):

      Recommendations for the authors (ordered by the paper sections):

      In the introduction

      1. regarding this part "We used a period of 1-month's training to enable effective consolidation, required for habitual action control or skill retention to occur. This acknowledged previous studies showing that practice alone is insufficient for habit development as it also requires off-line consolidation computations, through longer periods of time (de Wit et al., 2018) and sleep (Nusbaum et al., 2018; Walker et al., 2003)." I advise the authors to re-check whether what is attributed here to de Wit et al. (2018) is indeed justified (if I remember correctly they have not mentioned anything about off-line consolidation computations).

      Response: When we revise the manuscript, we will remove the de Wit et al. (2018) citation from this sentence.

      Done

      in the Outline paragraph

      1. it stated: "We continuously collected data online, in real time, thus enabling measurements of procedural learning as well as automaticity development." I think this wording implies that the fact that the data was collected online in real time was advantageous in that it enabled to assess measurements of procedural learning and automaticity development, which in my understanding is not the case.

      Response: To make this sentence clearer, we will change it to the following: ‘We continuously collected data online, to monitor engagement and performance in real time and to enable acquisition of sufficient data to analyze, à posteriori, procedural learning and automaticity development’.

      Done in page 4: ‘We collected data online continuously to monitor engagement and performance in real-time. This approach ensured we acquired sufficient data for subsequent analysis of procedural learning and automaticity development’.

      1. In the final sentence of this paragraph "or and" should be changed to "or/end".

      Response: This was a typo. The word ‘and’ will be removed.

      Done

      1. In Figure 1c - Note that in the figure legend it says "Each sequence comprises 3 single press moves, 2 two-finger moves..." whereas in the example shown in the figure it's the other way around (2 single press moves and 3 two-finger moves).

      Response: Thank you so much for spotting this! The example shown in the figure is incorrect. We apologize for the mistake. It should depict 3 single press moves, 2 two-finger moves and 1 three- finger move. The figure will be amended.

      Done

      In the results section:

      1. Regarding the "were followed by a positive ring tone and the unsuccessful ones by a negative ring tone", I suggest mentioning that there was also a positive visual (rewarding) effect.

      Response: Thank you. A mention to the visual effect will be added for both the positive (successful) and negative (unsuccessful) trials. Done in page 7

      1. p 10. - Note a typo in the following sentence where the word "which" appears twice consecutively:

      "Furthermore, both groups exhibited similar motor durations at asymptote which, which combined with the previous conclusion, indicates that OCD patients improved their motor learning more than controls, but to the same asymptote."

      Response: Thank you for spotting this typo. The second word will be removed. Done

      1. I have a few suggestions with respect to Figure 3:

      2. keeping the y-axes scale similar in all subplots would be more visually informative.

      Here we kept the y-axes scale similar in all subplots, except one of them, which was important to keep to capture all the data.

      1. For the subplots in 3b I would recommend for the transparent regions, instead of the IQR, to use the median +/- 1.57 * IQR/sqrt(n) which is equivalent to how the notches are calculated in a box-plot figure (It is referred to as an approximate 95% confidence interval for the median). This should make the transparent area narrower and thus better communicate the results.

      Done

      1. I think the significant levels mentioned in figure legend 3b (which are referring to the group effect measured for each reward schedule type separately) is not mentioned in the text. While not crucial, maybe consider adding it in the text.

      We don’t think this is necessary and may actually lead to confusion because in the text we report a Kruskal–Wallis H test (which is the most appropriate statistical test), including their H and p values for the group and reward effects. Since in the figure we separated the analysis and plots for variable and continuous reward schedules (for visual purposes) , we reported a U test separated for each reward schedule. Therefore, we consider that the correct statistics are reported in the appropriate places of the manuscript.

      Response: Thank you for this very helpful suggestion. We will amend figure 3 accordingly.

      1. In the Automaticity results (pp. 12 and 13) when describing the Descriptive stats the wrong parameter indicator are used (DL instead of CL and nD instead of nC.

      Response: Thank you for noticing it. We will amend.

      Done

      1. In Sensitivity of IKI consistency (C) to reward results:

      In Figure 6a legend: with respect to "... and for reward increments (∆R+, purple) and decrements (∆R-, green)" - note that there are also additional colors indicating these ∆Rs.

      Response: Done. We had used a 2 x 2 color scheme: green hues for ∆R-, and purple hues for ∆R+. Then, OCD is denoted by dark colors, and HV by light colors. This represents all four colors used in the figure. For instance, OCD and ∆R- is dark green, whereas OCD and ∆R+ is denoted by dark purple.

      1. p.21 - the YBOCS abbreviation appears before the full form is spelled out in the text.

      Response: In the revised version, we will make sure the YBOCS abbreviation will be spelled out the first time it is mentioned.

      Done in page 24

      Experiments 2 and 3:

      1. If there is a reason behind presenting the conditions sequentially rather than using intermixed trials in experiments 2 and 3, it would be useful to mention it in the text.

      Response: Experiment 2 could have used intermixed trials. However, we were concerned that the use of intermixed trials in experiment 3 would increase excessively the memory load of the task, which could then be a confound.

      Done in page 41

      1. I wonder whether the presentation order of the conditions in experiments 2 and 3 affected participants' results? Maybe it is worth adding this factor to the analysis.

      Response: As we mentioned both in the methods and results sections, we counterbalanced all the conditions across participants, in both experiments 2 and 3. This procedure ensures no order effects.

      Experiment 2:

      1. Regarding this sentence (pp. 21-22): "However, some participants still preferred the app sequence, specifically those with greater habitual tendencies, including patients who considered the app training beneficial." I think the part that mentions that there are "patients who considered the app training beneficial" appears below and it may confuse the reader. I suggest either providing a brief explanation or indicating that further details will be provided later in the text ("see below in...").

      Response: We will clarify this section.

      We added “see below exploratory analyses of “Mobile-app performance effect on symptomatology”” in the end of the sentence so that the reader knows this is further explained below. Page 25

      1. Finally, in addition to subgrouping maybe it is worth testing whether there is a correlation between the YBOCS score change and the app-sequences preference (as to learn if the more they change their YBOCS the more they prefer the learned sequences and vice versa?)

      Response: Thank you for suggesting this relevant correlational analysis, which we have now conducted. Indeed, there is a correlation between the YBOCS score change and the preference for the app-sequences, meaning that the higher the symptom improvement after the month training, the greater the preference for the familiar/learned sequence. This is particularly the case for the experimental condition 2, when subjects are required to choose between the trained app sequence and any 3-move sequence (rs = 0.35, p=0.04). A trend was observed for the correlation between the YBOCS score change and the preference for the app-sequences in experimental condition 1 (app preferred sequence versus any 6-move sequence): rs = 0.30, p=0.09.

      This finding represents an additional corroboration of our conclusion that the app seems to be more beneficial to patients more prone to routine habits, who are somewhat more averse to novelty.

      This analysis was added in page 24, 25 and page 35.

      Experiment 3:

      1. You mention "The task was conducted in a new context, which has been shown to promote reengagement of the goal system (Bouton, 2021)." In my understanding this observation is true also for experiment 2. In such case it should be stated earlier (probably under: "Phase B: Tests of actionsequence preference and goal/habit arbitration").

      Response: As answered above in (Q17), we will follow this referee 2’s suggestion and describe the contextual details of experiments 2 and 3 in the Results section, when we introduce Phase B.

      Done in page 21.

      1. w.r.t this sentence - "...that sequence (Figure 8b, no group effects (p = 0.210 and BF = 0.742, anecdotal evidence)" I would add what the anecdotal evidence refers (as done in other parts of the paper), to prevent potential confusion.

      Response: OK, this will be added.

      Added on page 27

      Discussion:

      1. w.r.t. "Here we have trained a clinical population with moderately high baseline levels of stress and anxiety, with training sessions of a higher order of magnitude than in previous studies (de Wit et al., 2018, 2018; Gera et al., 2022) (30 days instead of 3 days)." The Gera et al. 2022 (was more than 3 days), you probably meant Gera et al. 2023 ("Characterizing habit learning in the human brain at the individual and group levels: a multi-modal MRI study", for which 3 days is true).

      Response: Thank you for pointing this out. We will keep the citation to Gera et al 2022 given its relevance to the sentence but we will remove the information inside the parenthesis. This amendment will solve the issue raised here.

      Done in page 32

      1. w.r.t "to a simple 2-element sequence with less training (Gera et al., 2022)" - it's a 3-element sequence in practice.

      Response: Thank you for this correction. We will amend this sentence accordingly.

      Done in page 32

      1. (p.30) w.r.t "and enhanced error-related negativity amplitudes in OCD" - a bit more context of what the negative amplitudes refer to would be useful (So the reader understands it refers to electrophysiology).

      Response: We will add a sentence in our revised manuscript addressing this matter. This sentence has been removed in the revised manuscript

      Supplementary materials:

      1. under "Sample size for the reward sensitivity analysis":

      It is stated "One practice corresponded to 20 correctly performed sequences. We therefore split the total number of correct sequences into four bins." I was not able to follow this reasoning here (20 correct trials in practice => splitting the data the 4 bins). More clarity here would be useful.

      Response: We will clarify this procedure of our analysis in the revised version of the manuscript. Thanks.

      Done. See Supplementary materials.

      1. Also, maybe I am missing something, but I couldn't understand why the number of sequences available per bin is different for the calculation of ∆MT and C. Aren't any two consecutive sequences that are good for the calculation of one of these measures also good for the calculation of the other?

      Response: Thank you for pointing this out. Indeed, the number of trials was the same for both analyses, ∆MT and C. We had saved an incorrect variable as number of trials. We will amend the text.

      We have re-analyzed the trial number data. The average number of trials per bin both for the ∆MT and C analyses was 109 (9) in the HV and 127 (12) in OCD groups. Although the number was on average larger in the patient group, we did not find significant differences between groups (p = 0.47).

      When assessing the p(∆T|∆R+) and p(∆T|∆R-) separately, more trials were available for p(∆T|∆R+), 107 (10) , than for p(∆T|∆R-), and 98 (8). These trial numbers differed significantly (p = 0.0046), but were identical for ∆MT and C analyses.

      Done. Included in Supplementary materials.

      Minor comments:

      1. Not crucial, but maybe for the sake of consistency consider merging the "Self-reported habit tendencies" section and the "Other self-reported symptoms" section, preferably where the latter is currently placed.

      Response: We fully understand the referee’s rationale underlying this suggestion. We indeed considered initially presenting the self-reported questionnaires all together, in a last, single section of the results, as suggested by the referee. However, we decided to report the higher habitual tendencies of OCD as an initial set of results, not only because it is a novel and important finding (which justifies it to be highlighted) but also because it is essential to the understanding of some of the remaining results presented.

      1. In some figure legends the percentage of the interval of the mentioned confidence intervals (probably 95%) is missing. I suggest adding it.

      Response: OK, this will be added.

      Done

      1. The NHS abbreviation appears without spelling out the full form.

      Response: This will be amended accordingly.

      I removed NHS as it is not relevant.

      1. In p.38 the citation (Rouder et al., 2012) is duplicated (appears twice consecutively).

      Response: Thank you for pointing this out. We will amend accordingly.

      Done

      In the results section:

      1. The authors mention: "To promote motivation, the total points achieved on each daily training sessions were also shown, so participants could see how well they improved across days". Yet, if the score is based on the number of practices, it may not represent participants improvement in case in some days more practices are performed. I suggest to clarify this point.

      Response: The goal of providing the scoring feedback was, as explained in the sentence, to gauge motivation and inform the subject about their performance. Having this goal in mind, it does not really matter if one day their scoring would be higher simply because they would have done more practice on that day. Participants could easily understand that the scoring reflected their performance on each practice so they would realize that the more practice, the greater their improvement and that the scoring would increase across days of practice. We will amend the sentence to the following: "To promote motivation, the total points achieved on each training session (i.e. practice) was also shown, so participants could see how well they improved across practice and across days".

      Done in page 7 and 8.

    1. Reviewer #2 (Public Review):

      Summary:<br /> The manuscript by Kang et al investigates how the consideration of pairwise encounters (consumer-resource chasing, intraspecific consumer pair, and interspecific consumer pair) influences the community assembly results. To explore this, they presented a new model that considers pairwise encounters and intraspecific interference among consumer individuals, which is an extension of the classical Beddington-DeAngelis (B-D) phenomenological model, incorporating detailed considerations of pairwise encounters and intraspecific interference among consumer individuals. Later, they connected with several experimental datasets.

      Strengths:<br /> They found that the negative feedback loop created by the intraspecific interference allows a diverse range of consumer species to coexist with only one or a few types of resources. Additionally, they showed that some patterns of their model agree with experimental data, including time-series trajectories of two small in-lab community experiments and the rank-abundance curves from several natural communities. The presented results here are interesting and present another way to explain how the community overcomes the competitive exclusion principle.

      Weaknesses:<br /> The authors only explore the case with interspecific interference or intraspecific interference exists. I believe they need to systematically investigate the case when both interspecific and intraspecific interference exists. In addition, the text description, figures, and mathematical notations have to be improved to enhance the article's readability. I believe this manuscript can be improved by addressing my comments, which I describe in more detail below.

      1. In nature, it is really hard for me to believe that only interspecific interference or intraspecific interference exists. I think a hybrid between interspecific interference and intraspecific interference is very likely. What would happen if both the interspecific and intraspecific interference existed at the same time but with different encounter rates? Maybe the authors can systematically explore the hybrid between the two mechanisms by changing their encounter rates. I would appreciate it if the authors could explore this route.

      2. In the first two paragraphs of the introduction, the authors describe the competitive exclusion principle (CEP) and past attempts to overcome the CEP. Moving on from the first two paragraphs to the third paragraph, I think there is a gap that needs to be filled to make the transition smoother and help readers understand the motivations. More specifically, I think the authors need to add one more paragraph dedicated to explaining why predator interference is important, how considering the mechanism of predator interference may help overcome the CEP, and whether predator interference has been investigated or under-investigated in the past. Then building upon the more detailed introduction and movement of predator interference, the authors may briefly introduce the classical B-D phenomenological model and what are the conventional results derived from the classical B-D model as well as how they intend to extend the B-D model to consider the pairwise encounters.

      3. The notations for the species abundances are not very informative. I believe some improvements can be made to make them more meaningful. For example, I think using Greek letters for consumers and English letters for resources might improve readability. Some sub-scripts are not necessary. For instance, R^(l)_0 can be simplified to g_l to denote the intrinsic growth rate of resource l. Similarly, K^(l)_0 can be simplified to K_l. Another example is R^(l)_a, which can be simplified to s_l to denote the supply rate. In addition, right now, it is hard to find all definitions across the text. I would suggest adding a separate illustrative box with all mathematical equations and explanations of symbols.

      4. What is the f_i(R^(F)) on line 131? Does it refer to the growth rate of C_i? I noticed that f_i(R^(F)) is defined in the supplementary information. But please ensure that readers can understand it even without reading the supplementary information. Otherwise, please directly refer to the supplementary information when f_i(R^(F)) occurs for the first time. Similarly, I don't think the readers can understand \Omega^\prime_i and G^\prime_i on lines 135-136.

    1. l t h o u g h this is less evident in two dimensions, inthree dimensions it is obvious: the r o t o raR = e x p ( - ia/2) = cos(lat/2) - i~-~ sin(lat/2) (3.13)represents a r o t a t i o n of tal radians a b o u t the axis along the direction of a.

      not super obvious to me.

    Annotators

  5. ia802908.us.archive.org ia802908.us.archive.org
    1. hey believe,with good reason,thatif ind i v i d u a l l i b e r t y i s a n u l t i m a t e e n d f o r h u man beings,-non'should be deprivei of it by others; least of all that some shouldenjoy it at the expenseof others

      This is why he's on the street. Western liberal thinking asserts this man's individual liberty, and he shouldn't be 'deprived of it by others'.

    1. einer p h y s i s c h e nE r d b e s c h r e i b u n g “ zum „Begriff einer p h y s i s c h e n W e l t b e s c h r e i b u n g “

      hier u.ö. (aber auch nicht immer!): Nach Transformation sind aus dem Sperrdruck (= erweitere Laufweite des Textes) Einzelbuchstaben mit Leerzeichen dazwischen geworden. Nicht nur unschön, sondern auch unauffindbar für die Volltextsuche!

    1. Author Response

      The following is the authors’ response to the current reviews.

      For the final Version of Record the following changes will be included: 1. Figure 4: Example traces replaced with a more representative simulation run that is more similar to the mean. 2. Methods: Description of the alignment procedure expanded to explain the algorithm steps better.


      The following is the authors’ response to the previous reviews

      We are grateful for the positive and insightful feedback from the editors and reviewers. These constructive comments have contributed to the enhancement of our work. We have revised the manuscript, addressing each of the comments raised. In addition, based on the commentary provided, we have introduced two new figures that offer a deeper understanding of our research findings:

      In new Figure 7, we present the analysis of the difference in onset times between motion and flash responses. This figure also includes a simple illustration elucidating the origins of these differences, highlighting the varying engagement of receptive fields by these stimuli. The data presented in this figure were initially featured in the main text of the original manuscript. Figure 11 offers a detailed comparison of the temporal and spatial characteristics of the synthetic presynaptic signals driving optimal DS in SACs. We compare these characteristics with the properties extracted from recorded glutamate release. Our analysis suggests that the sluggish dynamics observed in biological signals impede effective directional integration. Below are the detailed point-by-point responses to reviewers comments.

      Reviewer #1 (Public Review):

      Summary:

      Direction selectivity (DS) in the visual system is first observed in the radiating dendrites of starburst amacrine cells (SACs). Studies over the last two decades have aimed to understand the mechanisms that underlie these unique properties. Most recently, a 'space-time' model has garnered special attention. This model is based on two fundamental features of the circuit. First, distinct anatomical types of bipolar cells (BCs) are connected to proximal/distal regions of each of the SAC dendritic sectors (Kim et al., 2014). Second, that input across the length of the starburst is kinetically diverse, a hypothesis that has been only recently demonstrated experimentally using iGluSnFR imaging (Srivastava et al., 2022). However, the stark kinetic distinctions, i.e., the sustained/transient nature of BC input to SACs dendrites appear to be present mainly in responses to stationary stimuli. When BC receptive field properties are probed using white noise stimuli, the kinetic differences between BCs are relatively subtle or nonexistent (Gaynes et al., 2022; Strauss et al., 2022, Srivastava et al., 2022). Thus, if and how BCs contribute to direction selectivity driven by moving spots that are commonly used to probe the circuit remains to be clarified. To address this issue, Gaynes et al., combine evolutionary computational modeling (Ankri et al., 2020) with two-photon iGluSnFR imaging to address to what degree BCs contribute to the generation of direction selectivity in the starburst dendrites in response to stimuli that are commonly used experimentally.

      Strengths:

      Combining theoretical models and iGluSnFR imaging is a powerful approach as it first provides a basic intuition on what is required for the generation of robust DS, and then tests the extent to which the experimentally measured BC output meets these requirements.

      The conclusion of this study builds on the previous literature and comprehensively considers the diverse BC receptive field properties that may contribute to DS (e.g. size, lag, rise time, decay time).

      By 'evolving' bipolar inputs to produce robust DS in a model network, these authors provide a sound framework for understanding which kinetic properties could potentially be important for driving downstream DS. They suggest that response delay/decay kinetics, rather than the center/surround dynamics are likely to be most relevant (albeit the latter could generate asymmetric responses to radiating/looming stimuli).

      Weaknesses:

      Finally, these authors report that the experimentally measured BC responses are far from optimal for generating DS. Thus, the BC-based DS mechanism does not appear to explain the robust DS observed experimentally (even with mutual inhibition blocked). Nevertheless, I feel the comprehensive description of BC kinetics and the solid assessment of the extent to which they may shape DS in SAC dendrites, is a significant advancement in the field.

      Reviewer #2 (Public Review):

      Summary:

      In this study, the authors sought to understand how the receptive fields of bipolar cells contribute to direction selectivity in starburst amacrine cell (SAC) dendrites, their post synaptic partners. In previous literature, this contribution is primarily conceptualized as the 'space-time wiring model', whereby bipolar cells with slow-release kinetics synapse onto proximal dendrites while bipolar cells with faster kinetics synapse more distally, leading to maximal summation of the slow proximal and fast distal depolarizations in response to motion away from the soma. The space-time wiring contribution to SAC direction selectivity has been extensively tested in previous literature using connectomic, functional, and modeling approaches. However, the authors argue that previous functional studies of bipolar cell kinetics have focused on static stimuli, which may not accurately represent the spatiotemporal properties of the bipolar cell receptive field in response to movement. Moreover, this group and others have recently shown that bipolar cell signal processing can change directionally when visual stimuli starts within the receptive field rather than passing through it, complicating the interpretation of moving stimuli that start within a bipolar cell of interest's receptive field (e.g. stimulating only one branch of a SAC or expanding/contracting rings). Thus, the authors choose to focus on modeling and functionally mapping bipolar cell kinetics in response to moving stimuli across the entire SAC dendritic field.

      General Comments

      There have been several studies that have addressed the contribution of space-time wiring to SAC process direction selectivity. The impact of this project is to show that this contribution is limited. First, the optimal solution obtained by the evolutionary algorithm to generate DS processes is slow proximal and fast distal inputs - exactly what is predicted by space-time wiring, which is exactly what is required of the HRC model. Hence, this result seems expected and it's not clear what the alternative hypothesis is. Second, the experimental results based on glutamate imaging to assess the kinetics of glutamate release under conditions of visual stimulation across a large region of retina confirm previous observations but were important to test. Third, by combining their model model with this experiment data, they conclude that even the optimal space-time wiring is not sufficient to explain the SAC process DS. The results of this approach might be more impactful if the authors come to some conclusion as to what factors do determine the direction selectivity of the SAC process since they have argued that all the current models are not sufficient.

      Reviewer #3 (Public Review):

      Gaynes et al. investigated the presynaptic and postsynaptic mechanisms of starburst amacrine cell (SAC) direction selectivity in the mouse retina by computational modeling and glutamate sensitivity (iGluSnFR) imaging methods. Using the SAC computational simulation, the authors initially tested bipolar cell contributions (space-time wiring model, presynaptic effect) and SAC axial resistance contributions (postsynaptic effect) to the SAC DS. Then, the authors conducted two-photon iGluSnFR imaging from SACs to examine the presynaptic glutamate release, and found seven clusters of ON-responding and six clusters of OFF-responding bipolar cells. They were categorized based on their response kinetics: delay, onset phase, decay time, and others. Finally, the authors generated a model consisting of multiple clusters of bipolar cells on proximal and distal SAC dendrites. When the SAC DS was measured using this model, they found that the space-time wiring model accounted for only a fraction of SAC DS.

      The article has many interesting findings, and the data presentation is superb. Strengths and weaknesses are summarized below.

      Major Strengths:

      • The authors utilized solid technology to conduct computational modeling with Neuron software and a machine-learning approach based on evolutionary algorithms. Results are effectively and thoroughly presented.

      • The space-time wiring model was evaluated by changing bipolar cell response properties in the proximal and distal SAC dendrites. Many response parameters in bipolar cells are compared, and DSI was compared in Figure 3.

      • Two-photon microscopy was used to measure the bipolar cell glutamate outputs onto SACs by conducting iGluSnFR imaging. All the data sets, including images and transients, are elegantly presented. The authors analyzed the response based on various parameters, which generated more than several response clusters. The clustering is convincing.

      Major Weaknesses:

      • In Figure 9, the authors generated the bipolar cell cluster alignment based on the space-time wiring model. The space-time wiring model has been proposed based on the EM study that distinct types of bipolar cells synapse on distinct parts of SAC dendrites (Green et al 2016, Kim et al 2014). While this is one of the representative Reicardt models, it is not fully agreed upon in the field (see Stincic et al 2016). While the authors' approach of testing the space-time wiring model and conclusions is interesting and appreciated, the authors could address more issues: mainly two clusters were used to generate the model, but more numbers of clusters should be applied. Although the location of each cluster on the SAC dendrites is unknown, the authors should know the populations of clusters by iGluSnFR experiments. Furthermore, the authors could provide more suggestive mechanisms after declining postsynaptic factors and the space-time wiring model.

      The reviewer is correct that the proximal and more distal SAC dendrites sample from different IPL depths. It should be theoretically possible to match the functional clusters we measured with anatomical bipolar cell identities. However, the stratifications of these cells have significant overlaps (Figure 6-S2), and previous attempts to match iGluSnFR signals to anatomy proved to be challenging (Franke et al., 2017; Gaynes et al., 2022; Matsumoto et al., 2019; Srivastava et al., 2022; Strauss et al., 2022). In the revised version of the manuscript, we reorder the functional clusters based on their transiency, which has a higher correlation to stratification depth (Franke et al., 2017).

      We have examined a scenario in which the presynaptic population comprises more than two clusters. We constructed synthetic models whose input structure was as in Figure 10 (old Figure 9). The optimal configuration for the most proximal and distal inputs closely resembled the proximal-distal model reported in Figure 2. However, we observed a nearly linear variation in the shape of the optimal mid-range inputs, transitioning from proximal-like to distal-like responses as the distance increased. We consider this outcome to be expected based on the structure of the space-time wiring model (Kim et al., 2014). Interestingly, this was not the case with models incorporating physiologically recorded signals. As we show in Figure 10, the most common optimal directional tuning was seen when the bipolar drive consisted of two main populations, both in the ON and OFF SACs.

      Finally, we believe that uncovering additional mechanisms that underlie directional selectivity in SACs represents a crucial challenge for the field to tackle. It is highly probable that achieving directional selectivity involves a complex interplay of multiple factors. This includes the organization of the presynaptic circuit, which we have partially addressed in this study, as well as the influence of postsynaptic active conductances and feedback loops involving other SACs and presynaptic cells. We have expanded the discussion section to describe the possible mechanisms

      • The computational modeling demonstrates intriguing results: SAC dendritic morphology produces dendritic isolation, and a massive input overcomes the dendritic isolation (Figure 1). This modeling seems to be generated by basic dendritic cable properties. However, it has been reported that SAC dendrites express Kv3 and voltage-gated Ca channels. It seems to be that these channels are not incorporated in this model.

      The reviewer's observation is accurate; the model depicted in Figure 1 did not include voltage-gated channels. Our goal was to study electrotonic isolation, which is often measured in passive models. However, while we did not incorporate voltage-gated potassium channels implicitly in the models, our simulations are rooted in previous models that were fine-tuned using empirical data. As potassium channels are expected to influence the experimentally recorded input resistance, we have indirectly accounted for their impact on the interdendritic signal propagation.

      In subsequent model iterations, we have integrated voltage-gated calcium channels into our simulations to assess the signal responsible for driving synaptic release. We show that nonlinear voltage dependence of the calcium currents enhances compartmentalization of the local calcium levels (Figure 2), but did not significantly influence local voltages. Therefore, calcium channels do not appear to have a major impact on electrotonic distances.

      • In Figure 5B, representative traces are shown responding to moving bars in horizontal directions. These did not show different responses to two directional stimuli. It is unclear whether directional preference was not detected, which was shown by Yonehara's group recently (Matsumoto et al 2021). Or that was not investigated as described in the Discussion.

      Indeed, we observed no discernible directional differences in bipolar responses. This phenomenon can be primarily attributed to the fact that the signals originating from the limited number of directionally-tuned release sites are overshadowed by the release from non-directionally-tuned units (Matsumoto et al., 2021). In the revised discussion, we have acknowledged this limitation in our recorded data.

      • The authors found seven ON clusters and six OFF clusters, which are supposed to be bipolar cell terminals. However, bipolar cells reported to provide synaptic inputs are T-7, T-6, and multiple T-5s for ON SACs and T-1, T-2, and T-3s for OFF SACs. The number of types is less than the number of clusters. Potentially, clusters might belong to glutamatergic amacrine cells. These points are not fully discussed.

      We have expanded the discussion section to address these points.

      Reviewer #1 (Recommendations For The Authors):

      Major comments

      1. One of the main conclusions of this study is that diverse BC kinetics contribute to DS (Fig. 9). The authors nicely demonstrate using modeling that the experimentally measured BC kinetics are far from ideal. However, this conclusion is based on a model that almost exclusively relies on just two of the 7 putative BC types (e.g., C1 & C6 for On SACs) placed optimally along the dendrites, which raises two important caveats.

      First, given that other BC types are likely to contribute, the effects of two distinct types are likely to be diluted. Thus, the contribution of BCs to DS is likely to be significantly overestimated. Second, given that the dendrites of 10-30 SACs cross each point in the honeycomb, for the given model to work, each BC would need to connect extremely selectively to SACs. i.e., at a given point, a sustained input must only connect to the more proximal dendritic segments, while avoiding entirely the distal segments of overlapping SAC dendrites. Thus, their model requires extremely selective wiring for which there is no evidence. In fact, there is evidence to the contrary provided by Ding et al. 2016, which showed that the type 7 (proximally biased) and type 5 (distally biased) populations had a substantial overlap (assuming these BC types correspond to kinetically diverse clusters).

      We wholeheartedly concur with the reviewer's perspective that our findings have led to an overestimation of the space-time wiring mechanism's role in SAC directional selectivity (DS). We have adjusted our discussion to emphasize this point. In light of this, our assertion that, even with the most favorable distribution of synaptic inputs, the space-time wiring model still does not fully account for the experimentally-determined directional tuning in SAC, remains valid.

      With regard to the model, it would also be worth comparing results to previous starburst models (e.g., Tukker et al,. 2004), which demonstrated a robust DS in SAC dendrites in the absence of kinetically diverse BC input. Why is the cell-intrinsic DS so weak in the present model?

      We have directly explored this question in the synthetic model (Figures 2, 3). Despite variances in the anatomy of SACs and the distribution of bipolar inputs between our model and the study by (Tukker et al., 2004), we observed remarkably similar levels of directional selectivity index computed from the voltage response (approximately 10%, as shown in Figure 3, 'Identical BCs').

      The primary distinction emerged in the degree of DS amplification mediated by calcium currents. Tukker et al., 2004 reported considerably higher DS compared to our findings, despite employing similar formulations for voltage-gated calcium channel models. The key factor driving this difference lies in the fact that Tukker et al., 2004 measured amplification in proximity to the threshold of calcium channel activation. Even minor variations in membrane potentials near this threshold can lead to substantial differences in calcium influx, especially when outward stimulation results in a calcium spike. In fact, recently, Robert Smith’s group revisited the threshold-based mechanism and concluded that it often fails to produce robust DS due to the heterogeneity of membrane potentials among different terminal dendrites (Wu et al., 2023).

      Our models were trained on five different stimuli velocities whose synaptic integration produced substantially different peak amplitudes. Consequently, the spike threshold alone couldn't reliably distinguish between inward and outward directions across all five conditions, resulting in reduced directional performance in our simulations. In the revised Figure 2-S2 we directly explore the performance of the model with identical BC formulations, trained on a single velocity. We find a dramatic enhancement of calcium DS (DSI=66%) in this condition compared to an identical model trained on 5 velocities (DSI=17%). Thus, evolutionary search is capable of finding the threshold-based solution, but only when the training is performed on a single stimulus velocity (Figure 2-S2). This solution did not generalize to multiple stimuli speeds because, as mentioned above, they lead to different postsynaptic depolarization levels (Figure 2, 2-S1). Instead, the algorithm converged on a set of postsynaptic paraments leading to less nonlinear calcium channel activation over a broader voltage range, ensuring effective DS performance over multiple velocities and heterogenous local potentials (Wu et al., 2023).

      1. Functionally distinct responses across different regions of interest (ROIs) were used to classify BC input. ROIs were obtained from multiple scan fields and retinas and combined into a single dataset for functional clustering. However, the consistency of the cluster distribution across these replicates has not been addressed. As BCs can exhibit different functional properties dependant on the state/health of the retina, it is important to know whether certain functional clusters may originate disproportionately from a particular experiment, as it implies that each cluster does not represent a different stable functional/anatomical population.

      We acknowledge that the state of the preparation can significantly impact signal dynamics. In response to this important consideration, we have incorporated details about the distribution of functional clusters in various experiments in the revised version of the manuscript (Figure 6-S1, and discussion).

      Other comments:

      1. Interpreting iGluSnFR signals: Since the sensor is expressed uniformly across the SAC dendrite, it is important to clarify why the measured F signals are considered synaptic responses. Could spillover contribute to the generation of slower responses?

      We do not believe spillover can explain slower responses because the sluggish clusters often responded significantly (up to 500ms) sooner to moving bars (Figures 6, 6-S3). We acknowledge and discuss this possibility of spillover in the revised discussion.

      1. One striking finding is the diversity of BCs RF sizes (Fig. 7C). Some BCs have RF that are far larger than their dendritic fields. It will be useful to discuss the potential mechanisms that may underlie large BC RFs.

      We changed the discussion to address this question.

      1. SAC DS is independent of dendritic isolation: The authors claim that dendritic isolation does not significantly impact DS. However, while this might be true for a linear motion through the receptive field, dendritic isolation probably matters for more dynamic stimuli. For example, DSGCs can encode rapid changes in objection direction, as DS is computed over fine spatiotemporal scales relying on SACs (Murphy-Baum et al., 2022). This could not occur if SAC dendrites were not well electrically isolated from each other.

      We believe that this is an accurate interpretation of our findings. Our research suggests that dendritic isolation is likely not a critical factor in the space-time wiring mechanism. However, as we demonstrate that this particular mechanism cannot fully account for the observed levels of DS in SACs, other mechanisms must be important. As previous studies revealed that dendritic isolation enhances SAC DS (for example, Koren et al., 2017), dendritic independence likely contributes to directional performance within SACs by these additional mechanisms.

      1. Figure 4: From what I understand, the BC inputs for the electrotonic connectivity variations evolved much like they were for the original model without axial resistance constraints. This makes sense, since stronger/weaker inputs with different temporal kernels may be appropriate for each condition, hence why the axial resistance wasn't changed post-evolution, which would have likely caused the DS to drop. If that is the case, however, I wonder how the best DS attainable by the final model which is constrained to the radial arrangement of realistic BC inputs (without being able to fit much more optimal sustained-transient BCs to their circumstance) would be impacted. Is dendritic isolation similarly unimportant when the pre-synaptic story isn't ideal?

      We have explored this question directly by allowing the evolutionary algorithm to modify the passive and active characteristics of the postsynaptic SAC. Our findings are summarized in Figure 9-S1. We observed a correlation between DSI levels and membrane/axial resistance values in SACs in the evolved models. Better DS was seen with leaky membranes (higher isolation) and lower axial resistance (lower isolation). While it is clear that postsynaptic parameters can influence synaptic integration, they can not fully compensate for inadequate presynaptic dynamics.

      1. BC are shown to contribute to DS across velocities (Fig. 9), which contrasts with results from Srivastava et al., (2022) that showed BCs contribute to DS at lower velocities. However, this discrepancy can easily be explained by the choice of moving spots. In this study, the sweeping bars had dynamic width (targeting pixel dwell time of 2s), which means for higher velocities the bar is significantly wider. While in the previous study, the width of the stimulus was kept constant, and thus for higher velocities, the sustained/transient kinetic differences of BCs are less clear (Srivastava et al., 2021). The author's should discuss this explicitly, to avoid discrepancies between these two studies the reader might otherwise perceive.

      We value reveiwer’s feedback, and in response, we have included an additional paragraph in the manuscript addressing the distinctions in directional tuning that arise from the space-time model presented in this work, in comparison to earlier studies.

      1. Methods: It will be good to discuss how ROIs sizes and positions were selected (pixel correlations?)

      We have included a more detailed explanation of the clustering procedure

      • Lines 614 describe whole-cell patch clamp techniques, which are not used in this study.

      We used patch-clamp to record the waveforms shown in Figure 2-S2

      1. Figure 6: Diversity of Glut responses to motion in ON and OFF SACs, caption typos?

      2. "Left:" without "Right:" to describe the population (I presume) viewed as an image

      3. If there should still be A,C and B,D to group the ON and OFF halves, maybe it should be mentioned in the caption

      Thank you for bringing this to our attention, the legends were fixed.

      References:

      Kim, J. S., Greene, M. J., Zlateski, A., Lee, K., Richardson, M., Turaga, S. C., Purcaro, M., Balkam, M., Robinson, A., Behabadi, B. F., Campos, M., Denk, W., Seung, H. S., & EyeWirers (2014). Space-time wiring specificity supports direction selectivity in the retina. Nature, 509(7500), 331-336. https://doi.org/10.1038/nature13240

      Gaynes, J. A., Budoff, S. A., Grybko, M. J., Hunt, J. B., & Poleg-Polsky, A. (2022). Classical center-surround receptive fields facilitate novel object detection in retinal bipolar cells. Nature communications, 13(1), 5575. https://doi.org/10.1038/s41467-022-32761-8

      Murphy-Baum B. and Awatramani GB (2022). Parallel processing in active dendrites during periods of intense spiking activity, Cell Reports, Volume 38, Issue 8,

      Srivastava P, de Rosenroll G., MatsumotoA., Michaels T., Turple Z., Jain V, Sethuramanujam S, Murphy-Baum B, Yonehara K., Awatramani, G.B. (2022) Spatiotemporal properties of glutamate input support direction selectivity in the dendrites of retinal starburst amacrine cells eLife 11:e81533

      Strauss, S., Korympidou, M. M., Ran, Y., Franke, K., Schubert, T., Baden, T., Berens, P., Euler, T., & Vlasits, A. L. (2022). Center-surround interactions underlie bipolar cell motion sensitivity in the mouse retina. Nature communications, 13(1), 5574. https://doi.org/10.1038/s41467-022-32762-7

      Tukker, J. J., Taylor, W. R., & Smith, R. G. (2004). Direction selectivity in a model of the starburst amacrine cell. Visual neuroscience, 21(4), 611-625. https://doi.org/10.1017/S0952523804214109

      Reviewer #2 (Recommendations For The Authors):

      Specific comments

      1. Line 223. The statement a model trained on only optimal DSI would produce "negligible absolute differences in calcium levels." is unclear. This needs to be better explained.

      We have modified and expanded this paragraph to make it more clear

      1. Figure 4. The authors use this model to test the hypothesis that space time wiring contribution to SAC process DS requires dendritic isolation. They do this by increasing axial resistance around the soma of their model neuron to isolate each dendrite. They found comparable DS was achieved in both conditions, indicating that the space-time wiring model works in two cases of high and low dendritic isolation. However, to test the claim that "specific details of postsynaptic integration appear to play a lesser role" (line 274) the authors may consider allowing the axial resistance to change as a part of the model rather than testing two extreme states.

      Membrane and axial resistances (and active parameters) were allowed to change as part of model evolution in most simulations presented in this manuscript. We have added the information on the final resistance values reached in the evolved models in Figure 9-S1

      1. Figure 6: To study glutamatergic input onto SACs, the authors expressed iGLuSnFR in ChAT-Cre mice and grouped similarly responding pixels into ROIs and separated these responses into functional groups based on cluster analysis (Figure 5). The alignment of the responses in Figure 6A was confusing. It appears that average responses for each cluster are aligned based on the peak observed during the stimulus in each direction, but it is unclear how they are aligned relative to each other or what this timing is relative to location of the stimulus (i.e. what is time 0 in 6A?).

      The displayed traces represent the average responses to horizontally moving bars (speed = 0.5mm/s), either moving to the left or right. To achieve this alignment, we employed a procedure consistent with our recent publication (Gaynes et al., 2022), which we have now detailed more comprehensively. Here's the step-by-step process we followed:

      1. Determination of half-maximum rise times: Initially, we calculated the half-maximum rise times for glutamate signals recorded in response to left and right-moving stimuli.

      2. Calculation of mean rise time: We then computed the mean of these rise times, which served as a reference point for alignment.

      3. Alignment procedure: To illustrate the alignment process, consider an example. Suppose the 50% rise time for responses to left-moving stimuli occurs at 3 seconds, while responses to right-moving stimuli occur 4 seconds after stimulation onset. This discrepancy suggests that the RF of the cell is shifted to the right from the center of the display (assuming a stimulation speed of 0.5mm/s on the retina, the RF's position would be approximately 250μm from the midline). To align these responses, we shifted both waveforms by 500ms so that their 50% rise times coincided at 3.5 seconds. Importantly, 3.5 seconds would represent the 50% rise time of the ROI if it were precisely centered on the display. This alignment effectively removed any spatial position dependence from the ROIs.

      4. Comparative analysis and clustering: With the responses now aligned, we were able to compare their shapes and subsequently cluster the ROIs into distinct functional clusters. For clarity, we opted to highlight the time of response peak for cluster 1. Although this peak closely aligned with the calculated time of stimulus motion over the center of the 'shifted RF' in the adjusted time frame, it provided a more straightforward comparison between response dynamics.

      1. The authors need to do a better job explaining how their results differ from Ezra-Tsur et al 2021, which uses the same sort of model to address the same question. The discussion about this study (lines 425-435) are based on how a more constrained version of these models work better but they do not directly address the difference in conclusion with regards to mechanisms that contribute to SAC process direction selectivity.

      We have expanded the discussion related to mechanisms that contribute to DS in SACs and discuss the differences between our studies.

      Minor point: The authors use the word "probe" to refer to visual stimulus. This is confusing because "probe" is also used to refer to sensors.

      In the revised manuscript, we minimized the usage of ‘probe’ to reference visual stimuli

      Reviewer #3 (Recommendations For The Authors):

      Writing and figure presentations are excellent.

      Thank you!

      References:

      Franke, K., Berens, P., Schubert, T., Bethge, M., Euler, T., & Baden, T. (2017). Inhibition decorrelates visual feature representations in the inner retina. Nature, 542(7642), 439-444. https://doi.org/10.1038/nature21394

      Gaynes, J. A., Budoff, S. A., Grybko, M. J., Hunt, J. B., & Poleg-Polsky, A. (2022). Classical Center-Surround Receptive Fields Facilitate Novel Object Detection in Retinal Bipolar Cells. Nat Commun, 13(1), 5575. https://doi.org/https://doi.org/10.1038/s41467-022-32761-8

      Kim, J. S., Greene, M. J., Zlateski, A., Lee, K., Richardson, M., Turaga, S. C., Purcaro, M., Balkam, M., Robinson, A., Behabadi, B. F., Campos, M., Denk, W., Seung, H. S., & EyeWirers. (2014). Space-time wiring specificity supports direction selectivity in the retina. Nature, 509(7500), 331-336. https://doi.org/10.1038/nature13240

      Matsumoto, A., Agbariah, W., Nolte, S. S., Andrawos, R., Levi, H., Sabbah, S., & Yonehara, K. (2021). Direction selectivity in retinal bipolar cell axon terminals. Neuron. https://doi.org/10.1016/j.neuron.2021.07.008

      Matsumoto, A., Briggman, K. L., & Yonehara, K. (2019). Spatiotemporally Asymmetric Excitation Supports Mammalian Retinal Motion Sensitivity. Curr Biol. https://doi.org/10.1016/j.cub.2019.08.048

      Srivastava, P., de Rosenroll, G., Matsumoto, A., Michaels, T., Turple, Z., Jain, V., Sethuramanujam, S., Murphy-Baum, B. L., Yonehara, K., & Awatramani, G. B. (2022). Spatiotemporal properties of glutamate input support direction selectivity in the dendrites of retinal starburst amacrine cells. Elife, 11. https://doi.org/10.7554/eLife.81533

      Strauss, S., Korympidou, M. M., Ran, Y., Franke, K., Schubert, T., Baden, T., Berens, P., Euler, T., & Vlasits, A. L. (2022). Center-surround interactions underlie bipolar cell motion sensing in the mouse retina. Nat Commun, 13(1), 5574. https://doi.org/https://doi.org/10.1038/s41467-022-32762-7

      Tukker, J. J., Taylor, W. R., & Smith, R. G. (2004). Direction selectivity in a model of the starburst amacrine cell. Vis Neurosci, 21(4), 611-625. http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15579224

      Wu, J., Kim, Y. J., Dacey, D. M., Troy, J. B., & Smith, R. G. (2023). Two mechanisms for direction selectivity in a model of the primate starburst amacrine cell. Vis Neurosci, 40, E003. https://doi.org/10.1017/S0952523823000019

    1. Author Response

      The following is the authors’ response to the original reviews.

      Thank you for the response and reviews of our manuscript eLife-RP-RA-2023-86638 “Energetics of the Microsporidian Polar Tube Invasion Machinery”. We are grateful for the comments and constructive criticism from all three reviewers, which have helped us to improve our manuscript.

      As a summary to the editor, we here provide a list of the major revisions we have implemented to address all the comments provided by the referees.

      1. We added Supplementary Section A.9 and Figure S4 to explain the details of calculation and have magnified sketches of flow fields.

      2. We clarified the term "required pressure" to "required pressure differences", and explained that the same pressure differences can be achieved by either positive or negative pressure. We invoke the fact that the spore wall buckled inward to deduce that germination is a negative pressure process.

      3. We only rank the hypotheses based on calculation of total energy requirement. The peak pressure and peak power requirement calculations are now just for quantitative reference. The ranking of hypotheses does not change.

      4. We clarified the definition of topological connections in Section "Systematic evaluation of possible topological configurations of a spore," making it explicit that the topological questions listed only involved the "original PT content" (not PT space at all time).

      Thank you again for the opportunity to revise our work. We attach a point-by-point response to the referees below.

      Public Reviews:

      Reviewer #1 (Public Review):

      1. The authors used mathematical models to explore the mechanism(s) underlying the process of polar tube extrusion and the transport of the sporoplasm and nucleus through this structure. They combined this with experimental observations of the structure of the tube during extrusion using serial block face EM providing 3 dimensional data on this process. They also examined the effect of hyperosmolar media on this process to evaluate which model fit the predicted observed behavior of the polar tube in these various media solutions.

      We thank the reviewer for their accurate summary of our work. One subtle point, however, is that we examine the effect of hyperviscous media on the polar tube extrusion process, rather than hyperosmolar media. In Supplementary Section A.6 of our updated manuscript, we have shown that the changes in osmolarity due to methylcellulose is negligible.

      1. Overall, this work resulted in the authors arriving at a model of this process that fit the data (model 5, E-OE-PTPV-ExP). This model is consistent with other data in the literature and provides support for the concept that the polar tube functions by eversion (unfolding like a finger of a glove) and that the expanding polar vacuole is part of this process. Finally, the authors provide important new insights into the buckling of the spore wall (and possible cavitation) as providing force for the nucleus to be transported via the polar tube. This is an important observation that has not been in previous models of this process.

      We thank the reviewer for acknowledging the novelty and importance of our study.

      Reviewer #2 (Public Review):

      1. Microsporidia has a special invasion mechanism, which the polar tube (PT) ejects from mature spores at ultra-fast speeds, to penetrate the host and transfer the cargo to host. This work generated models for the physical basis of polar tube firing and cargo transport through the polar tube. They also use a combination of experiments and theory to elucidate possible biophysical mechanisms of microsporidia. Moreover, their approach also provided the potential applications of such biophysical approaches to other cellular architecture.

      We thank the reviewer for their accurate summary and acknowledging the potential applications on other organisms.

      1. The conclusions of this paper are mostly well supported by data, but some analyses need to be clarified. According to the model 5 (E-OE-PTPV-ExP) in P42 Fig. 6, is the posterior vacuole connected with the polar tube? If yes, how does the nucleus unconnected with the posterior vacuole enter the polar tube?

      As we mentioned in our glossary and detailed in Section "Systematic evaluation of possible topological configurations of a spore", Model 5 requires the "original PT content" (any material inside the PT prior to cargo entering the tube) to permit fluid flow to posterior vacuole and external environment post anchoring disc rupture, but cannot permit fluid flow to the sporoplasm that is transported through the tube. As the the germination process progresses, our model does not require the connection between PT and posterior vacuole to be maintained afterwards, and that creates space allowing sporoplasm (including nucleus) sporoplasm (including nucleus) to enter PT space through fluid entrainment. We have clarified the definitions in Section "Systematic evaluation of possible topological configurations of a spore" and have additional clarification in the caption of Fig. 6 in the updated manuscript.

      1. In Fig. 6, would the posterior vacuole become two parts after spore germination? One part is transported via the polar tube, and the other is still in the spore. I recommend this process requires more experiments to prove.

      According to our Model 5, the membrane connection between PT and posterior vacuole must be broken for the infectious cargo to extrude. However, our current data does not allow us to prove nor disprove the membrane fission event. In theory, the membrane content in PT can potentially be severed into multiple parts by Plateau-Rayleigh instability, an interfacial-tension-driven fluid thread breakup mechanism. Note that it is possible to have membrane fission at the time scale of germination process, as when the time scale of shearing is faster than the viscoelastic time of lipid membranes (roughly 10 msec), membrane fission can happen (Morlot & Roux 2013). For time scale longer than viscoelastic time of lipid membrane, protein complexes like dynamin would be required for membrane fission. Future cryo-EM study of the vacuole-PT connection at the anterior tip (and in the spore as a whole) is needed to clarify the physical process. We added this discussion in Section "Predictions and proposed future experiments".

      Reviewer #3 (Public Review):

      Abstract:

      The paper follows a recent study by the same team (Jaroenlak et al Plos Pathogens 2020), which documented the dramatic ejection dynamics of the polar tube (PT) in microsporidia using live-imaging and scanning electron microscopy. Although several key observations were reported in this paper (the 3D architecture of the PT within the spore, the speed and extent of the ejection process, the translocation dynamics of the nucleus during germination), the precise geometry of the PT during ejection remain inaccessible to imaging, making it difficult to physically understand the phenomenon.

      This paper aims to fill this gap with an indirect "data-driven" approach. By modeling the hydrodynamic dissipation for different unfolding mechanisms identified in the literature and by comparing the predictions with experiments of ejection in media of various viscosities, authors shows that data are compatible with an eversion (caterpillar-like) mechanism but not compatible with a "jack-in-the-box" scenario. In addition, the authors observe that most germinated spores exhibit an inward bulge, which they attribute to buckling due to internal negative pressure and which they suggest may be a mean of pushing the nucleus out of the PT during the final stage of ejection.

      We thank the reviewer for their accurate summary of our work.

      Major strengths:

      Probably the most impressive aspect of the study is the experimental analysis of the ejection dynamics (velocity, ejection length) in medium of various viscosities over 3 orders of magnitudes, which, combined with a modeling of the viscous drag of the PT tube, provides very convincing evidence that the unfolding mechanism is not a global displacement of the tube but rather an apical extension mechanism, where the motion is localized at the end of the tube. The systematic classification of the different unfolding scenarios, consistent with the previous literature, and their confrontation with data in terms of energy, pressure and velocity also constitute an original approach in microbiology where in-situ and real time geometry is often difficult to access.

      We thank the reviewer for acknowledging the novelty and importance of our study.

      Major weaknesses:

      1a. While the experimental part of the paper is clear, I had (and still have) a hard time understanding the modeling part. Overall, the different unfolding mechanisms should be much better explained, with much more informative sketches to justify the dissipation and pressure terms, magnifying the different areas where dissipation occurs, showing the velocity field and pressure field, etc.

      We thank the reviewer for their comments and suggestions. In the Figure S4 and SI Section A.9 of the updated manuscript, we have magnified the sketches with flow field, and added a detailed explanation of the derivations of dissipation terms.

      1b. In particular, a key parameter of eversion models is the geometry of the lubrication layers inside and outside the spore (h_sheath, h_slip). Where do the values of h_sheath and h_slip come from? What is the physical process that selects these parameters?

      As we described in SI Section A.9, h_sheath was set to be 25 nm based on the observed translucent space around PT in activated spores (Lom 1972), and h_slip was set to be 6 nm based on the observed gap thickness between PT and cargo (Takovarian et al. 2020). Although we don't expect these numbers to be the same for each spore, the uncertainty in these two parameters are much less than the uncertainty in cytoplasmic viscosity (which varies several orders of magnitude) and boundary slip length. Our sensitivity testing on cytoplasmic viscosity and boundary slip length thus covers any uncertainty in h_sheath or h_slip already.

      1c. For clarity, the figures showing the unfolding mechanics in the different scenarios should be in the main text, not in the supplemental materials.

      We have added Figure S4 and SI Section A.9 to explain the details of our sketches. We believe, however, putting all the details of the mechanics and how each term is derived in the main text may detract from the flow of the manuscript, and result in it being less accessible to readers who are not as familiar with the physics. We therefore decided to keep this information in supplemental materials.

      2a. The authors compute and discuss in several places "the pressure" required for ejection, but no pressure is indicated in the various sketches and no general "ejection mechanism" involving this pressure is mentioned in the paper.

      In the updated manuscript, we have changed the term “pressure” to “pressure difference” or “required pressure difference”. We did not calculate the detailed pressure field around each structure, but only estimated the required pressure difference to overcome the drag force and drive fluid flow in various spaces. We also clarified this point in Section "Developing a mathematical model for PT energetics".

      Also, as we mentioned in Section “Posterior vacuole expansion and the role of osmotic pressure”, we made no assumptions on how the pressure difference is generated in this paper. The unfolding mechanism of polar tube, how eversion is sustained, and the driving mechanism are ongoing research projects, and we decided not to make premature comments on that without strong support from experiments or simulation results.

      2b. What is this "required pressure" and to what element does it apply?

      The “required pressure” in the manuscript indicates the required pressure difference between the spore and the tip of the polar tube for it to push the tip forward and sustain the fluid flow within the polar tube. In the updated manuscript, we thus changed the term “required pressure” to “required pressure difference”. We also added this clarification to Section "Developing a mathematical model for PT energetics".

      2c. I understand that the article focuses on the dissipation required to the deployment of the PT but I find it difficult to discuss the unfolding mechanism without having any idea on the driving mechanism of the movement. How could eversion be initiated and sustained?

      As we mentioned in Section “Posterior vacuole expansion and the role of osmotic pressure”, we made no assumptions on how the energy, pressure or power is generated in this paper. We agree that the unfolding mechanism of the polar tube, how eversion is sustained, and the driving mechanism are important questions, and these are ongoing research projects. As no assumptions about this are required for our models, we decided not to comment on these aspects without strong support from experiments or simulation results. We have clarified this in Section “Posterior vacuole expansion and the role of osmotic pressure” of the updated manuscript.

      1. Finally, the authors do not explain how pressure, which appears to be a positive, driving quantity at the beginning of the process, can become negative to induce buckling at the end of ejection. Although the hypothesis of rapid translocation induced by buckling is interesting, a much better mechanistic description of the process is needed to support it.

      As discussed in Point 2-b above, the “required pressure” actually means “required pressure difference”. The same pressure difference can possibly be achieved by either positive pressure (the spore has a higher pressure than the ambient, pushing the fluid into PT) or negative pressure (the PT tip has a lower pressure than the ambient, sucking the fluid from the spore). Hydrodynamic dissipation analysis alone cannot tell the differences between positive or negative pressure, as it only tells you the required pressure differences between the spore and the polar tube tip. It will have to be inferred from the implied mechanisms or other evidence. We added these discussions in the 4th paragraph of Section "Developing a mathematical model for PT energetics" in the updated manuscript.

      That being said, from our observations of buckled spore walls, it is still sufficient to deduce that the polar tube ejection process is a negative pressure driven process. For the spore wall to buckle inwards, the ambient pressure has to be higher than the pressure within the spore, but that would contradict with the positive pressure hypothesis as elaborated above. We added these clarifications in the 2nd paragraph of Section "Models for the driving force behind cargo expulsion".

      References:

      Lom, J. (1972). On the structure of the extruded microsporidian polar filament. Zeitschrift Für Parasitenkunde, 38(3), 200–213.

      Takvorian, P. M., Han, B., Cali, A., Rice, W. J., Gunther, L., Macaluso, F., & Weiss, L. M. (2020). An Ultrastructural Study of the Extruded Polar Tube of Anncaliia algerae (Microsporidia). The Journal of Eukaryotic Microbiology, 67(1), 28–44.

      Morlot, S., & Roux, A. (2013). Mechanics of dynamin-mediated membrane fission. Annual Review of Biophysics, 42, 629–649.

      Reviewer #1 (Recommendations For The Authors):

      The work is solid and supported by the experimental data presented, the literature and the biophysical modeling.

      1. The model (Model 5) indicates that the polar tube is connected to the posterior vacuole and that the contents of this vacuole may be transported by the polar tube before the sporoplasm. This needs experimental validation in the future, which will require the identification of posterior vacuole markers (i.e. proteins specific to this structure). I find the topology of this idea difficult to understand. If the polar tube is outside of the sporoplasm membrane then how does it connect to the posterior vacuole? If the expanded posterior vacuole is still in the spore at the end of germination then how does the sporoplasm get out?

      Model 5 requires the "original PT content" (any material inside the PT prior to cargo entering the tube) to permit fluid flow to posterior vacuole and external environment post anchoring disc rupture, but cannot permit fluid flow to sporoplasm. As the germination process progresses, our model does not require the connection between PT and posterior vacuole to be maintained afterwards, and that creates space allowing sporoplasm (including nucleus) to enter PT space through fluid entrainment.

      We agree with the reviewer that the specific predictions from Model 5 need to be experimentally validated in the future, and identification of posterior vacuole markers is a good direction. We have mentioned this in Section "Predictions and proposed future experiments".

      1. I have always thought that the polaroplast was the initial cargo in the polar tube and that this formed the limiting membrane of the sporoplasm and nucleus after passage through the polar tube (i.e., the limiting membrane of the sporont).

      In this manuscript, we only analyze the possible topology of the organelles that are relevant for energy dissipation calculations. Our final hypothesis (E-OE-PTPV-ExP) indicates that there is a limiting membrane of the infectious cargo as they pass through PT, but the energy calculation cannot tell you where this membrane comes from. That being said, our final hypothesis is consistent with the common belief that polaroplast provides the limiting membrane of the sporoplasm, even though our analysis neither proved nor disproved it.

      1. I understand that the model indicates that during eversion the end of the PT moves away from the posterior vacuole allowing the sporoplasm access to the PT lumen, however, I am not clear how this process occurs (although I understand the reason that this model was the best fit for the available data). Does the model distinguish between connected (as in the PV is in the polar tube lumen) to the idea of it being in proximity (i.e. the PT is at the PV at the start of eversion)?

      As we mentioned in our reply to Point 1 of the same reviewer above, "connectivity" simply means whether fluid flow is permitted across the end connections among organelles and sub-spaces within the spores. For Model 5, the content of posterior vacuole can pass to the original PT content and to the external environment post anchoring disc disruption through fluid flow, but not to sporoplasm. However, as the germination progresses, the PT does not have to maintain its spatial proximity or membrane connection to posterior vacuole, as the topological connectivity questions are pertaining to the "original PT content". We clarified this point in Section "Systematic evaluation of possible topological configurations of a spore" in the updated manuscript.

      Reviewer #2 (Recommendations For The Authors):

      1. The connection of polar tube and posterior vacuole need to be analyzed by Cryo -EM.

      We thank the reviewer for their comments. This work is underway.

      Reviewer #3 (Recommendations For The Authors):

      1a. As stated in the public review, the explanation and description of the unfolding mechanism should be much better described and associated with clear sketches, magnifying all the areas where the flow shear rate is concentrated (surrounding zone, lubrication inside and outside the spore, etc) and drawing the velocity field, the boundary solid motion and pressure distribution in order to clearly understand, for each model, the dissipation and pressure terms given in figs. S2 and S3.

      In the updated manuscript, we added Figure S4 to enlarge all the regions where fluid shear is considered, with sketches of velocity fields.

      1b. This is particularly important for explaining the eversion models (see comment in the Public Review) but even the "jack-in-the-box" model sketched in Fig. S2 is confusing: Why does the blue tube disappear outside the spore? What happens to the tube in this case?

      The blue tube in the sketch of Model 1 in Fig. S2 is the fluid between the two outermost layers of PT, not the PT itself. We have clarified that in the newly added Fig. S4.

      1. Many ejection mechanisms based on the deployment of invaginated appendages have been described in the literature (e.g. Zuckerkandl Biol. Bull. 1950, Karabulut et al Nat. Com. 2022) and also mimicked for robotic applications (e.g. Hawkes et al Science Robotics 2017). Although this is not the main topic of the paper, it would be very useful if the authors could discuss in the introduction the most acceptable theory for motion generation (eversion driven by an overpressure in the spore?). In the current version, this comes too late in the discussion.

      As we discussed in Section “Lack of biophysical models explaining the microsporidian infection process”, PT eversion is the most widely accepted hypothesis because of experimental evidence (e.g. microscopic observations of PT extrusions, and pulse-labeling of half-ejected tubes). However, whether or not it is driven by an overpressure in the spore remains controversial. In fact, our observations of inwardly buckled spores indicates that the ejection process likely involves negative pressure.

      In our work, we thus take a data-driven approach to generate models for the physical basis of PT extrusion process, without immediately assuming that eversion is the correct hypothesis. It would therefore not make sense to have elaborated discussion on other eversion mechanisms in Introduction.

      1. About the physical constraints, I understand that the stored energy must be the same when the viscosity is changed (by conservation of energy), but what physical basis do you have for requiring that the power and pressure also be the same (lines 295-298)? For e.g. when a spring is stretched and released in a very viscous fluid without inertia, the total energy dissipated is the same whatever the viscosity but the power is not the same. The formulation of the chosen physical constraints should be better justified.

      We thank the reviewer for their feedback. In our updated manuscript, we only use total energy requirement for the ranking, and the peak pressure difference requirement and peak power requirements are calculated just for quantitative reference. The ranking of the 5 hypotheses does not change.

      1. About the mechanism for cargo translocation, authors should explain the physical origin of the hypothetical negative pressure. How could the initial positive pressure become negative?

      As we mentioned in our reply to Point 3 of the same reviewer in the public review, the “required pressure” actually means “required pressure difference”. The same pressure difference can possibly be achieved by either positive pressure (the spore has a higher pressure than the ambient, pushing the fluid into PT) or negative pressure (the PT tip has a lower pressure than the ambient, sucking the fluid from the spore). Hydrodynamic dissipation analysis alone cannot tell the differences between positive or negative pressure, as it only tells you the required pressure differences between the spore and the polar tube tip. It will have to be inferred from the implied mechanisms or other evidence. We added these discussions in the 4th paragraph of Section "Developing a mathematical model for PT energetics" in the updated manuscript.

      That being said, from our observations of buckled spore walls, it is still sufficient to deduce that the polar tube ejection process is a negative pressure driven process. For the spore wall to buckle inwards, the ambient pressure has to be higher than the pressure within the spore, but that would contradict with the positive pressure hypothesis as elaborated above. We added these clarifications in the 2nd paragraph of Section "Models for the driving force behind cargo expulsion".

      More minor comments:

      1. The videos are amazing but it is not clear if the PT is ejected through a bulk fluid or if the spores (and ejected PT) are in contact with a solid.

      As described in Supplementary Section A.6, purified spores were spotted on a coverslip and let water evaporate. 2.0 μL of germination buffer (10 mM Glycine-NaOH buffer pH 9.0 and 100 mM KCl) with different concentration (0%, 0.5%, 1%, 2%, 3%, 4%) of methylcellulose was added to the slide and place the coverslip on top. So the spore is attached to the coverslip and ejected through a bulk liquid of germination buffer.

      1. S2 caption: please be precise that H is the Heaviside step function.

      We have updated the captions for both Figure S2 and S3 to make it explicit.

      1. Line 233 a pi is missing, no?

      We thank the reviewer for their careful read. We have corrected that.

      1. The notations are quite unfortunate and confusing. In fluid mechanics capital D usually refers to the dissipation, capital C to the drag coefficient. It would be much clearer to call D the dissipation power (in Watt) and P the pressure requirement (in Pa), whatever the mechanism and put the different contribution (drag, lubrication, cytoplasm flow) in subscript.

      We thank the reviewer for their feedback. The notation of this paper is challenging as there are many symbols while keeping everything relatively intuitive to both people with biology background and physics background. We will keep these feedback in mind in our future work.

      1. Fig S2: what is D (in the formula of the total dissipation power)? Why not use R instead?

      D is the PT diameter, as we mentioned in the caption. We keep that as it is used in the definition of the shape factor.

      1. Fig S3 why the pressure requirement for the "jack-in-the-box" hypothesis is 2\mu (vLf(epsilon)/R^2)?

      We have now elaborated the calculation in SI Section A.9.

      1. Lines 486-497: Although shear thinning fluids have their viscosity that decreases with the shear rate, in most cases the resistance (stress) still increases with speed with these fluids. Is mucin a "velocity-weakening" fluid, i.e. a fluid in which stress decreases when shear rate increases.

      We agree that stress still increases with speed for most shear thinning fluids. The mechanical properties of mucin solution strongly depend on its compositions and buffers. In our discussion, we thus simply mention this possibility without claiming whether mucin (or other biopolymer environment that microsporidia species actually experience in vivo) is a velocity-weakening fluid or not.

    1. Author Response

      The following is the authors’ response to the original reviews.

      We thank the reviewers and editors for their time and careful consideration of this study. Nearly every comment proved to be highly constructive and thoughtful, and as a result, the manuscript has undergone major revisions including the title, all figures, associated conclusions and web app. We feel that the revised resource provides a more systematic and comprehensive approach to correlating inter-individual transcript patterns across tissues for analysis of organ cross-talk. Moreover, the manuscript has been restructured to highlight utility of the web tool for queries of genes and pathways, as opposed to focused discrete examples of cherry-picked mechanisms. A few key revisions include:

      • Manuscript: All figures have been revised to place to explore broad pathway representation. These analyses have replaced the previous circadian and muscle-hippocampal figures to emphasize ability to recapitulate known physiology and remove the discovery portion which has not been validate experimentally.

      • Manuscript: The term “genetic correlation” or “genetically-derived” has been replaced throughout with “transcriptional”, “inter-individual”, or mostly just “correlations”.

      • Manuscript: A new figure (revised fig 2) has been added to evaluate the innate correlation structure of data used for common metabolic pathways, in addition an exploration of which tissues generally show more co-correlation and centrality among correlations.

      • Manuscript: A new figure (revised fig 4) has been added to highlight the utility of exploring gene ~ trait correlations in mouse populations, where controlled diets can be compared directly. These highlight sex hormone receptor correlations with the large amount of available clinical traits, which differ entirely depending on the tissue of expression and/or diet in mouse populations.

      • Web tool: Addition of a mouse section to query expression correlations among diverse inbred strains and associated traits from chow or HFHS diet within the hybrid mouse diversity panel.

      • Web tool: Overrepresentation analysis for pathway enrichments have been replaced with score-based gene set enrichment analyses and including network topology views for GSEA outputs.

      • Web tool: Associated github repository containing scripts for apps now include a detailed walk-through of the interface and definitions for each query and term.

      Public Reviews:

      Reviewer #1 (Public Review):

      Zhou et al. have set up a study to examine how metabolism is regulated across the organism by taking a combined approach looking at gene expression in multiple tissues, as well as analysis of the blood. Specifically, they have created a tool for easily analyzing data from GTEx across 18 tissues in 310 people. In principle, this approach should be expandable to any dataset where multiple tissues of data were collected from the same individuals. While not necessary, it would also raise my interest to see the "Mouse(coming soon)" selection functional, given that the authors have good access to multi-tissue transcriptomics done in similarly large mouse cohorts.

      Summary

      The authors have assembled a web tool that helps analyze multiple tissues' datasets together, with the aim of identifying how metabolic pathways and gene regulation are connected across tissues. This makes sense conceptually and the web tool is easy to use and runs reasonably quickly, considering the size of the data. I like the tool and I think the approach is necessary and surprisingly under-served; there is a lot of focus on multi-omics recently, but much less on doing a good job of integrating multi-tissue datasets even within a single omics layer.

      What I am less convinced about is the "Research Article" aspect of this paper. Studying circadian rhythm in GTEx data seems risky to me, given the huge range in circadian clock in the sample collection. I also wonder (although this is not even remotely in my expertise) whether the circadian rhythm also gets rather desynchronized in people dying of natural causes - although I suppose this could be said for any gene expression pathway. Similarly for looking at secreted proteins in Figure 4 looking at muscle-hippocampus transcript levels for ADAMTS17 doesn't make sense to me - of all tissue pairs to make a vignette about to demonstrate the method, this is not an intuitive choice to me. The "within muscle" results look fine but panels C-E-G look like noise to me...especially panel C and G are almost certainly noise, since those are pathways with gene counts of 2 and 1 respectively.

      I think this is an important effort and a good basis but a significant revision is necessary. This can devote more time and space to explaining the methodology and for ensuring that the results shown are actually significant. This could be done by checking a mix of negative controls (e.g. by shuffling gene labels and data) and a more comprehensive look at "positive" genes, so that it can be clearly shown that the genes shown in Fig 1 and 2 are not cherry-picked. For Figure 3, I suspect you would get almost an identical figure if instead of showing pan-tissue circadian clock correlations, you instead selected the electron transport chain, or the ribosome, or any other pathway that has genes that are expressed across all tissues. You show that colon and heart have relatively high connectivity to other tissues, but this may be common to other pathways as well.

      Response: We are thankful to the reviewer in their detailed assessment of the manuscript. The comments raised in both the public and suggested reviews clearly improved the revised study and helped to identify limitations. In general, we have removed data suggesting “discovery” using these generalized analyses, such as removing figures evaluating circadian rhythm genes and muscle-hippocampus correlations. These have been replaced with more thorough investigations of tissue correlation structure and potentially identified regions of data sparsity which are important for users to consider. Also, we have added a similar full detailed pipeline of mouse (HMDP) data and highlighted in the manuscript by showing transcript ~ trait correlations of sex hormone receptor genes which differ between organs and diets. Further responses to individual points are also provided below.

      Reviewer #2 (Public Review):

      Summary:

      Zhou et al. use publicly available GTEx data of 18 metabolic tissues from 310 individuals to explore gene expression correlation patterns within-tissue and across-tissues. They detect signatures of known metabolic signaling biology, such as ADIPOQ's role in fatty acid metabolism in adipose tissue. They also emphasize that their approach can help generate new hypotheses, such as the colon playing an important role in circadian clock maintenance. To aid researchers in querying their own genes of interest in metabolic tissues, they have developed an easy-to-use webtool (GD-CAT).

      This study makes reasonable conclusions from its data, and the webtool would be useful to researchers focused on metabolic signaling. However, some misconceptions need to be corrected, as well as greater clarification of the methodology used.

      Strengths:

      GTEx is a very powerful resource for many areas of biomedicine, and this study represents a valid use of gene co-expression network methodology. The authors do a good job of providing examples confirming known signaling biology as well as the potential to discover promising signatures of novel biology for follow-up and future studies. The webtool, GD-CAT, is easy to use and allows researchers with genes and tissues of interest to perform the same analyses in the same GTEx data.

      Weaknesses:

      A key weakness of the paper is that this study does not involve genetic correlations, which is used in the title and throughout the manuscript, but rather gene co-expression networks. The authors do mention the classic limitation that correlation does not imply causation, but this caveat is even more important given that these are not genetic correlations. Given that the goal of their study aligns closely with multi-tissue WGCNA, which is not a new idea (e.g., Talukdar et al. 2016; https://doi.org/10.1016/j.cels.2016.02.002), it is surprising that the authors only use WGCNA for its robust correlation estimation (bicor), but not its latent factor/module estimation, which could potentially capture cross-tissue signaling patterns. It is possible that the biological signals of interest would be drowned out by all the other variation in the data but given that this is a conventional step in WGCNA, it is a weakness that the authors do not use it or discuss it.

      Response: Thank you for the helpful and detailed suggestions regarding the study. The review raised some important points regarding methodological interpretations (ex. bicor-exclusive application as opposed to module-based approaches), as well as clarification of “genetic” inferences throughout the study. The comparison to module-based approaches has also now been discussed directly, pointing our considerations and advantages to each. We hope that the reviewer with our corrections to the misconceptions posed, many of which we feel were due to our insufficient description of methodological details and underlying interpretations. The revised manuscript, web portal and associated github provide much more detail and many more responses to specific points are provided below.

      Reviewer #3 (Public Review):

      Summary: A useful and potentially powerful analysis of gene expression correlations across major organ and tissue systems that exploits a subset of 310 humans from the GTEx collection (subjects for whom there are uniformly processed postmortem RNA-seq data for 18 tissues or organs). The analysis is complemented by a Shiny R application web service.

      The need for more multisystems analysis of transcript correlation is very well motivated by the authors. Their work should be contrasted with more simple comparisons of correlation structure within different organs and tissues, rather than actual correlations across organs and tissues.

      Strengths and Weaknesses: The strengths and limitations of this work trace back to the nature of the GTEx data set itself. The authors refer to the correlations of transcripts as "gene" and "genetic" correlations throughout. In fact, they name their web service "Genetically-Derived Correlations Across Tissues". But all GTEx subjects had strong exposure to unique environments and all correlations will be driven by developmental and environmental factors, age, sex differences, and shared and unshared pre- and postmortem technical artifacts. In fact we know that the heritability of transcript levels is generally low, often well under 25%, even studies of animals with tight environmental control.

      This criticism does not comment materially detract for the importance and utility of the correlations-whether genetic, GXE, or purely environmental-but it does mean that the authors should ideally restructure and reword text so as to NOT claim so much for "genetics". It may be possible to incorporate estimates of chip heritability of transcripts into this work if the genetic component of correlations is regarded as critical (all GTEx cases have genotypes).

      Appraisal of Work on the Field: There are two parts to this paper: 1. "case studies" of cross-tissue/organ correlations and 2. the creation of an R/Shiny application to make this type of analysis much more practical for any biologist. Both parts of the work are of high potential value, but neither is fully developed. My own opinion is that the R/Shiny component is the more important immediate contribution and that the "case studies" could be placed in the context of a more complete primer. Or Alternatively, the case studies could be their own independent contributions with more validation.

      Response: We thank the reviewer for their supportive and helpful comments. The discussion of usage of the term “genetic” has been removed entirely from the manuscript as this point was made by all reviewers. Further, we have revised the previous study to focus on more detailed investigations of why transcript isoforms seemed correlated between tissues and areas where datasets are insufficient to provide sufficient information (ex. Kidney in GTEx). As the reviewer points out, the previous “case studies” were unvalidated and incomplete and as a result, have been replaced. Additional points below have been revised to present a more comprehensive analyses of transcript correlations across tissues and improved web tool.

      (Recommendations For The Authors):

      As this manuscript is focused on the analytical process rather than the biological findings, the reviewer concerns are not a fundamental issue to subsequent acceptance of the paper, but some of the examples will need to be replaced or double-checked to ensure their biological and statistical relevance. To raise the scope and interest of the method developed, it would be seen very positively to include additional datasets, as the authors seem to have intended to have done, with a non-functional (and highlighted as such) selection for mouse data. Establishing that the authors can easily - and will easily - add additional datasets into their tool would greatly raise the reviewers' confidence in the methodology/resource aspect of this paper. This may also help address the significant concerns that all three reviewers raised with the biological examples, e.g. that GTEx data is so uncontrolled that studying environmentally-influenced traits such as circadian rhythm may be challenging or even impossible to do properly. Adding in a more highly controlled set of cross-tissue mouse data may be able to address both these concerns at once, i.e. the resource concern (can the website easily be updated with new data) and the biological concern (are the results from these vignettes actually statistically significant).

      Reviewer #1 (Recommendations For The Authors):

      Comments, in approximately reverse order of importance

      1. Some figure panels are not referenced in the text, e.g. Fig 1B and Figure 2E. Response: Thank you for pointing this out. We have revised every figure in the manuscript and additionally gone through to make sure every panel is referenced in the text.

      2. The authors mention "genetic data" several times but I don't see anything about DNA. By "genetic data" do you mean "transcriptome expression data," or something else?

      Response: This is an important point, also raised by all 3 reviewers. We have clarified in the abstract, results and discussion that correlations are between transcripts. As a result, all mentions of “genetics” or “genetic data” has been removed, with the exception of introducing mouse genetic reference panels.

      1. For Figure 3, the authors look at circadian clock data, but the GTEx data is from all sorts of different times of day from across the patient cohort depending on when the donor died, and I don't see this metadata actually mentioned anywhere. I see Arntl Clock and all the other circadian genes are highly coexpressed in each tissue (except not so strong in liver) but correlation across tissue seems more random. Also hypothalamus seems to be very strongly negatively correlated with spleen, but this large green block doesn't have significance? That is surprising to me, since the sample sizes are all equivalent I would expect any correlation remotely close to -1.0 to be highly significant.

      Response: The reviewer raises several important points with regard to the source of data and underlying interpretations. We have added a revised Fig 2, suggesting that representation of gene expression between tissues can be strongly biased by nature of samples (ex. differences in data that is available for each tissue) and also discussed considerations of the nature of sample origin in the limitations section. We have also used some of these points when introducing rationale for using mouse population data. As a result of comments from this reviewer and others, we have removed the circadian rhythm analysis and muscle-hippocampal figures from the revised study; however, specifically mentioned these cohort differences in the discussion section (lines 294-298). Circadian rhythm terms are also evaluated in Fig 2 and consistent with the reviewers concerns, less overall correlations are observed between transcripts across tissues when compared to other common GO terms assessed.

      1. Figure 4, this is all transcript-level data, so it is confusing to see protein nomenclature used, e.g. "expression of muscle ADAMTS17" should be "expression of muscle ADAMTS17" (ADAMTS17 the transcript should be in italics, in case the formatting is removed by the eLife portal). Same for FNDC5. In the figures you do have those in italics, so it is just an issue in the manuscript text. In general please look through the text and make sure whether you are referring really to a "gene," "transcript," or "protein." For instance, Figure 1 legend I think should be "A, All transcripts across the ... with local subcutaneous and muscle transcript expression." I know people still sometimes use "gene expression" to refer to transcripts, but now that proteomics is pretty mainstream, I would push for more careful vocabulary here.

      Response: Thank you for pointing these out. While we have replaced Fig 4 entirely as to limit the unvalidated discovery or research aspects of the paper, we have gone through the text and figures to check that the correct formatting is used for references to human genes (capitalized italics) or the newly-included mouse genes (lower-case italics).

      1. "Briefly, these data were filtered to retain genes which were detected across individuals where individuals were required to show counts > 0 in 1.2e6 gene-tissue combinations across all data." I don't quite understand the filtering metric here - what is 1.2 million gene-tissue combinations referring to? 20k genes times 18 tissues times 310 people is ~100 million measurements, but for a given gene across 310 people * 18 tissues that is only ~6000 quantifications per gene.

      Response: We apologize for this oversight, as the numbers were derived from the whole GTEx dataset in total and not the tissues used for the current study. We have clarified this point in the revised manuscript (methods section in Datasets used) and also removed confusing references to specific numbers of transcripts and tissues unless made clear.

      1. Generally I think your approach makes sense conceptually but... for the specific example used in e.g. figure 4, this only makes sense to me if applied to proteins and not to transcripts. Looking at the transcript levels per tissue for genes which are secreted could be interesting but this specific example is confusing, as is the tissue selected. I would not really expect much crosstalk between the hippocampus and the muscle, especially not in terms of secreted proteins.

      Response: This is a valid point, also raised by other reviewers. While we wanted to highlight the one potentially-new (ADAMTS7) and two established proteins (FNDC5 and ERFE) and their correlations, the fact that this direct circuit remains to be validated led us to replace the figure entirely. The point raised about inference of protein secretion compared to action; however, has been expanded upon in the results and discussion. We now show that complexities arise when using this approach to infer mechanisms of proteins which are primarily regulated post-transcriptionally. We provide a revised Supplemental Fig 4 showing that this general framework, when applied to expression of INS (insulin), almost exclusively captured pathways leading to its secretion and not action.

      1. It's not clear to me how correction for multiple testing is working in the analyses used in this manuscript. You mention q-values so I am sure it was done, I just don't see the precise method mentioned in the Methods section.

      Response: We apologize for this oversight and have included a specific mention of qvalue adjustment using BH methods, where our reasoning was the efficiency in run-time (compared to other qvalue methods). In addition, we provide a revised Fig 2 which suggests that innate correlation structure exists between tissues for a variety of pathways which should be considered. We also compare several empirical bicor pvalues and qvalue adjustments directly between these large pathways where much of the innate tissue correlation structure does appear present when BH qvalue adjustments are applied (revised Fig 2A).

      1. The piecharts in Figure 1 are interesting - I would actually be curious which tissues generally have closer coexpression. This would be an absolutely massive number of pairwise correlations to test, but maybe there is a smarter way to do it? For instance, for ADIPOQ, skeletal muscle has the best typical correlation, but would that be generally true just that many adipose genes have closer relationship between the two tissues?

      Response: This comment inspired us to perform a more systematic query of global gene-gene correlation structures, which is now shown as the revised Fig 2A. With respect to ADIPOQ, the reviewer is correct in that there does appear to be a general pattern of muscle genes showing stronger correlation with adipose genes. We emphasize and discuss there in the revised manuscript to point out that global trends of tissue correlation structure should be taken into account when looking at specific genes. Much of this innate co-correlation structure could be normalized by the BH qvalue adjustment (above); however, strongly correlated pathways like mitochondria showed selective patterns throughout thresholds (revised Fig 2A). Further, we analyze KEGG terms and general correlation structures (revised Fig 2B) to point out the converse, that some tissues are just poorly represented. Interpretation of correlated genes from these organ and pathway combinations should be especially considered in the framework that their poor representation in the dataset clearly impacted the global correlation structures. We have added these points to both results and discussion. In sum, we feel that this was a critical point to explore and attempted to provide a framework to identify/consider in the revised manuscript.

      1. The pathway enrichments in Figure 1 are more difficult for me to interpret, e.g. for ADIPOQ, the scWAT pathways make sense, but the enriched skeletal muscle pathways are less clearly relevant (rRNA processing?? Not impossible but no clear relevance either). What are the significances for these pathway enrichments? Is it even possible to select a gene that has no peripheral pathway enrichment, e.g. if you take some random Gm#### or olfactory receptor gene and run the analysis, are you also going to see significant pathways selected, as pathway enrichment often has a trend to overfit? The "within organ" does seem to make sense, but I am also just looking at 4 anecdotes here and it is unclear whether they are cherry picked because they did make sense. That is, it's unclear why you selected ADIPOQ and not APOE or HMGCR or etc. I also don't figure out how I can make these pathway enrichment plots using your website. I do get the pie chart but when I try the enrichment analysis block (NB: typo on your website, it says "Enrich-E-ment Analysis" with an extra E) I always get that "the selected tissue do not contain enough genes to generate positive the enrichment." (Also two typos in that phrase; authors should check and review extensively for improvements to the use of English.) After trying several genes I eventually got it to work. I think there is some significant overfitting here, as I am pretty sure that XIST expression in the white adipose tissue has nothing to do with olfactory signalling pathways, which are the top positive network (but with an n = 4 genes).

      Response: Several good points within this comment. 1) the pathway enrichments have been revised completely. The reviewer provided a helpful suggestion of a rank-based approach to query pathways, as opposed to the previous over-representation tests. After evaluating several different pathway enrichment tools based on correlated tissue expression transcripts, a rank- and weight-based test (GSEA) captured the most physiologic pathways observed from known actions of select secreted proteins. Therefore, revised pathway enrichments and web-tool queries unitize a GSEA approach which accounts for the rank and weight determined by correlation coefficient. In implementing these new pathway approaches, we feel that pathway terms perform significantly better at capturing mechanisms. 2) With respect to the selection genes, we wanted to provide a framework for investigating genes which encode secreted proteins that signal as a result of the abundance of the protein alone. This is a group-bias; however, and not necessarily reflective of trying to tackle the most important physiologic mechanisms underlying human disease. We agree with the reviewer in those evaluating genes such as APOE and cholesterol synthesis enzymes present an exciting opportunity, our expertise in interpretation and mechanistic confirmation is limited. 3) We have gone through the revised manuscript and attempted to correct all grammatical and/or spelling mistakes.

      1. The network figures I get on your website look actually more interesting than the ones you have in Figure 2, which only stay within a tissue. Making networks within a tissue is pretty easy I think for any biologist today, but the cross-tissue analysis is still fairly hard due to the size of the datasets and correlation matrices.

      Response: We greatly appreciate the reviewer’s enthusiasm for the network model generation aspect. We have tried to improve the figure generation and expanded the gene size selection for network generation in the web tool, both within and across tissues. We are working toward allowing users to select specific pathway terms and/or tissue genes to include in these networks as well, but will need more time to implement.

      1. I get a bug with making networks for certain genes, e.g. XIST - Liver does not work for plotting network graphs. Maybe XIST is a suppressed gene because it has zero expression in males? It is an interesting gene to look at as a "positive control" for many analyses, since it shows that sample sexing is done correctly for all samples.

      Response: The reviewer recognized a key consideration in underlying data structure for GTEx. In the revised manuscript, we evaluated tissue representation (or lack thereof) being a crucial factor in driving where significant relationships cannot be observed in tissues such as kidney, liver and spleen (Fig 2). Moreover, the representation of females (self-reported) in GTEx is less-than half of males (100 compared to 210 individuals). We have emphasized this point in the discussion where we specifically pointed out the lack of XIST Liver correlation being a product of data structure/availability and not reflecting real biologic mechanisms. We expanded on this point by highlighting the clear sex-bias in terms of representation.

      1. On the network diagram on your website, there doesn't seem to be any way to zoom in on the website itself? You can make a PDF which is nice but the text is often very small and hard to read.

      Response: We have revised the web interface plot parameters to create a more uniform graph.

      1. On a related note, is it possible to output the raw data and gene lists for the network graph? I would want to know what are those genes and their correlation coefficient.

      Response: We have enabled explore as .pdf or .svg graphics for the network and all plots. In addition, following pie chart generation at the top of the web app, users now have the ability to download a .csv file containing the bicor coefficients, regression pvalues and adjusted qvalues for all other gene-tissue combinations.

      1. Some functionality issues, e.g. on the "Scatter plot" block, I input a gene name again here. Shouldn't this use the same gene selected already at the top of the page? It seems confusing to again select the gene and tissue here, but maybe there is a reason for that.

      Response: It would be more intuitive to only display genes from a given selected tissue for scatterplots; however, we chose to keep all possible combinations with the [perhaps unnecessary] option of reselecting a tissue to allow users to query any specific gene without having to wait to run the pathways for all that correspond to a given tissues.

      1. Figure 4H should also probably be Figure 1A.

      Response: Good point, the revised Fig 1A is now a summary of the web tool

      I realize I have written a fairly critical review that will require most of the figures to be redone, but I think the underlying method is sound and the implementation by and end-user is quite simple, so I think your group should have no trouble addressing these points.

      Response: Your comments were really helpful and we feel that the tool has significantly improved as a result. So, we are thankful to the time and effort put toward helping here.

      Reviewer #2 (Recommendations For The Authors)

      Comments on the use of "genetic correlation"

      • The use of "genetic correlation" in title and throughout the manuscript is misleading. Should broadly be replaced with "gene expression correlation". Within genetics, "genetic correlation" generally refers to the correlation between traits due to genetic variation, as would be expected under pleiotropy (genetic variation that affects multiple traits). Here, I think the authors are somewhat conflating "genetic" (normally referring to genetic variation) with "gene" (because the data are gene expression phenotypes). I don't think they perform any genetic analysis in the manuscript. I hope I don't sound too harsh. I think the paper still has merit and value, but it is important to correct the terminology.

      Response: This was an important clarification raised by all reviewers. We apologize for the oversight. As a result, all mentions of “genetics” or “genetic data” has been removed, with the exception of introducing mouse genetic reference panels. These have generally been replaced with “transcript correlations”, “correlations” or “correlations across individuals” to avoid confusion.

      • The authors note an important limitation in the Discussion that correlations don't imply a specific causal model between two genes, and furthermore note that statistical procedures (mediation and Mendelian randomization) are dependent on assumptions and really only a well-designed experiment can completely determine the relationship. This is a very important point that I greatly appreciate. I think they could even further expand this discussion. The potential relationships between gene A and gene B are more complex than causal and reactive. For example, a genetic variant or environmental exposure could regulate a gene that then has a cascade of effects on other genes, including A and B. They belong to a shared causal pathway (and are potentially biologically interesting), but it's good to emphasize that correlations can reflect many underlying causal relationships, some more or less interesting biologically.

      Response: We thank the reviewer for pointing this out. We have expanded both the results and discussion sections to mention specifically how correlation between two genes can be due to a variety of parameters, often and not just encompassing their relationship. We mention the importance of considering genetic and environmental variables in these relationships as well which we feel will be an important “take-home message” for the reader. These points were also explored in the revised Fig 2 in terms of investigating broad pathway gene-gene correlation structures. As noted by the reviewer, contexts such as circadian rhythm or other variables in the data which are not fixed show much less overall significance in terms of broad relationships across organs.

      • It would be good for the authors to provide more context for the methods they use, even when they are fully published. For example, stating that biweight midcorrelation (bicor) is an approach for comparing to variables that is more robust to outliers than traditional correlations and is commonly used with gene co-expression correlation.

      Response: Thank you for pointing this out. A lack of method description was also an important reason for lack of clarity on other aspects so we have done our best to detail what exact approaches are being implemented and why. In the revised manuscript, we mention the usage if bicor values to limit influence of outlier individuals in driving regressions, but also point out that it is still a generalized linear model to assess relationships. We hope that the revised methods and expanded git repositories which detail each analysis provide much more transparency on what is being implemented.

      • Performing a similar analysis based on genetic correlation is an interesting idea, as it would potentially simplify the underlying causal models (removing variation that doesn't stem from genetic variants). I don't expect the authors to do this for this paper because it would be a significant amount of work (fitting and testing genetic correlations are not as straightforward). But still, an interesting idea to think about, and individuals in GTEx are genotyped I believe. Could be mentioned in the Discussion.

      Response: Absolutely. While we did not implement and models of genetic correlation (despite misusing the term) in this analysis. We have added to the discussion on how when genetic data is available, these approaches offer another way to tease out potentially causal interactions among the large amount of correlated data occurring for a variety of reasons.

      Comments on use of the term "local" and "regression"

      • "Local" is largely used to mean within-tissue, so how correlated gene X in tissue Y is with other genes in tissue Y. I think this needs to be defined explicitly early in the manuscript or possibly replaced with something like "within-tissue".

      Response: We have replaced al “local” mentions with “within-tissue” or simply name the tissue that the gene is expressed to avoid confusion with other terms of local (ex a transcript in proximity to where it is encoded on the genome).

      • "Regression" is also used frequently throughout, often when I think "correlation" would be more accurate. It's true that the regression coefficient is a function of the correlation between X and Y, but I don't think actual regression (the procedure) applies here. The coefficients being used are bicor, which I don't think relates as cleanly to linear regression.

      Response: Thank you for pointing this out. A lack of method description was also an important reason for lack of clarity on other aspects so we have done our best to detail what exact approaches are being implemented and why. In the revised manuscript, we mention the usage if bicor values to limit influence of outlier individuals in driving correlations, but also point out that it is still a generalized linear model to assess relationships. Further, we have removed usage of “regression” when referencing bicor values. We hope that the revised methods and expanded git repositories which detail each analysis provide much more transparency on what is being implemented.

      • "Further, pan-tissue correlations tend to be dominated by local regressions where a given gene is expressed. This is due to the fact that within-tissue correlations could capture both the regulatory and putative consequences of gene regulation, and distinguishing between the two presents a significant challenge" (lines 219-223). This sentence includes both "local" and "regressions" (and would be improved by my suggested changes I think), but I also don't fully understand the argument of "regulatory and putative consequences". I think the authors should elaborate further. In the examples, the within-tissue correlations do look stronger, suggesting within-tissue regulation that is quite strong and potentially secondary inter-tissue regulation. If that's the idea, I think it can be stated more clearly.

      Response: Thank you for pointing this out. We have revised the sentence to state the following:

      Further, many correlations tend to be dominated by genes expressed within the same organ. This could be due to the fact that, within-tissue correlations could capture both the pathways regulating expression of a gene, as well as potential consequences of changes in expression/function, and distinguishing between the two presents a significant challenge. For example, a GD-CAT query of insulin (INS) expression in pancreas shows exclusive enrichments in pancreas and corresponding pathway terms reflect regulatory mechanisms such as secretion and ion transport (Supplemental Fig 4).

      We feel that this point might not be intuitive, so have included a new figure (Supplemental Fig 4) which contains the tissue correlations and pathways for INS expression in pancreas. These analyses show an example where co-correlation structure seems almost entirely dominated by genes within the same organ (pancreas) and GSEA enrichments highlight many known pathways which are involved in regulating the expression/secretion of the gene/protein. We hope that this makes the point more clearly to the reader.

      Additional comments on Results:

      • I would break the titled Results sections into multiple paragraphs. For example, the first section (lines 84-129) has a few natural breakpoints that I noticed that would potentially make it feel less over-whelming to the reader.

      Response: We have broken up the results section into separate paragraphs in the revised manuscript. In addition, we have gone through to try and make sure that the amount of information per block/sentence focuses on key points.

      • "Expression of a gene and its corresponding protein can show substantial discordances depending on the dataset used" (line 224 of Results). This is a good point, and the authors could include citations here of studies that show discordance between transcripts and proteins, of which there are a good number. They could also add some biological context, such as saying differences could reflect post-translational regulation, etc.

      Response: Thank you for the supportive comment. We have referenced several comprehensive reviews of the topic, each of which contain tables summarizing details of mRNA-protein correlation. The revised discussion sentence is as follows:

      Expression of a gene and its corresponding protein can show substantial discordances depending on the dataset used. These have been discussed in detail39–41, but ranges of co-correlation can vary widely depending on the datasets used and approaches taken. We note that for genes encoding proteins where actions from acute secretion grossly outweigh patterns of gene expression, such as insulin, caution should be taken when interpreting results. As the depth and availability of tissue-specific proteomic levels across diverse individuals continues to increase, an exciting opportunity is presented to explore the applicability of these analyses and identify areas when gene expression is not a sufficient measure.

      1. Liu, Y., Beyer, A. & Aebersold, R. On the Dependency of Cellular Protein Levels on mRNA Abundance. Cell 165, 535–550 (2016).

      2. Maier, T., Güell, M. & Serrano, L. Correlation of mRNA and protein in complex biological samples. FEBS Letters 583, 3966–3973 (2009).

      3. Buccitelli, C. & Selbach, M. mRNAs, proteins and the emerging principles of gene expression control. Nat Rev Genet 21, 630–644 (2020).

      • In many ways, this work has similar goals to many studies that have performed multi-tissue WGCNA (e.g., Talukdar et al. 2016; https://doi.org/10.1016/j.cels.2016.02.002). In this manuscript, WGCNA's conventional approach to estimating robust correlations (bicor) is used, but they do not use WGCNA's data reduction/clustering functionality to estimate modules. Perhaps the modules would miss the signaling relationships of interest, being sort of lost in the presence of stronger signals that aren't relevant to the biological questions here. But I think it would be good for the authors to explain why they didn't use the full WGCNA approach.

      Response: This is an important point and we also feel that the previous lack of methodological details and discussion did a poor job at distinguishing why module-based approaches were not used. We wanted to be careful not to emphasize one approach being superior/inferior to another, rather point out the different considerations and when a direct correlation might inform a given question. As the reviewer points out, our general feeling is that adopting a simple gene-focused correlation approach allows users to view mechanisms through the lens of a single gene; however, this is limited in that these could be influenced by cumulative patterns of correlation structure (for example mitochondria in revised Fig 2A) which would be much more apparent in a module-based approach. This comment, in combination with the other listed above, was our motivation in exploring cumulative patterns of gene-gene correlations in the revised Fig 2. In the revised manuscript, we expanded on the results and discussion section to highlight utility of these types of approaches compared to module-based methods:

      The queries provided in GD-CAT use fairly simple linear models to infer organ-organ signaling; however, more sophisticated methods can also be applied in an informative fashion. For example, Koplev et al generated co-expression modules from 9 tissues in the STARNET dataset, where construction of a massive Bayesian network uncovered interactions between correlated modules6. These approaches expanded on analysis of STAGE data to construct network models using WGCNA across tissues and relating these resulting eigenvectors to outcomes42. The generalized approach of constructing cross-tissue gene regulatory modules presents appeal in that genes are able to be viewed in the context of a network with respect to all other gene-tissue combinations. In searching through these types of expanded networks, individuals can identify where the most compelling global relationships occur. One challenge with this type of approach; however, is that coregulated pathways and module members are highly subjective to parameters used to construct GRNs (for example reassignment threshold in WGCNA) and can be difficult in arriving at a “ground truth” for parameter selection. We note that the WGCNA package is also implemented in these analyses, but solely to perform gene-focused correlations using biweight midcorrelation to limit outlier inflation. While the midweight bicorrelation approach to calculate correlations could also be replaced with more sophisticated models, one consideration would be a concern of overfitting models and thus, biasing outcomes.

      Additional comments on Discussion:

      • In the second paragraph of the Discussion (lines 231-244), the authors mention that GD-CAT uses linear models to compare data between organs and point to other methods that use more complex or elaborate models. It's good to cite these methods, but I think they could more directly state that there are limitations to high complexity models, such as over-fitting.

      Response: Thank you for this suggestion. We have added a line (above) mentioning the overfitting concern.

      Comments on Methods:

      • The described gene filtration in the Methods of including genes with non-zero expression for 1.2e6 gene-tissue combinations is confusing. If there are 310 individuals and 18 tissues, for a given gene, aren't there only 5,580 possible data points? Might be helpful to contextualize the cut-off in terms of like the average number of individuals with non-zero expression within a tissue.

      Response: We apologize for this error. This number was pasted from a previous dataset used and not appropriate for this manuscript. In general, we have removed specific mentions of total number of gene_tissue correlation combinations, as these numbers reflect large but almost meaningless quantifications. Instead, we expanded the methods in terms of how individuals and genes filtered.

      • More details should be given about the gene ontology/pathway enrichment analysis. I suspect that a set-based approach (e.g., hypergeometric test) was used, rather than a score-based approach. The authors don't state what universe of genes were used, i.e., the overall set of genes that the reduced set of interest is compared to. Seems like this could or should vary with the tissues that are being compared. A score-based approach could be interesting to consider (https://www.biorxiv.org/content/10.1101/060012v3), using the genetic correlations as the score, as this would remove the unappealing feature of sets being dependent on correlation thresholds. This isn't something that I would demand of the published paper, but it could be an appealing approach for the authors to consider and confirm similar results to the set-based analysis.

      Response: This is an important point. Following this suggestion, we evaluated several different rank- and weight-based pathway enrichment tools, including FGSEA and others. Ultimately, we concluded that GSEA performed significantly better at 1) recapitulating known biology of select secreted protein genes and 2) leveraging the large numbers of genes occurring at qvalue cutoffs without having to further refine (ex. in the previous overrepresentation tests). For this reason, all pathway enrichments in the web tools and manuscripts not contain GSEA outputs and corresponding pathway enrichments or network graph visualizations. Thank you for this suggestion.

      Comments on figures:

      • I think there is a bit of a missed opportunity to use the figures to introduce and build up the story for readers. For example, in Figure 1, plotting ADIPOQ expression against a correlated gene in adipose (local) as well as peripheral tissues. This doesn't need to be done for every example, but I think it would help readers understand what the data are, and what's being detected before jumping into higher level summaries.

      Response: Thank you, this point also builds on others which recommended to restructure the manuscript and figures. In the revised manuscript, we first introduce the web tool (which was last previously), and immediately highlight comparisons of within- and across-organ correlations, such as ADIPOQ. We feel that the revised manuscript presents a superior structure in terms of demonstrating the key points and utility of looking at gene-gene correlations across tissues.

      • Figures 1 and 4 are missing the color scale legend for the bar plots, so it's impossible to tell how significant the enrichments are.

      Response: We apologize for the oversight. The pathways in the revised Fig 1 detail pathway network graphs among the top pathways which should make interpretation more intuitive. We have also gone through and made sure that GSEA enrichment pvalues are now present for all figures including pathways (revised Fig 1, Fig 3 and supplemental Fig 4).

      • The Figure 2 caption says that edges are colored based on correlation sign? Are there any negative correlations (red)? They all look blue to me. The caption could also state that edge weight reflects correlation magnitude (I assume). It would be ideal to include a legend that links a range of the depicted edge weights to their genetic correlation, though I don't know how feasible that may be depending on the package being used to plot the networks.

      Response: Good catch. We included in the revised manuscript the network edge parameters: Network edges represent positive (blue) and negative (red) correlations and the thicknesses are determined by coefficients. They are set for a range of bicor=0.6 (minimum to include) to bicor=0.99

      Related to seeing a dominant pattern of positive correlations, we agree that this observation is fascinating and gene-gene correlations being dominated by positive coefficients will be the topic of a closely-following manuscript from the lab

      • Figure 4A would be more informative as boxplots, which could still include Ssec score. This would allow the reader to get a sense of the variation in correlation p-value across all hippocampus transcripts.

      Response: Related to comments from this reviewer and others, we have removed the previous Fig 4 entirely from the manuscript to emphasize the ability of these gene-gene correlations to capture known biology and limit the extend of unvalidated “suggested” new mechanisms.

      Comments on GD-CAT

      • The online webtool worked nicely for me. It was easy to use and produce figures like in the manuscript. One suggestion is show data points in the scatter plot rather than just the regression line (if that's possible currently, I didn't figure it out). A regression line isn't that interesting to look at, but seeing how noisy the data look around it is something humans can usually interpret intuitively.

      Response: Thank you so much. We are excited that the web tool works sufficiently. We have also revised the individual gene-gene correlation tab to show individual data points instead of simple regression lines.

      Minor comments:

      Response: Thank you for these detailed improvements

      • This sentence is awkwardly constructed: "Here, we surveyed gene-gene genetic correlation structure for ~6.1x10^12 gene pairs across 18 metabolic tissues in 310 individuals where variation of genes such as FGF21, ADIPOQ, GCG and IL6 showed enrichments which recapitulate experimental observations" (lines 68-70). It's an important sentence because it's where in the Abstract/Introduction the authors succinctly state what they did, thus I would re-work it to something like: "Here, we surveyed gene expression correlation structure..., identifying genes, such as FGF21, ADIPOQ, GCG and IL6, that possess correlation networks that recapitulate known biological pathways."

      Response: The numbers of pairs examined and dataset size have been removed for clarity and we have revised this statement and results as a whole

      • Prefer swapping "signal" for "signaling" in line 53 of Abstract/Introduction.

      Response: Done

      • Remove extra period in line 208 of Results.

      Response: Removed

      • Change "well-establish" to "well-established" in line 247 of Discussion.

      Response: Replaced

      • Missing commas in line 302 of Methods.

      Response: added

      • Missing comma in line 485 of Figure 3 caption.

      Response: The previous Fig 3 has been removed

      • Typo in title of Figure 3E (change "Perihperal" to "Peripheral")

      Response: Thank you, changed

      • Add y-axis label to y-axis labels (relative cell proportions) to Supplemental Figures 1-3.

      Response: These labels have been added

      Reviewer #3 (Recommendations For The Authors):

      Minor technical comment: The authors refer to correlations between genes when they actually mean correlations between GTEX transcript isoform models. It is exceedingly important to keep this distinction clear in the reader's mind, a fact that is emphasized by the authors themselves when they comment on the potential value of similar proteomic assays to evaluate multiorgan system communication. GTEx has tried to do proteomics but I do not know of any open data yet.

      Response: Thank you for this point. We have gone through the manuscript and replaced “gene correlations” with “transcript” or other similar mentions. Related to the comment on GTEx proteomics, this is an important point as well. As the reviewer mentions, proteomics has been performed on GTEx data; however, given that this dataset contains only 6 sparsely-represented individuals, analyses such as the ones highlighted in our study remain highly limited. We have added the following to the discussion: As the depth and availability of tissue-specific proteomic levels across diverse individuals continues to increase, an exciting opportunity is presented to explore the applicability of these analyses and identify areas when gene expression is not a sufficient measure. For example, mass-spec proteomics was recently performed on GTEx42; however, given that these data represent 6 individuals, analyses utilizing well-powered inter-individual correlations such as ours which contain 310 individuals remain limited n applications.

      The R/Shiny companion application: The community utility of this application would be greatly improved by a link to a primer and more basic functionality. The Github site is a "work in progress" and does not include a readme file or explanation (that I could find) on the license.

      Response: Thank you, we are excited that the apps operate sufficiently. We have revised the github repository entirely to contain a full walk-through of app details and parameter selections. These are meant to walk users through each step of the pipeline and discuss what is being done at each step. We agree that this updated github repository allows users to understand the details of the R/Shiny app in much more detail. We also made all the app scripts, datasets, markdown/walkthrough files and docker image fully available to enhance accessibility.

    1. 方音符號📄💬 台灣方音符號由台灣省國語推行委員會方言組的朱兆祥教授設計,以注音符號為基礎,增補華語沒有的發音符號而成。台灣大學中文系退休教授吳守禮所著的《國臺對照活用辭典》及鄉土文學作家楊青矗所著的《台華雙語辭典》皆採方音符號。 系統概要 方音符號以華語注音符號為基礎,再增加台語需要的音素。 聲母 塞音(不送氣)塞音(送氣)濁音鼻音擦音邊音 唇音ㄅ /p/ㄆ /pʰ/ㆠ /b/ㄇ /m/ 舌尖音ㄉ /t/ㄊ /tʰ/ㄋ /n/ㄌ /l/ 舌齒音ㄗ /ʦ/ㄘ /ʦʰ/ㆡ /ʣ/ㄙ /s/ 齦顎音ㄐ /ʨ/ㄑ /ʨʰ/ㆢ /ʥ/ㄒ /ɕ/ 舌根音ㄍ /k/ㄎ /kʰ/ㆣ /g/ㄫ /ŋ/ 喉音ㄏ /h/ [ʦ]/[ʨ]、[ʦʰ]/[ʨʰ]、[ʣ]/[ʥ]、[s]/[ɕ] 在台語是相同音位,但方音符號為相容注音符號,故 ㄗㆡㄘㄙ 與 ㄐㆢㄑㄒ 對立書寫,當後面接韻母ㄧ或ㆪ時用後者。 濁音的正確寫法是末筆畫圈,封閉不出頭。 方音符號的聲母不會單獨存在,音節一定會有韻母。 韻母 韻腹aaɪaʊeiɪaɪaʊɪəɪɔiuəɔuuauaɪueuim̩ŋ̍ 陰聲韻ㄚㄞㄠㆤㄧㄧㄚㄧㄠㄧㄜㄧㆦㄧㄨㄜㆦㄨㄨㄚㄨㄞㄨㆤㄨㄧㆬㆭ 鼻化韻ㆩㆮㆥㆪㄧㆩㄧㆯㄧㆧㄧㆫㆧㄨㆩㄨㆮ -mㆰㄧㆬㄧㆰㆱ -nㄢㄧㄣㄧㄢㄨㄣㄨㄢ -ŋㄤㄧㄥㄧㄤㄧㆲㆲ 注音符號的特色是為確保韻母ㄧ定在兩個字之內,部分雙母音、三母音與含鼻音韻尾的韻母都定義了符號。如ㄚ+ㄧ=ㄞ、ㄚ+ㄣ=ㄢ。方音符號同樣新增了ㆰ、ㆱ、ㆲ等符號,也是為了確保(不包含入聲韻尾的)韻母在兩個字以內。 ㄣ與ㄥ不單獨使用,只出現在ㄧㄣ、ㄧㄥ、ㄨㄣ,這應該是為了與注音符號相容。 /ing/ 寫成ㄧㄥ而不是ㄧㆭ。 鼻化韻母ㆩ、ㆧ、ㆥ、ㆮ、ㆯ、ㆪ、ㆫ的正確寫法是末筆畫圈且交叉出頭。 鼻化的複合韻母,是將最後一個韻母寫成鼻化韻母。 ㆦ第二筆的正確寫法是缺一角的菱形,ㆧ、ㆱ亦比照如此。 華語注音的介音ㄧ、ㄨ只能同時存在,不過台語韻母有ㄧㄨ也有ㄨㄧ。 入聲韻尾 入聲韻尾ptkh 第4調(陰入)ㆴㆵㆻㆷ 第8調(陽入)ㆴ˙ㆵ˙ㆻ˙ㆷ˙ 入聲韻尾無論直排或橫排,都寫在最後一個韻母的右側。 聲調 調號1234578 調名陰平陰上陰去陰入陽平陽去陽入 調號ˋ˪ㆴ、ㆵ、ㆻ、ㆷˊ˫ㆴ˙、ㆵ˙、ㆻ˙、ㆷ˙ 調值˥˥ 55˥˩ 51˧˩ 31˧ 30˨˦ 24˧˧ 33˥ 50 第6調調號與第2調同。 未定義第9調寫法。 輕聲與台灣華語相同使用 ˙,但書寫位置有分歧。 《國臺對照活用辭典》比照其他聲調寫在韻母右側,如 --khì 寫成 ㄎㄧ˙ 也有人比照華語寫在音節前面,好處是可同時標出本調,如 --khì 寫成 ˙ㄎㄧ˪ 地方腔的表達 發音符號備註 /ɛ/ㄝ須區別部分偏漳腔腔調並未併入/e/的[ɛ]時。 /ə/ㄜ一般習慣比照第一優勢腔寫成ㄜ,但若依教育部公告,ㄜ只用於泉腔閏音。 /o/ㄛ教育部公告版本是用ㄛ。 /ɨ/ㆨ表達部分偏泉腔調,並未併入 /i/ 或 /u/ [ɨ]。 Unicode表現 文字Unicode序列用途備註 ㆻU+31BBㆻ入聲韻尾-k原誤收ㆶ字,Unicode 13 (2020)增補 ㆴ˙ㆵ˙ㆻ˙ㆷ˙U+02D9˙第8聲調號寫在入聲韻尾後方 ˋU+02CBˋ第2聲調號 ˪U+02EA˪第3聲調號 ˊU+02CAˊ第5聲調號 ˫U+02EB˫第7聲調號 ˙U+02D9˙輕聲寫在音節前方或韻母後方 方音符號使用 注音符號 與 注音符號擴充。但Unicode官網提供的文字造形有誤,於2019年時才修正。 Unicode 3.0 新增的 ㆶ 是錯誤字形,正確的 ㆻ 於 Unicode 13.0 編入。 方音符號專用的調號 ˪、˫ 雖然是與其它方音符號同時編入Unicode,但CNS-11643於至今仍是暫編碼、GB18030也遺漏未收,故多數字型都未支援。甚至有部分字型此兩字符還顛倒。 第8聲依中推會數位排版中注音符號調號定位方式建議,使用 U+02D9˙。目前多數環境下無法與ㆴ、ㆵ、ㆻ、ㆷ正確結合顯示,需要專用字型。 輕聲與第8聲都用U+02D9˙號。因方音符號一般只供標音,不做文字書寫之用,並無混淆之虞。但若有連寫需求,音節之間應以空格隔開。 音節表 Øhppʰbmttʰnlʦ/ʨʦʰ/ʨʰʣ/ʥs/ɕkkʰgŋ aㄚㄏㄚㄅㄚㄆㄚㆠㄚㄇㄚㄉㄚㄊㄚㄋㄚㄌㄚㄗㄚㄘㄚㄙㄚㄍㄚㄎㄚㆣㄚㄫㄚ ãㆩㄏㆩㄆㆩㄉㆩㄊㆩㄗㆩㄘㆩㄙㆩㄍㆩㄎㆩ aʔㄚㆷㄏㄚㆷㄅㄚㆷㄆㄚㆷㆠㄚㆷㄇㄚㆷㄉㄚㆷㄊㄚㆷㄋㄚㆷㄌㄚㆷㄗㄚㆷㄘㄚㆷㄙㄚㆷㄍㄚㆷㄎㄚㆷ ãʔㆩㆷㄏㆩㆷㄙㆩㆷ amㆰㄏㆰㄉㆰㄊㆰㄌㆰㄗㆰㄘㆰㄙㆰㄍㆰㄎㆰㆣㆰ anㄢㄏㄢㄅㄢㄆㄢㆠㄢㄉㄢㄊㄢㄌㄢㄗㄢㄘㄢㄙㄢㄍㄢㄎㄢㆣㄢ aŋㄤㄏㄤㄅㄤㄆㄤㆠㄤㄉㄤㄊㄤㄌㄤㄗㄤㄘㄤㄙㄤㄍㄤㄎㄤㆣㄤ ap̚ㄚㆴㄏㄚㆴㄉㄚㆴㄊㄚㆴㄌㄚㆴㄗㄚㆴㄘㄚㆴㄙㄚㆴㄍㄚㆴㄎㄚㆴ at̚ㄚㆵㄏㄚㆵㄅㄚㆵㄆㄚㆵㆠㄚㆵㄉㄚㆵㄊㄚㆵㄌㄚㆵㄗㄚㆵㄘㄚㆵㄙㄚㆵㄍㄚㆵㄎㄚㆵ ak̚ㄚㆻㄏㄚㆻㄅㄚㆻㄆㄚㆻㆠㄚㆻㄉㄚㆻㄊㄚㆻㄌㄚㆻㄗㄚㆻㄘㄚㆻㄙㄚㆻㄍㄚㆻㄎㄚㆻㆣㄚㆻ aɪㄞㄏㄞㄅㄞㄆㄞㆠㄞㄇㄞㄉㄞㄊㄞㄋㄞㄌㄞㄗㄞㄘㄞㄙㄞㄍㄞㄎㄞㆣㄞㄫㄞ ãɪㆮㄏㆮㄆㆮㄉㆮㄗㆮㄍㆮㄎㆮ aɪʔㄞㆷㄏㄞㆷㄌㄞㆷㄙㄞㆷ auㄠㄏㄠㄅㄠㄆㄠㆠㄠㄇㄠㄉㄠㄊㄠㄋㄠㄌㄠㄗㄠㄘㄠㄙㄠㄍㄠㄎㄠㆣㄠㄫㄠ auʔㄠㆷㄆㄠㆷㄇㄠㆷㄉㄠㆷㄋㄠㆷㄌㄠㆷㄘㄠㆷㄍㄠㆷㄫㄠㆷ Øhppʰbmttʰnlʦ/ʨʦʰ/ʨʰʣ/ʥs/ɕkkʰgŋ eㆤㄏㆤㄅㆤㄆㆤㆠㆤㄇㆤㄉㆤㄊㆤㄋㆤㄌㆤㄗㆤㄘㆤㄙㆤㄍㆤㄎㆤㆣㆤㄫㆤ ẽㆥㄏㆥㄅㆥㄆㆥㄉㆥㄊㆥㄗㆥㄘㆥㄙㆥㄍㆥㄎㆥ eʔㆤㆷㄏㆤㆷㄅㆤㆷㄆㆤㆷㆠㆤㆷㄇㆤㆷㄉㆤㆷㄊㆤㆷㄋㆤㆷㄌㆤㆷㄗㆤㆷㄘㆤㆷㄙㆤㆷㄍㆤㆷㄎㆤㆷㆣㆤㆷㄫㆤㆷ ẽʔㆥㆷㄏㆥㆷㄎㆥㆷ Øhppʰbmttʰnlʦ/ʨʦʰ/ʨʰʣ/ʥs/ɕkkʰgŋ iㄧㄏㄧㄅㄧㄆㄧㆠㄧㄇㄧㄉㄧㄊㄧㄋㄧㄌㄧㄐㄧㄑㄧㆢㄧㄒㄧㄍㄧㄎㄧㆣㄧㄫㄧ ĩㆪㄏㆪㄅㆪㄆㆪㄉㆪㄊㆪㄐㆪㄑㆪㄒㆪㄍㆪㄎㆪ iʔㄧㆷㄅㄧㆷㄆㄧㆷㆠㄧㆷㄇㄧㆷㄉㄧㆷㄊㄧㆷㄋㄧㆷㄌㄧㆷㄐㄧㆷㄑㄧㆷㄒㄧㆷㄍㄧㆷㄎㄧㆷ imㄧㆬㄏㄧㆬㄉㄧㆬㄊㄧㆬㄌㄧㆬㄐㄧㆬㄑㄧㆬㆢㄧㆬㄒㄧㆬㄍㄧㆬㄎㄧㆬㆣㄧㆬ inㄧㄣㄏㄧㄣㄅㄧㄣㄆㄧㄣㆠㄧㄣㄉㄧㄣㄊㄧㄣㄌㄧㄣㄐㄧㄣㄑㄧㄣㆢㄧㄣㄒㄧㄣㄍㄧㄣㄎㄧㄣㆣㄧㄣ iŋㄧㄥㄏㄧㄥㄅㄧㄥㄆㄧㄥㆠㄧㄥㄉㄧㄥㄊㄧㄥㄌㄧㄥㄐㄧㄥㄑㄧㄥㄒㄧㄥㄍㄧㄥㄎㄧㄥㆣㄧㄥ ip̚ㄧㆴㄏㄧㆴㄌㄧㆴㄐㄧㆴㄑㄧㆴㆢㄧㆴㄒㄧㆴㄍㄧㆴㄎㄧㆴ it̚ㄧㆵㄏㄧㆵㄅㄧㆵㄆㄧㆵㆠㄧㆵㄉㄧㆵㄊㄧㆵㄌㄧㆵㄐㄧㆵㄑㄧㆵㆢㄧㆵㄒㄧㆵㄍㄧㆵㄎㄧㆵ ik̚/ɪək̚ㄧㆻㄏㄧㆻㄅㄧㆻㄆㄧㆻㆠㄧㆻㄉㄧㆻㄊㄧㆻㄌㄧㆻㄐㄧㆻㄑㄧㆻㆢㄧㆻㄒㄧㆻㄍㄧㆻㄎㄧㆻㆣㄧㆻ ɪaㄧㄚㄏㄧㄚㄇㄧㄚㄉㄧㄚㄋㄧㄚㄌㄧㄚㄐㄧㄚㄑㄧㄚㆢㄧㄚㄒㄧㄚㄍㄧㄚㄎㄧㄚㆣㄧㄚㄫㄧㄚ iãㄧㆩㄏㄧㆩㄅㄧㆩㄆㄧㆩㄉㄧㆩㄊㄧㆩㄐㄧㆩㄑㄧㆩㄒㄧㆩㄍㄧㆩㄎㄧㆩ ɪaʔㄧㄚㆷㄏㄧㄚㆷㄅㄧㄚㆷㄆㄧㄚㆷㄉㄧㄚㆷㄊㄧㄚㆷㄌㄧㄚㆷㄐㄧㄚㆷㄑㄧㄚㆷㆢㄧㄚㆷㄒㄧㄚㆷㄍㄧㄚㆷㄎㄧㄚㆷㆣㄧㄚㆷ iãʔㄧㆩㆷㄏㄧㆩㆷㄒㄧㆩㆷ ɪamㄧㆰㄏㄧㆰㄉㄧㆰㄊㄧㆰㄌㄧㆰㄐㄧㆰㄑㄧㆰㆢㄧㆰㄒㄧㆰㄍㄧㆰㄎㄧㆰㆣㄧㆰ ɪan/enㄧㄢㄏㄧㄢㄅㄧㄢㄆㄧㄢㆠㄧㄢㄉㄧㄢㄊㄧㄢㄌㄧㄢㄐㄧㄢㄑㄧㄢㆢㄧㄢㄒㄧㄢㄍㄧㄢㄎㄧㄢㆣㄧㄢ Øhppʰbmttʰnlʦ/ʨʦʰ/ʨʰʣ/ʥs/ɕkkʰgŋ ɪaŋㄧㄤㄏㄧㄤㄅㄧㄤㄆㄧㄤㄌㄧㄤㄐㄧㄤㄑㄧㄤㆢㄧㄤㄒㄧㄤㄍㄧㄤㄎㄧㄤㆣㄧㄤ ɪap̚ㄧㄚㆴㄏㄧㄚㆴㄉㄧㄚㆴㄊㄧㄚㆴㄌㄧㄚㆴㄐㄧㄚㆴㄑㄧㄚㆴㆢㄧㄚㆴㄒㄧㄚㆴㄍㄧㄚㆴㄎㄧㄚㆴㆣㄧㄚㆴ ɪat̚/ɪet̚ㄧㄚㆵㄏㄧㄚㆵㄅㄧㄚㆵㄆㄧㄚㆵㆠㄧㄚㆵㄉㄧㄚㆵㄊㄧㄚㆵㄌㄧㄚㆵㄐㄧㄚㆵㄑㄧㄚㆵㆢㄧㄚㆵㄒㄧㄚㆵㄍㄧㄚㆵㄎㄧㄚㆵㆣㄧㄚㆵ ɪak̚ㄧㄚㆻㄅㄧㄚㆻㄆㄧㄚㆻㄉㄧㄚㆻㄑㄧㄚㆻㄒㄧㄚㆻㄎㄧㄚㆻ ɪauㄧㄠㄏㄧㄠㄅㄧㄠㄆㄧㄠㆠㄧㄠㄇㄧㄠㄉㄧㄠㄊㄧㄠㄋㄧㄠㄌㄧㄠㄐㄧㄠㄑㄧㄠㆢㄧㄠㄒㄧㄠㄍㄧㄠㄎㄧㄠㆣㄧㄠㄫㄧㄠ ɪãuㄧㆯ ɪauʔㄧㄠㆷㄏㄧㄠㆷㄎㄧㄠㆷㄫㄧㄠㆷ ɪə/ɪoㄧㄜㄏㄧㄜㄅㄧㄜㄆㄧㄜㆠㄧㄜㄉㄧㄜㄊㄧㄜㄌㄧㄜㄐㄧㄜㄑㄧㄜㆢㄧㄜㄒㄧㄜㄍㄧㄜㄎㄧㄜㆣㄧㄜ ɪɔ̃ㄧㆧㄏㄧㆧㄉㄧㆧㄐㄧㆧㄑㄧㆧㄒㄧㆧㄍㄧㆧㄎㄧㆧ ɪəʔ/ɪoʔㄧㄜㆷㄏㄧㄜㆷㄉㄧㄜㆷㄌㄧㄜㆷㄐㄧㄜㆷㄑㄧㄜㆷㄒㄧㄜㆷㄍㄧㄜㆷㄎㄧㄜㆷㆣㄧㄜㆷ ɪɔŋㄧㆲㄏㄧㆲㄉㄧㆲㄊㄧㆲㄌㄧㆲㄐㄧㆲㄑㄧㆲㆢㄧㆲㄒㄧㆲㄍㄧㆲㄎㄧㆲㆣㄧㆲ ɪɔk̚ㄧㆦㆻㄏㄧㆦㆻㄉㄧㆦㆻㄊㄧㆦㆻㄌㄧㆦㆻㄐㄧㆦㆻㄑㄧㆦㆻㆢㄧㆦㆻㄒㄧㆦㆻㄍㄧㆦㆻㄎㄧㆦㆻㆣㄧㆦㆻ iuㄧㄨㄏㄧㄨㄅㄧㄨㆠㄧㄨㄉㄧㄨㄊㄧㄨㄋㄧㄨㄌㄧㄨㄐㄧㄨㄑㄧㄨㆢㄧㄨㄒㄧㄨㄍㄧㄨㄎㄧㄨㆣㄧㄨㄫㄧㄨ iũㄧㆫㄏㄧㆫㄉㄧㆫㄐㄧㆫㄑㄧㆫㄒㄧㆫㄍㄧㆫㄎㄧㆫ iũʔㄧㆫㆷㄏㄧㆫㆷ Øhppʰbmttʰnlʦ/ʨʦʰ/ʨʰʣ/ʥs/ɕkkʰgŋ ə/oㄜㄏㄜㄅㄜㄆㄜㆠㄜㄉㄜㄊㄜㄌㄜㄗㄜㄘㄜㄙㄜㄍㄜㄎㄜㆣㄜ ɔㆦㄏㆦㄅㆦㄆㆦㆠㆦㄇㆦㄉㆦㄊㆦㄋㆦㄌㆦㄗㆦㄘㆦㄙㆦㄍㆦㄎㆦㆣㆦㄫㆦ ɔ̃ㆧㄏㆧㄍㆧ əʔ/oʔㄜㆷㄏㄜㆷㄅㄜㆷㄆㄜㆷㆠㄜㆷㄉㄜㆷㄊㄜㆷㄌㄜㆷㄗㄜㆷㄘㄜㆷㄙㄜㆷㄍㄜㆷ ɔʔㆦㆷㄏㆦㆷㄇㆦㆷㄉㆦㆷㄌㆦㆷㄍㆦㆷ ɔ̃ʔㆧㆷㄏㆧㆷ ɔmㆱㄉㆱㄙㆱ ɔŋㆲㄏㆲㄅㆲㄆㆲㆠㆲㄉㆲㄊㆲㄌㆲㄗㆲㄘㆲㄙㆲㄍㆲㄎㆲㆣㆲ ɔp̚ㆦㆴㄏㆦㆴㄌㆦㆴㄍㆦㆴ ɔk̚ㆦㆻㄏㆦㆻㄅㆦㆻㄆㆦㆻㆠㆦㆻㄉㆦㆻㄊㆦㆻㄌㆦㆻㄗㆦㆻㄘㆦㆻㄙㆦㆻㄍㆦㆻㄎㆦㆻㆣㆦㆻ Øhppʰbmttʰnlʦ/ʨʦʰ/ʨʰʣ/ʥs/ɕkkʰgŋ uㄨㄏㄨㄅㄨㄆㄨㆠㄨㄉㄨㄊㄨㄌㄨㄗㄨㄘㄨㆡㄨㄙㄨㄍㄨㄎㄨㆣㄨ uʔㄨㆷㄅㄨㆷㄆㄨㆷㆠㄨㆷㄉㄨㆷㄊㄨㆷㄌㄨㆷㄗㄨㆷㄘㄨㆷㄙㄨㆷㄎㄨㆷ unㄨㄣㄏㄨㄣㄅㄨㄣㄆㄨㄣㆠㄨㄣㄉㄨㄣㄊㄨㄣㄌㄨㄣㄗㄨㄣㄘㄨㄣㆡㄨㄣㄙㄨㄣㄍㄨㄣㄎㄨㄣㆣㄨㄣ ut̚ㄨㆵㄏㄨㆵㄅㄨㆵㄆㄨㆵㆠㄨㆵㄉㄨㆵㄊㄨㆵㄌㄨㆵㄗㄨㆵㄘㄨㆵㄙㄨㆵㄍㄨㆵㄎㄨㆵ uaㄨㄚㄏㄨㄚㄅㄨㄚㄆㄨㄚㆠㄨㄚㄇㄨㄚㄉㄨㄚㄊㄨㄚㄋㄨㄚㄌㄨㄚㄗㄨㄚㄘㄨㄚㆡㄨㄚㄙㄨㄚㄍㄨㄚㄎㄨㄚㆣㄨㄚ uãㄨㆩㄏㄨㆩㄅㄨㆩㄆㄨㆩㄉㄨㆩㄊㄨㆩㄗㄨㆩㄘㄨㆩㄙㄨㆩㄍㄨㆩㄎㄨㆩ uaʔㄨㄚㆷㄏㄨㄚㆷㄅㄨㄚㆷㄆㄨㄚㆷㆠㄨㄚㆷㄉㄨㄚㆷㄊㄨㄚㆷㄌㄨㄚㆷㄗㄨㄚㆷㄘㄨㄚㆷㆡㄨㄚㆷㄙㄨㄚㆷㄍㄨㄚㆷㄎㄨㄚㆷ uanㄨㄢㄏㄨㄢㄅㄨㄢㄆㄨㄢㆠㄨㄢㄉㄨㄢㄊㄨㄢㄌㄨㄢㄗㄨㄢㄘㄨㄢㄙㄨㄢㄍㄨㄢㄎㄨㄢㆣㄨㄢ uaŋㄨㄤㄏㄨㄤㄘㄨㄤ uat̚ㄨㄚㆵㄏㄨㄚㆵㄅㄨㄚㆵㄆㄨㄚㆵㆠㄨㄚㆵㄉㄨㄚㆵㄊㄨㄚㆵㄌㄨㄚㆵㄗㄨㄚㆵㄙㄨㄚㆵㄍㄨㄚㆵㄎㄨㄚㆵㆣㄨㄚㆵ uaiㄨㄞㄏㄨㄞㄙㄨㄞㄍㄨㄞㄎㄨㄞ uãiㄨㆮㄏㄨㆮㄗㄨㆮㄙㄨㆮㄍㄨㆮ uaiʔㄨㄞㆷㆠㄨㄞㆷㄌㄨㄞㆷㄘㄨㄞㆷㄍㄨㄞㆷ uãiʔㄨㆮㆷㄍㄨㆮㆷ ueㄨㆤㄏㄨㆤㄅㄨㆤㄆㄨㆤㆠㄨㆤㄇㄨㆤㄉㄨㆤㄊㄨㆤㄌㄨㆤㄗㄨㆤㄘㄨㆤㆡㄨㆤㄙㄨㆤㄍㄨㆤㄎㄨㆤㆣㄨㆤ Øhppʰbmttʰnlʦ/ʨʦʰ/ʨʰʣ/ʥs/ɕkkʰgŋ ueʔㄨㆤㆷㄏㄨㆤㆷㄅㄨㆤㆷㄆㄨㆤㆷㆠㄨㆤㆷㄊㄨㆤㆷㄌㄨㆤㆷㄗㄨㆤㆷㄘㄨㆤㆷㄙㄨㆤㆷㄍㄨㆤㆷㄎㄨㆤㆷㆣㄨㆤㆷㄫㄨㆤㆷ uiㄨㄧㄏㄨㄧㄅㄨㄧㄆㄨㄧㆠㄨㄧㄇㄨㄧㄉㄨㄧㄊㄨㄧㄌㄨㄧㄗㄨㄧㄘㄨㄧㄙㄨㄧㄍㄨㄧㄎㄨㄧㆣㄨㄧ uĩㄨㆪㄏㄨㆪㄅㄨㆪㄗㄨㆪㄘㄨㆪㄙㄨㆪㄍㄨㆪ uiʔㄨㄧㆷㄏㄨㄧㆷㄍㄨㄧㆷ Øhppʰbmttʰnlʦ/ʨʦʰ/ʨʰʣ/ʥs/ɕkkʰgŋ m̩ㆬㄏㆬ m̩ʔㆬㆷㄏㆬㆷ ŋ̍ㆭㄏㆭㄅㆭㄇㆭㄉㆭㄊㆭㄋㆭㄗㆭㄘㆭㄙㆭㄍㆭㄎㆭ ŋ̍ʔㆭㆷㄏㆭㆷㄆㆭㆷㄇㆭㆷㄘㆭㆷㄙㆭㆷㄎㆭㆷ 範例 我真頇顢講話,但是我真實在。 ㆣㄨㄚˋ ㄐㄧㄣ ㄏㄢˊ ㆠㄢ˫ ㄍㆲˋ ㄨㆤ˫,ㄉㄢ˫ ㄒㄧ˫ ㆣㄨㄚˋ ㄐㄧㄣ ㄒㄧㆵ ㄗㄞ˫。 請手扞予好勢,跤徛予在。 ㄑㄧㆩˋ ㄑㄧㄨˋ ㄏㄨㆩ˫ ㄏㆦ˫ ㄏㄜˋ ㄙㆤ˪,ㄎㄚ ㄎㄧㄚ˫ ㄏㆦ˫ ㄗㄞ˫。 建立於 2022 年 3 月 18 日 10 時 25 分本條目共被 1 位不同作者編輯過 7 次最後一次修改於 2022 年 3 月 22 日 19 時 59 分 關於本站 | 關於字碼資料庫

      有可以拷貝的臺語注音符號字母

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: In their paper "Oxytocin alleviated colitis and colitis-associated colorectal tumorigenesis by targeting fucosylated MUC2" the authors describe the contribution of oxytocin (OXT) to colonic mucus formation, and its protective contribution to colitis and colon cancer. The authors also describe the mechanism by which OXT execute its alleviating effects on the colon mucus; enhanced B3GNT7-mediated fucosylation. The findings are demonstrated in cell cultures, various mice models, and samples from human patients.

      Major comments:

      1. A conceptually confusing issue in this work is whether a decrease in OXTR expression is a predisposition, or a result of colonic illness. On one hand, the experiments with OXTR KO mice and cultured cells suggest that pre-existing lower levels of the receptor sensitize the tissue. However, in the AOM/DSS model the control mice present normal OXTR levels whereas mice that received AOM/DSS had lower expression, suggesting that changes in OXTR levels are not a predisposition but a result of the treatment/illness. Additionally, when tissue from CAC patients were analyzed, decreased levels of OXTR were found in sites of wounds but not in adjacent healthy tissue, implying that this decrease is not a genetic treat but a result of external cue. This inconsistency must be sorted out and clearly demonstrated.
      2. The study describes a new regulatory pathway for colonic mucin 2, and colon related conditions. Why did the author choose to generate mice lacking OXTR in the entire intestine (small+large) and not a large-intestine specific deficiency? And is there any way to demonstrate that the absence of OXTR in the small intestine does not interfere with the results presented here?
      3. The commonly used fixative for mucus and secreted mucins is Carnoy fixative (can be found in many of Hannson G.C and in Johansson M.E.V papers, and many other papers describing colon staining), while the use of formaldehyde and glutaraldehyde is less preservative for mucus layer. This raises a concern regarding the data obtained from aldehyde-fixed mucus samples.
      4. The authors found that mice lacking OXTR have lowered levels of B3GNT7, which leads to a decrease in mucin 2 fucosylation and to further damage in the colon. What is the mechanism by which supplementation with L-fucose alleviates these outcomes given that the enzyme that regulates the addition of the fucose to mucin 2 is downregulated?

      Minor comments:

      1. Some IHC images don't show comparable or similar areas. Specifically, Figure 1 B, Figure 7 F, I.
      2. There is a discrepancy between the dosage of DSS used to induce chronic colitis in the text (2%) and in the methods (2.2%). In addition, the difference between concentrations of DSS used to induce chronic and acute colitis (2.2% vs. 2.5%, respectively) is significantly smaller than what is reported in many other papers using these models.
      3. In Figure 2 A, I, Y-axis labeling doesn't seem right (compare with Figure 5 G). It looks like the decimal point is a mistake.
      4. All Western blots presented in this study lack the molecular weight of the proteins. In many cases it would have been more convincing to see a larger portion of the membrane.
      5. Mucin 2 in a large protein (more than 5000 amino acids in human mucin 2), and many disulfide bonds. The authors do not mention if any reducing or denaturing agents were added to the lysis buffers, and whether any other special conditions were employed to separate this huge globular protein on SDS-PAGE gel.
      6. The following sentence should be revised: " To examine the effects of fucosylation regulated by OXT on LS174T cells and colonic organoids, we found that..."

      Significance

      Though the concept of OXT-mediated suppression of colon cancer has been reported (For example: PMID: 34528509, and 31920487), the regulatory pathway by which it exerts its alleviating effect, and all the mechanistic components described in this paper, were not known before. This pathway may be a potential target for therapeutic intervention in various colonic diseases. Moreover, additional mucins may be regulated by OXT in a similar manner, which can extend the importance of these findings to other organs and disease-conditions. This type of findings is of interest to the broad audience of general cell biology as well as to GI clinicians. However, as stated in my comments there are some major issues with the hypothesis, the way data is presented, and in key methods that fundamentally limits my ability to evaluate this paper.

      Key words for my field of expertise: Disulfide catalysis, Golgi, mucin

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In this study, Wang and colleagues present data linking oxytocin signaling to protection against colitis and colitis associated cancer development. To this end, they utilize a Villin-Cre line to specifically remove OXTR only from intestinal epithelia and make use of AOM and/or DSS treatment to induce colitis, colorectal cancer or CAC. OXTRdeltaIEC mice consistently develop worse symptoms and more severe colitis/CAC compared to non-Cre expressing littermates, which appears to be associated with defects in the mucus layer. Using RNA-Seq, they identify the glycosyltransferase B3GNT7 as a differentially expressed gene. Due to its role in O-linked glycosylation, they investigate whether B3GNT7 is involved in mucin production and are able to show that OXT-induced upregulation of MUC2 protein is abolished when B3GNT7 is knocked down. In vivo, co-treatment with oxytocin reduces experimental CAC, which is an interesting that OXT may present a potential treatment option in CAC. The study is quite interesting in that it provides a potential treatment option where the mucus layer, which is often disturbed in IBD, can be impacted in a positive way. Still, there are some things missing to really be able to interpret the full picture and these should be experimentally addressed.

      Major comments

      • The staining for OXTR in Fig 1B is very strong (especially as others have reported that they were not able to demonstrate OXTR in human samples, Ohlsson et al. PMID: 16678285) - it would be beneficial to confirm OXTR distribution in steady state in mice, especially as you have a great negative control were OXTR staining should be absent from the IECs. Given the observations further in the study, I would also be very curious to see whether OXTR expression is specific to goblet cells, so co-staining with Muc2 would be interesting to include in later figures. The information for the OXTR antibody is also missing from the supplementary methods.
      • Fig 1D, Fig S2 - is this whole colon RNA or specifically epithelial cells? This can have major impact on observed expression levels as the relative amounts of epithelial vs. other cell types can drastically change, thereby falsely giving the impression that expression levels in the epithelial cells change, so this really needs to be analyzed in purified epithelial cells. In Fig S2 there is significant OXTR expression remaining in the deltaIEC mice, so this suggests to me that non-IEC cell types are also included.
      • The interpretation of the study would benefit from including some steady state/untreated data for the OXTRdIEC mice. For example in Fig 2 the researchers report increased spleen size, increased cytokines etc upon CRC in these mice, but it is important to also show steady state data as these parameters may already be significantly increased in basal conditions in these OXTR deficient mice (especially seeing as Fig 3 claims that under basal conditions the mucus layer is extensively damaged you would expect some phenotype in these mice).
      • Special care needs to be taken to preserve the mucus layer during fixation, and from the methods it is not clear whether the authors have taken these technical difficulties into account. Only PFA fixation is mentioned, but it is well-established that the golden standard for imaging the mucus layer is to fix tissues in water-free Carnoy's fixative, as the mucus layer tends to collapse using formaldehyde (see also Johansson & Hansson, PMID: 22259139).
      • In Fig 5, the message and conclusions become a bit more fuzzy. Overall fucosylation is measured, but it is unclear whether MUC2 itself is increasingly fucosylated due to OXTR signaling, or that this represents more global changes in the secretory pathway that eventually lead to more efficient MUC2 production. Perhaps an IP using fucose-specific lectin combined with western blotting for MUC2 may be an option to demonstrate whether MUC2 itself becomes increasingly fucosylated due to OXTR signaling?
      • The title broadly claims that OXT "alleviates" colitis and CAC through MUC2 fucosylation and Fig 6 indeed shows that OXT treatment affects the outcome of mice in a CAC model, which is very promising, but it also loses the link with the mechanistic insights surrounding MUC2 fucosylation in previous figures. To really definitively make the claim in the title, it's important to investigate whether these OXT treated mice indeed have restored B3GNT7 levels and a thicker mucus layer after AOS/DSS regimen compared to non-OXT treated mice (as one would expect based on the in vitro data using LS174T cells and organoids). Studying the effect of OXT treatment in the regular DSS colitis model would also provide additional support for this claim.
      • Optional as I am not a specialist in OXT signaling: I would assume that there are quite some differences between males and females when it comes to OXT and OXTR. Have the authors ever observed differences in staining pattern or expression levels between males and females? The methods state that all groups are sex matched, but I wonder if it may be necessary to include gender as a variable in the analysis?

      Minor comments

      • I would suggest to include another reference in the introduction and/or discussion, as MUC2 deficient mice are also known to develop colorectal cancer (Velcich et al. PMID: 11872843) and this serves as additional support for why it is important to discover how we can positively impact the mucin layer in IBD patients.
      • In Fig 1A GEO data is reanalyzed, but it's not immediately clear what the original samples were (i.e. colon biopsies). At first glance, the figure itself adds to this confusion with the titles 'hypothalamus' and 'hypophysis' - it's not very clear that these labels indicate synthesis location of the respective hormone and not the tissue where expression was measured.
      • Fig 1C - I could not find in the methods what software was used for these quantifications.
      • Fig 1C - N=5 is mentioned in the figure legend and there are 10 datapoints in each group. Were 2 biopsies quantified per patient then? Please state this more clearly.
      • Fig 3A, B and all other western blots - please include molecular weight indications in the figure
      • Several figures use light grey bars and datapoints, but this color was very hard to see after printing the manuscript.
      • The conclusion statement for Fig 3 should be revisited, as the expression of Muc2 mRNA is not affected at all by OXTR genotype (Fig S2F). Conclusion should make it clear that specifically (mature) protein levels are affected.
      • Fig 4A-D - would be nice to include the full list of DE genes in supplement, it's an important resource. For example, there are other factors known that influence the mucus layer (such as AGR2), so I would be interested to see how these are behaving in the knockout mice.
      • In Fig 4 H-J it would be informative to show the MUC2 mRNA expression level in these cultures as this could provide support for the mice data - i.e. do the cultures also display normal MUC2 mRNA levels, with a specific defect in the mucin maturation (as appears to be the case in mice)?
      • Fig S3H-K - this figure and the validation of the siRNA is not mentioned in the main text
      • It is interesting that L-fucose seems to partly reverse the effect of DSS, but I wonder whether mechanistically this is explained by restoration of B3GNT7 expression?
      • Please check the accession codes for the reanalyzed datasets, figure legends mention two accessions, while the Data availability statement mentions three accessions.
      • The number of repeats for each experiment is a bit unclear. It is now buried in the statistics statement in the methods, but it may be more clear if it is included in each figure legend.

      Significance

      This study shows a -for me- quite unexpected link between oxytocin and protection against colitis and colitis-associated cancer development. Disturbances in the mucin layer are a very common phenomenon in IBD and colitis and there has been a great interest in this in the scientific community for quite some years (Johansson, PMID: 25025717, Yao et al. PMID: 34902790, and many others). Current IBD treatment options are generally aimed at reducing inflammation, but this does not necessarily restore the mucin layer quality. It is therefore quite interesting to see that this is apparently heavily influenced by oxytocin (which already has applications in human medicine), and this provides significant advance to our current fundamental understanding of mucin barrier regulation.

      As mentioned in the comments, the study can be further improved. To me, a more detailed investigation into the steady state phenotype of these mice, and a more detailed confirmation of where the oxytocin receptor is expressed is necessary to fully put the results into a broader framework. Also Fig 6, where the actual interventional effect of oxytocin is evaluated, no longer demonstrates whether this indeed happens through the same mechanism as outlined in the previous figures and this should be developed more.

      I expect this study to be of interest primarily to a basic research audience, though I assume that a more clinical audience would be intrigued by the findings as well.

    1. The term was coined by productivity expert Merlin Mann in 2006. Mann's point is that time and attention are finite, and productivity suffers when an inbox is confused with a to-do list.  What is the Inbox Zero approach to email management? - TechTarget According to Mann, the zero isn't a reference to the number of messages in an inbox, but rather "the amount of time an employee's brain is in his inbox." Mann's point is that time and attention are finite, and productivity suffers when an inbox is confused with a to-do list. techtarget.com Inbox Zero Method: How to Inbox Zero in 15 Minutes - Superhuman Blog Oct 31, 2023 — What is the Inbox Zero method? Coined by productivity expert Merlin Mann, Inbox Zero is an email management strategy aimed at keeping your inbox empty. Or as empty as possible, at all times. The goal: triage your inbox quickly to reduce clutter and manage emails effectively. Superhuman Blog What is Inbox Zero – Definition, Example, and FAQ - Mindmesh Popularized by productivity guru Merlin Mann in 2006, inbox zero advocates for an empty or nearly empty inbox. It focuses instead on the most important and urgent emails. Inbox zero uses sorting tools and filters to eliminate unnecessary items — archiving and keeping only the important emails. mindmesh.com (function(){ (this||self).Bqpk9e=function(f,d,n,e,k,p){var g=document.getElementById(f);if(g&&(0!==g.offsetWidth||0!==g.offsetHeight)){var l=g.querySelector("div"),h=l.querySelector("div"),a=0;f=Math.max(l.scrollWidth-l.offsetWidth,0);if(0<d&&(h=h.children,a=h[d].offsetLeft-h[0].offsetLeft,e)){for(var m=a=0;m<d;++m)a+=h[m].offsetWidth;a=Math.min(f,a)}a+=n;d=Math.min(e?f-a:a,f);l.scrollLeft=e&&p?a:e&&k?-a:d;var b=g.getElementsByTagName("g-left-button")[0],c=g.getElementsByTagName("g-right-button")[0];b&&c&&(e= RegExp("\\btHT0l\\b"),k=RegExp("\\bpQXcHc\\b"),b.className=b.className.replace(e,""),c.className=c.className.replace(e,""),b.className=0===d?"pQXcHc "+b.className:b.className.replace(k,""),c.className=d===f?"pQXcHc "+c.className:c.className.replace(k,""),setTimeout(function(){b.className+=" tHT0l";c.className+=" tHT0l"},50))}};}).call(this);(function(){var id='_StdsZa-bFNXSkPIPkcqykAk_9';var index=0;var offset=0;var is_rtl=false;var is_gecko=false;var is_edge=false;var init='Bqpk9e';window[init](id,index,offset,is_rtl,is_gecko,is_edge);})();
    1. medical terms that are not easily broken

      This chapter has so many terms that are words not built from word parts yet it only has 7 flashcards. Suggest expanding this list. Consider creating a flashcard for the following terms:

      Terms from the Anatomy section

      bilirubin (bil-ĭ-ROO-bin) - a yellow substance that is released when hemoglobin breaks down.

      blood transfusion (tran-SFŪ-zhŏn) - a procedure that enables the transfer of blood products from one person to another.

      bone marrow (bōn MĀR-ō) is the soft tissue located inside flat bones such as the sternum, skull, and pelvis that produces red blood cells (RBC), white blood cells (WBC), and platelets.

      There is a flashcard in chapter 11 for bone marrow -

      bone marrow - tissue found inside bones, the site of all blood cell differentiation and mautration of b lymphocytes. (perhaps use this definition for consistency)

      erythropoietin (EPO) (ĕr-ĭ-thrō-poi-Ē-tin), a hormone secreted by the kidneys that stimulates the production of red blood cells in response to decreased levels of oxygen in the tissues.

      hematocrit (Hct) (hē-MAT-ō-krit) is the ratio of the volume of red blood cells to the total volume of blood.

      hemoglobin (Hgb) (HĒM-ō-glō-bin), a protein molecule in red blood cells that carries oxygen to the tissues.

      jaundice (JAWN-dis)- yellowish discoloration of the skin and eyes due to the liver being unable to adequately metabolize bilirubin.

      plasma (PLAZ-mă) - the pale, straw-colored liquid part of blood and lymph that constitutes 55% of a blood sample. Plasma is mostly water with dissolved proteins including albumin, immunoglobulins, clotting factors, nutrients, electrolytes, and cellular wastes.

      Rh factor (Ār āch FAK-tŏr) the presence of an antigen on the red blood cells.

      RhoGAM - a medication that can temporarily prevent the development of Rh antibodies in an Rh- mother, thereby averting this potentially serious disease for the fetus if it is Rh+.

      thrombocytes (THRŌM-bō-sītz) - platelets that help form a clot when there is bleeding.

      Terms from the Physiology section

      coagulation (kō-ag-yū-LĀ-shŏn) - The process that causes blood to clot and helps prevent excessive blood loss.

      fibrin (FĪ-brin) a protein that forms the basis of a blood clot.

      anticoagulant (an-tī-kō-AG-yū-lănt) - a substance or medication that opposes coagulation.

      Terms from disease section

      contusion (kon-TU-zhun) - a bruise that occurs when the small veins and capillaries under the skin break resulting in dark blue or purple tender patches that appear on the skin.

      hemophilia (hē-mō-FĒL-ē-ă) - a genetic disorder where blood doesn’t clot normally due to deficient or abnormal clotting factors.

      leukemia (lū-KĒM-ē-ă) - cancer characterized by an excessive increase in abnormal leukocyte reducing the ability to fight off infection

      polycythemia (pol-ē-sī-THĒM-ē-ă) - elevated RBC count.

      polycythemia vera (pol-ē-sī-THĒM-ē-ă vee-ruh) - an excessive production of immature RBCs and other blood components, increasing the viscosity of blood.

      sickle cell anemia (SĬK-l sĕl ā-NĒ-mē-ă) is genetic disorder that causes red blood cells to assume a sickle (i.e., crescent) shape that can block blood flow and cause serious problems in organs throughout the body.

      thalassemia (thal-ă-SĒM-ē-ă) - an inherited bleeding condition resulting in a reduced production of healthy blood cells and hemoglobin

      viscosity (vĭs-KOS-ĭ-tē) refers to the state of being thick, sticky, and easily coagulable.

      flashcard 4 - ischemia - belongs in set 1 - words built from word parts: isch/emia

  6. Nov 2023
    1. Author Response

      The following is the authors’ response to the original reviews.

      eLife assessment

      This study reports important findings regarding the systemic function of hemocytes controlling whole-body responses to oxidative stress. The evidence in support of the requirement for hemocytes in oxidative stress responses as well as the hemocyte single-nuclei analyses in the presence or absence of oxidative stress are convincing. In contrast, the genetic and physiological analyses that link the non-canonical DDR pathway to upd3/JNK expression and high susceptibility, and the inferences regarding the function of hemocytes in systemic metabolic control are incomplete and would benefit from more rigorous approaches. The work will be of interest to cell and developmental biologists working on animal metabolism, immunity, or stress responses.

      We would like to thank the editorial team for these positive comments on our manuscript and the constructive suggestions to improve our manuscript. We are now happy to send you our revised manuscript, which we improved according to the suggestions and valuable comments of the referees.

      Public Reviews:

      Reviewer #1 (Public Review):

      The study examines how hemocytes control whole-body responses to oxidative stress. Using single cell sequencing they identify several transcriptionally distinct populations of hemocytes, including one subset that show altered immune and stress gene expression. They also find that knockdown of DNA Damage Response (DDR) genes in hemocytes increases expression of the immune cytokine, upd3, and that both upd3 overexpression in hemocytes and hemocyte knockdown of DDR genes leads to increased lethality upon oxidative stress.

      Strengths

      1. The single cell analyses provide a clear description of how oxidative stress can cause distinct transcriptional changes in different populations of hemocytes. These results add to the emerging them in the field that there functionally different subpopulations of hemocytes that can control organismal responses to stress.

      2. The discovery that DDR genes are required upon oxidative stress to limit cytokine production and lethality provides interesting new insight into the DDR may play non-canonical roles in controlling organismal responses to stress.

      We are grateful to referee 1 to point out the importance and novelty of our snRNA-seq data and our findings on the role of DNA damage-modulated cytokine release by hemocytes during oxidative stress. We further extended these analyses in the revised manuscript by looking deeper into the transcriptomic alterations in fat body cells upon oxidative stress (Figure 4, Figure S4). We further provide additional data to support the connection of DNA damage signaling and regulation of upd3 release from hemocytes (Figure 6F). Here we show that upd3-deficiency can abrogate the increased susceptibility of flies with mei41 and tefu knockdown in hemocytes. In line with this finding, we also show that upd3null mutants show a reduced but not abolished susceptibility to oxidative stress overall (Figure 6F), underlining the role of upd3 as a mediator of oxidative stress response.

      Weaknesses

      1. In some ways the authors interpretation of the data - as indicated, for example, in the title, summary and model figure - don't quite match their data. From the title and model figure, it seems that the authors suggest that the DDR pathway induces JNK and Upd3 and that the upd3 leads to tissue wasting. However, the data suggest that the DDR actually limits upd3 production and susceptibility to death as suggested by several results:

      According to the referee’s suggestion, we revised the manuscript and adjusted our title, abstract and graphical summary to be more precise that DNA damage signaling seem to have a modulatory or regulatory effect on upd3 release. Furthermore, we provide now additional data to support the connection between DNA damage signaling and upd3 release. For example, we added several genetic “rescue” experiments to strengthen the epistasis that modulation of DNA damage signaling and the higher susceptibility of the fly is connected to altered upd3 levels (Figure 6F). We now provide additional data showing that the loss of upd3 rescues the susceptibility to oxidative stress in flies, which are deficient for DDR components in hemocytes.

      a. PQ normally doesn't induce upd3 but does lead to glycogen and TAG loss, suggesting that upd3 isn't connected to the PQ-induced wasting.

      Even though in our systemic gene expression analysis of upd3 expression, we could not detect a significant induction of upd3 upon PQ feeding. However, we found upd3 expression within our snRNAseq data in a distinct cluster of immune-activated hemocytes (Figure 3B, Cluster 6). Upon knockdown of the DNA damage signaling in hemocytes, the levels then increase to a detectable level in the whole fly. This supports our assumption that upd3 is needed upon oxidative stress to induce energy mobilization from the fat body, but needs to be tightly controlled to balance tissue wasting for energy mobilization. Furthermore, we found evidence in our new analysis of the snRNA-seq data of the fat body cells, that indeed we can find Jak/STAT activation in one cell cluster here, which could speak for an interaction of Cluster 6 hemocytes with cluster 6 fat body cells. A hypothesis we aim to explore in future studies.

      b. knockdown of DDR upregulates upd3 and leads to increased PQ-induced death. This would suggest that activation of DDR is normally required to limit, rather than serve as the trigger for upd3 production and death.

      Our data support the hypothesis that DDR signaling in hemocytes “modulates” upd3 levels upon oxidative stress. We now carefully revised the text and the graphical summary of the manuscript to emphasize that oxidative stress causes DNA damage, which subsequently induces the DNA damage signaling machinery. If this machinery is not sufficiently induced, for example by knockdown of tefu and mei-41, non-canonical DNA damage signaling is altered which induces JNK signaling and induces release of pro-inflammatory cytokines, including upd3. Whereas DNA damage itself is only slightly increase in the used DDR deficient lines (Figure 5C) and hemocytes do not undergo apoptosis (unaltered cell number on PQ (Figure 5B)), we conclude that loss of tefu, mei-41, or nbs1 causes dysregulation of inflammatory signaling cascades via non-canonical DNA damage signaling. However, oxidative stress itself seems to also induce upd3 release and DNA damage signaling in the same cell cluster, as shown by our snRNA-seq data (Figure 3B). Hence, we think that DNA damage signaling is needed as a rate-limiting step for upd3 release.

      c. hemocyte knockdown of either JNK activity or upd3 doesn't affect PQ-induced death, suggesting that they don't contribute to oxidative stress-induced death. It’s only when DDR is impaired (with DDR gene knockdown) that an increase in upd3 is seen (although no experiments addressed whether JNK was activated or involved in this induction of upd3), suggesting that DDR activation prevents upd3 induction upon oxidative stress.

      Whereas the double knockdown of upd3 or bsk and DDR genes was resulting in insufficient knockdown efficiencies, we added a rescue experiment where we combined upd3null mutants with knockdown of tefu and mei-41 in hemocytes and found a reduced susceptibility of DDR-deficient flies to oxidative stress.

      1. The connections between DDR, JNK and upd3 aren't fully developed. The experiments show that susceptibility to oxidative stress-induced death can be caused by a) knockdown of DDR genes, b) genetic overexpression of upd3, c) genetic activation of JNK. But whether these effects are all related and reflect a linear pathway requires a little more work. For example, one prediction of the proposed model is that the increased susceptibility to oxidative stress-induced death in the hemocyte DDR gene knockdowns would be suppressed (perhaps partially) by simultaneous knockdown of upd3 and/or JNK. These types of epistasis experiments would strengthen the model and the paper.

      As mentioned before, we had some technical difficulties combining the knockdown of bsk or upd3 with DDR genes. However, we added a new experiment in which we show that upd3null mutation can rescue the higher susceptibility of hemocytes with tefu and mei41 knockdown.

      1. The (potential) connections between DDR/JNK/UPD3 and the oxidative stress effects on depletion of nutrient (lipids and glycogen) stores was also not fully developed. However, it may be the case that, in this paper, the authors just want to speculate that the effects of hemocyte DDR/upd3 manipulation on viability upon oxidative stress involve changes in nutrient stores.

      In the revised version of the manuscript, we now provide a more thorough snRNA-seq analysis in the fat body upon PQ treatment to give more insights on the changes in the fat body upon PQ treatment. We added additional histological images of the abdominal fat body on control food and PQ food, to demonstrate the elimination of triglycerides from fat body with Oil-Red-O staining (Figure S1). We also analyzed now hemocyte-deficient (crq-Gal80ts>reaper) flies for their levels of triglycerides and carbohydrates during oxidative stress, to support our hypothesis that hemocytes are key players in the regulation of energy mobilization during oxidative stress. Loss of hemocytes (and therefore also their regulatory input on energy mobilization from the fat body) results in increased triglyceride storage in the fat body during steady state with a decreased consumption of these triglycerides on PQ food compared to control flies (Figure 1J). In contrast, glycogen storage and mobilization, which is mostly done in muscle, is not altered in these flies during oxidative stress (Figure 1L). Interestingly, free glucose levels are drastically reduced in hemocyte-deficient flies, which could be due to insufficient energy mobilization from the fat body and subsequently results in a higher susceptibility of these flies on oxidative stress (Figure 1K). Additionally, we aim to point out here that “functional” hemocytes are needed for effective response to oxidative stress, but this response has to be tightly balanced (see also new graphical abstract).

      Reviewer #2 (Public Review):

      Hersperger et al. investigated the importance of Drosophila immune cells, called hemocytes, in the response to oxidative stress in adult flies. They found that hemocytes are essential in this response, and using state-of-the-art single-cell transcriptomics, they identified expression changes at the level of individual hemocytes. This allowed them to cluster hemocytes into subgroups with different responses, which certainly represents very valuable work. One of the clusters appears to respond directly to oxidative stress and shows a very specific expression response that could be related to the observed systemic metabolic changes and energy mobilization. However, the association of these transcriptional changes in hemocytes with metabolic changes is not well established in this work. Using hemocyte-specific genetic manipulation, the authors convincingly show that the DNA damage response in hemocytes regulates JNK activity and subsequent expression of the JAK/STAT ligand Upd3. Silencing of the DNA damage response or excessive activation of JNK and Upd3 leads to increased susceptibility to oxidative stress. This nicely demonstrates the importance of tight control of JNK-Upd3 signaling in hemocytes during oxidative stress. However, it would have been nice to show here a link to systemic metabolic changes, as the authors conclude that it is tissue wasting caused by excessive Upd3 activation that leads to increased susceptibility, but metabolic changes were not analyzed in the manipulated flies.

      We thank the referee for the suggestion to better connect upd3 cytokine levels to energy mobilization from the fat body. We agree that this is an important point to support our hypothesis. First, we added now a detailed analysis of fat body cells in our snRNA-seq data to evaluate the changes induced in the fat body upon oxidative stress. We further added additional metabolic analyses of hemocyte-deficient flies (crq-Gal80ts>reaper) to support our hypothesis that hemocytes are key players in the regulation of energy mobilization during oxidative stress (see also answer to referee 1). Loss of the regulatory role of hemocytes in the energy mobilization and redistribution leads to a decreased consumption of these triglycerides on PQ food compared to control flies (Figure 1J). In contrast, glycogen storage and mobilization from muscle, is not affected in hemocyte-deficient flies during oxidative stress (Figure 1L). Interestingly, free glucose levels are drastically reduced in hemocyte-deficient flies compared to controls, which could be due to insufficient energy mobilization from the fat body resulting in a higher susceptibility to oxidative stress (Figure 1K). This data supports our assumption that “functional” hemocytes are needed for effective response to oxidative stress, but this response has to be tightly balanced (see also new graphical summary).

      The overall conclusion of this work, as presented by the authors, is that Upd3 expression in hemocytes under oxidative stress leads to tissue wasting, whereas in fact it has been shown that excessive hemocyte-specific Upd3 activation leads to increased susceptibility to oxidative stress (whether due to increased tissue wasting remains a question). The DNA damage response ensures tight control of JNK-Upd3, which is important. However, what role naturally occurring Upd3 expression plays in a single hemocyte cluster during oxidative stress has not been tested. What if the energy mobilization induced by this naturally occurring Upd3 expression during oxidative stress is actually beneficial, as the authors themselves state in the abstract - for potential tissue repair? It would have been useful to clarify in the manuscript that the observed pathological effects are due to overactivation of Upd3 (an important finding), but this does not necessarily mean that the observed expression of Upd3 in one cluster of hemocytes causes the pathology.

      We agree with the referee that the pathological effects and increased susceptibility to oxidative stress are mediated by over-activated hemocytes and enhanced cytokine release, including upd3 during oxidative stress. We edited the revised manuscript accordingly to imply a “regulatory” role of upd3, which we suspect and suggest as an important mediator for inter-organ communication between hemocytes and fat body. Whereas our used model for oxidative stress (15mM Paraquat feeding) is a severe insult from which most of the flies will not recover, we could not account and test how upd3 might influence tissue repair after injury, insults and infection. We believe that this is an important factor, we aim to explore in future studies.

      Reviewer #3 (Public Review):

      In this study, Kierdorf and colleagues investigated the function of hemocytes in oxidative stress response and found that non-canonical DNA damage response (DDR) is critical for controlling JNK activity and the expression of cytokine unpaired3. Hemocyte-mediated expression of upd3 and JNK determines the susceptibility to oxidative stress and systemic energy metabolism required for animal survival, suggesting a new role for hemocytes in the direct mediation of stress response and animal survival.

      Strength of the study:

      1. This study demonstrates the role of hemocytes in oxidative stress response in adults and provides novel insights into hemocytes in systemic stress response and animal homeostasis.

      2. The single-cell transcriptome profiling of adult hemocytes during Paraquat treatment, compared to controls, would be of broad interest to scientists in the field.

      We are grateful to these positive comments on our data and are excited that the referee pointed out the importance of our provided snRNA-seq analysis of hemocytes and other cell types during oxidative stress. In the revised, version we now extended this analysis and looked not only into hemocytes but also highlighted induced changes in the fat body (Figure 4).

      Weakness of the study:

      1. The authors claim that the non-canonical DNA damage response mechanism in hemocytes controls the susceptibility of animals through JNK and upd3 expression. However, the link between DDR-JNK/upd3 in oxidative stress response is incomplete and some of the descriptions do not match their data.

      In the revised manuscript, we aimed to strengthen the weaknesses pointed out by the referee. We now included additional genetic crosses to validate the connection of DDR signaling in hemocytes with upd3 release. For example, we added now survival studies where we show that upd3null mutation can rescue the higher susceptibility of flies with tefu and mei41 knockdown in hemocytes during oxidative stress. Furthermore, we added additional data to highlight the importance of hemocytes themselves as essential regulators of susceptibility to oxidative stress. We analyzed the hemocyte-deficient flies (crq-Gal80ts>reaper) for their triglyceride content and carbohydrate levels during oxidative stress (Figure 1 I-L). As outlined above, loss of hemocytes leads to a decreased consumption of these triglycerides on PQ food compared to control flies (Figure 1J). In contrast, glycogen storage and mobilization from muscle, is not affected in hemocyte-deficient flies during oxidative stress (Figure 1L). Interestingly, free glucose levels are drastically reduced in hemocyte-deficient flies, which could be due to insufficient energy mobilization from the fat body resulting in a higher susceptibility to oxidative stress (Figure 1K).

      1. The schematic diagram does not accurately represent the authors' findings and requires further modifications.

      We carefully revised the text throughout the manuscript describing our results and edited the graphical abstract to display that upd3 levels and hemocytes are essential to balance and modulate response to oxidative stress.

      Reviewer #1 (Recommendations For The Authors):

      The summary doesn't say too much about what the specific discoveries and results of the study are. The description is limited to just one sentence saying, "Here we describe the responses of hemocytes in adult Drosophila to oxidative stress and the essential role of non-canonical DNA damage repair activity in direct "responder" hemocytes to control JNK-mediated stress signaling, systemic levels of the cytokine upd3 and subsequently susceptibility to oxidative stress" which doesn't provide sufficient explanation of what the results were.

      In the revised version of our manuscript, we now provide further information for the reader to outline the findings of our study in a concise way in the summary.

      Reviewer #2 (Recommendations For The Authors):

      1. To strengthen the conclusion that the DDR response suppresses JNK, and thus Upd3, rescue of DDR by upd3 null mutation would help (knockdown by Hml>upd3IR might not work, RNAi seems problematic).

      We would like to thank the referee for this suggestion and included now a genetic experiment where we combined upd3null mutants with hemocyte-specific knockdown of mei-41 and tefu to test their susceptibility to oxidative stress. Our data indeed provide evidence that loss of upd3 rescues the higher susceptibility of flies with hemocyte-specific knockdown for tefu and mei-41 (Figure 6F). Furthermore, we see that upd3null mutants show a diminished susceptibility to oxidative stress compared to control flies (Figure 6F).

      1. To link the observed effects to systemic metabolic changes, it would be useful to measure glycogen and triglycerides in these flies as well:
      2. crq-Gal80ts>reaper to see what role hemocytes play in the observed metabolic changes.

      3. Hml-Upd3 overexpression and Upd3 null mutant (Upd3 RNAi seems to be problematic, we have similar experiences) to see if Upd3 overexpression leads to even more profound changes as suggested, and if Upd3 mutation at least partially suppresses the observed changes.

      We agree with the referee that analyzing the connection of hemocyte activation to metabolic changes should be demonstrated in our manuscript to support our claim that hemocytes are important regulators of energy mobilization during oxidative stress. Hence, we analyzed triglycerides and carbohydrate levels in hemocyte-deficient flies (crq-Gal80ts>reaper) during oxidative stress. Indeed, we found substantial differences in energy mobilization in these flies supporting the assumption that the higher susceptibility of hemocyte-deficient flies could be caused by substantial decrease in free glucose and inefficient lysis of triglycerides from the fat body (Figure 1I-K).

      1. To test whether the cause of the increased susceptibility to oxidative stress is due to Upd3 overactivation induced by DDR silencing, the authors should attempt to rescue DDR silencing with an Upd3 null mutation.

      The suggestion of the reviewer was included in the revised manuscript and as outlined above we now added this data set to our manuscript (Figure 6F). Indeed, we can now provide evidence that upd3null mutation rescues the higher susceptibility of flies with DDR knockdown in hemocytes.

      1. Lethality after PQ treatment varies widely (sometimes from 10 to 90%! as in Figure 5D) - is this normal? In some experiments the variability was much lower. In particular, Figure 5D is very problematic and for example the result with upd3 null mutant compared to control is not very convincing. This could be an important result to test whether Upd3, with normal expression likely coming from cluster 6, actually plays a beneficial role, whereas overexpression with Hml leads to pathology.

      We agree with the referee that it would be more convincing if the variation cross of survival experiments would be less. However, we included a lot of flies and vials in many individual experiments to test our hypothesis and variation in these survivals was always the case. These effects can be caused by many factors for example the amount of food intake by the flies, genetic background or inserted transgenes. The n-number is quite high across our survivals; so that we are convinced, the seen effects are valid. This reflects also the power of using Drosophila melanogaster as a model organism for such survivals. The high n-number in our data falls into a normal Gauss distribution with a distinct mean susceptibility between the genotypes analyzed.

      1. I like the conclusion at the end of the results: line 413: "We show that this oxidative stressmediated immune activation seems to be controlled by non-canonical DNA damage signaling resulting in JNK activation and subsequent upd3 expression, which can render the adult fly more susceptible to oxidative stress when it is over-activated." This is actually a more appropriate conclusion, but in the summary, introduction and discussion along with the overall schematic illustration, this is not actually stated as such, but rather as Upd3 released from cluster 6 causes the pathology. For example: line 435 "Hence, we postulate that hemocyte-derived upd3, most likely released by the activated plasmatocyte cluster C6 during oxidative stress in vivo and subsequently controlling energy mobilization and subsequent tissue wasting upon oxidative stress."

      We thank the referee for this suggestion and edited our manuscript and conclusions accordingly.

      Reviewer #3 (Recommendations For The Authors):

      1. In Figure 2, the authors claim showed that PQ treatment changes the hemocyte clusters in a way that suppresses the conventional Hml+ or Pxn+ hemocytes (cluster1) while expanding hemocyte clusters enriched with metabolic genes such as Lpin, bmm etc. It is not clear whether these cells are comparable to the fat body and if these clusters express any of previously known hemocyte marker genes to claim that these are bona fide hemocytes.

      We now included a new analysis of our snRNA-seq data in Figure S4, where we clearly show that all identified hemocyte clusters do not have a fat body signature and are hemocytes, which seem to undergo metabolic adaptations (Figure S4A). Furthermore, we show that the identified fat body cells have a clear fat body signature (Figure S4B) and do not express specific hemocyte markers (Figure S4C).

      1. In Figure 4C, the authors showed that comet assays of isolated hemocytes result in a statistically significant increase in DNA damage in DDR-deficient flies before and after PQ treatment. However, the authors conclude that, in lines 324-328, the higher susceptibility of DDR-deficient flies is not due to an increase in DNA damage. To explicitly conclude that "non-canonical" DNA damage response, without any DNA damage, is specifically upregulated during PQ treatment, the authors require further support to exclude the potential activation of canonical DDR.

      The referee is correct that we do not provide direct evidence for non-canonical DNA damage signaling. Therefore, we also decided to tune down our statement here a bit and removed that claim from the title. Increase in DNA damage can of course also increase the non-canonical DNA damage signaling pathway, loss of DNA damage signaling genes such as tefu and mei-41 seem to only have minor impacts on the overall amount of DNA damage acquired in hemocytes by oxidative stress. We therefore concluded that the induction in immune activation is most unlikely only caused by increased DNA damage but might be connected to dysregulation in non-canonical DNA damage signaling. Canonical DNA damage signaling leads essentially to DDR, which could be slow in adult hemocytes because they post-mitotic, or to apoptosis, which we could not observe in the analyzed time window in our experiments. Hemocyte number remained stable over the 24h PQ treatment without reduction in cell number (Figure 1H).

      1. From Figure 4D-F, the authors showed that loss of DDR in hemocytes induces the expression of unpaired 2 and 3, Socs36E, which represent the JAK/STAT pathway, and thor, InR, Pepck in the InR pathway, and a JNK readout, puc. These results indicate that the DDR pathway normally inhibits the upd-mediated JAK/STAT activation upon PQ treatment, compared to wild-type animals during PQ treatment in Figure 1B-C, which in turn protects the animal during oxidative stress responses. However, the authors claim that "enhanced DNA damage boosts immune activation and therefore susceptibility to oxidative stress (lines 365-366); we show that this oxidative stress-mediated immune activation seems to be controlled by non-canonical DNA damage signaling resulting in JNK activation and subsequent upd3 expression (line 413-416)". These conclusions are not compatible with the authors' data and may require additional data to support or can be modified.

      In the revised manuscript, we carefully revised now the text and our statements that it seems that DNA damage signaling in hemocytes has regulatory or modulatory effect on the immune response during oxidative stress. Accordingly, we also adjusted our graphical summary. We agree with the referee and used the term “non-canonical” DNA damage signaling more carefully throughout the manuscript. The slight increase in DNA damage seen after PQ treatment can contribute to immune activation but seems to be not correlative to the induced cytokine levels or the susceptibility of the flies to oxidative stress.

      1. In Fig 1I, the authors showed that genetic ablation of hemocytes using UAS-repear induces susceptibility to PQ treatment. It is possible that inducing cell death in hemocytes itself causes the expression of cytokine upd3 or activates the JNK pathway to enhance the basal level of upd3/JNK even without PQ treatment. If this phenotype is solely mediated by the loss of hemocytes, the results should be repeated by reducing the number of hemocytes with alternative genetic backgrounds.

      In the different genotypes analyzed across our manuscript we did not detect cell death of hemocytes or a dramatic reduction in hemocytes number (see Figure 1H, Figure 5B, Figure 6C). The higher susceptibility if hemocyte-deficient flies during oxidative stress is most likely caused by the loss of their regulatory role during energy mobilization. We tested triglyceride levels in hemocyte-deficient flies and found a decreased triglyceride consumption (lipolysis), with reduced levels of circulating glucose levels. This findings support our hypothesis that hemocytes are needed to balance the response to oxidative stress. In contrast, the flies with DDR-deficient hemocytes show higher systemic cytokine levels, which most likely enhance energy mobilization from the fat body and therefore result in a higher susceptibility of the fly to oxidative stress. Hence, we claim that hemocytes and their regulation of systemic cytokine levels are important to balance the response to oxidative stress and guarantee the survival of the organism.

      1. Lethality of control animals in PQ treatment is variable and it is hard to estimate the effect of animal susceptibility during 15mM PQ feeding. For example, Fig1A shows that control animals exhibit ~10% death during 15mM PQ which is further enhanced by crq-Gal80>reaper expression to 40% (Fig 1I). However, in Fig 5D-E, the basal lethality of wild-type controls already reaches 40~50%, which makes them hard to compare with other genetic manipulations. Related to this, the authors demonstrated that the expression of upd3 in hemocytes is sufficient to aggravate animal survival upon PQ treatment; however, upd3 null mutants do not rescue the lethality, which indicates that upd3 is not required for hampering animal mortality. These data need to be revisited and analyzed.

      As outlined above, we find the variability of susceptibility to oxidative stress across all of our experiments. This could be due to different effects such as food intake but also transgene insertion and genetic background. Crq-gal80ts>reaper flies are healthy, but show a shortened life span on normal food (Kierdorf et al., 2020) due to enhanced loss of proteostasis in muscles. We show in the revised manuscript that these flies have a higher susceptibility to oxidative stress and that this effect could be mediated by defects in energy mobilization and redistribution as shown by less triglyceride lysis from the fat body and decreasing levels in free glucose. This would explain the high mortality rate of these flies at 7 days after eclosion. Paraquat treatment (15mM) is a severe inducer of oxidative stress, which results in death of most flies when they are maintained for longer time windows on PQ food. Hence, it is a model, which is not suitable to examine and monitor recovery from this detrimental insult. upd3null mutants were extensively reexamined in this manuscript, and even though we could not see a full protection of these flies from oxidative stress induced death, we found a reduced susceptibility compared to control flies (Figure 6F). Furthermore, when we combined upd3null mutants with flies deficient for tefu and mei-41 in hemocytes, the increased susceptibility to oxidative stress was rescued.

    1. Author Response

      The following is the authors’ response to the original reviews.

      We appreciate the reviewers’ detailed corrections and insightful comments. We have revised our manuscript per reviewers’ recommendations by including new data and clarifications/expansion of the discussion on our findings. Please see below for details.

      Reviewer #1 (Recommendations For The Authors):

      1. The introduction notes that CD1d KO mice show reduced levels of Va3.2 T cells (Ruscher et al.), which is interesting because innate memory T cell development in the thymus often requires IL-4 production by NKT cells. Have the authors explored QFL T cells in CD1d KO and/or IL-4 KO mice? Since their QFL TCR Tg mice still develop QFL T cells (and these animals likely have very few thymic NKT cells), NKT cells may not be required for the intrathymic development of QFL T cells?

      Answer: We agree that investigation on the role of NKT cells or IL-4 in QFL T cell development will greatly further our understanding of these cells.

      We validated the finding that expression of the QFL TCR transgene largely repressed the expression of endogenous TCRα, as indicated by the low levels of endogenous Vα2 on mature CD8SP T cells in both thymus and spleen. However, the frequencies of Vα2 usage in CD4 SP thymocytes and splenocytes from QFL transgenic mice were similar to non-transgenic mice, confirming that they underwent positive selection using endogenous TCR rather than the QFL TCR. We thus do not exclude the possible presence of NKT cells in QFLTg mouse and their potential involvement in the QFL T cells development. Our manuscript here is mainly focused on investigating the peripheral phenotype of QFL T cells and their association with the gut microbiota environment. Investigations into the role of CD1d/IL-4 will be best addressed in our future studies.

      1. The finding that Qa-1 expression is not required for the development of QFL T cells raises questions about other MHC products that may be involved. In this context, it is interesting that TAP-deficient mice develop few QFL T cells, for reasons that are unclear, but the authors may speculate a bit. In this context, it may be helpful for the authors to note whether TAP is required for QFL presentation to QFL T cells. Since Qa-1 is not required, and CD1d is still expressed in TAP KO mice, what then could be responsible for their defect in QFL T cell development?

      Answer: This is a great point. Figure 2 (from (Valerio et al., 2023) on the development of QFL T cells) tested whether QFL TCR cross-react with other MHC I molecules.

      We assessed the activation of pre-selection QFLTg thymocytes in response to various MHC I deficient DC2.4 cell lines. While the QFL thymocytes showed partially reduced activation when stimulated with Qa-1b deficient APCs, triple knock-out (KO) of Qa-1b, Kb, and Db in DC2.4 cells reduced activation close to background levels. However, double knock-out of Qa-1b with either Kb, or Db led to stimulation that was intermediate between the triple KO and Qa-1b-KO cell lines. These data suggest that Kb and Db may contribute to the positive selection of QFL T cells in Qa-1b-KO mice.

      TAP is required for FL9 peptide presentation and is very likely needed for presentation of the yet unidentified MHC Ia presented peptide(s) that are essential to QFL T positive selection. While CD1d/NKT cells/IL-4 may be involved in supporting the maturation of QFL T cells, we think in the TAP-KO mice the absence of TAP led to deletion/altered selection of the QFL T population at early developmental stage. We have added clarification on this point in the revised manuscript (line 412~418).

      1. It may be worthwhile for the authors to note that Qa-1 was also dispensable for the intrathymic selection of another Qa-1-restricted TCR (Doorduijn et al. 2018. Frontiers Immunol.), although this is presumably not the case for others (Sullivan et al. 2002. Immunity 17, 95).

      Answer: We appreciate this recommendation. We have noted this point in the resubmitted manuscript (line 412~418).

      1. Lines 122-124: The sentence "Interesting ..." seemed confusing to me; are the numbers (60 and 30%) correct?

      Answer: The numbers 60% and 30% were referring to the largest number we have detected for percentages of Va3.2 QFL T cells and Va3.2 CD8 T cell respectively. Here in the revised version, we replaced these numbers with average percentages (20.1% and <10%) to avoid confusion (line 134).

      1. Qa-1/peptide complexes may also be recognized by CD94/NKG2 receptors, which may complicate the interpretation of the data (e.g., staining of the dextramers). From their previous work, it appears that Qa-1/QFL does not bind CD94/NKG2, which would be helpful to note in the text.

      Answer: We have noted this point in the revised manuscript (line 117~121).

      1. It would be helpful to add a few comments about the potential relevance to HLA-E.

      Answer: We have included discussion on this point (line 391~401).

      1. Figure legends: Most legends note the total number of replicates, which is usually quite high. It would also be helpful to indicate the total number of independent experiments performed and, when relevant, that the data are pooled from multiple independent experiments.

      Answer: Thank you for raising the concern. We have clarified the experimental repeats in figure legends.

      Reviewer #2 (Recommendations For The Authors):

      1. The work of Nilabh Shastri was the foundation of the present study. Unfortunately, he passed away in 2021. Since he can no longer assume the responsibilities of a senior author, I wonder if it would be more appropriate to dedicate this paper to him than to list him as a co-author.

      Answer: We have removed Dr. Shastri’s name as a co-senior author and have dedicated this work to his memory.

      1. The official symbol for ERAAP is Erap1.

      Answer: We have replaced ERAAP with ERAP1.

      1. Please refrain from editorializing. For example, "strikingly" appears eight times and "interestingly" 9 times in the manuscript. Most readers believe they do not need to be said when something is striking or interesting.

      Answer: We appreciate the Reviewer’s suggestion and have removed ‘strikingly’ and ‘interestingly’ from the manuscript.

      1. In WT mice, are there some cell types that express Qa-1b but not Erap1 and could therefore present the FL9 peptide?

      Answer: This is a great question. Using our highly sensitive QFL T cell hybridoma line BEko8Z (sensitivity shown in Fig. 6b), we have so far not been able to detect steady-state FL9 presentation by cells isolated from the spleen, lymph nodes, various gut associated lymphoid tissues or intestinal epithelial cells (Supplementary Fig. 8 a left panel). However, we do not exclude the possibility of FL9 peptide being transiently presented under certain conditions (i.e. ER stress/transformed cells) at particular locations or within certain time windows, which is of great importance for understanding the function of these cells but is beyond the scope of this study.

      1. Since you have not tested substitutions at other positions, could you explain your reasoning that P4 and P6 are the critical residues (lines 271-272)?

      Answer: Thank you for raising the concern. We have expanded on explanation of our strategy for determining peptide homology (line 272~313) in the revised manuscript. We have also included data on the structure the QFL TCR: FL9-Qa-1b complex predicted by Alphafold2, conformation alignment of FL9 and Qdm (Figure 6. a, b) and the NetMHCpan prediction of Qa1b binding of Qdm, FL9 and various FL9 mutant peptides (Supplementary Fig. 8 c) to help readers visualize the reasoning behind our strategy.

      1. Readers might appreciate having a Figure summarizing the differences between spleen and gut QFL T cells.

      Answer: This is a great suggestion. We have added a table summarizing the characteristic features of the splenic and IEL QFL T cells (Table 1).

      1. In the discussion, readers would like to know what plan you might have to elucidate the function of QFL T cells.

      Answer: We appreciate the recommendation. We have elaborated on our opinions and future directions in the resubmitted manuscript (line 393~401, 446~455).  

      Reviewer #3 (Public Review):

      1. For most of the report, the authors use a set of phenotypic traits to highlight the unique features of QFL-specific CD8+ T cells - specifically, CD44high, CD8aa+ve, CD8ab-ve. In Supp. Fig. 4, however, completely distinct phenotypic characteristics are presented, indicating that IEL QFL-specific T cells are CD5low, Thy-1low. No explanation is provided in the text about whether this is a previously reported phenotype, whether any elements of this phenotype are shared with splenic QFL T cells, what significance the authors ascribe to this phenotype (and to the fact that Qa1-deficiency leads to a more conventional Thy-1+ve, CD5+ve phenotype), and whether this altered phenotype is also seen in ERAAP-deficient mice. At least some explanation for this abrupt shift in focus and integration with prior published work is needed. On a related note, CD5 expression is measured in splenic QFL-specific CD8+ T cells from GF vs SPF mice (Supp. Fig. 9), to indicate that there is no phenotypic impact in the GF mice - but from Supp. Fig. 4, it would seem more appropriate to report CD5 expression in QFL-specific cells from the IEL, not the spleen.

      Answer: Expression of CD8αα and lack of CD4, CD8αβ, CD5 and CD90 expression was indeed reported as the characteristic phenotype of natIELs. We have clarified this point in the resubmitted manuscript (line 80). The CD8αα+ IEL QFL T cells have consistently showed CD5CD90- phenotype. While CD8αα expression was sufficient to describe their natIEL phenotype, we showed the CD5-CD90- data in Supplementary figures only to provide additional evidence.

      The CD5 molecule by itself reflects the TCR signaling strength and high CD5 level is associated with self-reactivity of T cells (Azzam et al., 2001; Fulton et al., 2015). The implication of CD5 expression on QFLTg cells is discussed in our other manuscript where we investigate the development of these cells (Valerio et al., 2023). In Supplementary Fig. 9, because the donor splenic QFLTg cell have consistently showed comparable CD5 level between the GF and SPF group, we reasoned that it would not interfere with our interpretation of the CD44 expression.

      1. The authors suggest the finding that QFL-specific cells from ERAAP-deficient mice have a more "conventional" phenotype indicates some form of negative selection of high-affinity clones (this result being somewhat unexpected since ERAAP loss was previously shown to increase the presentation of Qa-1b loaded with FL9, confirmed in this report). It is not clear how this argument aligns with the data presented, however, since the authors convincingly show no significant reduction in the number of QFL-specific cells in ERAAP-knockout mice (Fig. 3a), and their own data (e.g. Fig. 2a) do not suggest that CD44 expression correlates with QFL-multimer staining (as a surrogate for TCR affinity/avidity). Is there some experimental basis for suggesting that ERAAP-deficient lacks a subset of high affinity QFL-specific cells?

      Answer: We think the presence of QFL T cells in ERAAP-KO mice is a result of the unconventional developmental mechanism of these cells which is better addressed in our complementary manuscript on the development of QFL T cells(Valerio et al., 2023). Valerio et al. found that the most predominant QFL T clone which expresses Vα3.2Jα21, Vβ1Dβ1Jβ2-7 received relatively strong TCR signaling and underwent agonist selection during thymic development, indicating that the QFL ligand is involved in selection of the innate-like QFL T population.

      We agree that there is so far no direct evidence showing the QFL T cells that were absent in the ERAAP-KO mice were high-affinity clones. We have removed ‘high-affinity’ from the manuscript (line 180). While CD44 expression has been associated the antigen-experiences phenotype of T cells, it is yet unclear whether expression level of this molecule directly reflects TCR affinity/avidity. identification of clones of different affinities/avidities require high precision technologies that are not currently available to the research community. While we do have zMovi, a newly developed (developing) technology, in the lab claimed to measure relative avidity/affinity of different cell types for ligands, during the past two years working with this instrument has taught us that the technology is not yet advanced enough; it can only produce reliable data on extreme differences of single clones, i.e., high numbers of homogeneous cell types expressing very high affinity receptors.

      1. The rationale for designing FL9 mutants, and for using these data to screen the proteomes of various commensal bacteria needs further explanation. The authors propose P4 and P6 of FL9 are likely to be "critical" but do not explain whether they predict these to be TCR or Qa-1b contact sites. Published data (e.g., PMID: 10974028) suggest that multiple residues contribute to Qa-1b binding, so while the authors find that P4A completely lost the ability to stimulate a QFL-specific hybridoma, it is unclear whether this is due to the loss of a TCR- or a Qa-1-contact site (or, possibly, both). This could easily be tested - e.g., by determining whether P4A can act as a competitive inhibitor for FL9-induced stimulation of BEko8Z (and, ideally, other Qa-1b-restricted cells, specific for distinct peptides). Without such information, it is unclear exactly what is being selected in the authors' screening strategy of commensal bacterial proteomes. This, of course, does not lessen the importance of finding the peptide from P. pentosaceus that can (albeit weakly) stimulate QFL-specific cells, and the finding that association with this microbe can sustain IEL QFL cells.

      Answer: Thank you for raising the concern. We have expanded on explanation of our strategy for determining peptide homology (line 272~313) in the revised manuscript. We have also included data on the structure the QFL TCR: FL9-Qa-1b complex predicted by Alphafold2, conformation alignment of FL9 and Qdm (Figure 6. a, b) and the NetMHCpan prediction of Qa1b binding of Qdm, FL9 and various FL9 mutant peptides (Supplementary Fig. 8 c) to help readers visualize the reasoning behind our strategy.

      References

      Azzam, H.S., DeJarnette, J.B., Huang, K., Emmons, R., Park, C.S., Sommers, C.L., El-Khoury, D., Shores, E.W., and Love, P.E. (2001). Fine tuning of TCR signaling by CD5. J Immunol 166, 5464- 5472.10.4049/jimmunol.166.9.5464, PMID:11313384

      Fulton, R.B., Hamilton, S.E., Xing, Y., Best, J.A., Goldrath, A.W., Hogquist, K.A., and Jameson, S.C. (2015). The TCR's sensitivity to self peptide-MHC dictates the ability of naive CD8(+) T cells to respond to foreign antigens. Nat Immunol 16, 107-117.10.1038/ni.3043, PMID:25419629

      Valerio, M.M., Arana, K., Guan, J., Chan, S.W., Yang, X., Kurd, N., Lee, A., Shastri, N., Coscoy, L., and Robey, E.A. (2023). The promiscuous development of an unconventional Qa1b-restricted T cell population. bioRxiv, 2022.2009.2026.509583.10.1101/2022.09.26.509583,

    1. Reviewer #2 (Public Review):

      Summary:

      The goal of untargeted metabolomics is to identify differences between metabolomes of different biological samples. Untargeted metabolomics identifies features with specific mass-to-charge ratio (m/z) and retention time (RT). Matching those to specific metabolites based on the model compounds from databases is laborious and not always possible, which is why methods for comparing samples on the level of unmatched features are crucial.

      The main purpose of the GromovMatcher method presented here is to merge and compare untargeted metabolomes from different experiments. These larger datasets could then be used to advance biological analyses, for example, for the identification of metabolic disease markers. The main problem that complicates merging different experiments is m/z and RT vary slightly for the same feature (metabolite).

      The main idea behind the GromovMatcher is built on the assumption that if two features match between two datasets (that feature i from dataset 1 matches feature j from dataset 2, and feature k from dataset 1 matches feature l from dataset 2), then the correlations or distances between the two features within each of the datasets (i and k, and j and l) will be similar. The authors then use the Gromov-Wasserstein method to find the best matches matrix from these data.

      The variation in m/z between the same features in different experiments is a user-defined value and it is initially set to 0.01 ppm. There is no clear limit for RT deviations, so the method estimates a non-linear deviation (drift) of RT between two studies. GromovMatcher estimates the drift between the two studies and then discards the matching pairs where the drift would deviate significantly from the estimate. It learns the drift from a weighted spline regression.

      The authors validate the performance of their GromovMatcher method by a validation experiment using a dataset of cord blood. They use 20 different splits and compare the GromovMatcher (both its GM and GMT iterations, whereby the GMT version uses the deviation from estimated RT drift to filter the matching matrix) with two other matching methods: M2S and metabCombiner.

      The second validation was done using a (scaled and centered) dataset of metabolics from cancer datasets from the EPIC cohort that was manually matched by an expert. This dataset was also used to show that using automatic methods can identify more features that are associated with a particular group of samples than what was found by manual matching. Specifically, the authors identify additional features connected to alcohol consumption.

      Strengths:

      I see the main strength of this work in its combination of all levels of information (m/z, RT, and higher-order information on correlations between features) and using each of the types of information in a way that is appropriate for the measure. The most innovative aspect is using the Gromov-Wasserstein method to match the features based on distance matrices.

      The authors of the paper identify two main shortcomings with previously established methods that attempt to match features from different experiments: a) all other methods require fine-tuning of user-defined parameters, and, more importantly, b) do not consider correlations between features. The main strength of the GromovMatcher is that it incorporates the information on distances between the features (in addition to also using m/z and RT).

      Weaknesses:

      The first, minor, weakness I could identify is that there seem not to be plenty of manually curated datasets that could be used for validation. The second is also emphasized by the authors in the discussion. Namely, the method as it is set up now can be directly used only to compare two datasets.

    1. Reviewer #2 (Public Review):

      In this manuscript, the authors propose a computational method based on deep convolutional neural networks (CNNs) to automatically detect cell divisions in two-dimensional fluorescence microscopy timelapse images. Three deep learning models are proposed to detect the timing of division, predict the division axis, and enhance cell boundary images to segment cells before and after division. Using this computational pipeline, the authors analyze the dynamics of cell divisions in the epithelium of the Drosophila pupal wing and find that a wound first induces a reduction in the frequency of division followed by a synchronised burst of cell divisions about 100 minutes after its induction.

      In general, novelty over previous work does not seem particularly important. From a methodological point of view, the models are based on generic architectures of convolutional neural networks, with minimal changes, and on ideas already explored in general. The authors seem to have missed much (most?) of the literature on the specific topic of detecting mitotic events in 2D timelapse images, which has been published in more specialized journals or Proceedings. (TPMAI, CCVPR etc., see references below). Even though the image modality or biological structure may be different (non-fluorescent images sometimes), I don't believe it makes a big difference. How the authors' approach compares to this previously published work is not discussed, which prevents me from objectively assessing the true contribution of this article from a methodological perspective.

      On the contrary, some competing works have proposed methods based on newer - and generally more efficient - architectures specifically designed to model temporal sequences (Phan 2018, Kitrungrotsakul 2019, 2021, Mao 2019, Shi 2020). These natural candidates (recurrent networks, long-short-term memory (LSTM), gated recurrent units (GRU), or even more recently transformers), coupled to CNNs are not even mentioned in the manuscript, although they have proved their generic superiority for inference tasks involving time series (Major point 2). Even though the original idea/trick of exploiting the different channels of RGB images to address the temporal aspect might seem smart in the first place - as it reduces the task of changing/testing a new architecture to a minimum - I guess that CNNs trained this way may not generalize very well to videos where the temporal resolution is changed slightly (Major point 1). This could be quite problematic as each new dataset acquired with a different temporal resolution or temperature may require manual relabeling and retraining of the network. In this perspective, recent alternatives (Phan 2018, Gilad 2019) have proposed unsupervised approaches, which could largely reduce the need for manual labeling of datasets.

      Regarding the other convolutional neural networks described in the manuscript:

      1) the one proposed to predict the orientation of mitosis performs a regression task, predicting a probability for the division angle. The architecture, which must be different from a simple Unet, is not detailed anywhere, so the way it was designed is difficult to assess. It is unclear if it also performs mitosis detection, or if it is instead used to infer orientation once the timing and location of the division have been inferred by the previous network.

      2) the one proposed to improve the quality of cell boundary images before segmentation is nothing new, it has now become a classic step in segmentation, see for example Wolny et al. eLife 2020.

      As a side note, I found it a bit frustrating to realise that all the analysis was done in 2D while the original images are 3D z-stacks, so a lot of the 3D information had to be compressed and has not been used. A novelty, in my opinion, could have resided in the generalisation to 3D of the deep-learning approaches previously proposed in that context, which are exclusively 2D, in particular, to predict the orientation of the division.

      Concerning the biological application of the proposed methods, I found the results interesting, showing the potential of such a method to automatise mitosis quantification for a particular biological question of interest, here wound healing. However, the deep learning methods/applications that are put forward as the central point of the manuscript are not particularly original.

      Major point 1: generalisation potential of the proposed method.

      The neural network model proposed for mitosis detection relies on a 2D convolutional neural network (CNN), more specifically on the Unet architecture, which has become widespread for the analysis of biology and medical images. The strategy proposed here exploits the fact that the input of such an architecture is natively composed of several channels (originally 3 to handle the 3 RGB channels, which is actually a holdover from computer vision, since most medical/biological images are gray images with a single channel), to directly feed the network with 3 successive images of a timelapse at a time. This idea is, in itself, interesting because no modification of the original architecture had to be carried out. The latest 10-channel model (U-NetCellDivision10), which includes more channels for better performance, required minimal modification to the original U-Net architecture but also simultaneous imaging of cadherin in addition to histone markers, which may not be a generic solution.

      Since CNN-based methods accept only fixed-size vectors (fixed image size and fixed channel number) as input (and output), the length or time resolution of the extracted sequences should not vary from one experience to another. As such, the method proposed here may lack generalization capabilities, as it would have to be retrained for each experiment with a slightly different temporal resolution. The paper should have compared results with slightly different temporal resolutions to assess its inference robustness toward fluctuations in division speed.

      Another approach (not discussed) consists in directly convolving several temporal frames using a 3D CNN (2D+time) instead of a 2D, in order to detect a temporal event. Such an idea shares some similarities with the proposed approach, although in this previous work (Ji et al. TPAMI 2012 and for split detection Nie et al. CCVPR 2016) convolution is performed spatio-temporally, which may present advantages. How does the authors' method compare to such an (also very simple) approach?

      Major point 2: innovatory nature of the proposed method.

      The authors' idea of exploiting existing channels in the input vector to feed successive frames is interesting, but the natural choice in deep learning for manipulating time series is to use recurrent networks or their newer and more stable variants (LSTM, GRU, attention networks, or transformers). Several papers exploiting such approaches have been proposed for the mitotic division detection task, but they are not mentioned or discussed in this manuscript: Phan et al. 2018, Mao et al. 2019, Kitrungrotaskul et al. 2019, She et al 2020.

      An obvious advantage of an LSTM architecture combined with CNN is that it is able to address variable length inputs, therefore time sequences of different lengths, whereas a CNN alone can only be fed with an input of fixed size.

      Another advantage of some of these approaches is that they rely on unsupervised learning, which can avoid the tedious relabeling of data (Phan et al. 2018, Gilad et al. 2019).

      References :<br /> Ji, S., Xu, W., Yang, M., & Yu, K. (2012). 3D convolutional neural networks for human action recognition. IEEE transactions on pattern analysis and machine intelligence, 35(1), 221-231. >6000 citations

      Nie, W. Z., Li, W. H., Liu, A. A., Hao, T., & Su, Y. T. (2016). 3D convolutional networks-based mitotic event detection in time-lapse phase contrast microscopy image sequences of stem cell populations. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (pp. 55-62).

      Phan, H. T. H., Kumar, A., Feng, D., Fulham, M., & Kim, J. (2018). Unsupervised two-path neural network for cell event detection and classification using spatiotemporal patterns. IEEE Transactions on Medical Imaging, 38(6), 1477-1487.

      Gilad, T., Reyes, J., Chen, J. Y., Lahav, G., & Riklin Raviv, T. (2019). Fully unsupervised symmetry-based mitosis detection in time-lapse cell microscopy. Bioinformatics, 35(15), 2644-2653.

      Mao, Y., Han, L., & Yin, Z. (2019). Cell mitosis event analysis in phase contrast microscopy images using deep learning. Medical image analysis, 57, 32-43.

      Kitrungrotsakul, T., Han, X. H., Iwamoto, Y., Takemoto, S., Yokota, H., Ipponjima, S., ... & Chen, Y. W. (2019). A cascade of 2.5 D CNN and bidirectional CLSTM network for mitotic cell detection in 4D microscopy image. IEEE/ACM transactions on computational biology and bioinformatics, 18(2), 396-404.

      Shi, J., Xin, Y., Xu, B., Lu, M., & Cong, J. (2020, November). A Deep Framework for Cell Mitosis Detection in Microscopy Images. In 2020 16th International Conference on Computational Intelligence and Security (CIS) (pp. 100-103). IEEE.

      Wolny, A., Cerrone, L., Vijayan, A., Tofanelli, R., Barro, A. V., Louveaux, M., ... & Kreshuk, A. (2020). Accurate and versatile 3D segmentation of plant tissues at cellular resolution. Elife, 9, e57613.

    1. Reviewer #1 (Public Review):

      Summary:<br /> In this paper, Chen et al. identified a role for the circadian photoreceptor CRYPTOCHROME (cry) in promoting wakefulness under short photoperiods. This research is potentially important as hypersomnolence is often seen in patients suffering from SAD during winter times. The mechanisms underlying these sleep effects are poorly known.

      Strengths:<br /> The authors clearly demonstrated that mutations in cry lead to elevated sleep under 4:20 Light-Dark (LD) cycles. Furthermore, using RNAi, they identified GABAergic neurons as a primary site of cry action to promote wakefulness under short photoperiods. They then provide genetic and pharmacological evidence demonstrating that cry acts on GABAergic transmission to modulate sleep under such conditions.

      Weaknesses:<br /> The authors then went on to identify the neuronal location of this cry action on sleep. This is where this reviewer is much more circumspect about the data provided. The authors hypothesize that the l-LNvs which are known to be arousal-promoting may be involved in the phenotypes they are observing. To investigate this, they undertook several imaging and genetic experiments.

      Major concerns:<br /> 1. Figure 2 A-B: The authors show that knocking down cry expression in GABAergic neurons mimics the sleep increase seen in cryb mutants under short photoperiod. However, they do not provide any other sleep parameters such as sleep bout numbers, sleep bout duration, and more importantly waking activity measurements. This is an essential parameter that is needed to rule out paralysis and/or motor defects as the cause of increased "sleep". Any experiments looking at sleep need to include these parameters.

      2. For all Figures displaying immunostaining and imaging data the resolution of the images is quite poor. This makes it difficult to assess whether the authors' conclusions are supported by the data or not.

      3. In Figure 4-S1A it appears that the syt-GFP signal driven by Gad1-GAL4 is colabeling the l-LNvs. This would imply that the l-LNvs are GABAergic. The authors suggest that this experiment suggests that l-LNvs receive input from GABAergic neurons. I am not sure the data presented support this.

      4. In Figure 4-S1B. The GRASP experiment is not very convincing. The resolution of the image is quite poor. In addition, the authors used Pdf-LexA to express the post t-GRASP construct in l-LNvs, but Pdf-LexA also labels the s-LNvs, so it is possible that the GRASP signal the authors observe is coming from the s-LNvs and not the l-LNvs. The authors could use a l-LNvs specific tool to do this experiment and remove any doubts. Altogether this reviewer is not convinced that the data presented supports the conclusion "All in all, these results demonstrate that GABAergic neurons project to the l-LNvs and form synaptic connections." (Line 176). In addition, the authors could have downregulated the expression of Rdl specifically in l-LNvs to support their conclusions. The data they are providing supports a role for RDL but does not prove that RDL is involved in l-LNvs.

      5. In Figures 4 A and C: it appears that GABA is expressed in the l-LNvs. Is this correct? Can the authors clarify this? Maybe the authors could do an experiment where they co-label using Gad1-GAL4 and Pdf-LexA to clearly demonstrate that l-LNvs are not GABAergic. Also, the choice of colors could be better. It is very difficult to see what GABA is and what is PDF.

      6. Figure 4G: Pdf-GAL4 expresses in both s-LNvs and l-LNvs. So, in this experiment, the authors are silencing both groups, not only the l-LNvs. Why not use a l-LNvs specific tool?

      7. Figure 4H-I: The C929-GAL4 driver expresses in many peptidergic neurons. This makes the interpretation of these data difficult. The effects could be due to peptidergic cells being different than the l-LNvs. Why not use a more specific l-LNvs specific tool? I am also confused as to why some experiments used Pdf-GAL4 and some others used C929-GAL4 in a view to specifically manipulate l-LNvs? This is confusing since both drivers are not specific to the l-LNvs.

      8. Figure 5-S1B: Why does the pdf-GAL80 construct not block the sleep increase seen when reducing expression of cry in Gad1-GAL4 neurons? This suggests that there are GABAergic neurons that are not PDF expressing involved in the cry-mediated effect on sleep under short photoperiods.

      In conclusion, it is not clear that the authors demonstrated that they are looking at a cry-mediated effect on GABA in s-LNvs resulting in a modulation of the activity of the l-LNvs. Better images and more-suited genetic experiments could be used to address this.

    1. 'Hello' at: 1 put: $B; at: 2 put: $e; at: 3 put: $l; at: 4 put: $l; at: 5 put: $e; yourself

      smalltalk 'Hello' at: 1 put: $B; at: 5 put: $e; yourself

  7. inst-fs-iad-prod.inscloudgate.net inst-fs-iad-prod.inscloudgate.net
    1. EN G L I S H G RAMMAR4 .£9 3 }H O W T O T E A C H I T ;D ESIGN ED AS A TEX T-B OOK F OR COMMON SC HO O LS, AN DF OR T H E P R I MARY , IN TERMED IATE, AN D G RAMMARD EPARTMEN TS OF GRAD ED SCHOO LS.H E N RY BOLTWOOD , A.M. ,MAS TE R O F P RI NC E TON H I G H S C HO O L .C H I C A G O

      metadata 3

    1. Reviewer #1 (Public Review):

      Summary:<br /> This work is an extension of the authors' earlier work published in Sci Adv in 2001, wherein the authors showed that DTD2 deacylates N-ethyl-D-aminoacyl-tRNAs arising from acetaldehyde toxicity. The authors in this study, investigate the role of archaeal/plant DTD2 in the deacylation/detoxification of D-Tyr-tRNATyr modified by multiple other aldehydes and methylglyoxal (produced by plants). Importantly, the authors take their biochemical observations to plants, to show that deletion of DTD2 gene from a model plant (Arabidopsis thaliana) makes them sensitive to the aldehyde supplementation in the media especially in the presence of D-Tyr. These conclusions are further supported by the observation that the model plant shows increased tolerance to the aldehyde stress when DTD2 is overproduced from the CaMV 35S promoter. The authors propose a model for the role of DTD2 in the evolution of land plants. Finally, the authors suggest that the transgenic crops carrying DTD2 may offer a strategy for stress-tolerant crop development. Overall, the authors present a convincing story, and the data are supportive of the central theme of the story.

      Strengths:<br /> Data are novel and they provide a new perspective on the role of DTD2, and propose possible use of the DTD2 lines in crop improvement.

      Weaknesses:<br /> (a) Data obtained from a single aminoacyl-tRNA (D-Tyr-tRNATyr) have been generalized to imply that what is relevant to this model substrate is true for all other D-aa-tRNAs (term modified aa-tRNAs has been used synonymously with the modified Tyr-tRNATyr). This is not a risk-free extrapolation. For example, the authors see that DTD2 removes modified D-Tyr from tRNATyr in a chain-length dependent manner of the modifier. Why do the authors believe that the length of the amino acid side chain will not matter in the activity of DTD2?<br /> (b) While the use of EFTu supports that the ternary complex formation by the elongation factor can resist modifications of L-Tyr-tRNATyr by the aldehydes or other agents, in the context of the present work on the role of DTD2 in plants, one would want to see the data using eEF1alpha. This is particularly relevant because there are likely to be differences in the way EFTu and eEF1alpha may protect aminoacyl-tRNAs (for example see description in the latter half of the article by Wolfson and Knight 2005, FEBS Letters 579, 3467-3472).

    1. Author Response

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      Trebino et al. investigated the BRAF activation process by analysing the interactions of BRAF N-terminal regulatory regions (CRD, RBD, and BSR) with the C-terminal kinase domain and with the upstream regulators HRAS and KRAS. To this end, they generated four constructs comprising different combinations of N-terminal domains of BRAF and analysed their interaction with HRAS as well as conformational changes that occur. By HDX-MS they confirmed that the RBD is indeed the main mediator of interaction with HRAS. Moreover, they observed that HRAS binding leads to conformational changes exposing the BSR to the environment. Next, the authors used OpenSPR to determine the binding affinities of HRAS to the different BRAF constructs. While BSR+RBD, RBD+CRD, and RBD bound HRAS with nanomolar affinity, no binding was observed with the construct comprising all three domains. Based on these experiments, the authors concluded that BSR and CRD negatively regulate binding to HRAS and hypothesised that BSR may confer some RAS isoform specificity. They corroborated this notion by showing that KRAS bound to BRAF-NT1 (BSR+RBD+CRD) while HRAS did not. Next, the authors analysed the autoinhibitory interaction occurring between the N-terminal regions and the kinase domain. Through pulldown and OpenSPR experiments, they confirm that it is mainly the CRD that makes the necessary contacts with the kinase domain. In addition, they show that the BSR stabilizes these interactions and that the addition of HRAS abolishes them. Finally, the D594G mutation within the KD of BRAF is shown to destabilise these autoinhibitory interactions, which could explain its oncogenic potential.

      Overall, the in vitro study provides new insights into the regulation of BRAF and its interactions with HRAS and KRAS through a comprehensive in vitro analysis of the BRAF N-terminal region. Also, the authors report the first KD values for the N- and C-terminal interactions of BRAF and show that the BSR might provide isoform specificity towards KRAS. While these findings could be useful for the development of a new generation of inhibitors, the overall impact of the manuscript could probably be enhanced if the authors were to investigate in more detail how the BSR-mediated specificity of BRAF towards certain RAS isoforms is achieved. Moreover, though the very "clean" in vitro approach is appreciated, it also seems useful to examine whether the observed interactions and conformational changes occur in the full-length BRAF molecule and in more physiological contexts. Some of the results could be compared with studies including full-length constructs.

      Public Response: We would like to express our gratitude for your valuable feedback on our manuscript. Your insightful suggestions have significantly improved the quality and completeness of our research. In response to your comments, we have conducted additional experiments and incorporated new data into the revised manuscript.

      To gain a deeper understanding of how the BSR-mediated specificity of BRAF towards certain RAS isoforms is achieved, we performed HDX-MS to investigate the impact of KRAS interactions on the BSR. Our findings indicate that when KRAS is bound to BRAF NT2, there is no significant difference in hydrogen-deuterium exchange rates in the BSR compared to the apo-NT2 state (Figure 4). This observation contrasts with the effect of HRAS binding, where peptides from the BRAF-BSR exhibit an increased rate change, suggesting that HRAS induces a conformationally more dynamic state (Figure 2).

      Our results align with the conclusions of Terrell et al. in their 2019 publication, which propose that isoform preferences in the RAS-RAF interaction are driven by opposite charge attractions between BRAF-BSR and KRAS-HVR, promoting the interaction.1 Our data offers a potential mechanistic explanation, suggesting that HRAS disrupts the conformational stability of the BSR provided by the RBD, while KRAS-HVR restores stability and enhances interaction favorability. It is important to note that our results do not directly confirm a long-lasting interaction between the BRAF-BSR and KRAS-HVR, but they do not rule out the possibility of a transient, low-affinity interaction or close proximity between the two.

      Furthermore, our binding kinetics measurements conducted using OpenSPR support these findings. Particularly, in the case of NT1, when the CRD accompanies the BSR and RBD, no interactions with HRAS were observed. Additionally, we quantified the binding affinities between NT3:KRAS and NT4:KRAS, demonstrating that they are equally strong and that the presence of the BSR or CRD does not singularly affect the primary RBD interaction, consistent with HRAS. The BSR appears to exert an inhibitory effect on HRAS when the entire N-terminal region (BSR+RBD+CRD) is present. The BSR-mediated specificity is achieved through a coordinated interplay with the CRD.

      Moreover, we have addressed your concern regarding the physiological relevance of our conclusions. In response, we utilized active, full-length (FL) BRAF purified from HEK293F cells in OpenSPR experiments. Our findings indicate that FL-BRAF behaves similarly to BRAF-NT1, as it does not bind to HRAS but binds to KRAS with a deviation comparable to NT1. We have demonstrated that post-translational modifications or native intramolecular interactions do not alter our initial results. Several literature sources, employing cell systems or expressing proteins from insect or mammalian cells, further support the findings presented in our study.2–5

      Thank you once again for your constructive feedback, which has contributed significantly to the refinement of our work.

      For the author:

      Major points:

      1. Figure 1D: Negative control is missing.

      Response: We have incorporated the negative control into this figure as suggested.

      1. Figure 3F and G: negative controls (GST only) are missing.

      Response: We have incorporated the negative control into this figure as suggested.

      1. The authors demonstrate that BRAF NT1 (BSR+RBD+CRD) interacts with KRAS but not HRAS in SPR experiments (Figure 4). What about the conformational change that affects the positioning of BSR when NT2 (BSR+RBD) binds to HRAS (Figure 2)? Does it also occur with KRAS or not? When a rate change is observed between free protein and bound protein in HDX, particularly when this rate change results in a sigmoidal curve that closely parallels the reference curve, it signifies that all residues within the peptide share a uniform protection factor. This suggests that they collectively undergo conformational changes at the same rate, likely due to a concerted opening as a cohesive unit. In the context of our time plots, we observe this distinctive characteristic in the curves derived from the BSR peptides, indicating that HRAS binding perturbs this region, alters its flexibility, and induces a coordinated conformational shift. This compelling evidence strongly supports our assertion that HRAS instigates a reorientation of the BSR.

      Response: In response to the reviewer's comments, we conducted additional experiments to explore whether KRAS elicits any comparable alterations in the H-D exchange of the BSR within BRAF-NT2. Our findings indicate that KRAS does not induce a similar conformational change in the BSR. We have detailed these results in the Results section under the heading "BSR Differentiates the BRAF-KRAS Interaction from the BRAF-HRAS Interaction" and have included corresponding panels in Figure 4 to visually illustrate these observations.

      1. Related to point 3: The authors mention that the HVR domain is responsible for isoform-specific differences. Does the BSR interact with the HVR domain of KRAS (but not HRAS)?

      Response: It has been suggested by Terrell and colleagues1 that the BRAF-BSR and KRASHVR are directly responsible for the isoform specific interactions. We have no direct evidence confirming an interaction between the HVR and BSR. However, we deduce the possibility of such interaction based on previous research findings. Our HDX-MS experiments have demonstrated that the BRAF-BSR does not engage with HRAS. In our new HDX-MS experiments involving KRAS, we observed that the presence of KRAS does not lead to any discernible increase or decrease in the rate of deuterium exchange within the BRAF-BSR. It is important to emphasize that the absence of a rate change does not necessarily negate the occurrence of binding; rather, it might indicate a transient interaction with an affinity level below the detection threshold of HDX-MS.

      Given that the only major difference between H- and K-RAS isoforms is the HVR, we hypothesize that binding differences between BRAF and RAS isoforms can be attributed to the HVR. Notably, BRAF-NT3 resembles CRAF, which also behaves in line with the findings from Terrell et al. in which the BSR is not present to impact RAS-RAF association. We have updated some of the discussion section to include the new results and draw relevant conclusion.

      We mention in the text in the results section, “The HVR is an important region for regulating RAS isoform differences, like membrane anchoring, localization, RAS dimerization, and RAF interactions6… These results, combined with HDX-MS results, which showed that the BSR is exposed when bound to HRAS, suggest that the electrostatic forces surrounding the BSR promote BRAF autoinhibition and the specificity of RAF-RAS interactions.”

      We also write in the discussion, “However, BRET assays suggest that CRAF does not show preference for either H- or KRAS, while BRAF appears to prefer KRAS.1 This preference is suggested to result from the potential favorable interactions between the negatively charged BSR of BRAF and the positively charged, poly-lysine region of the HVR of KRAS1… Our binding data provide additional examples of isoform-specific activity. We speculate that diminished BRAF-NT1 binding to HRAS and increased BSR exposure upon HRAS binding may be due to electrostatic repulsion between HRAS and the BSR. Our full-length KRAS and its interaction with NT1 support the hypothesis that the BSR attenuates fast binding to HRAS but not to KRAS.”

      1. The authors might consider including NRAS in their study to give more weight to this interesting aspect.

      Response: While this suggestion is intriguing and could contribute to the expanding body of literature on RAS signaling, particularly in the context of NRAS-mutant tumors, we believe that delving into this topic would be beyond the scope of the present manuscript.

      1. Figure 6A: In this pulldown experiment the authors wish to demonstrate that binding of HRAS abolishes the autoinhibitory binding between NT1 and the kinase domain. However, the experimental design (i.e., pulldown of RAS) does not allow us to assess whether NT1 and KD are bound to each other in these conditions at all. The authors should rather pull down the KD and show that the interaction with NT1 is abolished when RAS is added.

      Response: We appreciate your suggestion. The experimental design for this study was intentionally structured to focus on the specific subset of NT1 that interacts with HRAS. The BRAF N-terminal region has the capacity to bind both HRAS and KD, resulting in two distinct populations within BRAF-NT1: NT1:KD and NT1:HRAS, although we believe the ratio between those two populations is not 1:1. If we were to design the experiment by isolating either the KD or NT1, it would lead to the observation of both populations simultaneously, making it challenging to distinguish between them. Our pulldown experiments are performed under the same conditions (i.e. all the proteins were maintained in a molar ratio of 1:1 and exposed to the same buffer components), and we rely on pulldown assays, such as those depicted in Figure 5, to clearly demonstrate the binding interactions between NT1 and KD.

      1. The authors have chosen a purely in vitro approach for their interaction studies, which initially makes sense for the addressed questions. However, since the BRAF constructs studied are only fragments and neither BRAF nor K/HRAS has any posttranslational modifications, the question arises to what extent the findings obtained hold up in vivo. Therefore, the manuscript would greatly benefit from monitoring the described interactions in full-length proteins and in cells or at least with proteins purified from cells.

      Response: Thank you for your valuable suggestion, which we take very seriously to enhance the quality of our manuscript. Upon carefully reviewing your comments, we conducted additional experiments involving full-length, wild-type BRAF (FL-BRAF) that was purified from mammalian cells, encompassing the post-translational modifications and scaffolding proteins such as 14-3-3 (Supplementary Fig 8A). We have incorporated the findings from these OpenSPR experiments into the revised manuscript within the Results Section titled "BSR Differentiates the BRAF-KRAS Interaction from the BRAFHRAS Interaction" and Figure 4. In summary, our results with FL-BRAF affirm the extension of our initial observations. Both NT1 and FL-BRAF interact with KRAS with comparable affinities, and neither NT1 nor FL-BRAF demonstrates an interaction with HRAS using OpenSPR. These results underscore that BRAF fragments accurately represent active, fully processed BRAF, lending support to our in vitro approach.

      Moreover, the conserved interactions we report in this manuscript are supported by literature. The interaction between RAF-RBD and RAS has been extensively documented, spanning investigations conducted in both insect and mammalian cell lines. For instance, Tran et al. (2021) utilized mammalian expression systems to explore the role of RBD in mediating BRAF activation through RAS interaction, identifying the same binding surfaces that we highlighted using HDX-MS.2 They quantified the KRAS-CRAF interaction yielding binding affinities in the low nanomolar range, similar to our findings for BRAF-NT:KRAS OpenSPR.2 In the manuscript text, we compared the binding affinity of BRAF residues 1245 purified from insect cells3 to our BRAF 1-227 (NT2 from E. coli), noting that the published value falls within the standard deviation of our experimental value. Additionally, our results align with the autoinhibited FL-BRAF:MEK:14-3-3 structure, which was expressed in Sf9 insect cells and reveals the central role of the CRD in maintaining autoinhibition through interactions with KD.4 In 2005, Tran and colleagues revealed specific domains within the BRAF N-terminal region are involved in binding to KD through Co-IP experiments conducted in mammalian cells.5

      While we are fully aware of the limitations of taking a purely in vitro approach to study the role of BRAF regulatory domains in RAS-RAF interactions and autoinhibition, as well as to quantify the affinity of these interactions, we emphasize that this approach enables us to dissect and examine the specific regions of RAF that are under investigation. As we write in the manuscript: “Our in vitro studies were conducted using proteins purified from E. coli, which lack the membrane, post-translational modifications, and regulatory, scaffolding, or chaperone proteins that are involved in BRAF regulation. Nonetheless, our study provides a direct characterization of the intra- and inter-molecular protein-protein interactions involved in BRAF regulation, without the complications that arise in cell-based assays.” We have added the following comment to clarify the advantages of our in vitro approach and the challenges associated with cell-based assays: “… without the complications and false-positives that can arise in cell-based assays, which often cannot distinguish between proximity and biochemical interactions.”

      Once again, we appreciate your insight feedback, which has contributed significantly to the improvement of our manuscript.

      Minor:

      1. Page 7, paragraph 2, line 6: It should probably read "BRAF autoinhibition" not "BRAF autoinhibitory".

      Response: Thank you for bringing this to our attention. We have fixed this typo.

      1. Figure 3G: In the first lane (time point 0 min) there is no input band for His/MBP-NT1. Probably a mistake when cropping the image from the original photo.

      Response: We sincerely appreciate your diligence in identifying cropping errors, and we have taken comprehensive measures to review the manuscript and correct any such errors. Regarding this specific figure, it is important to note that NT1 was not added at the "0" minute time point, which explains the absence of an input band at that stage. To avoid any confusion, we have revised the notation from "0" to "-" for clarity.

      Reviewer #2 (Public Review):

      In the manuscript entitled 'Unveiling the Domain-Specific and RAS Isoform-Specific Details of BRAF Regulation', the authors conduct a series of in vitro experiments using Nterminal and C-terminal BRAF fragments (SPR, HDX-MS, pull-down assays) to interrogate BRAF domain-specific autoinhibitory interactions and engagement by H- and KRAS GTPases. Of the three RAF isoforms, BRAF contains an extended N-terminal domain that has yet to be detected in X-ray and cryoEM reconstructions but has been proposed to interact with the KRAS hypervariable region. The investigators probe binding interactions between 4 N-terminal (NT) BRAF fragments (containing one more NT domain (BRS, RBD, and CRD)), with full-length bacterial expressed HRAS, KRAS as well as two BRAF C-terminal kinase fragments to tease out the underlying contribution of domainspecific binding events. They find, consistent with previous studies, that the BRAF BSR domain may negatively regulate RAS binding and propose that the presence of the BSR domain in BRAF provides an additional layer of autoinhibitory constraints that mediate BRAF activity in a RAS-isoform-specific manner. One of the fragments studied contains an oncogenic mutation in the kinase domain (BRAF-KDD594G). The investigators find that this mutant shows reduced interactions with an N-terminal regulatory fragment and postulate that this oncogenic BRAF mutant may promote BRAF activation by weakening autoinhibitory interactions between the N- and C-terminus.

      While this manuscript sheds light on B-RAF specific autoinhibitory interactions and the identification and partial characterization of an oncogenic kinase domain (KD) mutant, several concerns exist with the vitro binding studies as they are performed using taggedisolated bacterial expressed fragments, 'dimerized' RAS constructs, lack of relevant citations, controls, comparisons and data/error analysis. Detailed concerns are listed below.

      1. Bacterial-expressed truncated BRAF constructs are used to dissect the role of individual domains in BRAF autoinhibition. Concerns exist regarding the possibility that bacterial expression of isolated domains or regions of BRAF could miss important posttranslational modifications, intra-molecular interactions, or conformational changes that may occur in the context of the full-length protein in mammalian cells. This concern is not addressed in the manuscript.

      Response: Reviewer 1 raised a similar concern, and we have duplicated our response below for your reference:

      Thank you for your valuable suggestion, which we take very seriously to enhance the quality of our manuscript. Upon carefully reviewing your comments, we conducted additional experiments involving full-length, wild-type BRAF (FL-BRAF) that was purified from mammalian cells, encompassing the post-translational modifications and scaffolding proteins such as 14-3-3 (Supplementary Fig 8A). We have incorporated the findings from these OpenSPR experiments into the revised manuscript within the Results Section titled "BSR Differentiates the BRAF-KRAS Interaction from the BRAF-HRAS Interaction" and Figure 4. In summary, our results with FL-BRAF affirm the extension of our initial observations. Both NT1 and FL-BRAF interact with KRAS with comparable affinities, and neither NT1 nor FL-BRAF demonstrates an interaction with HRAS using OpenSPR. These results underscore that BRAF fragments accurately represent active, fully processed BRAF, lending support to our in vitro approach.

      Moreover, the conserved interactions we report in this manuscript are supported by literature. The interaction between RAF-RBD and RAS has been extensively documented, spanning investigations conducted in both insect and mammalian cell lines. For instance, Tran et al. (2021) utilized mammalian expression systems to explore the role of RBD in mediating BRAF activation through RAS interaction, identifying the same binding surfaces that we highlighted using HDX-MS.2 They quantified the KRAS-CRAF interaction yielding binding affinities in the low nanomolar range, similar to our findings for BRAF-NT:KRAS OpenSPR.2 In the manuscript text, we compared the binding affinity of BRAF residues 1245 purified from insect cells3 to our BRAF 1-227 (NT2 from E. coli), noting that the published value falls within the standard deviation of our experimental value. Additionally, our results align with the autoinhibited FL-BRAF:MEK:14-3-3 structure, which was expressed in Sf9 insect cells and reveals the central role of the CRD in maintaining autoinhibition through interactions with KD.4 In 2005, Tran and colleagues revealed specific domains within the BRAF N-terminal region are involved in binding to KD through Co-IP experiments conducted in mammalian cells.5

      While we are fully aware of the limitations of taking a purely in vitro approach to study the role of BRAF regulatory domains in RAS-RAF interactions and autoinhibition, as well as to quantify the affinity of these interactions, we emphasize that this approach enables us to dissect and examine the specific regions of RAF that are under investigation. As we write in the manuscript: “Our in vitro studies were conducted using proteins purified from E. coli, which lack the membrane, post-translational modifications, and regulatory, scaffolding, or chaperone proteins that are involved in BRAF regulation. Nonetheless, our study provides a direct characterization of the intra- and inter-molecular protein-protein interactions involved in BRAF regulation, without the complications that arise in cell-based assays.” We have added the following comment to clarify the advantages of our in vitro approach and the challenges associated with cell-based assays: “… without the complications and false-positives that can arise in cell-based assays, which often cannot distinguish between proximity and biochemical interactions.”

      Once again, we appreciate your insight feedback, which has contributed significantly to the improvement of our manuscript.

      1. The experiments employ BRAF NT constructs that retain an MBP tag and RAS proteins with a GST tag. Have the investigators conducted control experiments to verify that the tags do not induce or perturb native interactions?

      Response: Thank you for highlighting this important issue. We have conducted control experiments whenever feasible, particularly in cases where tags were not required for visualization, immobilization, or where cleave sites were present. We have subsequently included these control experiments in the supplementary figures and accompanying text within the manuscript.

      It is essential to note that many of the techniques employed in this manuscript rely on tags, such as immobilizing proteins onto NTA OpenSPR sensors and employing various resins/beads for pulldown assays. Utilizing tags for protein immobilization in OpenSPR applications offers distinct advantages, including homogeneous and site-specific immobilization of the protein, ensuring that binding sites remain accessible for the study of protein-protein interactions (PPIs) of interest. Furthermore, in all BRAF-RAS SPR experiments, the MBP protein serves as the reference channel "blocking" protein. This reference channel is instrumental in mitigating any potential false-positive signals resulting from binding interactions with the MBP protein. Any such signal is subsequently subtracted out during data analysis.

      To provide a comprehensive understanding of these aspects, we have incorporated these details into the manuscript text for clarity:

      “Maltose bind protein (MBP) is immobilized on the OpenSPR reference channel, which accounts for any non-specific binding or impacts to the native PPIs that may result from the presence of tags. Kinetic analysis is performed on the corrected binding curves, which subtracts any response in the reference channel.”

      We describe the control experiment to examine whether His/MBP-tag affects NT1 binding with BRAF-KD: “Similarly, we removed the His/MBP-tag from BRAF-NT1 through a TEV protease cleavage reaction and flowed over untagged NT1. Kinetic analysis confirmed that the interaction is preserved with the KD=13 nM (Supplemental Figure 6F).”

      We show that the GST-tag does not affect KRAS interactions with NTs in supplemental figure 6. We purified full-length, His/MBP-KRAS and subsequently removed the tag through TEV cleavage. BRAF-NT interactions are preserved with untagged KRAS. GST alone, also does not interact with BRAF-NTs. We updated the text in the results section “BSR differentiates the BRAF-KRAS interaction from the BRAF-HRAS interaction.”

      Additionally, Vojtek and colleagues used the same fusion-protein combinations (GSTRAS and MBP-RAF) in pulldown experiments and also found no perturbations from these tags.8

      1. The investigators state that the GST tag on the RAS constructs was used to promote RAS dimerization, as RAS dimerization is proposed to be key for RAF activation. However, recent findings argue against the role of RAS dimers in RAF dimerization and activation (Simanshu et al, Mol. Cell 2023). Moreover, while GST can dimerize, it is unclear whether this promotes RAS dimerization as suggested. In methods for the OpenSPR experiments probing NT BRAF:RAS interactions, it is stated that "monomeric KRAS was flowed...". This terminology is a bit confusing. How was the monomeric state of KRAS determined and what was the rationale behind the experiment? Is there a difference in binding interactions between "monomeric vs dimeric KRAS"?

      Response: Thank you for conducting such a comprehensive review of our manuscript and for identifying the mention of "monomeric KRAS" in the experimental section, which was inadvertently included and should not have been present. This terminology originally referred to a series of experiments involving "monomeric" KRAS that were initially considered for inclusion in the main body of the manuscript but were subsequently removed before submission. Furthermore, we adjusted the terminology to prevent any confusion or unwarranted implications.

      To clarify, this "monomeric" construct refers to the tagless, full-length KRAS variant that was confirmed to exist in a monomeric state through Size Exclusion Chromatography, eluting at a volume equivalent to 21 kDa. We have incorporated the findings from experiments involving this untagged KRAS variant into the supplementary figures to provide supporting evidence, particularly in response to comment #2, that the GST-tag does not interfere with native interactions. Supplementary Figure 1 illustrates that both GST-HRAS (45 kDa) and GST-KRAS (45 kDa) elute as dimers in solution, at approximately 90 kDa. It is important to note that the main text figures primarily feature the GST-tagged, "dimeric" RAS constructs. Our research results do not suggest any significant differences between "monomeric," untagged KRAS and "dimeric" GST-tagged KRAS, indicating that the binding kinetics between RAS and RAF are not influenced by oligomerization state (Supplementary Fig 6). To mitigate any potential confusion, we have made the necessary distinctions in the text and have revised the methods description to accurately reflect these aspects.

      While the recent findings summarized by Simanshu and colleagues were published concurrently with our manuscript submission, we would like to address this comment in the following manner. The authors assert that RAS does not engage in dimerization through the G domain, a hypothesis that contrasts with certain prior research findings. Instead, they propose that the plasma membrane plays a pivotal role in the clustering of RAS. Furthermore, the authors mention the involvement of RAS "dimerization" in RAF dimerization and activation in the subsequent statements:

      “Recruitment of two RAF proteins by RAS proteins in close proximity facilitate RAF activation but are not required for RAF dimerization.”

      “However, the PM recruitment of two RAF proteins by two non-dimerized but co- localized RAS proteins would serve equally well to promote RAF dimerization. Moreover, recent work on the activation cycle of RAF dimers (ref 20–23) argues strongly against a role for RAS dimers while revealing regulation by the 14-3-3 and SHOC2-MRAS- PP1C complexes. (Ref 24)”

      The primary focus of our study centers on elucidating the intricate details of the RAS-RAF interaction and the mechanisms underlying RAF autoinhibition, rather than emphasizing RAF dimerization as the sole pathway to RAF activation. It is important to recognize that RAF activation encompasses multiple steps, including RAS-mediated relief of RAF autoinhibition.

      To mimic physiological conditions as closely as possible, we employed a GST-tag on RAS in our experiments. It's worth noting that GST has a dimerization property,9 which brings RAS molecules into close proximity to one another, effectively emulating conditions akin to the plasma membrane. Our primary objective is not solely to facilitate interactions by bringing RAS into close proximity. Instead, our aim is to replicate cellular conditions to the greatest extent feasible, especially within the predominantly in vitro framework of our studies. Furthermore, we have revised the sentence pertaining to HRAS as follows: “As verified by size exclusion chromatography (Supplementary Fig 1A), the GST-tag dimerizes and forces HRAS into close proximity to recapitulate physiological conditions. (ref. 35)”

      1. The investigators determine binding affinities between GST-HRAS and NT BRAF domains (NT2 7.5 {plus minus} 3.5; NT3 22 {plus minus} 11 nM) by SPR, and propose that the BRS domain has an inhibitory role HRAS interactions with the RAF NT. However, it is unclear whether these differences are statistically meaningful given the error.

      Response: Thank you for bringing up this matter for further discussion. We are fully aware that these distinctions (NT2 and NT3), considering the overlapping error, lack statistical significance. Our conclusion points toward the most notable differences occurring when comparing NT1 to either NT2 or NT3, highlighting that the presence of the BSR has an inhibitory effect, particularly when the CRD is also present. It's important to note that we did not directly compare NT2 and NT3 to each other. Our comparison primarily elucidates that BSR without the CRD, and conversely, CRD without the BSR, do not exhibit the inhibitory effect. This collective evidence leads to the conclusion that all three domains collaboratively play a role in negatively regulating BRAF against HRAS.

      1. It is unclear why NT1 (BSR+RBD+CRD) was not included in the HDX experiments, which makes it challenging to directly compare and determine specific contributions of each domain in the presence of HRAS. Including NT1 in the experimental design could provide a more comprehensive understanding of the interplay between the domains and their respective roles in the HRAS-BRAF interaction. Further, excluding certain domains from the constructs, such as the BSR or CRD, may overlook potential domain-domain interactions and their influence on the conformational changes induced by HRAS binding.

      Response: We acknowledge that incorporating NT1 into the HDX experiments would have provided clearer insights into the specific contributions of each domain. Originally, it was our intention to include NT1 in these experiments. Unfortunately, we encountered challenges with the HDX experiments when it came to BRAF-NT1, as it yielded a significantly low sequence coverage after MS/MS analysis. We made multiple attempts to address this issue, which included additional protein purifications involving reducing agents, increasing the concentration of reaction buffer components, and extending the incubation time with reducing agents before injection. Despite these efforts, we were unable to obtain the desired sequence coverage for NT1. Consequently, we switched our approach to analyze NT2 and NT3 as the next best alternative.

      1. The authors perform pulldown experiments with BRAF constructs (NT1: BSR+RBD+CRD, NT2: BSR+RBD, NT3: RBD+CRD, NT4: RBD alone), in which biotinylated BRAF-KD was captured on streptavidin beads and probed for bound His/MBP-tagged BRAF NTs. Western blot results suggest that only NT1 and NT3 bind to the KD (Figure 5). However, performing a pulldown experiment with an additional construct, CRD alone, it would help to determine whether the CRD alone is sufficient for the interaction or if the presence of the RBD is required for higher affinity binding. This additional experiment would strengthen the authors' arguments and provide further insights into the mechanism of BRAF autoinhibition.

      Response: We are grateful for this valuable suggestion, and in response, we have taken the initiative to clone and purify a CRD-only construct (NT5) to strengthen our arguments. Subsequently, we conducted OpenSPR experiments to measure the binding affinity between NT5 and KD. Our findings clearly indicate that the CRD alone is not sufficient to mediate the autoinhibitory interactions and that the presence of the RBD is indeed necessary. These results have been incorporated into Figure 5 and are described within the Results Section for enhanced clarity and support.

      1. While the investigators state that their findings indicate that H- and KRAS differentially interact with BRAF, most of the experiments are focused on HRAS, with only a subset on KRAS. As SPR & pull-down experiments are only conducted on NT1 and NT2, evidence for RAS isoform-specific interactions is weak. It is unclear why parallel experiments were not conducted with KRAS using BRAF NT3 & NT4 constructs.

      Response: We sincerely appreciate your suggestion, which has contributed to enhancing the overall robustness of the evidence regarding isoform-specific differences between H- and K-RAS. In response, we performed additional experiments involving NT3 and NT4. The outcomes of these experiments have been integrated into Figure 4, and we have provided a comprehensive description of these results within the Results section “BSR differentiates the BRAF-KRAS interaction from the BRAF-HRAS interaction” of the manuscript.

      1. The investigators do not cite the AlphaFold prediction of full-length BRAF (AFP15056-F1) or the known X-ray structure of the BRAF BRS domain. Hence, it is unclear how Alpha-Fold is used to gain new structural information, and whether it was used to predict the structure of the N-terminal regulatory or the full-length protein.

      Response: We greatly appreciate the reviewer’s commitment to upholding good scientific practices and ensuring the inclusion of relevant citations in publications. In our original manuscript, we employed the UniProt ID P15056 to reference the specific AlphaFold structure used in our study. This was clarified as follows: "Since the full-length structure of BRAF is still unresolved, we applied the AlphaFold Protein Structure Database for a model of BRAF to display the conformation of the N-terminal domains and the HDX-MS results.40,41” Additionally, we referenced AlphaFold using the two citations recommended on their website (references 35 and 36 in the original manuscript). To prevent any potential confusion in the future, we have incorporated "AF-P15056-F1," as suggested.

      We are sorry for any misunderstanding that may have arisen regarding the use of AlphaFold for gaining new structural insights. Our sole intention was to utilize AlphaFold as a tool for modeling HDX, as a full-length structure of BRAF, encompassing the entire N-terminal domain, remains unavailable. We have taken steps to clarify our objectives in the manuscript to ensure the purpose of our AlphaFold utilization is unambiguous.

      Furthermore, we wish to emphasize that our utilization of AlphaFold was never intended to exclude the known X-ray structure of the BRAF-BSR domain. In our revised text, we have added clarity to our purposes and cited the Lavoie et al. Nature publication from 2018, which provides alignment between the X-ray structure and the AlphaFold model, thereby enhancing the confidence in the latter.

      1. In HDX-MS experiments, it is unclear how the authors determine whether small differences in deuterium uptake observed for some of the peptide fragments are statistically significant, and why for some of the labeling reaction times the investigators state " {plus minus} HRAS only" for only 3 time points?

      Response: First, in reference to the question about " ‘{plus minus} HRAS only’ for only 3 time points,” we write:

      “Both constructs were incubated with and without GMPPNP-HRAS in D2O buffer for set labeling reaction times (NT3: 2 sec [NT3 ± HRAS only], 6 sec [NT3 ± HRAS only], 20 sec, 30 sec [NT3 ± HRAS only], 60 sec, 5 min, 10 min, 30 min, 90 min, 4.5 h, 15 h, and 24 h)...”

      We realize how this can be confusing. To avoid such confusion, we fixed the text to read instead:<br /> “Both constructs were incubated with and without GMPPNP-HRAS in D2O buffer for set labeling reaction times (NT3: 2 sec, 6 sec, 20 sec, 30 sec, 60 sec, 5 min, 10 min, 30 min, 90 min, 4.5 h, 15 h, 45 h and 24 h at RT; NT2: 20 sec, 60 sec, 5 min, 10 min, 30 min, 90 min, 4.5 h, 15 h, and 24 h at RT)...”

      Next, with regard to assessing significance, we determine it by closely examining a consistent trend in smooth time course plots. To establish this trend, we rely on the presence of more than four overlapping peptides, each with multiple charge states, within a specific sequence range. When we observe multiple peptides showing even a small difference in rate exchange, we can confidently infer that structural changes have taken place. This confidence stems from the inherent reliability and redundancy in the data analysis approach we have employed.11,12 It is noteworthy that our focus is primarily on reporting the binding or no binding, rather than quantifying the magnitude of exchange. As such, conducting multiple replicates or statistical testing is not deemed necessary.13,14 This is true for multiple reasons:

      1) Instead of small deuterium changes (y-axis), we are focusing on the x-axis changes, which provides a slowing factor and how much that H-D exchange rate has changed.

      • In a publication investigating the ideal HDX-MS data set, the author explains, “with the availability of high resolution HDX-MS raw data, it may be the time to shift the data analysis paradigm from determination of centroid values and presentation of deuteration levels to deconvolution of isotope envelopes and presentation of exchange rates.” 15

      • Presentation of data through rate changes provides a physical chemistry measurement, as opposed to a relative measurement with percent deuteration. For example, slowing with a factor of 10 equates to the energy in 1 kCal. By quick visual estimation, we see a slowing factor of about 2 when RAS is bound to the BRAF-RBD.

      • We made some changes to the text to clear up any confusion about measuring D uptake vs rate.

      2) Looking at sigmoidal curves only—the “smooth time course” shows that the timedependent deuterium changes are not random, artifacts, or false positives/negatives. When parallel sigmoidal curves are present, any x-axis change is a measure of H-D exchange. Only plots with a smooth time course are used to make conclusions about BRAF’s conformational changes or binding interfaces.

      3) Wide time range- the extended time also confirms that any observed difference is reliable and accurate. This extended time frame provides coverage for deuteration levels from 0 to 100% for peptides. A smooth time course is present in complete coverage.

      • A narrow time window is a common flaw in HDX-MS studies14,15

      4) The rate change is observed at multiple time points (at least 4 for each peptide), which are all independent reactions, and show reproducibility of change

      5) Many overlapping peptides show the same pattern- the exchange rate difference is observed in at least 4 peptide time plots without contradictory evidence within the sequence range.

      • We included the complete set of peptide time plots in the supplemental materials.

      6) The many other peptide time plots that do not show any difference with and without RAS is a form of reproducibility, that no difference means no difference.

      1. The investigators find that KRAS binds NT1 in SPR experiments, whereas HRAS does not. However, the pull-down assays show NT1 binding to both KRAS and HRAS. SI Fig 5 attributes this to slow association, yet both SPR (on/off rates) and equilibrium binding measurements are conducted. This data should be able to 'tease' out differences in association.

      Response: Thank you for bringing up this important point. It's crucial to note that the experiments conducted at slow flow rates generated low responses, making it challenging to perform kinetic analyses effectively. Consequently, we are unable to provide accurate equilibrium binding measurements (on/off rates) for NT1 and HRAS. Regrettably, comparing the association rates between KRAS and HRAS is not feasible due to the differing flow rates employed. We have addressed this limitation in the manuscript as follows:

      “We therefore immobilized NT1 and flowed over HRAS at a much slower flow rate (5 µL/min), during which we saw minimal but consistent binding (Supplementary Fig 5A). The low response and long timeframe of each injection, however, makes the dissociation constant (KD) unmeasurable and incomparable to our other NT-HRAS OpenSPR results.”

      1. The model in Figure 7B highlights BSR interactions with KRAS, however, BSR interactions with the KRAS HVR (proximal to the membrane) are not shown, as supported by Terrell et al. (2019).

      Response: Thank you for the suggestion. We reoriented the BSR closer to HVR of KRAS rather than G-domain.

      1. The investigators state that 'These findings demonstrate that HRAS binding to BRAF directly relieves BRAF autoinhibition by disrupting the NT1-KD interaction, providing the first in vitro evidence of RAS-mediated relief of RAF autoinhibition, the central dogma of RAS-RAF regulation. However, in Tran et al (2005) JBC, they report pulldown experiments using N-and C-terminal fragments of BRAF and state that 'BRAF also contains an N-terminal autoinhibitory domain and that the interaction of this domain with the catalytic domain was inhibited by binding to active HRAS'. This reference is not cited.

      Response: We appreciate the concern raised regarding our statement. We want to clarify that it was never our intention to disregard this JBC publication, and we apologize for any misunderstanding caused by our phrasing. We recognize that our initial statement was contentious, and we have removed the word "first" from the phrase "first in vitro evidence." In the section of the discussion where we originally cited the Tran et al. (2005) publication, we have revised the language to eliminate "first" and have rephrased the sentence, as provided below:

      “Our in vitro binding studies align with previous implications that RAS relieves RAF autoinhibition shown through cell-based coIP’s.5”

      1. In Fig 2, panels A and C, it is unclear what the grey dotted line in is each plot.

      Response: Thank you for drawing our attention to the additional explanation needed here. The gray dotted lines represent the maximum deuterium exchange. We added the following description to the figure 2 legend:

      “Gray dotted lines represent the theoretical exchange behavior for specified peptide that is fully unstructured (top) or for specified peptide with a uniform protection factor (fraction of time the residue is involved in protecting the H-bond) of 100 (lower).”

      1. In Fig 3, error analysis is not provided for panel E.

      Response: We added the standard deviation values to this panel. We additionally added these for Fig 4C and Fig 5B.

      1. How was RAS GMPPNP loading verified?

      Response: Ras loading is a well-established protocol with a solid foundation in the literature.16– 21 We followed this accepted method for nucleotide exchange. Our controls, as evident in pulldown and OpenSPR experiments (fig 1C, 4E), unequivocally demonstrate that GMPPNPloaded RAS is active, while unloaded RAS is inactive, as evidenced by the absence of no binding. We also added supplemental figure 6E to show that inactive (unloaded) GST-KRAS does not bind to BRAF during OpenSPR analysis. To exemplify this, we included binding curves of 1 µM GST-KRAS- GMPPNP and -GDP flowed over NTA-immobilized BRAF-NT2 at a flow rate of 30 µl/min.

      References

      (1) Terrell, E. M.; Durrant, D. E.; Ritt, D. A.; Sealover, N. E.; Sheffels, E.; Spencer-Smith, R.; Esposito, D.; Zhou, Y.; Hancock, J. F.; Kortum, R. L.; Morrison, D. K. Distinct Binding Preferences between Ras and Raf Family Members and the Impact on Oncogenic Ras Signaling. Mol. Cell 2019, 76 (6), 872-884.e5. https://doi.org/10.1016/j.molcel.2019.09.004.

      (2) Tran, T. H.; Chan, A. H.; Young, L. C.; Bindu, L.; Neale, C.; Messing, S.; Dharmaiah, S.; Taylor, T.; Denson, J. P.; Esposito, D.; Nissley, D. V.; Stephen, A. G.; McCormick, F.; Simanshu, D. K. KRAS Interaction with RAF1 RAS-Binding Domain and Cysteine-Rich Domain Provides Insights into RAS-Mediated RAF Activation. Nat. Commun. 2021, 12 (1176), 1–16. https://doi.org/10.1038/s41467-021-21422-x.

      (3) Fischer, A.; Hekman, M.; Kuhlmann, J.; Rubio, I.; Wiese, S.; Rapp, U. R. B- and C-RAF Display Essential Differences in Their Binding to Ras: The Isotype-Specific N Terminus of B-RAF Facilitates Ras Binding. J. Biol. Chem. 2007, 282 (36), 26503–26516. https://doi.org/10.1074/jbc.M607458200.

      (4) Park, E.; Rawson, S.; Li, K.; Kim, B. W.; Ficarro, S. B.; Pino, G. G. Del; Sharif, H.; Marto, J. A.; Jeon, H.; Eck, M. J. Architecture of Autoinhibited and Active BRAF–MEK1–14-3-3 Complexes. Nature 2019, 575 (7783), 545–550. https://doi.org/10.1038/s41586-0191660-y.

      (5) Tran, N. H.; Wu, X.; Frost, J. A. B-Raf and Raf-1 Are Regulated by Distinct Autoregulatory Mechanisms. J. Biol. Chem. 2005, 280 (16), 16244–16253. https://doi.org/10.1074/jbc.M501185200.

      (6) Prior, I. A.; Hancock, J. F. Ras Trafficking, Localization and Compartmentalized Signalling. Semin. Cell Dev. Biol. 2012, 23 (2), 145–153.

      (7) Herrmann, C.; Martin, G. A.; Wittinghofer, A. Quantitative Analysis of the Complex between P21 and the Ras-Binding Domain of the Human Raf-1 Protein Kinase. J. Biol. Chem. 1995, 270 (7), 2901–2905. https://doi.org/10.1074/jbc.270.7.2901.

      (8) Vojtek, A. B.; Hollenberg, S. M.; Cooper, J. A. Mammalian Ras Interacts Directly with the Serine/Threonine Kinase Raf. Cell 1993, 74 (1), 205–214. https://doi.org/10.1016/00928674(93)90307-C.

      (9) Parker, M. W.; Bello, M. Lo; Federici, G. Crystallization of Glutathione S-Transferase from Human Placenta. J. Mol. Biol. 1990, 213 (2), 221–222. https://doi.org/10.1016/S00222836(05)80183-4.

      (10) Inouye, K.; Mizutani, S.; Koide, H.; Kaziro, Y. Formation of the Ras Dimer Is Essential for Raf-1 Activation. J. Biol. Chem. 2000, 275 (6), 3737–3740. https://doi.org/10.1074/JBC.275.6.3737.

      (11) Z. Y. Kan, X. Ye, J. J. Skinner, L. Mayne, S. W. E. ExMS2: An Integrated Solution for Hydrogen-Deuterium Exchange Mass Spectrometry Data Analysis. Anal Chem 2019, 91 (11), 7474–7481.

      (12) Mayne, L.; Kan, Z. Y.; Sevugan Chetty, P.; Ricciuti, A.; Walters, B. T.; Englander, S. W. Many Overlapping Peptides for Protein Hydrogen Exchange Experiments by the Fragment Separation-Mass Spectrometry Method. J. Am. Soc. Mass Spectrom. 2011, 22 (11), 1898–1905. https://doi.org/10.1007/S13361-011-0235-4.

      (13) Ye, X.; Lin, J.; Mayne, L.; Shorter, J.; Englander, S. W. Hydrogen Exchange Reveals Hsp104 Architecture, Structural Dynamics, and Energetics in Physiological Solution. Proc. Natl. Acad. Sci. 2019, 116 (15), 7333–7342. https://doi.org/10.1073/pnas.1816184116.

      (14) Ye, X.; Lin, J.; Mayne, L.; Shorter, J.; Englander, S. W. Structural and Kinetic Basis for the Regulation and Potentiation of Hsp104 Function. Proc. Natl. Acad. Sci. 2020, 117 (17), 9384–9392. https://doi.org/10.1073/pnas.1921968117.

      (15) Hamuro, Y. Determination of Equine Cytochrome c Backbone Amide Hydrogen/Deuterium Exchange Rates by Mass Spectrometry Using a Wider Time Window and Isotope Envelope. J. Am. Soc. Mass Spectrom. 2017, 28 (3), 486–497. https://doi.org/10.1007/s13361-016-1571-1.

      (16) Herrmann, C.; Horn, G.; Spaargaren, M.; Wittinghofer, A. Differential Interaction of the Ras Family GTP-Binding Proteins H-Ras, Rap1A, and R-Ras with the Putative Effector Molecules Raf Kinase and Ral-Guanine Nucleotide Exchange Factor. J. Biol. Chem. 1996, 271 (12), 6794–6800. https://doi.org/10.1074/jbc.271.12.6794.

      (17) Miller, A. F.; Halkides, C. J.; Redfield, A. G. An NMR Comparison of the Changes Produced by Different Guanosine 5’-Triphosphate Analogs in Wild-Type and Oncogenic Mutant P21ras. Biochemistry 1993, 32 (29), 7367–7376. https://doi.org/10.1021/bi00080a006.

      (18) Amendola, C. R.; Mahaffey, J. P.; Parker, S. J.; Ahearn, I. M.; Chen, W. C.; Zhou, M.; Court, H.; Shi, J.; Mendoza, S. L.; Morten, M. J.; Rothenberg, E.; Gottlieb, E.; Wadghiri, Y. Z.; Possemato, R.; Hubbard, S. R.; Balmain, A.; Kimmelman, A. C.; Philips, M. R. KRAS4A Directly Regulates Hexokinase 1. Nature 2019. https://doi.org/10.1038/s41586019-1832-9.

      (19) John, J.; Sohmen, R.; Feuerstein, J.; Linke, R.; Wittinghofer, A.; Goody, R. S. Kinetics of Interaction of Nucleotides with Nucleotide-Free H-Ras P21. Biochemistry 1990, 29 (25), 6058–6065. https://doi.org/10.1021/bi00477a025.

      (20) Dharmaiah, S.; Tran, T. H.; Messing, S.; Agamasu, C.; Gillette, W. K.; Yan, W.; Waybright, T.; Alexander, P.; Esposito, D.; Nissley, D. V.; McCormick, F.; Stephen, A. G.; Simanshu, D. K. Structures of N-Terminally Processed KRAS Provide Insight into the Role of N-Acetylation. Sci. Reports 2019 91 2019, 9 (1), 1–15. https://doi.org/10.1038/s41598-019-46846-w.

      (21) Rathinaswamy, M. K.; Gaieb, Z.; Fleming, K. D.; Borsari, C.; Harris, N. J.; Moeller, B. E.; Wymann, M. P.; Amaro, R. E.; Burke, J. E. Disease-Related Mutations in PI3Kγ Disrupt Regulatory C-Terminal Dynamics and Reveal a Path to Selective Inhibitors. Elife 2021, 10. https://doi.org/10.7554/eLife.64691.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Recommendations For The Authors):

      My main request is to show the phylogeny in the main text, so the reader knows what nodes are being compared.

      Full phylogeny was added to the main text as Fig. 2. Additionally, phylogenetic tree in Newick format is presented as a Supplementary file 2.

      I also suggest the authors check their figure legends carefully. At least in figure one, I think there is some mix-up with the letter labelling of the panels.

      Our mistake. Figure legend was corrected. In this version of the manuscript Figure 1 was split into Fig. 1 and Fig. 3. Corrected version is presented in the legend to Fig. 3.

      And lastly, I urge the authors to deposit the tree, alignment, and reconstructed sequences in a public repository.

      Alignment in fasta format and phylogenetic tree in Newick format were added as supplementary files to the publication (supplementary file 1 and supplementary file 2, respectively). Reconstructed sequences (both Most likely and AltAll variants) were shown as a figure supplement (Figure 3 – figure supplement 2). Posterior probabilities for all positions of the reconstructed sequences were added as a supplementary file (supplementary file 3).

      Reviewer #2 (Recommendations For The Authors):

      -I find the term "secondarily single sHsp" to be a little confusing, especially because it is often used in relation to IbpA/B, but it is just IbpA in another species. I think it would be more clear for the reader to consistently refer to it as Erwiniaceae IbpA vs Escherichia IbpA, or something similar.

      In the introduction we clarified (page 4 lines 11-13) that the term “secondarily single” IbpA refers to IbpA that lacks partner protein as a result of ibpB gene loss. This is in opposition to “single-protein” IbpA from a clade in which gene duplication leading to creation of two – protein sHsp system did not occur (like Vibrionaceae or Aeromonadaceae) - see Obuchowski et al., 2019.

      -Figure 1B. The labels are not defined. What is L? A and B refer to IbpA and IbpB but this should be made more clear to the reader. Why is this panel only referred to in the Introduction and not the Results? Why is there a second panel for E.amy, rather than including it in the same panel, as for other experiments? What are the error bars? (That goes for every error bar in the paper, none are defined).

      Labels in Fig.1B were corrected; “L” was used in reference to “luciferase alone” and it has been corrected for consistency to “no sHsp”. The sHsps activity measurements (obtained in the same experiment) were split into two separate panels as a correspondence to the two branches of the simplified tree in Fig. 1. The figure was modified to make it clearer and avoid confusion. Definitions of error bars were added to this and other figures.

      -"AncA0 exhibited sequestrase activity on the level comparable to IbpA from Escherichia coli (IbpAE.coli). AncA1 was moderately efficient in this process and IbpA from Erwinia amylovora (IbpAE.amyl) was the least efficient sequestrase (Fig. 1D)." - First, this should be referring to Fig. 1C. Second, the text doesn't quite match the panel. A0 appears to have the strongest sequestrase activity over most concentrations. Can the authors comment on in what concentration range these differences are most meaningful?

      Figure legend was corrected. Descriptions of panels C and D were fixed. Now these data are presented in panels A and B of a new Fig. 3. In our opinion differences in sequestration are most meaningful at lower sHsp concentrations (in this case lower than 5 µM), as with high enough sHsp concentration even less effective sequestrases seem to be able to effectively sequester aggregated proteins. Comment about it was added to the main text (page 5, line 6)

      -"Ancestral proteins' interaction with the aggregated substrates was stronger than in the case of extant E. amylovora IbpA, but weaker than in the case of extant E. coli IbpA (Fig. 1C)." - Is this referring to Fig. 1C, or to the unlabelled panel on the bottom right panel of Fig 1 (that is not referred to in the legend)? Can the authors comment on why they think the 2 ancestral proteins are much more similar to each other than they are to either of the native IbpAs?

      Due to our mistake descriptions of panels C and D were switched.

      Figure 1 was rearranged and split into Figures 1 and 3. Former figure S1 (full phylogeny) was inserted into the main text, as Fig. 2, per request of reviewer #1. Former panel 1D (now 3B) was rearranged, as graph was not apparent to be a part of that panel and looked as if it was unlabeled.

      The fact that the two ancestral proteins are more similar to each other than to the extant E. coli and E. amylovora proteins in their interaction with model substrate might be caused by higher sequence identity between the two ancestral proteins than between ancestral and extant proteins (10 amino acid differences between AncA0 and AncA1 compared to 20 differences between AncA1 and IbpA from E. amylovora or 11 differences between AncA0 and IbpA from E. coli). One also has to remember that this property is only one aspect of sHsp activity – proteins AncA0 and AncA1 are much less similar to each other if other activities such as sequestrase activity are considered. Substrate affinity and sequestrase activity are connected to each other, but there isn’t a strict correlation, as can be seen in the case of free ACD domains, which strongly bind aggregated substrate while effectively lacking sequestrase activity (fig. 5 A, fig. 5 – figure supplement 4 A,B).

      -Figure 1E should have E. coli IbpA and IbpB, by themselves, included for comparison. Strangely, it seems, by comparison to Fig 1B, that the "inhibitory" activity of A0 is not present in the E. coli protein, and the authors should comment on this. Similarly, A1 disaggregation looks like it might not be significantly different than the E. coli protein. Can the authors comment on why disaggregation might be so low in A1 compared to E.amy?

      E. coli IbpA alone was added to Fig. 1E (Fig. 3C in the new version) as suggested.

      AncA1 indeed exhibits similar activity to extant IbpA from E. coli, which, at the conditions of the experiment, does not possess inhibitory effect observed for AncA0. This suggests that:

      -There was an additional increase in ability to stimulate luciferase disaggregation between AncA1 and extant IbpA from E. amylovora

      -There was also an increase of ability to stimulate luciferase refolding between AncA0 and extant E. coli IbpA, albeit to a significantly lesser degree than in the Erwiniaceae branch.

      It is quite likely that after separation of Erwiniaceae and Enterobacteriaceae sHsp systems, they underwent further optimization through evolution. This might have led to observed higher effectiveness of modern IbpAs from both clades in refolding stimulation in comparison to the reconstructed ancestral proteins.

      Despite the above, effects of substitutions on positions 66 and 109 on activities of the extant E. coli and E. amylovora proteins suggests that the two identified positions still play key role in differentiating extant IbpAs from Erwiniaceae and Enterobacteriaceae.

      Nevertheless, additional mutations that lead to increased ability to stimulate luciferase reactivation must have occurred in both Erwiniaceae and Enterobacteriaceae branches of the phylogeny during evolution. These substitutions would be a worthwhile subject of further study.

      -Fig 1D - lizate should be lysate.

      The typo was corrected.

      -What is the bottom right panel in Fig 1? It doesn't seem to be referred to in the legend.

      This panel was intendent to be the part of figure 1D, but it was not clearly visible. This figure was rearranged to make it clearer. Now these data are presented as Fig. 3B.

      -Sequences are provided for the ancestral proteins, but I don't see them anywhere for the alternative ancestral proteins. How similar are the Anc proteins to the AltAlls? If they are very similar, this may not tell us anything about "robustness".

      Sequences of alternative proteins are added as a figure supplement (Fig. 3 - figure supplement 2). Full sequences of ML and alternative ancestors with posterior probabilities for each reconstructed position are presented in supplementary file 3

      The testing of the robustness to statistical uncertainty was intended to test to what extent properties of reconstructed ancestral proteins could be influenced by uncertainty present in a given reconstruction due to probabilistic nature of the process. Relatively high similarity between ML and AltAll sequences would indicate low uncertainty of the reconstruction (most likely due to high conservation during evolution). In such a case similar properties of AltAll and ML proteins would simply indicate that they are robust to the level of uncertainty present in a given reconstruction (which may be low). It would not tell us much about “general” robustness to mutations, but it was not relevant to research questions considered.

      -If the functional gain by IbpA comes down to only two amino acid substitutions, I'm not convinced this would be meaningfully reflected in any tests of positive selection.

      After considering Reviewer #1’s comments about limitations of models used for selection analysis we added acknowledgment in the discussion (page 9, line 9 - 13) that results indicating positive selection in our dataset should not be considered conclusive (see answer to Reviewer #1’s public review below).

      -The full MSA should be provided as supplemental material.

      The full MSA in fasta format is presented in the supplementary file 1.

      -For the aggregate binding panels in Figs 3 and 4, it would be helpful to show the native and ancestral proteins for comparison. I know this is a bit redundant, as they're present in Fig 1, but I find it hard to judge the scale of change. This is especially important because A0 and A1 are very similar in Fig 1, so I want to see what kind of difference the 2 mutations make.

      Data presented in Fig. 3C (Fig. 5C in the new version) refer to the binding of α-crystallin domains (A0ACD and A0ACD Q66H G109D) and not full length sHsps to E. coli proteins aggregated on a BLI sensor. Our intention was to show the influence of the two crucial substitutions (Q66H G109D) on the properties of A0 ancestral α-crystallin domain.

      Figure 4 (Fig. 6 in the new version) represent the effects of the substitutions on the identified positions 66 and 109 on the properties of extant IbpA orthologs from E. coli and E. amylovora, showing that these two positions play a key role in differentiating properties of those extant proteins. Changes in binding to aggregated substrate caused by those substitutions, as shown in Figure 6 B,C (new version), are indeed larger than observed between AncA0 and AncA1, as shown in Fig. 3B (new version).

      One has to remember, however, that the experiment shown in Fig.3 (new version) shows the effects of all 10 amino acid changes between the nodes A0 and A1 and not only the two analyzed substitutions, as was the case in experiment shown in Fig. 6 B,C (new version). Moreover, due to relatively large number of differences between ancestral and extant sequences (11 differences between AncA0 and E. coli IbpA, 20 differences between AncA1 and E. amylovora IbpA), substitutions in the two experiments are introduced into different sequence context.

      Because of the above, we believe that direct comparison of the results obtained for ancestral proteins with the results obtained for substitutions introduced into extant proteins would not meaningfully contribute to answering the question of the role of analyzed substitution in the context of extant proteins, while decreasing clarity of presented information.

      -Some of the luciferase plots show a time course, but others just show a single %. What is the time point used for the single % plots?

      Information was added to appropriate figure legends that for experiments showing a single timepoint the luciferase activity was measured after 1h of refolding.

      Reviewer #3 (Recommendations For The Authors):

      1. In the Introduction, it would be beneficial to explore additional instances where this evolutionary simplification process has been observed in nature. Investigating the prevalence of this phenomenon and identifying other multi-protein systems that have undergone simplification could enhance the understanding of its significance and implications.

      The section of the introduction concerning gene loss and differential paralog retention was expanded with additional examples of gene loss that is considered adaptive (page 3 lines 1 - 12).

      1. I am intrigued by the reasons why certain organisms continue to maintain a two-protein system despite the viability of a single-protein system. This aspect is particularly relevant for bacteria, considering the fitness cost associated with maintaining extra gene copies. Do you have any hypotheses or theories that may shed light on this intriguing observation?

      Refolding of proteins from aggregates requires the functional cooperation of sHsps and chaperones from Hsp70 system and Hsp100 disaggregase. In two protein sHsps system one sHsp (IbpA) is specialized in substrate binding, while the second one (IbpB) possesses low substrate binding potential and enhances sHps dissociation from substrates (Obuchowski et al, 2019). Thus, the presence of IbpB reduces the amount of chaperones from Hsp70 system required to outcompete sHsps from aggregated substrates to initiate refolding process. The cost associated with maintaining extra sHsp gene copy (ibpB) in bacteria might be compensated by lower requirement for Hsp70 chaperones for efficient and fast protein refolding following stress conditions.

      In this study we have demonstrated how such a system could have been simplified to a single – protein system capable of efficient substrate sequestration as well as stimulation of reactivation. This indeed leads to the question why such single – protein system isn’t more prevalent in Enterobacterales.

      One possibility may be that there are very specific requirements for efficient reactivation by a single – protein sHsp system. We have shown that new, more efficient IbpA functionality observed in Erwiniaceae required at least two separate mutations. It is possible, that such combinations of two substitutions simply did not occur in Enterobacteriaceae clade, in which IbpA still required partner protein for efficient reactivation stimulation.

      One must also remember that experiments performed in this study were performed in vitro in a specific set of conditions, which most likely does not represent whole spectrum of challenges faced by different bacteria. It is possible that two – protein system has some other additional adaptive effects, counterbalancing the additional cost of gene maintenance. It was for example recently shown (Miwa & Taguchi, PNAS, 120 (32) e2304841120) that bacterial sHsps play an important role in regulation of stress response. Two – protein system could potentially allow for more complex regulation.

      1. Incorporating X-ray crystallization as an additional technique in the methodology would offer detailed molecular insights into the effects of Q66H and G109D substitutions on ACD-C-terminal peptide and ACD-substrate interactions. The inclusion of such data would further strengthen the results section and provide robust support for your findings. Since the x-ray data might be difficult to collect, the authors might think to get alphafold model or some rosetta score for the model to discuss the finding further.

      In response to reviewer comment we added the comparison of the structural models of AncA0 and AncA0 Q66H G109D ACD dimers complexed with the C-terminal peptides, representing middle structures of largest clusters obtained from equilibrium molecular dynamics simulation trajectories based on the AlphaFold2 prediction and in silico mutagenesis (Fig. 5 – figure supplement 2). Model comparison as well as C-terminal peptide – ACD contact analysis did not reveal any major changes in mode of peptide binding or α-crystallin domain conformation, although we do acknowledge that simulation timescale limits the conformational sampling.

      Reviewer #1 (Public Review):

      The work in this paper is in general done carefully. Reconstructions are done appropriately and the effects of statistical uncertainty are quantified properly. My only slight complaint is that I couldn't find statistics about posterior probabilities anywhere and that the sequences and trees do not seem to be deposited.

      Posterior probabilities for all positions of reconstructed proteins were added as a supplementary file 3. MSA of all sequences used for ancestral reconstruction as well as phylogenetic tree in Newick format were added as supplementary files 1 and 2, respectively.

      I would also have preferred to have the actual phylogeny in the main text. This is a crucial piece of data that the reader needs to see to understand what exactly is being reconstructed.

      Full phylogeny was added to the main text as Fig. 2.

      The paper identifies which mutations are crucial for the functional differences between the ancestors tested. This is done quite carefully - the authors even show that the same substitutions also work in extant proteins. My only slight concern was the authors' explanation of what these substitutions do. They show that these substitutions lower the affinity of the C-terminal peptide to the alpha-crystallin domain - a key oligomeric interaction. But the difference is very small - from 4.5 to 7 uM. That seems so small that I find it a bit implausible that this effect alone explains the differences in hydrodynamic radius shown in Figure S8. From my visual inspection, it seems that there is also a noticeable change in the cooperativity of the binding interaction. The binding model the authors use is a fairly simple logarithmic curve that doesn't appear to consider the number of binding sites or potential cooperativity. I think this would have been nice to see here.

      The binding model we used is equivalent to the Hill equation as it accounts for the variable slope of sigmoid function by inclusion of input scaling factor k, which is equivalent to the hill coefficient. Simple one site binding model and two site binding model were also considered but provided worse fits to the data than model including binding cooperativity. Not providing values of fitted parameter k was our mistake, and it was corrected (Fig. 5. with a legend). Additionally, output scaling parameter L is not necessary as fraction bound takes values from 0 to 1, therefore we have fitted the curves again without this parameter. The new values of fitted parameters are very similar to the previous ones. To make text more accessible to the reader, we have used a conventional form of Hill equation. Indeed, AncA0 Q66H G109D ACD displays higher binding cooperativity than more ancestral AncA0 ACD (hill coefficient 2.3 for AncA0 vs 3.7 for AncA0 Q66H G109D). Fitted values of Hill coefficients are higher than one can expect for 2-site ACD dimer, which is probably caused by an experimental setup of BLI, where C-terminal peptide is immobilized on the sensor and ACD is present in solution as bivalent analyte leading to emergence of avidity effects. Both cooperativity and avidity are reflected in the value of Hill coefficient, however as ligand density on the sensor is the same in all experiments only change in ACD binding cooperativity can account for observed difference in the value of Hill coefficients. Difference in the C-terminal peptide binding cooperativity may influence the process of sHsp oligomerization and assembly formation despite similar binding affinity, especially if avidity of multiple binding sites within oligomer is considered.

      In addition, we changed the legend to Figure S8 (now called Fig. 5 – figure supplement 4A ) to clarify the fact that the differences in average hydrodynamic radius are in fact ferly small. To highlight the observation that there are two populations of particles in AncA0 and AncA0 Q66H G109D measured at 25, 35 and 45 °C with different hydrodynamic diameters, we used % of intensity in DLS measurement. It allows us to show the change in the hydrodynamic diameter distribution that is relatively small. We recognize it was not properly explained in the article and added a clarification in figure description.

      Lastly, the authors use likelihood methods to test for signatures of selection. This reviewer is not a fan of these methods, as they are easily misled by common biological processes (see PMID 37395787 for a recent critique). Perhaps these pitfalls could simply be acknowledged, as I don't think the selection analysis is very important to the impact of the work.

      We thank the reviewer for pointing to the recent research about limitations of methods used in our work in selection analysis. As per recommendation we added acknowledgment of limitations of methods used to discussion (page 9, line 9 - 13), modifying wording of our conclusions to deemphasize significance of selection analysis results.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Thank you for your time and effort in handling and reviewing our manuscript. We have responded to all comments below.

      Reviewer #1 (Public Review):

      Martinez-Gutierrez and colleagues presented a timeline of important bacteria and archaea groups in the ocean and based on this they correlated the emergence of these microbes with GOE and NOE, the two most important geological events leading to the oxygen accumulation of the Earth. The whole study builds on molecular clock analysis, but unfortunately, the clock analysis contains important errors in the calibration information the study used, and is also oversimplified, leaving many alternative parameters that are known to affect the posterior age estimates untested. Therefore, the main conclusion that the oxygen availability and redox state of the ocean is the main driver of marine microbial diversification is not convincing.

      We do not conclude that “oxygen availability and redox state of the ocean is the main driver of marine microbial diversification”. Our conclusion is much more nuanced. We merely discuss our findings in light of the major oxygenation events and oxygen availability (among other things) given the important role this molecule has played in shaping the redox state of the ocean.

      Regarding the methodological concerns, to address them we have provided additional analyses to account for different clock models and calibration points.

      Basically, what the molecular clock does is to propagate the temporal information of the nodes with time calibrations to the remaining nodes of the phylogenetic tree. So, the first and the most important step is to set the time constraints appropriately. But four of the six calibrations used in this study are debatable and even wrong.

      (1) The record for biogenic methane at 3460 Ma is not reliable. The authors cited Ueno et al. 2006, but that study was based on carbon isotope, which is insufficient to demonstrate biogenicity, as mentioned by Alleon and Summons 2019.

      Thank you for pointing out the limitations of using the geochemical evidence of methane as calibrations. Indeed, several commentaries have suggested that the biotic and abiotic origin of the methane reported by Ueno et al. are equally plausible (Alleon and Summons, 2019; Lollar and McCollom, 2006), however; we used that calibration as a minimum for the presence of life on Earth, not methanogenesis. Despite the controversy regarding the origin of methane, there are other lines of evidence suggesting the presence of life around ~3.4 Ga. For example stromatolites from the Dresser Formation, Pilbara, Western Australia (Djokic et al., 2017; Walter et al., 1980; Buick and Dunlop, 1990), and more recently (Hickman-Lewis et al., 2022). To avoid confusion, we have added a more extended explanation for the use of that calibration and additional evidence of life around that time in Table 1 and lines 100-104.

      (2) Three calibrations at Aerobic Nitrososphaerales, Aerobic Marinimicrobia, and Nitrite oxidizing bacteria have the same problem - they are all assumed to have evolved after the GOE where the Earth started to accumulate oxygen in the atmosphere, so they were all capped at 2320 Ma. This is an important mistake and will significantly affect the age estimates because maximum constraint was used (maximum constraint has a much greater effect on age estimates and minimum constraint), and this was used in three nodes involving both Bacteria and Archaea. The main problem is that the authors ignored the numerous evidence showing that oxygen can be produced far before GOE by degradation of abiotically-produced abundant H2O2 by catalases equipped in many anaerobes, also produced by oxygenic cyanobacteria evolved at least 500 Ma earlier than the onset of GOE (2500 Ma), and even accumulated locally (oxygen oasis). It is well possible that aerobic microbes could have evolved in the Archaean.

      We appreciate the suggestion of assessing the validity of the calibrations used in our analyses. We initially evaluated the informative power of the priors used for the Bayesian molecular dating (Supplemental File 5), and found that the only calibration that lacked enough information for the purposes of our study was Ammonia Oxidizing Archaea (AOA). In contrast to previous evidence (Ren et al., 2019; Yang et al., 2021), we associate this finding to the potential earlier diversification of AOA. Due to the limitations of several of the calibrations used, we performed an additional molecular dating analysis on 1000 replicate trees using a Penalized Likelihood strategy. This analysis consisted in excluding the calibrations that assumed the presence of oxygen as a maximum constraint. Our analysis shows similar age estimates of the marine microbial clades regardless of the exclusion of these calibrations (Supplemental File 8; TreePL Priors set 2). Our findings thus suggest that the age estimates reported in our study are consistent regardless of whether or not the presence of oxygen is used to calibrate several nodes in the tree. We describe the results of this analysis in lines 490-499 and include estimates in Supplemental File 8. Our results are therefore robust regardless of the use of these somewhat controversial calibrations.

      Once the phylogenetic tree is appropriately calibrated with fossils and other time constraints, the next important step is to test different clock models and other factors that are known to significantly affect the posterior age estimates. For example, different genes vary in evolutionary history and evolutionary rate, which often give very different age estimates. So it is very important to demonstrate that these concerns are taken into account. These are done in many careful molecular dating studies but missing in this study.

      We agree that the selection of marker genes will have a profound impact on the final age estimates. First, it is important to understand that very few genes present in modern Bacteria and Archaea can be traced back to the Last Universal Common Ancestor, so there are very few genes to use for this purpose. Studies that focus on particular groups of Bacteria and Archaea may have larger selections of genes to choose from, but for our purposes there are only about ~40 different genes - mostly encoding for ribosomal proteins, RNA polymerase subunits, and tRNA synthetases - that can be use for this purpose (Creevey et al., 2011; Wu and Scott, 2012). In a previous study we have extensively benchmarked methods for the reconstruction of high-resolution phylogenetic trees of Bacteria and Archaea using these genes (Martinez-Gutierrez and Aylward, 2021). Our analyses demonstrated that some of these genes (mainly tRNA synthetases) have undergone ancient lateral gene transfer events and are not suitable for deep phylogenetics or molecular dating. In this previous study we also evaluated different sets of marker genes to examine which provide the most robust phylogenetic inference. We arrived at a set of ribosomal proteins and RNA polymerase subunits that performs best for phylogenetic reconstruction, and we have used that in the current study.

      Furthermore, we tested the role of molecular dating model selection on the final Bayesian estimates by running four independent chains under the models UGAM and CIR, respectively. Overall, the results did not vary substantially compared with the ages obtained using the log-normal model reported on our manuscript (Supplemental File 8). The additional results are described in lines 478-488 and shown in Supplemental File 8. The clades that showed more variation when using different Bayesian models were SAR86, SAR11, and Crown Cyanobacteria (Supplemental File 8). Despite observing some differences in the age estimates when using different molecular models, the conclusion that the different marine microbial clades presented in our study diversified during distinct periods of Earth’s history remains. Moreover, the main goal of our study is to provide a relative timeline of the diversification of abundant marine microbial clades without focusing on absolute dates.

      Reviewer #2 (Public Review):

      In this paper, Martinez-Gutierrez and colleagues present a dated, multidomain (= Archaea+Bacteria) phylogenetic tree, and use their analyses to directly compare the ages of various marine prokaryotic groups. They also perform ancestral gene content reconstruction using stochastic mapping to determine when particular types of genes evolved in marine groups.

      Overall, there are not very many papers that attempt to infer a dated tree of all prokaryotes, and this is a distinctive and up-to-date new contribution to that oeuvre. There are several particularly novel and interesting aspects - for example, using the GOE as a (soft) maximum age for certain groups of strictly aerobic Bacteria, and using gene content enrichment to try to understand why and how particular marine groups radiated.

      Thank you for your thorough evaluation and comments on our manuscript.

      Comments

      One overall feature of the results is that marine groups tend to be quite young, and there don't seem to be any modern marine groups that were in the ocean prior to the GOE. It might be interesting to study the evolution of the marine phenotype itself over time; presumably some of the earlier branches were marine? What was the criterion for picking out the major groups being discussed in the paper? My (limited) understanding is that the earliest prokaryotes, potentially including LUCA, LBCA and LACA, was likely marine, in the sense that there would not yet have been any land above sea level at such times. This might merit discussion in the paper. Might there have been earlier exclusively marine groups that went extinct at some point?

      Thank you for pointing this out - this is a very interesting idea.<br /> Firstly, the major marine lineages that we study here have largely already been defined in previous studies and are known to account for a large fraction of the total diversity and biomass of prokaryotes in the ocean. For example, Giovannoni and Stingl described most of these groups previously when discussing cosmopolitan and abundant marine lineages (Giovannoni and Stingl, 2005). The main criteria to select the marine clades studied here are 1) these groups have large impacts in the marine biogeochemical cycles and represent a large fraction of the microbial biomass in the open ocean, 2) they have an appropriate representation on genomic databases such that they can be confidently included in a phylogenetic tree, 3) the clades included can be confidently classified as being marine, in the sense that consequently the last common ancestor had a marine origin. This is explained in lines 83-86. We were primarily interested in lineages that encompassed a broad phylogenetic breadth, and we therefore did not include many groups that can be found in the ocean but are also readily isolated from a range of other environments (i.e., Pseudomonas spp., some Actinomycetes, etc.).

      We agree that some of the earlier microbial branches in the Tree of Life were likely marine. The study of the marine origin of LUCA, LBCA, LACA, although interesting, is out of the scope of our study, and our results cannot offer any direct evidence of their habitat. We have therefore sought to focus on the origins of extant marine lineages.

      What do the stochastic mapping analyses indicate about the respective ancestors of Gracilicutes and Terrabacteria? At least in the latter case, the original hypothesis for the group was that they possessed adaptations to life on land - which seems connected/relevant to the idea of radiating into the sea discussed here - so it might be interesting to discuss what your analyses say about that idea.

      Thank you for your recommendation to perform additional analysis regarding the characterization of the ancestor of the superphyla Gracilicutes and Terrabacteria. We agree that this analysis would be very interesting, but we wish to focus the manuscript primarily on the marine clades in question, and other supergroups are listed in Figure 2 mainly for context. However, we did check the results of the stochastic mapping analysis and we now report the list of genes predicted to be gained and lost at the ancestor of the Gracilicutes and Terrabacteria clades, however; it is out of the scope of this study.

      I very much appreciate that finding time calibrations for microbes is challenging, but I nonetheless have a couple of comments or concerns about the calibrations used here:

      The minimum age for LBCA and LACA (Nodes 1 and 2 in Fig. 1) was calibrated with the earliest evidence of biogenic methane ~3.4Ga. In the case of LACA, I suppose this reflects the view that LACA was a methanogen, which is certainly plausible although perhaps not established with certainty. However, I'm less clear about the logic of calibrating the minimum age of Bacteria using this evidence, as I am not aware that there is much evidence that LBCA was a methanogen. Perhaps the line of reasoning here could be stated more explicitly. An alternative, slightly younger minimum age for Bacteria could perhaps be obtained from isotope data ~3.2Ga consistent with Cyanobacteria (e.g., see https://pubmed.ncbi.nlm.nih.gov/30127539/).

      Thank you for pointing this out. We used the presence of methane as a minimum for life on Earth, not as a minimum for methanogenesis. Despite using this calibration as a minimum for the root of Bacteria and not having methanogenic representatives within this domain, there are independent lines of evidence that point to the presence of microbial life around the same time (~3.5 Ga, for example stromatolites from the Dresser Formation, Pilbara, Western Australia (~3.5 Ga) (Djokic et al., 2017; Walter et al., 1980; Buick and Dunlop, 1990), and more recently (Hickman-Lewis et al., 2022). We added a rationale for the use of the evidence of methane as a minimum age for life on Earth to the manuscript (Table 1 and 100104).

      I am also unclear about the rationale for setting the minimum age of the photosynthetic Cyanobacteria crown to the time of the GOE. Presumably, oxygen-generating photosynthesis evolved on the stem of (photosynthetic) Cyanobacteria, and it therefore seems possible that the GOE might have been initiated by these stem Cyanobacteria, with the crown radiating later? My confusion here might be a comprehension error on my part - it is possible that in fact one node "deeper" than the crown was being calibrated here, which was not entirely clear to me from Figure 1. Perhaps mapping the node numbers directly to the node, rather than a connected branch, would help? (I am assuming, based on nodes 1 and 2, that the labels are being placed on the branch directly antecedent to the node of interest)?

      Thank you so much for your suggestion. As pointed out, the calibrations used were applied at the crown node of existing Cyanobacterial clades, not at the stem of photosynthetic Cyanobacteria. We agree that photosynthesis and therefore the production of molecular oxygen may have been present in more ancient Cyanobacterial clades, however; these groups have not been discovered yet or went extinct. We have improved Fig. 1 to avoid confusion and now it is part of the updated version of our manuscript.

      Alleon J, Summons RE. 2019. Organic geochemical approaches to understanding early life. Free Radic Biol Med 140:103–112.

      Buick R, Dunlop JSR. 1990. Evaporitic sediments of Early Archaean age from the Warrawoona Group, North Pole, Western Australia. Sedimentology 37: 247-277.

      Creevey CJ, Doerks T, Fitzpatrick DA, Raes J, Bork P. 2011. Universally distributed single-copy genes indicate a constant rate of horizontal transfer. PLoS One 6:e22099.

      Djokic T, Van Kranendonk MJ, Campbell KA, Walter MR, Ward CR. 2017. Earliest signs of life on land preserved in ca. 3.5 Ga hot spring deposits. Nat Commun 8:15263.

      Giovannoni SJ, Stingl U. 2005. Molecular diversity and ecology of microbial plankton. Nature 437: 343-348. Hickman-Lewis K, Cavalazzi B, Giannoukos K, D'Amico L, Vrbaski S, Saccomano G, et al. 2023. Advanced two-and three-dimensional insights into Earth's oldest stromatolites (ca. 3.5 Ga): Prospects for the search for life on Mars. Geology 51: 33-38.

      Lollar BS, McCollom TM. 2006. Geochemistry: biosignatures and abiotic constraints on early life. Nature. Martinez-Gutierrez CA, Aylward FO. 2021. Phylogenetic Signal, Congruence, and Uncertainty across Bacteria and Archaea. Mol Biol Evol 38:5514–5527.

      Ren M, Feng X, Huang Y, Wang H, Hu Z, Clingenpeel S, Swan BK, Fonseca MM, Posada D, Stepanauskas R, Hollibaugh JT, Foster PG, Woyke T, Luo H. 2019. Phylogenomics suggests oxygen availability as a driving force in Thaumarchaeota evolution. ISME J 13:2150–2161.

      Walter M R, R Buick, JSR Dunlop. 1980. Stromatolites 3,400–3,500 Myr old from the North pole area, Western Australia. Nature 284: 443-445.

      Wu M, Scott AJ. 2012. Phylogenomic analysis of bacterial and archaeal sequences with AMPHORA2. Bioinformatics 28:1033–1034.

      Yang Y, Zhang C, Lenton TM, Yan X, Zhu M, Zhou M, Tao J, Phelps TJ, Cao Z. 2021. The Evolution Pathway of Ammonia-Oxidizing Archaea Shaped by Major Geological Events. Mol Biol Evol 38:3637–3648.

    1. Joint Public Review:

      In the manuscript by Rajan et al., the authors have highlighted the direct interaction between Dbp5 and tRNA, wherein Dbp5 serves as a mediator for tRNA export. This export process is subject to spatial regulation, as Dbp5 ATPase activation occurs specifically at nuclear pore complexes. Notably, this regulation is independent of the Los1-mediated pre-tRNA export route and instead relies on Gle1.

      The manuscript is well constructed and nicely written. The authors have addressed the concerns as raised by the previous reviewers and added additional experiments.

      I have a few comments for polishing the manuscript.

      Major comments:<br /> 1. In their previous paper (Lari et al, 2019; Azra Lari Arvind Arul Nambi Rajan Rima Sandhu Taylor Reiter Rachel Montpetit Barry P Young Chris JR Loewen Ben Montpetit (2019) A nuclear role for the DEAD-box protein Dbp5 in tRNA export eLife 8:e48410.) as well as in the current manuscript the authors states that Dbp5 is involved in the export of tRNA that is independent of and parallel to Los1. They state that Dbp5 binds to the tRNA independent of known tRNA export proteins. The obtained conclusion is both intriguing and innovative, since it suggests that there are other variables, beyond the ones previously identified as tRNA factors, that might interact with Dbp5 to facilitate the export process. In order to find out additional factors aiding this process the authors may employ total RNA‐associated protein purification (TRAPP) experiments ( Shchepachevto et al., 2019; Shchepachev V, Bresson S, Spanos C, Petfalski E, Fischer L, Rappsilber J, Tollervey D. Defining the RNA interactome by total RNA-associated protein purification. Mol Syst Biol. 2019 Apr 8;15(4):e8689. doi: 10.15252/msb.20188689. PMID: 30962360; PMCID: PMC6452921) to identify extra factors involved in conjunction with Dbp5. The process elucidates hitherto uninvestigated tRNA export components that function in conjunction with Dbp5.

      2. Various reports suggest that eukaryotic translation elongation factor 1 eEF1A is involved tRNA export Bohnsack et al., 2002 (Bohnsack MT, Regener K, Schwappach B, Saffrich R, Paraskeva E, Hartmann E, Görlich D. Exp5 exports eEF1A via tRNA from nuclei and synergizes with other transport pathways to confine translation to the cytoplasm. EMBO J. 2002 Nov 15;21(22):6205-15. doi: 10.1093/emboj/cdf613. PMID: 12426392; PMCID: PMC137205), Grosshans etal., 2002; Grosshans H, Hurt E, Simos G. An aminoacylation-dependent nuclear tRNA export pathway in yeast. Genes Dev. 2000 Apr 1;14(7):830-40. PMID: 10766739; PMCID: PMC316491). The presence of mutations in eEF1A has been seen to hinder the nuclear export process of all transfer RNAs (tRNAs). eEF1A has been shown to interact with Los1 aiding in tRNA export. The authors can also explore the crosstalk between Dbp5 and eEF1A in this study. Additionally, suppressor screening analysis in dbp5R423A , los1∆dbp5R423A los1∆msn∆dbp5R423A could shed more light on this.

      3. Unfortunately, this article is not significantly different from that published in eLife in 2018. In fact, it raises more questions than it brings answers by not identifying a transporter for export and not identifying a role for the helicase activity of Dbp5. The addition of Gle1 is potentially novel but it's unclear why the authors didn't address the potential involvement of IP6.

    1. Reviewer #2 (Public Review):

      Summary:<br /> In the current work, authors deploy a set of behavioral tasks to explore individual differences in the preferred perceptual and motor rhythms. They found a consistent individual preference for a given perceptual and motor frequency across tasks and, while these were correlated, the latter is slower than the former one. Additionally, they show that the accuracy of adaptation to rate changes is proportional to the amount of rate variation and, crucially, the amount of adaptation decreases with age.

      Strengths:<br /> Authors carefully designed several experiments to measure individual preferred motor and perceptual tempo. Furthermore, before completing the main experiment they validated the experimental design by testing the consistency across tasks and test-retest. Additionally, to the value of the reported findings, the introduced paradigm represents a useful tool for future research.<br /> The obtained data is rigorously analyzed using a diverse set of tools, each adapted to the specificities across the different research questions and tasks.<br /> This study identifies several relevant behavioral features: (i) each individual shows a preferred and reliable motor and perceptual tempo and, while both are related, the motor is consistently slower than the pure perceptual one; (ii) the existence of hysteresis in the adaptation to rate variations; and (iii) the decrement of this adaptation with age. All these observations are valuable for the auditory-motor integration field of research, and they could potentially inform existing biophysical models to increase their descriptive power.

      Weaknesses:<br /> The current study is presented in the framework of the ongoing debate of oscillator vs. timekeeper mechanisms underlying perceptual and motor timing, and authors claim that the observed results support the former mechanism. In this line, every obtained result is related by the authors to a specific ambiguous (i.e., not clearly related to a biophysical parameter) feature of an internal oscillator. As pointed out by an essay on the topic (1), claiming that a pattern of results is compatible with an "oscillator" could be misleading, since some features typically used to validate or refute such mechanisms are not well grounded on real biophysical models. Relatedly, a recent study (2) shows that two quantitatively different computational algorithms (i.e., absolute vs relative timing) can be explained by the same biophysical model. This demonstrates that what could be interpreted as a timekeeper, or an oscillator can represent the same biophysical model working under different conditions. For this reason, if authors would like to argue for a given mechanism underlying their observations, they should include a specific biophysical model, and test its predictions against the observed behavior. For example, it's not clear why authors interpret the observation of the trial's response being modulated by the rate of the previous one, as an oscillator-like mechanism underlying behavior. As shown in (1) a simple oscillator returns to its natural frequency as soon as the stimulus disappears, which will not predict the long-lasting effect of the previous trial. Furthermore, a timekeeper-like mechanism with a long enough integration window is compatible with this observation.<br /> Still, authors can choose to disregard this suggestion, and not testing a specific model, but if so, they should restrict this paper to a descriptive study of the timing phenomena.

      1. Doelling, K. B., & Assaneo, M. F. (2021). Neural oscillations are a start toward understanding brain activity rather than the end. PLoS biology, 19(5), e3001234.<br /> 2. Doelling, K. B., Arnal, L. H., & Assaneo, M. F. (2022). Adaptive oscillators provide a hard-coded Bayesian mechanism for rhythmic inference. bioRxiv, 2022-06.

  8. www.planalto.gov.br www.planalto.gov.br
    1. III

      Previsão do princípio da dialeticidade, o qual também está previsto no art. 1.010, inciso III, e no art. 1.021, § 1º.

      Artigo do TJDF sobre a dialeticidade:

      • Trata-se do princípio da dialeticidade dos recursos que preconiza que “o recurso tem de combater a decisão jurisdicional naquilo que ela o prejudica, naquilo que ela lhe nega pedido ou posição de vantagem processual, demonstrando o seu desacerto, do ponto de vista procedimental (error in procedendo) ou do ponto de vista do próprio julgamento (error in judicando)”[1].

      • Não obstante, na prática são comuns recursos que se limitam a reproduzir, em seu corpo, os fundamentos da petição inicial ou da contestação sem atacar especificamente os fundamentos da decisão. E tal prática vem sendo combatida pela jurisprudência dos Tribunais Superiores (v. g. Súmulas 182, STJ).

      Nessa linha, é o Jurisprudência em teses, Edição nº 183:

      • 3) Não se conhece de agravo interno que se limita a reproduzir as razões de seu recurso anterior, por violar o princípio da dialeticidade.

      Art. 1.021, § 1°, do CPC/2015

    1. Author Response

      The following is the authors’ response to the original reviews.

      Re: Revised author response for eLife-RP-RA-2023-90135 (“The white-footed deermouse, an infection-tolerant reservoir for several zoonotic agents, tempers interferon responses to endotoxin in comparison to the mouse and rat” by Milovic, Duong, and Barbour”)

      The revised manuscript has taken into account all the comments and questions of the two reviewers. Our responses to each of the comments are detailed below. In brief, the modifications or additional materials for the revision each specifically address a reviewer comment. These modifcations or materials include the following….

      • a more in-depth consideration of sample sizes

      • a better explanation of what p values signify for a GO term analysis

      • a more detailed account of the selection of the normalization procedure for cross-species targeted RNA-seq (including a new supplemental figure)

      • several more box plots in supplementary materials to complement the scatterplots and linear regressions of the figures of the primary text

      • provision in a public access repository of the complete data for the RNA-seq analyses as well as primary data for figures and tables as new supplementary tables

      • the expansion of description of the analysis done for the revision of Borrelia hermsii infection of P. leucopus. This included a new table (Table 10 of the revision) • development of the possible relevance of finding for longevity studies by citing similarities of the findings in P. leucopus with those in the naked mole-rat

      • what we think is a better assessment of differences between female and male P. leucopus for this particular study, while still keeping focus on DEGs in common for females and males. This included a new figure (Figure 4 of the revision).

      • removal of reference to a “inverse” relationship between Nos2 and Arg1 while still retaining ratios of informative value

      We note that in the interval between uploading the original bioRxiv preprint and now we learned of the paper of Gozashti, Feschotte, and Hoekstra (reference 32), which supports our conception of the important place of endogenous retroviruses in the biology and ecology of deermice. This is the only addition or modification that was not a direct response to a reviewer comment or question, but it was germane to one of Reviewer #1’s comments (“Regarding..”).

      Reviewer #1:

      Supplemental Table 1 only lists genes that passed the authors statistical thresholds. The full list of genes detected in their analysis should be included with read counts, statistics, etc. as supplemental information.

      We agree that provision of the entire lists of reference transcripts and the RNA-seq results for each of the 40 animals is merited. These datasets are too large for what the journal’s supplementary materials resource was intended for, so we have deposited them at the Dryad public access repository.

      While P. leucopus is a critical reservoir for B. burgdorferi, caution should be taken in directly connecting the data presented here and the Lyme disease spirochete. While it's possible that P. leucopus have a universal mechanism for limiting inflammation in response to PAMPs, B. burgdorferi lack LPS and so it is also possible the mechanisms that enable LPS tolerance and B. burgdorferi tolerance may be highly divergent.

      The impetus for the study was the phenomenon of tolerance of infection of P. leucopus by a number of different kinds of pathogens, not just B. burgdorferi. We take the reviewer’s point, though. Certainly, the white-footed deermouse is probably most notable at-large for its role as a reservoir for the Lyme disease agent. We doubt that the species responses to LPS and to the principal agonists of B. burgdorferi are “highly divergent”, though. Other than the TLR itself-TLR4 for LPS vs the heterodimer TLR2/TLR1 for the lipoproteins of these spirochetes--the downstream signaling is generally similar for amounts comparable in their agonist potency.

      We had thought that we had addressed this distinction for B. burgdorferi and other Borreliaceae members by referring to the earlier study. But we agree with the reviewer that what was provided on this point was insufficient in the context of the present work. Accordingly, for the revision we have added a new analysis of the data on experimental infection of P. leucopus with Borrelia hermsii, which lacks LPS and for which the TLR agonists eliciting inflammation are lipoproteins. We do this in a format (new Table 6) that aids comparison with the LPS experimental data elsewhere in the article. As the manuscript references, B. burgdorferi infection of P. leucopus elicits comparatively little inflammation in blood even at the height of infection. While this phenomenon with the Lyme disease agent was part of the rationale driving these studies, the better comparison with LPS was 5 days into B. hermsii infection when the animals are spirochetemic.

      Statistical significance is binary and p-values should not be used as the primary comparator of groups (e.g. once a p-value crosses the deigned threshold for significance, the magnitude of that p-value no longer provides biological information). For instance, in comparing GO-terms, the reason for using of high p-value cutoffs ("None of these were up-regulated gene GO terms with p values < 1011 for M. musculus.") to compare species is unclear. If the authors wish to compare effect sizes, comparing enrichment between terms that pass a cutoff would likely be the better choice. Similarly, comparing DEG expression by p-value cutoff and effect size is more meaningful than analyses based on exclusively on p-value: "Of the top 100 DEGs for each species by ascending FDR p value." Description in later figures (e.g. Figure 4) is favored.

      Effect sizes--in this case, fold-changes--were taken into account for GO term analysis and were specified in the settings that are described. So, any gene that was “counted” for consideration for a particular GO term would have passed that threshold and with a falsediscovery corrected p value of a specified minimum. There is no further scoring of the “hit” based upon the magnitude of the p value beyond that point. It is, as the reviewer writes, binary at that point. We are in agreement on those principles.

      As we understand the comment above, though, the p-values referred to are in regard to the GO term analysis itself. The objective was discovery followed by inference. The situation was more like a genome-wide association study (GWAS) study. This is not strictly speaking a hypothesis test, because there was no stated hypothesis ahead of time or one driving the design. The “p value” for something like GO term analysis or GWAS provides an estimate of the strength of the association. It is not binary in that sense. The lower the p value, the greater confidence about the association. In a GWAS of a human population an association of a trait with a particular SNP or indel is usually not taken seriously unless the p value is less than 10^-7 or 10^-8. In the case of GO terms, the p value approximates (but is not equivalent to) the number of genes that are differentially expressed that belong to a GO cluster out of the total number of genes that define that cluster. The higher the proportion of the genes in the cluster that are associated with a treatment (LPS vs. saline), the lower the p value. Thus, it provides information beyond the point at which it would be rightly deemed of little additional value in many hypothesis testing circumstances.

      That said, we agree that the original manuscript could have been clearer on this point and have for the revision expanded the description of the GO term analysis in the Methods, including some explanation for a reader on what the p value signifies here. We also refrain from specifying a certain p value for special attention and merely list 20 by ascending p value.

      The ability to use of CD45 to normalize data is unclear. Authors should elaborate both on the use of the method and provide some data how the data change when they are normalized. For instance, do correlations between untreated Mus and Peromyscus gene expression improve? The authors seem to imply this should be a standard for interspecies comparison and so it would be helpful to either provide data to support that or, if applicable, use of the technique in literature should be referenced.

      The reviewer brings up an important point that we considered addressing in more depth for the original manuscript but in the end deferred to considerations about length and left it out.

      But we are glad to address this here, as well as in the revised manuscript.

      We did not intend to imply either that this particular normalization approach had been done before by others or that it “should” be a standard. We are not aware of another report on this, and it would be up to others whether it would be useful or not for them. We made no claim about its utility in another model or circumstance. The challenge before us was to do a comparative analysis of transcription in the blood not just for animals of one species under different conditions but animals of two different genera under different conditions. A notable difference between the animals was in their white blood cell counts, as this study documents. White cells would be the source of a majority of transcripts of potential relevance here, but there would also be mRNA for globins, from reticulocytes, from megakaryocytes, and likely cell-free RNA with origins in various tissues. If the white cell numbers differed, but the non-white cell sources of RNA did not, then there could be unacknowledged biases.

      It would be like comparing two different kinds of tissues and assuming them to be the same in the types and numbers of cells they contained. Four hours after a dose of LPS the liver cells (or brain cells) would differ in their transcriptional profiles from untreated the livers (or brains) of untreated animals for sure, but there would not be much if any change in the numbers of different kinds of cells in the liver (or brain) within 4 hours. The blood can change a lot in composition within that time frame under these same conditions. Some sort of accounting for differing white cell numbers in the blood in different outbred animals of two species seemed to be called for.

      The normalization that was done for the genome-wide analysis was not based on a particular transcript, but instead was based on the total number of reads, the lengths of the reference transcripts, and the distributions of reads matching to the tens of thousands of references for each sample. This was done according to what are standard procedures by now for bulk RNAseq analyses. Because the reference transcript sets for P. leucopus and M. musculus differed in their numbers and completeness of annotation, we did not attempt any cross-species comparison for the same set of genes at that point. That would not be possible because they were not entirely commensurate.

      The GO term analysis of those results provided the leads for the more targeted approach, which was roughly analogous to RT-qPCR. For a targeted assay of this sort, it is common to have a “housekeeping gene” or some other presumably stably transcribed gene for normalization. A commonly used one is Gapdh, but we had previously found that Gapdh was a DEG itself in the blood in P. leucopus and M. musculus at the four hour mark after LPS. The aim was to provide for some adjustment so datasets for blood samples differing in white blood cell counts could be compared. Two options were the 12S ribosomal RNA of the mitochondria, which would be in white cells but not mature erythrocytes, and CD45, which has served an approximately similar function for flow cytometry of the blood. As described in what has been added for the revision and the supplementary materials, we compared these different approaches to normalization. Ptprc and 12S rRNA were effectively interchangeable as the denominator with identifying DEGs of P. leucopus and M. musculus and cross-species comparisons.

      Regarding the ISG data-is a possible conclusion not that Peromyscus don't upregulate the antiviral response because it's already so high in untreated rodents? It seems untreated Peromyscus have ISG expression roughly equivalent to the LPS mice for some of the genes. This could be compared more clearly if genes were displayed as bar plots/box and whisker plots rather than in scatter plots. It is unclear why the linear regression is the key point here rather than normalized differences in expression.

      In answer to the question: yes, that is possible. In the interval between uploading of the manuscript and this revision, we became aware of a study by Gozashti and Hoekstra published this year in Molecular Biology and Evolution (reference 32) and reporting on the “massive invasion” of endogenous retroviruses in P. maniculatus and the defenses deployed in response to achieve silencing. We cite this work and discuss it, including related findings for P. leucopus, in the revision.

      We had originally intended to include box plots as well as scatterplots with regressions for the data, but thought it would be too much and possibly considered redundant. But with this encouragement from the reviewer we provide additional box plots in supplementary materials for the revision.

      Some sections of the discussion are under supported:

      The claim that low inflammation contributes to increased lifespan is stated both in the introduction and discussion. Is there justification to support this? Do aged pathogen-free mice show more inflammation than aged Peromyscus?

      We respectively point out that there was not a claim of this sort. We stated a fact about P. leucopus’ longevity. We made no statement connecting longevity and inflammation beyond the suggestion in the introduction that the explanation(s) for infection tolerance might have some bearing for studies on determinants of life span.

      But the reviewer’s comment prompted further consideration of this aspect of Peromyscus biology. This led eventually to the literature on the naked mole-rat, which seems to be the rodent with the longest known life span and the subject of considerable study. The discussion section of the revision has an added paragraph on some of the similarities of P. leucopus and the naked mole-rat in terms of neutrophils, expression of nitric oxide synthase 2 in response to LPS, and type 1 interferon responses. While this is far from decisive, it does serve to connect some of the dots and, hopefully, is considered at least partially responsive to the reviewer’s question.

      The claim that reduced Peromyscus responsiveness could lead to increased susceptibility to infection is prominently proposed but not supported by any of the literature cited.

      There was not this claim. In fact, it was framed as a question, not a statement. Nevertheless, we think we understand what the comment is getting at and acknowledge in the revision that there may be unexamined circumstances in which P. leucopus may be more vulnerable.

      References to B. burgdorferi, which do not have LPS, in the discussion need to ensure that the reader understands this and the potential that responses could be very different.

      We think we addressed this comment in a response above.

      Reviewer #2:

      1. How were the number of animals for each experiment selected? Was a power analysis conducted?

      A power analysis of any meaning for bulk RNA-seq with tens of thousands of reference transcripts, each with their own variance, and a comparison of animals of two different genera is not straight forward. Furthermore, a specific hypothesis was not being tested. This was a broad, forward screen. But the question about sample sizes is one that deserves more attention than the original manuscript provided. This now provided in added text in two places in Methods ( “RNA-seq” and “Genome-wide different gene expression”) in the revision.

      1. The authors conducted a cursory evaluation of sex differences of P. leucopus and reported no difference in response except for Il6 and Il10 expression being higher in the males than the females in the exposed group. The data was not presented in the manuscript. Nor was sex considered for the other two species. A further discussion of the role that sex could play and future studies would be appreciated.

      We agree that the limited analysis of sex differences and the undocumented remark about Il6 and Il10 expression in females and males warranted correction. For the revision we removed that analysis of targeted RNA-seq of P. leucopus from the two different studies. For this study we were looking for differences that applied to both species. This was the reason that there were equal numbers of females and males in the samples. We agree that further investigation of differences between sexes in their responses is of interest but is probably best left for “future studies”.

      But in revision we do not entirely ignore the question of sex of the animal and provide an additional analysis of the bulk RNA-seq for P. leucopus with regard to differences between females and males. This basically demonstarted an overall commensurability between sexes, at least for the purposes of the GO term analysis and subsequent targeted RNA-seq, but did reveal some exceptions that are candidate genes for those future studies.

      In the revision, we also add for the discussion and its “study limitations” section a disclaimer about possibly missing sex associated differences because the groups were mixed sexes.

      1. The ratio of Nos2 and Arg1 copies for LPS treated and control P. leucopus and M.musculus in Table 3 show that in P. leucopus there is not a significant difference but in M.musculus there is an increase in Nos2 copies with LPS treatment. The authors then used a targeted RNA-seq analysis to show that in P. leucopus the number of Arg1 reads after LPS treatment is significantly higher than the controls. These results are over oversimplified in the text as an inverse relationship for Nos2/Arg1 in the two species.

      We agree. In addition to providing box plots for Arg1 and Nos2, as suggested by Reviewer #1, we also replaced “ratio” in commenting on Arg1 and Nos2, with “differences in Nos2 and Arg1 expresssion” replacing “ratio of Nos2 to Arg1 expression” at one place. At another place we have removed “inverse” with regard to Nos2 and Arg1. But we respectfully decline to remove Nos2/Arg1 from Figure 5 (now Figure 6) or inclusion of Nos2/Arg1 ratios elsewhere. According to our understanding there need not be an inverse relationship for a ratio to have informative value.

      Recommendations For the Authors

      We thank the two reviewers for their constructive recommendations and suggestions, in some case pointing out errors we totally missed. For the great majority, the recommendations were followed. Where we decline or disagree we explain this in the response.

      Reviewer #1 (Recommendations For The Authors):

      • How was the FDR < 0.003 cutoff chosen for DEG? All cutoffs are arbitrary but there should be some justification.

      We agree and have provided the rationale at that point in the paper (before Figure 3) in R2: "For GO term analysis the absolute fold-change criterion was ≥ 2. Because of the ~3-fold greater number of transcripts for the M. musculus reference set than the P. leucopus reference set, application of the same false-discovery rate (FDR) threshold for both datasets would favor the labeling of transcripts as DEGs in P. leucopus. Accordingly, the FDR p values were arbitrarily set at <5 x 10-5 for P. leucopus and <3 x 10-3 for M. musculus to provide approximately the same number of DEGs for P. leucopus (1154 DEGs) and M. musculus (1266 DEGs) for the GO term comparison."

      • It would be helpful to include a figure demonstrating the correlation between CD45 and WBC ("Pearson's continuous and Spearman's ranked correlations between log-transformed total white blood cell counts and normalized reads for Ptprc across 40 animals representing both species, sexes, and treatments were 0.40 (p = 0.01) and 0.34 (p = 0.03), respectively.")

      In both the first version of the revision (R1) and in R2 we provide a fuller explanation of the choice of CD45 (Ptprc) for normalization as detailed in the response to Reviewer #1's public comment. In the revision only Pearson's correlation and p value is given. We did not think another figure was justified after there was additional space devoted to this in both R1 and R2.

      • Unclear what the following paragraph is referring to-is this from the previous paper? Was this experiment introduced somewhere? "Low transcription of Nos2 and high transcription of Arg1 both in controls and LPS-treated P. leucopus was also observed in the experiment where the dose of LPS was 1 µg/g body mass instead of 10 µg/g and the interval between injection and assessment was 12 h instead of 4 h (Table 4)."

      This experiment is described in the Methods in the original and subsequent versions, but we agree that it is not clear whether it was from present study or previous one. Here is the revised text for R2: "Low transcription of Nos2 in both in controls and LPS-treated P. leucopus and an increase in Arg1 with LPS was also observed in another experiment for the present study where the dose of LPS was 1 µg/g body mass instead of 10 µg/g and the interval between injection and assessment was 12 h instead of 4 h (Table 4)."

      • Regarding the differences in IFNy between outbred and BALB/c mice-are there any other RNA-seq datasets you can mine where other inbred mice (B/6, C3H, etc) have been injected with LPS and probed roughly the same amount of time later? Do they look like BALB/c or the outbreds?

      In both the original and R1 and R2 we cite two papers on the difference of BALB/c mice. While this is of interest for follow-up in the future, we did not think additional content on a subject that mainly pertains to M. musculus was warranted here, where the main focus is Peromyscus.

      • Figure 8 and its legend are difficult to follow. The top half of the figure is not well explained and it's unclear what species this is. Decreased use of abbreviations would help. Consider marking each R2 value as Mus or Peromyscus (As done in Fig 9). There are some typographical errors in the legend ("gree," incomplete sentence missing the words LPS or treatment AND Mus: "Co-variation between transcripts for selected PRRs (yellow) and ISGs (gree) in the blood of P. leucopus (P) or (M) with (L") or without (C)."

      This is now Figure 9 in both R1 and R2. We revised it for R1 to include references to the box plots in supplementary materials, but agree with Reviewer #1's recommendation to correct the typos and make the legend less confusing. We did not think that further labeling of the R2 values in the scatterplots with the species names was necessary. The data points are not just colors but also different symbols, so it should be fairly easy for readers to distinguish the regression lines by species. For R2 this is the revised legend with additions in response to the recommendation underlined:

      "Figure 9. Co-variation between transcripts for selected PRRs and ISGs in the blood of P. leucopus (P) or M. musculus (M) with (L) or without (C) LPS treatment. Top panel: matrix of coefficients of determination (R2) for combined P. leucopus and M. musculus data. PRRs are indicated by yellow fill and ISGs by blue fill on horizontal and vertical axes. Shades of green of the matrix cells correspond to R2 values, where cells with values less than 0.30 have white fill and those of 0.90-1.00 have deepest green fill. Bottom panels: scatter plots of log-transformed normalized Mx2 transcripts on Rigi (left), Ifih1 (center), and Gbp4 (right). The linear regression curves are for each species. For the right-lower graph the result from the General Linear Model (GLM) estimate is also given. Values for analysis are in Table S4; box plots for Gbp4, Irf7, Isg15, Mx2, and Oas1 are provided in Figure S6."

      • Discussion section could benefit from editing for clarity. Examples listed: o Unclear what effect is described here "The bacterial infection experiment indicated that the observed effect in P. leucopus was not limited to a TLR4 agonist; the lipoproteins of B. hermsii are agonists for TLR2 (Salazar et al. 2009)."

      Both R1 and R2 include the new section on the B. hermsii infection model. This was added in response to Reviewer #1 public comment. So the expanded consideration of this aspect should address the reviewer's recommendation for more clarity and context here. For R2 we modified the text in the discussion of R1:

      "The analysis here of the B. hermsii infection experiment also indicated that the phenomenon observed in P. leucopus was not limited to a TLR4 agonist."

      o Unclear what the takeaway from this paragraph is: "Reducing the differences between P. leucopus and the murids M. musculus and R. norvegicus to a single all-embracing attribute may be fruitless. But from a perspective that also takes in the 2-3x longer life span of the whitefooted deer mouse compared to the house mouse and the capacity of P. leucopus to serve as disease agent reservoir while maintaining if not increasing its distribution (Moscarella et al. 2019), the feature that seems to best distinguish the deer mouse from either the mouse or rat is its predominantly anti-inflammatory quality. The presentation of this trait likely has a complex, polygenic basis, with environmental (including microbiota) and epigenetic influences. An individual's placement is on a spectrum or, more likely, a landscape rather than in one or another binary or Mendelian category."

      We agree that modification, simplication, and clarification was called for. In response to a public comment of Reviewer #1 we had changed that section, leaving out reference to longevity here. Here is the revised text in both R1 and R2:

      "Reducing differences between P. leucopus and murids M. musculus and R. norvegicus to a single attribute, such as the documented inactivation of the Fcgr1 gene in P. leucopus (7), may be fruitless. But the feature that may best distinguish the deermouse from the mouse and rat is its predominantly anti-inflammatory quality. This characteristic likely has a complex, polygenic basis, with environmental (including microbiota) and epigenetic influences. An individual’s placement is on a spectrum or, more likely, a landscape rather than in one or another binary or Mendelian category."

      Minor comments:

      • Use of blue and red in figures as the -only- way to easily distinguish between groups is a poor choice-both in terms of how inclusivity of color-blind researchers and enabling grayscale printing. Most detrimental in Figure 2, but also slightly problematic in Figure 1. Use of color and shape (as done in other figures) is a much better alternative.

      We agree. Both figures have been modified to include an additional characteristic for denoting the data point. For Figure 1 it is a black filling, and for Figure 2 it is the size of symbol in additon to the color. This should enable accurate visualization by color blind individuals and printing in gray scale. We have added definitions for the symbols within the graph itself, so there is no need to refer to the legend to interpret what they mean.

      • Note the typo where it should read P leucopus: "The differences between P. musculus and M. musculus in the ratios of Nos2/Arg1 and IL12/IL10 were reported before (BalderramaGutierrez et al. 2021),"

      We thank the reviewer for pointing this typo out, which also carried over to R1. It has been corrected for R2.

      • Optional: Can the relationship between the ratios in figure 5 and macrophage "types" be displayed graphically alongside the graphs? It's a little challenging to go back and forth between the text and the figure to try to understand the biological implication.

      We considered something like this but in the end decided that we were not yet comfortable assigning “types” in this fashion for Peromyscus.

      Reviewer #2 (Recommendations For The Authors):

      • Be consistent with nomenclature for your species/treatment groups in the text, figures, and tables. For example, you go back and forth between "P. leucopus" and "deermouse" in the text. And in figures you use "P," "Peromyscus", or "Pero".

      In the Methods section of the original and revisions R1 and R2 we indicate that "deermouse" is synonymous with "Peromyscus leucopus" and "mouse" is synonymous with "Mus musculus" in the context of this paper. We think that some alternation in the terms relieves the text of some of its repetitiveness and that readers should not have a problem with equating one with the other. The use of "deermouse" also reinforces for readers that Peromyscus is not a mouse. With regard to the abbreviations for P. leucopus, those were used to accommodate design and space issues of the figures or tables. In all cases, the abbreviations referred to are defined in the legends of the figures. So, we respectfully decline to follow this recommendation.

      • Often the sentence structure and/or word choice is irregular and makes quick/easy comprehension difficult. Several examples are:

      o The third paragraph of the introduction

      We agree that the first and second sentences are unclear. Here is the revision for R2:

      “As a species native to North America, P. leucopus is an advantageous alternative to the Eurasian-origin house mouse for study of natural variation in populations that are readily accessible (9, 53). A disadvantage for the study of any Peromyscus species is the limited reagents and genetic tools of the sorts that are applied for mouse studies.”

      o The first line after Figure 5 on page 9.

      We agree. The long sentence which we think the reviewer is referring to has been in split into two sentences for R2.

      “An ortholog of Ly6C (13), a protein used for typing mouse monocytes and other white cells, has not been identified in Peromyscus or other Cricetidae family members. Therefore, for this study the comparison with Cd14 is with Cd16 or Fcgr3, which deermice and other cricetines do have.”

      o The sentence that starts "Our attention was drawn to..." on page 14.

      We agree that the sentence was awkward and split into two sentences.

      “Our attention was drawn to ERVs by finding in the genome-wide RNA-seq of LPS-treated and control rats. Two of the three highest scoring DEGs by FDR p value and fold-change were a gagpol polyprotein of a leukemia virus with 131x fold-change from controls and a mouse leukmia virus (MLV) envelope (Env) protein with 62x fold-change (Dryad Table D5).”

      • For figures with multiple panels, use A), B) etc then indicate which panel you are discussing in your text. This is a very data heavy study and your readers can easily get lost.

      We agree and have added pointers in the text to the panels we are referring to. But we prefer to use easily understood descriptors like “left” and “upper” over assigned letters.

      • For all the figures, where are the stats from the t-tests? Why didn't you do a two-way ANOVA? Instead of multiple t-tests?

      Where we are not hypothesis testing and we are able to show all the data points in box-whisker plots with distributions fully revealed, our default position is not to apply significance tests in a post hoc fashion. If a reader or other investigator wants to do this for other purposes, e.g. a meta-analysis, the data is provided in public repository for them to do this. We are not sure what the reviewer means by "multiple t-tests" for "all figures". Where we do 2-tailed t-tests for presentation of data for many genes in a table for the targeted RNA (where individual values cannot shown in the table), there is always correction for multiple testing, as indicated in Methods. The p values shown as "FDR" are after correction.

      • Results paragraph "LPS experiment and hematology studies"

      o List the two species for the first description to orient the reader since you eventually include rat data.

      We agree that this is warranted and followed this recommendation for R2.

      o Not all the mice experienced tachypnea, but the text makes it seem like 100% did.

      We are not sure what the reviewer is referring to here. This is what is in the text on tachypnea: "By the experiment’s termination at 4 h, 8 of 10 M. musculus treated with LPS had tachypnea, while only one of ten LPS-treated P. leucopus displayed this sign of the sepsis state (p = 0.005)." The only other mention of "tachypnea" was in Methods.

      • Figure 1: Why was the M. musculus outlier excluded? Where any other outliers excluded?

      That data point for the mouse was not "excluded" from the graph. It is identified (MM17) for reference with Table 1, and there is the graph for all to see where it is. It was only excluded from the regression curve for control mice. There was no significance testing. There were no other outliers excluded.

      • Figure 3: explain the colors and make the scales the same for all the panels or at least for the upregulated DEGs and the downregulated DEGs.

      We have modified the legend for Figure 3 to include fuller definitions of the x-axes and a description of the color spectrum. We decline to make the x-axis scale the same for all the panels because the horizontal bars in “transcription down” panels would take up only a small fraction of the space. The x-axes are clearly defined and the colors of the bars also indicate the differences in p-values. We doubt that readers will be misled. Here is the revised legend: “Figure 3. Gene Ontology (GO) term clusters associated with up-regulated genes (upper panels) and down-regulated genes (lower panels) of P. leucopus (left panels) and M. musculus (right panels) treated with LPS in comparison with untreated controls of each species. The scale for the x-axes for the panels was determined by the highest -log10 p values in each of the 4 sets. The horizontal bar color, which ranges from white to dark brown through shades of yellow through orange in between, is a schematic representation of the -log10 p values.”

      • Results paragraph "Targeted RNA seq analysis"

      o In the third paragraph, an R2 of 0.75 is not close enough to 1 to call it "~1"

      What the reviewer is referring to is no longer in either R1 and R2, as detailed in the authors' response to public comments.

      o In the 4th paragraph, where are your stats?

      We have replaced terms like "substantially" and "marginally" with simple descriptions of relationships in the graphs.

      "For the LPS-treated animals there was, as expected for this selected set, higher expression of the majority genes and greater heterogeneity among P. leucopus and M. musculus animals in their responses for represented genes. In contrast to the findings with controls, Ifng and Nos2 had higher transcription in treated mice. In deermice the magnitude of difference in the transcription between controls and LPS-treated was less."

      • Figure 4: The colors are hard to see, I suggest making all the up regulated reads one color, the down regulated reads a different color, and the reads that aren't different black or gray.

      This is now Figure 5 in R1 and R2. The selected genes that are highlighted in the panels are denoted not only by color but also by type of symbol. We do not think that readers will have a problem telling one from another even if color blind. The purpose of this figure was to provide an overview and a visual representation with calling out of selected genes, some of which will be evaluated in more detail later. We thought that this was necessary before diving deeper into the data of Table 2. We do not think further discriminating between transcripts in the categorical way that the reviewer suggests is warranted at this point. So, we respectfully decline to follow this suggestion.

      • Results paragraph " Alternatively- activated macrophages...."

      o Include a brief description of Nos2 and Arg1

      We have defined what enzymes these are genes for in R2.

      o How do you explain the lack of a difference in P. leucopus Arg1? Your text says the RT-qPCR confirms the RNA-seq findings.

      There was a difference in P. leucopus Arg1 by RT-qPCR between control and LPS treated by about 3-fold. By both RNA-seq and RT-qPCR Arg1 transcription is higher in P. leucopus than in M. musculus under both conditions. But we have modified the sentence so that does not imply more than what the data and analysis of the table reveal.

      "While we could not type single cells using protein markers, we could assess relative transcription of established indicators of different white cell subpopulations in whole blood. The present study, which incorporated outbred M. musculus instead of an inbred strain, confirmed the previous finding of differences in Nos2 and Arg1 expression between M. musculus and P. leucopus (Figure 5; Table 2). Results similar to the RNA-seq findings were obtained with specific RT-qPCR assays for Nos2 and Arg1 transcripts for P. musculus and M. musculus (Table 3)."

      • Figure 5: reorganize the panels to make the text description and label with letters, where are the stats?

      We thought the figure (now Figure 6) was self-explanatory, but agree that further explanation in the legend was indicated. We prefer to use descriptions of locations (“upper left”) over labels, like “panel C”, which do not obviously indicate the location of the panel. Of course, if the journal’s style mandates the other format we will do so. Our response about “stats” for boxplot figures is the same as what we provided above.

      • Results paragraph "Interferon-gamma and interleukin-1 beta..."

      o Either add the numbers or direct the viewer to where Ifng is in Table 2. The table is very big and Ifng is all the way at the bottom!

      We agree that this table is large, but we thought it better to err on the side of inclusiveness by having a single table, rather than have some genes in the main article and other results in a supplementary table. We thought that it would make it easier for reviewers and readers to find a gene of interest, but we also acknowledge the challenge to locate the genes we highlight. We follow for R2 that reviewer's recommendation to provide some guidance for readers trying to locate a featured gene by pointing relative locations. While adding a column of numbers to already complex table seems more than what is called for, we are depositing an Excel spreadsheet of the table at the Dryad repository to facilitate searching by an interested reader for a particular gene.

      • Figure 6: stats? The pink and red are hard to easily distinguish from each other. I also suggest not using red and green together for color blind readers.

      With regard to the box-plots and significance testing, please see response above to an earlier recommendation. We have removed an interpretative adjective (i.e. "marked") from the description of the graph. Different symbols as well as colors are used, so we do not think that this will pose a problem for readers, even those with complete red-green color blindness. For what it’s worth, with regard to the "red" and "pink" issue, according to the figure on our displays the colors of the two symbols appear to be red and purple. They are also applied to different species and different conditions for those species.

      • Figure 8: In the legend it says "... PRRs (yellow) and ISGs (gree)" which is a typo, but don't you mean blue not green anyways?

      See response above to Reviewer #1's recommendation. This has been corrected.

    1. Author Response:

      Reviewer #1 (Public Review):

      [...] Overall the manuscript is well written, and the successful generation of the new endogenous Cac tags (Td-Tomato, Halo) and CaBeta, stj, and stolid genes with V5 tags will be powerful reagents for the field to enable new studies on calcium channels in synaptic structure, function, and plasticity. There are also some interesting, though not entirely unexpected, findings regarding how Brp and homeostatic plasticity modulate calcium channel abundance. However, a major concern is that the conclusions about how "molecular and organization diversity generate functional synaptic heterogeneity" are not really supported by the data presented in this study. In particular, the key fact that frames this study is that Cac levels are similar at Ib and Is active zones, but that Pr is higher at Is over Ib (which was previously known). While Pr can be influenced by myriad processes, the authors should have first assessed presynaptic calcium influx - if they had, they would have better framed the key questions in this study. As the authors reference from previous studies, calcium influx is at least two-fold higher per active zone at Is over Ib, and the authors likely know that this difference is more than sufficient to explain the difference in Pr at Is over Ib. Hence, there is no reason to invoke differences in "molecular and organization diversity" to explain the difference in Pr, and the authors offer no data to support that the differences in active zone structure at Is vs Ib are necessary for the differences in Pr. Indeed, the real question the authors should have investigated is why there are such differences in presynaptic calcium influx at Is over Ib despite having similar levels/abundance of Cac. This seems the real question, and is all that is needed to explain the Pr differences shown in Fig. 1. The other changes in active zone structure and organization at Is vs Ib may very well contribute to additional differences in Pr, but the authors have not shown this in the present study, and rely on other studies (such as calcium-SV coupling at Is vs Ib) to support an argument that is not necessitated by their data. At the end of this manuscript, the authors have found an interesting possibility that Stj levels are reduced at Is vs Ib, that might perhaps contribute to the difference in calcium influx. However, at present this remains speculative.

      Overall, the authors have generated powerful reagents for the field to study calcium channels and how they are regulated, but draw conclusions about active zone structure and organization contributing to functional heterogeneity that are not strongly supported by the data presented.

      Reviewer 1 raises an interesting question that we agree will form the basis of important studies. Here, we set out to address a different question, which we will work to better frame. While we and others had previously found a strong correlation between calcium channel abundance and synaptic release probability (Pr (Akbergenova et al., 2018; Gratz et al., 2019; Holderith et al., 2012; Nakamura et al., 2015; Sheng et al., 2012)), more recent studies found that calcium channel abundance does not necessarily predict synaptic strength (Aldahabi et al., 2022; Rebola et al., 2019). Our study explores this paradox and presents findings that provide an explanation: calcium channel abundance predicts Pr among individual synapses of either low-Pr type-Ib or high-Pr type-Is inputs where modulating channel number tunes synaptic strength, but does not predict Pr between the two inputs, indicating an inputspecific role for calcium channel abundance in promoting synaptic strength. Thus, we propose that calcium channel abundance predictably modulates synaptic strength among individual synapses of a single input or synapse subtype, which share similar molecular and spatial organization, but not between distinct inputs where the underlying organization of active zones differs. Consistently, in the mouse, calcium channel abundance correlates strongly with release probability specifically when assessed among homogeneous populations of connections (Aldahabi et al., 2022; Holderith et al., 2012; Nakamura et al., 2015; Rebola et al., 2019; Sheng et al., 2012).

      As Reviewer 1 notes, the two-fold difference in calcium influx at type-Is synapses is certainly an important difference underlying three-fold higher Pr. However, growing evidence indicates that calcium influx alone, like calcium channel abundance, does not reliably predict synaptic strength between inputs. For example, Rebola et al. (2019) compared cerebellar synapses formed by granule and stellate cells and found that lower Pr granule synapses exhibit both higher calcium channel abundance and calcium influx. In another example, Aldahabi et al. (2023) demonstrate that even when calcium influx is greater at high-Pr synapses, it does not necessarily explain differences in synaptic strength between inputs. Studying excitatory hippocampal CA1 synapses onto distinct interneuronal targets, they found that raising calcium entry at low-Pr inputs to high-Pr synapse levels is not sufficient to increase synaptic strength to high-Pr synapse levels. Similarly, at the Drosophila NMJ, the finding that type-Ib synapses exhibit loose calcium channel-synaptic vesicle coupling whereas type-Is synapses exhibit tight coupling suggests factors beyond calcium influx also contribute to differences in Pr between the two inputs (He et al., 2023). Consistently, a two-fold increase in external calcium does not induce a three-fold increase in release at low-Pr type-Ib synapses (He et al., 2023). Thus, upon finding that calcium channel abundance is similar at type-Ib and -Is synapses, we focused on identifying differences beyond calcium channel abundance and calcium influx that might contribute their distinct synaptic strengths. We agree that these studies, ours included, cannot definitively determine the contribution of identified organizational differences to distinct release probabilities because it is not currently possible to specifically alter subsynaptic organization, and will ensure that our language is tempered accordingly. However, in addition to the studies cited above and our findings, recent work demonstrating that homeostatic potentiation of neurotransmitter release is accompanied by greater spatial compaction of multiple active zone proteins (Dannhauser et al., 2022; Mrestani et al., 2021) and decreased calcium channel mobility (Ghelani et al., 2023) provide support for the interpretation that subsynaptic organization is a key parameter for modulating Pr.

      Reviewer #2 (Public Review):

      The authors aim to investigate how voltage-gated calcium channel number, organization, and subunit composition lead to changes in synaptic activity at tonic and phasic motor neuron terminals, or type Is and Ib motor neurons in Drosophila. These neuron subtypes generate widely different physiological outputs, and many investigations have sought to understand the molecular underpinnings responsible for these differences. Additionally, these authors explore not only static differences that exist during the third-instar larval stage of development but also use a pharmacological approach to induce homeostatic plasticity to explore how these neuronal subtypes dynamically change the structural composition and organization of key synaptic proteins contributing to physiological plasticity. The Drosophila neuromuscular junction (NMJ) is glutamatergic, the main excitatory neurotransmitter in the human brain, so these findings not only expand our understanding of the molecular and physiological mechanisms responsible for differences in motor neuron subtype activity but also contribute to our understanding of how the human brain and nervous system functions.

      The authors employ state-of-the-art tools and techniques such as single-molecule localization microscopy 3D STORM and create several novel transgenic animals using CRISPR to expand the molecular tools available for exploration of synaptic biology that will be of wide interest to the field. Additionally, the authors use a robust set of experimental approaches from active zone level resolution functional imaging from live preparations to electrophysiology and immunohistochemical analyses to explore and test their hypotheses. All data appear to be robustly acquired and analyzed using appropriate methodology. The authors make important advancements to our understanding of how the different motor neuron subtypes, phasic and tonic-like, exhibit widely varying electrical output despite the neuromuscular junctions having similar ultrastructural composition in the proteins of interest, voltage gated calcium channel cacophony (cac) and the scaffold protein Bruchpilot (brp). The authors reveal the ratio of brp:cac appears to be a critical determinant of release probability (Pr), and in particular, the packing density of VGCCs and availability of brp. Importantly, the authors demonstrate a brp-dependent increase in VGCC density following acute philanthotoxin perfusion (glutamate receptor inhibitor). This VGCC increase appears to be largely responsible for the presynaptic homeostatic plasticity (PHP) observable at the Drosophila NMJ. Lastly, the authors created several novel CRISPRtagged transgenic lines to visualize the spatial localization of VGCC subunits in Drosophila. Two of these lines, CaBV5-C and stjV5-N, express in motor neurons and in the nervous system, localize at the NMJ, and most strikingly, strongly correlate with Pr at tonic and phasic-like terminals.

      1) The few limitations in this study could be addressed with some commentary, a few minor follow-up analyses, or experiments. The authors use a postsynaptically expressed calcium indicator (mhcGal4>UAS -GCaMP) to calculate Pr, yet do not explore the contribution that glutamate receptors, or other postsynaptic contributors (e.g. components of the postsynaptic density, PSD) may contribute. A previous publication exploring tonic vs phasic-like activity at the drosophila NMJ revealed a dynamic role for GluRII (Aponte-Santiago et al, 2020). Could the speed of GluR accumulation account for differences between neuron subtypes?

      We did observe that GCaMP signals are higher at type Is synapses, where synapses tend to form later but GluRs accumulate more rapidly upon innervation (Aponte-Santiago et al., 2020). However, because we are using our GCaMP indicator as a plus/minus readout of synaptic vesicle release at mature synapses, we do not expect differences in GluR accumulation to have a significant effect on our measures. Consistently, the difference in Pr we observe between type-Ib and -Is inputs (Fig. 1C) is similar to that previously reported (He et al., 2023; Lu et al., 2016; Newman et al., 2022).

      2) The observation that calcium channel density and brp:cac ratio as a critical determinant of Pr is an important one. However, it is surprising that this was not observed in previous investigations of cac intensity (of which there are many). Is this purely a technical limitation of other investigations, or are other possibilities feasible? Additionally, regarding VGCC-SV coupling, the authors conclude that this packing density increases their proximity to SVs and contributes to the steeper relationship between VGCCs and Pr at phasic type Is. Is it possible that brp or other AZ components could account for these differences. The authors possess the tools to address this directly by labeling vesicles with JanellaFluor646; a stronger signal should be present at Is boutons. Additionally, many different studies have used transmission electron microscopy to explore SVs location to AZs (t-bars) at the Drosophila NMJ.

      To date, the molecular underpinnings of heterogeneity in synaptic strength have primarily been investigated among individual type-Ib synapses. However, a recent study investigating differences between type-Ib and -Is synapses also found that the Cac:Brp ratio is higher at type-Is synapses (He et al., 2023).

      At this point, we do not know which active zone components are responsible for the organizational (Figs. 1, 2) and coupling (now demonstrated by He et al., 2023) differences between type-Ib and -Is synapses or what establishes the differences in active zone protein levels we observe (Figs. 3,6), although Brp likely plays a local role. We find that Brp is required for dynamically regulating calcium channel levels during homeostatic plasticity and plays distinct roles at type-Ib and -Is synapses (Figs. 3, 4). Brp regulates a number of proteins critical for the distribution of docked synaptic vesicles near T bars of type Ib active zones, including Unc13 (Bohme et al., 2016). Extending these studies to type-Is synapses will be of great interest.

      3) In reference to the contradictory observations that VGCC intensity does not always correlate with, or determine Pr. Previous investigations have also observed other AZ proteins or interactors (e.g. synaptotagmin mutants) critically control release, even when the correlation between cac and release remains constant while Pr dramatically precipitates.

      This is an important point as a number of molecular and organizational differences between high- and low-Pr synapses certainly contribute to baseline functional differences. The other proteins we (Figs. 3,6) and others (Dannhauser et al., 2022; Ehmann et al., 2014; He et al., 2023; Jetti et al., 2023; Mrestani et al., 2021; Newman et al., 2022) have investigated are less abundant and/or more densely organized at type-Is synapses. Investigating additional active zone proteins, including synaptic proteins, and determining how these factors combine to yield increased synaptic strength are important next steps.

      4) To confirm the observations that lower brp levels results in a significantly higher cac:brp ratio at phasic-like synapses by organizing VGCCs; this argument could be made stronger by analyzing their existing data. By selecting a population of AZs in Ib boutons that endogenously express normal cac and lower brp levels, the Pr from these should be higher than those from within that population, but comparable to Is Pr. I believe the authors should also be able to correlate the cac:brp ratio with Pr from their data set generally; to determine if a strong correlation exists beyond their observation for cac correlation.

      We do not have simultaneous measures of Pr and Cac and Brp abundance. However, our findings suggest that distinct Cac:Brp ratios at type Ib and Is inputs reflect underlying organizational differences that contribute to distinct release probabilities between the two synaptic subtypes. In contrast, within either synaptic subtype, release probability is positively correlated with both Cac and Brp levels. Thus, the mechanisms driving functional differences between synaptic subtypes are distinct from those driving functional heterogeneity within a subtype, so we do not expect Cac:Brp ratio to correlate with Pr among individual type-Ib synapses. We will work to clarify this point in the revised text.

      5) For the philanthotoxin induced changes in cac and brp localization underlying PHP, why do the authors not show cac accumulation after PhTx on live dissected preparations (i.e. in real time)? This also be an excellent opportunity to validate their brp:cac theory. Do the authors observe a dynamic change in brp:cac after 1, or 5 minutes; do Is boutons potentiate stronger due to proportional increases in cac and brp? Also regarding PhTx-induced PHP, their observations that stj and α2δ-3 are more abundant at Is synapses, suggests that they may also play a role in PhTx induced changes in cac. If either/both are overexpressed during PhTx, brp should increase while cac remains constant. These accessory proteins may determine cac incorporation at AZs.

      As we have previously followed Cac accumulation in live dissected preparations and found that levels increase proportionally across individual synapses (Gratz et al., 2019), we did not attempt to repeat these challenging experiments at smaller type-Is synapses. We will reanalyze our data to investigate Cac:Brp ratio at individual active zones post PhTx. However, as noted above, we do not expect changes in the Cac:Brp ratio to correlate with Pr among individual synapses of single inputs as this measure reflects organization differences between inputs and PhTx induces an increase in the abundance of both proteins at both inputs.

      Determining the effect of PhTx on Stj levels at type-Ib and -Is active zones is an excellent idea and might provide insight into how lower Stj levels correlate with higher Pr at type-Is synapses. While prior studies have demonstrated critical roles for Stj in regulating Cac accumulation during development and in promoting presynaptic homeostatic potentiation (Cunningham et al., 2022; Dickman et al., 2008; Kurshan et al., 2009; Ly et al., 2008; Wang et al., 2016), its regulation during PHP has not been investigated.

      Taken together this study generates important data-driven, conceptional, and theoretical advancements in our understanding of the molecular underpinnings of different motor neurons, and our understanding of synaptic biology generally. The data are robust, thoroughly analyzed, appropriately depicted. This study not only generates novel findings but also generated novel molecular tools which will aid future investigations and investigators progress in this field.

      References

      Akbergenova, Y., K.L. Cunningham, Y.V. Zhang, S. Weiss, and J.T. Littleton. 2018. Characterization of developmental and molecular factors underlying release heterogeneity at Drosophila synapses. eLife. 7.

      Aldahabi, M., F. Balint, N. Holderith, A. Lorincz, M. Reva, and Z. Nusser. 2022. Different priming states of synaptic vesicles underlie distinct release probabilities at hippocampal excitatory synapses. Neuron. 110:4144-4161 e4147.

      Aponte-Santiago, N.A., K.G. Ormerod, Y. Akbergenova, and J.T. Littleton. 2020. Synaptic Plasticity Induced by Differential Manipulation of Tonic and Phasic Motoneurons in Drosophila. The Journal of neuroscience : the official journal of the Society for Neuroscience. 40:6270-6288.

      Bohme, M.A., C. Beis, S. Reddy-Alla, E. Reynolds, M.M. Mampell, A.T. Grasskamp, J. Lutzkendorf, D.D. Bergeron, J.H. Driller, H. Babikir, F. Gottfert, I.M. Robinson, C.J. O'Kane, S.W. Hell, M.C. Wahl, U. Stelzl, B. Loll, A.M. Walter, and S.J. Sigrist. 2016. Active zone scaffolds differentially accumulate Unc13 isoforms to tune Ca(2+) channel-vesicle coupling. Nature neuroscience. 19:1311-1320.

      Cunningham, K.L., C.W. Sauvola, S. Tavana, and J.T. Littleton. 2022. Regulation of presynaptic Ca(2+) channel abundance at active zones through a balance of delivery and turnover. Elife. 11.

      Dannhauser, S., A. Mrestani, F. Gundelach, M. Pauli, F. Komma, P. Kollmannsberger, M. Sauer, M. Heckmann, and M.M. Paul. 2022. Endogenous tagging of Unc-13 reveals nanoscale reorganization at active zones during presynaptic homeostatic potentiation. Front Cell Neurosci. 16:1074304.

      Dickman, D.K., P.T. Kurshan, and T.L. Schwarz. 2008. Mutations in a Drosophila alpha2delta voltagegated calcium channel subunit reveal a crucial synaptic function. The Journal of neuroscience : the official journal of the Society for Neuroscience. 28:31-38.

      Ehmann, N., S. Van De Linde, A. Alon, D. Ljaschenko, X.Z. Keung, T. Holm, A. Rings, A. Diantonio, S. Hallermann, U. Ashery, M. Heckmann, M. Sauer, and R.J. Kittel. 2014. Quantitative super-resolution imaging of Bruchpilot distinguishes active zone states. Nature Communications. 5.

      Ghelani, T., M. Escher, U. Thomas, K. Esch, J. Lützkendorf, H. Depner, M. Maglione, P. Parutto, S. Gratz, T. Matkovic-Rachid, S. Ryglewski, A.M. Walter, D. Holcman, K. O‘Connor Giles, M. Heine, and S.J. Sigrist. 2023. Interactive nanocluster compaction of the ELKS scaffold and Cacophony Ca<sup>2+</sup> channels drives sustained active zone potentiation. Science Advances. 9:eade7804.

      Gratz, S.J., P. Goel, J.J. Bruckner, R.X. Hernandez, K. Khateeb, G.T. Macleod, D. Dickman, and K.M. O'Connor-Giles. 2019. Endogenous tagging reveals differential regulation of Ca<sup>2+</sup> channels at single AZs during presynaptic homeostatic potentiation and depression. The Journal of Neuroscience:3068-3018.

      He, K., Y. Han, X. Li, R.X. Hernandez, D.V. Riboul, T. Feghhi, K.A. Justs, O. Mahneva, S. Perry, G.T. Macleod, and D. Dickman. 2023. Physiologic and Nanoscale Distinctions Define Glutamatergic Synapses in Tonic vs Phasic Neurons. The Journal of neuroscience : the official journal of the Society for Neuroscience. 43:4598-4611.

      Holderith, N., A. Lorincz, G. Katona, B. Rózsa, A. Kulik, M. Watanabe, and Z. Nusser. 2012. Release probability of hippocampal glutamatergic terminals scales with the size of the active zone. Nature neuroscience. 15:988-997.

      Jetti, S.K., A.B. Crane, Y. Akbergenova, N.A. Aponte-Santiago, K.L. Cunningham, C.A. Whittaker, and J.T. Littleton. 2023. Molecular Logic of Synaptic Diversity Between Drosophila Tonic and Phasic Motoneurons. bioRxiv:2023.2001.2017.524447.

      Kurshan, P.T., A. Oztan, and T.L. Schwarz. 2009. Presynaptic alpha2delta-3 is required for synaptic morphogenesis independent of its Ca2+-channel functions. Nature neuroscience. 12:1415-1423.

      Lu, Z., A.K. Chouhan, J.A. Borycz, Z. Lu, A.J. Rossano, K.L. Brain, Y. Zhou, I.A. Meinertzhagen, and G.T. Macleod. 2016. High-Probability Neurotransmitter Release Sites Represent an Energy-Efficient Design. Current biology : CB. 26:2562-2571.

      Ly , C.V., C.-K. Yao , P. Verstreken , T. Ohyama , and H.J. Bellen 2008. straightjacket is required for the synaptic stabilization of cacophony, a voltage-gated calcium channel α1 subunit. Journal of Cell Biology. 181:157-170.

      Mrestani, A., M. Pauli, P. Kollmannsberger, F. Repp, R.J. Kittel, J. Eilers, S. Doose, M. Sauer, A.-L. Sirén, M. Heckmann, and M.M. Paul. 2021. Active zone compaction correlates with presynaptic homeostatic potentiation. Cell Reports. 37:109770.

      Nakamura, Y., H. Harada, N. Kamasawa, K. Matsui, Jason S. Rothman, R. Shigemoto, R.A. Silver, David A. DiGregorio, and T. Takahashi. 2015. Nanoscale Distribution of Presynaptic Ca2+ Channels and Its Impact on Vesicular Release during Development. Neuron. 85:145-158.

      Newman, Z.L., D. Bakshinskaya, R. Schultz, S.J. Kenny, S. Moon, K. Aghi, C. Stanley, N. Marnani, R. Li, J. Bleier, K. Xu, and E.Y. Isacoff. 2022. Determinants of synapse diversity revealed by superresolution quantal transmission and active zone imaging. Nature Communications. 13:229.

      Rebola, N., M. Reva, T. Kirizs, M. Szoboszlay, A. Lőrincz, G. Moneron, Z. Nusser, and D.A. Digregorio. 2019. Distinct Nanoscale Calcium Channel and Synaptic Vesicle Topographies Contribute to the Diversity of Synaptic Function. Neuron. 104:693-710.e699.

      Sheng, J., L. He, H. Zheng, L. Xue, F. Luo, W. Shin, T. Sun, T. Kuner, D.T. Yue, and L.-G. Wu. 2012. Calcium-channel number critically influences synaptic strength and plasticity at the active zone. Nature neuroscience. 15:998-1006.

      Wang, T., R.T. Jones, J.M. Whippen, and G.W. Davis. 2016. alpha2delta-3 Is Required for Rapid Transsynaptic Homeostatic Signaling. Cell Rep. 16:2875-2888.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      The manuscript describes an interesting experiment in which an animal had to judge a duration of an interval and press one of two levers depending on the duration. The Authors recorded activity of neurons in key areas of the basal ganglia (SNr and striatum), and noticed that they can be divided into 4 types.

      The data presented in the manuscript is very rich and interesting, however, I am not convinced by the interpretation of these data proposed in the paper. The Authors focus on neurons of types 1 & 2 and propose that their difference encodes the choice the animal makes. However, I would like to offer an alternative interpretation of the data. Looking at the description of task and animal movements seen in Figure 1, it seems to me that there are 4 main "actions" the animals may do in the task: press right lever, press left lever, move left, and move right. It seems to me that the 4 neurons authors observed may correspond to these actions, i.e. Figure 1 shows that Type 1 neurons decrease when right level becomes more likely to be correct, so their decrease may correspond to preparation of pressing right lever - they may be releasing this action from inhibition (analogously Type 2 neurons may be related to pressing left lever). Furthermore, comparing animal movements and timing of activity of neurons of type 3 and 4, it seems to me that type 3 neurons decrease when the animal moves left, while type 4 when the animal moves right.

      I suggest Authors analyse if this interpretation is valid, and if so, revise the interpretation in the paper and the model accordingly.

      We thank the reviewer for the general appreciation of the study. Regarding to the interpretation of each SNr subtypes, we have compared firing activities of the same SNr neurons in both standard 2-8 s task and reversed 2-8 s task (Figure 2G-R, Figure S4). Type 1 and Type 2 neurons are related to right and left choices respectively in the standard task (Figure 2G, M, N), and this is even more evident in the reversed 2-8 s task (Figure 2J), because when the movement trajectories of the same mice in 8-s trials were reversed from left-then-right in the control task (Figure 2I) to right-then-left in the reversed task (Figure 2L), the Type 1 SNr neurons which showed monotonic decreasing dynamics in the control 2-8 s task (Figure 2M) reversed their neuronal dynamics to a monotonic increase in the reversed 2-8 s task (Figure 2P). The same reversal of neuronal dynamics was also observed in Type 2 SNr neurons in the reversed version of standard task (Figure 2N vs Figure 2Q). Therefore, Type 1 and Type 2 neurons are related to the action selection. Furthermore, Type 3 and Type 4 SNr neurons exhibiting transient change when mice switching either from left to right, or from left to right maintained the same neuronal dynamics in both standard 2-8 s task and reversed 2-8 s task (Figure S4C-F), indicating that Type 3 and Type 4 neurons are related to the switch between choices but not the specific upcoming choice to be made.

      Reviewer #1 (Recommendations For The Authors):

      Suggest to clarify if SNr neurons recorded just from a single hemisphere or bilaterally.

      We have described the recording hemisphere in our Methods (page 46, lines 974-976) as follows “For striatum recording, we implanted 11 mice in the left hemisphere and 8 mice in the right hemisphere. For the SNr recording, we implanted 5 mice in the left hemisphere and 4 mice in the right hemisphere.”

      Suggest to analyse if type 1/2/3/4 neurons are preferrably located in hemispheres contra/ipsi lateral to a particular lever or movement.

      We have addressed this issue in Figure S3 and Figure S6. In fact, we have implanted electrodes in both left and right hemispheres with mirror M-L coordinates. For striatum recording, we implanted 11 mice in the left hemisphere and 8 mice in the right hemisphere. For the SNr recording, we implanted 5 mice in the left hemisphere and 4 mice in the right hemisphere. We have analyzed the striatal and SNr neuronal activity in left vs. right hemisphere respectively, in relation to action selection. We found that SNr neurons recorded in either left or right hemisphere exhibited the same four types of neural dynamics with similar proportions (Fig. S3). Specially, the Type 1 neurons are dominant in both hemispheres. Similar in striatum, SPNs from left and right hemispheres showed the same four types of neural dynamics with similar proportions (Fig. S6). Therefore, there is no significant difference between hemispheres regarding to the proportion of neuron subtypes.

      Suggest to investigate if type 1/2 neurons are involved in preparation for lever press, please investigate if these neurons are also changing their activity during the lever press.

      In Figure S1L, we have showed the neuronal activities of example Type 1 and Type 2 SNr neurons to rewarded and non-rewarded lever presses. Type 1 SNr neuron shows higher firing activities when pressing the left lever than pressing the right lever, whereas Type 2 SNr neuron shows higher firing activities when pressing the right lever than pressing the left lever, indicating that Type 1 and Type 2 neurons firing activities are action choice dependent.

      Suggest investigating if Type 3/4 neurons are controlling movement from one location to another, please analyse if their activity is correlated with the movement on trial by trial bases.

      In Figure S2C-D, we showed firing activities of example Type 3 and Type 4 neurons on trial-by-trial bases. Type 3 neuron showed increased firing activities between 3-4 s during the 8s lever retraction period when the animal switched from left side to right side, whereas Type 4 neuron showed decreased firing activities between 3-4 s during as the animal switching from left to right. We further showed in Figure S4C-F, Type 3 and Type 4 neurons Type 3 and Type 4 neurons are related to the switch between choices but not the specific upcoming choice to be made.

      Suggest also performing analogous analyses for striatal neurons.

      We showed 4 types of SPNs on the on trial-by-trial bases as follows. Due to the limitation of the number of figures, these data were not included in the manuscript. We have now included these results in Fig. S2(E-H).

      Typo: l. 68: "can bidirectionally regulates" -> "can bidirectionally regulate"

      Thanks, we have now corrected the typos.

      Reviewer #2 (Public Review):

      In this valuable manuscript Li & Jin record from the substantial nigra and dorsal striatum to identify subpopulations of neurons with activity that reflects different dynamics during action selection, and then use optogenetics in transgenic mice to selectively inhibit or excite D1- and D2- expressing spiny projection neurons in the striatum, demonstrating a causal role for each in action selection in an opposing manner. They argue that their findings cannot be explained by current models and propose a new 'triple control' model instead, with one direct and two indirect pathways. These findings will be of broad interest to neuroscientists, but lacks some direct evidence for the proposal of the new model.

      Overall there are many strengths to this manuscript including the fact that the empirical data in this manuscript is thorough and the experiments are well-designed. The model is well thought through, but I do have some remaining questions and issues with it.

      Weaknesses:

      1) The nature of 'action selection' as described in this manuscript is a bit ambiguous and implies a level of cognition or choice which I'm not sure is there. It's not integral to the understanding of the paper really, but I would have liked to know whether the actions are under goal-directed/habitual or even Pavlovian control. This is not really possible to differentiate with this task as there are a number of Pavlovian cues (e.g. lever retraction interval, house light offset) that could be used to guide behavior.

      Sorry for the confusion of task description in the manuscript. We appreciate reviewer’s deep understanding about the complexity of the 2-8 s task we designed. Indeed, the 2-8 s task can’t be simply categorized as goal-directed/habitual or Pavlovian task. There are several behavioral aspects in this task. Lever retraction is served as a Pavlovian cue for mice to start performing the left-then-right sequential movement, but once levers are retracted, there is no cue available to mice during the lever retraction period, and mice have to make a decision to switch choice solely based on its internal estimation of the passage of time, which is considered as a cognitive process. The house light stays on for the entire training session (2 – 3 hours), and will be turned off when the task is done, so house light will not be used as a guidance for choice behavior. The behavior and neural activities during the lever retraction period is our main focus in this manuscript. The main advantage of such task design is that the animal is engaged in a self-determined, dynamic switch of action selection process, which offers a unique opportunity for investigating the role of various neuronal populations in the basal ganglia pathways during action selection.

      2) In a similar manner, the part of the striatum that is being targeted (e.g. Figures 4E,I, and N) is dorsal, but is central with regards to the mediolateral extent. We know that the function of different striatal compartments is highly heterogeneous with regards to action selection (e.g. PMID: 16045504, 16153716, 11312310) so it would have been nice to have some data showing how specific these findings are to this particular part of dorsal striatum.

      We thank the reviewer for bringing up this point. We are targeting dorsal-central part of striatum. In Figure S5G-L, we showed the specific location we targeted in striatum. Also as specified in Methods (lines 965-970), the craniotomies for electrode implantation were made at the following coordinates: 0.5 mm rostral to bregma and 1.5 mm laterally, and ~ 2.2 mm from the surface of the brain for dorsal striatum. For the virus injection and optic fiber implantation (lines 997-998), the craniotomies was made bilaterally at 0.5 mm rostral to bregma, 2 mm laterally and ~ 2.2 mm from the surface of the brain.

      3) I'm not sure how I feel about the diagrams in Figure 4S. In particular, the co-activation model is shown with D2-SPNs represented as a + sign (which is described as "having a facilitatory effect to selection" in the caption), but the co-activation model still suggests that D2-SPNs are largely inhibitory - just of competing actions rather than directly inhibiting actions. Moreover, I am not sure about these diagrams because they appear to show that D2-SPNs far outnumbers D1-SPNs and we know that this isn't the case. I realize the diagrams are not proportionate, but it still looks a bit misrepresented to me.

      We appreciate the reviewer’s comments about the diagram. We borrowed and extended the “center-surround” layout from the receptive field of neurons in the early visual system, as an intuitive analogy in describing the functional interaction among striatal pathways (also see Mink 2003 Archives of Neurology). In the co-activation model, if D2-SPNs inhibit the competing action, then the target action will be more likely to be selected due to the reduced competition, which means D2-SPNs actually facilitate the target action in an indirect way. And this is why we define the effect of D2-SPNs in the co-activation model as facilitatory. The area of each region does not represent the amount of cells but mainly qualitative functional role. To make it clearer, we have now added more explanation in the manuscript (page 17, lines 338-341).

      4). There are a number of grammatical and syntax errors that made the manuscript difficult to understand in places.

      We have now gone through the text carefully and corrected the typos.

      5) I wondered if the authors had read PMID: 32001651 and 33215609 which propose a quite different interpretation of direct/indirect pathway neurons in striatum in action selection. I wonder if the authors considered how their findings might fit within this framework.

      We appreciate the reviewer’s comments and suggestion. Miriam Matamales et al. (2020, PMID: 32001651) found that dynamic D2- to D1-SPNs transmodulation across the striatum that is necessary for updating previously learned behavior, which highlights the importance of collateral modulations between D1- and D2-SPNs as an additional layer of behavior control besides the classic direct and indirect pathways. This finding is compatible with our “Triple control” model emphasizing the influence of collateral modulations within striatum on behavior choice. James Peak et al. (2020, PMID: 33215609) demonstrated that D2-SPNs are critical to maintain the flexibility of behavior, which is reflected in our “Triple-control” model that activation of D2-SPNs could trigger the behavioral switch from the current action to another action. Although the two studies mentioned above mainly investigate the roles of striatal D1- and D2-SPNs in action learning and behavioral strategies, their functions in general fit within our new ‘Triple-control’ model of basal ganglia pathways for action selection.

      6) There is no direct evidence of two indirect pathways, although perhaps this is beyond the scope of the current manuscript and is a prediction for future studies to test.

      As accumulating RNA-seq and physiological data implying the heterogeneity of D2-SPNs, the further investigation of the subtypes of D1- and D2-SPNs and their functionality are likely a direction the field will continue to explore. On the other hand, we have discussed other possible anatomical circuits within basal ganglia circuitry that could fulfill the functional role of a third pathway in our new ‘Triple-control’ model, together with or independent of the second indirect pathway (page 32-33, lines 689-700). We certainly hope that our new model will inspire future work to identify and dissect the additional functional pathways in the basal ganglia circuits for action control.

      Reviewer #2 (Recommendations For The Authors):

      Suggestions for authors:

      1) Consider how specific to the dorso-central striatum these findings are, possibly in the discussion.

      We have specified in the Discussion that the study is targeting dorsal-central part of striatum (page 29, lines 609-612).

      2) Modify the diagrams in 4S to make them more representative of the model's features.

      We have responded this comment above.

      3) Consider whether the findings here might fit within the role for direct pathway in excitatory action-outcome learning and the indirect pathway in response flexibility more generally.

      The current study is mainly focus on selection and execution of actions. It will definitely be important to continue exploring the functionality of direct vs. indirect pathways in the action learning process.

      4) Correct typos and grammatical errors including (but not limited to):

      a) Line 62-64 - explain why this is controversial? Is it because we don't know which one applies?

      In the “Go/No-go” model, indirect pathway inhibits the desired action and function as gain modulation, while in the “Co-activation” model, indirect pathway inhibits the competing action and in turn facilitates the desired action in an indirect manner, therefore these two existing models disagree with each other on the explanation the function of indirect pathway in its targeting action and the net outcome of behavior.

      b) Line 68 - Regulates should be regulate.

      This has been corrected in the revised manuscript.

      c) Line 86 - should read "there are neuronal populations in either the direct or indirect pathway that are activated..."

      This has been corrected in the revised manuscript.

      d) Line 146-147 - "these types of neuronal dynamics in Snr only appeared in the correct but not incorrect trials" - It seems the authors are suggesting this only for Types 1 and 2 neurons, but this confused me the first time I read it and I suggest it is made clearer.

      Line 146-147 now reads “These four types of neuronal dynamics in SNr only appeared…”

      e) Line 346 - significant should be significantly.

      This has been corrected in the revised manuscript.

      f) Line 360 "contrast" should be "contrasting".

      This has been corrected in the revised manuscript.

    1. Note: This response was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      Comment: The author investigated the role of the stress sensor pathway in the mechanism of tumor cell survival<br /> They identified a long noncoding RNA as JUNI that regulates antagonizing MAP phosphatase and favors the JUN transcription. JUNI correlated with the survival of several cancer histotypes, particularly in RCC, as a highly specific and correlated prognosis.

      The abstract although not always required from the journal should be divided into methods used to reach the main findings and clear presentation of results

      Response: We do not know yet to which Journal the paper will be sent. The format will be adjusted to the Journal requirements.

      it is unclear whether JUNI is a positive or negative regulator of JUI (I assume the reviewer meant JUN)

      Response: The text in the abstract was changed to” JUNI positively regulates the expression of its neighboring gene JUN, a key transducer of signals that regulate multiple transcriptional outputs.”

      Hope it is clearer now

      When the author indicates that JUNI antagonizes MAP PHOSPHATASE is not correct the term antagonism is related to receptors but the authors did not show any receptor.

      Response: The term "antagonism" does not only refer to receptor drugs. In pharmacology, antagonism generally describes the interaction between a drug (or other molecule) and a receptor or biological target that results in the inhibition or blocking of the receptor's activity. However, this concept can extend beyond receptor drugs and apply to various biological interactions.

      Outside of the realm of drugs and receptors, antagonism can also refer to antagonistic relationships between different biological processes, molecules, or organisms.

      Overall, while antagonism is commonly discussed in the context of receptor drugs, the concept of antagonism can apply to a broader range of interactions in biology and other fields.

      Response: The p values for the prognostic values of JUNI and DUSP14 in RCC were added to the abstract.

      Generally, Jun oncogene correlated with poor overall survival while the table indicates promote survival so good prognosis?

      Response: This manuscript describes for the first time the biological activity and cancer relevance of JUNI. It positively regulates stress induced c-Jun and can be used as prognostic marker in ccRCC.

      The significance of JUNI and its interactome in ccRCC prognosis is unequivocal, according to data analysis of cancer relevant data (TCGA) regardless to its effects on c-Jun. The concern raised by reviewer 1 and 2 is whether the cancer-relevant effects are mediated by c-Jun regulation. We suggest that despite regulating stress induced c-Jun, they are not! This suggestion is based on three points: 1. We show in the manuscript that a large portion of JUNI dependent effects on cellular survival activity is c-Jun independent. 2. We describe many interacting proteins that may, in a JUN-independent manner, affect tumorigenesis. 3. In this study we examined JUNI’s functions which are cell-autonomous. However, neither the non -autonomous effects nor effects on cells that compose the tumor environment were studied. Reports that lncRNAs may have a role in immune responses and high expression of JUNI in CD8 cells may suggest this direction for future investigation (Carpenter, S et al. science, 341(6147), pp.789-792; Mickaël, M. et al https://doi.org/10.1101/2021.12.01.470587)

      Therefore, we assume that direct correlations in every biological activity between JUNI and JUN is an over simplified consumption. Analogy for that can be found with another major regulator of c-Jun, JNK, which is stress induced, c-Jun regulator involved in stress-induced cell death, whereas c-Jun itself is contributing in many cases to drug resistance.

      The introduction contains the main information to follow the role of JUN and renal carcinoma<br /> However, should be improved with background on the key role of stress genes in the pro-survival pathway of tumors during progression and hypoxia condition. Too many references on long noncoding compared to the JUN complex with AP-1 and transformation

      Response: A section describing the major stress pathway in ccRCC, HIF 1 and its role in ccrCC was added. Due to the limitation of word count in most journals we cannot expend this section further

      Results In Figure 1 the authors showed expression levels of JUNI and JUN that are clearly different after UV stimuli. they demonstrate that are both regulated by UV but the amount and the time are different. the author should comment on these data if they want to study the regulative mechanism

      Response: The following comment was added at the end of the first section: Overall, these results suggested that JUNI is a stress-induced gene whose expression pattern resembles that of JUN, therefore, we investigated the potential existence of regulatory effects between the two genes, especially post exposure of cells to stress.

      Figure 1 F the cellular distribution of JUNI which is the rational of this experiment to provide that is into nucleus while normally is into the cytoplasm? What adds this experiment?

      Response: This is the first reported description of JUNI. We attempted to characterize it as much as possible. It’s localization was not described previously and we suggest that it is mainly nuclear. A novel important information that should be presented.

      In Figure 2 the authors provided that the kinase pathway is important for Jun regulation but the effect on JUNI a Luciferase assay needs to be provided

      Response: We respectfully disagree with the reviewer. We believe that examining the expression from a DNA fragment identical to the endogenous one is superior to artificial system, such as luciferase.

      In Figure 3 for Migration assay is necessary to see cells on the other side of the filter by staining not a graphical representation

      Response: The graphical representation is an accumulated result of at least 3 experiment. However, a figure representing a single experiment was added as a supplement figure s1.

      The experiment on kinase does not add any data to what is already known on jun probably should be shifted in Figure 6

      Response: We apologize, this question was not fully understood as there is no experiment on kinase in figure 3. If case the reviewer was referring to kinase inhibition in Fig 2A we do think it is needed as a positive control for the kinases activity.

      Table 1 is cited two times once in the context of Figure 3 and then in Figure 6 indicating that the authors go forward and back on their experimental design

      Response: Table 1 is indeed referred to in two places. It is first mentioned when we investigated the potential relevance of JUNI for human cancer, given its regulatory impact on the neighboring JUN gene and its influence on motility. Later, the types of cancers described in figure 1 were further processed in order to examine relations between JUNI and DUSP14 in human cancer. We do not see it as a flaw in experimental design but rather as further evolution of the story based on data discovered in earlier stages.

      in figure 4 the apoptotic cells are not clearly visible a specific staining marker is necessary to provide the phenomenon

      Response: Two corrections were made to demonstrate apoptosis clearly. The pictures in Figure 4 panel A were replaced with a better-quality image with addition of DNA staining to demonstrate the cell death clearer, appearance of cell blebbing and nuclear fragmentation. Panel B demonstrating increase in cleaved caspase 3 in JUNI silenced cells after all treatment was added.

      Additionally XTT assay should be reported as the percentage of survival cells not staining incorporated compared to untreated cells over time

      Response: We do apologize for the legend omission, but XTT assays, colonies formation and soft agar colonies formation are presented in Figure 4 H-J and Figure S3 for all cell lines

      The data on prognosis and correlation of gene expression are not clearly presented and discussed

      Response: Figure S4 was replaced by table S3 to demonstrate clearer the differences in Medians survival caused by JUNI of DUSP 14. Text was changed in the last section of results.

      The western blot need to be quantified

      Response: All blots were quantified

      Reviewer #2:

      1. While the experimental data showed JUNI, like c-JUN, is pro-survival of cancer cells, the clinical sample analyses correlated it positively with patients' survival. This discrepancy casts doubts in significance of the findings. The authors need to re-evaluate their data and conclusion

      Response: This manuscript describes for the first time the biological activity and cancer relevance of JUNI. It positively regulates stress induced c-Jun and can be used as prognostic marker in ccRCC.

      The significance of JUNI and its interactome in ccRCC prognosis is unequivocal, according to data analysis of cancer relevant data (TCGA) regardless to its effects on c-Jun. The concern raised by reviewer 1 and 2 is whether the cancer-relevant effects are mediated by c-Jun regulation. We suggest that despite regulating stress induced c-Jun, they are not! This suggestion is based on three points: 1. We show in the manuscript that a large portion of JUNI dependent effects on cellular survival activity is c-Jun independent. 2. We describe many interacting proteins that may, in a JUN-independent manner, affect tumorigenesis. 3. In this study we examined JUNI’s functions which are cell-autonomous. However, neither the non -autonomous effects nor effects on cells that compose the tumor environment were studied. Reports that lncRNAs may have a role in immune responses and high expression of JUNI in CD8 cells may suggest this direction for future investigation (Carpenter, S et al. science, 341(6147), pp.789-792; Mickaël, M. et al https://doi.org/10.1101/2021.12.01.470587)

      Therefore, we assume that direct correlations in every biological activity between JUNI and JUN is an over simplified consumption. Analogy for that can be found with another major regulator of c-Jun, JNK, which is stress induced, c-Jun regulator involved in stress-induced cell death, whereas c-Jun itself is contributing in many cases to drug resistance.

      Response: The Western blotting data need at least triplicate biological experiments and quantification. This is particularly important for trivial differences, such as shown in Fig. 6.

      Response: All westerns X=3. Representative experiments are depicted. Quantification was added.

      The identification and gene structure of LINC01135 and its relevance to c-Jun need better clarity

      Response: First result section. “According to ENCODE data, JUNI contains five main exons and has multiple isoforms. Twenty-seven different transcript isoforms were described according to LNCipedia ranging from 213 to 6213 bases {Volders, 2019 #2907}. The relevance to c-Jun was referred to in discussion: Both the effects of JUNI on c-Jun induction and cellular survival were demonstrated using under-expression conditions by targeting, the common, first, exon of JUNI. Nevertheless, this exon was also sufficient for c-Jun induction upon stress exposure, under conditions of overexpression.

      Page 9-10, Line 198-199, there are no results in Fig. 1 showing that JUNI induction was dependent to serum stimulation of starved cells

      Response: “ Similar to JUN, the induction was dose dependent (Fig 1C), and the rapid response to stress (Fig 1D) as well as to serum stimulation of starved cells, identified by others (36), qualifies it as an “immediate early” lncRNA.”

      Serum stimulation is described in reference 36

      What is the Y-axis in figures 2B, 4E-G

      Response: Legend was added to Y-axis of Figures 2B and 4 E-G

      In Fig. 3B, actin image is missing

      Response: Actin was hidden in the graphic process. Corrected.

      In Fig. 4. brightfield images are inaccurate for distinguishing apoptosis and necrosis. Additional molecular markers need to be used, such as caspase-3 cleavage and LDH release

      Response: Two corrections were made to demonstrate apoptosis clearly. The pictures in Figure 4 panel A were replaced with a better-quality image with addition of DNA staining to demonstrate the cell death clearer, appearance of cell blebbing and nuclear fragmentation. Panel B demonstrating increase in cleaved caspase 3 in JUNI silenced cells after all treatment was added.

      The inconsistency of using four cell types in each assay. For example, in Fig. 4A, B, E-G and Suppl Fig. 1, HMCB, MDA-MB-231 and CHL1 cells were used to test the short-term effect of JUNI knockdown on cell survival, whereas Hela, MDA-MB-231 and CHL1 cells were chosen to determine the long-term effect of JUNI knockdown. Similar case in other figures.

      Response: Effects on Jun regulation and the effects on long term survival were tested in all four cell lines both by XTT and clonogenic assays whereas effects on short term survival were tested in three out of the four cell lines. It is practically impossible to perform a study of this magnitude were all assays were tested in all cell lines. Using four cell lines was applied to prove the major points.

      In Fig. 5D, no difference of c-Jun expression between NS and siJUN groups

      Response: Correct, the western in 5D was replaced by a more representative one

      Cell survival in Fig. 5 lacked statistical analyses

      Response: Error bars were mistakably omitted. The figure was corrected.

      In Suppl Fig. 2C, there is no figure to show the reduced colonies formation in soft agar in MDA-MB-231 cells, contradicting to that stated in the manuscript

      Response: Indeed Figure 4 J and S3 C presented colonies formation in HMCB and HeLa cells. The text was corrected.

      Reviewer #3: "linc01135" - this is a human gene, should be capitalized

      Response: linc01135 was capitalized

      Please indicate primers in Fig1A and mention this in relevant part of Results

      Response: The following section was added: “Importantly, ENCODE predicts that the first exon is shared by all, therefore, all primers to analyze JUNI’s expression as well as siRNAs to silence it, were targeted for this exon.

      Fig1C-F - please add a legend to explain the colors

      Response: Legend was added into the Figure as well

      Copy number: It is important to establish the approximate copy number of JUNI RNAs in the cell lines tested. FISH would be one appropriate method. This could also be referenced back to the RNA-seq TPM values. Are we talking about <1 copy /cell, or many? Quick inspection of ENCODE RNA-seq in the UCSC browser suggest an intermediate value that varies between cell lines. This value is very important when interpreting mechanistic experiments later on

      Response: The copy number in HMCB and MDA-MB-231 was calculated by comparison of CT values obtained from RNAs from a known number of cells relative to calibration curve of known concentrations of JUNI. The following section was added to the first paragraph of the results: “quantitation of JUNI’s copy number in untreated HMCB and MBA-MD-231 cells revealed the presence of minimal amount of about 8 copies per cell”

      Fig3 - again, no figure legends, difficult for reader

      Response: Legend was added to Fig. 3A

      In general, the figures could be much more clearly annotated and presented with more care. They do not do justice to the quality of the work itself. For example, Fig4E-G why not label each panel with the time course, the cell line tested etc etc to save us the work of digging through the Legends?

      Response: We thank the reviewer for this remark. All figures were corrected, legends and proteins quantification was added.

      Rescue experiments: The rescue experiments in Fig5D are nicely done and the results are interesting. However, I would request the authors to perform similar experiments with JUNI rescue. Specifically, to knock down JUNI with siRNA, and then reintroduce it from an 'immune' expression plasmid, where the siRNA site is mutated. This will further strengthen the claim that JUNI siRNA is acting through the intended target to cause observed effects on cell viability

      Response: As the effects on survival are strongest in the longer term, 14 days after silencing, rescue experiments were performed to test the rescue in the survival of HMCB and HeLa cells using clonogenic assays. Results are presented in figure 4 L

      IncPrint data: was Jun protein found to be an interactor? This might be mentioned in the text, whether it is yes or no

      Response: c-Jun was screened and did not interact with JUNI. The text was changed as following” Interestingly, c-Jun itself does not interact with JUNI (Table S2, Normalized luciferase intensity MS2, RLU =0.44). By contrast, the dual specificity protein phosphatase 14….”

      Expression: A key issue is the expression of JUNI in healthy and diseased cells and organs. Is JUNI ubiquitous (and essential to both healthy and tumor cells), or is it specific to tumor cells? Which tumor types? This would be straightforward to find out from public data. I would suggest a main figure panel. Also, is JUNI upregulated across tumors? Could find this out from GEPIA2 or other databases.

      Response: Figure 7E describing the levels of JUNI in variety of normal and tumor samples was added.

      Non-tumor cells: Like many studies, this one focusses on effect of LOF in transformed cells. However, therapeutic relevance is tied to specific effect in transformed cells. Therefore I believe the paper would be vastly strengthened, if knockdowns+viability assays were also performed in some non-transformed cells. Eg HEK293, immortalised fibroblasts, RPE1 etc

      Response: Indeed discrimination between Normal and cancer cells is an essential point for further research and translation. We examined the affects of silencing on spontaneously immortalized keratinocytes, HaCat cells, and the results are depicted in Figure 4 K.

      Alternative reagents: The siRNA experiments are well performed with two independent sequences. An important additional experiment would be to replicate these experiments with antisense oligonucleotides. This would both further strengthen the confidence in experiments, and open more lines of potential therapies. This experiment I would consider optional

      Response: Stable CRISPR can not be formed. We are currently constructing inducible CRISPR but the construction consumes longer time than the scope of this revision.

      Advanced models: All the present experiments are performed in monolayer cell lines. The authors will no doubt be aware that the paper would be substantially strenghtened if functional experiments could be replicated in more advanced models: spheroids, PDX, explants, mice...

      Response: We examined the protective role of JUNI in Doxorubicin treated spheroids of HMCB and CHL1 cells. The results are depicted in figure 4 D and E.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      "MAGIC" was introduced by the Rong Li lab in a Nature letters article in 2017. This manuscript is an extension of this original work and uses a genome wide screen the Baker's yeast to decipher which cellular pathways influence MAGIC. Overall, this manuscript is a logical extension of the 2017 study, however the manuscript is challenging to follow, complicated by the data often being discussed out of sequence. Although the manuscripts make claims of a mechanism being pinpointed, there are many gaps and the true mechanisms of how the factors identified in the screen influence MAGIC is not clear. A key issue is that there are many assumptions drawn on previous literature, but central aspects of the mechanisms being proposed are not adequately shown.

      Key comments:

      1. Reasoning and pipelines presented in the first two sections of the results are disordered and do not follow figure order. In some instances, the background to experimental analyses such as detailing the generation of spGFP constructs in the YKO mutant library, or validation of Snf1 activation are mentioned after respective results are discussed. This needs to be fixed.

      We thank the reviewer for pointing out potential confusion to readers. We have revised the first two sections according to reviewer’s suggestion. (Page 4-6)

      1. In general there is a lack of data to support microscopy data and supporting quantification analysis. The validity of this data could be significantly strengthened with accompanying western blots showing accumulation of a given constructs in mitochondrial sub compartments (as was the case in the lab’s original paper in 2017).

      We appreciate the reviewer’s suggestion on biochemical validations. However, the validity of this imaging-based assay for detecting import of cytosolic misfolded proteins into mitochondria, including the use of FlucSM as a model misfolding-prone protein, was carefully established in our previous study by using appropriate controls, super resolution imaging, APEX-based proximity labeling, and classical biochemical fractionation and protease protection assay (Ruan et al., 2017 Nature, ref. 10). We have reminded readers of these validation experiments in the previous study on Page 4, line 14-17.

      In recent years, advancements in imaging-based tools have allowed many protein interactions and dynamic processes, which were previously examined by using biochemical assays in lysates of populations of cells, to be observed with various level of quantitation in live cells with intact cellular compartments. Many of these assays, e.g., the RUSH assay for ER to Golgi transport, FRAP-based analysis for nuclear/cytoplasmic shuttling of proteins, or FRET-based assays for protein-protein interactions, have been well accepted and even embraced by the respective fields of study once validated with genetic and biochemical approaches. The advantages for live-cell imaging-based assays are often their unique ability to report dynamic processes or unstable molecular species with spatiotemporal sensitivity. Respectfully, it is our view, based on our own experience, that the traditional protease protection assay is not adequate or sufficiently quantitative for examining the presence of unstable misfolded proteins in mitochondrial sub-compartments, given the obligatorily lengthy in vitro cell lysis and mitochondrial isolation process, during which the unstable proteins are continuously being degraded. This likely explains our previous biochemical fractionation result that only weak protein signals were detected in the matrix fraction (Ruan et al., 2017 Nature, ref. 10). In addition, unlike stably folded, native mitochondrial matrix proteins, misfolded/unfolded proteins such as Lsg1 or FlucSM are highly susceptible to protease treatment. This sensitivity makes the assay unreliable for detecting such proteins if trace amount of the protease penetrates mitochondrial membranes during cell lysis even without detergent treatment.

      While we agree that protease protection assay is highly valuable for qualitative detection of the presence of a protein in certain mitochondrial compartments or determining its topology on membranes, this assay (regrettably in our hands) does not allow quantitative comparisons that were necessary for this study, because of inherent sample to sample variation, yet the laborious and low throughput nature of this assay makes it difficult for adequate statistical analysis. Furthermore, the level of protein detection in various fractions is highly sensitive to how the sample is treated with protease and detergent. Our imaging-based quantification, on the other hand, allows us to compare increased or decreased presence of GFP11-tagged proteins in mitochondria under different metabolic conditions or in different mutant or wild-type strains. Data from hundreds of cells and at least three independent biological replicates allowed us to apply adequate statistical analysis to aid our conclusion.

      1. Much of the mechanisms proposed relies on the Snf1 activation. This is however not shown but assumed to be taking place. Given that this activation is central to the mechanism proposed, this should be explicitly shown here - for example survey the phosphorylation status of the protein.

      Both REG1 deletion and low glucose conditions have been demonstrated extensively for Snf1 phosphorylation and activation in yeast (e.g., many seminal papers from Marian Carlson’s and other lab, such as ref. 24-28). In our study, we have indeed corroborated this by showing that Mig1 was exported from the nucleus in Δreg1 mutant and in low glucose conditions (Figure 1—figure supplement 2H and I. The mechanism of Snf1-mediated nuclear export of Mig1 has been characterized in detail as well (e.g., ref. 29-31).

      Recommendations for the authors: please note that you control which, if any, revisions, to undertake

      Reviewer #1 (Recommendations For The Authors):

      SPECIFIC COMMENTS

      Genetic Screen o Line 20 - the narrative moves to SNF1, but the reasoning for the selection of this Class I substrate is not defined. What was the basis for this selection - what happened to the other Class I substrates. It is stated in the text that the other Class I proteins show the same increase in spGFP signal. The data showing this should be included in the Supp Figure 1 for transparency.

      We have moved the narratives of Snf1 function to the second section and clarified that we were interested in this gene due to its central role in metabolism and mitochondrial functions that may influence MAGIC (Page 5: line 16-20). Other genes in class 1 were shown in Table S1. Detailed discussion of other genes in this category is beyond the scope of this study.

      Snf1/AMPK prevents MP accumulation in mitochondria:

      The FlucDM data in human RPE-1 mitochondria seems to be added to only increase the significance of the work. The mechanisms suggested here with Hap4 would not be possible in human cells as there is no homologue of this protein in human cells. Making generalisations that these pathways are conserved based on this one experiment is not appropriate.

      We appreciate this feedback. Although the focus of this study is the regulation of MAGIC by the yeast AMPK Snf1, we would like to share our initial observation that suggests a similar role of AMPK in human RPE-1 cells. We acknowledge that the underlying mechanisms regarding the downstream transcription factors and pathway for misfolded protein import could be different in mammalian cells, but the overall effect of AMPK in mitochondrial biogenesis is well known to resemble that of Snf1. To avoid making over-generalization, we changed our statement of conclusion to: ‘These results suggest that AMPK in human cells regulates MP accumulation in mitochondria following a similar trend as in yeast, although the underlying mechanisms might differ between these organisms.’ (Page 7: line 2-4)

      Mechanisms of MAGIC regulation by Snf1:

      While the lysosome is ruled out here the authors have not considered the proteasomes. Is there a reason for this? Given accumulation of aggregates outside of mitochondria, and previous connections of the proteasome to mitochondrial quality control this would be an obvious thing to check. We examined the role of lysosomal degradation here because it is known to be activated under Snf1active condition (ref. 37). We appreciate this feedback and have included a new analysis on MG132treated FlucSM spGFP strains in which PDR5 gene was deleted to avoid drug efflux.

      This result suggests that the proteosome inhibitor did not ablate the difference in FlucSM accumulation between these conditions. That MG132 promoted mitochondrial accumulation of FlucSM in both high glucose and low glucose conditions was not surprising, as FlucSM is also degraded by proteasome in the cytosol (Ruan et al., 2017 Nature, ref. 10), and preventing this pathway could divert more of such protein molecules toward MAGIC. (Page 7: line 26-29).

      Line 13 "we hypothesized that elevated expression of mitochondrial preproteins induced by the activation of Snf1-Hap4 axis (REF) may outcompete MPs for import channels". This statement has some assumptions. The authors have not shown that Snf1 is activated in thier models and more importantly that they have an accumulation of mitochondrial preproteins. The data that follows using the cytosolic domains of the receptors is hard to rationalise without seeing evidence that there is in fact pre-protein accumulation or impacts on the mitochondrial proteome in this system.

      As stated in our response to main point [3], Snf1 activation in reg1 mutant or in low glucose is evidenced by our data showing Mig1 export from nucleus to cytoplasm and had also been shown in many previous publications. A recent study (Tsuboi et al., 2020 eLife) also showed a dramatic increase in mitochondrial volume fraction in Δreg1 cells and wild-type cells in respiratory conditions, further supporting the role of Snf1 in mitochondrial biogenesis. We have provided relevant references in the manuscript (ref. 24-28).

      The ability of Tom70 cytosolic domain (Tom70cd), which can bind mitochondrial preproteins but not localize to mitochondria due to lack of N-terminal targeting sequence, to compete with endogenous Tom70 for mitochondrial preproteins has been well documented (ref. 47-49). However, we agree with the reviewer that a future quantitative proteomics study to measure changes in mitochondrial proteome under Tom70cd over-expression could allow more accurate interpretation of our experimental result.

      AMPK protects cellular fitness during proteotoxic stress:

      The inhibition of preprotein import by overexpressing the cytosolic domains of receptors is not supported with some proof of principle data. If this was working as the authors assume, it is not clear why only an effect with Tom70 is observed. The majority of the mitochondrial proteome is imported via Tom20/Tom22 so this does not align with what the authors are suggesting. Is the Tom70CD and any associated Hsp proteins facilitating the observed changes to the MPs?

      We thank the reviewer for raising this point. We expressed different TOM receptor cytosolic domains but found that Tom70cd had the strongest rescue on MAGIC under AMPK activation conditions. It is possible that certain Tom70 substrates or Tom70-assoicated heat shock proteins inhibit the import of MAGIC substrates. We admit that a clear explanation of this unexpected observation necessitates a better understanding of how native and MAGIC substrates are selected and imported by the outer-membrane channel. We can only offer our best interpretation based on the current state of the understanding, and we feel that we have been careful to acknowledge such in the manuscript.

      While the effect of AMPK inactivation reducing FUS accumulation was striking, this was all in the context of overexpression and may not be physiologically relevant - or may occur very transiently under basal conditions. Is GST an appropriate control here, why not use WT FUS? Likewise, one representative image is shown in Figure 5 - can the authors show western blotting that mitochondrial accumulation of FUS can be reduced with AMPK activation?

      We thank the reviewer for this suggestion, however, overexpressed FUS WT is also aggregation prone (Zhihui Sun et al., 2011, PloS Biology; Shulin Ju, 2011, PloS Biology; Jacqueline C. Mitchell et., 2013, Acta Neuro). We believe that GST, as a well-folded protein, is an appropriate control (Ruan et al., 2017 Nature, ref. 10). As we discussed in response to main point [1], the in vitro assay involving protease protection and western blots do not allow reliable quantitative comparison in our hands.

      In text changes.

      The analysis pipeline of the YKO mutant library should be introduced at the very start of the first paragraph, not the end.

      Addressed on Page 4, second paragraph

      "Fluc" should be introduced as "Firefly luciferase" within the first paragraph of the first section, also need to define SM and DM in FlucSM/FlucDM - these appear to be missing.

      Addressed in both Introduction (Page 2: line 29; Page 3: line 8-9) and re-clarified in Result (Page 5: line 27-29)

      The role of Reg1 should be explicitly stated in the text, not just in the figure.

      Addressed on Page 6: line 3-6

      Figure 1H legend states Reg1 (WT) is Snf1-inactive and Reg1 KO is Snf1-active. This wording is confusing and is not supported by data, but by assumption. If the authors want to use this wording then evidence needs to be provided - as suggested above.

      We have changed this and other legends to only show genotypes and medium conditions.

      "Tom70cd overexpression also exacerbated growth rate reduction due to FlucSM expression in HG medium (Figure 4A; Figure 4 - figure supplement 1A)" should be figure supplement 1B.

      Fixed on Page 10: line 10

      "These results suggest that glucose limitation protects mitochondria and cellular fitness during FlucSM induced proteotoxic stress through Snf1-dependent inhibition of MP import into mitochondria". The phrase "Snf1-dependent inhibition of MP import into mitochondria" may be misleading, as Snf1 isn't modulating import directly but is acting on transcriptional regulators to modulate mitochondrial import under stress.

      We restated the conclusion as follows: ‘These results suggest that Snf1 activation under glucose limitation protects mitochondrial and cellular fitness under FlucSM-associated proteotoxic stress.’ (Page 10: line 20- 21)

      "... Significantly increased the fraction of spGFP-positive and MMP-low cells in both HG and LG medium (Figure 4G-K)" should be (Figure 4J-K).

      Fixed on Page 11: line 3

      Reviewer #2 (Public Review):

      Work of Rong Li´s lab, published in Nature 2017 (Ruan et al, 2017), led the authors to suggest that the mitochondrial protein import machinery removes misfolded/aggregated proteins from the cytosol and transports them to the mitochondrial matrix, where they are degraded by Pim1, the yeast Lon protease. The process was named mitochondria as guardian in cytosol (MAGIC).

      The mechanism by which MAGIC selects proteins lacking mitochondrial targeting information, and the mechanism which allows misfolded proteins to cross the mitochondrial membranes remained, however, enigmatic. Up to my knowledge, additional support of MAGIC has not been published. Due to that, MAGIC is briefly mentioned in relevant reviews (it is a very interesting possibility!), however, the process is mentioned as a "proposal" (Andreasson et al, 2019) or is referred to require "further investigation to define its relevance for cellular protein homeostasis (proteostasis)" (Pfanner et al, 2019).

      Rong Li´s lab now presents a follow-up story. As in the original Nature paper, the major findings are based on in vivo localization studies in yeast. The authors employ an aggregation prone, artificial luciferase construct (FlucSM), in a classical split-GFP assay: GFP1-10 is targeted to the matrix of mitochondria by fusion with the mitochondrial protein Grx5, while GFP11 is fused to FlucSM, lacking mitochondrial targeting information. In addition the authors perform a genetic screen, based on a similar assay, however, using the cytosolic misfolding-prone protein Lsg1 as a read-out.

      My major concern about the manuscript is that it does not provide additional information which helps to understand how specifically aggregated cytosolic proteins, lacking a mitochondrial targeting signal could be imported into mitochondria. As it stands, I am not convinced that the observed FlucSM-/Lsg1-GFP signals presented in this study originate from FlucSM-/Lsg1-GFP localized inside of the mitochondrial matrix. The conclusions drawn by the authors in the current manuscript, however, rely on this single approach.

      In the 2017 paper the authors state: "... we speculate that protein aggregates engaged with mitochondria via interaction with import receptors such as Tom70, leading to import of aggregate proteins followed by degradation by mitochondrial proteases such as Pim1." Based on the new data shown in this manuscript the authors now conclude "that MP (misfolded protein) import does not use Tom70/Tom71 as obligatory receptors." The new data presented do not provide a conclusive alternative. More experiments are required to draw a conclusion.

      In my view: to confirm that MAGIC does indeed result in import of aggregated cytosolic proteins into the mitochondrial matrix, a second, independent approach is needed. My suggestion is to isolate mitochondria from a strain expressing FlucSM-GFP and perform protease protection assays, which are well established to demonstrate matrix localization of mitochondrial proteins. In case the authors are not equipped to do these experiments I feel that a collaboration with one of the excellent mitochondrial labs in the US might help the MAGIC pathway to become established.

      We thank Reviewer 2 for these suggestions, but we would like to respectfully offer our difference in opinion:

      a. Regarding the suggestion “to isolate mitochondria from a strain expressing FlucSM-GFP and perform protease protection assays”, in our previous study (Ruan et al., 2017 Nature, ref. 10), we have indeed applied two independent biochemical approaches: APEX-mitochondrial matrix proximity labeling and classic protease protection assay using non-spGFP strains, both consistently confirmed the entry of misfolded proteins into mitochondria under proteotoxic stress. Our super-resolution imaging further confirmed the import of the split GFP-labeled proteins to be inside mitochondria. Moreover, as we discussed in response to Reviewer 1’s main point [2], while the suggested biochemical assay is useful for validating topology within mitochondria, it is not quantitative and may not reliably report the in vivo accumulation of misfolded proteins in mitochondria due to the isolation process that takes hours, during which the unstable proteins could be continuously degraded within mitochondria.

      While we agree with the reviewer that we do not yet understand how misfolded proteins are imported into mitochondria, it would be unfair to state “as it stands, I am not convinced..” simply because the underlying mechanism remains to be elucidated. We would like to point out that targeting sequences for many well-established mitochondrial proteins are still not well defined. It is well known that mitochondrial targeting sequences are not as uniformly predictable as, for example, nuclear targeting sequences. Our finding that deletion of TOM6 enhances the import of misfolded proteins suggest that their import may involve the TOM channel in a more promiscuous conformation, which may reduce the requirement for a specific sequence-based targeting signal associated with the substrate.

      b. Regarding the role of Tom70, in our 2017 study, using proteomics and subsequently immunoprecipitation we validated the binding, albeit not necessarily direct, between misfolded protein FlucSM and Tom70. Therefore, “we speculate that protein aggregates engaged with mitochondria via interaction with import receptors such as Tom70”. Recent studies from different labs confirmed the interactions between Tom70 and aggregation prone proteins (Backes et al., 2021, Cell Reports; Liu et al., 2023, PNAS). In the current study, surprisingly, knockout of TOM70 did not block MAGIC, suggesting redundant components of mitochondria import system may facilitate the recruitment of misfolded proteins in the absence of Tom70, and this does not contradict the notion that Tom70 helps tether protein aggregates to mitochondria.

      c. Regarding other studies also showing the import of misfolding or aggregation-prone cytosolic proteins into mitochondria, there have been at least several recent studies in the literature for mammalian cells involving either model substrates or disease proteins (e.g., ref. 12-15; 56-58; Vicario, M. et al. 2019 Cell Death Dis.). The studies are briefly mentioned in Introduction (Page 3, paragraph 2). The present manuscript documents a major effort from our group using whole genome screen in yeast to understand the mechanism and regulation of MAGIC. Many of the screen hits have yet to be studied in detail. We full agree that much remains to be understood about whether and how this pathway affects proteostasis and what might be the evolutionary origin for such a mechanism.

      Additional comments:

      The genetic screen:

      The genetic screen identified five class 1 deletion strains, which lead to enhanced accumulation of Lsg1GFP and a larger set of class 2 mutants, which lead to reduced accumulation. Please note, in my opinion it is not clear that accumulation of the reporters occurs inside the mitochondria. In any case, the authors selected one single protein for further analysis: Snf1, the catalytic subunit of the yeast SNF complex, which is required for respiratory growth of yeast.

      The results of the screen are not discussed in any detail. The authors mention that ribosome biogenesis factors are abundant among class 2 mutants. Noteworthy, Lsg1 is involved in 60S ribosomal subunit biogenesis. As Lsg1-GFP11 is overexpressed in the screen this should be discussed. Class 2 mutants also .include several 40S ribosomal subunit proteins (only one of the 60S subunit). What does this imply for the MAGIC model? Also, it should be discussed that the screen did not identify reg1 and hap4, which I had expected as hits based on the data shown in later parts of the manuscript.

      We apologize for the confusion, but the GFP11 tag was in fact knocked into the C-terminus of Lsg1 in the endogenous LSG1 locus, and so Lsg1 was not overexpressed in the screen. We have made sure that this information is clearly conveyed in the revised manuscript (Page 4: line 20-22). How the ribosome small subunit affects MAGIC is beyond the focus of the current study and will be pursued in the future.

      Regarding why certain mutants did not come out of our initial screen, this is not unexpected as the YKO collection, although extremely valuable to the community, is known to be potentially affected by false knockouts, suppressor accumulation and cross contamination (for references, e.g., Puddu et al., 2019 Nature). Additionally, high-through screens can also miss real hits. In our experience using this collection in several studies, we often found additional hits from analysis of genes implicated by known genetic or biochemical interactions.

      Mutant yeast strains and growth assays:

      The Δreg1 strain grows poorly in all growth conditions and frequently accumulates extragenic suppressor mutations (Barrett et al, 2012). It would be good to make sure that this is not the case in the strains employed in this study. My suggestion is to do (and show) standard yeast plating assays with the relevant mutant strains including Δreg1, snf1, hap4, Δreg1Δhap4 without the split GFP constructs and also with them (i.e. the strains that were used in the assays).

      We thank the reviewer for the suggestion. We were indeed aware of potential accumulation of suppressor mutations from the YKO library. Therefore, deletion mutants like Δreg1 and loss of TFs downstream of Snf1 that we used in the study after the initial screen were all freshly made and validated. At least 3 independent colonies were analyzed for each mutant (mentioned in Methods & Materials; Page 33, line 57). Moreover, the plating assay suggested here may not reveal additional information other than growth, which was taken into consideration during our experiments.

      Activation of Snf1 in the relevant strains should be tested with the commercially available antibody recognizing active Snf1, which is phosphorylated at Snf1-T210.

      Snf1 activation was validated by the Mig1 exporting from the nucleus. We also noted above that many studies have clearly demonstrated Snf1 activation in reg1 mutant and under low glucose growth (e.g., ref. 24-28).

      Effects of Snf1, Reg1, Hap4 and respiratory growth conditions:

      The authors show that split GFP reporters show enhanced accumulation during fermentative growth, in Δsnf1, and Δreg1Δhap4 and fail to accumulate during respiratory growth, in Δreg1 and upon overexpression of HAP4. Analysis of Δhap4 should be included in Fig. 2. The suggestion that upon activation of Snf1 enhanced Hap4-dependent expression "outcompetes" misfolded protein import seems unlikely as only a fraction of mitochondrial genes is under control of Hap4. Without further experimental evidence I do not find that a valid assumption. More likely, the membrane potential plays a role: it is low during fermentative growth, in Δsnf1 and Δreg1Δhap4, and high during respiratory growth and in Δreg1 (Hübscher et al, 2016). Such an effect of the membrane potential seems to contradict the findings in the 2017 paper and the issue should be clarified and discussed. In any case, these data do not reveal that GFP reporters accumulate inside of the mitochondria. Based on the currently available evidence they may accumulate in close proximity/attached to the mitochondria. This has to be tested directly (see above).

      We have included our analysis of Δhap4 in Page 8: line 14-15 and Figure 2—figure supplement 1H. Consistent with our result for Δreg1Δhap4 in glucose-rich medium, HAP4 deletion also resulted in a significant increase in mitochondrial accumulation of FlucSM in low glucose medium compared to WT. It did not have effect in high glucose condition in which Snf1 is largely inactive.

      It is our view that the importance of Hap4 should not be judged by the number of nuclear encoded mitochondrial proteins they regulate. Still, this sub-group comprises a considerable number of proteins (at least 55 genes upregulated by Hap4 overexpression, ref. 43), and certain substrates may be more competitive with misfolded cytosolic proteins for import. Our genetic data strongly suggest that the inhibitory effect of active Snf1 on MAGIC is through Hap4, although we agree with the reviewer that detailed mechanism on how Hap4 substrates may compete with misfolded proteins need to be addressed in future studies.

      Membrane potential is important for mitochondrial import. During respiratory growth and in Δreg1, membrane potential is well known to be elevated comparing to fermentative condition (e.g., Figure 4C). Our observation that the import of misfolded proteins into mitochondria is reduced under these conditions simply suggests that this reduction is not due to a lack of membrane potential. This is not in any way contradictory to our 2017 finding that misfolded protein import requires membrane potential (ref. 10).

      Again, the accumulation of misfolded proteins in mitochondria, especially the model protein FlucSM, has been validated by using super resolution imaging (Figure 1—figure supplement 1A) in addition to the protease protection assay in our 2017 study.

      Introduction and Discussion:

      Both are really short, too short in my view. Please provide some background of the general principals of mitochondrial protein import and information of how exactly translocation of cytosolic, aggregated proteins (lacking targeting information) is supposed to work. I do not understand exactly how the authors actually envisage the process.

      We thank the reviewer for the suggestion. In the revised manuscript, we have extended both Introduction (Page 2-3) and Discussion section (Page 11-13)

      The results from the 2022 eLife paper (Liu et al, 2022), which suggests that Tom70 may "regulate both the transcription/biogenesis and import of mitochondrial proteins so the nascent mitochondrial proteins do not compromise cytosolic proteostasis or cause cytosolic protein aggregation" should be discussed with regard to the data obtained with overexpression of the Tom70 soluble domain.

      We thank the reviewer for pointing out that study and we have included a brief comment in Discussion section (Page 12: line 13-16). As the function of Tom70 appears to be complex, we cannot exclude the possibility that overexpression of the cytosolic domain has additional or indirect effects in addition to that due to preprotein binding.

      Andreasson, C., Ott, M., and Buttner, S. (2019). Mitochondria orchestrate proteostatic and metabolic stress responses. EMBO Rep 20, e47865.

      Barrett, L., Orlova, M., Maziarz, M., and Kuchin, S. (2012). Protein kinase A contributes to the negative control of Snf1 protein kinase in Saccharomyces cerevisiae. Eukaryot Cell 11, 119-128.

      Hubscher, V., Mudholkar, K., Chiabudini, M., Fitzke, E., Wolfle, T., Pfeifer, D., Drepper, F., Warscheid, B., and Rospert, S. (2016). The Hsp70 homolog Ssb and the 14-3-3 protein Bmh1 jointly regulate transcription of glucose repressed genes in Saccharomyces cerevisiae. Nucleic Acids Res. 44, 5629-5645.

      Liu, Q., Chang, C.E., Wooldredge, A.C., Fong, B., Kennedy, B.K., and Zhou, C. (2022). Tom70-based transcriptional regulation of mitochondrial biogenesis and aging. Elife 11

      Pfanner, N., Warscheid, B., and Wiedemann, N. (2019). Mitochondrial proteins: from biogenesis to functional networks. Nat Rev Mol Cell Biol 20, 267-284.

      Ruan, L., Zhou, C., Jin, E., Kucharavy, A., Zhang, Y., Wen, Z., Florens, L., and Li, R. (2017). Cytosolic proteostasis through importing of misfolded proteins into mitochondria. Nature 543, 443-446.

      I prefer to have "all in one", also due to time limitation.

      It would be great to be able to upload the review file as otherwise formatting and symbols get lost.

      Reviewer #3 (Public Review):

      In this study, Wang et al extend on their previous finding of a novel quality control pathway, the MAGIC pathway. This pathway allows misfolded cytosolic proteins to become imported into mitochondria and there they are degraded by the LON protease. Using a screen, they identify Snf1 as a player that regulates MAGIC. Snf1 inhibits mitochondrial protein import via the transcription factor Hap4 via an unknown pathway. This allows cells to adapt to metabolic changes, upon high glucose levels, misfolded proteins an become imported and degraded, while during low glucose growth conditions, import of these proteins is prevented, and instead import of mitochondrial proteins is preferred.

      This is a nice and well-structured manuscript reporting on important findings about a regulatory mechanism of a quality control pathway. The findings are obtained by a combination of mostly fluorescent protein-based assays. Findings from these assays support the claims well.

      While this study convincingly describes the mechanisms of a mitochondria-associated import pathway using mainly model substrates, my major concern is that the physiological relevance of this pathway remains unclear: what are endogenous substrates of the pathway, to which extend are they imported and degraded, i.e. how much does MAGIC contribute to overall misfolded protein removal (none of the experiments reports quantitative "flux" information). Lastly, it remains unclear by which mechanism Snf1 impacts on MAGIC or whether it is "only" about being outcompeted by mitochondrial precursors.

      We thank Reviewer 3 for the positive and encouraging comments on our manuscript. We agree with the reviewer that identifying MAGIC endogenous substrates and understanding what percentage of them are degraded in mitochondria are very important issues to be addressed. We are indeed carrying out projects to address these questions. We also agree with Reviewer 3 that the effect of Snf1 on MAGIC may have additional mechanisms in addition to precursors competition, such as Tom6 mediated conformational changes of TOM pores. In the revised manuscript, we had added a discussion to address these comments (Page 12: line 21-28).

      Reviewer #3 (Recommendations For The Authors):

      1. In their screen, the authors utilize differences in GFP intensity as a measure for import efficiency. However, reconstitution of the GFP from GFP1-10 and GFP11 in the matrix might also be affected (folding factors, differential degradation).

      Upon Snf1 activation, the protein abundance of mitochondrial chaperones such as Hsp10, Hsp60, and Mdj1, and mitochondrial proteases such as Pim1 are not significantly changed (ref. 35). Therefore, it is unlikely that the folding and degradation capacity of mitochondrial matrix is drastically affected by Snf1 activation.

      To examine the effect of Snf1 activation on spGFP reconstitution, Grx5 spGFP strain was constructed in which the endogenous mitochondrial matrix protein Grx5 was C-terminally tagged with GFP11 at its genomic locus, and GFP1-10 was targeted to mitochondria through cleavable Su9 MTS (MTS-mCherryGFP1-10) (ref. 10). Only modest reduction in Grx5 spGFP intensity was observed in LG compared to HG, and no significant difference after adjusting the GFP1-10 abundance (spGFP/mCherry ratio) (Figure 1— figure supplement 3A-D). These data suggest that any effect on spGFP reconstitution is insufficient to explain the drastic reduction of MP accumulation in mitochondria under Snf1 activation. Overall, our results demonstrate that Snf1 activation primarily prevents mitochondrial accumulation of MPs, but not that of normal mitochondrial proteins. (Page 6: line 17-25).

      We admit, however, that to fully rule out these factors, specific intra-mitochondrial folding or degradation reporter assays would be needed.

      1. Scoring of protein import always takes place using fluorescence-based assays. These always require folding of the "sensors" in the matrix. An additional convincing approach that would not rely on matrix folding could be pulse chase approaches coupled to fractionation assays and immunoprecipitation.

      We thank reviewer 3 for this suggestion. In our previous study, we applied two different biochemical assays: APEX proximity labeling, and mitochondrial fractionation followed by protease protection. Both confirmed the entry of misfolded proteins into mitochondria as observed by using split GFP. As we discussed in response to Reviewer 1’s main point [3], the fractionation assays are not quantitative enough for the comparisons made in our study. In particular, during the over 2-hour assay, misfolded proteins continue to be degraded within mitochondria. By using proper controls, our spGFP system provides quantitative comparisons for mitochondrial accumulation of misfolded proteins in non-disturbed physiological conditions.

      1. Could the pathway be reconstituted in vitro with isolated mitochondria to test for the "competition hypothesis"

      This is an excellent suggestion, but setting up such a reconstituted system is a project on its own. The study documented in this manuscript already encompasses a large amount of work that we feel should be published timely.

      1. Fluorescence figures are not colour blind friendly (red-green). This should be improved by changing the color scheme.

      We thank reviewer 3 for pointing this out and sincerely apologize for any inconvenience. However, we are unfortunately unable to change all images within a limited time. We will adopt another color scheme in future work.

      1. spGFP in human cells appears to form "spot-like" structures. What are these granules?

      We indeed observed granule-like structures by spGFP labeled FUS in mitochondria, which is interesting, but we did not investigate this further because it is a not a focus of this study.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Response to Reviewers

      To whom it may concern, Thank you for your constructive feedback on our manuscript. I appreciate the time and effort that you and the reviewers have dedicated to providing your valuable feedback. We are grateful to the reviewers for their insightful comments and suggestions for our paper. I have been able to incorporate changes to reflect the majority of these suggestions provided. I have updated the analysis scripts (at https://github.com/neurogenomics/reanalysis_Mathys_2019) and have listed these changes in blue below:

      eLife assessment:

      This work is useful as it highlights the importance of data analysis strategies in influencing outcomes during differential gene expression testing. While the manuscript has the potential to enhance awareness regarding data analysis choices in the community, its value could be further enhanced by providing a more comprehensive comparison of alternative methods and discussing the potential differences in preprocessing, such as scFLOW. The current analysis, although insightful, appears incomplete in addressing these aspects.

      We thank the reviewing editors for this note. We agree that the differences in preprocessing will affect the results and conceal which step in our reanalysis resulted in the discrepancies we noted. To address this, we have split out our reanalysis into two separate parts - In the main body of the text we discuss the differences resulting from just changing the differential expression approach where we use the same processed data as the authors to enable a fair comparison. Secondly, we still provide the reprocessed data and perform differential expression analysis on it and discuss the cause and impact the differences in the processing steps made to the results.

      Reviewer 1:

      I think readers would be interested to learn more about the genes that were found "significant" by the original paper but sorted out by the authors. Did they just fall short of the cutoffs? If so, how many more samples would have been required to ascertain significance? This would yield a recommendation for future studies and an overall more positive/productive spirit to the manuscript. On the other hand, I suspect a fraction of DEGs were false positives due to differences in the proportions of cells from different individuals compared to the original analysis. Which percentage of DEGs does this apply to? Again, this would raise awareness of the issue and support the use of pseudobulk approaches.

      To investigate the relationship between the genes and how they differ across our analysis we have added a correlation analysis between our different DE approaches (using the same processed data), see paragraph 5 in the manuscript and supplementary table 3. In short, we find that there is a high correlation in the genes’ fold change values across our pseudobulk analysis and the author’s pseudoreplication analysis on the same dataset (pearson R of 0.87 for an adjusted p-value of 0.05) which is somewhat expected given the DE approaches are applied to the same dataset. However, the p-values, which pertain to the likelihood that a gene’s expressional changes is related to the case/control differences in AD, and resulting DEGs vary considerably due to the artificially inflated confidence of the author’s approach (Fig. 1c-e). Despite there being a correlation between the pseudoreplciation and pseudobulk approaches here, we do not think it makes sense to consider how many more samples would have been required to ascertain significance. The differences in results between the two approaches is not negatable with sample size as many DEGs identified by pseudoreplication will be false positives as highlighted in previous work1,2,3,4. However, perhaps we are misinterpreting the reviewer, who may have meant a power analysis which we have not conducted. Such an undertaking would require analysing a multitude of snRNA-Seq of large sample sizes to garner a confident estimate for power calculations based on pseudobulk approaches. Although we agree with the reviewer that this would be beneficial to the field, we do not believe it is in scope for this work. On the reviewer’s note regarding a fraction of DEGs being false positives due to differences in the proportions of cells from different individuals compared to the original analysis - We have analysed the same processed data the authors used to negate the differences caused by the differing processing steps. We thank the reviewer for this suggestion. We also give more insight into the cause of these differences, namely on filtering our nuclei with large proportions of mitochondrial reads and discuss their effect in paragraph 3 (also see Supplementary Figure 2).

      Given there are only a few DEGs, it would be good to show more data about these genes to allow better assessment of the robustness of the results, i.e., boxplots of the pseudobulk counts in the compared groups and perhaps heatmaps of the raw counts prior to aggregation. This could rule out concerns about outliers affecting the results.

      In Supplementary Figure 3, we have added boxplots of the sum pseudobulked, trimmed mean of M-values (TMM) normalised counts for three of our identified DEGs (b) and three of the authors’ DEGs which they discuss in their manuscript (a) to show the differences in counts across AD pathology and controls for these genes. We hope this gives some insight into the transcriptional changes highlighted by the differing approaches. In our opinion, there is a clear difference in the transcriptional signal in the genes identified from pseudobulk which is not present for the genes identified from the authors approach.

      Overall, I believe the paper would deliver a clearer message by mainlining the QC from the original study and only changing the DE analysis. However, if keeping the part about QC/batch correction:

      • Assess to which degree changes in cell type proportion are indeed due to batch correction (as suggested in the text) and not filtering by looking at the annotated cell types in the original publication and those in your analysis.

      • Also perform the analysis without changing QC and state the # of DEGs in both cases, to at least allow some disentanglement of the effect of different steps of the analysis.

      • Please state the number of cells removed by each QC step in the supplementary note.

      We thank the reviewer for this suggestion. We agree with performing the DE analysis on the same processed data as the original authors and have split out our reanalysis into two separate parts, primarily focussing on the discrepancies caused by the choice of differential expression (DE) approach. By splitting our analysis in this manner, we can identify the substantial differences in results caused by differing the DE approach in the study. Secondly, we can see how differences in preprocessing affects the DE results in isolation too – see paragraph 8 but in short, the fold change correlation between pseudobulk DE analyses on the reprocessed data vs authors processed data only had a moderate correlation (Pearson R of 0.57).

      In regards to the number of cells removed by each QC step, we have added an aggregated view for all samples in supplementary table 3 and also give the full statistics per sample in our Github repository: https://github.com/neurogenomics/reanalysis_Mathys_2019. Moreover, we investigated the root cause in the differences in nuclei numbers, uncovering filtering down to mitochondrial read proportions as the main culprit (Supplementary Figure 2).

      I recommend the authors read the following papers, assess whether their methodology agrees with them, and add citations as appropriate to support statements made in the manuscript.

      We thank the reviewer for this comprehensive list. We have updated our manuscript and supplementary file and main text throughout to cite many of these where appropriate. We believe this helps add context to our decisions for the differing tools and approaches used as part of the processing pipeline with scFlow and the differential expression approach.

      I believe the authors' intention was to show the results of their reanalysis not as a criticism of the original paper (which can hardly be faulted for their strategy which was state-of-the-art at the time and indeed they took extra measures attempting to ensure the reliability of their results), but primarily to raise awareness and provide recommendations for rigorous analysis of sc/snRNA-seq data for future studies.

      We thank the reviewer for this note, this was exactly our intent. Furthermore, we are based in a dementia research institute and our aim is to ensure that ensure that the Alzheimer’s disease research field does not focus on spuriously identified genes.We have updated the text of the manuscript (start paragraph 2) to explicitly state this so our message is not misconstrued.

      In my opinion, the purpose of the paper might be better served by focusing on the DE strategy without changing QC and instead detailing where/how DEGs were gained/lost and supporting whether these were false positives.

      We agree that the differences in preprocessing will affect the results and conceal which step in our reanalysis resulted in the discrepancies we noted. To address this, we have split out our reanalysis into two separate parts - In the main body of the text we discuss the differences resulting from just changing the differential expression approach where we use the same processed data as the authors to enable a fair comparison. Secondly, we still provide the reprocessed data and perform differential expression analysis on it and discuss the impact the differences in the processing steps made to the results. As previously mentioned, we have also added further investigation into the DEGs identified, looking at the correlation across the differing approaches and plotting the counts for selected genes.

      For instance, removal with a mitochondrial count of <5% seems harsh and might account for a large proportion of additional cells filtered out in comparison to the original analysis. There is no blanket "correct cutoff" for this percentage. For instance, the "classic" Seurat tutorial https://satijalab.org/seurat/articles/pbmc3k_tutorial.html uses the 5% threshold chosen by the authors, an MAD-based selection of cutoff arrived at 8% here https://www.sc-best-practices.org/preprocessing_visualization/quality_control.html, another "best practices" guide choses by default 10% https://bioconductor.org/books/3.17/OSCA.basic/quality-control.html#quality-control-discarded, etc. Generally, the % of mitochondrial reads varies a lot between datasets.

      Apologies, the 5% cut-off was a misprint – the actual cut-off used was 10% which, as the reviewer notes, is on the higher side of what is recommended. We have updated our manuscript to rectify this mistake and discuss the differences in the number of cells caused by the two approaches to mitochondrial filtering in the manuscript (paragraph 3). We found that over 16,000 nuclei that were removed in our QC pipeline were kept by the author’s (Supplementary Fig. 2), explaining the discrepancy in the number of nuclei after QC. Based on Supplementary Fig. 2, it is clear the author’s approach was ineffective at removing nuclei with high proportions of mitochondrial reads which is indicative of cell death5,6. We hope this alleviates the reviewer’s concerns around our alternative processing approach. Moreover, as mentioned, we swapped to compare the differences by DE approaches on the same data to avoid any effect by this.

      Reviewer 2:

      The paper would be better if the authors merged this work with the scFLOW paper so that they can justify their analysis pipeline and show it in an influential dataset.

      We thank the reviewer for this note. We would like to clarify that the purpose of our work was not to show the scFlow analysis pipeline on an influential dataset but rather to raise awareness and provide recommendations for rigorous analysis of single-cell and single-nucleus RNA-Seq data (sc/snRNA-Seq) for future studies and to help redirect the focus of the Alzheimer’s disease research field away from possible spuriously identified genes. We have updated our manuscript text to highlight this (see start paragraph 2). Furthermore, we are aware our original approach reprocessing the data with scFlow will affect the results and conceal which step in our reanalysis resulted in the discrepancies we noted. Thus, we have split out our reanalysis into two separate parts - In the main body of the text we discuss the differences resulting from just changing the differential expression approach where we use the same processed data as the authors to enable a fair comparison. Secondly, we still provide the reprocessed data so that the community can benefit from it and perform differential expression analysis on it and discuss the impact the differences in the processing steps made to the results. We have also added further references supporting the choice of steps and tools used in scFlow in the supplementary text which should address the reviewer’s concerns about justifying the analysis pipeline. Moreover, we identified the cause of the nuclei count differences caused by the two processing approaches, namely on filtering our nuclei with large proportions of mitochondrial reads and discuss their effect in paragraph 3 (also see Supplementary Figure 2).

      A major contribution is the use of the authors' own inhouse pipeline for data preparation (scFLOW), but this software is unpublished since 2021 and consequently not yet refereed. It isn't reasonable to take this pipeline as being validated in the field.

      We believe our answer to the previous point addresses these concerns - We have added references supporting the choice of steps and tools used in scFlow in the supplementary text which should address the reviewer’s concerns about justifying the analysis pipeline. Moreover, as a result of the pipeline we identified that 16,000 of the nuclei kept by the authors are likely of low quality and indicative of cell death with high mitochondrial read proportions5,6.

      They also worry that the significant findings in Mathys' paper are influenced by the number of cells of each type. I'm sure it is since power is a function of sample size, but is this a bad thing? It seems odd that their approach is not influenced by sample size.

      We thank the reviewer for highlighting this point. As they noted, we conclude that the original authors number of DEGs is just a product of the number of cells. However, the reviewer states that ‘It seems odd that their approach is not influenced by sample size’. An increase in the number of cells is not an increase in sample size since these cells are not independent from one another - they come from the same sample. Therefore, an increase in the number of cells should not result in an increase in the number of DEGs whereas an increase in the number of samples would. This point is the major issue with pseudoreplication approaches which over-estimate the confidence when performing differential expression due to the statistical dependence between cells from the same patient not being considered. See these references for more information on this point1,2,7,8. We have added a discussion of this point to our manuscript in paragraph 6.

      Moreover, recent work has established that the genetic risk for Alzheimer’s disease acts primarily via microglia9,10. Thus, it would be reasonable to expect that the majority of large effect size DEGs identified would be found in this cell type. This is what we found with our pseudobulk differential expression approach – 96% of all DEGs were in microglia. We have updated the text of our manuscript (paragraph 5) to highlight this last point.

      References 1. Murphy, A. E. & Skene, N. G. A balanced measure shows superior performance of pseudobulk methods in single-cell RNA-sequencing analysis. Nat. Commun. 13, 7851 (2022).

      1. Squair, J. W. et al. Confronting false discoveries in single-cell differential expression. Nat. Commun. 12, 5692 (2021).

      2. Crowell, H. L. et al. muscat detects subpopulation-specific state transitions from multi-sample multi-condition single-cell transcriptomics data. Nat. Commun. 11, 6077 (2020).

      3. Soneson, C. & Robinson, M. D. Bias, robustness and scalability in single-cell differential expression analysis. Nat. Methods 15, 255–261 (2018).

      4. Ilicic, T. et al. Classification of low quality cells from single-cell RNA-seq data. Genome Biol. 17, 29 (2016).

      5. Heumos, L. et al. Best practices for single-cell analysis across modalities. Nat. Rev. Genet. 24, 550–572 (2023).

      6. Zimmerman, K. D., Espeland, M. A. & Langefeld, C. D. A practical solution to pseudoreplication bias in single-cell studies. Nat. Commun. 12, 738 (2021).

      7. Lazic, S. E. The problem of pseudoreplication in neuroscientific studies: is it affecting your analysis? BMC Neurosci. 11, 5 (2010).

      8. Skene, N. G. & Grant, S. G. N. Identification of Vulnerable Cell Types in Major Brain Disorders Using Single Cell Transcriptomes and Expression Weighted Cell Type Enrichment. Front. Neurosci. 0, (2016).

      9. McQuade, A. & Blurton-Jones, M. Microglia in Alzheimer’s disease: Exploring how genetics and phenotype influence risk. J. Mol. Biol. 431, 1805–1817 (2019).

    2. Joint Public Review:

      Murphy, Fancy and Skene performed a reanalysis of snRNA-seq data from Alzheimer Disease (AD) patients and healthy controls published previously by Mathys et al. (2019), arriving at the conclusion that many of the transcriptional differences described in the original publication were false positives. This was achieved by revising the strategy for both quality control and differential expression analysis. With this re-analysis, the authors aim to raise awareness of the impact of data analysis choices for scRNA-seq data and to caution focus on putatively wrongly identified genes in the AD research community. The revised manuscript has been improved by separating QC and DE analysis, which makes interpretation of both steps more straightforward.

      STRENGTHS:

      The authors demonstrate that the choice of data analysis strategy can have a vast impact on the results of a study, which in itself may not be obvious to many researchers.

      The authors apply a pseudobulk-based differential expression analysis strategy (essentially, adding up counts from all cells per individual and comparing those counts with standard RNA-seq differential expression tests), which is (a) in line with latest community recommendations, (b) different from the "default options" in most popular scRNA-seq analysis suites, and (c) explains the vastly different number of DEGs identified by the authors and the original publication. The recommendation of this approach together with a detailed assessment of the DEGs found by both methodologies could potentially be a useful finding for the research community. Unfortunately, it is currently not sufficiently substantiated.

      All code and data used in this study are publicly available to the readers.

      WEAKNESSES:

      The authors interpret the fact that they found fewer DEGs with their method than the original paper as a good thing by making the assumption that all genes that were not found were false positives. However, they do not prove this, and it is likely that at least some genes were not found due to a lack of statistical power and not because they were actually "incorrect". The original paper also had performed independent validations of some genes that were not found here. I had raised this weakness in my first review, but it was not explicitly addressed and still pertains to the revised manuscript. The authors have added an analysis that shows that "pseudoreplication" is prone to false positive (FP) discoveries for high cell numbers (Fig. 1f), but this does not prove that all of Mathys' DEGs were wrong.

      I am concerned that almost all DEGs found by the authors are in the rare cell types, foremost the rare microglia (see Fig. 1e). Indeed, there is a weak negative correlation between cell counts and numbers of DEGs (Fig. 1e), if the correlation analysis is to be believed (see next point). It is unclear to me how many cells the pseudo-bulk counts were based on for these cell types, but it seems that (a) there were few and (b) there were quite few reads per cells. If both are the case, the pseudobulk counts for these cell populations might be rather noisy and the DEG results liable to outliers with extreme fold changes. Supp. Fig. 3b now shows three examples of DEGs, of which one (EGR1) looks like the DE call is indeed largely driven by four outliers, while Supp. Fig 3a shows at least one gene (BEX1) that could be FP of the pseudobulk approach due to insufficient statistical power. The authors go on to cite two papers (one is their own, published in a journal with suspected lack of appropriate quality assurance measures https://predatoryreports.org/the-predatory-journals-1), to support that the finding of DEGs in microglia "makes more sense" (l. 127). In summary, neither the presented examples nor the supporting literature are convincing. Lastly, the authors even show themselves that their approach is liable to FPs if applied with very low cell numbers in the range of those for microglia and OPCs (Fig. 1g).

      The correlation analysis between cell counts and number of DEGs found is weak. In all three cases (Fig. 1c, d, e) the correlation is largely driven by a single outlier data point.

      The authors claim they improved the quality control of the dataset but offer no objective metric to assess this putative improvement. The authors' QC procedure removes some 20k cells that had not been filtered out by Mathys' et al. As the authors state themselves, this difference is mostly due to the removal of cells with a high mitochondrial read content. Murphy et al use a fixed threshold for the mitochondrial percentage of reads, while the original paper had removed cell clusters with an "abnormally high" mitochondrial read fraction. That also seems reasonable, given that some cells might have a higher mitochondrial read content for reasons other than being "low quality". Simply stating that Mathys' approach was ineffective at removing cells with high mitochondrial read content is a self-fulfilling prophecy given the difference in approach, and itself not proof that the original QC procedure was inferior.

      Batch correction: "Dataset integration has become a common step in single-cell RNA-Seq protocols and is recommended to remove confounding sources of variation" (l. 38). While it is true that many authors now choose to perform an integration step as part of their analysis workflow, this is by no means uncontroversial as there is a risk of "over-integration" and loss of true biological differences. I had raised this point previously, but the authors chose not to address it (quoted text and line numbers updated). Given that there is controversy in the literature and "community opinion" on the topic of data integration, this is another example of the authors claiming superiority in analysis without showing proof.

      Due to a lack of comparison with other methods and due to the fact that the author's methodology was only applied to a single dataset, the paper presents merely a case study, which could be useful but falls short of providing a general recommendation for a best practice workflow.

      APPRAISAL:

      The manuscript could help to increase awareness of data analysis choices in the community, but only if the superiority of the methodology was clearly demonstrated. However, the authors only show that there are differences but have no convincing (orthogonal) evidence that their methodology was indeed better. This applies to both QC and DE analysis.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      On behalf of the authors, I would like to thank the three reviewers for providing a valuable feedback on our manuscript. We appreciate that they all considered that the first part of our study conveys novelty to the field of neuroblastoma research, while being coherent with recent studies aimed at identifying the NB cells-of-origin through transcriptomics and single-cell RNA-sequencing approaches. We indeed identified a transcriptional signature that distinguishes LR-NBs from HR-NBs, revealing that these two NB subgroups are better discriminated by the core transcriptional signature shared by the distinct SA cell types, rather than by the transcriptional specificities of any of these cell types, as recently debated. Of note, our findings unveil that the sympatho-adrenal transcriptional program facilitates NB formation but concomitantly restricts its malignant potential. We also wish to thank the reviewers for acknowledging that, in contrast to previous studies, we pursued further by testing the functional relevance of this signature through a combination of in vitro and in vivo experiments. We thereby identified NXPH1 and its receptor α-NRXN1 as ones of the very first factors showing an anti-metastatic activity in the context of NB. Uncovering NXPH1/α-NRXN signaling as a possible target to treat metastatic HR-NBs gives our study considerable clinical relevance.

      We consider that the critics and recommendations provided by the three reviewers are positive and pertinent. We are thus willing to address nearly all the reviewers’ concerns and suggestions within the scope of a revision, including performing additional in vivo experiments, as explained in details in the following section. We hope that the planned revisions will be sufficient to make our manuscript suitable for publication.

      __ __

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      In the reviewed manuscript, the authors aimed to characterize the sympathoadrenal (SA) transcriptional landscape that defines low- and high-risk neuroblastomas (LR-NBs and HR-NBs respectively). In particular, they analyze previously published Affymetrix U219 expression profiles of 18 low- (n=8) and high-risk (n=10) neuroblastomas, and 2 fetal human adrenal glands. The authors define transcriptional signatures of LR- and HR-NBs, and further unbiasedly classified them in 4 clusters defining groups of patients with different prognosis (as tested using the 498 SEQC cohort). Within these transcriptional signatures, the authors delineated a SA signature using a human fetal adrenal gland transcriptional profile recently published (Kildisiute et al. 2021), that can discriminate low-risk neuroblastomas. From these genes, the authors further select NXPH1 and NRNX1-2 as promising targets for extensive experimental in vitro and in vivo validation, including validation in cell cultures, and xenografts, to determine that NXPH1/alpha-NRXN1-2 signaling is sufficient for NB tumor growth, and that the expression of either stimulates the metastatic potential of NB cells.

      **Major comments:**

      1- The cohorts and data used by the authors to conduct the main analysis of the paper are already published, and thus the contribution of the analysis is incremental. In particular, the authors analyzed arrays from a limited cohort size, in comparison with others available sequenced with RNA-seq (e.g. 176 HR- and 322 non-HR NB in 498 SEQC; 224 HR- and 342 non-HR NBs included in the Westermann-genecode19 cohort; and 80 HR- and 20 non-HR in the Jagannathan cohort). Furthermore, the 12,000 most-expressed genes (out of ~20,000 available) were analyzed by the authors, as opposed to more than 40,000 (coding and non-coding) included in a normal RNA-seq study. Only the 498 SEQC dataset provides 12,000 genes significantly up-regulated in either high-risk (n~5,500) or non-high-risk (n~7,000). The differences between datasets could influence the results of the study. For example, in the reviewed manuscript, genes with a high expression in LR-NB (compared to fAG) included DNMT3B, SEMA5A, SOX5, and TET1, all of which have a significantly higher expression in HR-NBs of the 498 SEQC cohort. The quality of the manuscript will be enhanced with consistent results obtained by conducting the same reported analysis in larger cohorts.

      Reply: To define the transcriptional signatures associated to the aetiology of LR- and HR-NBs and to the malignant behavior of HR-NBs, we first needed to compare the transcriptomic signatures of LR-NB and HR-NB samples to the one of a non-tumoral, healthy tissue such as the fetal adrenal gland. The online databases cited by the reviewer #1 do not contain any healthy samples, thus precluding the possibility to use it for the first step of our analysis. This is why we decided to use as a starting point the cohort of samples from the Hospital Sant Joan de Déu (HSJD cohort), which further provided the advantage of comparing samples from patients all diagnosed and treated at the same facility. We then used the SEQC database in different steps of our analytic strategy (Fig.S1F, 2F-H, 3A-B, S3) to test the relevance and coherence of the results obtained with the HSJD cohort in the context of a larger cohort. As mentioned by the reviewer #1, we indeed focused our analysis on the 12,000 most-expressed genes. We did so to follow the recommendations of the software Phantasus for transriptomic analyses of mammalian datasets.

      2- In the reviewed manuscript, PHOX2A and PHOX2B are significantly more expressed in both LR-NB and HR-NB compared to fAG. This is also the case for other adrenergic markers including TH and DBH. Oppositely, the expression of cortex markers (i.e. STAR and CYP11A1) is significantly higher in fAG. Nevertheless PNMT is not significantly up-regulated in fAG in comparison to LR-NB nor HR-NB. Is it possible that the fetal adrenal glands analyzed include a large proportion of cortex that confounds the transcriptional signals? The quality of the manuscript will be enhance if the authors could establish what proportion of the fAG transcriptional signal belongs to cortex, and if they account for its influence in the analysis.

      Reply: The samples of human fetal adrenal gland from which RNA was extracted were obtained from donations (samples staged at 22 weeks post-conception or 2 days after birth) and evaluated by a pathologist who confirmed their correct preservation before sample processing. As fetal adrenal glands are very small tissues, successfully separating the cortical and adrenal regions requires micro-dissection, which was not applied. These samples were instead processed entirely, presenting a ratio of medulla/cortex tissue according to their developmental stage.

      3- A recent published paper (Bedoya-Reina et al. 2021) study the differences of HR-NB and LR-NB from a single-cell perspective. In the published manuscript, the authors conclude that LR-NBs are enriched in cells that resemble chromaffin and sympathoblast cells, while the high-risk neuroblastomas are enriched in undifferentiated cells that resemble cells with progenitor characteristics in post-natal adrenal gland. This is broadly consistent with the conclusion reached by the authors in the manuscript under review. It will enhance the content of the reviewed manuscript if the authors compare their transcriptional signatures with the recently published transcriptional signatures in this paper to answer the following questions:

      -1) to what extent the transcriptional signatures for HR-NBs (4 hierarchical clusters incl.) in the manuscript under review resembles that from the published undifferentiated cluster (nC3) enriched in HR-NBs, and the progenitor cluster (hC1) in post-natal adrenal gland;

      -2) to what extent the transcriptional signatures for LR-NBs (4 hierarchical clusters incl.) in the manuscript under review resembles that from the published NOR (nC7, nC8, and nC9) enriched in LR-NBs, and that from the chromaffin cells (hC4) enriched in post-natal adrenal gland;

      -3) how is the expression of NXPH1 and alpha-NRXN1-2 in the reported LR- and HR-NBs, and adrenal gland.

      Reply: As stated by the reviewer #1, our findings are indeed in agreement with the main conclusions recently reported by Bedoya-Reina et al (Bedoya-Reina et al, 2021). We agree with the reviewer #1’s suggestion that a detailed comparison of our transcriptomic signatures with those of Bedoya-Reina et al would be interesting. We will thus perform this comparison and provide these complementary results in the revised version of the manuscript.

      4- In the discussion, the authors indicate that they do not aim to identify the transcriptional signature associated to NB origin but rather use the component of the SA lineage that distinguish LR- and HR-NBs. This statement implies that neuroblastoma can originate from any cell in the developing SA lineage (i.e. SCP, bridge, chromaffin and sympathoblast), a controversial assumption that requires further proof. In particular, when discussing about the core sympathoadrenal signatures enriched in LR-NBs and HR-NBs, the authors obtained a SA signature of genes shared by at least 3 of the 4 SA cell signatures. Further justification needs to be provided as for why (in particular) one of these SA cell signatures exclude the sympathoblast/neuroblast contribution.

      Reply: We decided to use as a core SA signature the genes shared by at least 3 of the 4 SA cell types to avoid being too restrictive, the SCP signature being particularly distant from the 3 others. As such, the list of genes shared by all 4 cell types consists of 663 genes, of which only 51 are retrieved in our list of 503 LR vs HR DEGs. Conversely, the list of genes shared by Bridge cells, Chromaffin cells and Sympathoblasts (but not shared by SCPs) consists 3,530 genes, of which 199 are retrieved in the list of 503 LRvsHR DEGs. These complementary results, which can be discussed and provided if required, therefore suggest that the core SA signature discriminating LR-NBs from HR-NBs represents mostly the Bridge-Chromaffin-Sympathoblast lineage and excludes the SCP identity. They are therefore in agreement with the recent notion that NB cells-of-origin derive from the sympathoblast-chromaffin lineage.

      5- Some of the most interesting results in the paper are limited to proportions in a subset of top-ranked genes. It will be valuable to set the analysis in an hypothesis driven context, add probabilities, test names, and corrected p-values to the results.

      Reply: Our study is based on the initial hypothesis that LR- and HR-NBs might differ in the way they exploit the transcriptional program underlying their developmental origin. To get a deeper insight into this notion we performed a sequential differential expression analysis of primary samples of LR-NBs, HR-NBs and human fetal adrenal gland using the web-based Phantasus software. This software identifies differentially expressed genes between groups using the Limma R package, as detailed in the Methods section. As such, basic statistics for significance analysis were performed using a modetared T-test (as specified in the Limma R package) and FDR-adjusted P-values were set to PAdditionally, top-ranked genes of SA clusters were selected as part of a heuristic approach aimed at highlighting the clinical implication of the transcriptional clusters retrieved in our analysis. The relationship between their expression levels and patient survival was further analyzed using the SEQC database (Fig. S1E). Next, and in contrast to previous studies, we tested experimentally the validity of our analytical findings correlating the expression of SA-c1 genes with a better patient prognosis. To this aim, we selected the candidate gene NXPH1, one of the top-ranked genes from the SA-c1 subset, on the basis of several complementary arguments (listed in response to the reviewer #2’ comment #4). We thereby analyzed how modulating the expression of NXPH1 or that of its receptor α-NRXN1 affect the growth and metastatic potential of human NB cells. The results obtained argue for the validity of our model, by proposing that the neural crest-derived sympatho-adrenal developmental program, in particular the SA-c1 signature, plays a complex role in NB tumorigenesis: it facilitates tumor growth but blocks metastasis formation, hence opposing NB malignancy.

      **Minor comments.**

      6- In comparison with other cohorts that include low- and intermediate-risk NBs as non-HR NBs, the reviewed data specifically includes low- and high- risk NBs. It is important that the authors include a characterization of intermediate-risk neuroblastomas in their analysis.

      Reply: The samples forming the HSDJ cohort were all obtained from NB patients diagnosed and treated at the Hospital Sant Joan de Déu. Several clinical and biological parameters were used for classification, among which the age of the patient. If we apply the cut-off point of 1 year (used at the time the samples were obtained), our cohort does not include any intermediate-risk NB. If we apply the cut-off of 18 months which is now more usual, our cohort would contain only one case (#HSJD-NB14 - aged 16 months at the time of diagnosis) that could be classified as intermediate-risk.

      7- Further details and figures on what precise criteria was used to remove the sample #LR-08 is required. How including this sample changes the reported results?

      Reply: As explained in the Methods section, before comparing the transcriptomic landscapes of LR-NBs and HR-NBs, we first assessed the sample dispersion by performing a principal component analysis. This analysis identified one outlier (#LR-08) that was clearly distant from all the other NB samples (both LR- and HR-NBs). We thus removed this sample to limit the dispersion and variability that would have impacted the subsequent analyses, as recommended by the Phantasus guidelines. We will provide an illustration of the PCA including this outlier. We believe that performing de novo the whole bioinformatical analysis including this outlier would not bring any novel significant conclusion.

      8- GO-term distribution was assessed using the 50 most-enriched GO-terms. How would the results change if all the significant GO terms were analyzed?

      Reply: We will re-analyse the GO-term distribution by including all the significant terms.

      9- Was the SEQC 498 (GSE62564) dataset obtained with microarrays (as indicated in the methods) or with RNA-seq (i.e. Illumina HiSeq 2000)?

      Reply: Similar results were obtained using either the SEQC database obtained with microarrays or the one obtained with RNAseq. The data presented in our manuscript correspond to the ones obtained with the RNA-seq SEQC database.

      10- In methods, the first quartile (Q1) in SA-c1 has a higher limit in 487 samples and the fourth quartile the lowest limit in 4, how many samples (out of 498 NBs) were excluded and why?

      Reply: As explained in the Methods section and in Fig. 2F, the complete SEQC cohort was included in this survival analysis. To subdivide the 498 samples of the SEQC cohort into 4 expression quartiles, we evaluated whether the expression level of each of the 242 SA-c1 genes (corresponding to 573 ref-seq IDs) in a given patient sample was above or below the mean expression of that gene in the complete cohort. Samples were then distributed into quartiles based on the number of genes presenting an expression level above the mean. As detailed in the Methods section, the resulting sample distribution was as follows: 487≤Q1≤360 (124 patients); 359≤Q2≤250 (125 patients); 249≤Q3≤135 (126 patients) and 134≤Q4≤4 (123 patients).

      11- In the 503 DEGs between LR-HR NBs, NTRK2 and MYCN are not included, even if the HR samples included MYCN amplified tumors. Can the authors comment on this?

      Reply: MYCN did not pass the cut-off when comparing its expression levels in LR-NB and HR-NB samples (showing an adjusted P=0.10226 for a cut-off of adjusted PNTRK2 was present in our initial 12,000 gene dataset, but it was not differentially expressed in any of the comparisons made (LR vs fAG: adj-P=0.4916; HR vs fAG: adj-P=0.63757; HRvsLR: adj-P=0.90431)

      12- The authors mention that the top 30 genes found in cluster c1 (and also in c2) are correlated with favorable patient prognosis. Is it the case that *all* the genes in c1 (and also c2, c3 and c4) are significantly associated with a favorable or else unfavorable prognosis?

      Reply: As presented in the datasheet “KM analyses” of Table S3:

      • 32 out of the 33 genes (97%) of the cluster c2 correlate to an unfavorable prognosis, the 33th gene showing no particular correlation.

      • 29 out of the 32 genes (91%) of the cluster c3 correlate to a favorable prognosis, 1 gene correlates to an unfavorable prognosis and the last 2 do not show any particular correlation.

      For the clusters c1 and c4, we focused on the top 30 genes because these clusters contain numerous genes (338 and 100, respectively). The results obtained showed:

      • All the top 30 genes (100% of the genes tested) of the cluster c1 correlate to a favorable prognosis

      • 23 out of the top 30 genes (77% of the genes tested) of the cluster c4 correlate to an unfavorable prognosis, whereas the other 7 genes do not show any particular correlation

      We believe that these results are convincing enough. However, if considered mandatory we will assess the prognosis of all the genes found in clusters c1 and c4.

      13- The high expression of a (significant?) number of genes in cluster c4 is observed in patients with worst outcome (i.e. lower event-free survival), including ATR, HIF1A, ING2, POLR2L, SRPRB (498 SEQC, analyzed with R2).

      Reply: Indeed, 23 out of the top 30 genes (77% of the genes tested) of the cluster c4 correlate to an unfavorable prognosis, which appears to us as a significant number of genes. We will test whether the expression of the remaining genes forming the cluster c4 also correlate to an unfavorable prognosis.

      14- Regarding the 242 genes in the core SA signature, although its a smaller number, the expression of several genes in the core SA signature with a higher expression in HR compared to LR belonging to clusters 2, 3, and 4 is observed in worst outcome patients in the 498 SEQC cohort (CHD7, DNMT1, HMGA1, HSD17B12, LBR, LSM7, MCM4, NKAP, POLA1, and others). Is this small fraction significant?

      Reply: The genes presenting a higher expression in HR-NBs than in LR-NBs are found either in cluster c2 or cluster c4. The core SA signature retrieved 262 genes of the 503 LR vs HR DEGs, of which 14 belong to cluster c2 (including CHD7, DNMT1, HMGA1, LBR, LSM7, MCM4, and POLA1) and 3 to cluster c4 (UQCRFS1, NKAP and HSD17B12). As shown in the “KM analyses” datasheet of Table S6, these 17 genes all correlated with an unfavorable prognosis.

      15- In Kildisiute et al. 2021, NRXN1 is expressed in SCPs, while NXPH1 is expressed in bridge, chromaffin and sympathoblastic cells. How are the microenviroment of these cells regulating the expression of these genes in a developmental context (particularly as sympathoblastic cells are know to have larger proliferative capabilities than SCPs)? how is this cell heterogeneity replicated by a NB cell line? are mesenchymal and adrenergic cells expressing differentially NRXN1 and NXPH1?

      Reply: Unfortunately, the literature about NXPH1 remains very limited (less than 40 articles referenced in Pubmed) and nothing is known about the regulation of its expression during development. The data from Kildisiute et al (Kildisiute et al, 2021) indeed identified NXPH1 in the signatures of bridge cells, chromaffin cells and sympathoblasts, while its receptors NRXN1 and NRXN2 were found in the transcriptomic signatures of all 4 SA cell types. Interestingly, the data provided by Kildisiute et al further established that the expression of NXPH1, NRXN1 and NRXN2 is specifically enriched during the pseudo-time transition from bridge cells to sympathoblasts. This suggests that NXPH1/α-NRXN signaling might be particularly important at that stage and could participate in regulating this transition. But this remains purely speculative and it will need further investigation.

      We initially used a panel of 10 human NB cell lines harboring distinct characteristics in terms of genetic profile and morphological properties. We did not find any specific correlation between NXPH1 or α-NRXN1/2 expression and the different types of NB cell lines. We will provide an illustration of this observation in the revised version of the manuscript.

      NB cells can convert or be reprogrammed from an adrenergic state, which is less chemoresistant in vitro, to a mesenchymal state (van Groningen et al, 2019). As asked by the reviewer #1, we investigated the expression of NXPH1 and α-NRXN1 in relation with the mesenchymal vs adrenergic status of NB cells (using the dataset GSE90803 from (van Groningen et al, 2019). We found that both genes are expressed at higher levels in cells of the adrenergic phenotype, suggesting that NXPH1/α-NRXN signaling might be particularly relevant for the maintenance of this phenotype. If needed, we will provide an illustration of this observation in the revised version of the manuscript.

      16- Figure 1B and C, 2B,D: might the information provided be enhanced? otherwise these inserts might be excluded.

      Reply: We thought that the panels presented as Fig.1B, C and 2B, D would be helpful to the readers. We could remove them if the editors and reviewers consider it mandatory.

      17- Figure 3D: Kildisiute et al. 2021 data and GTEX available at human protein atlas indicate expression of NRXN1 and NXPH1 in developing and adult adrenal gland. Might the results illustrated suggest a confounding effect in the sampled fetal adrenal glands, perhaps from cortex?

      Reply: The samples of human fetal adrenal gland from which RNA was extracted were obtained from donations (samples staged at 22 weeks post-conception or 2 days after birth) and evaluated by a pathologist who confirmed their correct preservation before sample processing. As fetal adrenal glands are very small tissues, successfully separating the cortical and adrenal regions requires micro-dissection, which was not applied. These samples were instead processed entirely, presenting a ratio of medulla/cortex tissue according to their developmental stage.

      18- The authors conduct extensive experiments in NRXN1, and make conclusions about its role in for instance metastasis, nevertheless the LR-NB/HR-NB SA signal only includes NRXN2. Can the authors comment on the differences between NRXN1s and NRXN2?

      Reply: NRXN1 and NRXN2 were both found to be differentially expressed between LR vs fAG and HR vs fAG, and were thus retrieved among the list of 3.096 common DEGs (Table S3). NRXN2 was further found in the list of 503 LR vs HR DEGs (adjusted P=0.037), showing higher expression levels in LR-NBs than in HR-NBs (see Fig.3D). NRNX1 presented an expression profile comparable to that of NRXN2 (Fig.3D) but did not pass the cut-off (adjusted P=0.38) due to an increased variability in HR-NB samples and was thus absent from the list of LR vs HR DEGs. In vitro, the expression levels of NRXN1 and NRXN2 showed comparable patterns. When we initiated the functional experiments there was no fluorescence-conjugated antibody available to detect and sort α-NRXN2, but there was for α-NRXN1. This is the practical reason that led us to focus on α-NRXN1 in the second part of our study.

      Reviewer #1 (Significance (Required)):

      The significance of the study relies in investigating the role of selected targets in neuroblastomas within a risk group. In particular, HR-NBs have poor outcomes and are generally metastatic at the time of diagnosis.

      The results of the manuscript are somehow consistent with a recently published manuscript analyzing LR- and HR-NBs from a single-cell perspective. The manuscript will be enhanced by conducting the suggested comparison between the reviewed and the reported results. The authors further need to comment why HR-NBs markers, particularly MYCN is not recovered in the LR-NB/HR-NB and the LR-NB/HR-NB SA signals. Also they need to comment on possible confounding effects in the fetal adrenal gland.

      The paper is directed to a broader audience of cancer and developmental biologists, and computational biologist. Yet further statistical support needs to be provided.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate). Please place your comments about significance in section 2.

      In this paper, Fanlo-Escudero et al., determines gene signatures differentiating low-risk (LR) neuroblastoma (NB) from high-risk neuroblastoma (HR-NB), as well as LR- and HR-NB as an entity from human fetal adrenal gland. They identify a transcriptional signature corresponding to a core sympathoadrenal lineage that can discriminate between LR-NB and HR-NB. This signature is composed of genes associated with favorable patient outcome. The authors further choose one gene, NXPH1, for functional analysis and investigates the effects this gene has on NB progression using in vitro assays, chick CAM assay and mouse in vivo models. The authors conclude that this transcriptional signature can distinguish LR-NB from HR-NB and that NXPH1 is involved in NB cell growth.

      **Major comments:**

      • Are the key conclusions convincing? The key conclusions are 1) a core SA lineage signature can discriminate between LR-NB and HR-NB, and 2) NXPH1 represses NB malignancy (in terms of metastatic capacity) and is a therapeutic target. The first conclusion is indeed convincing, and not contradictive to common beliefs. The second conclusion is poorly supported by data. The authors perform a range of experiments using in vitro and in vivo settings, but lack some fundamental experiments and overstate their findings.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Several claims should be softened, re-phrased and/or clearly marked as preliminary or speculative. No data nor claims need to be removed.

      For example, the following statements need to be changed:

      1- Page 10, second paragraph: The authors state "Remarkably, NXPH1 and α-NRXN1/2 levels increased in all the NB cell lines harbouring a sphere-forming capacity (Fig. 3E), thereby revealing a strong positive correlation between the expression of NXPH1 and α-NRXN1/2 and the acquisition of a NCC stem cell identity". The authors only show that NXPH1 is expressed in 8 out of 10 NB cell lines. Sphere-forming capacity is displayed in a relative and not absolute scale which makes it difficult to assess which cell lines that do form spheres and to what extent. The capacity to form spheres (from low to high) does not correlate to the levels of NXPH1 in the different cell lines.

      Reply: In its current form, Fig.3E presents via a heatmap representation how the expression of selected genes changes after growing cell lines in sphere-forming conditions as compared to basal (normal) ones. We understand from various reviewers’ comments that this representation has been misleading. We will change it, showing more explicitly the expression levels both in basal and sphere-forming conditions and will bring further details on how the sphere-forming ability of each cell line was assessed and characterized.

      2- Page 13, paragraph 1. The authors write "...these data revealed that NXPH1/α-NRXN1 signaling is necessary and sufficient for NB tumor growth in vivo". This is an overstatement. Tumors still form, meaning that NXPH1 signaling is not sufficient.

      Reply: Indeed, tumors still form after xenografting sh-NXPH1 or sh-NRNX1 cells but they form with a decreased frequency. Specifically, sh-NXPH1 cells formed tumors in 4 out of 6 xenografted mice, and the 4 tumors all showed a markedly reduced volume. Tumors formed from sh-NRNX1 cells were observed in only 2 out of 5 xenografted mice, with 1 of the 2 tumors showing a markedly reduced volume. We consider that these results support the conclusion that inhibiting NXPH1/α-NRXN1 signaling impairs tumor growth, affecting both tumor initiation and tumor growth. We however understand the reviewer #2’s comment and will thus rephrase this part accordingly. In addition, we will provide complementary data showing the mean volume of the tumors generated in the distinct experimental conditions.

      3- Throughout the text, the authors convert their statements. One example is page 15, first paragraph. They write "...growth of NB cells but markedly restrict their metastatic potential", but they do not show this. Instead, they only address the opposite situation - Knockdown enhances metastasis. This is not equal to their statement. See other experiments in other sections. The authors need to go through the manuscript and make sure that they explain their conclusions to actually fit their experiments.

      Reply: We will follow the reviewer #2’s recommendation and will rephrase the conclusions whenever needed to better fit to the experimental results.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary to evaluate the paper as it is, and do not ask authors to open new lines of experimentation.

      **Issues to respond to:**

      4- The authors should more clearly explain why they choose to further study NXPH1 (second highest on the list), involved in synaptogenesis and neurotransmission, instead of SOX6, highest on the list, that is highly relevant in neural crest development.

      Reply: The combination of several reasons led us to choose NXPH1 for functional studies:

      • NXPH1 is a secreted factor, whose activity might be much more easy to target and modulate in future pharmacological/clinical assays than that of a transcription factor like SOX6
      • NXPH1 and its receptor NRNX2 both came out in the list of SA-c1 genes, and NRXN1 presented a comparable expression profile (higher levels in LR-NBs than in HR-NBs, although it did not pass the cut-off required to appear in the SA-c1 list), emphasizing the putative importance of this signaling pathway in NB tumor biology
      • The functional involvement of NXPH1 or that of its receptors has never been addressed in cancer formation or progression to date

        5- When the authors investigate the expression of NXPH1 and other genes and compare that with sphere-forming capacity (Fig. 3E and page 10) they analyze expression in cells cultured in basal medium while sphere-forming capacity is measured after 5 weeks in restricted medium. How does the expression of the analyzed genes change under these conditions?

      Reply: In its current form, our Fig.3E is actually showing how the expression of the analyzed genes changes after growing cell lines in sphere-forming conditions as compared to basal (normal) ones. As explained above (reviewer #1, point #1), we understand that this representation has been misleading and we will modify it.

      6- Is the subpopulation of NRXN1+ cells low (1.5%) because these samples are from aggressive ("High-risk") cell lines?

      Reply: All the NB cell lines used in this study derive from HR-NB tumors. As mentioned by the reviewer #3, there is no cell line modeling low-risk neuroblastoma to date. We indeed observed that the proportion α-NRXN1+ cells, as detected by FACS, was very low in all three cell lines tested. A similar observation was made using cells dissociated from three different patient-derived xenografts. We believe that a similar observation made in 6 samples of distinct origins highlights the consistency of finding a low proportion of α-NRXN1+ cells in NB samples.

      7- The authors state "...the number of cells quantified per tumor section was decreased by ~50% for the α-NRXN1+-deprived cells relative to their control (Fig. 3J, K), thus revealing that α-NRXN1+ cells are required to support NB tumor growth in vivo". This is not a correct conclusion. This experiment shows that a-NRXN1- cells do not grow and expand to the same extent as control cells. They cannot say that a-NRXN1+ cells support NB growth without comparing growth between a pure a-NRXN1+ and control cells.

      Reply: Unfortunately we could not assess the growth of purified α-NRXN1+ cells, given the low number of α-NRXN1+ cells that could be sorted as compared to the numbers of cells required to perform a CAM assay. We thus opted for comparing the growth of total SK-N-SH cells with that of SK-N-SH cells in which the α-NRXN1+ subpopulation had been experimentally removed. We believe that the results obtained convincingly argue for the importance of the α-NRXN1+ subpopulation in promoting NB proliferation and growth. Nevertheless, we understand the reviewer #2’s comment and will rephrase the conclusion.

      8- The authors use shRNAs to knock down NXPH1. They enrich their cells by two means - puromycin or doxocycline. This results in equal cell populations. The authors however state that they use doxocycline to circumvent the growth arrest they observe with puromycin selection. They need to elaborate on this and show why this would be the case and what difference the two methods do and show.

      Reply: We generated two types of knock-downs: a constitutive one and an inducible one. Puromycin was used to select constitutive knock-downs, whereas doxycycline was used to trigger sh-RNA production in an inducible manner, in stable clones previously established through neomycin selection. We apologize if this was not stated clearly enough in the manuscript and we will correct it.

      9- The major flaw of this paper is that the authors use one cell line in total, and even more that they use the same cell line for both knockdown and activation. Since they do show that different NB cell lines have different expression levels (ranging from high to absent), they should choose one cell line for KD and one for overexpression. The authors could also do a rescue experiment with knockout and gain-of-function (e.g., construct that will not be targeted by the shRNA) in the same cells.

      Reply: Since NXPH1 is a secreted protein, we needed a cell line that expresses both NXPH1 and its receptors to expect noticing effects on NB cell behavior when their expression is reduced. We reasoned that performing a gain-of-function of NXPH1 in a cell line that does not express its receptors would have no interest, and vice versa. We also believe that it is more conclusive to perform gain- and loss-of-function experiments in the same cell line, because of the likely differences in cell behavior and aggressiveness of distinct cell lines. We however agree that our conclusions would be strengthened if similar conclusions were reached using different cell lines. We will thus perform growth and metastasis assays both in vivo and in the CAM using NXPH1 and α-NRXN1 shRNAs in an additional cell line. We will moreover consider performing rescue experiments and will think about the best methodology to do so.

      10- They only use one shRNA after trying several (Fig. S4). The efficiency is substantially variable and not convincing. As stated also elsewhere in this review, they need to check protein level. And to ensure that their results are not off-target they should perform at least some crucial experiments with two shRNAs.

      Reply: We agree that the decreased mRNA levels caused by NXPH1 and α-NRXN1 shRNAs showed variability. Yet, they were sufficient to significantly reduce cell viability, which was impaired by two distinct shRNA constructs, both for NXPH1 and α-NRXN1. To complete these experiments as recommended by the reviewer #2, we will assess how NXPH1 and NRXN1 expression is altered at the protein level by western-blotting. We will moreover address possible off-target effects by RT-qPCR.

      11- Why don't the authors add BrdU post-implantation? This is easily done in the egg considering the accessibility and would better reflect the proliferation in vivo.

      Reply: Adding BrdU pre-implantation allowed us to get a read-out of the global proliferative behavior of NB cells over the whole post-implantation duration. Adding it at the end of the post-implantation would have only allowed us to assess the proliferative behavior of cells at the end of the experiment. We believe that this would have been less informative.

      12- Why do the authors switch between CAM and mouse xenografts? I understand that the mouse model must be employed for "metastasis", but can it be explained why and when they perform the different "tumor growth" experiments?

      Reply: The CAM assay was used for 2 reasons: 1) when cell numbers were limiting (i.e. testing the importance of the α-NRXN1+ subpopulation for tumor growth), and 2) to perform a gain-of-function strategy using a recombinant rNXPH1 protein and testing its effects on tumor growth over a duration of 1 week. Such experiment would not have been possible using mouse xenografts, due to the extended experimental duration (7-8 weeks) of this assay and to the need for repeated rNXPH1 injections. The rNXPH1 gain of function experiment in the CAM and the NXPH1/α-NRXN1 loss-of-function experiment using mouse xenografts were performed concomitantly.

      13- Why do the authors do left ventricle injections for metastatic studies and not AG implantations?

      Reply: We reasoned that cell injections into the left ventricle were ideal to test the organotropism of metastatic NBs, as this methodology facilitates cell dissemination and colonization of organs targeted by NB metastasis such as the liver and bone marrow. Cell implantation into the adrenal gland is especially useful to study cell growth in one of the primary sites of NB growth, which was not our experimental purpose at that stage of the study.

      14- The measurement of bioluminescence is very difficult to interpret. The authors discuss metastatic spread, but the images show only large blobs covering the heart and areas surrounding it, especially for the sh-αNRXN1. To be able to say that the cells have colonized specific organs the authors need to dissect these organs and perform staining. I see it as tumors recur rather than particularly metastasize when they re-appear after 6 weeks.

      Reply: The parameters for bioluminescence detection were set equally for all experimental conditions. The differences in bioluminescence intensities observed in mice injected with control cells compared to those injected with shNXPH1 and shNRXN1 cells explain why it is so intense in the case of shNXPH1 and shNRXN1 cells.

      As suggested by the reviewer #2, we had further dissected the mice and recovered the organs of interest. We will provide additional data confirming that an intense bioluminescent signal was effectively detected in the liver and hind legs (containing bone marrow) of mice injected with shNXPH1 and shNRXN1 cells, but not in those injected with control cells.

      15- How do the authors explain results presented in Fig. S5: There are more HuNu+ cells but tumor size is unchanged?

      Reply: In our experimental CAM setup, 5·105 SK-N-SH cells were embedded in 10μl of Matrigel, which served as a matrix facilitating tridimensional NB cell proliferation. The addition of rNXPH1 increased the number of HuNu+ (NB) cells per section and per mm3 (Fig.4G, H and Fig.S5C), without significantly increasing the tumor area (Fig.S5D) nor the tumor/matrigel volume (data not shown in the current version, but it will be included in the revised version). As illustrated in Fig. 4G, the matrigel was not totally filled with NB cells, even at the time of recovery. We thus deduced from these observations that more NB cells progressively filled the matrigel, without reaching the point where they significantly altered the tumor/matrigel volume. Nevertheless, we can provide additional data revealing that the addition of rNXPH1 caused a slight, yet reproducible increase in the tumor/matrigel weight, in agreement with the increased cell density already shown in Fig.S5C.

      • Are the suggested experiments realistic for the authors? It would help if you could add an estimated cost and time investment for substantial experiments.

      Some suggested experiments take time (orthotopic implantations, rescue experiments, adding cell lines and # of shRNAs).

      However, as long as the authors discuss and address their methods for in vivo growth (in ovo vs in vivo vs their choice of metastasis model), an orthotopic AG model is not necessary. The authors should however consider it for future studies.

      Experiments required to support their conclusions: A rescue experiment and use of different cell lines for KD and overexpression is somewhat time-consuming, as well as ensuring KD at protein level and include an additional shRNA in crucial experiment. I expect that this would take 2-4 months.

      • Are the data and the methods presented in such a way that they can be reproduced? 16- The authors should elaborate on their methods, but in general they are reproducible.

      Reply: As requested, we will bring further details on the experimental setups.

      Are the experiments adequately replicated and statistical analysis adequate?

      17- In several places the number of replicates is questionable. Especially at the end of page 26, the authors state that they have performed n=1-4 replicates. N=1 replicate is never ok. In several instances they do n=2 replicates. This can be acceptable but the authors could address this.

      Reply: When assessing the sphere-forming capacity of the different NB cell lines, some cell lines only produced spheroids in 1 of the 4 replicates tested. This is a result in itself, which helped us classifying cell lines based on their sphere-forming capacity. We understand that this Methods paragraph was elusive. We apologize for it and will clarify this aspect.

      **Minor comments:**

      • Specific experimental issues that are easily addressable. Mainly text changes can be seen as minor. For experiments and other issues, see other sections.

      • Are prior studies referenced appropriately? Yes. 18- The reference list is not coherently styled.

      Reply: We understand that using numbered references can be annoying. We will adapt the reference format to stick to the guidelines of the specific journal to which the manuscript will be addressed.

      • Are the text and figures clear and accurate?

      19- Overall, the paper is written in a complex way and difficult to easily comprehend. For example, the authors need to clarify several issues on the material they use and experiments they perform. I suggest substantial re-writing to better convey their messages. Sentences should be short and clear, data explained in the context of it was derived.

      Reply: We will take into consideration the reviewer #2’s suggestion and modify the manuscript to facilitate its comprehension.

      The following text edits and clarifications are required:

      20- Page 3. The authors write "cell-of-origin" in several places, this should be changed to "cells-of-origin" (i.e., plural). The view that all NBs originate from only one cell is too simplistic, and the authors should definitely edit this considering that they are investigating different subgroups of NB.

      Reply: We will correct this mistake.

      21- Page 5. The authors MUST define where the material from the 18 NB patients as well as fetal AG derive from. There is no reference, and taken from the material&methods section, the transcriptome data from these data has not been generated by the authors themselves?

      Reply: The transcriptomic data used herein have indeed been previously generated by two co-authors of the study, and the corresponding reference is cited (ref #20; (Gomez et al, 2015). All the NB samples included in our transcriptomic analysis were obtained at the time of diagnosis from patients attended at Hospital Sant Joan de Déu (HSJD, Barcelona, Spain). Tumors were evaluated by a pathologist and only the snap-frozen pre-treatment samples showing at least 70% of viable tumor content were included for analysis. Neuroblastoma risk assessment was defined by the International Neuroblastoma Staging System (INSS). Samples of normal fetal adrenal gland (n=2) were used as a non-tumoral reference tissue. Total RNA from frozen samples was extracted by TRIzol® Reagent. High quality RNA (RINe>7.00) was hybridized to Human Genome U219 microarray plates at the Functional Genomic Unit, Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS, Barcelona, Spain) according to Affymetrix standard protocols. Microarray data are deposited in NCBI (#GSE54720). We will include this information in the Methods section of the revised version of the manuscript.

      22- Why do the authors choose to work with the data set from Jansky et al in particular?

      Reply: Actually, we chose to work with the data from Kildisiute et al. (ref #16 (Kildisiute et al, 2021). We selected this study because: 1) they identified the transcriptional signatures of the 4 SA subtypes relevant to our study and additional related signatures of interest (adrenal cortex, mesenchyme…), 2) these data were obtained from human tissues, and 3) these signatures were easily accessible from their supplementary data.

      23- Page 7, paragraph 1. The authors write "Remarkably, 67% of the 503 genes found in this signature, which formed the cluster c1, are both associated with a neural identity and a better patient outcome". I do not agree that this is remarkable and the authors should remove this word.

      Reply: We will remove the term “Remarkably”.

      24- Page 10, second paragraph. The authors should clarify that the expression levels of the displayed genes are derived from qPCR analysis (if I have understood that correctly, I had to find and guess this from the M&M section), as well as explain how they have set the scale for sphere-forming capacity and what this corresponds to. What do the colors actually represent?

      Reply: We apologize for this information lacking in the Results section. We will precise that the expression levels were assessed by RT-qPCR, we will add a representation of their expression levels in basal culture conditions and will improve the explanation of the assessment of the sphere-forming capacity of the NB cell lines.

      25- Page 11 and onwards. The authors write "deprived of their a-NRXN1+ subpopulation". This is highly confusing and difficult to read. The authors should write "a-NRXN1- subpopulation"

      Reply: We will follow the reviewer #2’s recommendation and change the text for "α-NRXN1- subpopulation".

      26- Page 11, end of paragraph 2. The authors write "...arguing that NXPH1/α-NRXN signaling could control NB growth and/or aggressiveness". Number of cells do not directly correlate to aggressiveness, and this needs to be re-phrased to only state what the experiment actually shows - proliferation of a-NRXN1- cells.

      Reply: We will rephrase this sentence according to the reviewer #2’s recommendation.

      27- I am in favor of using the CAM assay as a complementary system. The authors however use this to define "...required to support tumor growth in vivo". The CAM assay using NB (i.e., transformed cells) shows the growth of these cells in response to presence of blood vessels and not a full tumor micro-environment. This should be clarified.

      Reply: We will clarify this aspect.

      28- As stated in the previous comment, the authors write "...required to support tumor growth in vivo". The next paragraph has the following headline "NXPH1/α-NRXN signaling stimulates NB growth". This is to me the same thing. The authors should elaborate on how these differ, or if they use them to show the same thing, clearly state that.

      Reply: We will better explain how these distinct pieces of evidence are complementary and reinforce the conclusion that NXPH1/α-NRXN signaling stimulates NB growth.

      29- The authors conclude that NXPH signaling can be used as a therapeutic target. This would however be extremely difficult considering the opposing effects shown on growth vs metastasis. I agree that it is important to find means to inhibit metastasis, but that does not mean we can allow for enhanced growth of the primary tumor. A better reflection would be to use this as a biomarker, but this can only be predictive/speculative since the authors do not perform for example a tissue microarray to show this at IHC protein level, something that is currently the practice in the clinic.

      Reply: While we agree with reviewer #2 that we cannot allow for enhanced growth of the primary tumor, we believe that having identified a secreted factor whose activity inhibits NB metastatic potential is a novel and important finding. We did not wish to suggest that our experimental setup could be directly applied to inhibit the metastatic dissemination of HR-NB tumors. However, we believe that our findings can set the basis of a therapeutic design in which NXPH1/α-NRXN signaling would be enhanced locally to prevent/block metastases from HR-NB tumors.

      As mentioned in the Discussion section (page 17), one study reported that NXPH1 can be used as a DNA methylation biomarker associated with a good prognosis for NB patients (reference #54, (Decock et al, 2016).

      **Material and Methods:**

      30- The authors have misspelled the cell line SK-N-BE(2)c.

      Reply: Indeed. We will correct this mistake.

      31- Why are some cell lines grown in 20% FBS? This is not standard and could impact the results.

      Reply: The 3 cell lines of our panel which were grown in 20% FBS correspond to the 3 cell lines of the mixed subtype, including SK-N-SH, SK-N-Be(2)c and IMR-32 cells. These cell lines have been established and originally grown in presence of a high FBS content (Tumilowicz et al, 1970; Biedler et al, 1973; Ciccarone et al, 1989). In our hands and as recommended by the colleagues that provided us with these cell lines, growing the cell lines in presence of 20% FBS was indeed crucial to sustain their morphological heterogeneity. While we agree with the reviewer #2 that culture conditions could affect cell behavior in vitro, we wish to emphasize that our main conclusions are derived from in vivo experiments. We are thus convinced that our main findings were not impacted by in vitro culture conditions.

      32- The authors should state what the tumor volume limit in their ethical permit is (page 30).

      Reply: The tumor volume limit was set at 1,500 mm3 as specified by our ethical committee.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      33- Fig. 3E. As discussed elsewhere, the authors should clarify the scale/meaning for sphere-forming capacity

      Reply: As explained above, we will modify the representation of the results shown in Fig.3E to improve their clarity and facilitate their comprehension.

      34- Fig. 4D. What do the numbers (i - vi) refer to? I cannot find this in the figure, figure legend, text or material&methods.

      Reply: These numbers were used to call different tumors and show their GFP content, as appearing in the lower part of this panel. We will precise this information in the corresponding figure legend.

      35- The authors do not present the tumor volume in Fig. 4. The authors discuss tumor growth in the text and this data should be included.

      Reply: We agree with the reviewer #2’ suggestion and will present tumor volumes in the revised version of the manuscript.

      36- Fig. S4. Knockdown efficiency is variable, and efficiency does not correlate to growth capacity. Especially because of this, the authors need to investigate this at protein level.

      Reply: As requested by the reviewer #2, we will investigate the knockdown efficiency at the protein level.

      Reviewer #2 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. This paper is partly within the scope of the ongoing debate of NB origin (see references below). This paper is however not as extensive, provides no new patient material (applies data from Jansky et al), and do not address the actual cell-of-origin (which the authors themselves also clearly states). This paper provides new gene signatures that can be used to define low-risk vs high-risk patients which is highly important to the field, but these signatures are not unexpected and does not add a significant advance to the field. The authors also do not address how these signatures could be applied clinically. The authors do not use any new methodology. With this said, with revision of the paper, it will still add to the current focus on NB biology research.

      • Place the work in the context of the existing literature (provide references, where appropriate). 37- There is a recently initiated, important and extensive debate about the cells-of-origin for NB. The authors indeed bring this up in the paper and also state that they do not intend to add to this debate. They use data from one paper from referenced debate above, and I would argue that because of this fact, and that they extract gene signatures from it, they do, at least partly, touch on the NB cells-of-origin debate, and the authors should put their results into context, from a big picture perspective. As of now, they dodge this complex issue.

      References: Dong et al., Cancer Cell 2020; Hanemaaijer et al., PNAS 2021; Jansky et al., Nat Genet 2021; Kameneva et al., Nat Genet 2021; Kildisiute et al., Sci Adv 2021; Furlan et al., Science 2017).

      Reply: We understand the reviewer #2’s argument. We will thus try to put our results in perspective regarding the NB cells-of-origin.

      • State what audience might be interested in and influenced by the reported findings.

      This paper will be interesting for scientists within the neuroblastoma field, in particular those working on defining NB subgroups in correlation to developmental stages.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. Reviewer's expertise: Neuroblastoma, neural crest, trunk neural crest, chick embryos, mouse models, in vitro models

      Parts of paper outside expertise of the reviewer: Analysis of the bioinformatics data (i.e., extracting signatures).

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary: short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).**

      Dr. Le Dréau and colleagues provide a transcriptional analysis comparing low versus high-risk neuroblastomas using a cohort of 18 patients. By comparing them to normal transcriptional signatures obtained from human fetal adrenal glands, they conclude that low risk tumors share a specific core sympatho-adrenal developmental program, which is associated with favorable patient prognosis. Among this signature, they specifically assess the role of NXPH1/a-NRXN in vitro and in vivo using different models (cell lines, PDX-derived cell lines, mouse and chick xenografts). They propose that NXPH1/a-NRXN axis promotes neuroblastoma tumor growth via cell proliferation, and inhibits metastases.

      **Major comments:**

      • Are the key conclusions convincing?

      There are major points that need to be addressed before being convinced by the conclusions.

      1- The choice for using the SKNSH cell line, among others, to study the role of NXPH1/a-NRXN axis need to be explained. Indeed, whereas there is no cell line modeling low-risk neuroblastoma, authors should properly illustrate the basal expression of NXPH1/a-NRXN in the cell lines and not the ratio provided in Fig3E. If I understood well, according to Fig3F, only 1.5% of SKNSH cells express NRXN1. Could the authors provide FACS plots? And explain how they can sort high and low expressing cells among 1.5%? And then how do they justify using shRNA approach in a cell line in which only 1.5% are concerned cells? Also, the authors should prove the efficiency of the shRNA used on each specific target for both in vitro and in vivo studies by WB.

      Reply: We chose the SK-N-SH cell line for experimental assays based on the following facts: 1) the expression levels of NXPH1 and α-NRXN1/2, 2) this cell line showed the highest sphere-forming capacity, 3) this cell line harbored the highest percentage of α-NRXN1+ cells detected by FACS among the different cell lines tested, and 4) this cell line is of a mixed type, which is supposed to encompass more cell heterogeneity than other (N, I and S) types, thus reproducing more faithfully the complex heterogeneity of primary NB tumors.

      In a revised version of the manuscript we will provide data on NXPH1 and a-NRNX1/2 expression levels in basal culture conditions, and will improve the representation of Fig.3E to facilitate its comprehension. The method used to sort α-NRXN1+high, α-NRXN1+low and α-NRXN1- cells is explained in the Methods section (page 25). As requested by the reviewer #3, we will provide FACs plots to illustrate the sorting method.

      As requested by the reviewers #2 and #3, we will assess the shRNA efficiency by testing the knockdown at the protein level.

      2- The conclusion of inhibition of metastatic process is not supported by enough data. To achieve such a conclusion, authors should provide more than one in vivo experiment (which need to be completed already with the proof of protein deregulation). Some in vitro characterization of metastatic properties such as invasion and migration assays and/or transcriptional analyses could be done.

      Reply: We agree with the reviewer #3 that that our conclusion on the anti-metastatic ability of NXPH1/α-NRXN signaling would be reinforced by complementary experiments. To this aim, we propose to test how NXPH1/α-NRXN knockdown affects the metastatic potential of the SK-N-SH cell line and of another cell line in the CAM assay. This assay can indeed be used to assess not only NB tumor growth (as we already did), but also NB cell invasion in target organs (such as the liver and the bone marrow), thus mimicking a metastatic dissemination.

      3- Why don't the authors use the transcriptome dataset of 498 patients to realize a more powerful comparative study of low versus high risk tumors (Zhang et al, Genome biology, 2015)? The authors should show the expression plot of NXPH1/a-NRXN in low versus high risk patients, in their cohort of 18 patients but also in the cohort of 498 patients. How do the authors reconciliate the idea that NXPH1/a-NRXN could be associated to stem cell identity but low risk tumors?

      Reply: As explained in response to the reviewer #1’ (comment #1), our strategy entailed comparing the transcriptomic signatures of LR-NB and HR-NB samples to the one of a non-tumoral, healthy tissue such as the fetal adrenal gland. The online SEQC database cited by reviewers #1 and #3 does not contain healthy samples, and therefore could not be used for this initial step of the analysis. This is why we decided to use as a starting point the cohort of samples from the Hospital Sant Joan de Déu, which further provided the advantage of comparing samples from patients all diagnosed and treated at the same facility. We then used the SEQC database at different steps of our analytic strategy (Fig.S1F, 2F-H, 3A-B, S3) to test the relevance and coherence of the results obtained with the HSJD cohort in the context of a larger cohort.

      As requested by the reviewer #3, we will provide a dot-plot representation of the expression levels of NXPH1 and its receptors in both the HSDJ and SEQC cohorts.

      At this point we can only speculate on the association between NXPH1/α-NRXN expression and stem cell identity. This correlation might simply reflect the fact that cells from human NB cell lines return to a transcriptional program closer to their neural crest-derived identity when grown in sphere-forming conditions (as suggested by the increased expression of p75/NTR). Alternatively, this correlation might reflect the ability of NXPH1/α-NRXN signaling to retain cells in an immature state. Such ability could explain how NXPH1/α-NRXN signaling participates in promoting primary tumor growth, and the fact that their expression is increased in LR-NBs as compared to normal fetal adrenal gland. On the other hand, NXPH1/α-NRXN expression is higher in LR-NBs than in HR-NBs, and our findings suggest that this is linked to the anti-metastatic ability of NXPH1/α-NRXN. We could further imagine that by favoring stem cell identity NXPH1/α-NRXN signaling might also provide LR-NB cells with an enhanced ability to “re-enter” a normal developmental path or to be eliminated. In this regard, it is worth reminding that LR-NBs are detected earlier during development than HR-NBs, and sometimes show the puzzling ability to regress spontaneously.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Details of experiments and supplementary experiments have to be provided (see previous question).

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      4- Additional details of the experiments have to be provided (cf above): levels of expression of NXPH1/a-NRXN in cell lines, FACS analyses to prove enrichment and/or depletion, WB for validation of shRNA knock-down etc...Moreover, the conclusion that NXPH1/a-NRXN axis inhibits metastatic potential of neuroblastoma is supported by only one in vivo experiment using SKNSH cell line with shRNAs anti-NXPH1/a-NRXN. It should be completed with invasion/migration assays in vitro for example, and/or transcriptional signature of tumors obtained +/- shRNAs anti-NXPH1/a-NRXN. Ideally, these results could be validated in an additional model.

      Reply: As mentioned above, we will perform additional experiments following the reviewer #3’s recommendations, including assessing NXPH1 and α-NRXN1 knockdown efficiency at the protein level, assessing how knocking down NXPH1 and α-NRXN1 expression alters the in vivo growth and metastatic potential of an additional cell line, and studying how knocking down NXPH1 and α-NRXN1 alters the metastatic potential of NB cells using a second and complementary metastasis assay (CAM assay).

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Yes

      • Are the data and the methods presented in such a way that they can be reproduced?

      Yes, Material and methods section is well described. But, several points are missing:

      5- The paragraphs describing how RNAs from 18 patients and fetal adrenal glands were obtained, how good was the quality, and how transcriptomes have been realized and sequenced are missing.

      Reply: The transcriptomic data used herein have indeed been previously generated by two co-authors of the study, and the corresponding reference is cited (ref #20; (Gomez et al, 2015). All the NB samples included in our transcriptomic analysis were obtained at the time of diagnosis from patients attended at Hospital Sant Joan de Déu (HSJD, Barcelona, Spain). Tumors were evaluated by a pathologist and only the snap-frozen pre-treatment samples showing at least 70% of viable tumor content were included for analysis. Neuroblastoma risk assessment was defined by the International Neuroblastoma Staging System (INSS). Samples of normal fetal adrenal gland (n=2) were used as a non-tumoral reference tissue. Total RNA from frozen samples was extracted by TRIzol® Reagent. High quality RNA (RINe>7.00) was hybridized to Human Genome U219 microarray plates at the Functional Genomic Unit, Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS, Barcelona, Spain) according to Affymetrix standard protocols. Microarray data are deposited in NCBI (#GSE54720). We will include this information in the Methods section of the revised version of the manuscript.

      6- The sequences of shRNAs have to be provided.

      Reply: We will provide the oligo sequences used to generate shRNAs against NXPH1 and aNRNX1.

      -Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      **Minor comments:**

      • Specific experimental issues that are easily addressable.

      7- The authors should show the basal expression of their genes of interest in the patients, cell lines and PDX-derived cell lines, to justify the choice to work then with SKNSH cell line. In addition to the ratio they show in Fig3.E.

      Reply: As explained above, we will provide more details on the expression levels of NXPH1 and α-NRXN1/2 in the HSJD and SEQC cohorts and in the NB cell lines used in our study.

      8- Authors should also illustrate the FACS analyses they realized in this study, in order to appreciate the quantity of positive cells that are either enriched or depleted.

      Reply: The methodology used to sort α-NRXN1+high, α-NRXN1+low and α-NRXN1- cells is explained in the Methods section (page 25). As requested by the reviewer #3, we will provide FACs plots to illustrate this methodology.

      9- Authors could precise in their schemes that DOX is maintained in vivo.

      Reply: We will follow the reviewer #3’s suggestion.

      10- The number of mice need to be integrated in each experiment.

      Reply: The numbers of mice used to assess growth and metastasis in vivo were included in the corresponding figure legends (Figs. 4D, 4E and 5C) and appear discreetly on the panel 4D (to the right). We will follow the reviewer #3’s recommendation and add these details on the corresponding figure panels.

      • Are prior studies referenced appropriately?

      11- No some elements are not right in the introduction:

      • Mutations of PHOX2B are not associated to poor prognosis.

      • Original publications could be provided instead of reviews.

      • Genetic alterations in NB are not only 16%, the authors forgot to mention TERT and ATRX.

      • Maybe the NXPH1 methylation in ref 54 could be more explicit, if DNA methylation is detected on NXPH1, it would be of poor prognosis because driving low expression ..?

      Reply: We will follow the reviewer #3’s critics and correct the mistakes and information lacking in the introduction and discussion sections.

      • Are the text and figures clear and accurate?

      Text is very clear and well written, as are the figures.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Already mentioned before: FACS and WB are needed.

      Reviewer #3 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. This work aimed first at better understanding the fundamental transcriptional differences between two risk groups in Neuroblastoma. The authors defend the conceptual idea that these two groups represent two distinct diseases, which is not new in the area but requires attention, and indeed supported here by distinct transcriptional signatures.

      Using published signatures of fetal sympatho-adrenal system, they define a core transcriptional program that is more strongly expressed in low-risk tumors, but we already know that tumors of this category are often more differentiated, and by definition expressed strong markers of SA differentiation, whereas high-risk tumors have a more undifferentiated phenotype.

      Not surprisingly, several genes as well as a specific gene signature could be associated to better prognosis, a well-known characteristic of low-risk tumors. However, among them, a novel axis (NXPH1/a-NRXN) is proposed to explain the proliferation but absence of metastasis that define the low-risk group.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      Neuroblastoma is a rare and very heterogeneous disease, in terms of biology and clinical presentation. To try to decipher such a heterogeneity, recent works have allied single cell transcriptome analyses on tumors and on human fetal cells during development. These studies are well cited in the discussion, and I agree that they yielded discrepant conclusions concerning the cell(s) of origin of Neuroblastoma. By comparing the normal developing human adrenal gland cells to cells from series of neuroblastomas, most of the studies converge towards that the tumors resembled differentiating adrenal neuroblasts. In one study, MYCN-amplified neuroblastoma cells (high risk group) were most similar to normal neuroblasts from seven- or eight-week post-conception, while lower-risk neuroblastomas included more cells resembling late neuroblasts (Janksy et al, 2021).

      During the submission of this work, another paper using single cell technologies was published and supported the idea of two distinct tumor entities (Bedoya-reina et al, 2021), with also a stronger signature of sympatho-adrenal cells in low risk tumors.

      • State what audience might be interested in and influenced by the reported findings. As the current clinical classification based on various criteria already allows clinicians to identify well low-risk tumors, I think this work would mainly attract fundamental researchers on the molecular differences between low-risk and high-risk tumors.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. I'm specialized on pediatric cancers, and especially Neuroblastoma for the past 3 years. I'm interested in tumor cell identity and cell plasticity, particularly in response to treatment. I think that I have sufficient expertise to evaluate all parts of the manuscript.

      __ __

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      __ __

      4. Description of analyses that authors prefer not to carry out

      1: The reviewers #1 and #3 suggested using a large dataset (the SEQC database containing 498 samples) to realize a more powerful comparative transcriptomic study. However, to define the transcriptional signatures associated to the aetiology of LR- and HR-NBs and to the malignant behavior of HR-NBs, we first needed to compare the transcriptomic signatures of LR-NB and HR-NB samples to the one of a non-tumoral, healthy tissue such as the fetal adrenal gland. The online SEQC does not contain any healthy samples, thus precluding the possibility to use it for the first step of our analysis. This is why we decided to use as a starting point the cohort of samples from HSJD cohort, which further provided the advantage of comparing samples from patients all diagnosed and treated at the same facility. In different steps of our analytic strategy (Fig.S1F, 2F-H, 3A-B, S3), we already used the SEQC database to test the relevance of the results in the context of a larger cohort. In doing so we obtained coherent results. As recommended by the reviewer #1 (comment #3), we will further compare our data with the data from another study (Bedoya-Reina et al, 2021). We thus believe that the request of the reviewers #1 and #3 does not need to be addressed within the scope of a revision.

      2: The reviewer #2 suggested that some crucial functional experiments should be repeated with another shRNA construct to ensure that our results are not off-target and because the knock-down efficiency of the shRNAs used was variable and not convincing. We have tested two distinct shRNA constructs for both NXPH1 and α-NRXN1, which all comparably reduced cell viability. Following this reviewer’s suggestion we will assess possible off-target effects of the sh-NXPH1 and sh-aNRXN1 constructs by RT-qPCR. Moreover, we will also test the effects of these sh-NXPH1 and sh-aNRXN1 constructs in another cell line and using a novel and complementary metastasis assay (CAM assay).

      3: The reviewer #2 suggested to study the metastatic potential of NB cells by performing orthotopic implantations of NB cells into the mouse adrenal gland instead of performing cell injections into the mouse left cardiac ventricle. We reasoned that cell injections into the left cardiac ventricle were ideal to test the organo-tropism of metastatic NBs, as this methodology facilitates cell dissemination and colonization of organs targeted by NB metastasis such as the liver and bone marrow, as shown in Fig.5B. Cell implantation into the adrenal gland is especially useful to study cell growth in one of the primary sites of NB formation. We thus believe that our current experimental approach is more relevant to the question we wish to address within the scope of a revision.

      References:

      Bedoya-Reina OC, Li W, Arceo M, Plescher M, Bullova P, Pui H, Kaucka M, Kharchenko P, Martinsson T, Holmberg J, et al (2021) Single-nuclei transcriptomes from human adrenal gland reveal distinct cellular identities of low and high-risk neuroblastoma tumors. Nat Commun 12: 1–15

      Biedler JL, Helson L & Spengler BA (1973) Morphology and Growth, Tumorigenicity, and Cytogenetics of Human Neuroblastoma Cells in Continuous Culture. Cancer Res 33: 2643–2652

      Ciccarone V, Spengler BA, Meyers MB, Biedler JL & Ross RA (1989) Phenotypic Diversification in Human Neuroblastoma Cells: Expression of Distinct Neural Crest Lineages. Cancer Res 49: 219–225

      Decock A, Ongenaert M, Cannoodt R, Verniers K, Wilde B De, Laureys G, Van Roy N, Berbegall AP, Bienertova-Vasku J, Bown N, et al (2016) Methyl-CpG-binding domain sequencing reveals a prognostic methylation signature in neuroblastoma. Oncotarget 7: 1960–72

      Gomez S, Castellano G, Mayol G, Sunol M, Queiros A, Bibikova M, Nazor KL, Loring JF, Lemos I, Rodriguez E, et al (2015) DNA methylation fingerprint of neuroblastoma reveals new biological and clinical insights. Epigenomics 7: 1137–1153

      van Groningen T, Akogul N, Westerhout EM, Chan A, Hasselt NE, Zwijnenburg DA, Broekmans M, Stroeken P, Haneveld F, Hooijer GKJ, et al (2019) A NOTCH feed-forward loop drives reprogramming from adrenergic to mesenchymal state in neuroblastoma. Nat Commun 10: 1–11

      Kildisiute G, Kholosy WM, Young MD, Roberts K, Elmentaite R, van Hooff SR, Pacyna CN, Khabirova E, Piapi A, Thevanesan C, et al (2021) Tumor to normal single-cell mRNA comparisons reveal a pan-neuroblastoma cancer cell. Sci Adv 7: eabd3311

      Tumilowicz JJ, Nichols WW, Cholon JJ & Greene AE (1970) Definition of a continuous human cell line derived from neuroblastoma. Cancer Res 30: 2110–2118

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: The authors have identified a sporozoite gliding motility protein through bioinformatic analysis. From the main text I do not know how, or what bioinformatic analysis was performed, in order to focus on this protein which is called S14. The authors then go on to tag the protein, produce a KO and show its involvement in gliding motility. The KO shows that parasites lacking S14 fail to invade the mosquito salivary glands. This is due to a motility defect. Y2H and docking studies are used to define an interaction with MTIP and GAP45, two known components of the glideosome.

      Major comments: The paper is sometimes hard to follow and lacks clarity. The reason: important information is omitted, or explained at the end of a section rather than at first mention; experimental details that are of essence need to be mentioned or explained in the main text; there is ample use of the word 'bioinformatic' without explaining what kind of analysis was performed in the main text. I cite from the abstract: 'In silico analysis of a novel protein, S14, which is uniquely upregulated in salivary gland sporozoites, suggested its association with glideosome-associated proteins.' I cite from the introduction: 'A study comparing transcriptome differences between sporozoites and merozoites using suppressive subtraction hybridization found several genes highly upregulated in sporozoites and named them 'S' genes (Kaiser et al, 2004). We narrowed it down to a candidate named S14, which lacked signal peptide and transmembrane domains.' From reading the main text, I do not know why Plasmodium berghei S14 was chosen in this manuscript. S14 is one of 25 transcripts identified by Kappe et al in Plasmodium yoelii (https://doi.org/10.1046/j.1365-2958.2003.03909.x) to be upregulated in sporozoites. The material and methods section does not explain either why S14 was chosen. Perhaps the authors could update Figure 2 from Kappe et al with the most recent annotations from plasmodb.

      Reproducibility: None of the main Figures or Figure legends define ' N = '. For example I cite: 'The S14 KO clonal lines were first analyzed for asexual blood-stage propagation, and for this, 200 µl of iRBCs with 0.2% parasitemia was intravenously injected into a group of mice.' There are 2 mentions of 'N=' in the supplementary figures. I have not found any others.

      I'm not sure what the convention is. Should unpublished data for this gene (PBANKA_0605900) found in pberghei.eu (a database for mutant berghei parasites) be cited? After all it confirms their findings.

      The authors need to use more recent references for some of their statements; see some comments below.

      Minor comments:

      line

      1-2 Add the Plasmodium species of this study. abstract Which species do you work with? 29 mosquito salivary glands and human host hepatocytes 30 to the glideosome, a protein complex containing [...] 32-33 What kind of in silico analysis suggested S14 is part of the glideosome? S14 is not uniquely upregulated; there are other S-type genes identified by Kappe and Matuschewski. 25 I believe. 32 Please point out he species were S genes were identified. SGS of which species? 34 expression: change to transcription 39 What kind of in silico analysis was used here? and therefore malaria transmission 55 A single zygote transforms into a single ookinete, which establishes a single oocyst, which in turn can produce thousands of midgut sporozoites. Please correct the life cycle passage. located or anchored in the IMC? And located between the IMC and plasma membrane? 61-63 Refer to Table S1 and its contents here 64 Name the known GAPs.

      65-67 Which transmembrane domain proteins? Please add more recent references than King 1988. 71-72 TRAP was the first protein found to be ... 74-76 Add additional, more recent references: for example search Frischknecht and TRAP 76 S6 (TREP) is also [...] 88 Some of these proteins are also expressed in ookinetes. 89-91 The sentence needs a verb. 88-96 Please add some more recent glideosome papers. After 2013. 91 Why do you call it a peripheral protein? 91-93 There are more recent citations for GAP45 andGAP50. 96 Insert a reference here. 99 Please define the gliding-associated proteins. What are they? Aren't there papers on GAP40, 45 and 50? DOI: 10.1016/j.chom.2010.09.002 99 .... What prompted you to identify a novel GAP? And why is S14 classified as a GAP? 99-102 What kind of bioinformatic study? Why was S14 chosen? Please outline how you ended up with S14. Any other proteins that came out of the bioinformatic screen from the list of S genes? How many proteins were identified in the screen for sporozoite upregulated proteins by Kappe and Matuschewski? 102-103 Define the nonclassical secretion pathway. Please reference GAP45 and GAP50 data for the nonclassical pathway. 105 Please add P. berghei to the title, the abstract, the introduction. 111 The results section does not outline what bioinformatic analysis was used 112-114 Please specify the exact number of upregulated in sporozoites genes. I think it was 25. And add the species the study was performed in. Why did you choose the Kappe study but not the uis genes from berghei? 114-115 How did you narrow it down to S14? The Kappe paper lists 25 S-type genes from P. yoelii. 118 Plasmodia is not the plural for a group of different Plasmodium species. Use: [...] conserved among Plasmodium spp. 118-119 Which proteins did you analyze? And how did you analyze them? Where is the data for this analysis? Outline the amino acids that predict palmitoylation? The nonclassical pathway? 119-122 Here: do you mean S14 has similar properties as GAP 45 and GAP50? Define the nonclassical pathway? How do you know S14 is in the IMC? 122-123 Please reference the bioinformatic analysis plus URL that allows targeting to the IMC to be analyzed. 123-124 Please reference the URLs for TM, palmitoylation, and interactions analyses. 125-127 How did you predict that S14 is secreted via the nonclassical pathway? 128-130 Define the nonclassical pathway when it first appears in your manuscript. The citation Moskes 2004 is not in the reference list 132 Which membrane? 134-135 In which species? 141-142 Please include images of blood stage and liver stage parasites. 142-143 Which membrane? 148-149 I cannot find the specific figure you refer to; I checked the online version of the Frenal 2010 paper. 175 gland, we counted [...] 177 Compared to the 177-179 Failed to invade (absolutely)? Or invaded in highly reduced numbers? 182-186 Please be precise: I think you mean you let all types of mosquitoes take a blood meal; s14 knockout-infected mosquitoes did not infect mice. 181-202 Perhaps use paragraphs to indicate the different types of experiments performed here. 204 Please introduce paragraphs to identify the different experiments in this section 208 Outer or inner membrane of what? IMC, the plasma membrane? 228 onwards Structural models were obtained from whom? Which species did you use for the docking study? Could you use in one approach 3 berghei proteins, and confirm your docking studies with the falciparum proteins? That would strengthen your model. Should you include a negative control protein in the approach? 250-251 Was all of the gene cloned? Please define amino acid range. discussion Please discuss data from https://elifesciences.org/articles/77447 in relation to your protein

      298-300 More recent glideosome papers exist. For example https://doi.org/10.1038/s42003-020-01283-8 340 List the proteins you analysed. Add URL (websites) to the analyses tools. 343 Known association from the literature: how was this done? 346-349 A few glideosome components? On what basis were they selected and which are they?

      471 Can AlphaFold Structure Predictions be used in the docking studies? 487 What parts of theses genes was cloned? Define the amino acid range. 714 Please split the table into A Mosquito bite and B haemolymph Sporozoites Figure 1 For clarity, maybe write S14::mCherry Figure 1 It would be useful to show blood stage parasite images. Figure 1F You have not formally shown that this signal corresponds to palmitoylated S14. Could be heavy chain. Figure 2G Haemolymph sporozoites ? Figure 8 You argued that S14 is a membrane-bound protein through palmitoylation. Here the protein is shown to be cytoplasmic. Please update our model with more recent ones.

      Figure S2B It would be good to include a positive control for these PCRs. Figure S3 It would be good to include a positive control for these PCRs.

      Tabel S1 Table S1 is only mentioned twice in the text: lines 124 and 128. There is no mention that the table contains all (??) known gliding motility proteins. Table S1 The algorithms / websites used for bioinformatic prediction need to be listed here. Table S2 Add the plasmodb gene identifiers here. The table does not show all Plasmodium spp. but a selection.

      Significance

      General assessment: The authors provide an in-depth analyses of the Plasmodium berghei protein S14 and its involvement in gliding motility.

      Advance: This paper is the first analysis of the S14 protein. The authors suggest a bridging function for the protein between MTIP and GAP45.

      Audience: Gliding motility is of interest to the apicomplexan field. I think this particular proteins is specific to Plasmodium spp.

    1. Reviewer #1 (Public Review):

      Summary:<br /> In the present manuscript, the authors present the results of a well-designed, thoughtful, and well-motivated study, targeting the role of angular gyrus in insight-based memory gains. The study is well conducted, timely, and presents clear-cut behavioral results. However, the analysis of the EEG-data lacks clarity and leaves many open questions - especially with regard to the representational similarity analyses. (Nevertheless, analogous concerns with regard to the focus on the three-way interaction and the comparison of linked vs. non-linked events pertain similarly to the connectivity analyses.)

      Strengths:<br /> - Well-conducted study with a proper sham-controlled TMS design.<br /> - Clever insight-based memory task.<br /> - Interesting behavioral findings.

      Weaknesses:<br /> - "We then calculated Pearson's correlations to compare the power patterns across theta frequency between the time points of linked events (A with B), as well as between the time points of non-linked events (A with X) for the pre- and the post-phase separately, separately for stories linked via imagination and via observation." (p.34)

      The RSA basically asks on the lowest level, whether neural activation patterns (as measured by EEG) are more similar between linked events compared to non-linked events. At least this is the first question that should be asked. However, on page 11 the authors state: "We examined insight-induced effects on neural representations for linked events [...]". Hence, the critical analysis reported in the manuscript fully ignores the non-linked events and their neural activation patterns. However, the non-linked events are a critical control. If the reported effects do not differ between linked and non-linked events, there is no way to claim that the effects are due to experimental manipulation - neither imagination nor observation. Hence, instead of immediately reporting on group differences (sham vs. control) in a two-way interaction (pre vs. post X imagination vs. observation), the authors should check (and report) first, whether the critical experimental manipulation had any effect on the similarity of neural activation patterns in the first place.

      Overall, the focus on the targeted three-way interaction is poorly motivated. Also, a functional interpretation is largely missing.

      - "Interestingly, we observed a different pattern of insight-related representational pattern changes for non-linked events."

      It is not sufficient to demonstrate that a given effect is present in one condition (linked events) but not the other (non-linked events). To claim that there are actually different patterns, the authors would need to compare the critical conditions directly (Nieuwenhuis et al., 2011).

      - "This analysis yielded a negative cluster (p = 0.032, ci-range = 0.00, SD = 0.00) in the parieto-temporal region (electrodes: T7, Tp7, P7; Fig. 3B)." (p. 11)

      The authors report results with specificity for certain topographical locations. However, this is in stark contrast to the fact that the authors derived time X time RSA maps.

      "These theta power values were then combined to create representational feature vectors, which consisted of the power values for four frequencies (4-7 Hz) × 41 time points (0-2 seconds) × 64 electrodes. We then calculated Pearson's correlations to compare the power patterns across theta frequency between the time points of linked events (A with B), as well as between the time points of non-linked events (A with X) for the pre- and the post-phase separately, separately for stories linked via imagination and via observation. To ensure unbiased results, we took precautions not to correlate the same combination of stories twice, which prevented potential inflation of the data. To facilitate statistical comparisons, we applied a Fisher z-transform to the Pearson's rho values at each time point. This yielded a global measure of similarity on each electrode site. We, thus, obtained time × time similarity maps for the linked events (A and B) and the non-linked events (A and X) in the pre- and post-phases, separately for the insight gained through imagination and observation." (p. 34+35)

      If RSA values were calculated at each time point and electrode, the Pearson correlations would have been computed effectively between four samples only, which is by far not enough to derive reliable estimates (Schönbrodt & Perugini, 2013). The problem is aggravated by the fact that due to the time and frequency smoothing inherent in the time-frequency decomposition of the EEG data, nearby power values across neighboring theta frequencies are highly similar to start with. (e.g., Schönauer et al., 2017; Sommer et al., 2022)

      Alternative approaches would be to run the correlations across time for each electrode (resulting in the elimination of the time dimension) or to run the correlations at each time point across electrodes (resulting in the elimination of topographic specificity).

      At least, the authors should show raw RSA maps for linked and non-linked events in the pre- and post-phases separately for the insight gained through imagination and observation in each group, to allow for assessing the suitability of the input data (in the supplements?) before progressing to reporting the results of three-way interactions.

      References:<br /> Nieuwenhuis, S., Forstmann, B. U., & Wagenmakers, E.-J. (2011). Erroneous analyses of interactions in neuroscience: A problem of significance. Nature Neuroscience, 14(9), 1105-1107. https://doi.org/10.1038/nn.2886<br /> Schönauer, M., Alizadeh, S., Jamalabadi, H., Abraham, A., Pawlizki, A., & Gais, S. (2017). Decoding material-specific memory reprocessing during sleep in humans. Nature Communications, 8(1), 15404. https://doi.org/10.1038/ncomms15404<br /> Schönbrodt, F. D., & Perugini, M. (2013). At what sample size do correlations stabilize? Journal of Research in Personality, 47(5), 609-612. https://doi.org/10.1016/j.jrp.2013.05.009<br /> Sommer, V. R., Mount, L., Weigelt, S., Werkle-Bergner, M., & Sander, M. C. (2022). Spectral pattern similarity analysis: Tutorial and application in developmental cognitive neuroscience. Developmental Cognitive Neuroscience, 54, 101071. https://doi.org/10.1016/j.dcn.2022.101071

    1. Enclosed Alphanumerics unicode subset (adsbygoogle = window.adsbygoogle || []).push({});

      Ⓞ Ⓟ Ⓠ Ⓡ Ⓢ Ⓣ Ⓤ Ⓥ Ⓦ Ⓧ Ⓨ Ⓩ ⓐ ⓑ ⓒ ⓓ ⓔ ⓕ ⓖ ⓗ ⓘ ⓙ ⓚ ⓛ ⓜ ⓝ ⓞ ⓟ ⓠ ⓡ ⓢ ⓣ ⓤ ⓥ ⓦ ⓧ ⓨ ⓩ ⓪ ⓫ ⓬ ⓭ ⓮ ⓯ ⓰ ⓱ ⓲ ⓳ ⓴ ⓵ ⓶ ⓷ ⓸ ⓹ ⓺ ⓻ ⓼ ⓽ ⓾ ⓿

    1. Enclosed Alphanumerics unicode subset (adsbygoogle = window.adsbygoogle || []).push({});

      ① ② ③ ④ ⑤ ⑥ ⑦ ⑧ ⑨ ⑩ ⑪ ⑫ ⑬ ⑭ ⑮ ⑯ ⑰ ⑱ ⑲ ⑳ ⑴ ⑵ ⑶ ⑷ ⑸ ⑹ ⑺ ⑻ ⑼ ⑽ ⑾ ⑿ ⒀ ⒁ ⒂ ⒃ ⒄ ⒅ ⒆ ⒇ ⒈ ⒉ ⒊ ⒋ ⒌ ⒍ ⒎ ⒏ ⒐ ⒑ ⒒ ⒓ ⒔ ⒕ ⒖ ⒗ ⒘ ⒙ ⒚ ⒛ ⒜ ⒝ ⒞ ⒟ ⒠ ⒡ ⒢ ⒣ ⒤ ⒥ ⒦ ⒧ ⒨ ⒩ ⒪ ⒫ ⒬ ⒭ ⒮ ⒯ ⒰ ⒱ ⒲ ⒳ ⒴ ⒵ Ⓐ Ⓑ Ⓒ Ⓓ Ⓔ Ⓕ Ⓖ Ⓗ Ⓘ Ⓙ Ⓚ Ⓛ Ⓜ Ⓝ

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2023-02123

      Corresponding author(s): Holger Sültmann

      [The “revision plan” should delineate the revisions that authors intend to carry out in response to the points raised by the referees. It also provides the authors with the opportunity to explain their view of the paper and of the referee reports.

      The document is important for the editors of affiliate journals when they make a first decision on the transferred manuscript. It will also be useful to readers of the preprint and help them to obtain a balanced view of the paper.

      If you wish to submit a full revision, please use our "Full Revision" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We would like to thank the editorial team of Review Commons for sending our manuscript for peer review and all Reviewers for carefully reading our manuscript. The reviewer’s detailed and constructive feedback and comments were instrumental to improve the quality and rigor of our manuscript. We highly appreciate the thoroughness of the review and have carefully considered all suggestions and concerns. Below, we have made point-by-point responses to the reviewer’s comments, and outlined revisions we plan to make, or have made. Textual changes in the revised manuscript are marked in Red.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Daum AK et al. indicated that CAFs promote TKI resistance via lipid biosynthesis in ALK-driven lung adenocarcinoma cells. This work provides novel CAF-induced drug resistant mechanisms in ALK-driven lung cancer cells, however, there are several issues to be resolved by the following.

      Major critiques:

      1. The authors claimed that CAF-produced HGF and NRG1 boosts AKT signaling and de novo lipogenesis to promote ALK-TKI resistance in lung cancer cells. They showed that CAF-secreted HGF and NRG1 inhibition attenuates tumor cell viability in the presence of FB2-CM, however, whether stromal HGF and NRG1 inhibition suppresses de novo lipogenesis in carcinoma cells has not yet been investigated. The authors should inhibit HGF and NRG1 expression in CAFs by shRNA prior to addition of CAF-CM onto cancer cells to evaluate AKT signaling and de novo lipogenesis.

      Author response:

      The necessity of demonstrating the direct impact of stromal HGF and NRG1 inhibition on de novo lipogenesis in cancer cells is a crucial aspect of our study, and appreciate the reviewer’s valuable suggestion to inhibit HGF and NRG1 expression in CAFs before assessing the effect on AKT signaling and de novo lipogenesis. To address this concern, we will conduct additional experiments to evaluate the impact of HGF and NRG1 expression knock-downs in CAFs by shRNA.

      This work lacks human correlation. Are HGF and NRG1 expressions in CAFs related with de novo lipogenesis, drug resistance and poor outcomes in ALK-driven lung cancer patients?

      __Author response: __

      We agree with the reviewer’s point that human correlation would underline the significance of the given findings. However, conducting such analyses presents certain challenges.

      1. The availability of ALK-mutated samples among TCGA samples is limited and the sample size is quite small (n = 5), making survival analyses less statistically meaningful due to low statistical power.
      2. Using bulk RNA-seq data for this analysis necessitates deconvolution methods to differentiate between tumor and stromal cell compartments. While deconvolution methods are valuable, they have limitations, including potential inaccuracies in estimating cell-specific gene expression due to the inherent heterogeneity of cell populations (1). This may lead to imprecise conclusions about the specific contributions of stromal factors, such as CAF-secreted HGF and NRG1, in the tumor microenvironment. Nonetheless, we are considering leveraging the available dataset of Maynard et al.(2), to address the raised concerns by the reviewer. Here, the authors performed single-cell RNA-seq on clinical biopsies, including a number of ALK+ samples from both treatment-naive and progressive-disease patients. The analysis of this dataset could allow us to investigate whether the effects observed in our study hold true in the in vivo human tissue environment, providing a more direct and clinically relevant assessment.

      The most experiments lack the appropriate control for CAFs. They would use primary isolated counterpart fibroblasts as the patient-specific control for lung cancer CAFs by extracting from non-cancerous regions of the same individual in their experiments.

      __Author response: __

      This point is well taken. Therefore, we will take this feedback into account and incorporate the suggested controls into our experimental design to enhance the robustness and validity of our results.

      Minor issues:

      In Fig 1B, coculture with TGF-b-treated MRC-5 attenuated cancer cell death with the lorlatinib treatment. However, HGF and NRG1 production is comparable between MRC-5 cells treated with or without TGF-b in Fig 5A. These data indicate that any fibroblasts but not CAFs could suppress cancer cell death with the lorlatinib treatment.

      __Author response: __

      We recognize the need for further clarification. To address this, we plan to include additional data to demonstrate the differences in tumor therapy response when cultured with CM derived from native fibroblasts versus TGF-β1-activated fibroblasts (CAFs). This will help elucidate the specific role of CAFs in suppressing cancer cell death with lorlatinib treatment and provide a more comprehensive understanding of the observed effects.

      The pAKT induction of H3122 treated with lorlatinib in the presence of CAM-CM or HGF or NRG in Fig 7A, B is barely observed and lacks the significance.

      Author response:

      We acknowledge the need for more robust data to demonstrate the significance of the observed pAKT induction in H3122 cells treated with lorlatinib in the presence of CAF-CM, HGF or NRG. We will work to provide additional data that strengthens the significance of this effect.

      Reviewer #1 (Significance (Required)):

      The concept of this work is interesting, however, molecular mechanisms underlying CAF-medicated ALK-TKI resistance remain poorly elucidated. Characterization of human primary fibroblasts (FB1, FB2) is not clearly described, and the most experiments lack proper control. Immunoblot in Fig 6 and 7 looks snap-shot and the reviewer has concerns about the reproducibility.

      Author response:

      In response to the comments, we will revise our manuscript to provide a more detailed characterization of the human primary fibroblasts to ensure transparency. To address the concern regarding controls, we will implement further controls in our experimental procedures. Regarding the concern about the reproducibility of the immunoblots, we appreciate the feedback, and we will provide additional data in the manuscript to ensure the reproducibility of our results.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this review, the authors describe a possible mechanism of resistance in lung tumor cells that is induced by CAFs in response to ALK-specific inhibitors. The manuscript is well written and conclusions are in general well supported by experimental data, however additional experiments are needed to fully support the conclusions stated by the authors.

      1. Brigatinib and lorlatinib are used to target ALK-driven lung tumor cells. In many of the experiments described heterotypic 3D co-cultures are used. Although both inhibitors act preferentially over the cancer cells, and no effect is expected in the CAFs, it would be desirable to confirm it.

      Author response:

      We thank the reviewer for this endorsement of our study, and we are pleased to learn that our conclusions are generally supported by the experimental data. Regarding the use of brigatinib and lorlatinib to target ALK-driven lung tumor cells in our experiments, we acknowledge the importance of confirming the specific action of these inhibitors. While these inhibitors act preferentially on the cancer cells, we agree that it is desirable to confirm their limited effect on CAFs. Therefore, we will address the reviewer’s suggestion by performing further cell viability analyses of TKI-treated fibroblast spheriods to specifically assess the impact of brigatinib and lorlatinib on CAFs.

      The authors demonstrated initial proliferation advantage and apoptosis protection of H2228 and H3122 cells in response to conditioned media (CM) of three independent CAF clones. However, once they identify lipid enzymes altered and specific ligand-receptor interaction, then they focus only in FB2. This imply that the mechanism described by authors is relevant for that particular clone, but it does not validate a general resistance mechanism induced by CAFs. In order to claim that this is a more general mechanism, other CAF clones should be tested.

      Author response:

      We appreciate the reviewer’s comment regarding the focus on a specific CAF clone (FB2) in our study. This point is well taken, and we understand the importance of demonstrating the generalizability of the resistance mechanism induced by CAFs.

      Since our results indicated consistent therapy response across all CAF clones tested (Figure 1), with the most pronounced effect observed with FB2-CAFs, we chose to focus our efforts on conducting sc-RNAseq experiments with FB2-CAFs and subsequently performed downstream validation experiments to corroborate our findings. This approach allowed us to prioritize the CAF clone with the most robust response while acknowledging the broader therapy response observed in all tested clones. Nevertheless, as requested by the reviewer, we will perform additional experiments using other independent CAF clones to assess whether the identified mechanism is broadly applicable.

      Authors show that the identified ligands secreted by CAFs (HGF, NRG1β1, etc) are found in conditioned media from CAFs. It would be good to determine if the amount of these ligands somehow is dependent on the presence of tumor cells and/or ALK-TKi. Additionally, both HGF, NRG1β1, are able to partially restore the expression of the lipogenic enzymes identified, or AKT activation pathway, but they are not able to completely restore it. Since CAF-derived CM would have both factors, maybe combination of both ligands may induce stronger rescue of the expression of these proteins.

      Author response:

      To provide a comprehensive understanding, we will investigate whether the levels of identified CAF-derived ligands are influenced by tumor cells and/or ALK-TKI treatment by performing additional assays on CAF-supernatants. Furthermore, we will explore the potential synergy between HGF and NRG1β1 in rescuing the expression of lipogenic enzymes and the AKT signal transduction pathway on protein level.

      Does PI3K/mTOR inhibitors revert the proliferation advantage of 3D heterotypic cultures? And expression of lipid biosynthesis genes?

      Author response:

      To address the impact of PI3K/mTOR inhibitors on the proliferation advantage of CAFs on ALK+ lung tumor spheroids and the expression of lipid metabolic genes, we will conduct treatment experiments using agents such as alpelisib (PI3Kα inhibitor), ipatasertib (panAKT inhibitor), or everolimus (mTORC1/2 inhibitor). Cell viability will be assessed using the 3D CellTiterGlo assay, and we will investigate changes in the expression of lipid biosynthesis genes to comprehensively evaluate the effects of these inhibitors on the resistance mechanism induced by CAFs.

      Reviewer #2 (Significance (Required)):

      This study underscores the multifaceted nature of resistance mechanisms in ALK-rearranged lung adenocarcinomas, highlighting the pivotal role of CAFs and lipid metabolic reprogramming. Lung adenocarcinoma remains a challenge in oncology, and while targeted therapy with tyrosine kinase inhibitors (TKIs) has shown promise in treating ALK-rearranged lung adenocarcinomas, the development of resistance to these therapies is nearly inevitable. This study delves into a critical aspect of this resistance by describing an important aspect of the intricate interplay between cancer-associated fibroblasts (CAFs) and tumor cells within the tumor microenvironment.

      One of the primary findings of this research is the impact of CAFs on the therapeutic response of ALK-driven lung adenocarcinoma cells. While intrinsic mechanisms within cancer cells are well-studied drivers of resistance, this study underscores the emerging importance of stromal components, particularly CAFs, in shaping therapeutic vulnerabilities. The observation that CAFs promote therapy resistance by hampering apoptotic cell death and fueling cell proliferation highlights the complexity of tumor-stroma interactions.

      The study utilizes three-dimensional (3D) spheroid co-culture models, providing a more physiologically relevant platform to investigate these interactions. This approach bridges the gap between conventional monolayer cultures and in vivo models, allowing for a deeper understanding of the role of the tumor microenvironment.

      Perhaps one of the most notable findings is the identification of lipogenesis-related genes as major players in TKI-treated lung tumor spheroids. This finding not only sheds light on a previously underexplored facet of cancer biology but also suggests that lipid metabolism may be a central determinant of therapeutic susceptibility in this context. Although data provided here suggests that it might not be the only mechanisms taking place in the development of resistance to ALK inhibitors, it clearly shows that it plays an important role in it.

      The study proposes a potential solution to overcome CAF-driven resistance by targeting vulnerabilities downstream of oncogenic signaling. The simultaneous targeting of ALK and SREBP-1, a key regulator of lipogenesis, emerges as a promising strategy to thwart the established lipid metabolic-supportive niche within TKI-resistant lung tumor spheroids.

      Author response:

      We thank the reviewer for this endorsement of our study and are gratified that the reviewer recognizes the critical implications of our research in the context of ALK-rearranged lung adenocarcinomas and their treatment resistance.

      One of the stronger limitations of this study is that it rely on a limited number of cell lines or patient-derived models, which may not fully capture the heterogeneity of ALK-rearranged lung adenocarcinomas. Furthermore, for most of the validatory assays performed, only a CAF cell line is used, higly limiting the significance of their conclusions to a more general resistance mechanism. Furthermore, the study provides valuable insights into the involvement of lipogenesis-related genes and AKT signaling but do not delve deeply into the precise molecular mechanisms underlying these processes. Further mechanistic studies are needed to understand the exact interactions and signaling pathways involved. In conclusion, while this study provides valuable insights into the role of CAFs and lipid metabolism in ALK-TKI resistance, its limitations underscore the need for further research, including more comprehensive in vivo models and clinical studies, to confirm and expand upon these findings.

      Author response:

      We appreciate the reviewer’s thoughtful assessment of our study and acknowledge the limitations highlighted in your comment. The limited number of cell lines and patient-derived models used in our study is indeed a limitation, and we agree that this may not fully capture the heterogeneity of ALK-rearranged lung adenocarcinomas. However, the number of ALK+ lung adenocarcinoma cell lines is limited (3), as is the availability of patient-derived tissue material. To address this, we are actively working on expanding our research to include a more comprehensive range of models (e.g. primary ALK+ lung cancer cells, patient-derived organoids (PDOs)) for future studies.

      We also acknowledge the importance of the limitations related to the use of a single CAF cell line in many of our validation assays. We are committed to broadening our experimental scope to involve multiple CAF cell lines to strengthen the significance of our conclusions.

      Regarding the need for deeper mechanistic studies to understand the precise molecular interactions and signaling pathways, we agree that this is a crucial point. To this end, we are planning additional mechanistic studies to uncover the exact molecular mechanisms underlying the described resistance processes in future studies.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: In this manuscript the authors use a spheroid co-culture model of human EML4-ALK non-small cell lung cancer (NSCLC) cell lines and fibroblasts to investigate the mechanism by which tumor-CAF crosstalk mediates non-genetic resistance to ALK inhibition. Spheroid culture of tumor cell lines with CAFs or CAF conditioned media was sufficient to reduce apoptosis and boost proliferation in the context of ALK inhibition. Using single-cell RNA-seq (scRNA-seq) of co-cultures treated with lorlatinib, the authors show that lipogenesis-associated genes are enriched in drug-treated tumor cells co-cultured with CAFs. Specifically, the authors propose that CAF-derived HGF and NRG1 derepress lorlatinib-induced changes in oncogenic Akt and mTOR signaling to boost expression of SREBP and FASN and restore lipid composition in NSCLC cancer cells.

      Major comments:

      The authors' conclusion that the effect of CAF CM on lorlatinib sensitivity is mediated by HGF and NRG is somewhat weak. Nrg1B1 was sufficient to rescue cell viability in the context of lorlatinib treatment (Figure 5B) only at a concentration significantly higher than that which was produced by fibroblast lines in culture (Figure 5A). Although the authors note that the real level of NRG1 could be higher than detected, this is speculative. Neither HGF nor NRG1 blocking antibodies appear to have rescued the elevated cell viability driven by CAF CM in the context of lorlatinib treatment (Figure 5C). These results, though statistically significant, do not appear biologically relevant. To strengthen their conclusions, the authors should consider ablating HGF or NRG1 in CAFs via shRNA or CRISPRi and then testing if CAF CM is no longer sufficient to rescue the viability of lorlatinib treated cancer spheroids.

      Author response:

      We refer the reviewer to our response to comment #1 of reviewer 1.

      In addition to the western blots used to demonstrate and effect of CAF CM or HGF/NRG1 on Akt and mTOR signaling, the authors could strengthen their conclusions by testing the effect of Akt and mTOR inhibitors on the rescue effect of CAF CM.

      Author response:

      We refer the reviewer to our response to comment #6 of reviewer 2.

      Minor comments:

      1. In addition to the cell death and proliferation assays shown in Figure 1B-E, it would be helpful to show and quantify images of spheroid co-cultures treated +/- lorlatinib (as in Figure 1A, Supplemental Figure S9). Although the effects on percent cell death and proliferation are significant, images of spheroid size and morphology would make these results more convincing.

      __Author response: __

      We can certainly incorporate representative images in the revised manuscript. However, we'd like to clarify that comparing spheroid mono- and direct co-cultures can be challenging due to differences in initial cell seeding numbers, variations in the growth rates of fibroblasts and tumor cells, and subsequently, differences in spheroid sizes at the initiation of treatment. These factors can confound direct comparisons between the two culture conditions upon treatment.

      In Figure 5B, the effect of CAF secreted factors on cell viability should be tested in comparison to CAF CM as a biological control. This would allow the reader to understand how the effect of each factor alone compares to the effect of CM.

      __Author response: __

      We will incorporate a comparison to CAF-CM as a biological control to provide a clearer understanding of the individual effects of CAF-secreted factors.

      **Referees cross-commenting**

      I am in agreement with all of the points made by Reviewer #1. I also suggested that the authors should use shRNA to inhibit HGF and NRG1 expression in CAFs, and am similarly concerned about both human and in vivo relevance of the authors' findings. The experiments suggested by Reviewer #1 to further characterize the fibroblast subtypes and to use non-CAF control cells are also reasonable.

      Author response:

      The reviewers alignment on the points raised is duly noted, and we understand the importance of addressing the concerns regarding the relevance of our findings. The use of shRNA to inhibit HGF and NRG1 expression in CAFs is a valuable suggestion, and we are actively considering this approach to enhance the specificity of our findings. Furthermore, we acknowledge the need for a deeper characterization of fibroblast subtypes and the inclusion of non-CAF control cells to strengthen the robustness of our research.

      I am also in agreement with the critiques presented by Reviewer #2 and find them reasonable; they would strengthen the manuscripts and better support the authors' findings. The work is indeed limited by the models used here, and mechanistic findings would be better supported by further metabolic analysis such as Seahorse or assessment of lipid synthesis.

      Author response:

      We greatly appreciate your alignment with Reviewer #2's critiques and your recognition of their reasonableness. Expanding our research to include additional models and conducting further metabolic analyses are valuable suggestions that we are actively considering to bolster the mechanistic underpinnings of our work.

      Reviewer #3 (Significance (Required)):

      As a reviewer, I have expertise in fibroblast biology and the contributions of the tumor microenvironment to pancreatic tumor development. Although my research has not focused on lung cancer specifically, I also have experience in lipid metabolism, therapy resistance, and tumor heterogeneity. In this manuscript the authors use a co-culture system to show that soluble CAF factors drive tyrosine kinase inhibitor (TKI) resistance in vitro in Alk-fusion driven NSCLC in line with prior work (Reviewed by Wong et al. 2021, Domen et al. 2021, Li et al. 2022). Mechanistically, the authors propose that CAF-secreted HGF and NRG1 restore Akt and mTor signaling pathways suppressed by lorlatinib, thus rescuing SREBP expression and the phospholipidome in TKI-treated NSCLC cells. Prior work has specifically demonstrated the ability of CAFs to rescue the effect of lorlatinib on NSCLC cell lines with ALK fusions via HGF/Met signaling (Hu et al. 2021), and the general effect of CAF-secreted HGF on therapy resistance through Akt/mTor signaling has been well characterized (CITE). The regulation of SREBP and lipid metabolism by Akt/mTOR signaling in cancer and TKI resistance have been similarly described (CITE). Thus, the authors largely connect these well-known pathways, demonstrating that CAF co-culture restores lipid-associated transcriptional programs and lipidomic profile in lorlatinib-treated cells via HGF/NRG1 activation of Akt, mTOR, and SREBP. A few points presented in the manuscript that could represent potential scientific advances include scRNA-seq analysis of CAF/NSCLC co-cultures and the implication of CAFs in TKI-resistance through the modulation of lipid metabolism. However, the scientific and clinical significance of these findings are limited by the biological systems used and by their incremental contribution in context of the current literature.

      The scRNA-seq analysis of NSCLC cells co-cultured with CAFs generated here could represent a potential advance and resource for future study; however, the application of this analysis to 2D cell lines in vitro may have limited utility as the heterogeneity of these long-culture lines is likely quite narrow (CITE OTHER scRNAseq in vivo or PDOs). The authors themselves did not leverage the scRNA-seq data for a deep analysis of cancer cell heterogeneity but rather uncovered lipid-associated transcriptional programs using bulk analysis across all tumor cells in their dataset. The significance of the authors' finding that CAF conditioned media (CM) mediates lorlatinib-sensitivity through the regulation of lipid metabolism is also somewhat limited by prior work directly implicating SREBP and phospholipid remodeling in TKI resistance (Xu et al. 2021, OTHERS). Although focusing on EGFR-mutant NSCLC, Xu et al. 2021 showed that SREBP upregulation and increased lipogenesis and decreased oxidative stress was associated with resistance to gefitinib and could be reversed by treatment with the SREBP inhibitor fatostatin in vivo. TKI-resistance is often driven by the activation of convergent signaling pathways (Akt, mTOR) in both in EGFR-mutant ALK-fusion NSCLC, so it is perhaps not particularly surprising that the authors find that lipid programs are similarly important in lorlatinib-resistance. The novelty in this manuscript is limited to the connection between CAFs and these critical lipid metabolism pathways, and the implication that SREBP inhibition similarly blocks non-genetic CAF-mediated TKI resistance. The significance of this finding might be greater if the authors explored whether fatostatin could improve therapy response to lorlatinib in vivo.

      Author response:

      We highly appreciate the reviewer’s detailed review and expertise in the fields of fibroblast biology, tumor microenvironment, lipid metabolism, and therapy resistance.

      The reviewer’s perspective on the utility of scRNA-seq analysis in our study is justified. We acknowledge that applying this analysis to 2D cell lines in vitro may have limitations due to the narrow heterogeneity of long-culture lines. We therefore attempted to enhance the relevance of our findings by applying 3D cell culture models, which are known to resemble the in vivo situation more closely than conventional monolayer cultures (4, 5). Nevertheless, we agree that incorporation of additional models (e.g. patient-derived organoids) would better capture the heterogeneity of the tumor and its surrounding microenvironment. We concur that a deeper analysis would enhance our understanding of the interactions between CAFs and tumor cell (sub)populations.

      The insights into the significance of our findings in the context of prior research on SREBP-dependent phospholipid remodeling in TKI resistance are well taken. We agree that the novelty of our study lies in the connection between CAFs and lipid metabolism pathways as a non-genetic CAF-mediated TKI resistance mechanism. However, it is also important to note that no prior studies have investigated stroma-driven lipid metabolic reprogramming in EML4-ALK-positive NSCLC. This unique aspect of our research adds to its originality and potential significance in advancing the understanding of ALK-positive NSCLC and therapy resistance.

      We agree with the reviewer’s point that an in vivo study would be important in exploring whether fatostatin could improve therapy response to lorlatinib. However, due to technical and timing limitations, the establishment of corresponding mouse models is beyond the scope of our present study.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      Reviewer #1

      Minor issues:

      "----stronger in in case of-------" needs to be improved to "----stronger in case of-------", line 128, page 5.

      __Author response: __

      This was changed accordingly.

      Reviewer #2

      1. Based on sc-RNA-seq data authors then explore molecular processes facilitating survival of ALK-driven lung cancer cells under the influence of CAFs and mention that "among the enriched biological processes and pathways related to metabolic activities, the most striking terms were linked to lipid metabolism" (lines 182-184). Attending to the graph depicted in Fig. 3A, that is not true, finding other processes more significant. In fact, linked to metabolism, glycolysis is more significant than lipid metabolism. This sentence should be changed accordingly and a different rationale should be done to focus on lipid metabolism.

      __Author response: __

      The phrasing was changed accordingly.

      Reviewer #3

      Major comments:

      1. Although the authors note that the scRNA-seq data generated here may be an important resource in the field, it could also be explored in greater depth to further support the conclusions of this manuscript. As is, the scRNA-seq data is primarily analyzed at a bulk level to identify lipid-associated genes and gene sets as down-regulated by lorlatinib. In this case, it would be more useful and perhaps a better resource to conduct bulk RNA-seq in triplicate to generate a stronger dataset and generate a set of genes significantly regulated by lorlatinib and CAF co-culture or CAF CM. The scRNA-seq data could be leveraged to support the conclusions of the manuscript by plotting lipid-associated genes identified in Figure XB-C by U-map. This analysis would identify which clusters are enriched for lipid-associated genes and demonstrate whether these particular clusters are depleted by lorlatinib or rescued by CAF co-culture.

      __Author response: __

      We opted for scRNA-seq as it allowed us to simultaneously sequence co-cultivated tumor cells and fibroblasts, without the need for sorting experiments typically required for bulk RNA-sequencing experiments. With this we intended to avoid potential biases introduced by sorting procedures, which can be challenging, particularly in the case of identifying appropriate markers for fibroblasts.

      In response to the reviewer’s suggestions we have now refined our analysis to depict lipid-associated genes in a cluster-dependent manner (Supplementary Figure S9). This analysis, however, did not showcase a cluster-specific enrichment of lipid-associated genes and a demonstrated a TKI-induced depletion of these genes across all tumor cell cluster.

      The authors' conclusion that CAF co-culture restores the lipid profile of lorlatinib-treated tumor cells is somewhat weak due to the representation of lipidomic data. Although the Figure legends note that lipidomic analyses were conducted at n=3 replicates, the data as represented in Figure 8A-B do not allow the reader to assess variability across samples or the significance of the fold change differences in lipid species. Although it can be useful to view the data this way, the authors should also show variability across samples in some way via PCA plot or by including a heatmap of lipid abundance across all treatment groups and replicates. Especially as some differences appear subtle, it is also difficult to understand to what extent CAF CM rescues lorlatinib-induced effects on lipid species as values are shown as fold change relative to control for the independent groups. In this way, the reader cannot assess, for example, how lipid species abundance compares in lorlatinib-treated tumor cells +/- CAF CM. Again, a heatmap across treatment groups might be helpful in addition to an analysis for statistically significant differences in lipid abundance across treatment groups. The issues outlined here make it difficult to assess whether "addition of CAF-CM to H3122 lung tumor spheroids was able to partly abrogate this shift towards higher levels of poly-unsaturated lipid". As is, the statements describing the results in Figure 8A-B are vague and don't appear to totally align with the data. To my eyes, there is no apparent general trend in SFA or MUFA reduction in lorlatinib-treated cells as implied by the authors, though particular species may be down-regulated. The authors should also calculate saturation indices across lipid species to support their conclusion that lipid saturation is modulated by lorlatinib and rescued by CAF CM.

      __Author response: __

      Given the reviewer’s suggestions, we have made significant improvements to the presentation of our lipidomic data in the revised manuscript. We now provide a more comprehensive view of the data to allow for a better assessment of variability across samples and the significance of saturation index differences across lipid species. Specifically, we have included a PCA plot (Figure 8A) and a heatmap of lipid abundances across all treatment groups and replicates to address the issue of variability (Supplementary Figure S11). Furthermore, we have performed additional analyses to calculate saturation indices across lipid species (Supplementary Figure S12A), which support our conclusion that lipid saturation, i.e. de novo lipogenesis, is modulated by lorlatinib and rescued by CAF-CM. These additions provide a clearer visualization of the data and enhance the robustness of our findings.

      Minor comments:

      The ablation of specific cell clusters upon lorlatinib treatment in Figure 2 is compelling and visually striking. To make it easier for the reader to interpret this data, it might be useful to denote the general functional annotations of each cluster in the legend (for example, "cluster 3: proliferative"). This would allow the reader to visualize which populations are preferentially depleted by the inhibitor and rescued by CAF co-culture. Further, some quantification showing the number of cells in each cluster by treatment would group (or fold-reduction per cluster upon inhibitor treatment) would more clearly show how each cluster is impacted by the inhibitor and CAF co-culture.

      __Author response: __

      To facilitate a clearer understanding of which populations are preferentially affected by the ALK-inhibitor and rescued by CAF co-culture, we provided the cluster-specific annotations in the legend of Figure 2.

      Furthermore, we included quantifications showing the number of cells in each cluster by culture condition and treatment group (Supplementary Table S3), to provide a more comprehensive view of how each cluster is impacted by the inhibitor and CAF co-culture.

      In Supplemental Figure S3A please specify which gate is being used to quantify the percentage of dead cells shown in subsequent plots. It would also be useful to show the gating strategy used to separate labelled tumor cells and CAFs in heterotypic co-cultures by FACS so it is clear that CAF cells are not included in the cell death/proliferation analysis.

      __Author response: __

      Gates for quantification of dead cells are now specified, while the gating strategy used for analyzing cell death rates of separated tumor cells is given as requested in Supplementary Figure S3A. This gating strategy was likewise used to separate labelled tumor cells from CAFs to analyze cell cycle distributions.

      In Supplemental Figure S1B and C, please briefly clarify in the legend how fibroblast lines were cultured for collection of RNA and protein. It would be useful to know if the cells were assessed in spheroid culture and thus representative of their cell state when used for the following heterotypic co-culture experiments.

      __Author response: __

      Culture conditions were added in the figure legend as requested. Prior to generation of heterotypic tumor spheroids, fibroblasts were cultured as monolayers. Nevertheless, we also verified that the activation status is maintained following TGF-β1 removal and subsequent cultivation as homotypic fibroblast spheroid via WB analysis. We added the results as shown in Supplementary Figure S1D.

      In Figure 8C, the authors should plot all individual values in the bar graph as done in all other panels.

      __Author response: __

      This was changed accordingly

      4. Description of analyses that authors prefer not to carry out

      *Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. *

      Reviewer #1

      Major critiques:

      The concept of this work is largely based on findings in vitro culture. The authors should perform animal experiments to convince their findings in vivo. CAFs and tumor cells would be implanted into recipient mice to determine whether AKT signaling, de novo lipogenesis and ALK-TKI resistance are increased in tumor cells by the presence of CAFs.

      __Author response: __

      We agree with the reviewer’s point that an in vivo study would be important in interrogating the full impact of CAFs on therapy response, AKT signaling, and de novo lipogenesis of ALK-driven lung adenocarcinoma cells. However, due to technical and timing limitations, establishing and performing co-injection and treatment experiments of corresponding mouse models is beyond the scope of our present study.

      The author described that human primary fibroblasts (FB1, FB2) were derived from NSCLC adenocarcinoma patients---. Have FB1 and FB2 been isolated from tumor tissues or no-tumor tissues? If these fibroblasts were isolated from tumor tissues as CAFs, why the authors added TGF-b onto the cells? The TGF-b treatment generates myofibroblastic CAFs, which is one of CAF subtypes, but fails to have inflammatory CAFs, which is another CAF subtypes.

      __Author response: __

      The fibroblasts FB1 and FB2 were indeed isolated from tumor tissue obtained from lung adenocarcinoma patients. In our study, the addition of TGF-β1 was employed as a strategy to maintain the CAF phenotype. It's important to note that CAFs exhibit considerable plasticity and can potentially lose their distinctive CAF characteristics during in vitro cultivation (6). The introduction of TGF-β1 was aimed at mimicking the tumor microenvironment and assisting in the preservation of the CAF phenotype, which was partially reflected in the increased expression of CAF markers such as αSMA and FAP (Supplementary Figure S1B and C).

      We acknowledge the existence of various CAF subtypes, including myofibroblastic and inflammatory CAFs, which can be induced by different stimuli. While TGF-β1 treatment tends to push fibroblasts more toward a myofibroblastic phenotype, other factors like IL-1 can induce an inflammatory phenotype (7). In our study, we chose to focus on the myofibroblastic CAF subtype. This decision was based on the prevalence of myofibroblastic CAFs in lung tumors and their established roles in tumor progression, poor prognosis across different cancer types, and resistance to immunotherapy in non-small cell lung cancer (NSCLC) (8, 9).

      Reviewer #2

      H2228 and H3122 cells are used indistinctively through the paper as ALK-driven lung tumor cells and, although in the discussion some reference is made regarding the worst outcomes observed for v3-driven ALK+ H2228 cells, results are considered similar for both cell lines, including sc-RNA-seq data. During analysis of sc-RNA-seq data numbers of specific genes identified at the different analysis are different, similar to the clusters identified (0-6). In order to determine the degree of overlap in the identified genes on the analysis and within clusters, it would be convenient to show tables with identified genes for each of the cell lines, together with the cluster classification of those genes.

      Author response:

      Regarding the comparison of v1- vs. v3-driven ALK+ tumors, we would like to clarify that the primary focus of our study is on the interactions CAFs and ALK-driven lung tumor cells, particularly in the context of therapy resistance. While the different ALK fusion variants are certainly of interest, our intention is not to delve into the comparative analysis of these variants in this paper. Instead, we aim to emphasize the broader impact of CAFs on ALK-driven lung tumors.

      The comparison of v1- vs. v3-driven tumors, as well as a detailed analysis of the differences between H2228 and H3122 cells, goes beyond the main focus of this paper. Incorporating a comparative analysis of specific genes for each cell line and their cluster classification would require a significantly expanded scope and could lead to a more complex and detailed study.

      In order to fully validate the effect of CAF/CAF CM in lipid biosynthesis in tumor cells, seahorse analysis would be highly beneficial, providing simultaneous measurement of multiple metabolic parameters, including glycolysis, oxidative phosphorylation, and fatty acid oxidation in homo (with and without CAF's CM or secreted ligands) and heterotypic conditions. Furthermore, it should be combined with specific substrates and inhibitors (i.e. glucose to measure acetyl-CoA production, labelled fatty acids, etc), to dissect various aspects of lipid biosynthesis and lipid metabolism and assess de novo lipogenesis, fatty acid uptake, or triglyceride.

      Author response:

      This point is well taken and we acknowledge the potential value of such comprehensive metabolic assessments. However, we would like to clarify that the Seahorse XF Analyzer can primarily measure oxygen consumption rate (OCR) and extracellular acidification rate (ECAR), in response to different substrates to interrogate key metabolic functions such as mitochondrial respiration and glycolysis. At least to our knowledge, the Seahorse analyzer does not specifically measure de novo lipogenesis and fatty acid uptake. Therefore, incorporating these assays in the revised manuscript may not directly address the central question of CAF-driven enhanced lipid biosynthesis.

      Nonetheless, we do agree with the reviewer that a more in-depth investigation of various metabolic alterations could be of interest in future studies. Given the GSEA data derived from our scRNA-seq analysis, which hints at alterations in glycolysis (Figure 3A), exploring these aspects of metabolic alterations in the context of CAF-mediated resistance effects could indeed provide valuable insights in the broader mechanisms underlying ALK-TKI resistance.

      This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      References

      1. Nadel BB, Oliva M, Shou BL, Mitchell K, Ma F, Montoya DJ, et al. Systematic evaluation of transcriptomics-based deconvolution methods and references using thousands of clinical samples. Brief Bioinform. 2021;22(6).
      2. Maynard A, McCoach CE, Rotow JK, Harris L, Haderk F, Kerr DL, et al. Therapy-Induced Evolution of Human Lung Cancer Revealed by Single-Cell RNA Sequencing. Cell. 2020;182(5):1232-51 e22.
      3. Bairoch A. The Cellosaurus, a Cell-Line Knowledge Resource. J Biomol Tech. 2018;29(2):25-38.
      4. Friedrich J, Seidel C, Ebner R, Kunz-Schughart LA. Spheroid-based drug screen: considerations and practical approach. Nat Protoc. 2009;4(3):309-24.
      5. Pampaloni F, Reynaud EG, Stelzer EH. The third dimension bridges the gap between cell culture and live tissue. Nat Rev Mol Cell Biol. 2007;8(10):839-45.
      6. Sahai E, Astsaturov I, Cukierman E, DeNardo DG, Egeblad M, Evans RM, et al. A framework for advancing our understanding of cancer-associated fibroblasts. Nat Rev Cancer. 2020;20(3):174-86.
      7. Biffi G, Oni TE, Spielman B, Hao Y, Elyada E, Park Y, et al. IL1-Induced JAK/STAT Signaling Is Antagonized by TGFbeta to Shape CAF Heterogeneity in Pancreatic Ductal Adenocarcinoma. Cancer Discov. 2019;9(2):282-301.
      8. Mhaidly R, Mechta-Grigoriou F. Fibroblast heterogeneity in tumor micro-environment: Role in immunosuppression and new therapies. Semin Immunol. 2020;48:101417.
      9. Hanley CJ, Waise S, Ellis MJ, Lopez MA, Pun WY, Taylor J, et al. Single-cell analysis reveals prognostic fibroblast subpopulations linked to molecular and immunological subtypes of lung cancer. Nat Commun. 2023;14(1):387.
    1. Author Response

      We thank the reviewers for their suggestions in improving the manuscript. We are currently working on a formal revision and plan to submit a revised manuscript in the near future. However, we would be remiss, if we did not address concerns regarding the conceptual merits of the paper. Below we speak to major points of note that address select reviewer comments and the eLife assessment of our manuscript.

      eLife assessment:

      However, the strength of evidence is incomplete due to the concern that larval contraction is a result of chilling the nervous system and muscles, which causes spreading depolarization and mechanical contraction of the body, rather than an active sensorimotor response to cold.

      Reviewer #3:

      The scientific premise is that a full body contraction in larvae that are exposed to noxious cold is a sensorimotor behavioral pathway. This premise is, to start with, questionable. A common definition of behavior is a set of "orderly movements with recognizable and repeatable patterns of activity produced by members of a species (Baker et al., 2001)." In the case of nociception behaviors, the patterns of movement are typically thought to play a protective role and to protect from potential tissue damage.

      Does noxious cold elicit a set of orderly movements with a recognizable and repeatable pattern in larvae? Can the patterns of movement that are stimulated by noxious cold allow the larvae to escape harm? Based on the available evidence, the answer to both questions is seemingly no.

      We thank the reviewer for their questions and clarify, here. Exposure to cold temperatures does elicit a recognizable and repeatable pattern of behavior across multiple strains, including both wildtype and genetic control strains (w1118, Oregon R) and numerous control conditions that have been previously published (Himmel et al., 2021, Himmel et al., 2023, Patel et al., 2022, Turner et al., 2016, Turner et al., 2018, Tenedini et al., 2019). Our initial publication on Drosophila cold nociception demonstrated a variety of cold-evoked behavior responses including head and/or tail raising of the larva as well as contraction behavior. These behaviors were repeatedly observed in assays involving either local cold stimulation with a cold probe or global cold stimulation on a cold plate. Head and/or tail raise behaviors are consistent with behavior that displaces the larval body from the cold surface, however, exposure to increasingly colder temperatures leads to an increasing level of cold-evoked contraction (CT) responses which result in a reduction of larval area (Turner et al., 2016). Presumably, increasing the level of CIII md neuron activation leads to greater activation of downstream circuitry. We previously performed optogenetic dose response assays to further clarify the increased prevalence CT response to strong noxious cold stimuli and investigated how CIII md neurons discriminate between innocuous touch and noxious cold stimuli. Here, we found that lower-level activation of CIII md neurons lead to predominantly touch-evoked behaviors whereas high-level activation led predominantly to cold-evoked responses (Turner et al., 2016). These analyses were coupled with stimulus-evoked calcium imaging, which revealed that touch-evoked Ca2+ levels were significantly lower than cold-evoked Ca2+ levels (Turner et al., 2016).

      In this manuscript, we confirm our previously published findings that neural silencing of CIII md neurons with either tetanus toxin expression or impairing action potential propagation results impaired cold-evoked CT responses (Turner et al., 2016, Turner et al., 2018). However, neural silencing of CIII md neurons did not eliminate cold-evoked CT responses. We interpret this finding as evidence that some component of cold-evoked CT response may be due to cold-induced muscle contraction. Furthermore, in this manuscript, we implicate the requirement of chordotonal (Ch) neurons in cold-evoked CT and demonstrate cold-evoked Ca2+ increases in Ch neurons. Furthermore, neural silencing of multiple sensory neuron types (CIII + Ch or CIII + CII) resulted in greater deficits in cold-evoked behaviors (Turner et al., 2016). Thus, the noxious cold stimulus is detected by multiple peripheral sensory neurons and inhibiting neural activity in CIII md neurons alone cannot eliminate cold-evoked CT responses.

      In this manuscript and in several other publications, studies have shown that optogenetic activation of CIII md neurons, or CIII neurons plus CII neurons or Ch neurons elicits CT-like responses (Hwang et al., 2007, Shearin et al., 2013, Turner et al., 2016). Conversely, optogenetic stimulation of CIII md neurons knocked down for paralytic, the α-subunit of voltage-gated sodium channel, did not elicit blue light-evoked CT responses due to impaired action potential propagation. These analyses collectively indicate that CIII md neuron activation is sufficient for eliciting CT-like responses. Additionally, we have previously published electrophysiological recordings of CIII md neurons under cold exposure. To address potential confounds of cold-induced muscle contraction on cold-induced electrical activity of CIII md neurons, we performed these analyses on de-muscled fillets revealing that CIII neural activity is not dependent upon muscles in response to cold. Exposure to noxious cold stimuli results in temperature-dependent increases in CIII neuron firing pattern consisting of both bursting and tonic firing (Himmel et al., 2021, Himmel et al., 2023, Maksymchuk et al., 2022, Patel et al., 2022, Himmel et al., 2022, Maksymchuk et al., 2023).

      Reviewer #3:

      Can the patterns of movement that are stimulated by noxious cold allow the larvae to escape harm?

      We were similarly curious about the neuroethological and/or protective implications of cold-evoked behaviors. In Drosophila larvae, noxious mechanical stimuli-evoked body rolling allows for lateral escape from predatory wasp (Hwang et al., 2007). Reducing the overall surface area that is exposed to cold (e.g., huddling behavior) serves as a protective strategy in many species (Canals et al., 1997, Contreras, 1984, Gilbert et al., 2006, Vickery and Millar, 1984, Hayes et al., 1992). Low temperatures can be fatal to poikilotherms (e.g., insects), however, many species have evolved the ability to cold acclimate thereby increasing their cold tolerance. To explore the potential evolutionary benefit of CIII-mediated contraction response to cold, we previously published work revealing a neural basis for cold acclimation in Drosophila larvae implicating these neurons (Himmel et al., 2021). We demonstrated that cold-evoked CT behavior is evolutionarily conserved across 11 different drosophilid species and that other cold-induced behaviors (e.g., tail raise) were also observed. Furthermore, drosophilid species adapted to rapid temperature swings were more likely to retain the ability to locomote even at lower temperatures (Himmel et al., 2021). Next, we elucidated the role of CIII md neurons in cold acclimation. Silencing CIII md neurons resulted in the inability to cold acclimate. We additionally investigated roles of Ch or CII md neurons, which alone did not inhibit the ability of larvae to cold acclimate. However, combinatorial silencing of CIII with CII or Ch neurons resulted in an inability to cold acclimate but did not obviously increase baseline cold tolerance. We explored how developmental exposure to noxious cold temperature impacts CIII md neuron cold-evoked firing pattern. Electrophysiological analyses revealed that cold acclimation results in hypersensitization in CIII md neurons (Himmel et al., 2021). Lastly, developmental optogenetic activation of CIII md neurons led to increased cold tolerance. Therefore, CIII md neurons are necessary and sufficient for cold tolerance and our collective evidence demonstrate that CIII-mediated cold nociception constitutes a peripheral neural basis for Drosophila larval cold acclimation (Himmel et al., 2021).

      Reviewer #3:

      It should be noted that this actuator drives very strong activation, and other studies with milder optogenetic stimulation of Class III neurons have shown that these cells produce behavioral responses that resemble gentle touch responses (Tsubouchi et al 2012 and Yan et al 2013)…The latter makes the reported Calcium responses to cold difficult to interpret in light of the fact that the strong muscle contractions driven by cold may actually be driving mechanosensory responses in these cells (ie through deformation of the mechanosensitive dendrites)…. Are the cIII calcium signals still observed in a preparation where cold induced muscle contractions are prevented?”

      We agree with the reviewer that mild activation of CIII md neurons results in gentle touch-like responses. In this manuscript, and other previously published work, it has been shown that optogenetic activation of CIII neurons, or CIII neurons and other sensory neurons, using a variety of optogenetic actuators (ChR2, ChETA, and CsChrimson) promotes bilateral contraction of the larval body along the anterior-posterior axis (Shearin et al., 2013, Hwang et al., 2007, Meloni et al., 2020, Turner et al., 2016, Patel and Cox, 2017, Patel et al., 2022, Himmel et al., 2023).

      As described above, in our initial publication documenting larval cold nociception in Drosophila, we investigated how CIII md neurons discriminate multimodal stimuli to elicit stimulus relevant behavioral responses. We reported that increased activation of CIII md neurons results in cold-evoked behaviors, where lower activation results in touch-evoked behaviors. Subsequent, calcium analyses revealed greater stimulus-evoked calcium response to noxious cold and milder calcium response to gentle touch (Turner et al., 2016).

      Though we have not performed cold-evoked Ca2+ imaging of CIII md neurons in larval preparations without muscles, we have recorded electrical responses of CIII md neurons in the absence of muscle contractions using de-muscled larvae fillets to analyze cold-evoked firing patterns of CIII md neurons (Himmel et al., 2021, Himmel et al., 2022, Himmel et al., 2023, Patel et al., 2022, Maksymchuk et al., 2022, Maksymchuk et al., 2023). These studies demonstrate the cold-evoked CIII neural activity is not dependent upon muscles.

      Reviewer #3:

      A major weakness of the study is that none of the second or third order neurons (that are downstream of CIII neurons) are found to trigger the CT behavioral responses even when strongly activated with the ChETA actuator (Figure 2 Supplement 2). These findings raise major concerns for this and prior studies and it does not support the hypothesis that the CIII neurons drive the CT behaviors.”

      We conducted extensive screening of interneuron populations post-synaptically connected to CIII neurons in an effort to identify post-synaptic partners that were sufficient to trigger CT response. Much to our surprise, we were unable to find any individual neuron type or driver line that was sufficient to elicit a CT response. However, we provide substantial supporting evidence for our co-activation experiments including neural silencing, EM connectivity and calcium imaging. We also report necessity for the reported second/third order neurons in cold-evoked behavioral responses, where inhibiting neural activity resulted in reduced cold-evoked behavior. Second/third order neurons also exhibit cold-evoked calcium responses. Lastly, we also report CIII-evoked (using optogenetics) increases in calcium response in downstream post-synaptic neurons.

      Previously published literature investigating CIV md neuron circuitry has implicated downstream neurons that are not sufficient to elicit rolling behavior upon activation. In CIV md neuron circuit dissection, select neurons are reported as acting downstream of CIV md neurons that require additional circuit components in order to execute rolling behavior. For example, A00c neuron activation alone does not lead to rolling behavior, however, co-activation of A00c and Basin-4 neurons facilitates rolling response (Ohyama et al., 2015). Similarly, co-activation of Basin-1 and Basin-4 neurons significantly enhance rolling probability relative to Basin-4 alone (Ohyama et al., 2015). Further, DnB neurons require Goro command neuron activity to promote rolling behavior (Burgos et al., 2018). Thus, there is precedent for co-activation requirements to elicit robust behavioral output in sensorimotor circuits and we employed a similar strategy after we discovered that activation of second or third order neurons alone did not elicit CT response.

      Reviewer #3:

      Later experiments in the paper that investigate strong CIII activation (with ChETA) in combination with other second and third order neurons does support the idea activating those neurons can facilitate body-wide muscle contractions. But many of the co-activated cells in question are either repeated in each abdominal neuromere or they project to cells that are found all along the ventral nerve cord, so it is therefore unsurprising that their activation would contribute to what appears to be a non-specific body-wide activation of muscles along the AP axis. Also, if these neurons are already downstream of the CIII neurons the logic of this co-activation approach is not particularly clear.”

      We agree with the reviewer’s comment that various cell-types that were investigated are repeated in every abdominal neuromere, however, only select post-synaptic neurons (Basin 1-4, DnB, mCSI, and Chair neurons) are segmentally repeated in every abdominal segment. Conversely, other projection and ascending neurons we investigated (A09e, A00c, A05q, Goro, TePn04/05, and A08n) are not segmentally repeated in every section. We used connectome evidence to guide our experiments on populations of neurons to explore in cold-evoked behavior and as alluded to above our co-activation approach was driven by the observation that an individual subpopulation of connected interneurons was not found to be sufficient to elicit CT behavior. That said, it does not change the findings that inhibition of neural activity in these subpopulations impairs cold-evoked behavior, nor does it change the observation that connected interneurons exhibit cold-evoked Ca2+ responses that can also be observed with optogenetic activation of CIII neurons. Reviewer #3: “The authors argument that the co-activation studies support "a population code" for cold nociception is a very optimistic interpretation of a brute force optogenetics approach that ultimately results in an enhancement of a relatively non-specific body-wide muscle convulsion.” Many studies exploring circuit bases of behavior have applied large-scale optogenetic, including co-activation strategies, or silencing screens to identify circuit components involved in specific behaviors under investigation. We employed similar methods in our circuit-based dissection and our conclusions are not solely based upon optogenetic analyses.

      References: BURGOS, A., HONJO, K., OHYAMA, T., QIAN, C. S., SHIN, G. J.-E., GOHL, D. M., SILIES, M., TRACEY, W. D., ZLATIC, M., CARDONA, A. & GRUEBER, W. B. 2018. Nociceptive interneurons control modular motor pathways to promote escape behavior in Drosophila. eLife, 7:e26016.

      CANALS, M., ROSENMANN, M. & BOZINOVIC, F. 1997. Geometrical aspects of the energetic effectivenes of huddling in small mammals. Acta Theriologica 42(3):321-328..

      CONTRERAS, L. C. 1984. Bioenergetics of Huddling: Test of a Psycho-Physiological Hypothesis. Journal of Mammalogy, 65, 256-262.

      GILBERT, C., ROBERTSON, G., LE MAHO, Y., NAITO, Y. & ANCEL, A. 2006. Huddling behavior in emperor penguins: Dynamics of huddling. Physiol Behav, 88, 479-88.

      HAYES, J. P., SPEAKMAN, J. R. & RACEY, P. A. 1992. The Contributions of Local Heating and Reducing Exposed Surface Area to the Energetic Benefits of Huddling by Short-Tailed Field Voles (Microtus agrestis). Physiological Zoology, 65, 742-762.

      HIMMEL, N. J., LETCHER, J. M., SAKURAI, A., GRAY, T. R., BENSON, M. N., DONALDSON, K. J. & COX, D. N. 2021. Identification of a neural basis for cold acclimation in Drosophila larvae. iScience, 24, 102657.

      HIMMEL, N. J., SAKURAI, A., DONALDSON, K. J. & COX, D. N. 2022. Protocols for measuring cold-evoked neural activity and cold tolerance in Drosophila larvae following fictive cold acclimation. STAR Protoc, 3, 101510.

      HIMMEL, N. J., SAKURAI, A., PATEL, A. A., BHATTACHARJEE, S., LETCHER, J. M., BENSON, M. N., GRAY, T. R., CYMBALYUK, G. S. & COX, D. N. 2023. Chloride-dependent mechanisms of multimodal sensory discrimination and nociceptive sensitization in Drosophila. elife, 12:e76863.

      HWANG, R. Y., ZHONG, L., XU, Y., JOHNSON, T., ZHANG, F., DEISSEROTH, K. & TRACEY, W. D. 2007. Nociceptive Neurons Protect Drosophila Larvae from Parasitoid Wasps. Current Biology, 17, 2105-2116.

      MAKSYMCHUK, N., SAKURAI, A., COX, D. N. & CYMBALYUK, G. 2022. Transient and Steady-State Properties of Drosophila Sensory Neurons Coding Noxious Cold Temperature. Frontiers in Cellular Neuroscience, 16:831803.

      MAKSYMCHUK, N., SAKURAI, A., COX, D. N. & CYMBALYUK, G. S. 2023. Cold-Temperature Coding with Bursting and Spiking Based on TRP Channel Dynamics in Drosophila Larva Sensory Neurons. Int J Mol Sci, 24(19):14638.

      MELONI, I., SACHIDANANDAN, D., THUM, A. S., KITTEL, R. J. & MURAWSKI, C. 2020. Controlling the behaviour of Drosophila melanogaster via smartphone optogenetics. Scientific Reports, 10, 17614.

      OHYAMA, T., SCHNEIDER-MIZELL, C. M., FETTER, R. D., ALEMAN, J. V., FRANCONVILLE, R., RIVERA-ALBA, M., MENSH, B. D., BRANSON, K. M., SIMPSON, J. H., TRUMAN, J. W., CARDONA, A. & ZLATIC, M. 2015. A multilevel multimodal circuit enhances action selection in Drosophila. Nature, 520, 633-639.

      PATEL, A. & COX, D. 2017. Behavioral and Functional Assays for Investigating Mechanisms of Noxious Cold Detection and Multimodal Sensory Processing in Drosophila Larvae. BIO-PROTOCOL, 7(13):e2388.

      PATEL, A. A., SAKURAI, A., HIMMEL, N. J. & COX, D. N. 2022. Modality specific roles for metabotropic GABAergic signaling and calcium induced calcium release mechanisms in regulating cold nociception. Front Mol Neurosci 15:942548.

      SHEARIN, H. K., DVARISHKIS, A. R., KOZELUH, C. D. & STOWERS, R. S. 2013. Expansion of the Gateway MultiSite Recombination Cloning Toolkit. PLoS ONE, 8, e77724-e77724.

      TENEDINI, F. M., SÁEZ GONZÁLEZ, M., HU, C., PEDERSEN, L. H., PETRUZZI, M. M., SPITZWECK, B., WANG, D., RICHTER, M., PETERSEN, M., SZPOTOWICZ, E., SCHWEIZER, M., SIGRIST, S. J., CALDERON DE ANDA, F. & SOBA, P. 2019. Maintenance of cell type-specific connectivity and circuit function requires Tao kinase. Nature Communications, 10, 3506.

      TURNER, H. N., ARMENGOL, K., PATEL, A. A., HIMMEL, N. J., SULLIVAN, L., IYER, S. C., BHATTACHARYA, S., IYER, E. P. R., LANDRY, C., GALKO, M. J. & COX, D. N. 2016. The TRP Channels Pkd2, NompC, and Trpm Act in Cold-Sensing Neurons to Mediate Unique Aversive Behaviors to Noxious Cold in Drosophila. Curr Biol, 26, 3116-3128.

      TURNER, H. N., PATEL, A. A., COX, D. N. & GALKO, M. J. 2018. Injury-induced cold sensitization in Drosophila larvae involves behavioral shifts that require the TRP channel Brv1. PLoS One, 13, e0209577.

      VICKERY, W. L. & MILLAR, J. S. 1984. The Energetics of Huddling by Endotherms. Oikos, 43, 88-93.

    1. Reviewer #1 (Public Review):

      The paper "Quantifying gliding forces of filamentous cyanobacteria by self-buckling" combines experiments on freely gliding cyanobacteria, buckling experiments using two-dimensional V-shaped corners, and micropipette force measurements with theoretical models to study gliding forces in these organisms. The aim is to quantify these forces and use the results to perhaps discriminate between competing mechanisms by which these cells move. A large data set of possible collision events are analyzed, bucking events evaluated, and critical buckling lengths estimated. A line elasticity model is used to analyze the onset of buckling and estimate the effective (viscous type) friction/drag that controls the dynamics of the rotation that ensues post-buckling. This value of the friction/drag is compared to a second estimate obtained by consideration of the active forces and speeds in freely gliding filaments. The authors find that these two independent estimates of friction/drag correlate with each other and are comparable in magnitude. The experiments are conducted carefully, the device fabrication is novel, the data set is interesting, and the analysis is solid. The authors conclude that the experiments are consistent with the propulsion being generated by adhesion forces rather than slime extrusion. While consistent with the data, this conclusion is inferred.

      Summary:

      The paper addresses important questions on the mechanisms driving the gliding motility of filamentous cyanobacteria. The authors aim to understand these by estimating the elastic properties of the filaments, and by comparing the resistance to gliding under a) freely gliding conditions, and b) in post-buckled rotational states. Experiments are used to estimate the propulsion force density on freely gliding filaments (assuming over-damped conditions). Experiments are combined with a theoretical model based on Euler beam theory to extract friction (viscous) coefficients for filaments that buckle and begin to rotate about the pinned end. The main results are estimates for the bending stiffness of the bacteria, the propulsive tangential force density, the buckling threshold in terms of the length, and estimates of the resistive friction (viscous drag) providing the dissipation in the system and balancing the active force. It is found that experiments on the two bacterial species yield nearly identical values of 𝑓 (albeit with rather large variations). The authors conclude that the experiments are consistent with the propulsion being generated by adhesion forces rather than slime extrusion.

      Strengths of the paper:

      The strengths of the paper lie in the novel experimental setup and measurements that allow for the estimation of the propulsive force density, critical buckling length, and effective viscous drag forces for movement of the filament along its contour - the axial (parallel) drag coefficient, and the normal (perpendicular) drag coefficient (I assume this is the case, since the post-buckling analysis assumes the bent filament rotates at a constant frequency). These direct measurements are important for serious analysis and discrimination between motility mechanisms.

      Weaknesses:

      There are aspects of the analysis and discussion that may be improved. I suggest that the authors take the following comments into consideration while revising their manuscript.

      The conclusion that adhesion via focal adhesions is the cause for propulsion rather than slime protrusion is consistent with the experimental results that the frictional drag correlates with propulsion force. At the same time, it is hard to rule out other factors that may result in this (friction) viscous drag - (active) force relationship while still being consistent with slime production. More detailed analysis aiming to discriminate between adhesion vs slime protrusion may be outside the scope of the study, but the authors may still want to elaborate on their inference. It would help if there was a detailed discussion on the differences in terms of the active force term for the focal adhesion-based motility vs the slime motility.

      Can the authors comment on possible mechanisms (perhaps from the literature) that indicate how isotropic friction may be generated in settings where focal adhesions drive motility? A key aspect here would probably be estimating the extent of this adhesion patch and comparing it to a characteristic contact area. Can lubrication theory be used to estimate characteristic areas of contact (knowing the radius of the filament, and assuming a height above the substrate)? If the focal adhesions typically cover areas smaller than this lubrication area, it may suggest the possibility that bacteria essentially present a flat surface insofar as adhesion is concerned, leading to a transversely isotropic response in terms of the drag. Of course, we will still require the effective propulsive force to act along the tangent.

      I am not sure why the authors mention that the power of the gliding apparatus is not rate-limiting. The only way to verify this would be to put these in highly viscous fluids where the drag of the external fluid comes into the picture as well (if focal adhesions are on the substrate-facing side, and the upper side is subject to ambient fluid drag). Also, the friction referred to here has the form of a viscous drag (no memory effect, and thus not viscoelastic or gel-like), and it is not clear if forces generated by adhesion involve other forms of drag such as chemical friction via temporary bonds forming and breaking. In quasi-static settings and under certain conditions such as the separation of chemical and elastic time scales, bond friction may yield overall force proportional to local sliding velocities.

      For readers from a non-fluids background, some additional discussion of the drag forces, and the forms of friction would help. For a freely gliding filament if 𝑓 is the force density (per unit length), then steady gliding with a viscous frictional drag would suggest (as mentioned in the paper) 𝑓 ∼ 𝑣! 𝐿 𝜂∥. The critical buckling length is then dependent on 𝑓 and on 𝐵 the bending modulus. Here the effective drag is defined per length. I can see from this that if the active force is fixed, and the viscous component resulting from the frictional mechanism is fixed, the critical buckling length will not depend on the velocity (unless I am missing something in their argument), since the velocity is not a primitive variable, and is itself an emergent quantity.

    2. Reviewer #2 (Public Review):

      In the presented manuscript, the authors first use structured microfluidic devices with gliding filamentous cyanobacteria inside in combination with micropipette force measurements to measure the bending rigidity of the filaments.

      Next, they use triangular structures to trap the bacteria with the front against an obstacle. Depending on the length and rigidity, the filaments buckle under the propulsive force of the cells. The authors use theoretical expressions for the buckling threshold to infer propulsive force, given the measured length and stiffnesses. They find nearly identical values for both species, 𝑓 ∼ (1.0 {plus minus} 0.6) nN∕µm, nearly independent of the velocity.

      Finally, they measure the shape of the filament dynamically to infer friction coefficients via Kirchhoff theory. This last part seems a bit inconsistent with the previous inference of propulsive force. Before, they assumed the same propulsive force for all bacteria and showed only a very weak correlation between buckling and propulsive velocity. In this section, they report a strong correlation with velocity, and report propulsive forces that vary over two orders of magnitude. I might be misunderstanding something, but I think this discrepancy should have been discussed or explained.

      From a theoretical perspective, not many new results are presented. The authors repeat the well-known calculation for filaments buckling under propulsive load and arrive at the literature result of buckling when the dimensionless number (f L^3/B) is larger than 30.6 as previously derived by Sekimoto et al in 1995 [1] (see [2] for a clamped boundary condition and simulations). Other theoretical predictions for pushed semi-flexible filaments [1-4] are not discussed or compared with the experiments.<br /> Finally, the Authors use molecular dynamics type simulations similar to [2-4] to reproduce the buckling dynamics from the experiments. Unfortunately, no systematic comparison is performed.

      [1] K. Sekimoto, N. Mori, K. Tawada and Y. Toyoshima, Phys. Rev. Lett., 1995, 75, 172-175<br /> [2] R. Chelakkot, A. Gopinath, L. Mahadevan and M. F. Hagan, J. R. Soc., Interface, 2014, 11, 20130884.<br /> [3] R. E. Isele-Holder, J. Elgeti and G. Gompper, Soft Matter, 2015, 11, 7181-7190.<br /> [4] R. E. Isele-Holder, J. Jager, G. Saggiorato, J. Elgeti and G. Gompper, Soft Matter, 2016, 12, 8495

    1. Author Response

      The following is the authors’ response to the original reviews.

      Cook, Watt, and colleagues previously reported that a mouse model of Spinocerebellar ataxia type 6 (SCA6) displayed defects in BDNF and TrkB levels at an early disease stage. Moreover, they have shown that one month of exercise elevated cerebellar BDNF expression and improved ataxia and cerebellar Purkinje cell firing rate deficits. In the current work, they attempt to define the mechanism underlying the pathophysiological changes occurring in SCA6. For this, they carried out RNA sequencing of cerebellar vermis tissue in 12-month-old SCA6 mice, a time when the disease is already at an advanced stage, and identified widespread dysregulation of many genes involved in the endo-lysosomal system. Focusing on BDNF/TrkB expression, localization, and signaling they found that, in 7-8 month-old SCA6 mice early endosomes are enlarged and accumulate BDNF and TrkB in Purkinje cells. Curiously, TrkB appears to be reduced in the recycling endosomes compartment, despite the fact that recycling endosomes are morphologically normal in SCA6. In addition, the authors describe a reduction in the Late endosomes in SCA6 Purkinje cells associated with reduced BDNF levels and a probable deficit in late endosome maturation.

      We would like to thank the reviewers for their careful reading of the paper, their feedback has helped us to add information and experiments to the paper that enhance the clarity of the findings.

      Strengths:

      The article is well written, and the findings are relevant for the neuropathology of different neurodegenerative diseases where dysfunction of early endosomes is observed. The authors have provided a detailed analysis of the endo-lysosomal system in SCA6 mice. They have shown that TrkB recycling to the cell membrane in recycling endosomes is reduced, and the late endosome transport of BDNF for degradation is impaired. The findings will be crucial in understanding underlying pathology. Lastly, the deficits in early endosomes are rescued by chronic administration of 7,8-DHF.

      We thank the reviewers for their positive feedback on this work.

      Weaknesses:

      The specificity of BDNF and TrkB immunostaining requires additional controls, as it has been very difficult to detect immunostaining of BDNF. In addition, in many of the figures, the background or outside of Purkinje cell boundaries also exhibits a positive signal.

      We agree with the reviewers that the performance of the BDNF and TrkB antibodies is an important concern. We have ourselves had difficulties with the performance of many antibodies and the images in this paper are the result of many years of optimization. We have therefore added further detail about the antibody optimization to the methods section of this paper, and have carried out new staining experiments with additional controls. We have added 2 new figure panels in supplementary figures 3 and 4 to demonstrate these tests.

      In the case of anti-BDNF antibodies, we have tested several antibodies and staining protocols and found that in our hands, the only antibody that reliably stained BDNF with a good signal to noise ratio was the one used in this paper (abcam ab108319). Even for this antibody, the staining was greatly enhanced by the use of a heat induced epitope retrieval (HIER) step, which allowed the visualization of BDNF within intracellular structures such as endosomes. When we quantified the intensity of this staining in our previous paper, the results were in agreement with those from a BDNF ELISA used to measure levels of BDNF in the cerebellar vermis of WT and SCA6 mice (Cook et al., 2022), which corroborates these results. As the staining was carried out in tissue sections and not dissociated cells, we also see positive signal from the BDNF staining outside of the Purkinje cells, since BDNF acts on cell-surface receptors and is thus released into the extracellular space around cells (Kuczewski et al., 2008) and is detectable in the extracellular matrix (Lam et al., 2019) and presynaptic terminals around neurons (Camuso et al., 2022; Choo et al., 2017). This is in contrast to studies that image BDNF mRNA with in-situ hybridization, which labels BDNF mRNA predominantly found in cells, and cannot tell us about sub-cellular or extracellular localization of BDNF protein. Together, these factors explain why we observe staining that is not cell- limited, but extends into the space around the cells of interest.

      We have added an additional supplemental figure to demonstrate the importance of using HIER when staining slices with anti-BDNF (Supplementary figure 3). We tested HIER protocols that involved heating the slices to 95°C in a variety of buffers. The buffers tested were sodium citrate buffer (10 mM sodium citrate, 0.05% Tween 20, pH 6), Tris buffer (10mM TBS, 0.05% Tween 20, pH 10), EDTA buffer (1mM EDTA, 0.05% Tween 20, pH 8) and neutral PBS. The PBS produced the best result, enhancing the staining of both anti-BDNF and anti-EEA1 antibodies (Supplementary figure 3). Therefore all slices stained using those antibodies were heated to 95°C in PBS using a heat block or thermocycler for 10 minutes, then allowed to cool before staining proceeded.

      The antibody we use (abcam ab108319) has been used in hundreds of other publications, including Javed et al., 2021 who ectopically expressed BDNF and noted colocalization between the antibody staining and the GFP tag of the BDNF construct, and Lejkowska et al., 2019 who overexpressed BDNF and saw a dramatic increase in antibody staining as well. The colocalization between ectopically expressed BDNF and the antibody in these studies demonstrates the specificity of the antibody.

      However, to further validate antibody specificity we used liver tissue as a negative control. In liver tissue from rodents and humans, the majority of the liver contains negligible levels of BDNF (Koppel et al., 2009; Vivacqua et al., 2014), see also the Human Protein Atlas. The exception is some cholangiocytes: epithelial cells that express BDNF at high levels (Vivacqua et al., 2014). We obtained liver tissue from a WT mouse that was undergoing surgery for an unrelated project and fixed and processed the tissue as we did for brain tissue (outlined in methods section). As we would expect, most of the cells in the liver showed BDNF immunoreactivity that was comparable to background levels (Supplementary figure 3). Interestingly, we were also able to detect sparse highly BDNF-positive cells in the liver, presumed cholangiocytes (Supp. Fig. 3). This pattern of liver BDNF expression is as predicted in the literature, and thus acts as a control for our antibody. We therefore believe that in our hands this antibody is able to stain BDNF with an appropriate degree of specificity.

      We also carried out staining experiments using a second anti-TrkB antibody that we had previously used to detect TrkB via Western bloing. We carried out immunohistochemistry as previously described using tissue sections from a WT mouse. The staining with the two different antibodies was carried out at the same time and all other reagents were kept constant. We found that both antibodies labelled TrkB in a similar pattern of localization, including in the early endosomes of the Purkinje cells (Supplementary figure 4). The second antibody however did have a lower signal to noise ratio and so we believe that the original anti-TrkB antibody used in this manuscript (EMD Millipore ab9872) is optimal for staining cerebellar tissue sections in our hands.

      One important concern about the conclusions is that the RNAseq experiment was conducted in 12-month- old SCA6 mice suggesting that the defects in the endo-lysosomal system may be caused by other pathophysiological events and, likewise, the impairment in BDNF signaling may also be indirect, as also noted by the authors. Indeed, Purkinje cells in SCA6 mice have an impaired ability to degrade other endocytosed cargo beyond BDNF and TrkB, most likely because of trafficking deficits that result in a disruption in the transport of cargo to the lysosomes and lysosomal dysfunction.

      We agree with the reviewers that the defects in the endo-lysosomal system may be caused by other events occurring in the course of disease progression. As mentioned by the reviewers, we have noted this possibility in the text. Detailed investigation into the sequence of events and the root causes of signaling disruption in SCA6 merits future study and we aim to address this in future work. We have expanded this explanation in the text.

      Moreover, the beneficial effects of 7,8-DHF treatment on motor coordination may be caused by 7,8-DHF properties other than the putative agonist role on TrkB. Indeed, many reservations have been raised about using 7,8-DHF as an agonist of TrkB activity. Several studies have now debunked (Todd et al. PlosONE 2014, PMID: 24503862; Boltaev et al. Sci Signal 2017, PMID: 28831019) or at the very least questioned (Lowe D, Science 2017: see Discussion: https://www.science.org/content/blog-post/those-compounds-aren-t- what-you-think-they-are Wang et al. Cell 2022 PMID: 34963057). Another interpretation is that 7,8-DHF possesses antioxidant activity and neuroprotection against cytotoxicity in HT-22 and PC12 cells, both of which do not express TrkB (Chen et al. Neurosci Lett 201, PMID: 21651962; Han et al. Neurochem Int. 2014, PMID: 24220540). Thus, while this flavonoid may have a beneficial effect on the pathophysiology of SCA6, it is most unlikely that mechanistically this occurs through a TrkB agonistic effect considering the potent anti-oxidant and anti-inflammatory roles of flavonoids in neurodegenerative diseases (Jones et al. Trends Pharmacol Sci 2012, PMID: 22980637).

      We thank the reviewers for raising this important point. We have noted in our previous paper (Cook et al., 2022) that 7,8-DHF may not be acting as a TrkB agonist in SCA6 mice, and are in agreement that other explanations are possible. We have now added information to the text of this paper to highlight this possibility. We did show in our previous paper that 7,8-DHF administration activates Akt signaling in the cerebellum of SCA6 mice, a signaling event that is known to take place downstream of TrkB activation. Additionally, 7,8-DHF treatment led to the increase of TrkB levels in the cerebellum of SCA6 mice (Cook et al., 2022), implicating TrkB in the mechanism of action, even if mechanistically, this is not via direct TrkB activation alone. However, even if the mechanism is currently incompletely explained, we believe that 7,8- DHF remains a valuable treatment strategy for SCA6. We have tried to rewrite the Discussion to highlight what we think is the most important takeaway: that 7,8-DHF can rescue endosomal and other deficits in SCA6, even if we do not currently know the full mechanism of action. We have therefore amended the text to add more detail about other potential explanations for the mechanism of action of 7,8-DHF.

      References

      Camuso S, La Rosa P, Fiorenza MT, Canterini S. 2022. Pleiotropic effects of BDNF on the cerebellum and hippocampus: Implications for neurodevelopmental disorders. Neurobiol Dis. doi:10.1016/j.nbd.2021.105606

      Choo M, Miyazaki T, Yamazaki M, Kawamura M, Nakazawa T, Zhang J, Tanimura A, Uesaka N, Watanabe M, Sakimura K, Kano M. 2017. Retrograde BDNF to TrkB signaling promotes synapse elimination in the developing cerebellum. Nat Commun 8:195. doi:10.1038/s41467-017-00260-w

      Cook AA, Jayabal S, Sheng J, Fields E, Leung TCS, Quilez S, McNicholas E, Lau L, Huang S, Watt AJ. 2022. Activation of TrkB-Akt signaling rescues deficits in a mouse model of SCA6. Sci Adv 8:3260. doi:10.1126/sciadv.abh3260

      Javed S, Lee YJ, Xu J, Huang WH. 2021. Temporal dissection of Rai1 function reveals brain-derived neurotrophic factor as a potential therapeutic target for Smith-Magenis syndrome. Hum Mol Genet 31:275–288. doi:10.1093/HMG/DDAB245

      Koppel I, Aid-Pavlidis T, Jaanson K, Sepp M, Pruunsild P, Palm K, Timmusk T. 2009. Tissue-specific and neural activity-regulated expression of human BDNF gene in BAC transgenic mice. BMC Neurosci 10:68. doi:10.1186/1471-2202-10-68

      Kuczewski N, Porcher C, Ferrand N, Fiorentino H, Pellegrino C, Kolarow R, Lessmann V, Medina I, Gaiarsa JL. 2008. Backpropagating action potentials trigger dendritic release of BDNF during spontaneous network activity. J Neurosci 28:7013–7023. doi:10.1523/JNEUROSCI.1673-08.2008

      Lam D, Enright HA, Cadena J, Peters SKG, Sales AP, Osburn JJ, Soscia DA, Kulp KS, Wheeler EK, Fischer NO. 2019. Tissue-specific extracellular matrix accelerates the formation of neural networks and communities in a neuron-glia co-culture on a multi-electrode array. Sci Rep 9. doi:10.1038/s41598- 019-40128-1

      Lejkowska R, Kawa MP, Pius-Sadowska E, Rogińska D, Łuczkowska K, Machaliński B, Machalińska A. 2019. Preclinical Evaluation of Long-Term Neuroprotective Effects of BDNF-Engineered Mesenchymal Stromal Cells as Intravitreal Therapy for Chronic Retinal Degeneration in Rd6 Mutant Mice. Int J Mol Sci 2019, Vol 20, Page 777 20:777. doi:10.3390/IJMS20030777

      Vivacqua G, Renzi A, Carpino G, Franchitto A, Gaudio E. 2014. Expression of brain derivated neurotrophic factor and of its receptors: TrKB and p75NT in normal and bile duct ligated rat liver. Ital J Anat Embryol 119:111–129. doi:10.13128/IJAE-15138

    1. Author Response

      The following is the authors’ response to the original reviews.

      We thank the reviewers and editor for their thoughful and careful evaluation of our manuscript. We appreciate your time and effort and have incorporated many of these suggestions to improve our revised manuscript.

      Reviewer #1 (Public Review):

      Summary: Cullinan et al. explore the hypothesis that the cytoplasmic N- and C-termini of ASIC1a, not resolved in x-ray or cryo-EM structures, form a dynamic complex that breaks apart at low pH, exposing a C-terminal binding site for RIPK1, a regulator of necrotic cell death. They expressed channels tagged at their N- and C-termini with the fluorescent, non-canonical amino acid ANAP in CHO cells using amber stop-codon suppression. Interaction between the termini was assessed by FRET between ANAP and colored transition metal ions bound either to a cysteine reactive chelator attached to the channel (TETAC) or metal-chelating lipids (C18-NTA). A key advantage to using metal ions is that they are very poor FRET acceptors, i.e. they must be very close to the donor for FRET to occur. This is ideal for measuring small distances/changes in distance on the scales expected from the initial hypothesis. In order to apply chelated metal ions, CHO cells were mechanically unroofed, providing access to the inner leaflet of the plasma membrane. At high pH, the N- and C- termini are close enough for FRET to be measured, but apparently too far apart to be explained by a direct binding interaction. At low pH, there was an apparent increase in FRET between the termini. FRET between ANAP on the N-and Ctermini and metal ions bound to the plasma membrane suggests that both termini move away from the plasma membrane at low pH. The authors propose an alternative hypothesis whereby close association with the plasma membrane precludes RIPK1 binding to the C-terminus of ASIC1a.

      Strengths: The findings presented here are certainly valuable for the ion channel/signaling field and the technical approach only increases the significance of the work. The choice of techniques is appropriate for this study and the results are clear and high quality. Sufficient evidence is presented against the starting hypothesis.

      Weaknesses: I have a few questions about certain controls and assumptions that I would like to see discussed more explicitly in the manuscript.

      My biggest concern is with the C-terminal citrine tag. Might this prevent the hypothesized interaction between the N- and C-termini? What about the serine to cysteine mutations? The authors might consider a control experiment in channels lacking the C-terminal FP tag.

      While it is certainly possible that the C-terminal citrine tag is preventing the hypothesized interaction between the intracellular termini, there are a few things that mitigate (but not eliminate) this concern. First, previous work looking at the interaction between the intracellular termini used FPs on both the N- and C-termini and concluded that in fact there is an interaction (PMID:31980622). Our channels have only a single FP, and we use a higher resolution FRET approach. Second, we aVach our citrine tag with a 11-residue linker, allowing for enhanced flexibility of the region and hopefully allowing for more space for an interaction that was posited to be between the very proximal part of the C-terminus (near the membrane and away from the tag) and the untagged N-terminus. Third, we previously showed that Stomatin, a much larger protein than the NTD, could bind the distal C-terminus of rASIC3 with a large fluorescent protein connected by the same linker on the C-terminus. In the case of Stomatin, the interaction involved the residues at the distal portion of the C-terminus close to the bulky FP. Interestingly, while we did not publish this, without this flexible linker, Stomatin could not regulate the channel and likely did not bind.

      Despite this, we agree that this is possible and have added a statement in our limitations section explicitly saying this.

      Figure 2 supplement 1 shows apparent read-through of the N-terminal stop codons. Given that most of the paper uses N-terminal ANAP tags, this figure should be moved out of the supplement. Do Nterminally truncated subunits form functional channels? Do the authors expect N-terminally truncated subunits to co-assemble in trimers with full-length subunits? The authors should include a more explicit discussion regarding the effect of truncated channels on their FRET signal in the case of such co-assembly.

      The positions that show readthrough (E6, L18, H515) were not used in the study. We eliminated them largely on the basis of these westerns. We elected to put the bulk of the blots in the supplement simply because of how many there were. We believe this is the best compromise. It allows us to show representative blots for all our positions without making an illegible figure with 7 blots.

      The N-terminally truncated subunits would create very short peptides that are not able to create functional channels. A premature stop at say E8 would create a 7-mer. Our longest N-terminal truncation would only create a protein of 32 amino acids. These don’t contain the transmembrane segments and thus cannot make functional channels.

      As the epitope used for the western blots in Figure 2 and supplements is part of the C-terminal tag, these blots do not provide an estimate of the fraction of C-terminally truncated channels (those that failed to incorporate ANAP at the stop codon). What effect would C-terminally truncated channels have on the FRET signal if incorporated into trimers with full-length subunits?

      Alternatively, C-terminally truncated subunits would be able to form functional channels because they contain the full N-terminus, the transmembrane domains, the extracellular domain and a portion of the C-terminus. We don’t think this is a major contaminant to our experiments. The only two C-terminal ANAP positions we use are 464 and 505. In each of these cases, they are only used for memFRET. The ones that do not contain ANAP are essentially “invisible” to the experiment. Since we are measuring their proximity to the membrane, having some missing should not maVer. However, there is some chance that truncations in some subunits could allosterically affect the position of the CT in other subunits. We have added a discussion of this in the manuscript.

      Some general discussion of these results in the context of trimeric channels would be helpful. Is the putative interaction of the termini within or between subunits? Are the distances between subunits large enough to preclude FRET between donors on one subunit and acceptor ions bound on multiple subunits?

      Thank you for this comment. We did not directly test whether the distances are within or between subunits. We considered using a concatemer to do this, however, the concatemeric channels do not express particularly well. Then, UAA incorporation hurts the expression as well. It was unlikely we would be able to get sufficient expression for tmFRET.

      However, the Maclean group has previously tested this using FRET between concatenated subunits and determined that FRET is stronger within than between subunits. We have updated the manuscript to reflect a more thorough discussion of our results in the context of their trimeric assembly.

      The authors conclude that the relatively small amount of FRET between the cytoplasmic termini suggests that the interaction previously modeled in Rosetta is unlikely. Is it possible that the proposed structure is correct, but labile? For example, could it be that the FRET signal is the time average of a state in which the termini directly interact (as in the Rosetta model) and one in which they do not?

      The proposed RoseVa model does not include the reentrant loop of the channel, so it is probable that this model would change if it were redone to include this new feature of the channel.

      However, we do discuss the limitation of FRET as a method that measures a time average that is weighted towards closest approach in our discussion section. The termini are most certainly dynamic and it is possible that spend some time in close proximity. Given that FRET is biased towards closest approach, we actually think this strengthens our argument that the termini don’t spend a great deal of time in complex. In addition, our MST data suggests that the termini do not bind. We have added some commentary on this to the discussion section for clarity.

      Reviewer #2 (Public Review):

      Summary:

      The authors use previously characterised FRET methods to measure distances between intracellular segments of ASIC and with the membrane. The distances are measured across different conditions and at multiple positions in a very complete study. The picture that emerges is that the N- and C-termini do not associate.

      Strengths:

      Good controls, good range of measurements, advanced, well-chosen and carefully performed FRET measurements. The paper is a technical triumph. Particularly, given the weak fluorescence of ANAP, the extent of measurements and the combination with TETAC is noteworthy.

      The distance measurements are largely coherent and favour the interpretation that the N and C terminus are not close together as previously claimed.

      Weaknesses:

      One difficulty is that we do not have a positive control for what binding of something to either N- or Cterminus would look like (either in FRET or otherwise).

      We acknowledge that this is a challenge for the approach. Having a positive control for binding would be great but we are not sure such a thing exists. You could certainly imagine a complex between two domains where each label (ANAP and TETAC) are pointed away from one other (giving comparatively modest quenching) or one where they are very close (giving comparatively large quenching), both of which could still be bound. This is essentially a less significant version of the problem with using FPs to measure proximity…they are not very good proxies for the position of the termini. These small labels are certainly beVer proxies but still not perfect. Our conclusion here is based more on the totality of the data. We tried many combinations and saw no sign of distances closer than ~ 20A at resting pH. We think the simplest explanation is that they are not close to one another but we tried to lay out the limitations in the discussion.

      One limitation that is not mentioned is the unroofing. The concept of interaction with intracellular domains is being examined. But the authors use unroofing to measure the positions, fully disrupting the cytoplasm. Thus it is not excluded that the unroofing disrupts that interaction. This should be mentioned as a possible (if unlikely) limitation.

      Thank you for your comment. We discuss unroofing as a potential limitation because it exposes both sides of the plasma membrane to changes in pH. We have updated this section to include acknowledgement of the possibility that unroofing disrupts the interaction via washout of other critical proteins.

      Reviewer #3 (Public Review):

      Summary: The manuscript by Cullinan et al., uses ANAP-tmFRET to test the hypothesis that the NTD and CTD form a complex at rest and to probe these domains for acid-induced conformational changes. They find convincing evidence that the NTD and CTD do not have a propensity to form a complex. They also report these domains are parallel to the membrane and that the NTD moves towards, and the CTD away, from the membrane upon acidification.

      Strengths:

      The major strength of the paper is the use of tmFRET, which excels at measuring short distances and is insensitive to orientation effects. The donor-acceptor pairs here are also great choices as they are minimally disruptive to the structure being studied.

      Furthermore, they conduct these measurements over several positions with the N and C tails, both between the tails and to the membrane. Finally, to support their main point, MST is conducted to measure the association of recombinant N and C peptides, finding no evidence of association or complex formation.

      Weaknesses:

      While tmFRET is a strength, using ANAP as a donor requires the cells to be unroofed to eliminate background signal. This causes two problems. First, it removes any possible low affinity interacting proteins such as actinin (PMID 19028690). Second, the pH changes now occur to both 'extracellular' and 'intracellular' lipid planes. Thus, it is unclear if any conformational changes in the N and CTDs arise from desensitization of the receptor or protonation of specific amino acids in the N or CTDs or even protonation of certain phospholipid groups such as in phosphatidylserine. The authors do comment that prolonged extracellular acidification leads to intracellular acidification as well. But the concerns over disruption by unroofing/washing and relevance of the changes remain.

      We acknowledge that unroofing is a limitation of our approach and noted it in the discussion. However, we have updated the section to include the possibility that the act of unroofing and washing could also disrupt the potential interaction between the intracellular domains as well as between these domains and other intracellular proteins. This was the best approach we could use to address our questions and it required that we unroof the cells. However, we look forward to future studies or new techniques that do not require the unroofing of the cells.

      The distances calculated depend on the R0 between donor and acceptor. In turn, this depends on the donor's emission spectrum and quantum yield. The spectrum and yield of ANAP is very sensitive to local environment. It is a useful fluorophore for patch fluorometry for precisely this reason, and gating-induced conformational changes in the CTD have been reported just from changes in ANAP emission alone (PMID 29425514). Therefore, using a single R0 value for all positions (and both pHs at a single position) is inappropriate. The authors should either include this caveat and give some estimate of how big an impact changes spectrum and yield might have, or actually measure the emission spectra at all positions tested.

      This is a reasonable concern and one we considered. Measuring the quantum yield would be quite difficult. However, we have measured spectra at a number of positions and see a relatively minimal shik in the peak. Most positions peak between 481 and 484nm. If you calculate the difference in R0 using theoretical spectra with a blue shik of 20nm, the difference in R0 is only ~1.5A. A shik of 20nm is on the higher side of anything we have seen in the literature (PMID 30038260) and since even with that large a shik, the difference is minimal we do not think measuring spectra for each position would impact the overall conclusions presented. As you noted, though, the quantum yield also changes. Assuming a change in yield from 0.22 to 0.47, the largest we found reported in the literature (PMID:29923827) , the R0 would increase by 2A. This same paper showed that the blue shiked position was the one with the higher extinction coefficient so these changes would be working in opposition to one another making the difference in R0 even smaller. It is important to note, that while tmFRET is a much more powerful measure of distance than standard FRET, these distances, as you point out, are quite challenging to measure precisely. Our conclusions are based less on the absolute distances and more on the observation that no positions show large quenching and that if there is any change upon acidification, it is in the wrong direction.

      Overall, the writing and presentation of figures could be much improved with specific points mentioned in the recommendations for authors section.

      See below.

      The authors argue that the CTD is largely parallel to the plasma membrane, yet appear to base this conclusion on ANAP to membrane FRET of positions S464 and M505. Two positions is insufficient evidence to support such a claim. Some intermediate positions are needed.

      We do not see in the paper where we suggest that the CTD is parallel. However, your point that we could try and determine if this was the case is correct. However, we aVempted to create several other CTD TAG mutants but struggled with readthrough and poor expression of these mutants so we opted to just include S464 and M505. Our point from these data is only that the distal CTD (505) must spend significant time near the membrane to explain our FRET data.

      Upon acidification, NTD position Q14 moves towards the plasma membrane (Figure 8B). Q14 also gets closer to C515 or doesn't change relative to 505 (Figures 7C and B) upon acidification. Yet position 505 moves away from the membrane (Figure 8D). How can the NTD move closer to the membrane, and to the CTD but yet the CTD move further from the membrane? Some comment or clarification is needed.

      This is a reasonable question and one that is hard to definitively answer. Our goal here was to test the hypothesis that the termini are bound at rest. Mapping the precise positions of the termini is difficult for reasons we will enumerate in the question that asks why we didn’t make a model. There are potentially multiple explanations but the easiest one would be that the CTD could move away from the membrane but closer to Q14, for instance, if the distal termini, say, rotated towards the NTD. This would move 505 closer and have no impact on whether or not the NTD and CTD moved away or toward the membrane.

      Reviewer #1 (Recommendations For The Authors):

      Minor concerns

      The authors show the spectrum of ANAP attached to beads and use this spectrum to calculate R0 for their FRET measurements. Peak ANAP fluorescence is dependent on local environment and many reports show ANAP in protein blue-shiked relative to the values reported here. How would this affect the distance measurements reported?

      This is an important point. See above for the answer.

      Could the lack of interaction between the N- and C-terminal peptides in Figure 7 arise from the cysteine to serine mutations or lack of structure in the synthetic peptides. How were peptide concentrations measured/verified for the experiment?

      It is possible that cysteine to serine mutations could prevent the interaction. It is also possible that these peptides are not capable of adopting their native fold without the presence of the plasma membrane or due to being synthetically created. However, the termini are thought to be largely unstructured. We received these peptides in lyophilized form at >95% purity and resuspended to our desired stock concentration (3 mM C-terminus, 1 mM N-terminus). Even if our concentration was off, we see no signs of interaction up to quite a high concentration.

      How was photobleaching measured for correcting the data?

      We executed several mock experiments at various TAG positions using either pH 8 and pH 6, where we performed the experiments as usual but with a mock solution exchange when we would normally add the metal. We normalized the L-ANAP fluorescence to the first image and averaged together these values for pH 8 and pH 6. We then corrected using Equation 2 in the manuscript..

      We have updated the methods to include how we adjusted for bleaching.

      The authors may wish to make it more explicit that their Zn2+ controls also preclude the possibility that a changing FRET signal between ANAP and citrine may affect their data.

      Thank you for this comment. We agree, it would strengthen the manuscript to include this statement. We have now included this.

      It might be useful to the reader if the authors could include (as a supplement) plots of their data (like in Figure 6), in which FRET efficiency has been converted to distance.

      We considered this idea as well but felt like showing the actual data in the figures and the distances in a table would be best.

      Figure 5D is mentioned in the text before any other figures. This is unconventional. Could this panel be moved to Figure 1 or the mention moved to later?

      Changed

      western blot is not capitalized.

      Changed.

      Figure 1, the ANAP structure shown is the methyl ester, which is presumably cleaved before ANAP is conjugated to the tRNA. The authors may wish to replace this with the free acid structure.

      This is a fair point. We originally used the methyl ester structure to indicate the version of ANAP we chose to use. However, you are correct that the methyl ester is cleaved before conjugation to the tRNA. We replaced the methyl ester with the free acid structure to clarify this.

      Figures 1 and 4 should have scale bars for the images.

      Scale bars have been added to figures 1, 4, and 5.

      In Figure 3, the letters in the structures (particularly TETAC) are way too small. Please increase the font size.

      Changed

      In Figure 3 and Figure 3 supplement 1, the axes are labeled "Absorbance (M-1cm-1)." Absorbance is dimensionless. The authors are likely reporting the extinction coefficient.

      Thank you for catching this. We adjusted the axes to extinction coefficient.

      In Figures 5 B and C, it might be clearer if the headers read "Initial, +Cu2+/TETAC, DTT" rather than "Initial, FRET, Recovery."

      Changed

      The panel labels for Figure 8 seem to be out of order.

      Changed

      The L for L-ANAP should be rendered, by convention, in small caps.

      This is a good example of learning something new from the review process. This is the first I have ever heard of small caps. We can find no other papers that use small caps for L-ANAP so I am not 100% sure what convention this is referring to and don’t want to change the wrong thing in the paper. We are happy to change if the editorial staff at eLife agree but have lek this for now.

      Reviewer #2 (Recommendations For The Authors):

      With so many distances measured, why was not even a basic structural model attempted?

      We certainly considered it, but a number of things lead us to conclude that it might imply more certainty about the structure of these termini than we hope to give. 1) Given that the FRET is a time average of positions, these distance constraints would not do much constraining. 2) Given that the termini are likely unstructured and flexible this makes the problem in 1 worse. 3) There is no structural information to use as a starting point for a model. 4) The flexibility of the linkers for each FRET pair also introduces uncertainty. This can, in theory, be modeled as they do in EPR but all of this together made us decide not to do this. What we hope readers take home, is the overall picture of the data is not consistent with the original RIPK1 hypothesis.

      Maybe it would be good to draw a band on the graphs in Figure 6 for the FRET signal expected for interaction (and thus, disfavoured by these data). This would at least give context.

      We agree this could be helpful, but it is not so easy to do. What distance would we choose? We could put a line at ~5Å (the model predicted distance). As we noted above, a number of distances could be compatible with an interaction. However, we think it’s unlikely that if a complex was formed that none of our measurements would show a distance closer than 20Å at rest and that an unbinding event would then lead to a decrease in distance. This, to us, is the take home message.

      Minor points:

      "Aker unroofing the cells, only fluorescence associated with the "footprint", or dorsal surface, of the cell membrane is lek behind."

      The authors use dorsal and ventral in this section to describe parts of an adherent cell. But in the first instance, they remove the dorsal part of the cell, and then in this phrase, the dorsal part is lek behind....I am a bit confused.

      Thank you for pointing out this mistake, we have fixed this. It is indeed the ventral surface lek behind.

      "bind at rest an" - and?

      Changed

      "One previous study used a different approach to try and map the topography of the intracellular termini of ASIC1a comparable to our memFRET experiments." I think a citation is due.

      Citation added

      "great deal of precedent" even if this result is from my own lab, I would prefer that the authors note that it's one study from one lab! I think best just to delete "great deal of".

      “Great deal of” deleted

      I think the column "Significance" in the tables is unnecessary when the P value is given.

      Thank you for this suggestion. We agree and have made the change.

      Figure 7a Q14TAG has a clearly bimodal distribution at pH 8. What could be the meaning of this result? The authors do not mention it that I could find. Perhaps there is no meaning. The authors should state what they think is (or is not) going on.

      This is a good question and we don’t have a good answer. It appears to be experimental variability. The data from the “low fret” in this experimental condition all came from the same days. So something was different that day. We considered that they might be outliers to exclude but thought showing all of our data was the beVer path. We reperformed the ANOVA here separating out the “outlier” day and nothing of substance changed. Both populations were still different with P value less than 0.001.

      Typo: Lumencore

      Changed

      Maybe just a matter of taste but the panel created with Biorender in Figure 8 is not attractive and depicts the channel differently to in Figure 5D, which is again different from Figure 1A. Surely one advantage of using computer-generated artwork could be to have consistency.

      We agree and have used the same cartoon for all of our images with the one exception being the schematics that are just meant to show the positions that are present in each bar graph.

      Figure 4A was squashed to fit (text aspect ratio is wrong).

      Fixed

      Reviewer #3 (Recommendations For The Authors):

      Citrine is used to report incorporation. Yet citrine has a strong tendency to dimerize (PMID 27240257). Did the authors use mCitrine or just Citrine? This is quite important in interpreting their data.

      Thank you for pointing out this important distinction. We use mCitirine which we have added to the methods.

      The manuscript has numerous instances of imprecise language. For example, page 10, last para, first line, "previous studies have looked at..." or page 7, final paragraph "tell a similar story". Related, the figures could be much better. For example, in Figure 1, where the authors depict the anap chemical in red, as opposed to the blue one might expect of a blue emiqng fluorophore. In figure 6, ANAP is also in red with the quenching group in green. This is opposite to how one typically thinks of FRET with the warmer color being the acceptor not the donor. Moreover, the pH 6 condition is also colored the same shade of red as the ANAP. Labels of Cys positions would again be useful here. In Figure 3, the heteroatoms of TETAC and C18-NTA are very small and difficult to see. It would also be good to label these structures, and the spectra below, so the reader can tell at a glance without looking at the caption, what the structures and spectra arise from. Also, how are the absorption spectra normalized? This is not discussed in the methods. The lack of attention to presentation mars an otherwise nice study.

      Thank you for these points. We have made modifications to the manuscript to address these comments.

      Abstract, second last line "Aker prolonged acidification, ...", 'prolonged' could be interpreted as 'it takes a while for the domain to move' or 'the movement only happens aker a while'. This not what the authors intend to convey. Consider modifying to just 'Aker acidification,'

      We updated the main text to indicate that prolonged acidification is intended to describe acidification that occurs over the minutes timescale.

      Pdf page 6, bottom para on Anap incorporation not altering channel function: What is meant by 'steady state pH dependence of activation'? This implies the authors applied a pH stimulus, then waited until equilibrium was achieved ie. until desensitization was complete and measured the current at that point. It seems more likely they simply applied different pH stimuli and measured the peak response and that the use of 'steady state' here is a typo.

      We removed the phrase steady state.

      Same section, controls of electrophysiology allude to 485, 505 and 515 ANAP-containing channels. In fact, the authors have no way of determining what fraction (if any) of the pH evoked currents arise from channels containing Anap in those positions versus from simply having a translation stop but still functioning. This should be mentioned.

      This is correct. We cannot be sure the CTD TAG positions are not a mixture of ANAP-containing channels and truncations. See above for why we do not think this a big concern for the FRET experiments. Functionally, though, you are correct that we cannot tell. We now mention this in the paper.

      Methods, the abbreviation for SBT should be defined somewhere.

      Added.

      Methods, unroofing section, middle paragraph, the authors use nM not nm to list wavelengths of light.

      Changed.

      Figure 3C-D: There's an unexpected blip in the Anap emission spectra at ~500 nm. Are the grating efficiency of the spectrograph and quantum efficiency of the camera accounted for in these spectra?

      This is a good question. The data are not corrected for either camera efficiency or grating efficiency. We don’t have easy access to the actual data (although we can see a pdf version of each). There is a liVle blip in the grating efficiency graph that could partly explain the blip in our spectra.

      Figure 5C, were recovery experiments routinely done? If so, would be good to show more than n = 1 in the plot to get an idea of reproducibility.

      Recovery experiments were done in every experiment but are not shown for simplicity. We have included all FRET and recovery data for position Q14TAG-C469 at pH 6 in figure 5C to show reproducibility of our FRET and recovery data.

      Table 1, considering adding a Δ distance column (pH 8 versus 6) so the magnitude of changes are more easily seen.

      This is a reasonable suggestion but we decided not to include a Δ distance column. The data are whole numbers and people can easily determine the Δ distance. We felt that including that column would bring too much focus on what we think are preVy small changes. Our hope is that readers take away that the data are not consistent with complex formation between the determine and focus less on absolute distances.

      Figure 7A, Q14tag pH 8 condition has a quite a bit of spread and, likely, two populations. These data, as well as G11, are unlikely to be parametric and hence ANOVA is inappropriate. A normality test, and likely Kruskal-Wallis test is called for.

      Aker testing for normality, the data for Q14TAG C485 pH8 are non-normally distributed. However, a Kruskal Wallis is a non-parametric test for a one-way ANOVA and not applicable here. We separated the data out into population 1 and 2 and repeated the two-way ANOVA statistical test. When Q14TAG pH 8 is split into 2 populations, the statistics hardly change. When the data is not separated, Q14TAG pH 8 relative to pH 6 has a p-value <0.0001. When the 2 populations are separated, both populations relative to Q14TAG pH 6 still have a p-value of <0.0001.

    1. Author Response

      The following is the authors’ response to the original reviews.

      We very much appreciate the constructive comments provided by the reviewers. We have incorporated many of their suggestions and believe the manuscript is much improved.

      In brief, we updated the text as suggested and have included three additional panels in supplementary fig. S2E-G. This additional data provides further support that the ectopically persisting neuroblasts are actively dividing and that cell cycle defects alone do not account for temporal patterning phenotypes.

      Reviewer #1 (Public Review):

      In this manuscript, the authors are building on their previous work showing Delta-Notch regulates the entrance and exit from embryo-larval quiescence of neural stem cells of the central brain (called CB neuroblasts (NB) (PMID: 35112131)). Here they show that continuous depletion of Notch in NBs from early embryogenesis leads to cycling NBs in the adult. This - cycling NBs in the adult - is not seen in controls. The assumption here is that these Notch-RNAi NBs in adults are those that did not undergo terminal differentiation in pupal development. The authors show that Notch is activated by its ligand Delta which is expressed on the GMC daughter cell and on cortex glia. They determine that the temporal requirement for Notch activity is 0-72 hours after larval hatching (ALH) (i.e., 1st instar through mid-3rd instar at 25C). In NBs/GMCs depleted for Notch, early temporal markers were still expressed at time points when they should be off and late markers were delayed in expression. These effects were observed in ~20-40% of NBs (Figures 5 and 6). Through mining existing data sets, they found that the early temporal factor Imp - an RNA binding protein - can bind Delta mRNA. They state that Delta transcripts decrease over time (without any reference to a Figure or to published work), leading to the hypothesis that Delta mRNA is repressed by the late temporal factors. Over-expressing late factors Syp or E93 earlier in development leads to downregulation of a Delta::GFP protein trap. These results lead to a model in which Notch regulates expression of early temporal factors and early temporal factors regulate Notch activity through translation of Delta mRNA.

      There are several strengths of this study. The authors report rigorous measurements and statistical analyses throughout the study. Their conclusions are appropriate for the results. Data mining revealed an important mechanism - that Imp binds Delta mRNA - supporting the model that early temporal factors promote Delta expression, which in turn promotes Notch signaling.

      There are also several weaknesses:

      1) The activation of Notch in NBs by Delta in GMCs was already shown by this group in their Dev 2022 paper, reducing some of the impact of this study.

      In our previous work, we reported that Delta-expressing GMCs transactivate Notch in neuroblasts during the embryonic to larval transition. In the current manuscript, we show that Delta is expressed in GMCs and cortex glia and both sources transactivate Notch in neuroblasts during later developmental stages. This is in agreement with work published by others and while not novel per se, is a necessary first step for understanding which neighboring cell types control Notch pathway activity. During the embryonic to larval transition, glia do not contribute likely because they have not yet grown to ensheath CB NBs and their recently born progeny.

      2) The authors do not explain their current results in context of their prior paper (2022 Dev) until the Discussion, but this would be useful to read in the Introduction. Similarly, it would be good to mention that in the 2022 paper, they find a significant number of wor>Notch RNAi NBs at 2 AHL that are cycling. Are the adult Notch RNAi in this study descended from those NBs at 2 hours ALH in the 2022 study? In other words, how does the early requirement for Notch between 0-72 hours ALH reported in the current study relate to the Notch-depleted NBs identified in the 2022 paper?

      We have now included the following text in the intro: “We recently reported that Notch signaling regulates CB NB quiescence during the embryonic to larval transition (Sood et al., 2022). When Notch is knocked down, some CB NBs continue dividing during this transition. We also reported that Notch activity becomes attenuated in quiescent CB NBs because CB NBs are no longer dividing and producing Delta-expressing GMC daughters for Notch pathway transactivation. Moreover, low Notch is necessary for CB NBs to reactivate from quiescence in response to dietary nutrients (Sood et al., 2022).

      Here we report that Notch signaling also regulates neurogenesis termination during pupal stages. When Notch is knocked down, CB NBs maintain early temporal factor expression longer resulting in a delay of late temporal factor expression with prolonged neurogenesis into late pupal stages and early adulthood. This defect in temporal patterning (switching from early to late) occurs after CB NB exit from quiescence suggesting that Notch is required at multiple times throughout development in controlling CB NB proliferation decisions.”

      We do not know whether the neuroblasts that fail to enter quiescence are the same that fail to terminate divisions during pupal stages, however there are many more that fail to terminate divisions during pupal stages.

      3) Most of the experiments rely upon continuous depletion of Notch from embryonic stage 8 until adulthood using the wor-GAL4 driver. There is no lineage tracing of this driver and there is no citation about the published expression pattern of this driver. The inclusion of these details is important for a broad audience journal.

      The reference for the driver is included in supplementary data, under the heading “Experimental model:Drosophila melanogaster”. This GAL4 driver is widely used and one of the most accepted in the field.

      4) Most of the experiments utilize a single RNAi transgene for Notch, Delta, Imp, Syp, E93. There are no experiments demonstrating the efficacy of the RNAi lines and no references to prior use and/or efficacy of these lines.

      All RNAi lines used in these studies have been published previously, by our group as well as others and sources for the lines are listed in supplementary data, under the heading “Experimental model:Drosophila melanogaster”. Efficiency of these lines have been verified using antibody labeling (data not shown) and by assaying activity of Notch activity reporters (shown in Fig. 2).

      An appraisal: The authors use temperature shifts with Gal80TS to show that Notch is required between 0-72 hours ALH. They show with the use of known markers of the temporal factors and Delta protein trap, that Imp promotes Delta protein expression and the later temporal factors reduce Delta, although the molecular mechanisms are not clearly delineated. Overall, these data support their model that the reduction of Delta expression during larval development leads to a loss of Notch activity.

      As noted in the Discussion, this study raises many questions about what Notch does in larval CB NBs. For example, does it inhibit Castor or Imp? Is Notch required in certain neural lineages and not others. These studies will be of interest in the community of developmental neurobiologists.

      Reviewer #2 (Public Review):

      Embryonic stem cells extensively proliferate to generate the necessary number of cells that are required for organogenesis, and their proliferation must be timely terminated to allow for proper patterning. Thus, timely termination of stem cell proliferation is critical for proper development. Numerous studies have suggested that cell-extrinsic changes in the surrounding niche environment drive the termination of stem cell proliferation. By contrast, cell-intrinsic mechanisms that terminate stem cell proliferation remain poorly understood. Fruit fly larval brain neuroblasts provide an excellent model for mechanistic investigation of intrinsic control of stem cell proliferation due to the wealth of information on molecular marks, gene functions and lineage hierarchy. Sood et al. conducted a genetic screen to identify genes that are required for the termination of neuroblast proliferation in metamorphosis and found that Notch and its ligand Delta contribute to their exit from cell cycle. They showed that knocking down Notch or delta function in larval neuroblasts allows them to persist into adulthood and remain proliferative when no neuroblasts can be detected in wild-type adult brains. By carrying out a well-designed temperature-shift experiment, the authors showed that Notch is required early during larval development to promote timely exit from cell cycle in metamorphosis. The authors went on to show that attenuating Notch signaling prolongs the expression of temporal identity genes castor and seven-up perturbing the switch from Imp to Syp/E93. Finally, they showed that knocking down Imp function or overexpressing E93 can restore the elimination of neuroblasts in Notch/delta mutant brains.

      Overall, the experiments are well conceived and executed, and the data are clear. However, the data reported in this study represent incremental progress in improving our mechanistic understanding of the termination of neuroblast proliferation.

      We respectfully disagree with this statement. Because Notch signaling is implicated in neurogenesis termination and Notch activity is regulated by GMCs and glia, it strongly suggests that NB proliferation and timing cues are controlled in a non-autonomous manner through direct interactions with NBs and their neighbors. This is in contrast to temporal patterning during embryogenesis which is largely believed to be controlled NB-autonomously. In addition, to our knowledge, no one has yet reported that CB NBs fail to terminate cell divisions on time when Notch activity is reduced during normal development. In fact, reported NB phenotypes associated with Notch loss of function have been surprisingly subtle until now.

      Some of the data seem to represent more careful analyses of previously published observations described in the Zacharioudaki et al., Development 2016 paper while others seem to contradict to the results in this study.

      The Zacharioudaki et al., Development 2016 paper is terrific. One key difference between our work and theirs, is that we look at Notch pathway knockdown and loss of function phenotypes, whereas in the Zacharioudaki 2016 paper, the authors report phenotypes associated with Notch constitutive activation. It has been known for some time that constitutively active Notch leads to tumorigenic phenotypes particularly in type II lineages. Zacharioudaki and colleagues further determined that some of the classically known temporal transcription factors were ectopically expressed in these stem cell tumors.Here we show that under normal developmental conditions, Notch pathway activity controls CB NB temporal patterning.

      Gaultier et al., Sci. Adv. 2022 suggested that Grainyhead is required for the termination of neuroblast proliferation in a neuroblast tumor model, and grainyhead is a direct target of Notch signaling. Thus, Grainyhead should be a key downstream effector of Notch signaling in terminating castor and seven-up expression. Identical to Notch signaling, Grainyhead is also expressed through larval development. Grainyhead can function as a classical transcription factor as well as a pioneer factor raising the possibility that temporal regulation of neurogenic enhancer accessibility might be at play in allowing Notch signaling in early larval development to set up termination of castor and seven-up expression in metamorphosis. Diving deeper into how dynamic changes in chromatin in neurogenic enhancers affect the termination of neuroblast proliferation will significantly improve our understanding of termination of stem cell proliferation in diverse developing tissue.

      Reviewer #3 (Public Review):

      In this study, the authors investigate the effects of Notch pathway inactivation on the termination of Drosophila neuroblasts at the end of development. They find that termination is delayed, while temporal patterning progression is slowed down. Forcing temporal patterning progression in a Notch pathway mutant restores the correct timing of neuroblast elimination. Finally, they show that Imp, an early temporal patterning factor promotes Delta expression in neuroblast lineages. This indicates that feedback loops between temporal patterning and lineage-intrinsic Notch activity fine tunes timing of early to late temporal transitions and is important to schedule NB termination at the end of development.

      The study adds another layer of regulation that finetunes temporal progression in Drosophila neural stem cells. This mechanism appears to be mainly lineage intrinsic - Delta being expressed from NBs and their progeny, but also partly niche-mediated - Delta being also expressed in glia but with a minor influence. Together with a recent study (PMID: 36040415), this work suggests that Notch signaling is a key player in promoting temporal progression in various temporal patterning system. As such it is of broad interest for the neuro-developmental community.

      Strengths

      The data are based on genetic experiments which are clearly described and mostly convincing. The study is interesting, adding another layer of regulation that finetunes temporal progression in Drosophila neural stem cells. This mechanism appears to be mainly lineage intrinsic - Delta being expressed from NBs and their progeny, but also partly niche-mediated - Delta being also expressed in glia but with a minor influence. A similar mechanism has been recently described, although in a different temporal patterning system (medulla neuroblasts of the optic lobe - PMID: 36040415). It is overall of broad interest for the neuro-developmental community.

      Weaknesses

      The mechanisms by which Notch signaling regulates temporal patterning progression are not investigated in details. For example, it is not clear whether Notch signaling directly regulates temporal patterning genes, or whether the phenotypes observed are indirect (for example through the regulation of the cell-cycle speed). The authors could have investigated whether temporal patterning genes are directly regulated by the Notch pathway via ChIP-seq of Su(H) or the identification of potential binding sites for Su(H) in enhancers.

      This is already known for svp and cas and we have now included this information in the discussion.Thank you.

      “Whether Notch pathway activity curtails both Cas and Svp or just Cas remains an open question, however it has been reported that both cas and svp are associated with at least one enhancer that is responsive to Notch activity (Zacharioudaki et al., 2016).”

      A similar approach has been recently undertaken by the lab of Dr Xin Li, to show that Notch signaling regulates sequential expression of temporal patterning factors in optic lobes neuroblasts (PMID: 36040415), which exhibit a different temporal patterning system than central brain neuroblasts in the present study. As such, the mechanistic insights of the study are limited.

      Reviewer #1 (Recommendations For The Authors):

      1) There are missing controls

      a) Fig. 1F and Fig. 6A - The authors should generate and show images of control clones (FRT19A) stained with the same markers as Notch clones.

      Fig. 1F is at 48 hours APF. In control clones, there are no Dpn positive cells present, as stated in the text and therefore no confocal images are shown. Same for Fig. 6A, there are no Dpn positive cells in control clones in the brain at this time, therefore nothing to double label.

      2) This result is incorrectly described in the Results

      a) P. 5 "Ectopically persisting N RNAi CB NBs expressed the NB transcription factor Deadpan (Dpn), the S-phase indicator pcnaGFP, and were small on average, similar in size to control CB NBs at earlier pupal stages (Fig. 1B,C,E)." The Notch RNAi NBs were larger (not smaller) than controls in Fig. 1E at 30, 48, 72 h APF and in adults.

      Thank you for this comment. We have changed the language in the main text as follows:

      “Ectopically persisting N RNAi CB NBs (CB NBs at 48 hours APF and beyond) expressed the NB transcription factor Deadpan (Dpn), the S-phase indicator pcnaGFP, and were small on average compared to control CB NBs during earlier developmental stages (L3 control, average diameter 10-15μms) (Fig. 1B,C,E). However, at 30 hours APF when control CB NBs are still present, N RNAi CB NBs were larger on average (Fig. 1B,C,E).”

      3) This sentence needs clarification/editing

      a) P. 4: " Independent of neurogenesis timing and the mechanism by which CB NB stop divisions, temporal patterning plays a key role". A key role in what?

      Thank you again. We have changed the text to the following:

      “Independent of neurogenesis timing and the mechanism by which CB NB stop divisions, temporal patterning plays a key role in controlling numbers and types of neurons made within each of the NB lineages (Maurange et al., 2008; Tsuji et al., 2008; Bahrampour et al., 2017; Yang et al., 2017; Pahl et al., 2019).”

      4) Some sentences need references or data to support them.

      a) P. 9 Please provide a reference to support the statement that Delta is a known Notch target

      We have included a reference.

      b) P. 9 - please provide a reference or data to support the statement that Delta transcripts decrease over time in larval CB NBs.

      This result is shown in Fig. 7B.

      5) Fig. 7A - it is difficult to appreciate the purple highlighting.

      We have changed the colors as suggested.

      Reviewer #2 (Recommendations For The Authors):

      1) In Fig. 4C, why does late knockdown of delta lead to ectopic persistence of NBs but late knockdown of Notch has no effect?

      This could be due to many things including differences in efficiency of UAS-RNAi lines. The point is that Delta/Notch is required early, but not late. Although some DeltaRNAi CB NBs are still present, the number compared to 48 hours APF is greatly reduced.

      2) It is surprising that Delta expression in NBs/GMCs appears to play a more important role in activating Notch signaling in neuroblasts than Delta expression in cortex glia. Please explain how Delta can cell autonomously activate Notch signaling.

      We are not proposing that Delta activates Notch cell autonomously, but are proposing that Delta in GMCs transactivates Notch in NBs. After NBs divide Delta is partitioned to GMCs. Quiescent NBs have low to no Notch pathway activity, likely because they are not producing Delta expressing GMC daughters (Sood, 2022).

      Please also reconcile the difference in gene expression induced by delta[RNAi] in this study and the delta-mutant allele used in the Zacharioudaki et al study.

      We are unsure what the reviewer is asking here and therefore can not reconcile any differences in gene expression between the dlRNAi line and the mutant allele. What gene expression needs to be reconciled? Zacharioudaki is listed as first author on four manuscripts. Which paper is being referred to?

      3) In Fig. 2J-L, why does knocking down delta in glia lead to loss of Scrib expression in neuroblasts and their surrounding progeny?

      We are not sure if it does or not. We only use Scrib as a membrane marker to identify and locate cells and neuropil regions of interest.

      4) The phrase "Notch is active early" is misleading when multiple labs have shown that Notch signaling is active in neuroblasts throughout larval development.

      Good point! We have rewritten the statement: “Somewhat paradoxically, we find that early Notch activity is required to terminate CB NB divisions late.”

      5) Neuroblasts that persist into adulthood are "smaller and Dpn-positive/PCNA-GFP-positive". Are they really neuroblasts? Can the authors verify the identity of these "persistent neuroblasts" with other molecular markers as well as functional assessment by inducing lineage clones?

      We have no doubt that these cells are NBs. Because we examine brains over time, these cells can be tracked using the markers, Scrib, Dpn, and pcna. These cells also undergo asymmetric cell division (Refer to Fig. S2F) and express other markers characteristic of CB NBs (mir and insc-not shown). We have made clones and see the same phenotype (ectopic persistence) in both MARCM clones and in “flip-out” clones.

      Reviewer #3 (Recommendations For The Authors):

      I have a few issues that need to be addressed to reinforce some of the conclusions:

      1) It is unclear whether NBs that persist in late pupal or adult stages have just failed to differentiate or whether they continue to divide, leading to supernumerary progeny (as shown for NBs that are stalled in temporal patterning like in svp mutant NBs (Maurange et al. 2008)). EdU or PH3 staining could be done in adults to clarify this point.

      In this manuscript, we make use of pcna:GFP, a reporter for E2F activity as an indicator of cell proliferation. We certainly observe Dpn positive cells that only weakly express the reporter, suggesting that these cells are not actively dividing or dividing at a reduced rate. However, by far most of the ectopically persisting CB NBs strongly express the reporter and generate pcnaGFP expressing progeny, indicating that these cells are dividing. We have also stained tissues with PH3 and have included an image of a telophase dlRNAi expressing CB NB at 48 hours APF (Fig. S2F).

      2) It is unclear whether Notch signaling directly or indirectly regulates temporal transitions. One possibility is that knockdown of Notch signaling decreases cell-cycle speed leading to delayed temporal transitions. The authors should test whether Notch KD affects cell cycle speed using EdU incorporation or PH3 staining. This could be done best using Notch mutant MARCM clones as wt NBs can be used as controls.

      We have quantified the number of PH3 positive CB NBs during wandering L3 stages in control and dlRNAi animals. We find that dlRNAi CB NBs are indeed proliferating at reduced rates compared to controls. To test whether reduced cell cycle times are causative for termination delay, we expressed a constitutively active form of PI3-kinase in dlRNAi animals to drive cell growth and proliferation. We found that CB NBs still ectopically persist (Fig. S2E-G).

      We have included the following in the text:

      “Defects in timing of temporal transitions could be due to defects in cell cycle progression, although embryonic NBs still transition independent of cell division (Grosskortenhaus et al., 2005). We used PH3 to assay CB NB mitotic activity. In Delta knock down animals, the percentage of PH3 positive CB NBs was reduced compared to control (Fig. S2E). At 48 h APF however, Delta knock down CB NBs were still dividing based on PH3 expression (Fig. S2F). To determine whether CB NBs ectopically persist due to defects in cell cycle rate, we co-expressed dp110 to constitutively activate PI3-kinase in Delta knock down animals. A significant number of pcnaGFP expressing, Dpn positive CB NBs were still observed, suggesting that defects in cell cycle timing and growth rates alone cannot account for ectopic persistence of CB NBs into later developmental stages and adulthood (Fig. S2G).”

      3) Cas is expressed in NBs either during quiescence and shortly after quiescence. It is possible that the maintenance of Cas in Figure 5D, E is due to NBs that have not re-entered the cell-cycle or have exited quiescence with a strong delay.

      Knockdown of Notch pathway has no effect on CB NB reactivation from developmental quiescence. In fact, low levels of Notch are required for CB NBs to reactivate in response to dietary nutrients (Sood, 2022).

      Indeed, the authors have previously shown that Notch signaling is important for NB cell cycle reentry during early larval stages (PMID: 35112131). Are Cas and Svp also maintained in late larval N-/MARCM clones (MARCM clonew are made after quiescence exit)?

      We have not assayed Cas or Svp expression past 48 hours ALH.

      4) The authors have revisited some previously published RNA-seq data showing that Delta is temporally regulated in NB lineages. This is not clearly shown by the authors that the same is true at the protein level.

      Moreover, they find that mis-expression of late temporal factors or Imp knockdown in early larval brains appear to decrease Delta expression. Such semi-quantitative analysis of gene expression by immunostainings in different conditions can be a bit complicated and not very convincing because variations on intensity levels can be due to slight variations in antibody concentration, or different parameters of image acquisition.

      We totally agree, but in this case the difference compared to controls was so readily apparent, that we felt it was not necessary to carry out experiments in clones. All images were acquired with the same confocal settings, experiments were repeated, and we consistently observed the same results. The data shown in Fig. 7D-G is representative.

      I suggest that the authors use clonal analysis rather than pan-neuroblast manipulation in order to have internal controls. For example, blocking temporal progression in Syp-RNAi clones (MARCM or Flp-out) and/or svp MARCM clones should lead to maintenance of Imp expression in late larval clones and maintenance of high levels of Delta, which would be easily assessed compared to surrounding NBs.

      Minor points:

      Fig 5: the sequential expression of Cas and Svp expression in larval NBs was first described by Maurange et al. 2008. Please cite appropriately.

      We have now added the requested citation to the following:

      “Over time, the percentage of Cas expressing CB NBs declined, while Svp expressing CB NBs modestly increased (Fig. 5B). Less than 1% of CB NBs co-expressed Cas and Svp at any stage and expression of both factors was absent by 48 hours ALH (Fig. 5B,C). This is consistent with work published previously (Isshiki et al., 2001; Tsuji et al., 2008; Chai et al., 2013; Maurange et al., 2008; Ren et al., 2017; Syed et al., 2017).”

      Fig 6A: Please indicate which immunostainings are shown in the overlay panels.

      Good catch! We have modified the figure.

      P9: "Delta co-immunoprecipitated with Imp.": Add "Delta mRNA co-immunoprecipitated with Imp in RIP-seq experiments" Otherwise, it suggests that you are talking about the protein.

      Done

      The scheme in Figure 7H is rather complicated to understand. In my opinion, it does not clearly convey the idea that Notch signaling favors the Imp-to-Syp transition.

      We have made a new model figure.

  9. Oct 2023
    1. Author Response

      We appreciate the editor's and reviewers' time to review our manuscript. We will work on the suggestions and have provided an initial assessment of what we can do for our revised submission.

      Reviewer #1 (Public Review):

      Summary:

      This study aimed to investigate the effects of optically stimulating the A13 region in healthy mice and a unilateral 6-OHDA mouse model of Parkinson's disease (PD). The primary objectives were to assess changes in locomotion, motor behaviors, and the neural connectome. For this, the authors examined the dopaminergic loss induced by 6-OHDA lesioning. They found a significant loss of tyrosine hydroxylase (TH+) neurons in the substantia nigra pars compacta (SNc) while the dopaminergic cells in the A13 region were largely preserved. Then, they optically stimulated the A13 region using a viral vector to deliver the channelrhodopsine (CamKII promoter). In both sham and PD model mice, optogenetic stimulation of the A13 region induced pro-locomotor effects, including increased locomotion, more locomotion bouts, longer durations of locomotion, and higher movement speeds. Additionally, PD model mice exhibited increased ipsilesional turning during A13 region photoactivation. Lastly, the authors used whole-brain imaging to explore changes in the A13 region's connectome after 6-OHDA lesions. These alterations involved a complex rewiring of neural circuits, impacting both afferent and efferent projections. In summary, this study unveiled the pro-locomotor effects of A13 region photoactivation in both healthy and PD model mice. The study also indicates the preservation of A13 dopaminergic cells and the anatomical changes in neural circuitry following PD-like lesions that represent the anatomical substrate for a parallel motor pathway.

      Strengths:

      These findings hold significant relevance for the field of motor control, providing valuable insights into the organization of the motor system in mammals. Additionally, they offer potential avenues for addressing motor deficits in Parkinson's disease (PD). The study fills a crucial knowledge gap, underscoring its importance, and the results bolster its clinical relevance and overall strength.

      The authors adeptly set the stage for their research by framing the central questions in the introduction, and they provide thoughtful interpretations of the data in the discussion section. The results section, while straightforward, effectively supports the study's primary conclusion the pro-locomotor effects of A13 region stimulation, both in normal motor control and in the 6-OHDA model of brain damage.

      We thank the reviewer for their positive comments.

      Weaknesses:

      1) Anatomical investigation. I have a major concern regarding the anatomical investigation of plastic changes in the A13 connectome (Figures 4 and 5). While the methodology employed to assess the connectome is technically advanced and powerful, the results lack mechanistic insight at the cell or circuit level into the pro-locomotor effects of A13 region stimulation in both physiological and pathological conditions. This concern is exacerbated by a textual description of results that doesn't pinpoint precise brain areas or subareas but instead references large brain portions like the cortical plate, making it challenging to discern the implications for A13 stimulation. Lastly, the study is generally well-written with a smooth and straightforward style, but the connectome section presents challenges in readability and comprehension. The presentation of results, particularly the correlation matrices and correlation strength, doesn't facilitate biological understanding. It would be beneficial to explore specific pathways responsible for driving the locomotor effects of A13 stimulation, including examining the strength of connections to well-known locomotor-associated regions like the Pedunculopontine nucleus, Cuneiformis nucleus, LPGi, and others in the diencephalon, midbrain, pons, and medulla.

      We considered two approaches initially. The first approach was to look at specific projections to the motor regions, focusing on the MLR. The second approach was to utilize a whole-brain analysis that is presented here. Given what we know about the zona incerta, especially its integrative role, we felt that a reasonable starting point was to examine the full connectome. The value of the whole-brain approach is that it provides a high-level overview of the afferents and efferents to the region. The changes in the brain that occur following Parkinson-like lesions, such as those in the nigrostriatal pathway, are known to be complex and can affect neighbouring regions such as the A13. Therefore, we wished to highlight the A13, which we considered a therapeutic target, and examine changes in connectivity that could occur following acute lesions affecting the SNc. We acknowledge that this study does not provide a causal link, but it presents the fundamental background information for subsequent hypothesis-driven, focused, region-specific analysis.

      The terms provided were from the Allen Brain Atlas terminology and were presented as abbreviations. We have looked at other ways to present it, including a greater emphasis on raw numbers and highlighting motor-related subareas. We will rewrite the connectomics section to make it more accessible, reflecting the change in the figures.

      Additionally, identifying the primary inputs to A13 associated with motor function would enhance the study's clarity and relevance.

      This is a great point and could help simplify the whole-brain results. We can present the motor-related inputs and outputs as part of a new figure in the main paper and add accompanying text in the results section. This will help highlight possible therapeutic pathways. We can also enhance our discussion of these motor-related pathways. We will retain the entire dataset and present it in a supplementary table for those who are interested.

      The study raises intriguing questions about compensatory mechanisms in Parkinson's disease and a new perspective on the preservation of dopaminergic cells in A13, despite the SNc degeneration, and the plastic changes to input/output matrices. To gain inspiration for a more straightforward reanalysis and discussion of the results, I recommend the authors refer to the paper titled "Specific populations of basal ganglia output neurons target distinct brain stem areas while collateralizing throughout the diencephalon from the David Kleinfeld laboratory." This could guide the authors in investigating motor pathways across different brain regions.

      Thank you for the advice, and as pointed out, Kleinfeld’s group had a nice, focused presentation of their data. For the connectomic piece, we can certainly adopt their reporting style, which, as you point out, may highlight key motor-related regions. There are a few ideas here that we can explore further, as mentioned above.

      2) Description of locomotor performance. Figure 3 provides valuable data on the locomotor effects of A13 region photoactivation in both control and 6-OHDA mice. However, a more detailed analysis of the changes in locomotion during stimulation would enhance our understanding of the pro-locomotor effects, especially in the context of 6-OHDA lesions. For example, it would be informative to explore whether the probability of locomotion changes during stimulation in the control and 6-OHDA groups. Investigating reaction time, speed, total distance, and even kinematic aspects during stimulation could reveal how A13 is influencing locomotion, particularly after 6-OHDA lesions. The laboratory of Whelan has a deep knowledge of locomotion and the neural circuits driving it so these features may be instructive to infer insights on the neural circuits driving movement. On the same line, examining features like the frequency or power of stimulation related to walking patterns may help elucidate whether A13 is engaging with the Mesencephalic Locomotor Region (MLR) to drive the pro-locomotor effects. These insights would provide a more comprehensive understanding of the mechanisms underlying A13-mediated locomotor changes in both healthy and pathological conditions.

      Thank you for these suggestions. We will revise as suggested. We will provide additional and/or updated data in revised figures and text. We will also move Supplementary Figures S1 and S2, which present additional locomotor data, into the main text to partly address the reviewers' points.

      Reviewer #2 (Public Review):

      Summary:

      The paper by Kim et al. investigates the potential of stimulating the dopaminergic A13 region to promote locomotor restoration in a Parkinson's mouse model. Using wild-type mice, 6-OHDA injection depletes dopaminergic neurons in the substantia nigra pars compacta, without impairing those of the A13 region and the ventral tegmentum area, as previously reported in humans. Moreover, photostimulation of presumably excitatory (CAMKIIa) neurons in the vicinity of the A13 region improves bradykinesia and akinetic symptoms after 6-OHDA injection. Whole-brain imaging with retrograde and anterograde tracers reveals that the A13 region undergoes substantial changes in the distribution of its afferents and projections after 6-OHDA injection. The study suggests that if the remodeling of the A13 region connectome does not promote recovery following chronic dopaminergic depletion, photostimulation of the A13 region restores locomotor functions.

      Strengths:

      Photostimulation of presumably excitatory (CAMKIIa) neurons in the vicinity of the A13 region promotes locomotion and locomotor recovery of wild-type mice 1 month after 6-OHDA injection in the medial forebrain bundle, thus identifying a new potential target for restoring motor functions in Parkinson's disease patients.

      Weaknesses:

      Electrical stimulation of the medial Zona Incerta, in which the A13 region is located, has been previously reported to promote locomotion (Grossman et al., 1958). Recent mouse studies have shown that if optogenetic or chemogenetic stimulation of GABAergic neurons of the Zona Incerta promotes and restores locomotor functions after 6-OHDA injection (Chen et al., 2023), stimulation of glutamatergic ZI neurons worsens motor symptoms after 6-OHDA (Lie et al., 2022).

      Thank you - we will add this reference. It is useful as Grossman did stimulate the zona incerta in the cat and elicit locomotion, suggesting that stimulation of the area in normal mice has external validity. The area targeted by Chen et al. (2023) is in the lateral aspect of central/medial zona incerta, formed by dorsal and ventral zona incerta, which may account for the differing results. Our data were robust for stimulation of the medial aspect of the rostromedial zona incerta. The thigmotactic behaviour that we observed in our work that focused on CamKII neurons has not been observed with chemogenetic, optogenetic activation or with photoinhibition of GABAergic central/medial ZI (Chen et al. 2023).

      Although CAMKIIa is a marker of presumably excitatory neurons and can be used as an alternative marker of dopaminergic neurons, behavioral results of this study raise questions about the neuronal population targeted in the vicinity of the A13 region. Moreover, if YFP and CHR2-YFP neurons express dopamine (TH) within the A13 region (Fig. 2), there is also a large population of transduced neurons within and outside of the A13 region that do not, thus suggesting the recruitment of other neuronal cell types that could be GABAergic or glutamatergic.

      We found that CamKII transfection of the A13 region was extremely effective in promoting locomotor activity, which was critical for our work in exploring its possible therapeutic potential. We acknowledge that specific viral approaches that target the GABAergic, glutamatergic, and dopaminergic circuits would be very useful. The range of tools to target A13 dopaminergic circuits is more limited than the SNc, for example, because the A13 region lacks DAT, and TH-IRES-Cre approaches, while useful, are less specific than DAT-Cre mouse models. Intersectional approaches targeting multiple transmitters (glutamate & dopamine, for example) may be one solution as we do not expect that a single transmitter-specific pathway would work, as well as broad targeting of the A13 region. Recent work suggests that GABAergic neuron activation may have more general effects on behaviour rather than control of ongoing locomotor parameters. However, this is in contrast to recent work showing a positive valence effect of dopamine A13 activation on motivated food-seeking behavior, which differs from consummatory behavior observed with GABAergic modulation (Ye, Nunez, and Zhang 2023). Chemogenetic inactivation and ablation of dopaminergic A13 revealed that they contribute to grip strength and prehensile movements, uncoupling food-seeking grasping behavior from motivational factors (Garau et al. 2023). Overall, this suggests differing effects of GABA compared to DA and/or glutamatergic cell types, consistent with our effects of stimulating CamKII.

      Regarding the analysis of interregional connectivity of the A13 region, there is a lack of specificity (the viral approach did not specifically target the A13 region), the number of mice is low for such correlation analyses (2 sham and 3 6-OHDA mice), and there are no statistics comparing 6-OHDA versus sham (Fig. 4) or contra- versus ipsilesional sides (Fig. 5). Moreover, the data are too processed, and the color matrices (Fig. 4) are too packed in the current format to enable proper visualization of the data. The A13 afferents/efferents analysis is based on normalized relative values; absolute values should also be presented to support the claim about their upregulation or downregulation.

      Generally, papers using tissue-clearing imaging approaches have low sample sizes due to technical complexity and challenges. The technical challenges of obtaining these data were substantial in both collection and analysis. There are multiple technical complexities arising from dual injections (A13 and MFB coordinates) and targeting the area correctly. The A13 region is difficult to target as it spans only around 300 µm in the anterior-posterior axis. While clearing the brain takes weeks, and light-sheet imaging also takes time, the time necessary to analyze the tissue using whole-brain quantification is labor intensive, especially with a lack of a standardized analysis pipeline from atlas registrations, signal segmentations, and quantifications. The field is still relatively new, requiring additional time to refine pipelines.

      Correlation matrices are often used in analyzing connectivity patterns on a brain-wide scale, as they can identify any observable patterns within a large amount of data. We used correlation matrices to display estimated correlation coefficients between the afferent and efferent proportions from one brain subregion to another across 251 brain regions in total in a pairwise manner (not for hypothesis testing). We provided descriptive statistics (mean and error bars) in Figure 5C and G. As mentioned in comments for Reviewer 1, we will also present data in a revised Figure 5 and/or a new figure that focuses specifically on motor-related pathways to provide information on possible therapeutic pathways. As suggested, absolute values will be shared in a supplemental table.

      In the absence of changes in the number of dopaminergic A13 neurons after 6-OHDA injection, results from this correlation analysis are difficult to interpret as they might reflect changes from various impaired brain regions independently of the A13 region.

      We acknowledge that models of Parkinson’s disease, particularly those using 6-OHDA, induce plasticity in various regions, which may subsequently affect A13 connectivity. Our aim is to emphasize the residual, intact A13 pathways that could serve as therapeutic targets in future investigations. This emphasis is pertinent in the context of potential clinical applications, as the overall input and output to the region fundamentally dictate the significance of the A13 region in lesioned nigrostriatal models. We agree with the reviewer that the changes certainly can be independent of A13; however, the fact that there was a significant change in the connectome post-6-OHDA injection and striatonigral degeneration is in and of itself important and important to document.

      There is no causal link between anatomical and behavioral data, which raises questions about the relevance of the anatomical data.

      This point was also addressed earlier in response to a comment from Reviewer 1. Focusing on specific motor pathways is one avenue to explore. However, given that the zona incerta acts as an integrative hub, we believed it is prudent to initially examine both afferent and efferent pathways using a brain-wide approach. For instance, without employing this methodology, the potential significance of cortical interconnectivity to the A13 region might not have been fully appreciated. As mentioned previously, we will place additional emphasis on motor-related regions in our revised paper, thereby enhancing the relevance of the anatomical data presented. With these modifications, we anticipate that our data will underscore specific motor-related targets for future exploration, employing optogenetic targeting to assess necessity and sufficiency.

      Overall, the study does not take advantage of genetic tools accessible in the mouse to address the direct or indirect behavioral and anatomical contributions of the A13 region to motor control and recovery after 6-OHDA injection.

      We acknowledge that our study has not specifically targeted neurons that express dopaminergic, glutamatergic, or GABAergic properties (refer to earlier comment for more detail). However, like others, we find that targeting one neuronal population often does not result in a pure transmitter phenotype. For instance, evidence suggests co-localization of dopamine neurons with a subpopulation of GABA neurons in the A13/medial zona incerta (Negishi et al. 2020). In the hypothalamus, research by Deisseroth and colleagues (Romanov et al. 2017) indicates the presence of multiple classes of dopamine cells, each containing different ratios of co-localized peptides and/or fast neurotransmitters. Consequently, we believe our work lays the foundation for the investigations suggested by the reviewer. Furthermore, if one considers this work in the context of a preclinical study to determine whether the A13 might be a target in human Parkinson's disease, the existing technology that could be utilized is deep brain stimulation (DBS) or electrical modulation, which would also affect different neuronal populations in a non-specific manner. While optogenetic stimulation therapy is longer term, using CamKII combined with the DJ hybrid AAV could be a translatable strategy for targeting A13 neuronal populations in non-human primates (Watakabe et al. 2015; Watanabe et al. 2020).

      Reviewer #3 (Public Review):

      Kim, Lognon et al. present an important finding on pro-locomotor effects of optogenetic activation of the A13 region, which they identify as a dopamine-containing area of the medial zona incerta that undergoes profound remodeling in terms of afferent and efferent connectivity after administration of 6-OHDA to the MFB. The authors claim to address a model of PD-related gait dysfunction, a contentious problem that can be difficult to treat with dopaminergic medication or DBS in conventional targets. They make use of an impressive array of technologies to gain insight into the role of A13 remodeling in the 6-OHDA model of PD. The evidence provided is solid and the paper is well written, but there are several general issues that reduce the value of the paper in its current form, and a number of specific, more minor ones. Also, some suggestions, that may improve the paper compared to its recent form, come to mind.

      Thank you for the suggestions and careful consideration of our work - it is appreciated.

      The most fundamental issue that needs to be addressed is the relation of the structural to the behavioral findings. It would be very interesting to see whether the structural heterogeneity in afferent/effects projections induced by 6-OHDA is related to the degree of symptom severity and motor improvement during A13 stimulation.

      As mentioned in comments for Reviewer 1, we will be highlighting motor-related A13 pathways in a revised Figure 5 and/or a new figure. We hope that our work will provide a roadmap for future studies to disentangle divergent or convergent A13 pathways that are involved in different or all PD-related motor symptoms. Because we could not measure behavioural change in the same animals studied with the anatomic study (essentially because the optrode would have significantly disrupted the connectome we are measuring), we cannot directly compare behaviour to structure.

      The authors provide extensive interrogation of large-scale changes in the organization of the A13 region afferent and efferent distributions. It remains unclear how many animals were included to produce Fig 4 and 5. Fig S5 suggests that only 3 animals were used, is that correct? Please provide details about the heterogeneity between animals. Please provide a table detailing how many animals were used for which experiment. Were the same animals used for several experiments?

      The behavioral set and the anatomical set were necessarily distinct. In the anatomical experiments, we employed both anterograde and retrograde viral approaches to target the afferent and efferent A13 populations with fluorescent proteins. For the behavioral approach, a single ChR2 opsin was utilized to photostimulate the A13 region; hence combining the two populations was not feasible. We were also concerned that the optrode itself would interfere with connectomics. A lower number of animals were used for the whole-brain work due to technical limitations described earlier. We will provide more details regarding numbers we can identify as a table and text.

      While the authors provide evidence that photoactivation of the A13 is sufficient in driving locomotion in the OFT, this pro-locomotor effect seems to be independent of 6-OHDA-induced pathophysiology. Only in the pole test do they find that there seems to be a difference between Sham vs 6-OHDA concerning the effects of photoactivation of the A13. Because of these behavioral findings, optogenic activation of A13 may represent a gain of function rather than disease-specific rescue. This needs to be highlighted more explicitly in the title, abstract, and conclusion.

      We agree with the reviewer that this aspect needs to be highlighted more. Optogenetic activation of A13 may represent a gain of function in both healthy and 6-OHDA mice, highlighting a parallel descending motor pathway that remains intact. 6-OHDA lesions have multiple effects on motor and cognitive function. This makes a single pathway unlikely to rescue all deficits observed in 6-OHDA models. We can say that the lack of locomotion observed in 6-OHDA models can be reversed by A13 region stimulation. We have discussed some aspects of the gain of function possibility but will augment this in other areas of the paper as well, as suggested.

      The authors claim that A13 may be a possible target for DBS to treat gait dysfunction. However, the experimental evidence provided (in particular the lack of disease-specific changes in the OFT) seems insufficient to draw such conclusions. It needs to be highlighted that optogenetic activation does not necessarily have the same effects as DBS (see the recent review from Neumann et al. in Brain: https://pubmed.ncbi.nlm.nih.gov/37450573/). This is important because ZI-DBS so far had very mixed clinical effects. The authors should provide plausible reasons for these discrepancies. Is cell-specificity, which only optogenetic interventions can achieve, necessary? Can new forms of cyclic burst DBS achieve similar specificity (Spix et al, Science 2021)? Please comment.

      Thank you for the useful comments - we will update our discussion accordingly.

      Our study highlights a parallel motor pathway provided by the A13 region that remains intact in 6-OHDA mice and can be sufficiently driven to rescue the hypolocomotor pathology observed in the OFT and overcome bradykinesia and akinesia. The photoactivation of ipsilesional A13 also has an overall additive effect on ipsiversive circling, representing a gain of function on the intact side that contributes to the magnitude of overall motor asymmetry against the lesioned side. The effects of DBS are rather complex, ranging from micro-, meso-, to macro-scales, involving activation, inhibition, and informational lesioning, and network interactions. This could contribute to the mixed clinical effects observed with ZI-DBS, in addition to differences in targeting and DBS programming among the studies (see review (Ossowska 2019)). Also the DBS studies targeting ZI have never targeted the rostromedial ZI which extends towards the hypothalamus and contains the A13. Furthermore, DBS and electrical stimulation of neural tissue, in general, are always limited by current spread and lower thresholds of activation of axons (e.g., axons of passage), both of which can reduce the specificity of the true therapeutic target. Optogenetic studies have provided mechanistic insights that could be leveraged in overcoming some of the limitations in targeting with conventional DBS approaches. Spix et al. (2021) provided an interesting approach highlighting these advancements. They devised burst stimulation to facilitate population-specific neuromodulation within the external globus pallidus. Moreover, they found a complementary role for optogenetics in exploring the pathway-specific activation of neurons activated by DBS. To ascertain whether A13 DBS may be a viable therapy for PD gait, it will be necessary to perform many more preclinical experiments, and tuning of DBS parameters could be facilitated by optogenetic stimulation in these murine models.

      In a recent study, Jeon et al (Topographic connectivity and cellular profiling reveal detailed input pathways and functionally distinct cell types in the subthalamic nucleus, 2022, Cell Reports) provided evidence on the topographically graded organization of STN afferents and McElvain et al. (Specific populations of basal ganglia output neurons target distinct brain stem areas while collateralizing throughout the diencephalon, 2021, Neuron) have shown similar topographical resolution for SNr efferents. Can a similar topographical organization of efferents and afferents be derived for the A13/ ZI in total?

      The ZI can be subdivided into four subregions in the antero-posterior axis: rostral (ZIr), dorsal (ZId), ventral (ZIv), and caudal (ZIc) regions. The dorsal and ventral ZI is also referred together as central/medial/intermediate ZI. There are topographical gradients in different cell types and connectivity across these subregions (see reviews: (Mitrofanis 2005; Monosov et al. 2022; Ossowska 2019). Recent work by Yang and colleagues (2022) demonstrated a topographical organization among the inputs and outputs of GABAergic (VGAT) populations across four ZI subregions. Given that A13 region encompasses a smaller portion (the medial aspect) of both rostral and medial/central ZI (three of four ZI subregions) and coexpress VGAT, A13 region likely falls under rostral and intermediate medial ZI dataset found in Yang et al. (2022). With our data, we would not be able to capture the breadth of topographical organization shown in Yang et al (2022).

      In conclusion, this is an interesting study that can be improved by taking into consideration the points mentioned above.

      Reviewer #1 (Recommendations For The Authors):

      1) Figure 2 indeed presents valuable information regarding the effects of A13 region photoactivation. To enhance the comprehensiveness of this figure and gain a deeper understanding of the neurons driving the pro-locomotor effect of stimulation, it would be beneficial to include quantifications of various cell types:

      • cFos-Positive Cells/TH-Positive Cells: it can help determine the impact of A13 stimulation on dopaminergic neurons and the associated pro-locomotor effect in the healthy condition and especially in the context of Parkinson's disease (PD) modeling.

      • cFos-Positive Cells /TH-Negative Cells: Investigating the number of TH-negative cells activated by stimulation is also important, as it may reveal non-dopaminergic neurons that play a role in locomotor responses. Identifying the location and characteristics of these TH-negative cells can provide insights into their functional significance.

      Incorporating these quantifications into Figure 2 would enhance the figure's informativeness and provide a more comprehensive view of the neuronal populations involved in the locomotor effects of A13 stimulation.

      Agreed - we will add quantification and create graphs to present the data in Figure 2.

      2) Refer to Figure 3. In the main text (page 5) when describing the animal with 6-OHDA the wrong panels are indicated. It is indicated in Fgure 2A-E but it should be replaced with 3A-E. Please do that.

      Will be done

      Reviewer #2 (Recommendations For The Authors):

      Abstract

      Page 1: Inhibitory or lesion studies will be necessary to support the claim that the global remodeling of afferent and efferent projections of the A13 region highlights the Zona Incerta's role as a crucial hub for the rapid selection of motor function.

      We believe that overall, there is quite a bit of evidence that the zona incerta is a hub for afferent/efferents. Mitrofanis (2005) and, more recently, Wang et al. (2020) summarize some of the evidence. Yang (2022) illustrates that the zona incerta shows multiple inputs to GABAergic neurons and outputs to diverse regions. Recent work suggests that the zona incerta contributes to various motor functions such as hunting, exploratory locomotion, and integrating multiple modalities (Zhao et al. 2019; Wang et al. 2019; Monosov et al. 2022; Chometton et al. 2017). We will update our paper to reflect these references.

      Introduction

      Page 2, paragraph 2: "However, little attention has been placed on the medial zona incerta (mZI), particularly the A13, the only dopamine-containing region of the rostral ZI" Is the A13 region located in the rostral or medial ZI or both?

      It should have been written “rostromedial” ZI. The A13 is located in the medial aspect of rostromedial ZI. We will update the introduction.

      Page 2, para 3: Li et al (2021) used a mini-endoscope to record the GCaMP6 signal. Masini and Kiehn, 2022 transiently blocked the dopaminergic transmission; they never used 6-OHDA. Please correct through the text.

      We will correct this.

      Page 2, para 4: the A13 connectome encompasses the cerebral cortex,... MLR. The MLR is a functional region, correct this for the CNF and PPN.

      Thank you, we will correct this.

      Page 3, the last paragraph of the introduction could be clarified by presenting the behavioral data first, followed by the anatomy.

      We will correct this.

      Figure 1 is nice and clear, and well summarizes the experimental design.

      Thank you.

      Figure 2 shows an example of the extent of the ChR2-YFP expression and the position of an optical fiber tip above the dopaminergic A13 region from a mouse. Without any quantification, these images could be included in Figure 1. Despite a very small volume (36.8nL) of AAV, the extent of ChR2-YFP expression is quite large and includes dopaminergic and unidentified neurons within the A13 region but also a large population of unidentified neurons outside of it, thus raising questions about the volume and the types of neurons recruited.

      This is an important consideration. As mentioned previously, we will provide more information on viral spread and optrode location. The issue of viral spread is complex and depends on factors including tissue type, serotype, and promotor of the virus. Li et al. (2021), for example, used different virus serotypes and promotors, injecting 150 nL, whereas we used AAV DJ, injecting 36.8nL. AAV-DJ is a hybrid viral type consisting of multiple serotypes. It has a high transduction efficiency, which leads to greater gene delivery than single-serotype AAV viral constructs (Mao et al. 2016). A secondary consideration regarding translation was that AAV-DJ could effectively transduce non-primate neurons (Watanabe et al. 2020). We have addressed the issue of neurons recruited earlier and will provide c-Fos quantification to illustrate the extent of co-localization with TH.

      Anatomical reconstruction of the extent of the ChR2-YFP expression and the location of the tip of the optical fiber will be necessary to confirm that ChR2-YFP expression was restricted to the A13 region.

      We will provide additional information regarding viral spread, ferrule tip placement, and c-fos cell counts.

      Page 5, 1st para: Double-check the references, as not all of them are 6-OHDA injections in the MLF.

      Will correct.

      Page 5, 1st para, 4th line: Replace ferrule with optical canula or fiber.

      Will correct.

      Page 5, 1st para, 9th line: Replace Figure 2 with Figure 3.

      Will correct.

      Page 5, 2nd para: About the refractory decrease in traveled distance by sham-ChR2 mice: is this significant?

      It was not significant (Figure S1, 1-way RM ANOVA: F5,25 = 0.486, P = 0.783)). We will update this.

      Figure 3 showing behavioral assessments is nice, but the stats are not always clear. In Fig 3A, are each of the off and on boxes 1 minute long? The figure legend states the test lasts 1 min, but isn't it 4 minutes? In Figure 3B-E and 3J-M, what are the differences? Do the stats identify a significant difference only during the stimulation phase? Fig. 3F-I are nice and could have been presented as primary examples prior to data analysis in Fig. 3B-E. Group labels above the graph would help.

      Yes, the off-on boxes are 1 minute long. We will correct the error in the legend. Great suggestion for F-I - we will move them ahead of the summary figures.

      Fig. 3L-M, what do PreSur, Post, and Ferrule mean? I assume that Ferrule refers to mice tested with the optical fiber without stimulation, whereas Stim. refers to the stimulation. It would be helpful to standardize the format of stats in Fig. 3B-E and 3-J-M. What are time points a, b, and c referring to?

      We will do this.

      Figure S2A: the higher variability in 6-OHDA-YFP mice in comparison to 6-OHDA-ChR2 mice prior to stimulation suggests that 6-OHDA-YFP mice were less impaired. Why use boxplots only for these data? Would a pairwise comparison be more appropriate?

      Data did not follow a normal distribution and thus, were plotted as box and whiskers with the horizontal line through the box indicating the group median, interquartile range indicated by the limits of the box, and group minimum and maximum indicated by the whiskers. And indeed, a non-parametric equivalent of paired t-test (Wilcoxon signed-rank test) was used.

      Fig. S2B: add the statistical marker.

      Will do

      Page 7, para 1, line 8: to add "in comparison to 6-OHDA-YFP and YFP mice" to during photostimulation... (Figure 3E).

      Will do

      Page 7, para 3, line 5: about larger improvement, replace "sham ChR2" with "6-OHDA."

      Will do

      Page 8, para 1, line 4: Perier et al., 2000 reported that 6-OHDA injection increased the firing frequency of the ZI over a month.

      We will add that time frame. Agreed, it is shorter than the behavioral work, which was started 3 weeks after 6-OHDA injection.

      Page 8, para 2, line 1: Since the results were expected, add some references.

      Will do

      Page 8, para 3, line 4. Double-check the reference.

      Will correct and update

      Page 8: About large-scale changes in the A13 region, the relevance of correlation matrices is difficult to grasp. Analysis of local connectivity would have been more informative in the context of GABAergic and glutamatergic neurons of the ZI in the vicinity of the A13 region.

      We will explore alternative methods to present the data.

      Page 8, para 3, line: given Fig. 2, there is concern about the claim that only the A13 region was targeted. The time of the analysis after 6-OHDA should be mentioned. Some sections of the paragraph could be moved to methods.

      As mentioned earlier, we will provide additional information regarding viral spread, ferrule tip placement, and c-fos cell counts. We will mention analysis time after 6-OHDA and update Figure 1a to include this.

      Fig. 4: The color code helps the reader visualize distribution differences. However, statistical analyses comparing 6-OHDA versus sham should be included. Quantification per region would greatly help readers visualize the data and support the conclusion. The relationship between the type of correlation (positive or negative) and absolute change (increase or decrease) is unknown in the current format, which limits the interpretation of the data. Moreover, examples of raw images of axons and cells should be presented for several brain regions. The experimental design with a timeline, as in Fig. 1, would be helpful. The legend for Fig. 4 is a bit long. Some sections are very descriptive, whereas others are more interpretive.

      We will explore alternative methods of presenting the data, as suggested in a previous comment. Should we retain the correlation matrix, we will incorporate the reviewer’s suggestions.

      Page 10, para 1, line 1: add "afferent" to "changes in -afferent and- projection patterns."

      Will do

      Page 10, para 1, line 9: remove the 2nd "compared to sham" in the sentence.

      Will do

      10, para 1, line 10: remove "coordinated" in "several regions showed a coordinated reduction in afferent density." We cannot say anything about the timing of events, as there is only info at 1 month.

      Will do

      Page 10, para 2: the section should be written in the past tense.

      Will do

      Page 13, para 2, the last sentence is overstated. Please remove "cells" and refer to the A13 region instead.

      Will do

      About differential remodelling of the A13 region connectome: Figure 5C and 5G: The proportion of total afferents ipsi- and contralateral to 6-OHDA injection argues that the A13 region primarily receives inputs from the cortical plate and the striatum. Unfortunately, there are no statistics.

      Due to the small sample size, we provided descriptive statistics (mean and error bars) in Figure 5C and G. As mentioned in comments for Reviewers 1 and 2, we will revise Figure 5 to present data focusing on motor-related pathways to provide clarity. In addition, absolute values will be shared in a supplemental table.

      Figure 5 D and 5H: Changes in the proportion of total afferents/projections are relatively modest (less than 10% of the whole population for the highest changes). There is no standard deviation for these data and no statistics. Do they reflect real changes or variability from the injection site?

      The changes are relatively modest (less than 10%) since a small brain region usually provides a very small proportion of total input (McElvain et al. 2021; Yang et al. 2022). The changes in the proportions reflect real differences between average proportions observed in sham and 6-OHDA mice. The variability in the total labeling of neurons and fibers was minimized by normalizing individual regional counts against total counts found in each individual animal.

      Fig 5F and H: The example in F shows a huge decrease in the striatum, but H indicates only a 2% change, which makes the example not very representative. Absolute values would be helpful.

      While a 2% change may seem small, it represents a relatively large change in the A13 efferent connectome. To provide further clarity, we will provide absolute values as suggested in our new supplemental table.

      Figure 6 is inaccurate and unnecessary.

      Agree - it is too simplistic. We will remove it and replace it with one outlined in comments to Reviewer 1.

      Discussion

      Although interesting, the discussion is too long.

      We will make it more concise in the revised paper.

      Page 12: para 2. If the A13 region has a pro-locomotor effect and has therapeutical potential; the claim about its plasticity relies on Fig. 4 and 5, which have a limited scope in the current analysis and presentation (see comments above).

      We will revise the paper per the comments above and then update this accordingly.

      Methods

      Page 17, para 1: include the stereotaxic coordinates of the optical cannula above the A13 region.

      We will include this information.

      References

      Chen, Fenghua, Junliang Qian, Zhongkai Cao, Ang Li, Juntao Cui, Limin Shi, and Junxia Xie. 2023. “Chemogenetic and Optogenetic Stimulation of Zona Incerta GABAergic Neurons Ameliorates Motor Impairment in Parkinson’s Disease.” iScience 26 (7). https://doi.org/10.1016/j.isci.2023.107149.

      Chometton, S., K. Charrière, L. Bayer, C. Houdayer, G. Franchi, F. Poncet, D. Fellmann, and P. Y. Risold. 2017. “The Rostromedial Zona Incerta Is Involved in Attentional Processes While Adjacent LHA Responds to Arousal: C-Fos and Anatomical Evidence.” Brain Structure & Function 222 (6): 2507–25.

      Garau, Celia, Jessica Hayes, Giulia Chiacchierini, James E. McCutcheon, and John Apergis-Schoute. 2023. “Involvement of A13 Dopaminergic Neurons in Prehensile Movements but Not Reward in the Rat.” Current Biology: CB, October. https://doi.org/10.1016/j.cub.2023.09.044.

      Li, Zhuoliang, Giorgio Rizzi, and Kelly R. Tan. 2021. “Zona Incerta Subpopulations Differentially Encode and Modulate Anxiety.” Science Advances 7 (37): eabf6709.

      Mao, Yingying, Xuejun Wang, Renhe Yan, Wei Hu, Andrew Li, Shengqi Wang, and Hongwei Li. 2016. “Single Point Mutation in Adeno-Associated Viral Vectors -DJ Capsid Leads to Improvement for Gene Delivery in Vivo.” BMC Biotechnology 16 (January): 1.

      McElvain, Lauren E., Yuncong Chen, Jeffrey D. Moore, G. Stefano Brigidi, Brenda L. Bloodgood, Byung Kook Lim, Rui M. Costa, and David Kleinfeld. 2021. “Specific Populations of Basal Ganglia Output Neurons Target Distinct Brain Stem Areas While Collateralizing throughout the Diencephalon.” Neuron 109 (10): 1721–38.e4.

      Mitrofanis, J. 2005. “Some Certainty for the ‘Zone of Uncertainty’? Exploring the Function of the Zona Incerta.” Neuroscience 130 (1): 1–15.

      Monosov, Ilya E., Takaya Ogasawara, Suzanne N. Haber, J. Alexander Heimel, and Mehran Ahmadlou. 2022. “The Zona Incerta in Control of Novelty Seeking and Investigation across Species.” Current Opinion in Neurobiology 77 (December): 102650.

      Negishi, Kenichiro, Mikayla A. Payant, Kayla S. Schumacker, Gabor Wittmann, Rebecca M. Butler, Ronald M. Lechan, Harry W. M. Steinbusch, Arshad M. Khan, and Melissa J. Chee. 2020. “Distributions of Hypothalamic Neuron Populations Coexpressing Tyrosine Hydroxylase and the Vesicular GABA Transporter in the Mouse.” The Journal of Comparative Neurology 528 (11): 1833–55.

      Ossowska, Krystyna. 2019. “Zona Incerta as a Therapeutic Target in Parkinson’s Disease.” Journal of Neurology. https://doi.org/10.1007/s00415-019-09486-8.

      Romanov, Roman A., Amit Zeisel, Joanne Bakker, Fatima Girach, Arash Hellysaz, Raju Tomer, Alán Alpár, et al. 2017. “Molecular Interrogation of Hypothalamic Organization Reveals Distinct Dopamine Neuronal Subtypes.” Nature Neuroscience 20 (2): 176–88.

      Spix, Teresa A., Shruti Nanivadekar, Noelle Toong, Irene M. Kaplow, Brian R. Isett, Yazel Goksen, Andreas R. Pfenning, and Aryn H. Gittis. 2021. “Population-Specific Neuromodulation Prolongs Therapeutic Benefits of Deep Brain Stimulation.” Science 374 (6564): 201–6.

      Wang, Xiyue, Xiaolin Chou, Bo Peng, Li Shen, Junxiang J. Huang, Li I. Zhang, and Huizhong W. Tao. 2019. “A Cross-Modality Enhancement of Defensive Flight via Parvalbumin Neurons in Zona Incerta.” eLife 8 (April). https://doi.org/10.7554/eLife.42728.

      Wang, Xiyue, Xiao-Lin Chou, Li I. Zhang, and Huizhong Whit Tao. 2020. “Zona Incerta: An Integrative Node for Global Behavioral Modulation.” Trends in Neurosciences 43 (2): 82–87.

      Watakabe, Akiya, Masanari Ohtsuka, Masaharu Kinoshita, Masafumi Takaji, Kaoru Isa, Hiroaki Mizukami, Keiya Ozawa, Tadashi Isa, and Tetsuo Yamamori. 2015. “Comparative Analyses of Adeno-Associated Viral Vector Serotypes 1, 2, 5, 8 and 9 in Marmoset, Mouse and Macaque Cerebral Cortex.” Neuroscience Research 93 (April): 144–57.

      Watanabe, Hidenori, Hiromi Sano, Satomi Chiken, Kenta Kobayashi, Yuko Fukata, Masaki Fukata, Hajime Mushiake, and Atsushi Nambu. 2020. “Forelimb Movements Evoked by Optogenetic Stimulation of the Macaque Motor Cortex.” Nature Communications 11 (1): 3253.

      Yang, Yang, Tao Jiang, Xueyan Jia, Jing Yuan, Xiangning Li, and Hui Gong. 2022. “Whole-Brain Connectome of GABAergic Neurons in the Mouse Zona Incerta.” Neuroscience Bulletin 38 (11): 1315–29.

      Ye, Qiying, Jeremiah Nunez, and Xiaobing Zhang. 2023. “Zona Incerta Dopamine Neurons Encode Motivational Vigor in Food Seeking.” bioRxiv : The Preprint Server for Biology, June. https://doi.org/10.1101/2023.06.29.547060.

      Zhao, Zheng-Dong, Zongming Chen, Xinkuan Xiang, Mengna Hu, Hengchang Xie, Xiaoning Jia, Fang Cai, et al. 2019. “Zona Incerta GABAergic Neurons Integrate Prey-Related Sensory Signals and Induce an Appetitive Drive to Promote Hunting.” Nature Neuroscience 22 (6): 921–32.

    1. Reviewer #1 (Public Review):

      Summary:<br /> This study examines the context-dependent modulation of auditory cortical neurons in response to expected sensory input, either self-generated sounds or expected perturbations of self-generated sounds. Specifically, using songbirds, the authors ask whether social context (the presence of a female conspecific) affects 1) the response of auditory cortical neurons to the bird's own song when he is singing; and 2) the response of neurons to perturbations of auditory feedback that the bird has been trained to expect.

      Strengths:<br /> First, the authors report that across the population, the responses of the neurons does not differ when a male bird sings alone or if he sings to a female. A fraction of auditory cortical neurons, however, do show significant differences in the firing rate, precision, and/or degree of burst firing when males sing alone vs. when they sing to females. This finding is broadly consistent with the literature showing that sensory neurons (visual, auditory, somatosensory, etc.) can be rapidly reconfigured into different "information processing modes" depending on behavioral state (e.g, quiescence vs vigilance).

      For the perturbation experiments, the authors trained birds to expect distorted auditory feedback during a particular syllable. They found that some neurons showed greater responses during perturbation when a female was present (compared to when males were alone) while other neurons had smaller responses during perturbation when a female was present. In addition, the response of a small number of auditory cortical neurons were not affected by behavioral state. These results contrast with their prior report that the responses of midbrain dopaminergic neurons that project to the basal ganglia are "uniformly reduced" in the presence of a female, raising a question of how an evaluation signal is transformed in the circuit from the primary sensory region to the midbrain.

      Weaknesses:<br /> While the experiments and analysis are solid, the finding that social context can alter responses of auditory cortical neurons in a multitude of ways (increase, decrease or no change) raises several questions that can be examined with additional analysis. For example, do context-dependent differences in auditory responses derive from context-dependent differences in the songs? Are context-dependent differences present in all classes of neurons and throughout the auditory system?

      The observed heterogeneity in the firing properties of auditory cortical neurons, both in response to self-generated sounds and during perturbations of auditory feedback, raises the question of which neurons are sensitive to social context (which likely can be addressed by the authors in a revision). The authors should provide additional details about the recordings:

      a) What are the locations of the recording sites?<br /> Prior work has shown that there is an organized map of spectrotemporal features of sounds in the auditory cortex of songbirds; spectral tuning widths change along the medial-lateral axis and temporal tuning widths differ between the input and output layers of Field L. Were the recordings primarily in Field L2 (thalamo-recipient region), L1 or L3? Were some recordings lateral to Field L in secondary auditory regions? Were the neurons that showed context-dependent changes in firing properties localized or distributed throughout Field L (i.e., were the context-dependent differences in neural responses truly brain-wide)? At a minimum, the authors should include a schematic showing the different regions of Field L and a summary of the location of the recording sites. Images of the processed tissue with electrolytic lesions would also be helpful.

      b) Was the context-dependent modulation limited to a particular class of neurons (distinguished by spike waveform shape, spontaneous firing rate, or other feature)?

      While the authors attribute differences in the responses of single auditory cortical neurons to the presence of a female, other potential explanations for the observed differences should be examined (and potentially ruled out):

      a) Prior work has shown that songs of zebra finches differ slightly when males sing alone compared to when they sing to females: songs are faster; pitch is less variable; and the number of introductory elements is greater when males sing to females. Do some of the observed social context-dependent differences in the responses of auditory neurons reflect differences in the songs in the two conditions? This idea is supported in part by a prior study in juvenile zebra finches (Keller & Hahnloser, 2009) showing that ~20% of the neurons they recorded in Field L and a secondary auditory region (CLM) showed anticipatory activity even before the onset of a song bout, suggesting a source of premotor (or at least non-auditory drive) to neurons in the auditory cortex. Did the authors of this study also find premotor activity in Field L, and if so, did it differ between the two social contexts? Might differences in Field L responses reflect motor/song differences?

      b) For the perturbation experiments, the authors report heterogeneous responses to playback, with some neurons firing more and other firing less when a female is present compared to when the male is alone. Keller and Hahnloser (2009) found that in juvenile birds, responses of Field L to perturbations of auditory feedback were sensitive to sound amplitude; perturbation responses increased with relative perturbation amplitude. This raises a question of whether perturbation amplitude is different when a male is alone and when a female is present (i.e., the male may move towards the female when she is present and if the speaker is close to the female, the perturbation may be louder than when the male is alone; alternatively, the male be more active when he is alone so the loudness of the perturbation may be more variable across song bouts). It would be useful to know if (and how much) perturbation amplitude varied depending on the location inside the cage as well as whether the sound pressure level of the underlying song was higher (e.g., Lombard effect). Addition of details of the experimental setup/procedure would help to allay concerns that the amplitude of the white noise varied significantly depending on behavioral context.

      Finally, I am still trying to make sense of the differences in the context-dependent modulation of responses of auditory cortical neurons vs. midbrain dopaminergic neurons. Given the heterogeneity of responses in Field L, both to self-generated sounds and to expected perturbations during singing, how are the signals decoded downstream of Field L? At the population level, neither the mean firing rate nor the timing of firing of Field L neurons changed with courtship. Similarly, across the population, the responses to perturbations of auditory feedback were not affected by courtship state (error signal attenuated in 11 neurons, increased in 22 neurons and not affected in 10 neurons). Yet, the courtship state "uniformly" reduces the response of midbrain dopaminergic neurons to auditory perturbation. It would be helpful if the authors could include a model and/or more discussion of how this change may arise.

    2. Reviewer #3 (Public Review):

      Summary:

      In this study, Jones et al. examine how neural activity in a primary auditory area (field L) of singing male songbirds is modulated by the presence or absence of an audience (a female conspecific). Prior work has demonstrated that the presence of an audience attenuates the responses of dopaminergic neurons to distortions of auditory feedback (DAF). Here the authors report that even in a region that is primarily considered sensory, responses to DAF are also modulated by the audience, although in a heterogeneous manner that does not readily explain previously observed attenuation. These findings address an interesting question and will potentially be important in adding to an understanding of how non-sensory factors can alter response properties of neurons even in primary sensory regions in a context dependent fashion. However, to be fully persuasive, additional analyses will be required to address how much of the apparent modulation by audience may be explained by other factors such as changes in recorded neurons or their properties over time.

      Full Public Review:

      In this study, Jones et al. examine how neural activity in a primary auditory area (field L) of singing male songbirds is modulated by the presence or absence of an audience (a female conspecific). They test whether activity in Field L differs between conditions in which the male is singing to a female (directed song) or alone (undirected song) and whether response to distortions of auditory feedback (DAF) differ between these conditions. Previous work has shown that in other parts of the songbird brain, sensory-motor activity can differ between directed and undirected song, and that responses to DAF are attenuated when males sing directed song versus undirected song. These prior results raise the interesting question of the extent to which such modulations of activity by the presence of an audience are already present in primary sensory areas such as Field L. This possibility is also motivated by prior work that has shown that Field L activity is not exclusively explained by auditory input, but can also be modulated by the bird's state - whether it is singing or not.

      Against this background, the questions asked here are of interest for two inter-related reasons:

      1) the authors address whether the presence of an audience (a female conspecific) alters activity in a primary auditory area during singing. Primary auditory areas such as Field L, and analogous mammalian thalamo-recipient cortical regions such as A1, are often thought of as responding very specifically to the features of sensory stimuli, but are also understood to be modulated by a variety of factors including the attentional and behavioral state of the animal. For audition, such modulation includes whether or not animals are vocalizing and listening to themselves or listening to playback of their own vocalizations. Cited works from Keller (2009) as well as Eliades and Wang (2008) have indicated that the act of vocalizing can modulate auditory responses to self-generated feedback in primary auditory areas relative to those arising from playback of the same sounds. Here, the question is whether responses to self-generated feedback differ between conditions of singing alone versus singing to a female audience. A demonstration that the presence of an audience matters to responses in Field L would add to a general understanding of how it is that non-auditory factors can modulate sensory responses.

      2) the authors address the possible source of an audience-dependent modulation of responses to feedback perturbation in the VTA previously reported by Goldberg and colleagues (2023). In the VTA, responses to perturbations during singing are consistently attenuated when males are singing to females versus when they are singing alone, but the underlying mechanisms of this modulation are unknown. Here, the authors test the possibility that such modulation by an audience is already present at the level of Field L. The previously reported attenuation in VTA is quite striking and reflects a nice example of how neural processing can differ with varying behavioral priorities. Understanding whether this modulation of responses to DAF arises already in primary auditory areas would further a mechanistic understanding of an intriguing example of state-dependent modulation of sensory processing and behavior, and lend broad insight into related phenomena.

      The authors report 1) that activity in Field L differs between directed and undirected singing at many individual recording sites, but that these changes are heterogeneous, with both increases and decreases in activity, so that there is no consistent change across the population and 2) that the responses to DAF differ between directed and undirected song, but that there is no consistent attenuation of response (as observed in the VTA) and instead heterogeneous increases and decreases in response to DAF so that there is no net change at the population level.

      These findings, if firmly established, are important and of general interest. While they do not readily explain the source of the audience-dependent attenuation of auditory responses to DAF in the VTA, the demonstration of audience-dependent modulation of self-generated feedback and its disruption in a primary auditory area is an exciting result that would provide an opportunity for further investigation of how changes in social context influence brain and behavior. The manuscript is generally well written, although the presentation is terse. My main reservations about the current manuscript relate to aspects of experimental design and analysis that need to be clarified and addressed before these conclusions will be fully persuasive. There are also some places where further discussion of the findings and their relationship to prior studies would be helpful.

      1. A central concern relates to whether the main reported effects associated with differences in singing directed versus undirected song reflect only those changes in conditions, versus contributions from changes in unit isolation or response properties over time. The authors record undirected song in a block in the morning and only after collecting at least 40 renditions do they later record responses during directed song over a series of repeated exposures to a female. Therefore, differences between data collected during undirected song and directed song also reflect differences between data collected initially during the morning versus later. It is unclear from methods whether any of these recordings during undirected and directed conditions are interleaved, but if this is not the case, then it is crucial to ask how stable were neural recordings with respect to unit isolation, and potential changes to response properties, over the duration of the experiments. This would be less of a concern if the results mirrored those observed in the VTA, where attenuation of responses was observed across the entire population during directed versus undirected conditions - it is hard to explain a phenomenon that is consistently observed across the population as arising from a change in which neurons and spikes are contributing to responses, or other forms of non-stationarity. However, because there are no significant differences reported at the population level in the current study, it is important to address the possibility that observed differences between conditions reflect some form of noise or drift in recorded units, rather than being entirely due to directed versus undirected singing. I have elaborated in more detail below on this concern, including places where the data seems to suggest some non-stationarity of responses, and have some suggestions for ways in which this concern might be addressed.

      2. A second concern, related to this first one, has to do with the categorical definition of 'error neurons'. The authors note in their text that it could be problematic to apply categorical definitions to continuous distributions, and yet that seems to be what they then do. The authors have a metric of error sensitivity that they apply to each neuron's response to DAF in both undirected and directed conditions (the error score). They show that there is a continuous distribution of error scores (Figure 2 - figure supplement 1) across the population, with no bimodality that would be suggestive of distinct error sensitive and error-insensitive neurons. One nice feature of their analysis is that they also show the distribution of error scores computed in an analogous fashion for a period of neural activity in the song prior to DAF. This control data set makes it persuasive that there is a significant response to DAF, but also shows that there can be a broad range of error scores even when no DAF has been played, and that this range of 'noise' responses to DAF overlaps substantially with the actual responses to DAF. Despite the continuum of error scores, the authors define a subset of neurons as error responsive only if their responses to DAF exceed a specific threshold (2.5 standard deviations). One of the main conclusions of the paper is based on finding a subset of 22 neurons that exhibited error responses (by this definition) only during singing to a female and 11 neurons that exhibited error responses only when singing alone. These neurons are described as 'retuned' because they have error responses in only one condition.

      The problem here is that for some, if not many, of the neurons that are categorically defined as being responsive to DAF in only one condition (directed versus undirected) there is almost certainly not a significant difference in the actual responses to DAF between conditions. This is apparent in the relevant data figure (figure 2 - figure supplement 1) and is a consequence of using a threshold to split a continuous distribution into groups defined as error responsive or not. For example, several neurons in this plot that have almost identical scores in the directed and undirected condition are counted as examples of retuning because the error scores are just a bit over 2.5 in the directed condition and just a bit under 2.5 in the undirected condition.

      That this kind of categorical approach may be problematic is apparent in the control data in the plot. Despite the absence of any perturbation, there are error responsive neurons present in these data that are considered selective for directed versus undirected singing - this is an expected consequence of using a threshold on dispersed or noisy biological data. Shifting to a more stringent threshold of three standard deviations, as the authors do, does not help with this problem, as that still treats as categorically different responses that fall on either side of a line, even if only by a tiny amount. I suggest that the authors devise a measure for each neuron to test whether the responses to DAF are significantly different under the two conditions (directed versus undirected). As noted above, this measure should take into account some assessment of the stationarity of responses, as well as the distribution of responses (which, in some of the examples does not seem to be Gaussian around a mean response level, but rather highly variable across trials).

      3. There are several places where further discussion of the previous literature and how the current results relate to that literature would be helpful. This includes:

      3a. Some discussion of what is already known about the auditory tuning of field L, and the extent to which responses associated with distortion of feedback may reflect the frequency tuning of field L neurons versus something that might be construed as more specifically as detecting an error in perceived feedback. For example, Field L neurons have previously been characterized as having relatively simple spectro-temporal receptive fields, often with a single frequency band that is excitatory and nearby frequency bands that are inhibitory. It would be beyond the scope of this paper to directly assess the extent to which both song responses and responses to DAF are well predicted by simple STRFs that might be measured for the recorded neurons, or computed from activity during a range of vocalizations, but perhaps worth discussing whether a neuron with such frequency tuning would potentially exhibit 'error responses' of the sort described here, simply because the DAF stimulus happens to fall into the excitatory or inhibitory regions of the neuron's receptive field. While it is OK to use the term 'error responsive' in the current study, it would be good to make clear that changes in firing associated with playing DAF should be expected even for neurons that have simple auditory receptive fields (i.e. with center surround tuning to specific frequencies in a tonotopic map, as has been described for Field L) without necessarily indicating that these neurons are specifically registering any deviation or 'error' between expected feedback and experienced feedback. In this respect, there are multiple subdivisions of Field L with different tuning properties. Please specify further what criteria were used to determine recording locations and how these correspond with previously defined subdivisions.

      3b. It would also be useful to discuss further previous work on differences in auditory tuning or responses between conditions when subjects are vocalizing, versus when vocalizations are played back (as in Keller, Eliades) and whether the results in the current study are similar or different. For example, this prior work has indicated that efference copy or other signals that precede vocalizations can reach and influence activity in auditory areas - with the most compelling evidence for this being the modulation of activity prior to the onset of vocalizations. Was this also observed in the current study, and to what extent might this kind of mechanism contribute to the processing of feedback distortions? With respect to this kind of efference signal, or other possibilities, can the authors provide some discussion or speculation about possible mechanisms that might be differentially engaged between conditions of singing directed versus undirected song?

      3c. The previous study on DAF responses in VTA indicates enhanced responses to female calls during directed song. To what extent did the current study control for any vocalizations or other sounds produced by females during the directed singing, and could this have contributed to differences in Field L activity between conditions? This question is motivated partly by the highly variable responses in raster plots even within one condition - might some of this reflect motifs during which transient noises are produced from female calling or other movements by the male or female?

      More regarding stability of recordings:

      The data presented in Figure 1D illustrate some of my concerns about the stationarity of recordings. In the directed condition there are no spikes at all following the first handful of motif renditions. Were the directed and undirected recordings interleaved here? If not, could the recorded neuron simply have been lost, changed in amplitude of recorded spikes so that it was no longer counted, or reduced its responsiveness over the course of the recordings? Because the recordings of undirected and directed singing are described as occurring sequentially, it seems likely that this type of change in recorded signal could contribute to changes in measured responses over time, independently of effects due to directed versus undirected singing.

      A minor issue of this example is that the raw example trace with male alone does not seem to have a corresponding set of points in the roster plot. For panel E, I also cannot find rasters that correspond to the example recordings shown at top.

      Figure 2A also shows a neuron that looks like it has non-stationarity; for the alone condition without altered feedback, the main peak has no spikes for the bottom half of the rasters. For the directed condition, much of the difference between control and distorted feedback conditions seems to come from a few trials towards the bottom of the raster plot that show more and earlier firing than most other rasters.

      Other more subtle examples are suggested in the figures, such as Figure 1F where responses in the alone condition seem to increase over the course of recordings. A related issue apparent in some of the raster plots is that the firing rate distributions within a given condition sometimes appear to be very non-gaussian, with some motifs during which there is a lot of activity, or apparent bursting, and others in which there is little activity. In addition to the examples above, this includes<br /> responses in Fig 1E and Fig 2F. Does anything distinguish these cases or trails? Where differences between conditions are driven by firing differences that are present on only a subset of trials, such as in Fig 2A, there is some deviation from the normal criteria for use of T-tests/Z-scores. Please consider this point and discuss any caveats and/or apply other tests (Monte Carlo? Non-parametric?) as appropriate.

      These potential issues of non-stationarily, and non-Gaussian firing rate distributions in each condition, make it complicated to think about what differences in activity reflect changes from undirected to directed conditions versus these other factors.

      Approaches to addressing this issue could include more specifically indicating examples in which recordings from the alone condition and directed condition are interleaved and exhibit reversible (between conditions) changes in the pattern of responses (both without DAF in comparing alone versus directed, and with DAF demonstrating differences in DAF influences between conditions). Some good interleaved examples of this sort would be very helpful to illustrate the robustness of differences between conditions. More generally, the methods and or raster plots should include some further explanation of the time periods over which recordings were made in the alone versus directed conditions, and the extent to which they are interleaved or not.

      Another approach that could be used if there are not many instances of inter-leaved recordings is to try to document the stationarily or stability of unit isolation and/or responses over time. It would be most helpful when applied to recordings from a given singing condition (i.e. alone or directed) that are interleaved, but even in cases where this is not possible perhaps one could assess the stability of waveforms and unit isolation across time. For example in Figure 2 - Supplementary figure 2, the left-hand and middle examples appear to have quite good unit isolation, and might be the sorts of cases where measures of unit isolation and waveform stability could be used to argue that a gain or loss of spikes due to drift in recordings or changes to SNR and spike detection are not contributing to changes in firing patterns over time (and across conditions).

      It potentially would also be informative to present the prevalence of the main effects reported in the study as a function of some measures of unit isolation, SNR, and recording stability. It would be reassuring to see that significant differences between conditions are equally or more prevalent under the conditions of greatest unit isolation and recording stability than in cases with worse SNR or stability.

      One other way that the authors might be able to address my main concern would be to look at the stability of firing patterns within conditions, where differences across trials most directly indicate the potential contributions of technical or biological changes in neural activity over time that are not related to the experimental conditions.

      To further address some of these issues, it would be helpful to have additional explanations in this paper (rather than by reference to Goldberg and Fee, 2010) of the criteria that were used for counting spikes, and assessing stability of recordings. All I found about this in the Goldberg and Fee, 2010 reference was that "Spikes were sorted off-line using custom Matlab software" Does this require human inspection and judgment? Is there a simple threshold, or waveform measurement used for detecting spikes from single units? Are some sort of signal to noise measures, or ISI violations used to score how well units are isolated?

      For the specific examples shown in figures, it would be useful to indicate by small tick marks or otherwise which spikes were counted as single units. For example in figure 2 column B, for the condition with female, did only the 1-3 largest spikes get counted, or also the spikes of medium height?

      Page 11: "Many channels on the probes recorded multi-unit activity, which were taken note of but not analyzed in this study."

      What were the criteria for this? For several of the examples in the figures there are spikes of varying amplitudes and as mentioned above it would be helpful to clarify how the spikes were sorted into single units in such cases.

      Categorical scores:

      Page 13: "Neurons with error responses greater than 2.5 in only one condition (undirected versus directed) were considered to have retuned; neurons with error scores greater than 2.5 in both conditions were considered not to have retuned."

      This definition results in cases where responses of 2.45 vs 2.55 are described as 'retuned', even if these responses are not significantly different. The figure (Figure 2 - figure supplement 1) indicates that multiple neurons that were scored as retuning had responses that fall very near the threshold in this way.

      Page 13, "Our results did not fundamentally change with ... a more stringent threshold of 3..."

      The stringency is not issue here, rather the categorical threshold. Retuning would be more persuasively demonstrated if the authors could provide a test of whether or not the responses for individual neurons differ significantly between conditions appropriately taking into account multiple comparisons, stability of recordings, non-Gaussian firing rate distributions across motif renditions, etc. and use this metric to report effects, rather than setting a categorical threshold.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to reviewer comments

      R: We really appreciate the reviewer positive comments and consideration, and we believe that the review process has significantly strengthened our manuscript.

      We have responded to all the reviewer comments, as follows:

      Response (R)

      From Reviewer #1

      Major comments: The manuscript is mostly well written (it could use a few minor grammatical corrections), the significance of the problem is well described, and the results are clearly presented with adequate controls. The movies, provided as supplementary material, are of the highest quality and are essential additions to the stills provided in the figures. The data convincingly support the key conclusions of the manuscript.

      R: We really appreciate the reviewer positive comments, and we have revised the manuscript accordingly

      1) Does the MO knockdown both S and L homoeologs of X. laevis? Since the level of GAPDH in Figure 1H also looks reduced in Gai2 MO lane, it should be made clear that the apparent knockdown of Gai2 was normalized to GAPDH, rather than being the results of unequal loading of the gel. Yes, I recognize that Figure 1I says normalized, but this is not stated in the results or the methods. Also, was this experiment done with X. laevis or X. tropicalis? I could imagine that if done in X. laevis, the lack of complete knockdown might be due to only one homoeolog being affected.

      R: We appreciate the reviewer comment, and we described in Material and Methods section the region targeted by the morpholino, in both Xenopus species. We added the next paragraph in the Material and Methods section, see page 23 paragraph 1 lines: 7-11.

      “The Gαi2 morpholino (Gαi2MO) was designed as described in the results section to target Gαi2 in both Xenopus species (Xenopus tropicalis and Xenopus laevis). Specifically, it hybridizes with the 5’ UTR of X. tropicalis Gαi2 (NM_203919), 17 nucleotides upstream of the ATG start codon. For X. laevis Gαi2, the morpholino hybridizes with both isoforms described in Xenbase. It specifically targets the 5' UTR of the Gαi2.L isoform (XM_018258962), located 17 nucleotides upstream of the ATG start codon, and the 5' UTR of the Gαi2.S isoform (NM_001097056), situated 275 nucleotides upstream of the ATG.”

      With respect to Figure 1H and 1I, we have specified in the Figure 1 legend that we normalized the data to GAPDH to quantifying the decrease in Gαi2 expression induced by the morpholino.

      See page 37, Figure 1H-I, Legends section.

      2) The knowledge of the efficacy of knockdown in each Xenopus species provided by the information requested in the previous point, would allow the reader to assess the level of knockdown in the remaining assays. To do this, the authors should tell us which assays were done in which species. I am not suggesting that each experiment needs to be done in each species, only that the information should be provided. If the MO is more effective in X. tropicalis - which assays used this species? If the knock down is partial, as shown in Figure 1H-I, which species this represents in the remaining assays would be useful knowledge.

      R: We greatly appreciate the reviewer's valuable comments and suggestions, and as a response, we have incorporated a new supplementary figure (Figure S1). This figure includes a western blot and an in situ hybridization assay illustrating the efficiency of the knockdown in Xenopus laevis. The results presented in Figure S1 demonstrate that the knockdown efficiency is similar in both Xenopus species, allowing for a comparison between Figure 1A-I (X. tropicalis) and Figure 1S (X. laevis).

      To complement this information, we have also improved the section of Material and Methods regarding the experiments in both Xenopus species (Xenopus tropicalis and Xenopus laevis). As detailed in the Materials and Methods section, we employed 20 ng of Gai2MO for Xenopus tropicalis embryos and 35 ng of Gai2MO for Xenopus laevis embryos to deplete cell migration. In both species, in vivo migration was analyzed, resulting in a substantial inhibition of cranial neural crest (NC) migration, ranging from 60% to 80%. Additionally, we conducted dispersion assays in both species. In X. laevis, in vitro migration was monitored for 10 hours, while in X. tropicalis, it was tracked for 4 hours, both yielding the same phenotype. We also studied cell morphology and microtubule dynamics in both Xenopus models. However, we used different tracer concentrations for each, with 200 pg for X. laevis and 100 pg for X. tropicalis, as specified in the Materials and Methods section. Our Rac1 and RhoA timelapse experiments were conducted in both species as well, employing pGBD-GFP and rGBD-mCherry probes, respectively, and different probe concentrations as outlined in the Materials and Methods section. These experiments revealed polarity impairment and consistent Rac1 behavior in both Xenopus species. The study of focal adhesion in vivo dynamics using the FAK-GFP tracer was carried out also in both species, resulting in the same phenotype. It is worth noting that the only experiment conducted exclusively in X. tropicalis was the focal adhesion disassembly assay with nocodazole.

      Regarding the improvements of the Materials and Method section see page 22, paragraph 2.

      We want to highlight that at the beginning of the Materials and Methods section, we incorporated a paragraph to clarify that “all experiments were conducted in both Xenopus species (X.t and X.l) using distinct concentrations of the morpholino (MO) and mRNA, as specified in each respective methodology description”. This approach consistently yielded similar results. It is important to note that for the figures, we selected the most representative images.

      We have also specified in each figure legend which Xenopus species is depicted.

      Minor comments: While prior studies are referenced appropriately, and the text and figures are mostly clear and accurately presented, the following are a few suggestions that would help the authors improve the presentation of their data and conclusions:

      1) The cell biological experiments convincingly demonstrate that knockdown of Gai2 causes cells to move more slowly. It would be a nice addition to bring the explant experimental data back to the embryo by showing whether the slower moving NC cells in morphants eventually populate the BA. DO they cease to migrate or are they just slower getting to their destination? This could be done by performing snail2 ISH at a later stage (34-35?)

      R: We appreciate the reviewer's insightful point and are currently conducting the in situ hybridization assay at stages 32-36 to address this question. Our plan includes incorporating a supplementary figure showing the results of this assay and integrating this information into both the results and discussion sections.

      2) There are places in the manuscript where the authors use the terms "silencing" or "suppression" of Gai2, when they really mean reduced translation - their system is not a genetic knockout, as clearly demonstrated in Figure 1H-I. I suggest that more accurate wording be used.

      R: We appreciate the reviewer's comment, and we agree that the Gαi2 morpholino impedes Gαi2 translation, leading to a reduction in Gαi2 protein expression. Consequently, we have revised the entire manuscript, replacing the terms “silencing” and “suppression” with “knockdown”.

      3) In Figures 1-5 there are scale of bars on the cell images, but these are not defined in any of the figure legends.

      R: We value the reviewer's comment, and we have revised all the figure legends by including the scale information. Each image has been scaled to 10 µm with varying magnifications.

      4) The abstract is the weakest section of the manuscript, and would have greater impact if it were more clearly written.

      R: We appreciate the reviewer's comment on the abstract, and we have revised and edited it to enhance its quality.

      Abstract:

      “Cell migration is a complex and essential process in various biological contexts, from embryonic development to tissue repair and cancer metastasis. Central to this process are the actin and tubulin cytoskeletons, which control cell morphology, polarity, focal adhesion dynamics, and overall motility in response to diverse chemical and mechanical cues. Despite the well- established involvement of heterotrimeric G proteins in cell migration, the precise underlying mechanism remains elusive, particularly in the context of development.

      This study explores the involvement of Gαi2, a subunit of heterotrimeric G proteins, in cranial neural crest cell migration, a critical event in embryonic development. Our research uncovers the intricate mechanisms underlying Gαi2 influence, revealing its interaction with tubulin and microtubule-associated proteins such as EB1 and EB3, suggesting a regulatory function in microtubule dynamics modulation. Gαi2 knockdown leads to microtubule stabilization, alterations in cell morphology and polarity, increased Rac1-GTP concentration at the leading edge and cell-cell contacts, impaired cortical actin localization and focal adhesion disassembly. Interestingly, RhoA-GTP was found to be reduced at cell-cell contacts and concentrated at the leading edge, providing evidence of Gαi2 significant role in polarity. Remarkably, treatment with nocodazole, a microtubule-depolymerizing agent, effectively reduces Rac1 activity, restoring cranial NC cell morphology, actin distribution, and overall migration. Collectively, our findings shed light on the intricate molecular mechanisms underlying cranial neural crest cell migration and highlight the pivotal role of Gαi2 in orchestrating microtubule dynamics through EB1 and EB3 interaction, modulating Rac1 activity during this crucial developmental process.”

      The molecular regulation of cell movement is a key feature of a number of developmental and homeostatic processes. While many of the proteins involved have been identified, how they interact to provide motility has not been elucidated in any great detail, particularly in embryo-derived cells (as opposed to cell lines). The results obtained from the presented experiments are novel, in-depth and provide a novel paradigm for how G proteins regulate microtubule dynamics which in turn regulate other components of the cytoskeleton required for cell movement. The results will be applicable to many migrating cell types, not just neural crest cells.

      Because of the application of the data to many types of cells that migrate, the audience is expected to include a broad array of developmental biologists, basic cell biologists and those interested in clinically relevant aberrant cell migrations.

      R: We really appreciate the reviewer positive comments and consideration

      From Reviewer 2

      Major comments

      The authors aim to address two issues in this manuscript: a) the role of Gai2 in neural crest development; and b) the mechanism of Gai2 function. While they have done a good job demonstrating a role of Gai2 in NC migration both in vivo and in vitro as well as the effects of Gai2 knockdown on cytoskeleton dynamics, protein distribution of selected polarity and focal adhesion molecules, and Rac1 activation, the link between Gai2 and the downstream effectors is largely correlative. Because of this, the model suggesting the sequential events flowing from Gai2 to microtubule to Rac1 to focal adhesion/actin should be modified to allow room for direct and indirect regulation at potentially multiple entry points.

      R: We appreciate the reviewer's valuable comments. We concur with the reviewer's observation that our experiments do not establish a causal link between Gαi2, EB1/EB3, and Rac1. We established a relationship between Gαi2 and microtubule dynamics (EB1 and EB3) to regulate Rac1 polarity through co-immunoprecipitation assays, which reveal protein interactions within an interactor complex. Therefore, while our findings support the involvement of Gαi2 in coordinating cranial NC cell migration alongside EB1, EB3, and Rac1, we cannot exclude the possibility that this regulation may occur through other intermediary proteins, such as GEFs, GAPs, GDIs, and others. As a result, we have revised our model and its description in accordance with the reviewer suggestion.

      We have edited the discussion/conclusion, model and the legend at Figure 6. See page 16 (paragraph 2, last line), 17 (paragraph 1, last line), 22 (paragraph 1, last line 17-20), 42 (Legend Fig. 6).

      Specific major comments are as the following: Strengths: -Determination of a role of Gai2 in neural crest migration is novel. -The effect of Gai2 knockdown on membrane protrusion morphology and microtubule stability and dynamics are demonstrated nicely. -Quantification of experimental perimeters has been performed throughout the manuscript in all the figures, and statistical analysis is included in the figures.

      R: We appreciate the reviewer positive comments

      Weaknesses: -The heavy focus of the study on microtubule is due to the previous publication on the function of Gai2 in regulation of microtubule during asymmetrical cell division. However, the activity of Gai2 is likely cell type-specific, as it has not been shown to control microtubule during cytokinesis in general. It is equally likely that Gai2 primarily regulates Rac1 or actin regulators to influence both microtubule and actin dynamics. The tone of the discussion should therefore be softened.

      R: We greatly appreciate and agree with the comment from the reviewer, highlighting the possibility that Gαi2 primarily regulates Rac1 or actin regulators to influence both microtubule and actin dynamics. In this regard, we have revised our manuscript to include a discussion of this point. We added the next paragraph in the Discussion/Conclusion section, page 20-21.

      “It is well established that the activity from the Rho family of small GTPases is controlling cytoskeletal organization during migration (Ridley et al., 2015). Contrariwise, it has been described in many cell types, that microtubules dynamic polymerization plays a crucial role in establishing the structural foundation for cell polarization, consequently influencing the direction of cell motility (Watanabe et al., 2005). Our results appear to align with this latter view. While it is reasonable to postulate the possibility that Gαi2 regulates Rac1 activity, subsequently influencing actin and microtubule dynamics, our findings in the context of cranial NC cells, lend support to an alternative sequence of events. Initially, Gαi2 knockdown leads to a decrease in microtubule dynamics, which in turn increase Rac-GTP towards the leading edge. This shift is accompanied by reduced levels of cortical actin and impaired focal adhesion disassembly, culminating in compromised cell migration. Notably, nocodazole, a microtubule-depolymerizing agent, not only diminishes Rac-GTP localization at the leading edge but also rescues cell morphology, restores normal cortical actin localization, and promotes focal adhesion disassembly, thereby facilitating cell migration. If Rac1 activity were indeed upstream of microtubules, it would be expected that nocodazole would not reduce Rac-GTP levels at the cell leading edge. These results suggest that the regulation of Rac1 activity may follow, rather than precede, alterations in microtubule dynamics, in the context of NC cells. Furthermore, in support of our model, our protein interaction analysis demonstrates Gαi2 interacting with microtubule components such as EB proteins and tubulin. As we already mention above, earlier studies have reported that microtubule dynamics promote Rac1 signaling at the leading edge and by releasing RhoGEFs promote RhoA signaling as well (Best et al., 1996; Garcin and Straube, 2019; Moore et al., 2013; Waterman-Storer et al., 1999). In addition, it is well-documented that RhoGEFs interact with microtubules, including bPix, a GEF for Rac1 and Cdc42, which, in turn, promotes tubulin acetylation (Kwon et al., 2020). Interestingly, in ovarian cancer cells, Gαi2 has been shown to activate Rac1 through an interaction with bPix, thereby jointly regulating migration in response to LPA (Ward et al., 2015). Taken together, these findings further support our proposed model (refer to Fig. 6).”

      -The effect of rescue of NC migration with Rac1 inhibitor is marginal and the result is hard to interpret considering the inhibitor also blocks control NC migration. Either lower doses of Rac1 inhibitor can be used or the experiment can be removed from the manuscript, as Rac1 is required for membrane protrusions and the inhibitor doses can be hard to titrate.

      __ R: We appreciate and agree with the reviewer's comments. To address this concern and enhance clarity, we have incorporated the following paragraph into the manuscript within the Discussion section. Additionally, we have included information on the range of NSC23766 concentrations used for this analysis in the Materials and Methods section. Page 24, __Explants and microdissection.

      “It is worth noting that we conducted Rac inhibitor NSC23766 trials at concentrations ranging from 20 nM to 50 nM for X. laevis and between 10 nM to 30 nM for X. tropicalis. In both cases, higher concentrations of the Rac inhibitor proved to be lethal, underscoring the essential role of Rac1 in both cell migration and cell survival. Specifically, the described concentrations of 20 nM for X. laevis and 10 nM for X. tropicalis led to a partial rescue of the observed phenotype. This suggests that these concentrations are sufficient to demonstrate that the increase in Rac1-GTP resulting from Gαi2 morpholino knockdown impairs cell migration. The partial rescue can be attributed to the crucial role of microtubule dynamics in cell migration, which acts upstream of Rac activity. Additionally, Rac is pivotal for the modulation of cell polarity at the leading edge of migration. It is worth emphasizing that Rac1 levels are critical for cell migration, as demonstrated by other researchers. Lower concentrations of Rac1-GTP have been shown to hinder cell migration in cells deficient in Rac1, leading to a significant reduction in wound closure and random cell migration (Steffen et al., 2013).

      Therefore, we believe that the lower concentration of NSC23766 used in our assay was adequate to reduce the abnormal Rac1-GTP activity in the morphant NC cells. However, it is important to note that for normal NC cell, this level of reduction in Rac1-GTP activity is critical and sufficient to impair normal migration”.

      See page 12, paragraph 2, lines 8-11, 14-16, 23-25.

      Steffen A, Ladwein M, Dimchev GA, Hein A, Schwenkmezger L, Arens S, Ladwein KI, Margit Holleboom J, Schur F, Victor Small J, Schwarz J, Gerhard R, Faix J, Stradal TE, Brakebusch C, Rottner K. Rac function is crucial for cell migration but is not required for spreading and focal adhesion formation. J Cell Sci. 2013 Oct 15;126(Pt 20):4572-88. doi: 10.1242/jcs.118232. Epub 2013 Jul 31. PMID: 23902686; PMCID: PMC3817791.

      -Since the defects seem to result partially from the inability of the NC cells to retract and move away, it may help to either include some data on Rho activation patterns in knockdown cells or simply add some discussion about the issue.

      R: We acknowledge and sincerely appreciate the reviewer's valuable comments on this pivotal aspect, which significantly enhances our capacity to elucidate the impact of Gαi2 knockdown on cell polarity. To address this crucial point, we have introduced an experiment that examines RhoA-GTP localization under Gαi2 knockdown conditions, and we have incorporated a supplementary figure S3 into our manuscript. This newly added figure clearly demonstrates that, under Gαi2 knockdown conditions, and in contrast to control cells, RhoA-GTP localization is substantially disrupted at cell-cell contacts and now detected at the leading edge of the cell, providing compelling evidence of cell polarity defects (refer to Figure S3). In response to these results, we have included a description of these findings in the Results section (please see page 11-12) and a dedicated paragraph in the Discussion section (please see page 18, paragraph 2, line 15-16, page 19, paragraph 1, lines 6-12).

      Results section 1: “To achieve this, we explored whether Gαi2 regulates the subcellular distribution of active Rac1 and RhoA in cranial NC explants under Gαi2 loss-of-function conditions, considering their pivotal roles in cranial NC migration and contact inhibition of locomotion (CIL) (Carmona-Fontaine et al., 2011; Moore et al., 2013; Leal et al., 2018). Hence, we employed mRNA encoding the small GTPase-based probe, enabling specific visualization of the GTP-bound states of these proteins.”

      Results section 2: “Consistent with earlier observations by Carmona-Fontaine et al. (2011), in control cranial NC cells, active Rac1 displayed prominent localization at the leading edge of migrating cells, whereas its presence was reduced at cell-cell contacts, coincident with an increase in RhoA-GTP levels (white arrows in Fig. 4A, supplementary material Figure S3A). On the contrary, in comparison to the control cells, Gαi2 morphants exhibit a pronounced accumulation of active Rac1 protein in the protrusions at cell-cell contacts, where active RhoA localization is conventionally expected (white arrow in Fig. 4B, supplementary material Figure S3A and movie S6). In contrast to control cells, a notable shift in the localization of active RhoA protein was observed, with its predominant accumulation now detected at the leading edge of the cell, instead of the typical localization towards the trailing edge or cell-cell contacts (__supplementary material Figure S3B). __These findings suggest a dysregulation of contractile forces that align with the observed distribution of active RhoA, cortical actin disruption, and diminished retraction in cell treated with Gαi2MO.”

      Discussion section:

      “Other studies have reported that microtubule assembly promotes Rac1 signaling at the leading edge, while microtubule depolymerization stimulates RhoA signaling through guanine nucleotide exchange factors associated with microtubule-binding proteins controlling cell contractility, via Rho-ROCK and focal adhesion formation (Krendel et al., 2002; Ren et al., 1999; Best et al., 1996; Garcin and Straube, 2019; Waterman-Storer et al., 1999; Bershadsky et al., 1996; Moore et al., 2013). This mechanism would contribute to establishing the antero-posterior polarity of cells, crucial for maintaining migration directionality, underscoring the significance of regulating microtubule dynamics in directed cell migration. These findings closely align with the results obtained in this investigation, demonstrating that Gαi2 loss of function reduces microtubule catastrophes and promotes tubulin stabilization, resulting in increased localization of active Rac1 at the leading edge and cell-cell contacts, while decreasing active RhoA at the cell-cell contact but increasing it at the leading edge. This possibly reinforces focal adhesion, which is consistent with the presence of large and highly stable focal adhesions under Gαi2 knockdown conditions. This finding also suggests a dysregulation of contractile forces in comparison to control cells, a result that aligns with the observed distribution of active RhoA, cortical actin distribution and diminished retraction in cells treated with Gαi2MO. This strikingly contrasts with the normal cranial NC migration phenotype, where Rac1 is suppressed while active RhoA is increased at cell-cell contacts during CIL, leading to a shift in polarity towards the cell-free edge to sustain directed migration (Theveneau et al., 2010; Shoval and Kalcheim, 2012; Leal et al., 2018).”

      -To consider focal adhesion dynamics, live imaging should be used in the analysis. The fixed samples are different from each other, and natural variations of focal adhesion may exist among the samples. This can obscure data collection and quantification.

      R: We agree with the reviewer that focal adhesion (FA) dynamics need to be analysed using live imaging. Indeed, Fig 5E-H shows an extensive analysis of FA using live imaging of neural crest expressing FAK-GFP. As complement to this live imaging analysis, and in order to analyse the effect on the endogenous levels of FA proteins, we performed immunostaining against FA. Both experiments using live imaging or fixed cells produce similar results, and they are consistent with our model on the role of Gαi2 on FA dynamics.

      Minor comments -Fig. 2, the centrosomes in control cells are not always obvious. The microtubules simply seem to be more networked and more fluid in control cells. This should be clarified with either marking the centrosomes in the figure or modifying the wording in the manuscript.

      R: We appreciate and concur with the reviewer's comment on this matter. As pointed out by the reviewer, the precise localization of the centrosome is not consistently clear in all cells. In response to this observation, we have revised the manuscript to emphasize this aspect solely as “microtubule morphology”. Please refer to the Results section description Figure 2.

      -In Fig. 3, a better negative control for co-IP should be using anti-V5 antibody to IP against tubulin/EB1/EB3 in the absence of Gai2-V5.

      R: We appreciate the reviewer's comment, and we agree about the controls that the reviewer suggest. We can inform that we have done by triplicate all the Co-IPP. Although, if is necessary we will do the controls suggested. We present this assay as a plan.

      -The data for cell polarity proteins Par3 and PKC-zeta seem to be out of place. It is unclear whether mis-localization of these proteins has anything to do with NC migration defects induced by Gai2 knockdown. The conclusion does not seem to be affected if the data are taken out of the manuscript.

      R: We appreciate the reviewer's concern, and we would like to highlight two points in this regard. Firstly, we have included these results as additional data to support the impact of Gai2 knockdown on cell polarity, given that these two proteins are commonly used polarity markers. Secondly, we have discussed this aspect extensively in the Discussion section of the manuscript. (See page 19, paragraph 1, lines 18-28)

      In that section, we delve into the relationship between aPKC, Par3, and Gαi2 in controlling cell polarity during asymmetric cell division, as described in Hao et al., 2010. Par3 is known to play a role in regulating microtubule dynamics and Rac1 activation through its interaction with Rac-GEF Tiam1 (Chen et al., 2005). Additionally, it has been shown to promote microtubule catastrophes and inhibit Rac1/Trio signaling, regulating Contact Inhibition of Locomotion (CIL) as demonstrated in Moore et al., 2013. Thus, we believe that the data we present support the relationship between Par3 and aPKC localization changes and the neural crest migration defects induced by Gαi2 knockdown, probably by controlling microtubule dynamics. However, we have moved these results as part of the supplementary Figure S3.

      -In Suppl. Fig. 1, protrusion versus retraction should be defined more clearly. The retraction shown in this figure seems to be just membrane between protrusions instead of actively retracting membrane.

      R: We appreciate the reviewer's comments, and here we aim to provide a clearer description of our approach to this analysis. For the measurement of protrusion extension/retraction, we conducted two distinct experiments. The first, as described in Figure 1, involved measuring membrane extension and retraction in live cell using membrane-GFP by utilizing the image subtraction tool in ImageJ, which highlights changes in the membrane in red. Secondly, we employed ADAPT software to quantify cell perimeter based on fluorescence intensity in live cell using lifeactin-GFP, distinguishing membrane extension in green and retraction in red (as has been shown similarly in Barry et al., 2015). In both approaches, we observed a substantial increase in membrane protrusion (both in area and extension) and protrusion stability in Gαi2 morphants. Hence, we have revised the Materials and Methods section of the manuscript and included this clarification.

      See Materials and Methods section, Cell dispersion and morphology, page 25-26.

      Barry DJ, Durkin CH, Abella JV, Way M. Open source software for quantification of cell migration, protrusions and fluorescence intensities. J Cell Biol. 2015. Doi: 10.1083/jcb.201501081

      -Discussion can be improved by better incorporating all the components to make a cohesive story on how Gai2 works to regulate migration in the context of the neural crest cells.

      R: We appreciate the reviewer's comment and agree. To enhance the manuscript, we have included a new paragraph at the end of the Discussion/Conclusion section specifically addressing this point. For more details, please refer to page 21-22.

      “In the context of collective cranial NC cells migration, our findings reveal the pivotal role played by Gαi2 in orchestrating the intricate interplay of microtubule dynamics and cellular polarity. When Gαi2 levels are diminished, we observe significant impediments in the ability of cells to efficiently navigate through their environment, resulting in a range of distinct effects. First and foremost, Gαi2 deficiency leads to the diminished ability of cells to adjust and reorient new protrusions effectively. Primary protrusions exhibit higher stability and heightened levels of active Rac1/RhoA when compared to control conditions in the leading edge. In addition, we observe a notable increase in protrusion area, a decrease in retraction velocity, and an enhanced level of cell-matrix adhesion in Gαi2 knockdown cells. These findings underscore the pivotal role that Gαi2 plays in the modulation of various cellular dynamics essential for collective cranial NC cells migration. Notably, the application of nocodazole, a microtubule-depolymerizing agent, and NSC73266, a Rac1 inhibitor, to Gαi2 knockdown cells leads to the rescue of the observed effects, thus facilitating migration. This observed response closely mirrors the outcomes associated with Par3, a known regulator of microtubule catastrophe during contact inhibition of locomotion (CIL) in NC cells. This parallel implies that there exists a delicate equilibrium between microtubule dynamics and Rac1-GTP levels, crucial for the establishment of proper cell polarity during collective migration. Our findings collectively position Gαi2 as a central master regulator within the intricate framework of collective cranial NC migration. This master regulator's role is pivotal in orchestrating the dynamics of polarity, morphology, and cell-matrix adhesion by modulating microtubule dynamics through interactions with EB1 and EB3 proteins, possible in a protein complex involving other intermediary proteins such as GDIs, GAPs and GEFs, thus fostering crosstalk between the actin and tubulin cytoskeletons. This orchestration ultimately ensures the effective collective migration of cranial NC cells (Fig. 6).”

      From Reviewer #3

      Major comments: 1. The authors focus exclusively on the analysis of the subcellular levels of Rac1, which is probably related to the fact that they observe large extended protrusions with high Rac1 activity. However, as the authors note, a global fine-tuning of Rho GTPase activity is required for neural crest migration. One of the observed phenotypes of Gαi2-morphant neural crest cells is a decrease in cell dispersion, which may be caused by defects in contact inhibition of locomotion (CIL). This process involves a local activation of RhoA at cell-cell contact sites (Carmona-Fontaine et al., 2008). Furthermore, in fibroblast, RhoA/ROCK activity is required for the front-rear polarity switch during CIL (Kadir et al., 2011). Interestingly, similar to the Gαi2 loss of function phenotype, ROCK inhibition leads to microtubule stabilization, which can be rescued by nocodazole treatment, restoring microtubule dynamics and CIL. Therefore, it would also be interesting to know how RhoA activity is affected in Gαi2-morphant NC cells. At a minimum, this point should be be included in the discussion.

      R: We acknowledge and sincerely appreciate the reviewer's valuable comments on this pivotal aspect, which significantly enhances our capacity to elucidate the impact of Gαi2 knockdown on cell polarity. To address this crucial point, we have introduced an experiment that examines RhoA-GTP localization under Gαi2 knockdown conditions, and we have incorporated a supplementary figure S3 into our manuscript. This newly added figure clearly demonstrates that, under Gαi2 knockdown conditions and in contrast to control cells, RhoA-GTP localization is substantially disrupted at cell-cell contacts and now detected at the leading edge of the cell, providing compelling evidence of cell polarity defects (refer to Figure S2). In response to these results, we have included a description of these findings in the Results section (please see page 11-12) and a dedicated paragraph in the Discussion section (please see page 18, paragraph 2, line 15-16, page 19, paragraph 1, lines 6-12).

      Results section 1: “To achieve this, we explored whether Gαi2 regulates the subcellular distribution of active Rac1 and RhoA in cranial NC explants under Gαi2 loss-of-function conditions, considering their pivotal roles in cranial NC migration and contact inhibition of locomotion (CIL) (Carmona-Fontaine et al., 2011; Moore et al., 2013; Leal et al., 2018). Hence, we employed mRNA encoding the small GTPase-based probe, enabling specific visualization of the GTP-bound states of these proteins.”

      Results section 2: “Consistent with earlier observations by Carmona-Fontaine et al. (2011), in control cranial NC cells, active Rac1 displayed prominent localization at the leading edge of migrating cells, whereas its presence was reduced at cell-cell contacts, coincident with a increase in RhoA-GTP levels (white arrows in Fig. 4A, supplementary material Figure S2A). On the contrary, in comparison to the control cells, Gαi2 morphants exhibit a pronounced accumulation of active Rac1 protein in the protrusions at cell-cell contacts, where active RhoA localization is conventionally expected (white arrow in Fig. 4B, supplementary material Figure S3B and movie S6). In contrast to control cells, a notable shift in the localization of active RhoA protein was observed, with its predominant accumulation now detected at the leading edge of the cell, instead of the typical localization towards the trailing edge or cell-cell contacts (__supplementary material Figure S2). __These findings suggest a dysregulation of contractile forces that align with the observed distribution of active RhoA, cortical actin disruption, and diminished retraction in cell treated with Gαi2MO.”

      Discussion section:

      “Other studies have reported that microtubule assembly promotes Rac1 signaling at the leading edge, while microtubule depolymerization stimulates RhoA signaling through guanine nucleotide exchange factors associated with microtubule-binding proteins controlling cell contractility, via Rho-ROCK (cita) and focal adhesion formation (Krendel et al., 2002; Ren et al., 1999; Best et al., 1996; Garcin and Straube, 2019; Waterman-Storer et al., 1999; Bershadsky et al., 1996; Moore et al., 2013). This mechanism would contribute to establishing the antero-posterior polarity of cells, crucial for maintaining migration directionality, underscoring the significance of regulating microtubule dynamics in directed cell migration. These findings closely align with the results obtained in this investigation, demonstrating that Gαi2 loss of function reduces microtubule catastrophes and promotes tubulin stabilization, resulting in increased localization of active Rac1 at the leading edge and cell-cell contacts and decreasing active RhoA at the cell-cell contact but increasing at the leading edge, possibly reinforcing focal adhesion, which align with our result here that show large and highly stable focal adhesions under Gαi2 knockdown conditions. This finding also suggests a dysregulation of contractile forces in comparison to control cells, a result that aligns with the observed distribution of Active RhoA, cortical actin distribution and diminished retraction in cells treated with Gαi2MO. This strikingly contrasts with the normal cranial NC migration phenotype, where Rac1 is suppressed while active RhoA is increased at cell-cell contacts during CIL, leading to a shift in polarity towards the cell-free edge to sustain directed migration (Theveneau et al., 2010; Shoval and Kalcheim, 2012; Leal et al., 2018).”

      The co-Immunoprecipitation data lack marker bands (larger images/sections of the blots would be preferable) and the labelling is not clear. What do the white arrows in Fig. 3H,I mean? What does "elu" and "non eluted" mean? Did the reverse IP work as well?

      R: We appreciate the reviewer's comments, and here we intend to provide a more detailed explanation of our approach to this analysis. Since we do not possess a secondary antibody specific to the heavy chain, our method involves eluting the co-immunoprecipitated proteins to visualize those with weights close to that of the light chain (such as EB1). We have outlined this elution step in the “Cell lysates and co-immunoprecipitation” protocol in the Materials and Methods section. To ensure proper control, we load both fractions - the eluted (or supernatant) and non-eluted (or resin) fractions - to monitor the amount of protein extracted from the resin using a 1% SDS solution. It's important to note that the elution step, as indicated by the V5 signal, is not entirely efficient, and a significant portion of the protein remains bound to the resin. This issue may also apply to the EB1 protein; however, it is still possible to visualize both bands (Gαi2V5 and EB1).

      We have revised the legend for Figure 3 to include an explanation of the terms 'elu' (eluted fraction) and 'non-eluted' (non-eluted fraction). We have also included the explanation of the white arrows’ significance in the legends for Figure 3H and 3I. These arrows indicate the bands corresponding to the immunoprecipitated proteins.

      We also agree with the reviewer’s suggestion to conduct the reverse IP. We can inform that we have done by triplicate all the Co-IPP. Although, if is necessary we will do the controls suggested. We present this assay as a plan.

      The presentation of the Delaunay triangulations varies in quality. In Fig. 1 J/K the cells are clearly visible in the images, while this is not the case in Fig. 3 J-M and Fig. 4K-N. Conversely, the Delaunay triangulations in Fig. 1L are mainly black, while they are clear in Fig. 3 and 4. Perhaps the authors could find a more consistent way to present the data. Were the explants all approximately the same size at the beginning of the experiment? The Gαi2-morphant explant in Fig. 3K appears to be unusually small.

      R: We appreciate the reviewer’s concerns and have taken steps to address them. To improve the quality of our data, we have made enhancements to the presentation of Figures 3 (panels J-M) and Figure 4 (panels K-N). Specifically, we have standardized the Delaunay triangulation representations.

      Regarding the size of the explants at the beginning of the experiments, they were indeed approximately similar in size. We confirmed this by including a reference point (point 0) for each condition in the figures 3. However, in the panels presented, we show the results after 10 hours (Figure 3, X. laevis) and 4 hours (Figure 4, X. tropicalis) to assess cell dispersion, as indicated in the respective figure legends. This uniformity in size was further ensured by the calculation used to quantify dispersion. For the dispersion assay, we normalized each initial size of the explant upon the control, and we have added another representative explant of Gαi2 morpholino with its Delaunay triangulation to facilitate the experiment interpretation. Every Delaunay triangulation calculates the area generated between three adjacent cells and it grows depending on how much disperse are the cells between each other in the final point. (See Material and Methods section, Cell dispersion and morphology). As we can see in the manuscript, in every dispersion experiment that we have performed with Gαi2 morpholino, the cells cannot disperse at all. Furthermore, to analyze the dispersion rate in this experiment we use Control n= 21 explants, Gαi2MO n= 24 explants, Gαi2MO + 65 nM Nocodazole n= 36 explants, Control + 65 nM Nocodazole n= 30 explants (as we mentioned in the manuscript legend).

      Why was the tubulin distribution in Fig. 2F measured from the nucleus to the cell cortex? Would it not make more sense to include cell protrusions? This does not seem to be the case in the example shown in Fig. 2F.

      R: We appreciate the reviewer's concern. We would like to clarify that for the tubulin distribution measurements, we indeed measured from the nucleus to the cell protrusion. As a result, we have made an edit to Figure 2 (panel 2F) to provide further clarity on this matter.

      The immunostaining for acetylated tubulin (Fig. 3A,B) looks potentially unspecific and seems to co-localize with actin (for comparison see Bance et al., 2019). For imaging and quantification, it may be better to use tubulin co-staining to calculate the percentage of acetylated tubulin. Please also add marker bands to the Western blot in Fig. 3C. If this issue cannot be resolved it may be better to only include the Western blot data.

      R: We appreciate the reviewer's concern about the potential unspecific nature of acetylated-tubulin and its co-localization with actin, particularly in Figure 3. Regarding the co-localization with actin, it is predominantly observed in panel B, and we attribute this phenomenon to the Gαi2 morphant phenotype, where cortical actin is notably reduced, creating the appearance of co-localization. However, we will assess the experiment as suggested by the reviewer. Therefore, our plan is to conduct an immunostaining for acetylated tubulin and tubulin in both control and Gαi2 knockdown conditions. This will allow us to calculate the percentage of acetylated tubulin and complement the western blot analysis.

      We have included marker weight indications on the western blot panel in Figure 3C.

      The model in Fig.6 indicates that Gαi2 inhibits EB1/3. What is the experimental evidence and the proposed mechanism for this? In the discussion, the authors cite evidence that Gαi activates the intrinsic GTPase activity of tubulin, which would destabilize microtubules by removing the GTP cap. However, this mechanism would not directly affect EB1 and EB3 stability as the Fig. 6A seems to suggest. The authors also mention that EB3 appears to be permanently associated with microtubules in Gαi2-morphant cells. How would this work, given that end-binding proteins bind to the cap region? Are the authors suggesting that there is an extended cap region in Gαi2 morphants?

      R: We appreciate the reviewer's valuable comments. We agree with the reviewer's observation that our experiments do not establish a causal link between Gαi2, EB1/EB3, and Rac1. We established a relationship between Gαi2 and microtubule dynamics (EB1 and EB3) to regulate Rac1 polarity through co-immunoprecipitation assays, which reveal protein interactions within an interactor complex. In addition, in Gαi2 Knockdown conditions we have found a strong reduction in microtubules dynamics following EB-GFP comets. Regarding the observation that EB3 seems to be persistently associated with microtubules in Gαi2-morphant cells, we wish to clarify that this is a speculation based on the microtubule phenotype observed during our dynamic analysis, where they appear more like lines rather than comets. It is important to note that none of the experiments conducted in this study conclusively demonstrate this, and thus, it remains a suggestion. Therefore, while our findings support the involvement of Gαi2 in coordinating cranial NC cell migration alongside EB1, EB3, and Rac1, we cannot exclude the possibility that this regulation may occur through other intermediary proteins, such as GEFs, GAPs, GDIs, and others. As a result, we have revised our model in accordance with the reviewer suggestion.

      We have edited both the model and the legend at Figure 6. Gαi2 controls cranial NC migration by regulating microtubules dynamics.

      Considering this, we have reviewed the manuscript to provide clarity on this point. See page 16 (paragraph 2, last line), 17 (paragraph 1, last line), 22 (paragraph 1, last line 17-20), 42 (Legend Fig. 6).

      Minor comments 1. The citation of Wang et al. 2018 in the introduction does not seem to fit.

      R: We appreciate the correction provided by the reviewer. We have carefully reviewed the Introduction and Reference sections and have corrected this error.

      2.Does the graph in Fig. 4S show average values for the three conditions? If so, what is the standard deviation?

      R: We appreciate the reviewer’s concern and we have added the standard deviation to Figure 4S.

      3.From the images in Fig. 2G and H, it is difficult to understand what the difference is between the four groups shown.

      R: We appreciate the reviewer's comment, and to clarify this point, we would like to explain that the comparison has been made between each type of comet. The PlusTipTracker software separates comets based on their speed and lifetime, classifying them as fast long-lived, fast short-lived, slow long-lived, or slow short-lived. In both conditions (control and morphant cells), we compared the percentage of each type of comet, as previously described in Moore et al., 2013. The results demonstrate that morphant cells exhibit an increase in slow comets compared to control cells. The same explanation is described in the Material and Methods section on page 26, Microtubule dynamics analysis.

      Reviewer #3 (Significance (Required)): Overall, the study is well executed and significantly advances our understanding of the control and role of microtubule dynamics in cell migration, which is much less understood compared to the function of the actin cytoskeleton in this process. The strength of the study is the use of state-of-the-art (live imaging) techniques to characterize the role of Gαi in neural crest migration at the cellular/subcellular level. This article will be of interest to a broad readership, including researchers interested in basic embryonic morphogenesis, cell migration, and cytoskeletal dynamics, as well as translational/clinical researchers interested in cancer progression or wound healing.

      R: We really appreciate the reviewer positive comments and consideration. We believe that the review process has significantly strengthened our manuscript.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Thank you for reviewing and assessing our paper. Reviewer2 had only posive comments. Reviewer 1 also had posive comments but included a list of suggesons. The revised version includes text edits to address the suggesons.

      Reviewer 1:

      … First, it is unclear whether the experiments and analyses were set up to be able to rule out more specific candidate funcons of the ZI.

      The list of possible funcons performed by the ZI is broad. Nevertheless, our study considers a rather long list of neural processes related to the behaviors listed below.

      Second, many important details of the experiments and their results are hard to decipher given the current descripons and presentaons of the data.

      The procedures used in the present study have all been used and described in our previous studies (cited). We used the same descripons and presentaons as in the prior studies. We have gone over the Methods and figures to ensure that all details required to understand the experiments are provided, but we also added further details following the suggesons noted below.

      The paper could be significantly strengthened by including more details from each experiment, stronger jusficaons for the limited behaviors and experimental analyses performed, and, finally, a broader analysis of how the recorded acvity in the ZI relates to behavioral parameters.

      The paper studied several behaviors including: 1) spontaneous movement of head-fixed mice on a spherical treadmill, 2) tacle (whisker, and body parts) and auditory (tones and white noise) smuli applied to head fixed mice, 3) spontaneous movement iniaon, change, and turns in freely moving mice, 4) auditory tone (frequency and SPL) mapping in freely behaving mice, 5) auditory-evoked orienng head movements (responses) in the context of several behavioral tasks, 6) signaled acve avoidance responses and escapes (AA1), 7) unsignaled/signaled passive avoidance responses (AA2ITI/AA3-CS2), 8) sensory discriminaon (AA3), 9) CS-US interval ming discriminaon (AA4), and 10) USevoked unsignaled escape responses.

      In freely moving experiments, the behavior is connuously tracked and decomposed into translaonal and rotaonal movement components. Discrete responses are also evaluated (e.g., acve avoids, escapes, passive avoids, errors, intertrial crossings, latencies, etc.). These behavioral procedures evaluate many neural processes, including decision making (Go/NoGo in AA1-3), response control/inhibion (unsignaled and signaled passive avoidance in AA2/3), and smulus discriminaon (AA3). The applied smuli, discrete responses, and tracked movement are always related to the recorded ZI acvity using a variety of techniques (e.g., cross-correlaons, PSTHs, event-triggered me extracons, etc.), which relate the discrete and me-series parameters to the neural acvity. We do not think all this qualifies as, “limited behaviors”.

      (1) Anatomical specificaon: The ZI contains many disnct subdivisions--each with its own topographically organized inputs/outputs and putave funcons. The current manuscript doesn't reference these known divisions or their behavioral disncons, and one cannot tell exactly which poron(s) of the ZI was included in the current study. Moreover, the elongated structure of the ZI makes it very difficult to specifically or completely infect virally. The data could be beter interpreted if the paper included basic informaon on the locaons of recordings, the extent of the AAV spread in the ZI in each viral experiment, and what fracon of infected neurons were inside versus outside ZI.

      Our experiments employed Vgat-Cre mice to target ZI neurons. In this line, GABAergic neurons from the enre ZI express Cre, including the dorsal and ventral subdivisions (see (Vong et al., 2011; Hormigo et al., 2020)). Consequently, AAV injecons in Vgat-Cre mice produce restricted expression in the ZI that can fully delineate the nucleus as shown in the papers referenced above (including ours). There is nil expression in structures above or below ZI because they do not express Cre in these mice (e.g., thalamus and subthalamic nucleus), which allows for selecve targeng of ZI. Our optogenec manipulaons and photometry recordings were not aimed at specific ZI subdivisions. We targeted the area of ZI indicated by the stereotaxic coordinates (see Methods), which are aimed at the center of the structure to maximize success in recording/manipulang neurons within ZI. While all the animals included in the study expressed opsins and GCaMP within ZI that in many animals fully delineated the nucleus, there was normal variability in the locaon of opcal fibers, but we did not detect any differences in the results related to these variaons.

      Fiber photometry and optogenecs experiments are performed with rather large diameter opcal probes, which record/manipulate relavely large areas of the targeted structure. This is useful because our goal was to idenfy funconal roles of the enre ZI, which could then be parsed. In the present study, we did not perform experiments to target specific ZI populaons (e.g., retrograde Cre expression from target areas), which may have revealed differences atributed to their projecon sites. However, in the last experiment, we selecvely excited ZI fibers targeng three different areas (midbrain tegmentum, superior colliculus, and posterior thalamus), which revealed clear differences on movement. Thus, future experiments should explore these different populaons (e.g., using retrograde/anterograde expression systems), which may be in different subdivisions.

      We have enhanced the Methods secon to clarify these points, including the addion of these references.

      (2) Electrophysiological recording on the treadmill: The authors are commended for this technically very difficult experiment. The authors do not specify, however, how they knew when they were recording in ZI rather than surrounding structures, parcularly given that recording site lesions were only performed during the last recording session. A map of the locaons of the different classes of units would be valuable data to relate to the literature.

      We have added details about this procedure in the Methods secon. These recordings are performed based on coordinates, and categorizing neurons as belonging to ZI is obviously an esmate based on the final histological verificaon. Nevertheless, the marking lesions revealed that the electrodes were on target, which likely resulted from the care taken during the surgical procedure to define reference points used later during the recording sessions (see Methods). Regarding a map of the unit locaons, we performed several analyses that did not reveal clear differences based on site. For example, we compared depth vs cell class, “There was no difference in recording depth between the four classes of neurons (ANOVA F(3,337)= 1.06 p=0.3676)”. Future experiments that employ addional methods (labelling, opto-tagging, etc.) would be more appropriate to address mapping quesons. Finally, as we state in the paper, “However, these recordings do not target GABAergic neurons and may sample some neurons in the tissue surrounding the zona incerta. Therefore, we used calcium imaging fiber photometry to target GABAergic neurons in the zona incerta”.

      (3) The raonale of the analysis of acvity with respect to “movement peak”: It is unclear why the authors did not assess how ZI acvity correlates with a broad set of movement parameters, but rather grouped heterogeneous behavioral epochs to analyze firing with respect to “movement peaks”.

      The reviewer is referring to movement peaks on the spherical treadmill. On the treadmill, we used the forward locomotor movement of the animal because this is the main acvity of the mice on the treadmill. We considered “all peaks” (or movements) and “>4 sec peaks”, which select for movement onsets. Compared to the treadmill, in freely movement condions during various behavioral tasks, there is a richer behavioral repertoire, which was analyzed in more detail (i.e., translaonal, and rotaonal components during spontaneous ongoing movement and movement onsets, movement related to various behaviors such as orienng, acve and passive avoidance, escape, sensory smulaon, discriminaon, etc.). Thus, we focused on a broader set of movement parameters in the Cre-defined ZI cells of freely behaving mice.

      (4) The display of mean categorical data in various figures is interesng, however, the reader cannot gather a very detailed view of ZI firing responses or potenal heterogeneity with so litle informaon about their distribuons.

      The PCA performs the heterogeneity classificaon in an unbiased manner, which we feel is a thoughul approach. The firing rates and correlaons with movement for each category of neurons are detailed in the results. Furthermore, the sensory responses for these neurons are also detailed. Together, we think this provides a detailed view of the units we recorded in awake/head-fixed mice. As already stated, further study would benefit from an addional level of cell site verificaon.

      (5) Somatosensory firing responses in ZI: It is unclear why the authors chose the specific smuli used in the study. How oen did they evoke reflexive motor responses? What was the latency of sensory-evoked responses in ZI acvity and the latency of the reflexive movement?

      These are broad quesons, and we assume that the reviewer is asking about somatosensory evoked responses on the spherical treadmill. We used air-puffs applied to the whiskers and on the back (le vs right) because the whiskers represent an important sensory representaon for mice, and the back is a part of the body (trunk), which we oen use to movate the animals to move forward on the treadmill. Regarding the latency of the somatosensory evoked responses, in this case, we did not correct them based on the me it takes the air-puff to travel to the whiskers or body part, and therefore we did not provide latencies. Moreover, air-puffs are not a very good method to quanfy whisker-evoked latencies, which are beter measured using other methods (whisker deflecons of single/mulple whiskers using piezo-devices or other mechanical devices, as we and others have done in many studies). We are not sure what the reviewer means by “reflexive behavior”; we did not measure any reflexive behavior under these condions. We have gone over the Methods and Results to ensure that sufficient details are provided about these experiments.

      (6) It would be valuable to see example traces in Figure 3 to get a beter sense of the me course and contexts under which Ca signals in ZI tracks movement. What is the typical latency? What is the typical range of magnitudes of responses? Does the Ca signal track both fast and slow movements? How are the authors sure that there are no movement arfacts contribung to the calcium imaging? It seems there is more informaon in the dataset that could be valuable.

      As is well known, fiber photometry calcium imaging is a slow populaon signal. We do not think it would be valuable to get into ming issues beyond what is already detailed in the study (i.e., magnitudes measured as areas or peaks, and ming as me-to-peaks). Regarding “movement arfacts”, these signals are absent (flat) in animals that do not express GCAMP. We agree that there must be addional valuable informaon in our datasets (as in most me-series). However, the current paper is already rather extensive. We will connue to peruse our datasets and report addional findings in new papers.

      (7) Figure 4: The raonale for quanfying the F/Fo responses over a 6-second window, rather than with respect to discrete movement parameters, is not well explained. What types of movement are binned in this approach and might this broad binning hinder the ability to detect more specific relaonships between acvity and movement?

      Figure 4 is focused on characterizing the relaonship between turns (ipsiversive and contraversive) during movement and ZI acvity. We tested different binning windows to find differences, including the 6 sec window in figure 4 for populaon measures (-3 to 3 sec around the turns). This binning approach is effecve at revealing differences where they exist (e.g., superior colliculus) as shown in our previous studies (e.g. (Zhou et al., 2023)). Moreover, the turns in the different direcons can be considered discrete responses at their peak, and the ming of the related acvaons (e.g., me to peaks), which we evaluated, are rather sensive and would have revealed differences, but we did not find them.

      (8) Separaon of sensory and motor responses in Figure 5: The current data do not adequately differenate whether the responses are sensory or motor given the high correlaon of the sensory inputs driving motor responses. Because isoflurane can diminish auditory responses early in the auditory pathway, this reviewer is not convinced the isoflurane experiments are interpretable.

      The reviewer is referring to Fig. 5C,D. Indeed, the point of this experiment was to show that it is difficult to differenate whether neural responses are sensory or motor in awake and freely moving condions. As we stated in the Results secon, “Although arousal and movement were not dissected in the present experiment (this would likely require paralyzing and ventilating the animal), the results indicate that activation of zona incerta neurons by sensory stimulation is primarily associated with states when sensory-evoked movement is also present”. This is followed in the Discussion by, “…as already noted, the suppression of sensory responses may be due to changes in arousal (Castro-Alamancos, 2004; Lee and Dan, 2012) and not caused by the abolishment of the movements per se”.

      (9) Given the broad duraon of the mean avoidance response (Fig. 6 C, botom), it would be useful to know to what extent this plot reflects a prolonged behavior or is the result of averaging different animals/trials with different latencies. Given that the shapes of the F/Fo responses in ZI appear similar across avoids and escapes (Fig. 6D), despite their apparent different speeds and movement duraons (Fig 6C), it would be valuable to know how the ming of the F/Fo relates to movement on a trial-by-trial basis.

      The duraon of the avoidance response cannot be ascertained from CS onset (panel 6C botom) and avoids are not wide but rather sharp. We have now made this clearer when Fig. 6C is first menoned (“note that since avoids occur at different latencies after CS onset they are best measured from their occurrence as in Fig. 6D”). Like other related condioned and uncondioned responses, avoids and escapes are similar, varying in the noted parameters. Regarding ming, as already menoned above, we think that the characteriscs of the populaon calcium signal make it unsuitable for further ming consideraons than what we included, parcularly for movements occurring at the fast speeds of avoids and escapes.

      (10) Lesion quanficaon: One cannot tell what rostral-caudal extent of ZI was lesioned and quanfied in this experiment. It would be easier to interpret if also ploted for each animal, so the reader can tell how reliable the method is. The mean ablaon would be beter shown as a normalized fracon of cells. Although the authors claim the lesions have litle impact on behavior, it appears the incompleteness of the lesions could warrant a more conservave interpretaon.

      The lesion experiment was a complement to the optogenecs inacvaon experiments we performed in our preceding ZI paper and in the present paper. Thus, the finding that the lesions had litle impact on behavior is supporve of the optogenecs findings. Regarding cell counts, we did not select any parts of the ZI to quanfy the number of neurons in either control or lesion mice. We considered the full rostrocaudal extent in our measurements. We are not sure what “fracon” the reviewer is suggesng, considering that these counts are from two different groups of mice (control vs lesion). Note that the red-marked neurons, as shown in Fig. 8A, reveal healthy non-Vgat-Cre neurons outside ZI that mark the extent of the AAV diffusion, which as shown spanned the full extent of the ZI in the coronal plane (and in other planes as the AAV spreads in all direcons).

      (11) Optogenecs: the locaon of infected neurons is poorly described, including the rostral-caudal extent and the fracon of neurons inside and outside of ZI. Moreover, it is unclear how strongly the optogenec manipulaons in this study are expected to affect neuronal acvity in ZI.

      We discussed the first point in (1) above. Regarding, how optogenec manipulaons are expected to affect neuronal acvity in ZI and its targets, we have conducted extensive electrophysiological recordings in slices and in vivo to detail the effects of our manipulaons on GABAergic neurons (e.g. (Hormigo et al., 2016; Hormigo et al., 2019; Hormigo et al., 2021a; Hormigo et al., 2021b), including ZI neurons (Hormigo et al., 2020). In fact, we never use an opsin we have not validated ourselves using electrophysiology. Moreover, our experiments employ a spectrum of optogenec light paterns (including trains/cont at different powers) that trate the optogenec effects within each session/animal. As shown in fig. 11 and 12, these paterns produce different behavioral effects related to the different levels of neural firing they induce. For ChR2-expressing neurons in ZI, firing is frequency dependent and maximal during Cont blue light (at the same power). For Arch-expressing neurons only Cont is used, and inhibion is a funcon of the green light power. When blue light is applied in ZI fibers targeng different areas, this relaonship changes. Blue light trains (1-ms pulses) at 40-66 Hz become the most effecve means of inducing sustained postsynapc inhibion compared to Cont or low frequencies.

      References

      Castro-Alamancos MA (2004) Dynamics of sensory thalamocorcal synapc networks during informaon processing states. Progress in Neurobiology 74:213-247.

      Hormigo S, Vega-Flores G, Castro-Alamancos MA (2016) Basal Ganglia Output Controls Acve Avoidance Behavior. J Neurosci 36:10274-10284.

      Hormigo S, Zhou J, Castro-Alamancos MA (2020) Zona Incerta GABAergic Output Controls a Signaled Locomotor Acon in the Midbrain Tegmentum. eNeuro 7.

      Hormigo S, Zhou J, Castro-Alamancos MA (2021a) Bidireconal control of orienng behavior by the substana nigra pars reculata: disnct significance of head and whisker movements. eNeuro. Hormigo S, Vega-Flores G, Rovira V, Castro-Alamancos MA (2019) Circuits That Mediate Expression of Signaled Acve Avoidance Converge in the Pedunculoponne Tegmentum. J Neurosci 39:45764594.

      Hormigo S, Zhou J, Chabbert D, Shanmugasundaram B, Castro-Alamancos MA (2021b) Basal Ganglia Output Has a Permissive Non-Driving Role in a Signaled Locomotor Acon Mediated by the Midbrain. J Neurosci 41:1529-1552.

      Lee SH, Dan Y (2012) Neuromodulaon of brain states. Neuron 76:209-222.

      Vong L, Ye C, Yang Z, Choi B, Chua S, Jr., Lowell BB (2011) Lepn acon on GABAergic neurons prevents obesity and reduces inhibitory tone to POMC neurons. Neuron 71:142-154.

      Zhou J, Hormigo S, Busel N, Castro-Alamancos MA (2023) The Orienng Reflex Reveals Behavioral States Set by Demanding Contexts: Role of the Superior Colliculus. J Neurosci 43:1778-1796.

    1. Author Response

      The following is the authors’ response to the original reviews.

      We thank the reviewers for recognizing the importance of our work and for their insightful suggestions. A point-by-point response to their comments is listed underneath each reviewer’s section.

      Reviewer #1 (Recommendations For The Authors):

      Major comments

      1) Have the authors optimized the expression level of dCas9? I cannot find this information in this paper or in their 2021 paper. It is important to avoid the toxicity phenomenon that occurs when using guide RNAs that share specific five base seed sequences (referred to as 'bad seeds').

      Cui L., Vigouroux A., Rousset F., Varet H., Khanna V., Bikard D. A CRISPRi screen in E. coli reveals sequence-specific toxicity of dCas9. Nat. Commun. 2018; 9:1912.

      Rostain W., Grebert T., Vyhovskyi D., Thiel Pizarro P., Tshinsele-Van Bellingen G., Cui1 L., Bikard D. Cas9 off-target binding to the promoter of bacterial genes leads to silencing and toxicity. Nucleic Acids Research, 2023, gkad170.

      2) One guide per gene is highly unusual given that different guides block the RNA polymerase with different efficiency. This was even shown by the Machner lab in the Legionella context in Figure 1c of Ellis et al. 2021 for sidM and vipD. Typically, genes need three guides minimum to ensure that the gene of interest is knocked down fully unless it is not possible as the gene is too small and/or it is difficult to find an NGG sequence. The authors have used one guide per effector, how can they be sure that each gene is knocked down? The Machner lab themselves in Figure 3c of Ellis et al. 2021 shows not all genes targeted using multiplex CRISPRi are equally efficiently knocked down. Please justify why only one guide per gene was chosen and add controls to validate the results. The authors themselves state that the strategy of one guide may be problematic. Lines 315-316 it reads... A possible explanation was the incomplete knockdown of a seemingly important process.

      3) Given what the Machner lab observed about spacer location in Ellis et al. 2021 would it not make more sense to take one set of redundant effectors and make multiplex randomized CRISPRi with them in different locations? For Figure 1 at least.

      4) Following infection, it seems that the bacteria were not plated onto antibiotic media, so it is not known how well the plasmid harboring guides is kept through infection.

      Specific comments

      A) The first results paragraph describes the set-up of 10-plex synthesized CRISPR arrays, where 10 effector encoding genes of specific gene families are selected. The rationale of the choice of these genes is not given. Please explain.

      B) Please also add some biological data on what these genes code for, and what is their known or predicted function. It is not very informative and exciting to have tables of lpg numbers without any knowledge of what these genes code for and why they were selected, at least some.

      C) Figure 1 A Why are only some of the MC arrays shown? Please, at least include in supplementary the others. Again one array in detail would be more informative, showing true knockdown of all genes by qPCR and ideally by western blot.

      D) I am not convinced that the gene silencing efficiency qPCR comparison is done in the correct way. In my opinion, each of the genes to be knocked down should be tested against the expression of a control gene e.g. rpoS and then these results should be compared and not the results of empty plasmid or CRISPR array containing plasmid directly. L. pneumophila are very sensitive to growth conditions and inoculum, thus the two strains might not be completely at the same growth stage when being compared which can impact the results.

      E) Figure 1 B As stated in general comment number 4, the authors do not appear to plate onto antibiotic so we don't know how well the plasmid harboring the guides is kept through infection. The sustained presence of the guide is particularly important for CRISPRi.

      F) The authors found only a few growth phenotypes and mainly this was due to single genes and not combinations of genes. This might again be due to the fact that only one guide per gene was used. How do the authors know that all genes targeted were indeed knocked down?

      G) Line 119 Alternatively, the genes were not 100% all knocked down, escaping the knockdown effect expected. Could authors take three genes with three guides each and look at impact instead of only one?

      H) The authors then develop the randomized multiplexed arrays and chose to test 44 TME encoding genes. Line 141 Justify why these effectors were chosen in the text.

      I) Unfortunately, the method is not clearly described, and many parts are complicated and the text needs to be re-read several times to be understood (lines 150 - 166). Please re-write to better explain to the reader. In line 156 the authors point to a supplementary note 1. This information should be in the main text.

      J) What is the copy number of the CRISPR plasmid? Please add in the Material and Method section also the origin of this plasmid.

      Figure 2

      K) In the paper (line 154-160) and the extra notes, it states that authors attempt to size select CRISPR arrays. However, this is not apparent in Figure 2 schematic. Or are the authors stating that plasmids only containing one guide were selected out? However, line 312 would suggest not. Please clarify

      L) A limiting factor in making multiplex guide CRISPR, as the authors are trying to establish in this study, is cloning of multiple guides. In the pre-determined CRISPR arrays in this study, the guides were synthesized. For the randomized multiplex CRISPR in this study, the authors adapt a Golden Gate cloning method to generate multiple sgRNAs in the Cas9 vector. A similar protocol was established in the below paper. Please add this reference.

      Zuckermann, M.; Hlevnjak, M.; Yazdanparast, H.; Zapatka, M.; Jones, D.T.W.; Lichter, P.; Gronych, J. A novel cloning strategy for one-step assembly of multiplex CRISPR vectors. Sci. Rep. 2018

      M) As the authors note, Zuckermann et al. similarly note that plex of 3 or 4 is most common and above 5 is rare. This thus appears to still be the limiting step of multiplex CRISPR technology. Please discuss

      Figure 4

      N) The idea of multiplexed CRISPRi seq to address the biological phenomenon of redundancy is an interesting one, however, I am missing the in-depth functional characterization and discussion of at least one of the redundant functions discovered. Please add.

      Figure5/6

      O) As noted above, the strength of the experiments is undermined by how CRISPRi is set up. Having an average multiplex of 2 or three and again only using one guide per gene weakens the study and the results obtained. Furthermore, as stated in general comment number 4, the authors do not appear to plate onto antibiotic so again, we don't know how well the plasmid harboring the guides is kept through infection. The sustained presence of the guide is particularly important for CRISPRi. Please add a validation that the guides are all present.

      Response to Reviewer #1

      We are grateful to the reviewers for their insightful comments and suggestions on how to further improve the manuscript.

      Regarding the issue of ‘bad seed sequences’ (comment #1), we had previously evaluated the expression level of dcas9 (plotted in Figure 1b of the 2021 Communications Biol paper) and tuned our induction conditions accordingly (40 ng/mL as described in the Methods). Since all strains used in this study express dcas9 from the chromosome, not a plasmid, this eliminates the possibility of fluctuations in expression levels due to variabilities in plasmid copy numbers.

      In the rare event that toxicity by any given guide occurs, we would expect that guide to already be underrepresented or missing in the input pool following 24+ hours of CRISPRi induction during axenic growth. Our data, now discussed in the manuscript (Lines 211-216 and Figure S2), revealed that this was not the case and that all guide-encoding spacers were present in roughly equal amounts (median of >5000 occurrences). As with any knockdown study, the creation of true chromosome deletions was performed throughout as to alleviate the issue of false positives.

      Regarding comments #2, #3, and specific comments made under point F, G, and O, on the topic of using single vs. multiple guides, we agree that there are circumstances under which using more than one guide per target may be advantageous, for example when attempting to delete a gene from mammalian cells using conventional CRISPR engineering. In the study described here, this is not the case. In fact, we did create a second array library with alternative guides targeting the same group of genes at locations other than the “optimal location” identified in our 2021 paper and found that these “sub-optimal” guides were inefficient for identifying critical effectors as described in Supplemental Note S1 under the heading “Sub-optimal annealing sites” (Lines 919+). These data suggest that adding sub-optimal guides to the arrays of optimal guides might ‘poison’ the arrays and limit rather than enhance their ability to identify gene combinations.

      Regarding comment #2, #3, and specific comments made under point C, F, and G, on the topic of confirming efficient gene knockdown for the identification of critical genes, we remind Reviewer 1 that we did confirm knockdown of 60 of the target genes of the 10-plex screen to be at least 2-fold, with an average fold repression of one order of magnitude or more (Figure 1A). While knockdown of every gene in every 10-plex construct would be an unprecedented ask of any published CRISPR screen, we believe that these 60 genes provide a large enough sampling of all guides to elucidate the range of knockdown to be expected by our CRISPRi platform. As with other knockdown technologies, such as RNAi, there is no expectation of accomplishing complete knockdown for any given target. Hence, the data in Figure 1A suggest that the lack of identifying critical genes using pre-determined 10-plex arrays was not due to a lack of knockdown efficiency, but rather the difficulty to accurately predict redundancy within a cohort of uncharacterized genes, accentuating the need for array randomization with MuRCiS.

      On the topic of antibiotic use for plasmid selection (comments #4, E and O), we would like to clarify that the CRISPR plasmids were selected by thymidine prototrophy, not antibiotic resistance, and we apologize for not making this clearer. The laboratory strain Lp02 is a thymidine auxotroph (thyA-) L. pneumophila variant, and plasmid retention is routinely achieved by including the thymidine biosynthesis gene (thyA) on the plasmid backbone. Only with a plasmid bearing the thyA gene can L. pneumophila grow on CYE (thymidine-) plates. Our use of vectors bearing thyA and plating on CYE plates is described in the Methods section. Further, in Figure 7 of our 2021 paper, we show that CRISPR plasmids are efficiently retained by Lp02 for the duration of a 48-hour infection, resulting in efficient multi-gene knockdown even at the end of the intracellular growth experiment.

      Regarding comments A and B, on publishing the biological data used to classify genes in gene families for 10-plex silencing, we do not consider it critical to provide additional information beyond the broad classification (e.g. kinases, phosphatases, etc) described in Table S1. Structural predictions constantly change due to continuously evolving databases. Our initial analyses were made in 2015 using HHPRED Hidden-Markov models and, in all likelihood, those predictions have been refined since then. Moreover, with the recent advent of Alphafold, anyone interested in learning more about select effectors from our list is advised to simply access the most recent functional predictions directly on the Alphafold webpage (https://alphafold.ebi.ac.uk/). We clarify how predictions were made on Lines 97-101.

      Regarding specific comment D, on our method for qPCR normalization and comparison, we point Reviewer 1 to the Methods section (Lines 460+) where we describe that data obtained from each CRISPRi strain were in fact normalized to the levels of rpsL prior to comparing them to the normalized data from the strain with the empty control plasmid. This normalization to rpsL, a gene encoding a ribosomal protein, also corrects for growth differences between samples.

      Regarding specific comment H, the justification for studying 44 transmembrane effector-encoding genes was driven by the fact that activities mediated by transmembrane proteins are difficult (though not impossible) to be replaced by cytosolic proteins, for example the transport of metabolites across the LCV membrane. And since transmembrane regions can be predicted with high confidence, we decided to probe this group of TMEs for synthetic lethality with the randomized CRISPRi approach as proof-of-concept. To make this clearer, we have added more detail to the text (Lines 151-155).

      Regarding specific comment I, we have further simplified the description of the cloning technique to increase clarity (Lines 156+). The information listed under Supplemental Note S1, though informative, is not critical for the overall understanding of this highly technical section, and since the reviewer already considered this section to be difficult to follow, we would prefer to not further complicate the text by including these non-essential details.

      Regarding the origin of the CRISPRi plasmid (specific comment J), we point Reviewer 1 to the reference (Hammer BK and Swanson MS (Mol Microbiol 1999)) listed in Table S10: Strains and Plasmids Used in this Study.

      Regarding specific comment K and O, on the clarity of depicting the CRISPR array size selection process, we have updated the Figure 2 schematic. Reviewer 1 is correct in that despite our best efforts to exclude short CRISPR arrays, inevitably some 1-plex arrays remained in our input vector pool. Still, the average length of all arrays used in our pilot study exceeded three crRNA-encoding spacers. Further, having a population of 1- or 2-plex arrays in our libraries did allow us to pin-point the most critical effectors of a larger arrays within the same MuRCiS experiment (Table S5 and Table S7), a strength of MuRCiS as described in the discussion (Lines 378+).

      Regarding specific comment L, we appreciate Reviewer 1’s suggestion of an additional reference and we have added it to the manuscript as reference #23 (Line 71). While this reference does use a Golden Gate strategy to build a multiplex array, that array was not randomized but had a predefined order. Hence, our assembly method is unique due to its randomization.

      Regarding specific comment M, on array length cloning limitations, we agree with the conclusion of Zuckermann in Figure 1d of their article that longer inserts are generally harder to get into vector backbones. The challenge of cloning longer inserts is a common phenomenon of general biology and is not unique to cloning CRISPR arrays. We have altered the wording in our manuscript to better describe the intrinsic competition between short and long inserts during cloning (Lines 162-164).

      Regarding specific comment N, we second Reviewer 1’s desire to learn more about the critical effector pairs discovered here. With that said, the goal of this manuscript is to report the development of a novel MuRCiS pipeline to identify these critical pairs. Biochemical and molecular investigations of the encoded protein pairs are on-going and will be the topic of a future manuscript.

      Reviewer #2 (Recommendations For The Authors):

      Specific points

      1) The effector repertoire of L. pneumophila seems to have evolved in response to the plethora of potential protozoan hosts (PMID: 31988381). To further assess evolutionary aspects of the vast L. pneumophila effector arsenal, it would be interesting to test the single and double effector mutant strains (Fig. 5FG, Fig. 6EF) for growth in protozoa other than A. castellanii.

      2) Most CRISPR arrays comprising genes encoding functionally similar proteins or encoding evolutionarily conserved proteins did not substantially affect intracellular growth of L. pneumophila (Fig. 1B). This rather surprising result should be further discussed.

      3) l. 118/119: "Similar results ..., where none of the MC arrays ..." This statement should be phrased more precisely, since some CRISPR arrays did indeed have an effect on intracellular growth of L. pneumophila in U937 macrophages, while none affected intracellular growth in A. castellanii (Fig. 1B).

      4) Typos:

      • l. 852: ... (arbitrarily set to -100).

      • l. 862: ... Legionella-containing vacuole (LCV).

      • l. 895: ... and so we would recommend ...

      Regarding point 1, we thank Reviewer 2 for the suggestion of testing effector mutants in different hosts. While the primary purpose of the current manuscript was to optimize the MuRCiS platform, future studies using this technology to investigate specific biological questions related to Legionella infection would certainly benefit from including more than one amoebaean species.

      Regarding point 2, we agree that the lack of substantial growth defects seems surprising. Yet only two of the seven core effectors (found in all Legionella sp.), lpg2300 and mavN, individually attenuated Legionella intracellular growth when deleted (Burstein 2016 Nat Genetics; Isaac et al., 2015 PNAS). Thus, we hypothesize that the functions many effectors fulfil are of such importance for intracellular survival that that redundancy reaches beyond the boundary of conservation or like-function. We have added a statement emphasizing this at the end of the Figure 1 results section (Line 122-125).

      Regarding points 3 and 4, we thank Reviewer 2 for catching these errors and have corrected where needed in the text.

      -l. 852 (now Line 874): … (arbitrarily set to -100,000) is correct for Figure 6E.

    1. Author Response

      We appreciate the insightful comments from three reviewers on our manuscript. These comments help us improve the clarity of this manuscript. We will revise our manuscript comprehensively in subsequent revision, and enclose a detailed response to each of these comments. In this public reply, we focus on (a) clarifying the theoretical motivation and implication of the present study, and (b) discussing the implications of our LLM study. Besides, we provide a brief justification regarding some methodological concerns shared by the reviewers.

      1) Theoretical rationale and implication

      As we stated in the manuscript, the present study tested whether body size serves as a reference for locomotion and object manipulation, or alternatively, plays a pivotal role in shaping the representation of objects as suggested by Protagoras. Behind this question is the long-lasting debate regarding the representation versus direct perception of affordance.

      One outstanding theme shared by many embodied theories of cognition is the replacement hypothesis (e.g., Van Gelder, 1998). This hypothesis challenges the necessity of representation in the sense of computationalism cognitive theories (e.g., Fodor, 1975), which implies discretizing/categorizing inputs and then subjecting them to certain abstraction or symbolization so as to create discrete stand-ins for the input (e.g., representations/states). In this sense, our theoretical motivation can be restated explicitly as to test the ‘representationalization’ of affordance. That is, we tested whether object affordance would simply covary with its continuous constraints such as object size, in line with the representation-free view, or, whether affordance would be ‘representationalized’, in line with the representation-based view, under the constrain of body size. Such representationalization would generate categorization between the affordable (the objects) and those beyond affordance (the environment).

      Debates regarding the replacement hypothesis often turn into wrestles on the definition of representation (Shapiro, 2019). The present study tried to avoid this pitfall but examined where the embodied and computational theories make opposite hypotheses: discontinuity. Specifically, we considered two computationalism propositions about representation: (a) representations entail discretization of continuous input, and (b) the product of such discretization (representations) is supramodally accessible (that is, transcending sensorimotor processes). These claims are opposite to the prediction based on the idea of direct perception and other representation-free embodied theories.

      Thus, we tested whether, for continuous action-related physical features (such as object size relative to the agents), affordance perception introduces discontinuity and qualitative dissociation, i.e., to allow the sensorimotor input to be assigned into discrete states/kinds, as representations envisioned by computationalists. Alternatively, does the activity directly mirror the input, free from discretization/categorization/abstraction, as proposed by the replacement hypothesis that organisms do not need to re-present the world as they are always in contact with the world in a continuous way?

      All the experiment settings and analyses in the present study were organized around this motivation, following a progressive logic chain.

      First, we tested the discretization hypothesis, that is, whether affordance leads to discontinuity in perception. Here, the discontinuity in affordance perception would be in line with the representation-based view instead of the representation-free proposals. Second, to ensure that the observed discontinuity can be attributed to the discretization of sensorimotor input involved in human-object interaction rather than amodal sources, such as the discrete abstract concepts of the objects (independent from agent motor capability), we tested the embodied nature of this discontinuity through the body imagination experiment. If there is discontinuity in representing embodied information, this discontinuity should be locked to the motor capacity (constrained by the physical constitution such as body size) of the agent, rather than reflecting independent categorization of the absolute size of the objects. Finally, we probed the supramodality of this embodied discontinuity: whether this discontinuity is accessible beyond the sensorimotor domain. To do this, we leveraged the recent advance in AI and tested whether the discretization observed in affordance perception is supramodally accessible to disembodied agents which lack access to sensorimotor input but only have access to the linguistic materials built upon discretized representations, such as large language models (LLM).

      In this way, the experiments in the present study collectively contributed to the debate on the replacement theme of the embodiment of cognition, which serves as one of the three key themes of embodied theories of cognition (Shapiro, 2019). By addressing this theme, we hope to shed light on the nature of representation in, and resulting from, the vision-for-action processing. Our finding regarding discontinuity suggested that sensorimotor input undergoes discretization implied in the computationalism idea of representation. Further, not contradictory to the claims of the embodied theories, these representations do shape processes out of the sensorimotor domain, but after discretization.

      2) Implication in the development of LLM-based agents

      The finding that affordance was representationalized may have profound implications for the development of LLM-based agents. Traditional robots and non-LLM-based agents require implementation-level action instruction, acting as a tool for human beings to achieve desired results. In contrast, LLM-based agents (for a review, see Wang et al., 2023), such as Auto-GPT and BabyAGI, are able to autonomously perform tasks and achieve desired results based on LLMs’ planning ability. In this sense, LLM-based agents show a primary ability to interact on their own with the world. Generative agents, for instance, the agents in Smallville (Park et al., 2023), are a particularly applauded recent advantage in the school of LLM-based agents, which show even larger potentials in this aspect. Drawing on generative models to simulate human behaviors, these agents can formulate their own memories and goals, generate new environment-dependent behaviors, and interact convincingly with humans and other agents and their environments in the course. This brings new possibilities in resolving the long-lasting challenge in artificial general intelligence (AGI) development, that is, to bestow AI with human-level ability in agent-environment interactions. However, it is worth noting that the present investigation in LLM-based agents is still largely confined to virtual environments. This leaves an open question as to how to equip these agents with the ability of agent-environment physical interaction. Especially, according to embodied theories of cognition, sensorimotor interactions with the environment provide unique knowledge upon which various cognitive domains are built. From this point of view, building agents with human-level ability in agent-environment physical interactions might provide an unreplaceable missing piece for AGI.

      By probing the representation of action possibilities (affordances) provided by the environment to the agent (or the absence of them), the present study provided a clue in achieving such ability by illustrating the representationalization of affordance and the supramodality of these representations. For instance, the finding of supramodality may alleviate the doubts about the physical interaction ability of LLM-based agents comparable to biological agents. Specifically, LLM-based agents can leverage the affordance representation distilled into language to interact with the physical world. Indeed, by clarifying and aligning such representation with the physical constitutes of LLM-based agents, and even by explicitly constructing an agent-specific object space, we may facilitate the sensorimotor interactions of LLM-based agents so as to achieve animal-level interaction ability with the world. This in turn may provide new instances for embodied theories.

      3) Clarification on incomplete evidence

      In response to the methodological and validity concerns of the reviewers, we will provide a point-by-point detailed response to reviewers enclosed with the revised manuscript. Here, we reply to the most prominent concerns.

      Reviewers were concerned about the statistical power of both the body imagination experiment and the fMRI experiment. Regarding the number of participants in the imagination study, we would like to clarify that we did not remove 80% of the participants. Actually, a separate sample of participants was recruited in the body imagination experiment. The sample size for the body imagination experiment (100 participants) was indeed smaller than that recruited for the first experiment (528 participants). This is because the first experiment was set for exploratory purposes, and was designed to be over-powered.

      Admittedly, the fMRI experiment recruited a small sample (12 participants), which might lead to low power in estimating the affordance effect. In revision, we will acknowledge this issue explicitly. Having said this, note that the null hypothesis of this fMRI study is the lack of two-way interaction between object size and object-action congruency, which was rejected by the significant interaction. That is, the interpretation of the present study did not rely on accepting any null effect. In addition, the fMRI experiment provided convergent evidence for the affordance discontinuity at the neural level. We showed that behind the behavioral discontinuity in action judgement, neural activity was qualitatively different between objects within the affordance boundary and those beyond, which reinforces our statement that objects were discretized along the continuous size axis into two broad categories.

      Reviewers also commented that more objects and actions should be included. We agree, and in revision, we will advocate future studies with more objects and more actions to comprehensively portray discontinuity. The present set of objects was designated to cover a relatively large range of object sizes, ranging from 14 cm to 7,618 cm to cover most size categories studied in Konkle and Oliva's (2011) work. In addition, the actions were selected to cover daily interactions between human and objects or environments from single-point movements (e.g., hand, foot) to whole-body movements (e.g., lying, standing) referencing the kinetics human action video dataset (Kay et al., 2017). Thus, this set of selected objects and actions is sufficient to test the discontinuity.

      References

      Fodor, J. A. (1975). The Language of Thought (Vol. 5). Harvard University Press.

      Park, J. S., O'Brien, J. C., Cai, C. J., Morris, M. R., Liang, P., & Bernstein, M. S. (2023). Generative agents: Interactive simulacra of human behavior. arXiv preprint arXiv:2304.03442.

      Shapiro, L. (2019). Embodied Cognition. Routledge.

      Van Gelder, T. (1998). The dynamical hypothesis in cognitive science. Behavioral and Brain Sciences, 21(5), 615-628.

      Wang, L., Ma, C., Feng, X., Zhang, Z., Yang, H., Zhang, J., ... & Wen, J. R. (2023). A survey on large language model based autonomous agents. arXiv preprint arXiv:2308.11432.

    1. Author Response

      Reviewer #1 (Public Review):

      I believe it is important for the authors to clarify how the time frames to test for group differences of ERP components were defined. Were the components defined based on a grand average across lesions and controls or based or on the maximum range for both groups? As the paper is written currently this is unclear to me. It is also unclear why the group comparisons between controls and lateral PFC group were based only on the control group. To ensure no inadvertent biases towards the larger control group were introduced and ensure the studies findings were reliable, it would be appreciated if the authors could clarify this.

      We thank the reviewer for the helpful comment. We recognize the need for a clearer definition of time frames for testing group differences in the ERP components and apologize for any ambiguity in the previous version of the manuscript.

      Regarding the time frames to test for group differences of ERP components for the OFC and control groups, they were determined based on the combined maximum range for both groups. The time range for each group and each ERP component was derived from the statistical analysis of the condition contrasts run for each group. For instance, for the Local Deviance MMN, the condition contrast (i.e., Control condition versus Local Deviance condition) for the CTR group revealed a MMN component from 67 to128 ms, while the same condition contrast for the OFC group revealed a MMN from 73 to131 ms. The time frame used for the group comparison on the MMN time window was 50 to 150 ms to capture component activity for both groups. In the same way, for the Local Deviance P3a, the condition contrast (i.e., Control condition versus Local Deviance condition) for the CTR group revealed a P3a component ranging from 141 to 313 ms, while the same condition contrast for the OFC group revealed a P3a from 145 to 344 ms. The time frame used for the group comparison on the P3a time window encompassed 140 to 350 ms to capture component activity for both groups.

      In the “Results” section of the main manuscript, together with the results from the cluster-based permutation independent samples t-tests, we provide the time frames in which the latter were computed for each ERP component. These segments have been highlighted with yellow in the revised manuscript. Moreover, in the section “Materials and methods - Statistical analysis of event-related potentials” of the main manuscript [page 37, paragraph 2], we provide a revised description of how the time frames for group differences of ERPs were defined. The revised description states: “In a second step, to check for differences in the ERPs between the two main study groups, we ran the same cluster-based permutation approach contrasting each of the four conditions of interest between the two groups using independent samples t-tests. The cluster-based permutation independent samples t-tests were computed in the latency range of each component, which was determined based on the maximum range for both groups combined. The latency range for each group and component was based on the time frames derived from the statistical analysis of task condition contrasts.”

      Regarding the comparisons between the lateral PFC and control groups, they were not based solely on the control group condition contrast. This was miswritten. The approach to define time frames to test for ERP differences between the CTR and the lateral PFC group was the same as the one used to test differences between CTR and OFC groups. We apologize for any confusion this may have caused. We have revised the erroneous statements in the Supplementary File 1 [highlighted text, page 9-10].

      An additional potential weakness of the paper, and one that if addressed would increase our confidence that neural differences arise because of the specific lesion effect, is the lack of evidence that the lesion and control groups do not differ on measures that could inadvertently bias the neural data. For example, while the groups did not differ on demographics and a range of broad cognitive functions, were there any differences between the number or distribution of bad/noisy channels in each subject between the two groups? Were there differences in the number of blinks/saccades or distribution of blinks or saccades across the conditions in each subject across the two groups.

      We thank the reviewer for this suggestion. We have completed a number of measurements and tests to ensure that the OFC lesion group and the control group did not differ on measures that could affect the neural data. First, we computed the number of bad/noisy channels for each subject and group, and found that the two groups did not differ significantly. Second, we computed the number of trials remaining after removing the noisy segments across conditions for each subject and group, and found no significant differences between the groups. Third, the number of blinks/saccades across conditions for each subject and group showed no significant group differences. Altogether, the results indicate that the neural differences observed in our study arose because of the specific lesion effect.

      These additional EEG measures and the statistical test results are included in the Supplementary File 1 [page 15-16] and Supplementary File 1g. We have also added text in the section “Materials and methods - EEG acquisition and pre-processing” of the main manuscript [page 35, paragraph 3], which states: “To ensure the validity of the neural data analysis, potential sources of bias were assessed between the healthy control participants and the OFC lesion patients. Specifically, no significant differences were observed between the two groups in terms of the number of noisy channels, the number of noisy trials, or the number of blinks across the task blocks and the experimental conditions.”

      On a similar note, while I appreciate this is a well established task could the authors clarify whether task difficulty is balanced across the different conditions? The authors appear to have used the counting task to ensure equal attention is paid across conditions although presumably the blocks differ in the number of deviant tones and therefore in the task difficulty. Typically, tasks to maintain attention are orthogonal to the main task and equally challenging across the different blocks. Is there a way to reassure readers that this has not affected the neural results?

      Thank you for pointing this out. Indeed, the experimental blocks differ in the number of deviant tones and therefore in the task difficulty. Thus, it is a very good suggestion to look for behavioral performance differences across the different blocks. In the present set of analyses, two block types were used: Regular (xX) and Irregular (xY). In regular blocks, where the repeated sequence is xxxxx, participants were required to count the rare/uncommon sequences, i.e., xxxxy and xxxxo. In irregular blocks, where the repeated sequence is xxxxy, participants were required to count the rare/uncommon sequences, i.e., xxxxx and xxxxo. We have now updated the behavioral analysis. First, by excluding the omission block’s counting performance, and second, by calculating the counting performance separately for the two blocks. The new behavioral analysis revealed that participants from both groups performed better in the irregular block compared to the regular block. However, there was no statistically significant difference between the counting performances of the two groups.

      The new results are reported on page 5 of the main manuscript, section “Results - Behavioral performance”, paragraph 1: “Participants from both groups performed the task properly with an average error rate of 9.54% (SD 8.97) for the healthy control participants (CTR) and 10.55% (SD 6.18) for the OFC lesion patients (OFC). There was no statistically significant difference between the counting performance of the two groups [F(24) = 0.11, P = 0.75]. Participants from both groups performed better in the irregular block (CTR: 8.39 ± 8.24%; OFC: 7.50 ± 7.34%) compared to the regular block (CTR: 10.69 ± 11.36%; OFC: 13.60 ± 10.97%) [F(24) = 3.55, P = 0.07]. There was no block X group interaction effect [F(24) = 0.73, P = 0.40].”

      As with many patient lesion studies, while the comparison directly against the healthy age matched controls is critical it would have strengthened the authors claims if they could show differences between the brain damaged control group. Given the previous literature that also links lateral PFC with prediction error detection, I understand that this region is potentially not the clearest brain damaged control group and therefore another lesion group might have strengthened claims of specificity. Furthermore, the authors do not offer an explanation for why no differences between lateral PFC and control groups were found when others have previously reported them. Identifying those differences would strengthen our understanding of the involvement of different structures in this task/function.

      We thank the reviewer for raising this crucial issue. We recognize the importance of addressing the lack of neurophysiological differences between the lateral PFC lesion group and the control group. First, it is important to clarify that the lateral PFC lesion control group was initially included not as a control for specific lateral PFC lesions but rather a broader control group to account for potentially general effects of frontal brain damage. However, considering that previous studies have implicated specific areas of the lateral PFC (e.g., inferior frontal gyrus; IFG) in predictive processing, we also think that a more thorough justification of these null findings is needed.

      Intracranial EEG studies examining local and global level prediction error detection pointed to the role of inferior frontal gyrus (IFG) as a frontal source supporting top-down predictions in MMN generation (Dürschmid et al., 2016; Nourski et al., 2018; Phillips et al., 2016; Rosburg et al., 2005). However, other intracranial studies reported unclear (Bekinschtein et al., 2009) or weak (Dürschmid et al., 2016) frontal MMN effects. El Karoui et al. (2015) observed late ERP responses in the lateral PFC related to global deviants but no MMN to local deviants, and it was not clear where in the PFC these responses occurred, not showing responses in the IFG. Additionally, studies employing dynamic causal modeling of MMN consistently modeled frontal sources in the IFG region (Garrido et al., 2008; Garrido et al., 2009; Phillips et al., 2015). A review by Deouell (2007) highlighted the potential contributions of both IFG and middle frontal gyrus to MMN generation, suggesting that the specific source might vary depending on characteristics of the deviant stimuli, such as pitch or duration.

      In Alho et al. (1994) lesion study, diminished MMN to local-level deviants was found after lesion to the lateral PFC, with the lesion cohort exhibiting a hemisphere ratio of 7/3 for left and right hemispheres, respectively, which is different from our cohort's ratio of 4/6. Furthermore, all individuals in that study had infarcts in the middle cerebral artery, resulting in a more uniform lesion location compared to our cohort. Notably, the lesions observed in our lateral PFC group appeared to be situated in more superior brain regions and towards the MFG compared to the predominantly reported involvement of the IFG in previous studies. Another factor that might contribute to the lack of significant effects is the heterogeneity of the lesions in our lateral PFC group (see Supplementary Figures 2, 3 and 4). Especially for the left hemisphere cohort, the individual lesions did not share a consistent anatomical location. The right hemisphere cohort had a greater lesion overlap, but overall, the lesions were not centered in the IFG area with highest overlap being in the MFG area. This distinction in lesion location might contribute to the absence of effects observed in our study.

      Regarding the global effect, often reflected in the P300 component, it appears that the neural sources responsible for processing global deviance exhibit a more distributed pattern. This means that the brain regions involved in detecting and processing global deviations may not be as localized or concentrated as those implicated in local deviance processing. Given that the neural mechanisms underlying global deviance detection and processing are likely to involve a wider network of brain regions, they may be less susceptible to disruptions caused by focal lesions in the lateral PFC.

      In response to your comment, we have expanded the “Discussion” to address this point by adding a new section titled “Lack of findings in the lateral PFC lesion group” [page 21]. In this section, we first present some of the findings implicating specific areas of the lateral PFC in the generation of MMN and in predictive processing, and then offer an account of the potential reasons behind the lack of neurophysiological differences between the lateral PFC and control groups.

      Finally, while the authors have already cited widely across multiple fields, again speaking to the likely large impact the study will make, there does appear to be an unexplored conceptual link between the conclusions here that the OFC supports "the formation of predictions that define the current task by using context and temporal structure to allow old rules to be disregarded so that new ones can be rapidly acquired" and that lesions of the lateral portions of the OFC disrupt the assignment of credit or value to a stimuli that occurred temporally close to the outcome (Walton et al 2010, Noonan et al 2010, PNAS, Rudebeck et al 2017 Neuron, Noonan et al 2017, JON, Wittmann et al 2023 PlosB, note the wider imaging literature in line with this work Jocham et al 2014 Neuron and Wang et al bioRxiv). Without the OFC monkeys and humans appear to rely on an alternative, global learning mechanism that spreads the reinforcing properties of the outcome to stimuli that occurred further back in time. Could the authors speculate on how these two strains of evidence might converge? For example, does the OFC only assign credit in the event of a prediction error or does one mechanism subsume another?

      We thank the reviewer for this comment regarding the unexplored conceptual link between our study’s conclusion, which suggests that the OFC facilitates the detection of prediction errors, and the findings of other research that delves into the OFC’s role in assignment of credit to stimuli. We find this comment very interesting and appreciate the opportunity to speculate on the potential functional convergence of these two processes within the OFC.

      The OFC is a critical neural hub implicated in learning, decision-making, and adaptive behavior. The detection of prediction errors and the assignment of credit to stimuli are mechanisms linked with the OFC, which play an important role in all these functions (Noonan et al., 2012; Schultz & Dickinson, 2000; Sul et al., 2010; Tobler et al., 2006; Walton et al., 2010; Walton et al., 2011). Prediction errors involve recognizing discrepancies between expected and actual outcomes, which engages the OFC in rapidly updating stimulus valuations to align with newfound information (Holroyd & Coles, 2002; Kakade & Dayan, 2002). Signaling of errors provides a powerful mechanism whereby OFC facilitates adaptive learning and enables the brain to adjust its expectations based on novel experiences (Schultz, 2015; Seymour et al., 2004). Credit assignment, on the other hand, refers to properly identifying the causes of prediction errors. Without proper credit assignment, one might have intact error signaling mechanisms, but lose the ability to learn appropriately. This is especially true when multiple possible antecedents may be related to the error or when past choices have been unpredictable. In such situations, it is important to assign credit to the most recent choice and not get distracted by previous alternatives (Stalnaker et al., 2015).

      These mechanisms within the OFC appear interrelated yet distinct. While prediction errors could trigger credit assignment, the OFC's ability to continually assess stimuli's values extends beyond instances of prediction errors. The OFC is involved in continuously evaluating and updating the values of stimuli based on ongoing experiences (Padoa-Schioppa & Assad, 2006; Tremblay & Schultz, 1999). This process enables the brain to learn from both unexpected outcomes and regular, predictable interactions with the environment. In situations where outcomes are not solely determined by prediction errors, the assignment of credit remains important. Complex decision-making involves considering a variety of factors beyond just prediction errors, such as contextual information and long-term consequences. Clarifying the convergence of these mechanisms within the OFC holds profound implications for understanding the intricacies of learning dynamics and the orchestration of adaptive responses to the environment.

      While we recognize the value of this discussion, we believe it extends beyond the primary focus of our study. Consequently, we have made the decision not to incorporate it into the current manuscript.

      One remaining weakness, which plagues all patient studies, is that of anatomical specificity. The authors have analysed what is, for the field, a large group of patients, and while the lesions appear to be relatively focused on the OFC the individuals vary in the degree to which different subregions within the OFC are damaged. This is increasingly important as evidence over the last 10 years has identified functional roles of these specific structures (Rushworth et al 2011, Neuron, Rudebeck et al 2017 Neuron). It would be important to ultimately know whether the detection of prediction errors was specific to a particular OFC subregion, a general mechanism across this area of cortex, or whether different subregions were more involved during different contexts or types of stimuli/contexts/tasks etc. Some comments on this would be appreciated.

      The reviewer raised an important point here. It would have been interesting to explore this aspect. However, one challenge with focal lesion studies is to establish large patient cohorts. The group size of our study, which is relatively large compared to other studies of focal PFC lesions, does not allow us to perform any exploratory lesion-symptom mapping analyses. A larger patient sample will provide a stronger basis for drawing conclusions about the critical role of a particular OFC subregion to the detection of prediction errors and allow statistical approaches to lesion subclassification and brain-behavior analysis (e.g., voxel-based lesion-symptom mapping (Bates et al., 2003; Lorca-Puls et al., 2018)).

      Considering the average percentage of damaged tissue in our study, the medial part of OFC or Brodmann area 11 is affected more by the lesion (approx. 33%), followed by the anterior-most region of the prefrontal cortex or Brodmann area 10 (approx. 25%), and the lateral portions of the OFC or Brodmann area 47 (approx. 12%). From our analysis, it is difficult to conclude whether the detection of prediction errors in our study was specific to a certain OFC area, or whether different subregions were involved more than others during different types of stimuli/contexts processing.

      To provide a more balanced interpretation of our findings, we incorporated a section in the “Discussion”, titled “Limitations and future directions” [page 24-25], which delves into the limitations of our study and lesion studies generally with respect to anatomical specificity and the challenge to establish large patient cohorts.

      Reviewer #2 (Public Review):

      The current version of the manuscript is overall very long and verbose, for example, the introduction is 5 pages long and includes up to 102 references. In my view this is way too much. I suppose authors wish to be very detailed, but somehow they get an opposite effect, the main message of the introduction and aims get diluted.

      We thank the reviewer for the feedback on our manuscript's length and content. This prompted us to carefully reconsider the balance between providing necessary context and ensuring the clarity of our main message. Our intention was to establish a strong foundation for our research by presenting relevant literature and setting the stage for our aims. In our revised manuscript, we have condensed the Introduction while retaining the key elements necessary to understand the context and motivations behind our research. Specifically, the current version of the “Introduction” is three pages long and includes 83 references.

      I wonder if the presentation rate used, SOA; 150 is too fast and the stimuli too short 50 ms. Please prove a rationale for this.

      We appreciate the reviewer's thoughtful consideration of the stimulus duration and presentation rate (SOA) used in our study. We understand the importance of providing a rationale for our choices to ensure the validity of our experimental design. The decision to use a SOA of 150 ms and stimuli of 50 ms duration was grounded in established practices and relevant literature in the field. Similar presentation rates and stimulus durations were employed in previous studies using similar auditory oddball paradigms, investigating rapid cognitive processes in combination with event-related potentials (ERPs). For instance, Bekinschtein et al. (2009) first introduced the task by using a SOA of 150 ms and stimulus duration of 50 ms, demonstrating that this combination is sensitive to detecting auditory deviations and eliciting early and late ERP components. Additionally, Wacongne et al. (2011), Chennu et al. (2013), Uhrig et al. (2014), and El Karoui et al. (2015) employed similar task designs with the same SOA and stimulus duration in combination with scalp EEG, fMRI and intracranial recordings, further supporting the validity of this approach. Other studies, employing the same paradigm, such as Chao et al. (2018) and Doricchi et al. (2021), used a SOA of 200 ms but kept the same stimulus duration of 50 ms.

      One of the conditions is 'omissions', but results are not reported, so either authors do not mention this at all, or they report these data, which would be probably interesting.

      We thank the reviewer for the nice reminder. The “omissions” condition is indeed an integral part of our study, and we acknowledge its potential significance. However, we have decided to publish the detailed analysis of the 'omissions' condition in a separate paper, because we think that such analysis and discussion would make the current paper quite dense and complicated. We apologize for any confusion that might arise from the absence of the 'omissions' results in this manuscript. On page 33 of the main manuscript, we state the reason for not including the “omissions” condition in the current analysis: “In the present set of analyses, the Omission blocks were not further examined, because such analysis and discussion would make the current paper overly dense and complicated.”

      The Discussion is very long and in some aspect even too speculative. For example, in the conclusions authors claim that the OFC contributes to a top-down predictive process that modulates the deviance detection system in the primary auditory cortices and may be involved in connecting PEs at lower hierarchical areas with predictions at higher areas. I am not sure the current data support this. This would-be probably more appropriate if they could compare results from OFC and AC etc. so it is a more dynamic study.

      We thank the reviewer for this observation. We have made revisions to shorten and refine the discussion, with a primary focus on presenting and interpreting the key results in a more concise and straightforward manner (See tracked changes in the revised manuscript).

      However, the overall length of the Discussion has not been reduced significantly because we have introduced two additional sections within the Discussion (i.e., “Lack of findings in the lateral PFC lesion group” and “Limitations and future directions”) in response to reviewers’ request to address the lack of finding in the lateral PFC lesion group and certain limitations associated with the employed lesion method.

      We also agree that the claim mentioned by the reviewer is overly too speculative and therefore revised the sentence as follows [page 38, “Conclusion”]: “We suggest that the OFC likely contributes to a top-down predictive process that modulates the deviance detection system in lower sensory areas.”

      At the beginning of Discussion, the authors mention that overall, these findings provide novel information about the role of the OFC in detecting violation of auditory prediction at two levels of stimuli abstraction/time scale. I think this needs to be detailed more specifically rather than mention they provide novel results.

      We understand the importance of providing readers with precise descriptions about the novelty of our study. Therefore, we have revised the statement to provide more detailed information about the novel contributions offered by our study. The revised text states as follows [“Discussion”, page 18,]: “These findings indicate that the OFC is causally involved in the detection of local and local + global auditory PEs, thus providing a novel perspective on the role of OFC in predictive processing.”

      I am not sure I like to have a section as a general discussion within the discussion itself, probably this heading should be reformatted to be more specific to what is discussed.

      As suggested by the reviewer, we reformatted the heading to “OFC and hierarchical predictive processing” [page 22-24] to better capture the essence of the content covered in this section of the “Discussion”. Here, we discuss the functional relevance of our EEG findings under the umbrella of the predictive coding framework and the potential role of OFC in predictive processes (See tracked changes in the revised manuscript).

      Reviewer #3 (Public Review):

      The central claim of the study is that hierarchical predictive processing is altered in OFC patients. However, OFC patients were able to identify global deviants as well as controls. Thus, hierarchical predictive processing itself seems to be unaltered, even though its neural correlates were different. This begs the question of what exactly the functional meaning of the EEG findings is. From the evidence presented this is difficult to determine for three reasons (See comments below).

      We thank the reviewer for the detailed observations and valuable comments. The reviewer points out that hierarchical predictive processing is unaltered even though the neural correlates were altered, because OFC patients were able to identify global deviants as accurately as control participants. We respectfully disagree with the reviewer’s claim for two reasons: 1) The primary purpose of the behavioral data in this study was not to measure the participants’ deviant detection performance, but to confirm that they were paying attention to the global rule of each block. However, we agree that an effect of lesion on behavioral performance would strengthen the claim of altered high-level predictive processing. Your point highlights the importance of looking more carefully at our behavioral results. In a follow up study, which we are currently running, we explore the behavioral nuances of our task by measuring reaction times of correct deviant detections. 2) Earlier lesion studies reported typical performance on simple oddball tasks for patients with focal frontal lesions that did not significantly differ from control participants. However, despite normal task execution and neuropsychological profiles, patients with LPFC and OFC lesions present distinct neurophysiological evidence of alterations in novelty processing (Knight, 1984, 1997; Knight & Scabini, 1998; Løvstad et al., 2012; Yamaguchi & Knight, 1991).

      Regarding the central claim of our study being that hierarchical predictive processing is altered in OFC patients, we have tried not to make strong claims about our results showing altered hierarchical predictive processing. For example, the conclusion of the abstract states: “the altered magnitudes and time courses of MMN/P3a responses after lesions to the OFC indicate that the neural correlates of detection of auditory regularity violation is impacted at two hierarchical levels of stimuli abstraction.” Thus, we do not claim that detection of regularity violation is directly impaired (e.g., OFC patients were able to identify global deviants as well as healthy controls) but that the neural correlates of deviants’ detection are altered, and therefore impaired.

      Finally, we have gone through all the comments/reasons, which the reviewer believes are difficult to determine the functional meaning of our EEG findings, and addressed them one by one (see comments below). We hope that the revised manuscript has been improved accordingly and provides a more critical view on the extent to which the findings support hierarchical predictive coding.

      It is possible that the shifts in scalp potentials are due to volume conduction differences linked to post-lesion changes in neural tissue and anatomy rather than differences in information processing per se.

      We appreciate your comment regarding the potential influence of volume conduction differences on the observed shifts in scalp potentials in our study. We acknowledge that there are special challenges in interpreting ERP findings in brain lesion populations (Kutas et al., 2012; Rugg, 1995). To reliably interpret changes in the ERPs in lesion patients as reflecting impairments in certain cognitive processes, it is necessary to identify factors that might possibly affect the results and to apply the appropriate control measures. As noted by the reviewer, structural pathology, and the replacement of neural tissue by cerebrospinal fluid following tumor resection, likely causes inhomogeneities in the volume conduction of electrical activity and resulting changes in current flow patterns. Moreover, post-craniotomy skull defects can cause local inhomogeneities in the resistive properties of the skull (Løvstad & Cawley, 2011; Rugg, 1995). Both types of biophysical changes might alter the amplitude levels and/or topography (by altering the configuration of the generators) of surface-recorded ERPs (e.g., Swick (2005)). Consequently, caution is warranted when comparing the ERPs and their scalp distributions of intact and brain-lesioned groups. It is difficult to directly quantify the consequences of brain lesions on tissue conductivity. To conclude that ERP differences between patients and controls reflect functional abnormalities in particular cognitive processes, and not primarily nonspecific effects of structural brain damage, it is helpful to demonstrate that they are specific to certain ERP components/stages of information processing and task conditions. Changes confined to one or a subset of ERP components, that additionally may not manifest across all task conditions, can give some indication concerning the specificity of ERP changes (Kutas et al., 2012; Swaab, 1998). In our study, group differences pertaining to ERP amplitudes were limited to specific task conditions and not across all data. This condition-dependent pattern suggests that the observed shifts are related to the specific cognitive processes engaged during those task conditions rather than being a global artifact of volume conduction. If volume conduction was the main driver, we would expect these group differences to be more uniformly present across task conditions. Another piece of evidence against volume conduction effects is the scalp potentials’ latency differences between the two groups observed for the Local + Global deviance detection. Group differences in the latencies of ERPs, such as the MMN and P3a, cannot be attributed to volume conduction alone (Hämäläinen et al., 1993). These differences in the timing of neural responses strongly indicate genuine variations in cognitive processing.

      To provide a more balanced interpretation of our findings, we have incorporated a section in the “Discussion” that delves into the limitations of our study and lesion studies generally with respect to volume conduction and amplitude changes, titled “Limitations and future directions” [page 24-25].

      It is unclear from the analyses whether the P3a amplitude differences are true amplitude differences or a byproduct of latency differences. The reason is that the statistical method used (cluster based permutations) might yield significant effects when the latency of a component is shifted, even if peak amplitudes are the same. Complementary analyses on mean or peak amplitudes could resolve this issue.

      We thank the reviewer for raising an important concern about the use of cluster-based permutation tests and their potential to yield significant effects when the latency of a component is shifted. We acknowledge this concern and recognize the need for complementary analyses to address this issue. To provide a clearer understanding of the nature of the observed ERP amplitude differences, we conducted complementary analyses on mean amplitudes of the MMN and P3a components on the midline sensors for the conditions where significant group differences were observed. For the MMN component elicited by the Local Deviance, we found group amplitude differences on the electrodes AFz (p = 0.021), Fz (p = 0.008), CPz (p = 0.015), and Pz (p < 0.001). Surprisingly, we also found amplitude differences for the P3a component elicited by the Local Deviance on the electrodes AFz (p < 0.001), Fz (p < 0.001), FCz (p < 0.001), and Cz (p = 0.002) that were not observed previously with the cluster-based permutation analysis. For the MMN component elicited by the Local+Global Deviance, our analysis showed group amplitude differences on the electrodes AFz (p = 0.007), FCz (p = 0.051), Cz (p = 0.004), CPz (p = 0.002), and Pz (p < 0.001). However, as the reviewer rightly pointed out, the group differences for the P3a elicited by the Local + Global Deviance seem to be a byproduct of latency differences, as we did not find amplitude differences on any of the midline electrodes. Overall, this complementary analysis shows that the OFC patients had an attenuated MMN/P3a to local level prediction violation, and an attenuated and delayed MMN followed by a delayed P3a to the combined local and global level prediction violation. The new analysis is added in the Supplementary File 1 [page 5-7] and Supplementary File 1c and 1d.

      The MMN, P3a and P3b components are difficult to map to the hierarchical PC theory. Traditionally, the MMN is ascribed to lower level processing while P3a and P3b are ascribed to higher level processing. However, the picture is more complicated. For example, the current results show that the MMN is enhanced in local + global surprise while the P3a is elicited by local surprise. Furthermore, the P3a is classically interpreted as reflecting attention reorientation and the P3b as reflecting the conscious detection of task-relevant targets. How attention and conscious awareness fit in hierarchical PC is not entirely clear.

      Indeed, the relationships between MMN, P3a and P3b components and the predictive coding (PC) framework can be intricate. However, numerous studies employed the PC theory to interpret these common electrophysiological signatures as prediction error (PE) signals (Garrido et al., 2007, 2009; Lieder et al., 2013) and dissociations between these ERPs supported that there are successive levels of predictive processing (Chennu et al., 2013; El Karoui et al., 2015; Wacongne et al., 2011).

      In terms of hierarchical PC (Friston, 2005), the temporally constrained MMN has been traditionally linked with first-level predictive processing, known as the local effect of short-term stimulus deviance. PE signals at this level feed forward to a temporally extended, attention-dependent system that extracts longer-term patterns. PE signals at the higher level are usually indexed by the P300, identified as the global effect of longer-term stimulus deviance. The P300 reflects a more attention-driven process, emerging in response to novel or low-probability “target” stimuli that violate broader contextual expectations (Polich, 2007), such as those that form over multiple trials. Because the MMN, P3a and P3b also appear to exhibit varying degrees of sensitivity to preconscious and conscious perceptual predictions (Sculthorpe et al., 2009), they could serve as measures for examining the concept of a predictive neural hierarchy.

      Indeed, the MMN has been viewed as sensitive to local violation and essentially blind to higher-order regularities. However, this is a simplified view. For example, Wacongne et al. (2011) showed that violating a low-level perceptual expectation triggers the MMN, violating contextual expectations triggers the higher-level P3, and when both expectations are simultaneously violated, a larger response is evoked compared to either one alone. These findings, which are consistent with the results of our study, show that the local and global effects are not fully independent but interact in an early time window, indexed by enhanced and temporally extended MMN responses. They provide support not just for a hierarchical model, but for a predictive rather than a feedforward one. Moreover, the MMN has been found to be relatively insensitive to attention, because it is elicited in situations in which the subjects’ attention is directed away from the stimuli and there are no task demands (Chennu et al., 2013). Given that early MMN is a pre-attentive automatic ERP component (Näätänen et al., 2001; Pegado et al., 2010; Tiitinen et al., 1994), and given that it has been observed in comatose and vegetative state patients (Bekinschtein et al., 2009; Fischer et al., 2004; Naccache et al., 2004), the finding that even early MMN is impaired in OFC patients indicate that patients may suffer from a deficit in sensory predictive processing that is independent of attention and conscious awareness.

      The picture is more complicated when it comes to the predictive roles of P3a and P3b components. Following the MMN, a positive polarity P300 complex, sensitive to the detection of unpredicted auditory events, has been reported (Chennu et al., 2013; Doricchi et al., 2021; Kompus et al., 2020; Liaukovich et al., 2022). However, the two types of P300 (P3a and P3b) have not been clearly fitted into the hierarchical PC theory. The P3a is considered to be part of the brain's mechanism for detecting PEs (Wessel et al., 2012; Wessel et al., 2014) and may indicate that the brain is reallocating attentional resources to process and learn from these unexpected events. The P3a is typically interpreted as reflecting an involuntary attentional reorienting process (Escera & Corral, 2007; Ungan et al., 2019), which may relate to the operations of the ventral attention network (Corbetta et al., 2008; Corbetta & Shulman, 2002; Nieuwenhuis et al., 2005). Predictive coding emphasizes the role of contextual information in generating predictions with P3a being influenced by the context in which an unexpected event occurs (Schomaker et al., 2014). In the hierarchy of predictive processing, the P3a may reflect PEs at different hierarchical levels, depending on the complexity of the prediction and the degree to which it deviates from the sensory input. On the other hand, the P3b is linked to higher-level cognitive processes that involve updating long-term predictions based on incoming sensory information. It is highly dependent on attention, conscious awareness and active engagement with the task (Bekinschtein et al., 2009; Del Cul et al., 2007; Sergent et al., 2005; Strauss et al., 2015). It is thought to play a role in integrating the unexpected sensory input into the current context, potentially leading to updates of predictions in working memory (Chao et al., 1995; Donchin & Coles, 1988; Polich, 2007).

      Hierarchical PC theory is continually evolving, and the relationship between these ERP components and attention or conscious awareness remains an active area of research. We acknowledge the need for further investigation to better understand how attention and conscious awareness fit within this framework. In light of your comment, we provide a more comprehensive discussion about the functional meaning of the EEG findings in our “Discussion - OFC and hierarchical predictive processing” [page 22-24].

      The fact that lateral PFC patients show unaltered neural responses contradicts prominent views from PC identifying this region as a generator of the MMN and a source of predictions sent to temporal auditory areas.

      We appreciate the reviewer's comment and want to acknowledge that another reviewer raised this concern previously. We have provided a detailed response to this issue in our previous response (see Response to Reviewer #1 Comment 4). We have expanded the “Discussion” to address this point by adding a new section titled “Lack of findings in the lateral PFC lesion group” [page 21]. In this section, we first present some of the findings implicating specific areas of the lateral PFC in the generation of MMN and in predictive processing, and then offer an account of the potential reasons behind the lack of neurophysiological differences between the lateral PFC and control groups.

      For these reasons, a more critical view on the extent to which the findings support hierarchical predictive coding is needed.

      By responding to the reviewer’s previous comments (i.e., the reasons why the reviewer thinks it is difficult to determine the functional meaning of the EEG findings), we believe that we have offered a more critical view on this matter.

      References

      Alho, K., Woods, D. L., Algazi, A., Knight, R., & Näätänen, R. (1994). Lesions of frontal cortex diminish the auditory mismatch negativity. Electroencephalography and clinical neurophysiology, 91(5), 353-362.

      Bates, E., Wilson, S. M., Saygin, A. P., Dick, F., Sereno, M. I., Knight, R. T., & Dronkers, N. F. (2003). Voxel-based lesion–symptom mapping. Nature neuroscience, 6(5), 448-450.

      Bekinschtein, T. A., Dehaene, S., Rohaut, B., Tadel, F., Cohen, L., & Naccache, L. (2009). Neural signature of the conscious processing of auditory regularities. Proceedings of the National Academy of Sciences, 106(5), 1672-1677.

      Chao, L., Nielsen-Bohlman, L., & Knight, R. (1995). Auditory event-related potentials dissociate early and late memory processes. Electroencephalography and Clinical Neurophysiology/Evoked Potentials Section, 96(2), 157-168.

      Chao, Z. C., Takaura, K., Wang, L., Fujii, N., & Dehaene, S. (2018). Large-scale cortical networks for hierarchical prediction and prediction error in the primate brain. Neuron, 100(5), 1252-1266. e1253.

      Chennu, S., Noreika, V., Gueorguiev, D., Blenkmann, A., Kochen, S., Ibánez, A., Owen, A. M., & Bekinschtein, T. A. (2013). Expectation and attention in hierarchical auditory prediction. Journal of Neuroscience, 33(27), 11194-11205.

      Corbetta, M., Patel, G., & Shulman, G. L. (2008). The reorienting system of the human brain: from environment to theory of mind. Neuron, 58(3), 306-324.

      Corbetta, M., & Shulman, G. L. (2002). Control of goal-directed and stimulus-driven attention in the brain. Nature reviews neuroscience, 3(3), 201-215.

      Del Cul, A., Baillet, S., & Dehaene, S. (2007). Brain dynamics underlying the nonlinear threshold for access to consciousness. PLoS biology, 5(10), e260.

      Deouell, L. Y. (2007). The frontal generator of the mismatch negativity revisited. Journal of Psychophysiology, 21(3-4), 188-203.

      Donchin, E., & Coles, M. G. (1988). Is the P300 component a manifestation of context updating? Behavioral and brain sciences, 11(3), 357-374.

      Doricchi, F., Pinto, M., Pellegrino, M., Marson, F., Aiello, M., Campana, S., Tomaiuolo, F., & Lasaponara, S. (2021). Deficits of hierarchical predictive coding in left spatial neglect. Brain communications, 3(2), fcab111.

      Dürschmid, S., Edwards, E., Reichert, C., Dewar, C., Hinrichs, H., Heinze, H.-J., Kirsch, H. E., Dalal, S. S., Deouell, L. Y., & Knight, R. T. (2016). Hierarchy of prediction errors for auditory events in human temporal and frontal cortex. Proceedings of the National Academy of Sciences, 113(24), 6755-6760.

      El Karoui, I., King, J.-R., Sitt, J., Meyniel, F., Van Gaal, S., Hasboun, D., Adam, C., Navarro, V., Baulac, M., & Dehaene, S. (2015). Event-related potential, time-frequency, and functional connectivity facets of local and global auditory novelty processing: an intracranial study in humans. Cerebral cortex, 25(11), 4203-4212.

      Escera, C., & Corral, M. (2007). Role of mismatch negativity and novelty-P3 in involuntary auditory attention. Journal of psychophysiology, 21(3-4), 251-264.

      Fischer, C., Luauté, J., Adeleine, P., & Morlet, D. (2004). Predictive value of sensory and cognitive evoked potentials for awakening from coma. Neurology, 63(4), 669-673.

      Friston, K. (2005). A theory of cortical responses. Philosophical transactions of the Royal Society B: Biological sciences, 360(1456), 815-836.

      Garrido, M. I., Friston, K. J., Kiebel, S. J., Stephan, K. E., Baldeweg, T., & Kilner, J. M. (2008). The functional anatomy of the MMN: a DCM study of the roving paradigm. Neuroimage, 42(2), 936-944.

      Garrido, M. I., Kilner, J. M., Kiebel, S. J., & Friston, K. J. (2007). Evoked brain responses are generated by feedback loops. Proceedings of the National Academy of Sciences, 104(52), 20961-20966.

      Garrido, M. I., Kilner, J. M., Kiebel, S. J., & Friston, K. J. (2009). Dynamic causal modeling of the response to frequency deviants. Journal of Neurophysiology, 101(5), 2620-2631.

      Holroyd, C. B., & Coles, M. G. (2002). The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity. Psychological review, 109(4), 679.

      Hämäläinen, M., Hari, R., Ilmoniemi, R. J., Knuutila, J., & Lounasmaa, O. V. (1993). Magnetoencephalography—theory, instrumentation, and applications to noninvasive studies of the working human brain. Reviews of modern Physics, 65(2), 413.

      Kakade, S., & Dayan, P. (2002). Dopamine: generalization and bonuses. Neural Networks, 15(4-6), 549-559.

      Knight, R. T. (1984). Decreased response to novel stimuli after prefrontal lesions in man. Electroencephalography and Clinical Neurophysiology/Evoked Potentials Section, 59(1), 9-20.

      Knight, R. T. (1997). Distributed cortical network for visual attention. Journal of Cognitive Neuroscience, 9(1), 75-91.

      Knight, R. T., & Scabini, D. (1998). Anatomic bases of event-related potentials and their relationship to novelty detection in humans. Journal of clinical neurophysiology, 15(1), 3-13.

      Kompus, K., Volehaugen, V., Todd, J., & Westerhausen, R. (2020). Hierarchical modulation of auditory prediction error signaling is independent of attention. Cognitive neuroscience, 11(3), 132-142.

      Kutas, M., Kiang, M., & Sweeney, K. (2012). Potentials and Paradigms: Event‐Related Brain Potentials and Neuropsychology. The handbook of the neuropsychology of language, 1, 543-564.

      Liaukovich, K., Ukraintseva, Y., & Martynova, O. (2022). Implicit auditory perception of local and global irregularities in passive listening condition. Neuropsychologia, 165, 108129.

      Lieder, F., Daunizeau, J., Garrido, M. I., Friston, K. J., & Stephan, K. E. (2013). Modelling trial-by-trial changes in the mismatch negativity. PLoS computational biology, 9(2), e1002911.

      Lorca-Puls, D. L., Gajardo-Vidal, A., White, J., Seghier, M. L., Leff, A. P., Green, D. W., Crinion, J. T., Ludersdorfer, P., Hope, T. M., & Bowman, H. (2018). The impact of sample size on the reproducibility of voxel-based lesion-deficit mappings. Neuropsychologia, 115, 101-111.

      Løvstad, A., & Cawley, P. (2011). The reflection of the fundamental torsional guided wave from multiple circular holes in pipes. Ndt & E International, 44(7), 553-562.

      Løvstad, M., Funderud, I., Lindgren, M., Endestad, T., Due-Tønnessen, P., Meling, T., Voytek, B., Knight, R. T., & Solbakk, A.-K. (2012). Contribution of subregions of human frontal cortex to novelty processing. Journal of Cognitive Neuroscience, 24(2), 378-395.

      Naccache, L., Puybasset, L., Gaillard, R., Serve, E., & Willer, J.-C. (2004). Auditory mismatch negativity is a good predictor of awakening in comatose patients: a fast and reliable procedure. Clinical neurophysiology: official journal of the International Federation of Clinical Neurophysiology, 116(4), 988-989.

      Nieuwenhuis, S., Aston-Jones, G., & Cohen, J. D. (2005). Decision making, the P3, and the locus coeruleus--norepinephrine system. Psychological bulletin, 131(4), 510.

      Noonan, M., Kolling, N., Walton, M., & Rushworth, M. (2012). Re‐evaluating the role of the orbitofrontal cortex in reward and reinforcement. European Journal of Neuroscience, 35(7), 997-1010.

      Nourski, K. V., Steinschneider, M., Rhone, A. E., Kawasaki, H., Howard III, M. A., & Banks, M. I. (2018). Processing of auditory novelty across the cortical hierarchy: An intracranial electrophysiology study. Neuroimage, 183, 412-424.

      Näätänen, R., Pakarinen, S., Rinne, T., & Takegata, R. (2004). The mismatch negativity (MMN): towards the optimal paradigm. Clinical neurophysiology, 115(1), 140-144.

      Näätänen, R., Tervaniemi, M., Sussman, E., Paavilainen, P., & Winkler, I. (2001). ‘Primitive intelligence’in the auditory cortex. Trends in neurosciences, 24(5), 283-288.

      Padoa-Schioppa, C., & Assad, J. A. (2006). Neurons in the orbitofrontal cortex encode economic value. Nature, 441(7090), 223-226.

      Pegado, F., Bekinschtein, T., Chausson, N., Dehaene, S., Cohen, L., & Naccache, L. (2010). Probing the lifetimes of auditory novelty detection processes. Neuropsychologia, 48(10), 3145-3154.

      Phillips, H. N., Blenkmann, A., Hughes, L. E., Bekinschtein, T. A., & Rowe, J. B. (2015). Hierarchical organization of frontotemporal networks for the prediction of stimuli across multiple dimensions. Journal of Neuroscience, 35(25), 9255-9264.

      Phillips, H. N., Blenkmann, A., Hughes, L. E., Kochen, S., Bekinschtein, T. A., & Rowe, J. B. (2016). Convergent evidence for hierarchical prediction networks from human electrocorticography and magnetoencephalography. cortex, 82, 192-205.

      Polich, J. (2007). Updating P300: an integrative theory of P3a and P3b. Clinical neurophysiology, 118(10), 2128-2148.

      Rosburg, T., Trautner, P., Dietl, T., Korzyukov, O. A., Boutros, N. N., Schaller, C., Elger, C. E., & Kurthen, M. (2005). Subdural recordings of the mismatch negativity (MMN) in patients with focal epilepsy. Brain, 128(4), 819-828.

      Rugg, M. D. (1995). Event-related potential studies of human memory. Schomaker, J., Roos, R., & Meeter, M. (2014). Expecting the unexpected: The effects of deviance on novelty processing. Behavioral neuroscience, 128(2), 146.

      Schultz, W. (2015). Neuronal reward and decision signals: from theories to data. Physiological reviews, 95(3), 853-951.

      Schultz, W., & Dickinson, A. (2000). Neuronal coding of prediction errors. Annual review of neuroscience, 23(1), 473-500.

      Sculthorpe, L. D., Stelmack, R. M., & Campbell, K. B. (2009). Mental ability and the effect of pattern violation discrimination on P300 and mismatch negativity. Intelligence, 37(4), 405-411.

      Sergent, C., Baillet, S., & Dehaene, S. (2005). Timing of the brain events underlying access to consciousness during the attentional blink. Nature neuroscience, 8(10), 1391-1400.

      Seymour, B., O'Doherty, J. P., Dayan, P., Koltzenburg, M., Jones, A. K., Dolan, R. J., Friston, K. J., & Frackowiak, R. S. (2004). Temporal difference models describe higher-order learning in humans. Nature, 429(6992), 664-667.

      Stalnaker, T. A., Cooch, N. K., & Schoenbaum, G. (2015). What the orbitofrontal cortex does not do. Nature neuroscience, 18(5), 620-627.

      Strauss, M., Sitt, J. D., King, J.-R., Elbaz, M., Azizi, L., Buiatti, M., Naccache, L., Van Wassenhove, V., & Dehaene, S. (2015). Disruption of hierarchical predictive coding during sleep. Proceedings of the National Academy of Sciences, 112(11), E1353-E1362.

      Sul, J. H., Kim, H., Huh, N., Lee, D., & Jung, M. W. (2010). Distinct roles of rodent orbitofrontal and medial prefrontal cortex in decision making. Neuron, 66(3), 449-460.

      Swick, D. (2005). 13 ERPs in Neuropsychological Populations. Event-related potentials: A methods handbook, 299.

      Swaab, T. Y. (1998). Event-related potentials in cognitive neuropsychology: Methodological considerations and an example from studies of aphasia. Behavior Research Methods, Instruments, & Computers, 30(1), 157-170.

      Tiitinen, H., May, P., Reinikainen, K., & Näätänen, R. (1994). Attentive novelty detection in humans is governed by pre-attentive sensory memory. Nature, 372(6501), 90-92.

      Tobler, P. N., O’Doherty, J. P., Dolan, R. J., & Schultz, W. (2006). Human neural learning depends on reward prediction errors in the blocking paradigm. Journal of Neurophysiology, 95(1), 301-310.

      Tremblay, L., & Schultz, W. (1999). Relative reward preference in primate orbitofrontal cortex. Nature, 398(6729), 704-708.

      Uhrig, L., Dehaene, S., & Jarraya, B. (2014). A hierarchy of responses to auditory regularities in the macaque brain. Journal of Neuroscience, 34(4), 1127-1132.

      Ungan, P., Karsilar, H., & Yagcioglu, S. (2019). Pre-attentive mismatch response and involuntary attention switching to a deviance in an earlier-than-usual auditory stimulus: an ERP study. Frontiers in Human Neuroscience, 13, 58.

      Wacongne, C., Labyt, E., van Wassenhove, V., Bekinschtein, T., Naccache, L., & Dehaene, S. (2011). Evidence for a hierarchy of predictions and prediction errors in human cortex. Proceedings of the National Academy of Sciences, 108(51), 20754-20759.

      Walton, M. E., Behrens, T. E., Buckley, M. J., Rudebeck, P. H., & Rushworth, M. F. (2010). Separable learning systems in the macaque brain and the role of orbitofrontal cortex in contingent learning. Neuron, 65(6), 927-939.

      Walton, M. E., Behrens, T. E., Noonan, M. P., & Rushworth, M. F. (2011). Giving credit where credit is due: orbitofrontal cortex and valuation in an uncertain world. Annals of the New York Academy of Sciences, 1239(1), 14-24.

      Wessel, J. R., Danielmeier, C., Morton, J. B., & Ullsperger, M. (2012). Surprise and error: common neuronal architecture for the processing of errors and novelty. Journal of Neuroscience, 32(22), 7528-7537.

      Wessel, J. R., Klein, T. A., Ott, D. V., & Ullsperger, M. (2014). Lesions to the prefrontal performance-monitoring network disrupt neural processing and adaptive behaviors after both errors and novelty. Cortex, 50, 45-54.

      Yamaguchi, S., & Knight, R. (1991). Anterior and posterior association cortex contributions to the somatosensory P300. Journal of Neuroscience, 11(7), 2039-2054.

    1. Author Response

      The following is the authors’ response to the previous reviews.

      We are grateful for the helpful comments of both reviewers and have revised our manuscript with them in mind.

      One of the main issues raised was that readers may by default assume that our models are correct. We in fact made it very clear in our discussion that the models are merely hypotheses that will need testing by “wet” experiments and we do not therefore agree that even readers unfamiliar with AF would assume that the models must be correct. It was also suggested that readers could be reassured by including extensive confidence estimates such as PAE plots. As it happens, every single model described in the manuscript had reasonably high PAE scores and more crucially the entire collection of output files, including PAE data, are readily accessible on Figshare at https://doi.org/10.6084/m9.figshare.22567318.v2, a fact that the reviewers appear to have overlooked. The Figshare link is mentioned three times in the manuscript. Embedding these data within the manuscript itself would in our view add even more details and we have therefore not included them in our revised manuscript. Likewise, it is rather simple for any reader to work out which part of a PAE matrix corresponds to an interaction observed in the corresponding pdb prediction. Besides which, it is our view that the biological plausibility and explanatory power of models is just as important as AF metrics in judging whether they may be correct, as is indeed also the case for most experimental work.

      Another important point was that the manuscript was too long and not readable. Yes, it is long and it could well be argued that we could have written a different type of manuscript, focusing entirely on what is possibly the simplest and most important finding, namely that our AF models suggest that in animal cells Wapl appears to form a quarternary complex with SA, Pds5, and Scc1 in a manner suggesting that a key function of Wapl’s conserved CTD is to sequester Scc1’s Nterminal domain after it has dissociated from Smc3. For right or for wrong, we decided that this story could not be presented on its own but also required 1) an explanation for how Scc1 is induced to dissociate from Smc3 in the first place and 2) how to explain that the quarternary complex predicted for animal cells was not initially predicted for fungi such as yeast. The yeast situation was an exception that clearly needed explaining if the theory was to have any generality and it turned out that delving into the intricate details of the genetics of releasing activity in yeast was eventually required and yielded valuable new insights. We also believe that our work on the recruitment of Eco/Esco acetyl transferases to cohesin and the finding that sororin binds to the Smc3/Scc1 interface also provided important insight into how releasing activity is regulated. We acknowledge that the paper is indeed long but do not think that it is badly written. It is above all a long and complex story that in our view reveals numerous novel insights into how cohesin’s association with chromosomes is regulated and have endeavoured to eliminate any excessive speculation. We feel it is not our fault that cohesin uses complex mechanisms.

      Notwithstanding these considerations, we have in fact simplified a few sections and removed one or two others but acknowledge that we have not made substantial cuts.

      It was pointed out that a key feature of our modelling, namely the predicted association of Wapl’s C-terminal domain with SA/Scc3’s CES is inconsistent with published biochemical data. The AF predictions for this interface are universally robust in all eukaryotic lineages and crucially fully consistent with published and unimpeachable genetic data. We note that any model that explains all findings is bound to be wrong for the very simple reason that some of these findings will prove to be incorrect. There is therefore an art in Science of judging which data must be explained and accommodated and which should be ignored. In this particular case, we chose to ignore the biochemistry. Time will tell whether our judgement proves correct.

      Last but not least, it was suggested that we might provide some experimental support for our proposed SA/Scc3-Pds5-Scc1-WaplC quaternary complex. We are in fact working on this by introducing cysteine pairs (that can be crosslinked in cells) into the proposed interfaces but decided that such studies should be the topic of a subsequent publication. It would be impossible with the resources available to our labs to follow up all of the potential interactions and we therefore decided to exclude all such experiments.

      We are grateful for the detailed comments provided by both reviewers, many of which were very helpful, and in many but not all cases have amended the manuscript accordingly.

      With regard to the more specific comments:

      Reviewer #1 (Recommendations For The Authors):

      1) One concern is that observed interfaces/complexes arise because AF-multimer will aim to pack exposed, conserved and hydrophobic surfaces or regions that contain charge complementarity. The risk is that pairwise interaction screens can result in false positive & non-physiological interactions. It is therefore important to report the level of model confidence obtained for such AF calculations:

      A) The authors should color the key models according to pLDDT scores obtained as reported by AF. This would allow the reader to judge the estimated accuracy of the backbone and side chain rotamers obtained. At least for the key models and interactions it would be important to know if the pLDDT score is >90 (Correct backbone and most rotamers) or >70 (only backbone is correct).

      B) It would also be important to report the PAE plots to allow estimation of the expected position error for most of the important interactions. pLDDT coloring and PEA plots can be shown side-by-side as shown in other published data (e.g. https://pubmed.ncbi.nlm.nih.gov/35679397/ (Supplementary data)

      C) The authors should include a Table showing the confidence of template modeling scores for the predicted protein interfaces as ipTM, ipTM+pTM as reported by AlphaFold-multimer. Ideally, they would also include DockQ scores but this may not be essential. Addition of such scores would help classification into Incorrect, Acceptable or of high quality. For example, line 1073 et seq the authors show a model of a SCC1SA and ESCO1 complex (Fig. 37). Are the modeling scores for these interfaces high? It does not help that the authors show cartoons without side chains? Can the authors provide a close-up view of the two interfaces? Are the amino acids are indeed packed in a manner expected for a protein interface? Can we exclude the possibility that the prediction is obtained merely because the sequence segments (e.g. in ESCO1 & ESCO2) are hydrophobic and conserved?

      We do not agree that including this level of detail to the text/figures of the manuscript would be suitable. All the relevant data for those who may be sceptical about the models are readily available at https://doi.org/10.6084/m9.figshare.22567318.v2. In our view, the cartoon versions of the models are easier for a reader to navigate. Anyone interested in the molecular details can look at the models directly.

      Importantly, no amount of statistical analysis can completely validate these models. What is required are further experiments, which will be the topic of further work from our and I dare from other laboratories.

      D) When they predict an interaction between the SA2:SCC1 complex and Sororin's FGF motif, they find that only 1/5 models show an interaction and that the interaction is dissimilar to that seen of CTCF. Again, it would be helpful to know about modeling scores. Can they show a close-up view of the SORORIN FGF binding interface to see if a realistic binding mode is obtained? Can they indicate the relevant region on the PAE plot?

      Given that AF greatly favours other interactions of Sororin’s FGF motif over its interaction with SA2-Scc1, we do not agree that dwelling on the latter would serve any purpose.

      2) Line 996: AF predicts with high confidence an interaction between Eco1 & SMC3hd. What are the ipTM (& DockQ if available) scores. Would the interface score High, Medium or Acceptable?

      As mentioned, see https://doi.org/10.6084/m9.figshare.22567318.v2.

      3) Line 1034 et seq: Eco1/ESCO1/ESCO2 interaction with PDS5. Interface scores need to be shown to determine that the models shown are indeed likely to occur. If these interactions have low model confidence, Fig. 36 and discussion around potential relevance to PDS5-Eco1 orientation relative to the SMC3 head remains highly speculative and could be expunged.

      See https://doi.org/10.6084/m9.figshare.22567318.v2. It should be clear that the predictions are very similar in fungi and animals. Crucially, we know that Pds5 is essential for acetylation in vivo, so the models appear plausible from a biological point of view.

      4) Considering the relatively large interface between ECO1 and SMC3, would the author consider the possibility that in addition to acetylating SMC3's ATPase domain, ECO1 remains bound to cohesin-DNA complex, as proposed for ESCO1 by Rahman et al (10.1073/pnas.1505323112)?

      This is certainly possible but we would not want to indulge in such speculation.

      5) E.g. Line 875 but also throughout the text: As there is no labeling of the N- and C-termini in the Figures, is frequently unclear what the authors are referring to when they mention that AF models orient chains in a certain manner.

      Good point. This has been amended. However, the positions of N- and C- is all available at https://doi.org/10.6084/m9.figshare.22567318.v2.

      6) Fig19B: PAE plots: authors should indicate which chains correspond to A, B, C. Which segment corresponds to the TYxxxR[T/S]L motif? Can they highlight this section on the PAE plot?

      Good point and amended in the revised manuscript.

      Minor comments:

      1) Line 440: the WAPL YSR motif is not shown in Fig. 14A

      2) Line 691: Scc3 spelling error.

      3) Line 931: Sentence ending '... SCC3 (SCC3N).' requires citation.

      4) Line 1008: Figure reference seems wrong. It should read: Fig. 34A left and right. Fig. 34B does not contain SCC1.

      Many thanks for spotting these. Hopefully, all corrected.

      5) Fig. 41 can be removed as it shows the absence of the interaction of Sororin with SMC1:SCC1. Sufficient to mention in the text that Sororin does not appear to interact with SMC1:SCC1.

      This is possible but we decided to leave this as is.

      Reviewer #2 (Recommendations For The Authors):

      Minor points

      (1) Are there any predicted models in which one of the two dimer interfaces of the hinge is open when the coiled coils are folded back, as seen in the cryo-EM structure of human cohesin-NIPBL complex in the clamped state?

      No AF runs ever predicted half opened hinges. It is possible that the introduction of mutations in one of the two interfaces might reveal a half-opened state and we ought to try this. However, it would not be appropriate for this manuscript, we believe.

      (2) Structures of the SA-Scc1 CES bound to [Y/F]xF motifs from Sgo1 and CTCF have been reported, suggesting that a similar motif could interact with SA/Scc3. Surprisingly, AF did not predict an interaction between Scc3/SA and Wapl FGF motifs, which only bind to the Pds5 WEST region. On the other hand, AF predicted interactions of the Sororin FGF motif with both Pds5 WEST and SA CES. Can the authors comment on this Wapl FGF binding specificity? What will happen if a Wapl fragment lacking the CTD is used in the prediction?

      This seems to be an academic point as the CTD is always present.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      FULL REVISION

      The preprint of this article is uploaded in bioRxiv (doi:10.1101/2023.06.03.543550) (June 2023) and has been previously submitted and revised by peers in Review Commons. We attach the full Review, with a detailed answer to the Reviewers and a new revised version of the manuscript following the Reviewer’s comments and suggestions, adding new data, a new Supplementary figure, as well as a revision of the text and discussion. We are thankful to the reviewers for their helpful comments, which have further clarified some of aspects of our work and overall improved the quality of our manuscript.

      All the line’s numbering mentioned in the point by point answer to the reviewers refer to the converted pdf of the revised manuscript, including the main figures.

      Answers to Reviewer #1

      Aísa-Marín et al. present a detailed scRNAseq description of two Nr2e3 mouse models the authors had published in Neurobiology of Disease in 2020. These two mouse models do not mirror known pathogenic variants in human patients, but are useful animal models to understand disease mechanisms in NR2E3-linked retinal degenerations. The present follow-up study has been carefully done, the bioinformatic analysis is sound and selected pathways have been validated at protein level by immunohistochemistry and Western blot. The impact of this data on photoreceptor development and maintenance is somewhat decreased because previous data by the groups of Joseph Corbo, Connie Cepko, Jeremy Nathans, Anand Swaroop and others have already shown several years ago upregulation of cone-specific genes in a spontaneous Nr2e3 mutant mouse, the rd7 mouse. In these papers, hybrid cones both expressing cone- and rod-specific genes have been described and coined 'cods'. The authors are encouraged to use already defined terms in their paper. For instance, the concept of 'half differentiated' photoreceptors is unclear and must be rephrased. In general, existing literature is not always integrated adequately into the submitted work. The detailed remarks are listed below.

      We thank the reviewer for this suggestion to incorporate previous terminology in the field for the sake of clarity. Although it is true that previous authors have described hybrid photoreceptors, they have not quantified them, or compared their contribution to the different subpopulations of photoreceptors in different models of disease. For instance, there may be different types of hybrid photoreceptors, as they can be identified both within the cone and the rod populations, they show different characteristics and might perform different roles or else, be indicative of a pathological phenotype.

      Taking into account the reviewer’s suggestion, we have carefully revised our text and figures and the terms “intermediate” or “half differentiated photoreceptors” have been substituted by the already existing term “hybrid photoreceptors” of “incompletely differentiated”. We have also included some context in previous work regarding ‘cods’.

      See now lines 60, 146, 215, 271, 638-645 in the Discussion, Figure 7 legend and the Graphical Abstract for changes in terminology, and also sentences in lines 518-522, for reference to previous works.

      Reviewer #1

      One important question the authors must also address in their scRNAseq analysis is why the well described 'mixed' S- and M-opsin expressing cones do not seem detectable, are actually not even mentioned?

      We sincerely thank the reviewer for this comment, as it is a relevant piece of information that we missed in our previous analysis. Cones are usually classifed according to the type of opsin they express. In mouse, previous work has described cones expressing solely S- or M-opsins, but also cones that co-express both types of opsins. Indeed, we find these three types of cones, overlapping partially with our cone subpopulations, which are defined by a larger number of signature genes. As previously described, merging the results of the wild-type with the two mutants demonstrate that most cones co-express both opsins (roughly 46%), S- and M-opsins are expressed exclusively in around 13% cone cells each, and somewhat surprisingly around 28% of cones do not express any opsin (Fig. EVF6). However, the dissection of cone results by genotype is more informative, as the percentages vary according to the mutant genotype. In the mutants, the percentage of cones expressing no opsins is higher than in the wild-type at the expense of the cones that should express solely S-opsin (data in EVF5). Again reflecting the impact of Nr2e3 mutations on the differentiation of cones and reinforcing our main message.

      We have introduced several sentences in the text to reflect this results and refer to this new figure EVF6 (see lines 292-307 in the Results as well as 575-580 in the Discussion sections.

      Reviewer 1. Detailed comments:

      1. Graphical abstract: replace 'half differentiated' by incompletely differentiated or similar

      As detailed above, we have followed the suggestion of the reviewer and throroughly revised all the text and all Figures. “Half differentiated” and “intermediate photoreceptors” have been substituted by “incompletely differentiated” and “hybrid photoreceptors”, respectively.

      1. l.86: to my best knowledge, the shorter isoform has only been described at transcript level in humans, no evidence at protein level, please clarifiy. Please also state that the isoform lacking exon 8 is due to retention of intron 7.

      Following the suggestion of the reviewer, we have clarified that the short NR2E3 transcript isoform is due to retention of intron 7, both in human and mouse. We have also clarified that in human, the protein has been computationally predicted, whereas there are experimental evidences in mouse. (lines 94-104).

      1. l.89: lacks repressor and dimerization domains. Exon 8 is not coding all repressor and dimerization domains. The authors do not mention neither the D-box in the DBD that also contributes to dimerization, in addition to the LBD (von Alpen et al., Hum Mut, 2015). Furthermore, repressor domain should be presented in the context of the auto-repressed structure of NR2E3 (Zhu et al., Genes Dev, 2015).

      We agree with the reviewer that the original text was simplified. For the sake of clarity and following his/her suggestion we now include more context about the structure of NR2E3 and included the suggested references (lines 87-93).

      See also below the answer to the points 6, 9 and 13, which are also related.

      1. l.90: typo NR2E3

      The corresponding line is now on line 105 and the typo has been corrected.

      1. l.93-106: incorrect, please rewrite whole paragraph. There is only one single pathogenic variant leading to NR2E3-Gly56Arg-linked autosomal dominant retinitis pigmentosa, all other pathogenic variants are recessive and cause ESCS!

      Now these paragraph starts in line 105. We agree with the reviewer that so far only mutation Gly56Arg in NR2E3 is associated to autosomal dominant retinitis pigmentosa. however there are also some (even if few) NR2E3 mutations associated to autosomal recessive forms of RP, in addition to the most well known autosomal recessive mutations that cause ESCS. Here we provide a selection of reports supporting NR2E3 as causative of arRP (that are some more included in HMGD).

      In order to avoid further confusion to some readers, we also include these references in the new version (see lines 109-112).

      • Gerber, S. et al. The photoreceptor cell-specific nuclear receptor gene (PNR) accounts for retinitis pigmentosa in the Crypto-Jews from Portugal (Marranos), survivors from the Spanish Inquisition. Hum Genet 107, 276–284 (2000). https://doi.org/10.1007/s004390000350
      • Kannabiran, et al. Mutations in TULP1, NR2E3, and MFRP genes in Indian families with autosomal recessive retinitis pigmentosa. Mol. Vis. 2012; 18:1165-74..
      • Bocquet, B. et al. Homozygosity mapping in autosomal recessive retinitis pigmentosa families detects novel mutations. Mol Vis 19:2487-500.
        1. l.110: see comment above about dimerization.

      We have modified the sentence in the text accordingly, see now line 125.

      1. l.162: ok, but the main reason for restricting the analysis to photoreceptors should be the photoreceptor-specific expression of Nr2e3 though...

      We have specified that Nr2e3 is solely expressed in photoreceptor cells (see line 176).

      1. l.165: please specify what is the percentage of rods with respect to all retinal cells

      The percentage of rods is around 77.7% of all cells (now on line 180). We value the suggestion, as it adds information to the reader. Therefore, the percentage of each main cell type is now included in Figure 1B.

      1. l.210: idem l.89

      We have modified the sentence in the text accordingly (now on lines 226-227).

      1. l.310: replace 'halfway'

      As specified in the answer to the first main point, we have throroughly revised the text and amended the references towards hybrid and incompletely differentiated photoreceptors.

      1. l.337: as expected? please detail

      We agree that the sentece was not clear. We have now clarified that our results agree with previous studies (see lines 370-371).

      1. l.388: discuss also crystallins in other RD models, v.g. Rpe65 ko mice

      We thank the reviewer for this suggestion and have included a brief description of the expression of crystallins in response to retinal stress in other RD models, and include the appropriate references (now on lines 422-428).

      1. l.469: idem l.89

      We have modified the sentence in the text accordingly (see lines 506-509).

      1. l.526: Please discuss increase in non-apoptotic cell death markers with respect to published data in the rd7 mouse (Venturini et al., Sci Rep, 2021)

      We have included published data in the rd7 mouse and discussed that multiple non-apoptotic cell death markers might be activated in response to NR2E3 disfunction (see lines 587-593).

      1. l.580: the proposed dominant negative effect is overtly speculative and not supported by any presented data, please remove.

      We have rewritten the sentence removing the dominant negative effect and referring exclusively to our results.

      Answers to Reviewer #2

      Aísa-Marín et al. present a detailed scRNAseq description of two Nr2e3 mouse models the authors had published in Neurobiology of Disease in 2020. These two mouse models do not mirror known pathogenic variants in human patients, but are useful animal models to understand disease mechanisms in NR2E3-linked retinal degenerations. The present follow-up study has been carefully done, the bioinformatic analysis is sound and selected pathways have been validated at protein level by immunohistochemistry and Western blot.

      Major comments:

      1. Overall, the conclusions of the study are well supported by the results. The findings provide valuable insights into retinal development and the pathogenesis of NR2E3-associated retinal dystrophies in an animal model, which need to be validated in humans. This limitation should be noted in the manuscript.

      Indeed, we agree that our work has used animal models, but further work in human-derived models (e.g. retinal organoids) should be performed to confirm these results. We have clarified this point in the Discussion section (see lines 649-650).

      Include an explanation in the text about how the specificity of the different signals detected by immunohistochemistry was assessed.

      The specificity of each antibody was assessed by introducing a negative control into the immunostaining procedure, wherein only the secondary antibody was used, as well as by comparison to the staining of the protein of interest in wild type or basal conditions reported in previous work from ours or other groups. This has been now specified in the corresponding Methods section (see lines 912-916).

      Explain the observed high variability in the percentage of cones, particularly between the two deltaE8 mutants (Figure 3C).

      We believe that the main characteristic of the Nr2e3-mutant retinas is that photoreceptors are not adequately differentiated and thus, affected photoreceptor subpopulations, in this case cones, degnerate and die. The pathogenic phenotype might be somewhat variable between animals from the same genotype (as it also happens in siblings carrying the same mutation). The deltaE8 mutants show a very different cone subpopulation pattern compared to wild-types and they clearly cluster with the other mutant retinas. For instance, cones of subpopulation cone4 might have all died.

      We take note of the question posed by the reviewer, and thus have included a graph of the absolute number of cones in each subcluster per retina and genotype, which may help the reader (see new Fig 3C, panel on the right).

      Explain why PARP-1 signals in Figure 6E are so thick and intense. Why this thick pattern is also present in the wild type retina?

      The IHC of mouse tissue sections using antibodies of mouse origin can result in background by secondary antibody binding to the endogenous Igs (as reported in Eng et al, 2016, https://doi.org/10.1093/protein/gzv054). The PARP-1 antibody is a mouse monoclonal antibody (Abcam, ab14459). The strong signal that we observe in the INL, IPL, and GL of the retina is typically found when using primary mouse antibodies in mouse tissues and corresponds to the reaction of the secondary anti-mouse antibodies binding to the endogenous IgGs in the blood vessels of the retina (we obtain the same result using the secondary antibody as a negative control, see answer to point 2).

      For the sake of clarity, we have clarified this background staining in the corresponding figure legend (lines 831-834).

      Reviewer #2

      Minor comments:

      1. Describe the abbreviations used in the text, such as PARP-1, MLKL, IRD, and VADC.

      A list of abbreviations has been included at the end of the main text (see lines 663-672).

      1. Improve the visibility of number 4 in Figure 3A and describe the meaning of the insert.

      Figure 3A has been amended accordingly

      1. Label the X-axis (cone subclusters) in Figure 3E.

      The X-axis is now correctly labelled in Figure 3E.

      1. Describe the meaning of the insert in Figure 4A.

      Figure 4A legend now contains the meaning of the insert.

      1. Indicate the relationship among inserts in Figure 4F.

      Figure 4F and legend have been modified to clarify the meaning of the insert.

      1. Use an arrow to indicate the higher expression of CSTB in the cone-rich invaginations in the mutant retinas (Figure 6A).

      The Figure 6A has been modified to include white arrowheads indicating the high expression of CSTB in the invaginations of the mutant retinas. This is also indicated in the corresponding Figure 6 Legend (lines 819-820.

      1. Revise the Y-axis values in Figure 6B, as they do not correspond to a percentage. Please, provide an explanation for the number of symbols displayed in this panel.

      The Y-axis was previously expressed on a per-unit basis. For the sake of clarity and following the reviewer’s suggestion, it has now been appropriately modified to percentage of CSTB colocalizing with opsins.

    1. Author Response

      The following is the authors’ response to the original reviews.

      We appreciate very much the comments and suggestions on our manuscript "Cylicins are a structural component of the sperm calyx being indispensable for male fertility in mice and human". According to the comments, we performed a series of further experiments, re-worded and re-wrote several paragraphs and re-structured the manuscript according to the reviewers’ comment. We think that the manuscript is now improved and are looking forward to the further evaluations. We provide a point by point response to all comments and have prepared a version.

      Recommendations for the authors:

      Editor’s comment:

      1) As pointed out by all three reviewers, it is critical to show the specificity of the antibodies used. The authors should clarify how the specificity of antibodies is tested. Western blot analysis to show the absence of the protein in the knockout is essential.

      As suggested by all reviewers, we additionally performed Western Blot analysis on cytoskeletal protein fractions to further verify the specificity of generated antibodies and the generation of functional knockout alleles. Results nicely confirm the results of the IF staining, however, both anti-bodies detected the bands lower than the predicted molecular weight. In addition, Mass Spectrometry was performed to search for the presence of peptides in the cytoskeletal protein fractions. The paragraph reads now as follows:

      Line 127-134: Additionally, Western Blot analyses confirmed the absence of CYLC1 and CYLC2 in cytoskeletal protein fractions of the respective knockout (Fig. 1 G), thereby demonstrating i) specificity of the antibodies and ii) validating the gene knockout. Of note, the CYLC1 antibody detects a double band at 40-45 KDa. This is smaller than the predicted size of 74 KDa as, but both bands were absent in Cylc1-/y. Similarly, the CYLC2 Antibody detected a double band at 38-40 KDa instead of 66 KDa. Again, both bands were absent in Cylc2-/-. Next, Mass spectrometry analysis of cytoskeletal protein fraction of mature spermatozoa was performed detecting both proteins in WT but not in the respective knockout samples (Figure 1 – supplement 5; Figure 1 – supplement 6).

      Specificity of antibodies was additionally proven by immunohistochemical staining, showing a specific staining only in testis sections but not in any other organ tested. The section reads now as follows:

      Line 115-117: Specificity of antibodies was proven by immunohistochemical stainings (IHC), showing a specific signal in testis sections only, but not in any other organ tested (Figure 1 – supplement 2)

      2) Re-structuring/streamlining of the figures is recommended. Please consider the flow suggested by reviewer #2 and shorten the evolutionary analysis which takes up more space than it adds to the value of the story.

      We thank the reviewers and editor for the valuable suggestion. We re-structured the figures as suggested and rewrote the results section accordingly. The evolutionary analysis was significantly shortened.

      3) Provide statistics for the imaging analysis such as TEM as only a single representative image is shown.

      We agree that the observed morphological defects require a detailed statistical evaluation. TEM analysis was performed to confirm the results from optical microscopy and representative images with high magnification are shown for a detailed visualization of the defects. For additional quantification, we included statistics for IF stainings against calyx proteins CCIN and CapZa (Fig. 2 I-J). For TEM, we added additional images to the supplement (Figure 3 – supplement 1). Furthermore, we quantified the manchette length of step 10-13 spermatids to prove the increased elongation of the manchette in Cylc2-/- and Cylc1-/y Cylc2-/- spermatids (Fig. 5 A-B).

      4) Please consider other points raised by the reviewers below to improve the manuscript and provide responses on how the authors have addressed them.

      We thank all reviewers for the detailed review of our manuscript and their valuable suggestions, which helped a lot to improve the manuscript. We considered all points raised by the reviewers to the best of our knowledge and hope that the reviewers will approve the manuscript ready for publication. We added a point-by-point discussion of all comments/suggestions hereafter.

      Reviewer #1 (Recommendations For The Authors):

      Major comments:

      (1) Antibody specificity: Fig 1E - there are some unspecific binding in Cylc2-/- for CYLC2 and in Cylc1/y Cylc2+/- for CYLC1 in the testis (and elongating spermatids in Figure 1 – Supplement 4). Could authors elaborate/comment on this? Western blot analysis would be also helpful to further support the antibody specificity.

      The very weak unspecific staining in the testis for CYLC2 (in Cylc2-/-) and CYLC1 (in Cylc1-/y Cylc2+/-) is only present in the lumen of the seminiferous tubules and/or the residual bodies of the testicular sperm cells and can be referred to as background signal. Importantly, the signal is entirely lost in the PT region, proving specificity of the generated antibodies. We added the following paragraph to the results section:

      Line 124-127: The generated antibodies did not stain testicular tissue and mature sperm of Cylc1- and Cylc2-deficient males, except for a very weak unspecific background staining in the lumen of seminiferous tubules and the residual bodies of testicular sperm (Fig. 1 F).

      Specificity of antibodies was additionally proven by immunohistochemical staining, showing a specific staining only in testis sections but not in any other organ tested.

      Line 115-117: Specificity of antibodies was proven by immunohistochemical stainings, showing a specific staining in testis sections only, but not in any other organ tested (Figure 1 – supplement 2)

      To further verify the specificity of generated antibodies and the generation of functional knockout alleles, we additionally performed Western Blot analysis on cytoskeletal protein fractions, confirming the results of the IF staining. No unspecific bands were detected in the Western Blot, further supporting the notion that the weak unspecific signals in IF resemble staining artifacts.

      The paragraph reads now as follows:

      Line 127-132: Additionally, Western Blot analyses confirmed the absence of CYLC1 and CYLC2 in cytoskeletal protein fractions of the respective knockout (Fig. 1 G), thereby demonstrating i) specificity of the antibodies and ii) validating the gene knockout. Of note, the CYLC1 antibody detects a double band at 40-45 KDa. This is smaller than the predicted size of 74 KDa as, but both bands were absent in Cylc1-/y. Similarly, the CYLC2 Antibody detected a double band at 38-40 KDa instead of 66 KDa. Again, both bands were absent in Cylc2-/-.

      (2) Please provide more interpretation of the gene dosage effect of Cylicin 2. It is not common to see a gene dosage effect in the sperm phenotype as transcripts and proteins can be shared between haploids due to syncytium formation during spermatogenesis.

      We agree and we apologize for the misinterpretation. In Cylc2+/- mice expression of Cylc2 was reduced by half but there was no altered phenotype observed. The sentence now reads as follows:

      Line 112: In Cylc2+/- animals expression of Cylc2 was reduced by 50 %.

      (3) Line 194-196 - the authors say that the sperm are smaller, with shorter hooks and increased circularity of the nuclei, and reduced elongation. Are these statistically significant? There seems to be a high variation in the graph in S2D and the statistical analysis is not given.

      We agree, performed statistical analyses, and highlighted significantly altered values for sperm head elongation and circularity in Figure 2 – Supplement 3.

      (4) Line 153-164 It is interesting that the absence of Cylc2 affected many parts of sperm structure. I think some ratios of sperm always have a morphological defect in diverse ways, so it is hard to confirm the finding only with a single sperm image. I think that it will be important to do some statistical analysis or at the minimum show more TEM images from TEM.

      We agree that the observed morphological defects require a detailed statistical evaluation. TEM analysis was performed to confirm the results from optical microscopy and representative images with high magnification are shown for a detailed visualization of the defects. For additional quantification, we included statistics for IF stainings against calyx proteins CCIN and CapZa (Fig. 2 I-J). For TEM, we added additional images to the supplement (Figure 3 – Supplement 1).

      (5) Line 236-242 - I believe that the phenotype described applies to the sperm from Cylc2-/- and Cylc1/y Cylc2-/- animals; however, I think that the Cylc1-/y Cylc2+/- has a more subtle, intermediate phenotype between the WT and the genotypes missing both Cylc-/- alleles.

      We agree and we added a quantification of manchette length at step 10-13 to visualize the differences between the genotypes. The section reads now as follows: Line 268-272: Manchette length was measured starting from step 10 until step 13 spermatids and the mean was obtained, showing that the average manchette length was 76-80 nm in wildtype, Cylc1-/Y and Cylc2+/- while for Cylc2-/- and Cylc1-/Y Cylc2-/- spermatids mean manchette length reached 100 nm (Fig. 5 B). Cylc1-/Y Cylc2+/- spermatids displayed an intermediate phenotype with a mean manchette length of 86 nm.

      (6) Since CYLC1 staining is absent in Fig 5B, does that mean that the mutation resulted in protein degradation/instability? Is RNA present? Additional biochemical studies of Cyclins demonstrating the deleterious nature of the mutations would strengthen the molecular pathogenesis of the human mutations.

      Thank you for raising these important questions. The CYLC1 variant c.1720G>C is predicted to cause the amino acid substitution p.(Glu574Gln). It is, thus, conceivable that the RNA is present but either the protein is degraded or misfolded and, therefore, not detectable by IF. Unfortunately, for personal reasons of the patient, it is currently not possible to receive additional semen samples, preventing additional analyses of the semen, e.g. analysis of Cylicin transcript level.

      (7) Strongly suggest shortening the evolutionary analysis - all the corresponding materials are in supplemental while texts are extensive- or even consider entirely omitting. It does not add a lot to the current study.

      We agree that the evolutionary analysis was very detailed. However, we think that the results are important to understand the role of Cylicins for male reproduction in general. The results obtained from the mouse model might be transferable to other species, including humans. Further, the results present a possible explanation for the subfertility of Cylc1-deficient mice, in contrast to infertility of Cylc2-deficient males. We shortened the section, the paragraph reads as follows:

      Line 287-302: To address why Cylc2 deficiency causes more severe phenotypic alterations than Cylc1deficiency in mice, we performed evolutionary analysis of both genes. Analysis of the selective constrains on Cylc1 and Cylc2 across rodents and primates revealed that both genes’ coding sequences are conserved in general, although conservation is weaker in Cylc1 trending towards a more relaxed constraint (Fig. 6). A model allowing for separate calculation of the evolutionary rate for primates and rodents, did not detect a significant difference between the clades, neither for Cylc1 nor for Cylc2, indicating that the sequences are equally conserved in both clades.

      To analyze the selective pressure across the coding sequence in more detail, we calculated the evolutionary rates for each codon site across the whole tree. According to the analysis, 34% of codon sites were conserved, 51% under relaxed selective constraint, and 15% positively selected. For Cylc2, 47% of codon sites conserved, 44% under neutral/relaxed constraint, and 9% positively selected. Interestingly, codon sites encoding lysine residues, which are proposed to be of functional importance for Cylicins, are mostly conserved. For Cylc1, 17% of lysine residues are significantly conserved and 35% of significantly conserved codons encode for lysine. For Cylc2, this pattern is even more pronounced with 27.9% of lysine codons being significantly conserved and 24.3% of all conserved sites encoding for lysine (Fig. 6).

      Minor comments:

      (1) Line 114, 115, 118 à Figure 1D is already well-described in the previous paragraph and thus redundant. Ref Fig 1D, E; but only figure E shows IF. Maybe supposed to be E and F or just 1E?

      We apologize for the mix-up with the subfigures. The mentioned paragraph refers to Fig. 1 E-F, which was corrected accordingly.

      Line 117-123: Immunofluorescence staining of wildtype testicular tissue showed presence of both, CYLC1 and CYLC2 from the round spermatid stage onward (Fig. 1 E). The signal was first detectable in the subacrosomal region as a cap-like structure, lining the developing acrosome (Fig. 1 E-F, Figure 1 – supplement 3). As the spermatids elongate, CYLC1 and CYLC2 move across the PT towards the caudal part of the cell (Figure 1 – supplement 4). At later steps of spermiogenesis, the localization in the subacrosomal part of the PT faded, while it intensified in the postacrosomal calyx region (Fig. 1 E-F).

      (2) Figure 1F - Arguably, IF images show expression of both CYLC1 and CYLC2 to reach/include the acrosome/hook portion of the sperm head, but the diagram does not reflect that. Why is that?

      We agree and apologize for the inconsistency. The illustration was adjusted according to the experimental data showing localization of Cylicins in the whole ventral part of the sperm.

      (3) Line 124 - PAS staining mentioned on line 124, is not explained (Periodic acid Schiff staining) until line 605

      We agree and introduced the abbreviation accordingly. The PAS staining was moved to Fig. 4. The paragraph reads now as follows:

      Line 220-222: To study the origin of observed structural sperm defects, spermiogenesis of Cylicin deficient males was analyzed in detail. PNA lectin staining and Periodic Acid Schiff (PAS) staining of testicular tissue sections were performed to investigate acrosome biogenesis.

      (4) Some figures are hard to read due to being very small (S1B, 3F).

      We agree and we increased the figure size. For former Figure 3F (now figure 4A), insets with higher magnification of representative sperm were added. Insets are additionally shown in Figure 4 – Supplement 1 in higher resolution.

      (5) Line 139 Please specify whether the sperm was capacitated or not.

      Analysis of the flagellar beat was performed with non-capacitated sperm. We clarified this in the main text:

      Line 203: The SpermQ software was used to analyze the flagellar beat of non-capacitated Cylc2-/- sperm in detail 22.

      As described in the Material and Methods section, sperm were only activated in TYH medium, prior to analysis:

      Line 732-733: Sperm samples were diluted in TYH buffer shortly before insertion of the suspension into the observation chamber.

      (6) Line 142-145; The sentence is interrupted strangely, perhaps the authors meant to write: "Interestingly, we observed that the flagellar beat of Cylc2-/- sperm cells was similar to wildtype cells, however, with interruptions during which midpiece and initial principal piece appeared stiff whereas high-frequency beating occurs at the flagellar tip"

      We corrected the sentence accordingly.

      Line 206-208: Interestingly, we observed that the flagellar beat of Cylc2-/- sperm cells was similar to wildtype cells, however, with interruptions during which midpiece and initial principal piece appeared stiff whereas high frequency beating occurs at the flagellar tip (Fig. 3 C, Video 1, Video 2).

      (7) Line 142 -Wrong Figure number. Figure S4A is a phylogenic analysis.

      We regret the mix up and corrected the Figure reference accordingly. Line 204-205: Cylc2-/- sperm showed stiffness in the neck and a reduced amplitude of the initial flagellar beat, as well as reduced average curvature of the flagellum during a single beat (Figure 3 – supplement 2).

      (8) L146-147 Better placed in Discussion.

      We agree, and we omitted this sentence from the results part.

      (9) Line 154-156 - The white arrowheads are present in both WT and KO sperm, thus it appears they denote the basal plate, not necessarily the dislocation/parallel position as the current text seems to suggest. Furthermore, the position of the WT and KO sperm is somewhat different with the tail coiling differently, so it is hard to see whether the two are comparable.

      We agree and we removed the white arrowhead in the WT sperm picture, and it now depicts only the dislocation of the basal plate in the Cylc2-/- sperm. Due to the morphological anomalies of Cylc2-/- sperm cells, it’s difficult to determine the exact angle of the depicted cell. However, we added more TEM pictures of the sperm cells (3 for WT and 6 for Cylc2-/-) in Figure 3 – Supplement 1.

      (10) Line 164 Please describe in detail what mitochondrial damage the readers expect to see from the TEM image.

      We evaluated the observed mitochondrial damage in more detail. Unfortunately, the defects described initially seem to be an artifact of apoptotic sperm cells and could not be identified in vital sperm cells in either of the knockout mouse models. We apologize for this misinterpretation, and we deleted this section in the manuscript.

      (12) Figure S2A - no WT comparison, difficult to compare without it (mitochondria in Cylc2-/-)

      See (10). We evaluated the observed mitochondrial damage in more detail and in comparison to WT. Unfortunately, the defects described initially seem to be an artifact of apoptotic sperm cells and could not be identified in vital sperm cells in either of the knockout mouse models. We apologize for this misinterpretation and we deleted this section in the manuscript.

      (13) Line 172-173 - Fig 3C denotes quantification of abnormal acrosome only, however, the text mentions sperm coiled tail being quantified within this graph - which is it? Is it both of them? Or only one of them?

      Figure 3 C (now Figure 2G) showed the percentage of abnormal sperm in general comprising acrosomal as well as flagellar defects. We modified the figure and evaluated acrosomal defects and tail defects separately. The results section was changed accordingly and reads now as follows:

      Line 152-159: Loss of Cylc1 alone caused malformations of the acrosome in around 38% of mature sperm, while their flagellum appeared unaltered and properly connected to the head. Cylc2+/- males showed normal sperm tail morphology with around 30% of acrosome malformations. Cylc2-/- mature sperm cells displayed morphological alterations of head and mid-piece (Fig. 2 F-G). 76% of Cylc2-/- sperm cells showed acrosome malformations, bending of the neck region, and/or coiling of the flagellum, occasionally resulting in its wrapping around the sperm head in 80% of sperm (Fig. 2 F). While 70% of Cylc1-/Y Cylc2+/- sperm showed these morphological alterations, around 92% of Cylc1-/YCylc2-/- sperm presented with coiled tail and abnormal acrosome (Fig. 2 F-G).

      (14) Fig 3D - CCIN in the text, cylicin in the figure - this should be consistent. Furthermore, since only the head is being shown, is CCIN ever detected in the WT sperm tail?

      We apologize for the inconsistency, and we added the abbreviation “CCIN” to the figure. CCIN is solely detectable in the sperm head of wildtype sperm as published previously. Irregular staining patterns showing signals in the tail region are only observed upon Cylicin deficiency.

      (15) Line 199-200 - To say that head of Cylc2-deficient sperm appears less concave seems redundant, likely the observed increased circularity is contributed to by sperm head being less concave in this region; unless there is an extra point that the authors are trying to make and if so, this needs to be elaborated on

      We agree and we deleted the sentence from the manuscript.

      (16) Figure legend of Fig S3 is wrong. Only S3A and S3B are present, and in the figure legend S3C corresponds to figure S3B.

      We agree and corrected the Figure legends accordingly. Due to the re-structuring of the manuscript, Figures and Supplementary figures were re-ordered as well.

      (17) Figure 4B - figure legend and/or text should specify that lectin is green and HOOK1 is in red

      We specified the figure legend as well as the main text accordingly: Line: 279-281: Co-staining of the spermatids with antibodies against PNA lectin (green) and HOOK1 (red) revealed that abnormal manchette elongation and acrosome anomalies simultaneously occurred in elongating spermatids of Cylc2-/- male mice (Fig. 5 C).

      Line: 560-562: Co-staining of the manchette with HOOK1 (red) and acrosome with PNA-lectin (green) is shown in round, elongating and elongated spermatids of WT (upper panel) and Cylc2-/- mice (lower panel).

      (18) Line 261-263 - It is difficult to see what is going on with microtubules in these images, as the resolution is low

      We increased the pictures and improved their quality. Microtubules are also depicted with letter ‘m’

      (19) Line 265-266 - It seems that there is a persistence of manchette, rather than elongation. From these images, I cannot see gaps, and I am not sure where to look for them. This needs to be labelled further and higher-resolution images could be included for clarity.

      We agree, although we observed both excessive elongation and persistence of the manchette. The average length of the manchette is now shown in figure 5B.

      The paragraph now reads as follows:

      Line 235-239: Microtubules appeared longer on one side of the nucleus than on the other, displacing the acrosome to the side and creating a gap in the PT (Fig. 4 C). Whereas elongated spermatids at step 14-15 in wildtype sperm already disassembled their manchette and the PT appeared as a unique structure that compactly surrounds nucleus, in Cylc2-/- spermatids, remaining microtubules failed to disassemble.

      The gaps in the perinuclear theca are better visible in TEM micrographs and the description is now in the paragraph describing TEM.

      (20) Line 269 Please include the information of "White arrowhead".

      We added the information accordingly.

      Line 240-242: In addition, at step 16, the calyx was absent, and an excess of cytoplasm surrounded the nucleus and flagellum (Fig. 4 C, white arrowhead).

      (21) Line 276-280 This should be placed in the Discussion.

      We agree, and we deleted this concluding remark from the results section.

      (22) Is Cylc1 and/or Cylc2 conserved/expressed amongst species other than rodents and primates?

      Yes, Cylc1 and Cylc2 homologs were identified in C. elegans for example. We added a schematic to the introduction showing the protein structure of human, mouse and C. elegans CYLC1 and CYLC2 (Figure 1 – supplement 1).

      The section reads now as follows:

      Line 73-78: In most species, two Cylicin genes, Cylc1 and Cylc2, have been identified (Figure 1- supplement 1). They are characterized by repetitive lysine-lysine-aspartic acid (KKD) and lysine-lysine-glutamic acid (KKE) peptide motifs, resulting in an isoelectric point (IEP) > pH 10 14, 15. Repeating units of up to 41 amino acids in the central part of the molecules stand out by a predicted tendency to form individual short α-helices 14. Mammalian Cylicins exhibit similar protein and domain characteristics, but CYLC2 has a much shorter amino-terminal portion than CYLC1 (Figure 1-supplement 1).

      (23) The whole chapter "Cylc2 coding sequence is slightly more conserved among species than Cylc1" references only supplemental figures/tables. I find this unusual.

      We agree, and in order to show the results of the evolutionary analysis more clearly, we moved the panel to main Figure 6.

      Line 286-302: To address why Cylc2 deficiency causes more severe phenotypic alterations than Cylc1deficiency in mice, we performed evolutionary analysis of both genes. Analysis of the selective constrains on Cylc1 and Cylc2 across rodents and primates revealed that both genes’ coding sequences are conserved in general, although conservation is weaker in Cylc1 trending towards a more relaxed constraint (Fig. 6 A). A model allowing for separate calculation of the evolutionary rate for primates and rodents, did not detect a significant difference between the clades, neither for Cylc1 nor for Cylc2, indicating that the sequences are equally conserved in both clades.

      To analyze the selective pressure across the coding sequence in more detail, we calculated the evolutionary rates for each codon site across the whole tree. According to the analysis, 34% of codon sites were conserved, 51% under relaxed selective constraint, and 15% positively selected. For Cylc2, 47% of codon sites conserved, 44% under neutral/relaxed constraint, and 9% positively selected. Interestingly, codon sites encoding lysine residues, which are proposed to be of functional importance for Cylicins, are mostly conserved. For Cylc1, 17% of lysine residues are significantly conserved and 35% of significantly conserved codons encode for lysine. For Cylc2, this pattern is even more pronounced with 27.9% of lysine codons being significantly conserved and 24.3% of all conserved sites encoding for lysine (Fig. 6 B).

      (24) Line 332 - CYCL2 should be CYLC2

      We corrected the typo accordingly.

      (25) Line 340 The ratio of head defects is different from Table 1 (98% here and 99 % in the table). Please check this information.

      We apologize for the inconsistency. We checked the raw data and corrected the table accordingly.

      (26) Line 344-345 From figure 5C it is difficult to determine whether the sperm are "headless" or whether the heads are attached to the highly coiled tails next to them

      We agree and we quantified the percentage of sperm showing abnormal flagella and a headless phenotype. Furthermore, we added an arrowhead to figure 6C to highlight headless sperm. The paragraph reads now as follows:

      Line 335-339: Bright field microscopy demonstrated that M2270’s sperm flagella coiled in a similar manner compared to flagella of sperm from Cylicin deficient mice. Quantification revealed 57% of M2270 sperm displaying abnormal flagella compared to 6% in the healthy donor (Fig. 7 D). Interestingly, DAPI staining revealed that 27% of M2270 flagella carry cytoplasmatic bodies without nuclei and could be defined as headless spermatozoa (Fig. 7 C, white arrowhead; Fig. 7 E).

      (27) L367-368 I agree with the authors' logic of this sentence. Although, it is better to show the co-localization of proteins using multi-channel immunocytochemistry. As you mentioned on L369 this will make your finding more obvious. If it is available, please include the colocalization images of the proteins.

      We performed the multi-channel staining against Cylicin1 and Calicin, as well as Cylicin2 and Calicin on mouse epipidymal sperm and it’s shown in Figure 2 – supplement 4. Unfortunately, we did not manage to obtain stainings of tissue sections since antibodies against Cylicins and Calicin require different sample processing.

      The sentence was added in the section describing calyx integrity:

      Line 168-169: In epididymal sperm, CCIN co-localizes with both CYLC1 and CYLC2 in the calyx (Figure 2 – supplement 4).

      (28) Line 376 Please keep the abbreviation. "Calicin" "CCIN".

      We included the abbreviation accordingly.

      Line 377-378: CCIN is shown to be necessary for the IAM-PT-NE complex by establishing bidirectional connections with other PT proteins.

      (29) Line 377-378 "Based on ~". The authors did not prove the interaction between CCIN and Cylicins in this article. The mislocalization of CCIN might be resulted in the loss of Cylicins, without any "interaction". To reach this conclusion, a more direct result should be provided.

      We agree that we overinterpreted this as we and others did not prove the interaction between CCIN and Cylicins so far. We therefore weakened this statement and formulated it as a hypothesis.

      Line 377-381: CCIN is shown to be necessary for the IAM-PT-NE complex by establishing bidirectional connections with other PT proteins. Zhang et al. found CYLC1 to be among proteins enriched in PT fraction 7. Based on their speculation that CCIN is the main organizer of the PT, we hypothesize that both CCIN and Cylicins might interact, either directly or in a complex with other proteins, in order to provide the ‘molecular glue’ necessary for the acrosome anchoring.

      (30) Line 499 Please specify which is the target of the immunostaining on the Figure legend. (Tubulin à acetylated α-tubulin)

      We specified that α-Tubulin was stained. The figure legend reads now as follow: Line 555-557: Immunofluorescence staining of α-Tubulin to visualize manchette structure in squash testis samples of WT, Cylc1-/y, Cylc2+/-, Cylc2-/-, Cylc1 -/y Cylc2+/- and Cylc1-/y Cylc2-/- mice.

      (31) Line 502 Please specify which color indicates which target protein (not only cellular structure).

      Line 560-562: Co-staining of the manchette with HOOK1 (red) and acrosome with PNA-lectin (green) is shown in round, elongating and elongated spermatids of WT (upper panel) and Cylc2-/- mice (lower panel).

      (32) Line 509 Please include scale bar information in the figure legend like Figure 4 (The magnifications of Figure 5 B, C, and D seem different).

      We included the scale bar information accordingly (now Figure 6).

      Line 575-588: Figure 6: Cylicins are required for human male fertility

      (A) Pedigree of patient M2270. His father (M2270_F) is carrier of the heterozygous CYLC2 variant c.551G>A and his mother (M2270_M) carries the X-linked CYLC1 variant c.1720G>C in a heterozygous state. Asterisks (*) indicate the location of the variants in CYLC1 and CYLC2 within the electropherograms.

      (B) Immunofluorescence staining of CYLC1 in spermatozoa from healthy donor and patient M2270. In donor’s sperm cells CYLC1 localizes in the calyx, while patient’s sperm cells are completely missing the signal. Scale bar: 5 µm.

      (C) Bright field images of the spermatozoa from healthy donor and M2270 show visible head and tail anomalies, coiling of the flagellum as well as headless spermatozoa who carry cytoplasmatic residues without nuclei. Heads were counterstained with DAPI. Scale bar: 5 µm.

      (D-E) Quantification of flagellum integrity (D) and headless sperm (E) in the semen of patient M2270 and a helathy donor.

      (F-G) Immunofluorescence staining of CCIN (F) and PLCz (G) in sperm cells of patient M2270 and a healthy donor. Nuclei were counterstained with DAPI. Scale bar: 3 µm.

      (33) S2A is not clear. Please describe specifically what the left panel and right panel are about to show with a clear indication of what is PM, mitochondria, etc. On the right, in only one cross-section that shows both mitochondria and the 9+2 axoneme, they look both unaltered whereas on the left, there are unpacked, not aligned mitochondria but the tail boundary is not clear to grasp at first sight.

      We apologize for the bad quality of the TEM pictures showing the axonemes and the missing labeling. We recorded and included new images showing an intact 9+2 microtubular structure in Cylc2-/-. Furthermore, we added an image for the wildtype control.

      (34) S2D: colors of the last three plots of each graph are too close to tell apart

      We agree and changed the color scheme for better visualization.

      Reviewer #2 (Recommendations For The Authors):

      However, I find the manuscript a bit messy, and I will propose to reorganize the figures: following figure 1, showing the reproductive phenotype, I would continue with a figure showing the morphology of sperm in optical microscopy and showing the morphological defect of the nucleus (Fig 3B and 3E), followed with one figure focusing on the flagellum, with images obtained with optical and electronic microscopies, allowing to present the abnormalities of the flagellum and finally the impact on sperm motility and flagellum beating (mix of figure 2FG/3A); next, one figure focusing on acrosome. After that, I would present all data concerning spermiogenesis, starting with figure 2C.

      We thank the reviewer for the valuable suggestion, which helps a lot to improve the structure and comprehensibility of the manuscript. We re-organized the figures and the results section accordingly.

      Major remarks

      1) Line 111. The specificity of raised Ab is not clear. Please specify if antibodies are specific: what immune-decorates anti-CYLC1: only CYLC1 or CYLC1 and CYLC2. Same question for anti-CYLC2

      Both antibodies were raised against specific peptides of the CYLC1 or CYLC2 protein, respectively. The antigen peptides used for immunization are depicted in the Material and Methods section (AESRKSKNDERRKTLKIKFRGK and KDAKKEGKKKGKRESRKKR peptides for CYLC1; KSVGTHKSLASEKTKKEVK and ESGGEKAGSKKEAKDDKKDA for CYLC2). The peptides used for immunization are specific as they do not resemble the highly conserved and repetitive KKD/KKE motives present in both, Cylc1 and Cylc2.

      The specificity of raised antibodies was validated by IF staining of wildype and Cylicin-deficient testis sections. The results clearly show, that CYLC1 signal is absent in Cylc1-deficient spermatids and CYLC2 signal being absent in Cylc2 deficient spermatids.

      Specificity of antibodies was additionally proven by immunohistochemical stainings, showing a specific staining only in testis sections but not in any other organ tested.

      Line 115-117: Specificity of antibodies was proven by immunohistochemical stainings, showing a specific staining only in testis sections but not in any other organ tested (Figure 1 - supplement 2)

      To further verify the specificity of generated antibodies and the generation of functional knockout alleles, we additionally performed Western Blot analysis on cytoskeletal protein fractions, confirming the results of the IF staining.

      The paragraph reads now as follows:

      Line 127-134: Additionally, Western Blot analyses confirmed the absence of CYLC1 and CYLC2 in cytoskeletal protein fractions of the respective knockout (Fig. 1 G), thereby demonstrating i) specificity of the antibodies and ii) validating the gene knockout. Of note, the CYLC1 antibody detects a double band at 40-45 KDa. This is smaller than the predicted size of 74 KDa as, but both bands were absent in Cylc1-/y. Similarly, the CYLC2 Antibody detected a double band at 38-40 KDa instead of 66 KDa. Again, both bands were absent in Cylc2-/-. Next, Mass spectrometry analysis of cytoskeletal protein fraction of mature spermatozoa was performed detecting both proteins in WT but not in the respective knockout samples (Figure 1 – supplement 5; Figure 1 – supplement 6).

      2) Line 115 and figure 1D. From the images presented in figure 1D, it is not clear where CYLC1 and CYLC2 are localized in the round and in elongated spermatids. Please make double staining using a second Ab to identify the acrosome such as DPY19L2 (best option) or SP56 and the manchette such as acetylated alpha-tubulin.

      We agree, and we added a double staining of CYLC1/CYLC2 and SP56 to the supplement (Figure 1 - supplement 3), showing co-localization of the developing acrosome and Cylicins. Manchette staining was not performed due to antibodies being available in same species as those against Cylicins (anti-rabbit).

      Line 117-120: Immunofluorescence staining of wildtype testicular tissue showed presence of both, CYLC1 and CYLC2 from the round spermatid stage onward (Fig. 1 E, Figure 1 – supplement 3). The signal was first detectable in the subacrosomal region as a cap like structure, lining the developing acrosome (Fig. 1 E-F, Figure 1 – supplement 3).

      3) Line 118 and figure 1. The drawing showing the localization of Cylicin in mature sperm does not fit with the experimental data. Cylicins are located on the whole ventral face of the sperm.

      We agree and apologize for the inconsistency. The illustration was adjusted according to the experimental data showing localization of Cylicins in the whole ventral part of the sperm.

      4) Figure 1: Change "expression of Cylicin" to "localization of cylicin" (green)

      We changed the legend accordingly.

      5) Line 124 and figure 2C. In the figure provided, the PAS staining seems defective. The acrosomes do not seem stained (in pink as expected for a PAS staining). It may be due to the low quality of the pdf file, nevertheless, it is important to provide in supplementary data, an enlargement of the spermatid region showing the staining of the acrosome.

      We apologize for the bad quality of the PDF file and the low magnification. We restructured the subfigure showing PAS stained spermatids at different steps of spermiogenesis at higher magnification. According to the initial reviewer’s suggestion, the PAS staining was moved to figure 4. The PAS staining in figure 2 was replaced by HE-stained overview testis sections in Figure 3 – supplement 1 showing intact spermatogenesis in all genotypes.

      6) Line 130. Please indicate a reference for the lower limit of 58%. If this lower limit corresponds to human sperm, it should be omitted.

      Indeed, the given reference limit of 58% is only valid for human sperm samples. Therefore, we omitted the reference limit. The paragraph reads now as follows: Line 144-146: Eosin-Nigrosin staining revealed that the viability of epididymal sperm from all genotypes was not severely affected (Fig. 2 D, Figure 2 – supplement 2).

      7) line 152 Sperm morphology. Before showing the ultrastructure of the sperm, it would be important to show sperm morphology observed by optical microscopy. Therefore, I recommend including figure S2 as a principal figure, with a mix of Figures 3B and 3E.

      We thank the reviewer for the suggestion. The results section was re-structured accordingly, first showing results of optical microscopy (Fig. 2), followed by an in-depth ultrastructural investigation of morphological defects and their effects on sperm motility. Brightfield images of epididymal sperm were moved from former Figure S2 to main Figure 2.

      8) Line 164. figure S2A, showing that the 9+2 pattern is normal in KO sperm, is not convincing. Enlargement is required to conclude that the axoneme structure is normal; from the pictures, it rather seems that some doublets are missing.

      We apologize for the bad quality of the TEM pictures showing the axonemes. We recorded and included new images showing an intact 9+2 microtubular structure.

      9) Line 196. I would suggest removing the term "mild globozoospermia". Globozoospermia is rather complete (100% of round sperm heads) or incomplete (<90 % of round sperm heads). The anomalies observed on sperm heads, sperm motility, and the decrease in sperm concentration are rather suggestive of an OAT.

      We agree and we omitted the term “mild globozoospermia”. Instead, we added a concluding remark to the section, summarizing the described defects as OAT syndrome. The section reads now as follows:

      Line 215-217: Taken together, observed anomalies of sperm heads, impaired sperm motility, and the decrease in epididymal sperm concentration show that Cylc deficiency results in a severe OAT phenotype (Oligo-Astheno-Teratozoospermia-syndrome) described in human.

      10) Line 248. It is not clear from the data of figure 4B that "the developing acrosome lost its compact adherence to the nuclear envelope". From this figure, only defective morphologies of the acrosome are observed

      We agree and we omitted the sentence. Furthermore, it does not add additional information to the manuscript, since defects in the attachment of the acrosome to the nuclear envelope have been described in detail in Figure 4C.

      11) line 260-264. Manchette defects appear at stages 9-10. At this stage, the HTCA is already attached to the nucleus of the spermatid. see for instance figure 2 from Shang Y, Zhu F, Wang L, Ouyang YC, Dong MZ, Liu C, Zhao H, Cui X, Ma D, Zhang Z, Yang X, Guo Y, Liu F, Yuan L, Gao F, Guo X, Sun QY, Cao Y, Li W. Essential role for SUN5 in anchoring sperm head to the tail. Elife. 2017 Sep 25;6:e28199. doi: 10.7554/eLife.28199 . Therefore, the hypothesis that "abnormal attachment of the developing flagellum to the basal plate and consequently flipping of the head and coiling of the tail in mature spermatozoa" is unlikely and I suggest modifying this paragraph. In the HOOK paper, the manchette defects occurred earlier.

      We read the suggested literature and we agree to this reviewer’s comment. Manchette defects that we observe appear at later stages and are probably not responsible for the morphological anomalies of the mature sperm. We also re-analyzed all the manchette staining pictures and didn’t find any defects at earlier stages, so we decided to delete the sentence from the manuscript.

      12) Line 344. Please indicate a percentage of headless spermatozoa. Many sperm is too vague.

      We agree and we quantified the percentage of sperm showing abnormal flagella and a headless phenotype. The paragraph reads now as follows:

      Line 335-339: Bright field microscopy demonstrated that M2270’s sperm flagella coiled in a similar manner compared to flagella of sperm from Cylicin deficient mice. Quantification revealed 57% of M2270 sperm displaying abnormal flagella compared to 6% in the healthy donor (Fig. 7 D). Interestingly, DAPI staining revealed that 27% of M2270 flagella carry cytoplasmatic bodies without nuclei and could be defined as headless spermatozoa (Fig. 7 C, white arrowhead; Fig. 7 E).

      13) Any data concerning the success of ICSI for this patient?

      Yes, the outcome of the ICSI were added to the main text. Line 309-311: The couple underwent one ICSI procedure which resulted in 17 fertilized oocytes out of 18 retrieved. Three cryo-single embryo transfers were performed in spontaneous cycles, but no pregnancy was achieved.

      14) Finally, it would be interesting to study the localization of PLCzeta in this model, since its localization in the perinuclear theca has been clearly shown (Escoffier et al, 2015 doi:10.1093/molehr/gau098 )

      We thank the reviewer for the valuable suggestion and performed PLCzeta staining on human sperm, clearly showing an irregular PT staining pattern in sperm of patient M2270 compared to healthy control sperm. Of note, staining was not possible in the mouse due to the antibody being reactive only for human samples.

      The section reads as follows:

      Line 343-349: Testis specific phospholipase C zeta 1 (PLCζ1) is localized in the postacrosomal region of PT in mammalian sperm (Yoon and Fissore, 2007) and has a role in generating calcium (Ca²⁺) oscillations that are necessary for oocyte activation (Yoon, 2008). Staining of healthy donor’s spermatozoa showed a previously described localization of PLCζ1 in the calyx, while sperm from M2270 patient presents signal irregularly through the PT surrounding sperm heads (Fig. 7 G). These results suggest that Cylicin deficiency can cause severe disruption of PT in human sperm as well, causing male infertility.

      Reviewer #3 (Recommendations For The Authors):

      1) Why the Cylc1-/y Cylc2+/- males were infertile? It would be helpful to show the homologue of the two proteins;

      To elaborate more on the homology of CYLC1 and CYLC2, we added a more detailed section about the protein and domain structure to the introduction.

      Line 73-78: In most species, two Cylicin genes, Cylc1 and Cylc2, have been identified (Figure 1supplement 1). They are characterized by repetitive lysine-lysine-aspartic acid (KKD) and lysine-lysineglutamic acid (KKE) peptide motifs, resulting in an isoelectric point (IEP) > pH 10 14, 15. Repeating units of up to 41 amino acids in the central part of the molecules stand out by a predicted tendency to form individual short α-helices (Hess et al., 1993). Mammalian Cylicins exhibit similar protein and domain characteristics, but CYLC2 has a much shorter amino-terminal portion than CYLC1 (Figure 1supplement 1).

      Speculations about the infertility of Cylc1-/y Cylc2+/- males was added to the discussion:

      Line 410-413: Interestingly, Cylc1-/Y Cylc2+/- males displayed an “intermediate” phenotype, showing slightly less damaged sperm than Cylc2-/- and Cylc1-/Y Cylc2-/- animals. This further supports our notion, that loss of the less conserved Cylc1 gene might be at least partially compensated by the remaining Cylc2 allele.

      2) Western blot is important to show the absence of the two proteins in the mouse models;

      To further verify the specificity of generated antibodies and the generation of functional knockout alleles, we additionally performed Western Blot analysis on cytoskeletal protein fractions, confirming the results of the IF staining.

      A paragraph was added to the manuscript and reads as follows:

      Line 127-134: Additionally, Western Blot analyses confirmed the absence of CYLC1 and CYLC2 in cytoskeletal protein fractions of the respective knockout (Fig. 1 G), thereby demonstrating i) specificity of the antibodies and ii) validating the gene knockout. Of note, the CYLC1 antibody detects a double band at 40-45 KDa. This is smaller than the predicted size of 74 KDa as, but both bands were absent in Cylc1-/y. Similarly, the CYLC2 Antibody detected a double band at 38-40 KDa instead of 66 KDa. Again, both bands were absent in Cylc2-/-. Next, Mass spectrometry analysis of cytoskeletal protein fraction of mature spermatozoa was performed detecting both proteins in WT but not in the respective knockout samples (Figure 1 – supplement 5; Figure 1 – supplement 6).

      3) On Page 7, line 227 and line 243, was the acetylated α-tubulin or α-tubulin antibody used?

      For all stainings α-tubulin antibody was used. We corrected this accordingly. Line 257-259: We used immunofluorescence staining of α-tubulin on squash testis samples containing spermatids at different stages of spermiogenesis to investigate whether the altered head shape, calyx structure, and tail-head connection anomalies originate from possible defects of the manchette structure.

      4) Fig. 2S: A cartoon showing the elongation and circularity of nuclei for evaluation is helpful; The TEM images from the control and Cylc1 KO mice are needed;

      Cylc1-/Y TEM picture was added in Figure 3A.

      5) The discussion should be rewritten. The current version is to repeat the experiments/findings. The authors should discuss more about the potential mechanisms.

      We discussed the observed defects of Cylc-deficient animals and discussed this in relation to other published mouse models deficient in Calyx components. Furthermore, we speculated about potential interaction partners of Cylicins and the importance of these protein complexes for male fertility. However, to this point, we think that it is too farfetched to speculate about potential mechanisms without any evidence for Cylc interaction partner or their exact molecular function. This requires further research.

    1. Author Response

      The following is the authors’ response to the original reviews.

      eLife assessment

      This useful manuscript challenges the utility of current paradigms for estimating brain-age with magnetic resonance imaging measures, but presents inadequate evidence to support the suggestion that an alternative approach focused on predicting cognition is more useful. The paper would benefit from a clearer explication of the methods and a more critical evaluation of the conceptual basis of the different models. This work will be of interest to researchers working on brain-age and related models.

      Response: Thank you so much for providing high-quality reviews on our manuscript. We revised the manuscript to address all of the reviewers’ comments and provided full responses to each of the comments below.

      Briefly, regarding clearer explanations of the methods, we added additional analyses (e.g., commonality analyses on ridge regression and on multiple regressions with a quadratic term for chronological age) to address some of the concerns and additional details in text and figures to ensure that the reader can fully understand our methodological procedures. Regarding the critical evaluation of the conceptual basis of the different models, we added discussions to help with interpretations and the scope of the generalisability of our findings. For instance, as opposed to treating Brain Cognition and Brain Age as separate biomarkers and comparing them in the ability to explain fluid cognition, we now treated the capability of Brain Cognition in capturing fluid cognition as the upper limit of Brain Age’s capability in capturing fluid cognition. In other words, we now examined the extent to which Brain Age missed the variation in the brain MRI that could explain fluid cognition (for this particular issue, please see our response to Reviewer 3 Public Review #4).

      Reviewer 1:

      This is a reasonably good paper and the use of a commonality analysis is a nice contribution to understanding variance partitioning across different covariates. I have some comments that I believe the authors ought to address which mostly relate to clarity and interpretation.

      Reviewer 1 Public Review #1:

      First, from a conceptual point of view, the authors focus exclusively on cognition as a downstream outcome. I would suggest the authors nuance their discussion to provide broader considerations of the utility of their method and on the limits of interpretation of brain-age models more generally. Further, I think that since brain-age models by construction confound relevant biological variation with the accuracy of the regression models used to estimate them, there may be limits to the interpretation of (e.g.) the brain-age gap is as a dimensionless biomarker. This has also been discussed elsewhere (see e.g. https://academic.oup.com/brain/article/143/7/2312/5863667). I would suggest that the authors consider and comment on these issues.

      Response: Thank you Reviewer 1 for pointing out these important issues. We addressed them in our response to Reviewer 1 Recommendations For The Authors #1 (see below).

      Reviewer 1 Public Review #2

      Second, from a methods perspective, there is not a sufficient explanation of the methodological procedures in the current manuscript to fully understand how the stacked regression models were constructed. Stacked models can be prone to overfitting when combined with cross-validation. This is because the predictions from the first-level models (i.e. the features that are provided to the second level 'stacked' models) contain information about the training set and the test set. If cross-validation is not done very carefully (e.g. using multiple hold-out sets), information leakage can easily occur at the second level. Unfortunately, there is not a sufficient explanation of the methodological procedures in the current manuscript to fully understand what was actually done. Please provide more information to enable the reader to better understand the stacked regression models. If the authors are not using an approach that fully preserves training and test separability, they need to do so.

      Response: Thank you Reviewer 1. We addressed this issue in our response to Reviewer 1 Recommendations For The Authors #2 (see below). Briefly, we now made it clearer that training models for both non-stacked and stacked models did not involve the test set, ensuring that there was no data leakage between training and test sets.

      Reviewer 1 Public Review #3

      Please also provide an indication of the different regression strengths that were estimated across the different models and cross-validation splits. Also, how stable were the weights across splits?

      Response: Thank you Reviewer 1. We addressed this issue in our response to Reviewer 1 Recommendations For The Authors #3 (see below).

      Reviewer 1 Public Review #4:

      Please provide more details about the task designs, MRI processing procedures that were employed on this sample in addition to the regression methods, and bias-correction methods used. For example, there are several different parameterisations of the elastic net, please provide equations to describe the method used here so that readers can easily determine how the regularisation parameters should be interpreted.

      Response: Thank you Reviewer 1. We addressed this issue in our response to Reviewer 1 Recommendations For The Authors #5-#6. Briefly, we followed your advice and add all of the suggested details.

      Reviewer 2 (Public Review):

      Reviewer 2 Public Review Overall:

      In this study, the authors aimed to evaluate the contribution of brain-age indices in capturing variance in cognitive decline and proposed an alternative index, brain-cognition, for consideration. The study employs suitable data and methods, albeit with some limitations, to address the research questions. A more detailed discussion of methodological limitations in relation to the study's aims is required. For instance, the current commonality analysis may not sufficiently address potential multicollinearity issues, which could confound the findings. Importantly, given that the study did not provide external validation for the indices, it is unclear how well the models would perform and generalize to other samples. This is particularly relevant to their novel index, brain-cognition, given that brain-age has been validated extensively elsewhere. In addition, the paper's rationale for using elastic net, which references previous fMRI studies, seemed somewhat unclear. The discussion could be more nuanced and certain conclusions appear speculative.

      Response Thank you for your encouragement. We have now added discussion of methodological limitations (see below). Regarding potential multicollinearity issues, we addressed this comment using Ridge regressions (see our response to Reviewer 2 Recommendations For The Authors #2). Regarding external validation, we now added discussions about how consistency between our results and several recent studies that investigated similar issues with Brain Age in different populations (see Reviewer 2 Recommendations For The Authors #1). Regarding Brain Cognition, we also added previous studies showing similarly high prediction for cognition functioning (Dubois et al., 2018; Pat, Wang, Anney, et al., 2022; Rasero et al., 2021; Sripada et al., 2020; Tetereva et al., 2022; for review, see Vieira et al., 2022). We added a discussion about Elastic Net (see Reviewer 1 Recommendations For The Authors #6)

      Discussion

      “There are several potential limitations of this study. First, we conducted an investigation relying only on one dataset, the Human Connectome Project in Aging (HCP-A) (Bookheimer et al., 2019). While HCP-A used state-of-the-art MRI methodologies, covered a wide age range from 36 to 100 years old and used several task-fMRI from different tasks that are harder to find in other bigger databases (e.g., UK Biobank from Sudlow et al., 2015), several characteristics of HCP-A might limit the generalisability of our findings. For instance, the tasks used in task-based fMRI in HCP-A are not used widely in clinical settings (Horien et al., 2020). This might make it challenging to translate the approaches used here. Similarly, HCP-A also excluded participants with neurological conditions, possibly making their participants not representative of the general population. Next, while HCP-A’s sample size is not small (n=725 and 504 people, before and after exclusion, respectively), other datasets provide a much larger sample size (Horien et al., 2020). Similarly, HCP-A does not include younger populations. But as mentioned above, a study with a larger sample in older adults (Cole, 2020) and studies in younger populations (8-22 years old) (Butler et al., 2021; Jirsaraie, Kaufmann, et al., 2023) also found small effects of the adjusted Brain Age Gap in explaining cognitive functioning. And the disagreement between the predictive performance of age-prediction models and the utility of Brain Age found here is largely in line with the findings across different phenotypes seen in a recent systematic review (Jirsaraie, Gorelik, et al., 2023).”

      Reviewer 2 Public Review #1:

      The authors aimed to evaluate how brain-age and brain-cognition indices capture cognitive decline (as mentioned in their title) but did not employ longitudinal data, essential for calculating 'decline'. As a result, 'cognition-fluid' should not be used interchangeably with 'cognitive decline,' which is inappropriate in this context.

      Response Thank you for raising this issue. We now no longer used the word ‘cognitive decline’.

      Reviewer 2 Public Review #2:

      In their first aim, the authors compared the contributions of brain-age and chronological age in explaining variance in cognition-fluid. Results revealed much smaller effect sizes for brain-age indices compared to the large effects for chronological age. While this comparison is noteworthy, it highlights a well-known fact: chronological age is a strong predictor of disease and mortality. Has the brain-age literature systematically overlooked this effect? If so, please provide relevant examples. They conclude that due to the smaller effect size, brain-age may lack clinical significance, for instance, in associations with neurodegenerative disorders. However, caution is required when speculating on what brain-age may fail to predict in the absence of direct empirical testing. This conclusion also overlooks extant brain-age literature: although effect sizes vary across psychiatric and neurological disorders, brain-age has demonstrated significant effects beyond those driven by chronological age, supporting its utility.

      Response For aim 1, we focused our claims on cognitive functioning and not on any clinical significance for neurodegenerative disorders. We now made it clearer that the small effects of the Corrected Brain Age Gap in explaining fluid cognition of aging individuals found here are consistent with a study with a larger sample in older adults (Cole, 2020) and studies in younger populations (8-22 years old) (Butler et al., 2021; Jirsaraie, Kaufmann, et al., 2023).

      We believe this issue of the utility of brain age on cognitive functioning vs neurological/psychological disorders requires another consideration, namely the discrepancy in the training and test samples typically used for studies focusing on neurological/psychological disorders. We made this point in the discussion now (see below).

      Discussion

      “There is a notable difference between studies investigating the utility of Brain Age in explaining cognitive functioning, including ours and others (e.g., Butler et al., 2021; Cole, 2020, 2020; Jirsaraie, Kaufmann, et al., 2023) and those explaining neurological/psychological disorders (e.g., Bashyam et al., 2020; Rokicki et al., 2021). That is, those Brain Age studies focusing on neurological/psychological disorders often build age-prediction models from MRI data of largely healthy participants (e.g., controls in a case-control design or large samples in a population-based design), apply the built age-prediction models to participants without vs. with neurological/psychological disorders and compare Brain Age indices between the two groups. This means that age-prediction models from Brain Age studies focusing on neurological/psychological disorders might be under-fitted when applied to participants with neurological/psychological disorders because they were built from largely healthy participants. And thus the difference in Brain Age indices between participants without vs. with neurological/psychological disorders might be confounded by the under-fitted age-prediction models (i.e., Brain Age may predict chronological age well for the controls, but not for those with a disorder). On the contrary, our study and other Brain Age studies focusing on cognitive functioning often build age-prediction models from MRI data of largely healthy participants and apply the built age-prediction models to participants who are also largely healthy. Accordingly, the age-prediction models for explaining cognitive functioning do not suffer from being under-fitted. We consider this as a strength, not a weakness of our study.”

      Reviewer 2 Public Review #3:

      The second aim's results reveal a discrepancy between the accuracy of their brain-age models in estimating age and the brain-age's capacity to explain variance in cognition-fluid. The authors suggest that if the ultimate goal is to capture cognitive variance, brain-age predictive models should be optimized to predict this target variable rather than age. While this finding is important and noteworthy, additional analyses are needed to eliminate potential confounding factors, such as correlated noise between the data and cognitive outcome, overfitting, or the inclusion of non-healthy participants in the sample. Optimizing brain-age models to predict the target variable instead of age could ultimately shift the focus away from the brain-age paradigm, as it might optimize for a factor differing from age.

      Response We discussed the issue regarding the discrepancy between the accuracy of their brain-age models in estimating age and the brain-age's capacity to explain variance in fluid cognition in our response to Reviewer 3 Public Review #9 (see below). This issue is found to be widespread in a recent systematic review (Jirsaraie, Gorelik, et al., 2023). We now provided several strategies to mitigate this issue to improve the utility of Brain Age in explaining other phenotypes based on our current work and others, using different MRI modalities as well as modelling techniques (Bashyam et al., 2020; Jirsaraie, Kaufmann, et al., 2023; Rokicki et al., 2021).

      Regarding potential confounding factors, we are not sure what the reviewer meant by “correlated noise between the data and cognitive outcome”. The current study, for instance, used ICA-FIX (Glasser et al., 2016) to remove noise in functional MRI. It is unclear how much ‘noise’ is still left and might confound our findings. More importantly, we are not sure how to define ‘noise’ as referred to by Reviewer 2 here. As for overfitting, we used nested cross-validation to ensure that training and test sets were separate from each other (see Reviewer 1 Recommendations For The Authors #2). If overfitting happened as suggested, we should see a ‘lower’ predictive performance of age-prediction and cognitive-prediction models since the models would fit well with the training set but would not generalise well to the test set. This is not what we found. The predictive performance of our age-prediction and cognitive-prediction models was high and consistent with the literature. Regarding the inclusion of non-healthy participants in the sample, we discussed this above in our response to Reviewer 2 Public Review #2).

      Reviewer 2 Public Review #4:

      While a primary goal in biomarker research is to obtain indices that effectively explain variance in the outcome variable of interest, thus favouring models optimized for this purpose, the authors' conclusion overlooks the potential value of 'generic/indirect' models, despite sacrificing some additional explained variance provided by ad-hoc or 'specific/direct' models. In this context, we could consider brain-age as a 'generic' index due to its robust out-of-sample validity and significant associations across various health outcome variables reported in the literature. In contrast, the brain-cognition index proposed in this study is presumed to be 'specific' as, without out-of-sample performance metrics and testing with different outcome variables (e.g., neurodegenerative disease), it remains uncertain whether the reported effect would generalize beyond predicting cognition-fluid, the same variable used to condition the brain-cognition model in this study. A 'generic' index like brain-age enables comparability across different applications based on a common benchmark (rather than numerous specific models) and can support explanatory hypotheses (e.g., "accelerated ageing") since it is grounded in its own biological hypothesis. Generic and specific indices are not mutually exclusive; instead, they may offer complementary information. Their respective utility may depend heavily on the context and research or clinical question.

      Response Thank you Reviewer 2 for pointing out this important issue. Reviewer 1 (Recommendations For The Authors #4) and Reviewer 3 (Public Review #4) bought up a similar issue. We agreed with Reviewer 2 that both 'specific/direct' index and Brain Age as a 'generic/indirect' index have merit in their own right. We made a discussion about this issue in our response to Reviewer 3 Public Review #4 (please see this response below).

      Briefly, in the revision, as opposed to treating Brain Cognition and Brain Age as separate biomarkers and comparing them, we treated the capability of Brain Cognition in capturing fluid cognition as the upper limit of Brain Age’s capability in capturing fluid cognition. In other words, we now examined the extent to which Brain Age missed the variation in the brain MRI that could explain fluid cognition. We also made a discussion about using our commonality approach to test for this missing variation in future work:

      Discussion

      “Finally, researchers should test how much Brain Age miss the variation in the brain MRI that could explain fluid cognition or other phenotypes of interest. As demonstrated here, one straightforward method is to build a prediction model using a phenotype of interest as the target (e.g., fluid cognition) and incorporate the predicted value of this model (e.g., Brain Cognition), along with Brain Age and chronological age, into a multiple regression for commonality analyses. The unique effect of this predicted value will inform the missing variation in the brain MRI from Brain Age. If this unique effect is large, then researchers might need to reconsider whether using Brain Age is appropriate for a particular phenotype of interest.”

      Reviewer 2 Public Review #5:

      The study's third aim was to evaluate the authors' new index, brain-cognition. The results and conclusions drawn appear similar: compared to brain-age, brain-cognition captures more variance in the outcome variable, cognition-fluid. However, greater context and discussion of limitations is required here. Given the nature of the input variables (a large proportion of models in the study were based on fMRI data using cognitive tasks), it is perhaps unsurprising that optimizing these features for cognition-fluid generates an index better at explaining variance in cognition-fluid than the same features used to predict age. In other words, it is expected that brain-cognition would outperform brain-age in explaining variance in cognition-fluid since the former was optimized for the same variable in the same sample, while brain-age was optimized for age. Consequently, it is unclear if potential overfitting issues may inflate the brain-cognition's performance. This may be more evident when the model's input features are the ones closely related to cognition, e.g., fMRI tasks. When features were less directly related to cognitive tasks, e.g., structural MRI, the effect sizes for brain-cognition were notably smaller (see 'Total Brain Volume' and 'Subcortical Volume' models in Figure 6). This observation raises an important feasibility issue that the authors do not consider. Given the low likelihood of having task-based fMRI data available in clinical settings (such as hospitals), estimating a brain-cognition index that yields the large effects discussed in the study may be challenged by data scarcity.

      Response Given the use of nested cross-validation, we do not consider the good predictive performance of Brain Cognition found here as overfitting. In fact, we found a similar level of predictive performance of Brain Cognition on another database with younger participants in the past (Tetereva et al., 2022). However, we agreed with Reviewer 2 that the prediction of fluid cognition might be driven by MRI modalities that are different from those that drive the prediction of chronological age. In our own work with other age groups, including young adults (Tetereva et al., 2022) and children (Pat, Wang, Anney, et al., 2022), cognitive functioning seems to be predicted well from task-based functional MRI. And Reviewer 2 is right that task-based fMRI is not commonly used in clinics, making it harder to translate our results. However, given our results, clinicians should be encouraged to use task-based fMRI if their goal is to predict cognitive functioning. Nevertheless, as suggested, we listed data scarcity as one of the limitations of our approach.

      Discussion “For instance, the tasks used in task-based fMRI in HCP-A are not used widely in clinical settings (Horien et al., 2020). This might make it challenging to translate the approaches used here.”

      Reviewer 2 Public Review #6:

      This study is valuable and likely to be useful in two main ways. First, it can spur further research aimed at disentangling the lack of correspondence reported between the accuracy of the brain-age model and the brain-age's capacity to explain variance in fluid cognitive ability. Second, the study may serve, at least in part, as an illustration of the potential pros and cons of using indices that are specific and directly related to the outcome variable versus those that are generic and only indirectly related.

      Response We are thankful for the encouragement. For the discrepancy between the predictive performance of age-prediction models and the utility of Brain Age indices as a biomarker for fluid cognition, we made a detailed discussion in our response to Reviewer 3 Public Review #9. More specifically, to ensure that readers can benefit from our findings, we made suggestions on how to ensure the utility of Brain Age indices as a biomarker for other phenotypes by drawing from our own strategy, as well as strategies used by Rokicki and colleagues (2021), Jirsaraie and colleagues (2023) and Bashyam and colleagues (2020).

      As for the pros and cons between generic vs specific biomarkers, we made a detailed discussion in our response to Reviewer 3 Public Review #4. We also made some suggestions on how to make use of the difference in the ability between generic vs specific biomarkers (see Reviewer 2 Public Review #4, above).

      Reviewer 2 Public Review #7:

      Overall, the authors effectively present a clear design and well-structured procedure; however, their work could have been enhanced by providing more context for both the brain-age and brain-cognition indices, including a discussion of key concepts in the brain-age paradigm, which acknowledges that chronological age strongly predicts negative health outcomes, but crucially, recognizes that ageing does not affect everyone uniformly. Capturing this deviation from a healthy norm of ageing is the key brain-age index. This lack of context was mirrored in the presentation of the four brain-age indices provided, as it does not refer to how these indices are used in practice. In fact, there is no mention of a more common way in which brain-age is implemented in statistical analyses, which involves the use of brain-age delta as the variable of interest, along with linear and non-linear terms of age as covariates. The latter is used to account for the regression-to-the-mean effect. The 'corrected brain-age delta' the authors use does not include a non-linear term, which perhaps is an additional reason (besides the one provided by the authors) as to why there may be small, but non-zero, common effects of both age and brain-age in the 'corrected brain-age delta' index commonality analysis. The context for brain-cognition was even more limited, with no reference to any existing literature that has explored direct brain-cognitive markers, such as brain-cognition.

      Response Regarding Brain Age and negative health outcomes, we addressed this in our response to Reviewer 1 Recommendations For The Authors #1 (see below). Briefly, we now discussed (1) the consistency between our findings on fluid cognition and other recent works on negative health outcomes, (2) the differences between Brain Age studies focusing on negative health outcomes vs. cognitive functioning and (3) suggested solutions to optimise the utility of brain age for both cognitive functioning and negative health outcomes.

      Regarding how Brain Age was used in practice, we addressed this in our response to Reviewer 3 Public Review #2 (see below). Our argument resonates Butler and colleagues’ (2021) suggestion that the common practice for Brain Age analysis should be re-evaluated: “The MBAG and performance on the complex cognition tasks were not associated (r =  .01, p = 0.71). These results indicate that the association between cognition and the BAG are driven by the association between age and cognitive performance. As such, it is critical that readers of past literature note whether or not age was controlled for when testing for effects on the BAG, as this has not always been common practice (e.g., Beheshti et al., 2018; Cole, Underwood, et al., 2017; Franke et al., 2015; Gaser et al., 2013; Liem et al., 2017; Nenadi c et al., 2017; Steffener et al., 2016). (p. 4097).”

      Importantly, we also implemented “brain-age delta as the variable of interest, along with linear and non-linear terms of age as covariates” in our additional analyses along with other implementations (see Reviewer 2 Recommendations For The Authors #3). Of particular note, we found that adding a non-linear term (i.e., a quadratic term for chronological age) barely changed the results of commonality analyses.

      We now wrote this paragraph to recommend how future research should implement Brain Age:

      Discussion

      “First, they have to be aware of the overlap in variation between Brain Age and chronological age and should focus on the contribution of Brain Age over and above chronological age. Using Brain Age Gap will not fix this. Butler and colleagues (2021) recently highlighted this point, “These results indicate that the association between cognition and the BAG are driven by the association between age and cognitive performance. As such, it is critical that readers of past literature note whether or not age was controlled for when testing for effects on the BAG, as this has not always been common practice (p. 4097).” Similar to their recommendation (Butler et al., 2021), we suggest future work focus on Corrected Brain Age Gap or, better, unique effects of Brain Age indices after controlling for chronological age in multiple regressions. In the case of fluid cognition, the unique effects might be too small to be clinically meaningful as shown here and previously (Butler et al., 2021; Jirsaraie, Kaufmann, et al., 2023). “

      Regarding brain cognition, we now expanded our explanation about Brain Cognition on how it might be relevant to Brain Age and on Brain Cognition’s predictive performance found previously.

      Introduction

      “Third and finally, certain variation in the brain MRI is related to fluid cognition, but to what extent does Brain Age not capture this variation? To estimate the variation in the brain MRI that is related to fluid cognition, we could build prediction models that directly predict fluid cognition (i.e., as opposed to chronological age) from brain MRI data. Previous studies found reasonable predictive performances of these cognition-prediction models, built from certain MRI modalities (Dubois et al., 2018; Pat, Wang, Anney, et al., 2022; Rasero et al., 2021; Sripada et al., 2020; Tetereva et al., 2022; for review, see Vieira et al., 2022). Analogous to Brain Age, we called the predicted values from these cognition-prediction models, Brain Cognition. The strength of an out-of-sample relationship between Brain Cognition and fluid cognition reflects variation in the brain MRI that is related to fluid cognition and, therefore, indicates the upper limit of Brain Age’s capability in capturing fluid cognition. Consequently, the unique effects of Brain Cognition that explain fluid cognition beyond Brain Age and chronological age indicate what is missing from Brain Age -- the amount of co-variation between brain MRI and fluid cognition that cannot be captured by Brain Age.”

      Discussion

      “Third, by introducing Brain Cognition, we showed the extent to which Brain Age indices were not able to capture the variation of brain MRI that is related to fluid cognition. Brain Cognition, from certain cognition-prediction models such as the stacked models, has relatively good predictive performance, consistent with previous studies (Dubois et al., 2018; Pat, Wang, Anney, et al., 2022; Rasero et al., 2021; Sripada et al., 2020; Tetereva et al., 2022; for review, see Vieira et al., 2022).”

      Reviewer 2 Public Review #8:

      While this paper delivers intriguing and thought-provoking results, it would benefit from recognizing the value that both approaches--brain-age indices and more direct, specific markers like brain-cognition--can contribute to the field.

      Response Thank you so much for recognising the value of our work. As we mentioned above in our response to Reviewer 2 Public Review #4 and #6, we made some suggestions on how to make use of the difference in the ability between generic vs specific biomarkers.

      Reviewer 3 (Public Review):

      Reviewer 3 Public Review Overall:

      The main question of this article is as follows: "To what extent does having information on brain-age improve our ability to capture declines in fluid cognition beyond knowing a person's chronological age?" While this question is worthwhile, considering that there is considerable confusion in the field about the nature of brain-age, the authors are currently missing an opportunity to convey the inevitability of their results, given how brain-age and the brain-age gap are calculated. They also argue that brain-cognition is somehow superior to brain-age, but insufficient evidence is provided in support of this claim.

      Response We addressed the concerns below. The inevitability of our results is not obvious to many researchers who might be interested in Brain Age. We hope our findings might make many issues surrounding Brain Age more obvious, and we now make many suggestions on how to address some of these issues. We no longer argue that Brain Cognition is superior to Brain Age (Reviewer 3 Public Review #4). Rather, we treated the capability of Brain Cognition in capturing fluid cognition as the upper limit of Brain Age’s capability in capturing fluid cognition. We used the unique effects of Brain Cognition that explain fluid cognition beyond Brain Age and chronological age to indicate how much Brain Age misses the variation in the brain MRI that could explain fluid cognition.

      Specific comments follow:

      Reviewer 3 Public Review #1:

      • "There are many adjustments proposed to correct for this estimation bias" (p3). Regression to the mean is not a sign of bias. Any decent loss function will result in over-predicting the age of younger individuals and under-predicting the age of older individuals. This is a direct result of minimizing an error term (e.g., mean squared error). Therefore, it is inappropriate to refer to regression to the mean as a sign of bias. This misconception has led to a great deal of inappropriate analyses, including "correcting" the brain age gap by regressing out age.

      Response: Thank you so much for raising this issue. We used the word ‘bias’ following many articles in the field. For instance,

      de Lange and Cole (2020) wrote: “brain-age estimation also involves a frequently observed bias: brain age is overestimated in younger subjects and underestimated in older subjects, while brain age for participants with an age closer to the mean age (of the training dataset) are predicted more accurately (Cole, Le, Kuplicki, McKinney, Yeh, Thompson, Paulus, Investigators, et al., 2018, Liang, Zhang, Niu, 2019, Niu, Zhang, Kounios, Liang, 2019, Smith, Vidaurre, Alfaro-Almagro, Nichols, Miller, 2019).”

      Cole (2020) wrote: “As recent research has highlighted a proportional bias in brain-age calculation, whereby the difference between chronological age and brain-predicted age is negatively correlated with chronological age (Le et al., 2018, Liang et al., 2019, Smith et al., 2019), an age-bias correction procedure was used. This entailed calculating the regression line between age (predictor) and brain-predicted age (outcome) in the training set, then using the slope (i.e., coefficient) and intercept of that line to adjust brain-predicted age values in the testing set (by subtracting the intercept and then dividing by the slope). After applying the age-bias correction the brain-predicted age difference (brain-PAD) was calculated; chronological age subtracted from brain-predicted age.”

      Beheshiti and colleagues (2019) used bias in their title: “Bias-adjustment in neuroimaging-based brain age frameworks: a robust scheme”

      More recently, Cumplido-Mayoral and colleagues (2023) wrote: “As recent research has shown that brain-age estimation involves a proportional bias (de Lange et al., 2020a; Le et al., 2018; Liang et al., 2019; Smith et al., 2019), we applied a well-established age-bias correction procedure to our data (de Lange et al., 2020a; Le et al., 2018).”

      Still, we agree with Reviewer 3 that using ‘bias’ might lead to misinterpretation. As Butler and colleagues (Butler et al., 2021) pointed out, ”It is important to note that regression toward the mean is not a failure, but a feature, of regression and related methods.“ We rewrote the paragraph and clarified the “regression towards the mean” issue. We no longer used the word “bias” here:

      Introduction

      “Note researchers often subtract chronological age from Brain Age, creating an index known as Brain Age Gap (Franke & Gaser, 2019). A higher value of Brain Age Gap is thought to reflect accelerated/premature aging. Yet, given that Brain Age Gap is calculated based on both Brain Age and chronological age, Brain Age Gap still depends on chronological age (Butler et al., 2021). If, for instance, Brain Age was based on prediction models with poor performance and made a prediction that everyone was 50 years old, individual differences in Brain Age Gap would then depend solely on chronological age (i.e., 50 minus chronological age). Moreover, Brain Age is known to demonstrate the “regression towards the mean” phenomenon (Stigler, 1997). More specifically, because Brain Age is a predicted value of a regression model that predicts chronological age, Brain Age is usually shrunk towards the mean age of samples used for training the model (Butler et al., 2021; de Lange & Cole, 2020; Le et al., 2018). Accordingly, Brain Age predicts chronological age more accurately for individuals who are closer to the mean age while overestimating younger individuals’ chronological age and underestimating older individuals’ chronological age. There are many adjustments proposed to correct for the age dependency, but the outcomes tend to be similar to each other (Beheshti et al., 2019; de Lange & Cole, 2020; Liang et al., 2019; Smith et al., 2019). These adjustments can be applied to Brain Age and Brain Age Gap, creating Corrected Brain Age and Corrected Brain Age Gap, respectively. Corrected Brain Age Gap in particular is viewed as being able to control for age dependency (Butler et al., 2021). Here, we tested the utility of different Brain Age calculations in capturing fluid cognition, over and above chronological age.”

      Reviewer 3 Public Review #2:

      • "Corrected Brain Age Gap in particular is viewed as being able to control for both age dependency and estimation biases (Butler et al., 2021)" (p3). This summary is not accurate as Butler and colleagues did not use the words "corrected" and "biases" in this context. All that authors say in that paper is that regressing out age from the brain age gap - which is referred to as the modified brain age gap (MBAG) - makes it so that the modified brain age gap is not dependent on age, which is true. This metric is meaningless, though, because it is the variance left over after regressing out age from residuals from a model that was predicting age. If it were not for the fact that regression on residuals is not equivalent to multiple regression (and out of sample estimates), MBAG would be a vector of zeros. Upon reading the Methods, I noticed that the authors use a metric from Le et al. (2018) for the "Corrected Brain Age Gap". If they cite the Butler et al. (2021) paper, I highly recommend sticking with the same notation, metrics and terminology throughout. That would greatly help with the interpretability of the present manuscript, and cross-comparisons between the two.

      Response: We thank Reviewer 3 for pointing out the issues surrounding our choices of wording: "corrected" and "biases". We share the same frustration with Reviewer 3 in that different brain-age articles use different terminologies, and we tried to make sure our readers understand our calculations of Brain Age indices in order to compare our results with previous work.

      We commented on the word “bias” in our response to Reviewer 3 Public Review #1 above and refrained from using this word in the revised manuscript. Here we commented on the use of the word “Corrected Brain Age Gap". And by doing so, we clarified how we calculated it.

      Reviewer 3 is right that we cited the work of Butler and colleagues (2021), but wasn’t accurate to say that we used “a metric from Le et al. (2018) for the "Corrected Brain Age Gap". We, instead, used a method described in de Lange and Cole’s (2020) work. We now added equations to explain this method in our Materials and Method section (see below).

      It is important to note that Butler and colleagues (2021) did not come up with any adjustment methods. Instead, Butler and colleagues (2021) discussed three adjustment methods:

      1) A method proposed by Beheshiti and colleagues (2019). Butler and colleagues (2021) called the result of this method, Modified Brain Age Gap (MBAG). Importantly, Butler and colleagues (2021) discouraged the use of this method due to “researchers misinterpreting the reduced variability of the MBAG as an improvement in prediction accuracy.” Accordingly in our article, we performed methods (2) and (3) below.

      2) A method proposed by de Lange and Cole (2020). We used this method in our article (see below for the equations). Briefly, we first fit a regression line predicting the Brain Age from a chronological age in each training set. We then used the slope and intercept of this regression line to adjust Brain Age in the corresponding test set, resulting in an adjusted index of Brain Age. Butler and colleagues (2021) called this index, “Revised Predicted Age.”, while de Lange and Cole’s (2020) originally called this Corrected Brain Age, “Corrected Predicted Age”. Butler and colleagues (2021) then subtracted the chronological age from this index and called it, “Revised Brain Age Gap (RBAG)”. We would like to follow the original terminology, but we do not want to use the word “Predicted Age” since chronological age can be predicted by other variables beyond the brain. We then settled with the word, "Corrected Brain Age" and “Corrected Brain Age Gap". We listed the terminologies used in the past in our article (see below).

      3) A method proposed by Le and colleagues (2018). Here, Butler and colleagues (2021) referred to one of the approaches done by Le and colleagues: “include age as a regressor when doing follow-up analyses.” Essentially this is what we did for the commonality analysis. Le and colleagues (2018)’ approach is the same as examining the unique effects of Brain Age in a multiple regression analysis with Chronological Age and Brain Age as regressors.

      While indexes from de Lange and Cole’s (2020) and Le and colleagues’ (2018) methods show poor performance in capturing fluid cognition in the current work, we need to stress that many research groups do not believe that these methods are meaningless. In fact, de Lange and Cole’s method (2020) is one of the most commonly implemented methods that can be seen elsewhere (e.g., Cole et al., 2020; Cumplido-Mayoral et al., 2023; Denissen et al., 2022). This index just does not seem to work well in the case of fluid cognition.

      Here is how we described how we calculated Brain Age indexes in the revised manuscript:

      Methods

      “ Brain Age calculations: Brain Age, Brain Age Gap, Corrected Brain Age and Corrected Brain Age Gap In addition to Brain Age, which is the predicted value from the models predicting chronological age in the test sets, we calculated three other indices to reflect the estimation of brain aging. First, Brain Age Gap reflects the difference between the age predicted by brain MRI and the actual, chronological age. Here we simply subtracted the chronological age from Brain Age:

      Brain Age Gapi = Brain Agei - chronological agei , (2)

      where i is the individual. Next, to reduce the dependency on chronological age (Butler et al., 2021; de Lange & Cole, 2020; Le et al., 2018), we applied a method described in de Lange and Cole’s (2020), which was implemented elsewhere (Cole et al., 2020; Cumplido-Mayoral et al., 2023; Denissen et al., 2022):

      In each outer-fold training set: Brain Agei = 0 + 1 chronological agei + εi, (3)

      Then in the corresponding outer-fold test set: Corrected Brain Agei = (Brain Agei - 0)/1, (4)

      That is, we first fit a regression line predicting the Brain Age from a chronological age in each outer-fold training set. We then used the slope (1) and intercept (0) of this regression line to adjust Brain Age in the corresponding outer-fold test set, resulting in Corrected Brain Age. Note de Lange and Cole (2020) called this Corrected Brain Age, “Corrected Predicted Age”, while Butler (2021) called it “Revised Predicted Age.”

      Lastly, we computed Corrected Brain Age Gap by subtracting the chronological age from the Corrected Brain Age (Butler et al., 2021; Cole et al., 2020; de Lange & Cole, 2020; Denissen et al., 2022):

      Corrected Brain Age Gap = Corrected Brain Age - chronological age, (5)

      Note Cole and colleagues (2020) called Corrected Brain Age Gap, “brain-predicted age difference (brain-PAD),” while Butler and colleagues (2021) called this index, “Revised Brain Age Gap”.

      Reviewer 3 Public Review #3:

      • "However, the improvement in predicting chronological age may not necessarily make Brain Age to be better at capturing Cognitionfluid. If, for instance, the age-prediction model had the perfect performance, Brian Age Gap would be exactly zero and would have no utility in capturing Cognitionfluid beyond chronological age" (p3). I largely agree with this statement. I would be really careful to distinguish between brain-age and the brain-age gap here, as the former is a predicted value, and the latter is the residual times -1 (i.e., predicted age - age). Therefore, together they explain all of the variance in age. Changing the first sentence to refer to the brain-age gap would be more accurate in this context. The brain-age gap will never be exactly zero, though, even with perfect prediction on the training set, because subjects in the testing set are different from the subjects in the training set.

      Response: Thank you so much for pointing this out. We agree to change “Brain Age” to “Brain Age Gap” in the mentioned sentence.

      Reviewer 3 Public Review #4:

      • "Can we further improve our ability to capture the decline in cognitionfluid by using, not only Brain Age and chronological age, but also another biomarker, Brain Cognition?". This question is fundamentally getting at whether a predicted value of cognition can predict cognition. Assuming the brain parameters can predict cognition decently, and the original cognitive measure that you were predicting is related to your measure of fluid cognition, the answer should be yes. Upon reading the Methods, it became clear that the cognitive variable in the model predicting cognition using brain features (to get predicted cognition, or as the authors refer to it, brain-cognition) is the same as the measure of fluid cognition that you are trying to assess how well brain-cognition can predict. Assuming the brain parameters can predict fluid cognition at all, it is then inevitable that brain-cognition will predict fluid cognition. Therefore, it is inappropriate to use predicted values of a variable to predict the same variable.

      Response: Thank you Reviewer 3 for pointing out this important issue. Reviewer 1 (Recommendations For The Authors #4) and Reviewer 2 (Public Review #4) bought up a similar issue. While Reviewer 3 felt that “it is inappropriate to use predicted values of a variable to predict the same variable,“ Reviewer 2 viewed Brain Cognition as a 'specific/direct' index and Brain Age as a 'generic/indirect' index. And both have merit in their own right.

      Similar to Reviewer 2, we believe that the specific index is as important and has commonly been used elsewhere in the context of biomarkers. For instance, to obtain neuroimaging biomarkers for Alzheimer’s, neuroimaging researchers often build a predictive model to predict Alzheimer's diagnosis (Khojaste-Sarakhsi et al., 2022). In fact, outside of neuroimaging, polygenic risk scores (PRSs) in genomics are often used following “to use predicted values of a variable to predict the same variable” (Choi et al., 2020). For instance, a PRS of ADHD that indicates the genetic liability to develop ADHD is based on genome-wide association studies of ADHD (Demontis et al., 2019).

      Still, we now agreed that it may not be fair to compare the performance of a specific index (Brain Cognition) and a generic index (Brain Age) directly (as pointed out by Reviewer 3 Public Review #6 below). Accordingly, in the revision, as opposed to treating Brain Cognition and Brain Age as separate biomarkers and comparing them, we treated the capability of Brain Cognition in capturing fluid cognition as the upper limit of Brain Age’s capability in capturing fluid cognition. In other words, the strength of an out-of-sample relationship between Brain Cognition and fluid cognition reflects variation in the brain MRI that is related to fluid cognition. And consequently, the unique effects of Brain Cognition that explain fluid cognition beyond Brain Age and chronological age indicate what is missing from Brain Age -- the amount of co-variation between brain MRI and fluid cognition that cannot be captured by Brain Age. According to Reviewer 2, a generic index (Brain Age) “sacrificed some additional explained variance provided” compared to a specific index (Brain Cognition). Here, we used the commonality analyses to quantify how much scarifying was made by Brain Age. See below for the re-conceptualisation of Brain Age vs. Brain Cognition in the revision:

      Abstract

      “Lastly, we tested how much Brain Age missed the variation in the brain MRI that could explain fluid cognition. To capture this variation in the brain MRI that explained fluid cognition, we computed Brain Cognition, or a predicted value based on prediction models built to directly predict fluid cognition (as opposed to chronological age) from brain MRI data. We found that Brain Cognition captured up to an additional 11% of the total variation in fluid cognition that was missing from the model with only Brain Age and chronological age, leading to around a 1/3-time improvement of the total variation explained.”

      Introduction:

      “Third and finally, certain variation in the brain MRI is related to fluid cognition, but to what extent does Brain Age not capture this variation? To estimate the variation in the brain MRI that is related to fluid cognition, we could build prediction models that directly predict fluid cognition (i.e., as opposed to chronological age) from brain MRI data. Previous studies found reasonable predictive performances of these cognition-prediction models, built from certain MRI modalities (Dubois et al., 2018; Pat, Wang, Anney, et al., 2022; Rasero et al., 2021; Sripada et al., 2020; Tetereva et al., 2022; for review, see Vieira et al., 2022). Analogous to Brain Age, we called the predicted values from these cognition-prediction models, Brain Cognition. The strength of an out-of-sample relationship between Brain Cognition and fluid cognition reflects variation in the brain MRI that is related to fluid cognition and, therefore, indicates the upper limit of Brain Age’s capability in capturing fluid cognition. Consequently, the unique effects of Brain Cognition that explain fluid cognition beyond Brain Age and chronological age indicate what is missing from Brain Age -- the amount of co-variation between brain MRI and fluid cognition that cannot be captured by Brain Age.”

      “Finally, we investigated the extent to which Brain Age indices missed the variation in the brain MRI that could explain fluid cognition. Here, we tested Brain Cognition’s unique effects in multiple regression models with a Brain Age index, chronological age and Brain Cognition as regressors to explain fluid cognition.“

      Discussion

      “Third, how much does Brain Age miss the variation in the brain MRI that could explain fluid cognition? Brain Age and chronological age by themselves captured around 32% of the total variation in fluid cognition. But, around an additional 11% of the variation in fluid cognition could have been captured if we used the prediction models that directly predicted fluid cognition from brain MRI.

      “Third, by introducing Brain Cognition, we showed the extent to which Brain Age indices were not able to capture the variation of brain MRI that is related to fluid cognition. Brain Cognition, from certain cognition-prediction models such as the stacked models, has relatively good predictive performance, consistent with previous studies (Dubois et al., 2018; Pat, Wang, Anney, et al., 2022; Rasero et al., 2021; Sripada et al., 2020; Tetereva et al., 2022; for review, see Vieira et al., 2022). We then examined Brain Cognition using commonality analyses (Nimon et al., 2008) in multiple regression models having a Brain Age index, chronological age and Brain Cognition as regressors to explain fluid cognition. Similar to Brain Age indices, Brain Cognition exhibited large common effects with chronological age. But more importantly, unlike Brain Age indices, Brain Cognition showed large unique effects, up to around 11%. The unique effects of Brain Cognition indicated the amount of co-variation between brain MRI and fluid cognition that was missed by a Brain Age index and chronological age. This missing amount was relatively high, considering that Brain Age and chronological age together explained around 32% of the total variation in fluid cognition. Accordingly, if a Brain Age index was used as a biomarker along with chronological age, we would have missed an opportunity to improve the performance of the model by around one-third of the variation explained.”

      Reviewer 3 Public Review #5:

      • "However, Brain Age Gap created from the lower-performing age-prediction models explained a higher amount of variation in Cognitionfluid. For instance, the top performing age-prediction model, "Stacked: All excluding Task Contrast", generated Brain Age and Corrected Brain Age that explained the highest amount of variation in Cognitionfluid, but, at the same time, produced Brian Age Gap that explained the least amount of variation in Cognitionfluid" (p7). This is an inevitable consequence of the following relationship between predicted values and residuals (or residuals times -1): y=(y-y ̂ )+y ̂. Let's say that age explains 60% of the variance in fluid cognition, and predicted age (y ̂) explains 40% of the variance in fluid cognition. Then the brain age gap (-(y-y ̂)) should explain 20% of the variance in fluid cognition. If by "Corrected Brain Age" you mean the modified predicted age from Butler et al (2021), the "Corrected Brain Age" result is inevitable because the modified predicted age is essentially just age with a tiny bit of noise added to it. From Figure 4, though, this does not seem to be the case, because the lower left quadrant in panel (a) should be flat and high (about as high as the predictive value of age for fluid cognition). So it is unclear how "Corrected Brain Age" is calculated. It looks like you might be regressing age out of brain-age, though from your description in the Methods section, it is not totally clear. Again, I highly recommend using the terminology and metrics of Butler et al (2021) throughout to reduce confusion. Please also clarify how you used the slope and intercept. In general, given how brain-age metrics tend to be calculated, the following conclusion is inevitable: "As before, the unique effects of Brain Age indices were all relatively small across the four Brain Age indices and across different prediction models" (p10).

      Response: We agreed that the results are ‘inevitable’ due to the transformations from Brain Age to other Brain Age indices. However, the consequences of these transformations may not be very clear to readers who are not very familiar with Brain Age literature and to the community at large who think about the implications of Brain Age. This is appreciated by Reviewer 1, who mentioned “While the main message will not come as a surprise to anyone with hands-on experience of using brain-age models, I think it is nonetheless an important message to convey to the community.”

      Note we made clarifications on how we calculated each of the Brain Age indices above (see<br /> Reviewer 3 Public Review #2), including how we used the slope and intercept. We chose the terminology closer to the one originally used by de Lange and Cole (2020) and now listed many terminologies others have used to refer to this transformation.

      Reviewer 3 Public Review #6:

      "On the contrary, the unique effects of Brain Cognition appeared much larger" (p10). This is not a fair comparison if you do not look at the unique effects above and beyond the cognitive variable you predicted in your brain-cognition model. If your outcome measure had been another metric of cognition other than fluid cognition, you would see that brain-cognition does not explain any additional variance in this outcome when you include fluid cognition in the model, just as brain-age would not when including age in the model (minus small amounts due to penalization and out-of-sample estimates). This highlights the fact that using a predicted value to predict anything is worse than using the value itself.

      Response Please see our response to Reviewer 3 Public Review #4 above. Briefly, we no long made this comparison. Instead, we now viewed the unique effects of Brain Cognition as a way to test how much Brain Age missed the variation in the brain MRI that could explain fluid cognition.

      Reviewer 3 Public Review #7:

      "First, how much does Brain Age add to what is already captured by chronological age? The short answer is very little" (p12). This is a really important point, but the paper requires an in-depth discussion of the inevitability of this result, as discussed above.

      Response We agree that the tight relationship between Brain Age and chronological age is inevitable. We mentioned this from the get-go in the introduction:

      Introduction “Accordingly, by design, Brain Age is tightly close to chronological age. Because chronological age usually has a strong relationship with fluid cognition, to begin with, it is unclear how much Brain Age adds to what is already captured by chronological age.”

      To make this point obvious, we quantified the overlap between Brain Age and chronological age using the commonality analysis. We hope that our effort to show the inevitability of this overlap can make people more careful when designing studies involving Brain Age.

      Reviewer 3 Public Review #8:

      "Third, do we have a solution that can improve our ability to capture Cognitionfluid from brain MRI? The answer is, fortunately, yes. Using Brain Cognition as a biomarker, along with chronological age, seemed to capture a higher amount of variation in Cognitionfluid than only using Brain Age" (p12). I suggest controlling for the cognitive measure you predicted in your brain-cognition model. This will show that brain-cognition is not useful above and beyond cognition, highlighting the fact that it is not a useful endeavor to be using predicted values.

      Response This point is similar to Reviewer 3 Public Review #6. Again please see our response to Reviewer 3 Public Review #4 above. Briefly, we no long made this comparison and said whether Brain Cognition is ‘better’ than Brain Age. Instead, we now viewed the unique effects of Brain Cognition as a way to test how much Brain Age missed the variation in the brain MRI that could explain fluid cognition.

      Reviewer 3 Public Review #9:

      "Accordingly, a race to improve the performance of age-prediction models (Baecker et al., 2021) does not necessarily enhance the utility of Brain Age indices as a biomarker for Cognitionfluid. This calls for a new paradigm. Future research should aim to build prediction models for Brian Age indices that are not necessarily good at predicting age, but at capturing phenotypes of interest, such as Cognitionfluid and beyond" (p13). I whole-heartedly agree with the first two sentences, but strongly disagree with the last. Certainly your results, and the underlying reason as to why you found these results, calls for a new paradigm (or, one might argue, a pre-brain-age paradigm). As of now, your results do not suggest that researchers should keep going down the brain-age path. While it is difficult to prove that there is no transformation of brain-age or the brain-age gap that will be useful, I am nearly sure this is true from the research I have done. If you would like to suggest that the field should continue down this path, I suggest presenting a very good case to support this view.

      Response Thank you for your comments on this issue.

      Since the submission of our manuscript, other researchers also made a similar observation regarding the disagreement between the predictive performance of age-prediction models and the utility of Brain Age. For instance, in their systematic review, Jirasarie and colleagues (2023, p7) wrote this statement, “Despite mounting evidence, there is a persisting assumption across several studies that the most accurate brain age models will have the most potential for detecting differences in a given phenotype of interest. As a point of illustration, seven of the twenty studies in this review only evaluated the utility of their most accurate model, which in all cases was trained using multimodal features. This approach has also led to researchers to exclusively use T1-weighted and diffusion-weighted MRI scans when developing brain age models36 since such modalities have been shown to have the largest contribution to a model’s predictive power.2,67 However, our review suggests that model accuracy does not necessarily provide meaningful insight about clinical utility (e.g., detection of age-related pathology). Taken with prior studies,16,17 it appears that the most accurate models tend to not be the most useful.”

      We now discussed the disagreement between the predictive performance of age-prediction models and the utility of Brain Age, not only in the context of cognitive functioning (Jirsaraie, Kaufmann, et al., 2023) but also in the context of neurological/psychological disorders (Bashyam et al., 2020; Rokicki et al., 2021). Following Reviewer 3’s suggestion, we also added several possible strategies to mitigate this problem of Brain Age, used by us and other groups. Please see below.

      Discussion:

      “This discrepancy between the predictive performance of age-prediction models and the utility of Brain Age indices as a biomarker is consistent with recent findings (for review, see Jirsaraie, Gorelik, et al., 2023), both in the context of cognitive functioning (Jirsaraie, Kaufmann, et al., 2023) and neurological/psychological disorders (Bashyam et al., 2020; Rokicki et al., 2021). For instance, combining different MRI modalities into the prediction models, similar to our stacked models, often lead to the highest performance of age-prediction models, but does not likely explain the highest variance across different phenotypes, including cognitive functioning and beyond (Jirsaraie, Gorelik, et al., 2023).”

      “Next, researchers should not select age-prediction models based solely on age-prediction performance. Instead, researchers could select age-prediction models that explained phenotypes of interest the best. Here we selected age-prediction models based on a set of features (i.e., modalities) of brain MRI. This strategy was found effective not only for fluid cognition as we demonstrated here, but also for neurological and psychological disorders as shown elsewhere (Jirsaraie, Gorelik, et al., 2023; Rokicki et al., 2021). Rokicki and colleagues (2021), for instance, found that, while integrating across MRI modalities led to age-prediction models with the highest age-prediction performance, using only T1 structural MRI gave age-prediction models that were better at classifying Alzheimer’s disease. Similarly, using only cerebral blood flow gave age-prediction models that were better at classifying mild/subjective cognitive impairment, schizophrenia and bipolar disorder.

      As opposed to selecting age-prediction models based on a set of features, researchers could also select age-prediction models based on modelling methods. For instance, Jirsaraie and colleagues (2023) compared gradient tree boosting (GTB) and deep-learning brain network (DBN) algorithms in building age-prediction models. They found GTB to have higher age-prediction performance but DBN to have better utility in explaining cognitive functioning. In this case, an algorithm with better utility (e.g., DBN) should be used for explaining a phenotype of interest. Similarly, Bashyam and colleagues (2020) built different DBN-based age-prediction models, varying in age-prediction performance. The DBN models with a higher number of epochs corresponded to higher age-prediction performance. However, DBN-based age-prediction models with a moderate (as opposed to higher or lower) number of epochs were better at classifying Alzheimer’s disease, mild cognitive impairment and schizophrenia. In this case, a model from the same algorithm with better utility (e.g., those DBN with a moderate epoch number) should be used for explaining a phenotype of interest. Accordingly, this calls for a change in research practice, as recently pointed out by Jirasarie and colleagues (2023, p7), “Despite mounting evidence, there is a persisting assumption across several studies that the most accurate brain age models will have the most potential for detecting differences in a given phenotype of interest”. Future neuroimaging research should aim to build age-prediction models that are not necessarily good at predicting age, but at capturing phenotypes of interest.”

      Reviewer #1 (Recommendations For The Authors):

      In this paper, the authors evaluate the utility of brain age derived metrics for predicting cognitive decline using the HCP aging dataset by performing a commonality analysis in a downstream regression. The main conclusion is that brain age derived metrics do not explain much additional variation in cognition over and above what is already explained by age. The authors propose to use a regression model trained to predict cognition ('brain-cognition') as an alternative that explains more unique variance in the downstream regression.

      This is a reasonably good paper and the use of a commonality analysis is a nice contribution to understanding variance partitioning across different covariates. While the main message will not come as a surprise to anyone with hands-on experience of using brain-age models, I think it is nonetheless an important message to convey to the community. With that said, I have some comments that I believe the authors ought to address before publication.

      Reviewer 1 Recommendations For The Authors #1:

      First, from a conceptual point of view, the authors focus exclusively on cognition as a downstream outcome. This is undeniably important, but is only one application area for brain age models. They are also used for example to provide biomarkers for many brain disorders. What would the results presented here have to say about these application areas? Further, I think that since brain-age models by construction confound relevant biological variation with the accuracy of the regression models used to estimate them, my own opinion about the limits of interpretation of (e.g.) the brain-age gap is as a dimensionless biomarker. This has also been discussed elsewhere (see e.g. https://academic.oup.com/brain/article/143/7/2312/5863667). I would suggest the authors nuance their discussion to provide considerations on these issues.

      Response Thank you Reviewer 1 for pointing out two important issues.

      The first issue was about applications for brain disorders. We now made a detailed discussion about this, which also addressed Reviewer 3 Public Review #9. Briefly, we now bought up

      1) the consistency between our findings on fluid cognition and other recent works on brain disorders,

      2) under-fitted age-prediction models from Brain Age studies focusing on neurological/psychological disorders when applied to participants with neurological/psychological disorders because the age-prediction models were built from largely healthy participants,

      and 3) suggested solutions we and others made to optimise the utility of Brain Age for both cognitive functioning and brain disorders.

      Discussion:

      “This discrepancy between the predictive performance of age-prediction models and the utility of Brain Age indices as a biomarker is consistent with recent findings (for review, see Jirsaraie, Gorelik, et al., 2023), both in the context of cognitive functioning (Jirsaraie, Kaufmann, et al., 2023) and neurological/psychological disorders (Bashyam et al., 2020; Rokicki et al., 2021). For instance, combining different MRI modalities into the prediction models, similar to our stacked models, often lead to the highest performance of age-prediction models, but does not likely explain the highest variance across different phenotypes, including cognitive functioning and beyond (Jirsaraie, Gorelik, et al., 2023).”

      “There is a notable difference between studies investigating the utility of Brain Age in explaining cognitive functioning, including ours and others (e.g., Butler et al., 2021; Cole, 2020, 2020; Jirsaraie, Kaufmann, et al., 2023) and those explaining neurological/psychological disorders (e.g., Bashyam et al., 2020; Rokicki et al., 2021). That is, those Brain Age studies focusing on neurological/psychological disorders often build age-prediction models from MRI data of largely healthy participants (e.g., controls in a case-control design or large samples in a population-based design), apply the built age-prediction models to participants without vs. with neurological/psychological disorders and compare Brain Age indices between the two groups. This means that age-prediction models from Brain Age studies focusing on neurological/psychological disorders might be under-fitted when applied to participants with neurological/psychological disorders because they were built from largely healthy participants. And thus, the difference in Brain Age indices between participants without vs. with neurological/psychological disorders might be confounded by the under-fitted age-prediction models (i.e., Brain Age may predict chronological age well for the controls, but not for those with a disorder). On the contrary, our study and other Brain Age studies focusing on cognitive functioning often build age-prediction models from MRI data of largely healthy participants and apply the built age-prediction models to participants who are also largely healthy. Accordingly, the age-prediction models for explaining cognitive functioning do not suffer from being under-fitted. We consider this as a strength, not a weakness of our study.”

      “Next, researchers should not select age-prediction models based solely on age-prediction performance. Instead, researchers could select age-prediction models that explained phenotypes of interest the best. Here we selected age-prediction models based on a set of features (i.e., modalities) of brain MRI. This strategy was found effective not only for fluid cognition as we demonstrated here, but also for neurological and psychological disorders as shown elsewhere (Jirsaraie, Gorelik, et al., 2023; Rokicki et al., 2021). Rokicki and colleagues (2021), for instance, found that, while integrating across MRI modalities led to age-prediction models with the highest age-prediction performance, using only T1 structural MRI gave age-prediction models that were better at classifying Alzheimer’s disease. Similarly, using only cerebral blood flow gave age-prediction models that were better at classifying mild/subjective cognitive impairment, schizophrenia and bipolar disorder. As opposed to selecting age-prediction models based on a set of features, researchers could also select age-prediction models based on modelling methods. For instance, Jirsaraie and colleagues (2023) compared gradient tree boosting (GTB) and deep-learning brain network (DBN) algorithms in building age-prediction models. They found GTB to have higher age-prediction performance but DBN to have better utility in explaining cognitive functioning. In this case, an algorithm with better utility (e.g., DBN) should be used for explaining a phenotype of interest. Similarly, Bashyam and colleagues (2020) built different DBN-based age-prediction models, varying in age-prediction performance. The DBN models with a higher number of epochs corresponded to higher age-prediction performance. However, DBN-based age-prediction models with a moderate (as opposed to higher or lower) number of epochs were better at classifying Alzheimer’s disease, mild cognitive impairment and schizophrenia. In this case, a model from the same algorithm with better utility (e.g., those DBN with a moderate epoch number) should be used for explaining a phenotype of interest. Accordingly, this calls for a change in research practice, as recently pointed out by Jirasarie and colleagues (2023, p7), “Despite mounting evidence, there is a persisting assumption across several studies that the most accurate brain age models will have the most potential for detecting differences in a given phenotype of interest”. Future neuroimaging research should aim to build age-prediction models that are not necessarily good at predicting age, but at capturing phenotypes of interest.”

      The second issue was about “the brain-age gap as a dimensionless biomarker.” We are not so clear on what the reviewer meant by “the dimensionless biomarker.” One possible meaning of the “dimensionless biomarker” is the fact that Brain Age from the same algorithm and same modality can be computed, such that Brain Age can be tightly fit or loosely fit with chronological age. This is what Bashyam and colleagues (2020) did in the article Reviewer 1 referred to. We now wrote about this strategy in the above paragraph in the Discussion.

      Alternatively, “the dimensionless biomarker” might be something closer to what Reviewer 2 viewed Brain Age as a “generic/indirect” index (as opposed to a 'specific/direct' index in the case of Brain Cognition) (see Reviewer 2 Public Review #4). We discussed this in our response to Reviewer 3 Public Review #4.

      Reviewer 1 Recommendations For The Authors #2:

      Second, from a methods perspective, I am quite suspicious of the stacked regression models the authors are using to combine regression models and I suspect they may be overfit. In my experience, stacked models are very prone to overfitting when combined with cross-validation. This is because the predictions from the first level models (i,e. the features that are provided to the second-level 'stacked' models) contain information about the training set and the test set. If cross-validation is not done very carefully (e.g. using multiple hold-out sets), information leakage can easily occur at the second level. Unfortunately, there is not sufficient explanation of the methodological procedures in the current manuscript to fully understand what was done. First, please provide more information to enable the reader to better understand the stacked regression models and if the authors are not using an approach that fully preserves training and test separability, please do so.

      Response: We would like to thank Reviewer 1 for the suggestion. We now made it clearer in texts and new figure (see below) that we used nested cross-validation to ensure no information leakage between training and test sets. Regarding the stacked models more specifically, the hyperparameters of the stacked models were tuned in the same inner-fold CV as the non-stacked model (see Figure 7 below). That is, training models for both non-stacked and stacked models did not involve the test set, ensuring that there was no data leakage between training and test sets.

      Methods:

      “To compute Brain Age and Brain Cognition, we ran two separate prediction models. These prediction models either had chronological age or fluid cognition as the target and standardised brain MRI as the features (Denissen et al., 2022). We used nested cross-validation (CV) to build these models (see Figure 7). We first split the data into five outer folds. We used five outer folds so that each outer fold had around 100 participants. This is to ensure the stability of the test performance across folds. In each outer-fold CV, one of the outer folds was treated as a test set, and the rest was treated as a training set, which was further divided into five inner folds. In each inner-fold CV, one of the inner folds was treated as a validation set and the rest was treated as a training set. We used the inner-fold CV to tune for hyperparameters of the models and the outer-fold CV to evaluate the predictive performance of the models.

      In addition to using each of the 18 sets of features in separate prediction models, we drew information across these sets via stacking. Specifically, we computed predicted values from each of the 18 sets of features in the training sets. We then treated different combinations of these predicted values as features to predict the targets in separate “stacked” models. The hyperparameters of the stacked models were tuned in the same inner-fold CV as the non-stacked model (see Figure 7). That is, training models for both non-stacked and stacked models did not involve the test set, ensuring that there was no data leakage between training and test sets. We specified eight stacked models: “All” (i.e., including all 18 sets of features), “All excluding Task FC”, “All excluding Task Contrast”, “Non-Task” (i.e., including only Rest FC and sMRI), “Resting and Task FC”, “Task Contrast and FC”, “Task Contrast” and “Task FC”. Accordingly, in total, there were 26 prediction models for Brain Age and Brain Cognition.

      Reviewer 1 Recommendations For The Authors #3:

      Third, the authors standardize the elastic net regression coefficients post-hoc. Why did the authors not perform the more standard approach of standardizing the covariates and responses, prior to model estimation, which would yield standardized regression coefficients (in the classical sense) by construction? Please also provide an indication of the different regression strengths that were estimated across the different models and cross-validation splits. Also, how stable were the weights across splits?

      Response For model fitting, we did not “standardize the elastic net regression coefficients post-hoc.” Instead, we did all of the standardisation steps prior to model fitting (see Methods below). For regression strengths across different models and cross-validation splits, we now provided predictive performance at each of the five outer-fold test sets in Figure 1 (below). As you may have seen, the predictive performance was quite stable across the cross-validation splits.

      For visualising feature importance, We originally only standardised the elastic net regression coefficients post-hoc, so that feature importance plots were in the same scale across folds. However, as mentioned by Reviewer 3 (Recommendations for the Authors #7, below), this might make it difficult to interpret the directionality of the coefficients. In the revised manuscript, we refitted the Elastic Net model to the full dataset without splitting them into five folds and visualised the coefficients on brain images (see below).

      Methods

      “We controlled for the potential influences of biological sex on the brain features by first residualising biological sex from brain features in each outer-fold training set. We then applied the regression of this residualisation to the corresponding test set. We also standardised the brain features in each outer-fold training set and then used the mean and standard deviation of this outer-fold training set to standardise the test set. All of the standardisation was done prior to fitting the prediction models.”

      “To understand how Elastic Net made a prediction based on different brain features, we examined the coefficients of the tuned model. Elastic Net coefficients can be considered as feature importance, such that more positive Elastic Net coefficients lead to more positive predicted values and, similarly, more negative Elastic Net coefficients lead to more negative predicted values (Molnar, 2019; Pat, Wang, Bartonicek, et al., 2022). While the magnitude of Elastic Net coefficients is regularised (thus making it difficult for us to interpret the magnitude itself directly), we could still indicate that a brain feature with a higher magnitude weights relatively stronger in making a prediction. Another benefit of Elastic Net as a penalised regression is that the coefficients are less susceptible to collinearity among features as they have already been regularised (Dormann et al., 2013; Pat, Wang, Bartonicek, et al., 2022).

      Given that we used five-fold nested cross validation, different outer folds may have different degrees of ‘’ and ‘l_1 ratio’, making the final coefficients from different folds to be different. For instance, for certain sets of features, penalisation may not play a big part (i.e., higher or lower ‘’ leads to similar predictive performance), resulting in different ‘’ for different folds. To remedy this in the visualisation of Elastic Net feature importance, we refitted the Elastic Net model to the full dataset without splitting them into five folds and visualised the coefficients on brain images using Brainspace (Vos De Wael et al., 2020) and Nilern (Abraham et al., 2014) packages. Note, unlike other sets of features, Task FC and Rest FC were modelled after data reduction via PCA. Thus, for Task FC and Rest FC, we, first, multiplied the absolute PCA scores (extracted from the ‘components_’ attribute of ‘sklearn.decomposition.PCA’) with Elastic Net coefficients and, then, summed the multiplied values across the 75 components, leaving 71,631 ROI-pair indices.”

      Reviewer 1 Recommendations For The Authors #4:

      I do not really find it surprising that the level of unique explained variance provided by a brain-cognition model is higher than a brain-age model, given that the latter is considerably more accurate (also, in view of the comment above). As such I would recommend to tone down the claims about the utility of this method, also because it is only really applicable to one application area for brain age.

      Response Thank you for bringing this issue to our attention. We have now toned down the claims about the utility of Brain Cognition and importantly treated the capability of Brain Cognition in capturing fluid cognition as the upper limit of Brain Age’s capability in capturing fluid cognition. Please see Reviewer 3 Public Review #4 above for a detailed discussion about this issue.

      Reviewer 1 Recommendations For The Authors #5:

      Please provide more details about the task designs and MRI processing procedures that were employed on this sample so that the reader is not forced to dig through the publications from the consortia contributing the data samples used. For example, comments such as "Here we focused on the pre-processed task fMRI files with a suffix "_PA_Atlas_MSMAll_hp0_clean.dtseries.nii." are not particularly helpful to readers not already familiar with this dataset.

      Response Thank you so much for pointing out this important point on the clarity of the description of our MRI methodology. We now added additional details about the data processing done by the HCP-A and by us. We, for instance, explained the meaning of the HCP-A suffix “"_PA_Atlas_MSMAll_hp0_clean.dtseries.nii”. Please see below.

      Methods

      “HCP-A provides details of parameters for brain MRI elsewhere (Bookheimer et al., 2019; Harms et al., 2018). Here we used MRI data that were pre-processed by the HCP-A with recommended methods, including the MSMALL alignment (Glasser et al., 2016; Robinson et al., 2018) and ICA-FIX (Glasser et al., 2016) for functional MRI. We used multiple brain MRI modalities, covering task functional MRI (task fMRI), resting-state functional MRI (rsfMRI) and structural MRI (sMRI), and organised them into 19 sets of features.

      Sets of Features 1-10: Task fMRI contrast (Task Contrast)

      Task contrasts reflect fMRI activation relevant to events in each task. Bookheimer and colleagues (2019) provided detailed information about the fMRI in HCP-A. Here we focused on the pre-processed task fMRI Connectivity Informatics Technology Initiative (CIFTI) files with a suffix, “_PA_Atlas_MSMAll_hp0_clean.dtseries.nii.” These CIFTI files encompassed both the cortical mesh surface and subcortical volume (Glasser et al., 2013). Collected using the posterior-to-anterior (PA) phase, these files were aligned using MSMALL (Glasser et al., 2016; Robinson et al., 2018), linear detrended (see https://groups.google.com/a/humanconnectome.org/g/hcp-users/c/ZLJc092h980/m/GiihzQAUAwAJ) and cleaned from potential artifacts using ICA-FIX (Glasser et al., 2016).

      To extract Task Contrasts, we regressed the fMRI time series on the convolved task events using a double-gamma canonical hemodynamic response function via FMRIB Software Library (FSL)’s FMRI Expert Analysis Tool (FEAT) (Woolrich et al., 2001). We kept FSL’s default high pass cutoff at 200s (i.e., .005 Hz). We then parcellated the contrast ‘cope’ files, using the Glasser atlas (Gordon et al., 2016) for cortical surface regions and the Freesurfer’s automatic segmentation (aseg) (Fischl et al., 2002) for subcortical regions. This resulted in 379 regions, whose number was, in turn, the number of features for each Task Contrast set of features.

      HCP-A collected fMRI data from three tasks: Face Name (Sperling et al., 2001), Conditioned Approach Response Inhibition Task (CARIT) (Somerville et al., 2018) and VISual MOTOR (VISMOTOR) (Ances et al., 2009). First, the Face Name task (Sperling et al., 2001) taps into episodic memory. The task had three blocks. In the encoding block [Encoding], participants were asked to memorise the names of faces shown. These faces were then shown again in the recall block [Recall] when the participants were asked if they could remember the names of the previously shown faces. There was also the distractor block [Distractor] occurring between the encoding and recall blocks. Here participants were distracted by a Go/NoGo task. We computed six contrasts for this Face Name task: [Encode], [Recall], [Distractor], [Encode vs. Distractor], [Recall vs. Distractor] and [Encode vs. Recall].

      Second, the CARIT task (Somerville et al., 2018) was adapted from the classic Go/NoGo task and taps into inhibitory control. Participants were asked to press a button to all [Go] but not to two [NoGo] shapes. We computed three contrasts for the CARIT task: [NoGo], [Go] and [NoGo vs. Go].

      Third, the VISMOTOR task (Ances et al., 2009) was designed to test simple activation of the motor and visual cortices. Participants saw a checkerboard with a red square either on the left or right. They needed to press a corresponding key to indicate the location of the red square. We computed just one contrast for the VISMOTOR task: [Vismotor], which indicates the presence of the checkerboard vs. baseline.

      Sets of Features 11-13: Task fMRI functional connectivity (Task FC)

      Task FC reflects functional connectivity (FC ) among the brain regions during each task, which is considered an important source of individual differences (Elliott et al., 2019; Fair et al., 2007; Gratton et al., 2018). We used the same CIFTI file “_PA_Atlas_MSMAll_hp0_clean.dtseries.nii.” as the task contrasts. Unlike Task Contrasts, here we treated the double-gamma, convolved task events as regressors of no interest and focused on the residuals of the regression from each task (Fair et al., 2007). We computed these regressors on FSL, and regressed them in nilearn (Abraham et al., 2014). Following previous work on task FC (Elliott et al., 2019), we applied a highpass at .008 Hz. For parcellation, we used the same atlases as Task Contrast (Fischl et al., 2002; Glasser et al., 2016). We computed Pearson’s correlations of each pair of 379 regions, resulting in a table of 71,631 non-overlapping FC indices for each task. We then applied r-to-z transformation and principal component analysis (PCA) of 75 components (Rasero et al., 2021; Sripada et al., 2019, 2020). Note to avoid data leakage, we conducted the PCA on each training set and applied its definition to the corresponding test set. Accordingly, there were three sets of 75 features for Task FC, one for each task. “

      Reviewer 1 Recommendations For The Authors #6:

      Similarly, please be more specific about the regression methods used. There are several different parameterisations of the elastic net, please provide equations to describe the method used here so that readers can easily determine how the regularisation parameters should be interpreted. The same goes for the methods used for correcting bias, e.g. what is "de Lange and Cole's (2020) 5th equation"?

      Response Thank you. We now made a detailed description of Elastic Net including its equation (see below). We also added more specific details about the methods used for correcting bias in Brain Age indices (see our response to Reviewer 3 Public Review #2 above).

      Methods:

      “For the machine learning algorithm, we used Elastic Net (Zou & Hastie, 2005). Elastic Net is a general form of penalised regressions (including Lasso and Ridge regression), allowing us to simultaneously draw information across different brain indices to predict one target variable. Penalised regressions are commonly used for building age-prediction models (Jirsaraie, Gorelik, et al., 2023). Previously we showed that the performance of Elastic Net in predicting cognitive abilities is on par, if not better than, many non-linear and more-complicated algorithms (Pat, Wang, Bartonicek, et al., 2022; Tetereva et al., 2022). Moreover, Elastic Net coefficients are readily explainable, allowing us the ability to explain how our age-prediction and cognition-prediction models made the prediction from each brain feature (Molnar, 2019; Pat, Wang, Bartonicek, et al., 2022) (see below).

      Elastic Net simultaneously minimises the weighted sum of the features’ coefficients. The degree of penalty to the sum of the feature’s coefficients is determined by a shrinkage hyperparameter ‘’: the greater the , the more the coefficients shrink, and the more regularised the model becomes. Elastic Net also includes another hyperparameter, ‘l_1 ratio’, which determines the degree to which the sum of either the squared (known as ‘Ridge’; l_1 ratio=0) or absolute (known as ‘Lasso’; l_1 ratio=1) coefficients is penalised (Zou & Hastie, 2005). The objective function of Elastic Net as implemented by sklearn (Pedregosa et al., 2011) is defined as: argmin_ ((|(|y-X|)|_2^2)/(2×n_samples )+α×l_1 _ratio×|(||)|_1+0.5×α×(1-l_1 _ratio)×|(|w|)|_2^2 ), (1) where X is the features, y is the target, and  is the coefficient. In our grid search, we tuned two Elastic Net hyperparameters:  using 70 numbers in log space, ranging from .1 and 100, and l_1-ratio using 25 numbers in linear space, ranging from 0 and 1.”

      Additional minor points:

      Reviewer 1 Recommendations For The Authors #7:

      • Please provide more descriptive figure legends, especially for Figs 5 and 6. For example, what do the boldface numbers reflect? What do the asterisks reflect?

      Response Thank you for the suggestion. We made changes to the figure legends to make it clearer what the numbers and asterisks reflect.

      Reviewer 1 Recommendations For The Authors #8:

      • Perhaps this is personal thing, but I find the nomenclature cognition_{fluid} to be quite awkward. Why not just define FC as an acronym?

      Response Thank you for the suggestion. We now used the word ‘fluid cognition’ throughout the manuscript.

      Reviewer #2 (Recommendations For The Authors):

      Suggestions for improved or additional experiments, data or analyses.

      Reviewer 2 Recommendations For The Authors #1:

      • Since the study did not provide external validation for the indices, it is unclear how well the models would perform and generalize to other samples. Therefore, it is recommended to conduct out-of-sample testing of the models.

      Response Thank you for the suggestion. We now added discussions about how consistency between our results and several recent studies that investigated similar issues with Brain Age in different populations, e.g., large samples of older adults in Uk Biobank (Cole, 2020) and younger populations (Butler et al., 2021; Jirsaraie, Kaufmann, et al., 2023), and in a broader context, extending to neurological and psychological disorders (for review, see Jirsaraie, Gorelik, et al., 2023). Please see below.

      Please also noted that all of the analyses done were out-of-sample. We used nested cross-validation to evaluate the predictive performance of age- and cognition-prediction models on the outer-fold test sets, which are out-of-sample from the training sets (please see Reviewer 1 Recommendations For The Authors #2). Similarly, we also conducted all of the commonality analyses on the outer-fold test sets.

      Discussion

      “The small effects of the Corrected Brain Age Gap in explaining fluid cognition of aging individuals found here are consistent with studies in older adults (Cole, 2020) and younger populations (Butler et al., 2021; Jirsaraie, Kaufmann, et al., 2023). Cole (2020) studied the utility of Brain Age on cognitive functioning of large samples (n>17,000) of older adults, aged 45-80 years, from the UK Biobank (Sudlow et al., 2015). He constructed age-prediction models using LASSO, a similar penalised regression to ours and applied the same age-dependency adjustment to ours. Cole (2020) then conducted a multiple regression explaining cognitive functioning from Corrected Brain Age Gap while controlling for chronological age and other potential confounds. He found Corrected Brain Age Gap to be significantly related to performance in four out of six cognitive measures, and among those significant relationships, the effect sizes were small with a maximum of partial eta-squared at .0059. Similarly, Jirsaraie and colleagues (2023) studied the utility of Brain Age on cognitive functioning of youths aged 8-22 years old from the Human Connectome Project in Development (Somerville et al., 2018) and Preschool Depression Study (Luby, 2010). They built age-prediction models using gradient tree boosting (GTB) and deep-learning brain network (DBN) and adjusted the age dependency of Brain Age Gap using Smith and colleagues’ (2019) method. Using multiple regressions, Jirsaraie and colleagues (2023) found weak effects of the adjusted Brain Age Gap on cognitive functioning across five cognitive tasks, five age-prediction models and the two datasets (mean of standardised regression coefficient = -0.09, see their Table S7). Next, Butler and colleagues (2021) studied the utility of Brain Age on cognitive functioning of another group of youths aged 8-22 years old from the Philadelphia Neurodevelopmental Cohort (PNC) (Satterthwaite et al., 2016). Here they used Elastic Net to build age-prediction models and applied another age-dependency adjustment method, proposed by Beheshti and colleagues (2019). Similar to the aforementioned results, Butler and colleagues (2021) found a weak, statistically non-significant correlation between the adjusted Brain Age Gap and cognitive functioning at r=-.01, p=.71. Accordingly, the utility of Brain Age in explaining cognitive functioning beyond chronological age appears to be weak across age groups, different predictive modelling algorithms and age-dependency adjustments.“

      “This discrepancy between the predictive performance of age-prediction models and the utility of Brain Age indices as a biomarker is consistent with recent findings (for review, see Jirsaraie, Gorelik, et al., 2023), both in the context of cognitive functioning (Jirsaraie, Kaufmann, et al., 2023) and neurological/psychological disorders (Bashyam et al., 2020; Rokicki et al., 2021). For instance, combining different MRI modalities into the prediction models, similar to our stacked models, often lead to the highest performance of age-prediction models, but does not likely explain the highest variance across different phenotypes, including cognitive functioning and beyond (Jirsaraie, Gorelik, et al., 2023). “

      “Third, by introducing Brain Cognition, we showed the extent to which Brain Age indices were not able to capture the variation of brain MRI that is related to fluid cognition. Brain Cognition, from certain cognition-prediction models such as the stacked models, has relatively good predictive performance, consistent with previous studies (Dubois et al., 2018; Pat, Wang, Anney, et al., 2022; Rasero et al., 2021; Sripada et al., 2020; Tetereva et al., 2022; for review, see Vieira et al., 2022). We then examined Brain Cognition using commonality analyses (Nimon et al., 2008) in multiple regression models having a Brain Age index, chronological age and Brain Cognition as regressors to explain fluid cognition. Similar to Brain Age indices, Brain Cognition exhibited large common effects with chronological age. But more importantly, unlike Brain Age indices, Brain Cognition showed large unique effects, up to around 11%. The unique effects of Brain Cognition indicated the amount of co-variation between brain MRI and fluid cognition that was missed by a Brain Age index and chronological age. This missing amount was relatively high, considering that Brain Age and chronological age together explained around 32% of the total variation in fluid cognition. Accordingly, if a Brain Age index was used as a biomarker along with chronological age, we would have missed an opportunity to improve the performance of the model by around one-third of the variation explained. “

      “There is a notable difference between studies investigating the utility of Brain Age in explaining cognitive functioning, including ours and others (e.g., Butler et al., 2021; Cole, 2020, 2020; Jirsaraie, Kaufmann, et al., 2023) and those explaining neurological/psychological disorders (e.g., Bashyam et al., 2020; Rokicki et al., 2021). That is, those Brain Age studies focusing on neurological/psychological disorders often build age-prediction models from MRI data of largely healthy participants (e.g., controls in a case-control design or large samples in a population-based design), apply the built age-prediction models to participants without vs. with neurological/psychological disorders and compare Brain Age indices between the two groups. This means that age-prediction models from Brain Age studies focusing on neurological/psychological disorders might be under-fitted when applied to participants with neurological/psychological disorders because they were built from largely healthy participants. And thus, the difference in Brain Age indices between participants without vs. with neurological/psychological disorders might be confounded by the under-fitted age-prediction models (i.e., Brain Age may predict chronological age well for the controls, but not for those with a disorder). On the contrary, our study and other Brain Age studies focusing on cognitive functioning often build age-prediction models from MRI data of largely healthy participants and apply the built age-prediction models to participants who are also largely healthy. Accordingly, the age-prediction models for explaining cognitive functioning do not suffer from being under-fitted. We consider this as a strength, not a weakness of our study.”

      Reviewer 2 Recommendations For The Authors #2:

      • Employ Variance Inflation Factor (VIF) to empirically test for multicollinearity.

      Response Given high common effects between many of the regressors in the models (e.g., between Brain Age and chronological age), VIF will be high, but this is not a concern for the commonality analysis. We showed now that applying the commonality analysis to multiple regressions allowed us to have robust results against multicollinearity, as demonstrated elsewhere (Ray-Mukherjee et al., 2014, Using commonality analysis in multiple regressions: A tool to decompose regression effects in the face of multicollinearity). Specifically, using the multiple regressions by themselves without the commonality analysis, researchers have to rely on beta estimates, which are strongly affected by multicollinearity (e.g., a phenomenon known as the Suppression Effect). However, by applying the commonality analysis on top of multiple regressions, researchers can then rely on R2 estimates, which are less affected by multicollinearity. This can be seen in our case (Figure 5 and 6) where Brain Age indices had the same unique effects regardless of the level of common effects they had with chronological age (e.g., Brain Age vs. Corrected Brain Age Gap from stacked models).

      To directly demonstrate the robustness of the current commonality analysis regarding multicollinearity, we applied the commonality analysis to Ridge regressions (see Supplementary Figures 3 and 5 below). Ridge regression is a method designed to deal with multicollinearity (Dormann et al., 2013). As seen below, the results from commonality analyses applied to Ridge regressions are closely matched with our original results.

      Methods

      “Note to ensure that the commonality analysis results were robust against multicollinearity (Ray-Mukherjee et al., 2014), we also repeated the same commonality analyses done here on Ridge regression, as opposed to multiple regression. Ridge regression is a method designed to deal with multicollinearity (Dormann et al., 2013). See Supplementary Figure 3 for the Ridge regression with chronological age and each Brain Age index as regressors and Supplementary Figure 5 for the Ridge regression with chronological age, each Brain Age and Brain Cognition index as regressors. Briefly, the results from commonality analyses applied to Ridge regressions are closely matched with our results done using multiple regression.”

      Reviewer 2 Recommendations For The Authors #3:

      • Incorporate non-linearities in the correction of brain-age indices, such as separate terms in the regression or statistical analyses.

      Response Thank you for the suggestion. We now added a non-linear term of chronological age in our multiple-regression models explaining fluid cognition (see Supplementary Figure 4 and 6 below). Originally we did not have the quadratic term for chronological age in our model since the relationship between chronological age and fluid cognition was relatively linear (see Figure 1 above). Accordingly, as expected, adding the quadratic term for chronological age as suggested did not change the pattern of the results of the commonality analyses.

      Methods

      “Similarly, to ensure that we were able to capture the non-linear pattern of chronological age in explaining fluid cognition, we added a quadratic term of chronological age to our multiple-regression models in the commonality analyses. See Supplementary Figure 4 for the multiple regression with chronological age, square chronological age and each Brain Age index as regressors and Supplementary Figure 6 for the multiple regression with chronological age, square chronological age, each Brain Age index and Brain Cognition as regressors. Briefly, adding the quadratic term for chronological age did not change the pattern of the results of the commonality analyses.”

      Reviewer 2 Recommendations For The Authors #4:

      • It would be helpful to include the complete set of results in the appendix - for instance, the statistical significance for each component for the final commonality analysis.

      Response Figures 5 and 6 (see above) already have asterisks to reflect the statistical significance of the unique effects. Because of this, we do not believe we need more figures/tables in the appendix to show statistical significance.

      Recommendations for improving the writing and presentation.

      Reviewer 2 Recommendations For The Authors #5:

      • The authors are encouraged to refrain from using terms such as 'fortunately', 'unfortunately', and 'unsettling', as they may appear inappropriate when referring to empirical findings.

      Response We agree with this suggestion and no long used those words.

      Reviewer 2 Recommendations For The Authors #6:

      • It would be helpful to clarify in the methods that you end up with 5 test folds.

      Response We now made a clarification why we chose 5 test folds.

      Methods

      “We used nested cross-validation (CV) to build these models (see Figure 7). We first split the data into five outer folds. We used five outer folds so that each outer fold had around 100 participants. This is to ensure the stability of the test performance across folds.”

      Minor corrections to the text and figures.

      Reviewer 2 Recommendations For The Authors #7:

      • Why use months, not years for chronological age? This seems inappropriate given the age range.

      Response We originally used months since they were units used in our prediction modelling. However, to make the figures easier to understand, we now used years.

      Reviewer 2 Recommendations For The Authors #8:

      • The formatting, especially regarding the text embedded within the figures, could benefit from significant improvements.

      Response Thank you for the suggestion. We made changes to the text embedded within the figures. They should be more readable now

      Reviewer 2 Recommendations For The Authors #9:

      • The legend for the neuroimaging feature labels is missing, and the captions are incomplete.

      Response Please see Figure 2 above. We now revised by adding the letter L and R for the laterality of the brain images. We made some changes to the captions to make sure they are complete.

      Reviewer 2 Recommendations For The Authors #10:

      • Figure 5's caption: SD has a missing decimal point).

      Response The numbers are not SD. The numbers to the left of the figure represent the unique effects of chronological age in %, the numbers in the middle of the figure represent the common effects between chronological age and Brain Age index in %, and the numbers to the right of the figure represent the unique effects of Brain Age Index in %. We now used the same one decimal point for these number

      Reviewer #3 (Recommendations For The Authors):

      The main question of this article is as follows: “To what extent does having information on Brain Age improve our ability to capture declines in fluid cognition beyond knowing a person’s chronological age?” While this question is worthwhile, considering most of the field is confused about the nature of brain age, the authors are currently missing an opportunity to convey the inevitability of their results given how Brain Age and the Brain Age Gap are calculated. They also misleadingly convey that Brain Cognition is somehow superior to Brain Age. If the authors work on conveying the inevitability of their results and redo (or remove) their section on Brain Cognition, I can see how their results would be enlightening to the general neuroimaging community that is interested in the concept of brain age. See below for specific critiques.

      Response Please see our response to Reviewer 3 Public Review Overall. Note we no longer argue that Brain Cognition is superior to Brain Age (Reviewer 3 Public Review #4). Rather, we treated the capability of Brain Cognition in capturing fluid cognition as the upper limit of Brain Age’s capability in capturing fluid cognition. We used the unique effects of Brain Cognition that explain fluid cognition beyond Brain Age and chronological age to indicate how much Brain Age misses the variation in the brain MRI that could explain fluid cognition.

      Reviewer 3 Recommendations For The Authors #1:

      “There are many adjustments proposed to correct for this estimation bias” (p3) → Regression to the mean is not a sign of bias. Any decent loss function will result in over- predicting the age of younger individuals and under-predicting the age of older individuals. This is a direct result of minimizing an error term (e.g., mean squared error). Therefore, it is inappropriate to refer to regression to the mean as a sign of bias. This misconception has led to a great deal of inappropriate analyses, including “correcting” the brain age gap by regressing out age.

      Response Please see our response to Reviewer 3 Public Review#1

      Reviewer 3 Recommendations For The Authors #2:

      “Corrected Brain Age Gap in particular is viewed as being able to control for both age dependency and estimation biases (Butler et al., 2021).” (p3) → This summary is not accurate as Butler and colleagues did not use the words "corrected" and "biases" in this context. All that authors say in that paper is that regressing out age from the brain age gap - which is referred to as the modified brain age gap (MBAG) - makes it so that the modified brain age gap is not dependent on age, which is true. This metric is meaningless, though, because it is the variance left over after regressing out age from residuals from a model that was predicting age. If it were not for the fact that regression on residuals is not equivalent to multiple regression (and out of sample estimates), MBAG would be a vector of zeros. Upon reading your Methods, I noticed that you are using a metric for Le et al. (2018) for your “Corrected Brain Age Gap”. If they cite the Butler et al. (2021) paper, I highly recommend sticking with the same notation, metrics and terminology throughout. That would greatly help with the interpretability of your paper, and cross-comparisons between the two.

      Response Please see our response to Reviewer 3 Public Review #2.

      Reviewer 3 Recommendations For The Authors #3:

      “However, the improvement in predicting chronological age may not necessarily make Brain Age to be better at capturing Cognitionfluid. If, for instance, the age-prediction model had the perfect performance, Brian Age Gap would be exactly zero and would have no utility in capturing Cognitionfluid beyond chronological age.” (p3) → I largely agree with this statement. I would be really careful to distinguish between Brain Age and the Brain Age Gap here, as the former is a predicted value, and the latter is the residual times -1 (predicted age - age). Therefore, together they explain all of the variance in age. If you change the first sentence to refer to the Brain Age Gap, this statement makes more sense. The Brain Age Gap will never be exactly zero, though, even with perfect prediction on the training set, because subjects in the testing set are different from the subjects in the training set.

      Response Please see our response to Reviewer 3 Public Review #3.

      Reviewer 3 Recommendations For The Authors #4:

      “Can we further improve our ability to capture the decline in cognitionfluid by using, not only Brain Age and chronological age, but also another biomarker, Brain Cognition?” → This question is fundamentally getting at whether a predicted value of cognition can predict cognition. Assuming the brain parameters can predict cognition decently, and the original cognitive measure that you were predicting is related to your measure of fluid cognition, the answer should be yes. This seems like an uninteresting question to me. Upon reading your Methods, it became clear that the cognitive variable in the model predicting cognition using brain features (to get predicted cognition, or as you refer to it, Brain Cognition) is the same as the measure of fluid cognition that you are trying to assess how well Brain Cognition can predict. Assuming the brain parameters can predict fluid cognition at all, of course Brain Cognition will predict fluid cognition. This is inevitable. You should never use predicted values of a variable to predict the same variable.

      Response Please see our response to Reviewer 3 Public Review #4.

      Reviewer 3 Recommendations For The Authors #5:

      “We also examined if these better-performing age-prediction models improved the ability of Brain Age in explaining Cognitionfluid.” → Improved above and beyond what?

      Response We referred to if better-performing age-prediction models improved the ability of Brain Age in explaining fluid cognition over and above lower-performing age-prediction models. We made changes to the Introduction to clarify this change.

      Reviewer 3 Recommendations For The Authors #6:

      Figure 1 b & c → It is a little difficult to read the text by the horizontal bars in your plots. Please make the text smaller so that there is more space between the words vertically, or even better, make the plots slightly bigger. Please also put the predicted values on the y-axis. This is standard practice for displaying regression results. To make more room, you can get rid of your rPearson or your R2 plot, considering the latter is simply the square of the former. If you want to make it clear that the association is positive between all of your variables, I would keep rPearson.

      Response Thank you so much for the suggestions.

      1) We now made sure that the text by the horizontal bars in Figure 1b and c is readable.

      2) Note in prediction model/machine-learning literature, it is more common to plot observed/real values on the y-axis. Here is the logic of our practice: values in the x-axis are the predicted values based on the model, and we would like to see if the changes in the predicted values correspond to the changes in the observed/real value in the y-axis.

      3) Regarding Pearson correlation vs R2, please note that we wrote ”for R2, we used the sum of squares definition (i.e., R2 = 1 – (sum of squares residuals/total sum of squares)) per a previous recommendation (Poldrack et al., 2020).” As such, R2 is NOT the square of the Pearson correlation. In fact, in Poldrack and colleages’s “Establishment of Best Practices for Evidence for Prediction” paper (2020), they discourage 1) the use of Pearson correlation by itself and 2) the use of the correlation coefficient square as R2 (as opposed to sum of squares definition):

      “It is common in the literature to use the correlation between predicted and actual values as a measure of predictive performance; of the 64 studies in our literature review that performed prediction analyses on continuous outcomes, 30 reported such correlations as a measure of predictive performance. This reporting is problematic for several reasons. First, correlation is not sensitive to scaling of the data; thus, a high correlation can exist even when predicted values are discrepant from actual values. Second, correlation can sometimes be biased, particularly in the case of leave-one-out cross-validation. As demonstrated in Figure 4, the correlation between predicted and actual values can be strongly negative when no predictive information is present in the model. A further problem arises when the variance explained (R2) is incorrectly computed by squaring the correlation coefficient. Although this computation is appropriate when the model is obtained using the same data, it is not appropriate for out-of-sample testing23; instead, the amount of variance explained should be computed using the sum-of-squares formulation (as implemented in software packages such as scikit-learn).”

      “A further problem arises when the variance explained (R2) is incorrectly computed by squaring the correlation coefficient. Although this computation is appropriate when the model is obtained using the same data, it is not appropriate for out-of-sample testing23; instead, the amount of variance explained should be computed using the sum-of-squares formulation (as implemented in software packages such as scikit-learn).”

      Accordingly, we decided to keep both R2 and Pearson correlation (along with MAE) in our Figure 1.

      Reviewer 3 Recommendations For The Authors #7:

      Figure 2 “We calculated feature importance by, first, standardizing Elastic Net weights across brain features of each set of features from each test fold.” → What do you mean by “standardize” here? Rescale to be mean 0, variance 1? If so, this seems like a misleading transformation, because it gives the impression that the relationships are negative, when they are not necessarily. Also, why did you choose to use elastic net weights in any form as measures of effect size (or importance)? The raw values are inherently penalized, which means they are under-estimates of the true effect size. It would be more meaningful (and less biased) to plot the raw correlations.

      Response For the first question regarding standardisation, we addressed this issue in our response to Reviewer 1 Recommendations For The Authors #3. Briefly, we agreed with Reviewer 3 that standardisation (with mean = 0, SD = 1) might make it difficult to interpret the directionality of the coefficients. For visualising feature importance in the revised manuscript, we refitted the Elastic Net model to the full dataset without splitting them into five folds and visualised the coefficients on brain images (see below).

      For the second question regarding why using Elastic Net coefficients as feature importance (as opposed to correlations), we need to mention the goal of feature importance: to understand how the model makes a prediction based on different brain features (Molnar, 2019). Correlations between a target and each brain feature do not achieve this. Instead, they will show univariate/marginal relationships between a target and a brain feature. What we want to visualise is how the model made a prediction, which in the case of Elastic Net, the prediction is based on the sum of the features’ coefficients. In other words, the multivariate models (including Elastic Net) focus on marginal relationships that take into account all brain features within each set of features.

      Elastic Net coefficients can be considered as feature importance, such that more positive Elastic Net coefficients lead to more positive predicted values and, similarly, more negative Elastic Net coefficients lead to more negative predicted values (Molnar, 2019; Pat, Wang, Bartonicek, et al., 2022). While the magnitude of Elastic Net coefficients is regularised (thus making it difficult for us to interpret the magnitude itself directly), we could still indicate that a brain feature with a higher magnitude weights relatively stronger in making a prediction. Another benefit of Elastic Net as a penalised regression is that the coefficients are less susceptible to collinearity among features as they have already been regularised (Dormann et al., 2013; Pat, Wang, Bartonicek, et al., 2022).

      Reviewer 3 Recommendations For The Authors #8:

      Figure 3 → Again, what exactly do you mean by “standardised” here?

      Response It means mean subtraction followed by the division by an SD. Though we no longer applies standardisation for feature importance. See our response to Reviewer 1 Recommendations For The Authors #3 and Reviewer 3 Recommendations For The Authors #7.

      Reviewer 3 Recommendations For The Authors #9:

      “However, Brain Age Gap created from the lower-performing age-prediction models explained a higher amount of variation in Cognitionfluid. For instance, the top performing age-prediction model, “Stacked: All excluding Task Contrast”, generated Brain Age and Corrected Brain Age that explained the highest amount of variation in Cognitionfluid, but, at the same time, produced Brian Age Gap that explained the least amount of variation in Cognitionfluid.” (p7) → Yes, but you did not need to run any models to show this, considering it is an inevitable consequence of the following relationship between predicted values and residuals (or residuals times -1): 𝑦 = (𝑦 − 𝑦% ) + 𝑦% . Let’s say that age explains 60% of the variance in fluid cognition, and predicted age ( 𝑦% ) explains 40% of the variance in fluid cognition. Then the brain age gap (−(𝑦 − 𝑦% )) should explain 20% of the variance in fluid cognition. If by “Corrected Brain Age” you mean the modified predicted age from the Butler paper, the “Corrected Brain Age” result is inevitable because the modified predicted age is essentially just age with a tiny bit of noise added to it. From Figure 4, though, this does not seem to be the case, because the lower left quadrant in panel a should be flat and high (about as high as the predictive value of age for fluid cognition). So how are you calculating “Corrected Brain Age”? It looks like you might be regressing age out of Brain Age, though from your description the Methods (How exactly do you use the slope and intercept? You need equation of you are going to stick with this terminology), it is not totally clear. I highly recommend using terminology and metrics from the Butler et al. (2021) paper throughout to reduce confusion.

      Response Please see our response to Reviewer 3 Public Review #5

      Reviewer 3 Recommendations For The Authors #10:

      “On the contrary, an amount of variation in Cognitionfluid explained by Corrected Brain Age Gap was relatively small (maximum R2 = .041) across age-prediction models and did not relate to the predictive performance of the age-prediction models.” (p7) → If by “Corrected Brain Age Gap” you mean MBAG from The Butler paper, yes, this is also inevitable, considering MBAG would be a vector of zeros if it were not for regression on residuals (and out of sample estimates), as I mentioned earlier. Also, it is not clear why you used “on the contrary” as a transition here.

      Response Please see our response to Reviewer 3 Public Review #2 for the ‘MBAG’ term. Briefly, we didn’t use Butler and colleagues' (2021) MBAG, but rather we used the method described in de Lange and Cole’s (2020), which was called RBAG by Butler and colleagues.

      de Lange and Cole’s (2020) method, was commonly implemented elsewhere (Cole et al., 2020; Cumplido-Mayoral et al., 2023; Denissen et al., 2022). Accordingly, researchers who use Brain Age do not usually view this method as capturing a meaningless biomarker. Yet, the small effects of the Corrected Brain Age Gap in explaining fluid cognition of aging individuals found here are consistent with studies in older adults (Cole, 2020) and younger populations (Butler et al., 2021; Jirsaraie, Kaufmann, et al., 2023) (see our response to Reviewer 2 Recommendations For The Authors #1).

      “On the contrary” refers to the fact that the other three Brain Age indices (i.e., those that did not account for the relationship between Brain Age and chronological age) showed a much higher amount of variation in fluid cognition explained. As mentioned above (our response to Reviewer 2 Public Review #7), our argument resonates Butler and colleagues’ (2021) suggestion (p. 4097): “As such, it is critical that readers of past literature note whether or not age was controlled for when testing for effects on the BAG, as this has not always been common practice (e.g., Beheshti et al., 2018; Cole, Underwood, et al., 2017; Franke et al., 2015; Gaser et al., 2013; Liem et al., 2017; Nenadi c et al., 2017; Steffener et al., 2016)”.

      Reviewer 3 Recommendations For The Authors #11:

      “As before, the unique effects of Brain Age indices were all relatively small across the four Brain Age indices and across different prediction models.” (p10) → Yes, again, this is inevitable considering how they are calculated. You can show these analyses to demonstrate your results in data, if you want, but ignoring the inevitability given how these variables are calculated is misleading.

      Response Accounting for the relationship between Brain Age and chronological age when examining the utility of Brain Age is not misleading. Similar to previous recommendations (Butler et al., 2021; Le et al., 2018), we believe that not doing so is misleading. That is, without accounting for the relationship between Brain Age and chronological age, Brain Age will likely explain the same variation of the phenotype of interest as chronological age. Please see our response to Reviewer 3 Recommendations For The Authors #18 below.

      Reviewer 3 Recommendations For The Authors #12:

      “On the contrary, the unique effects of Brain Cognition appeared much larger.” (p10) → This is not a fair comparison if you don’t look at the unique effects above and beyond the cognitive variable you predicted (fluid cognition) in your Brain Cognition model. When you do this, you will see that Brain Cognition is useless when you include fluid cognition in the model, just as Brain Age would be in predicting age when you include age in the model. This highlights the fact that using predicted values of a metric to predict that metric is a pointless path to take, and that using a predicted value to predict anything is worse than using the value itself.

      Response Please see our response to Reviewer 3 Public Review #6.

      Reviewer 3 Recommendations For The Authors #13:

      “First, how much does Brain Age add to what is already captured by chronological age? The short answer is very little.” (p12) → This is a really important point, but your paper requires an in-depth discussion of the inevitability of this result, which I have discussed previously in this review.

      Response Please see our response to Reviewer 3 Public Review #7.

      Reviewer 3 Recommendations For The Authors #14:

      “Second, do better-performing age-prediction models improve the ability of Brain Age to capture Cognitionfluid? Unfortunately, the answer is no.” (p12) → You need to be clear that you are talking about above and beyond age here.

      Response Thank you so much for your suggestion. We now made the change to this sentence accordingly.

      Discussion

      “Second, do better-performing age-prediction models improve the utility of Brain Age to capture fluid cognition above and beyond chronological age? The answer is also no.”

      Reviewer 3 Recommendations For The Authors #15:

      “Third, do we have a solution that can improve our ability to capture Cognitionfluid from brain MRI? The answer is, fortunately, yes. Using Brain Cognition as a biomarker, along with chronological age, seemed to capture a higher amount of variation in Cognitionfluid than only using Brain Age.” (p12) → Again, try controlling for the cognitive measure you predicted in your Brain Cognition model. This will show that Brain Cognition is not useful above and beyond cognition, highlighting the fact that it is not a useful endeavor to be using predicted values.

      Response Please see our response to Reviewer 3 Public Review #8.

      Reviewer 3 Recommendations For The Authors #16:

      “Accordingly, a race to improve the performance of age-prediction models (Baecker et al., 2021) does not necessarily enhance the utility of Brain Age indices as a biomarker for Cognitionfluid. This calls for a new paradigm. Future research should aim to build prediction models for Brian Age indices that are not necessarily good at predicting age, but at capturing phenotypes of interest, such as Cognitionfluid and beyond.” (p13) → I whole-heartedly agree with the first two sentences, and strongly disagree with the last. Certainly your results, and the underlying reason as to why you found these results, calls for a new paradigm (or, one might argue, a pre-brain age paradigm). They do not, however, suggest that we should keep going down the Brain Age path. In fact, I think it should be abandoned all together. While it is difficult to prove that there is no transformation of Brain Age or the Brain Age Gap that will be useful, I am nearly sure this is true from the research I have done. Therefore, if you would like to suggest that the field should continue down this path, you need to present a very good case to support this view.

      Response Please see our response to Reviewer 3 Public Review #9.

      Reviewer 3 Recommendations For The Authors #17:

      “Perhaps this is because the estimation of the influences of chronological age was done in the training set.” (p13) → I believe this is the case, and it is testable. Try re-running your analyses where parameters are estimated and performance is evaluated on the same data.

      Response Yes, we agreed with this. Based on the equations we used, this is inevitable.

      Reviewer 3 Recommendations For The Authors #18:

      “Similar to a previous recommendation (Butler et al., 2021), we suggest focusing on Corrected Brain Age Gap.” (p13) → To be clear, the authors did not use the term “Corrected” because it is very misleading. The authors also did not suggest that we proceed with any brain age metric; rather they mentioned that the modified brain age gap is independent of age. Note the following passage: “Further, the interpretability of the modified brain age gap (MBAG) itself is limited by the fact that it is a prediction error from a regression to remove the effects of age from a residual obtained through a regression to predict age. By virtue of these limitations, we suggest that the modified version may not provide useful information about precocity or delay in brain development. In light of this, as well as the complexities associated with interpretations of the BAG and its dependence on age, we suggest that further methodological and theoretical work is warranted.” I recognize that that this statement is hedged, as is often required in the publication process, but I am all but certain that MBAG/BAG/modified predicted age are useless constructs. Therefore, if you are going to suggest that people continue to use them, opposed to suggesting that further methodological or theoretical work is warranted, you need to make a strong case, which you did not try to make here. If anything, your results support abandoning the age- prediction endeavor altogether.

      Response Please see our response to Reviewer 3 Public Review #2 for the term. Briefly, we didn’t use Butler and colleagues’ (2021) MBAG, but rather RBAG. This index was originally described in de Lange and Cole’s (2020), and has now been implemented elsewhere (Cole et al., 2020; Cumplido-Mayoral et al., 2023; Denissen et al., 2022).

      We do not intend to encourage people to abandon the Brain Age endeavour altogether. However, we made main three suggestions for future research on Brain Age to ensure its utility. First, they should account for the relationship between Brain Age and chronological age either using Corrected Brain Age Gap (or other similar adjustments) or, better, examining the unique effects of Brain Age indices after controlling for chronological age through commonality analyses (see below). This is similar to the suggestion made by Le and colleagues (2018) and later rephased by Butler and colleagues (2021). More specifically, Le and colleagues (2018) mentioned (p. 10): “Based on our observations in both real and simulated data, we recommend that the relationship between chronological age and BrainAGE should be accounted for. The two methods proposed in this study are either: (1) regress age on BrainAGE, producing BrainAGER, which is centered on 0 regardless of a participant's actual age or (2) include age as a regressor when doing follow-up analyses.”

      Second, we suggested that researchers should not select age-prediction models based solely on age-prediction performance (see our response to Reviewer 1 Recommendations For The Authors #1).

      Third, we suggested that researchers should test how much Brain Age miss the variation in the brain MRI that could explain fluid cognition or other phenotypes of interest (see our response to Reviewer 2 Public Review #4).

      Discussion

      “What does it mean then for researchers/clinicians who would like to use Brain Age as a biomarker? First, they have to be aware of the overlap in variation between Brain Age and chronological age and should focus on the contribution of Brain Age over and above chronological age. Using Brain Age Gap will not fix this. Butler and colleagues (2021) recently highlighted this point, “These results indicate that the association between cognition and the BAG are driven by the association between age and cognitive performance. As such, it is critical that readers of past literature note whether or not age was controlled for when testing for effects on the BAG, as this has not always been common practice (p. 4097).” Similar to previous recommendations (Butler et al., 2021; Le et al., 2018), we suggest future work should account for the relationship between Brain Age and chronological age, either using Corrected Brain Age Gap (or other similar adjustments) or, better, examining unique effects of Brain Age indices after controlling for chronological age through commonality analyses. Note we prefer using unique effects over beta estimates from multiple regressions, given that unique effects do not change as a function of collinearity among regressors (Ray-Mukherjee et al., 2014). In our case, Brain Age indices had the same unique effects regardless of the level of common effects they had with chronological age (e.g., Brain Age vs. Corrected Brain Age Gap from stacked models). In the case of fluid cognition, the unique effects might be too small to be clinically meaningful as shown here and previously (Butler et al., 2021; Cole, 2020; Jirsaraie, Kaufmann, et al., 2023).”

      Reviewer 3 Recommendations For The Authors #19:

      “To compute Brain Age and Brain Cognition, we ran two separate prediction models. These prediction models either had chronological age or Cognitionfluid as the target.” (p16) → You should make it clear in the main text of your paper that the cognition variable in your Brain Cognition models is the same as what you refer to as Cognitionfluid. Some of your analyses would have been much more reasonable if you had two different measures of cognition.

      Response Thank you so much for the suggestion. We believe, given the re-conceptualisation of Brain Cognition as the main text

      Introduction

      “certain variation in the brain MRI is related to fluid cognition, but to what extent does Brain Age not capture this variation? To estimate the variation in the brain MRI that is related to fluid cognition, we could build prediction models that directly predict fluid cognition (i.e., as opposed to chronological age) from brain MRI data.”

      Reviewer 3 Recommendations For The Authors #20:

      “We controlled for the potential influences of biological sex on the brain features by first residualizing biological sex from brain features in the training set.” (p16) → Why? Your question is about prediction, not causal inference.

      Response While the question is about prediction, we still would like to, as much as possible, be confident about what kind of information we drew from. Here we focused on brain data and controlled for other variables that might not be neuronal. For instance, we controlled for movement and physiological noise using ICA-FIX (Glasser et al., 2016). Following conventional practices in brain-based predictive modelling, we also treated biological sex as another sort of noise (Vieira et al., 2022). The difference between movement/physiological noise and biological sex is that the former varies across TRs, and the latter varies across individuals. Thus we controlled for movement and physiological noise within each participant and controlled for biological sex within a group of participants who belonged to the same training set.

      Reviewer 3 Recommendations For The Authors #20:

      “Lastly, we computer Corrected Brain Age Gap by subtracting the chronological age from the Corrected Brain Age (Butler et al., 2021; Le et al., 2018).” (p17) → The modified brain age gap in that paper is the residuals from regressing BAG on age (see equation 6). I highly recommend using that terminology and notation throughout to provide consistency and interpretability across papers.

      Response Please see our response to Reviewer 3 Public Review #2 for the term.

      Reviewer 3 Recommendations For The Authors #21: Equations (pgs 17-19) → Please use statistical notation instead of pseudo-R code.

      Response We rewrote all of the equations using statistical notations.

      References

      Abraham, A., Pedregosa, F., Eickenberg, M., Gervais, P., Mueller, A., Kossaifi, J., Gramfort, A., Thirion, B., & Varoquaux, G. (2014). Machine learning for neuroimaging with scikit-learn. Frontiers in Neuroinformatics, 8, 14. https://doi.org/10.3389/fninf.2014.00014

      Ances, B. M., Liang, C. L., Leontiev, O., Perthen, J. E., Fleisher, A. S., Lansing, A. E., & Buxton, R. B. (2009). Effects of aging on cerebral blood flow, oxygen metabolism, and blood oxygenation level dependent responses to visual stimulation. Human Brain Mapping, 30(4), 1120–1132. https://doi.org/10.1002/hbm.20574

      Bashyam, V. M., Erus, G., Doshi, J., Habes, M., Nasrallah, I. M., Truelove-Hill, M., Srinivasan, D., Mamourian, L., Pomponio, R., Fan, Y., Launer, L. J., Masters, C. L., Maruff, P., Zhuo, C., Völzke, H., Johnson, S. C., Fripp, J., Koutsouleris, N., Satterthwaite, T. D., … on behalf of the ISTAGING Consortium, the P. A. disease C., ADNI, and CARDIA studies. (2020). MRI signatures of brain age and disease over the lifespan based on a deep brain network and 14 468 individuals worldwide. Brain, 143(7), 2312–2324. https://doi.org/10.1093/brain/awaa160

      Beheshti, I., Nugent, S., Potvin, O., & Duchesne, S. (2019). Bias-adjustment in neuroimaging-based brain age frameworks: A robust scheme. NeuroImage: Clinical, 24, 102063. https://doi.org/10.1016/j.nicl.2019.102063

      Bookheimer, S. Y., Salat, D. H., Terpstra, M., Ances, B. M., Barch, D. M., Buckner, R. L., Burgess, G. C., Curtiss, S. W., Diaz-Santos, M., Elam, J. S., Fischl, B., Greve, D. N., Hagy, H. A., Harms, M. P., Hatch, O. M., Hedden, T., Hodge, C., Japardi, K. C., Kuhn, T. P., … Yacoub, E. (2019). The Lifespan Human Connectome Project in Aging: An overview. NeuroImage, 185, 335–348. https://doi.org/10.1016/j.neuroimage.2018.10.009

      Butler, E. R., Chen, A., Ramadan, R., Le, T. T., Ruparel, K., Moore, T. M., Satterthwaite, T. D., Zhang, F., Shou, H., Gur, R. C., Nichols, T. E., & Shinohara, R. T. (2021). Pitfalls in brain age analyses. Human Brain Mapping, 42(13), 4092–4101. https://doi.org/10.1002/hbm.25533 Choi, S. W., Mak, T. S.-H., & O’Reilly, P. F. (2020). Tutorial: A guide to performing polygenic risk score analyses. Nature Protocols, 15(9), Article 9. https://doi.org/10.1038/s41596-020-0353-1

      Cole, J. H. (2020). Multimodality neuroimaging brain-age in UK biobank: Relationship to biomedical, lifestyle, and cognitive factors. Neurobiology of Aging, 92, 34–42. https://doi.org/10.1016/j.neurobiolaging.2020.03.014

      Cole, J. H., Raffel, J., Friede, T., Eshaghi, A., Brownlee, W. J., Chard, D., De Stefano, N., Enzinger, C., Pirpamer, L., Filippi, M., Gasperini, C., Rocca, M. A., Rovira, A., Ruggieri, S., Sastre-Garriga, J., Stromillo, M. L., Uitdehaag, B. M. J., Vrenken, H., Barkhof, F., … Group, M. study. (2020). Longitudinal Assessment of Multiple Sclerosis with the Brain-Age Paradigm. Annals of Neurology, 88(1), 93–105. https://doi.org/10.1002/ana.25746

      Cumplido-Mayoral, I., García-Prat, M., Operto, G., Falcon, C., Shekari, M., Cacciaglia, R., Milà-Alomà, M., Lorenzini, L., Ingala, S., Meije Wink, A., Mutsaerts, H. J., Minguillón, C., Fauria, K., Molinuevo, J. L., Haller, S., Chetelat, G., Waldman, A., Schwarz, A. J., Barkhof, F., … OASIS study. (2023). Biological brain age prediction using machine learning on structural neuroimaging data: Multi-cohort validation against biomarkers of Alzheimer’s disease and neurodegeneration stratified by sex. ELife, 12, e81067. https://doi.org/10.7554/eLife.81067

      de Lange, A.-M. G., & Cole, J. H. (2020). Commentary: Correction procedures in brain-age prediction. NeuroImage: Clinical, 26, 102229. https://doi.org/10.1016/j.nicl.2020.102229

      Demontis, D., Walters, R. K., Martin, J., Mattheisen, M., Als, T. D., Agerbo, E., Baldursson, G., Belliveau, R., Bybjerg-Grauholm, J., Bækvad-Hansen, M., Cerrato, F., Chambert, K., Churchhouse, C., Dumont, A., Eriksson, N., Gandal, M., Goldstein, J. I., Grasby, K. L., Grove, J., … Neale, B. M. (2019). Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nature Genetics, 51(1), Article 1. https://doi.org/10.1038/s41588-018-0269-7

      Denissen, S., Engemann, D. A., De Cock, A., Costers, L., Baijot, J., Laton, J., Penner, I., Grothe, M., Kirsch, M., D’hooghe, M. B., D’Haeseleer, M., Dive, D., De Mey, J., Van Schependom, J., Sima, D. M., & Nagels, G. (2022). Brain age as a surrogate marker for cognitive performance in multiple sclerosis. European Journal of Neurology, 29(10), 3039–3049. https://doi.org/10.1111/ene.15473

      Dormann, C. F., Elith, J., Bacher, S., Buchmann, C., Carl, G., Carré, G., Marquéz, J. R. G., Gruber, B., Lafourcade, B., Leitão, P. J., Münkemüller, T., McClean, C., Osborne, P. E., Reineking, B., Schröder, B., Skidmore, A. K., Zurell, D., & Lautenbach, S. (2013). Collinearity: A review of methods to deal with it and a simulation study evaluating their performance. Ecography, 36(1), 27–46. https://doi.org/10.1111/j.1600-0587.2012.07348.x

      Dubois, J., Galdi, P., Paul, L. K., & Adolphs, R. (2018). A distributed brain network predicts general intelligence from resting-state human neuroimaging data. Philosophical Transactions of the Royal Society B: Biological Sciences, 373(1756), 20170284. https://doi.org/10.1098/rstb.2017.0284

      Elliott, M. L., Knodt, A. R., Cooke, M., Kim, M. J., Melzer, T. R., Keenan, R., Ireland, D., Ramrakha, S., Poulton, R., Caspi, A., Moffitt, T. E., & Hariri, A. R. (2019). General functional connectivity: Shared features of resting-state and task fMRI drive reliable and heritable individual differences in functional brain networks. NeuroImage, 189, 516–532. https://doi.org/10.1016/j.neuroimage.2019.01.068

      Fair, D. A., Schlaggar, B. L., Cohen, A. L., Miezin, F. M., Dosenbach, N. U. F., Wenger, K. K., Fox, M. D., Snyder, A. Z., Raichle, M. E., & Petersen, S. E. (2007). A method for using blocked and event-related fMRI data to study “resting state” functional connectivity. NeuroImage, 35(1), 396–405. https://doi.org/10.1016/j.neuroimage.2006.11.051

      Fischl, B., Salat, D. H., Busa, E., Albert, M., Dieterich, M., Haselgrove, C., van der Kouwe, A., Killiany, R., Kennedy, D., Klaveness, S., Montillo, A., Makris, N., Rosen, B., & Dale, A. M. (2002). Whole Brain Segmentation. Neuron, 33(3), 341–355. https://doi.org/10.1016/S0896-6273(02)00569-X

      Franke, K., & Gaser, C. (2019). Ten Years of BrainAGE as a Neuroimaging Biomarker of Brain Aging: What Insights Have We Gained? Frontiers in Neurology, 10, 789. https://doi.org/10.3389/fneur.2019.00789

      Glasser, M. F., Smith, S. M., Marcus, D. S., Andersson, J. L. R., Auerbach, E. J., Behrens, T. E. J., Coalson, T. S., Harms, M. P., Jenkinson, M., Moeller, S., Robinson, E. C., Sotiropoulos, S. N., Xu, J., Yacoub, E., Ugurbil, K., & Van Essen, D. C. (2016). The Human Connectome Project’s neuroimaging approach. Nature Neuroscience, 19(9), 1175–1187. https://doi.org/10.1038/nn.4361

      Glasser, M. F., Sotiropoulos, S. N., Wilson, J. A., Coalson, T. S., Fischl, B., Andersson, J. L., Xu, J., Jbabdi, S., Webster, M., Polimeni, J. R., Van Essen, D. C., & Jenkinson, M. (2013). The minimal preprocessing pipelines for the Human Connectome Project. NeuroImage, 80, 105–124. https://doi.org/10.1016/j.neuroimage.2013.04.127

      Gordon, E. M., Laumann, T. O., Adeyemo, B., Huckins, J. F., Kelley, W. M., & Petersen, S. E. (2016). Generation and Evaluation of a Cortical Area Parcellation from Resting-State Correlations. Cerebral Cortex, 26(1), 288–303. https://doi.org/10.1093/cercor/bhu239

      Gratton, C., Laumann, T. O., Nielsen, A. N., Greene, D. J., Gordon, E. M., Gilmore, A. W., Nelson, S. M., Coalson, R. S., Snyder, A. Z., Schlaggar, B. L., Dosenbach, N. U. F., & Petersen, S. E. (2018). Functional Brain Networks Are Dominated by Stable Group and Individual Factors, Not Cognitive or Daily Variation. Neuron, 98(2), 439-452.e5. https://doi.org/10.1016/j.neuron.2018.03.035

      Harms, M. P., Somerville, L. H., Ances, B. M., Andersson, J., Barch, D. M., Bastiani, M., Bookheimer, S. Y., Brown, T. B., Buckner, R. L., Burgess, G. C., Coalson, T. S., Chappell, M. A., Dapretto, M., Douaud, G., Fischl, B., Glasser, M. F., Greve, D. N., Hodge, C., Jamison, K. W., … Yacoub, E. (2018). Extending the Human Connectome Project across ages: Imaging protocols for the Lifespan Development and Aging projects. NeuroImage, 183, 972–984. https://doi.org/10.1016/j.neuroimage.2018.09.060

      Horien, C., Noble, S., Greene, A. S., Lee, K., Barron, D. S., Gao, S., O’Connor, D., Salehi, M., Dadashkarimi, J., Shen, X., Lake, E. M. R., Constable, R. T., & Scheinost, D. (2020). A hitchhiker’s guide to working with large, open-source neuroimaging datasets. Nature Human Behaviour, 5(2), 185–193. https://doi.org/10.1038/s41562-020-01005-4

      Jirsaraie, R. J., Gorelik, A. J., Gatavins, M. M., Engemann, D. A., Bogdan, R., Barch, D. M., & Sotiras, A. (2023). A systematic review of multimodal brain age studies: Uncovering a divergence between model accuracy and utility. Patterns, 4(4), 100712. https://doi.org/10.1016/j.patter.2023.100712

      Jirsaraie, R. J., Kaufmann, T., Bashyam, V., Erus, G., Luby, J. L., Westlye, L. T., Davatzikos, C., Barch, D. M., & Sotiras, A. (2023). Benchmarking the generalizability of brain age models: Challenges posed by scanner variance and prediction bias. Human Brain Mapping, 44(3), 1118–1128. https://doi.org/10.1002/hbm.26144

      Khojaste-Sarakhsi, M., Haghighi, S. S., Ghomi, S. M. T. F., & Marchiori, E. (2022). Deep learning for Alzheimer’s disease diagnosis: A survey. Artificial Intelligence in Medicine, 130, 102332. https://doi.org/10.1016/j.artmed.2022.102332

      Le, T. T., Kuplicki, R. T., McKinney, B. A., Yeh, H.-W., Thompson, W. K., Paulus, M. P., Tulsa 1000 Investigators, Aupperle, R. L., Bodurka, J., Cha, Y.-H., Feinstein, J. S., Khalsa, S. S., Savitz, J., Simmons, W. K., & Victor, T. A. (2018). A Nonlinear Simulation Framework Supports Adjusting for Age When Analyzing BrainAGE. Frontiers in Aging Neuroscience, 10. https://www.frontiersin.org/articles/10.3389/fnagi.2018.00317

      Liang, H., Zhang, F., & Niu, X. (2019). Investigating systematic bias in brain age estimation with application to post-traumatic stress disorders. Human Brain Mapping, 40(11), 3143–3152. https://doi.org/10.1002/hbm.24588

      Luby, J. L. (2010). Preschool Depression: The Importance of Identification of Depression Early in Development. Current Directions in Psychological Science, 19(2), 91–95. https://doi.org/10.1177/0963721410364493

      Molnar, C. (2019). Interpretable Machine Learning. A Guide for Making Black Box Models Explainable. https://christophm.github.io/interpretable-ml-book/

      Nimon, K., Lewis, M., Kane, R., & Haynes, R. M. (2008). An R package to compute commonality coefficients in the multiple regression case: An introduction to the package and a practical example. Behavior Research Methods, 40(2), 457–466. https://doi.org/10.3758/BRM.40.2.457

      Pat, N., Wang, Y., Anney, R., Riglin, L., Thapar, A., & Stringaris, A. (2022). Longitudinally stable, brain‐based predictive models mediate the relationships between childhood cognition and socio‐demographic, psychological and genetic factors. Human Brain Mapping, hbm.26027. https://doi.org/10.1002/hbm.26027

      Pat, N., Wang, Y., Bartonicek, A., Candia, J., & Stringaris, A. (2022). Explainable machine learning approach to predict and explain the relationship between task-based fMRI and individual differences in cognition. Cerebral Cortex, bhac235. https://doi.org/10.1093/cercor/bhac235

      Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, É. (2011). Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research, 12(85), 2825–2830.

      Poldrack, R. A., Huckins, G., & Varoquaux, G. (2020). Establishment of Best Practices for Evidence for Prediction: A Review. JAMA Psychiatry, 77(5), 534–540. https://doi.org/10.1001/jamapsychiatry.2019.3671

      Rasero, J., Sentis, A. I., Yeh, F.-C., & Verstynen, T. (2021). Integrating across neuroimaging modalities boosts prediction accuracy of cognitive ability. PLOS Computational Biology, 17(3), e1008347. https://doi.org/10.1371/journal.pcbi.1008347

      Ray-Mukherjee, J., Nimon, K., Mukherjee, S., Morris, D. W., Slotow, R., & Hamer, M. (2014). Using commonality analysis in multiple regressions: A tool to decompose regression effects in the face of multicollinearity. Methods in Ecology and Evolution, 5(4), 320–328. https://doi.org/10.1111/2041-210X.12166

      Robinson, E. C., Garcia, K., Glasser, M. F., Chen, Z., Coalson, T. S., Makropoulos, A., Bozek, J., Wright, R., Schuh, A., Webster, M., Hutter, J., Price, A., Cordero Grande, L., Hughes, E., Tusor, N., Bayly, P. V., Van Essen, D. C., Smith, S. M., Edwards, A. D., … Rueckert, D. (2018). Multimodal surface matching with higher-order smoothness constraints. NeuroImage, 167, 453–465. https://doi.org/10.1016/j.neuroimage.2017.10.037

      Rokicki, J., Wolfers, T., Nordhøy, W., Tesli, N., Quintana, D. S., Alnæs, D., Richard, G., de Lange, A.-M. G., Lund, M. J., Norbom, L., Agartz, I., Melle, I., Nærland, T., Selbæk, G., Persson, K., Nordvik, J. E., Schwarz, E., Andreassen, O. A., Kaufmann, T., & Westlye, L. T. (2021). Multimodal imaging improves brain age prediction and reveals distinct abnormalities in patients with psychiatric and neurological disorders. Human Brain Mapping, 42(6), 1714–1726. https://doi.org/10.1002/hbm.25323

      Satterthwaite, T. D., Connolly, J. J., Ruparel, K., Calkins, M. E., Jackson, C., Elliott, M. A., Roalf, D. R., Hopson, R., Prabhakaran, K., Behr, M., Qiu, H., Mentch, F. D., Chiavacci, R., Sleiman, P. M. A., Gur, R. C., Hakonarson, H., & Gur, R. E. (2016). The Philadelphia Neurodevelopmental Cohort: A publicly available resource for the study of normal and abnormal brain development in youth. NeuroImage, 124, 1115–1119. https://doi.org/10.1016/j.neuroimage.2015.03.056

      Smith, S. M., Vidaurre, D., Alfaro-Almagro, F., Nichols, T. E., & Miller, K. L. (2019). Estimation of brain age delta from brain imaging. NeuroImage, 200, 528–539. https://doi.org/10.1016/j.neuroimage.2019.06.017

      Somerville, L. H., Bookheimer, S. Y., Buckner, R. L., Burgess, G. C., Curtiss, S. W., Dapretto, M., Elam, J. S., Gaffrey, M. S., Harms, M. P., Hodge, C., Kandala, S., Kastman, E. K., Nichols, T. E., Schlaggar, B. L., Smith, S. M., Thomas, K. M., Yacoub, E., Van Essen, D. C., & Barch, D. M. (2018). The Lifespan Human Connectome Project in Development: A large-scale study of brain connectivity development in 5–21 year olds. NeuroImage, 183, 456–468. https://doi.org/10.1016/j.neuroimage.2018.08.050

      Sperling, R. A., Bates, J. F., Cocchiarella, A. J., Schacter, D. L., Rosen, B. R., & Albert, M. S. (2001). Encoding novel face-name associations: A functional MRI study. Human Brain Mapping, 14(3), 129–139. https://doi.org/10.1002/hbm.1047

      Sripada, C., Angstadt, M., Rutherford, S., Kessler, D., Kim, Y., Yee, M., & Levina, E. (2019). Basic Units of Inter-Individual Variation in Resting State Connectomes. Scientific Reports, 9(1), Article 1. https://doi.org/10.1038/s41598-018-38406-5

      Sripada, C., Angstadt, M., Rutherford, S., Taxali, A., & Shedden, K. (2020). Toward a “treadmill test” for cognition: Improved prediction of general cognitive ability from the task activated brain. Human Brain Mapping, 41(12), 3186–3197. https://doi.org/10.1002/hbm.25007

      Stigler, S. M. (1997). Regression towards the mean, historically considered. Statistical Methods in Medical Research, 6(2), 103–114. https://doi.org/10.1177/096228029700600202

      Sudlow, C., Gallacher, J., Allen, N., Beral, V., Burton, P., Danesh, J., Downey, P., Elliott, P., Green, J., Landray, M., Liu, B., Matthews, P., Ong, G., Pell, J., Silman, A., Young, A., Sprosen, T., Peakman, T., & Collins, R. (2015). UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age. PLOS Medicine, 12(3), e1001779. https://doi.org/10.1371/journal.pmed.1001779

      Tetereva, A., Li, J., Deng, J. D., Stringaris, A., & Pat, N. (2022). Capturing brain‐cognition relationship: Integrating task‐based fMRI across tasks markedly boosts prediction and test‐retest reliability. NeuroImage, 263, 119588. https://doi.org/10.1016/j.neuroimage.2022.119588

      Vieira, B. H., Pamplona, G. S. P., Fachinello, K., Silva, A. K., Foss, M. P., & Salmon, C. E. G. (2022). On the prediction of human intelligence from neuroimaging: A systematic review of methods and reporting. Intelligence, 93, 101654. https://doi.org/10.1016/j.intell.2022.101654

      Vos De Wael, R., Benkarim, O., Paquola, C., Lariviere, S., Royer, J., Tavakol, S., Xu, T., Hong, S.-J., Langs, G., Valk, S., Misic, B., Milham, M., Margulies, D., Smallwood, J., & Bernhardt, B. C. (2020). BrainSpace: A toolbox for the analysis of macroscale gradients in neuroimaging and connectomics datasets. Communications Biology, 3(1), 103. https://doi.org/10.1038/s42003-020-0794-7

      Woolrich, M. W., Ripley, B. D., Brady, M., & Smith, S. M. (2001). Temporal Autocorrelation in Univariate Linear Modeling of FMRI Data. NeuroImage, 14(6), 1370–1386. https://doi.org/10.1006/nimg.2001.0931

      Zou, H., & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67(2), 301–320. https://doi.org/10.1111/j.1467-9868.2005.00503.x

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors describe a broad-scale phylogenetic survey of chemokine-related ligand and receptors from representative vertebrates, invertebrates, and viruses. They collect ligand and receptor sequences from available genome sequences, and use phylogenetic and CLANS analysis to group these into similar gene types. They then overlay these onto a validated species phylogeny in order to evaluate relationships of orthology and paralogy to pinpoint gene duplication and loss events. They carry out these analyses for canonical chemokine ligands receptors and for other closely related protein families. They conclude that the canonical chemokine system is restricted to vertebrates but that closely related ligands and receptors can be found in invertebrate chordates. More divergent but related gene systems are found in more distant invertebrates. They define more limited expansions of some ligand-receptor systems in certain jawed vertebrate groups and specifically in mammals.

      Overall, the paper addresses a complex and important system of signaling proteins with a rigorous and comprehensive set of analyses. The finding will be of interest to a diverse group of scientists. My comments listed below mainly consist of suggestions to help clarify the presentation.

      1. Pg 2, Lns 21-24: The canonical and non-canonical chemokine subclasses are introduced in the abstract without definition. A very brief explanation would be useful.

      We've included a brief description of "non-canonical" components in the abstract (lines 21-24). These non-canonical components fall into at least one of three categories: 1) molecules with sequence similarities to canonical components, 2) those that bind to a canonical component (either ligand or receptor), 3) those involved in chemokine-like functions, such as chemoattraction. More comprehensive explanations and examples of these non-canonical components are provided in the Introduction section.

      1. Some general contexts of chemokine functions are listed, including inflammation and homeostasis. A little more detail of how these signals are used and the molecular consequences of signaling may be useful in the introduction to set the biological context of the analysis (e.g., how do the signals regulate homeostasis?).

      We have added at the beginning of the introduction (lines 39 – 46) some details of how chemokine signalling typically occurs at a mechanistic level. We also provided few examples of homeostatic functions regulated by chemokine signalling and clarified different expression strategies for inflammatory versus homeostatic chemokines.

      It may help to summarize the known chemokine and chemokine-related gene systems in some type of table at the beginning of the results. This could serve as a convenient reference to guide the reader through the more detailed results. The manuscript addresses a complex set of ligands and receptors with names that may be confusing to the non-expert.

      We agree with the reviewer on this and moved Table S1 to the main text (now Table 1). This table contains all the information on ligands, receptors, and relative citations (lines 741-744).

      Pg 5, Ln 98: Fig 1C is introduced before Fig 1B. Can the panels be switched or the descriptions be rearranged?

      We have switched the panels in Figure 1. Now, Figure 1A and 1B refer to CLANS analyses and Figure 1C and 1D refer to phylogenetic trees of ligand groups. We have corrected all the references in the main text and in Figure 1 caption. Now the panels are mentioned in the correct alphabetical order within the text.

      Cytokine and chemokine ligands are small proteins that diverge quickly in different species and are difficult to identify in divergent genomes even within vertebrates. Conclusions about the absence of these types of factors are notorious for being disproven in subsequent analyses. Some discussion of what may have been missed in the survey for homologs (or reasons to think that ligands were not missed) would be useful in the Discussion.

      We concur with the reviewer's observation, and we used three distinct strategies to address the issue:

      1. E-value Threshold Adjustment: Initially, we utilized a relatively low e-value threshold of These three strategies collectively contribute to a more robust and comprehensive approach to address the challenges associated with the bioinformatic identification of canonical and non-canonical chemokines. We briefly mentioned the technical difficulty of working with short sequences in our Introduction (lines 75-76).

      Reviewer #1 (Significance (Required)):

      This paper presents a thorough analysis of chemokines and related gene systems across a wide phylogenetic landscape. The authors have expertise in these gene families and in the techniques that they use to identify and relate family members. The chemokines are an important set of signals that are used across several biological systems. These findings will be of wide interest to immunologists, neurobiologists, developmental and evolutionary biologists.

      We thank reviewer 1 for their comments – they have been very valuable to improve our manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This paper applies phylogenetic clustering methods to a large taxonomical sampling to interrogate the relationship between canonical and non-canonical chemokine ligands and receptors. The results suggest that 1) unrelated proteins evolved "chemokine-like" ligand function multiple times independently; and 2) all the canonical and non-canonical chemokine receptors (except ACKR1) originated from a single duplication in the vertebrate stem group, which also gave rise to many GPCRs. In addition, the authors characterized the complement of canonical and non-canonical components in the common ancestor of vertebrates and identified several other ligands and receptors with potential chemokine related properties.

      Comments: 1. There are many places in the paper, too many to list, where the authors refer to chemokine receptors but call them 'chemokines'.

      We have corrected this oversight throughout the manuscript.

      In Figure 1, CX3CL is referred to as 'X3CL'

      We have corrected this. Now CX3CL is referred to correctly in Figure 1. We also found that it was incorrectly spelt in Figure 2 as well and corrected it there too.

      1. CXCL17 was originally reported to be chemokine-like based on sequence threading methods. The authors refer to a 2015 paper indicating that it has chemokine-like activity at GPR35, which had been renamed provisionally CXCR8. To my knowledge that result was not based on direct binding data but inferred from a functional response. Moreover, to my knowledge it has not been independently confirmed. Instead there is a recent paper in JI from the Pease lab showing extensive experimental results that fail to demonstrate CXCL17 activity at GPR35. This uncertainty regarding a potential mistake in the literature should be addressed and integrated in the points made about CXCL17 being an outlier.

      We thank the reviewer for pointing this out. To account for this suggestion, we have modified the text as follows:

      Lines 105-108: “The distinction between CXCL17 and all other canonical chemokines is consistent with our receptor results showing that the potential receptor for CXCL17, GPR35 (41), is also not within the canonical chemokine receptor group (see below). Although it is important to note that recent studies fail to demonstrate CXCL17 activity at GPR35 (42, 43).”

      Lines 240-241: “Another orphan GPCR, GPR35, had been proposed as a potential chemokine receptor (41); however, this was later questioned (42, 43) and GPR35 is still generally considered orphan (55–57).”

      Lines 312-315: “CXCL17 is mammal-specific and likely unrelated to canonical chemokines (similar to its controversial putative receptor, GPR35 (41-43), that is not a canonical chemokine receptor).”

      References: [41] J. L. Maravillas-Montero, et al., Cutting Edge: GPR35/CXCR8 Is the Receptor of the Mucosal Chemokine CXCL17. The Journal of Immunology 194, 29–33 (2015).

      [42] S.-J. Park, S.-J. Lee, S.-Y. Nam, D.-S. Im, GPR35 mediates lodoxamide-induced migration inhibitory response but not CXCL17-induced migration stimulatory response in THP-1 cells; is GPR35 a receptor for CXCL17? British Journal of Pharmacology 175, 154–161 (2018).

      [43] N. A. S. B. M. Amir, et al., Evidence for the Existence of a CXCL17 Receptor Distinct from GPR35. The Journal of Immunology 201, 714–724 (2018).

      [55] S. Xiao, W. Xie, L. Zhou, Mucosal chemokine CXCL17: What is known and not known. Scandinavian Journal of Immunology 93, e12965 (2021).

      [56] S. P. Giblin, J. E. Pease, What defines a chemokine? – The curious case of CXCL17. Cytokine 168, 156224 (2023).

      [57] J. Duan, et al., Insights into divalent cation regulation and G13-coupling of orphan receptor GPR35. Cell Discov 8, 1–12 (2022).

      Can the authors use alpha fold to address whether any of these non-canonical molecules actually is predicted to fold like a chemokine? More generally, based on the paper's analysis, how do the authors propose to define a chemokine? It is well-accepted that chemokines are defined by structure, not function (e.g. limited truncation of any chemokine abrogates activity, but it is still a chemokine structurally, not semantically, folds like a chemokine, aligns with other chemokines).

      In response to the recommendation from reviewer 2 to incorporate AlphaFold data, we leveraged AFDB Clusters (foldseek.com), a recently developed tool that clustered over 200 million Uniprot proteins based on their predicted AlphaFold structures (as described in this Nature paper: https://www.nature.com/articles/s41586-023-06510-w). We utilised this pre-computed dataset of clustered proteins to query with representative human proteins, both canonical and non-canonical chemokine ligands, and the results are summarised in the table below. Notably, we observed that canonical chemokines were distributed across different AlphaFold clusters, each corresponding to different ligand types (e.g., CC and CXC). Interestingly, despite this, all these clusters exhibited similar descriptions (e.g. CC or CXC), indicating that the method effectively recovers well-characterized chemokines. Conversely, when analysing non-canonical chemokine ligands, none of them were classified within the canonical chemokine clusters. This observation strongly suggests that canonical and non-canonical ligands do not share the same protein fold. Additionally, we identified intriguing correlations between these structure-based clusters and the results from our phylogenetic analyses. For instance, CXCL14 was clustered within a CC-type group, consistent with our reconciled tree positioning it within the broader CC-type clade (as shown in Figure 2A). Similarly, CXCL16 formed its own unique cluster, which aligns with our CLANS analysis, where it is the last group to connect with canonical chemokines (illustrated in Figure 1A and Figure S1). Furthermore, TAFA5 was found in a distinct cluster, mirroring our phylogenetic analyses that place it as the most basal TAFA clade (as depicted in Figure 2A and Figure S19). While these findings are intriguing, we acknowledge that additional in-depth analyses, beyond the scope of this paper, will be necessary to confirm these results.

      In response to the reviewer's inquiry regarding how to define a chemokine, it is essential to recognise that many proteins can exhibit similar 3D structures without being considered homologous. A notable example is the opsins, which are present in both bacteria and animals. Despite sharing a common 3D structure that is characterised by seven transmembrane domains (TMDs) and serves similar functions, they are not regarded as homologous, as highlighted in this study (https://doi.org/10.1186/gb-2005-6-3-213). Considering these findings, we propose that, like various other gene families, the primary criterion for assessing protein homology should be rooted in shared evolutionary ancestry and common origin, and this should take precedence over structural similarities.

      Human gene

      Uniprot Accession

      AFDB Cluster

      Accession

      Description

      Canonical CKs

      CXCL14

      O95715

      A0A3Q3M453

      C-C motif chemokine

      CCL24

      O00175

      A0A4X1T574

      C-C motif chemokine

      CX3CL1

      P78423

      A0A7J8CF84

      C-X3-C motif chemokine ligand 1

      CXCL1

      P09341

      A0A1S2ZIJ4

      C-X-C motif chemokine

      CXCL13

      O43927

      A0A1S2ZIJ4

      C-X-C motif chemokine

      CXCL8

      P10145

      A0A1S2ZIJ4

      C-X-C motif chemokine

      CCL20

      P78556

      A0A6P7X7F3

      C-X-C motif chemokine

      XCL1

      P47992

      A0A6P7X7F3

      C-X-C motif chemokine

      CXCL16

      Q9H2A7

      A0A6P8SIS6

      C-X-C motif chemokine 16

      CCL27

      Q9Y4X3

      A0A1L8GBB9

      SCY domain-containing protein

      CCL1

      P22362

      A0A3B4A358

      SCY domain-containing protein

      CCL5

      P13501

      A0A3B4A358

      SCY domain-containing protein

      CCL28

      Q9NRJ3

      A0A3Q0SB19

      SCY domain-containing protein

      CXCL12

      P48061

      A0A401SMI2

      SCY domain-containing protein

      CXCL17

      CXCL17

      Q6UXB2

      No cluster found

      No cluster found

      TAFA

      TAFA1

      Q7Z5A9

      Q96LR4

      Chemokine-like protein TAFA-4

      TAFA2

      Q8N3H0

      Q96LR4

      Chemokine-like protein TAFA-4

      TAFA3

      Q7Z5A8

      Q96LR4

      Chemokine-like protein TAFA-4

      TAFA4

      Q96LR4

      Q96LR4

      Chemokine-like protein TAFA-4

      TAFA5

      Q7Z5A7

      A0A7M4EYY1

      TAFA chemokine like family member 5

      CYTL

      CYTL1

      Q9NRR1

      A0A673GVE4

      Cytokine-like protein 1

      CKLFSF

      CMTM5

      Q96DZ9

      A0A4W2H069

      CKLF like MARVEL transmembrane domain containing 5

      CMTM8

      Q8IZV2

      U3IR50

      CKLF like MARVEL transmembrane domain containing 7

      CMTM7

      Q96FZ5

      A0A6G1PQK5

      CKLF-like MARVEL transmembrane domain-containing protein 7

      CMTM6

      Q9NX76

      A0A814ULI9

      Hypothetical protein

      CKLF

      Q9UBR5

      A0A3M0K8M7

      MARVEL domain-containing protein

      CMTM1

      Q8IZ96

      A0A3M0K8M7

      MARVEL domain-containing protein

      MAL

      P21145

      A0A402F5Z5

      MARVEL domain-containing protein

      CMTM2

      Q8TAZ6

      A0A6G1S7Y0

      MARVEL domain-containing protein

      PLP2

      Q04941

      A0A667IJ27

      Proteolipid protein 2

      CMTM3

      Q96MX0

      A0A3B1ILJ1

      Zgc:136605

      CMTM4

      Q8IZR5

      A0A3B1ILJ1

      Zgc:136605

      PLLP

      Q9Y342

      A0A3B1ILJ1

      Zgc:136605

      Chemokine genes are found on many human chromosomes with large clusters on chromosome 2 and 17. Can the authors address the syntenic relationships phylogenetically?

      There are cases where synteny data have been used to infer the relationship between species (e.g. https://doi.org/10.1038/s41586-023-05936-6); however, to our knowledge, they cannot be used to infer the pattern of gene duplications and losses, as we have done here with gene tree to species tree reconciliations. However, the two approaches are extremely powerful combined and compared as they provide independent evidence. For example, with our phylogenetic analysis of chemokine ligands, we found that CXCL1-10 plus CXCL13 form a monophyletic clade (Figure 2A); this is consistent with their location on the human chromosome 4 (Zlotnik and Yoshie 2012). Similarly, most of the CC-type chemokines, that we find monophyletic in our trees, are located in a locus in human chromosome 17. Likewise, chemokine receptor phylogenetic relationships are largely consistent with macro and micro syntenic patterns. Most of the chemokine receptors are on human chromosome 3 (Zlotnik and Yoshie 2012) and they all belong to a large monophyletic clade in our tree (Figure 4A). Smaller clusters also maintain correspondence, such as the mini cluster of CXCR1 and CXCR2 on human chromosome 2 corresponding to a monophyletic clade in our phylogenetic analysis (Figure 4A).

      We have incorporated the above considerations in our manuscript at the lines:

      • Lines 140-148 (ligands)

      • Lines 256-272 (receptors)

      • Lines 375 – 483 (discussion)

      The authors indicate that 'CXCL8 is present in all jawed vertebrates except in the cartilaginous fishes lineage'. However, they should point out that CXCL8 is not represented in mice. The notion that the repertoire of chemokine and chemokine receptor genes can be different in even closely related species as well as in individuals of the same species is well-documented but not mentioned here.


      We thank the reviewer for these suggestions, and we have modified the text in lines 137-138.

      The analysis suggests that chemokine gene repertoires start small and grow non-linearly to 45 in mammals. However DeVries et al (JI 2005) published that zebrafish have the most chemokines, 63, and chemokine receptors, 24. Do the authors disagree? This should be addressed.

      The significant increase in the number of ligands and receptors in zebrafish, compared to their last common mammalian ancestor, can be attributed to an additional round of whole-genome duplication (WGD) (https://doi.org/10.1016/S0955-0674(99)00039-3).

      Concerning ligands, the count in zebrafish varies from 63 in DeVries et al. 2005 to 111 in Nomiyama et al. 2008, and to 35 in our study. This variation can be attributed to several factors:

      1. Genome Versions: The disparities may arise from the use of different versions of the zebrafish genome. We utilised an improved version known for its higher contiguity and reduced fragmentation (https://www.nature.com/articles/nature12111). It is possible that the additional ligands identified by DeVries, Nomiyama, and others were partial sequences.
      2. Methodology: Methodological differences are at play. DeVries et al. employed tblastN, while we opted for BLASTP. Nomiyama et al. do not specify the type of BLAST performed.
      3. Stringency: We collected our sequences based on a BLASTP search using as query sequences only manually curated sequences from UniProt. This additional precaution allowed us to identify sequences with high-confidence chemokine ligand characteristics.
      4. Sequence Characteristics: Ligands typically have shorter sequences and exhibit less sequence conservation compared to receptors. Zebrafish represents a case in which working with short sequences may lead to missed homologs.
      5. Species-Specific Nature: Our approach successfully recovered the complete set of ligands in other species, such as humans and mice. Zebrafish appears to be an exception rather than the norm. When it comes to receptors, which typically have longer sequences, making it easy to identify distant homologs, our results closely mirror those of DeVries in 2005. In our study, we identified 28 canonical receptors, compared to their count of 24. However, it is worth highlighting that within our dataset, four of these receptors appear as species-specific duplications, potentially indicating that they are actually isoforms or related variants.

      Nonetheless, it is essential to emphasise that our work does not aim to precisely reconstruct the entire complement of ligands and receptors in zebrafish or other species. Achieving this would require further validation, including the expression analysis of potential transcripts.

      Did the authors find any species in which a chemokine/chemokine receptor pair are not found together? That is, if the system is irreducibly complex, requiring both a ligand and receptor, the probability of both genes arising simultaneously is essentially zero. So how do the authors theorize that such a system actually arose, and is there any evidence in their data set for convergence of separately evolved ligand and receptor?

      Our data strongly support the hypothesis that the canonical chemokine system originated within the stem group of vertebrates, likely as a consequence of two rounds of genome duplication. This likely accounts for the simultaneous emergence of both ligands and receptors. While the receptors (both canonical and non) can be traced back to a single-gene duplication event (with the exception of ACKR1), the evolution of ligand families capable of interacting with chemokine receptors occurred independently, although further experiments are required to validate this in vivo in a broader set of organisms. In our study, we successfully identified the complete set of receptors and ligands in well-established model systems like humans and mice. However, when it comes to interactions between ligands and receptors outside these model organisms, the picture becomes less clear. Similarly, the exact pairings of non-canonical components are also not fully clarified (see lines 404-406). As a result, speculating about evolutionary conservation in these contexts requires caution and further investigation. It's worth noting that chemokines and their corresponding chemokine receptors do not necessarily evolve in tandem. Since they are encoded by different genes, they evolved from separate duplication events occurring at different points in evolutionary history. In certain instances, due to the system's flexibility, chemokines binding orthologous receptors may not be orthologous themselves but may have independently acquired the ability to activate the same receptor in various species.

      Line 180, 181 and elsewhere: GPCR1 and GPCR33 should be GPR1 and GPR33

      We have corrected this throughout the manuscript.

      Line 185: ACKR1 exceptionalism is noted, but there is no discussion of the remarkable structure-function paradox that the most distantly related chemokine receptor is also the most highly promiscuous receptor, binding many but not all CC and CXC chemokines with high affinity.

      We added in the discussion section this consideration regarding the wide binding of ACKR1 (Lines 341-343) and its ability to bind both CC and CXC chemokines (DOI: 10.1126/science.7689250 and 10.3389/fimmu.2015.00279), highlighting the intriguing contrast with the fact that it is the most distantly related receptor.

      Line 196: the viral receptors cluster with the vertebrate receptors, suggesting that the viruses captured the receptor gene from the host. Authors might mention this obvious point regarding origins, and discuss how it relates to the monophyly and paraphyly that emerges from the phylogenetic analysis.

      We added a comment to the discussion section (Lines 348-352) regarding the potential origins of the viral chemokine receptors.

      Any discussion of chemokine-like convergent evolution presupposes that the activity is real and actually occurs in vivo. The authors should make clear to what extent the existing literature supports this. As mentioned above, CXCL17 interaction with GPR35 has been challenged in vitro and has never been demonstrated to occur in vivo. To what extent is the same limitation a problem in considering co-evolution of the other non-canonical chemokines? I agree that classification based solely on function is inappropriate, but so is phylogenetic analysis without direct knowledge of in vivo function. It is no feasible to address this in a phylogenetic analysis, but there ought to be at least one species in which the non-canonicals have been rigorously shown to act at specific receptors in vivo before grouping them with the canonicals in a co-evolutionary sense.


      We agree with the referee that evidence of real chemokine-like activity is important to consider the activity in vivo.

      In our work, the molecules examined were chosen based on previous evidence of chemokine-like sequence similarity, ability to bind canonical components and/or chemokine-like function. For example, CKLF (also called CKLF1) has been shown, through calcium mobilisation and chemotaxis assays using the human cell line HEK293, to bind CCR4 and to induce cell migration via CCR4 respectively (https://doi.org/10.1016/j.lfs.2005.05.070). Numerous papers are studying the in vitro and in vivo effects of CKLF in murein and human models (https://doi.org/10.1016/j.cyto.2017.12.002), therefore, we found it compelling to investigate its evolutionary relationship with canonical chemokines. Similarly, CYTL1, that had been predicted to possess an IL8-like fold (https://doi.org/10.1002/prot.22963), has been found to bind CCR2 (https://doi.org/10.4049/jimmunol.1501908) and in vitro and in vivo studies showed chemotactic activity for neutrophils (https://doi.org/10.1007/s10753-019-01116-9). Ongoing research into this molecule are focusing on a wide array of immune functions (https://doi.org/10.1007/s00018-019-03137-x).

      We mentioned these considerations in our introduction to explain why we were interested in investigating these molecules (lines 50-57). We have also added a line in the Discussion (lines 323-324) where we reinforce the idea that in vitro and in vivo experiments for all chemokine-like molecules are required to validate computation predictions.

      The discussion of homeostatic vs inflammatory chemokine/receptors in the last section of the Discussion would be enhanced by pointing out that the chemokine specificities are numerically totally different for these two groupings, homeostatics tending to have monogamous ligand-receptor relationships and inflammatories being highly promiscuous.

      To account for the reviewer’s comment, we have added this consideration in a paragraph of the discussion (see Line 389-394).

      Reviewer #2 (Significance (Required)):



      Much of the paper's results are confirmatory of previous work based on less extensive sequence analysis. One could say more generally that unrelated chemical forms, not just unrelated proteins, have chemokine-like ligand function. For example leukotriene B4 is a powerful leukocyte chemoattractant for neutrophils working through a GPCR. That proteins might also independently evolve common functions does not add insight beyond what is already appreciated. The notion that chemokine receptors have a common ancestor is also generally accepted and that ACKR1 is an outlier is already appreciated. The present work adds phylogenetic and statistical precision to these points.

      Our discoveries clarify various aspects of the chemokine system's evolution, and we are confident that the "phylogenetic and statistical precision" of our findings will provide a solid cornerstone for future research aimed at unravelling the function and evolution of the system. Specifically, our work clarified:

      1. The presence only in Vertebrates: We have confirmed, through a comprehensive taxonomic sampling (we use many more species than previous works), that the chemokine system is exclusive to vertebrates. However, intriguingly, we identified a TAFA chemokine-like family in urochordates.
      2. Relationships between Ligands: We conducted a thorough examination of the relationships between canonical and non-canonical ligands and suggested that several unrelated molecules might have evolved independently their ability to interact with the chemokine receptors. We appreciate the comment of the reviewer regarding the fact that unrelated chemical forms such as leukotriene B4 may have chemokine-like functions. However, in our work all the non-canonical components examined are proteins and as such could have an evolutionary relationship with chemokines. Furthermore, we chose to consider only proteins that showed multiple lines of evidence implicating them in the chemokine system and that are currently the topic of interest in the field (see replies to reviewer 1’s comment #5 and to reviewer 2’s comment #12). Seeing the general interest in the topic, and especially seeing as this had never been clarified before, in this work, we set ourselves the goal to investigate the evolutionary relationship amongst these non-canonical ligands and canonical chemokines.
      3. Duplication Events: We pinpoint the specific gene duplication events responsible for the emergence of chemokine receptors.
      4. Atypical Receptor Paraphyly: Our work highlights the paraphyletic nature of atypical receptors, in contrast to previous research (see https://doi.org/10.1155/2018/9065181).
      5. Viral Receptor Phylogenetics: To our knowledge, this is the first work to investigate the phylogenetic affinities of viral receptors.
      6. GPCR182 and Atypical Receptor Affinities: We clarify the affinity of GPCR182 with atypical receptor 3, offering different insights compared to prior studies (see figure S3C in https://doi.org/10.1038/s41467-020-16664-0).
      7. Additionally, our study represents the first analysis of the chemokine system in the basal vertebrate hagfish and provides insights into the ancestral form of the chemokine system.
      8. Ultimately, our research identifies numerous molecules and receptors with potential chemokine functions. In conclusion, we contribute to resolving uncertainties surrounding the system's origin, including the complex duplication events that have shaped receptor evolution. As evident from the extensive comments provided by the reviewer, our work addresses various controversies in the field (e.g. the inclusion of CXCL17 as a chemokine). Nonetheless, like any new set of findings, our work amalgamates confirmatory results (as highlighted in point 1) with innovative discoveries (as outlined in points 2-8). However, the latter category significantly outweighs the former, underscoring the richness of novel insights.

      Finally, we would like to thank reviewer 2 for their comments, as these have contributed to greatly improve our manuscript.

    1. Joint Public Review:

      This work by Liu CSC et al. is an extension of the author's previous work on the role of Piezo1 mechano-sensor in human T cell activation. In this study, the authors address whether Piezo1 plays a role in T-cell chemotactic migration.

      The authors used CD4+ T cells or Jurkat T cells to test the effects of siRNA-mediated depletion of Piezo1 on chemotactic migration. They establish that Piezo1 is implicated in chemotactic migration, although the effects of depletion are relatively moderate.

      They show that Piezo1 is redistributed to the leading edge of T-cells.

      They identify that relocation of Piezo1 to the leading edge follows an increase in membrane tension.

      In Piezo-1 depleted cells, they observe a moderate reduction of LFA-1 polarity. With the use of specific inhibitors, they propose Piezo1 activation to be downstream of focal adhesion formation and upstream of calpain-mediated LFA-1, integrin alpha L beta 2, or CD11a/CD18 recruitment at the leading edge.

      Strengths:<br /> Together with their 2018 paper, this study presents Pieszo1 as a regulator of T-cell activation, implicating it as a player in the coordination of the chemotactic immune response.

      Weaknesses:<br /> Most of the effects observed are relatively modest. The authors did not challenge the cells with various physico-mechanical conditions to see when Piezo-1 might become really important. For instance, there are no experiments that expose T cells to varying counter-acting forces to see how piezo1 might affect migration.

      Technical weaknesses:<br /> The authors state that "these high tension edges are usually further emphasized at later time points", but after ten minutes the median tension and tension (Figure 2C and Supplementary Figure 2C respectively) reduce down to the pretreatment time point. It would be clearer if the author stated within which timeframe the tension edges are "further emphasised".

      Figures 3 and 4 - The author states the number of cells quantified from the images, but it is not clear whether the data is actually from 3 biological replicates.

      Some of the data has no representative images or videos included. there is no video in the supplementary for Figures 1 A and B. There are no representative images of transwell migration assay in Figures 1 D and E.

    1. The young poplars fringing the Loire are also tapered trees, like the candles and the coffin. The Loire here is a river which has seen many bodies being used as fish-bait, which gives "the breath of baited bodies" an added layer of meaning.

      [The repeated "l" and "b" sounds in this stanza could also be a reference to the length x breadth required to find area of a shape- probably slim and far-fetched]

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The study largely focuses on the use of a 293 cell line that lacks a functional Dicer gene originally identified by the Cullen group. Baldaccini use this cell line, referred to as NoDice cells, to reconstitute various Dicer isoforms that have thus far been described in a variety of settings (e.g., stem cells and oocytes). Collectively, these data demonstrate the capacity of certain N-terminal truncations of Dicer to inhibit Sindbis virus and reduce the presence of viral dsRNA, supporting some of the observations made thus far concerning an antiviral role for mammalian Dicer. For other viruses, this impact was significantly more modest (SFV reduction is less than a log) or was not observed at all (VSV and SARS-CoV-2). The authors then go on to characterize the nature of the observed antiviral activity and ultimately implicate PKR and the induction of NF-kB in priming the cell's antiviral defenses. Importantly, the group also found that this antiviral activity neither required the nuclease activity of Dicer nor the kinase activity of PKR - providing evidence against antiviral RNAi in mammals. In all, the data would seem to suggest that Dicer can act as a dsRNA sensor and can mediate the activation of an NF-kB response - akin to what is observed in response to NOD or some TLR engagement. In all, it is the opinion of this reviewer that this work brings additional clarity to a concept that remains controversial in the field and therefore embodies something meaningful for the community.

      With that said, there are a few issues that require additional attention. The first of these is textual. The introduction of the paper accurately describes the evidence in support of mammalian RNAi but does not invest the same time in discussing the data to the contrary. For example, Seo et al demonstrated that virus infection results in poly-adp-ribosylation of RISC preventing RNAi activity (PMID: 24075860), Uhl et al showed that IFN-induced ADAR1 resolves dsRNA in the cell and prevents RNAi (PMID: 37017521), and Tsai et al showed that virus-derived small RNAs are not loaded into the RISC in a manner that would enable antiviral activity (PMID 29903832). None of this work is referenced in this manuscript and it generates an unbalanced introduction as it relates to the controversy surrounding the idea of RNAi in mammals.

      Reply: We thank the reviewer for their positive comments and suggestions. In the revised version of this manuscript, we will rewrite the introduction to take into account the published data that are not in favor of an antiviral role of RNAi in mammals and we will add the suggested references

      The second issue that would further strengthen this paper relates to the fact that the authors spend a considerable amount of time discussing the data of Figure 6 and 7 as conditions that are defined by a Dicer that can not be processive in its nuclease activity (WT) vs. one that can (N1). However, there seems to be little consideration about the fact that the introduction of WT Dicer into these cells also restores miRNA biology whereas N1 appears to remain only partially functional (based on the data of Fig 3E). Given this, it seems the authors should verify that the high baseline of NFkB signaling that is being observed when comparing WT to N1 is not a product of restored miRNA function in WT cells, in contrast to the hypotheses outlined in the manuscript. This could be addressed by silencing Drosha or DGCR8 in the Dicer knockout cells prior to their reconstitution of Dicer. In the opinion of this reviewer, this experimental control would significantly strengthen the conclusions the authors are making here.

      Reply: This would indeed be an ideal experiment to rule out the contribution of miRNAs in the observed phenotype. We believe however that this particular experiment would prove difficult to realize given that we reconstitute Dicer expression by lentiviral transduction and keep the cells under selection for a couple of weeks before using them for further experiments. This time frame is therefore not compatible with the use of siRNA to knock-down Drosha or DGCR8. Alternatively, we could knock them out by CRISPR-Cas9, but this would take too long and is not feasible in the frame of this work.

      We can however address the concern regarding the role played by miRNAs in the observed phenotype of the Dicer N1 cells. Indeed, we can determine the miRNA profile from our small RNA sequencing data and compare them between the Dicer WT and Dicer N1 cells. We have done this comparison and could not find striking differences in miRNA expression between the two cell lines. We will add this additional piece of evidence in our revised manuscript.

      Reviewer #1 (Significance (Required)):

      In the manuscript entitled, "Canonical and non-canonical contributions of human Dicer helicase domain in antiviral defense" Baldaccini et al. describe their findings concerning the ability of certain N-terminal deletion variants of Dicer in contributing to mammalian antiviral activity. The concept of a functional antiviral RNAi system in mammals is a contentious one with many publications including data both in support of its existence and opposing this idea. In this manuscript, Baldaccini et al. perform a wide range of well-controlled experiments to specifically address aspects of those reports to both provide clarity in what has been documented thus far and to expand on those concepts further.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary

      Whether RNAi is used as an antiviral mechanism in mammals has been a hotly debated issue. The research team previously published several papers on the roles of Dicer in siRNA/miRNA biogenesis and in antiviral responses. They have recently reported that the helicase domain of human Dicer specifically interacts with several proteins that are involved in the IFN response, including PKR. In this study, Baldaccini et al. investigated the involvement of Dicer in antiviral response using various mutants of human Dicer. They showed that deletion mutants of helicase domain exhibit antiviral activity that requires the presence of PKR. They further demonstrated that one of the mutants, N1-Dicer showed antiviral activity in an RNAi-independent manner but depending on the presence of either native PKR or kinase deficient mutants. Transcriptomic analysis revealed that numerous genes involved the IFN and inflammatory response were upregulated in the cells that express N1-Dicer, which is likely due to an increased activation of the NFκB pathway. Based on these findings, the authors propose that Dicer may act as antiviral molecule using its helicase domain, which representing a novel non-canonical function of Dicer.

      Major comments:

      1.The results from experiments with SARS-CoV2 are intriguing (Fig.2). The authors speculated that NFkb activation is in favor of the replication of this virus. It would be interesting to see the infection and replication of SARS-CoV2 in PKR deficient cells and cells expressing PKR mutants (as described in Fig.5). The results may prove/disapprove the authors' speculation and yield additional findings.

      Reply: We thank the reviewer for this suggestion. We have cells that are double knock-out for Dicer and PKR (NoDice/∆PKR) that were transduced to stably express Dicer WT or Dicer N1 and further transduced to express ACE2. We will infect those cell lines with SARS-CoV-2, which will allow us to see whether the difference in viral accumulation can still be observed in the absence of PKR. However, it might prove more difficult to reconstitute PKR expression (WT or mutants) in these cells since they are already transduced twice with two different constructs (Dicer and ACE2).

      Western blot analysis. In the method section, it is stated that proteins were quantified with Bradford method and equal loading was verified by Ponceau S staining. The members of also probed with gamma-tubulin (It was stated that antibodies against alpha-tubulin was used in the method section) as a loading control, however, the bend intensity of tubulin shows great variations among different lanes in several figures while Ponceau S staining is similar (Fig.s, 4, 5, and 8). The differences compromise the accuracy of the results.

      Reply: We apologize for the difference in Tubulin signal in some of our western blots. There are several possibilities to explain those inconsistencies between Ponceau staining and Tubulin blotting, including an effect of viral infection on Tubulin expression. To remove ambiguities around this issue, we could quantify the signal across several blot replicates and provide the quantification after normalization. In addition, we would like to stress that regarding quantification of the infection, we think that the plaque assay experiments are more reliable than quantification of western blot signals.

      3.RNA-seq analysis revealed that Dicer N1 cells have significantly increased expression levels of signaling molecules in type I IFN response even in uninfected cells. While this provides a potential explanation for the antiviral phenotype of N1-Dicer cells. I wonder why the expression levels of type I IFNs (probably the most potent antiviral molecules) were not analyzed in WT and Dicer N1 cells. Measurement of the levels of IFNα and IFNβ by ELISA in the cells before and after infection could provide the important and direct data to support their conclusion.

      Reply: This an interesting suggestion but unfortunately, we do not believe that it would possible to quantify IFNα and IFNβ by ELISA in the cell line that we used in our experiments. Indeed, the level of expression might just be too low to be able to measure something meaningful. We could measure the induction of IFNβ expression at the mRNA level by RT-qPCR though. However, we do not believe that the observed increased expression of genes that belong to the type I IFN response is solely the effect of an increased production of IFN. These genes are also under the control of other transcription factors, including NF-kB for some of them, and it might prove difficult to make a direct link with IFNα or IFNβ production.

      4.While the data presented in Fig. 5 provides convincing evidences that the antiviral activity of mediated by PKR against SINV is independent of its kinase activity in N1-Dicer cells. An interesting question is that whether antiviral activity associated with PKR is N1-Dicer dependent, which could be addressed by comparing the viral infection of NoDice∆PKR and NoDicer expressing PKR mutants.

      Reply: Yes indeed, we have generated NoDice/∆PKR cells expressing PKR WT or mutant and we will infect them with SINV to confirm whether the presence of Dicer N1 is needed for the observed phenotype.

      5.In the concluding paragraph of the discussion, the authors presented an oversimplified discerption of a complex model that involves a crosstalk between IFN-I and RNAi and Dicer-PKR interaction, which is difficult for the reader to compose a clear picture of mechanisms involved. It could be helpful to use a schematic illustration to summarize the action model of PKR incorporated with the canonical and non-canonical Dicer functions.

      Reply: We will add a schematic model in the revised version of our manuscript to summarize our main findings.

      Minor comments:

      1.It stated that NoDice FHA-Dicer WT #4 and NoDice FHA:Dicer N1 110 #6 are referred to as Dicer WT and Dicer N1 cells (p.6). For simplicity, Dicer WT and Dicer N1 cells should be used throughout manuscript, including in all figures. The labels in the figures are difficult to read and are confusing in some cases.

      Reply: This will be changed in the revised version to increase the clarity of the figures.

      2.It is to note that p-PKR was only detected at in N1-Dicer cells at 24 hpi (Fig.8A). This is an interesting observation that was not discussed. It appears that this could be due to a delayed viral replication since these cells are already in an elevated antiviral state. This possibility could be tested by examining viral replication and dsRNA accumulation at more time points in the experiments described in Fig.1.

      Reply: We have performed a kinetic of infection at more time points and we will incorporate these experiments in the revision.

      3.The authors may point out the limitations of the studies. For examples, all cells used in the study are engineered HEK cell lines and were tested with limited number of viruses. As such, the observations may reflect Dicer-PKR interaction under artificially overexpressed conditions, but how the model established from the current study applies to primary cells require further investigation.

      Reply: This is indeed important, we will add a sentence about this in the discussion.

      Reviewer #2 (Significance (Required)):

      The findings reported in this study shed some new light on a long-debated issue regarding the potential roles of RNAi as physiologically relevant antiviral mechanism in mammals. Identification of a new antiviral function of Dicer helicase domain via interaction with PKR is a new advancement of the field, and it also adds a new dimension to a complex subject that overlaps of innate immunity , RNA biology, and developmental biology associated with Dicer.

      Field of expertise: Innate immunity, cell signaling, cytokine biology

      Areas that that I do not have sufficient expertise to evaluate: Small RNA cloning, sequencing and, analysis.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This work by Baldaccini et al. explores the interplay between Dicer and the antiviral protein PKR in the context of viral infection. It builds on a previous publication of the team which demonstrates that the Dicer helicase interacts with multiple RNA binding proteins, including PKR (see Montavon et al.). In this work from 2021, they demonstrate that an artificially-truncated form of Dicer (Dicer-N1) lacking part of the helicase is antiviral against RNA viruses in a PKR-dependent fashion. This was an interesting finding because the field largely assumed that Dicer-N1 performs its antiviral function via canonical dicing of dsRNA, as part of an antiviral RNAi pathway. The present manuscript follows up on this initial discovery and deciphers the specifics of Dicer-N1 antiviral phenotype, as well as delineates the interplay between Dicer's helicase and PKR. The authors main claims are as follow:

      1. i) Dicer-N1 antiviral effect does not require its catalytic activity, therefore is completely RNAi-independent.
      2. ii) Neither does it require canonical PKR activation, but relies instead on NF-kB-driven inflammation. The origin of this inflammation is not studied.
      3. ii) Truncated Dicers other than Dicer-N1 are antiviral through RNAi, but are also PKR-dependent. The authors claims are mostly supported by the data, although I suggest below some improvements regarding experimental approaches and data presentation. This work details in an interesting manner the interplay between the machinery of RNAi and the classical pathway of innate immunity (PKR). As explained by the authors, there is solid data in the literature demonstrating the mutual exclusivity of IFN and antiviral RNAi in differentiated cells. This mostly goes through the receptors LGP2, which inhibits dsRNA dicing by Dicer. The authors data suggest that, conversely, Dicer may play a role in preventing the unwanting activation of PKR (a non-canonical activation leading to inflammation). Given that PKR activation does not depend on virus, the authors discuss potential mechanisms of PKR triggering. This is an interesting topic that deserves further investigation (not necessarily within the frame of this work - it can be a follow-up). Another interesting piece of information is that different truncated Dicers behave differently with respect to implementing antiviral RNAi. Whilst Dicer-N1 isn't proficient in doing so, the other forms are. It shows that lab-generated truncations do not fully recapitulate what is observed with existing truncated Dicers (DicerO and aviD).

      Experimental design and data interpretation

      1. The authors should compare infection between different cell lines across a range of time points (ie, a virus growth curve). In Fig 4E for example, I worry that cells expressing or not PKR will reach the plateau of viral particle accumulation at different time points. One could imagine that cells lacking PKR do not show any differences in particle production at 24h, but do at earlier time points.

      Reply: This is an interesting suggestion, we can perform a kinetic experiment by looking at more time points to address this point. This will allow us to determine the time needed for every cell line to reach the plateau of infection.

      Western blots should be accompanied with proper quantifications plotted as bar graph with biological replicates (p-PKR, p-eIF2a and capsid).

      Reply: We have biological replicates for our western blot experiments, and we will quantify those to better determine the observed changes. However, in the case of p-eIF2a, we do not think it is pertinent to measure it since there are other kinases than PKR that are known to induce eIF2a phosphorylation upon SINV infection. It might therefore not prove very informative to precisely quantify this particular signal.

      Microscopy images should be properly quantified across biological replicates (Fig 1&2 for the J2 staining, for example).

      Reply: We could do a proper quantification of the J2 signal across replicates, but we do not think it would bring much to our message. Here, we mostly used J2 staining as a qualitative indication that the infection was impacted or not. We have a proper quantification of the effect with our plaque assay experiments, which are way more robust to determine the levels of infection between conditions.

      Confounding factors hinder the interpretation of siRNA accumulation (Suppl Fig 2): i) the efficiency of dsRNA dicing from different Dicers will generate different amounts of siRNAs from a given amount of dsRNA and ii) the higher antiviral response translates into decreased infection, so decreased dsRNA substrate. I suggest that the authors normalise the amount of viral siRNAs over the total amount of viral genomes. This should allow to assess if Dicer-N1 is better at dicing dsRNA than WT in these conditions.

      Reply: This is a valid concern and we agree that it is important to be able to normalize small RNA reads between conditions before reaching a conclusion. The problem is that there is no easy way to do this since we do not get a direct measurement of viral genomes accumulation from our small RNA sequencing data. To better compare the two conditions, we could normalize the individual viral siRNA to the total number of viral reads. Another problem that we face is that we are looking here at the AGO-loaded small RNAs, which makes it more difficult to assess dicing efficiency since not every generated siRNA might be loaded into an Argonaute protein. In fact, this has been proposed by the Cullen laboratory in a paper published in 2018 (Tsai et al. doi: 10.1261/rna.066332.118). They showed that although viral siRNAs were generated during IAV infection, those were inefficiently loaded and thus did not significantly impacted the infection.

      In Fig 8, the authors should verify that phospho-p65 increase depends on PKR by repeating the experiment in PKR KO cells.

      Reply: Yes, good point. We will check what happens to phosphorylation of p65 in PKR KO cells. In addition, we can also measure the effect on a known NF-kB target by RT-qPCR (e.g. PTGS2).

      Data representation

      1. Levels of phospho-PKR and eIF2a need to be normalised on the total amount of PKR and eIF2a, respectively. The authors should quantify the blots and present bar graphs with biological replicates and statistics.

      Reply: As mentioned above in our reply to point 2, we can add the quantification for phospho PKR, but we do not think it is pertinent to do it for eIF2a.

      Could the authors add the names of representative genes on the volcano plots of Fig 7?

      Reply: Yes, this will be done.

      Points of discussion

      1. In Fig 4C, catalytically-dead mutants of truncated Dicers (other than N1) do not display an antiviral effect. Presumably, such proteins implement canonical antiviral RNAi. Is there a reason why the authors interpret this data as Dicers being "partially" antiviral through RNAi (l. 92). This data instead suggest that is it totally dependent on RNAi.

      Reply: Indeed, and we do not say the contrary. It seems that some of this helicase-truncated Dicer proteins can act through RNAi. However, they also depend on PKR, so in the end it might be a combination of the two that allows their antiviral effect.

      Gurung et al. demonstrate that PKR is activated in Dicer KO mouse ES cells, which results in phosphorylation of eIF2a at steady-state. This is different from the authors' data, in which PKR activation does not affect eiF2a phosphorylation. Could the authors discuss this discrepancy?

      Reply: The problem that we face here is that SINV is known to also activate GCN2 and therefore eIF2a phosphorylation does not strictly rely on PKR in our experimental conditions. In addition, we did not check eIF2a phosphorylation in Dicer KO cells, but we always compare Dicer WT and Dicer N1 expressing cells.

      Do the authors expect that truncated Dicers other than N1 trigger an inflammatory response such as the one described for N1? Would it be possible to have this antiviral inflammatory response in conjunction with antiviral RNAi?

      Reply: This goes back to Point 1 mentioned previously. We think indeed that there might be a dual action of Dicer and that it will be important to check whether in other cellular systems or animal model such a phenomenon can be observed as well. This is a point that we did address in the discussion of our manuscript (line 522-525).

      Reviewer #3 (Significance (Required)):

      This is a study that conceptually advances the field of antiviral RNAi in mammals, including its interplay with the machinery of innate immunity. It is of interest for virologists and immunologists. My expertise is centered on the mechanisms of innate immunity in mammalian cells, including antiviral RNAi.

    1. n groups B and C, the Numeric Rating Scale (NRS)-pruritus temporarily significantly increased after interval prolongation but remained low (median NRS-pruritus≤4). Median dupilumab levels remained stable in group A (standard dosage), but significantly decreased in groups B and C (24.1 mg/L (IQR = 17.1–45.6); 12.5 mg/L (IQR = 1.7–22.3)) compared with the levels during the standard dosage (88.2 mg/L [IQR = 67.1–123.0, p < .001]).

      testing

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers to appreciating the depth and quality of the results presented in our manuscript. We are happy that he/she has appreciated our contemporary approach to addressing an important problem but also a thorny and debated topic in the field that has lasted for over 30 years since it was first proposed by Mike Berridge. Our work not only addresses mechanisms in cultured human cells but also for the very first time addresses the key problem of Li action in the human brain using human iPSC derived forebrain cortical neurons.

      We are delighted with the reviewer’s comment that “This effort highlighted the author's commendable goal to develop a thoughtful story and not just another publication.” We thank the reviewer for appreciating our detailed multi-disciplinary approach to addressing a long standing (> 4 decades) problem of key significance in human psychiatry and neuroscience. We thank him/her for noting that the work deserves to be presented in a journal with a cross-disciplinary focus. A few technical points that have been noted by the reviewer have been addressed below. Most of these can be effectively addressed by modest rewriting of text to make certain points clearer. The one experiment that has been suggested can be done and will be completed within 1 month.

      1. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity):

      The authors functionally define in the inositol monophosphate phosphatase IMPA1, as the true target of lithium regulating phosphatidylinositol turnover and calcium signalling. While the observed IMPA1 inhibition by lithium led to the historical 'inositol depletion hypothesis' over the past 30+ years were published evidence both in support and against this concept. These contradictory sets of results have led to decreased interest in phosphoinositides as the signalling pathway affected by the therapeutic action of lithium in bipolar disorder (BD) patients. The remarkable results shown here will revert this trend since the data clearly demonstrate a key role of IMPA1 in setting the rate of phosphatidylinositol turnover, and consequentially the extent of calcium signalling. While the data are consistent with the 'inositol depletion hypothesis' the authors do not prove or disprove the validity of this hypothesis since the actual levels of inositol were not measured in their experiments. However, this is not a criticism, since quantifying cellular inositol is complex, it is just a suggestion for future work. After clarifying the points listed below this work will be suitable for publication.

      • The experiments using inositol rich DMEM (reported on page 17 and in Fig 2I,J) require a better explanation and an adequate material and method section. It is not clear if the 'normal/control' condition uses inositol-free DMEM. The standard concentration of inositol in DMEM is 40uM. Thus, are the ~155uM (28 mg/litre) added by the authors at the high end of the 40uM? Given that FBS contains inositol have the authors used dialyzed serum? While adding 155uM of inositol on either inositol-free medium or to medium containing 40uM inositol does not alter the author's message, this technical information are important for the reproducibility of the data presented and to understand how HEK293T manages inositol homeostasis.

      The standard concentration of inositol in DMEM high Glucose media (Dulbecco’s Modified Eagle Medium; Life Technologies) is 40 µM (7 mg/ litre) and our HEK293T cells were maintained under standard conditions at (37oC, with 5% CO2) in this media, supplemented with 10% Foetal bovine serum (FBS). This FBS was not dialyzed, so it might contain trace amounts of inositol.

      For our inositol rich DMEM, 117 μl of 100 μM of inositol (18 mg/ml) was added to 100 ml of DMEM high Glucose media, supplemented with 10% FBS- this led to the final effective concentration of inositol in the media ~155 µM (28 mg/ litre). This media was referred to as the inositol rich media and used for the inositol supplementation in Fig. 2I-J.

      Therefore, the inositol supplementation we refer to is effectively raising the extracellular inositol concentration from ca. 40mm to 155mM which is 3X elevation. This information has been added to the results section.

      • It would be helpful to know if store operated calcium entry is altered in impa1-/--M1 cells. This information would nicely complement Fig.3 C-E data.

      We have studied store operated calcium entry in IMPA1-/- cells and it is decreased. The quantification of this reduction can be added to the paper.

      • In the Introduction at the end of page 4, the evidence not supporting the inositol depletion hypothesis is correctly discussed. This section lacks the discussion of another work questioning this theory (PMID: 30171184). The conclusion of this work is also in agreement with the authors finding that lithium affects the rare/turnover (lines 490/506) of PIP2 synthesis.

      Saiardi A, Mudge AW. Lithium and fluoxetine regulate the rate of phosphoinositide synthesis in neurons: a new view of their mechanisms of action in bipolar disorder. Transl Psychiatry. 2018 Aug 31;8(1):175. doi: 10.1038/s41398-018-0235-2. PMID: 30171184.

      This paper suggests that lithium mediated inhibition of IMPase leads to an accumulation of IP1 and this elevated IP1 leads to a competitive inhibition of the rate of synthesis of PI, and hence turnover of PIP2. Combined with lithium’s inhibition of inositol uptake, this inhibition of PI synthesis can lead to the mood stabilizing effect of lithium, rather than the inositol depletion. This point will be added to our manuscript and the reference cited.

      • In the material and methods, Liquid Chromatography Mass spectrometry is abbreviated to LCMS while in the main text (line 493) LC-MS is used. The dashed version should be used throughout the manuscript.

      Liquid Chromatography Mass spectrometry will be abbreviated to LC-MS throughout the text in the revised version.

      • I suggest to define (line 500) phosphatidylinositol 4-phosphate as (PI(4)P simplified as PIP). This will be consistent with the phosphatidylinositol 4,5-bisphosphate abbreviation as (PIP2) as reported in the introduction (line 97)

      PIP refers to all the functional isoforms of Phosphatidylinositol phosphate- PI 3P, PI 4P and PI 5P. By the LC-MS/MS analysis, we had measured the total PIP masses but we cannot distinguish between the individual functional isomers of PIP. However, pre-existing literature suggests that PI 4P is the most abundant isoform of PIP present in cell- its level is approximately 50 folds higher than that of PI 5P (Rameh et al., 1997). Hence we can suggest that the change in the total mass of PIP (as seen by the LC-MS/MS) is mainly reflective of the PI 4P.

      • Line 646: Instead of using [this study] the authors should refer to the Figure panels supporting the discussed argument.

      The identity of the channels mediating Ca2+ transients in this system was shown by us in Sharma et al., 2020. In the revision, we will cite this paper.

      Referees cross-commenting

      Reviewer #2 main message is identical to my. The work is a "contemporary re-evaluation of the inositol depletion hypothesis" but it does settle the debate. Say that reviewer #2 also recognises the importance of the work in defining IMPA1 as the only lithium target affecting the PI cycle removing GSK3 from the picture. Additionally, we agree that the thorough transcriptional analysis of the effect of lithium on human cortical neurons will be very informative for any researcher interested in psychiatric disorders.

      Reviewer #2 requests are rational and not demanding. Most queries require extra information or the reformatting of the data presented.

      Reviewer #1 (Significance):

      The submitted manuscript addresses an important topic. The authors developed HEK293 stable expressing muscarinic receptor to study the effect of lithium (without or after receptor activation) on PI(4,5)P2 turnover using two approaches, by microscopy and biochemically by LC-MS. These analyses were followed by a thorough characterization of the effect of lithium on calcium signalling. The generation of HEK293 impa1-/- line has allowed the authors to demonstrate that the observed effect of lithium on PI(4,5)P2 turnover and calcium signalling were IMPA1 dependent. The authors pushed the work to a higher level by studying the effect of lithium on iPSC-derived human cortical neurons demonstrating that lithium reduces neurons excitability and calcium signalling. Although previously published attempts failed to generate IMPA1 deficient human cortical neurons the authors managed to produce iPSC impa1-/- but, as reported and consistent with previous literature, this cell line failed to differentiate into neurons. This effort highlighted the author's commendable goal to develop a thoughtful story and not just another publication. The work is complemented by a very informative transcriptional analysis characterising the effect of lithium on human cortical neurons. Noteworthy is also the author's efforts to functionally and transcriptionally define the effect of another lithium target, GSK-3. These experiments emphasize that GSK-3 does not phenocopy the effect of lithium. This is another utterly important message of the paper.

      In conclusion, the authors presented an easily readable, comprehensive, and experimentally convincing story. Furthermore, the developed experimental tools (HEK293-m1AchR) and the extensive data set (transcriptomic analysis) will be instrumental to further studies aimed at elucidating mechanistically how phosphoinositide signalling affects BD pathophysiology.

      Reviewer #2 (Evidence, reproducibility and clarity):

      This manuscript seeks to test if inhibition of the phosphoinositide (PI) cycle is the relevant pathway targeted by lithium in bipolar affective disorder (BPAD). Firstly, a cultured model system (HEK293T) is used to test the effects of lithium on the PI cycle. Using PI(4,5)P2 probes along with mass spectrometry, Li is shown to inhibit PI(4,5)P2 re-synthesis after PLC activation, though not to perturb pre-stimulus levels. Release of calcium from intracellular stores along with refilling from extracellular calcium is also inhibited - though there are no effects on stored calcium capacity. Crucially, with the exception of the calcium refilling step, these effects if Li can be abolished by genetic ablation of IMPA1, the proposed molecular target of Li. Having established the affected pathway, the manuscript then studies the effects of Li treatment on iPSC-differentiated cultured cortical neurons. Spontaneous and muscarinic evoked calcium transients are shown to be abolished by Li. None of these effects in HEK293T or neurons can be recapitulated by an inhibitor of GSK3beta, another proposed target for Li. Finally, a transcriptomic analysis of Li treated neurons is presented, showing down regulation of relevant genes, especially genes involved in neuronal calcium signaling and glutamatergic signaling.

      The inositol depletion hypothesis has been debated for nearly four decades. As it stands, this manuscript does not settle this debate once and for all, but it does add some novel and important insights: that 1) IMPA1 is certainly the target of lithium, at least in terms of the PI cycle and 2) Lithium treatment can lead to longer-term transcriptional changes in neuronal calcium and glutamatergic signaling that can dampen excitability. The paper is on the whole clearly written, and the data are easy to follow. That said, there are a number of areas where the manuscript is lacking key details, or where the results do not fully support the conclusions. Specific suggestions for amendment are as follows:

      (1) The PH-PLCdetla1 PH domain has been used to follow PI(4,5)P2 turnover in HEK293T cells. Although long established, the manuscript does not discuss the fact that this domain also binds to IP3, which given high enough concentrations, can compete the PH domain off the membrane. As such, what is being measured is the convolution of PI(4,5)P2 decreases and IP3 increases (see for example doi: 10.1083/jcb.200301070 ). Ideally, a non-IP3 binding probe would have been used, such as the Tubby c-terminal domain (doi: 10.1186/1471-2121-10-67; doi: 10.1113/jphysiol.2008.153791). As it stands, the failure of the PH domain to return to the membrane after Li treatment reported in figures 1G, 3F and 3L could either be due to a failure of PI(4,5)P2 re-synthesis, or a failure to breakdown IP3 - either of which are plausible explanations given inhibition of IMPA1. This concern is somewhat mitigated by the inclusion of mass determinations of the lipids in figure 1H-J, which support the PI(4,5)P2 re-synthesis defect. However, the potential problems with interpretation of the data with the PH domain should be discussed.

      PH-PLCδ-GFP probe is used in the field as a biosensor for PI 4,5-P2 (Chakrabarti et al., 2015; Várnai and Balla, 1998) and has been used to monitor the PIP2 turn-over rate. However, as the reviewer has pointed out, this probe also has an affinity towards IP3 (Xu et al., 2003) and therefore the failure of the probe to return to the plasma membrane could also, in principle, reflect the accumulation of IP3.

      We are well aware of this discussion in the field and to make sure that our measurements using the PH-PLCδ-GFP probe are indeed a true reflection of PIP2 re-synthesis , we have also used a biochemical method to establish the levels of PIP2. Our measurements of the total mass of PIP2 by LC-MS/MS corroborate our findings using the probe, on the delay in PIP2 resynthesis. Nonetheless, we will explicitly mention this point in the discussion.

      Drawback of the Tubby c-terminal domain-

      Most of the biosensors for PI 4,5-P2 have distinctive advantage and disadvantages- PH-PLCδ-GFP probe is a more sensitive reporter but its IP3 binding may compromise its accuracy to measure PI 4,5-P2 changes. However, the Tubby c-terminal domain has exhibited lower sensitivity to report on changes of PI 4,5-P2 during PLC activation, although being more specific in its affinity towards PI 4,5-P2 (Szentpetery et al., 2009). Furthermore, recent studies have revealed that Tubby c-terminal domain can also bind to PI 3,4-P2 as well as PI 3,4,5-P3 (Hammond and Balla, 2015). Lastly a very recent study has noted that in contrast to PH-PLCδ-GFP probe, the tubby domain binds selectively to certain domains of the plasma membrane at membrane contact sites making it not a detector of PIP2 levels across the plasma membrane (Thallmir et,al., 2023, PMCID: PMC10445746 DOI: 10.1242/jcs.260848 ).

      (2) The strongest evidence for the effects of IMPA1 inhibition coming from inositol depletion are given by the experiment reported in figure 2I and J, where inositol supplementation rescues calcium mobilization. This should also be performed for the PIP2 re-synthesis experiments.

      We thank the reviewer for this suggestion.

      We will perform this experiment and add the results to the revised manuscript. Duration estimated for this experiment- 30 days.

      (3) It is implicit in the manuscript that DMEM does not contain inositol. This is not true; Life technologies' formulation for DMEM contains 40 micromol/l myo-inositol, which is sufficient to support activity of both proton/myo-inositol and sodium/myo-inositol symporters (HMIT and SMIT). On the face of it, therefore, inositol depletion seems unlikely. The reviewer wonders what concentration of added inositol mediated the rescue? This key fact is missing from the manuscript. At the very least, the details should be included and the reason for rescue of already inositol replete cells discussed. Ideally, the key experiments would be repeated with inositol-free medium and supplementation.

      Repeated cycles of GPCR linked PLC signalling depend on a stable and on a continuous supply of PIP2 at the cell membrane, which in turn depends on the cytoplasmic pool of inositol in the cell. Inositol pool can be maintained by three avenues- via the recycling of the inositol by the stepwise dephosphorylation of IP3, de novo synthesis of inositol from glucose 6-phosphate and the transport of inositol from the extracellular environment across the plasma membrane. Li inhibits IMPase, an enzyme that dephosphorylates inositol monophosphate to generate free inositol.

      Due to Li’s inhibition of IMPase, the inositol pool cannot be regenerated by the first two avenues since both of them need IMPase. However, restriction of the inositol pool by Li’s inhibition of IMPase can be bypassed by the transport of inositol from the extracellular media via SMIT (Sodium-dependent myo-inositol co-transporter) and/or HMIT (Proton-dependent myo-inositol co-transporter). At steady state, the low amount of inositol in the DMEM media (the inositol concentration in normal DMEM media being approximately 7 mg/litre or 40 µM) might be sufficient for maintaining the inositol pool and thereby PIP2 levels at the steady state. But this amount of inositol in the DMEM media appeared to be limiting to sustain the inositol pool and thereby PIP2 levels under conditions of hyperactivated PIP2 cycle. This is likely the reason why Li mediated inositol depletion (by inhibition of IMPA1) leads to a decreased rate of PIP2 synthesis at the plasma membrane as well as decreased PLC mediated Ca2+ release, in the background of activated PLC.

      However, when the cells were grown in an inositol rich DMEM media (inositol concentration is ca. 28 mg/litre or approximately 155 µM; which is similar to the inositol concentration in the cerebrospinal fluid (Swahn, 1985; Shetty et al., 1996)); transport of extracellular inositol by SMIT/HMIT could sustain a continuous level of PIP2 levels even in a PLC activated background. This explains the rescue of the decreased PLC mediated Ca2+ release phenotype in the control-M1 cells grown in inositol supplemented DMEM despite the Li treatment.

      This section can be added to the Results section (page 17) in the revised version.

      (4) The introduction refers to lithium as a "non-competitive" inhibitor of IMPA1. This is erroneous, as lithium is in fact an uncompetitive inhibitor. This is a key distinction: since the uncompetitive inhibitor blocks the enzyme:substrate complex, it is most effective where substrate accumulates the most - in this context, sites of intense PLC activity. This was central to Berridge's inositol depletion hypothesis. Also, the Allison et al citation is incorrect here. The correct citation is PMID: 2833231.

      Li inhibits IMPA1 in an uncompetitive manner.

      This will be corrected in the revised version of the manuscript, with the appropriate references.

      Ref cited by reviewer- Gee NS, Ragan CI, Watling KJ, Aspley S, Jackson RG, Reid GG, Gani D, Shute JK. The purification and properties of myo-inositol monophosphatase from bovine brain. Biochem J. 1988 Feb 1;249(3):883-9. doi: 10.1042/bj2490883. PMID: 2833231; PMCID: PMC1148789.

      Ref- Hallcher LM, Sherman WR. The effects of lithium ion and other agents on the activity of myo-inositol-1-phosphatase from bovine brain. J Biol Chem. 1980 Nov 25;255(22):10896-901. PMID: 6253491.

      (5) other key experimental details are missing from the figures/figure legends/results and or methods. Namely, what concentration of carbachol was used? What was the optimum concentration of thapsigargin? For figure 2 B-C, was carbachol used to evoke calcium mobilization?

      For the agonist mediated Ca2+ release, 20 µM of carbachol was used. This is mentioned in the Materials and Methods section (line 238); however, we will mention this in the figure legends and results for clarity, in the revised version.

      For the store depletion in the SOCE experiments, 10 µM of thapsigargin was used. This is mentioned in the Materials and Methods section (line 249); however, we will mention this in the figure legends and results for clarity.

      In Fig. 2B, C, Carbachol was used to evoke calcium mobilization in the cells. This is mentioned in the Results section (line 510)- we will mention this in the figure legends for clarity.

      (6) The effects of IMPA1 knockout and rescue in figure 3F are rather unconvincing. All treatment groups' means fall within 1 SD; are the changes statistically significant? Plotting 95% C.I. or standard error may be more informative for these experiments.

      This can be addressed in the revised version.

      Minor comments:

      () There are some inconsistencies in the figure panels. Arrows labelled "CCh" are used to denote CCh addition in e.g. Fig. 3D, whereas simple arrows are used elsewhere e.g. Fig. 3F or no arrow at all e.g. Fig. 3B.

      All the points where CCh has been added to stimulate PLC will be denoted with an arrow labelled Cch for clarity and consistency. This will be addressed in the revised version.

      () Details of what the error denotes is missing in Figs. 2B-C, 3B-C, F - as is N for 3A.

      Whiskers in box plots show the minimum and maximum values with a line at the median. This will be addressed in the revised version.

      () Fig. 2D: It should be made explicit that arrows indicate TTX addition in the figure. More importantly, it should be clarified whether this is also the case for Fig. 5D? Transients do not appear to be depleted by this addition.

      This has been mentioned in the figure legends for the figure (line 1077)- we will address this in the revised version.

      Few of the neuronal transients are not abolished by TTX- this variability can be addressed by other representative traces.

      () In figure 6C, it is stated that "there was no down regulation in transcripts for SCN1A (Nav1.1) or SCN9A (Nav7.1)" but this is not strictly true from the data; the Li-treated cells definitely trend lower. The effect is clearly not statistically significant, so although it is not possible to state that they reduced, this is not the same thing as being able to assert that they are not reduced. Perhaps it could be more helpful to plot the size of the reduction between these transcripts?

      This will be addressed as- “there was no significant downregulation in transcripts for SCN1A (Nav1.1) or SCN9A (Nav7.1)".

      The difference in the transcript level for SCN1A (Nav1.1) or SCN9A (Nav7.1) can be plotted for further clarification.

      Reviewer #2 (Significance):

      The manuscript is ultimately a contemporary re-evaluation of the inositol depletion hypothesis for Li treatment of BPAD, first proposed by Berridge in 1989. The manuscript certainly does not end the debate - other pathways and mechanisms for lithium's actions on a complex human behavioral phenotype will surely persist. However, it does add some important new insights: firstly, that IMPA1 is certainly the target mediating the effects of Li on the PI cycle; and secondly, that long-term, Li may exert effects at the transcriptional level to down regulate calcium and glutamatergic signaling in the brain. However, there is no mechanism presented to link these two findings. The work is therefore of interest to researchers with an interest on studying psychiatric disorders, basic mechanisms of neuronal excitability as well as molecular mechanisms of cell signaling. It therefore deserves to be published in a journal with a cross disciplinary focus.

      2. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      3. Description of analyses that authors prefer not to carry out

      Nil

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript seeks to test if inhibition of the phosphoinositide (PI) cycle is the relevant pathway targeted by lithium in bipolar affective disorder (BPAD). Firstly, a cultured model system (HEK293T) is used to test the effects of lithium on the PI cycle. Using PI(4,5)P2 probes along with mass spectrometry, Li is shown to inhibit PI(4,5)P2 re-synthesis after PLC activation, though not to perturb pre-stimulus levels. Release of calcium from intracellular stores along with refilling from extracellular calcium is also inhibited - though there are no effects on stored calcium capacity. Crucially, with the exception of the calcium refilling step, these effects if Li can be abolished by genetic ablation of IMPA1, the proposed molecular target of Li. Having established the affected pathway, the manuscript then studies the effects of Li treatment on iPSC-differentiated cultured cortical neurons. Spontaneous and muscarinic evoked calcium transients are shown to be abolished by Li. None of these effects in HEK293T or neurons can be recapitulated by an inhibitor of GSK3beta, another proposed target for Li. Finally, a transcriptomic analysis of Li treated neurons is presented, showing down regulation of relevant genes, especially genes involved in neuronal calcium signaling and glutamatergic signaling.

      The inositol depletion hypothesis has been debated for nearly four decades. As it stands, this manuscript does not settle this debate once and for all, but it does add some novel and important insights: that 1) IMPA1 is certainly the target of lithium, at least in terms of the PI cycle and 2) Lithium treatment can lead to longer-term transcriptional changes in neuronal calcium and glutamatergic signaling that can dampen excitability. The paper is on the whole clearly written, and the data are easy to follow. That said, there are a number of areas where the manuscript is lacking key details, or where the results do not fully support the conclusions. Specific suggestions for amendment are as follows:

      1. The PH-PLCdetla1 PH domain has been used to follow PI(4,5)P2 turnover in HEK293T cells. Although long established, the manuscript does not discuss the fact that this domain also binds to IP3, which given high enough concentrations, can compete the PH domain off the membrane. As such, what is being measured is the convolution of PI(4,5)P2 decreases and IP3 increases (see for example doi: 10.1083/jcb.200301070 ). Ideally, a non-IP3 binding probe would have been used, such as the Tubby c-terminal domain (doi: 10.1186/1471-2121-10-67; doi: 10.1113/jphysiol.2008.153791). As it stands, the failure of the PH domain to return to the membrane after Li treatment reported in figures 1G, 3F and 3L could either be due to a failure of PI(4,5)P2 re-synthesis, or a failure to breakdown IP3 - either of which are plausible explanations given inhibition of IMPA1. This concern is somewhat mitigated by the inclusion of mass determinations of the lipids in figure 1H-J, which support the PI(4,5)P2 re-synthesis defect. However, the potential problems with interpretation of the data with the PH domain should be discussed.
      2. The strongest evidence for the effects of IMPA1 inhibition coming from inositol depletion are given by the experiment reported in figure 2I and J, where inositol supplementation rescues calcium mobilization. This should also be performed for the PIP2 re-synthesis experiments.
      3. It is implicit in the manuscript that DMEM does not contain inositol. This is not true; Life technologies' formulation for DMEM contains 40 micromol/l myo-inositol, which is sufficient to support activity of both proton/myo-inositol and sodium/myo-inositol symporters (HMIT and SMIT). On the face of it, therefore, inositol depletion seems unlikely. The reviewer wonders what concentration of added inositol mediated the rescue? This key fact is missing from the manuscript. At the very least, the details should be included and the reason for rescue of already inositol replete cells discussed. Ideally, the key experiments would be repeated with inositol-free medium and supplementation.
      4. The introduction refers to lithium as a "non-competitive" inhibitor of IMPA1. This is erroneous, as lithium is in fact an uncompetitive inhibitor. This is a key distinction: since the uncompetitive inhibitor blocks the enzyme:substrate complex, it is most effective where substrate accumulates the most - in this context, sites of intense PLC activity. This was central to Berridge's inositol depletion hypothesis. Also, the Allison et al citation is incorrect here. The correct citation is PMID: 2833231.
      5. other key experimental details are missing from the figures/figure legends/results and or methods. Namely, what concentration of carbachol was used? What was the optimum concentration of thapsigargin? For figure 2 B-C, was carbachol used to evoke calcium mobilization?
      6. The effects of IMPA1 knockout and rescue in figure 3F are rather unconvincing. All treatment groups' means fall within 1 SD; are the changes statistically significant? Plotting 95% C.I. or standard error may be more informative for these experiments.

      Minor comments:

      • There are some inconsistencies in the figure panels. Arrows labelled "CCh" are used to denote CCh addition in e.g. Fig. 3D, whereas simple arrows are used elsewhere e.g. Fig. 3F or no arrow at all e.g. Fig. 3B.
      • Details of what the error denotes is missing in Figs. 2B-C, 3B-C, F - as is N for 3A.
      • Fig. 2D: It should be made explicit that arrows indicate TTX addition in the figure. More importantly, it should be clarified whether this is also the case for Fig. 5D? Transients do not appear to be depleted by this addition.
      • In figure 6C, it is stated that "there was no down regulation in transcripts for SCN1A (Nav1.1) or SCN9A (Nav7.1)" but this is not strictly true from the data; the Li-treated cells definitely trend lower. The effect is clearly not statistically significant, so although it is not possible to state that they reduced, this is not the same thing as being able to assert that they are not reduced. Perhaps it could be more helpful to plot the size of the reduction between these transcripts?

      Significance

      The manuscript is ultimately a contemporary re-evaluation of the inositol depletion hypothesis for Li treatment of BPAD, first proposed by Berridge in 1989. The manuscript certainly does not end the debate - other pathways and mechanisms for lithium's actions on a complex human behavioral phenotype will surely persist. However, it does add some important new insights: firstly, that IMPA1 is certainly the target mediating the effects of Li on the PI cycle; and secondly, that long-term, Li may exert effects at the transcriptional level to down regulate calcium and glutamatergic signaling in the brain. However, there is no mechanism presented to link these two findings. The work is therefore of interest to researchers with an interest on studying psychiatric disorders, basic mechanisms of neuronal excitability as well as molecular mechanisms of cell signaling. It therefore deserves to be published in a journal with a cross disciplinary focus.

    1. Author Response

      The following is the authors’ response to the original reviews.

      eLife assessment

      The study by O'Reilly and Delis provides a valuable data-driven framework for extracting task-related muscle synergies in a step towards the understanding and practical use of synergies in real scenarios (e.g., evaluation of patients in a clinical environment). The approach is incomplete since the authors did not compare their method with classical physiologically grounded approaches for assessing muscle synergies. In this sense, the comparisons with classical approaches would clarify if physiological assemblies were preserved and were not altered to incorporate task space variables. Despite limitations, the proposed framework would interest motor control and neural engineering researchers.

      We thank the editors for the positive assessment of our work and appreciate their constructive feedback. In our revised manuscript, we believe we have sufficiently addressed the identified limitations by a) comparing our approach to existing physiologically-based methods, providing thorough comparisons of their respective outputs, b) applying it to a dataset of post-stroke participants to demonstrate that it can identify physiologically-interpretable markers of motor recovery and c) providing examples to demonstrate how readers can interpret the novel perspective introduced.

      Reviewer #1 (Public Review):

      The proposed study provides an innovative framework for the identification of muscle synergies taking into account their task relevance. State-of-the-art techniques for extracting muscle interactions use unsupervised machine-learning algorithms applied to the envelopes of the electromyographic signals without taking into account the information related to the task being performed. In this work, the authors suggest including the task parameters in extracting muscle synergies using a network information framework previously proposed. This allows the identification of muscle interactions that are relevant, irrelevant, or redundant to the parameters of the task executed.

      The proposed framework is a powerful tool to understand and identify muscle interactions for specific task parameters and it may be used to improve man-machine interfaces for the control of prostheses and robotic exoskeletons.

      With respect to the network information framework recently published, this work added an important part to estimate the relevance of specific muscle interactions to the parameters of the task executed. However, the authors should better explain what is the added value of this contribution with respect to the previous one, also in terms of computational methods.

      We thank the reviewer for their constructive comments. We have adjusted the introduction section of the manuscript to better explain the added value of this framework over previous work. Specifically, we draw the reviewer’s attention to the following updated section of the introduction:

      “In [11], we considered, key limitations among current approaches to muscle synergy analysis in extracting functionally relevant and interpretable patterns of muscle activity [12]. We proposed a combinatorial approach based on information- and network-theory and dimensionality reduction (the network-information framework (NIF)) that significantly improved the generalisability of the extraction process by, among others, removing restrictive model assumptions (e.g. linearity, same mixing coefficients) and the reliance on variance-accounted-for (VAF) metrics [12]. By determining the pairwise mutual information between muscles, this innovation paved the way for the appropriate mapping of muscular interactions to the task space. To elaborate on the significance of this development, the extraction of motor patterns in isolation of the task space comes at the expense of both functional and physiological relevance [12,13]. Furthermore, effective methods for mapping large-scale physiological dynamics to behaviour is a current gap across the neurosciences [14]. Thus, here we build on this work by, for the first time, directly including task space parameters during muscle synergy extraction. In doing so, we address these current research gaps, progressing muscle synergy research and successful engineering applications in a fruitful direction [12,15,16]. This enables us, in a novel way, to dissect the concept of the muscle synergy and therefore quantify interactions between muscle activations with shared or complementary functional roles. “

      In general, the method proposed relies on several hyperparameters and cost functions that have been optimized for the specific datasets. A sensitivity analysis should be performed, varying these parameters and reporting the performance of the framework.

      We thank the reviewer for this comment which enabled us to clarify a potential misunderstanding. Our proposed framework does not require setting or varying hyperparameters to optimise cost functions.

      For model-rank specification, a modularity maximising cost-function is used which determines what partitioning of the networks results in maximal modularity. We have offered two alternative approaches using this cost-function which consistently converge on the same solution. To further ensure the representativeness of this solution, we also offer a consensus-based approach where we apply these alternative approaches to individual participant or task data, then group the collective partitions together and re-apply the approaches. One of these approaches (Equation 2.2) requires two hyperparameters, γ and ω, which adjust the intra- and inter- network layer resolutions. As stated in the manuscript, we set both of these parameters to 1, thus nullifying their presence in the cost-function and aligning our work with the classical notion of modularity. Across the two alternative approaches to model-rank specification, the solution is unique and data-driven and has a demonstratable generalisability across datasets.

      The only other cost-function present in the framework is during dimensionality reduction, which is a standard loss function used across the muscle synergy analysis literature. Thus, the approach is essentially parameter-free and we now have mentioned this more explicitly in the manuscript:

      “To empirically determine the number of components to extract in a parameter-free way, we then concatenated these adjacency matrices into a multiplex network and employed network community-detection protocols to identify modules across spatial and temporal scales (fig.3(D)) [29–32,44].”

      “In its generalised multilayer form, the Q-statistic is given an additional term to consider couplings between layers l and r with intra- and inter-layer resolution parameters γ and ω (Equation 2.2). Here, μ is the total edge weight across the network and γ and ω were set to 1 in the current study for classical modularity [30], thus removing the need for any hyperparameter tuning.”

      It is not clear how the well-known phenomenon of cross-talk during the recording of electromyographic muscle activity may affect the performance of the proposed technique and how it may bias the overall outcomes of the framework.

      Indeed artifacts such as crosstalk are a standard issue across the EMG literature and may impact the performance of subsequent analyses where prevalent in the dataset. Crosstalk is expected to be present irrespective of the task and so should not affect redundant and synergistic muscle representations, however it could be present in the task-irrelevant muscle interactions extracted. Due to the prominence of long-range functional connections with the task-irrelevant representations extracted, we suggest that such artifacts are unlikely to have played a prominent role in the extracted patterns. Nonetheless, we have recognised this possibility with the following updated sentence in the Discussion section:

      “Although distinguishing task-irrelevant muscle couplings may capture artifacts such as EMG crosstalk, our results convey several physiological objectives of muscles including gross motor functions [65], the maintenance of internal joint mechanics and reciprocal inhibition of contralateral limbs [20,50].”

      Reviewer #2 (Public Review):

      This paper is an attempt to extend or augment muscle synergy and motor primitive ideas with task measures. The authors idea is to use information metrics (mutual information, co-information) in 'synergy' creation including task information directly. My reading of the paper is that the framework proposed radically moves from attempts to be analytic in terms of physiology and compositionality with physiological bases, instead into more descriptive ML frameworks that may not support physiological work easily.

      We thank the reviewer for taking the time to provide a thorough commentary on this manuscript. An overall aim in developing this framework is to build on other recent developments in providing a more fine-grained functional architecture underlying movement control [1,2]. It is a requirement for the successful communication and introduction of this toolbox to the field to provide readers with an understanding of how to use the framework and an intuition on how to interpret the results. Thus, we agree with the reviewer that functional interpretations are of crucial use.

      We also agree with the reviewer that maintaining a physiological underpinning is a desirable direction for the field and should not be made secondary to functional descriptions. In our updated version of this manuscript, we have therefore included direct comparisons with the gold-standard in the field for muscle synergy extraction, namely non-negative matrix factorisation based muscle synergy extraction (see ‘Building on current approaches to muscle synergy analysis’ and fig.5-6 of revised manuscript) [3,4]. In these comparison, we show how our framework goes beyond this current approach in terms of functional insight while still maintaining physiological relevance. Indeed, in the revised manuscript we also include a fourth dataset comprising post-stroke participants and healthy controls (Fig.6). We demonstrate, through a simple example application to this dataset, how our proposed framework can produce more predictive representations of motor impairment than the gold-standard approach. The representations we identified were discriminative of motor impairment measured via the Fugl-Meyer assessment using just one trial per participant. This improves considerably upon the sensitivity of the current approach to altered motor patterns which have predominantly required many trials and participants to gain significance [5,6]. Thus, the patterns we extract are a more comprehensive representation of the actual underlying physiological state of the participants.

      This approach is very different from the notions of physiological compositional elements as muscle synergies and motor primitives, and to me seems to really be striving to identify task relevant coordinative couplings. This is a meta problem for more classical analyses. Classical analyses seek compositional elements stable across tasks. These elements may then be explored in causal experiments and generative simulations of coupling and control strategies. The present work does not convince me that the joint 'meta' analysis proposed with task information added is not unmoored from physiology and causal modeling in some important ways. It also neglects publications and methods that might be inconvenient to the new framework.

      We would be very interested in receiving the reviewer’s suggestions of existing approaches that we have not incorporated here and would be happy to discuss these in the revised manuscript.

      Information based separation has been used in muscle synergy analyses using infomax ICA, which is information not variance based at core. Though linear mixing of sources is assumed, minimized mutual information is the basis.

      We agree with the reviewer that ICA relies on information measures, however it does not incorporate task-space information. The novelty of our approach lies in the characterisation of muscle interactions with respect to the task at hand. If the reviewer could provide references to this statement, we would be able to consider this further.

      Physiological causal testing of synergy ideas is neglected in the literature reviews in the paper. Although these are in animal work, the clear connection of muscle synergy choices and analyses to physiology is important and needs to be managed in the new methods proposed. Is any correspondence assumed? Possible?

      We agree with τhe reviewer that this a crucial element of muscle synergy research and will aim to address it in our future work. However, we would like to point out that the current manuscript is a “tools and resources” article aiming to introduce a new framework. In our revised manuscript, we have incorporated an application of the framework to a dataset from post-stroke patients to demonstrate the use of the framework in clinical settings to identify biomarkers and use them to make predictions of motor recovery (see Fig.6 of updated manuscript).

      Questions and concerns with the framework as an overall tool:

      First, muscle based motor information sources have influences on different time scales in the task mechanics. Analyses of synergies in the methods proposed will be very much dependent on the number and quality of task variables included and how these are managed. Standardizing and comparing among labs, tasks sets and instrumentation differences is not well enough considered as a problem in this new proposed method toolset, at least in my reading. Will replication, and testing across groups ever be truly feasible in this framework?

      We agree with the reviewer that this important point can be a limitation of the applicability of the framework. For this reason, we chose a “holistic” approach, applying the framework to several datasets collected in different settings, and selecting different kinds of task variables to extract muscle networks from. Crucially, we used a leave-one-task-out and leave-one-participant-out cross validation procedure to specifically address this point. Our results showed that the extracted couplings are robust irrespective of the task variable and/or participant excluded and this lends credit to the generalisability of the framework.

      Muscle based motor information sources have influences on different time scales in the task mechanics. Kinematic analyses, dynamic analyses and force plate analyses of the same task may provide task variables that alter the results in the proposed framework it seems.

      As we have mentioned above, here we used all the above types of task variables together to illustrate the range of measures that can be included in the proposed framework and showed that the outputs are robust to the exclusion of any task/participant. This point is especially evident for dataset 3 results, where high levels of generalisability were found despite the inclusion of kinematic, dynamic and IMU data (see Table 1. of original submission and updated manuscript). We believe that this is an advantage of the approach as it allows researchers to apply the method to different kinds of measurements they may have collected and gain insights into the relationships of muscle couplings with kinematic/dynamic/force parameters. This will also enable scientists to attribute different functional roles to the identified couplings and it is something we plan to do in future applications of the framework.

      Second, there is a sampling problem in all synergy analyses. We cannot record all muscles or all task parameters. Examining synergies across multiple tasks seeks 'stationary' compositionality. Including task specific elements may or may not reinforce or give increased coordinative precision to the stationary compositionality.

      We fully agree that this is a limitation of all synergy analyses and aimed to consider this study a step in the direction of addressing this limitation by providing the research community with a toolbox that can be used to quantify muscle couplings that can have different levels of task specificity.

      To me the new methods proposed seem partly orthogonal to the ideas of stable compositionality. The 'synergies' obtained will likely differ, and are more likely to be coordinative control groupings of recurrent task and muscle motifs (based on instrumentation) which may or may not relate to core compositionality in physiology. Is there any expectation that the framework should relate to core compositionality and physiology. This is not clear in the paper as written.

      In our new analysis, we have compared the proposed approach to existing physiologically-based methodologies and showed that the new framework can capture several salient physiological features of movement that the current NMF-based approach cannot. For example, as we have moved away from optimising variance accounted for metrics, our framework can identify subtle muscle couplings that have important functional roles. These subtle couplings are often not captured in current muscle synergy analysis as, against physiological relevance, higher amplitude muscles often take prominence. Further, by directly including task parameters during extraction, we can determine the muscles that have a functional role concerning the included task parameter rather than inferring this relationship indirectly using knowledge about the task executed. In our updated manuscript, by applying the framework to post-stroke participants (see Fig.6), we were also able to demonstrate that the extracted couplings are associated with functional parameters of motor recovery and have a clear link with the physiological state of individual participants.

      It would be useful to explore the approach with a range of neuromechanical models and controllers and simulated data to explore the issues I am raising and convince readers that this analysis framework adds clarity rather than dissolving the generalizability and interpretability of analyses in terms of underlying causal mechanisms.

      The authors need to better frame their work in relation to causal analyses if they are claiming links to muscle synergies analyses and claim extension/refinement. Alternatively, these may not be linked, and instead parallel approaches exploring different hypotheses and goals using different organizational data descriptors.

      To address the reviewers concerns here, we have included in the updated manuscript a toy example simulating situations in which pairs of muscles would have a redundant or synergistic functional relationship (see Fig.2). This simulation gives clear intuition on situations where two muscles (e.g. an antagonist-agonist pair) may share functionally similar or complementary information about task direction (left vs right). In particular, within the main text describing this figure, we state how current NMF based approaches consider muscles functionally equivalent when they share similar magnitude activations, whereas our framework captures muscles with identical task information. Thus, our work is an extension of current approaches towards understanding causal mechanisms. The suggestion to use neuromechanical models is valuable, however we consider it beyond the scope of this work. This “Tools and Resources” paper is aimed at introducing the computational framework for the analysis of large-scale muscle couplings in task space. Our future work will use this framework to address unanswered questions in the field and we hope that it will be helpful for other scientists in testing their hypotheses.

      To me this appears a data science tool that may not help any reductionist efforts and leads into less interpretable descriptions of motor control. Not invalid, but sufficiently different that common term use muddies the water.

      We believe that the novel evidence we provided both on simulated and real data have contributed to a better interpretability of the approach outcomes. Specifically, we have introduced examples showing the functional roles of the different types of interactions as well as the predictive power of the outputs. Concerning the use of the term synergy, we have provided a clear description throughout the manuscript regarding the interpretation of synergy vs redundancy in the novel perspective we propose. For example in the discussion section:

      “ We thus sought to provide greater nuance to the notion of ‘working together’ by defining motor redundancy and synergy in information-theoretic terms [6,56]. In our framework, redundancy and synergy are terms describing functionally similar and complementary motor signals respectively, introducing a new perspective that is conceptually distinct from the traditional view of muscle synergies as a solution to the motor redundancy problem [3,6,7]. In this new definition of muscle interactions in the task space, a group of muscles can ‘work together’ either synergistically or redundantly towards the same task. In doing so, the perspective instantiated by our approach provides novel coverage to the partitioning of task-relevant and -irrelevant variability implemented by the motor system along with an improved specificity regarding the functional roles of muscle couplings [20–22]. Our framework emphasises not only the role of functionally redundant muscle couplings that result from the underlying degeneracy of the motor system, but also of complementary, synergistic dependencies that are important for communication and integration across specialised neural circuitry [57,58]. Thus, the present study aligns the muscle synergy concept with the current mechanistic understanding of the nervous system whilst offering an analytical approach amenable to the continued advances in large-scale data capture [14,59].”

      Reviewer #3 (Public Review):

      In this study, the authors developed and tested a novel framework for extracting muscle synergies. The approach aims at removing some limitations and constrains typical of previous approaches used in the field. In particular, the authors propose a mathematical formulation that removes constrains of linearity and couple the synergies to their motor outcome, supporting the concept of functional synergies and distinguishing the task-related performance related to each synergy. While some concepts behind this work were already introduced in recent work in the field, the methodology provided here encapsulates all these features in an original formulation providing a step forward with respect to the currently available algorithms. The authors also successfully demonstrated the applicability of their method to previously available datasets of multi-joint movements.

      Preliminary results positively support the scientific soundness of the presented approach and its potential. The added values of the method should be documented more in future work to understand how the presented formulation relates to previous approaches and what novel insights can be achieved in practical scenarios and confirm/exploit the potential of the theoretical findings.

      Strengths:

      This work proposes a novel framework that addresses physiologically non-verified hypothesis of standard muscle synergy methods: it removes restrictive model assumptions (e.g. linearity, same mixing coefficients) and the reliance on variance-accounted-for (VAF) metrics.

      The method is solid and achieves the prescribed objectives at a computational level and in preliminary laboratory data.

      A toolbox is available for testing the methods on a larger scale.

      The paper is well written and shows a high level of innovation, original content and analysis

      Weaknesses:

      Task performance variables could be specified in more quantitative definition in future work (e.g.: articular angles rather than a generic starting point- end point).

      We agree with this point and will incorporate it in future work. Our aim here was to show that the framework would work with any task variable and that scientists can use it to identify the relevance of muscle interactions to different types of task parameters.

      The paper does not show a comparison with previous approaches (e.g.: NMF) or recently developed approaches (such as MMF).

      We have now illustrated such a comparison on two datasets and explained more how the new framework can dissect the different types of muscle groupings (see ‘Building on current approaches to muscle synergy analysis’ section and Fig.5-6 of revised manuscript).

      A discussion of the likely impact of the work on the field, and the utility of the methods and data to the community.

      In our revised manuscript, we have introduced 2 new applications of the framework to real data to exemplify its use for a) functional interpretability and b) identification of biomarkers (see ‘Building on current approaches to muscle synergy analysis’ section and Fig.5-6 of revised manuscript). We also point towards its use in movement restoration and augmentation devices and in the clinical setting in the discussion section:

      “The separate quantification of these muscle interaction types opens up novel opportunities in the practical application of muscle synergy analysis, as demonstrated in the current study through the identification of a significant predictor of motor impairment post-stroke from single-trials [5,12,65]. For instance, these distinct representations may encapsulate different neural substrates that can be specifically assessed at the muscle-level for the purpose of bodily restoration and augmentation [66]. Uncovering their neural underpinnings is an interesting topic for future research.”

      In this work, the effort of the authors aimed at developing the field is clear. It is fundamental to develop novel frameworks for synergy extraction and use them to make them more interpretable and applicable to real scenarios, as well as more adherent to recent findings achieved in motor control and neuroscience that are not reflected in the standard models. At the same time, muscle synergies are being used more and more in research but their impact in practical scenarios is still limited, probably because synergies have rarely been analyzed in a functional context. This paper shows a very in-depth analysis and a novel framework to interpret data that links to the task space from a functional perspective. I also found that the results on the datasets are very well commented but could expand more to show why using this framework is advantageous.

      There are some key points for discussion that follow from this paper which can be described more, maybe in future work, and that might contribute to major developments in the field, including:

      The understanding of how the separation between relevant (redundant and synergistic) and irrelevant synergies impact on synergy analysis in practical works;

      We have now introduced new figures (Fig. 5 and 6) to the revised manuscript, demonstrating simple applications of the framework and providing intuition regarding the outputs. We have also added points to the Discussion commenting on the differences between types of couplings and how they can be interpreted in future works:

      “Our framework emphasises not only the role of functionally redundant muscle couplings that result from the underlying degeneracy of the motor system, but also of complementary, synergistic dependencies that are important for communication and integration across specialised neural circuitry [57,58]. Thus, the present study aligns the muscle synergy concept with the current mechanistic understanding of the nervous system whilst offering an analytical approach amenable to the continued advances in large-scale data capture [14,59].”

      “Although distinguishing task-irrelevant muscle couplings may capture artifacts such as EMG crosstalk, our results convey several physiological objectives of muscles including gross motor functions [64], the maintenance of internal joint mechanics and reciprocal inhibition of contralateral limbs [19,49]. Thus, task-irrelevant muscle interactions reflect both biomechanical- and task-level constraints that provide a structural foundation for task-specific couplings. The separate quantification of these muscle interaction types opens up novel opportunities in the practical application of muscle synergy analysis, as demonstrated in the current study through the identification of a significant predictor of motor impairment post-stroke from single-trials [5,12,65]. For instance, these distinct representations may encapsulate different neural substrates that can be specifically assessed at the muscle-level for the purpose of bodily restoration and augmentation [66]. Uncovering their neural underpinnings is an interesting topic for future research.”

      Interpreting how different synergistic organizations described in this work allows to better describe data from real scenarios (e.g.: motor recovery of patients after neurological diseases);

      We have now added an example application of the framework to a dataset of stroke patients (Fig.6) and identified a redundant muscle patterns that are predictive of functional measures.

      Discussing in detail how the presented findings compare with standard algorithms such as NMF to determine the added value provided with this approach;

      As indicated above, we have now shown such a comparison on two new datasets (see Fig.5-6 of revised manuscript).

      Describe how redundant synergies reflect real neural organization and - if their "existence" is confirmed - how they contribute to redesign the concept of muscle synergies and of modular/synergistic control in general.

      This is an important point that we have now addressed more in our Discussion by relating redundant muscle couplings to degeneracy in the motor system and synergistic couplings to integrative dynamics by higher-level processes. We have also added a simple simulation illustrating how synergistic and redundant interactions co-exist and represent different contributions to task performance (see Fig.2 of revised manuscript).

    1. (a) applying the reading standard for informational text: explaining how an author uses reasons and evidence to support particular points in a text, identifying which reasons and evidence support which particular points (RI.5.8); (b) the writing standard: produce clear and coherent writing in which the development and organization are appropriate to task, purpose, and audience (W.5.4); and (c) the language standard: vocabulary use (L.5.4-6), particularly transition words to help their writing flow logically. Students are writing an argument to encourage their readers to take more care of the natural environment.

      This CCSS can be practiced at many age levels to help prepare them for future reading activities or tests.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Recommendations For The Authors):

      p. 5, l. 87-90: The control of flgM by OmrA/B (PMID 32133913) and the antisense RNA to flhD (PMID 36000733) are other examples of known regulatory RNAs that impact the flagellar regulon.

      We thank the reviewer for pointing out these references and have added citations to them (page 5, lines 87-91).

      p.11/Fig. 3: it is intriguing that ArcZ and RprA, two of the rpoS-activating sRNAs, repress lrhA. I realize that it is outside of the scope of this study, but have the authors considered the possibility that ArcZ or McaS could have a role in the previously reported repression of rpoS by LrhA (PMID 16621809)?

      We agree that it is intriguing that ArcZ and RprA, two of the rpoS-activating sRNAs, repress lrhA, and added mention of this regulatory connection (page 12, lines 247-250).

      p. 13/l. 272: I do not understand why the authors say that "r-proteins were almost exclusively found in chimeras with MotR and FliX and no other sRNAs...", given that several other chimeras between r-prot and other sRNAs are found

      While some r-proteins encoding genes were found with other sRNAs in RIL-seq datasets, MotR and FliX generally had the highest numbers. The text was revised to better describe the RIL-seq data for r-proteins interaction partners (page 14, lines 291-295), and a new panel showing the S10 operon with all the interacting sRNAs was added to Figure 3—figure supplement 1B.

      Fig. 4 and 5: One possible improvement would be to more systematically assess the effect of base-pairing mutants of the sRNAs, such as MotRM1 or FliXM1 on fliC and rps/rpl genes in vivo. This is especially important for the mutants that affected the sRNA effects in the in vitro probing assays, such as UhpU-M2, MotR-M1 and FliX-S-M1 on fliC (Fig. S7)

      As suggested, we examined fliC mRNA levels across growth in motR-M1 and fliX-M1 chromosomal mutants. The results of these northern assays, now shown in Figure 8—figure supplement 1, are consistent with our model as we observed delayed expression of fliC mRNA in motR-M1 background and premature expression in fliX-M1 background (page 21, lines 444446, 449-453).

      Fig. 5: it may be worth including a schematic of the whole S10 operon to highlight its length and its organization?

      As suggested, a schematic representation of the S10 operon was added to Figure 3—figure supplement 1 with a summary of the RIL-seq data for this operon.

      Probing data (Fig. 5, S7 and S9): in general, it is difficult to differentiate the thin and thick brackets, and what is indicated by the dashed brackets is not always clear. Maybe using a color-code instead could help? Highlighting the predicted pairing regions on the different gels could be useful as well.

      We thank the reviewer for this suggestion and color-coded the brackets (Figure 5, Figure 4figure supplement 2, and Figure 5-figure supplement 2). The correspondences to regions of predicted pairing are described in the figures legends.

      Fig. S10: The experimental evidence used to support FliX-dependent degradation of the rpsS mRNA is indirect (primer extension to observe higher levels of cleavage intermediates). It would be nice to be able to observe a decrease in the mRNA levels as well, either by Northern, or primer extension from a region more distant to the FliX pairing site.

      The S10 operon is long (~5 KB). We have tried multiple probes for this mRNA and detect many bands with each, likely due to extensive regulation of this operon. We think teasing out the origin of the different bands to appropriately interpret changes in patterns will require a significant amount of work.

      legend of Fig. S10: from the gel, it seems that only the plasmids differ in the samples, and it is not clear where the data corresponding to the WT strain mentioned in the legend is shown

      The samples shown in this figure are all for the indicated plasmids in the WT strain. We corrected the figure legend.

      Table S1: please define the NOR (normalized odds ratio?)

      The definition of Normalized Odds Ratio was added to the legend of Supplementary file 1.

      Reviewer #2 (Recommendations For The Authors):

      Major comments:

      Figure 1B. Please add a negative control (which could be in the supplementary section) from a large section showing transcripts that are not directly influenced by Hfq.

      We think the flgKLO browser in this figure serves as a negative control; flgK and flgL clearly are not enriched on Hfq in contrast to FlgO. Figure 1B was generated using published datasets that are easily accessible to the readers at a genome browser and show many other examples of transcripts that are not influenced by Hfq: https://genome.ucsc.edu/cgi-bin/hgTracks?hubUrl=https://hpc.nih.gov/~NICHD- core0/storz/trackhubs/ecoli_rilseq/hub.hub.txt&hgS_loadUrlName=https://hpc.nih.gov/~NICHDcore0/storz/trackhubs/ecoli_rilseq/session.txt&hgS_doLoadUrl=submit

      Line 158. MotR* is a more abundant version of [the constitutively overexpressed] MotR. Is there a Northern or qPCR to confirm this? While I understand the relevance of these mutated constructs, their high expression can lead to artefactual effects.

      This is a valuable point and therefore we provided a northern blot to document the relative levels of MotR and MotR* (Figure 2—figure supplement 1A).

      Figure 2. The overexpression of MotR/MotR* from a plasmid is increasing the number of flagella. However, when the MotR gene is deleted, is there a reduction of the number of flagella? Same question with FliX: what happens when the fliX gene is deleted? According to the model described in the manuscript, we should expect fewer flagella in ΔmotR background and an increased number of flagella in ΔfliX background. Both Figure 2 and Figure 8 would benefit from additional experiments with deleted motR and fliX genes.

      We agree that experiments regarding the endogenous effects of endogenous sRNAs are important. We provided such data in Figure 8 and Figure 8—figure supplement 1 for MotR and FliX in a variety of assays: flagella numbers by electron microscopy, motility and competition assays, expression of flagellar genes by RT-qPCR and western analysis. The chromosomallyexpressed MotR-M1 and FliX-M1 base pairing mutants did show the expected phenotypes of reduced and increased numbers of flagella, respectively (Figure 8A-B). As suggested by reviewer 1, we added northern analysis that examined fliC mRNA levels across growth in motRM1 and fliX-M1 chromosomal mutants. The results of these northern assays are consistent with our model as we observed delayed expression of fliC mRNA in motR-M1 background and premature expression in fliX-M1 background. We went to the trouble of constructing strains carrying point mutations in the chromosomal copies of these genes rather than deletions to avoid interfering with the expression of motA and fliC given that MotR and FliX encompass the 5’ and 3’ UTRs, respectively.

      Figure 3 is key to demonstrating the sRNAs pairing with their specific targets and potential effect on bacterial swimming. However, these results would be more relevant with endogenous expression of the sRNAs and demonstration of their effects on the same targets. A Northern blot showing the overproduced sRNA level compared to endogenous sRNA level could help us appreciate the expression ratio.

      The levels of the UhpU, MotR and FliX expressed from the overexpression plasmids are at least 100-fold higher than the endogenous levels. Thus, we agree that assays of chromosomal deletion/point mutants are important experiments. We did construct chromosomal uhpU-M1 and uhpU∆seed sequence mutants. However, under the conditions assayed, the uhpU chromosomal mutations did not result in observable effects on motility or FlhD-SPA protein levels. It is possible we would be able to detect differences between the wild type and uhpU chromosomal mutant strains under different growth conditions or in different assays, but this would require a significant amount of work. For many other sRNA chromosomal mutations have no or only subtle effects, suggesting redundancy between sRNAs or sRNA roles in fine tuning gene expression.

      Figure 4. In panel B, the empty plasmid pZE alone seems to positively affect the flagellin expression when compared to the WT background. This can also be seen in Figure 4C. There is no fliC signal with empty plasmid pBR* but a strong fliC signal with empty plasmid pZE. Maybe the authors can explain this in the manuscript.

      With respect to panel B and Figure 4—figure supplement 1A, we agree that there is some variation between the levels of flagellin in the WT and pZE control samples, possibly due to the addition of antibiotic to the pZE culture. We added quantification of the bands in Figure 4— figure supplement 1 to better document the changes in flagellin levels.

      With respect to panel C, the pBR samples were collected in crl+ background while the pZE samples were collected in crl- background, which explains the lack of fliC signal in the pBR control sample. This is now noted in the figure legend.

      In lines 154-157, the justification for using two plasmids is described. An IPTG-inducible Plac promoter, the pBR*, is used because the constitutive overexpression of UhpU is resulting in mutated UhpU clones. These observations suggest a toxic expression level of UhpU that the cell can only tolerate when the UhpU RNA is somewhat deactivated by mutations. This does not seem like a detail and could be discussed further.

      We agree with the reviewer that this observation is important and now mention that it suggests at a critical UhpU role (page 8, lines 160-163).

      Figure 5E and I. While the bindings of MotR on rpsJ and Flix-S on rpsS are clear, the resolution of both gels in the areas of binding (upper part of both gels) could be improved.

      We found it tricky to choose the mRNA fragments for the in vitro structure probing for the regions of predicted pairing internal to CDSs. Given that we hoped to retain native RNA folding, we chose long fragments; for rpsJ, we started with the +1 of S10 leader and for rpsS, we started 147 nt into the CDS, a region that overlaps the region that was cloned to the rpsS-rplV-gfp fusion. Consequently, the region of base pairing is in the upper part of both gels. The gels were already run for an unusually long time. Thus, we do not think the resolution could be improved further. Nevertheless, we think the region of protection is evident for both mRNAs.

      Minor comments:

      Fig 1B. The promoter symbols are extremely small, please increase the size.

      As suggested, we have enlarged the promoter symbols in Figure 1B as well as in Figure 3A.

      Line 211. "the lrhA mRNA has an unusually long 5´ UTR". How long exactly?

      The 5’ UTR of the lrhA mRNA is 371 nt long. This is now mentioned in the text (page 11, line 224)

      Line 320. Should "Fig 9C" be "Fig S9C" instead?

      We thank the reviewer for noticing this typo. Callouts to supplementary figures have now been renumbered per eLife format.

      Line 384. Something seems to be missing in the sentence "a representative combined class 2 and 3 promoter".

      The sentence has been modified to clarify the designation (page 19, lines 409-411).

      Reviewer #3 (Recommendations For The Authors):

      Recommendation to clarify/strengthen the presentation of science in the paper:

      Lines 102-103: Can the authors provide some more information on how the sRNAs were initially discovered to be potentially sigma-28 dependent and selected?

      As suggested, we expanded the section discussing the discovery and the selection of these sRNAs (page 6, lines 104-109).

      Lines 192-193: It would be helpful to provide a bit more information in the main text about what are the different RIL-seq data sets (18 in total).

      As suggested, we now provide more details about the different RIL-seq datasets we used in the analysis (page 10, lines 202-205).

      It would be helpful to specify the criteria for "top" interactions in targets retrieved from RIL-seq data (Table S1 and text, e.g., line 273): e.g. number of conditions, number of chimeras, etc.

      As suggested, we now more explicitly specify the criteria for selecting targets to characterize (page 10, lines 205-206).

      Fig. 4B/ S6 and line 242: The flagellin amount in the empty vector control (pZE) looks higher than in WT, and the stated effect of MotR/MotR* OE on flagellin is not very clear from the blot. The "cross-reacting band" above flagellin also seems to vary among strains. Could the authors include a quantification of flagellin protein amount and normalize relative to a housekeeping protein (e.g., GroEL), instead of Ponceau S as loading control?

      We agree that there is some variation between the levels of flagellin in the WT and pZE control sample, possibly due to the addition of antibiotic to the pZE culture. We added quantification of the bands in Figure 4—figure supplement 1 to better document the changes in flagellin levels.

      Figure legends: It would be helpful to have a bit more information about the method used/displayed image rather than stating results in the legends.

      As suggested, we now provide a bit more information about the methods used/displayed image in the figure legends to allow for easier comprehension of the data presented in the figures (while trying to balance this with the length of the legends).

      Fig. 2: Please include a scale for all electron microscopy images or, if it is the same for all panels, state it in the figure legend. Moreover, the same image is used for the pZE control in panel C, E and Figure S4A/C. It would be better to show different fields of bacteria for the pZE sample.

      As is now mentioned in the legends to Figure 2, Figure 2—figure supplement 2, and Figure 8, the same scale was used for all panels. We thought it was better to show the same image for the pZE control in the different panels to emphasize that these samples were all analyzed on the same day.

      Fig. 2: The sRNA OE strains seem to show some heterogeneity in cell length (pZE-MotR) or width (pZE-FliX). The authors could, e.g., check whether this is a phenotype correlated to sRNA OE by quantifying these parameters for different fields and comparing to WT or comment on this in the text if this is not consistently seen.

      We also were intrigued by the slightly different sizes and widths of cells in the EM images. However, our statistical analysis did not reveal significant differences between the different samples. We now comment on this (page 53, lines 1178-1179).

      As a follow-up to this study, it would be interesting to assess the impact of MotR and FliX regulation of ribosomal protein synthesis on overall ribosome activity (e.g., via Ribo-seq), also considering that antitermination regulates rRNA transcription. In the case of MotR, the authors suggest that MotR upregulation of S10 protein might not only impact antitermination, but also lead to the formation of more active ribosomes that would increase flagellar protein synthesis (lines 359-362). However, in the RNA-seq performed in OE MotR* several transcripts encoding rRNA and ribosomal proteins are significantly downregulated compared to EVC (Supplementary Table S2). Could the authors comment on this?

      We share the reviewer’s enthusiasm for follow-up work and thank for the suggested experiments. We hope we will be able to decipher the full mechanism of MotR and FliX action on ribosomal protein synthesis in future experiments. The observation that some ribosomal protein-coding gene levels are reduced in the RNA-seq experiment with overexpression of MotR* is interesting but we do not have an explanation other than the fact that the samples were collected early in exponential growth. We now mention the observation in the text (page 19, lines 404-407).

      Considering that OE of the WT MotR appears to increase fliC mRNA abundance but has no strong impact on flagellin protein levels, can the authors speculate what is the physiological relevance of MotR* for flagellin production?

      We agree that while we do see significant increases in the flagella number and fliC mRNA abundance with MotR and MotR* overexpression, the western analysis did not reveal a striking increase in flagellin levels and also wonder how MotR strongly increases the flagella number, which requires flagellin subunits, but only has a weak effect on the intercellular levels of flagellin. One possibility explanation is that it is more difficult to see significant increases for a protein whose levels are high to begin with. These points are now discussed (page 13, lines 264-269).

      Fig. 4C: The pZE samples seem to show variable expression of fliC mRNA although the samples are collected at the same timepoints. Try to clarify in the text.

      The northern membrane on the bottom was exposed for a longer time due to the lower fliC mRNA levels in the samples with FliX overexpression. We now note these differences in the legends to Figure 4 and Figure 4—figure supplement 1.

      Fig. 7/S13: While a volcano plot for MotR is shown in Fig. 7A, quantification of GFP reporter fusion regulation is shown for MotR. Quantifications of MotR are shown in Fig. S13. Maybe swap the figures.

      Given that the data for MotR are in the supplement figures for all other figures we would also like to retain this distribution for Figure 7 (aside from the volcano plot since this experiment was only carried out for MotR).

      Lines 135-136 (Fig. S1B): on the northern blots, only sRNA levels of MotR are comparable between rich and minimal media (excluding M63 G6P and M63 gal). Most other sRNA seem to be more abundantly expressed in minimal media conditions compared to LB. Maybe rephrase.

      As suggested, the text was revised to point out the differences in the sRNA levels for cells grown in different growth media (page 7, lines 140-144).

      Lines 229-234: this paragraph seems not directly connected to the aims of the study (i.e., no effect on motility tested of these other sRNAs) and could be removed (or moved to discussion).

      We appreciate the reviewer’s suggestion but, considering Reviewer 1’s comments, think that showing the regulation of lrhA by other sRNAs has value in highlighting the complexity of the regulatory circuit. We have revised the text to incorporate Reviewer 1’s suggestions and better explain why these results are intriguing (page 12, lines 247-250).

      Line 200 and Fig. S5: For FlgO sRNA only one target was identified in RIL-seq. This gene could be specified and labeled in Fig. S5 and the text. Does FlgO also bind ProQ?

      We now mention the single FlgO target (gatC) detected in four datasets (page 10, lines 213215). In Figure 3—figure supplement 1, we labeled only targets that we followed up with in the current study. Therefore, to be consistent, we prefer not to label gatC in the FlgO plot. FlgO was found to co-immunoprecipitate with ProQ but at much lower levels than with Hfq, and to have very few RNA partners (Melamed et al., 2020).

      Lines 493-498: It is mentioned that the four sRNAs were also detected in recent RIL-seq experiments of Salmonella and EPEC. Are any of the here identified targets also found in other species or was none detected as analyses were carried out under conditions that do not favor flagella expression?

      The targets identified in this study were not detected in the Salmonella and EPEC RIL-seq datasets. However, the Salmonella and EPEC experiments were carried out under different growth conditions. Based on the sequence conservation of the Sigma 28-dependent sRNAs across several bacterial species (Figure 8—figure supplement 2), we do think overlapping targets will be found in other bacterial species under the appropriate growth conditions.

      The strongest evidence of MotR dependent target regulation is the one on rpsJ, which does not necessarily require the additional experiments with MotR. Since the authors were able to show upregulation of the rpsJ-gfp reporter upon OE of MotR WT, it would have strengthened the results if they performed the experiments in Fig. S8C with MotR WT. Similary as an increase of flagella number was seen with OE of MotR WT in Fig. 2A, the effect of the OE S10∆loop could be compared to OE MotR instead of OE MotR (Fig. 6A). At least if would be helpful, to briefly comment on why MotR* was used instead of MotR WT for these experiments.

      As suggested, we state MotR was used in some assays given the stronger effects for some phenotypes (page 10, lines 196-197). We think, given that we established MotR and MotR cause the same effects, with increased intensity for the latter, it is reasonable to use MotR* in some of the experiments.

      p. lines 482-491 and 508-511: The authors discuss that both UhpU sRNAs and RsaG sRNA from S. aureus are derived from the 3'UTR of uhpT, but conclude there is no overlap regarding flagella regulation, suggesting independent evolution of these sRNAs. However, the authors also mention that UhpU sRNA has many additional targets beyond LhrA involved in carbon and nutrient metabolism. Thus, maybe regulation of metabolic traits could be a conserved theme and function for UhpU and RsaG? Maybe try to comment on or better connect these two parts in the discussion.

      As suggested, we now comment on the possibility of the regulation of metabolic traits being a conserved theme and function for UhpU and RsaG (page 24, lines 520-527).

      Check the text for consistency regarding the use of italics for gene names (e.g., legend of Figs. 7 and 8)

      The text was corrected.

      Please introduce abbreviations, e.g., G6P (line 139), REP (line 150), ARN (line 258), NOR/U (Table S1 legend)

      As suggested, we now introduce the abbreviations for G6P (page 7, line 142), REP (page 8, lines 155-156), and NOR (Supplementary file 1 legend). Regarding ARN, these sequences are already written in parentheses in the same sentence. However, we revised this to “ARN motif sequences” (page 13, line 278).

      Fig. S1A: Highlight REP sequence mentioned in text (line 150).

      REP sequences are now highlighted in gray in Figure 1—figure supplement 1A.

      Fig. S1C: It would be helpful to list number nt positions on the sRNAs based on full-length transcripts.

      The corresponding positions based on the full-length transcripts have also been added to this figure.

      Fig. S2: Adjust the position of UhpU-S label.

      UhpU-S label position was adjusted.

      Fig. S6: Include UhpU in the figure title.

      UhpU was added to the title.

      Fig. S10: It would be helpful to indicate on the figure (or state more clearly in the legend) which RNA was extracted from WT or ΔfliCX background.

      The samples shown in the Figure are all in a WT strain. We corrected the figure legend accordingly.

      Line 290: the effect is on flagella number, not motility.

      This typo is now corrected (page 15, line 312).

      Fig. S8: One-way ANOVA (panel A legend)

      This typo is now corrected (page 64, line 1433).

      Line 320: Fig. S9C instead of 9C

      We thank the reviewer for noticing the typo. The numbering of the supplementary figures has now been changed to the eLife format.

      It would be helpful to add reference for statement in line 57.

      A reference to (Fitzgerald et al., 2014) was added as suggested.

      Add PMID:32133913 as reference for post-transcriptional regulation of the flagellar regulon in the introduction (lines 87-91)

      The indicated reference was added as suggested (page 5, lines 87-91).

      Legend Fig. S6: expand view -> expanded view

      This typo is now corrected (page 63, line 1406).

      line 513: sRNA -> sRNAs

      This typo is now corrected (page 25, line 549).

      Fig. 8G: Maybe include lrhA as target of UhpU sRNA at top of the cascade.

      As suggested lrhA has been added as a target of UhpU at the top of the cascade.

    1. Despite widespread use of nucleic acid diagnostic procedures, cultures remain essential in clinical diagnostic laboratories. Isolation in pure culture is required for identification and most phenotypic antimicrobial susceptibility testing. ++ Growth on artificial media, isolation, and identification of the infecting agent is usually the most sensitive and specific means for an etiologic diagnosis of common bacterial and fungal pathogens. Theoretically, the presence of a single live organism in the specimen can yield a positive result. Most bacteria and fungi can be grown in a variety of artificial media, but strictly intracellular microorganisms (eg, Chlamydia, Rickettsia, and viruses) can be isolated only in cultures of living eukaryotic cells. Consequently, molecular methods have replaced culture for these pathogens. +++ Isolation and Identification of Bacteria and Fungi ++ Almost all medically important bacteria can be cultivated outside the host in artificial culture media. A single bacterium placed in the proper culture conditions multiplies to quantities sufficient to be seen by the naked eye. Bacteriologic media are broth recipes prepared from digests of animal or vegetable protein supplemented with nutrients such as glucose, yeast extract, serum, or blood to meet the metabolic requirements of the organism. Their chemical composition is complex, and their success depends on matching the nutritional requirements of most heterotrophic living things. The same approaches are used for growing fungi. ++ Bacteria grow in broth and on solid media ++ Growth in media prepared in the fluid state (broth) is apparent when bacterial numbers are sufficient to produce turbidity or macroscopic clumps. Turbidity results from reflection of transmitted light by the bacteria; depending on the size of the organism, from 105 to 106 bacteria per milliliter of broth are required. The addition of a gelling agent to a broth medium allows its preparation in solid form in Petri dishes. The universal gelling agent for diagnostic bacteriology is agar—a polysaccharide extracted from seaweed. Agar has the convenient property of becoming liquid at approximately 95°C but not returning to the solid gel state until cooled to less than 50°C. This allows the addition of a heat-labile substance such as blood to the medium before it sets. At temperatures used in the diagnostic laboratory (37°C or lower), broth–agar exists as a smooth, solid, nutrient gel. This medium, usually termed agar, may be qualified with a description of any supplement (eg, blood agar). ++ Large numbers of bacteria produce turbidity Agar is used to solidify media ++ A useful feature of agar plates is that the bacteria can be separated by spreading a small sample of the specimen over the surface. Bacterial cells that are well separated from others grow as isolated colonies, often reaching 2 to 3 mm in diameter after overnight incubation. This allows isolation of bacteria in pure culture because the colony is assumed to arise from a single organism (Figure 4–5). Colonies vary greatly in size, shape, texture, color, and other features called colonial morphology. Colonies from different species or genera often differ substantially, whereas those derived from the same strain are usually consistent. Differences in colonial morphology are very useful for separating bacteria in mixtures and as clues to their identity. ++ FIGURE 4–5. Bacteriologic plate streaking. Plate streaking is essentially a dilution procedure. A. (1) The specimen is placed on the plate with a swab, loop, or pipette and evenly spread over approximately part of plate surface with a sterilized bacteriologic loop (2-5). The loop is flamed to remove residual bacteria, and a series of overlapping streaks are made flaming the loop between each one. B. After overnight incubation, heavy growth is seen in the primary areas followed by isolated colonies. More than one organism is present because both a red and a clear colony are seen. (Reproduced with permission from Willey JM: Prescott, Harley, & Klein’s Microbiology, 7th ed. New York, NY: McGraw Hill; 2008.) Graphic Jump LocationView Full Size| Favorite Figure |Download Slide (.ppt) ++ Bacteria separated in isolated colonies Colonies may have characteristic features +++ Culture Media ++ Over the last 100 years, countless media have been developed by microbiologists to aid in the isolation and identification of medically important bacteria and fungi. Only a few have found their way into routine use in clinical laboratories. These may be classified as nutrient, selective, or indicator media. +++ Nutrient Media ++ The nutrient component of a medium is designed to satisfy the growth requirements of the organism to permit isolation and propagation. For medical purposes, the ideal medium would allow rapid growth of all agents. No such medium exists; however, several suffice for good growth of most medically important bacteria and fungi. These media are prepared with enzymatic or acid digests of animal or plant products, such as muscle, milk, or soybeans. The digest reduces the native protein to a mixture of polypeptides and amino acids that also includes trace metals, coenzymes, and various undefined growth factors. For example, one common broth contains a digest of casein (milk curd) and a digest of soybean meal. To this nutrient base, salts, vitamins, or body fluids such as serum may be added to provide pathogens with the conditions needed for optimum growth. All cultures of blood use this type of medium. ++ Media are prepared from animal or plant products +++ Selective Media ++ Selective media are used when specific pathogenic organisms are sought in sites with an extensive microbiota (eg, Campylobacter species in fecal specimens). In these cases, other bacteria may overgrow the suspected etiologic species in simple nutrient media, either because the pathogen grows more slowly or because it is present in much smaller numbers. Selective media usually contain dyes, other chemical additives, or antimicrobial agents at concentrations designed to inhibit contaminating flora but not the suspected pathogen. ++ Contaminants inhibited with chemicals or antimicrobials +++ Indicator Media ++ Indicator media contain substances designed to demonstrate biochemical or other features characteristic of specific pathogens or organism groups. The addition to the medium of one or more carbohydrates and a pH indicator is frequently used. A color change in a colony indicates the presence of acid products and thus of fermentation or oxidation of the carbohydrate by the organism. The addition of red blood cells (RBCs) to plates allows the hemolysis produced by some organisms to be used as a differential feature. In practice, nutrient, selective, and indicator properties are often combined to various degrees in the same medium. It is possible to include an indicator system in a highly nutrient medium and also make it selective by adding appropriate antimicrobials. Some examples of culture media commonly used in diagnostic microbiology are listed in Appendix 4–1, and more details of their constitution and application are provided in Appendix 4–2. ++Table Graphic Jump LocationAPPENDIX 4–1Some Media Used for Isolation of Bacterial PathogensView Table| Favorite Table |Download (.pdf) APPENDIX 4–1 Some Media Used for Isolation of Bacterial Pathogens MEDIUM USES General-purpose Media Nutrient broths (eg, soybean–casein digest broth) Most bacteria, particularly when used for blood culture Thioglycolate broth Anaerobes, facultative bacteria Blood agar Most bacteria (demonstrates hemolysis) and fungi Chocolate agar Most bacteria, including fastidious species (eg, Haemophilus) and fungi Selective Media   MacConkey agar Nonfastidious Gram-negative rods Hektoen enteric agar Salmonella and Shigella Selenite F broth Salmonella enrichment Sabouraud agar Isolation of fungi, particularly dermatophytes Special-purpose Media   Löwenstein–Jensen medium, Middlebrook agar M tuberculosis and other mycobacteria (selective) Martin–Lewis medium Neisseria gonorrhoeae and Neisseria meningitidis (selective) Tinsdale agar C diphtheriae (selective) Regan-Lowe charcoal agar Bordetella pertussis (selective) Buffered charcoal–yeast extract agar Legionella species (nonselective) Campylobacter blood agar Campylobacter jejuni (selective) Thiosulfate-citrate-bile-sucrose agar (TCBS) Vibrio cholerae and Vibrio parahaemolyticus (selective) ++Table Graphic Jump LocationAPPENDIX 4–2Characteristics of Commonly Used Bacteriologic MediaView Table| Favorite Table |Download (.pdf) APPENDIX 4–2 Characteristics of Commonly Used Bacteriologic Media Nutrient broths. Some form of nutrient broth is used for culture of blood and all direct tissue samples from sites that are normally sterile to obtain the maximum culture sensitivity. Selective or indicator agents are omitted to prevent inhibition of more fastidious organisms. Blood agar. The addition of defibrinated blood to a nutrient agar base enhances the growth of some bacteria, such as streptococci. This often yields distinctive colonies and provides an indicator system for hemolysis. Two major types of hemolysis are seen: β-hemolysis, a complete clearing of red cells from a zone surrounding the colony; and α-hemolysis, which is incomplete (ie, intact red cells are still present in the hemolytic zone), but shows a green color caused by hemoglobin breakdown products. The net effect is a hazy green zone extending 1 to 2 mm beyond the colony. A third type, α’-hemolysis, produces a hazy, incomplete hemolytic zone similar to that caused by α-hemolysis, but without the green coloration. Chocolate agar. If blood is added to molten nutrient agar at approximately 80°C and maintained at this temperature, the red cells are gently lysed, hemoglobin products are released, and the medium turns a chocolate brown color. The nutrients released permit the growth of some fastidious organisms such as H influenzae, which fail to grow on blood or nutrient agars. This quality is particularly pronounced when the medium is further enriched with vitamin supplements. Given the same incubation conditions, any organism that grows on blood agar also grows on chocolate agar. Martin–Lewis medium. A variant of chocolate agar, Martin–Lewis medium is a solid medium selective for the pathogenic Neisseria (N gonorrhoeae and N meningitidis). Growth of most other bacteria and fungi in the genital or respiratory flora is inhibited by the addition of antimicrobial agents. One formulation includes vancomycin, colistin, trimethoprim, and anisomycin. MacConkey agar. This agar is both a selective and an indicator medium for Gram-negative rods, particularly members of the family Enterobacteriaceae and the genus Pseudomonas. In addition to a peptone base, the medium contains bile salts, crystal violet, lactose, and neutral red as a pH indicator. The bile salts and crystal violet inhibit Gram-positive bacteria and the more fastidious Gram-negative organisms, such as Neisseria and Pasteurella. Gram-negative rods that grow and ferment lactose produce a red (acid) colony, often with a distinctive colonial morphology. Hektoen enteric agar. The Hektoen medium is one of many highly selective media developed for the isolation of Salmonella and Shigella species from stool specimens. It has both selective and indicator properties. The medium contains a mixture of bile, thiosulfate, and citrate salts that inhibits not only Gram-positive bacteria, but members of Enterobacteriaceae other than Salmonella and Shigella that appear among the normal flora of the colon. The inhibition is not absolute; recovery of Escherichia coli is reduced 1000- to 10,000-fold relative to that on nonselective media, but there is little effect on growth of Salmonella and Shigella. Carbohydrates and a pH indicator are also included to help to differentiate colonies of Salmonella and Shigella from those of other enteric Gram-negative rods. Anaerobic media. In addition to meeting atmospheric requirements, isolation of some strictly anaerobic bacteria on blood agar is enhanced by reducing agents such as L-cysteine and by vitamin enrichment. Sodium thioglycolate, another reducing agent, is often used in broth media. Plate media are made selective for anaerobes by the addition of aminoglycoside antibiotics, which are active against many aerobic and facultative organisms but not against anaerobic bacteria. The use of selective media is particularly important with anaerobes because they grow slowly and are commonly mixed with facultative bacteria in infections. Highly selective media. Media specific to the isolation of almost every important pathogen have been developed. Many allow only a single species to grow from specimens with a rich normal flora (eg, stool). The most common of these media are listed in Appendix 4–1; they are discussed in greater detail in following chapters. ++ Metabolic properties demonstrated by indicator systems +++ Atmospheric Conditions +++ Aerobic ++ After inoculation, cultures of most aerobic bacteria are placed in an incubator with temperature maintained at 35°C to 37°C. Slightly higher or lower temperatures are used occasionally to selectively favor a certain organism or organism group. Most bacteria that are not obligate anaerobes grow in air; however, CO2 is required by some and enhances the growth of others. Incubators that maintain a 2% to 5% concentration of CO2 in air are frequently used for primary isolation, because this level is not harmful to any bacteria and improves isolation of some. Some bacteria (eg, Campylobacter) require a microaerophilic atmosphere with reduced oxygen (5%) and increased CO2 (10%) levels to grow. This can be achieved by using a commercially available packet that is placed in a jar which is then sealed similar to the anaerobic system described further. ++ Incubation temperature, atmosphere vary +++ Anaerobic ++ Strictly anaerobic bacteria do not grow under the conditions just described, and many die when exposed to atmospheric oxygen or high oxidation–reduction potentials. Most medically important anaerobes grow in the depths of liquid or semisolid media containing any of a variety of reducing agents, such as cysteine, thioglycollate, ascorbic acid, or even iron filings. An anaerobic environment for incubation of plates can be achieved by replacing air with a gas mixture containing hydrogen, CO2, and nitrogen and allowing the hydrogen to react with residual oxygen on a catalyst to form water. A convenient commercial system accomplishes this chemically in a packet that is added before the jar is sealed. Specimens suspected to contain significant anaerobes should be processed under conditions designed to minimize exposure to atmospheric oxygen at all stages. ++ Anaerobes require reducing conditions, no oxygen +++ Clinical Microbiology Procedures ++ Routine laboratory procedures for processing specimens from various sites are needed because no single medium or atmosphere is ideal for all bacteria. Combinations of broth and solid-plated media and aerobic, CO2, and anaerobic incubation must be matched to the organisms expected at any particular site or clinical circumstance. Examples of such routines are shown in Table 4–1. In general, it is not practical to routinely include specialized media for isolation of rare organisms, such as C diphtheriae or Legionella pneumophila. For detection of these and other uncommon organisms, the laboratory must be specifically informed of their possible presence by the physician. Appropriate media and special procedures can then be included. ++Table Graphic Jump LocationTABLE 4–1Routine Use of Gram Smear and Isolation Systems for Selected Clinical SpecimensaView Table| Favorite Table |Download (.pdf) TABLE 4–1 Routine Use of Gram Smear and Isolation Systems for Selected Clinical Specimensa SPECIMEN MEDIUM (INCUBATION) BLOOD CEREBROSPINAL FLUID WOUND, PUS GENITAL, CERVIX THROAT SPUTUM URINE STOOL Gram smear   × × ×   ×     Soybean–casein digest broth (CO2)a ×               Blood agar (CO2)   × ×   ×b × ×   Chocolate agar (CO2)   × × × PCR preferred   ×     Blood agar (anaerobic)     ×           MacConkey agar (air)     ×     × × × Hektoen agar (air)               × Selenite F broth (air)               × Campylobacter agar (CO2, 42°C)c               × Martin–Lewis agar (CO2)       × PCR preferred         aThe added sensitivity of a nutrient broth is used only when contamination by normal flora is unlikely. Exact media and protocols may vary between laboratories.bAnaerobic incubation used to enhance hemolysis by β-hemolytic streptococci.cIncubation in a reduced oxygen atmosphere. ++ Designed to detect the most common organisms +++ Identification ++ When growth is detected in any medium, the process of identification begins. Identification involves methods for obtaining pure cultures from single colonies, followed by tests designed to characterize and identify the isolate. The exact tests and their sequences vary with different groups of organisms, and the taxonomic level (genus, species, subspecies, etc.) of identification needed varies according to the medical usefulness of the information. In some cases, only a general description or the exclusion of particular organisms is important. For example, a report of “mixed oral flora” in a sputum specimen or “No Salmonella, Shigella, or Campylobacter isolated” in a fecal specimen may provide all the information needed. MALDI-TOF (matrix-assisted laser desorption ionization-time of flight) mass spectroscopy has become the foremost tool used for the rapid identification of microorganisms already isolated in pure culture and has reduced time to identification and reporting substantially from a day or more to minutes. Although a major advance, MALDI-TOF complements but does not replace fully the need for traditional methods. The scope and accuracy of MALDI-TOF depend on the quality of the data based used for comparisons. As depicted schematically in Figure 4-6, ionized microorganisms are separated by mass-charge-ratio (effectively by molecular weight), collide under vacuum with an ion detector, and thereby generate a mass spectrum for comparative analysis. The net result is rapid identification of bacterial or fungal, especially yeasts, isolates. ++ FIGURE 4–6. MALDI-TOF mass spectrometer. As depicted schematically, ionized microorganisms are separated by mass-charge-ratio (effectively by molecular weight), collide under vacuum with an ion detector, and thereby generate a mass spectrum for comparative analysis. (From Patel R. Matrix-assisted laser desorption ionization-time of flight mass spectrometry in clinical microbiology. Clin Infect Dis. 2013 Aug;57(4):564–72; used with permission of Mayo Foundation for Medical Education and Research, all rights reserved.) Graphic Jump LocationView Full Size| Favorite Figure |Download Slide (.ppt) ++ Extent of identification is linked to medical relevance +++ Features Used to Classify Bacteria and Fungi +++ Cultural Characteristics ++ Cultural characteristics include the demonstration of properties such as unique nutritional requirements, pigment production, and the ability to grow in the presence of certain substances (sodium chloride, bile) or on certain media (MacConkey, nutrient agar). Demonstration of the ability to grow at a particular temperature or to cause hemolysis on blood agar plates is also used. For fungi, growth as a yeast colony or a mold is the primary separator. For molds, the morphology of the mold structures (hyphae, conidia, etc.) is the primary means of identification. ++ Growth under various conditions +++ Biochemical Characteristics ++ Traditionally, the ability to attack various substrates or to produce particular metabolic products has broad application to the identification of bacteria and yeast. The most common properties examined are listed in Appendix 4–3. Biochemical and cultural tests for bacterial identification are analyzed by reference to tables that show the reaction patterns characteristic of individual species. In fact, advances in computer analysis have now been applied to identification of many bacterial and fungal groups. These systems use the same biochemical principles together with computerized databases to determine the most probable identification from the observed test pattern. In many laboratories, MALDI-TOF mass spectrometry has replaced these biochemical approaches except for a few rapid colorimetric spot tests, eg, indole and PYR. When identification of bacteria remains elusive after biochemical and MALDI-TOF have been attempted, the isolates usually are sent to reference laboratories for 16S rRNA or other sequencing methods if the clinical importance warrants. ++Table Graphic Jump LocationAPPENDIX 4–3Common Biochemical Tests for Microbial IdentificationView Table| Favorite Table |Download (.pdf) APPENDIX 4–3 Common Biochemical Tests for Microbial Identification Carbohydrate breakdown. The ability to produce acidic metabolic products, fermentatively or oxidatively, from a range of carbohydrates (eg, glucose, sucrose, and lactose) has been applied to the identification of most groups of bacteria. Such tests are crude and imperfect in defining mechanisms, but have proved useful for taxonomic purposes. More recently, gas chromatographic identification of specific short-chain fatty acids produced by fermentation of glucose has proved useful in classifying many anaerobic bacteria. Catalase production. The enzyme catalase catalyzes the conversion of hydrogen peroxide to water and oxygen. When a colony is placed in hydrogen peroxide, liberation of oxygen as gas bubbles can be seen. The test is particularly useful in differentiation of staphylococci (positive) from streptococci (negative), but also has taxonomic application to Gram-negative bacteria. Citrate utilization. An agar medium that contains sodium citrate as the sole carbon source may be used to determine ability to use citrate. Bacteria that grow on this medium are termed citrate-positive. Coagulase. The enzyme coagulase acts with a plasma factor to convert fibrinogen to a fibrin clot. It is used to differentiate Staphylococcus aureus from other, less pathogenic staphylococci. Decarboxylases and deaminases. The decarboxylation or deamination of the amino acids lysine, ornithine, and arginine is detected by the effect of the amino products on the pH of the reaction mixture or by the formation of colored products. These tests are used primarily with Gram-negative rods. Hydrogen sulfide. The ability of some bacteria to produce H2S from amino acids or other sulfur-containing compounds is helpful in taxonomic classification. The black color of the sulfide salts formed with heavy metals such as iron is the usual means of detection. Indole. The indole reaction tests the ability of the organism to produce indole, a benzopyrrole, from tryptophan. Indole is detected by the formation of a red dye after addition of a benzaldehyde reagent. A spot test can be done in seconds using isolated colonies. Nitrate reduction. Bacteria may reduce nitrates by several mechanisms. This ability is demonstrated by detection of the nitrites and/or nitrogen gas formed in the process. O-Nitrophenyl-β-D-galactoside (ONPG) breakdown. The ONPG test is related to lactose fermentation. Organisms that possess the β-galactoside necessary for lactose fermentation but lack a permease necessary for lactose to enter the cell are ONPG-positive and lactose-negative. Oxidase production. The oxidase tests detect the c component of the cytochrome–oxidase complex. The reagents used change from clear to colored when converted from the reduced to the oxidized state. The oxidase reaction is commonly demonstrated in a spot test, which can be done quickly from isolated colonies. Proteinase production. Proteolytic activity is detected by growing the organism in the presence of substrates, such as gelatin or coagulated egg. Pyrrolidonyl arylamidase activity (PYR test) is a rapid colorimetric test for preliminary identification and screening of certain Gram-positive bacteria (eg, group A streptococci, enterococci, and Staphylococcus lugdenensis). A positive PYR test is color change from pink to red. Urease production. Urease hydrolyzes urea to yield two molecules of ammonia and one of CO2. This reaction can be detected by the increase in medium pH caused by ammonia production. Urease-positive species vary in the amount of enzyme produced; bacteria can thus be designated as positive, weakly positive, or negative. Voges–Proskauer test. The Voges–Proskauer test detects acetylmethylcarbinol (acetoin), an intermediate product in the butene glycol pathway of glucose fermentation. ++ Biochemical reactions give identification probability Toxin production and pathogenicity ++ Molecular assays have been developed for some toxins (eg, Clostridiodes difficile as an alternative to enzyme immunoassay [EIA]) for use in the clinical laboratory. Neutralization of a toxic effect in a test animal with specific antitoxin is the method used to confirm the identity of Clostidium botulinum (Chapter 29) toxin and is available only in public health reference laboratories. ++ Detection of specific toxin may define disease +++ Antigenic Structure ++ Viruses, bacteria, fungi, and parasites possess many antigens, such as capsular polysaccharides, surface proteins, and cell wall components. Serology involves the use of antibodies of known specificity to detect antigens present on whole organisms or free in extracts (soluble antigens). The methods used for demonstrating antigen–antibody reactions are discussed in Antibody Detection (Serology). ++ Antigenic structure demonstrated with antisera +++ Genomic Structure ++ Nucleic acid–sequence relatedness as determined by homology and direct sequence comparisons have become a primary determinant of taxonomic decisions. They are discussed later in the section on Methods of Nucleic Acid Analysis. +++ Isolation and Identification of Viruses +++ Cell and Organ Culture ++ Virtually no clinical microbiology laboratory still retains the capacity to do viral isolation by cell or organ culture. The classical techniques are done, if at all, in research or public health laboratories. The extensive repertoire of molecular assays now available for most human viral pathogens has far better sensitivity and specificity than traditional methods. The appropriate use of these NAATs in viral diagnosis is discussed for each virus in Chapters 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 in PART II, Pathogenic Viruses.

      culture is essential in clinical diagnostic laboratories, and etiologic diagnosis of common bacterial and fungal pathogens.

    1. 3 . B E H O L D I N G C H R I S T I N T H E O T H E RA N D I N T H E S E L FDeification in Benedict of Nursia and Gregory the Great

      READ THIS ONE

    Annotators

  10. Sep 2023
    1. Author Response

      Reviewer #1 (Public Review):

      In this very strong and interesting paper the authors present a convincing series of experiments that reveal molecular mechanism of neuronal cell type diversification in the nervous system of Drosophila. The authors show that a homeodomain transcription factor, Bsh, fulfills several critical functions - repressing an alternative fate and inducing downstream homeodomain transcription factors with whom Bsh may collaborate to induce L4 and L5 fates (the author's accompanying paper reveals how Bsh can induce two distinct fates). The authors make elegant use of powerful genetic tools and an arsenal of satisfying cell identity markers.

      Thanks!

      I believe that this is an important study because it provides some fundamental insights into the conservation of neuronal diversification programs. It is very satisfying to see that similar organizational principles apply in different organisms to generate cell type diversity. The authors should also be commended for contextualizing their work very well, giving a broad, scholarly background to the problem of neuronal cell type diversification.

      Thanks!

      My one suggestion for the authors is to perhaps address in the Discussion (or experimentally address if they wish) how they reconcile that Bsh is on the one hand: (a) continuously expressed in L4/L4, (b) binding directly to a cohort of terminal effectors that are also continuously expressed but then, on the other hand, is not required for their maintaining L4 fate? A few questions: Is Bsh only NOT required for maintaining Ap expression or is it also NOT required for maintaining other terminal markers of L4? The former could be easily explained - Bsh simply kicks of Ap, Ap then autoregulates, but Bsh and Ap then continuously activate terminal effector genes. The second scenario would require a little more complex mechanism: Bsh binding of targets (with Notch) may open chromatin, but then once that's done, Bsh is no longer needed and Ap alone can continue to express genes. I feel that the authors should be at least discussing this. The postmitotic Bsh removal experiment in which they only checked Ap and depression of other markers is a little unsatisfying without further discussion (or experiments, such as testing terminal L4 markers). I hasten to add that this comment does not take away from my overall appreciation for the depth and quality of the data and the importance of their conclusions.

      Great suggestions, we will discuss these two hypotheses as requested.

      Bsh initiates Ap expression in L4 neurons which then maintain Ap expression independently of Bsh expression, likely through Ap autoregulation. During the synaptogenesis window, Ap expression becomes independent from Bsh expression, but Bsh and Ap are both still required to activate the synapse recognition molecule DIP-beta. Additionally, Bsh also shows putative binding to other L4 identity genes, e.g., those required for neurotransmitter choice, and electrophysiological properties, suggesting Bsh may initiates L4 identity genes as a suite of genes. The mechanism of maintaining identity features (e.g., morphology, synaptic connectivity and functional properties) in the adult remains poorly understood. It is a great question whether primary HDTF Bsh maintains the expression of L4 identity genes in the adult. To test this, in our next project, we will specifically knock out Bsh in L4 neurons of the adult fly and examine the effect on L4 morphology, connectivity and function properties.

      Reviewer #2 (Public Review):

      Summary:

      In this paper, the authors explore the role of the Homeodomain Transcription Factor Bsh in the specification of Lamina neuronal types in the optic lobe of Drosophila. Using the framework of terminal selector genes and compelling data, they investigate whether the same factor that establishes early cell identity is responsible for the acquisition of terminal features of the neuron (i.e., cell connectivity and synaptogenesis).

      Thanks for the positive words!

      The authors convincingly describe the sequential expression and activity of Bsh, termed here as 'primary HDTF', and of Ap in L4 or Pdm3 in L5 as 'secondary HDTFs' during the specification of these two neurons. The study demonstrates the requirement of Bsh to activate either Ap and Pdm3, and therefore to generate the L4 and L5 fates. Moreover, the authors show that in the absence of Bsh, L4 and L5 fates are transformed into a L1 or L3-like fates.

      Thanks!

      Finally, the authors used DamID and Bsh:DamID to profile the open chromatin signature and the Bsh binding sites in L4 neurons at the synaptogenesis stage. This allows the identification of putative Bsh target genes in L4, many of which were also found to be upregulated in L4 in a previous single-cell transcriptomic analysis. Among these genes, the paper focuses on Dip-β, a known regulator of L4 connectivity. They demonstrate that both Bsh and Ap are required for Dip-β, forming a feed-forward loop. Indeed, the loss of Bsh causes abnormal L4 synaptogenesis and therefore defects in several visual behaviors. The authors also propose the intriguing hypothesis that the expression of Bsh expanded the diversity of Lamina neurons from a 3 cell-type state to the current 5 cell-type state in the optic lobe.

      Thanks for the excellent summary of our findings!

      Strengths:

      Overall, this work presents a beautiful practical example of the framework of terminal selectors: Bsh acts hierarchically with Ap or Pdm3 to establish the L4 or L5 cell fates and, at least in L4, participates in the expression of terminal features of the neuron (i.e., synaptogenesis through Dip-β regulation).

      Thanks!

      The hierarchical interactions among Bsh and the activation of Ap and Pdm3 expression in L4 and L5, respectively, are well established experimentally. Using different genetic drivers, the authors show a window of competence during L4 neuron specification during which Bsh activates Ap expression. Later, as the neuron matures, Ap becomes independent of Bsh. This allows the authors to propose a coherent and well-supported model in which Bsh acts as a 'primary' selector that activates the expression of L4-specific (Ap) and L5-specific (Pdm3) 'secondary' selector genes, that together establish neuronal fate.

      Thanks again!

      Importantly, the authors describe a striking cell fate change when Bsh is knocked down from L4/L5 progenitor cells. In such cases, L1 and L3 neurons are generated at the expense of L4 and L5. The paper demonstrates that Bsh in L4/L5 represses Zfh1, which in turn acts as the primary selector for L1/L3 fates. These results point to a model where the acquisition of Bsh during evolution might have provided the grounds for the generation of new cell types, L4 and L5, expanding lamina neuronal diversity for a more refined visual behaviors in flies. This is an intriguing and novel hypothesis that should be tested from an evo-devo standpoint, for instance by identifying a species when L4 and L5 do not exist and/or Bsh is not expressed in L neurons.

      Thanks for the appreciation of our findings!

      To gain insight into how Bsh regulates neuronal fate and terminal features, the authors have profiled the open chromatin landscape and Bsh binding sites in L4 neurons at mid-pupation using the DamID technique. The paper describes a number of genes that have Bsh binding peaks in their regulatory regions and that are differentially expressed in L4 neurons, based on available scRNAseq data. Although the manuscript does not explore this candidate list in depth, many of these genes belong to classes that might explain terminal features of L4 neurons, such as neurotransmitter identity, neuropeptides or cytoskeletal regulators. Interestingly, one of these upregulated genes with a Bsh peak is Dip-β, an immunoglobulin superfamily protein that has been described by previous work from the author's lab to be relevant to establish L4 proper connectivity. This work proves that Bsh and Ap work in a feed-forward loop to regulate Dip-β expression, and therefore to establish normal L4 synapses. Furthermore, Bsh loss of function in L4 causes impairs visual behaviors.

      Thanks for the excellent summary of our findings.

      Weaknesses:

      ● The last paragraph of the introduction is written using rhetorical questions and does not read well. I suggest rewriting it in a more conventional direct style to improve readability.

      We agree, and will update the text as suggested.

      ● A significant concern is the way in which information is conveyed in the Figures. Throughout the paper, understanding of the experimental results is hindered by the lack of information in the Figure headers. Specifically, the genetic driver used for each panel should be adequately noted, together with the age of the brain and the experimental condition. For example, R27G05-Gal4 drives early expression in LPCs and L4/L5, while the 31C06-AD, 34G07-DBD Split-Gal4 combination drives expression in older L4 neurons, and the use of one or the other to drive Bsh-KD has dramatic differences in Ap expression. The indication of the driver used in each panel will facilitate the reader's grasp of the experimental results.

      We agree, and will update the figure annotation.

      ● Bsh role in L4/L5 cell fate:

      o It is not clear whether Tll+/Bsh+ LPCs are the precursors of L4/L5. Morphologically, these cells sit very close to L5, but are much more distant from L4.

      Our current data show L4 and L5 neurons are generated by different LPCs. However, currently we don’t have tools to demonstrate which subset of LPCs generate which lamina neuron type. We are currently working on a followup manuscript on LPC heterogeneity, but those experiments have just barely been started.

      o Somatic CRISPR knockout of Bsh seems to have a weaker phenotype than the knockdown using RNAi. However, in several experiments down the line, the authors use CRISPR-KO rather than RNAi to knock down Bsh activity: it should be explained why the authors made this decision. Alternatively, a null mutant could be used to consolidate the loss of function phenotype, although this is not strictly necessary given that the RNAi is highly efficient and almost completely abolishes Bsh protein.

      The reason we chose CRISPR-KO (L4-specific Gal4, uas-Cas9, and uas-Bsh-sgRNAs) is that it effectively removed Bsh expression from majority of L4 neurons. However, it failed to knock down Bsh in L4 neurons using L4-split Gal4 and Bsh-RNAi because L4-split Gal4 expression depends on Bsh. We will include this explanation in the text.

      o Line 102: Rephrase "R27G05-Gal4 is expressed in all LPCs and turned off in lamina neurons" to "is turned off as lamina neurons mature", as it is kept on for a significant amount of time after the neurons have already been specified.

      Thanks; we will make that change.

      o Line 121: "(a) that all known lamina neuron markers become independent of Bsh regulation in neurons" is not an accurate statement, as the markers tested were not shown to be dependent on Bsh in the first place.

      Good point. We will rephrase it as “that all known lamina neuron markers are independent of Bsh regulation in neurons”.

      o Lines 129-134: Make explicit that the LPC-Gal4 was used in this experiment. This is especially important here, as these results are opposite to the Bsh Loss of Function in L4 neurons described in the previous section. This will help clarify the window of competence in which Bsh establishes L4/L5 neuronal identities through ap/pdm3 expression.

      Thanks! We will include Gal4 information in the text for every manipulation.

      ● DamID and Bsh binding profile:

      ○ Figure 5 - figure supplement 1C-E: The genotype of the Control in (C) has to be described within the panel. As it is, it can be confused with a wild type brain, when it is in fact a Bsh-KO mutant.

      Great point! Thank you for catching this and we will update it.

      ○ It Is not clear how L4-specific Differentially Expressed Genes were found. Are these genes DEG between Lamina neurons types, or are they upregulated genes with respect to all neuronal clusters? If the latter is the case, it could explain the discrepancy between scRNAseq DEGs and Bsh peaks in L4 neurons.

      We did not use “L4-specific Differentially Expressed Genes”. Instead, we used all genes that are significantly transcribed in L4 neurons (line 209-210).

      ● Dip-β regulation:

      ○ Line 234: It is not clear why CRISPR KO is used in this case, when Bsh-RNAi presents a stronger phenotype.

      As we explained it above, the reason we chose CRISPR-KO (L4-specific Gal4, uas-Cas9, and uas-Bsh-sgRNAs) is that it effectively removed Bsh expression from majority of L4 neurons. However, it failed to knock down Bsh in L4 neurons using L4-split Gal4 and Bsh-RNAi because L4-split Gal4 expression depends on Bsh. We’ll include this explanation in the text.

      ○ Figure 6N-R shows results using LPC-Gal4. It is not clear why this driver was used, as it makes a less accurate comparison with the other panels in the figure, which use L4-Split-Gal4. This discrepancy should be acknowledged and explained, or the experiment repeated with L4-Split-Gal4>Ap-RNAi.

      I think you mean 6J-M shows results using LPC-Gal4. We first tried L4-Split-Gal4>Ap-RNAi but it failed to knock down Ap because L4-Split-Gal4 expression depends on Ap. We will add this to the text.

      ○ Line 271: It is also possible that L4 activity is dispensable for motion detection and only L5 is required.

      Thanks! Work from Tuthill et al, 2013 showed that L5 is not required for any motion detection. We will include this citation in the text.

      ● Discussion: It is necessary to de-emphasize the relevance of HDTFs, or at least acknowledge that other, non-homeodomain TFs, can act as selector genes to determine neuronal identity. By restricting the discussion to HDTFs, it is not mentioned that other classes of TFs could follow the same Primary-Secondary selector activation logic.

      That is a great point, thank you! We will include this in the discussion.

    1. Worf, Benjamin L. 1956. “Language, Thought, and Reality: Selected Writings of Benjamin Lee Whorf.” Edited by John B. Carrol. Boston: Technology Press of MIT.

      check citation. Why is it in quotation marks? Edited ... shouldn't be in italics. remove place of publication.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Point-by-point response to reviewers, including our plans for the revision:

      ­­­Review____er #1 (Evidence, reproducibility and clarity (Required)):

      * Summary: In this manuscript by the Sanson group, Lye and colleagues try to definitively answer the question of whether pulling forces from the ventral mesoderm have significant effects on convergent extension in the Drosophila germband (germband extension). While germband extension does occur in mutant embryos lacking mesoderm invagination, it has long been an open question in the field as to whether ventral pulling forces from the mesoderm have significant effects (positive or negative) on cell intercalation during germband extension. To definitely address this question, Lye and colleagues generated high-quality, directly comparable datasets from wild-type and twist mutant embryos, and then systematically assessed nearly all aspects of cell intercalation, myosin recruitment, and tissue elongation over time. They demonstrate that pulling forces from the ventral mesoderm have negligible impacts on the course of germband extension. While there are indeed some interesting differences between wild-type and twist embryos with respect to cell intercalation and myosin recruitment, such differences are relatively minor. They conclude that the events of germband extension neither require nor are strongly affected by external forces from the mesoderm. While this is largely a negative results paper, I believe that it should be published and that it will be an impactful paper within the field. Namely, it will settle once and for all the question of whether mesoderm invagination is required for optimal germband extension in the early Drosophila embryo, and it suggests that tissues are largely autonomous developmental units that are buffered from outside mechanical inputs.*

      • * *Major comments: *

      * It seems to me that the one obvious omission from this paper is a general measure of convergent extension over time. I think it would be useful to the reader to include some measure of change in tissue aspect ratio over time between wild-type and twist embryos. This could be included in Figure 5 or 6. *

      • *

      We are happy to include a graph with what we call “tissue strain rate”, which measures the deformation of the germ-band in the direction of extension (along AP) over time, and propose to add it as a panel in Supplementary Figure 6. Note that in our measures, the “tissue” strain rate is decomposed into contributions from two cell behaviors, the “cell intercalation” strain rate and the “cell shape” strain rate (Blanchard et al., 2009). “Tissue” and “cell shape” strain rate are directly measured, and “cell intercalation” strain rate is what remains when “cell shape” strain rate is removed from “tissue” strain rate. The “cell intercalation” strain rate calculated in that way is a “continuous” measure of cell intercalation, measuring the progressive shearing of cells during convergent extension. We also use a “discrete” measure of cell intercalation, which measures the number of cell neighbor exchanges, also called T1 swaps. We found that both “continuous” and “discrete” measures of cell intercalation are unchanged in twist mutant compared to wild-type embryos (Fig. 6F and 6E, respectively). In contrast, we find that the “cell shape” strain rate is increased in twist mutants (Fig. 5B and Fig. 5S1A). Consistent with this finding, the “tissue” strain rate is also increased in twist mutants (see graph below).

      Otherwise, I have no major comments on the experimental approach or the findings of this manuscript. It seems to me a straightforward and systematic approach for determining whether mesoderm invagination affects germband extension. I do have several minor comments that should be addressed prior to publication (below).

      *Minor comments: *

      *I understand why cells would initially stretch more along the DV axis in wild-type embryos compared with twist embryos, but why do cells become so much more stretched along the AP axis (and become smaller apically) after 10 minutes of GBE in wild type compared with twist (Figure 2C and E). *

      *I think this is an interesting and non-intuitive result that would warrant a bit of explanation/conjecture. *

      This is not what Fig. 2C and E show, and we realize now that our schematics on the graphs might have been confusing. We will work on those to improve their clarity (or remove them), and also review our text.

      Figure 2C shows how cells deform along DV (cell shape strain rate projected onto the DV axis). So the graph does not show that the cells are elongating in AP, as only the DV component of the strain rate is shown in this figure. In the wild type, the DV strain rate is positive (the cells are elongating in DV) at developmental times when the mesoderm invaginate (from about -10 minutes to until 7.5 minutes). The DV strain shows an acceleration until about 5 mins, then decelerates, crossing the x-axis to become negative at 7.5 minutes. From this timepoint and until the end of GBE, the DV strain rate is negative (the cells are contracting along DV). Mirroring the positive section of the curve, the DV contraction of the cells accelerate until about 12 mins and then slows down. The strong rate of DV contraction between 7.5 and 20 mins could in part be due to the endoderm invagination pulling in the orthogonal direction (AP) and helping the cells regaining a more isotropic shape. We could add a mention about this in the discussion.

      In Figure 2E, the rate of change in cell area follows a similar time course in the wild type, showing that the cells are increasing their areas until about 10 mins (positive values) and then reduce their areas again until the end of GBE (negative values). Note that the graph does not show raw (instantaneous) cell areas as suggested by the comment, but rather a rate of change.

      So in wild type, the cells get stretched by the invaginating mesoderm, and once the mesoderm is not pulling anymore, the cells appear to relax back. As there is no stretching in twist mutants, there is no equivalent relaxation of the cells along DV. Note that in twist, there is a milder increase in cell area in the first 15 mins of GBE (Fig. 2E). This could again be caused by the pull from endoderm invagination stretching the cells along AP, which, as we have shown before, increases both cell shape strain rates along AP and cell areas (Butler et al., 2009). So the pull from endoderm invagination (along AP) will have an impact on cell area rates of change and possibly also, indirectly, on DV cell shape strain rates, in both twist and wild type embryos, during most of GBE. Therefore cell area and DV cell shape strain rates are affected by more than one process during GBE. In this paper, we are focusing on the impact of mesoderm invagination, which happens around the start of GBE, so have focused our analysis of the graphs in the results section to this period, and the differences between wildtype and *twist. *

      *I don't understand how you are defining cell orientation in Figure 2G. How are you choosing the cell axis that you are then comparing with the body axis? Is it the long axis, or something more complicated than that? I think you should briefly provide this information in the results section. If it is included in the methods, I wasn't able to locate it. *

      Yes, it is the orientation of the long axis of the cell relative to the antero-posterior embryonic axis. We will clarify this in the text, in particular in the Methods, and also try improve our schematics.

      Figure 2: Since you have the space, it might help the reader if you simply wrote out "strain rate" for panels B, D, and F, rather that used the abbreviation "SR." Thank you for this suggestion, we will reduce use of abbreviations where space permits.

      *Please ensure that all axis labels are fully visible in the final figures. In several figures, the Y-axis labels were cut off (e.g., Fig 2I, 4A, 4D, 6B, 6C). *

      These were visible to us in our submitted version, but of course we will ensure everything is visible on the final version.

      *Where space permits, I would suggest using fewer abbreviations in axis labels to increase readability of the figures (e.g., in Figures 3H or 4D). *

      Thank you for this suggestion, will do.

      * In Figure 7, I would move the wild-type panels to the left and the twist panels to the right. I think it is more conventional to describe the normal wild-type scenarios first, and then contrast the mutant state.*

      Will do.

      To be consistent with the literature, "wildtype" should be hyphenated (wild-type) when used as an adjective, or two separate words (wild type) when used as a noun. Thank you, we will change this.

      Review*er #1 (Significance (Required)): *

      * Advance: The advances in this manuscript are largely methodological, but the experiments and analyses are quite rigorous and allow the authors to make strong conclusions concerning their hypotheses. Their findings are based on a high-quality collection of movies from control and twist mutant embryos expressing a cell membrane marker and knock-in GFP-tagged myosin. Importantly, I think the researchers were correct in choosing to analyze twist single-mutant embryos (as opposed to snail or twist, snail double-mutant embryos), as the overall embryo geometry of these mutants is fairly similar to wild-type embryos, allowing the researchers to directly compare cell behaviors and myosin dynamics during germband extension. This approach also allows them to avoid indirect effects on the germband due to a completely non-internalized mesoderm. *

      *

      Audience: The primary audience for this article will be basic science researchers working in the early Drosophila embryo who are interested in the interplay between the germband and neighboring tissues. Secondary audiences will include developmental biologists more broadly who are interested in biomechanical coupling (or in this case decoupling) of neighboring tissues. *

      *

      Describe your expertise: I have been a Drosophila developmental geneticist for over twenty years, and I have been working directly on Drosophila germband extension for over a decade. I have published numerous papers and reviews in this field, and I am very familiar with the genetic backgrounds and types of experimental analyses used in this manuscript. Therefore, I believe I am highly qualified to serve as a reviewer for this manuscript.*

      ­­

      Review____er #2 (Evidence, reproducibility and clarity (Required)):

      *

      In the present manuscript, Lye et al. describe a highly detailed quantification of cell shape changes during germband extension in Drosophila melanogaster early embryo. During this process, ectodermal tissue contracts along the dorso-ventral axis, simultaneously expanding along the perpendicular antero-posterior direction, migrating from the ventral to the dorsal surface of the embryo as it extends. This important morphogenetic event is preceded by ventral furrow formation when mesodermal tissue (located in the ventral part of the embryo) contracts along the dorso-ventral axis and invaginates into the embryonic interior. The study compares cell shape dynamics in the wildtype Drosophila with that in the twist mutant, which largely lacks mesoderm and does not form ventral furrow. The major motivation of the study is to examine whether cellular behaviors and myosin recruitment in the ectoderm is cell autonomous, or if those cellular behaviors depend on mechanical interactions between mesoderm and ectoderm.*

      • The authors first examine whether transcriptional patterning of key genes involved in germband extension is different between the wildtype and the twist mutant and find no significant difference. Next, the authors thoroughly quantify cellular behaviors and patterns of myosin recruitment in the two genetic backgrounds. A number of different measures are investigated, notably the rate of change in the degree of cellular asymmetry, rate of cell area change, rate of change of cell orientation, differences in myosin recruitment to cell edges of various orientation, as well as the rates of growth, shrinkage, and re-orientation of the various cellular interfaces. It is thoroughly documented how these quantities change as a function of developmental timing and spatial position within the embryo. These data serve basis for quantitative comparison between cellular dynamics in the two genetic backgrounds considered.*

      • Overall, the study shows that cellular behaviors observed in the ectoderm are largely the same during the period of time following ventral furrow formation, as would be expected if those cellular behaviors were predominantly cell autonomous and not dependent on stresses generated in the mesoderm.*

      • The data presented in the manuscript are of excellent quality and presentation is very clear.

      Minor comments: none *

      * Reviewer #2 (Significance (Required)): *

      * I find that the study provides a thorough quantification of cell behaviors in a widely studied important model of morphogenesis. The work may be of particular interest for future model-to-data comparison, perhaps providing a basis for future modeling work. I therefore certainly think that this work warrants publication.*

      • However, the results of the study largely parallel previous findings and do not appear novel or surprising. It is well established that in snail mutant that lack mesoderm entirely, germband extension proceeds largely normally. This well-established fact suggests that since tissue dynamics in complete absence of mesoderm are largely unaffected, behaviors of individual cells are likely to not be affected either*.

      *The work is pretty much entirely observational, and for most part provides a more detailed documentation/quantification of previous findings. I do not think it is appropriate for high profile publication. *

      We are not sure which evidence the reviewer is referring to here specifically. We agree that the single mutants twist or snail, or the double twist snail mutants do extend their germ-band. However, the question we are asking here, is how well do they extend their germband and to answer this question, quantitation is needed. The first quantitation of GBE were performed by (Irvine and Wieschaus, 1994). While they quantified GBE in various mutant contexts, they did not perform quantitation for snail, twist, or twist snail mutants. Instead, they refer to these mutants once in p839, with the following sentence: Additionally, twist and snail mutant embryos, which lack mesoderm, extend their germbands almost normally (Leptin and Grunewald, 1990; Simpson, 1983)*.” *

      Following these earlier qualitative observations, various studies have quantified different aspects of GBE in mesoderm invagination mutants, with contradictory results. For example, some studies, including from our own lab, report a reduction in cell intercalation in the absence of mesoderm invagination (Butler et al., 2009; Wang et al., 2020), but there have also been reports that tissue extension and T1-transistions occur normally (Farrell et al., 2017)(see also introduction of our manuscript). These contradictory results have motivated our present study, and we have implemented rigorous comparison between wild type and mesoderm invagination mutants, being careful i) to check that the regions analyzed were comparable in terms of cell fate, and ii) to control for any confounding effects between experiments (see also response to reviewer 4, main question 2). We have also considered which mesoderm invagination mutants to use. We rejected snail or twist snail mutants because the absence of snail means that the mesodermal cells do not contract and thus stay at the surface of the embryo, which changes the spatial configuration of the embryo considerably and would make a fair quantitative comparison very difficult. Instead, we decided to use twist mutants, as in those, cell contractions still happen so the cells do not take as much space at the surface of the embryo, but the contractions are uncoordinated which means that there is no invagination (and we demonstrate here, no significant pulling on the ectoderm). We note that reviewer 1 highlights the merit of settling the question of the impact of mesoderm invagination on GBE and the pertinence of choosing twist mutants versus the alternatives (see also response to reviewer 4, suggestion 1).

      ­­

      __Review____er #3 (Evidence, reproducibility and clarity (Required)): __

      During morphogenesis, the final shape of the tissue is not only dictated by mechanical forces generated within the tissue but can also be impacted by mechanical contributions from surrounding tissues. The way and extent to which tissue deformation is influenced by tissue-extrinsic forces are not well understood. In this work, Lye et al. investigated the potential influence of Drosophila mesoderm invagination on germband extension (GBE), an epithelial convergent extension process occurring during gastrulation. Drosophila GBE is genetically controlled by the AP patterning system, which determines planar polarized enrichment of non-muscle myosin II along the DV-oriented adherens junctions. Myosin contractions drive shrinking of DV-oriented junctions into 4-way vertices, followed by formation of new, AP-oriented junctions. This process results in cell intercalation, which causes tissue convergence along the DV-axis and extension along the AP-axis. In addition, GBE is facilitated by tissue-extrinsic pulling forces produced by invagination of the posterior endoderm. Interestingly, some recent studies suggest that the invagination of the mesoderm, which occurs immediately prior to GBE, also facilitates GBE. In the proposed mechanism, invaginating mesoderm pulls on the germband tissue along the DV-axis; the resulting strain of the germband cells generates a mechanotransduction effect that promotes myosin II recruitment to the DV-oriented junctions, thereby facilitating cell intercalation. Here, the authors revisited this proposed mechanotransduction effect using quantitative live imaging approaches. By comparing the wildtype embryos with twist mutants that fail to undergo mesoderm invagination, the authors show that although the DV-oriented strain of the germband cells was greatly reduced in the absence of mesoderm pulling, this defect had a negligible impact on junctional myosin density, myosin planar polarity, the rate of junction shrinkage or the rate of cell intercalation during GBE. A mild increase in the rate of new junction extension and a slight defect in cell orientation were observed in twist mutants, but these differences did not cause obvious defects in cell intercalation. The authors conclude that myosin II-mediated cell intercalation during GBE is robust to the extrinsic mechanical forces generated by mesoderm pulling.

      • * *Overall, I found that the results described here are very interesting and of high quality. The data acquisition and analyses were elegantly performed, statistics were appropriately used, and the manuscript was clearly written. However, there are a few points where some further explanation or clarification is necessary, as detailed below: *

      • The main conclusion of the manuscript relies on appropriate quantification of myosin intensity at cell junctions. It is therefore important that the methods of quantification are well justified. Below are a few questions regarding the methods used in the analyses:*
      • -For myosin quantification, the authors state that "Background signal was subtracted by setting pixels of intensities up to 5 percentile set to zero for each timepoint" [Line826]. The rationale for selecting 5 percentile as the threshold for background should be explained. Also, how does this background value change over time? *

      • *

      For our normalization method, we stretched the intensity histogram of images to use the full dynamic range for quantification and enable meaningful comparison of intensities between different movies. The 5th percentile was chosen to set to zero intensity as this removed background signal without removing any structured Myosin signal (i.e., non-uniform, low level fluorescence - this was assessed by eye). We will provide some before and after normalization images at different timepoints to illustrate this (See reviewer 3, minor point 4 below). Since the cytoplasmic signal is uniform, it is difficult to discern from true ‘background’, therefore some cytoplasmic signal might be set to zero with this method, but all medial and junctional Myosin structures will still be visible and have none-zero intensity values. However, since cytoplasm takes up a large majority of pixels in the image, and we only set 5% of pixels to zero, the majority of the cytoplasm will have non-zero pixel values. ‘Background’ changes increases slightly as Myosin II levels increase in general over time, as expected from the embryo accumulating Myosin II as they develop.

      -The authors mention that "Intensities varied slightly between experiments due to differences in laser intensity and therefore histograms of pixel intensities were stretched" [Line828]. The method of intensity justification should be justified. For example, does this normalization result in similar cytoplasmic myosin intensity between control and twist mutant embryos?

      • *

      As stated above, we stretched the intensity histogram of images to enable meaningful comparison of intensities between different movies, as stretching the histograms would bring Myosin II structures of similar intensities into the same pixel value range. We chose to stretch histograms using a reference timepoint (30 minutes, the latest timepoint analyzed), rather than on a per timepoint basis, because we saw a general increase in Myosin II over time, and we wanted to ensure that this increase was preserved in our analysis.

      • *

      Note that we quantify Myosin from 2 µm above to 2 µm below the level of the adherens junctions (see Methods), not throughout the entire cell, and therefore we have no true measure of cytoplasmic Myosin. However, we can plot non-membrane Myosin from this same apicobasal position in the cell. Non-membrane Myosin will include both the cytoplasmic signal and the Myosin II medial web (see above). When plotting these, we find that Myosin II intensities in this pool are similar in wildtype and twist (see graph below, dotted lines show standard deviations), confirming that that we are not inappropriately brightening one set of images compared to the other (e.g., twist versus wildtype).

      Finally, our observations of rate of junction shrinkage and intercalation are consistent with our Myosin II quantification results (see Figures 4A, 4D and 6F). This further validates our methods.

      • *

      • *

      - A previous study demonstrates that the accumulation of junctional myosin is substantially reduced in twist mutant embryos compared to the wild type (Gustafson et al., 2022). In that work, junctional myosin was quantified as (I_junction - I_cytoplasm)/I_cytoplasm. In contrast, the cytoplasmic myosin intensity does not appear to be subtracted from the quantification in this study. How much of the difference in the conclusions of the two studies can be explained by this difference in myosin quantification?

              As explained above, we choose to normalize our data by stretching histograms, rather than subtracting and dividing intensities between different pools of Myosin. The setting pixels of intensities up to 5 percentiles set to zero for each will have a similar effect to subtracting a small fraction of the cytoplasmic pool. We note that the intensity measurements in (Gustafson et al., 2022) are in the apical-top 5µm of the cell, and therefore their ‘cytoplasmic’ signal is likely to also include the apical medial web of Myosin. Also, after subtraction they use division by the cytoplasmic intensity in an attempt to bring pixel intensities between different movies into a comparable range, whereas we do this by stretching the histograms themselves (see above).  We carefully designed our method to preserve the increase in Myosin levels that we see over time in our post-normalization data. This is something that their method of normalization would not be predicted to capture, if their ‘cytoplasmic’ signal increase over time as well as their junctional signal.  Indeed, in FigS6D of their paper, Myosin II levels do not appear to increase over time in these (presumably normalized) images.
      

      Additionally, we note that in (Gustafson et al., 2022), not all Myosin II is fluorescently tagged since they use a sqhGFP transgene located on the balancer chromosome. This means that the line they use will have a pool of exogeneous Myosin tagged with GFP (expressed from the CyO balancer) and a pool of endogenous Myosin (expressed from the sqh gene on the X chromosome. It is not known whether endogenous and exogeneous GFP-tagged Myosin II will be recruited equally to cell junctions when in competition with each other. Therefore, in their genetic background, the ratio of junctional/cytoplasmic sqhGFP might not reflect the true ratio. To avoid this potential caveat, in our study we have used a new knock-in of Myosin, which tags the sqh gene at the endogenous locus (Proag et al., 2019). The line is homozygous viable and thus all the molecules of Myosin II Regulatory Light Chain (encoded by sqh), and thus the Myosin II mini-filaments, are labelled with GFP.

      Additionally, we note that when comparing their images of Myosin II in wildtype and twist (Figure 5D and D’), the overall Myosin signal appears reduced in twist mutants (including in the head and posterior midgut, which is outside the area that they are claiming Myosin II is recruited in response to mesoderm invagination). This suggests that Myosin II is generally reduced in their twist mutants (or images thereof), which is not expected and might indicate issues with their methods.

      Therefore differences in the methods may explain the discrepancies between studies. Importantly, we have quantified junctional shrinkage rates and intercalation, and our analysis of these rates is consistent with our Myosin II quantification results (see above).

      -The authors used the tissue flow data to register the myosin channel and the membrane channel, which were acquired at slightly different times. The accuracy of this channel registration should be demonstrated.

      As stated in our methods: “the channel registration was corrected post-acquisition in order that information on the position of interfaces in the Gap43 channel could be used to locate them in the Myosin channel. Therefore the local flow of cell centroids between successive pairs of time frames in the Gap43 channel is used to give each interface/vertex pixel a predicted flow between frames. A fraction of this flow is applied, equal to the Myosin II to Gap43 channel time offset, divided by the frame interval. Because cells deform as well as flow, the focal cell’s cell shape strain rate is also applied, in the same fractional manner as above.”

      The images in Figure 3C and C’ show the Myosin II, with quantified membrane Myosin superimposed on the image as a color-code. Images in Figure 3B and B’ show the (normalized) Myosin II. Comparison of these images demonstrates that the channel registration is accurate. We will add a reference to these images in the methods.

      • The authors show that cell intercalation is not influenced in twist mutant embryos. However, a previous study demonstrates that the speed of GBE is substantially reduced in twist mutants (Gustafson et al., 2022). It would be interesting to see whether a similar reduction in the speed of GBE was observed in this study. *

      We do not see a reduction in the speed of GBE as reported by (Gustafson et al., 2022), we will add “tissue strain rate” graphs to demonstrate this. On the contrary, we find a slight increase in the “tissue strain rate”, because there is a slight increase in the “cell shape strain rate” contributing to extension (while “cell intercalation strain rate” is unchanged). See also response to Reviewer 1 (major comment) .

      • It has been previously shown that contractions of medioapical myosin in germband cells also contribute to cell intercalation. The authors should explain why medioapical myosin was not included in the comparison between wildtype and twist mutant embryos. *

      • *

      Indeed, it has been shown that there is a flow of medial Myosin towards the junctions (Rauzi et al., 2010). However, and as described in that paper, this flow ‘feeds’ the enrichment of Myosin II at shrinking junctions, and thus the junctional Myosin II can be taken as a readout of polarized Myosin II behavior. Additionally, medial flows are more technically challenging to quantify, especially when quantification is required in a large number of cells as is the case for our study.

      Importantly, our junctional Myosin II and junctional shrinkage rate results are consistent with each other, therefore it is very unlikely that analyzing medial Myosin II would lead us to form a different conclusion. We will add a sentence to explain why we chose to quantify junctional, and not medial, Myosin II.

      *Minor points: *

      1. * Fig. 1-S1 panel C: the number of cyan cells changes non-monotonically. It first decreases from -10 min to 10 min, then increases from 10 min to 20 min. This is confusing since in theory the number of tracked cells should not increase over time if the cells are tracked from the beginning of the movie. *
      2. *

      The cyan cells highlight tracked mesodermal and mesectodermal cells, which are not included in the analysis. The low number of mesodermal cells highlighted at 10mins germband extension is because mesodermal and mesectodermal cells are not always tracked successfully at this time. Note that the legend includes a note that ‘”Unmarked cells are poorly tracked and excluded from the analysis”. Also see Methods: “Note on number of cells in movies, for notes on changes to the number of tracked ectodermal cells throughout the timecourse of the movies.”

      • Fig. 1-S2: the vnd band in panel A appears to be much narrower than in panel B. *

      • *

      These are fixed embryos, therefore this could be (at least partially) due to slight differences in exact developmental age of the embryo. Note that we wanted to check that vnd and ind are expressed in the correct places in the ectoderm. We were motivated to check this because the width of mesoderm is reduced in twist, so we thought it was important to verify that there is not a population of ‘ectodermal’ cells with a strange fate (i.e., negative for both vnd and ind). Our experiments show that vnd abuts the mesoderm/mesectoderm in twist as in wildtype, and that the cells immediately lateral to the vnd cell population express ind as expected.

      It is possible that there is a slight difference in the number of vnd cells in twist mutants compared to wildtype, but we see no differences in Myosin II bipolarity that would coincide with the vnd/ind boundary (Fig3-S1). Therefore, this would not change the interpretation of our results. Counting the number of rows of vnd cells prior to any cell intercalation (the number of rows will reduce as cells intercalate) would be technically challenging as the lateral border of vnd expression is hard to discern at this time due to lower levels of vnd expression laterally within the vnd expression domain.

      • The schematic in Fig. 2J suggests that at the onset of mesoderm pulling the germband cells have a uniform angle of rotation (towards bottom right). Is this the case?*

      • *

      No, this schematic is purely supposed to show that as cells stretch, they also reorient. Note that we will review our schematics in Fig. 2 to increase clarity (see response to reviewer 1, first minor comment).

      • The description of myosin intensity normalization in the Methods section is somewhat difficult to follow [Line 829 - 832]. It would be helpful if the authors can show one or two images before and after intensity normalization as examples. *

      We will add some examples of before and after normalization images to this section. We will also review the Methods to improve the text’s clarity.

      • Line 704: "Z-stacks for each channel were collected sequentially" - the step size in Z-axis should be reported. *

      Thank you for this, the step size was 1µm. We will add this information.

      • Fig. 4C: what are the thin, black lines in the image? *

      This image is a 2D representation of the Gap43Cherry signal at the level of the adherens junctions extracted for tracking, not a simple confocal z-slice. When viewing these representations, you can see lines showing borders between where information from different z-stacks was used for the tracking layer. Unfortunately, our software does not allow us to remove these lines, but they do not affect tracking, quantification etc.

      Reviewer #3 (Significance (Required)):

      While most previous work on tissue mechanics and morphogenesis focuses on tissue-intrinsic mechanical input, recent studies have started to emphasize the contribution of tissue-extrinsic forces. An important challenge in understanding the function of tissue-extrinsic forces lies in the difficulties in properly comparing the wild type and the mutant samples that disrupt extrinsic forces, in particular when cell fate specification is altered in the mutants. In this work, the authors addressed this challenge by employing a number of approaches to warrant a parallel comparison between genotypes, including examining the AP- and DV-patterning of the tissue, selecting sample regions with comparable cell fate for analysis, and carefully aligning the stage of the movies. With these approaches, the authors provide compelling evidence to support their main conclusions. By teasing apart the role of the intrinsic genetic program and the extrinsic tissue forces, the work provides important clarifications on the function of mesoderm pulling in GBE and adds new insights into this well-studied tissue morphogenetic process. This work should be of interest to the broad audience of epithelial morphogenesis, tissue mechanics and myosin mechanobiology.

      • *

      Review____er #4 (Evidence, reproducibility and clarity (Required)):

      *Lye and colleagues investigate the impact of tissue-tissue interactions on morphogenesis. Specifically, they ask how disrupting mesoderm internalization affects convergence and extension of the ectoderm (germband) in Drosophila embryos. Using twi mutants in which mesoderm invagination fails, the authors find that the invagination of the mesoderm deforms germband cells, but does not significantly contribute to patterning, cell alignment, myosin polarization and cell-cell contact disassembly (which drive germband convergence). The authors find modest effects of mesoderm invagination on new junction formation and orientation (which drive extension), but these changes do not have a significant effect on germband elongation. The authors conclude that germband extension is robust to external forces from the invagination of the mesoderm. *

      *MAIN 1. The authors clearly show that myosin density is not different in wild-type and twi mutant embryos, and subsequently argue that the pulling force from the mesoderm does not elicit a mechanosensitive response in early germband extension. But if the cell density is constant, doesn't that mean that the longer, DV-oriented interfaces in the wild type accumulate more total myosin than their shorter counterparts in twi mutants? Assuming that the total number of myosin molecules per cell is not greater in the wild type, wouldn't increased total myosin at the membrane suggest a response to the increased deformation? Certainly the cells are able to maintain the same cell density despite the pulling force from the mesoderm, so can the authors rule out a mechanosensing mechanism? *

      • *

      We do not rule out a mechanosensing mechanism. We agree the total Myosin at stretched interfaces is higher than at unstretched interfaces and proposed a homeostatic mechanism to maintain Myosin II density on the cortex upon rapid stretching (summarized in Fig. 7). Indeed it is possible that this mechanism could itself be due to mechanosensitive recruitment of Myosin II (though there are also other possibilities). We have tried to address this in our discussion (under “Mechanisms regulating Myosin II density at the cortex and consequences for cell intercalation” and “Restoration of DV cell length after being stretched by mesoderm invagination”), but we will amend the wording the make the possibility of mechanosensitive recruitment of Myosin II to maintain cortical density more explicit.

      *What happens to the Gap43mCherry signal? From Figure 2A, it seem to be diluted ventrally in the wild type as compared to twi mutants? Comparing myosin and Gap43 dynamics may shed light on whether myosin accumulates more or less than one would expect simply on the basis of having longer contacts. *

      We quantify the density of Myosin, rather than the total amount. Therefore, the length of the contact should not matter. The suggestion of comparing Myosin density to Gap43Cherry density is in principle a good one, as it would allow us to compare a protein which is not diluted as cell contact length increases (Myosin) to one which appears to be (Gap43). However, it is not essential for the conclusions that we make. However, in practice quantifying the Gap43Cherry signal would not be straightforward on our existing movies due to the imaging parameters used. We capture the Gap43Cherry channel (but not the Myosin channel) with a ‘spot noise reducer’ tuned on in the camera software, due to very occasional bright spot noise, which confuses the tracking software. Therefore, our Gap43Cherry signal is manipulated during acquisition and to quantify from these images would not be appropriate. Therefore, we would have to acquire, track and quantify some new movies, which is not possible within the timeframe of a revision.

      In summary, we think that we have sufficient evidence from our analysis that Myosin II is not diluted upon junctional stretching without comparing to quantification of Gap43Cherry, and the time investment required to quantify the Gap43Cherry would not be worthwhile as it would require more data to be acquired and processed.

      • The authors previously argued that mesoderm invagination was required for the fast phase of cell intercalation [Butler et al., 2009]. However, here the authors interpret that loss of twi does not significantly slow down interface contraction, but accelerates the elongation of junctions and cells along the AP axis, which overall would mean that mesoderm invagination is (slightly) detrimental for axis elongation. The discrepancy between their previous and current results should be discussed. *

      We are happy to add more information about these discrepancies in the discussion. In a nutshell, we think that these discrepancies arise from the challenges of comparing wildtype and twist mutant embryos relative to each other, and as a consequence we have made various improvements to our methods since (Butler et al., 2009). These improvements included using markers that would be expressed at the same levels in wildtype and twist embryos. Additionally, we did not use overexpressed cadherin-FPs (namely, the ubi-CadGFP transgene), which may have confounding effects, and we used a knock-in sqhGFP to ensure we could all Myosin II molecules were labelled by GFP. We also carefully controlled the temperature at which we acquired the movies, standardized the level at which to track cells and quantify Myosin between movies, as well as improving the accuracy of our image segmentation and cell type identification since our previous study (Butler et al., 2009). See also response to reviewer 2.

      • Related to the previous point, it is surprising that the differences shown in Figure 4A-B are not significant. This is particularly troubling when in Figure 5B the authors claim a significant difference in cell elongation rate, which is higher in twi mutants (but only in very short time intervals and actually switches sign at the end of germband extension). These are just two examples, but I think the analysis of significance on a per-time point basis is problematic. *

      *Have the authors considered analyzing their results as time series rather than comparing individual time points? Or perhaps integrating the different metrics over the duration of germband extension (e.g. using areas under the curve)? That way they would not have to arbitrarily decide if significant differences in a few time points should or not be interpreted as significant overall differences. *

      • *

      For graphs plotted against time of germband extension, we do not think it is appropriate to analyze as a time series rather than comparing individual time points, since different developmental events (such as mesoderm invagination) occur at different times. For graphs plotted against time to/from cell neighbor swap, these can also change over time (e.g., ctrd-ctrd orientation, Fig6D). Therefore we do not feel that it appropriate to run statistical analyses as a timeseries for these comparisons either. Statistically cut-offs are by their nature arbitrary. We have tried to highlight non-significant trends throughout the text (including for Fig4A&B), in addition to stating where we see significant differences to highlight where there may be minor (but not significant) differences.


      • While the number of cells analyzed is impressive, the number of embryos is relatively low, particularly for the wild type (only four embryos analyzed). If I understood correctly (if not, please clarify) the authors ran their statistics using cells and not embryos as their measurement unit. But I could not find any evidence that cells from the same embryo can be considered as independent measurements. This could be easily done by demonstrating that the variance of any of the measurements (e.g. elongation, area change rate, etc.) for cells in an embryo is comparable to that calculated when mixing cells from different embryos. *

      • *

      We do not simply use the number of cells as an n for our experiments. We use a mixed effects model for our statistics as previously (Butler et al., 2009; Finegan et al., 2019; Lye et al., 2015; Sharrock et al., 2022; Tetley et al., 2016). This estimates the P value associated with a fixed effect of differences between genotypes, allowing for random effects contributed by differences between embryos within a given genotype. We will make sure that this is clear in the Methods.

      MINOR 1. Figure 4D: the authors show no difference in the proportion of neighbor swaps per minute between wild-type and twi- mutant embryos. But how about the absolute number of neighbour swaps per minute? Does that change in twi mutants (and if so, why?).

      The number of interfaces involved in a T1 swap are expressed as a proportion of the total number of DV-oriented interfaces for all tracked ectodermal germband cells, to take account of differences in the number of tracked cells between different timepoints and different movies. Presenting the absolute number of swaps per minute could lead to misleading interpretations.

      • I was a bit confused about the reason why in Figure 4A the authors measure the rate of interface contraction in units of “proportion/min”, but in Figure 5A they measure interface elongation in units of “um/min”. Unless there is a good reason not to, these two metrics should be reported using the same units. Is there a difference in the rate of interface contraction when measured in absolute units (um/min)? *

      Thank you, we will amend so that both measures are expressed in the same units.

      • The discussion of previous work on cell deformation within the mesoderm (page 16, first paragraph) should probably include recent work from Adam Martin's lab (e.g. [Heer et al., 2017]; or [Denk-Lobnig et al., 2021]). *

      Thank you, and apologies for this oversight, we will add these references__.__

      SUGGESTIONS 1. While I appreciate the arguments that the authors provide to use twi mutants rather than sna mutants or twi sna double mutants, as the authors indicate, in twi mutants there is still contractility in the mesoderm (albeit not ratcheted). Therefore, it is possible that contractile pulses from the mesoderm in twi mutants could still facilitate cell alignment and polarization of myosin in the germband. Given the previous results from the Zallen lab using twi sna double mutants (see above) this is unlikely to be the case, but the findings in this manuscript would be significantly stronger if they included similar analysis in the double mutants.

      We had concerns about using sna or twi sna double mutants due to the large amount of space the un-internalized mesoderm takes up on the exterior of the embryo. This concern is also shared by reviewer 1 “Importantly, I think the researchers were correct in choosing to analyze twist single-mutant embryos (as opposed to snail or twist, snail double-mutant embryos), as the overall embryo geometry of these mutants is fairly similar to wild-type embryos, allowing the researchers to directly compare cell behaviors and myosin dynamics during germband extension. This approach also allows them to avoid indirect effects on the germband due to a completely non-internalized mesoderm.” * In addition to this concern, imaging of snail or twist snail* embryos by confocal imaging to include the ventral midline (which is required to define embryonic axes) is problematic as the un-constricted mesodermal cells occupy virtually all the field of view, leaving very few ectodermal cells to analyze.

      Whilst we acknowledge that there are some (un-ratcheted) contractions of mesodermal cells in twist mutants, we have clearly shown that there is no DV stretch and very little reorientation of cells. Therefore, any residual contractile activity in the mesodermal cells of twist mutants does not appear to have a mechanical impact on the ectoderm. We cannot exclude the possibility that there is some transmission of forces between contracting cells of the mesoderm and the ectoderm in twist mutants. However, our evidence suggests that the large tissue scale force that transmits to the ectoderm from the invaginating mesoderm is missing in twist mutants, and it was the effects of that force that we wished to investigate (See also response to reviewer 2).

      Review*er #4 (Significance (Required)): *

      *This is an interesting study, with careful quantitative analysis of cellular and subcellular dynamics. The results follow previous findings from Jennifer Zallen and the authors themselves. The Zallen lab showed that cell alignment, myosin polarization and germband extension are normal in sna twi mutants [Fernandez-Gonzalez et al., 2009], a result that the authors fail to cite. The results in the present manuscript are similar, but the analysis is much more in depth here, so the findings by Lye and colleagues certainly warrant publication. *

      We did not specifically cite this result from (Fernandez-Gonzalez et al., 2009), because the subject of their study is the formation of multicellular rosettes, not whether a pull from mesoderm affects Myosin II polarity and cell intercalation. The formation of multicellular rosettes occurs later in germband extension, and therefore these results are not directly relevant to our study. Additionally, their measures of alignment are defined as linkage to other approximately DV oriented interfaces, rather than directly measuring orientation compared to the embryonic axes as we do here, as a different question is being addressed. Specifically, the quoted sna twi experiment is interpreted as extrinsic forces from the mesoderm not being required for linkage of Myosin enriched DV-oriented interfaces together. Myosin II quantification is more rudimentary with edges being assigned as Myosin positive or Myosin negative, as opposed to quantifying the density of Myosin on each interface and we cannot see any comparison of Myosin II quantification between wildtype and twist embryos.­

      So, although the results are consistent with each other, they are not directly comparable due to methods used and we are happy that the reviewer acknowledges that our analysis is more in depth, which was necessary to address the specific questions that we investigate in our study.

              In general, there have been inconsistencies in results between previous studies, leading reviewer one to recognize that *“…it should be published and that it will be an impactful paper within the field. Namely, it will settle once and for all the question of whether mesoderm invagination is required for optimal germband extension in the early Drosophila embryo.”  *The high amount of conflicting information in the literature led us to not exhaustively describe individual findings, but we will ensure the results from the Zallen lab are appropriately cited.
      

      However, there are a number of experimental points that I think need to be addressed to solidify the manuscript, particularly in terms of statistical analysis.

      Please see more details above (main points 3 and 4) regarding specific concerns about experimental points and statistics. Additionally, we note that reviewer 3 states “statistics were appropriately used”, and our statistical methods are the same as we have used in previous studies comparing live imaging data (Butler et al., 2009; Finegan et al., 2019; Lye et al., 2015; Sharrock et al., 2022; Tetley et al., 2016).

      • *

      __REFERENCES

      __

      Blanchard, G. B., Kabla, A. J., Schultz, N. L., Butler, L. C., Sanson, B., Gorfinkiel, N., Mahadevan, L. and Adams, R. J. (2009). Tissue tectonics: morphogenetic strain rates, cell shape change and intercalation. Nat Methods 6, 458-464.

      Butler, L. C., Blanchard, G. B., Kabla, A. J., Lawrence, N. J., Welchman, D. P., Mahadevan, L., Adams, R. J. and Sanson, B. (2009). Cell shape changes indicate a role for extrinsic tensile forces in Drosophila germ-band extension. Nat Cell Biol 11, 859-864.

      Farrell, D. L., Weitz, O., Magnasco, M. O. and Zallen, J. A. (2017). SEGGA: a toolset for rapid automated analysis of epithelial cell polarity and dynamics. Development 144, 1725-1734.

      Fernandez-Gonzalez, R., Simoes Sde, M., Roper, J. C., Eaton, S. and Zallen, J. A. (2009). Myosin II dynamics are regulated by tension in intercalating cells. Dev Cell 17, 736-743.

      Finegan, T. M., Hervieux, N., Nestor-Bergmann, A., Fletcher, A. G., Blanchard, G. B. and Sanson, B. (2019). The tricellular vertex-specific adhesion molecule Sidekick facilitates polarised cell intercalation during Drosophila axis extension. PLoS Biol 17, e3000522.

      Gustafson, H. J., Claussen, N., De Renzis, S. and Streichan, S. J. (2022). Patterned mechanical feedback establishes a global myosin gradient. Nat Commun 13, 7050.

      Irvine, K. D. and Wieschaus, E. (1994). Cell intercalation during Drosophila germband extension and its regulation by pair-rule segmentation genes. Development 120, 827-841.

      Leptin, M. and Grunewald, B. (1990). Cell shape changes during gastrulation in Drosophila. Development 110, 73-84.

      Lye, C. M., Blanchard, G. B., Naylor, H. W., Muresan, L., Huisken, J., Adams, R. J. and Sanson, B. (2015). Mechanical Coupling between Endoderm Invagination and Axis Extension in Drosophila. PLoS Biol 13, e1002292.

      Proag, A., Monier, B. and Suzanne, M. (2019). Physical and functional cell-matrix uncoupling in a developing tissue under tension. Development 146.

      Rauzi, M., Lenne, P. F. and Lecuit, T. (2010). Planar polarized actomyosin contractile flows control epithelial junction remodelling. Nature 468, 1110-1114.

      Sharrock, T. E., Evans, J., Blanchard, G. B. and Sanson, B. (2022). Different temporal requirements for tartan and wingless in the formation of contractile interfaces at compartmental boundaries. Development 149.

      Simpson, P. (1983). Maternal-Zygotic Gene Interactions during Formation of the Dorsoventral Pattern in Drosophila Embryos. Genetics 105, 615-632.

      Tetley, R. J., Blanchard, G. B., Fletcher, A. G., Adams, R. J. and Sanson, B. (2016). Unipolar distributions of junctional Myosin II identify cell stripe boundaries that drive cell intercalation throughout Drosophila axis extension. Elife 5.

      Wang, X., Merkel, M., Sutter, L. B., Erdemci-Tandogan, G., Manning, M. L. and Kasza, K. E. (2020). Anisotropy links cell shapes to tissue flow during convergent extension. Proc Natl Acad Sci U S A 117, 13541-13551.

    1. Author Response

      We are grateful to the editors and the reviewers for the thorough evaluation of our manuscript and their feedback, as it allows us to provide additional clarification of our findings and improve the manuscript.

      In their evaluation reviewers raised a key conceptual point linked to the inhibitory mechanism that appeared to be insufficiently explained in the manuscript, leading to a misconception regarding the physiological relevance. They have also missed experimental data related to the concentrations of Aβ used and their relevance for Alzheimer’s disease (AD). We believe that our studies, although performed in vitro in model systems, provide novel conceptual framework and shed light on the unexplored mechanisms underlying AD.

      We discuss these points below in a provisional response to their comments.

      Reviewer #1 (Public Review):

      Summary:

      Human Abeta42 inhibits gamma-secretase activity in biochemical assays.

      Strengths:

      Determination of inhibitory concentration human Abeta42 on gamma-secretase activity in biochemical assays.

      Weaknesses:

      Human Abeta42 may concentrate up to microM order in endosomes.

      This is correct.

      If so, production of Abeta42 would be attenuated then lead to less Abeta deposition in the brain. The authors finding is interesting but does not fit the physiological condition in the brain.

      We thank the reviewer for raising this key conceptual point, as this gives us the opportunity to clarify it for the future readers.

      The characterized inhibitory mechanism is more complex than the reviewer’s interpretation, and a number of factors must be considered. Indeed, our data show that Aβ42 upon intracellular concentration inhibits γ-secretase activity, resulting in increased γ-secretase substrate (C-terminal fragment, CTF) levels. It is important however to highlight that this inhibition is competitive in nature, implying that it is partial, reversible, and regulated by the relative concentrations of the Aβ42 peptide (inhibitor) and the substrates. The model that we put forward is that cellular uptake and intracellular concentration of Aβ42 facilitates γ-secretase inhibition, which results in the accumulation of APP-CTFs (and γ-secretase substrates in general). However, as Aβ42 levels fall, the increased concentration of substrates shifts the equilibrium towards their processing and Aβ production. As Aβ42 concentration raises again, equilibrium is shifted back towards inhibition and so on. This inhibitory mechanism will translate into pulses of (partial) γ-secretase inhibition, which will alter γ-secretase mediated signalling (arising from increased CTF levels or decreased release of soluble intracellular domains from substrates). These alterations may affect the dynamics of systems oscillating in the brain, such as NOTCH signalling, implicated in memory formation (2), and potentially others (related to e.g. cadherins, p75 or neuregulins).

      It is worth noting that oscillations in γ-secretase activity induced by treatment with a γ-secretase inhibitor (semagacestat) have been proposed to have contributed to the cognitive alterations observed in semagacestat treated patients in the failed Phase-3 IDENTITY clinical trial (2, 3); and that semagacestat, like Aβ42, acts as a high affinity competitor of substrates (Koch et al, 2023). We will include this clarification in the discussion of the revised manuscript and create an additional figure presenting the proposed mechanism.

      It is not clear whether the FRET-based assay in living cells really reflect gamma-secretase activity.

      The specificity of this assay is supported by the γ-secretase inhibitor treatment included as a positive control (Figure 3). In addition, the following literature supports that this assay truthfully assesses γ-secretase activity in cellular context (4-7).

      Processing of APP-CTF in living cells is not only the cleavage by gamma-secretase.

      This is correct, and therefore we have analysed the contribution of other APP-CTF degradation pathways by performing cycloheximide-based stability assay in the presence of γ-secretase inhibitor. Quantitative analysis of the levels of both APP-CTFs and APP-FL over the 5h time-course failed to reveal significant differences between Aβ42 treated cells and controls. As expected, Bafilomycin A1 treatment markedly prolonged the half-life of both proteins (Figure 7B & C). The lack of a significant impact of Aβ42 on the half-life of APP-CTFs under the conditions of γ-secretase inhibition is consistent with the proposed inhibitory mechanism. Finally, we note that the inhibition will not only affect APP-CTF, but also the processing of γ-secretase substrates in general.

      Reviewer #2 (Public Review):

      Summary:

      In the current study, the authors tested the hypothesis that Aβ42 toxicity arises from its proven affinity for γ-secretases. Specifically, the increases in Aβ42, particularly in the endolysosomal compartment, promote the establishment of a product feedback inhibitory mechanism on γ-secretases, and thereby impair downstream signaling events. They showed that human Aβ42 peptides, but neither murine Aβ42 nor human Aβ17-42 (p3), inhibit γ-secretases and trigger accumulation of unprocessed substrates in neurons, including (CTFs of APP, p75 and pan-cadherin. Moreover, Aβ42 dysregulated cellular homeostasis by inducing p75-dependent neuronal death. Because γ-secretases process many other membrane proteins, including NOTCH, ERB-B2 receptor tyrosine kinase 4 (ERBB4), N-cadherin (NCAD) and p75 neurotrophin receptor (p75-NTR), revealing a broad range of downstream signaling pathways, including those critical for neuronal structure and function. Hence, they propose to identification of a selective role for the Aβ42 peptide, and raise the intriguing possibility that compromised γ-secretase activity against the CTFs of APP and/or other neuronal substrates contributes to the pathogenesis of AD. Overall, the data are not very convincing to support the main claim.

      Strengths.

      Different in vitro and cellular approaches are employed to test the hypothesis.

      Weaknesses.

      The experimental concentrations for Aβ42 peptide in the assay are too high, which are far beyond the physiological concentrations or pathological levels. The artificial observations are not supported by any in vivo experimental evidence.

      It is correct that in the majority of the experiments we used low μM concentrations of Aβ42. However, we would like to note that we also performed experiments where conditioned medium collected from human APP.Swe expressing neurons was used as a source of Aβ. In these experiments total Aβ concentration was in low nM range (0.5-1 nM) (Figure 4G). Treatment with this conditioned medium led to the increase APP-CTF levels, supporting that low nM concentrations of Aβ are sufficient for partial inhibition of γ-secretase.

      We would like to underline that Aβ is estimated to be present in the brain in concentration ranging from fM to mM, depending on the pool (soluble, aggregated, fibrillar, etc) that is considered (8, 9). However, it is rather the local than the global concentration of Aβ that is critical for the disease pathogenesis. In this regard, it is proposed that as AD progresses Aβ42 slowly accumulates in the endo-lysosomal system wherein it reaches μM concentrations that are required for aggregation and seeding (1, 10, 11). Our findings are consistent with the analysis showing that extracellular soluble Aβ42 peptide, at low nM concentrations, is taken up by cortical neurons and neuroblastoma (SH-SY5Y) cells, and concentrated in the endo-lysosomal system wherein effective peptide concentrations reach ~2.5 μM (1). Hence, a slow vesicular peptide accumulation and/or degradation imbalance (1, 11, 12) could lead to several order of magnitude increases in the effective concentration of Aβ42 over the span of years to decades in AD pathogenesis. We note that our experimental settings, using low μM concentrations of extracellular Aβ42 over 24h treatment, were designed to accelerate this 'peptide concentration’ process in vitro. As discussed in our report, a high μM Aβ peptide concentration in the endo-lysosomal system not only leads to aggregation but also facilitates γ-secretase inhibition. Of note, we are currently developing protocols and will undertake follow up studies to quantitatively define the Aβ concentration in synaptosomes and endosomes in AD brain, as well as in in vitro systems (i.e. cells treated with Aβ preparations obtained from AD brains).

      Finally, we would like to highlight that analyses of the brains of the AD affected individuals have shown that APP-CTFs accumulate in both sporadic and genetic forms of the disease (13-15); and recently, Ferrer-Raventós et al have revealed a correlation between APP-CTFs and Aβ levels at the synapse (13).

      To conclude, we would like to highlight that as clarified above, the Aβ peptide concentrations and the conditions tested fit well within pathophysiology, and that the data presented in our report collectively provide evidence in support of an Aβ42-mediated inhibitory effect on γ-secretase.

      References:

      1. X. Hu et al., Amyloid seeds formed by cellular uptake, concentration, and aggregation of the amyloid-beta peptide. Proc Natl Acad Sci U S A 106, 20324-20329 (2009).
      2. B. De Strooper, Lessons from a failed γ-secretase Alzheimer trial. Cell 159, 721-726 (2014).
      3. R. S. Doody et al., A phase 3 trial of semagacestat for treatment of Alzheimer's disease. N Engl J Med 369, 341-350 (2013).
      4. M. C. Houser et al., A Novel NIR-FRET Biosensor for Reporting PS/γ-Secretase Activity in Live Cells. Sensors (Basel) 20, (2020).
      5. M. C. Q. Houser et al., Limited Substrate Specificity of PS/γ-Secretase Is Supported by Novel Multiplexed FRET Analysis in Live Cells. Biosensors (Basel) 11, (2021).
      6. M. Maesako et al., Visualization of PS/γ-Secretase Activity in Living Cells. iScience 23, 101139 (2020).
      7. M. Maesako, M. C. Q. Houser, Y. Turchyna, M. S. Wolfe, O. Berezovska, Presenilin/γ-Secretase Activity Is Located in Acidic Compartments of Live Neurons. J Neurosci 42, 145-154 (2022).
      8. B. R. Roberts et al., Biochemically-defined pools of amyloid-β in sporadic Alzheimer's disease: correlation with amyloid PET. Brain 140, 1486-1498 (2017).
      9. J. A. Raskatov, What Is the "Relevant" Amyloid β42 Concentration? Chembiochem 20, 1725-1726 (2019).
      10. M. P. Schützmann et al., Endo-lysosomal Aβ concentration and pH trigger formation of Aβ oligomers that potently induce Tau missorting. Nat Commun 12, 4634 (2021).
      11. E. Wesén, G. D. M. Jeffries, M. Matson Dzebo, E. K. Esbjörner, Endocytic uptake of monomeric amyloid-β peptides is clathrin- and dynamin-independent and results in selective accumulation of Aβ(1-42) compared to Aβ(1-40). Sci Rep 7, 2021 (2017).
      12. M. F. Knauer, B. Soreghan, D. Burdick, J. Kosmoski, C. G. Glabe, Intracellular accumulation and resistance to degradation of the Alzheimer amyloid A4/beta protein. Proc Natl Acad Sci U S A 89, 7437-7441 (1992).
      13. P. Ferrer-Raventós et al., Amyloid precursor protein Neuropathol Appl Neurobiol 49, e12879 (2023).
      14. M. Pera et al., Distinct patterns of APP processing in the CNS in autosomal-dominant and sporadic Alzheimer disease. Acta Neuropathol 125, 201-213 (2013).
      15. L. Vaillant-Beuchot et al., Accumulation of amyloid precursor protein C-terminal fragments triggers mitochondrial structure, function, and mitophagy defects in Alzheimer's disease models and human brains. Acta Neuropathol 141, 39-65 (2021).
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statements

      In this manuscript, we describe a LINC complex-dependent centrosome positioning mechanism that takes place during the early stages of mitotic spindle assembly. We are grateful to the reviewers for their comments and suggestions and hope the proposed revision plan addresses all concerns raised. We are pleased that reviewers recognize the excellent technical quality of the experiments and the significance of the work presented in this manuscript.

      Description of the planned revisions

      Reviewer 1

      • “Moreover, we demonstrate this mechanism is altered in cancer cells, leading to increased chromosome segregation errors. » Here the authors infer that the identified mechanism is absent in cancer cells and that its absence contributes to chromosome segregation errors. Both conclusions are not supported by the presented data. First, the authors did not test whether any members of the LINC complex or dynactin is present at lower levels on the nuclear membranes of the cancer cells. Such a direct validation would be essential to make such a strong statement. Second, the authors conclude that this mechanism prevents chromosome segregation errors, based on the fact that depletion or impairment of the LINC complex (shSUN1, shSUN2, DN-KASH) results in chromosome segregation errors. These perturbations lead, however, as noted by the authors themselves to pleiotropic effects, including insufficient retraction of nuclear membrane, which can all contribute to chromosome segregation errors. It is therefore impossible to estimate the contribution of the centrosome positioning mechanism to these segregation errors using this type of perturbations. One could even argue that this mechanism might not be that important, since depletion of SUN2, which also impairs centrosome positioning has no significant effect on chromosome segregation.

      We agree with the reviewer that an analysis of the levels of LINC complex components and dynactin in cancer cells is lacking. For this reason, we propose to analyze the levels of SUN1, SUN2, dynactin and Nesprins by immunofluorescence in all cell lines. In addition, we have now re-written the manuscript regarding the chromosome segregation phenotype, to clarify that the observed phenotypes are not necessarily due to centrosome positioning defects.

      Reviewer 2

      “The authors need some other NE protein as a control to show that the reduction of dynein by DN-KASH is a specific defect and not a broad impact on the NE. The dynein data in Figs. 5J-L need to be extended to SUN1/2”.

      We thank the reviewer for these suggestions. To clarify this point, we will analyze the levels of lamin B following expression of DN-KASH or DPPPL-KASH. This will allow us to determine whether expression of the DN-KASH construct only affects dynein and not other NE proteins. In addition, we will analyze dynactin levels following SUN1 and SUN2 depletion.

      Reviewer 3:

      “Fig. 3: I suggest to quantify the lamin B1 and LBR overexpression levels”.

      According to the reviewer´s suggestion, we will perform a WB analysis of the cells overexpressing lamin B1 and LBR and quantify its levels.

      Description of the revisions that have already been incorporated in the transferred manuscript.

      Reviewer 1

      • “The authors conclude based on three cell lines that the centrosome positioning mechanisms is present in non-transformed cells and not in cancerous cells. The authors have, however, only analysed 1 non-cancerous cell line, and they compare cells originating from vastly different tissues (retina, bones and breast) and origins (epithelial vs. mesenchymal cartilage cells). Such a general statement is not possible, without a systematic comparison of several healthy cells vs cancerous cells from the same tissue”.

      We agree with this reviewer´s comment, which is also shared by the other reviewers. Accordingly, we have now extensively rewritten the manuscript to tone down this statement and focus on the role of the LINC complex in determining centrosome positioning.

      • “While the data showing that centrosome positioning depends on the LINC complex is solid and robust, some of the "negative" examples identified by the authors are less convincing. One the process the authors study is cell rounding. Based on the fact that Rap1 transfection or treatment with Calyculin A does not lead to differences that are statistically different, the authors conclude that cell rounding is not involved. However, absence of statistical difference does not mean that there is no difference. Indeed, when comparing the raw data in Figure 2L and 2Q to the positive hit shSun2 in Figure 4J, one could conclude that cell rounding does make a difference, and that this statistical difference would emerge if the authors would count a high number of cells. Therefore the authors should interpret these results in a more differentiated manner, and also instead of just stating nonsignificant, state also the real p-values for the different experiment”.

      According to the reviewer´s suggestion, we have now added all p values to the respective graphs and interpreted these results in a more step-by-step manner. Moreover, while we understand the reviewer`s comment regarding our sample size, it should be noted that this is a single-cell, high-resolution imaging approach which, in combination with certain treatments makes it very challenging to obtain data for a high number of cells. In this regard, we point out that interfering with cell rounding was extremely difficult to achieve. When highly overexpressed, Rap1* completely impairs mitotic cell de-adhesion, and this blocks mitotic entry (Marchesi et al., 2014). Furthermore, CalA treatment induces a fast and drastic rounding, which makes it very challenging to accurately track centrosome and nuclear positions. Nevertheless, we filmed additional cells treated with CalA and added the data to the figures. Our results still confirm that interfering with cell rounding does not significantly change centrosome positioning during this stage. It should be noted that the sample size in all conditions is within the range normally used when performing single-cell high resolution imaging.

      • The second major concerns emerges when looking at the data in Figure 5, when the authors test for the abundance of the dynein complex on the nuclear envelope in cells treated with DPPPL-KASH or DN-KASH. Yes, there is a statistical difference, but the absolute difference is tiny (I estimated a normalized intensity of 1.44 vs 1.35). This is a difference of less than 10%. How do the authors think that such a small change in dynein could have such a strong effect on centrosome positioning? Would a partial dynactin depletion by 10% give an equivalent result? Does the depletion of other proteins involved in the late recruitment of dynein at the NE also affect centrosome positioning?

      We thank the reviewer for this important point. Originally, we quantified dynactin intensity by selecting three unbiased random regions of the NE. However, this approach might underestimate the overall fluorescence intensity across the entire structure. For this reason, we have now measured dynactin fluorescence intensity over the entire NE using the same dataset. We have replaced Fig. 5K and L with this data and a description of the method has been added to the Materials and Methods section. As can be seen from the new graph, there is a reduction of approximately 50% in dynactin NE fluorescence intensity.

      The reviewer also asks whether depletion of other proteins involved in the late recruitment of dynein at the NE would also affect centrosome positioning. However, extensive previous work done by us and others, has shown that depletion of either BicD2 or NudE/NudEL, which are the main adaptors for dynein loading during the G2/M transition, significantly affect prophase centrosome positioning, since they detach centrosomes from the NE (Splinter et al., 2010; Bolhy et al., 2011; Hu et al., 2013; Baffet et al., 2015; Nunes et al., 2020). Once detached, centrosomes are no longer able to orient according to nuclear cues. Therefore, we do not believe such an approach would provide additional information regarding the role of the LINC complex in this process.

      Reviewer 2:

      • “Figure 1 is insufficiently explained. The authors have to describe in an understandable way how they measured centrosome-centrosome angle and centrosome-nucleus angle. They should show a cartoon in which these angles are clearly shown. The small cartoons in Fig. 1C are not helpful at all; they are also not explained. The authors should explain the meaning of the black dots (are these centrosomes?) and the even smaller dots. The short nuclear axis should be indicated, e.g., by a red line”.

      We apologize for the lack of sufficient explanation in Figure 1. We have now re-written the text. We have also added a scheme explaining how centrosome-nucleus and centrosome-centrosome angles are quantified, according to the reviewer´s suggestion. We have added this to Fig. S1. We believe this makes our data more understandable and easier to follow.

      “On the first page of the manuscript: "Consequently, at the NEP, centrosomes are positioned on the shortest nuclear axis (Fig. 1C) as can be seen in Fig. 1A. This means that the centrosome-nucleus angle relative to the shortest nuclear axis should be 0. However, in Fig. 1C, this angle is between 45 and 90 degrees. This is also the case for Fig. 1D. Please clarify”.

      We thank the reviewer for noticing this error. In fact, the graphs reflect positioning of centrosomes relative to the longest nuclear axis. Therefore, when the values are close to 90º, this means they are oriented on the shortest nuclear axis. We understand this could be confusing to the readers. We have now clarified this information throughout the text.

      • “I find it confusing that in Fig. 1, depending on the subfigure, the short or longest nuclear axis is used as a reference point: Fig. 1C: shortest; D: shortest; F: shortest; G: longest; I: shortest; J: longest. Thus, even within the same cell line, the reference point is changing. What is the rational for this variation”?

      Again, we refer to the point above. The reference point is always the shortest nuclear axis. However, we apologize for the lack of clarity. This has all been changed, according to the explanation provided in the previous point.

      • Fig. 4K, L, M: in figure, y-axis: "shortest nuclear axis". In legend: "relative to the longest nuclear axis". I guess the longest nuclear axis is correct. Same in Fig. 5D and E. Fig. 5C lacks the WT control.

      This information has been clarified in the text and panels have been corrected accordingly. Regarding Fig. 5C, we believe the correct control is the expression of PPPL-KASH, since it has been shown extensively that Nesprins localize to the NE in control, unmanipulated cells. Nevertheless, we have added a WT control to Supplementary Figure 5, showing localization of Nesprins in unmanipulated prophase cells.

      “The cells in Fig. 5J are not comparable: one has a monopolar spindle, the other a bipolar. The authors need some other NE protein as a control to show that the reduction of dynein by DN-KASH is a specific defect and not a broad impact on the NE. The dynein data in Figs. 5J-L need to be extended to SUN1/2”.

      We agree with the reviewer´s comment that the cell in the top panel might appear as a monopolar. However, it is not. In fact, this cell has centrosomes on the top and bottom of the nucleus, in a vertical configuration (check Magidson et al., Cell, 2011). To clarify this, we have now added lateral projections of all cells, highlighting the centrosomes to clearly show they are positioned on opposite sides of the nucleus. The other points related to the effects of DN-KASH on other NE proteins and dynactin levels following shSUN1 and shSUN2 are being addressed (please see comments above in the section “description of planned reviews”).

      “The title of the paper is misleading: the authors do not provide any indication for a nuclear signal in prophase that determines centrosome positioning”.

      We have changed the title of the manuscript according to the reviewer´s suggestion.

      “It would make sense to use the same time scale in Figs. 1A and B (either min.sec. or sec.) to allow direct comparison”.

      We have now changed the time scale to seconds in all figures to allow direct comparison.

      “2nd section: Mitotic cell rounding "The authors state: Given that cancer cells failed... I would be careful with this generalization; only one cancer cell was used in this study”.

      Given the limited number of cells that we used, and following the concern raised by all reviewers, we have now re-written the text to avoid generalizations. Instead, we now focus on the role of the LINC complex in determining centrosome positioning.

      “The authors say: "However, they did not place the centrosomes at the shortest nuclear axis (Figure 4K-M)." Centrosomes are still on the shortest nuclear axis but not as frequent as in control”.

      This has been corrected.

      “The white color in Fig. 6B cannot be seen and needs to be changed to something else”.

      We apologize for this oversight. During the upload and pdf conversion process, we did not realize the color of this bar, corresponding to the DN-KASH group had changed to white. This has now been corrected.

      The paper has neither line nor page numbers.

      This has been added.

      Reviewer 3

      “it would make sense to indicate the test used for each p-value in all the figure legends”.

      We have now added the statistical test used and the p-value in the figure legends.

      “Figure legends are quite repetitive and could be shortened. E.g. in Fig. 1 the description for E, F, H and I repeats what has been explained for B and C. Same applies between figure legends. The authors might refer to previous legends if the analysis was done in a similar way”.

      The legends have been simplified.

      “How is nuclear solidity defined and analyzed in Fig S3D”?

      Nuclear solidity was analyzed using Fiji. In short, nuclei are outlined using the polygon tool and nuclear area is measured. To calculate nuclear solidity, the nuclear area is then divided by the corresponding nuclear convex hull area. Irregular nuclei will typically show a lower nuclear solidity value. This information was added to the text.

      “The references to Fig S3 in figure legend 3 ("see Fig S3") do not enlighten the message and could be removed. The same applies to Fig5 - here it is not clear why the author refer to Fig S4”.

      We agree with this reviewer´s comment. We have now removed these references from the legends.

      “Fig. 5: Consider reordering the panel: Start with the current panel C (as in the text) as it is the necessary control prior to the experimental data”.

      We have now changed the order of the panel according to the reviewer´s suggestion.

      “Fig 5 I: what means "before"? Can the authors give a time window they use for analysis”.

      We have now replaced the term “before” with a defined time.

      “Page 20: "... shortest nuclear axis (Fig. 1C, 5D-G; n.s. - not significant). However, DN-KASH-expressing cells showed compromised separation and positioning of centrosome (Fig. 5D-G, * p=0.0155 and * p=0.0237, respectively). - rather point to the specific panels, i.e. Fig. 1C, 5D and F as well as and Fig. 5E and G”.

      We have now clarified these points in the text.

      “Fig 6B. The DN- KASH bars are on my pdf not visible - use a darker grey”.

      As mentioned above, we apologize for this oversight. We did not realize that during the pdf conversion process the bar corresponding to the DN-KASH group had changed to white. We have now corrected this.

      “Fig S6, albeit mentioned in the text, is not included in the supplementary info”.

      We apologize for this error. In fact, where it reads Fig. S6, should be Fig. S5. We have now corrected this.

      “a. GlutaMAX instead of GlutaMAXE (page 29)

      b. What means "as described previously"? No reference is given. Do you refer to the upper part of the method section? (page 30)

      c. 20 nM HEPES should most probably read 20 mM (page 32)

      d. "1:50 protease inhibitor; 1:100 Phenylmethylsulfonul fluoride" - which protease inhibitor (mixture)? Rather phenylmethylsulfonyl fluoride.

      e. exact composition of the cytoskeleton buffer used to prepare 4% paraformaldehyde could be given”.

      All these suggestions/corrections have been introduced in the text.

      Description of analyses that authors prefer not to carry out.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      A nuclear signal in prophase determines centrosome positioning and ensures efficient mitotic spindle assembly.<br /> Lima and Ferreira investigate in this manuscript the regulation of centrosome positioning in early mitosis. The authors first analyze the position of the two centrosomes either relative to the cell length axis or the shortest or longest axis of the nucleus and describe differences between RPE1, U2OS, and MDA-MB cells. Next, they analyze whether mitotic cell rounding determines the position of centrosomes; however, delayed cortical retraction (Rho-kinase inhibition), adhesion disassembly inhibition (Rap1Q63E), and inducing premature rounding (CalA) did not impact centrosome positioning in RPE1, U2OS, and MDM-MB cells. In addition, the nuclear lamina and LBR were also not required for centrosome positioning on the shortest nuclear axis. In contrast, depletion of SUN1 or SUN2 and overexpression of a dominant-negative DN-KASH affected the nuclear positioning of centrosomes in RPE1 cells. Finally, the authors analyze whether the LINC complex impacts mitotic fidelity. This is indeed the case when SUN1 is depleted, but it is not the case for SUN2 depletion or DN-KASH overexpression. This difference between LINC complex components is not discussed in the manuscript. Since SUN1, SUN2, and DN-KASH affect centrosome positioning in a similar way (Figs. 4 and 5), the chromosome segregation defect in SUN1-depleted cells is most likely not caused by a centrosome position defect but probably by another defect caused by SUN1 depletion.

      Major comments

      1. Figure 1 is insufficiently explained. The authors have to describe in an understandable way how they measured centrosome-centrosome angle and centrosome-nucleus angle. They should show a cartoon in which these angles are clearly shown. The small cartoons in Fig. 1C are not helpful at all; they are also not explained. The authors should explain the meaning of the black dots (are these centrosomes?) and the even smaller dots. The short nuclear axis should be indicated, e.g., by a red line.
      2. On the first page of the manuscript: "Consequently, at the NEP, centrosomes are positioned on the shortest nuclear axis (Fig. 1C) as can be seen in Fig. 1A. This means that the centrosome-nucleus angle relative to the shortest nuclear axis should be 0. However, in Fig. 1C, this angle is between 45 and 90 degrees. This is also the case for Fig. 1D. Please clarify.
      3. I find it confusing that in Fig. 1, depending on the subfigure, the short or longest nuclear axis is used as a reference point: Fig. 1C: shortest; D: shortest; F: shortest; G: longest; I: shortest; J: longest. Thus, even within the same cell line, the reference point is changing. What is the rational for this variation?
      4. Fig. 4K, L, M: in figure, y-axis: "shortest nuclear axis". In legend: "relative to the longest nuclear axis". I guess the longest nuclear axis is correct. Same in Fig. 5D and E. Fig. 5C lacks the WT control.
      5. The cells in Fig. 5J are not comparable: one has a monopolar spindle, the other a bipolar. The authors need some other NE protein as a control to show that the reduction of dynein by DN-KASH is a specific defect and not a broad impact on the NE. The dynein data in Figs. 5J-L need to be extended to SUN1/2.
      6. The title of the paper is misleading: the authors do not provide any indication for a nuclear signal in prophase that determines centrosome positioning.

      Minor comment

      1. It would make sense to use the same time scale in Figs. 1A and B (either min.sec. or sec.) to allow direct comparison.
      2. 2nd section: Mitotic cell rounding "The authors state: Given that cancer cells failed... I would be careful with this generalization; only one cancer cell was used in this study.
      3. The authors say: "However, they did not place the centrosomes at the shortest nuclear axis (Figure 4K-M)." Centrosomes are still on the shortest nuclear axis but not as frequent as in control.
      4. The white color in Fig. 6B cannot be seen and needs to be changed to something else.
      5. The paper has neither line nor page numbers.

      Referees cross-commenting

      My comments are more or less reflected by the comments and concerns of reviewer 1 (only one cancer cell line; the role of the LINC complex). This reduced the impact of this manuscript that is certainly intresting and has novel aspects.

      Significance

      The manuscript analysis an early step in spindle assembly: the positioning of the two centrosomes on the NE. As such, the paper is interesting and important. They exclude cell rounding and lamin disassembly as mechanisms for centrosome positioning. The SUN1/2 and KASH data on centrosome positioning are convincing, and they provide a novel finding on the function of the LINC complex in centrosome positioning, probably via dynein recruitment to the NE. It remains unclear whether LINC recruits dynein directly or functions via one of the two known dynein/NE recruitment pathways. LINC-dynein at the NE binds centrosome microtubules and dynein pulls them towards the NE. However, how LINC-dynein spatially positions centrosomes relative to the short axis of the nucleus remains unclear (dynein uniformly decorates the NE (Fig. 5J)). The data on chromosome missegregation are not so clear because the defect only occurs in SUN1-depleted cells. Thus, this phenotype indicates most likly a function of SUN1 but not the LINC complex and is probably not related to centrosome positioning since all LINC components affect centrosome positioning. The paper falls short in explaining how parameters were measured and contains mistakes in the figures, as outlined above. The paper lacks a coherent story (a little bit on cancer, some negative data, LINC-dynein, but it stops on the surface).

      It will be relatively easy to improve some aspects of the manuscript (explaining the angles, correcting the figures: one week). Measuring dynein at the NE in SUN1/2-depleted cells is also easy to do (1-2 months). To get more mechanistic insides into how LINC-dynein positions centrosomes probably will not be possible during revision time.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2023-02111

      Corresponding author(s): Moira O’Bryan

      1. General Statements

      We thank the Review Commons editor and the three reviewers for their overall positive responses in assessing this manuscript. Further, we appreciate and would like to reiterate the similarities across our three reviewers’ comments regarding the significance of this work, where our examination of epsilon tubulin (TUBE1) during mammalian spermatogenesis will be valuable for both microtubule/cytoskeletal and developmental/ reproductive fields. Below, we have made point-by-point responses to the reviewers’ comments, and outlined by the revisions we plan to make, or have made. All line numbers refer to the transferred manuscript file with tracked changes.

      2. Description of the planned revisions

      Reviewer 1:* The authors claim that because the TUBE1 knockout mouse have abnormal centrosome numbers during meiosis, there is a role for TUBE1 in suppressing supernumerary centriole formation. While this is one possibility, it's also possible that abnormal centrosome numbers arose as a result of cell division defects, especially because binucleate cells are present in mutants. The authors should edit the text to state that abnormal centrosome numbers may arise from either supernumerary centriole formation (by the templated or de novo pathways) or from failure to complete cell division. *

      *OPTIONAL: to test these possibilities, the authors may choose to 1) count the number of centrioles in meiosis with two different centriole markers 2) stain for markers of mature centrioles, such as Cep164, to determine the number of parental centrioles. *

      Response: This is a good point. Published data indicates that the Stra8-cre is active within a subset of undifferentiated spermatogonia, and in differentiated spermatogonia through to pre-leptotene spermatocytes (Sadate-Ngatchou et al., 2008). This raises the possibility that the increase in centriole numbers could be due to a failure to complete cell division if cre is active in mitotically active spermatogonia populations. The text has been appropriately modified in lines 207-209 and 352 to reflect these insights. We appreciate the Reviewer’s optional suggestion to perform additional immunolabeling experiments and intend to examine the number of parental centrioles in spermatocytes during meiotic division using a marker of the distal or subdistal appendages. This data will be included in the final revised document.

      Reviewer 2:* Considering the suggested non-canonical function of Epsilon tubulin outside the centriole in mice sperm, it is critical to know the localization of the protein in spermatocytes during meiosis and spermatids during differentiation. *

      Response: We agree with Reviewer 2 that determining the localization of TUBE1 in spermatocytes and spermatids would be desirable. However, we are yet to find an appropriate available antibody for this. We have previously assessed the specificity of a TUBE1 antibody (PA5-56917, Invitrogen), however, this antibody was not suitable for use in our mouse model. This aside, we have recently acquired a new TUBE1 antibody which we plan to evaluate its specificity during this revision period. If it appears to bind specifically to TUBE1, we will perform the requested localization experiments.

      For clarification we have previously defined the location of TUBE1 in spermatids to the manchette and basal body in elongating spermatids (lines 72-74) (Dunleavy et al., 2017). Unfortunately, the antibody used in this study is now discontinued. The phenotypes observed as a consequence of TUBE1 loss of function in this study are, however, consistent with these patterns of localization.

      Reviewer 2:* Localization of Epsilon tubulin is needed to distinguish between mutant sperm cells and those that are not Epsilon tubulin mutants in the Tube1GCKO/GCKO mice. E.g., are the 28.07% of Tube1GCKO/GCKO tubules that showed a Sertoli cell only (SCO) phenotype the one where all the cells are mutants? *

      Response: As per our response to Reviewer 2’s comment above, we plan to test a new TUBE1 antibody to determine TUBE1 localization in this model. Outlined in our response to Reviewer 2 below, we also plan to sequence DNA from mature epididymal sperm from our mutant mice to further confirm the deletion of Tube1 exon 3.

      Reviewer 2:* The generated conditional germ cell-specific mutants are demonstrated by mRNA expression spermatocytes. It would help if DNA sequencing, western, and Immunohistochemical staining were used to show the gene and protein are affected. *

      Response: We thank Reviewer 2 for their suggestions. Should we successfully validate an appropriate TUBE1 antibody for use in our model, we will perform immunohistochemical staining during the revision process. Our qPCR results from purified spermatocytes however, strongly suggest that the Tube1 gene is deleted in our model, noting that such purifications are on average 81% pure with the major contaminants being Sertoli cells and spermatids (Dunleavy et al., 2019). To further confirm the deletion of Tube1 exon 3, we plan to sequence DNA from mature epididymal sperm from our mutant mice.

      Reviewer 2:* "Suggesting a core TUBE1 function that can be supplemented by either z-tubulin or TUBD1." Can you test what happens to mice Z and D tubulin isoforms in the mutant? Did their level increase in the centrioles? This is informative since there is no clear centriolar phenotype (other than centriole number that may be due to cell division failure) in mice spermatogenesis and the paper's central hypothesis in the introduction. *

      Response: We appreciate this question by Reviewer 2. Zeta tubulin is not present in the mouse genome as outlined in our introduction (lines 38-39). We do acknowledge that exploring Tubd1 will be informative in our mutant and thereby plan to examine its expression in round spermatids.

      Reviewer 2: The authors looked at the Metaphase stage cells to assess meiosis. It would be more interesting to look at the meiosis prophase I. Since the Stra8 acts very early leptotene stage, it would be interesting to see if meiosis is defective from the very beginning. Also, some suggest that the manchette is nucleated at the pachytene stage. Is the manchette defective from the very early stage of nucleation?

      Response: We thank Reviewer 2 for this suggestion. To this end, we plan to examine juvenile mouse testes at days 10 and 17 post-partum where leptotene and pachytene spermatocytes are the most mature germ cells respectively.

      In regard to the Reviewer’s comment of the manchette being nucleated in pachytene stage spermatocytes, we acknowledge that the precise mechanism of manchette nucleation has not been confirmed. We are aware of the alternative hypothesis introduced by Moreno and Schatten (2000), which postulates manchette microtubules may be nucleated prior to pachytene period, through their examination of bovine male germ cells. This hasn’t, however, been supported by evidence and with more recent data, others have suggested that the manchette is nucleated at the centrosomal adjunct (Lehti and Sironen, 2016). Indeed, our unpublished data suggests this is the case (another study). Regardless, the origin of the microtubule seeds that ultimately extend to form the manchette is not relevant to the hypothesis we have proposed. As we note that in our manuscript and mouse model, manchettes appear to assemble normally in step 8 spermatids. Rather, their movement and disassembly is abnormal i.e. TUBE1 serves critical roles more manchette movement and disassembly rather than manchette formation.

      Reviewer 2:* Is the acetylation of manchette microtubules affected in the absence of TUBE1? *

      Response: Reviewer 2 raises an interesting question, which we plan to answer through immunolabeling of testis sections for acetylated tubulin in our control and mutant groups.

      Reviewer 3: *Minor points, a substantial percentage of sperm produced had a normal head shape in the KO (Figure 1I), which undermine the function of tube1 in nuclear shaping, the author should address this point in their manuscript. It is also curious whether there are phenotype in other tissues, can the authors comment on that? *

      Response: We thank Reviewer 3 for highlighting this point. As reported in Fig. 1I, 28.5% of sperm from Tube1GCKO/GCKO epididymides have abnormal nuclear shape. This is a 4.4-fold increase over that seen in wild type sperm. These data clearly highlight the role of TUBE1 in defining nuclear morphology. Variations between cells does not undermine this conclusion. It appears that prior to sperm release from the testis, the majority of TUBE1 null spermatids heads are abnormally shaped. However, in the epididymis there appears to be an increase in the proportion of normally shaped heads. We thus hypothesize that the high rates of spermiation failure in the TUBE1 null mice reflect the preferential removal of abnormally shaped sperm by Sertoli cells, thus enriching for normally shaped heads that are released. During the revision process, we will quantify the percentage of spermatids with normal versus abnormally shaped heads prior to spermiation in testis sections. All Tube1 null mice were sterile.

      To Reviewer 3’s second point - we have not examined other tissues in this conditional male germ cell knockout mouse model, as the cre used in this manuscript is only expressed in the testis (Sadate-Ngatchou et al., 2008). Consistent with the specificity of the deletion, null male mice are overtly healthy, with the exception of male fertility, and exhibit normal body weight as detailed on line 123 and in Fig S1D.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer 1:* In figure 5, based on quantification of fluorescence intensity, the authors conclude that loss of epsilon-tubulin results in an increase in the levels of KATNAL1, KATNAL2, and KATNB1. Given the inherent variability in immunofluorescence staining, the authors should at a minimum normalize their intensity measurements to those of an unrelated control protein stained in the same cell (ex: alpha-tubulin). It would be more convincing to quantify the levels of these proteins by Western blot (again, normalized to a control protein or to total cellular protein), which should be feasible given that the authors can isolate elongating spermatids. *

      Response: We thank Reviewer 1 for this suggestion to better account for any potential variability between immunofluorescence staining in cells. In this instance, alpha-tubulin would be a related protein in our model, making it unsuitable for normalization - the longer manchette phenotypes in our mutant spermatids indicate more tubulin present in mutant cells. We have therefore normalized the fluorescence intensity in our cells to DNA content (DAPI staining). This has provided comparable results to our initial analysis, and we have edited our text accordingly at lines 303, 307-310, 563-564, 845, 850 and Fig. 5. We respectfully disagree that western blotting would be informative, as the point is that katanin proteins are accumulating abnormally on the elongating sperm manchette. This does not necessarily mean that overall katanin levels will be increased. This aside, given the low numbers of elongating spermatids in the Tube1GCKO/GCKO mice, obtaining sufficient materials of western blotting is prohibitive. With the severity of germ cell loss indicated by our daily sperm production calculations, we predict the isolated spermatids of up to 5 Tube1GCKO/GCKO animals would be required to make up one biological replicate. It would not be feasible to collect the large number of animals required for at least three biological replicates in the revision timeframe.

      Reviewer 1:* A major claim of the paper is that epsilon-tubulin plays a different role within mammalian germ cells (abstract, line 22; p9, lines 167-168; p15 lines 315-316), because the Tube1GCKO/GCKO mice can form some sperm with relatively normal ciliary ultrastructure, whereas ciliates lacking epsilon-tubulin fail to form cilia. However, it's unclear whether the centrioles that templated these normal cilia were formed before or after epsilon-tubulin loss. Given that centrioles are inherited from one generation to the next, it's possible that the few normal cilia may be templated by relatively normal parental centrioles. These parental centrioles would have been present in spermatogonia prior to Cre expression/epsilon-tubulin deletion, and inherited by a fraction of sperm after the mitotic and meiotic divisions, resulting in sperm with normal ciliary ultrastructure. Other spermatocytes may have inherited centrioles formed in the absence of epsilon-tubulin, resulting in aberrant centrioles similar to those reported in human somatic cells, but these would not form any sperm flagella due to a loss of cell viability, as has been reported for acentriolar cells in a p53+ background. Underscoring this point, Chlamydomonas and human somatic mutant cells constitutively lack epsilon-tubulin. In these systems, the parental centrioles were diluted from the population over many cell divisions, and phenotypic analysis would only include the centrioles that formed in the absence of epsilon-tubulin. To make their major claim, the authors need to demonstrate that the basal bodies of sperm flagella with normal ultrastructure were formed in the absence of epsilon-tubulin, and were not normal parental centrioles. Given the difficulty of this experiment, the authors may instead choose to remove their claim that epsilon-tubulin plays a different role within mammalian germ cells. *

      Response: The authors thank Reviewer 1 for their detailed input regarding TUBE1’s centriolar importance across species. From their feedback, we recognize the need to modulate our interpretation of this result. We have also added a line to our manuscript highlighting that the normal axonemal structure observed may be due to the inheritance of normal centrioles (lines 328-329). We note however, that sperm produced within the null animals were immotile and that motility could not be recovered by the addition of exogenous ATP thus revealing that TUBE1 is required to form functional sperm tails.

      Reviewer 2:* It will help if the introduction summarizes the knowledge on Epsilon tubulin in spermatogenesis with emesis on its localization and the method used to find the localization. *

      Response: We have modified the introduction accordingly in lines 72-73.

      Reviewer 2:* How many independent mutant animals were studied, and what was the elfishness of generating mutants with a complete mutant testis? From Fig s1c, it appears all mutants generated were total mutations in almost all cells - is this correct? *

      Response: We have updated the number of animals studied as per the comment below. Regarding the mutant status of our mouse model, we used Stra8-Cre which is active between early (postnatal day 3) spermatogonia to pre-leptotene spermatocytes (Sadate-Ngatchou et al., 2008) thus all spermatocytes, spermatids, and sperm will carry the deletion. As shown in Fig. S1C we measured a 90.1% reduction in Tube1 mRNA expression from purified spermatocytes. As mentioned above, we note that the purified germ cells always contain a low percentage of contaminating cells. Using our optimized Staput method we obtain isolated germ cell populations of high purity, where in spermatocyte populations we calculate 19% contamination with other testicular cell types (e.g. somatic Sertoli/interstitial cells, spermatogonia, spermatids) (Dunleavy et al., 2019). We therefore believe the 9.9% Tube1 mRNA expression detected in our Tube1GCKO/GCKO group are the origin of that residual mRNA. We have included this information in the materials and methods section (lines 491-493).

      Reviewer 2:* Add a definition to "ZED-tubulins." *

      Response: A definition to the ZED-tubulins can be found on line 32.

      Reviewer 2:* From the paper, it is unclear if Epsilon tubulin is dispensable for centriole function only in sperm cells or if the same is true in mice somatic cells in vivo. *

      Response: In this study we have used a conditional male germ cell knockout mouse model to examine TUBE1’s function specifically in male germ cells. As mentioned in our introduction, the function of TUBE1 has not been examined in murine somatic cells in vivo (lines 68-70). To avoid confusion, we have reiterated this point in lines 356-358 of our discussion.

      Reviewer 2:* Fig. S1 and other figures: "n {greater than or equal to} 3 samples/genotype" - this is unclear - please indicate the number of independent animals tested. *

      Response: We have modified the figure legends accordingly in lines 11-13 and 33-35 of the transferred supplementary information file and lines 787-788 and 810-811 of the transferred manuscript file.

      Reviewer 2:* "suppressing supernumerary centriole formation" is this due to access centriole formation or failed mitosis? *

      Response: We acknowledge Reviewer 2’s comment is similar to the comment made by Reviewer 1 above and note we have modified the associated text in lines 207-209 in response to the above comment.

      Reviewer 2:* The KATNAL1, KATNAL2, and KATNB1 staining in Fig 5 show multiple foci in the nucleus. Are these foci-specific staining or nonspecific? It is surprising to see such a large complex. *

      Response: As outlined in the materials and methods and the Fig. 5 legend, Fig. 5 displays three-dimensional (3D) z-stack images of whole elongating spermatids presented as 2D maximum intensity projections. The katanin subunit staining is around the nucleus rather than inside of it, however the flattening of the image from 3D to 2D make the foci appear inside the nucleus. To clarify this, we have modified the Fig. 5 legend in lines 845 and 848.

      Reviewer 2:* How the staging of spermatids was performed needs to be explained in the method. *

      Response: We have included additional explanation the materials and methods section (lines 513-514).

      Reviewer 3: The experimental part is of the highest quality and the manuscript is very well written. My only reservation with the manuscript is concerning the model proposed for manchette migration in the Discussion section (Figure 6). I find the proposed model highly speculative and pre-mature, not supported enough by data, as even admitted by the authors (lines 415-427). Having it as a figure and concluding remark gives it too match weight, my suggestion would be to remove figure 6 and tone down the discussion.

      Response: The authors thank Reviewer 3 for their complimentary overview of our manuscript. We agree that some unanswered questions remain in our proposed model of manchette migration. This study has however, added several critical missing pieces. With respect, we prefer to keep Figure 6 in the manuscript as explaining manchette function to non-experts is very difficult without a visual aide. To ensure transparency with the audience that our model is indeed hypothetical, we have edited our discussion and Figure 6 legend to reflect this (lines 406, 417, 428, 435, 463, 860, 863, 869).

      4. Description of analyses that authors prefer not to carry out

      None

      References

      DUNLEAVY, J. E., GRAFFEO, M., WOZNIAK, K., O’CONNOR, A. E., MERRINER, D. J., NGUYEN, J., SCHITTENHELM, R. B., HOUSTON, B. J. & O’BRYAN, M. K. 2022. Male mammalian meiosis and spermiogenesis is critically dependent on the shared functions of the katanins KATNA1 and KATNAL1. bioRxiv, 2022.11.11.516072.

      DUNLEAVY, J. E. M., O’CONNOR, A. E. & O’BRYAN, M. K. 2019. An optimised STAPUT method for the purification of mouse spermatocyte and spermatid populations. Molecular Human Reproduction.

      DUNLEAVY, J. E. M., OKUDA, H., O’CONNOR, A. E., MERRINER, D. J., O’DONNELL, L., JAMSAI, D., BERGMANN, M. & O’BRYAN, M. K. 2017. Katanin-like 2 (KATNAL2) functions in multiple aspects of haploid male germ cell development in the mouse. PLOS Genetics, 13.

      LEHTI, M. S. & SIRONEN, A. 2016. Formation and function of the manchette and flagellum during spermatogenesis. Reproduction, 151__,__ R43-54.

      MORENO, R. D. & SCHATTEN, G. 2000. Microtubule configurations and post-translational alpha-tubulin modifications during mammalian spermatogenesis. Cell Motil Cytoskeleton, 46__,__ 235-46.

      SADATE-NGATCHOU, P. I., PAYNE, C. J., DEARTH, A. T. & BRAUN, R. E. 2008. Cre recombinase activity specific to postnatal, premeiotic male germ cells in transgenic mice. Genesis, 46__,__ 738-42.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      This manuscript by He et al. explores the molecular basis of the different stinging behaviors of two related anemones. The freshwater Nematostella which only stings when a food stimulus is presented with mechanical stimulation and the saltwater Exaiptasia which stings in response to mechanical stimuli. The authors had previously shown that Nematostella stinging is calcium-dependent and mediated by a voltage-gated calcium channel (VGCC) with very pronounced voltage-dependent inactivation, which gets removed upon hyperpolarization produced by taste receptors.

      In this manuscript, they show that Exaiptacia and Nematostella differing stinging behavior is near optimal, according to their ecological niche, and conforms to predictions from a Markov decision model.

      It is also shown that Exaiptacia stinging is also calcium-dependent, but the calcium channel responsible is much less inactivated at resting potential and can readily induce nematocyte discharge only in the presence of mechanical stimulation. To this end, the authors record calcium currents from Exaipacia nematocysts and discover that the VGCCs in this anemone are not strongly inactivated and thus are easily activated by mechanical stimuli-induced depolarization accounting for the different stinging behavior between species. The authors further explore the role of the auxiliary beta subunit in the modulation of VGCC inactivation and show that different n-terminal splice variants in Exaiptacia produce strong and weak voltage-dependent inactivation.

      The manuscript is clear and well-written and the conclusions are in general supported by the experiments and analysis. The findings are very relevant to increase our understanding of the molecular basis of non-neural behavior and its evolutionary basis. This manuscript should be of general interest to biologists as well as to more specialized fields such as ion channel biophysics and physiology.

      Some findings need to be clarified and perhaps additional experiments performed.

      1) The authors identify by sequencing that the Exaiptacia Cav is a P-type channel (cacna1a). However, the biophysical properties of the nematocyte channel are different from mammalian P-type channels. The cnidarian channel inactivation is exceedingly rapid and activation happens at relatively low voltages. These substantial differences should be mentioned and commented on.

      First, we thank Reviewer 1 for thoughtful and detail-oriented comments, as well as their shared appreciation for the molecular basis of unique behaviors. Indeed, Nematostella and rat CaV channels exhibit striking differences in inactivation (both fast and steady-state). We previously described this in Weir et al., 2020 and added additonal text to ensure that this result is clear.

      2) The currents from Nematostella in Figure 3d seem to be poorly voltage-clamped. Poor voltage-clamp is also evident in the sudden increase of conductance in Figure 3C and might contribute to incorrect estimation of voltage dependence of activation and if present in inactivation experiments, also to incorrect estimation of the inactivation voltage range. This problem should be reassessed with new data.

      Because it is necessary to use small-tipped pipettes to get recordings from small and technically challenging nematocytes, there is imperfect voltage clamp that is evident in the steep activation curves. This issue should have little effect on the inactivation curves determined with 1s pre-pulses because poor voltage control occurs transiently at the beginning of the pre-pulse. In our case, current is measured in response to a brief maximally activating pulse followed by a nearly 1s period. Thus, error should be minimal in inactivation curves if the test pulse is a maximally activating voltage. We ensured that these protocols are clearly described in the Methods to address this issue. In addition, we are confident in the described inactivation values because they are generally consistent with channel properties measured in a heterologous expression system in which we do not have this problem and see the same differences in inactivation (also see Weir et al., 2020).

      3) While co-expression of the mouse Cav channel with the beta1 isoform from Exaiptacia indeed shifts inactivation to more negative voltages, it does not recapitulate the phenotype of the more inactivated Ca-currents in nematocytes (compare Figures 4d and 5d). It should be explained if this might be due to the use of a mammalian alpha subunit. Related to this, did the authors clone the alpha subunit from Exaiptacia? Using this to characterize the effect of beta subunits on inactivation might be more accurate.

      While the cnidarian CaVβ subunits indeed shift inactivation consistent with native properties, we agree that using the Exaiptasia alpha subunit would be more accurate. We were unable to successfully clone and heterologously express this subunit, however, we did express all subunits from Nematostella and made chimeric channels in which alpha, alpha2d, or CaVβ were swapped between Nematostella and mammalian channels. These experiments demonstrated the requirement and sufficiency of the CaVβ subunit in altering inactivation (Weir et al., 2020). Furthermore, we were able to express CaVβ subunits from a variety of other cnidarians, all of which affected inactivation properties. Thus, we are confident in the conclusion that CaVβ subunits are major contributors to molecular tuning of cnidarian CaV channels. Future studies aim to incorporate describing properties of the alpha subunit from Exaiptasia and other cnidarians.

      4) The in situ shown in Figure 4b are difficult to follow for a non-expert in cnidarian anatomy. Some guidance should be provided to understand the structures. Also, for the left panels, is the larger panel the two-channel image? If so, blue would indicate co-localization of the two isoforms and there seems to be a red mark in the same nematocyte.

      We thank the reviewer for this important comment and have modified the figure to enhance visual guidance. We more clearly highlighted the nematocyte in the single and two-channel images and selected the clearest representative images. For additional reference, previous studies beautifully illustrate the unusual morphology of nematocytes, including the relative localization of the nematocyst and nucleus in the context of cnidarian tissues (Babonis and Martindale, 2017).

      Reviewer #2 (Public Review):

      This manuscript links the distinctive stinging behavior of sea anemones in different ecological niches to varying inactivation properties of voltage-gated calcium channels that are conferred by the identity of auxiliary Cavbeta subunits. Previous work from the Bellono lab established that the burrowing anemone, Nematostella vectensis, expresses a CaV channel that is strongly inactivated at rest which requires a simultaneous delivery of prey extract and touch to elicit a stinging response, reflecting a precise stinging control adapted for predation. They show here that by contrast, the anemone Exaiptasia diaphana which inhabits exposed environments, indiscriminately stings for defense even in the absence of prey chemicals, and that this is enabled by the expression of a CaVbeta splice variant that confers weak inactivation. They further use the heterologous expression of CaV channels with wild type and chimeric anemone Cavbeta subunits to infer that the variable N-termini are important determinants of Cav channel inactivation properties.

      1) The authors found that Exaiptasia nematocytes could be characterized by two distinct inactivation phenotypes: (1) nematocytes with low-voltage threshold inactivation similar to that of Nematostella (Vi1/2 = ~ -85mV); and (2) a distinct population with weak, high-voltage threshold inactivation (Vi1/2 = ~ -48mV). What were the relative fractions of low-voltage and high-voltage nematocytes? Do the low-voltage Exaiptasia nematocytes behave similarly to Nematostella nematocytes with respect to requiring both prey extract and touch to discharge?

      We thank Reviewer 2 for thoughtful comments and questions. Nematocyte patch clamp is technically challenging due to small size, large nematocyst, and, notably, the explosive discharge involved in stinging! Therefore, we only patch clamped a small number of cells. Despite this limitation, we were able to observe two distinct nematocyte populations based on physiological properties. Yet, we did not observe a correlation with morphology and cannot make broad comments on relative fractions. Because morphology was generally similar and Exaiptasia nematocytes discharge even from touch alone, it remains unclear whether the low-voltage population behaves similarly to Nematostella nematocytes that only discharge in response to chemicals and touch. Future in vivo approaches could be used to address this question.

      2) The authors state in Fig 3 legend and in the results that Exaiptasia nematocyte voltage-gated Ca2+ currents have weak inactivation compared with Nematostella. This description is imprecise and inaccurate. Figure 3 in fact shows that Exaiptasia nematocyte voltage-gated Ca2+ currents display a faster rate of inactivation compared to Nematostella Ca2+ currents. A sub-population of Exaiptasia nematocytes does display less resting state (or steady-state) inactivation compared to Nematostella Ca2+ currents. The authors need to be more accurate and qualify what type of inactivation property they are talking about.'

      We thank Reviewer 2 for this attention to detail and have defined this phrasing early in the text.

      3) In a similar vein, the authors need to be more accurate when referring to 'rat beta' used in heterologous expression experiments. It should be made explicit throughout the manuscript that the rat beta isoform used is rat beta2a. Among the distinct beta isoforms, beta2a is unique in being palmitoylated at the N-terminus which confers a characteristic slow rate of inactivation and a right-shifted voltage-dependence of steady-state inactivation consistent with the data shown in Fig. 4D. Almost all other rat beta isoforms do not have these properties.

      We used the rat CaVβ2a for comparison because it shares the highest homology with Nematostella CaVβ (Weir et al., 2020). We have now more clearly defined the rat subunit in the text and legends.

      4) The profiling of the impact of different Cnidarian Cavbeta subunits on reconstituted Ca2+ channel current waveforms is nice (Fig 5 and Fig 5S1). The N-terminus sequence of EdCaVβ2 is different from palmitoylated rat beta2a, though both have similar properties in showing slow inactivation and a right-shifted voltage-dependence of steady-state inactivation. Does EdCaVβ2 target autonomously the plasma membrane when expressed in cells? If so, this would reconcile with what was previously known and provide a rational explanation for the observed functional impact of the distinct Cavbetas.

      As far as we understand the question, our data support that Exaiptasia CaVβ2 targets the plasma membrane for a number of reasons: 1) Expressing Exaiptasia CaVβ2 produces consistent properties in comparison with other CaVβs, suggesting a homogenous population of channel complexes; 2) Distinct cnidarian-Exaiptasia CaVβ2 chimeras produce distinct and internally consistent properties; and 3) Expressing P/Q-type CaV alpha + alpha2d subunits without CaVβ in cell lines does not produce robust measurable voltage-gated currents. We further tested this in our case and found the same result: at an equivalent maximally activating step using the same protocol, we measured 458.68 ± 179.88pA average current amplitude for +Exaiptasia CaVβ2 (n = 6) and 43.03 ± 17.64pA average current amplitude for -CaVβ2 (n = 4).

      Reviewer #3 (Public Review):

      Summary:

      The present article attempts to answer both the ultimate question of why different stinging behaviours have evolved in Cnidiarians with different ecological niches and shed light on the proximate question of which electro-physiological mechanisms underlie these distinct behaviours.

      Account of major methods and results:

      In the first part of the paper, the authors try to answer the ultimate question of why distinct dependencies of the sting response on internal starvation levels have evolved. The premise of the article that Exaiptasia's nematocyte discharge is independent of the presence of prey (Artemia nauplii) as compared to Nematostella's significant dependence of the discharge on the presence of actual prey, is shown be a robust phenomenon justified by the data in Figure 1.

      The hypothesis that defensive vs. predatory stinging leads to different nematocyte discharge behaviours is analysed in mathematical models based on the suitable framework of optimal control/decision theory. By assuming functional relations between the:

      1) cost of a full nematocyte discharge and the starvation level.

      2) probability of successful predation/avoidance on the discharge level.

      3) desirability/reward of the reached nutritional state.

      Based on these assumptions of environmental and internal influences, the optimal choice of attack intensity is calculated using Bellman's equation for this problem. The model predictions are validated using counted nematocytes on a coverslip. The scaling of normalised nematocyte discharge numbers with scaled starvation time is qualitatively comparable to what is predicted from the models. The abundance of nematocytes in the tentacles was, on the other hand, independent of the starvation state of the animals.

      Next, the authors turn to investigate the proximate cause of the differential stinging behaviour. The authors have previously reported convincing evidence that a strongly inactivating Cav2.1 channel ortholog (nCav) is used by Nematostella to prevent stinging in the absence of prey (Weir et al. 2020). This inactivation is released by hyperpolarising sensory inputs signalling the presence of prey. In this article, it is clearly shown by blocking respective currents that Exaiptasia, too, relies on extracellular Ca2+ influx to initiate stinging. Patch clamp data of the involved currents is provided in support. However, the authors find that in addition to the nCav with a low-inactivation threshold, Exaiptasia has a splice variant with a higher inactivation threshold expressed (Figure 3D).

      The authors hypothesise that it is this high-threshold nCav channel population that amplifies any voltage depolarisation to release a sting irrespective of the presence of prey signals. They found that the β subunit that is responsible for Nematostella's unusually low inactivation threshold exists in Exaiptasia as two alternative splice isoforms. These N-terminus variants also showed the greatest variation in a phylogenetic comparison (Figure 5), rendering it a candidate target for mutations causing variation in stinging responses.

      Appraisal of methodology in support of the conclusions:

      The authors base their inference on a normative model that yields quantitative predictions which is an exciting and challenging approach. The authors take care in stating the model assumptions as well as showing that the data indeed does not contradict their model predictions. The interesting comparative nature of the modelling part of the study is complicated by slightly different cost assumptions for the two scenarios. Hence, Figure 2 needs to be carefully digested by readers.

      We thank the reviewer for their careful revision of our work and excellent comments. We simplified Figure 2 considerably to make it easier to digest. We now compare the stinging response for predation vs defense under the same exact definition of cost per nematocyte for both models. You can find examples 1 and 2 in Figure 2 and examples 3 and 4 in Supplementary Figure 3 (see response below).

      It would be even more prudent to analyse the same set of cost-of-discharge vs. starvation scenarios for both species. Specifically, for Nematostella the complete cost-of-discharge vs starvation-state curves as for Exaiptasia (Figure 2E, example 2-4) could be used. It is likely that the differential effect size of Nematostella and Exaiptasia behaviour is the strongest if only the flat cost-of-discharge vs starvation is used (Figure 2A) for Nematostella. But as a worst-case comparison the other curves, where the cost to the animal scales with starvation would be a good comparison. This could help the reader to understand when the different prediction of Nematostella's behaviour breaks down. In addition, this minor change could shed light on broader topics like common trade-offs in pursuit predation.

      The results hold even when the cost increases moderately with starvation: Figure 2 now shows results with the same cost for predatory and defensive stinging (cost defined in Figure 2A, former examples 1 and 4). Predatory stinging robustly increases with starvation and defensive stinging remains constant or decreases. Interestingly, the fit between theory and data for both anemones improves by using the increasing cost (open circles in Figure 2E right). For other choices of increasing cost functions, defensive stinging will always decrease, and even more so if the cost increases dramatically (like for the former Examples 2 and 3). In contrast, predatory stinging will switch behavior if the cost increases too much with starvation (results with former Examples 2 and 3, now in Supplementary Figure 3 and theoretical arguments in Supplementary Information). Note however that these assumptions are less realistic because they necessitate that the cost of stinging for well-fed animals is negligible with respect to the cost for starved animals. A formal proof of the asymptotic solution for predatory stinging with varying cost is beyond the scope of this work and is subject of ongoing work where we consider implications for Markov Decision Processes in continuous space state.

      The qualitatively similar scaling of the model-derived relation between starvation and sting intensity with the counted nematocytes for different feeding pauses is evidence that feeding has indeed been optimised for the two distinct ecological niches. To prove that Exaiptasia uses a similar Ca2+ channel ortholog as well as a different splice variant, the authors employed both clean electrophysiological characterisaiton (Figure 3) as well as transcriptomics data (Figure 4S1).

      To strengthen the authors' hypothesis that variation in the N-termini leads to changes in Ca2+ channel inactivation and hence altered stinging, the response sequence variability of 6 Cnidaria was analysed.

      Additional context:

      Although, the present article focuses on nematocytes alone, currently, there has been a refocus in neurobiology on the nervous systems of more basal metazoans, which received much attention already in the works of Romanes (1885). In part, this is driven by the goal to understand the early evolution of nervous systems. Cnidarians and Ctenophors are exciting model organisms in this venture. This will hopefully be accompanied by more comparative studies like the present one. Some of the recent literature also uses computational models to understand mechanisms of motor behaviour using full-body simulations (Pallasdies et al. 2019; Wang et al. 2023), which can be thought of as complementary to the normative modelling provided by the authors.

      Comparative studies of recent Cnidarians, such as the present article, can shed light on speculative ideas on the origin of nervous systems (Jékely, Keijzer, and Godfrey-Smith 2015). During a time (the Ediacarium/Cambrium transition) that has seen the genesis of complex trophic foodwebs with preditor-prey interaction, symbioses, but also an increase of body sizes and shapes, multiple ultimate causes can be envisioned that drove the increase in behavioural complexity. The authors show that not all of it needs to be implemented in dedicated nerve cells.

      References:

      Jékely, Gáspár, Fred Keijzer, and Peter Godfrey-Smith. 2015. "An Option Space for Early Neural Evolution." Philosophical Transactions of the Royal Society B: Biological Sciences 370 (December): 20150181. https://doi.org/10.1098/rstb.2015.0181.

      Pallasdies, Fabian, Sven Goedeke, Wilhelm Braun, and Raoul-Martin Memmesheimer. 2019. "From Single Neurons to Behavior in the Jellyfish Aurelia Aurita." eLife 8 (December). https://doi.org/10.7554/elife.50084.

      Romanes, G. J. 1885. Jelly-Fish, Star-Fish and Sea-Urchins: Being a Research on Primitive Nervous Systems. Appleton.

      Wang, Hengji, Joshua Swore, Shashank Sharma, John R. Szymanski, Rafael Yuste, Thomas L. Daniel, Michael Regnier, Martha M. Bosma, and Adrienne L. Fairhall. 2023. "A Complete Biomechanical Model of hydra Contractile Behaviors, from Neural Drive to Muscle to Movement." Proceedings of the National Academy of Sciences 120 (March). https://doi.org/10.1073/pnas.2210439120.

      Weir, Keiko, Christophe Dupre, Lena van Giesen, Amy S-Y Lee, and Nicholas W Bellono. 2020. "A Molecular Filter for the Cnidarian Stinging Response." eLife 9 (May). https://doi.org/10.7554/elife.57578.

      We appreciate the excellent suggestion to further discuss non-neuronal adaptations in the context of studying the evolution of behavior. We have added additional text to the Discussion to cover this interesting field.

    2. Reviewer #3 (Public Review):

      Summary:<br /> The present article attempts to answer both the ultimate question of why different stinging behaviours have evolved in Cnidiarians with different ecological niches and shed light on the proximate question of which electro-physiological mechanisms underlie these distinct behaviours.

      Account of major methods and results:<br /> In the first part of the paper, the authors try to answer the ultimate question of why distinct dependencies of the sting response on internal starvation levels have evolved. The premise of the article that Exaiptasia's nematocyte discharge is independent of the presence of prey (Artemia nauplii) as compared to Nematostella's significant dependence of the discharge on the presence of actual prey, is shown to be a robust phenomenon justified by the data in Figure 1.

      The hypothesis that defensive vs. predatory stinging leads to different nematocyte discharge behaviours is analysed in mathematical models based on the suitable framework of optimal control/decision theory. By assuming functional relations between the:<br /> 1) cost of a full nematocyte discharge and the starvation level.<br /> 2) probability of successful predation/avoidance on the discharge level.<br /> 3) desirability/reward of the reached nutritional state.

      Based on these assumptions of environmental and internal influences, the optimal choice of attack intensity is calculated using Bellman's equation for this problem. The model predictions are validated using counted nematocytes on a coverslip. The scaling of normalised nematocyte discharge numbers with scaled starvation time is qualitatively comparable to what is predicted from the models. The abundance of nematocytes in the tentacles was, on the other hand, independent of the starvation state of the animals.

      Next, the authors turn to investigate the proximate cause of the differential stinging behaviour. The authors have previously reported convincing evidence that a strongly inactivating Cav2.1 channel ortholog (nCav) is used by Nematostella to prevent stinging in the absence of prey (Weir et al. 2020). This inactivation is released by hyperpolarising sensory inputs signalling the presence of prey. In this article, it is clearly shown by blocking respective currents that Exaiptasia, too, relies on extracellular Ca2+ influx to initiate stinging. Patch clamp data of the involved currents is provided in support. However, the authors find that in addition to the nCav with a low-inactivation threshold, Exaiptasia has a splice variant with a higher inactivation threshold expressed (Figure 3D).

      The authors hypothesise that it is this high-threshold nCav channel population that amplifies any voltage depolarisation to release a sting irrespective of the presence of prey signals. They found that the β subunit that is responsible for Nematostella's unusually low inactivation threshold exists in Exaiptasia as two alternative splice isoforms. These N-terminus variants also showed the greatest variation in a phylogenetic comparison (Figure 5), rendering it a candidate target for mutations causing variation in stinging responses.

      Appraisal of methodology in support of the conclusions:<br /> The authors base their inference on a normative model that yields quantitative predictions which is an exciting and challenging approach. The authors take care in stating the model assumptions as well as showing that the data indeed does not contradict their model predictions. The interesting comparative nature of the modelling part of the study is complicated by slightly different cost assumptions for the two scenarios. Hence, Figure 2 needs to be carefully digested by readers.

      It would be even more prudent to analyse the same set of cost-of-discharge vs. starvation scenarios for both species. Specifically, for Nematostella the complete cost-of-discharge vs starvation-state curves as for Exaiptasia (Figure 2E, example 2-4) could be used. It is likely that the differential effect size of Nematostella and Exaiptasia behaviour is the strongest if only the flat cost-of-discharge vs starvation is used (Figure 2A) for Nematostella. But as a worst-case comparison the other curves, where the cost to the animal scales with starvation would be a good comparison. This could help the reader to understand when the different prediction of Nematostella's behaviour breaks down. In addition, this minor change could shed light on broader topics like common trade-offs in pursuit predation.

      The qualitatively similar scaling of the model-derived relation between starvation and sting intensity with the counted nematocytes for different feeding pauses is evidence that feeding has indeed been optimised for the two distinct ecological niches.<br /> To prove that Exaiptasia uses a similar Ca2+ channel ortholog as well as a different splice variant, the authors employed both clean electrophysiological characterisation (Figure 3) as well as transcriptomics data (Figure 4S1).

      To strengthen the authors' hypothesis that variation in the N-termini leads to changes in Ca2+ channel inactivation and hence altered stinging, the response sequence variability of 6 Cnidaria was analysed.

      Additional context:<br /> Although, the present article focuses on nematocytes alone, currently, there has been a refocus in neurobiology on the nervous systems of more basal metazoans, which received much attention already in the works of Romanes (1885). In part, this is driven by the goal to understand the early evolution of nervous systems. Cnidarians and Ctenophors are exciting model organisms in this venture. This will hopefully be accompanied by more comparative studies like the present one. Some of the recent literature also uses computational models to understand mechanisms of motor behaviour using full-body simulations (Pallasdies et al. 2019; Wang et al. 2023), which can be thought of as complementary to the normative modelling provided by the authors.

      Comparative studies of recent Cnidarians, such as the present article, can shed light on speculative ideas on the origin of nervous systems (Jékely, Keijzer, and Godfrey-Smith 2015). During a time (the Ediacarium/Cambrium transition) that has seen the genesis of complex trophic foodwebs with preditor-prey interaction, symbioses, but also an increase of body sizes and shapes, multiple ultimate causes can be envisioned that drove the increase in behavioural complexity. The authors show that not all of it needs to be implemented in dedicated nerve cells.

      References:

      Jékely, Gáspár, Fred Keijzer, and Peter Godfrey-Smith. 2015. "An Option Space for Early Neural Evolution." Philosophical Transactions of the Royal Society B: Biological Sciences 370 (December): 20150181. https://doi.org/10.1098/rstb.2015.0181.

      Pallasdies, Fabian, Sven Goedeke, Wilhelm Braun, and Raoul-Martin Memmesheimer. 2019. "From Single Neurons to Behavior in the Jellyfish Aurelia Aurita." eLife 8 (December). https://doi.org/10.7554/elife.50084.

      Romanes, G. J. 1885. Jelly-Fish, Star-Fish and Sea-Urchins: Being a Research on Primitive Nervous Systems. Appleton.

      Wang, Hengji, Joshua Swore, Shashank Sharma, John R. Szymanski, Rafael Yuste, Thomas L. Daniel, Michael Regnier, Martha M. Bosma, and Adrienne L. Fairhall. 2023. "A Complete Biomechanical Model of hydra Contractile Behaviors, from Neural Drive to Muscle to Movement." Proceedings of the National Academy of Sciences 120 (March). https://doi.org/10.1073/pnas.2210439120.

      Weir, Keiko, Christophe Dupre, Lena van Giesen, Amy S-Y Lee, and Nicholas W Bellono. 2020. "A Molecular Filter for the Cnidarian Stinging Response." eLife 9 (May). https://doi.org/10.7554/elife.57578.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1 major comments:

      The authors show one configuration of the E1-E2 heterodimer in Figure 4d. As shown, the E1 protein is exterior to the E2 protein and would suggest E1 is on the surface on the spike complex and virus surface. However, another configuration of the glycoproteins has E2 on the exterior of E1 and also on the exterior of the virus. The latter conformation is what has been observed in cryoEM studies of alphaviruses. The first configuration represents the E1-E2 between the three heterodimers which are important for spike assembly. The reason the orientation of the E2-E1 dimer is important is the authors speculate on the importance of the 6 CHIK residues not found in ONNV based on the structure, but the structural interpretation is, in my opinion, not correct.

      We thank reviewer 1 for pointing out the correct E2-E1 heterodimer configuration. To address this, we corrected the position of E2 and E1 in Figure 4 based on previous cryoEM study1, keeping E2 always on the exterior in the E2-E1 heterodimer. We also replaced the Indian Ocean Lineage (IOL) E2-E1 structure1 in the original Figure 4 with the CHIKV 181/clone 25 structure which was recently analyzed by Katherine Basore et al.2. In a single E2-E1 heterodimer, all six unique CHIKV positive selection sites are located on the outside of the structure after correcting the configuration. In addition, we investigated two of the unique CHIKV positively selected sites that are important for virion production, E2-V135 (V460 in the original manuscript version) and E1-V220 (V1029 in the original manuscript version), in trimerized structure of E2-E1 heterodimers. We found that the E2-V135 and E1-V220 residues in one heterodimer are facing E2 of the neighboring heterodimer on either side. Interestingly, while V135 is embedded between the E2 proteins of two different heterodimers, E1-V220 is partially embedded by E1 and the neighboring E2 and partially exposed to the outside. This suggests that even though both E2-V135 and E1-V220 might be crucial for CHIKV E2-E1 trimerization, E1-V220 provides an additional docking site for host factor interactions. We thank review 1 again for this important comment leading to these new findings. We have updated Figure 4F-4G and the corresponding result section (lines 201-209) in this partially revised manuscript.

      1. Validation of E1 interaction with SPSC3 and eIF3k needs to be stronger. Some concerns/questions are listed below. A myc tag was inserted between E3 and E2. How efficiently does furin cleave E3 from E2 in this virus and how are viral titers of the myc-tagged virus compared to the non-tagged virus? I ask because is the IP looking at what is being pulled down by E2 or E3-myc-E2 that could be part of the spike polyprotein? The authors found E2 interacts with E3, E1 and a list of other host proteins. These results suggest several interactions including E2-host factor, E2-E1, E2-E3, E2-E1-host factor, E2-E3-E1, E2-E3-host factor. In figure 6d, and the subsequent conclusions, the authors suggest E1 is interacting with the host factor and do not see E2 alone and very low amounts of E3-E2-6K-E1. based on how the IP was performed I am not sure how an interaction between E1 and SPCS3 alone, without E2, would be detected. I would also like to see a reciprocal pull down using E1 and also E2 to see if these host factors are pulled down.

      We thank the reviewer for these concerns. Given the low viral protein expression in macrophages (Figure 1A), we need an efficient system to enrich for large amounts of CHIKV glycoproteins for identifying host interactors through mass spectrometry. Adding tag/reporter proteins, such as mCherry, between E3 and E2 have been used to label alphavirus glycoproteins in previous study2, which is why we chose to use this myc tag labeling strategy coupled with myc Ab-conjugated agarose beads for AP-MS. However, like reviewer 1 speculated, inserting myc tag between E3 and E2 does attenuate CHIKV infectivity according to the reduced supernatant viral titers of 293T cells transfected with CHIKV/myc-E2 genomic RNA in comparison to those of cells transfected with unmodified CHIKV vaccine strain 181/clone 25 genomic RNA (shown in revision plan). Despite the attenuation, CHIKV/myc-E2 harvested from transfected 293T cells still reaches a titer over 108 pfu/ml, which allowed us to identify interactors by AP-MS.

      We further analyzed the cleavage efficiency of glycoproteins by comparing the expression levels of E3-E2-6K -E1, E3-E2 (p62), E2, and E3 in 293T cells transfected with unmodified CHIKV or CHIKV/myc-E2 genomic RNA (result shown in revision plan). We didn’t detect any uncleaved forms of glycoproteins in cells transfected with either unmodified CHIKV or CHIKV/myc-E2 RNA when we probed with E2 antibody. However, probing with E3 antibody prior to longer exposure of the immunoblot showed higher E3-E2-6k-E1 and E3-E2 (p62) levels in cells transfected with CHIKV/myc-E2 RNA, suggesting that both mature E2 and E2-containing precursor polyproteins are available to be pulled down. Overall, the expression levels of mature E2 detected by E2 antibody are similar.

      We thank reviewer 1 for providing a thorough dissection of all the possible interactions between the identified host factors and cleaved/uncleaved glycoproteins. This is a very interesting question. As reviewer 1 mentioned that E1 usually appears with E2 or E3-E2 in heterodimer forms, we were also surprised to find that E2 does not interact with either of the two host factors. To address this, we plan to conjugate E2 and E1 to protein A/G beads, respectively, for a reciprocal pulldown to validate CHIKV glycoprotein interactions with SPCS3 and eIF3k. Results from this experiment will be included in the fully revised manuscript.

      1. If CHIK E1 is interacting with the host factors and that is antagonizing the antiviral response of SPSC3 (as one example), then what do pull downs using ONNV structural proteins look like? One would expect reduced interactions because the different amino acid causes a different E2-E1 dimer or attenuates the E1-host factor binding site.

      We thank Reviewer 1 for this insightful suggestion. We agree that it would be informative to examine the interactions between ONNV glycoproteins and identified host factors (SPCS3 and eIF3k). Unfortunately, there is no commercial ONNV glycoprotein antibody available making this experiment unfeasible. Interestingly, we did observe reduced interactions between the host factors SPCS3 and eIF3k and the CHIKV E1-V220I mutant (V1029I in original manuscript version) where the positively selected site in E1 was mutated to the homologous ONNV residue (please refer to our response to Reviewer 3’s major comment #1). This result suggests that the ONNV glycoproteins likely have an attenuated E1-host factor binding site as the reviewer speculated.We have included this as Figure 7A in partially revised manuscript.

      1. E1 and E2 are thought to interact during polyprotein translation and the initial dimer forms in the ER. If E1 is interacting with SPSC3 in the ER, is E2 also present? Or is a population of E1 not interacting with E2 in order to inhibit SPSC3? I would love a model of how the authors see all these factors coming together for this new role of E1.

      We thank Reviewer 1 for proposing this interesting hypothesis. Given the unexpected absence of E2 in our validation of host factor-E1 pulldown, we speculate that a group of free E1 proteins with distinct function is interfering with host factors in the ER, which is a model worth further investigation and discussion. A great example of this is the alphavirus nonstructural protein 3 (nsP3) that plays essential roles in RNA replication, although depending on the alphavirus not all of the nsP3 in the cell colocalizes with dsRNA, suggesting there is a separate distinct pool of nsP3 outside of active viral replication complex that interacts with host factors in these observed larger cytoplasmic aggregates3. To address this, we plan to use laser confocal microscopy to observe the interactions between host factors (SPCS3, eIF3k), and CHIKV E2 and E1. We will include this result as well as our proposed model in the fully revised manuscript.

      Reviewer 1 minor comments:

      1. In Figure 1c, (-) RNA is shown but in the rest of the figures (+) RNA is shown. Show both or select one. I do find it interesting the (-) RNA levels are similar over time, even at 4 hours post transfection (early time). Related to this, ONNV has higher levels of (-) RNA but what is known about structural protein levels in ONNV and CHIK in macrophages? Are there comparable levels of CP and GP being produced?

      We thank Reviewer 1 for this comment. The (-) RNA is synthesized before the synthesis of subgenomic mRNA and therefore can reflect more accurately early viral replication and nonstructural protein functions. This is the reason why we consider the (-) RNA levels evaluated by specific nsP1 TaqMan probes to be more appropriate for determining early stage differences between ONNV and CHIKV replication in Figure 1 as the goal of that figure is to define the steps in CHIKV life cycle that are more efficient than those of ONNV in THP-1 derived macrophages. On the other hand, the (+) RNA evaluated by E1 primers that we used in the later figures monitors viral RNA synthesis over time in the reflection of genomic (+) RNA and subgenomic mRNA transcribed from (-) RNA templates. Similar levels of (+) RNA and contrasting virion titers really point the difference to the later stages of subgenomic mRNA translation, viral glycoprotein secretion, and assembly.

      We have generated ONNV/myc-E2 reporter virus and assessed viral glycoprotein expression through flow cytometry using a FITC -conjugated anti-myc antibody in the THP-1 derived macrophages transfected with CHIKV/myc-E2 and ONNV/myc-E2 (shown in revision plan). The results show that the expression of ONNV glycoproteins is more inhibited than that of CHIKV glycoproteins, though both of their expression levels in macrophages seem to be suppressed. Since there is no commercial ONNV antibody available, we were unable to compare capsid expression levels between the two viruses. Overall, differences in the myc-tagged glycoprotein expression levels of the two viruses reveals ONNV defect in either structural protein translation or glycoprotein maturation .

      1. Figure 2e and figure 3 have ONNV has the first bar followed by CHIK. In figure 1 and 2b, CHIK is first and then ONNV. helps the reader to have the controls in the same order.

      We thank Reviewer 1 for this suggestion. We have changed the order of ONNV and CHIKV bars in figure 2E and figure3 so the CHIKV bar consistently comes first in all the figures.

      1. Line 143-145 the authors discuss that when ONNV is the backbone and CHIK proteins are inserted the infection is more attenuated because of the E2 and E1 are from CHIK and ONNV, not the same virus (could also be E2-CP interactions are disrupted). However the chimeras made with the CHIK backbone (in Figure 2) have a mismatch between E2 and E1 as well.

      We thank Reviewer 1 for this informative comment. We agree that the incompatible E2-E1 heterodimer formation may not be the only reason that causes attenuation of ONNV/CHIKV E1 and ONNV/CHIKV E2. There may be multiple factors contributing to the fitness of the chimeras, which requires more in-depth mechanistic investigations and is out of the scope of this study. We have now removed the explanation “potentially due to incompatible heterodimer formation between ONNV E2 and CHIKV E1” in line 144.

      1. When discussing the residues that were found in the FEL and MEME analysis, the authors start the amino acid numbering from CP and continue along the polyprotein. Usually when discussing amino acids in the structural proteins, each protein starts at amino acid 1. So V460 would be E2-V135. It would also be useful to know what the residues in ONNV were at these positions to see if amino acids changed in charge, size, bond forming potential, etc. Showing these residues in the E2-E1 conformation found in the virion would also allow one to find adjacent residues that could explain differences in spike assembly and potentially where/how E1 is binding to a host protein.

      We thank Reviewer 1 for this comment. We revised the amino acid numbers in the manuscript to start from the beginning of each structural protein. To look more into these residues in ONNV, we aligned CHIKV and ONNV from different lineages and compared the 6 positively selected sites (refer to our response to Reviewer 1’s minor comment #5). We found that E2-135 and E1-220 which are essential for CHIKV production are valines in all the aligned CHIKV strains. For the aligned ONNV strains, E2-135 are all leucines and E1-220 are all isoleucines. While valine, leucine and isoleucine are all amino acids with hydrophobic side chains, valine has the shortest side chain. The length of the side chains may lead to different hydrophobic properties that affect protein folding, which warrants further structural analysis.

      1. How effective is a non-attenuated CHIK strain in infecting macrophages? Could you make a SINV-La Reunion chimeric virus (which is BSL2) to see if a higher percentage of macrophages are infected and is this potentially contributing to the increased pathogenesis of La Reunion? Also how different is 181/25 with a pathogenic strain in the E2 and E1 residues? and compared to ONNV?

      We thank Reviewer 1 for this question, which is also raised by Reviewer 2. In order to address this question, we plan to use the virulent CHIKV La Reunion strain to study the infection of THP-1 derived macrophages with non-attenuated CHIKV in BSL-3. We are getting trained in the BSL-3 facility and will soon be certified.

      We thank Reviewer 1 for this insightful suggestion on investigating the conservation of these positively selected sites in different strains. We have aligned the sequences of ONNV and CHIKV strains from different lineages, including CHIKV vaccine strain 181/clone 25 and Thai strain AF15561 (the parental strain of CHIKV 181/clone 25) (alignment shown in revision plan). We found that the two positively selected sites with negative effects on virion production, E2-135 and E1-220 (sites 460 and 1029 in original manuscript version), are very conserved in either CHIKV or ONNV strains. CHIKV E2-135 is always valine (V) regardless of the lineages, while ONNV E2-135 is always leucine (L). CHIKV E1-220 is always V, while ONNV E1-220 is always isoleucine (I).

      We also analyzed the amino acid heterogeneity of E2-135 and E1-220 in 397 CHIKV patient sequences from NCBI Virus database. Most of the amino acids at these 2 sites are V. The counts of each amino acid at E2-135 and E1-220 is summarized in table below. This result suggests that valine residues at E2-135 and E1-220 are crucial for CHIKV fitness and strongly selected during viral evolution. The sequence alignment and table will be included and discussed in the fully revised manuscript .

      E2-135

      E1-220

      Valine (V)

      394

      392

      Alanine (A)

      1

      3

      Methionine (M)

      1

      0

      Glutamic acid (E)

      0

      1

      Glycine (G)

      1

      0

      Isoleucine (I)

      0

      1

      1. When describing the last results section, "CHIKV E1 binding proteins exhibit potent anit-CHIV activities" the authors use macrophages. In the rest of the text they consistently use THP-1 macrophages or human primary monocyte derived macrophages. The details of the cell type are extremely useful to the reader and having those in the last results section would be great.

      We thank Reviewer 1 for pointing out the importance of cell type clarification in the last results section. We now consistently use “THP-1 derived macrophages” instead of “macrophages” in this section.

      1. The paper is well-written. There is a slight disconnect as the authors go from discussing results in Figure 4 to Figure 5.

      We thank Reviewer 1 for the comment regarding the disconnection of the last two figures in this paper which is also shared by the other reviewers. We have taken 3 approaches to address this comment: 1) We performed a pulldown of the host factors (SPCS3, eIF3k) identified in Figure 5 with CHIKV positively selected mutants examined in Figure 4 with deficient virion production. The result is presented in our response to Reviewer 3’ s major comment #1, suggesting that the positively selected site in E1 is essential for CHIKV glycoprotein interaction with host factors. 2) To complement our first experiment, we will also determine structural protein expression and processing of parental and E1 mutant CHIKV in eIF3k CRISPR knockout 293T cells. 3) Finally, we plan to perform CORUM analysis to identify high confidence functional protein complexes using our 14 hits found in both mass spec experiments, which will provide mechanistic insights into how these identified cellular complexes and processes might modulate CHIKV infection.

      Reviewer 2’s major comments

      The authors elegantly demonstrate that CHIKV structural proteins confer an advantage over ONNV structural proteins in a step in the replication cycle downstream of virus RNA synthesis, possibly virion assembly. This point would be strengthened determining the particle-to-PFU ratio of the parental viruses and the chimeras . Presumably, the ratio would increase in the chimeras containing CHIKV structural proteins.

      We thank Reviewer 2 for this comment. We agree that determining particle-to-PFU ratios of parental and chimeric viruses will strengthen this study. To obtain the particle-to-PFU ratio, we infected THP-1 derived macrophages with CHIKV, ONNV and chimeras containing CHIKV glycoproteins (Chimera I, and ONNV/CHIKV E2+E1) for 24 h. To quantify the secreted viral particles, we extracted viral RNA in the supernatant and detected (+) viral RNA through TaqMan assay with specific nsp1 probes. The released infectious virions were evaluated through plaque assay. The particle-to-PFU ratios are summarized in the table below. The results show that ONNV has the highest particle-to-PFU ratio (41398), suggesting defective ONNV genome encapsidated in particles leading to defective virion production. On the other hand, the particle-to-PFU ratio of CHIKV (747) is 55-fold lower than that of ONNV. Replacing E3-E2-6K-E1 of ONNV with CHIKV homologous proteins reduces the particle-to-PFU ratio by 8 fold to 4875. Replacing E2 and E1 of ONNV with the ones from CHIKV (ONNV/CHIKV E2+E1) reduces the particle-to-pfu ratio by 20 fold to 2017, suggesting that CHIKV glycoproteins enhance the infectivity of viral progenies produced by THP-1 derived macrophages. We have included the results in Figure 3D-3E in our partially revised manuscript and described in lines 149-158.

      1. Additionally, the authors should consider performing virion assembly blocking assays with a small molecule inhibitor to determine if this abrogates the virus production advantage of CHIKV structural proteins within the ONNV backbone.

      We thank Reviewer 2 for this insightful comment. As the secretory pathway is commonly important for alphavirus glycoprotein maturation and assembly, it will be informative to interrogate CHIKV glycoprotein trafficking and assembly through this pathway using specific inhibitors, such as dihydropyridine FLI-06 and golgicide A . Golgicide A is a reversible inhibitor of the cis-Golgi GBF1, which leads to rapid disassembly of the Golgi and trans-Golgi network (TGN)4. FLI-06 is a new inhibitor that interferes with cargo recruitment to ER-exit sites and disrupts Golgi without depolymerizing microtubules or interfering GBF15. We pretreated THP-1 derived macrophages with 10 uM FLI-06 or golgicide A for 30 mins prior to infection with CHIKV, ONNV, Chimera I, or ONNV/ CHIKV E2+E1. After 1 hour of virus adsorption in PBS with 1% FBS in the absence of the inhibitors, the cells were treated with the inhibitors at the same concentration (10uM) in complete medium for 24 h. The plaque assay result shows that all the viruses are sensitive to secretory pathway inhibition, however, the production of viruses containing CHIKV glycoproteins is significantly more attenuated by FLI-06 and golgicide A. This suggests that CHIKV glycoproteins-mediated trafficking and assembly is more heavily dependent on the host secretory pathway . We will include this result in the fully revised manuscript.

      1. Finally, the authors should perform competition experiments with the chimeric viruses and ONNV in macrophages to determine if the chimeras can outcompete the parental ONNV strain. Based on their data, the chimeric viruses should outcompete.

      We thank Reviewer 2 for this inspiring suggestion. The competition experiment is an innovative and informative way to evaluate whether CHIKV glycoproteins confer a selective advantage on virion production in THP-1 derived macrophages. We plan to infect THP-1 derived macrophages with ONNV and ONNV/CHIKV E2+E1 and detect the viral glycoproteins secreted in the supernatant by western blot, although there is a possibility that this experiment might not work due to superinfection exclusion. Given that there is no commercial antibody of ONNV available, we need to use tagged viruses for this competition experiment. We constructed ONNV/CHIKV myc-E2+E1 that has a myc tag at the N-terminus of CHIKV E2, and ONNV/HA-E2 that has a HA tag at the N-terminus of ONNV E2. Our first attempt at concentrating the viral progenies released by THP-1 derived macrophages infected with the two tagged viruses has not been successful. We performed sucrose gradient ultracentrifugation of the supernatant viral particles but the myc and HA tags were not detected in the expected sucrose layer. Next, we plan to use myc-Ab and HA-Ab conjugated beads to pull down the supernatant viral particles to detect the ratio of ONNV/CHIKV myc-E2+E1 and ONNV/HA-E2 secreted by THP-1 derived macrophages. This will determine whether ONNV containing CHIKV glycoproteins can outcompete ONNV in co-infected cells due to increased viral fitness.

      1. The authors use both primary macrophages and macrophage cell lines as their in vitro model system and make one of their major points (listed in the title) that the determinants they identified in the CHIKV structural proteins convert macrophages into dissemination vessels; however, they do not show: 1) an in vivo model that the CHIKV-ONNV chimeras disseminate more efficiently than the parental ONNV; and 2) that these chimeras generate virus more efficiently specifically in macrophages. It would be useful to show that ONNV and CHIKV have equivalent virion production in other cell lines and that the advantage conferred by CHIKV structural proteins in the ONNV backbone is specific to macrophages. The authors should also change their title to reflect that dissemination is not directly being addressed in their study; the implications of their in vitro experimentation in a mammalian host would be more appropriate for the discussion.

      We acknowledge the limitations of the study, which include a lack of direct demonstration of in vivo dissemination. To address these concerns, we will include further discussion of our in vitro findings in the context of viral dissemination in mammalian hosts in the fully revised manuscript. We are also testing ONNV, CHIKV, Chimera I and ONNV/CHIKV E2+E1 infections in 293T cells to investigate whether the advantage conferred by CHIKV glycoproteins are macrophage specific.

      We have also updated the title to accurately reflect the significance of this research: “Chikungunya virus glycoprotein targeting of host factors increases viral fitness in human macrophage”.

      Reviewer 2’s optional comments

      1. The authors use CHIKV-ONNV chimeras but it would be interesting to test other chimeras to determine if CHIKV structural proteins confer the same advantage in the backbone of other arthritogenic alphaviruses. The study would also be strengthened by using a pathogenic strain of CHIKV instead of the vaccine strain, as this is significantly attenuated in vivo.

      We thank Reviewer 2 for this suggestion which is also suggested by Reviewer 1 in their minor comment #5. We plan to use virulent CHIKV La reunion strain and carry out infection experiments in BSL-3 to strengthen this study. We are getting trained in the BSL-3 facility and will be certified soon.

      1. In Figure 4, the authors identify residues in the CHIKV structural proteins that appear to be under positive selection in human subjects and generate point mutants in these residues with the corresponding ONNV residues. They find that one mutation, V1029I located in E1, completely abolishes virion production in THP-1 macrophage cell lines. However, in their previous chimeric experiments, they find that neither CHIKV E1 or E2 was sufficient to increase virus production in the ONNV backbone. The authors should address this discrepancy, otherwise they should consider moving the data in their point mutation experiments to a supplementary figure. While worthy of reporting, especially given the patient data, these experiments do not buttress the points made in the previous figures.

      We thank Reviewer 2 for this insightful comment. According to previous studies, E2 and E1 always interact with each other from the step of the formation of single heterodimer in the ER to heterodimer trimerization before viral particle assembly. Although the E1-V220 site (previously called V1029) on the exterior of a single E2-E1 heterodimer appears to not be engaged in the E2-E1 interaction E1-V220 is partially exposed and protruding into the groove formed by E1 and the E2 of neighboring heterodimer, accessible to host factors. As such, mutating CHIKV E1-V220 to the ONNV residue (E1-V220I) may not only disrupt E2-E1 trimerization but also interfere viral glycoprotein interaction with host factors(presented in our response to Reviewer 1’s major comment #1). Similarly, solely swapping E2 or E1 with CHIKV substitute in the ONNV backbone would also affect the interaction between neighboring E2 and E1 in trimerized spike, which may explain why neither ONNV/CHIKV E2 or ONNV/CHIKV E1 rescues virion production in THP-1 derived macrophages . We have included this in the partially revised discussion section lines __ __296-313.

      1. The authors conclude their manuscript with an assessment of several host proteins, namely SPCS3 and eIF3k, that were identified by mass spectrometry and whose knockdown results in increased virion production. The authors speculate about the role of these proteins but do not provide any mechanistic detail on how they might be playing a role. It is unclear that the putative antiviral role of these proteins involves steps downstream of virus replication, especially given that the authors speculate translation might be affected by eIF3k which, if the case, RNA synthesis should also be expected to be affected.

      We thank Reviewer 2 for this comment. We acknowledge that we have yet a full mechanistic understanding of how SPCS3 and eIF3k impact virion production. We plan to investigate their antiviral roles in our follow-up studies. For our partial revision, we have constructed several single eIF3k knockout (KO) clones of 293T cells. The eIF3k sgRNA we designed targets exon 3 which would eliminate expression of all 3 splice isoforms of eIF3k (KO schematic and sequence verification of CRISPR KO shown in revision plan). Unfortunately, we failed to obtain single clones of 293T cells with SPCS3 complete KO, consistent with a previous study by Rong Zhang et al6 that were unable to recover SPCS3 KO clones likely due to the importance of SPCS3 in cell survival. We infected an eIF3k KO clone (clone 9) with CHIKV vaccine strain 181/clone 25, ONNV SG650, and SINV Toto1101. Interestingly, we found that the antiviral activity of eIF3k is specific to CHIKV as CRISPR KO of eIF3k increases CHIKV production by 2.5 fold but not ONNV or SINV production (shown in revision plan). We have included this in the partially revised manuscript in__ line 272-282 (Figure 7B-7D).__

      We presume that Reviewer 2’s inference of eIF3k’s potential effects on viral RNA synthesis is based on our speculation of its antiviral role in viral translation, which may affect viral nonstructural gene expression. We would like to clarify that eIF3k is not an initiation factor traditionally needed for cap-dependent translation. It is also not clear what translation process (nonstructural polyprotein translation from viral genomic RNA or structural polyprotein translation from viral subgenomic mRNA) involves eIF3k if it indeed affects viral protein expression. Notably, previous SINV studies imply that alphavirus structural polyprotein translation may employ unique mechanisms without the requirement of several crucial initiation factors4,5. It will be interesting to see whether eIF3k participates in viral subgenomic mRNA translation as that would affect viral glycoprotein expression leading to reduced virion production. We have now included additional discussion on eIF3k antiviral mechanisms in the partially revised manuscript in lines 345-353.

      1. Overall, while the initial chimeric virus and domain swap approach is strong, the manuscript would benefit with a more thorough examination of virion assembly steps and a mechanistic link to virion production. Otherwise, the authors should revise the structure of their manuscript by de-emphasizing points about virion assembly and leave room for other mechanistic explanations of their chimeric data that more clearly link the host antiviral factor/E1 binding studies.

      We thank the reviewer for these positive comments and suggestions. We have addressed this by further interrogating the production kinetics of CHIKV, ONNV, and the chimeras containing CHIKV glycoproteins through determining their particle-to-PFU ratios as well as treating infected cells with secretory pathway inhibitors (refer to our responses to Reviewer 2 major comments #1 and #2). We have also included additional discussion on eIF3k antiviral mechanisms specifically on how it may affect other steps of the viral life cycle in the partially revised manuscript in lines 345-353 (refer to our response to Reviewer 2 optional comment #3).

      Reviewer 3’s critique comments

      1. Overall, the manuscript is well written but in its current state it is more like two different stories because the effects of envelope proteins and list of interactors are not brought together in one story. A possible fix to this problem would be inclusion of ONNV and CHIKV containing env mutations that do and do not restore viral release from macrophages into the pulldown/association experiments shown in Figure 6.

      We thank Reviewer 3 for the insightful suggestions to better connect the first (CHIKV determinants) and second (CHIKV glycoprotein interactors) parts of the manuscript. In response to the Reviewer’s comment, we tested the binding of SPCS3 and eIF3k to CHIKV E1 with E1-V220I (V1029I in original manuscript version) mutation (shown in revision plan) which was shown to abrogate virion production in THP-1 derived macrophages in Figure 4E. We transfected plasmids expressing 3XFLAG-tagged SPCS3/eIF3k or empty vector for 24 h followed by transfection with plasmids expressing either the parental CHIKV vaccine strain 181/clone 25 poly-glycoproteins (E3-myc-E2-6K-E1) or poly-glycoproteins with the E1-V220I mutation. Interestingly, we found that mutating CHIKV E1-V220 to the homologous ONNV residue reduces the binding to either SPCS3 or eIF3k. This result strongly suggests that the positively selected E1-V220 is located in the interaction interface between E1 and SPCS3/eIF3k, confirming the genetic conflict between E1 and these host factors to be one of the major drivers of CHIKV evolution observed at site E1-V220. We have included this result in partially revised manuscript in Figure 7A and in lines 265-271.

      1. The other major issue is the lack of protein data for the viral mutants relative to WT ONNV and CHIKV and assessment of viral RNA in the supernatants to determine whether the block is release or an earlier event since viral RNA levels in the cell seems to be the same or at least normalized.

      We thank Reviewer 3 for pointing out the insufficient clarification of the block leading to defective CHIKV mutant virion production. We previously detected E2 expression from 293T cells transfected with poly-glycoproteins (E3-myc-E2-6K-E1) containing E2-V135L (V460L in original manuscript version), E2-A164T (A489T in original manuscript version), E2-A246S (A571S in original manuscript version) and E1-V220I (V1029I in original manuscript version). We found that only E2-V135L mutation can lead to unexpected E2 cleavage (shown in revision plan) as we mentioned but not shown in the original manuscript. This explains why E2-V135L mutation attenuates infectious CHIKV production.

      The E2 expression of E1-V220I appears to be not affected in 293T cells transfected with poly-glycoproteins with E1-V220I (shown in revision plan ). In addition, the E1-host factor binding result in our response to Reviewer 3’s major comment #1 showed that E1 with the positively selected site mutation V220I can also be successfully expressed in 293T cells after transfection with poly-glycoprotein. Based on these current data, E1-V220I mutation likely abrogates virion production without affecting glycoprotein expression.

      Our previous result of the ONNV particle-to-PFU ratio reveals that ONNV RNA is released but encapsidated in defective particles causing its attenuation in infected macrophages. Thus, even though the glycoproteins of E1-V220I can be expressed, the diminished virion production of CHIKV E1-V220I can still be ascribed to 1) blocked viral particle release and 2) production of defective particles like ONNV. Given that it is not feasible to obtain particle-to-PFU ratio of E1-V220I mutant which fails to form plaques, Reviewer 3’s suggestion to assess the supernatant viral RNA will be a nice approach to address this question. To further address this concern, we plan to transfect THP-1 derived macrophages with CHIKV E1-V220I mutant RNA to detect the intracellular viral glycoprotein expression and supernatant viral RNA levels through western blot and TaqMan assay, respectively.

      1. Lastly, knockdown experiments indicate an effect of things like OAS3 or other innate immune modulators. There are no controls to demonstrate that these are specific to CHIKV infection or if knockdown would assist growth of ONNV as well.

      We also thank Reviewer 3 for the suggestion to check whether the identified host factors specifically target CHIKV or inhibit the infection of ONNV as well. We previously tried but were facing some issues. Since only a small fraction of macrophages can be infected with CHIKV and even a smaller fraction can be infected with ONNV (Figure 1A), it is hard to elucidate the roles of these identified host factors in ONNV infection by siRNA knockdown. We decided to take a more rigorous approach to investigate the antiviral specificity of identified host factors, especially understudied SPCS3 and eIF3k, to different alphaviruses by generating complete knockout 293T single cell clones. Despite the fact that we did not successfully generate SPCS3 complete KO, we obtained an eIF3k KO single cell clone and infected it with CHIKV, ONNV and SINV (refer to our response to Reviewer 2 optional comment #3). We found that eIF3k only has antiviral activity against CHIKV with almost no effects on ONNV or SINV infection. We have included this in our partially revised manuscript in line 272-282 (Figure 7B-7D).

      Reviewer 3's minor comments:

      Other points to consider:

      1. The title does not fit the manuscript findings and should be modified.

      We thank Reviewer 3 for this important comment, which was also brought up by Reviewer 2. We have now changed our title to “Chikungunya virus glycoprotein targeting of host factors increases viral fitness in human macrophage”, which more accurately reflects the significance of our research.

      1. It is unclear why the authors show results for SINV and RRV in Figure 1. Either these should be removed or the viruses should be carried throughout the experiments described in the Figure. Better yet would be to add additional alphaviruses to this analysis to determine if there are additional viruses that act similarly to CHIKV.

      We apologize for the confusion caused by including SINV and RRV results in Figure 1. We intended to show the superiority of CHIKV in infecting primary monocyte derived macrophages among arthritogenic alphaviruses, which we speculate may provide the molecular basis for macrophage-mediated CHIKV dissemination and disease. We would like to keep the SINV and RRV infection results in Figure 1 to highlight the relative susceptibility of macrophages to CHIKV. To echo the additional alphaviruses tested in Figure 1 and bring the story full circle, we included the result of SINV infection of eIF3k CRISPR KO 293T cells in Figure 7B-7D. These results uncover inhibitory activities of eIF3k that are specific to CHIKV.

      1. Is the data presented in Figure 1A significant?

      We thank Reviewer 3 for this question. We infected both THP-1 derived macrophages and primary monocyte derived macrophages with EGFP-expressing alphaviruses each in duplicates for two independent times. The general low expression of EGFP in all virus-infected groups refrains us from drawing conclusions based on statistically significant differences observed with MFI, hence we chose to show representative scatter plots in the original manuscript. To address Reviewers 3's question, we plotted the infected cell (EGFP+) based on the percentages of the experimental duplicates (shown in revision plan), and found CHIKV infection to be the most significantly different from that of the other alphaviruses in primary monocyte derived macrophage . The numbers above the bar charts are the mean percentages of EGFP+ cells.

      1. The justification for inclusion of Figure 4A is lacking. It is unclear what this panel is supposed to be demonstrating.

      This is an excellent suggestion as the host factors identified by AP-MS not only contain interactors of CHIKV mature E2 but also those of uncleaved E2-containing precursor polyproteins. We modified Figure 4A to reflect all E2/E2-containing poly-glycoproteins present in CHIKV-infected cells (shown in revision plan).

      1. There is little justification for the candidates assessed in

      We understand Reviewer 3’s concern. Due to the nature of mass spectrometry studies which predict protein-protein interactions rather than direct functional validation, we acknowledge that we may miss some host candidates that have anti- or pro-CHIKV activities. Although justification of hit selection from mass spectrometry datasets is more difficult than that from CRISPR KO screen datasets, we set up specific criteria to identify host protein candidates with the greatest potential to functionally interact with CHIKV glycoproteins. Most of the proteins we chose to validate (Figure 6a) were identified in both of our independent AP-MS experiments, which both pass through a P-value threshold of 0.05 and log2 fold change of 0.

      1. Extended data Figure 3 is very difficult to read due to the small font size.

      We apologize for the small font in Extended data Figure 3. We plan to replace Figure EV3 ( Extended data 3 in unrevised version) with a CORUM protein-protein interaction network that centers on the significant hits identified by both AP-MS experiments, but includes hits from either one of the two experiments in these functional protein complexes. The figure will be more concise and centralized, and the font will be bigger.

      1. Just to be clear, the blots shown in Figure 6D are different from those depicted in Extended data Figure 4b, because some of them look very similar.

      We thank Reviewer 3 for this question. In Figure 6D, we expressed CHIKV glycoproteins through transfecting CHIKV genomic RNA into 293T cells, while, in Figure 4B, we expressed CHIKV glycoproteins through transfecting poly-glycoprotein plasmid (pcDNA3.1-E3-myc-E2-6K-E1) into 293T cells, which are complementary approaches to express CHIKV glycoproteins to validate their interactions with identified host factors. We have now added schematics to illustrate the different experimental strategies above the figures in this partially revised manuscript (shown in revision plan).

      References:

      Voss, J. E. et al. Glycoprotein organization of Chikungunya virus particles revealed by X-ray crystallography. Nature 468, 709–712 (2010). Jose, J., Tang, J., Taylor, A. B., Baker, T. S. & Kuhn, R. J. Fluorescent Protein-Tagged Sindbis Virus E2 Glycoprotein Allows Single Particle Analysis of Virus Budding from Live Cells. Viruses 7, 6182–6199 (2015). Götte, B., Liu, L. & McInerney, G. M. The Enigmatic Alphavirus Non-Structural Protein 3 (nsP3) Revealing Its Secrets at Last. Viruses 10, 105 (2018). Saenz, J. B. et al. Golgicide A reveals essential roles for GBF1 in Golgi assembly and function. Nat. Chem. Biol. 5, 157–165 (2009). Krämer, A. et al. Small molecules intercept Notch signaling and the early secretory pathway. Nat. Chem. Biol.9, 731–738 (2013). Zhang, R. et al. A CRISPR screen defines a signal peptide processing pathway required by flaviviruses. Nature 535, 164–168 (2016).

    1. Author Response

      Joint Public review

      The manuscript by Mitra and coworkers analyses the functional role of Orai in the excitability of central dopaminergic neurons in Drosophila. The authors show that a dominant-negative mutant of Orai (OraiE180A) significantly alters the gene expression profile of flight-promoting dopaminergic neurons (fpDANs). Among them, OraiE180A attenuates the expression of Set2 and enhances that of E(z) shifting the level of epigenetic signatures that modulate gene expression. The present results also demonstrate that Set2 expression via Orai involves the transcription factor Trl. The Orai-Trl-Set1 pathway modulates the expression of VGCC, which, in turn, are involved in dopamine release. The topic investigated is interesting and timely and the study is carefully performed and technically sound; however, there are several major concerns that need to be addressed:

      1) In Figure S2E, STIM is overexpressed in the absence of Set2 and this leads to rescue. It is presumed that STIM overexpression causes excess SOCE, yet this is rarely the case. Perhaps the bigger concern, however, is how excess SOCE might overcome the loss of SET2 if SET2 mediates SOCE-induced development of flight. These data are more consistent with something other than SET2 mediating this function.

      Our statement that STIM overexpression overcomes deficits in SOCE is based on the following published work:

      1. Studies of SOCE in wildtype cultured larval Drosophila neurons demonstrated that overexpression of STIM raised SOCE to the same extent as co-expression of STIM and Orai in the WT background (Chakraborty et al, 2016; Figure 1D).

      2. Both Carbachol-induced IP3-mediated Ca2+ release and SOCE (measured by Ca2+ add back after Thapsigargin-induced store depletion) were rescued in primary cultures of IP3R hypomorphic mutant (itprku) Drosophila neurons by overexpression of STIM (Agrawal et al., 2010; Figure 8A-G).

      3. Deb et al., 2016 (Supplementary Figure 2h,i) reaffirmed that overexpression of STIM significantly improves SOCE after Thapsigargin-induced passive store-depletion in Drosophila neurons expressing IP3RRNAi.

      4. Consistent with the cellular rescue of SOCE, defects in flight initiation and physiology observed in the heteroallelic IP3R hypomorphic background (itprku) could be rescued by overexpression of STIM (Agrawal et al., 2010; Figure 3A-E) as well as Orai (Venkiteswaran and Hasan, 2009; Figure 3).

      5. In Figure S2E, we show that flight deficits arising from THD’> Set2RNAi are rescued upon overexpression of STIM (i.e. THD’>Set2RNAi; STIMOE). Here and in another recent publication (Mitra et al., 2021) we show that neurons expressing Set2RNAi exhibit reduced expression of the IP3R and reduced ER-Ca2+ release presumably leading to reduced SOCE. As mentioned above we have consistently found that STIM overexpression raises both IP3-mediated Ca2+ release and SOCE in Drosophila neurons.

      In this study, we propose that Ca2+ release through the IP3R followed by SOCE are part of a positive feedback loop driving expression of Set2 which in turn upregulates expression of mAChR and IP3R (Figure 3F) to regulate dopaminergic neuron function. Our observation that loss of Set2 (THD’>Set2RNAi) can be rescued by STIM overexpression is consistent with this model because:

      1. Loss of Set2 (THD’>Set2RNAi) results in downregulation of several genes including mAChR and IP3R leading to decreased SOCE.

      2. As evident from our previous studies increased STIM expression in the Set2RNAi background (THD’>Set2RNAi; STIMOE) is expected to enhance SOCE which we predict would rescue Set2 expression leading to rescue of other Set2 dependent downstream functions like flight (Figure 2D).

      2) In Figure 3, data is provided linking SET2 expression and Cch-induced Ca2+ responses. The presentation of these data is confusing. In addition, the results may be a simple side effect of SET2-dependent expression of IP3R. Given that this article is about SOCE, why isn't SOCE shown here? More generally, there are no measurements of SOCE in this entire article. Measuring SOCE (not what is measured in response to Cch) could help eliminate some of this confusion.

      We will re-write this section in the revised version for better clarity and explain how Set2-dependent IP3R expression is an important component of Orai-mediated Ca2+ entry in fpDANs. Here, we propose that IP3-mediated Ca2+ release and SOCE, through Orai, are together part of a positive feedback loop driving transcription of Set2 which in turn upregulates mAChR and IP3R expression (Figure 3F). We hypothesized that the observed loss of CCh-induced Ca2+ response in the Set2RNAi background (Figure 3B-D; THD’>Set2RNAi) results from decreased itpr and mAChR expression and verified this in Figure 3E. This is further validated by the rescue of CCh-induced Ca2+ response and itpr/mAChR expression in the OraiE180A background upon Set2 overexpression (Figure 3B-E; THD’>OraiE180A; Set2OE). We were constrained to measure CCh-induced Ca2+ responses in OraiE180A expressing neurons for the following reasons:

      1. SOCE measurements through Tg mediated store Ca2+ release followed by Ca2+ add back require a 0 Ca2+ environment that can only be achieved in culture. The Drosophila brain is bathed in hemolymph which contains Ca2+ and there do not exist any methods to readily deplete Ca2+ from the tissue to create a 0 Ca2+ environment without also effecting the health of the neurons.

      2. Cultures of the subset of dopaminergic neurons (THD’) we have focused on in this study were not feasible due to the small number of neurons being studied from the total number of dopaminergic neurons in the brain (~35/400). In previous studies we have shown that SOCE post-Tg induced store depletion is abrogated in cultured dopaminergic neurons from Drosophila upon expression of OraiE180A (Pathak et al., 2015).

      Furthermore, Carbachol-induced IP3-mediated Ca2+ release is tightly coupled to SOCE in Drosophila neurons (Venkiteswaran and Hasan, 2009) and Ca2+ release from the IP3R is physiologically relevant for flight behavior in THD’ neurons (Sharma and Hasan, 2020).

      3) A significant gap in the study relates to the conclusion that trl is a SOCE-regulated transcription factor. This conclusion is entirely based on genetic analysis of STIMKO heterozygous flies in which a copy of the trl13C hypomorph allele is introduced. While these results suggest a genetic interaction between the expression of the two genes, the evidence that expression translates into a functional interaction that places trl immediately downstream of SOCE is not rigorous or convincing. All that can be said is that the double mutant shows a defect in flight which could arise from an interruption of the circuit. Further, it is not clear whether the trl13C hypomorph is only introduced during the critical 72-96 hour time window when the Orai1E180E phenotype shows up. The same applies to the over-expression of Set2 and the other genes. If the expression is not temporally controlled, then the phenotype could be due to the blockade of an entirely different aspect of flight neuron function.

      The idea that Trl functions downstream of Orai-mediated Ca2+ entry in THD’ neurons is based on the following genetic evidence:

      1. In Figure 4D, we show evidence of genetic interaction between trl-STIM and trl-Set2. The rescue of trl13c/STIMKO with STIM overexpression in THD’ neurons indicates that excess SOCE (driven by STIMOE) may activate the residual Trl (there exists a WT Trl copy in this genetic background) to rescue THD’ flight function. This is further supported by the rescue of trl/STIMKO with Set2 overexpression in THD’ neurons, which is consistent with the feedback loop model proposed in Figure 5C - where we propose that reduced SOCE leads to reduced ‘activated’ Trl and thus reduced Set2 expression, and the latter is rescued by SET2OEThe manner in which SOCE ‘activates’ Trl is the subject of ongoing investigations.

      2. The trl hypomorphic alleles (including trl13C) exist as genetic mutants and they affect Trl function in all tissues throughout development. While we concede that these mutant alleles would affect multiple functions at other stages of development, which may impinge on the phenotypes noted in Figure S4B, we have used a targeted RNAi approach to validate Trl function specifically in the THD’ neurons (Figure 4C).

      3. Overexpression mediated rescues (including Set2) were not induced only during the critical 72-96 hrs APF developmental window. Having established that Orai function drives critical gene expression during this window (Figure 1), it is reasonable to assume that Set2 rescue of loss of flight in OraiE180A occurs in the same time window where flight is disrupted.

      4- In Figure 4, data is shown that SOCE compensates for the loss of Trl, the presumed mediator of SOCE-dependent flight. The fact that flight deficits are rescued by raising SOCE in the absence of Trl is very inconsistent with this conclusion.

      We apologise for this confusion and will clarify in the revision. trl13c is a recessive allele of Trl and should be written as such throughout the text and in the figures (i.e trl13c and NOT Trl13c). In all cases of Trl mutant rescue by STIMOE and Set2OE there exists residual Trl that can be activated by excess SOCE thus leading to the rescue. This is true for trl13C/ STIMKO where each mutant is present as a heterozygote (the complete genotype of this strain is STIMKO/+; trl13c/+; this will be corrected in the revision). Similarly, for TrlRNAi we expect reduced levels (but not complete loss) of Trl. Thus the SOCE rescue of loss of Trl occurs in conditions where Trl levels are reduced but NOT absent. Homozygous trl null mutants are lethal.

      5- In Figure 5 (A-C), data is provided that Trl transcripts are unaffected by loss of SOCE and that overexpression cannot rescue flightlessness. From this, the authors conclude that this gene "must" be calcium responsive. While that is one possibility, it is also possible that these genes are not functionally linked.

      The idea that Trl is functionally linked to SOCE is based on the following evidence:

      1. In Figure 4C we show that flight defects caused by partial loss of Trl (THD’>TrlRNAi) were rescued by STIM overexpression (THD’>TrlRNAi; STIMOE). As mentioned above we have found that STIM overexpression raises SOCE.

      2. Heteroalleles of the trl13C hypomorph exhibit a strong genetic interaction with a single copy of the null allele of STIMKO as shown by the flight deficit of trl13c/+; STIMKO/+ (trl13C/STIMKO ) flies (Figure 4D). The genotypes will be corrected in the revision.

      3. Flight defects in trl13C/STIMKO flies could be rescued by STIM overexpression in the THD’ neurons (trl13C/STIMKO; THD’>STIMOE)

      4. In Figure 4E, we show that partial loss of Trl in THD’ neurons (THD’>TrlRNAi) leads to decreased expression of the Ca2+ responsive genes mAChR, itpr, and Set2 genes indicating that Trl is a constituent of the SOCE-driven transcriptional feedback loop (Figure 5C).

      Since we could not detect a well-defined Ca2+ binding domain in Trl, we hypothesize that it could be activated by a Ca2+ dependent post-translational modification. Phosphoproteome analysis of Trl demonstrated that it does indeed undergo phosphorylation at a Threonine residue (T237; Zhai et al., 2008), which lies within a potential site for CaMKII. Independently, CaMKII has been identified as a binding partner of Trl from a Trl interactome study (Lomaev et al., 2018). Past work from our group (Ravi et al., 2018) identified a role for CaMKII in THD’ neurons in the context of flight. We are currently testing if CaMKII functions downstream of SOCE in THD’ neurons to mediate flight and will update this information in the next version of the manuscript.

      6) There is no characterization of SOCE in fpDANs from flies expressing native Orai or the dominant negative OraiE180A mutant. While the authors refer to previous studies, as the manuscript is essentially based on Orai function thapsigargin-induced SOCE should be tested using the Ca2+ add-back protocol in order to assess the release of Ca2+ from the ER in response to thapsigargin as well as the subsequent SOCE.

      The fpDANs consist of 16-19 neurons in each hemisphere (PPL1 are 10-12 and PPM3 are 6-7 cells; Pathak et al., 2015). Measuring SOCE from these neurons in vivo is not possible due to the presence of abundant extracellular Ca2+ in the brain. Given their sparse number, it proved technically challenging to isolate the fpDANs in culture to perform SOCE measurements using the Ca2+ add back protocol. Due to these reasons, we have relied upon using Carbachol to elicit IP3-mediated Ca2+ release and SOCE as a proxy for in vivo SOCE. In previous studies we have shown that Carbachol treatment of cultured Drosophila neurons elicits IP3-mediated Ca2+ release and SOCE (Agrawal et al., 2010; Figure 8). Moreover, expression of OraiE180A completely blocks SOCE as measured in primary cultures of dopaminergic neurons (Pathak et al., 2015; Figure 1E). Hence we have not repeated SOCE measurements from all dopaminergic neurons in this work. In the revised version we will explicitly state this weakness of our study and the reasons for it.

      7) In the experiments performed to rescue flight duration in Set2RNAi individuals the authors overexpress STIM and attribute the effect to "Excess STIM presumably drives higher SOCE sufficient to rescue flight bout durations caused by deficient Set2 levels.". This should be experimentally tested as the STIM:Orai stoichiometry has been demonstrated as essential for SOCE.

      The assumption that STIM overexpression drives higher SOCE is based upon previously published work from Drosophila neurons (Agrawal et al., 2010; Chakraborty et al, 2016; Deb et al., 2016) which demonstrates that excess WT STIM overcomes IP3R deficiencies (RNAi or hypomorphic mutants) to rescue SOCE. We agree that STIM-Orai stoichiometry is essential for SOCE, and propose that the rescue backgrounds possess sufficient WT Orai, which is recruited by the excess STIM to mediate the rescue. We will reference the earlier work to validate our use of STIMOE for rescue of SOCE.

      Here, we propose that Set2 is part of a positive feedback loop driving transcription of mAChR and IP3R (Figure 3F). In keeping with this hypothesis, we posit that the phenotypes observed in the Set2RNAi background (Figure 2D) result from decreased itpr and mAChR expression (validated in Figure 3E). This is further validated by the Set2 overexpression mediated rescue of OraiE180A (Figure 2D) and rescue of itpr/mAChR expression in the OraiE180A background (Figure 3B-E; THD’>OraiE180A; Set2OE).

      8) The authors show that overexpression of OraiE108A results in Stim downregulation at a mRNA level. What about the protein level? And more important, how does OraiE108A downregulate Stim expression? Does it promote Stim degradation? Does it inhibit Stim expression?

      We hypothesize that changes in STIM mRNA observed in the THD’ > OraiE180A neurons stems from an overall reduction in IP3-mediated Ca2+ release and SOCE due to loss of Trl-Set2 driven gene expression detailed in our transcriptional feedback loop model (Figure 5C). We will attempt to explain this aspect more clearly in the next version of the manuscript. While we agree that measuring levels of STIM protein would be helpful, estimation of protein levels from a limited number of neurons (~35 cells per brain) is technically challenging. The STIM antibody does not work well in immunohistochemistry. In the absence of any experimental evidence we cannot comment on how expression of OraiE180A might affect STIM protein turnover.

      9) Lines 271-273, the authors state "whereas overexpression of a transgene encoding Set2 in THD' neurons either with loss of SOCE (OraiE180A) or with knockdown of the IP3R (itprRNAi), lead to significant rescue of the Ca2+ response". This is attributed to a positive effect of Set2 expression on IP3R expression and the authors show a positive correlation between these two parameters; however, there is no demonstration that Set2 expression can rescue IP3R expression in cells where the IP3R is knocked down (itprRNAi). This should be further demonstrated.

      The rescue of IP3R expression by Set2 overexpression in itprRNAi was demonstrated in a different set of Drosophila neurons in an earlier study (Mitra et al., 2021) and has not been repeated specifically in THD’ neurons. Similar to the previous study, here we tested CCh stimulated Ca2+ responses of THD’ neurons with itprRNAi and itprRNAi; SetOE (Fig S3), which are indeed rescued by SET2OE.

      10) The data presented in Figure 3E should be functionally demonstrated by analyzing the ability of CCh to release Ca2+ from the intracellular stores in the absence of extracellular Ca2+.

      CCh-mediated Ca2+ release from the intracellular stores in the absence of extracellular Ca2+ has been described in primary cultures of Drosophila neurons in previously published work (Venkiteswaran and Hasan, 2009; Agrawal et al., 2010) This work focuses on a set of 16-19 dopaminergic neurons in a hemisphere of the Drosophila central brain. It is technically challenging to generate a 0 Ca2+ environment in vivo, which is essential for measuring store Ca2+ release. Given their meagre numbers, primary cultures of these neurons is not readily feasible.

      11) The conclusion that SOCE regulates the neuronal excitability threshold is based entirely on either partial behavioral rescue of flight, or measurements of KCl-induced Ca2+ rises monitored by GCaMP6m in DAN neurons. The threshold for neuronal excitability is a precise parameter based on rheobase measurements of action potentials in current-clamp. Measurements of slow calcium signals using a slow dye such as GCaMp6m should not be equated with neuronal excitability. What is measured is a loss of the calcium response in high K depolarization experiments, which occurs due to the loss of expression of Cav channels. Hence, the use of this term is not accurate and will confuse readers. The use of terms referring to neuronal excitability needs to be changed throughout the manuscript. As such, the conclusions regarding neuronal excitability should be strongly tempered and the data reinterpreted as there are no true measurements of neuronal excitability in the manuscript. All that can be said is that expression of certain ion channel genes is suppressed. Since both Na+ channels and K+ channel expression is down-regulated, it is hard to say precisely how membrane excitability is altered without action potential analysis.

      The claim that SOCE influences neuronal excitability is based on the following observations:

      1. Interruption of the transcriptional feedback loop involving SOCE, Trl, and Set2 through loss of any of its constituents, results in the downregulation of VGCCs (Figure 5G, 6H), which are essential components of action potentials.

      2. OraiE180A mediated loss of SOCE in THD’ neurons abrogates the KCl-evoked depolarization response (Figure 6B, C) measured using GCaMP6m. We verified that this response requires VGCC function using pharmacological inhibition of L-type VGCCs (Figure 6E, F).

      3. SOCE deficient THD’ neurons, which were presumably compromised in their ability to evoke action potentials could be rescued to undergo KCl-evoked depolarisation by expression of NachBac, which lowers the depolarization threshold (Figure 7C, D) or through optogenetic stimulation using CsChrimson (Figure 7F).

      We agree that ‘neuronal excitability threshold’ is a precise electrophysiological parameter that has not been directly investigated here by measurement of action potentials. Therefore, references to neuronal excitability will be tempered throughout the revised manuscript and be replaced with a more generic reference to ‘neuronal activity’. In this context we propose to include further evidence supporting reduced excitability of THD’ neurons upon loss of SOCE in the revision.

      Since one of the key functional outcomes of activity during critical developmental periods such as the 72-96 hrs APF developmental window identified in this study, is remodelling of neuronal morphology, we decided to investigate the same in our context. Neuronal activity can drive changes in neurite complexity and axonal arborization (Depetris-Chauvin et al., 2011) especially during critical developmental periods (Sachse et al., 2007). To understand if Orai mediated Ca2+ entry and downstream gene expression through Set2 affects this activity-driven parameter, we investigated the morphology of fpDANs, and specifically measured the complexity of presynaptic terminals within the 2’1 lobe MB using super-resolution microscopy. We found striking changes in the neurite volume upon expression of OraiE180A which could be rescued by restoring either Set2 (OraiE180A; Set2OE) or by inducing hyperactivity through NachBac expression (OraiE180A ; NachBacOE). These data will be included in the revised manuscript.

      12) Related, since trl does not contain any molecular domains that could be regulated by Ca2+ signaling, it is unclear whether trl is directly regulated by SOCE or the regulation is highly indirect. Reporter assays evaluating trl activation upon Ca2+ rises would provide much stronger and more direct evidence for the conclusion that trl is a SOCE-regulated TF. As such the evidence is entirely based on RNAi downregulation of trl which indicates that trl is essential but has no bearing on exactly what point of the signaling cascade it is involved.

      We agree that luciferase Trl reporters would provide a direct method to test SOCE-mediated activation. Future investigations will be targeted in this direction. Regarding possible mechanisms of Trl activation - since we could not detect a well-defined Ca2+ binding domain in Trl, we hypothesize that it may be phosphorylation by a Ca2+ sensitive kinase. Phosphoproteome analysis of Trl indicates that it does indeed undergo phosphorylation at a Threonine reside (T237; Zhai et al., 2008), which may be mediated by the Ca2+ sensitive kinase-CaMKII based on binding partners identified in the Trl interactome (Lomaev et al., 2018). Past work (Ravi et al., 2018) has indeed demonstrated a requirement for CaMKII in THD’ neurons for flight. We are currently testing whether CaMKII functions downstream of SOCE in these neurons to mediate flight, and will be updating this information in the next version of the manuscript.

      13) Are NFAT levels altered in the Orai1 loss of function mutant? If not, this should be explicitly stated. It would seem based on previous literature that some gene regulation may be related to the downregulation of this established Ca2+-dependent transcription factor. Same for NFkb.

      As mentioned in the text in lines (307-309), Drosophila NFAT lacks a calcineurin binding site and is therefore not sensitive to Ca2+ (Keyser et al., 2007). In the past we tested if knockdown of NF-kB in dopaminergic neurons gave a flight phenotype and did not observe any measurable deficit. From the RNAseq data we find a slight downregulation of NFAT (0.49 fold, p value=0.048) and NF-kb (0.26 fold, p value =0.258) the significance of which is unclear at this point. We did not find any consensus binding sites for these two factors in the regulatory regions of downregulated genes from THD’ neurons.

      14) Does over-expression of Set2 restore ion channel expression especially those of the VGCCs? This would provide rigorous, direct evidence that SOCE-mediated regulation of VGCCs through Set2 controls voltage-gated calcium channel signaling.

      Set2 overexpression in the OraiE180A background indeed restores the expression of VGCC genes (Figure 6H).

      15) All 6 representative panels from Figure 3B are duplicated in Figure 4G. Likewise, 2 representative panels from Figure 5H are duplicated in Figure 6D. Although these panels all represent the results from control experiments, the relevant experiments were likely not conducted at the same time and under the same conditions. Thus, control images from other experiments should not be used simply because they correspond to controls. This situation should be clarified.

      We regret the confusion caused by the same representative images for the control experiments. These will be replaced by new representative images for Figure 5H in the next updated version of the manuscript.

      16) The figures are unusually busy and difficult to follow. In part this is because they usually have many panels (Fig. 1: A-I; Fig. 2, A-J, etc) but also because the arrangement of the panels is not consistent: sometimes the following panel is found to the right, other times it is below. It would help the reader to make the order of the panels consistent, and, if possible, reduce the number of panels and/or move some of the panels to new figures (eLife does not limit the number of display items).

      The image panels will be rearranged for ease of reading in the next updated version of the manuscript.

      17) As a final recommendation, the reviewers suggest that the authors a- Reword the text that refers to membrane excitability since membrane excitability was not directly measured here. b-Explain why STIM1 rescues the partial loss of flight in Set2 RNAi flies (Fig. S2E); and c- Explain how/why trl is calcium regulated and test using luciferase (or other) reporter assays whether Orai activation leads to trl activation.

      a. Textual references to membrane excitability will be appropriately modified.

      b. We have provided a detailed explanation for how STIM overexpression might rescue the phenotypes caused by Set2RNAi in Point 1. In short, these phenotypes depend upon IP3R mediated Ca2+ entry driving a transcriptional feedback loop. We relied upon past reports that STIM overexpression upregulates IP3R-mediated Ca2+ release and SOCE in Drosophila itpr mutant neurons (Agrawal et al., 2010; Chakraborty et al, 2016; Deb et al, 2016). We therefore propose that STIM overexpression in the Set2RNAi background rescues IP3R mediated Ca2+ release followed by SOCE, which drives enhanced Set2 transcription, counteracting the effects of the RNAi. We will explain this more clearly with past references in the next revision.

      c. We have provided a detailed response to this comment in Point 12. Briefly, we agree that building luciferase reporters for Trl could be an ideal strategy to test for its responsiveness to SOCE and needs to be done in future. As an alternate strategy, we have looked at data from existing studies of interacting partners of Trl (Lomaev et al., 2017) and identified CamKII, which is both Ca2+ responsive (Braun and Schulman, 1995; Yasuda et al., 2022), and thus might activate Trl through a phosphorylation-switch like mechanism. Moreover, a previous publication identified a requirement for CamKII in THD’ neurons for Drosophila flight (Ravi et al., 2018). We are testing the ability of a dominant active version of CamKII to rescue THD’>E180A flight deficits and will include this information in the next version of the manuscript.

      References

      1. Agrawal N, Venkiteswaran G, Sadaf S, Padmanabhan N, Banerjee S, Hasan G. Inositol 1,4,5-Trisphosphate Receptor and dSTIM Function in Drosophila Insulin-Producing Neurons Regulates Systemic Intracellular Calcium Homeostasis and Flight. J Neurosci. 2010;30:1301-1313. doi:10.1523/jneurosci.3668-09.2010
      2. Braun AP, Schulman H. A non-selective cation current activated via the multifunctional Ca(2+)-calmodulin-dependent protein kinase in human epithelial cells. J Physiol. 1995. 488:37-55. doi:10.1113/jphysiol.1995.sp020944
      3. Chakraborty S, Deb BK, Chorna T, Konieczny V, Taylor CW, Hasan G. Mutant IP3 receptors attenuate store-operated Ca2+ entry by destabilizing STIM-Orai interactions in Drosophila neurons. J Cell Sci. 2016. 129:3903-3910. doi:10.1242/jcs.191585
      4. Deb BK, Pathak T, Hasan G. Store-independent modulation of Ca2+ entry through Orai by Septin 7. Nat Commun. 2016. 7:11751. doi:10.1038/ncomms11751
      5. Depetris-Chauvin A, Berni J, Aranovich EJ, Muraro NI, Beckwith EJ, Ceriani MF. Adult-specific electrical silencing of pacemaker neurons uncouples molecular clock from circadian outputs. Curr Biol. 2011. 21:1783-1793. doi: 10.1016/j.cub.2011.09.027.
      6. Keyser P, Borge-Renberg K, Hultmark D. The Drosophila NFAT homolog is involved in salt stress tolerance. Insect Biochem Mol Biol. 2007. 37:356-362. doi:10.1016/j.ibmb.2006.12.009
      7. Kilo L, Stürner T, Tavosanis G, Ziegler AB. Drosophila Dendritic Arborisation Neurons: Fantastic Actin Dynamics and Where to Find Them. Cells. 2021. 10:2777. doi:10.3390/cells10102777
      8. Lomaev D, Mikhailova A, Erokhin M, et al. The GAGA factor regulatory network: Identification of GAGA factor associated proteins. PLoS One. 2017. 12:e0173602. doi:10.1371/journal.pone.0173602
      9. Mitra R, Richhariya S, Jayakumar S, Notani D, Hasan G. IP3/Ca2+ signals regulate larval to pupal transition under nutrient stress through the H3K36 methyltransferase dSET2. Development. 2021. 148:dev199018. doi:10.1101/2020.11.25.399329
      10. Pathak T, Agrawal T, Richhariya S, Sadaf S, Hasan G. Store-Operated Calcium Entry through Orai Is Required for Transcriptional Maturation of the Flight Circuit in Drosophila. J Neurosci. 2015. 35:13784-13799. doi:10.1523/jneurosci.1680-15.2015
      11. Ravi P, Trivedi D, Hasan G. FMRFa receptor stimulated Ca2+ signals alter the activity of flight modulating central dopaminergic neurons in Drosophila melanogaster. Barsh GS, ed. PLOS Genet. 2018. 14:e1007459. doi:10.1371/journal.pgen.1007459
      12. Sachse S, Rueckert E, Keller A, Okada R, Tanaka NK, Ito K, Vosshall LB. Activity-dependent plasticity in an olfactory circuit. Neuron. 2007. 56:838-50. doi: 10.1016/j.neuron.2007.10.035.
      13. Sharma A, Hasan G. Modulation of flight and feeding behaviours requires presynaptic IP3Rs in dopaminergic neurons. Elife. 2020;9. e62297.doi:10.7554/elife.62297
      14. Venkiteswaran G, Hasan G. Intracellular Ca2+ signalling and store operated Ca2+ entry are required in Drosophila neurons for flight. Proc Natl Acad Sci. 2009.106:10326-10331. doi: 10.1073/pnas.0902982106
      15. Yasuda R, Hayashi Y, Hell JW. CaMKII: a central molecular organizer of synaptic plasticity, learning and memory. Nat Rev Neurosci. 2022. 23: 666-682 doi:10.1038/s41583-022-00624-2
      16. Zhai B, Villén J, Beausoleil SA, Mintseris J, Gygi SP. Phosphoproteome Analysis of Drosophila melanogaster Embryos. J Proteome Res. 2008. 7:1675-1682. doi:10.1021/pr700696a
    1. Fig. 3: Adult hippocampal neurogenesis is impaired in patients with Alzheimer’s disease.

      We can see in (b) and (c) that the harmful proteins are building up as AD progresses through the stafes, and in d-l that DCX and DCX+ cells decrease as AD progresses compared to control subjects.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      This study provides insights into the early detection of malignancies with noninvasive methods. The study contained a large sample size with external validation cohort, which raises the credibility and universality of this model. The new model achieved high levels of AUC in discriminating malignancies from healthy controls, as well as the ability to distinguish tumor of origin. Based on these findings, prospective studies are needed to further confirm its predictive capacity.

      However, there are several concerns about the manuscript, which needs to be clarified or modified.

      1) The use of "multimodal model" will definitely increase workload of the testing. From the results of this manuscript, the integration of multimodal data did not significantly outperform the EM-based model. Is this kind of integration necessary? Is that tool really cost-effective? The authors did not convince me of its necessity, advantages, and clinical application.

      To provide further evidence supporting the advantages of using multimodal model (stack model) over EM-based model, we performed the DeLong test and provided data in Table S7 and Figure S6. Our data show that the stack model outperformed the EM-based model, with significantly higher AUC (AUC difference = 0.0286, p<0.0001). Moreover, the stack model exhibited significantly higher sensitivity for detecting cancer patients of five cancer types in both discovery (73.8% versus 59.5%, p<0.0001, Figure S6A) and validation cohort (72.4% versus 61.5%, p=0.0002, Figure S6B) at comparable specificity of > 95%. The number of misclassified cases were lower when using stack model as compared to the EM-based model (Figure S6C and S6D). Strikingly, we observed that the stack model significantly improved the sensitivity for detecting lung cancer patients compared to the EM based model in both discovery (78.5% versus 44.1%, Figure S6A) and validation cohort ( 83.7% versus 55.8%, Figure S6B), indicating that other ctDNA signatures are also the important biomarkers for detecting lung cancer. Therefore, we conclude that the combination of multiple signatures of ctDNA, ie. the multimodel approach, could improve the sensitivity of multi-cancer detection.

      Given the same wet lab protocol, the difference in computational time between a single EM-based model and the stack model is about 10-11 minutes per sample, but the real difference in analysis time can be reduced to ~1 min/sample by parallelization. With regards to the wet lab protocol, an important novelty of SPOT-MAS technology is its all-in-one approach that enables simultaneous analysis of different ctDNA signatures using a single blood draw and a single library reaction, greatly reducing the experimental cost. Thus, we strongly argue that our approach improves the detection sensitivity by increasing the breadth of ctDNA analysis while achieving cost effectiveness for sample preparation and sequencing with negligible trade-off of analysis time .

      We have also added the following sentences in the discussion to clarify this point. (Line 618-625)

      “Moreover, this study showed that the feature of EM achieved the highest performance among the five examined ctDNA signatures in discriminating cancer from healthy controls (Figure S6). Importantly, we found that combining EM with other ctDNA signatures in a stack model could further improve the sensitivity for detecting cancer samples, with significant improvement for lung cancer patients (Figure S6A and S6B). These findings highlighted that the multimodal analysis of multiple ctDNA signatures by SPOT-MAS could increase the breadth of ctDNA feature analysis, thus enhancing the detection sensitivity while maintaining the low cost of sample preparation and sequencing.”

      2) The baseline characteristics of part of the enrolled patients are not clear. It seems that some of the cancer patients were diagnosed only by imaging examinations. The manuscript described "staging information was not available for 25.7% of cancer patients, who were confirmed by specialized clinicians to have non-metastatic tumors". I have no idea how did this confirmation make? According to clinicians' experience only?

      Our study only recruited cancer patients with non-systemic-metastatic stages (Stage I-IIIA) in which cancer is localized to the primary sites and has not spread to other organs. We excluded patients who were diagnosed with metastatic stage IIIB and IV cancer. All healthy subjects were confirmed to have no history of cancer at the time of enrollment. They were followed up at six months and one year after enrollment. The majority of cancer patients (74.3%) were confirmed to have cancer by abnormal imaging examination and subsequent tissue biopsy confirmation of tumor staging and metastasis status. For patients with unavailable staging information (25.7%), they initially went to the study hospitals for imaging examination. Upon receiving positive imaging results (MRI scan or CT scan), they moved to another hospital for surgery, leading to missing tumor staging information at the original study hospitals. The metastasis status of these patients were later obtained via communications between the clinicians at the study hospitals and the clinicians at the surgery hospitals, subject to existing data sharing agreement between the two hospitals. For those with metastatic cancer or unclear metastatic status, they were excluded from our study.

      We have added the following sentences in the method (Line 127-135) and discussion section (Line 679-688).

      “Cancer patients were confirmed to have cancer by abnormal imaging examination and subsequent tissue biopsy confirmation of malignancy. Cancer stages were determined by the TNM (Tumor, Node, Metastasis) system classification according to the American Joint Committee on Cancer and the International Union for Cancer Control. Our study only recruited cancer patients with non-systemic-metastatic stages (Stage I-IIIA) in which cancer is localized to the primary sites and has not spread to other organs. We excluded patients who were diagnosed with metastatic stage IIIB and IV cancer. All healthy subjects were confirmed to have no history of cancer at the time of enrollment. They were followed up at six months and one year after enrollment to ensure that they did not develop cancer.”

      “For patients with unavailable staging information, their initial imaging examinations were conducted at the study hospitals. However, subsequent tests and surgical procedures were performed at a different hospital, as per the patients' preferences. Consequently, the original study hospitals lacked access to comprehensive tumor staging data. To address this limitation, the metastasis status of these patients was obtained via communication channels between the clinicians at the study hospitals and those at the surgery hospitals. This enabled the retrieval of limited information, adhering to an established data-sharing agreement between the two institutions. To maintain the robustness of our analysis, patients diagnosed with metastatic cancer or those with indeterminate metastatic status were subsequently excluded from the study.”

      3) It seems that one of the important advantages of this new model is the low depth coverage in comparing to previous screening models for cancer. The authors should discuss more on the reason why the new model could achieve comparable predictive accuracy with an obviously lower sequencing depth.

      We thanked the reviewer for the suggestion. We have added the following sentences in the discussion to explain why our assay could achieve good performance at low depth sequencing. (Line 571-584)

      “However, the low amount of ctDNA fragments in plasma samples of patients with early-stage cancer as well as the molecular heterogeneity of different cancer types are known as the major challenges for liquid biopsy based multi-cancer detection assays. Thus, sequencing at high depth coverages is required to capture enough informative cancer DNA fragments in the finite plasma sample to achieve early cancer detection. In support to this notion, many groups (1-4) have developed assays that exploited high depth coverage of sequencing to detect ctDNA fragments in plasma of early stage cancer patients. However, this strategy might not be cost effective and feasible for population wide screening in developing countries. Alternatively, we argued that increasing breadth of ctDNA analysis could maximize the ability to detect ctDNA fragments with heterogeneous genetic and epigenetic changes at shallow sequencing depth, thus improving the sensitivity for multicancer detection. To demonstrate the feasibility of this approach, we built a stacking ensemble model to combine nine different ctDNA signatures and demonstrated its superior performance on cancer detection in comparison to single-feature models (Figure 7B and 7C).”

      4) The readability of this manuscript needs to be improved. The focus of the background section is not clear, with too much detail of other studies and few purposeful summaries. You need to explain the goals and clinical significance of your study. In addition, the results section is too long, and needs to be shortened and simplified. Move some of the inessential results and sentences to supplementary materials or methods.

      We thank the reviewer for these constructive suggestions. Accrodingly, we have reduced the details of other studies (Line 85-91) as follows:

      “In recent years, there has been considerable interest in exploring the potential of ctDNA alterations for early detection of cancer (5, 6). One such approach is the PanSeer test, which uses 477 differentially methylated regions (DMRs) in ctDNA to detect five different types of cancer up to four years prior to conventional diagnosis (7). The DELFI assay employs a genome-wide analysis of ctDNA fragment profiles to increase sensitivity in early detection (1). Recently, the Galleri test has emerged as a multi-cancer detection assay that analyses more than 100,000 methylation regions in the genome to detect over 50 cancer types and localize the tumor site (8).”

      We have modified the text in the introduction to explain the goals and clinical significance of our study (Line 111-123)

      “In this study, we aimed to expand our multimodal approach, SPOT-MAS, to comprehensively analyze methylomics, fragmentomics, DNA copy number and end motifs of cfDNA and evaluate its utility to simultaneously detecting and locating cancer from a single screening test.” “Our findings demonstrate that the multimodal approach of SPOT-MAS enables profiling of multiple ctDNA signatures across the entire genome at low sequencing depth to detect five different cancer types in their early stages. Beyond detecting the presence of cancer signals, our assay was able to predict the tumor location, which is important for clinicians to fast-track the follow-up diagnostic and guide necessary treatment. Thus, SPOT-MAS has the potential to become a universal, simple, and cost-effective approach for early multi-cancer detection in a large population.”

      Reviewer #2 (Public Review):

      The authors tried to diagnose cancers and pinpoint tissues of origin using cfDNA. To achieve the goal, they developed a framework to assess methylation, CNA, and other genomic features. They established discovery and validation cohorts for systematic assessment and successfully achieved robust prediction power.

      1) Still, there are places for improvement. The diagnostic effect can be maximized if their framework works well in early-stage cancer patients. According to Table 1, about 10% of the participants are stage I. Do these cancers also perform well as compared to late stage cancers?

      We have performed the comparison of SPOT-MAS performance on different stages and provided the data in Supplementary table S8 and Supplementary Figure S4J and S4L. Our data showed that SPOT-MAS achieved lower sensitivity for detecting stage I and II cancers as compared to stage IIIA cancers in both discovery (61.54% and 69.82% for stage I and II respectively versus 78.67% for stage IIIA, Supplementary table 8) and validation cohort (73.91% and 62.32% for stage I and II, respectively versus 88.31% for stage IIIA, Supplementary table 8). This suggested that cancer stages can influence the performance of our models.

      2) Can authors show a systematic comparison of their method to other previous methods to summarize what their algorithm can achieve compared to others.

      We have conducted a systematic comparison of our method with others in the Supplementary Table S11.

      Reviewer #1 (Recommendations For The Authors):

      There are still points for the authors to clarify and consider for incorporation into revision.

      • Please first clarify the issues mentioned in "public review". Several complements are needed.

      We have addressed all of the reviewer’s comments in “public review”.

      1) Line 72-73: Different approaches of early cancer screening assays have different features, application scenarios, and of course, limitations. It's too vague to describe in this way. More importantly, diagnosis of malignancies relies on pathological diagnosis, I don't think the results of unsuccessful screening would be overdiagnosis and overtreatment. That's overstatements.

      We have rewritten the statement as follows (Line 72-75)

      “Although currently guided screening tests have each been shown to provide better treatment outcomes and reduce cancer mortality, some of them are invasive, thus having low accessibility. Importantly, most of them are single cancer screening tests, which may result in high false positive rates when used sequentially.”

      2) Line 115-130: The findings in this study shouldn't be introduced here.

      We have removed this section.

      3) Line 496-498: It surprised me that the model performed even better in independent validation cohort, which is quite different from the usual situations. Please explain it.

      We agree with the reviewer that model performance in independent validation cohort is often lower than in discovery cohort. In our case, we have carefully confirmed our data by utilizing cross-validation (CV). Cross-validation is a widely used process in which the data being used for training the model is separated into folds or partitions and the model is trained and validated for each fold; the performance estimates are then calculated to obtain mean and confidence interval (GraphPad Prism, Wilson/Brown method). To further confirm our findings, we have increased the cross-validation fold into 50, and consistently detected no significant difference in the performance between Discovery and Validation cohorts (p=0.1277, DeLong’s test).

      We have added the following sentence in the discussion to explain this (Line 633-635)

      “Despite a slightly higher AUC value in the validation cohort compared to the discovery cohort, no significant differences in AUC values were observed between the two cohorts at CV of 10 or 50 (p=0.1277, DeLong’s test).”

      4) Line 499-501: For the cut-off value selection, the authors thought that for cancer screening, specificity is more important than sensitivity? It's controversial. The sensitivity is only approximately 70%, I think that a missed diagnosis is even worse.

      We agree with the reviewer that both specificity and sensitivity are important metrics of a cancer detection test. However, there is a trade-off between sensitivity and specificity and the preference for either one of them remains a controversial topic. For a screening test, the preference should be determined by considering the prevalence of the disease, in this case - cancer. The low prevalence of cancers indicates that even a small percentage of false-positive test results due to low specificity of the assay, spread across a national population, would hugely increase the demand for confirmatory imaging as well as biopsy sampling of imaging-detected benign abnormalities (9). Thus, false positives have obvious implications for health-care resources as well as patient well-being. Conversely, higher sensitivities will make sure that more cancer cases are detected and avoid delays in diagnosis. To mitigate the impact of insufficient sensitivity of a cancer screening test, it is important to consult the test-takers that current liquid biopsy tests should only be used as a complementary approach to the available diagnosis tests to increase rates of cancer detection. To be used as a stand-alone test, further work is required to improve its performance, with more focus on increasing sensitivity while maintaining high specificity.

      We have added the following sentences in the discussion to explain why we set a high threshold of specificity (Line 660-671)

      “For an effective screening test, careful consideration of disease prevalence, cancer in this context, is imperative. Given the low prevalence of cancers, even a small proportion of false-positive test results arising from reduced assay specificity, if extrapolated to a national population, could significantly escalate the need for confirmatory imaging and biopsy procedures for benign abnormalities detected during screening. Thus, false-positives can have substantial implications for both healthcare resources and patient well-being. Conversely, a screening test with high sensitivity ensures that most cancer cases are detected and minimizes delays in diagnosis. To address potential limitations posed by low sensitivity in cancer screening tests, we suggest that current liquid biopsy tests should be employed as a complementary approach to existing diagnostic methods to enhance cancer detection rates. To be used a stand-alone test, further work is required to improve its performance, with a particular emphasis on improving sensitivity while preserving high specificity.”

      5) The methylation profiles have been used broadly in ctDNA, while your also integrated the fragmentomics, copy number aberration and end motif into the new model. In the discussion section, it would be better to further compare your new model with several previous models based on conventional ctDNA methylation markers (10, 11) for early detection of malignancies. What are the advantages of adding the other two types of data? Why the new model could achieve comparable predictive accuracy with an obviously lower sequencing depth?

      We thank the reviewer for the suggestion. We have added the following sentences in the discussion to highlight the novelty of our multimodal approach. (Line 587-610)

      “Previous studies have reported that methylation changes at target regions could be exploited for detecting ctDNA in plasma of patients with early-stage cancer (10, 11).”

      “In addition to methylation alterations, recent studies have revealed that the DNA copy number, fragmentomics profile (1) and end motif profile (12) at genome wide scales have been shown as useful features for healthy-cancer classification. Therefore, we propose that the combination of these markers might provide added value to increase the performance of liquid biopsy assays. We demonstrated that the same bisulfite sequencing data could be used to identify somatic CNA (Figure 4), cancer-associated fragment length (Figure 5) and end motifs (Figure 6), highlighting the advantage of SPOT-MAS in capturing the broad landscape of ctDNA signatures without high cost deep sequencing. For cancer-associated fragment length, we pre-processed this data into five different feature tables to better reflect the information embedded within the data. Overall, we integrated multiple features of ctDNA including methylation, fragment length, end motif and copy number changes into a multi-cancer detection model and demonstrated that this approach could distinguish healthy individuals with patients from five popular cancer types. This strategy enables increased breadth of ctDNA analysis at shallow sequencing depth to overcome the limitation of low amount of ctDNA fragments in plasma samples as well as molecular heterogeneity of cancers.”

      Moreover, we have conducted a systematic comparison of our method with others in the Supplementary Table 11.

      6) Line 667-668: The wording should be modest. "Successfully detect and localize" is not appropriate.

      We have rewritten the sentence. (Line 713-716)

      “Our large-scale case-control study demonstrated that SPOT-MAS, with its unique combination of multimodal analysis of cfDNA signatures and innovative machine-learning algorithms, can detect and localize multiple types of cancer with high accuracy at a low-cost sequencing.”

      Reviewer #2 (Recommendations For The Authors):

      1) Are the patients and controls all from Vietnam? If I am not mistaken, it is hard to find demographic information for controls. Also it is not clear if samples from controls were processed simultaneously or at a same institution or using the same protocol etc.

      We thank the reviewer for asking this question. All cancer patients and controls are from Vietnam, who were recruited from five hospitals including Medic Medical Center, University Medical Center Ho Chi Minh City, Thu Duc City Hospital, National Cancer Hospital and Hanoi Medical University. At each research sites, blood samples from both cancer patients and healthy subjects were collected in in Streck Cell-Free DNA BCT tubes and subsequently transported to a central laboratory located in Medical Genetics Institute for cfDNA isolation, library preparation and sequencing. In a recent publication (10), we have investigated the impact of logistic time and hemolysis rates of blood samples collected from different clinical sites on cfDNA concentration and sequencing quality. We did not observe any noticeable impact of such variations on cfDNA concentrations or sequencing library yields. However, future analytical validation studies are required to evaluate the impact of variation in sampling technique across different clinical sites on the robustness or accuracy of assay results.

      We have added the following sentences in the discussion to highlight this important point (Line 696-704)

      “At each research sites, blood samples from both cancer patients and healthy subjects were collected in in Streck Cell-Free DNA BCT tubes and subsequently transported to a central laboratory located in Medical Genetics Institute for cfDNA isolation, library preparation and sequencing. In a recent publication (10), we have investigated the impact of logistic time and hemolysis rates of blood samples collected from different clinical sites on cfDNA concentration and sequencing quality. We did not observe any noticeable impact of such variations on cfDNA concentrations or sequencing library yields. However, future analytical validation studies using a larger sample size are required to evaluate the impact of variation in sampling technique across different clinical sites on the robustness or accuracy of assay results.”

      References

      1. Cristiano S, Leal A, Phallen J, Fiksel J, Adleff V, Bruhm DC, et al. Genome-wide cell-free DNA fragmentation in patients with cancer. Nature. 2019;570(7761):385-9.

      2. Cohen JD, Li L, Wang Y, Thoburn C, Afsari B, Danilova L, et al. Detection and localization of surgically resectable cancers with a multi-analyte blood test. Science. 2018;359(6378):926-30.

      3. Liu MC, Oxnard GR, Klein EA, Swanton C, Seiden MV. Sensitive and specific multi-cancer detection and localization using methylation signatures in cell-free DNA. Ann Oncol. 2020;31(6):745-59.

      4. Stackpole ML, Zeng W, Li S, Liu C-C, Zhou Y, He S, et al. Cost-effective methylome sequencing of cell-free DNA for accurately detecting and locating cancer. Nature Communications. 2022;13(1):5566.

      5. Constantin N, Sina AA, Korbie D, Trau M. Opportunities for Early Cancer Detection: The Rise of ctDNA Methylation-Based Pan-Cancer Screening Technologies. Epigenomes. 2022;6(1).

      6. Phan TH, Chi Nguyen VT, Thi Pham TT, Nguyen VC, Ho TD, Quynh Pham TM, et al. Circulating DNA methylation profile improves the accuracy of serum biomarkers for the detection of nonmetastatic hepatocellular carcinoma. Future Oncol. 2022;18(39):4399-413.

      7. Chen X, Gole J, Gore A, He Q, Lu M, Min J, et al. Non-invasive early detection of cancer four years before conventional diagnosis using a blood test. Nature Communications. 2020;11(1):3475.

      8. Jamshidi A, Liu MC, Klein EA, Venn O, Hubbell E, Beausang JF, et al. Evaluation of cell-free DNA approaches for multi-cancer early detection. Cancer Cell. 2022;40(12):1537-49.e12.

      9. Ignatiadis M, Sledge GW, Jeffrey SS. Liquid biopsy enters the clinic - implementation issues and future challenges. Nat Rev Clin Oncol. 2021;18(5):297-312.

      10. Xu RH, Wei W, Krawczyk M, Wang W, Luo H, Flagg K, et al. Circulating tumour DNA methylation markers for diagnosis and prognosis of hepatocellular carcinoma. Nat Mater. 2017;16(11):1155-61.

      11. Luo H, Zhao Q, Wei W, Zheng L, Yi S, Li G, et al. Circulating tumor DNA methylation profiles enable early diagnosis, prognosis prediction, and screening for colorectal cancer. Sci Transl Med. 2020;12(524).

      12. Jiang P, Sun K, Peng W, Cheng SH, Ni M, Yeung PC, et al. Plasma DNA End-Motif Profiling as a Fragmentomic Marker in Cancer, Pregnancy, and Transplantation. Cancer Discovery. 2020;10(5):664-73.

    1. a document in ...Microsoft Supporthttps://support.microsoft.com › en-us › office › use-w...Microsoft Supporthttps://support.microsoft.com › en-us › office › use-w...Open an OpenDocument Text file in Word · Click the File tab. · Click Open. · Click Browse, · To see only the files saved in the OpenDocument format, click the list ...(function(){var uer=false;var eid='z9PoV';(function(){var a=google.c.wh0,b=google.c.frt,c=google.c.frvt,d=google.c.whu;var e=uer,f=Date.now(),g=google.c.sxs?"load2":"load";if(google.timers&&google.timers[g].t){var h=0;if(eid){var k=document.getElementById(eid);k&&(h=Math.floor(k.getBoundingClientRect().top+window.pageYOffset))}b&&google.tick(g,"frt",f);c&&google.tick(g,"frvt",f);google.tick(g,"frts",f);h&&google.c.e(g,"frtp",String(h));d&&!google.c.wh&&(google.c.wh=Math.floor(window.innerHeight||document.documentElement.clientHeight));var l=google.c.wh,m;m=a&&!l?!1:h>=l;for(var n=document.getElementsByTagName("img"),p=0,q=void 0;q=n[p++];)google.c.setup(q,!0,h);google.c.bofr= !1;m&&google.c.ubr(!1,f,h,!e)};}).call(this);})();People also ask

      gg

    1. rk, 1953; Finlayson, B. A., Nonlinear Analysis in Chemical Engineering, McGraw-Hill, New York, 1980; Finlayson, B. A., L. T. Biegler, and I. E. Grossmann, Mathematics in Chemical Engineering, Ullmann's Encyclopedia of Industrial Chemistry, Published Online: 15 DEC 2006, DO

      fbgbt

    1. Author Response

      Thank you for your thorough critique and thoughtful suggestions for improving our manuscript, "Homeostatic Synaptic Plasticity of Miniature Excitatory Postsynaptic Currents in Mouse Cortical Cultures Requires Neuronal Rab3A.” The reviewers’ detailed comments suggest that showing multiple types of graphs to demonstrate the presence of divergent scaling of mEPSC amplitudes in cultures from Rab3A wild type, and its disruption in cultures from Rab3A knockout mice, had the unintended consequence of obscuring the major results of our study. Furthermore, our proposal that the difference in characteristics of scaling of GluA2 receptor expression compared to that of mEPSC amplitudes, based on the ratio plots, indicated that a mechanism other than postsynaptic receptors likely contributes to the homeostatic increase in mEPSC amplitude was not convincing to the reviewers. Reviewers 2 and 3 point out these results might be explained by differences in the limitations and artifacts of the two very distinct techniques, electrophysiology and fluorescence imaging. In the revision we will acknowledge that a greater variability in the signal, or, more issues with signal over noise, might be present in imaging experiments compared to electrophysiology. This could explain the lack of identical effects on GluA2 receptors compared to mEPSC amplitudes in the matched experiments, but we maintain it is also possible that a greater variability in GluA2 responses is biologically meaningful. Further, an issue with the accuracy of imaging experiments to report the true receptor effects would also call into question the conclusion that receptors always increase after activity blockade. Finally, the graphs illustrating the detailed characteristics of scaling with rank order and ratio plots required pooling multiple samples per cell, which precludes application of standard statistical methods to determine whether effects or differences reach statistical significance. Therefore, we will remove the cumulative distribution functions, rank order plots, and ratio plots, and show only analyses that involve a single sample per cell. This major change will simplify and clarify the main findings, that homeostatic plasticity of both mEPSC amplitude and GluA2 receptor expression in mouse cortical cultures involves the synaptic vesicle protein Rab3A operating in neurons rather than astrocytes. We will focus our comparison between mEPSC amplitudes and receptors in the same cultures to differences between the magnitude of effects on the mean or median, and will make clear that overall, our data can be explained by two possibilities: 1) the presynaptic vesicle protein is acting via regulation of postsynaptic receptors alone, or, it is regulating both postsynaptic receptors and another contributor to mEPSC amplitude, possibly amount of transmitter released by a single vesicle. Either way, it is very surprising that this presynaptic protein is involved in postsynaptic changes, so our results represent a novel contribution to the field of homeostatic plasticity. In sum, the changes we propose should go a long way towards addressing the majority of the reviewers’ major critiques.

      A related issue raised by the reviewers was that the model describing potential presynaptic mechanisms of Rab3A in homeostatic plasticity was not supported by direct evidence (Figure 10). We meant the model to introduce the possibility of a presynaptic contribution to mEPSC amplitude and to stimulate future research, but clearly did not communicate its speculative nature, neither in the Figure legend nor in our discussion of potential mechanisms. In the revision, we will restrict the model to the direct findings in this study. Additionally, we will state where appropriate, that while previous findings at the mouse NMJ are consistent with a presynaptic role for Rab3A (Wang et al., 2011), in the current study there is no direct evidence for this idea in cortical cultures other than the quantitative differences in the fold increases in mEPSC amplitudes and GluA2 receptors which were assayed in the same cultures.

      We will submit a revised version addressing each of the reviewer’s concerns and suggestions as described above and below; these major modifications will greatly improve the readability of the manuscript and clarify the main results.

      Reviewer #1

      Koesters and colleagues investigated the role of the presynaptic small GTPase Rab3A in homeostatic scaling of miniature synaptic transmission in primary mouse cortical cultures using electrophysiology and immunohistochemistry. The major finding is that TTX incubation for 48 hours does not induce an increase in the amplitude of excitatory synaptic miniature events in neuronal cultures derived from Rab3A KO and Rab3A Earlybird mutant mice. NASPM application had comparable effects on mEPSC amplitude in control and after TTX, implying that Ca2+-permeable glutamate receptors are unlikely modulated during synaptic scaling. Immunohistochemical analysis revealed an increase in GluA2 puncta size and intensity in wild type, but not Rab3A KO cultures. Finally, they provide evidence that loss of Rab3A in neurons, but not astrocytes, blocks homeostatic scaling. Based on these data, the authors propose a model in which presynaptic Rab3A is required for homeostatic scaling of synaptic transmission through GluA2-dependent and independent mechanisms.

      While the title of the manuscript is mostly supported by data of solid quality, many conclusions, as well as the final model, cannot be derived from the results presented. Importantly, the results do not indicate that Rab3A modulates quantal size on both sides of the synapse. Moreover, several analysis approaches seem inappropriate.

      The following points should be addressed:

      1) The model shown in Figure 10 is not supported by the data. The authors neither provide evidence for two different functional states of Rab3A being involved in mEPSC amplitude modulation, nor for a change in glutamate content of vesicles. Furthermore, the data do not fully support the conclusion of a presynaptic role for Rab3A in homeostatic scaling.

      We will revise the model, removing presynaptic mechanisms for Rab3A and restricting it to the direct findings in this study.

      2) The analysis of mEPSC data using quantile sampling followed by ratio calculation is not meaningful under the tested experimental conditions because of the following reasons:

      (i) The analysis implicitly assumes that all events have been detected. The prominent mEPSC frequency increase after TTX suggests that this is not the case, i.e., many (small) mEPSCs are likely missed under control conditions.

      We explicitly addressed the potential contribution of missed mEPSCs that are below threshold in (Hanes et al., 2020). We found that even simulating a threshold of 7 pA, applied to data artificially modified by uniformly multiplying the control data set, did not generate a ratio plot with the increasing ratio over 75% of the data that we observe in the experimental data. Overall, the findings from simulating a threshold and a uniform multiplicative factor illustrate that the threshold issue does not cause major changes to the data. Furthermore, in cultures from Rab3A+/+ mice from the Rab3AEbd/+ colony, the mEPSC amplitudes were significantly smaller than those recorded in cultures from Rab3A+/+ mice from the Rab3A+/- colony (lines 327-329, 11 pa vs 13 pA), indicating that if there were smaller mEPSCs occurring in the Rab3A+/+ data set, we would have detected them. Although for these reasons we feel it is unlikely our ratio plot analysis is invalid, to clarify the result that homeostatic plasticity of mEPSC amplitude requires functioning Rab3A, we will remove the ratio plots.

      (ii) The analysis is used to conclude how events of a certain size are altered by TTX treatment. However, this analysis compares the smallest mEPSCs of the TTX condition with the smallest control mEPSCs, but this is not a pre-post experimental design. Variation between cells and between coverslips will markedly affect the results and lead to misleading interpretations.

      The rank order plot is a well-established plot to examine the mathematical transformation caused by homeostatic plasticity, first used in (Turrigiano et al., 1998). We included it here to facilitate comparison of our findings with previous results. We introduced the ratio plot in (Hanes et al., 2020), finding it shows more clearly differences occurring in the range of small mEPSC values. The reviewer is correct in that we are assuming the smallest mEPSCs before treatment should be matched with the smallest mEPSCs after treatment. It is almost impossible to do a pre-post experimental design for mEPSCs. Even when applying a treatment, for example acute perfusion with a receptor antagonist, to a single cell and recording mEPSCs before and after the treatment, it is not a true pre-post design at the level of mEPSC amplitudes, which come from many different inputs. The power of the method is that different characteristic mathematical transformations for different experimental conditions (e.g., genotype or activity protocol) support the idea that those conditions either involve different mechanisms or have altered the mechanism. Such differences might be missed by only comparing means or medians. However, we found no evidence that loss of Rab3A or expression of the Rab3A Earlybird mutant altered the mathematical transformation due to homeostatic plasticity, other than to reduce its magnitude across all amplitudes. Therefore, including these complex analyses is not adding anything to the finding that Rab3A plays a role in homeostatic plasticity of mEPSC amplitudes and they will be removed in the revision.

      (iii) The ratio (TTX/control) vs. control plots seem to suffer from a division by small value artifact (see Figure 6F).

      The reviewer is referring to findings on the ratio plot for receptor cluster area. Because the large ratios for the smallest control areas occur in the cultures prepared from wild type mice, and to a much lower extent in cultures prepared from Rab3A knockout mice, we think there is a biologically relevant increase in the TTX/CON ratio, since an artifact due to division by small values should be present in both data sets. However, we cannot rule out that the differences in ratio plot behavior between receptors and mEPSC amplitudes result from the different limitations in detection of receptor clusters vs. the limits of detection of mEPSCs, so we will remove the ratio plots and focus on comparison of means or medians.

      Correspondingly, ratio-analysis differs considerably for different control conditions (Fig. 1Giii, Fig. 2Giii, Fig. 6C, Fig. 9A).

      The reviewer is correct to point out that the ratio plot shows differences across control conditions (note, these differences are not obvious with the more standard rank order plot). The magnitude of the 50th percentile ratio differs across control conditions, and behaviors of the largest mEPSCs also differ, with some ratios going down at the highest control amplitudes (1Giii, 6C), and others continuing to increase with increasing control amplitude (2Giii, 9A). They all share the divergent increasing ratio from smallest mEPSC amplitude to around the 20 pA level. We attribute the differences in magnitude to the differences in experimental conditions: 1Giii is Rab3A+/+ from the +/+ colony; 1Giii is Rab3A+/+ from the Ebd/+ colony; 6C is a set of Rab3A+/+ cultures assayed several years after the set in 1Giii; 9A is a different culture condition altogether, with neurons being plated onto an already formed bed of astrocytes. Effects on the largest mEPSCs are likely attributable to the small number and high variability of amplitudes in this range. Since the variability in the very sensitive ratio plot have taken away from the main findings of homeostatic plasticity being disrupted in the absence of functioning Rab3A in neurons, we will remove the rank-order and the ratio plots from the manuscript.

      3) As noted by the authors in a previous publication (Hanes et al. 2020), statistical analysis of CDFs suffers from ninflation. In addition, the quantile sampling method chosen violates an important assumption of the K-S test. Indeed, pvalues for these comparisons are typically several orders of magnitude smaller. Given that the statistical N most likely corresponds to the number of cultures (see, e.g., https://doi.org/10.1371/journal.pbio.2005282), CDF comparisons are not informative and should thus not be used to draw conclusions from the data. The plots can be informative, though.

      As the reviewer acknowledges, we were very careful in (Hanes et al., 2020) to state that the p values could not be used to determine significance in the KS test of cumulative distributions for pooled data because the KS test assumes a single sample per cell. We also suggested in that study that the p values could be used in a comparative way for looking at data sets with similar (inflated) n values to say something about bigger or smaller differences. We failed to reiterate those caveats here. In reviewing the article “What is N” by (Lazic et al., 2018) (which we very much appreciate being shown by the reviewer), we agree that in the current study where we are attempting to show how the effect of homeostatic plasticity is or is not altered by loss of Rab3A function, it is imperative that we be able to make conclusions about statistical significance. The pooling approach is essential for having some sense of the mEPSC amplitude distributions, but that is not necessary for looking at the effect of Rab3A. Therefore, we will remove all analyses that involve pooling of multiple mEPSC amplitudes per cell.

      4) How does recoding noise and the mEPSC amplitude threshold affect "divergent scaling"?

      We addressed this in our 2020 paper (Hanes et al., 2020) where we showed that the experimental homeostatic increase in mEPSC amplitude cannot be simulated with uniform, multiplicative synaptic scaling whether we included or excluded distortion caused by a detection threshold.

      5) What is the justification for the line fits of the ratio data/how was the fit range chosen?

      We are assuming the reviewer is referring to the line fits for the rank-order data. If so, the fit range is the entire range of the data. This issue will be addressed by the removal of the rank-order plots from the manuscript.

      6) TTX application induces a significant increase in mEPSC amplitude in Rab3A-/- mice in two out of three data sets (Figs. 1 and 9). Hence, the major conclusion that Rab3A is required for homeostatic scaling is only partially supported by the data.

      Based on the p-values for comparison of means with a Kruskal-Wallis test, we would argue that TTX application does not show a significant increase in mEPSC amplitude in Rab3A-/- neurons (Figure 1 p-value = .318; Figure 9 p-value = .125) when comparing to untreated control mEPSC amplitude means. It is only when we use the KS test and the inflated n’s that we get a barely significant results, p = 0.042. Based on the Lazic article (Lazic et al., 2018), we would now conclude that we cannot use the KS p value in that analysis. We have tried to be clear that the effect of TTX application on mEPSC amplitude in Rab3A-/- neurons is not completely abolished, but rather is dramatically reduced, which we acknowledge in the manuscript (line 279). This issue will be addressed by removal of CDFs from the manuscript.

      7) Line 289: A comparison of p-values between conditions does not allow any meaningful conclusions.

      Although we feel that comparison of magnitude of effects can be stated in a qualitative way for similar sized pooled data sets with larger or smaller p-values, we agree that statistical significance has no meaning. This issue will be addressed by removing the CDF plots from the manuscript.

      8) There is a significant increase in baseline mEPSC amplitude in Rab3AEbd/Ebd (15 pA) vs. Rab3Aebd/+ (11 pA) cultures, but not in Rab3A-/- (13.6 pA) vs. Rab3A+/- (13.9 pA). Although the nature of scaling was different between Rab3AEbd/Ebd vs. Rab3AEbd/+, and Rab3AEbd/Ebd with vs. without TTX, the question arises whether the increase in mEPSC amplitude in Rab3AEbd/Ebd is Rab3A dependent. Could a Rab3A independent mechanism occlude scaling?

      We have acknowledged in the manuscript that one explanation for a failure to exhibit homeostatic plasticity in the cultures from Rab3A Earlybird mutant mice is that the already large basal amplitude occludes any further increase (line 366). In the revision we will make sure the occlusion possibility is highlighted, but we will also discuss other proteins that have been implicated in homeostatic plasticity that have caused an increase in mEPSC amplitude and/or AMPA receptors at baseline, for example, Arc/Arg3.1 KO (Shepherd et al., 2006; Beique et al., 2011); Homer KO (Hu et al., 2010) and inhibition of mir-186-5p (Silva et al., 2019).

      9) Figure 4: NASPM appears to have a stronger effect on mEPSC frequency in the TTX condition vs. control (-40% vs. 15%). A larger sample size might be necessary to draw definitive conclusions on the contribution of Ca2+-permeable AMPARs.

      We will acknowledge that Ca2+-permeable AMPARs could be contributing to the frequency increase following activity blockade and will also include analyses of frequency throughout the manuscript.

      10) The authors discuss previous papers showing changes in VGLUT1 intensity. Was VGLUT intensity altered in the stainings presented in the manuscript?

      We will perform analyses VGLUT1 intensity and include them in the manuscript.

      11) The change in GluA2 area or fluorescence intensity upon TTX treatment in controls is modest. How does the GluA2 integral change?

      The changes in GluA2 integrals look exactly like the changes in cluster size and were not included to simplify the results. But with the removal of the CDFs, rank order, and ratio plots, we can easily include integral measurements. What we did not observe was an additive effect with intensity and size such that the effects on integral were of greater magnitude or statistical significance than either alone. We will include the integral plots in the revised manuscript.

      12) The quantitative comparison between physiology and microscopy data is problematic. The authors report a mismatch in ratio values between the smallest mEPSC amplitudes and smallest GluA2 receptor cluster sizes (l. 464; Figure 8). Is this comparison affected by the fluorescence intensity threshold?

      What was the rationale for a threshold of 400 a.u. or 450 a.u.?

      We have acquired AOIs of receptor clusters at multiple threshold levels, and can examine whether the results are altered when using a low, medium or high threshold level.

      How does this threshold compare to the mEPSC threshold of 3 pA?

      The issue with values being below threshold in untreated cultures has been the concern in interpreting effects on mEPSC amplitudes, specifically, whether this mismatch contributes to divergent scaling. A problem of values being below a toohighly set threshold in the control and becoming detectable after the homeostatic plasticity produces a lower ratio than expected, because now there are values in the treated condition that were not present in the control condition. Instead, for GluA2 receptor cluster size, we observed higher TTX/CON ratios at the low end of the data set. So, based on this, the thresholds chosen for imaging are not having the same effect, if that is what is being asked. This issue will be addressed by removing ratio plots.

      The conclusion that an increase in AMPAR levels is not fully responsible for the observed mEPSC increase is mainly based on the rank-order analysis of GluA2 intensity, yielding a slope of ~0.9. There are several points to consider here: (i) GluA2 fluorescence intensity did increase on average, as did GluA2 cluster size. (ii) The increase in GluA2 cluster size is very similar to the increase in mEPSC amplitude (each approx. 18-20%). (iii) Are there any reports that fluorescence intensity values are linearly reporting mEPSC amplitudes (in this system)?

      We agree that our data show GluA2 receptors increase as based on cluster size, and did not mean to imply otherwise. Our conclusion that there is another contributor to mEPSC amplitude other than receptors is based on two main findings, 1) that the ratio plots for mEPSC amplitudes and receptor cluster size have distinctively different behaviors, and 2) that there are differences in either magnitude or direction of the TTX effect across 6 matched cultures, 3 from WT animals and 3 from TTX animals (see more explanation of this point below, in response to Reviewer 3). To our knowledge, no one has reported homeostatic plasticity effects on a culture by culture basis, and no one has compared imaging results and physiological results for the same cultures. We will remove the ratio plots and the conclusions based on the differences in behavior for mEPSC amplitudes and receptor cluster size. We will acknowledge in the revision that the differences in magnitude and direction across the 6 matched cultures could be due to the differences in limitations and artifacts of imaging fluorescent antibody staining vs. the limitations and artifacts of detecting mEPSCs electrophysiologically. However, we will continue to state that our results could also be due to the possibility that mEPSC amplitude is not changing in lockstep with receptor levels in every situation. To support this proposal, we will discuss those articles that include both measurements, and point out where mEPSC amplitude measurements and receptor levels match and where they do not.

      Antibody labelling efficiency, and false negatives of mEPSC recordings may influence the results. The latter was already noted by the authors.

      We will add the caveat that antibody labeling efficiency can vary between coverslips. Although we prepared single solutions that were applied to all coverslips in an experiment, this was not possible for the primary antibody to GluA2, which was added to live cultures in individual wells.(iv) It is not entirely clear if their imaging experiments will sample from all synapses. We will add to Materials and Methods that we sample from all the synapses that could be detected by the researcher on the primary dendrite of the pyramidal cell.

      Other AMPAR subtypes than GluA2 could contribute, as could kainate or NMDA receptors.

      This is true, other AMPARs (GluA3 and/or GluA4) could be contributing, but we only looked at the receptors well established to be contributing to homeostatic plasticity (GluA1 and GluA2). We will acknowledge the possible contribution of other AMPARs in the revised manuscript.

      Furthermore, the statement "complete lack of correspondence of TTX/CON ratios" is not supported by the data presented (l. 515ff). First, under the assumption that no scaling occurs in Rab3A-/- , the TTX/CON ratios show a 20-30% change, which indicates the variation of this readout. Second, the two examples shown in Figure 8 for Rab3A+/+ are actually quite similar (culture #1 and #2), particularly when ignoring the leftmost section of the data, which is heavily affected by the raw values approaching zero.

      We will remove the ratio plots from the manuscript and the arguments about differences between GluA2 receptors and mEPSC amplitudes that were based on them. However, we maintain that we have demonstrated a lack of consistent effect for GluA2 receptors and mEPSCs in the matched culture experiments. Yes, the readout of homeostatic plasticity in ratio plots for mEPSCs in the Rab3AKO reach over 1.1 in Figure 1, and as high a 1.2 in the cultures where Rab3AKO neurons were plated on Rab3AWT glia (Figure 9). Our point is that if we had measured GluA2 receptor responses to TTX in those same experiments, the ratios should have been above 1. However, in the experiments in which we measured both mEPSCs and GluA2 receptors, the ratios do not match. In culture #1, the ratio for mEPSCs was at 1 for more than 50% of the data, but for GluA2 receptors, was below 1 for more than 50% of the data. In culture #3, the ratio for mEPSCs was below 1 for more than 50% of the data, but for GluA2 receptors was close to 1.2 for 50% of the data. Only for culture #2 do the ratios appear to match. In the revised manuscript, the evidence that GluA2 receptors and mEPSCs are not changing in parallel will be based on the behavior of means or medians in untreated vs TTXtreated cultures, rather than ratio plots. It could be argued that we need a greater number of matched experiments to make conclusions, but the whole point of a matched experiment is that it should always show the same result—we are no longer dealing with the variability in the homeostatic plasticity itself. We will add a statement that the only three explanations left for the failure of mEPSC amplitudes and GluA2 receptors to change in parallel are 1) a true mismatch, 2) a sampling issue, or 3) technical artifacts that occur in one culture and not another.

      13) Figure 7A: TTX CDF was shifted to smaller mEPSC amplitude values in Rab3A-/- cultures. How can this be explained?

      Figure 7A depicts the pooled data that are shown separately for 3 cultures in Figure 8. We observed mEPSC amplitudes being smaller after TTX treatment in some range of the data for all three Rab3AKO cultures, suggesting that this may be a biological result rather than random variation around no change (which would be a ratio of 1). However, this effect is not significant at the level of means, nor in the KS test (which has the issue of inflated n in any case), so we did not highlight this point. This issue will be addressed by the removal of the CDF plots from the manuscript.

      Reviewer #2

      Technical concerns:

      1) The culture condition is questionable. The authors saw no NMDAR current present during spontaneous recordings, which is worrisome since NMDARs should be active in cultures with normal network activity (Watt et al., 2000; Sutton et al., 2006).

      The (Watt et al., 2000) study recorded mEPSCs in 0 Mg2+ (Figure 1). The (Sutton et al., 2006) study also shows an average mEPSC waveform (Figure 1D) that was recorded from in 0 Mg2+. Our extracellular recording solution contains Mg2+ (1.3 mM) so we likely are not observing NMDA-mediated currents because they are blocked with Mg2+ when strong depolarizations are prevented with TTX in the recording solution. We will add the idea that the NMDA currents are blocked by Mg2+ to Material and Methods.

      It is important to ensure there is enough spiking activity before doing any activity manipulation.

      We agree that it would be best if network spiking activity were monitored alongside mEPSC recordings, for example by culturing on multi-electrode arrays. Data from these measurements might explain culture to culture variability in homeostatic responses. To our knowledge, most other studies investigating homeostatic plasticity do not monitor network spiking activity in the same cultures that assay mEPSC amplitudes. This is something that the field should move towards. We will add the caveat that activity was not directly measured to the manuscript.

      Similarly, it is also unknown whether spiking activity is normal in Rab3A KO/Ebd neurons.

      Since we did not measure spiking activity, we cannot address whether the disruption in homeostatic plasticity in cultures prepared from Rab3A KO and Rab3AEbd/Ebd mutant mice is due to an alteration in network activity. If activity were already low in cultures prepared from these genetically altered mice, we would expect mEPSC amplitudes to be increased, compared to those measured in cultures from WT animals. That is not the case in cultures from Rab3A KO mice, so it is unlikely that network activity is reduced. However, mEPSC amplitudes are increased in Rab3AEbd/Ebd cultures, leaving open this possibility. It would have to be a defect unique to neurons in culture, since the Rab3AEbd/Ebd mouse appears normal in every way, suggesting action potential activity is occurring in the brains of these animals in vivo. We will add the possibility that activity is altered in the cultures from Rab3AKO and Rab3AEbd/Ebd to the manuscript.

      2) Selection of mEPSC events is not conducted in an unbiased manner. Manually selecting events is insufficient for cumulative distribution analysis, where small biases could skew the entire distribution. Since the authors claim their ratio plot is a better method to detect the uniformity of scaling than the well-established rank-order plot, it is important to use an unbiased population to substantiate this claim.

      MiniAnalysis (a standard program used for mEPSC event detection and analysis) selects many false positives with the automated feature (due to the very small sizes of events that are close to the noise level) so manual re-evaluation of the automated process is necessary to eliminate false positives. As soon as there is a manual step, bias is introduced. Interestingly, a manual reevaluation step was applied in a recent study that describes their process as ‘unbiased” (Wu et al., 2020). The alternative is to apply a very large threshold, reducing or eliminating false positives. However, this has the effect of biasing the data towards large events. In sum, we do not believe it is currently possible to perform a completely unbiased detection process. We feel that it is important to include as many small events as possible to reduce the problem of having events in the TTX experimental group that were not matched by events in the control experimental group, for the rank order and ratio plots, so setting the threshold low and manually detecting events accomplishes this. We will add to the Materials and Methods section that the person selecting events did not have information on whether the record was from an untreated or a TTX-treated cell at the time of selection. All of these issues, the potential for skewing the CDFs, and bias potentially interfering in the true rank order and ratio relationships, are addressed by removal of the CDFs, ratio and rank-order plots from the manuscript.

      3) Immunohistochemistry data analysis is problematic. The authors only labeled dendrites without doing cell-fills to look at morphology, so it is questionable how they differentiate branches from pyramidal neurons and interneurons. Since glutamatergic synapses on these two types of neuron scale in the opposite directions, it is crucial to show that only pyramidal neurons are included for analysis.

      MAP2, in addition to labeling dendrites, also labels the cell body, and we used the cell structure revealed by MAP2 staining to select pyramidal-shaped neurons. The selection of the primary dendrite of a pyramidal neuron was stated in lines 239-240 in Materials and Methods and lines 1094 in the figure legend, but we had not explicitly stated how we knew it was a pyramidal neuron. We will include a low power picture of each of the selected pyramidal neurons in the revision.

      Conceptual concerns:

      The only novel finding here is the implicated role for Rab3A in synaptic scaling, but insights into mechanisms behind this observation are lacking. The author claims that Rab3A likely regulates scaling from the presynaptic side, yet there is no direct evidence from data presented. In its current form, this study's contribution to the field is very limited.

      We acknowledge that a presynaptic mechanism is involved in the regulation of homeostatic plasticity by Rab3A is not supported by direct evidence in cortical cultures in this study. But we disagree that the study’s contribution is very limited.

      The revised manuscript will emphasize that there are only two possible mechanisms by which Rab3A is acting in homeostatic plasticity. Either this presynaptic vesicle protein is regulating postsynaptic receptors (an extremely surprising result for which we do have direct evidence), or, it is regulating quantal size from both sides of the synapse (supported by direct evidence from our previous study at the mouse neuromuscular junction in vivo, where receptors are not being upregulated during homeostatic plasticity, and, by indirect evidence in the current study, that receptors and mEPSCs are not being identically regulated in the same cultures). Furthermore, the first idea that follows from the effect of Rab3A on receptors is that it would be regulating release of factors from astrocytes, since this is a mechanism that has been shown to be involved in homeostatic plasticity, and we clearly disprove this hypothesis.

      1) Their major argument for this is that homeostatic effects on mEPSC amplitudes and GluA2 cluster sizes do not match. This is inconsistent with reports from multiple labs showing that upscaling of mEPSC amplitude and GluA2 accumulation occur side by side during scaling (Ibata et al., 2008; Pozo et al., 2012; Tan et al., 2015; Silva et al., 2019).

      We agree with the reviewer that many studies show an increase in receptors and mEPSC amplitudes after activity blockade. This is why we were very surprised in our initial experiments to find that there was not a consistent robust increase in receptors in our cultures. At that point we were only imaging, and we assumed that it was homeostatic plasticity that was not always robust. We decided it was essential to measure mEPSC amplitudes and image receptors in the same cultures. We expected to observe larger and smaller effects on mEPSC amplitudes from culture to culture that were paralleled by larger and smaller effects on receptors, but this is not what happened. We have gone back to the literature to look more closely at whether variability across cultures has ever been shown for mEPSC amplitudes, receptors, or both. In a survey of 14 studies, none report results culture by culture. To our knowledge, we are the first to report this variability in the receptor response, and the lack of correlation between mEPSC amplitudes and receptor responses, in the same cultures. That said, for the 4 examples provided by the reviewer, only 1 reports evidence relevant to our study that receptors and mEPSC amplitudes ‘occur side by side,’ which is the (Ibata et al., 2008) study. Here, 24 hr of TTX treatment of rat cortical cultures causes synaptically localized GluA2 receptors in confocal imaging, and mEPSC amplitudes, to both increase to around 130%. The (Pozo et al., 2012) study is not a study of activity blockade but of the effects of overexpressing beta-integrins in rat hippocampal cultures, and this causes both GluA2 receptors and mEPSC amplitudes to increase, but the GluA2 level is not restricted to synaptic sites, and, is expressed as the surface fraction (surface receptor/total receptor—total receptor being surface intensity plus internalized intensity) which increases from 0.5 to 0.55, or to 110%, while mEPSC amplitude increases to ~180%. The (Tan et al., 2015) study only provides Western blot data to show an increase of receptors to 125% in mouse cortical cultures in response to 48 hr TTX, with mEPSC amplitudes increased to ~140%, but the Western blot technique measures synaptic and nonsynaptic receptors on excitatory and inhibitory neurons, as well as receptors on astrocytes. Finally, in (Silva et al., 2019), the culture conditions for the imaging data and the mEPSC amplitude data are markedly different, with ‘low-density’ Banker cultures being used for the former, and ‘high-density’ cultures used for the latter, and the protocol to induce activity blockade is different from ours (noncompetitive AMPA and NMDA blockers); synaptic GluA2 receptors are increased to ~280% and mEPSC amplitudes to ~170%. In the revision we will carefully summarize the previous evidence for receptors and mEPSC amplitude responses to activity blockade. Since it is known that different protocols trigger different molecular mechanisms, for example, TTX + APV triggers a homeostatic plasticity that can be completely reversed by acute application of blockers of Ca-permeable receptors, whereas TTX alone triggers a plasticity that is insensitive to these blockers (Sutton et al., 2006), Figure 4E; (Soden and Chen, 2010); Figure 4A), we will keep our discussion restricted to studies using TTX alone for at least 24 hr. We will acknowledge that our finding that GluA2 receptors and mEPSC amplitudes are not varying in lockstep from culture to culture suggests there is another contributor to mEPSC amplitude, but that we cannot rule out it is due to a greater variability in signal, or more issues with signal over noise, in imaging experiments compared to electrophysiology experiments.

      Studies surveyed about reporting results by culture:

      (Ju et al., 2004; Stellwagen et al., 2005; Shepherd et al., 2006; Sutton et al., 2006; Cingolani and Goda, 2008; Hou et al., 2008; Ibata et al., 2008; Chang et al., 2010; Hu et al., 2010; Jakawich et al., 2010; Beique et al., 2011; Tatavarty et al., 2013; Diering et al., 2014; Sanderson et al., 2018)

      Further, because the acquisition and quantification methods for mEPSC recordings and immunohistochemistry imaging are entirely different (each with its own limitations in signal detection), it is not convincing that the lack of proportional changes must signify a presynaptic component.

      We agree with the reviewer that there is no way to compare absolute levels from one type of experimental technique to another, but whatever differences in technical issues there are for the two techniques, they should cause systemic errors and should not contribute to the differences between experiments. Most of the issues with imaging come down to variability in the intensity of fluorescence from experiment to experiment, since the antibody solutions are made anew each time, as is the fixation solution. In addition, the confocal microscope function can vary over time and give brighter or dimmer images. But those kinds of artifacts are addressed by using the same solutions on control and TTX-treated coverslips, and imaging control and TTX-treated coverslips in the same single 2-3 hour imaging session, so that whatever issues there are, they cannot contribute to the TTX effect itself. Therefore when we compare the TTX effect (TTX measurements compared to untreated measurements) from culture to culture and find that in one WT culture there was no increase in receptors but there was in mEPSC amplitude, it is difficult to explain how a limitation specific to the antibody imaging technique could produce such a result. Similarly, when we get the opposite result, that in one KO culture, receptors increased but mEPSC amplitudes did not, it is unclear how limitations in signal detection would produce such a result in one culture but not another. The one exception to this is that the primary GluA2 antibody has to be added individually to each coverslip before returning the dishes to the incubator in order to avoid the disruption to live cells that a complete removal of media would have had. The only remaining ‘artifact’ that could explain the results would be a greater variability in the imaging experiments due to limitations in the signal or the signal to noise ratio. In the revision we will report additional characteristics of imaging experiments, such as average intensity for each coverslip, and for each experiment, to address whether variability in fluorescence levels could explain the variability in TTX effects we observe. We will include the possibility that the mismatches in GluA2 receptors and mEPSCs could be caused by greater variability in the imaging experiments.

      2) The authors also speculate in the discussion that presynaptic Rab3A could be interacting with retrograde BDNF signaling to regulate postsynaptic AMPARs. Without data showing Rab3A-dependent presynaptic changes after TTX treatment, this argument is not compelling. In this retrograde pathway, BDNF is synthesized in and released from dendrites (Jakawich et al., 2010; Thapliyal et al., 2022), and it is entirely possible for postsynaptic Rab3A to interfere with this process cell-autonomously.

      In the revision, the model will focus on the direct findings of the manuscript and tone down the speculation about BDNF signaling, but in the Discussion we will add the possibility that a Rab3A-BDNF interaction could occur either presynaptically or postsynaptically. Interestingly, these articles suggest the postsynaptic BDNF is affecting presynaptic function, namely mEPSC frequency. It is conceivable it could presynaptically affect the vesicle’s release of transmitter.

      3) The authors propose that a change in AMPAR subunit composition from GluA2-containing ones to GluA1 homomers may account for the distinct changes in mEPSC amplitudes and GluA2 clusters. However, their data from the Naspm wash-in experiments clearly show that GluA1 homomer contributions have not changed before and after TTX treatment.

      Our apologies to the reviewer that we were not clear on this point. In lines 396 to 400 we were describing the significant effects that NASPM had on mEPSC frequency on both untreated and TTX-treated cells, despite having only modest, and not quite significant effects on mEPSC amplitude. We conclude from these results that there are synaptic sites that have only GluA1 homomers, and the mEPSCs from these sites are blocked 100% by NASPM. There may be an increase in such GluA1-only synapses after activity blockade, but nevertheless, these events do not contribute to the amplitude increase. So we did not mean to suggest that there is a shift from Glua2 containing to GluA1 containing receptors that leads to the amplitude increase and fully agree with the reviewer that the GluA1 homomer contributions to amplitude have not changed before and after TTX. We will clarify the difference between the contribution of GluA1 homomers to amplitude and frequency in the revised manuscript.

      Reviewer #3

      Summary: The authors clearly demonstrate the Rab3A plays a role in HSP at excitatory synapses, with substantially less plasticity occurring in the Rab3A KO neurons. There is also no apparent HSP in the Earlybird Rab3A mutation, although baseline synaptic strength seems already elevated. In this context, it is unclear if the plasticity is absent or just occluded by a ceiling effect due the synapses already being strengthened. The authors do appropriately discuss both options. There are also differences in genetic background between the Rab3A KO and Earlybird mutants that could also impact the results, which are also noted. The authors have solid data showing that Rab3A is unlikely to be active in astrocytes, Finally, they attempt to study the linkage between synaptic strength during HSP and AMPA receptor trafficking, and conclude that trafficking is largely not responsible for the changes in synaptic strength.

      Strengths: This work adds another player into the mechanisms underlying an important form of synaptic plasticity. The plasticity is only reduced, suggesting Rab3A is only partially required and perhaps multiple mechanisms contribute. The authors speculate about some possible novel mechanisms.

      Weaknesses: However, the rather strong conclusions on the dissociation of AMPAR trafficking and synaptic response are made from somewhat weaker data. The key issue is the GluA2 immunostaining in comparison with the mESPC recordings. Their imaging method involves only assessing puncta clearly associated with a MAP2 labeled dendrite. This is a small subset of synapses, judging from the sample micrographs (Fig 5). To my knowledge, this is a new and unvalidated approach that could represent a particular subset of synapses not representative of the synapses contributing to the mEPSC change. (they are also sampling different neurons for the two measurements; an additional unknown detail is how far from the cell body were the analyzed dendrites for immunostaining. While the authors acknowledge that a sampling issue could explain the data, they still use this data to draw strong conclusions about the lack of AMPAR trafficking contribution to the mEPSC amplitude change. This apparent difference may be a methodological issue rather than a biological one, and at this point it is impossible to differentiate these. It will unfortunately be difficult to validate their approach. Perhaps if they were to drive NMDA-dependent LTD or chemLTP, and show alignment of the imaging and ephys, that would help. More helpful would be recordings and imaging from the same neurons but this is challenging. Sampling from identified synapses would of course be ideal, perhaps from 2P uncaging combined with SEP-labeled AMPARs, but this is more challenging still. But without data to validate the method, it seems unwarranted to make such strong conclusions such as that AMPAR trafficking does not underlie the increase in mEPSC amplitude, given the previous data supporting such a model.

      We chose the primary dendrite to ensure we were not assaying dendrites from inhibitory neurons or on axons, but we will add in the revision that it is a limitation of our methods that we are not sampling all the synapses for each neuron. The majority of previous studies that establish that receptors are increased side by side with mEPSCs did not measure receptors and mEPSCs in the same cells, nor even in the same cultures. There is a recent study which employs dual recordings, transfection of GluA2 and VGlut1 constructs, and infusion of dyes to highlight cell morphology (Letellier et al., 2019), so in principle an experiment could be done in which synaptic GluA2 sites are imaged in a cell in which the mEPSCs are also measured. It would be difficult to make these measurements in the same cells before and after TTX treatment, since there is a high likelihood of damaging the cell upon electrode withdrawal and with the imaging process itself. In theory, only a few such experiments would be necessary to establish whether receptors and mEPSC amplitudes are varying in lockstep, and we will consider this for a future study. As stated in response to conceptual concern #1 in Reviewer 2’s comments, we will review the literature on previous studies’ demonstrations of increases in receptors and mEPSC amplitudes following activity blockade in more detail, including how the synaptic sites to be imaged were chosen, to address whether our selection of sites touching the primary dendrite is unvalidated.

      A sample from 3 articles:

      (Ibata et al., 2008), only information is that ‘distal dendrites’ were examined. The authors do not use a dendritic label. (Jakawich et al., 2010), ‘neurons with pyramidal-like morphology were selected for imaging,’ and ‘principal dendrite of each neuron was linearized’—but how these were identified is not clear, since MAP2 or other cellular labels are not described.

      (Silva et al., 2019), ‘dendrites with similar thickness and appearance were randomly selected using MAP2 staining,’ which suggests synaptic sites with GluA2 and VGLUT1 were selected on the basis of being close to or touching the MAP2 positive dendrite, although this is not stated explicitly.

      We can perform length measurements on the dendrites imaged and report this information in the revision, but the primary dendrite is the closest dendrite to the cell body.

      We have addressed the potential contribution of technical artifacts arising from the two distinct methods of measurement, imaging and electrophysiology, in our response to conceptual concern #1 of Reviewer 2.

      Other questions arise from the NASPM experiments, used to justify looking at GluA2 (and not GluA1) in the immunostaining. First, there is a frequency effect that is quite unclear in origin. One would expect NASPM to merely block some fraction of the post-synaptic current, and not affect pre-synaptic release or block whole synapses. It is also unclear why the authors argue this proves that the NASPM was at an effective concentration (lines 399-400).

      We observed a clear effect of NASPM reducing mEPSC frequency. We will state more clearly that we infer from the loss of mEPSCs after NASPM that such mEPSCs were from synaptic sites that had only GluA1 homomers, and acknowledge that this is an interpretation. We will also clarify that if our inference is correct, it would indicate that the dose of NASPM we used was 100% effective at blocking GluA1 homomers. The alternative explanation would be a presynaptic effect of NASPM, which has never been reported, to our knowledge.

      Further, the amplitude data show a strong trend towards smaller amplitude. The p value for both control and TTX neurons was 0.08 - it is very difficult to argue that there is no effect. And the decrease is larger in the TTX neurons. Considering the strong claims for a pre-synaptic and the use of this data to justify only looking at GluA2 by immunostaining, these data do not offer much support of the conclusions. Between the sampling issues and perhaps looking at the wrong GluA subunit, it seems premature to argue that trafficking is not a contributor to the mEPSC amplitude change, especially given the substantial support for that hypothesis. Further, even if trafficking is not the major contributor, there could be shifts in conductance (perhaps due to regulation of auxiliary subunits) that does not necessitate a pre-synaptic locus. While the authors are free to hypothesize such a mechanism, it would be prudent to acknowledge other options and explanations.

      We did not mean to suggest that there is no effect of NASPM on mEPSC amplitude. We will clarify that our data indicate that there is no effect of NASPM on the TTX effect on mEPSC amplitude. We agree with the reviewer that the effect of NASPM on frequency is of larger magnitude after TTX treatment, although the p value is larger than that for untreated cells, likely due to greater variability. We interpret this to mean that TTX treatment increases the proportion of synapses that have only GluA1 homomers. Nevertheless, the increase in GluA1 homomer sites does not appear to contribute to the overall increase in amplitude following TTX treatment, and we wanted to find the mechanism of the amplitude increase. That is why we focused on GluA2 receptors. We will acknowledge the limitation of basing our conclusions on only GluA2 receptors in the revision, as well as the possibility that there is a change in conductance. As stated in our response to Reviewer 2, we do not mean to state that GluA2 receptors do not go up after activity blockade, we find that this is the case. We are proposing an additional mechanism contributing to mEPSC amplitude to explain the different responses for GluA2 receptors vs. mEPSC amplitudes in some of the 6 matched experiments (3 WT and 3 KO).

      The frequency data are missing from the paper, with the exception of the NASPM dataset. The mEPSC frequencies should be reported for all experiments, particularly given that Rab3A is generally viewed as a pre-synaptic protein regulating release. Also, in the NASPM experiments, the average frequency is much higher in the TTX treated cultures. Is this statistically above control values?

      We will report frequency measurements for all experiments shown. Following TTX treatment, frequency variability increases enormously, with cells having as high as > 10 mEPSCs per second, and other TTX-treated cells with frequencies as low as < 1 mEPSC per second, so the TTX effect on frequency, and whether this effect is present or not in Rab3A KO and Rab3AEbd/Ebd is not completely clear, which is why we did not include those results previously.

      Unaddressed issues that would greatly increase the impact of the paper:

      1) Is Rab3A acting pre-synaptically, post-synaptically or both? The authors provide good evidence that Rab3A is acting within neurons and not astrocytes. But where it is acting (pre or post) would aid substantially in understanding its role (and particularly the hypothesized and somewhat novel idea that the amount of glutamate released per vesicle is altered in HSP). They could use sparse knock-down of Rab3A, or simply mix cultures from KO and WT mice (with appropriate tags/labels). The general view in the field has been that HSP is regulated post-synaptically via regulation of AMPAR trafficking, and considerable evidence supports this view. The more support for their suggestion of a pre-synaptic site of control, the better.

      We agree with the reviewer that this is the most important question to answer next. The approach suggested by the reviewer would be to record from Rab3A KO neurons in a culture where the majority of its inputs are Rab3A positive. If the TTX effect is absent from these cells, it would strongly indicate that postsynaptic Rab3A is required for homeostatic plasticity. There are not currently transgenic mice expressing GFP forms of Rab3A, so we would have to create one, or, transiently transfect Rab3A-GFP into Rab3AKO neurons. Given that under our experimental conditions, we require a very high density of neurons to observe the increase in mEPSC amplitude, it would be difficult to get the ratio of Rab3A-expressing neurons high enough using transfection to be sure that a given postsynaptic cell lacking Rab3A had a normal number of Rab3A-positive inputs and almost no Rab3A-negative inputs. It may be that the opposite experiment is more doable—an isolated Rab3A-positive neuron in a sea of Rab3A-negative neurons, which could be accomplished with a very low transfection efficiency. Another approach would be to use the fast off rate antagonist gamma-DGG, which is more effective against low glutamate concentrations than high glutamate concentrations (see (Liu et al., 1999; Wu et al., 2007). If gamma-DGG were less effective at reducing mEPSC amplitude in TTX-treated cells, compared to untreated cells, it would support the hypothesis that activity blockade leads to an increase in the amount of transmitter per vesicle fusion event. Further, if the change in gamma-DGG sensitivity after activity blockade were disrupted in cultures from Rab3A KO cells, it would support a presynaptic role for Rab3A in homeostatic plasticity of mEPSC amplitude. We have begun these experiments but are finding the surprising result that within a single recording, small mEPSCs and large mEPSCs appear to be differentially sensitive to gamma-DGG. To confirm that this is a biological characteristic, rather than an issue with the detection threshold, we will be repeating our experiments with a slow off rate antagonist that has same effect regardless of transmitter concentration. The complexity of these results precludes including them in the current manuscript.

      2) Rab3A is also found at inhibitory synapses. It would be very informative to know if HSP at inhibitory synapses is similarly affected. This is particularly relevant as at inhibitory synapses, one expects a removal of GABARs and/or a decrease of GABA-packaging in vesicles (ie the opposite of whatever is happening at excitatory synapses). If both processes are regulated by Rab3A, this might suggest a role for this protein more upstream in the signaling; an effect only at excitatory synapses would argue for a more specific role just at these synapses.

      The next question, after it is determined where Rab3A is acting, is whether it is required for other forms of homeostatic plasticity. This includes plasticity of GABA mIPSCs on pyramidal neurons, but also mEPSCs on inhibitory neurons, and, the downscaling of mEPSCs (and upscaling of mIPSCs) when activity is increased, by bicuculline for example. We will add a statement about future experiments examining other forms of plasticity to the discussion, and include examples where a molecular mechanism has mediated multiple forms, and those that have been shown to be very specific.

      Beique JC, Na Y, Kuhl D, Worley PF, Huganir RL (2011) Arc-dependent synapse-specific homeostatic plasticity. Proc Natl Acad Sci U S A 108:816-821.

      Chang MC, Park JM, Pelkey KA, Grabenstatter HL, Xu D, Linden DJ, Sutula TP, McBain CJ, Worley PF (2010) Narp regulates homeostatic scaling of excitatory synapses on parvalbumin-expressing interneurons. Nat Neurosci 13:1090-1097.

      Cingolani LA, Goda Y (2008) Differential involvement of beta3 integrin in pre- and postsynaptic forms of adaptation to chronic activity deprivation. Neuron Glia Biol 4:179-187.

      Diering GH, Gustina AS, Huganir RL (2014) PKA-GluA1 coupling via AKAP5 controls AMPA receptor phosphorylation and cell-surface targeting during bidirectional homeostatic plasticity. Neuron 84:790-805.

      Hanes AL, Koesters AG, Fong MF, Altimimi HF, Stellwagen D, Wenner P, Engisch KL (2020) Divergent Synaptic Scaling of Miniature EPSCs following Activity Blockade in Dissociated Neuronal Cultures. J Neurosci 40:4090-4102.

      Hou Q, Zhang D, Jarzylo L, Huganir RL, Man HY (2008) Homeostatic regulation of AMPA receptor expression at single hippocampal synapses. Proc Natl Acad Sci U S A 105:775-780.

      Hu JH, Park JM, Park S, Xiao B, Dehoff MH, Kim S, Hayashi T, Schwarz MK, Huganir RL, Seeburg PH, Linden DJ, Worley PF (2010) Homeostatic scaling requires group I mGluR activation mediated by Homer1a. Neuron 68:1128-1142.

      Ibata K, Sun Q, Turrigiano GG (2008) Rapid synaptic scaling induced by changes in postsynaptic firing. Neuron 57:819826.

      Jakawich SK, Nasser HB, Strong MJ, McCartney AJ, Perez AS, Rakesh N, Carruthers CJ, Sutton MA (2010) Local presynaptic activity gates homeostatic changes in presynaptic function driven by dendritic BDNF synthesis. Neuron 68:1143-1158.

      Ju W, Morishita W, Tsui J, Gaietta G, Deerinck TJ, Adams SR, Garner CC, Tsien RY, Ellisman MH, Malenka RC (2004) Activity-dependent regulation of dendritic synthesis and trafficking of AMPA receptors. Nat Neurosci 7:244-253.

      Lazic SE, Clarke-Williams CJ, Munafo MR (2018) What exactly is 'N' in cell culture and animal experiments? PLoS Biol 16:e2005282.

      Liu G, Choi S, Tsien RW (1999) Variability of neurotransmitter concentration and nonsaturation of postsynaptic AMPA receptors at synapses in hippocampal cultures and slices. Neuron 22:395-409.

      Pozo K, Cingolani LA, Bassani S, Laurent F, Passafaro M, Goda Y (2012) beta3 integrin interacts directly with GluA2 AMPA receptor subunit and regulates AMPA receptor expression in hippocampal neurons. Proc Natl Acad Sci U S A 109:1323-1328.

      Sanderson JL, Scott JD, Dell'Acqua ML (2018) Control of Homeostatic Synaptic Plasticity by AKAP-Anchored Kinase and Phosphatase Regulation of Ca(2+)-Permeable AMPA Receptors. J Neurosci 38:2863-2876.

      Shepherd JD, Rumbaugh G, Wu J, Chowdhury S, Plath N, Kuhl D, Huganir RL, Worley PF (2006) Arc/Arg3.1 mediates homeostatic synaptic scaling of AMPA receptors. Neuron 52:475-484.

      Silva MM, Rodrigues B, Fernandes J, Santos SD, Carreto L, Santos MAS, Pinheiro P, Carvalho AL (2019) MicroRNA186-5p controls GluA2 surface expression and synaptic scaling in hippocampal neurons. Proc Natl Acad Sci U S A 116:5727-5736.

      Soden ME, Chen L (2010) Fragile X protein FMRP is required for homeostatic plasticity and regulation of synaptic strength by retinoic acid. J Neurosci 30:16910-16921. Stellwagen D, Beattie EC, Seo JY, Malenka RC (2005) Differential regulation of AMPA receptor and GABA receptor trafficking by tumor necrosis factor-alpha. J Neurosci 25:3219-3228.

      Sutton MA, Ito HT, Cressy P, Kempf C, Woo JC, Schuman EM (2006) Miniature neurotransmission stabilizes synaptic function via tonic suppression of local dendritic protein synthesis. Cell 125:785-799.

      Tan HL, Queenan BN, Huganir RL (2015) GRIP1 is required for homeostatic regulation of AMPAR trafficking. Proc Natl Acad Sci U S A 112:10026-10031.

      Tatavarty V, Sun Q, Turrigiano GG (2013) How to scale down postsynaptic strength. J Neurosci 33:13179-13189.

      Turrigiano GG, Leslie KR, Desai NS, Rutherford LC, Nelson SB (1998) Activity-dependent scaling of quantal amplitude in neocortical neurons. Nature 391:892-896.

      Wang X, Wang Q, Yang S, Bucan M, Rich MM, Engisch KL (2011) Impaired activity-dependent plasticity of quantal amplitude at the neuromuscular junction of Rab3A deletion and Rab3A earlybird mutant mice. J Neurosci 31:3580-3588.

      Watt AJ, van Rossum MC, MacLeod KM, Nelson SB, Turrigiano GG (2000) Activity coregulates quantal AMPA and NMDA currents at neocortical synapses. Neuron 26:659-670.

      Wu XS, Xue L, Mohan R, Paradiso K, Gillis KD, Wu LG (2007) The origin of quantal size variation: vesicular glutamate concentration plays a significant role. J Neurosci 27:3046-3056.

      Wu YK, Hengen KB, Turrigiano GG, Gjorgjieva J (2020) Homeostatic mechanisms regulate distinct aspects of cortical circuit dynamics. Proc Natl Acad Sci U S A 117:24514-24525.

    2. Reviewer #2 (Public Review):

      In this study, Koesters et al. investigated whether Rab3A, a small GTPase that regulates synaptic vesicle fusion pore opening, is required for excitatory synaptic scaling in response to TTX-induced activity suppression in dissociated mouse cortical neuronal culture. They first show that, while pyramidal neurons from wild-type (WT) littermates show normal synaptic scaling in response to 48h of TTX treatment (~30% increase in the mean mEPSC amplitude), those from two different mouse lines with either deletion (Rab3A-/-) or loss-of-function mutation of Rab3A (Rab3AEbd/Ebd) fail to engage this homeostatic compensation. They perform cumulative distribution analysis to show that the mEPSC population has gone through divergent scaling in WT neurons. Similarly, this phenomenon is absent in neurons from the two Rab3A mouse lines. They further demonstrate that GluA2-containing AMPARs likely account for the increase in mEPSC amplitudes by comparing measurements before and after washing in blockers specific for GluA2-lacking AMPARs. Subsequently, they perform electrophysiology and immunohistochemistry side by side for WT neurons from the same culture following TTX treatment, and find that both mEPSC amplitudes and GluA2 cluster sizes have shifted towards higher values, while GluA2 cluster intensity remains unchanged. Importantly, all these homeostatic compensations are absent in Rab3A-/- neurons. Finally, they mix neurons and astrocyte feeders either from WT or Rab3A-/- mice, which reveals that neuronal but not astrocytic Rab3A knockout leads to impaired scaling up of mEPSCs. They conclude that Rab3A is required for homeostatic scaling up of mEPSC amplitude in cortical neurons, most likely from the presynaptic side.

      Although the authors have raised an interesting question, their conclusion is not well supported by the data presented. I list my technical and conceptual concerns below.

      Technical concerns:

      1. The culture condition is questionable. The authors saw no NMDAR current present during spontaneous recordings, which is worrisome since NMDARs should be active in cultures with normal network activity (Watt et al., 2000; Sutton et al., 2006). It is important to ensure there is enough spiking activity before doing any activity manipulation. Similarly, it is also unknown whether spiking activity is normal in Rab3A KO/Ebd neurons.

      2. Selection of mEPSC events is not conducted in an unbiased manner. Manually selecting events is insufficient for cumulative distribution analysis, where small biases could skew the entire distribution. Since the authors claim their ratio plot is a better method to detect the uniformity of scaling than the well-established rank-order plot, it is important to use an unbiased population to substantiate this claim.

      3. Immunohistochemistry data analysis is problematic. The authors only labeled dendrites without doing cell-fills to look at morphology, so it is questionable how they differentiate branches from pyramidal neurons and interneurons. Since glutamatergic synapses on these two types of neuron scale in the opposite directions, it is crucial to show that only pyramidal neurons are included for analysis.

      Conceptual concerns:

      The only novel finding here is the implicated role for Rab3A in synaptic scaling, but insights into mechanisms behind this observation are lacking. The author claims that Rab3A likely regulates scaling from the presynaptic side, yet there is no direct evidence from data presented. In its current form, this study's contribution to the field is very limited.

      1. Their major argument for this is that homeostatic effects on mEPSC amplitudes and GluA2 cluster sizes do not match. This is inconsistent with reports from multiple labs showing that upscaling of mEPSC amplitude and GluA2 accumulation occur side by side during scaling (Ibata et al., 2008; Pozo et al., 2012; Tan et al., 2015; Silva et al., 2019). Further, because the acquisition and quantification methods for mEPSC recordings and immunohistochemistry imaging are entirely different (each with its own limitations in signal detection), it is not convincing that the lack of proportional changes must signify a presynaptic component.

      2. The authors also speculate in the discussion that presynaptic Rab3A could be interacting with retrograde BDNF signaling to regulate postsynaptic AMPARs. Without data showing Rab3A-dependent presynaptic changes after TTX treatment, this argument is not compelling. In this retrograde pathway, BDNF is synthesized in and released from dendrites (Jakawich et al., 2010; Thapliyal et al., 2022), and it is entirely possible for postsynaptic Rab3A to interfere with this process cell-autonomously.

      3. The authors propose that a change in AMPAR subunit composition from GluA2-containing ones to GluA1 homomers may account for the distinct changes in mEPSC amplitudes and GluA2 clusters. However, their data from the Naspm wash-in experiments clearly show that GluA1 homomer contributions have not changed before and after TTX treatment.

      Ibata K, Sun Q, Turrigiano GG (2008) Rapid synaptic scaling induced by changes in postsynaptic firing. Neuron 57:819-826.

      Jakawich SK, Nasser HB, Strong MJ, McCartney AJ, Perez AS, Rakesh N, Carruthers CJL, Sutton MA (2010) Local Presynaptic Activity Gates Homeostatic Changes in Presynaptic Function Driven by Dendritic BDNF Synthesis. Neuron 68:1143-1158.

      Pozo K, Cingolani LA, Bassani S, Laurent F, Passafaro M, Goda Y (2012) β3 integrin interacts directly with GluA2 AMPA receptor subunit and regulates AMPA receptor expression in hippocampal neurons. Proceedings of the National Academy of Sciences 109:1323-1328.

      Silva MM, Rodrigues B, Fernandes J, Santos SD, Carreto L, Santos MAS, Pinheiro P, Carvalho AL (2019) MicroRNA-186-5p controls GluA2 surface expression and synaptic scaling in hippocampal neurons. Proceedings of the National Academy of Sciences 116:5727-5736.

      Sutton MA, Ito HT, Cressy P, Kempf C, Woo JC, Schuman EM (2006) Miniature Neurotransmission Stabilizes Synaptic Function via Tonic Suppression of Local Dendritic Protein Synthesis. Cell 125:785-799.

      Tan HL, Queenan BN, Huganir RL (2015) GRIP1 is required for homeostatic regulation of AMPAR trafficking. Proceedings of the National Academy of Sciences 112:10026-10031.

      Thapliyal S, Arendt KL, Lau AG, Chen L (2022) Retinoic acid-gated BDNF synthesis in neuronal dendrites drives presynaptic homeostatic plasticity. eLife 11:e79863.

      Watt AJ, Rossum MCW van, MacLeod KM, Nelson SB, Turrigiano GG (2000) Activity Coregulates Quantal AMPA and NMDA Currents at Neocortical Synapses. Neuron 26:659-670.

    1. sets: collection of objects ex: A{6,1,2,0....} * set order does not matter * duplicates are not allowed B={2,{3,4},{}} {} empty set (known as 0 with a slash) not actually zero but is only empty union sets: AuB= {2,6,1,{},0,{3,4}} what if A had 3? 3 is not a element of AuB Intersection set: AnB={2} set minus: A-B = A but not B (A\B) Natural numbers: set of even natural numbers= {n \in N| is even(n)} notes:

      . = is such that , after for all or exists

      A set cannot have itself as an element.

      z\in N {1,4,7} not\in N (or sub in not\in fir subet of c with underline) Z={-2,-2,0,1} Q= Rational R= Real C = complex numbers

      what does it mean to prove a statement? Prove existence

      Thm: exist n \in N, n >=10 and is prime(n) proof: show n=11 satisfies the condition 11>=10 ^ 11 is prime

      Thm: exists x \in S. P(x) proof: choose x= ? \in S. P(x) is true because ?

      Prove universality or for all cant just say x=11 but all numbers

      proof by example is not the way to go for all x \in R . x^2 -6x>-10 proof: suppose x is in R (x^2-6x+9)=(x-3)^2>=0 x^2-6x=(x^2-6x+9)-9>=-9>-10

      square is done with proof or QED

      thm: for all x \in S. P(x) proof: suppose x is an arbitrary number of S P(x) is true because reasons.

      for all or exists needs a period that means such that

      proof of implication: for all n \in Z. ( n is mult. of 10)=> is even(n). proof: suppose n \in Z. assume n is mult of 10 ; WTS n is even, n=10k(k is some int) =25K=2some int

      WTS - WANT TO SHOW p->q Assume p is true then Q is true EXPLAIN contrapositive: not p -> not p Assume not Q false then not P false EXPLAIN p->q = not p -> not p

      proof by contrapositive: introduce variable for "for all" suppose n\in Z. assume n is odd ; WTS n*n is odd odd x odd =odd

      Proof by contradiction: counter intuitive to prove P show that not P is false if its not false its true

      Thm:

      sqrt2 not \in Q Assume sqrt2 is \in Q sqrt2=a/b for some ints a,b ,b cannot be zero assume lowest form and a and b share no common factors (other than + or - 1 ) a^2=2b^2-> a^2 is even -> a is even a=2A for some int A 4A^2=2b^2->2A^2=b^2 b^2 is even-> b is even b=2B for some B \in Z a=2A, b=2B a/b is not reduced because there is a factor of 2 that's common and 2 is not + - 1 .. CONTRADICTION assumption sqrt2 \in Q must be false -> sqrt2 not\in Q QED.

      COMMON FACTORS ARE RATIO OF INTS. --?

      Review-

      INDUCTION:

      thm: for all n >=0 , (1+2+3+...+n)=(n(n+1))/2 [sigma k=1 to n (k)] to exlude 0

      n=0 an empty sum is 0 (01)/2 <br /> n=1: 1=? (12)/2 add 1 n=2: 1+2 =? (23)/2 add 2 n=3 1+2+3=? (34)/2 add 3 n=4 1+2+3+4=? (4*5)/2 add 4

      verify: 1+2+..+(n-1)=(n-1)(n)/2 add n 1+2+..+(n-1)+n=n(n+1)/2

      (n^2+n)/2 - (n^2-n)/2 = 2n/2 = n

      (1+2+3+...+n)=(n(n+1))/2=: P(n) [sigma k=1 to n (k)] to exlude 0 induction principle: if you prove the implications are true than the rest are true. p(0) true p(1) true p(1)->p(2) (implies) p(2) -> p(3) p(3) -> p(4) ......

      Below is all true because the implications are true aswell. p(1) p(2) p(3) p(4)

      principle of induction: if [p(0) and for all n >=1. p(n-1)->p(n)] then [ for all n \in N. p(n)]

      outline to follow: thm: p(n) is true for all n \in N

      first step: define p(n) recall: p(n) ="(1+2+3+...+n)=(n(n+1))/2" proof by induction on n. base case (p(0): is true b/c ( do some work) if [p(0) and for all n >=1. p(n-1)->p(n)] then [ for all n \in N. p(n)] inductive step: assume n is at least 1 and p(n-1) is true and WTS p(n) IS TRUE -------assume left prove right.--------

      Ie. assume

      1+2+..+(n-1)=(n-1)(n)/2

      WTS 1+2+...(n-1)+n=n(n+1)/2

      WTS (n-1)(n)/2 + n=n(n+1)/2

      true by algebra by induction p(n) is true for all n>=0

      16 x 16 grid want to tile w/o middle cell L-trominoes tile once but not covered twice p(n):=2^nx2^n grid w/o one middle cell can be tile w/ L trominoes

      p(0), tile |x| missing middle cell QED P(1): 2X2 1 L trominoe P(2): 4X4 start in middle then outward 8x8 example: highlight middle lines x and y start center and observe 4 quads separately cant invoke p(2) twice. if induction is not working try and prove a stronger thm.<br /> remove middle cell restriction less restrictions are stronger thms.

      p(2)->p(3) p(3)->p(4) strat: place one L in the middle , hitting the 3 full quadrants now use p(2) on each quad

      3 quads: 2^nx2^n -1 cell missing cell in corner 4th 2^nx2^n missing cell wont be in corner

      you can always tile no matter where the face is in 2x2 induction

      each p(n) is a algorithm not a thm

    Annotators

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      In their manuscript „Live-cell super-resolution nanoscopy reveals modulation of cristae<br /> dynamics in bioenergetically compromised mitochondria", Golombek et al. tested the effects of different mitochondrial toxins on cristae dynamics. The main focus of their work lies on live STED imaging, which they use to visualize cristae merging and splitting. They found swelling of mitochondria and reduced cristae density in response to most toxins, but cristae dynamics remained largely unaffected. Depletion of the membrane potential by administration of CCCP increased cyristae dynamics, while inhibition of ANT had a negative effect on cristae dynamics at least in a subset of mitochondria.

      1. The authors state that the used concentrations of mitochondrial toxins commonly result in a change in oxygen consumption. While this is believable, it is not guaranteed that the specific chemicals used for the experiments were working properly (freeze/thawing or simply incorrect storage or aliquotation may have an effect on the compounds). This is even more important in the case of results where no significant change after the administration of the toxins is seen. In Figure 5, the authors report no change in membrane potential after oligomycin administration, this is unexpected. I therefore suggest to include a supplementary figure, in which the functionality of the compounds is verified. This could be done by respiratory measurements (e.g. Seahorse). A Mito Stress Test was performed for Figure 6, but this was done using the Seahorse kit chemicals, which were probably different from the chemicals used in the microscopy experiments.

      Response: We appreciate the valid concerns of the reviewer in this point.

      A) In order to show the functionality of compounds which were used for performing our experiments including STED imaging, we now performed respiratory measurements employing the concentrations of mitochondrial toxins (Oligomycin A, CCCP, rotenone/antimycin A) which were used during imaging conditions as well as commercially available mitochondrial toxins (Oligomycin A, FCCP, rotenone/antimycin A) with respective concentrations used as a standard for the Mito stress Kit. The new figures are included in Fig S1A & B. HeLa cells treated with seahorse compounds or those used during imaging conditions showed similar results including basal, maximal and spare respiratory capacity. Further, in order to overcome the inefficiency of mitochondrial toxins employed, due to freeze/thaw cycles, we used fresh aliquots (stored at -20°C) as a general strategy. This is clearly observed by a drastic reduction of ΔΨm upon treating HeLa cells with CCCP, antimycin A as well as rotenone (Fig S6A & B). A reduction of mitochondrial ATP levels was also observed upon employing rotenone, antimycin A and oligomycin A confirming that active mitochondrial toxins were used. These experiments demonstrate that the mitochondrial toxins employed throughout our manuscript are functional as expected.

      New Figure S1A & B

      B) The Fig 6 (now Fig 5 due to Reviewer # 2, Point 7) respirometry experiments which initially employed seahorse compounds and BKA has now been replaced with new experiments where we used mitochondrial toxins similar to STED imaging. Needless, to say, the results are similar to what were observed with seahorse compounds. The new figures are replaced in Fig 5A & 5B.

      New Figure 5A & B

      C) Oligomycin A inhibits ATP synthase which results in decreased ATP synthesis as observed (Fig 4A & B). Further, oligomycin A is expected to hyperpolarise mitochondria (2). In Fig S6, despite some cells having more ΔΨm, there was no overall significant change when compared to untreated cells. Previous publications also show that there is no significant difference in ΔΨm upon treatment with oligomycin (1) demonstrating that the ΔΨm depends on the concentration of oligomycin, treatment time and cell type.

      1. Figure 1 would benefit from a more detailed description of merging/splitting events. Maybe a cartoon plus a zoomed in image of an exemplary event?

      Response: Thank you for the suggestion. In order to clearly explain/simplify the understanding of cristae merging and splitting events, we added a cartoon in Fig 1B. The green and magenta arrows show sites of imminent merging and splitting with the green and magenta asterisks representing them respectively in the subsequent frames. The zoomed in images in Fig1A (leftmost panel) are shown to the right as time-lapse images.

      New Figure 1B

      1. Could the reduced cristae density be an effect of mitochondrial swelling? It is curious that all toxins appear to have the same effect on mitochondrial architecture. What is the fait of an enlarged mitochondrion over time? Mitophagy? And does the percentage of enlarged mitochondria change with increasing treatment time?

      Response: Thank you for the comment.

      A) We agree that the reduced cristae density is due to mitochondrial swelling. We added the relevant text in the results section ‘Cristae structure is altered in a subset of mammalian cells treated with mitochondrial toxins’. Treatment of HeLa cells, with all the mitochondrial toxins mentioned, uniformly result around 50 % of mitochondria undergoing enlargement (Fig 2B). In enlarged mitochondria where the mitochondrial width is ≥ 650 nm, there is no change in cristae area occupied per mitochondria (Fig S3C & D) and as a result reduced cristae density (Fig 2H). Therefore, it indicates that reduced cristae density occurs due to mitochondrial enlargement.

      Figure 2B-F

      Figure S3C and D

      B) In order to address the fate of mitochondria with increasing time upon treatment with various mitochondrial toxins, we treated the HeLa cells for 4 hrs with mitochondrial toxins. Untreated cells maintained normal mitochondrial morphology while cells treated with various mitochondrial toxins displayed fragmented and swollen mitochondrial morphology. The new Fig S5 is included in the supplementary. Cristae morphology was abnormal displaying interconnected cristae in swollen mitochondria. Since mitochondrial fragmentation is already observed at 4 hours and accompanied by interconnected cristae, the number of cristae merging and splitting were severely reduced.

      Our imaging performed within 30 mins of addition of respective toxins overcomes the additional aberrancy of mitochondrial fragmentation which would not allow a reliable analysis of cristae dynamics as too few cristae would be visible within one mitochondrion.

      New Figure S5

      1. Figure 4C: How was the mitochondrial width determined in the LSM images? Especially in the perinuclear area it will be difficult to determine this parameter without the super-resolution provided by STED. Was this parameter determined manually for selected mitochondria? In the methods part it says that only a maximum of two mitochondria per cell were analyzed. How were these chosen? Was the process blinded?

      Response: Thank you for the comment. We could imagine the reason for the ambiguity in understanding.

      A) For LSM confocal images involving FRET-based microscopy to determine the ATP levels, we calculated the cell population as belonging to either normal or enlarged category. The confocal images of HeLa cells displayed clear separation of mitochondria even in the perinuclear area (representative images are shown in Fig 4A) and thus it was possible to measure the width of individual mitochondria. The methods section ‘FRET-based microscopy to measure ATP levels’ describes that ‘the cut off for swollen mitochondria was set to 650 nm in congruence with STED SR nanoscopy. If 85% of the mitochondrial population featured enlarged mitochondria, the cells were designated as swollen. Similarly, if 85% of the mitochondrial population featured mitochondria whose width was less than 650 nm, the cell was considered as having normal mitochondria’.

      Figure 4A

      B) The cristae morphology of various mitochondria is fairly uniform in individual cells. Thus, the mitochondria are representative of the individual cells. Therefore, in order to increase the coverage of various cells, we considered a maximum of two mitochondria from each cell which were randomly chosen. This part is modified in the methods section ‘Quantification of various parameters related to cristae morphology’ to make it clear. Thus, while the quantification of various parameters including dynamics involved individual mitochondria, various cells were classified as belonging to normal or enlarged category while measuring ATP levels.

      1. What is the average size of all mitochondria per cell? Is this addressed in Figure 2B or are only analyzed mitochondria included? Please carify. Were the mitochondria chosen for analysis representative for the respective cell?

      Response: The data obtained by super-resolution imaging of mitochondria is used for quantifying cristae dynamics which is a very challenging and time-consuming method done in a blind-manner. As mentioned in response 4B, the cristae morphology is fairly uniform in individual cells, therefore, we only included the mitochondria which were analysed for various cristae parameters in our analysis which are really huge data-sets already. Thus, the average size of individual mitochondria per cell are not represented while analysing images obtained with STED SR imaging. Please also check response 4B.

      1. explain the mt-Go-AT team2, what is GFP (green fluorescent protein) and OTP (?)

      Response: GFP is Green Fluorescent Protein and OFP is Orange Fluorescent protein and included in the revised text.

      1. the graphs show in principle, e.g. Fig.1B, 3B-E show events/mitochondrion as far as I understand, not per cristae.

      Response: Thank you for pointing this out. It is actually the average number of events per cristae per mitochondria. We have changed the Y-axis to events/cristae/mito in Fig 1C (previous 1B), Fig 3B-E and wherever applicable for other figures throughout the manuscript.

      Figure 1C

      Figure 3B-E

      1. I would recommend changing the legend of the x-axis of Fig.2B-F to mito-width (y-axis could be probability density function, PDF).

      Response: We have now changed the X-Axis to mito width (originally width) in Fig 2B-F. The Y-axis are still retained as percentage mitochondria where cells treated with few mitochondrial toxins do not show a gaussian distribution of mitochondrial width.

      Figure 2B-F

      Referees cross-commenting

      both expert opinions address similar concerns and therefore a revision should be requested

      Reviewer #1 (Significance):

      The study is thorough and the experiments and results are well described. Overall, however, it remains a descriptive study and does not provide mechanisms. There is also no discussion of how MMP-dependent proteins, such as Opa1, which was previously studied by the Reichert group, might be affected. For swelling mechanisms, the opening of the mitochondrial permeability transition pore was discussed. This could be tested using inhibitors, but perhaps not within the scope of this publication. Nevertheless, the information provided by the study is of interest to the bioenergetics community and should be made available.

      Response: Thank you for the overall inputs.

      We tested the processing of OPA1 forms and found that after 30 mins, only CCCP treatment led to the processing of long isoforms to short forms (Fig S6C). We now included in the discussion that it is possible that short OPA1-forms are correlative to increased cristae merging as well as splitting events upon treatment with CCCP.

      New Figure S6C

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary:<br /> The authors investigated cristae merging and splitting events using ultra-resolution STED. The goal was to test if cristae membrane remodeling is dependent on OXPHOS complexes, mitochondrial membrane potential (ΔΨm), and the ADP/ATP nucleotide translocator. To do this the authors utilized several mitochondrial toxins with known mechanisms of action. Interestingly, many changed overall cristae density but did not change the cristae remodeling events. Inhibition of ANT did change cristae morphology and cristae dynamics.

      Major Concerns

      1. Many conclusions and concepts need more clarification. For example, a major take home from the abstract is that various ETC inhibitors and protonophores reduce cristae density but not did not change cristae remodeling events. If cristae density is reduced, how can this occur without cristae remodeling events? Remodeling events need to be clearly defined in the introduction and abstract.

      Response: Thank you for pointing out this lack of sharpness in our terminology which indeed can cause a misunderstanding. To avoid this, we have now included ‘changes in cristae morphology’ as well as ‘dynamic merging and splitting events of cristae’ under the broader term cristae remodelling. Thus, we had changed the wording ‘cristae remodeling’ to cristae dynamics in the abstract and wherever appropriate in the manuscript text.

      The cristae morphology analysis showed no change in cristae area (Fig S3C) which was accompanied by mitochondrial enlargement. Therefore, cristae density was reduced. For the purpose of clarity, we added a sentence in the introduction section while giving a peek into our results that ‘cristae dynamic events are ongoing despite reduced cristae density’. In addition, we have now included in the results section the following statement: ‘Cristae membrane remodeling has been used to describe cristae dynamic events (i.e. cristae merging and splitting) as well as overall changes in cristae morphology within a single mitochondrion in this manuscript’.

      Figure S3C and D

      1. Other interpretations are also unclear such as how ETC inhibitors which reduce ATP levels did not impact cristate remodeling events, yet inhibiting ATP/ADP exchange did greatly impact this phenomenon. It seems likely that the inhibition of ANT has nothing to do with ATP/ADP exchange since most of the ETC inhibitors no doubt greatly impact overall ATP/ADP exchange. This interpretation needs clarification.

      Response: We agree that further clarification is needed, in particular to explain why ATP/ADP exchange is actually ongoing even when OXPHOS inhibitors are applied and to explain why reduced ATP levels do not mean that there is no ATP/ADP exchange occuring. Treatment of HeLa cells with various mitochondrial toxins inhibiting the function of OXPHOS complexes leads to decreased ATP levels due to ongoing ATP consumption within the cell (Fig 4). One should also consider that two things can and do happen when most of these toxins are applied regarding ATP exchange. First, the ATPase can act in reverse mode which is a (partial) compensatory mechanism to restore ΔΨm and which will further decrease ATP levels (Note: not in the presence of oligomycin). Second, under these conditions ADP/ATP exchange is still ongoing in order to transport ATP derived from glycolysis in the cytosol to the mitochondrial matrix which also causes an (partial) compensatory increase in membrane potential. After ATP import ATP is hydrolysed to ADP for reverse proton pumping via the F1FO-ATPase or alternatively by the F1-part alone without proton pumping. In all these cases it is essential and possible to exchange ADP with ATP constantly. Therefore, the overall exchange of ADP and ATP is not necessarily grossly expected to be different when compared to untreated cells (due to compensatory glycolysis and subsequent ATP import and hydrolysis in the matrix). On the other hand, BKA treatment which clearly impairs the exchange of ADP and ATP will lead to a completely different situation compared to only treating with OXPHOS inhibitors. With BKA the mitochondrial matrix cannot anymore be resupplemented with ATP derived from glycolysis and metabolite flux is grossly hampered. Consistent with this a strong reduction in ΔΨm and oxygen consumption is accompanied with BKA treatment (Fig. 5AB & SFig 7F). Thus, w.r.t cristae dynamic events, in the time-frame we used for imaging, a reduction of ATP levels does not impede occurrence of cristae merging and splitting events while BKA treatment does (Fig S7). We discuss this indeed interesting and unexpected finding in the discussion section. We propose that rather ongoing metabolite flux (ATP/ADP exchange) is critical for maintaining cristae dynamics and blocking it is detrimental for it. We adapted the discussion in this direction to make it more clear.

      Figure S7A, B and D

      1. Why did the authors wait 30 min to image after the addition of mitochondrial toxins? I would have guessed there is a more rapid change in response to these inhibitors. Is there is a chance he authors missed the most dramatic events?

      Response: Since we were inclined to observe early responses, cells were imaged within the first 30 mins after addition of the respective mitochondrial toxins (Please see methods ‘cell culture transfection and mitochondrial toxin treatment’). Thus, to answer this question we want to emphasize that we did not wait 30 minutes but we restricted our time frame of analysis to 30 min. Therefore, we think that we did not miss out on any rapid changes occurring early on. Regarding this point, Reviewer #1 (Query 3) asked for responses at a later time-point. Please read the Reviewer #1, response 3B.

      1. How do these mitochondrial toxins that are known to cause mitochondrial swelling not induce changes in cristate density?

      Response: Thank you for the question. Probably, there is a misunderstanding. In Fig S3E, we clearly show that as the mitochondrial width increases in cells after treatment with mitochondrial toxins, there is a clear decrease in cristae density. In fact, the reduced cristae density is observed exclusively in enlarged mitochondria. Figure S3E-I

      5. It's interesting that inhibition of the ANT translocator by BKA treatment led to increased percentage of mitochondria with abnormal cristae morphology. It's accepted that inhibition of ANT profoundly reduces mitochondrial swelling. Do the authors have any data suggesting that abnormal cristae morphology actually is a mechanism for reducing cell death events such as permeability transition? Did the authors utilize cyclosporin A concomitantly with any of the mitochondrial toxins?

      Response: This is a very interesting question! As the reviewer might be aware, there is evidence connecting cristae remodelling to induction of apoptosis (3). Cristae transitioned to a highly interconnected state after tBID treatment within minutes. However, it is unclear what is the contribution of cristae dynamics in this regard. Within 30 mins, there were no visual signs of cell death in our experiments as observed under a microscope. Hence, we did not use cyclosporin A in our experiments. In our opinion, this question will form part of a very interesting future study and is currently beyond the scope of this manuscript.

      1. Are the authors confident in the data given many of the experiments utilized quantification of 10-20 mitochondria? How are you sure this sampling is sufficient for phenomenon being studied?

      Response: Please see Reviewer 1, Response 4B. As pointed in the response to reviewer #1, the cristae morphology is fairly uniform in individual cells. Therefore, in order to maximise the cell population covered, we randomly used a maximum of two mitochondria from each cell. In addition, we included cristae analysis from at least three biological replicates in order to observe the reproducibility of the data. Taking these factors into consideration, we are confident that our results reflect a sufficient sample size. Further, we would like to point out while our group performs STED super-resolution imaging routinely, the quantification of cristae merging and splitting events done in a blind yet manual manner is a really laborious and time-consuming process. In the future, we are also looking to optimise this at least in a semi-automated manner.

      1. Figure 4 and 5 merely confirm current dogma and don't really contribute to the overall conclusions and can be moved to supplemental data.

      Response: We agree that Fig 5 is confirming to the current dogma. Therefore, we moved it to Fig S6. Regarding Fig 4, we would like to highlight that there is a decrease of ATP levels before mitochondria enlarge. Thus, we would like to retain it as part of the main figure.

      1. It's interesting that BKA dose dependently decreased ATP-linked respiration and all doses limited maximal respiratory capacity. It would be interesting to know if the BKA normal vs. abnormal mitochondria have differential membrane potential?

      Response: Thank you for the interesting question. Overall, BKA treatment leads to a significant decrease of ΔΨm in the whole cell population (Fig S7). Further, the abnormal cristae morphology is only seen in one-third of the population of mitochondria (Fig shown in Response 2). Thus, a drop in ΔΨm seems to be a very early response upon exposure to BKA and independent of cristae morphology. An ideal experiment to address this question would be to image cristae dynamics and ΔΨm using super-resolution imaging which is challenging according to the state-of-art and available chemicals.

      Figure S7E and F

      1. Overall, this is an interesting study and seems appropriately performed but the results and conclusions are unclear. More discussion should include physiological relevance and impact and how this data influences previous work. Some physiological perturbations beyond the mitochondrial toxins and or utilization of genetic models would strengthen the interpretation and overall impact.

      Response: Thank you. We added an OPA1 blot showing the different L-OPA1 and S-OPA1. (Reviewer #1, response in significance section) where we observed that S-OPA1cleavage is selectively enhanced in CCCP-treated cells which could be correlated with enhanced cristae dynamics. We also included these results in the main text.

      New Figure S6C

      Referees cross-commenting

      Yes, I conclude that given the significant overlap in reviwer comments and general need for clarification of concepts and data that a revision is in order.

      Reviewer #2 (Significance):

      Overall, a highly specialized study with audience limited to mitochondriacs. Although, I'll note tis is a hot area of study and there is high interest in the field. Some of the data interpretation is difficult to understand and overall more context is needed to explain the results, impact and relevance. Defining exactly what a cristae remodeling event is and how this differs from cristae density and how the two aren't directly connected is unclear.

      Review by a mitochondrial biologist specializing in mitochondrial signaling and connection to physiology.

      References:

      1. Baker MJ, Lampe PA, Stojanovski D, Korwitz A, Anand R, et al. 2014. Stress-induced OMA1 activation and autocatalytic turnover regulate OPA1-dependent mitochondrial dynamics. EMBO J 33: 578-93
      2. Farkas DL, Wei MD, Febbroriello P, Carson JH, Loew LM. 1989. Simultaneous imaging of cell and mitochondrial membrane potentials. Biophys J 56: 1053-69
      3. Scorrano L, Ashiya M, Buttle K, Weiler S, Oakes SA, et al. 2002. A distinct pathway remodels mitochondrial cristae and mobilizes cytochrome c during apoptosis. Dev Cell 2: 55-67
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank all reviewers for taking the time to evaluate our manuscript fairly and critically. Many helpful suggestions and discussion points were raised. One important group of comments raised concerns whether our proposed timer and counter models were the appropriate conceptual framework to discuss nuclear multiplication in schizogony, whether they were mutually exclusive, and whether other alternatives should be considered. These comments were instrumental for us to uncover some inconsistencies in our previous modeling approach. In the new manuscript, we now define the counter and timer models much more rigorously in the context of Plasmodium cell division. Based on these refined models we now provide a new statistical analysis that goes beyond the previous analysis, significantly improving the statistical support for our conclusions. Details are given in the following individual replies.

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary

      Malaria parasites replicating in human red blood cells show a striking diversity in the number of progeny per replication cycle. Variation in progeny number can be seen between different species of malaria parasites, between parasite isolates, even between different cells from the same isolate. To date, we have little understanding of what factors influence progeny number, or how mechanistically it is controlled. In this study, the authors try to define how the mechanism that determines progeny number works. They propose two mechanisms, a 'counter' where progeny number is determined by the measurement of some kind of parasite parameter, and a 'timer' where parasite lifecycle length would be proportional to progeny number. Using a combination of long-term live-cell microscopy and mathematical modelling, the authors find consistent support for a 'counter' mechanism. Support for this mechanism was found using both Plasmodium falciparum, the most prominent human malaria parasite, and P. knowlesi, a zoonotic malaria parasite. Of the parameters measured in this study, the only thing that seemed to predict progeny number was parasite size around the onset of mitosis. The authors also found that during their replication inside red blood cells, malaria parasites drastically increase their nuclear to cytoplasmic ratio, a cellular parameter remains consistent in the vast majority of cell-types studied to date.

      Major Comments

      It is stated a few times in this study that P. knowlesi has an ~24 hour lifecycle, and while this is the case for in vivo P. knowlesi, it was established in the study when P. knowlesi A1-H1 was adapted to human RBCs (Moon et al., 2013) that this significantly extended the lifecycle to ~27 hours, which should be made clear in the text. As much of this study revolves around lifecycle length and timing, the authors should consider some of their findings with the context that in vitro adaption can significantly alter lifecycle length.

      The reviewer raises an important point that we didn’t discuss for P. knowlesi. We now mention this directly in the introduction chapter (line 67) and in the discussion (lines 470ff). We are aware that P. knowlesi takes about 27 hours in the lab, which was also communicated by the Moon lab. We now cite relevant studies again in this context. We further address the issue of modified cell cycle time in vitro in the discussion in the sense that absolute values must be taken with caution and the focus of this study is about the relative ratio and correlation between the different cell cycle metrics.

      • The dichotomous distinction between 'timer' and 'counter' as mutually exclusive mechanisms seems to be a drastic oversimplification. Considering the drastic variation we see in merozoite number across species, between isolates, and between cells, it seems much more likely that there are factors controlled by both time-sensed and counter-sensed mechanisms that both influence progeny number.

      The study of progeny regulation in malaria parasites is very much in the early stages. We can agree that our models are simplifications, as is the case with all models. Our choice of just the two models timer and counter was driven by the number of cellular parameters we measure, i.e., duration of division phase and progeny number. These data essentially allow us to test the two competing models we presented. As we quantify more and more cellular parameters, based on the quantitative live cell imaging protocols established here, we will be able to test more complex cell cycle models. With our current data, we believe more complex models are not warranted.

      However, this valuable criticism, in conjunction with related remarks by other reviewers, made us reevaluate the constraints of our model more precisely. We noticed that the criteria used in the previous version in the manuscript contained unnecessary additional assumptions. Briefly, the previous counter model also required that final merozoite number was tightly controlled, while the previous timer model required the growth rate to be tightly controlled. These side assumptions were not made explicit in the manuscript and could bias the support towards one or the other model.

      We now improved the modeling approach substantially by removing implicit side assumptions, and clearly defining timer and counter models in terms of their correlations. The refined formulation of the timer posits that between individual parasites the target duration and the nuclear multiplication rate vary in a statistically independent way; while in a counter, target number and nuclear multiplication rate are statistically independent. We now explain this extended analysis in more detail in the introduction (lines 86ff). We also now more clearly state the dichotomous nature of the model (line 488). A new results paragraph (lines 213ff) and an entirely new Fig. 2 (and Fig. S4) contains the model predictions and statistical comparison between the models.

      This more rigorous treatment showed that including the variance of the multiplication rate was critical to allow a clean discrimination between the models. Also, with the sole exception of P.knowlesi H2B, where no model was clearly favored (Fig. 2G-H,K), the timer model was found to be inconsistent with the data, while the counter was clearly favored. Our new goodness-of-fit analysis also showed that although the counter is strongly simplified, it produced adequate fits, demonstrating that potential model refinements would need to be justified by new, more extensive data.

      It is also important to consider that the degree of variation in merozoite number could rather be an expression of varying growth conditions and does not directly predict which of the proposed models are true. For instance, a counter where the target merozoite number varies strongly depending on growth conditions, would be consistent with all available data. It is an interesting question for future work whether a counter would indeed describe growth across different isolates.

      The biological reality of growth regulation is certainly complex, and the counter model will likely need to be refined in the future, which we acknowledge in a corresponding statement in the discussion (lines 491ff). Nevertheless, we find it encouraging that a simple model can explain the vast majority of our data very well.

      Additionally, the only parasite parameter measured in this study, size at time of first nuclear division, explained only a small proportion of the variance observed in merozoite number.

      It is indeed the case that amongst the measured parasite parameters i.e. schizont stage duration, nuclear volume, and cell size we only found the latter to correlate with the final progeny number. We did not aim to imply that all variation in progeny number is explained by cell size. It is likely that a putative counter relies on a set of factors, which are somehow linked to cell size. In addition, intrinsic stochasticity in nuclear growth is likely to contribute to final merozoite number variability, which is included in our models via a variable growth rate. Defining the actual limiting factor or combination of factors will be an exciting challenge for the future studies building on this one.

      • For modelling of a timer-based mechanism, the designation of t0 is subjective. The authors chose the time of first nuclear division as their t0. It is possible that a timer-based mechanism could not be supported based on this model the chosen t0 differs from when the "parasite's timer" starts. For example, t could also have been designated as the time from merozoite invasion (t0) to egress (tend). It would be unreasonable to suggest the authors repeat experiments with a longer time-frame to address this, but this possibility should be discussed as a limitation of the model. It may also be possible to develop a different model where t0 = merozoite invasion and tend = egress, and test this model against the data already collected in this study.

      This is a valid point. We indeed, considered the time point of invasion as the other relevant time point in the IDC for a possible timer. Due to necessary compromises in imaging protocols between acquisition length, temporal, and spatial resolution we have not been able yet to combine full-length IDC measurements with quantification of progeny number. Given the choice, however, between time point of invasion and the onset of nuclear division as starting point for a potential timer we would still favor the latter: An argument can be made that a timer that regulates offspring number would be more accurate when activated at the moment of the relevant cellular events rather than “running” for a very prolonged growth phase before any “decision” concerning parasite replication. We are still convinced that the entry into the schizont stage, which we analyze here, marks an important cell cycle transition point that has been highlighted in many different studies. As suggested, we now discuss the limitations of our selection of t0 in the text (lines 146ff).

      • The calculation of the multiplication rate is confusingly defined. In Figure 1 it is stated that it is "...based on t and n", which would imply that the multiplication rate is the number of merozoites formed per hour of schizogony, which would give an average value of ~2 for P. falciparum and ~1.5 for P. knowlesi. The averages rate values shown, however, are in the range of 0.15-3. The authors should clarify how these values were determined.

      Thank you for pointing out the need for more clarity. Since the nuclear multiplication, similar to e.g. cell population growth, follows an exponential law, the multiplication rate used (lambda) is in fact a logarithmic growth rate. Therefore, it occurs in the exponent (not as a coefficient) in the exponential growth function ( ), which explains the range. We now mention this more explicitly in the results (lines 163ff).

      • In Figure 2, the time from tend until egress is calculated, and this is interpreted as the time required for segmentation. In the Rudlaff et al., 2020 study cited in this paper, it is shown that segmentation starts before the final round of nuclear divisions are complete. Considering this, the time from tend until egress is not an appropriate proxy for segmentation time. The authors should consider rewording to something akin to "time from final nuclear division until egress" to more accurately reflect these data.

      Thank you for indicating our imprecise use of the nomenclature. Indeed, some essential segmentation-associated structures such as rhoptries and subpellicular microtubules are clearly forming before the last division. We were referring to “segmentation” as the time window where actual ingression of the plasma membrane occurs between nuclei with the concurrent formation of more prominent IMC-associated sub-pellicular microtubules between nuclei (as in Fig. 1A last panel). We can, however, agree that consistently using the term “merozoite formation” is more adequate here. We have now corrected the terminology according to the suggestions of the reviewer (lines 271ff).

      • There is a significant discrepancy between the data in Figure 5 and Supplementary Figure 8. In Supplementary Figure 8, the authors establish that culturing parasites in media diluted 0.5x has a marginal effect on parasite growth, with no discernible change in parasitaemia over 96 hours. By contrast, in Figure 5a the parasitaemia of parasites cultured in 0.5x diluted media is approximately 5-fold lower than those in 1x media. The authors should explain the significant discrepancy between these results.

      The reviewer correctly points out a difference in parasitaemia between two parasite culture experiments, shown in Figs 5a (now 6A) and S8 (now S11), respectively. There were several differences in the experimental setup used in the two experiments that could explain this discrepancy. In Fig. 5a the parasites were synchronized to early ring stages while in Fig. S8 we used asynchronous cultures (maybe with a slight majority of late stages). One could speculate that by the time the synchronized ring stage culture reached egress the effect of nutrient depletion, which started at t = 0 h is more pronounced. This effect could have been exacerbated by the more frequent media change of 24 h in Fig. 5a vs 48h in Fig. S8. Lastly, the starting parasitemia was differently set being higher at around 0.5% in the Fig. 5a while only 0.2% in Fig. S8. Possibly a lack of nutrient is “felt less” by the culture at lower parasitemias. Generally, in Fig. S8 we were more focused on highlighting the difference between 1x/0.5x and the more diluted conditions on the long-term culture and to show that continuous culture is actually possible in 0.5x medium. We have now expanded the legends to highlight those differences more clearly.

      • In Supplementary Figure 4, the mask on the cell at t0 shows two distinct objects, but it seems very unlikely that they are two distinct nuclei as they vary approximately 5-fold in diameter. The authors should provide more detail on how their masking was performed for their volumetric analysis. Specifically, whether size thresholds were also applied during object detection.

      Thank you for requesting clarification here. Fig S4 (now S7) shows only one z-slice (not a projection) of the entire image stack, to illustrate how the thresholding approach was performed on every single image slice. The two objects in the shown cell are indeed two nuclei, but because they are not in the same z-plane appear to be of different size. In particular, only a slice of the upper part of the nucleus on the lower right is visible in the shown slice. Throughout the study, volume determination was realized by adding up the individual slices, as is explained in detail in the Materials and Methods sections. We have now added a more explanation in the figure legend to clarify the procedure.

      Minor Comments

      • Line 45-48 mentions that merozoite number influences growth rate and virulence, but the corresponding reference (Mancio-Silva et al., 2013) only discusses the relationship between merozoite number and growth rate, not virulence.

      We thank the reviewer for requesting this distinction. Merozoite number and virulence have not been correlated in vivo so far. Certainly, because one can’t retrieve late-stage P. falciparum parasites from patients, but maybe partly because merozoite number has not gotten significant attention as a metric in the previous decades. Even if merozoite number is intuitively connected to growth rate which might causes higher parasitemia which is in turn linked to more severe disease outcome it is important to emphasize that those are certainly not equivalent. We have therefore removed the statement about virulence (line 48).

      • Line 59 states that a 48 hour lifecycle is a baseline from which in vitro cultured parasites deviate. Clinical isolates also show variation in lifecycle length and so it is more accurate to just say that 48 hours is an average, rather than a baseline.

      The word “baseline” has been changed to “average” (line 61).

      • Line 63 cites a study for the lifecycle length of P. knowlesi (Lee et al., 2022), but there seems to be no mention of lifecycle length in this reference

      This reference was meant to serve as an introductory review article to research in P. knowlesi. Actually, to the knowledge of the authors, there is no study presenting quantitative data showing that the in vitro cycle of P. knowlesi is actually around 27 h. Our lab experience is however coherent with a 27 h cycle, which was confirmed by personal communication by the Moon lab. We now also cite in the next sentence the inaugural P. knowlesi adaptation publication (Moon et al. 2013) showing some time course data indicating the duration of the IDC to be around ~27h (lines 67ff).

      • If I am interpreting Figure 3B correctly, this is essentially a paired analysis where the same erythrocytes are measured twice, once at t0 and once at tend. If this is the case, this data may be better represented with lines that connect the t0 and tend values.

      Yes, these are the same erythrocytes measured twice. We have modified Figure 3 (now Fig. 4) accordingly.

      • Figure 3A seems to imply that to calculate diameter of the erythrocytes, three measurements were made and averaged for each cell. I think this is a nice way to get a more accurate erythrocyte diameter, but if this is the case, it should be specified in the figure legend or methods.

      This is already described in the figure legend (line 305).

      • In Figure 4I it is shown that in P. falciparum merozoite number doesn't correlate with nucleus size, but for P. knowlesi in Supplementary Figure 7c, a significant anticorrelation is observed. The authors should state this in the text and discuss this discrepancy.

      Contrary to all other graphs, visual inspection of the distribution of data points in Fig. S10C shows that it contains two outlier data points at the bottom right. Those two specific points are also responsible for the significant anticorrelation. We did not filter or remove any quantification results but also didn’t have sufficient confidence in this data distribution (which is further based on the segmentation of the Histone2B not on an NLS mCherry signal) to make substantial claims about anticorrelation. Because we considered it informative we still decided to show it in the supplements. We now briefly mention the issues with the data set and its interpretation in the text (lines 350ff).

      • The authors show that merozoite number roughly correlates with cell size at t0 but it would be interesting to see whether cell size at tend also corresponds with cell size at t0. This might help answer whether the cell is larger because it has more merozoites, or whether it has more merozoites because it is larger.

      Plotting parasite cell volume at t0 against cell volume at tend (as well as between t-2 and tend) indeed shows a positive correlation (see below). While it is an interesting thought we concluded after some discussion that no convincing causal relationship between cell size and merozoite number can be inferred based on this analysis. Since we consider the possible statement that cells that are bigger in the beginning are also bigger in the end unavailing, we decided not to include the data.

      • I don't feel that "nearly identical" is an appropriate summary of erythrocyte indices in Supplementary Figure 9, considering there is a statistically significant increase in mean cell volume. I think it is unlikely that this change is consequential, and performing these haematology analyses is a nice quality control step, but this change should be stated in the text.

      In the modified text we now express the significant change in MCV in terms of percentage, which is around 1.2% (line 381).

      • In Supplementary Figure 8, parasitaemia only increases ~2-fold compared to >5-fold the previous two cycles. It seems likely that at the final timepoint on this graph the parasites are starting to crash, and therefore it may be best to end the graph with the 96 hour timepoint.

      The reviewer suggests that cultures at those parasitemias might not be in perfect health. Our Giemsa stains did not show signs of an unhealthy culture and kept growing. It was, however, important for us to show that cultures can be maintained in culture over a prolonged period of time in 0.5x medium, even when resulting in reduced growth, while this was not possible with lower dilutions. Therefore, we would like to keep the data point. We have added a cautionary comment in the legend.

      • The error bars in Figure 5C aren't easily visible, moving them in front of the datapoints may help their visibility.

      Error bars were moved in front of the data points.

      • In Figure 6D & E, the y-axis labels should be changed to whole integers as all the values in the graph are whole numbers.

      We have changed the y-axis labels accordingly.

      • My interpretation of Figure 6 C-E, is that these are the same cells measured at three time points (t-2, t0 and tend). If this is the case, 6C is missing the cell that has a merozoite number of 8, which is presumably why the y-axes are not equalised for the three graphs.

      It is correct that the same cells are displayed in all three plots, with the exceptions of three cells in 6C (for the timepoint t-2), which are missing for the following reasons: 1) it was not possible to determine the volume at this respective timepoint due to technical issues or 2) the cell was already just before t0 at the start of the movie so that t-2 had already passed. We now note this in the figure legend and have also equalized the y-axes (now Fig. 7C-E).

      Reviewer #1 (Significance):

      In the asexual blood-stage of their lifecycle, malaria parasites replicate through a process called schizogony. During schizogony an initially mononucleated parasite undergoes multiple asynchronous rounds of mitosis followed by nuclear division without cytokinesis, producing a variable number of daughter nuclei. Parasites then undergo a specialised cytokinesis, termed segmentation to where nuclei are packaged into merozoites that go on to invade new host cells. While nucleus, and therefore merozoite, number are known to be varied between cells, across isolates, and across species, little is known about the mechanisms regulating merozoite number. In this study, the authors use live-cell microscopy to understand how parasites determine their progeny number. They suggest that parasites regulate their progeny number using a 'counter' mechanism, which would respond to the size or concentration of a cellular parameter, as opposed to a 'timer' mechanism. Long-term live-cell microscopy experiments using malaria parasites are extremely technically challenging, and the authors should be commended for their efforts in this regard. While I agree that the data generated from these experiments are technically sound, I have some reservations expressed above about the interpretation of some of these results. I would strongly encourage the authors to consider rewording some of their interpretations taking into account some of the caveats listed above. I would also consider fitting/testing an additional mathematical model where the time-frame proposed for the 'timer' mechanism begins following merozoite invasion.

      We thank the reviewer for the appreciation of our work and hope we have sufficiently reworked the manuscript based on the comments listed above. Furthermore, we think the improved model statement and analysis improves the clarity of our conclusions. Indeed, we would like to test additional models including the full IDC once, as mentioned above, we are technically able to generate these data.

      This work is of specific interest to anybody who grows malaria parasites, as the dynamics of their growth is obviously important to understand. Further, this work is of interest more generally to cell biologists who study the regulation of progeny number or cell size. I have no experience with the application of mathematical modelling to understand biological systems, and so I cannot comment on the interest of this work to that field.

      Reviewer #2 (Evidence, reproducibility and clarity):

      This is a solid study that further characterises the dynamics of nuclear division in Plasmodium falciparum and P. knowlesi. Of two, among potentially several, models for how the number of daughter nuclei, and thus parasites - (called merozoites in this genus), are one that posits nuclei divide until a fixed timer ends, and one that posits that nuclei divide to reach a fixed number that is defined by a cellular counter. I find some practical difficulties in definitive measurement of either model, one issue with the former is that experimental definition of the start of the timer is problematic - we may define the starter's gun (eg by the first nuclear division) but it isn't necessary that the cell is using that same start time.

      We are pleased that the Reviewer found our study ‘solid’. Concerning the timer model, we agree that the selection of the starting point is a critical aspect of this study, as also Reviewer 1 pointed out. We selected this particular “t0” because the entry into the mitotic phase marks an important cell cycle transition. Several studies have suggested a “schizogony entry checkpoint” might be active just before (Matthews et al, 2018; Voß et al, 2023; van Biljon et al, 2018; McLean & Jacobs-Lorena, 2020). Once cells are committed to the schizont stage they are less responsive to stimuli. Alternatively, the timepoint of erythrocyte invasion could be a legitimate starting point. Due to necessary compromises in our imaging protocol between acquisition length, temporal, and spatial resolution we have not been able yet to combine full-length IDC measurements with quantification of progeny number, and therefore we leave exploration of an earlier timer start for future work. Within the confines of the model comparison in the current study, we think the selected t0 is already highly informative. We now explain the selection and limitations more explicitly in the text (line 144ff).

      Additionally, as the authors confirm here, being sure when that first nuclear division has occurred is particularly tricky with Plasmodium parasites, in part because the first few nuclei seem to clump together, preventing one from unambiguously calibrating the first division.

      The Reviewer is concerned about difficulties with precise reporting of the time point of first nuclear division. We suspect there was a misunderstanding here. In the text (line 137) we had written the following:

      “Although separating individual nuclei after the first two rounds of division was challenging due to their spatial proximity, the improvements in resolution and 3D image analysis allowed us to count the final number of nuclei routinely and reliably at the transition into the segmenter stage.”

      To clarify, when analyzing 3D image stacks produced by the LSM900 Airyscan the first nuclear division can consistently and unambiguously be detected. In anaphase the nuclei are pushed apart quite substantially before getting a bit closer together afterwards (see e.g. Fig. 1B and C). Hence the precision of the detection is only limited by the 30 min interval of the time lapse. Later, at the four nuclei stage, crowding makes distinction more difficult. In the final segmenter stage, the reorganization and condensation of nuclei makes reliable counting possible again. We have now reformulated the quoted sentence for more clarity (lines 137ff).

      Furthermore, getting decent replicate numbers is hard because of the difficulties of time lapse microscopy, and most Plasmodium studies (including this one) suffer from low enough numbers that it isn't always clear whether the numbers support one model over another.

      The reviewer points out the difficulty of obtaining enough replicates in Plasmodium time-lapse studies. We agree that depending on technology, sufficient replicates can be challenging. In the present study we obtained Ns between 25 and 35 for all conditions in P. falciparum and P. knowlesi from three independent replicas. To gain confidence in the conclusions from a limited, but not austere, data, it is essential to 1) reduce model complexity to a minimum and 2) perform stringent statistical analysis including accounting for small-sample variation. Motivated by this concern of the Reviewer and a similar point raised by Reviewer 1, we have revisited our modeling approach in the revised manuscript. This led us to a corrected, more rigorous definition of what precisely we mean by ‘counter’ and ‘timer’ models: The timer posits that between individual parasites the target duration and the nuclear multiplication rate and vary in a statistically independent way, while in a counter target number and nuclear multiplication rate are statistically independent. With no further adjustable parameters, the two models are thus both mutually exclusive and minimal. Although biological reality is likely to be more complex, we feel that these minimal models are adequate for the amount and resolution of our current, state-of-the art data. The general result remained the same: The counter model is strongly preferred in almost all our experiments data (new Fig. 2), with the sole exception of P. knowlesi H2B, where indeed more data may be needed to come to a clear conclusion. Furthermore, we have taken care to scrutinize these conclusions accounting for goodness-of-fit for the respective sample size N. This analysis showed, surprisingly, that the counter model was sufficient to account for the data: the real dataset was as similar to the counter prediction as synthetic, counter-generated data. We hope that this improved statistical analysis can help the reader judge the robustness of our conclusions.

      Nonetheless, several recent studies, particularly a study from the same institute (Klaus et al., 2022) employing timelapse imaging of nuclei, and timing the nuclear division of parasites, finds poor correlation between the duration of "schizogeny" (although perhaps using a different definition to the one used by the parasite) and the final number or merozoites. They therefore argue that there is poor evidence for a timer, and conclude by elimination that a counter must exist instead. A review by some of the authors of that study and some of this current study (Voß et al 2023), also concludes that the data from Klaus and colleagues "strongly support" a counter model. This current study also concludes that a counter model controls final nuclear/merozoite number in P. falciparum and P. knowlesi. This much at least is not particularly novel given the recent work on this topic, although the addition of the P. knowlesi data is interesting and consistent with the prior P. falciparum work.

      Our present work, indeed, does confirm the previous report of a counter over a timer, through a more targeted approach. While Klaus et al. used timing data of first nuclear cycle vs. the full duration, we now provide, thanks to an improvement microscopy setup and protocol, simultaneous measurements of timing and final progeny number, i.e. counting of merozoites/nuclei. While the preference for a counter model is not fundamentally novel, the additional information that the counter model holds in different strains, conditions and species is, in our opinion, not trivial and points to some degree of evolutionary conservation. We also demonstrate here that the counter model is not only preferred over the timer, it also fits the data adequately, so that it can be considered ‘correct’ at this level of complexity. Another, possibly more important, value of this study lies in the quantitative and time-resolved assessment of multiple important parasite metrics such a cell volume and nuclear volume together with merozoite number at the single cell level. Although descriptive, this has not been achieved in Plasmodium until now.

      As above, the authors concede that it is difficult to determine with strong confidence when the first nuclear division has occurred, so it may well be that there is substantial noisiness in the time that they define schizogeny to commence. If that were the case, this would contribute to the poor correlation observed between schizogeny duration and number of merozoites produced, so this could be an important confounding experimental factor. This deserves some more discussion by the authors.

      Concerning the confidence with which we identify the first nuclear division we could hopefully clarify in the section above that our precision is only limited by the time resolution of the acquired time-lapse. Therefore, the uncertainty about the start time is not particularly high, and moreover, can expected to affect timer and counter (via the growth rate) to a similar degree. We see no unfair advantage for the counter for this reason.

      Alternative methods to count absolute DNA content (rather than trying to count individual nuclei) might be useful ways of independently confirming this phenomenon. Alternative possibilities for what constitutes the "start" of a possible timer are also warranted - it could be for example, the first division of one of the other organelles.

      This is an interesting suggestion. Next generation fluorogenic DNA dyes have been used by us and the Ganter group (Simon et al. 2021, Klaus et al. 2022, Wenz et l. 2023) to assess DNA content of single cells over time. Our experience shows that there are some caveats to using these Hoechst based dyes, some of which we discussed in the aforementioned publications. While they allow some reasonable absolute quantification of DNA content for the very first S-Phase (and subsequent nuclear division), in later stages only relative quantification can be achieved. One underlying reason is the apparent increase of dye permeability, and therefore higher intensity, at late schizont stages. This issue is exacerbated by the asynchronous DNA replication of multiple nuclei. Further, nuclear division itself can be delayed or even inhibited when increasing the concentration of the dye, which suggest an impact on cell physiology (well documented for Hoechst based dyes in other organisms). When reaching the segmenter stage, the resulting variance in fluorescent intensity would make it challenging to assign a reliable number of nuclei required for analysis, a problem that does not occur when counting individual nuclei. Taken together, unfortunately, all these confounding factors make DNA content analysis in live single cells for the entire schizont stage unachievable at this point.

      These and previous authors in any case conclude that a counter model must exist through exclusion of a timer model. I am less convinced that the evidence discounting the timer is conclusive, and that a straight counter model is the only alternative. Indeed I am unconvinced by the suitability of this strictly dichotomous two-model system to categorise the division of unicellular eukaryotes, and these theories are not universally held to be sufficient to describe division.

      We thank the Reviewer for this insightful comment. As already detailed above, we have clarified and corrected our model definitions in the revised manuscript. Further, we want to make the important distinction between organisms, including unicellular ones that undergo binary fission and the ones like Plasmodium that use schizogony. Our model, although inspired by model organisms, is tailored to a multinucleated division mechanism, and clearly defined within those boundaries. The timer and counter models we consider are defined by their correlation structures. They are at two extremes of a continuum of models which could be characterized, for instance, by the ratio of correlations (growth rate - nuclear number) vs. (growth rate – duration) as an additional parameter. As the reviewer points out, excluding the timer model is not equivalent to proving the counter model, and indeed a partially correlated model, or a more complex model entirely, could yield a better fit. However, within the realm of models without additional parameters, and which are testable with the available data, only timer and counter remain, as different timer start points are not experimentally accessible. Importantly and somewhat surprisingly, the counter model also gave a fit that is as good as can be reasonably expected for the experimental sample size (new Fig. 2). So, we maintain that within the current experimental constraints, the counter model is the only viable option for almost all our tested conditions. The observation that in H2B-GFP expressing P. knowlesi parasites no clear distinction can be made between the models, indeed, suggest that the reality of multiplication rate regulation is more complex and may be limited by different constraints in different growth regimes. We now state these limitations and the room for further model adjustments with more data in the Discussion section.

      Nonetheless, if a counter exists, what is being counted that determines the final number? The authors consider that this might be a physical object or resource inside the parasite, or an extrinsic/extracellular resource. They investigate this by comparing the final cell number to a number of factors. First, the authors investigate the size of the RBC (by musing the diameter as an indicator)- little information is given about the source of the blood used, but it appears to be from a single donor of unknown age, who has approximately typical variance in RBC diameter (at least, after manipulation and storage). The authors observe little correlation between these variables.

      We share the curiosity of the reviewer about what might be “counted” by the parasite. This shall be the subject of future studies, and our present study provides the necessary basis for asking this question and defines a framework to investigate it. Concerning the size of the host cell, the blood used was from a different donor for each of the replicas, which we now specify in the figure legend (line 302). No significant difference between the RBC diameters between the donors was observed. A correlation between RBC diameter and progeny number was indeed not observed.

      Second the authors measure parasite size at the onset of schizogeny, and find that bigger parasites result in more daughter merozoites early in schizogeny (perhaps not surprising, given the earlier mentioned technical problems with measuring the first few steps of schizogeny), but that this different initial cell size doesn't result in a different final merozoite number, or as they describe it "not quite significant anymore". Previous p values were taken as cause for rejecting the timer hypothesis and the timer model. In this case the authors instead interpret the data as suggesting "that the setting of the counter might correlate with parasite cell size". This is inconsistent statistical and analytical handling, and highlights the earlier potential pitfall of rejecting timer-based models based on not gathering data that statistically show a correlation. This needs reworking to highlight that these data are inherently noisy, difficult to measure accurately, and aren't necessarily going strongly reveal a trend even where one biologically exists, and that this ought not be used as grounds for confident rejection of a model.

      The Reviewer raises concerns about the consistency of the statistical interpretation of our data. We care deeply about the well-foundedness of our conclusions and hope to eliminate these concerns in the following. First, we hope that the issue about the “technical problems” in measuring the first division has been solved in our response to previous comments. Next, to clarify an apparent misunderstanding: As stated in the text (lines 329ff) and shown in now Fig. 5D-E, cell size at onset of nuclear division or 2 hours prior does significantly correlate with final merozoite number. The lack of significant p-value (0.08) only pertains to the correlation of cell size at the end of the schizont stage (tend) with merozoite number (now Fig. 5F). We have removed the unfortunate wording “not quite significant anymore” in that context. Finally, regarding potential mechanisms, a potential counter must be set before the first nuclear division is completed because only that way it can be set independent of the speed of nuclear multiplication. This observation gives the statistically significant correlation of volume at the onset of division and progeny number its relevance. We have reformulated the marked sentence for more clarity (lines 331ff). Furthermore, we point out that our rejection of the timer is now based on a revisited statistical analysis (Fig. 2), which is no longer based on a simple correlation between final number and duration, as detailed above.

      Finally, the authors grow the parasites in dilute media, and find that they produce fewer daughter parasites. This is anecdotally unsurprising, as most Plasmodium laboratories are aware that sub-optimal growth conditions result in less healthy schizonts with fewer viable merozoites (and lower magnitudes of single-cycle expansion), but is nonetheless an important result that highlights explicitly how much this occurs in the specific conditions of dilute media. Given the lack of investigation of exactly which nutrient, carbon source, or combination thereof leads to the reduced merozoite number, it is unclear if or how much this is relevant to the scenario of a natural infection and realistic levels of that nutrient in a human or primate parasite environment.

      As rightfully pointed out by the reviewer suboptimal growth conditions affecting parasite growth and multiplication rate have been shown in many instances. The number of studies that actually quantify a reduction in merozoite number under different growth conditions is certainly much lower (Brancucci et al. 2017 (lipids), Mancio-Silva et al. 2017 (calorie-restriction in mice), Tinto-Font et al. 2022 (temperature) come to mind). What our study adds to this body of literature is to which extent duration of the schizont stage and cell volume are affected in relation to progeny number at the single cell level. Importantly, we wanted to test whether the counter model still holds under these more adverse conditions, which we found to be the case. Along the lines of the work on calorie restriction and the likely implication of isoleucine in the process investigated in the laboratory of Maria Mota, it will be exciting to identify a “limiting factor” in future studies. Indeed, any study done in complete RPMI culture medium can be questioned regarding its physiological relevance and we added a sentence addressing this aspect in the discussion (lines 514ff). Yet, our medium dilution experiments suggest that at least to some degree an extracellular resource is implicated, which makes sense from a biological function point-of-view.

      Minor issues

      The manuscript confuses the terms "less" and "fewer". Fewer should be used for countable nouns (fewer daughter cells, fewer nuclei, fewer merozoites), less for uncountable nouns (e.g. less speed, less volume).

      Thank you for pointing this out. The words have been replaced accordingly.

      I didn't understand lines 93-95; "This excluded a timer and thereby confirmed a counter as the mechanism regulating termination of nuclear multiplication (Klaus et al., 2022). A direct correlation between duration of schizont stage and merozoite number is, however, still missing." If I understand the first sentence concludes that there ought not be a direct correlation between schizont duration and merozoite number, but the second sentence, says that that correlation is "however" missing. Isn't this expected? Perhaps reword for clarity?

      Thank you for requesting clarification here. The exclusion of the timer by Klaus et al. 2022 was based on the correlation between duration of the first nuclear division cycle and the total duration of all nuclear replication phases. At no point did Klaus et al. count merozoites in live single cells, which was mainly due to lower spatial resolution of their images (M. Ganter, personal communication). Therefore, they could not directly assess the relation between progeny number and schizont stage duration, which we now report for the first time. The sentence was supposed to convey that this type of data was missing and was now reformulated for more clarity (line 114).

      Lines 104

      "We further uncover that throughout schizogony P. falciparum infringes on the otherwise ubiquitously constant N/C-ratio (Cantwell and Nurse, 2019)" This seems obvious to me, and not something uncovered by this study. In most of the numerous apicomplexans that divide by endoschizogeny, the cells achieve a near final size considerably before the final rounds of nuclear division so the N/C ratio must not remain constant - this is a direct corollary of many previous descriptions and not a novel finding of this study, and this claim here should be made more modest.

      We understand the point raised by the reviewer but still think that our claim is justified due to several aspects. There are examples of eukaryotic cells that undergo multinucleated stages during division were the N/C-ratio is constant (Dundon et al. 2016, Cantwell and Nurse, 2019), while we are not aware of any counter-example in the literature. Studies have also shown that e.g. certain mutant yeast that fail to undergo cytokinesis will increase their volume by factor of up to 16 alongside the still replicating and growing nucleus maintain the N/C-ratio (Neumann et al. 2007, Jorgensen et al. 2007). This demonstrates the tremendous plasticity that cells can reveal with respect to nucleus and cell size regulation. Until the contrary was shown, it was conceivable that nuclear compaction, which does occur (Fig. 5H), compensates for the increase in nuclear number while the cell volume is only increasing slightly. Importantly, we are not aware of any literature where nuclear volume has been quantified for blood stage Plasmodium. Cell volume quantifications remain limited to modelling and the study by Waldecker et al., which provides a few datapoints throughout the IDC. Whether this finding is expected or not, formally speaking, our claim is justified, but for more clarity we replace “uncover” with “demonstrate”. We also introduce the N/C-ratio as cellular parameter in P. falciparum pointing out another divergent aspect of its biology and might in the future understand the functional implication of this usually constant ratio, which is still unclear.

      Dundon SE, Chang SS, Kumar A, Occhipinti P, Shroff H, Roper M, Gladfelter AS. Clustered nuclei maintain autonomy and nucleocytoplasmic ratio control in a syncytium. Mol Biol Cell. 2016 Jul 1;27(13):2000-7.

      Neumann FR, and Nurse P. Nuclear size control in fission yeast. J. Cell Biol. 2007; 179: 593–600. pmid:17998401

      Jorgensen P, Edgington NP, Schneider BL, Rupeš I, Tyers M & Futcher B Molecular Biology of the Cell 18 (2007) The size of the nucleus increases as yeast cells grow.

      Helena Cantwell, Paul Nurse; A homeostatic mechanism rapidly corrects aberrant nucleocytoplasmic ratios maintaining nuclear size in fission yeast. J Cell Sci; 132 (22)

      I lack specialist statistical knowledge to comment on the statistical analyses performed on the correlation data, and in particular, whether the high p values for t-Tests for correlation are sufficient to support the argument that there is not a correlation, and whether these observations are sufficiently powered to robustly test that hypothesis.

      We are confident that our reworked model analysis, as explained above, now sufficiently supports our hypotheses.

      Reviewer #2 (Significance):

      The manuscript purports to find a counting mechanism that determines parasite merozoite numbers, and that this coutner is set by an externally provided and diffusible resource. Many nutrients are in excess in normal culture media, but not all. If that counted nutrient(s) were normally in excess in the bloodstream, it could hardly be said to be the factor that is counted and that therefore defines merozoite number. Conversely, if the amount of that nutrient were increased in normal media, would parasites make even more merozoites? Further, if the "counted" item is a freely diffusible compound in the media, it should be equally accessible to each parasite in a culture condition, and isn't a reasonable explanation for the variable merozoite numbers in the normal media conditions. To me, it is unsurprising that parasites that are healthy and well fed are able to produce more merozoites, but I don't see this as being the same as support for a counter model where the parasite senses and counts a set number of merozoites to produce in response to a specific external counter. I think the shoehorning of this phenomenon into a paradigm used to describe some other eukaryotes may not be appropriate, and that the rejection of one overly simplistic timer model should not automatically lead to us dichotomously accepting a simple counter method as the alternative. The authors need to do more to either identify a countable input whose gradual increase leads to a predictable and gradual increase in merozoite number, to show that they do use a counter, or provide substantially more caveats to their argument that the parasites are using a counter based on an externally provided resource to determine merozoite number.

      The reviewer comments on the feasibility of a counter mechanism based on an externally provided and diffusible resource. In fact this is a limited view of how a counter may arise and not the one we subscribe to. Rather, while a resource may be diffusible in the medium, it would need to be consumed during schizogony, and insufficiently replenished, in order to enable counting by dilution in the host cell. Furthermore, the reviewer has doubts that the fact that “healthy and well fed […] produce more merozoites” implies “support for a counter model”. We fully agree, and we argue in the manuscript that it is the correlations between schizogony durations and merozoite counts that support a counter model.

      As we have argued above, the two alternative models we consider are inspired by paradigm from other eukaryotes, but their definitions in the present context are simple enough for them to be considered natural minimal models of schizogony. As the simplest imaginable phenomenological models of multiplication control, we find it natural to compare them, and we hope our new introductory section introduces them appropriately now. Naturally, we hope to expand on this simple model in the future and identify more precisely the limiting resources and describe a more direct response.

      Audience - relatively specialised - likely interested audience would combine apicomplexan cell biologists, as well as theorists of cell division mechanism

      Advance - limited - confirms phenomenon also described by other researchers in their institute, and extends to another related organism.

      We would like to add that the present data are the first quantitative joint measurements of schizogony dynamics and outcome in P.falciparum and knowlesi. They allowed for the first time a direct correlation of duration and merozoite number, thereby accessing the question of growth control head on. Further they provide a quantitative reference of several key cellular parameters for anybody studying asexual blood stage parasites.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary:

      Stürmer and colleagues used super-resolution time-lapse microscopy to probe the mechanism regulating the number of merozoites produced by a single cell in Plasmodium falciparum and P. knowlesi. The authors conclude the followings-

      1. P. knowlesi has similar duration of schizont stage to P. falciparum, although having a 24 h intraerythrocytic developmental cycle (IDC) to 48 h of P. falciparum.
      2. Nuclear multiplication dynamics suggests a counter mechanism of division- which is further suggested by a significant relation of merozoite numbers with schizont size at the onset of division.
      3. Nutritional deprivation caused increase in nuclear volume and decrease in merozoite number. For the most part, the experiments that are presented in this manuscript support the conclusion of the authors. The data are presented in a concise and clear manner. However, some clarification and a couple of experiment (listed below) would improve this manuscript.

      Major comments:

      1. The authors generated at least 3 transgenic lines for this study, But the did not present any genetic validation of the lines in the manuscript. For completeness, I recommend to provide genetic validation (either pcr genotyping or whole genome sequencing) of the lines that were generated and used in this study in the supplement.

      Our study exclusively used episomal expression of the respective fluorescent reporter (H2B-GFP, NLS-mCherry, and cytoplasmic GFP). As is customary in the field resistance to selection drugs and distinct fluorescent signals are assumed to sufficiently validate the presence of the plasmids. We now added the schematic maps of the plasmids in a new Fig. S1 to make our approach more visually clear.

      1. In the H2B-GFP lines, the authors episomally GFP-tagged histone 2B to label the nuclear chromatin for both P. falciparum and P. knowlesi. This provides a very useful parasite line which enables the live time-lapse microscopy. Using these parasite lines, the authors first show that despite having a 24 h IDC in P. knowlesi vs 48 h in P. falciparum, both these parasites have a similar duration of the schizont stage (8.s vs 9.4 h). My concern here is whether this GFP-tagging is influencing the growth dynamics as in slowing down the P. knowlesi parasites. However, if that was the case authors should have seen that for P. falciparum too. Also, for the P. falciparum parasites that episomally express cytosolic GFP and Nuclear mCherry have a higher number of merozoites compared to the H2B-GFP P. falciparum and the authors speculate this is probably because of not tagging Histone 2B. Given this, it is important to show that none of the H2B-GFP parasites show any significant fitness cost due to GFP tagging of histone. I recommend a simple experiment to compare the multiplication rate of H2B-GFP lines to the parental lines in identical growth conditions. This suggested experiment was described in PMID: 35164549 to determine fitness cost of knockout lines. This experiment is vital for validation of the H2B-GFP lines and subsequent interpretation of the data that were presented in this manuscript.

      We thank the reviewer for this excellent suggestion. To validate our lines further we now have carried out multiplication rate measurements similar to the one described in the designated publication for all the used lines alongside their parental strains (Fig. S2). We found no significant differences in between the wild type and the episomally expressing parasite lines (lines 131ff), which gives us confidence that episomal expression of tagged proteins do not significantly alter growth dynamics in these cases.

      1. The authors used the microtubule live cell dye SPY555-Tubulin in P. falciparum to validate the findings presented in 1D and 1E. They did not do that for P. knowlesi. If there is no unsurmountable technical difficulty, I suggest doing the same with P. knowlesi. This will also address the concern that I have pointed out in #1.

      Thank you for this suggestion. We have now generated the requested data with P. knowlesi, added it to what is now Supplemental Figure 3 and included it in our new analysis (Fig. 2I-J). The numerical values align well with the observations made when measuring schizont stage dynamics with the H2B-GFP expressing P. knowlesi line (line 158). A notable difference is that the Tubulin data strongly support the (refined) counter model, while the H2B data alone allow no distinction.

      1. The data in Figure 3 shows that merozoite number does not depend on host cell diameter. My question here is, were these data collected using different donor blood? Or were this measured from different biological replicate? These are not clear from the writing. I am not sure about whether blood from various donor would have on the data, however, different preparation of the cells across various biological replicate will have some effect on host cell diameter hence on data. State if these were collected from independent biological replicates and about the donor blood.

      The data results where indeed collected from three independent biological replicates using different donor blood batches. This is now stated in the figure legend. The batches displayed no difference in RBC diameter.

      1. It is interesting to see that nutrient-limited conditions increase average nuclear volume but less merozoite numbers. In this experiment, as I understand, complete media was diluted 0.5x, which basically diluted every component of the media by half. From this experiment I can see nutritional deprivation as a whole having an effect and supports the counter mechanism, it would be intriguing to see if there is any effect of a particular nutrient have any effect on progeny division. For example, parasites can be grown in amino acid deprived media (except isoleucine) which makes the parasites fully dependent on host cell amino acids. This sort of specific nutrient deprivation will probably allow the authors to probe for specific nutrients that plays role as counter mechanism factor.

      This is indeed a very exciting direction we would like to investigate in more detail in follow-up studies. Our aim for this study was to confirm that nutrient deprivation actually affects “counting” and to provide a workflow to investigate individual nutrients. In the meantime the Mota group, in a study we now cite in the discussion (lines 507ff), actually reported that isoleucine (and possibly methionine) levels are linked to progeny number. A follow-up on this topic using our strains and methodology is certainly worthwhile but requires more detailed analysis in the future.

      Minor comments:

      1. P. knowlesi is sometimes just written as knowlesi. Please, write P. Knowlesi.

      Has been corrected.

      1. Supplemental figure 1D, missing x-axis label.

      We added the x-axis label.

      1. In line 105, define N/C.

      Done.

      1. In line 205, I assume the authors mean episomally, not episomally.

      Thank you for pointing this out. We have replaced “ectopically” with “episomally” throughout the text.

      1. In line 275, Duration of Schizont stage was slightly....

      Has been corrected.

      1. All 'ml' or 'µl' should be 'mL' or 'µL'.

      Changes have been made.

      1. Define iRPMI.

      We added a definition (line 610).

      1. In line 475, replace 'as' with 'and'.

      Done.

      Reviewer #3 (Significance):

      The factors that regulate the number of progenies in malaria parasites remain unknown. While there are few previous studies attempting to answer the question, those studies were done on fixed stained cells. In this study, the authors used genetically modified fluorescent P. falciparum and P. knowlesi parasites that enable live microscopy. These parasites coupled with super-resolution time-lapse microscopy the authors attempt to investigate the mechanism(s) at play in regulating progeny division. This manuscript provides data to suggest that external resources might have some role in progeny division and supports the counter mechanism. More careful validation of the transgenic lines that were used to collect data presented needs to be more systematic and rigorous.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary:<br /> In this original article from Mutscher et al., the authors developed a compartmentalized dissociated mouse myelinating SC-DRG coculture system to investigate the distinct roles of Schwann cells in axon protection and degeneration after injury. The innovation of this approach relies on (i) the use of mouse SCs and mouse DRGS neurons instead of rat cells; (ii) the use of microfluidic chambers, seeded by axons and SCs in different compartment; (iii) the possibility to perform a traumatic injury in vitro. While this novel approach offers new ways to study peripheral nerve regeneration and SC-axon interaction, and technical the study is robust, the paper is currently limited by the exploration of their model.

      Major points:<br /> Reviewer 1. It is unclear is this approach will ever lead to the identification of key mechanism or key candidates. This is a major miss in the current manuscript form. In short: the authors should demonstrate that their in vitro system can lead to significant leap in our understanding of peripheral nerve regeneration by identifying novel targets/pathways or mechanisms.

      Author response: We agree with the reviewer that cell culture approaches have limitations however we would disagree that it is not a viable approach given that a number of seminal studies in the field have already helped identified key cellular and molecular steps using rat SC-DRG cocultures or using mouse DRGs and rat SCs in combination with in vivo study. We have added the following to the introduction to highlight this point in more detail:

      Introduction.

      Dissociated myelinating SC-DRG cocultures from rats were first developed by the Bunge laboratory in the 1980’s to investigate PNS myelination in a more dynamic way (Bunge et al, 1989; Eldridge et al, 1987)__. These cultures have been used to make seminal discoveries in uncovering the cellular and molecular mechanisms of SC myelination alongside in vivo investigation. These include how the inner SC membrane (mesaxon) advances to myelinate axons, and the role of b__-neuregulin-1 (__b__NRG1) and polarity proteins in SC myelination (Bunge et al, 1989; Shen et al, 2014; Chan et al, 2006; Taveggia et al, 2005)__. Similarly, SC-DRG cocultures have been useful in demonstrating how SCs proliferate after axon injury, transfer metabolites, such as pyruvate, to delay axon degeneration, how placental growth factor (Plgf) regulates axon fragmentation by SCs and how SC JUN promotes axon outgrowth after injury (Arthur-Farraj et al, 2011; Babetto et al, 2020; Vaquié et al, 2019; Salzer & Bunge, 1980)__. The use of a coculture system to study axon-SC interactions during axon degeneration and regeneration offers some advantages over in vivo approaches as both neurons and SCs can be genetically manipulated separately and live imaged with ease.

      Discussion.

      Most importantly SCs and DRG neurons from various transgenic mice can be used to perform in vitro analysis to complement findings from in vivo transgenic mouse studies.

      Author response: Furthermore, as this is a methods paper, demonstrating novel molecular mechanisms is outside the scope of this article. However, we have already used this technique with a collaborator to study the role of cdk7 in myelination (see link to conference abstract below) and this manuscript is under preparation to be submitted soon. Additionally, we have ongoing projects within the lab using this technique to help characterise novel molecular targets in nerve injury. https://scholar.google.com/citations?view_op=view_citation&hl=en&user=uEtwAd8AAAAJ&sortby=pubdate&citation_for_view=uEtwAd8AAAAJ:_Qo2XoVZTnwC

      Author response: Despite this, we have shown that the axo-protective effect of SCs is independent of myelination status, which is an advance on what is known in the literature.

      Author response: We have realised through all of the reviewers’ comments that the title and the aims of the manuscript were confusing. We have made this clearer by removing the word novel in the title changing the title to the following:

      A method for mouse myelinating Schwann cell-DRG neuron compartmentalised cocultures: myelination status does not influence the axo-protective effect of Schwann cells after injury.

      Author response: We have also made it much clearer what the purpose of our study is and where and how it fits in with the previous literature by adding the following paragraph to the introduction.

      Introduction.

      Indeed there has only ever been one laboratory detailing convincing myelin formation in dissociated mouse myelinating SC-DRG neuron cocultures, however this was never published as a step by step detailed protocol (Stevens et al, 1998; Stevens & Fields, 2000)__. In the last twenty years, there have been no published studies demonstrating myelination in fully dissociated mouse SC-mouse DRG cocultures. This has largely prevented the use of cells, particularly SCs, from transgenic mice in cocultures and thus restricted the ability to study SC-axon interactions in a system that can be readily manipulated and live imaged and results directly applied back to in vivo findings in the same species.

      Reviewer 1. The use of embryonic DRG neurons or SC isolated from P2 animals are arguably physiologically not the same cells that are affected by traumatic nerve injury, which happen most often than not in adult. This is a problem in the long-term reliance on this approach to study axotomy peripheral nerve regeneration.

      Author response: We agree with the reviewer that one should always be cautious with the use of embryonic/neonatal cells to directly refer them to adult cellular mechanisms. We have added discussion of this point to the discussion:

      Discussion.

      One limitation of our coculture model and indeed all coculture and cell culture models that are used to investigate cellular and molecular mechanisms in nerve injury is that the cells are obtained from embryonic or neonatal animals. This is an important caveat when applying results from cell culture to adult in vivo nerve injury. However, while we would argue that cell culture approaches should always be used in combination with in vivo study it is important to remember that nerve injury is not restricted to adults and brachial plexus injury secondary to birth trauma is unfortunately a significant clinical problem (Pondaag et al, 2007)__. Furthermore, neonatal SCs replicate many of key cellular and molecular mechanisms seen in adult SCs after injury, including JUN upregulation, myelinophagy, promotion of axon growth and expression of key repair program transcripts (Arthur-Farraj et al, 2012; Gomez-Sanchez et al, 2015; Arthur-Farraj et al, 2017; Parkinson et al, 2008)__. A future development would be to try to adapt this protocol to make a coculture model with adult mouse or even human cells.

      Author response: Additionally, we already know Schwann cells in P5 neonatal mice in vivo after nerve transection demyelinate in a similar way to Schwann cells in adult mice and that neonatal cells in vivo and in vitro require the transcription factor c-Jun to do so (Parkinson et al., 2008 JCB Fig.7).

      Moderate points:<br /> Reviewer 1"there are no established protocols in the field describing the use of mouse SCs with mouse DRG neurons in dissociated myelinating cocultures". The use of mouse cells is laudable, but it is not necessarily a technical innovation, or at least the current manuscript does not explain why their approach particularly suitable to mouse Schwann cells.

      Author response: We feel that a detailed working protocol for compartmentalised dissociated mouse myelinating cocultures showing convincing and extensive myelination has been missing from our field for a long time. We agree that it is an incremental technical advance, but it is an important one. We have modified the title as we explained above. We have explained this point more clearly in the introduction, results, and discussion with the following additions:

      Introduction.

      Indeed there has only ever been one laboratory detailing convincing myelin formation in dissociated mouse myelinating SC-DRG neuron cocultures, however this was never published as a step by step detailed protocol (Stevens et al, 1998; Stevens & Fields, 2000)__. In the last twenty years, there have been no published studies demonstrating myelination in fully dissociated mouse SC-mouse DRG cocultures. This has largely prevented the use of cells, particularly SCs, from transgenic mice in cocultures and thus restricted the ability to study SC-axon interactions in a system that can be readily manipulated and live imaged and results directly applied back to in vivo findings in the same species.

      The consensus within the field is that inducing myelination in dissociated mouse SCs is challenging. Certainly, induction of myelin differentiation with cyclic adenosine monophosphate (cAMP) analogues or elevating agents, such as forskolin, is more difficult in mouse SC monocultures compared to rat SC cultures. This is because mouse SCs require additional exogenous b__-neuregulin-1 (__b__NRG1), plating on poly-L-lysine (PLL) instead of poly-D-lysine (PDL), and low concentration horse serum as opposed to foetal calf serum (Stevens et al, 1998; Arthur-Farraj et al, 2011; Päiväläinen et al, 2008)__.

      Author response: We have now explained more clearly that without plating on Matrigel and the regular addition of Matrigel to the myelination medium that mouse cocultures do not myelinate with ascorbic acid or indeed addition of NRG1 nor forskolin. Please see NEW DATA in Supplemental figure 1. We have added the following paragraph to the results section.

      Results

      Importantly, we found that L-ascorbic acid was insufficient to induce substantial myelination in our cultures, unlike in rat SC-DRG cocultures, and in the one previously published dissociated mouse SC-DRG protocol (Stevens et al, 1998)__. In fact, plating cocultures on laminin, adding ascorbic acid (50 m__g ml−1), b__NRG1 (10 ng ml−1) and forskolin (10 m__M) induced very few myelin sheaths (Supp. Fig. 1). Only when cultures were plated on Matrigel__â and further Matrigel__â was added to the myelination medium for each medium change, were we able to visualise robust reproducible myelination in our cocultures (Supp. Fig. 1).

      Author response: We have also added further discussion on how our protocol differs from the Stevens 1998 and other protocols in the discussion.

      Discussion

      Our protocol differs somewhat from the one used by Stevens et al., 1998 to induce myelination in dissociated mouse SC-DRG cocultures__, as they used ascorbic acid and 10% horse serum and presumably plated their cultures on laminin, though they do not explicitly detail this (Stevens et al, 1998)__. In our preliminary experiments we were unable to visualise much myelination with use of laminin, ascorbic acid or indeed if b__NRG1 and high concentration forskolin was added to the medium for up to four weeks. However, if we plated cocultures on Matrigel® and continuously added it to the myelination medium then we saw comparable levels of myelination in our mouse cocultures to that of rat cocultures (Eldridge et al, 1987)__. This approach of using Matrigel® to enhance myelination has previously been successfully employed in cultures of human iPSC sensory neurons with rat SCs and in non-dissociated mouse DRG explant cultures (Clark et al, 2017; Päiväläinen et al, 2008)__. Importantly, we used growth factor depleted Matrigel® as standard Matrigel® preparations contain substantial amounts of Transforming growth factor b (TGF__b__) which is a known inhibitor of myelination (Einheber et al, 1995)__. Additionally, the majority of rat and mouse coculture protocols plate cells on glass whereas we found cultures were healthier and myelinated better when cultured on plastic Alcar® coverslips.

      Discussion

      Furthermore, as this is a dissociated and compartmentalised purely mouse cell culture system, one can utilise the vast array of transgenic and knockout lines available to study neuron-SC interactions in more detail, without concern of contaminating endogenous SCs and other non-neuronal cells that remains a drawback of current mouse dissociated or non-dissociated DRG explant models.

      Reviewer 1: The figures in the paper are largely descriptive. They are very little quantitative measurement. Thus, the readers will have a hard to determine, if they replicate the proposed approach, whether their efficient is on par with the current authors.

      Author response: We have now added quantification of myelin segments per mm2, percentage of SCs that myelinate and quantification of the interperiodic distance of the myelin formed. This is all included in a new version of TABLE 1. We have discussed this data in the results section as follows.

      Results

      Quantifying the number of PRX positive myelin segments we found that there were 325.33 ± 12.3 sheaths per mm2, comparable to what has been originally described in rat SC/DRG cocultures and two fold more extensive myelination than recently described compartmentalised rat cocultures models (n=3; Table 1; Eldridge et al, 1987; Vaquié et al, 2019)__. Furthermore, 25.47 ± 1% of Schwann cells were myelinating in our cultures (n=3; Table 1). To confirm that cocultured myelinated SCs formed compact myelin we performed electron microscopy (EM), which revealed compact myelin formation with multiple myelin wraps and formation of readily visible major dense (MDL) and intraperiod lines (IPL; Fig. 2D). Additionally, we measured the periodicity, i.e., the distance between two adjacent major dense lines, to make sure myelin was compacted. Interperiodic distance was 12.16 ± 0.28 nm, in line with previous reports (n=3; Table 1; Boutary et al, 2021; Fernando et al, 2016; García-Mateo et al, 2018; Giese et al, 1992; Perrot et al, 2007)__.

      Author response: Interestingly, after we performed this analysis, we realised we have double the level of myelination in our mouse cultures (325.33 ± 12.3 per mm2) than in the compartmentalised myelinating rat cocultures in Vaquie et al., 2019 (147 ± 27 internodes per mm2 (n = 3)).

      Author response: We have also quantified the JUN upregulation after injury in both myelinating and aligned cocultures. See Fig. 3B-E).

      Results

      Additionally, we noted a strong upregulation of JUN protein in SCs 12 hours after axotomy (Fig. 3B and C). We also saw significant JUN upregulation 12 hours after axotomy in cocultures with aligned SCs (Fig. 3D and E).

      Author response: The above quantification is in addition to the quantification of the rate of axon degeneration in the presence and absence of aligned and myelinating Schwann cells in Fig. 4B. We have also quantified the % of Schwann cells that contained axonal debris after injury – this data is now quoted in the text as we removed Fig. 4E.

      We thank the reviewer for asking for additional quantification as this has improved the manuscript.

      Minor points:<br /> Fig.4B and E should show individual data points.

      Author response: We have added the individual data points to Fig.4B. We have removed Fig.4E and instead quoted the data in the results section as follows:

      Results

      When we quantified this phenomenon, we found that 97.84 ± 1.462% (n=2) of SCs in our cocultures contained mCherry-labelled axonal fragments.

      Reviewer #1 (Significance):

      In addition, to the demonstration of feasibility of this in vitro approach, the main finding by the authors is that SCs have a role in neuronal protection and support is key for peripheral nerve regeneration. Thus while in vitro approach does not add key information that do not already exists in the field, it somewhat confirms that the effect is SC autonomous. Overall the approach is interesting and has potential, but the study currently lack a demonstration of its usefulness to the community.

      It would have been interesting to have the authors discuss the advantages of their approach in comparison to other innovative approaches to study SC-axon interactions that have been developed in the last decade (i.e., 3D environment, microfluidic approach, transwell systems). There is also a lack of citations about similar studies in the field.

      Author response: We direct the reviewer to the following paragraphs in the introduction and the discussion, which we have now elaborated on further post peer review. We discuss all relevant cocultures studies in mouse as well as all the relevant microfluidic studies and 3D coculture studies as well as the one human nerve organoid study. We found two additional studies, Numata-Uematasu 2023 using DRG mouse explant cultures and Park et al., 2021 using motorneuron-SC cocultures which we have now added to the discussion. We also briefly discuss transwell studies to assess migration as the reviewer requested.

      We also discuss in detail the two microfluidic coculture injury studies Babetto et al., 2020 and Vaqiue et al., 2019 extensively throughout the manuscript. We have added further discussion of the similarities and differences between theirs and our approach in the discussion.

      Introduction.

      Protocols exist where endogenous mouse SCs are used to myelinate dissociated or non-dissociated DRG explant cultures. (Shen et al, 2014; Harty et al, 2019; Sundaram et al, 2021; Stettner et al, 2013; Numata-Uematasu et al, 2023)__. Furthermore, another protocol seeded exogenous SCs onto non-dissociated DRG explant cultures (Päiväläinen et al, 2008)__. Other laboratories seed cultured rat SCs onto dissociated mouse DRG axons (Taveggia & Bolino, 2018)__. Use of dissociated or non-dissociated DRG explants cultures preclude many experimental uses, such as using SCs from different transgenic animals and separate transfection of SCs and neurons with viruses for live imaging or genetic manipulation, and easy use of microfluidic chambers to allow injury studies and separate drug treatments to neurons or SCs. The reason for this is that antimitotics cannot be used in dissociated or non-dissociated DRG explant cultures as this depletes SCs, and the culture quickly becomes contaminated with other non-neuronal cell types, such as satellite cells and fibroblasts migrating out of the DRG. Furthermore, use of exogenous SCs in a non-dissociated DRG explant culture risks, after a period of antimitotic exposure, which was developed by Päiväläinen et al., 2008 still risks potential contamination from endogenous SCs and satellite glia migrating out of the DRG explant over time. This occurs because antimitotic treatment is unlikely to fully penetrate the whole DRG without prior dissociation. Additionally, a compartmentalised culture system cannot be readily used with non-dissociated DRG explant cultures (Päiväläinen et al, 2008)__.

      Discussion.

      Furthermore, our protocol is complementary to the recently described 3D mouse myelinating SC-motor neuron coculture system using collagen hydrogels (Hyung et al, 2021; Park et al, 2021)__. It will be interesting in the future to up titrate the concentration of Matrigel®, which is similar to collagen hydrogels, in our cultures to see whether further increasing extracellular matrix viscosity and stiffness improves our myelination efficiency even further. While it is possible to study cell migration in microfluidic cell culture devices, transwell models offer significant advantages to study this cellular phenomenon (Negro et al, 2022)__. To date, there have been no published studies of successful myelination in human SC-neuron coculture systems. Despite this rat SCs have been shown to readily myelinate human-induced pluripotent stem cell (iPSC)-derived sensory neurons and an iPSC-derived peripheral nerve organoid system which does contain myelinating SCs has recently been described (Clark et al, 2017; Van Lent et al, 2022)__.

      These findings confirm the observations of both Babetto et al., 2020 and Vaquié et al., 2019 who used rat SCs in similar microfluidic culture systems (Vaquié et al, 2019; Babetto et al, 2020)__. We have shown that the axo-protective observation seen by Babetto et al., 2020 does not rely on myelination status, which was an outstanding question from that study (Babetto et al, 2020)__. Furthermore, in an advance from previous studies, we have visualised the axo-protective and axon clearance phenomena in the same culture and shown that they are temporally separated, with axon fragmentation and debris clearance by SCs occurring at much later timepoints after axotomy. Several different culture and experimental conditions preclude direct comparison of our study with both those of Vaquié et al., 2019 and Babetto et al., 2020. These include the use of rat SCs in both prior studies and that Babetto et al., 2020 mixed rat SC with mouse DRG axons; length of time in culture, time points quantified after injury and distance from injury and site of analysis. Babetto et al., 2020 performed axotomy on relatively short term cocultures (6 days in vitro) whereas Vaiquié et al., 2019 cultured for at least 4 weeks and in our case 6 weeks prior to axotomy. Vaiquié et al., 2019 removed nerve growth factor (NGF) prior to laser axotomy and quantified proximally (though also imaged distally) whereas both our study and Babetto et al., 2020 kept NGF in the medium, performed axotomy with a scalpel and quantified more distally and in our case extremely distal, where only individual neurites and no axon bundles were visible. Vaquié et al., 2019 had SCs on both sides of the barrier in the microfluidic chambers whereas we seeded SCs only in the axonal/bottom compartment. Additionally, our cocultures had both forskolin and b__NRG1 added to help induce myelination, whereas these factors are not required in rat myelinating cocultures. Finally, it is important to permeabilise myelinated cultures with acetone after fixation__, as we did, to visualise the entire axon through heavily myelinated segments otherwise axon integrity cannot be reliably assessed in a quantitative manner (Vaquié et al, 2019; Babetto et al, 2020)__.

      Because of the lack of key novel mechanisms, and lack of discussion on what this approach is superior to others in vitro approach limits the impact of the study and the excitement of the reader, even from the SC-axon community.

      Author response: We have developed the first compartmentalised fully dissociated mouse myelinating coculture system in over twenty years. Thanks to the reviewer’s suggestions, we have now shown that myelination is comparable to the original rat cocultures from the Bunge lab, which is the gold standard in the field, and superior to recently described compartmentalised rat coculture system by Vaquie et al., 2019. We have provided a detailed step by step protocol to allow other researchers to use our technique.

      Additionally, thanks to the reviewer, we have now described in detail exactly how our protocol differs from others and why we succeeded to get mouse SCs to myelinate so robustly in a fully dissociated coculture (see previous answers). This is an incremental but important advance given that studies currently use a coculture system using entirely cells from rat, or where rat Schwann cells are seeded on mouse axons, or dissociated or non-dissociated mouse explant cultures are used which abrogates using neurons and SCs from different transgenic mice.

      As this is a methods paper, we did not intend to describe novel molecular mechanisms though our method is already being used for such purposes by ourselves and a collaborator as outlined above in prior answers. We did not make this clear and we hope the extensive revision of the manuscript now addresses this point. Despite this, we have shown that the axo-protective effect of SCs is independent of myelination status, which is an advance on what is known in the literature.

      Finally, in addition to myelination, we have demonstrated that one can study all the key components of the SC and axonal response to injury in a quantifiable way in addition to demonstrating that these cocultures can be live imaged and used for drug studies. None of the prior mouse studies looked at injury responses of axons nor SCs. We believe this will be of use to the community.

      Reviewer #2 (Evidence, reproducibility and clarity):

      In the manuscript entitled "Distinct axo-protective and axo-destructive roles for Schwann cells after injury in a novel compartmentalised mouse myelinating coculture system", Arthur-Farraj and colleagues detail a method of dissociated coculture of mouse-DRG neurons and Schwann cells in microfluidic chambers. In this system, neurons and Schwann cells harvested from the same or from different animals are grown in different compartments that are connected by microgrooves, thereby allowing for spatial and diffusive separation. Neurons are shown to extent their axons across the microgroove barrier to the glial compartment where Schwann cells align with the axons and myelination can be induced. Detailed analysis of myelination and axon injury/degradation are presented as use cases, including the capability to genetically and pharmacologically manipulate neurons and Schwann cells independently, which also enabled fluorescent life cell imaging. The authors then examine the effect of immature/premyelinating and myelinating Schwann cells on the rate of axon degeneration. Upon axotomy Schwann cells significantly delayed degeneration, with no difference between non-myelinating and myelinating Schwann cells. Finally, live imaging during axon degeneration with fluorescent proteins separately expressed in neurons and Schwann cells demonstrated that Schwann cells ingest axonal fragments.

      Major comment:<br /> In establishing a much needed in-vitro system for PNS myelination and injury research the paper represents a valuable contribution to the PNS community. However, I find the presentation of aspects concerning a protective/destructive role of Schwann cells somewhat inconclusive. That these roles exist has been known, as the authors discuss. Then what does this study contribute concerning the open question that was raised by the discrepancies between Babetto et al., 2020 and Vaquié et al., 2019, i.e. how Schwann cells contribute to axon survival/regeneration after injury? Essentially, the only significant conclusion in this regard is that myelinating and non-myelinating mouse Schwann cells do not differ in their capability to protect axons from degeneration. The manuscript, including the title, would benefit from focusing more on this aspect. In particular, the discussion of the factors that lead to the still remaining apparent discrepancies between Babetto et al., 2020 and Vaquié et al., 2019 and this study should be revised. The authors state that "The study by Vaiquié et al., 2019 quantified axon fragmentation proximally in the microgrooves at timepoints starting from 12 hours after axotomy." (Discussion). While this observation is accurate, Jacob and colleagues also show accelerated, obviously distal axon degeneration in the presence of Schwann cells (Figure 3C in Vaquié et al., 2019). It is therefore unlikely that the discrepancies stem from analysis of more proximal vs. more distal axons, or the timepoints of analyses. In my opinion, a further study (using the coculture system presented in this manuscript) that compares the role of Schwann cells from rats and mice, and that includes analysis of more proximal and distal axon degeneration as well analysis of axon regeneration is needed. In a rework of the manuscript, the authors may therefore elaborate more on the shortcomings of the present study, or alternatively soften the aims of the study in the first place.

      Author response: We thank the reviewer for their comments, and we agree that the title and the aims of the manuscript were confusing. We didn’t make it explicit that this was a methods paper, and that we didn’t intend to show entirely novel findings but instead thoroughly characterise our mouse system so that it is comparable to what has been done for rat cocultures. We have now made this clearer by removing the word novel in the title and changing the title to the following:

      A method for mouse myelinating Schwann cell-DRG neuron compartmentalised cocultures: myelination status does not influence the axo-protective effect of Schwann cells after injury.

      Author response: We have decided to focus the manuscript more on the comparison of myelinating versus non-myelinating cocultures, given that we have shown that the axo-protective effect of SCs is independent of myelination status, which is an advance on what is known in the literature. We have added further characterisation of our aligned cocultures with p75NTR immuno and EM images (Fig.1D and E). We have added the following summary of how our study relates to findings of Babetto et al., 2020 and Vaquie et al., 2019 in the discussion, in line with the reviewer’s suggestions.

      Discussion.

      These findings confirm the observations of both Babetto et al., 2020 and Vaquié et al., 2019 who used rat SCs in similar microfluidic culture systems (Vaquié et al, 2019; Babetto et al, 2020)__. We have shown that the axo-protective observation seen by Babetto et al., 2020 does not rely on myelination status, which was an outstanding question from that study (Babetto et al, 2020)__. Furthermore, in an advance from previous studies, we have visualised the axo-protective and axon clearance phenomena in the same culture and shown that they are temporally separated, with axon fragmentation and debris clearance by SCs occurring at much later timepoints after axotomy.

      Author response: We have now added more in-depth discussion of the similarities and differences between Babetto et al., 2020 and Vaquie et al., 2019 and our approach in the discussion.

      Discussion.

      Several different culture and experimental conditions preclude direct comparison of our study with both those of Vaquié et al., 2019 and Babetto et al., 2020. These include the use of rat SCs in both prior studies and that Babetto et al., 2020 mixed rat SC with mouse DRG axons; length of time in culture, time points quantified after injury and distance from injury and site of analysis. Babetto et al., 2020 performed axotomy on relatively short term cocultures (6 days in vitro) whereas Vaiquié et al., 2019 cultured for at least 4 weeks and in our case 6 weeks prior to axotomy. Vaiquié et al., 2019 removed nerve growth factor (NGF) prior to laser axotomy and quantified proximally (though also imaged distally) whereas both our study and Babetto et al., 2020 kept NGF in the medium, performed axotomy with a scalpel and quantified more distally and in our case extremely distal, where only individual neurites and no axon bundles were visible. Vaquié et al., 2019 had SCs on both sides of the barrier in the microfluidic chambers whereas we seeded SCs only in the axonal/bottom compartment. Additionally, our cocultures had both forskolin and b__NRG1 added to help induce myelination, whereas these factors are not required in rat myelinating cocultures. Finally, it is important to permeabilise myelinated cultures with acetone after fixation__, as we did, to visualise the entire axon through heavily myelinated segments otherwise axon integrity cannot be reliably assessed in a quantitative manner (Vaquié et al, 2019; Babetto et al, 2020)__.

      Author response: We would like to add that we showed Claire Jacob, senior author of the Vaquie et al., 2019 study, our manuscript prior to peer review and she offered helpful comments, which we incorporated into the manuscript, which is why she is acknowledged. We have also discussed our findings with Elisabetta Babetto as well.

      While it would be a great future study to compare both axon degeneration rates in rat and mouse cocultures this was not the original intention of our study. We believe we have included enough detail of our experimental procedures, including the distance from the barrier we imaged axon degeneration, a crucial bit of information missing from the other studies, should others want to perform a comparative study between rats and mice.

      Methods

      Five images at a distance of between 1.2-1.4mm from the microgroove barrier (the most distal part of the culture that could be imaged) were quantified per culture, taken in comparable locations in each culture.

      Author response: I would add that one of our previous studies (Arthur-Farraj et al., 2012 Neuron, Fig5I) has already looked at axon outgrowth/regeneration in dissociated non-myelinating mouse SC-DRG co-cultures. We showed the presence of Schwann cells accelerates axon regeneration/outgrowth and this relies upon Schwann cell c-JUN.

      We have now added quantification of the extent of myelination in our cocultures and it is comparable to the original Bunge lab rat cocultures and more extensive than the Vaquie et al., 2019 coculture.

      Results

      Quantifying the number of PRX positive myelin segments we found that there were 325.33 ± 12.3 sheaths per mm2, comparable to what has been originally described in rat SC/DRG cocultures and two fold more extensive myelination than recently described compartmentalised rat cocultures models (n=3; Table 1; Eldridge et al, 1987; Vaquié et al, 2019)__. Furthermore, 25.47 ± 1% of Schwann cells were myelinating in our cultures (n=3; Table 1).

      Methods

      To quantify the number of myelin segments per area, we counted the number of myelin segments for five areas per culture for three cultures and normalised this per mm2. To quantify the percentage of Schwann cells in myelinating cocultures that are actively myelinating, we quantified the number of myelin segments and the number of DAPI-positive nuclei for five areas per culture for three cultures.

      Minor comments:<br /> - Figure 2D: From the electron micrograph there is no doubt that compact myelin is formed, however to me it seems the compaction is not complete. A rough estimation with the aid of the provided scale bar resulted in an interperiodic distance of about 17 nm, which contrasts with the remarkably well reproduced values reported in multiple reports using conventional specimen preparation (like in this study), of which I am citing just a few: about 13 nm in rat ex vivo nerve (Peterson and Pease, J. Ultrastruct Res 1972; Fledrich et al., Nat Commun 2018), 12 nm (Giese et al., Cell 1992), 12.2 nm (Perot et al., J Neurosci 2007), about 12 (Fernando et al., Nat Commun 2016), about 13 nm (García-Mateo et al., Glia 2017) or about 13 nm in mouse ex vivo (Boutary et al., Commun Biol 2021), which was also reproduced with about 13 nm in rat in vitro (Taveggia and Bolino, Methods Mol Biol 2018). The authors should acknowledge this deviation and might discuss possible reasons.

      Author response: We have now provided a more representative EM image of our myelination (Fig. 2D). Additionally, thanks to the reviewer’s comments we have now quantified the interperiodic distance and find it is comparable to the studies the reviewer suggested. We have added the data to the new Table 1 and added the references the reviewer advised. Please see the additions to the methods and the results section regarding this new data below.

      Results

      To confirm that cocultured myelinated SCs formed compact myelin we performed electron microscopy (EM), which revealed compact myelin formation with multiple myelin wraps and formation of readily visible major dense (MDL) and intraperiod lines (IPL; Fig. 2D). Additionally, we measured the periodicity, i.e., the distance between two adjacent major dense lines, to make sure myelin was compacted. Interperiodic distance was 12.16 ± 0.28 nm, in line with previous reports (n=3; Table 1; Boutary et al, 2021; Fernando et al, 2016; García-Mateo et al, 2018; Giese et al, 1992; Perrot et al, 2007)__.

      Methods

      To measure interperiodic distance, we measured at least 10 periods per myelinated fibre for at least three fibres per sample for three separate samples.

      • Figure 4A,B: The result of a slowed axon degeneration in coculture relies on the accurate assessment of continuity of the NFL staining. While the authors report that acetone permeabilization was necessary to afford complete penetration of the used antibodies in myelinating cultures, I cannot see why the authors have not used the same staining protocol for all cultures, as it is detailed in the method section. While I consider it unlikely that the staining conditions have led to an apparent delay of degeneration in coculture, experiments should generally be performed under identical conditions, unless there are good reasons not to do so. If this is not the case, it will be reassuring to see the same effect when identical staining conditions are employed. On the same note, do the compared cultures have the same age, i.e. have the neuron monocultures been in vitro for the same time as the cocultures?

      Author response: We thank the reviewer for picking up this inaccuracy in the manuscript. We can confirm that for the purposes of the axon degeneration experiment all cultures were stained using exactly the same staining protocol. Additionally, we were very careful to maintain all cultures for exactly the same time in culture – 6 weeks. Additionally, axon only cultures were maintained in myelination medium to make sure medium constituents were not responsible for the observed differences in degeneration rate. We have added the following elaboration to the methods section to clarify these points.

      Methods

      Axon only cultures related to Figure 1 were permeabilised in PBS + 0.5% Triton (Merck) + 5% HS + 5% donkey serum (DS, Merck - D9663) at RT for 1 hour. For the purposes of quantifying the rate of axon degeneration (Figure 4) both axon only cultures and cocultures with SCs were permeabilised in 50% Acetone for 2 minutes, 100% Acetone for 2 minutes, 50% Acetone for 2 minutes (all at RT), and then blocked in PBS + 0.5% Triton + 5% HS + 5% DS at RT for 1 hour.

      Methods

      All cultures (axon only, aligned SCs and myelinating SCs) were cultured for 6 weeks prior to axotomy experiments. To minimise the possibility that medium constituents were responsible for differences in axon degeneration rates, axonal compartments of axon only cultures were cultured in medium containing 10 ng ml-1 b__NRG1 and 10 m__M forskolin (axon only medium, extended methods section D6) once SCs were seeded on other cultures, and then switched into myelination medium (additional Matrigel__â and 50 m__g ml−1 L-Ascorbic Acid), 24 hours before axotomy. Bottom compartments of aligned SC cultures, 24 hours before axotomy, were switched into DRG/SC medium containing 10 ng ml-1 b__NRG-1, 10 m__M forskolin and 50 m__g ml−1 L-Ascorbic Acid, which is insufficient to induce myelination in mouse cultures. Bottom compartments of myelinating cocultures were medium changed into fresh myelination medium (Extended methods section D7) 24 hours prior to axotomy.

      • In several instances of the manuscript, the term "transfection" is used to refer to lentiviral gene transfer. I advise to use the more appropriate term "transduction" instead
      • I could not seem to find a meaningful reference to the microfluidic chambers that were used in the study. The protocol should contain details on the device and source of supply in order to enable potential readers to execute the protocol

      Author response: We thank the reviewer for this comment. We have replaced transfection with transduction throughout the manuscript. Please see the track changes manuscript for all instances.

      Reviewer #2 (Significance):

      The paper presents a convincing establishment of a dissociated coculture derived exclusively from mouse that leads to robust myelination. As the manuscript correctly states, Schwann cell culture and especially coculture with neurons has been experienced difficult in the field, and by providing a detailed protocol as well as demonstrating how the coculture system can be used to address important questions of PNS myelination and repair, the paper fills an important gap. However, the experiments directed to the role of Schwann cells in axon degeneration do not clarify much, which should be better addressed in the discussion and also by modifying the title accordingly.<br /> The paper will be of high value for basic researchers that are interested in performing studies addressing cellular and molecular mechanisms of myelination and repair in the PNS. Importantly, the paper can pave the way to usage of transgenic or knockout mouse models in coculture. Thereby it might spark interest also in those researchers that use transgenic and knockout mouse models and who have so far refrained from using coculture models.

      Field of expertise of the reviewer: Cellular and molecular mechanisms of myelination and growth signaling in the PNS; in-depth experience with DRG coculture models from rats and mice

      Author response: We thank the reviewer for their kind comments. We have now modified the title, aims and discussion of the manuscript in line with the reviewer’s suggestions.

      Reviewer #3 (Evidence, reproducibility and clarity):

      SUMMARY<br /> The authors present a detailed protocol for co-cultures of mouse DRGs with mouse SCs using microfluidics. In this model, cells of interest grow in different compartments while allowing for axons to grow in between, thereby making them accessible to injury induction. Using this experimental system, the authors show that myelination occurs, myelin gets compacted and acquires nodal organization. The authors then show that such a system allows for compartment-specific lentivirus transduction and live imaging. Next, they perform physical and chemical axonal injury and show that at early time point pos- injury the presence of SCs protects from axonal degeneration regardless of the myelination status, and helps with clearing of damaged axons at later time points.<br /> Major comments:

      The novelty of the study is questionable.<br /> While the model is well described and appears to be useful for the proposed applications (live imaging, transduction, injury model), the arguments provided regarding its novelty are not fully convincing. The main argument from the authors of this paper is that there are no established protocols describing the use of mouse SCs with mouse DRG neurons in dissociated myelinating cocultures. However, this appears to be inaccurate, as the model described in Stevens et al., 1998 (cited in the paper) uses mouse DRG neurons dissected at E13.5 with mouse SCs dissected at P3 to study myelination. Also, in Päiväläinen et al., 2008, mouse DRGs and SCs are cultured from transgenic mice at different developmental ages, thereby arguing that coculture models have been previously successfully implemented. The main difference appears to be rather the compartmentalization of SCs and DRGs which appears to be a mouse adaptation of the rat model described by Vaquie et al,2019. Based on the above, it seems imperative for the authors to tone down the novelty aspect and provide a more thorough discussion on how the current novel differs from protocols in published study, highlighting advantages and caveats for each.

      Author response: We agree with the reviewer that we did not make the case clear enough for how our coculture model adds to what is currently described in the literature. We have now changed the title and removed the word novel. New title:

      A method for mouse myelinating Schwann cell-DRG neuron compartmentalised cocultures: myelination status does not influence the axo-protective effect of Schwann cells after injury.

      Author response: We have now added the following paragraph to the introduction:

      Introduction.

      Indeed there has only ever been one laboratory detailing convincing myelin formation in dissociated mouse myelinating SC-DRG neuron cocultures, however this was never published as a step by step detailed protocol (Stevens et al, 1998; Stevens & Fields, 2000)__. In the last twenty years, there have been no published studies demonstrating myelination in fully dissociated mouse SC-mouse DRG cocultures. This has largely prevented the use of cells, particularly SCs, from transgenic mice in cocultures and thus restricted the ability to study SC-axon interactions in a system that can be readily manipulated and live imaged and results directly applied back to in vivo findings in the same species.

      Author response: Regarding the study by Päiväläinen et al, 2008_,_ they did not fully dissociate their DRGs (see Fig.1 which demonstrates a DRG explant) and thus it is a non-dissociated DRG explant model. While they demonstrated convincing myelination due to the use of Matrigel which we acknowledge them for, their model is not perfectly suited for the use of neurons and SCs from different transgenic animals as the use of a DRG explant, even with temporary use of an antimitotic, risks contamination by endogenous SCs and satellite glia over time, especially as their model is not compartmentalised. We discuss the caveats of their protocol and those using dissociated mouse explant cocultures in a revised paragraph in the introduction.

      Introduction.

      Protocols exist where endogenous mouse SCs are used to myelinate dissociated or non-dissociated DRG explant cultures. (Shen et al, 2014; Harty et al, 2019; Sundaram et al, 2021; Stettner et al, 2013; Numata-Uematasu et al, 2023)__. Furthermore, another protocol seeded exogenous SCs onto non-dissociated DRG explant cultures (Päiväläinen et al, 2008)__. Other laboratories seed cultured rat SCs onto dissociated mouse DRG axons (Taveggia & Bolino, 2018)__. Use of dissociated or non-dissociated DRG explants cultures precludes many experimental uses, such as using SCs from different transgenic animals and separate transfection of SCs and neurons with viruses for live imaging or genetic manipulation, and easy use of microfluidic chambers to allow injury studies and separate drug treatments to neurons or SCs. The reason for this is that antimitotics cannot be used in dissociated or non-dissociated DRG explant cultures as this depletes SCs, and the culture quickly becomes contaminated with other non-neuronal cell types, such as satellite cells and fibroblasts migrating out of the DRG. Furthermore, use of exogenous SCs in a non-dissociated DRG explant culture risks, after a period of antimitotic exposure, which was developed by Päiväläinen et al., 2008 still risks potential contamination from endogenous SCs and satellite glia migrating out of the DRG explant over time. This occurs because antimitotic treatment is unlikely to fully penetrate the whole DRG without prior dissociation. Additionally, a compartmentalised culture system cannot be readily used with non-dissociated DRG explant cultures (Päiväläinen et al, 2008)__.

      Author response: We have also added further discussion on how our protocol differs from the Stevens 1998 and other protocols in the discussion.

      Discussion

      Our protocol differs somewhat from the one used by Stevens et al., 1998 to induce myelination in dissociated mouse SC-DRG cocultures__, as they used ascorbic acid and 10% horse serum and presumably plated their cultures on laminin, though they do not explicitly detail this (Stevens et al, 1998)__. In our preliminary experiments we were unable to visualise much myelination with use of laminin, ascorbic acid or indeed if b__NRG1 and high concentration forskolin was added to the medium for up to four weeks. However, if we plated cocultures on Matrigel® and continuously added it to the myelination medium then we saw comparable levels of myelination in our mouse cocultures to that of rat cocultures (Eldridge et al, 1987)__. This approach of using Matrigel® to enhance myelination has previously been successfully employed in cultures of human iPSC sensory neurons with rat SCs and in in non-dissociated mouse DRG explant cultures (Clark et al, 2017; Päiväläinen et al, 2008)__. Importantly, we used growth factor depleted Matrigel® as standard Matrigel® preparations contain substantial amounts of Transforming growth factor b (TGF b__) which is a known inhibitor of myelination (Einheber et al, 1995)__. Additionally, the majority of rat and mouse coculture protocols plate cells on glass whereas we found cultures were healthier and myelinated better when cultured on plastic Alcar® coverslips.

      Author response: We have added further discussion of comparable models in the literature in the discussion.

      Discussion.

      Furthermore, our protocol is complementary to the recently described 3D mouse myelinating SC-motor neuron coculture system using collagen hydrogels (Hyung et al, 2021; Park et al, 2021)__. It will be interesting in the future to up titrate the concentration of Matrigel®, which is similar to collagen hydrogels, in our cultures to see whether further increasing extracellular matrix viscosity and stiffness improves our myelination efficiency even further. While it is possible to study cell migration in microfluidic cell culture devices, transwell models offer significant advantages to study this cellular phenomenon (Negro et al, 2022)__. To date, there have been no published studies of successful myelination in human SC-neuron coculture systems. Despite this rat SCs have been shown to readily myelinate human-induced pluripotent stem cell (iPSC)-derived sensory neurons and an iPSC-derived peripheral nerve organoid system which does contain myelinating SCs has recently been described (Clark et al, 2017; Van Lent et al, 2022)__.

      Next, the authors emphasize the conflicting results of two articles, Babetto et al., 2020 and Vaquie et al., 2019, as the basis to use their newly developed model in the same species and testing two ages corresponding to distinct myelination states. However, both studies reach the same conclusion as the current study, that SCs have a protective role, although at two different developmental time points. As such, it is likely that multiple mechanisms may account for the protective effect of SC on axonal damage, and therefore the different studies do not seem conflicting but rather complementary. Yet, it is interesting that this manuscript shows that the myelination status of SCs does not impact their ability to slow down degeneration and yet it confirms that different timing after injury elicits different behaviors in SCs, as suggested by the studies of Babetto et al., 2020 and Vaquie et al., 2019. In other words, a more accurate description of the results of these two studies is needed and a better explanation of what the authors consider to be conflicting and why (there could be more differences than species and myelination, for instance, such as the method used for axotomy - laser vs cut with scalpel which tear and pull membranes).

      Author response: We would like to humbly correct the reviewer that the studies by Babetto et al., 2020 and Vaquie et al., 2019 do not reach the same conclusion that Schwann cells have a protective role. Instead, they describe axon protection (Babetto et al., 2020) and axon fragmentation (Vaquie et al., 2019). Our studies now visualise both phenomena in the same culture system. We have now made this point more explicit as well as highlighted the one conceptual advance our methods paper makes on the current literature, which is that myelination status does not influence the SC axo-protection, as the reviewer suggested.

      Discussion.

      These findings confirm the observations of both Babetto et al., 2020 and Vaquié et al., 2019 who used rat SCs in similar microfluidic culture systems (Vaquié et al, 2019; Babetto et al, 2020)__. We have shown that the axo-protective observation seen by Babetto et al., 2020 does not rely on myelination status, which was an outstanding question from that study (Babetto et al, 2020)__. Furthermore, in an advance from previous studies, we have visualised the axo-protective and axon clearance phenomena in the same culture and shown that they are temporally separated, with axon fragmentation and debris clearance by SCs occurring at much later timepoints after axotomy.

      Author response: We have now added more in-depth discussion of the similarities and differences between Babetto et al., 2020 and Vaquie et al., 2019 and our approach in the discussion.

      Discussion.

      Several different culture and experimental conditions preclude direct comparison of our study with both those of Vaquié et al., 2019 and Babetto et al., 2020. These include the use of rat SCs in both prior studies and that Babetto et al., 2020 mixed rat SC with mouse DRG axons; length of time in culture, time points quantified after injury and distance from injury and site of analysis. Babetto et al., 2020 performed axotomy on relatively short term cocultures (6 days in vitro) whereas Vaiquié et al., 2019 cultured for at least 4 weeks and in our case 6 weeks prior to axotomy. Vaiquié et al., 2019 removed nerve growth factor (NGF) prior to laser axotomy and quantified proximally (though also imaged distally) whereas both our study and Babetto et al., 2020 kept NGF in the medium, performed axotomy with a scalpel and quantified more distally and in our case extremely distal, where only individual neurites and no axon bundles were visible. Vaquié et al., 2019 had SCs on both sides of the barrier in the microfluidic chambers whereas we seeded SCs only in the axonal/bottom compartment. Additionally, our cocultures had both forskolin and b__NRG1 added to help induce myelination, whereas these factors are not required in rat myelinating cocultures. Finally, it is important to permeabilise myelinated cultures with acetone after fixation__, as we did, to visualise the entire axon through heavily myelinated segments otherwise axon integrity cannot be reliably assessed in a quantitative manner (Vaquié et al, 2019; Babetto et al, 2020)__.

      Author response: We would like to add that we showed Claire Jacob, senior author of the Vaquie et al., 2019 study, our manuscript prior to peer review and she offered helpful comments, which we incorporated into the manuscript, which is why she is acknowledged. We have also discussed our findings with Elisabetta Babetto as well.

      Overall, the title does not appear to be the most appropriate because the content rather proposes a detailed protocol and gives examples of applications, rather than focusing on the protective versus destructive role of SCs on axons. It also appears to be misleading, as "axo-destructive" appears to suggest a negative role of Schwann cells on axons, whereas SC are rather helpful in clearing degenerative axons, a step which facilitates regeneration.

      Author response: We have now changed the title and the focus of the manuscript in line with the reviewer’s comments. New title:

      A method for mouse myelinating Schwann cell-DRG neuron compartmentalised cocultures: myelination status does not influence the axo-protective effect of Schwann cells after injury.

      Author response: We have removed the phrase axo-destructive throughout the manuscript and instead referred to axon fragmentation and axon debris clearance roles of SCs in line with the reviewer’s suggestion. Please see track changes manuscript for all instances where this was modified.

      The number of biological replicates for each experiment is not always indicated, and if the "n=" represent cultures prepared independently/passaged or wells/cell. It is essential to be rigorous and clearly indicate the number of technical replicates and biological samples throughout the manuscript and provide a thorough description of them. One example is Fig. 4 E were only 10 cells from a single culture appeared to have been imaged. Is this accurate? This aspect is essential to evaluate reproducibility, especially in view of the technical and biological variability.

      Author response: We have now added quantification of myelin segments per mm2, percentage of SCs that myelinate and quantification of the interperiodic distance of the myelin formed. This is all included in a new version of TABLE 1. We have discussed this data in the results section as follows.

      Quantifying the number of PRX positive myelin segments we found that there were 325.33 ± 12.3 sheaths per mm2, comparable to what has been originally described in rat SC/DRG cocultures and two fold more extensive myelination than recently described compartmentalised rat cocultures models (n=3; Table 1; Eldridge et al, 1987; Vaquié et al, 2019)__. Furthermore, 25.47 ± 1% of Schwann cells were myelinating in our cultures (n=3; Table 1). To confirm that cocultured myelinated SCs formed compact myelin we performed electron microscopy (EM), which revealed compact myelin formation with multiple myelin wraps and formation of readily visible major dense (MDL) and intraperiod lines (IPL; Fig. 2D). Additionally, we measured the periodicity, i.e., the distance between two adjacent major dense lines, to make sure myelin was compacted. Interperiodic distance was 12.16 ± 0.28 nm, in line with previous reports (n=3; Table 1; Boutary et al, 2021; Fernando et al, 2016; García-Mateo et al, 2018; Giese et al, 1992; Perrot et al, 2007)__.

      Author response: We have discussed the number of cultures used for each quantification in the methods section. See below.

      Quantification of Myelination in cocultures

      To quantify the number of myelin segments per area, we counted the number of myelin segments for five areas per culture for three cultures and normalised this per mm2. To quantify the percentage of Schwann cells in myelinating cocultures that are actively myelinating, we quantified the number of myelin segments and the number of DAPI-positive nuclei for five areas per culture for three independently prepared cultures. To measure interperiodic distance, we measured at least 10 periods per myelinated fibre for at least three fibres per sample for three separate samples.

      Quantification of Degeneration

      Five images at a distance of between 1.2-1.4mm from the microgroove barrier (the most distal part of the culture that could be imaged) were quantified per culture, taken in comparable locations in each culture. A line was drawn across each image, and each axon crossing this line was either scored as degenerated or intact. Images were blinded prior to quantification. A minimum of three independently prepared cultures were assessed per timepoint for each condition.

      Author response: We have removed Fig.4E and instead quoted the data in the results section as follows:

      When we quantified this phenomenon, we found that 97.84 ± 1.462% (n=2) of SCs in our cocultures contained mCherry-labelled axonal fragments.

      Author response: We apologise as the n number for this experiment was 2 (not 10), with cells in 10 areas quantified throughout all imaging timepoints from each independently prepared culture. We have included the following description in the methods section:

      To quantify number of SCs with fragments, each cell was defined as a region of interest and checked for the presence of mCherry positive fragments at all timepoints. Two separate independently prepared cultures and cells in 10 areas per culture were analysed.

      Author response: Additionally for Fig. 4B we have now included individual data points from independently prepared cultures.

      N numbers are included in all figure legends and always refers to independently prepare cultures/biological replicates.

      We have added to the relevant figure legends (Fig.3 and 4 and Table 1) the phrase:

      n number refers to independently prepared cultures from separate litters of mice.

      Minor comments:

      • Does myelination reach axons in the microgrooves (it seems to from 2C, but up to where)? Where is axotomy performed and are axons myelinated where the cut was performed?

      Author response: Myelination occasionally reaches the beginning of the microgrooves. We didn’t visualise myelination in the DRG cell body compartment. We have added the following detail to the methods section:

      Traumatic axotomies were carried out by carefully removing the microfluidic chamber (SND150 and RND150, Xona Microfluidics__Ò__) from the Aclar__â coverslip using sterile forceps and severing axons with a surgical blade under a light microscope. Axotomies were carried out at the level of the microgroove barrier. To confirm all axons were severed, a second higher cut was performed and axons between the cut sites removed using the surgical blade.

      Author response: Given this, we cannot exclude that the odd proximal myelin segment is cut, but the vast majority of axons are not myelinated at the site of cut (lower cut).

      • Since the model allows for comparison of aligned vs myelinating SCs, and that both aligned and myelinating SCs seem to slow down degeneration, and that c-JUN is upregulated after in vivo injury, have the authors measured if c-JUN levels increase similarly in both myelinating vs aligned SCs?

      Author response: We thank the reviewer for this suggestion. We have now quantified the JUN upregulation after injury in both myelinating and aligned cocultures as well as adding images of JUN upregulation in aligned cocultures. See Fig. 3B-E).

      Additionally, we noted a strong upregulation of JUN protein in SCs 12 hours after axotomy (Fig. 3B and C). We also saw significant JUN upregulation 12 hours after axotomy in cocultures with aligned SCs (Fig. 3D and E).

      Author response: We have decided to focus the manuscript more on the comparison of myelinating versus non-myelinating cocultures, given that we have shown that the axo-protective effect of SCs is independent of myelination status, which is an advance on what is known in the literature. In addition to changing the title, as we have mentioned previously, we have added further characterisation of our aligned cocultures with p75NTR immuno and EM images (Fig.1D and E).

      We have

      • On clarity:<br /> - In the step-by-step protocol, wording needs to be improved.

      Author response: We have substantially edited the step-by-step protocol. Please see track changes document for all specific changes in wording.

      • Temperatures for centrifugations are missing.

      Author response: We have added temperatures for all centrifugation steps. Please see track changes document

      • The MOI described for lentivirus is 2-10 in the protocol but 200 in the legend of Figure 3F.

      Author response: The MOI for DRGs was 2-10 and SCs was 200 in Figure 3F. This is described similarly in the extended methods section. DRGs are transduced much more easily than SCs.

      We have added the following sentence to the results section to emphasise this point:

      Importantly dissociated mouse SCs required a much higher multiplicity of infection (MOI) than dissociated mouse DRGs (see extended methods section).

      • Certain citations in the references list are incomplete (i.e. Babetto et al.; Catenaccio et al.,).

      Author response: We have updated the reference list.

      Reviewer #3 (Significance):

      The advance for the field proposed by this paper is mostly technical, as it details a new model to be used by the field, of mouse SCs-mouse DRGs in dissociating myelinating cultures. The tested applications allowed the authors to also confirm a protective role for SCs on axonal damage, which was independent from myelination status.

      Being a method paper, it is essential that the authors provide clear statements on the number of biological replicates, and technical repeats, as well as a very thorough and accurate description of the methodology.

      The model described has similarities with existing models in the field such as Stevens et al., 1998 and Vaquié et al., 2019. To place it in context in a more helpful way, the authors should emphasize on the novelty brought by their protocol compared to existing models. The authors compare their findings to results from Vaquié et al., 2019 and Babetto et al., 2020 that they describe as conflicting, when it seems they rather address different mechanisms of SCs in protection and repair, occurring at different time points.

      Audience might be interested in the detailed step by step protocol to use this in vitro model for the applications described, and investigate further why SCs myelination status does not influence their ability to protect from neurodegeneration early on or how to make use of this for neuroprotection studies.

      Author response: We have now rephrased the description of Vaquié et al., 2019 and Babetto et al., 2020 studies in line with the reviewer’s suggestions. We have now added further discussion of our model in the context of all other models in the field as we have outlined in detail in above responses.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Thank you for the helpful comments regarding our manuscript, "Association between APOL1 risk variants and the occurrence of sepsis among patients hospitalized with infections.” We have revised the title of the manuscript in response to reviewer comments. Additionally, we have updated the manuscript with analyses among patients with pre-existing renal disease alone as well as other items suggested by the reviewers. The Tables have been renumbered to accommodate these revisions.

      Public review:

      The study has main limitations which need to be addressed and there is lack of functional explanation of carriage. These limitations are: a) the lack of inclusion of non-Black patients; and b) the lack of appropriate explanation if results are false-positive since APOL1 provides risk for chronic renal disease (CRD) and patients with CRD are predisposed to sepsis. Sepsis occurred in 565 Black subjects, of whom 105 (29% ) had APOL1 high-risk genotype and 460 had-low risk genotype. Importantly, the risk for sepsis associated with APOL1 HR variants was no longer significant after adjusting for subjects pre-existing severe renal disease or after excluding these subjects. Thus, the susceptibility pathway seems to be: APOL1 variants > CKD > sepsis diathesis.

      Suggestions to the authors:

      • The authors need to provide analysis of patients of non-Black origin.

      We apologize for not fully clarifying that the APOL1 high-risk genotypes are virtually exclusive to populations of recent African ancestries,1–4 the majority of whom are identified as having Black race in our dataset.5 To illustrate the rarity of APOL1 high-risk genotypes in other reported races, we examined the frequency of these genotypes in White patients who had been hospitalized with infections at VUMC (comparable to the cohort of Black patients used in the study). Compared to the 361 out of 2242 (16.1%) Black patients hospitalized with infections carrying APOL1 high-risk genotypes, there were only 8 carriers of APOL1 high-risk genotypes out of 12,990 White patients (0.06%); of these 8, 2 patients developed sepsis during hospitalization. Due to a low number of carriers (n=8) and limited number of events (n=2), we could not proceed with further analysis. Patients reported as other races (e.g., Asian and American Indian) are less frequent than White or Black patients in the VUMC de-identified EHR; as such, we would anticipate similarly small, if any, numbers of high-risk genotypes among these groups, with insufficient power for meaningful analysis. Comparisons between racial groups that did not have carriage of the APOL1 high-risk genotypes would increase the possibility of confounding by factors associated with racial identity (e.g., social determinants of health), rather than genotype; as such, detected differences would likely reflect those factors, rather than the impact of APOL1.

      We have now added clarifying language in the Methods section.

      • The Table of demographics needs to include the type of infections and the underlying pathogen.

      Microbiological evidence of specific infection types is not available for the majority of records for patients hospitalized with infections (as well as sepsis); indeed, for many patients with common infections (e.g., pneumonia) the pathogen is often not identified.6 While we do not have details regarding the underlying pathogens, we were able to determine infection categories at admission. We now include details regarding the categories of infection based on ICD codes in Supplementary Table 1, and the updated Table 1 now includes that information for the APOL1 high-risk and low-risk groups. Given that individuals could have more than one type of infection, we also tested the number of types of infection and found no significant difference between the high-risk and low-risk genotypes (p=0.77).

      • The authors need to provide convincing analysis if results are false-positive since APOL1 provides risk for chronic renal disease (CRD) and patients with CRD are predisposed to sepsis. For this purpose, they have to provide evidence if the sepsis causes (both type of infection and implicated pathogens) in patients with CRD who are carriers of APOL1 variants are different than in patients with CRD who are not carriers of APOL1 variants.

      Indeed, we believe the presented findings suggest that the apparent association between APOL1 high-risk genotypes and sepsis is driven by associated pre-existing severe renal disease rather than APOL1 itself; we appreciate the suggestion to conduct additional analyses to assess whether APOL1 high-risk genotypes impact the occurrence of sepsis among those patients with pre-existing severe renal disease. We note that this analysis could also be biased towards detecting a spurious association between APOL1 high-risk genotypes and sepsis if, within the subgroup with pre-existing severe renal disease, patients with high-risk genotypes also have more severe pre-existing renal disease.

      Among the patients with pre-existing severe renal disease (n=458), 121 (26.4%) were carriers of the APOL1 high-risk genotypes. First, we assessed the severity of renal disease among these patients, detecting an association between APOL1 high-risk genotypes and greater severity (i.e., CKD stage 5/ESRD) when adjusted for age, sex, and 3 PCs: OR=2.29 (95% CI, 1.42-3.67, p=6.25x10-4). Then, we compared the primary outcome of sepsis in patients with APOL1 high-risk and low-risk genotypes for this subgroup. Despite the potential bias toward detecting an association between sepsis and the high-risk genotype based on the severity of pre-existing renal disease, there was no significant association between the high-risk genotypes and sepsis (OR=1.29, [95% CI, 0.84-1.98, p=0.25]). Finally, we assessed infection categories (as described in the above response) in this subgroup. We found no significant differences between the high-risk and low-risk genotypes in the frequency of any infection category.

      These results suggest that the APOL1 high-risk genotypes are not associated with an increased risk of sepsis among patients who have pre-existing severe renal disease. Taken with our other findings, the high-risk genotypes appear to have little or no association with sepsis beyond their association with renal disease. As such, drugs targeting those genotypes would likely have little effect in the acute setting of hospitalization with infection; rather, their primary contribution to the prevention of sepsis would need to target the prevention of underlying renal disease. We have revised our Methods, Results, and Discussion to include these findings.

      • Why concentrations of APOL1 were not measured in the plasma of patients?

      Although APOL1 high risk genetic variants have been repeatedly associated with renal-related clinical phenotypes, and many candidate mechanisms have been proposed,4 there has been contradictory evidence regarding whether the genetic variants could be linked to altered plasma APOL1 levels or whether APOL1 levels are related to elevated risk of renal disease. This is not surprising since it is the altered biological function of the APOL1 structural variant that is important, rather than the concentration of APOL1 protein. While some studies have detected an association between APOL1 high-risk genotypes and plasma levels among patients with renal disfunction and sepsis,7 other population studies have suggested no association between APOL1 plasma levels and renal function.8 Plasma APOL1 levels are seldom measured in clinical practice and thus were not available in this retrospective cohort. However, given the inconsistency of findings and the underlying biology of APOL1, we believe measurements of levels (rather than function) is unlikely to be illuminating.

      • Why analysis towards risk for death is not done?

      In the current study, we focused on the risk of in-hospital death. We did not include the risk of out-of-hospital death due to potential data fragmentation. Specifically, we only have access to the patient’s EHRs at VUMC, and death after hospital discharge is not always be included in a patient’s EHR unless relatives contact the hospital. As such, we focused on in-hospital death, which we validated previously with manual chart review.9 Paralleling the design from a previous publication assessing sepsis outcomes, we included discharge to hospice as part of our in-hospital death algorithm,10 as patients with a terminal illnesses are often discharged to hospice. However, to clarify this outcome component, we now refer to in-hospital deaths and discharge to hospice collectively as “short-term mortality.” In this study, of the 84 total patients with the “short-term mortality” outcome, 47 patients were in-hospital deaths and 37 patients were discharged to hospice. Parallel to the short-term mortality, we found no association with in-hospital death alone. Ln 190: discharge to hospice. I am not sure this can be translated in in-hospital mortality. As noted in the above response, we have rephrased this outcome component as “short-term mortality,” following the design of a previous publication assessing sepsis outcomes.10

      • The authors need to explain why functional information is not provided.

      Functional studies were not performed for several reasons. Animal models are problematic because mice do not have an ortholog to the human APOL1 gene, and the various models developed all have limitations, particularly when second and third perturbations (sepsis and renal impairment) would need to be introduced.11 Also, since we did not observe an association between the genotypes and sepsis independent of pre-existing severe renal disease, we did not pursue additional functional studies. We do describe existing functional analysis in the introduction and briefly in our discussion; we now note this limitation.

      • n 162-172: too many assumptions have been used for the trial; thus, progression to sepsis is difficult to define. According to Sepsis-3 sepsis is no more a continuum from infection to sepsis and septic shock. Some patients presented with sepsis (-1, 0, 1 days considered by the authors) and when electronic health records are used, we are not able to detect the exact timepoint of SOFA score turning to a 2-point increase. This is a major limitation of the methodology presented.

      Same applies for all comorbidities and data extracted from electronic health records.

      Thank you for highlighting this issue. We acknowledge that our choice of wording was unclear. The choice of ICD infection codes during the initial hospitalization window (i.e., -1, 0, 1 days) was aimed to generate a clean cohort of patients hospitalized with infections (i.e., not secondary infections or development of sepsis after an in-hospital procedure), rather than to establish a timeline of progression from infection to sepsis. As you correctly note, our algorithm would capture patients presenting with infection and concurrent sepsis at admission rather than progression to sepsis, and the exact timepoint of the SOFA score meeting the 2-point criterion is difficult to capture through the EHR. Accordingly, we conducted no time-dependent analysis in the current study. To more accurately convey the methodology of the current study (i.e., testing the association between APOL1 high-risk genotypes—which the patients were born with—and the risk of sepsis for patients hospitalized with infections), we revised the manuscript thoroughly, replacing “progression to sepsis” with “occurrence of sepsis” in the title, abstract as well as on pages 7, 8, and 19. We also acknowledge the limitations of using EHR in the Discussion.

      • P value significance thresholds were set at 0.05, except for the PWAS where the threshold was set at 0.05/5 (p13). It would be helpful to list at this point what the 5 outcomes were that led to this adjusted threshold.

      We have revised the manuscript accordingly.

      "Risk of sepsis was significantly increased among patients with high-risk genotypes (OR 1.29, 1.0 to 1.67, P1.29, CI 1.00-1.67, P<0.47)." Some would argue that a confidence interval that includes 1.0 indicates non-significance.

      While the lower bound of the confidence interval appears to meet the 1.0 threshold with only 2 decimal places (which would preclude significance), when taken to the 4th decimal place, the value is 1.0037, demonstrating that the 95% CI did not meet or cross under the 1.0 threshold, and thus the odds ratio should be considered significant (as evidenced by the p=0.047). This result is consistent with other studies that have detected an association between the high-risk genotypes and sepsis,7 but you correctly note that readers can discern from the confidence intervals that the finding is not strong.

      • The Discussion is too long and should be shortened.

      We have revised the Discussion. 

      References:

      1. Limou S, Nelson GW, Kopp JB, Winkler CA. APOL1 Kidney Risk Alleles: Population Genetics and Disease Associations. Adv Chronic Kidney Dis. 2014;21(5):426-433. doi:10.1053/j.ackd.2014.06.005

      2. Kopp JB, Nelson GW, Sampath K, et al. APOL1 genetic variants in focal segmental glomerulosclerosis and HIV-associated nephropathy. J Am Soc Nephrol. 2011;22(11):2129-2137. doi:10.1681/ASN.2011040388

      3. Zhang J, Fedick A, Wasserman S, et al. Analytical Validation of a Personalized Medicine APOL1 Genotyping Assay for Nondiabetic Chronic Kidney Disease Risk Assessment. The Journal of Molecular Diagnostics. 2016;18(2):260-266. doi:10.1016/j.jmoldx.2015.11.003

      4. Daneshpajouhnejad P, Kopp JB, Winkler CA, Rosenberg AZ. The evolving story of apolipoprotein L1 nephropathy: the end of the beginning. Nat Rev Nephrol. 2022;18(5):307-320. doi:10.1038/s41581-022-00538-3

      5. Dumitrescu L, Ritchie MD, Brown-Gentry K, et al. Assessing the accuracy of observer-reported ancestry in a biorepository linked to electronic medical records. Genet Med. 2010;12(10):648-650. doi:10.1097/GIM.0b013e3181efe2df

      6. Wiese AD, Griffin MR, Stein CM, et al. Validation of discharge diagnosis codes to identify serious infections among middle age and older adults. BMJ Open. 2018;8(6):e020857. doi:10.1136/bmjopen-2017-020857

      7. Wu J, Ma Z, Raman A, et al. APOL1 risk variants in individuals of African genetic ancestry drive endothelial cell defects that exacerbate sepsis. Immunity. 2021;54(11):2632-2649.e6. doi:10.1016/j.immuni.2021.10.004

      8. Kozlitina J, Zhou H, Brown PN, et al. Plasma Levels of Risk-Variant APOL1 Do Not Associate with Renal Disease in a Population-Based Cohort. J Am Soc Nephrol. 2016;27(10):3204-3219. doi:10.1681/ASN.2015101121

      9. Liu G, Jiang L, Kerchberger VE, et al. The relationship between high density lipoprotein cholesterol and sepsis: A clinical and genetic approach. Clin Transl Sci. 2023;16(3):489-501. doi:10.1111/cts.13462

      10. Alrawashdeh M, Klompas M, Simpson SQ, et al. Prevalence and Outcomes of Previously Healthy Adults Among Patients Hospitalized With Community-Onset Sepsis. Chest. 2022;162(1):101-110. doi:10.1016/j.chest.2022.01.016

      11. Yoshida T, Latt KZ, Heymann J, Kopp JB. Lessons From APOL1 Animal Models. Front Med (Lausanne). 2021;8:762901. doi:10.3389/fmed.2021.762901

    1. Reviewer #2 (Public Review):

      Pheochromocytoma (PCC), a rare neuroendocrine tumor, is currently considered malignant, but non-surgical treatment options are very limited and there is an urgent need for more basic research to support the development of new therapeutic approaches. In the present work, the authors described the intra- and inter-tumor heterogeneity by performing scRNA-seq on tumor samples from five patients with PCC, and evaluated the corresponding PASS scores.

      Strengths: The tumor microenvironment of PCC was characterized and potential molecular classification criteria based on single-cell transcriptomics were proposed, offering new theoretical possibilities for the treatment of PCC. The article is logically written and the results are clearly presented.

      Weaknesses: I still have concerns about some of the article's content. My main concerns are: In this study, the authors seem to have demonstrated the inaccuracy of a subjective score (PASS) by another objective means (scRNA-seq). In fact, the multiparametric scoring systems such as PASS are no longer endorsed in the 2022 WHO guidelines. The PASS scoring system does not have a high positive predictive value for risk stratification of PCC metastasis, but "rule-out" of metastasis risk with a PASS score of <4 seems to be fairly reliable. Could the authors please explain why the PASS scores were chosen rather than the GAPP, m-GAPP, or COPPS scoring systems? If possible, please try to emphasize the importance and necessity of using the PASS scoring system, either by replacing it with a more acceptable scoring system or by deleting the relevant part, which does not seem to be very relevant to the subject of the article.

      Moreover, I noted the following statement in the text "There are no studies reporting the composition of immune cells in PCCs. The few published studies investigating the immune microenvironment of PCCs have been limited to the expression of PDL1 at the histological level and to assessment of the tumor mutation burden (TMB) at the genomic level, and these results only seem to suggest that PCCs are immune-cold (Bratslavsky et al, 2019; Guo et al, 2019; Pinato et al, 2017)." This statement is very wrong. The reason for this error may be that the authors did not adequately search and read the relevant literature. I noticed that almost all references in this paper are dated 2021 and earlier, which is surprising. Please update the references cited in this paper in a comprehensive and detailed manner; referring to literature published too early may lead to inadequate discussion or even one-sided or incorrect conclusions and conjectures.

      For example, the text statement "Combined with previously reported negative regulatory effects of kinases (such as RET, ALK, and MEK) on HLA-I expression on tumor cells (Brea et al., 2016; Oh et al., 2019), we speculate that the possible reason for inability in recruiting CD8+ T cells of kinase-type PCCs is the downregulation of HLA-I in tumor cells regulated by RET, while the mechanism of immune escape in metabolism-type PCCs (with antigen presentation ability) needs to be further explored. Our results also indicate that the application of immunotherapy to metabolism-type PCCs is likely unsuitable, while kinase-type PCCs may have the potential of combined therapy with kinase inhibitors and immunotherapy." is rather one-sided; in fact, the presence of immune escape in PCC, as the malignancy with the lowest tumor mutation compliance, has been well characterized, and the low number of infiltrating T cells in tumor tissue may be influenced by a variety of factors, such as the release of catecholamines, the expression of inhibitory receptors on the surface of T cells, and so on, although genetic mutation still plays the most crucial role. The Discussion section also has a lot of information that needs to be updated or corrected and expanded, so please rewrite the above section with sufficiently updated references.

      Below I have listed some references for the authors to read:<br /> Tufton N, Hearnden RJ, Berney DM, et al. The immune cell infiltrate in the tumour microenvironment of phaeochromocytomas and paragangliomas. Endocr Relat Cancer. 2022;29(11):589-598. Published 2022 Sep 19. doi:10.1530/ERC-22-0020<br /> Jin B, Han W, Guo J, et al. Initial characterization of immune microenvironment in pheochromocytoma and paraganglioma. Front Genet. 2022;13:1022131. Published 2022 Dec 7. doi:10.3389/fgene.2022.1022131<br /> Celada L, Cubiella T, San-Juan-Guardado J, et al. Pseudohypoxia in paraganglioma and pheochromocytoma is associated with an immunosuppressive phenotype. J Pathol. 2023;259(1):103-114. doi:10.1002/path.6026<br /> Calsina B, Piñeiro-Yáñez E, Martínez-Montes ÁM, et al. Genomic and immune landscape Of metastatic pheochromocytoma and paraganglioma. Nat Commun. 2023;14(1):1122. Published 2023 Feb 28. doi:10.1038/s41467-023-36769-6

    1. Author Response:

      Thank you very much for selecting our paper for peer review and for the thorough evaluation of our manuscript. We appreciate your assessment and the reviewers’ comments that value our work and identify important points that will enable us to improve the paper. We are now working on key experiments to further test the hypothesis that ROCK is essential for the formation, growth, and morphology of the sea urchin larval skeleton. We will address the reviewers’ comments in detail in the revised version of the paper that we will submit after completing the experiments, but for now, there are two points we would like to clarify.

      We thank the first reviewer for the appreciation of this paper and of our previous work where we studied calcium vesicle dynamics in whole embryos (Winter et al, Plos Com Biol 2021). In Winter et al 2021, we found that the skeleton (spicules) doesn’t grow when the embryos are immobilized in either control or treated embryos. As a consequence, we cannot determine the role of ROCK in vesicle trafficking and exocytosis based on experiments conducted in whole embryos. We are developing an alternative assay for vesicle tracking using cell cultures, but that is beyond the scope of this current work.

      As for the second reviewer’s criticism of the usage of Y-27632 to block ROCK activity: The ROCK inhibitor concentrations we used (30-80µM) are similar the those commonly used in mammalian systems and in Drosophila to block ROCK activity, for example: (Becker et al., 2022; Canellas-Socias et al., 2022; Fischer et al., 2009; Kagawa et al., 2022; Segal et al., 2018; Su et al., 2022). The manufactory datasheet indicates that: “Y-27632 dihydrochloride is a selective ROCK inhibitor (Ki values are 0.14-0.22, 0.3, 25, 26 and > 250 μM for ROCK1 (p160 ROCK), ROCK2, PKA, PKC and MLCK respectively)”. That is, the affinities of Y-27632 for ROCK kinases are at least 100 times higher than those for PKC, PKA, and MLCK. Furthermore, these Ki values are based on biochemistry assays where the activity of the inhibitor is tested in-vitro with the purified protein. Therefore, these concentrations are not relevant to cell or embryo cultures where the inhibitor has to penetrate the cells and affect ROCK activity in-vivo. Y-27632 activity was studied both in-vitro and in-vivo in Narumiya, Ishizaki and Ufhata, Methods in Enzymology 2000 (Narumiya et al., 2000). This paper reports similar concentrations to the ones indicated in the manufactory data sheet for the in-vitro experiments, but shows that 10µM concentration or higher are effective in cell cultures. As stated above, we will add additional experimental verifications to the revised version, but even at this stage, the concentrations we used and the agreement between our pharmacological and genetic perturbations suggests that the affected protein is indeed ROCK.

      We share the reviewers and editors wish to identify the molecular targets of ROCK and the specific cellular processes that ROCK is involved in, and we are actively working on achieving this goal. However, we believe that this paper is an important step towards illuminating the cellular components that participate in biomineral construction and the feedback between the cellular machinery and gene expression.

      Best,

      Smadar, in the name of all co-authors.

      References:

      • Becker, K.N., Pettee, K.M., Sugrue, A., Reinard, K.A., Schroeder, J.L., Eisenmann, K.M., 2022. The Cytoskeleton Effectors Rho-Kinase (ROCK) and Mammalian Diaphanous-Related (mDia) Formin Have Dynamic Roles in Tumor Microtube Formation in Invasive Glioblastoma Cells. Cells 11.
      • Canellas-Socias, A., Cortina, C., Hernando-Momblona, X., Palomo-Ponce, S., Mulholland, E.J., Turon, G., Mateo, L., Conti, S., Roman, O., Sevillano, M., Slebe, F., Stork, D., Caballe-Mestres, A., Berenguer-Llergo, A., Alvarez-Varela, A., Fenderico, N., Novellasdemunt, L., Jimenez-Gracia, L., Sipka, T., Bardia, L., Lorden, P., Colombelli, J., Heyn, H., Trepat, X., Tejpar, S., Sancho, E., Tauriello, D.V.F., Leedham, S., Attolini, C.S., Batlle, E., 2022. Metastatic recurrence in colorectal cancer arises from residual EMP1(+) cells. Nature 611, 603-613.
      • Fischer, R.S., Gardel, M., Ma, X., Adelstein, R.S., Waterman, C.M., 2009. Local cortical tension by myosin II guides 3D endothelial cell branching. Curr Biol 19, 260-265.
      • Kagawa, H., Javali, A., Khoei, H.H., Sommer, T.M., Sestini, G., Novatchkova, M., Scholte Op Reimer, Y., Castel, G., Bruneau, A., Maenhoudt, N., Lammers, J., Loubersac, S., Freour, T., Vankelecom, H., David, L., Rivron, N., 2022. Human blastoids model blastocyst development and implantation. Nature 601, 600-605.
      • Narumiya, S., Ishizaki, T., Uehata, M., 2000. Use and properties of ROCK-specific inhibitor Y-27632. Methods Enzymol 325, 273-284.
      • Segal, D., Zaritsky, A., Schejter, E.D., Shilo, B.Z., 2018. Feedback inhibition of actin on Rho mediates content release from large secretory vesicles. J Cell Biol 217, 1815-1826.
      • Su, Y., Huang, H., Luo, T., Zheng, Y., Fan, J., Ren, H., Tang, M., Niu, Z., Wang, C., Wang, Y., Zhang, Z., Liang, J., Ruan, B., Gao, L., Chen, Z., Melino, G., Wang, X., Sun, Q., 2022. Cell-in-cell structure mediates in-cell killing suppressed by CD44. Cell Discov 8, 35.
    1. Author Response

      We thank all three reviewers for their detailed reviews, and generally agree with their feedback. To accompany the reviewed preprint of this manuscript, we wished to respond to comments from the reviewers so that they (and the public) will know what we are planning to incorporate in the revised manuscript we are currently preparing. If there are any comments on our plans in the meantime, please let us know.

      • Reviewer 1, on concerns regarding identification of ontogenetic stage and comparison of taxa from different ontogenetic stages: It is fair to say that enantiornithine ontogeny is still poorly understood, though we believe all current evidence points to each specimen used in this study to being adequately mature for comparison to the extant birds used in the study. Stages of skeletal fusion are the standard method of assessing enantiornithine ontogeny (Hu and O'Connor 2017), and our comparison of histological work (Atterholt, Poust et al. 2021) to skeletal stages in Table S4 suggests a transition from juvenile to subadult in stage 0 or 1 and from subadult to adult within stage 3. Thus, the specimens we quantitatively examine in this study, all at stages 2 or 3 (Figure S10), are advanced subadults or adults. It is well-known that many living animals considered “adults” would be considered subadults or even juveniles to a palaeontologist (Hone, Farke et al. 2016). So, even if some individuals in this study are not fully skeletally mature, they should have obtained the morphology which they would possess for most of their lives and thus the morphology which undergoes selective pressure. We will add this context to the “Bohaiornithid Ontogeny” section and thank the reviewer for seeking more detail for this point.

      • Reviewer 2, on need of a context figure: We have an artistic life reconstruction of a bohaiornithid in preparation, and can include that in the revised manuscript as a figure.

      • Reviewer 2, on raptor claw categories: We explain these categories in-depth in a previous work (Miller, Pittman et al. 2023). However, we will now add a short summary of that explanation to this work so that this manuscript will become self-contained in this regard. In short, the “large raptor” category includes extant birds with records of regularly taking prey which cannot be encircled with the pes, while birds in the “small raptor” have no such records. As Reviewer 2 points out this does often follow phylogenetic lines, but not always. E.g. most owls specialise in taking small prey, but the great horned owl Bubo virginianus regularly takes mammals and birds larger than its pes (Artuso, Houston et al. 2020); and conversely we can only find reports of the common black hawk Buteogallus anthracinus taking prey samll enough for the pes to encircle (Schnell 2020) despite other accipiters frequently taking large prey. In both cases these taxa plot in PCA nearer to other large or small raptors (respectively) than to their phylogenetic relatives.

      • Reviewer 3, on teeth vs beaks: We are not aware of any foods which are exclusive to toothed or beaked animals. There are some aspects of extant bird biology that may affect the way a certain diet may need to be adapted to which we do comment on, e.g. discussion of alternatives to the crop and ventriculus for processing plant matter in the Bohaiornithid Ecology and Evolution section. For functional studies, e.g. FEA, we have included the rhamphotheca in toothless models which serves the same role as teeth, to be a feeding surface. It should not matter, in theory, if the feeding surface is hard or soft as mechanical failure occurs in high stress/strain states regardless of the medium. If having teeth necessarily increases or decreses overall stress/strain relative to a beak (and from our work this does not appear to be the case), this would in turn necessarily limit dietary options. So, all models in our work should be directly comparable.

      As an additional note on this topic, we address tooth shape in bohaiornithids at the end of the Bohaiornithid Ecology and Evolution section. We specifically note that their tooth shape is likley controlled by phylogeny in the current version, though we will add a note in the upcoming version that the morphospace of bohaiorntihid teeth overlaps that of many other clades with purportedly diverse diets, which is consistent with a hypothesis of diverse diets within the clade.

      • Reviewer 3, on cranial kinesis: Our FE models should be unaffected by cranial kinesis, as these are two-dimensional and model the akinetic lower jaw only. Some mediolateral kinesis may be relevant in the mandible in the form of “wishboning” in different taxa, but its prevalence in extant birds is currently unknown. The preservation of enantiornithines (two-dimensionally and typically in lateral view) limits the ability to capture any mediolateral function regardless.

      Our models of mechanical advantage do not account for any cranial kinesis. This is a necessary simplifcation. The nature of cranial kinesis in extant birds, and the role that it plays in feeding, is poorly understood. Cranial kinesis will increase gape, but we don’t yet know how/if it affects jaw closing force and speed (moreover, given the variation in quadrate and hinge morphology present in extant birds, this is also something that is likely to be highly diverse). We have therefore modelled the extant birds’ jaw closing systems as having one, akinetic out lever (the jaw joint to the bite point), to match the situation in our fossil taxa. This is a common simplification that has been used previously with success (Corbin, Lowenberger et al. 2015, Olsen 2017). However, we acknowledge that this simplification may introduce some error. Unfortunately, until the mechanics of cranial kinesis – and the variation in the anatomy and performance of kinetic structures in extant birds – are better understood, we cannot determine exactly what that error looks like. We therefore have greater confidence in the inter-species comparability this conservative, akinetic approach (in other words, we may not be making assumptions that are 100% accurate, but we are at least making the same assumption across all taxa, so it should be comparable in its error). We will add a section in the Mechanical Advantage and Functional Indices discussion calling for further research into the mechanics of cranial kinesis so future mechanical advantage work in birds can take this matter into account.

      • Reviewer 3, on skull reconstruction: This issue is partly addressed in the Bohaiornithid Skull Reconstruction section, though we agree that adding more mentions of it in the MA and FEA Discussion sections and the Bohaiornithid Ecology and Evolution sections will benefit the manuscript. Most notably Shenqiornis and Sulcavis have similar ecological interpretations, but much of the Shenqiornis skull reconstruction uses Sulcavis bones. Longusunguis is the only other taxon which takes more than two bones from a different taxon, and in this case all but the quadrate are not used in any quanitative measurements. We have ensured that the skull reconstructions presented in Figure 2 show what portions of the skull come from what specimen so that as new material is discovered and phylogenetic relationships are updated it will be clear to future readers which parts of reconstructions will need to be updated.

      • Reviewer 3, on data availability: All data including FEA models and raw measurement data are included in the same repository as the scripts, which we will make clear in the manuscript. Good catch on the data link being dead, we will publish it now.

      As a final note, it was brought to our attention by another colleague that the original manuscript’s ancestral state reconstrction lacked an outgroup. An updated reconstruction using Sapeornis as an outgroup will be included in the revised manuscript. The addition of the outgroup does not change any conclusions of the manuscript.

      We once again thank our reviewers for their valuable feedback and will submit a revised version of this manuscript for publication shortly. Please let us know if you have any additional comments after reading our response that we can take onboard in our revision.

      References

      Artuso, C., C. S. Houston, D. G. Smith and C. Rohner (2020). Great Horned Owl (Bubo virginianus), version 1.0. Birds of the World. A. F. Poole. Ithaca, NY, USA, Cornell Lab of Ornithology.

      Atterholt, J., A. W. Poust, G. M. Erickson and J. K. O'Connor (2021). "Intraskeletal osteohistovariability reveals complex growth strategies in a Late Cretaceous enantiornithine." Frontiers in Earth Science 9: 640220.

      Corbin, C. E., L. K. Lowenberger and B. L. Gray (2015). "Linkage and trade‐off in trophic morphology and behavioural performance of birds." Functional ecology 29(6): 808-815.

      Hone, D. W. E., A. A. Farke and M. J. Wedel (2016). "Ontogeny and the fossil record: what, if anything, is an adult dinosaur?" Biology letters 12(2): 20150947.

      Hu, H. and J. K. O'Connor (2017). "First species of Enantiornithes from Sihedang elucidates skeletal development in Early Cretaceous enantiornithines." Journal of Systematic Palaeontology 15(11): 909-926.

      Miller, C. V., M. Pittman, X. Wang, X. Zheng and J. A. Bright (2023). "Quantitative investigation of Mesozoic toothed birds (Pengornithidae) diet reveals earliest evidence of macrocarnivory in birds." iScience 26(3): 106211.

      Olsen, A. M. (2017). "Feeding ecology is the primary driver of beak shape diversification in waterfowl." Functional Ecology 31(10): 1985-1995.

      Schnell, J. H. (2020). Common Black Hawk (Buteogallus anthracinus), version 1.0. Birds of the World. A. F. Poole and F. B. Gill. Ithaca, NY, USA, Cornell Lab of Ornithology.

    1. a I came across the various passages in Al Ghasali, Dqs Elixir der Gliickseligkeit. Aus den l?ersiscbenund arabischen Quellen Ubertragen von H. Ritter (Jena, 1923). Since the Persian text of the original couldnot be found in any of the American libraries, and since the short version of the text in Arabic published inCairo, in 1343 H. does not contain the pertinent passages, I had to rely mainly on Ritter's translation. Ialso used the English translations from the Turkish text by'H.A.Homes (The A~chemyof Happiness by Moham-med Al-Ghazzali (Albany, 1873) and from the Hindustani text by C. Field (The Alcherny of Happiness by Al-Ghazzali (London, 1910). Ritter gives a short introduction to the book in his translation. Further informa-tion about the author and bibliography are to be found in B. D. Macdonald, " al-Ghazali," Encyclopaedia ofIslam (Leyden, London, 1927) II, pp. 146-49 and in N. A. Faris, "Al-GhazzaH" in The Arab Heritage, ed.N. A. Faris (Princeton, 1944), pp. 142-58. For the approximate date of the book see C. Rieu, Catalogue of thePersian Mss. in the British Museum (London, 1879-83) II, p. 829b. ~1anuscripts of ihe" Kimiya-i Sa'ii,datare not common ; Rieu's Catalogue lists only one copy (op. cit:, I, p. 37), others are given in C. Brockelmann,.Geschichte der arabischen Litteratur

      one copy; translated from many languages- written in persian

    1. Author Response

      The following is the authors’ response to the original reviews.

      eLife assessment:

      This study presents a useful inventory of the joint effects of genetic and environmental factors on psychotic-like experiences, and identifies cognitive ability as a potential underlying mediating pathway. The data were analyzed using solid and validated methodology based on a large, multi-center dataset. However, the claim that these findings are of relevance to psychosis risk and have implications for policy changes are only partially supported by the results.

      We appreciate the feedback and insightful suggestions from the editor and reviewers, which aided us to improve the manuscript. We believe the concerns initially raised were mostly due to areas that needed further clarification, which we have now clarified in this revised version. Our primary contribution lies in our meticulous analytical approach aimed at minimizing confounding effects and providing more precise estimates of the genetic and environmental impact on children's cognition and psychology. This method differs from the widely used general linear modeling in the field, which, in our opinion, may not be the optimal strategy for large-scale data analysis. Our comprehensive, tutorial-style description of the methods might serve as a valuable resource for the community.

      Regarding the critique that our findings 'partially support the relevance to psychosis risk,' we have updated our manuscript to more accurately reflect this feedback. We have altered the narrative to indicate that psychotic-like experiences (PLE) are associated with the risk for psychosis, a connection substantiated by prior studies cited in our manuscript.

      Similarly, in response to the comment that our findings 'partially support implications for policy changes,' we have nuanced our conclusion. However, we would like to emphasize our discovery that a negative genetic predisposition impacting cognitive development (i.e., low polygenic scores for cognitive phenotypes) can be counteracted by a positive school and familial environment. We believe that this finding could have meaningful implication for policy making and is robustly supported by our analyses.

      We hope this revised manuscript more accurately reflects our research findings and its significances. Lastly, we would like to express our gratitude for your fair and detailed review process. Our experience working with eLife has been incredibly rewarding, and we commend your dedication to an encouraging and progressive publishing culture.  

      Public Reviews:

      Reviewer #1

      This study by Park et al. describes an interesting approach to disentangle gene-environment pathways to cognitive development and psychotic-like experiences in children. They have used data from the ABCD study and have included PGS of EA and cognition, environmental exposure data, cognitive performance data and self-reported PLEs. Although the study has several strengths, including its large sample size, interesting approach and comprehensive statistical model, I have several concerns:

      • The authors have included follow-up data from the ABCD Study. However, it is not very clear from the beginning that longitudinal paths are being explored. It would be very helpful if the authors would make their (analysis) approach clearer from the introduction. Now, they describe many different things, which makes the paper more difficult to read. It would be of great help to see the proposed path model in a Figure and refer to that in the Method.

      We clarified the longitudinal paths tested in this study in Intro [line 149~159]. We also added a figure of the proposed path model (Figure 1) [Methods: line 231~238].

      • There is quite a lot of causal language in the paper, particularly in the Discussion. My advice would be to tone this down.

      We adjusted and moderated the use of causal languages throughout the manuscript.

      • I feel that the limitation section is a bit brief, and can be developed further.

      We clearly specified the limitations of our study. These included concerns about the representativeness of the ABCD samples, of the limited scope of longitudinal data, and the use of non-randomized, observational data [line 524~544].

      • I like that the assessment of CP and self-reports PEs is of good quality. However, I was wondering which 4 items from the parent-reported CBCL were used and how did they correlate with the child-reported PEs? And how was distress taken into account in the child self-reported PEs measurement? Which PEs measures were used?

      Thanks for the clarification question. We report the Pearson’s correlation coefficients between the PLEs [line 198~200]. (The Reviewer #1 may have referred to the prior version of our manuscript submitted elsewhere, for this point has been already addressed in our initial submission to eLife).

      • What was the correlation between CP and EA PGSs?

      The Pearson’s correlation between CP and EA PGS was 0.4331 (p<0.0001). We added the statistics to the manuscript. [line 214]

      • Regarding the PGS: why focus on cognitive performance and EA? It should be made clearer from the introduction that EA is not only measuring cognitive ability, but is also a (genetic) marker of social factors/inequalities. I'm guessing this is one of the reasons why the EA PGS was so much more strongly correlated with PEs than the CP PGS. See the work bij Abdellaoui and the work by Nivard.

      We appreciate the reviewer’s insightful feedback. Acknowledging the role of both CP and EA PGSs in our study, we agree with the observation that EA PGS goes beyond gauging cognitive aptitude—it also serves as an indicator of societal influences and inequalities. The multifaceted nature of EA PGS could be the reason underlying the stronger correlation with PLEs compared to CP PGS. In response to this feedback, we revised our introduction to articulate the multifaceted role of EA PGS in more precise terms. For supporting our assertions, we have included references to prior studies (Abdellaoui et al., 2022) [line 131~142].

      Abdellaoui, A., Dolan, C. V., Verweij, K. J. H., & Nivard, M. G. (2022). Gene–environment correlations across geographic regions affect genome-wide association studies. Nature Genetics. doi:10.1038/s41588-022-01158-0

      • Considering previous work on this topic, including analyses in the ABCD Study, I'm not surprised that the correlation was not very high. Therefore, I don't think it makes a whole of sense to adjust for the schizophrenia PGS in the sensitivity analyses, in other words, it's not really 'a more direct genetic predictor of PLEs'.

      We thank the reviewer for the thoughtful comments. We acknowledge that the correlation between schizophrenia PGS and PLE may not be exceedingly high, as evidenced by previous work, including analyses from the ABCD study. However, we would like to emphasize our rationale for adjusting schizophrenia PGS in the sensitivity analyses. Our study design stemmed from the established associations between PLEs and increased risk for schizophrenia. Existing studies have reported significant associations between schizophrenia PGS and cognitive deficits in both psychosis patients (Shafee et al., 2018) and people at risk for psychosis (He et al., 2021). Notable, the PGS for schizophrenia has shown significant associations with PLEs, arguably more so than PGS for PLEs itself (Karcher et al., 2018). Our updated manuscript has incorporated these references to improve clarity. [line 307~309]. By adding this layer of adjustment, we believe that our mixed linear model more precisely examines the relationship between the cognitive phenotype PGS and PLEs, in terms of both sensitivity and specificity.

      He, Q., Jantac Mam-Lam-Fook, C., Chaignaud, J., Danset-Alexandre, C., Iftimovici, A., Gradels Hauguel, J., . . . Chaumette, B. (2021). Influence of polygenic risk scores for schizophrenia and resilience on the cognition of individuals at-risk for psychosis. Translational Psychiatry, 11(1). doi:10.1038/s41398-021-01624-z

      Karcher, N. R., Paul, S. E., Johnson, E. C., Hatoum, A. S., Baranger, D. A. A., Agrawal, A., . . . Bogdan, R. (2021). Psychotic-like Experiences and Polygenic Liability in the Adolescent Brain Cognitive Development Study. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging. doi:https://doi.org/10.1016/j.bpsc.2021.06.012

      Shafee, R., Nanda, P., Padmanabhan, J. L., Tandon, N., Alliey-Rodriguez, N., Kalapurakkel, S., . . . Robinson, E. B. (2018). Polygenic risk for schizophrenia and measured domains of cognition in individuals with psychosis and controls. Translational Psychiatry, 8(1). doi:10.1038/s41398-018-0124-8

      • How did the FDR correction for multiple testing affect the results?

      Please note that we have clarified our FDR correction in the methods

      As detailed in the method section [line 254~255], we applied False Discovery Rate (FDR) correction for multiple testing across nine key variables in the study: PGS (CP or EA), family income, parental education, family’s financial adversity, Area Deprivation Index, years of residence, proportion of population below -125% of the poverty line, positive parenting behavior, and positive school environment. An exception was made in our additional sensitivity analysis, where we included schizophrenia PGS in the linear mixed model for adjustment, thus the FDR correction was applied across ten key variables instead. Overall, the application of FDR correction had minimal impact on our findings. Most associations between the key variables and the outcomes that were originally marked as highly significant sustained their significance after the FDR correction.

      Overall, I feel that this paper has the potential to present some very interesting findings. However, at the moment the paper misses direction and a clear focus. It would be a great improvement if the readers would be guided through the steps and approach, as I think the authors have undertaken important work and conducted relevant analyses.

      We express our appreciation to the reviewer for the positive feedback and constructive suggestions, which only serve to improve and strengthen our manuscript. We have incorporated the suggested corrections and clarifications in response to the reviewer's suggestions. We believe that these changes will not only enhance the overall readability but also more effectively emphasize the significance and implication of our work.

      Reviewer #2 (Public Review):

      This paper tried to assess the link between genetic and environmental factors on psychotic-like experiences, and the potential mediation through cognitive ability. This study was based on data from the ABCD cohort, including 6,602 children aged 9-10y. The authors report a mediating effect, suggesting that cognitive ability is a key mediating pathway in the link between several genetic and environmental (risk and protective) factors on psychotic-like experiences.

      While these findings could be potentially significant, a range of methodological unclarities and ambiguities make it difficult to assess the strength of evidence provided.

      Strengths of the methods:

      The authors use a wide range of validated (genetic, self- and parent-reported, as well as cognitive) measures in a large dataset with a 2-year follow-up period. The statistical methods have the potential to address key limitations of previous research.

      Weaknesses of the methods:

      The rationale for the study is not completely clear. Cognitive ability is probably a more likely mediator of traits related to negative symptoms in schizophrenia, rather than positive symptoms (e.g., psychosis, psychotic-like symptom). The suggestion that cognitive ability might lead to psychotic-like symptoms in the general population needs further justification.

      We appreciate the reviewer’s concern regarding the role of cognitive ability in relation to schizophrenia symptoms. We are aware that cognitive ability often serves as a mediator of psychotic-like experiences. However, to our best knowledge, a growing body of research has proposed that cognitive ability can mediate positive symptoms in schizophrenia including psychotic-like experiences. The studies by Howes & Murray (2014) and Garety et al. (2001) suggested that deficits in cognitive ability can potentially contribute to the manifestation of positive symptoms such as psychotic-like experiences. We have elaborated on this aspect in the Introduction section [line 104-115].

      Howes, O. D., & Murray, R. M. (2014). Schizophrenia: an integrated sociodevelopmental-cognitive model. The Lancet, 383(9929), 1677-1687. doi:https://doi.org/10.1016/S0140-6736(13)62036-X

      Garety, P. A., Kuipers, E., Fowler, D., Freeman, D., & Bebbington, P. E. (2001). A cognitive model of the positive symptoms of psychosis. Psychological Medicine, 31(2), 189-195. doi:10.1017/S0033291701003312

      Terms are used inconsistently throughout (e.g., cognitive development, cognitive capacity, cognitive intelligence, intelligence, educational attainment...). It is overall not clear what construct exactly the authors investigated.

      We thank the reviewer’s feedback regarding the consistency of terminology in our manuscript. Per the suggestion, we standardized the use of ‘cognitive capacity’ and now consistently refer to it as ‘cognitive phenotypes’ throughout our manuscript. Furthermore, we explicitly stated in the Introduction section that our two PGSs of focus will be termed ‘cognitive phenotypes PGSs’, aligning with terminology used in prior studies (Joo et al., 2022; Okbay et al., 2022; Selzam et al., 2019) [line 140~142].

      Joo, Y. Y., Cha, J., Freese, J., & Hayes, M. G. (2022). Cognitive Capacity Genome-Wide Polygenic Scores Identify Individuals with Slower Cognitive Decline in Aging. Genes, 13(8), 1320. doi:10.3390/genes13081320

      Okbay, A., Wu, Y., Wang, N., Jayashankar, H., Bennett, M., Nehzati, S. M., . . . Young, A. I. (2022). Polygenic prediction of educational attainment within and between families from genome-wide association analyses in 3 million individuals. Nature Genetics, 54(4), 437-449. doi:10.1038/s41588-022-01016-z

      Selzam, S., Ritchie, S. J., Pingault, J.-B., Reynolds, C. A., O’Reilly, P. F., & Plomin, R. (2019). Comparing Within- and Between-Family Polygenic Score Prediction. The American Journal of Human Genetics, 105(2), 351-363. doi:https://doi.org/10.1016/j.ajhg.2019.06.006

      Not the largest or most recent GWASes were used to generate PGSes.

      We appreciate the reviewer’s observation. Indeed, we were unable to utilize the most recent or the largest GWAS for cognitive performance, educational attainment, and schizophrenia due to the timeline of our study. Regrettably, the commencement of our study preceded the publication of the ‘currently’ the largest or most recent GWAS studies by Okbay et al. (2022) and Trubetskoy et al. (2022). Our research was conducted with the best available data at that time, which was the GWAS of European-descent individuals for educational attainment and cognitive performance (Lee et al, 2018). To eliminate any potential confusion, we adjusted the text to specify that our study used 'a GWAS of European-descent individuals for educational attainment and cognitive performance' rather than the largest GWAS [line 206~208].

      It is not fully clear how neighbourhood SES was coded (higher or lower values = risk?). The rationale, strengths, and assumptions of the applied methods are not fully clear. It is also not clear how/if variables were combined into latent factors or summed (weighted by what). It is not always clear when genetic and when self-reported ethnicity was used. Some statements might be overly optimistic (e.g., providing unbiased estimates, free even of unmeasured confounding; use of representative data).

      Thank you for pointing this out. Consistent with the illustration of neighborhood SES in the Methods, higher values of neighborhood SES indicate risk [line 217~228]. In the original Figure 2, higher value of neighborhood SES links to lower intelligence (direct effects: β=-0.1121) and higher PLEs (indirect effects: β=-0.0126~ -0.0162). We think such confusion might have been caused by the difference between family SES (higher values = lower risk) neighborhood SES (higher values = higher risk). Thus, we changed the terms to ‘High Family SES’ and ‘Low Neighborhood SES’ in the corrected figure (Figure 3) for clarification.

      Considering that shorter duration of residence may be associated with instability of residency, it may indicate neighborhood adversity (i.e., higher risk). This definition of the ‘years of residence’ variable is in line with the previous study by Karcher et al. (2021).

      During estimation, the IGSCA determines weights of each observed variable in such a way as to maximize the variances of all endogenous indicators and components. We added this explanation in the description about the IGSCA method [line 266~268].

      We deleted overly optimistic statements like ‘unbiased estimates’ and used expressions such as ‘adjustment for observed/unobserved confounding’ instead, throughout our manuscript.

      Karcher, N. R., Schiffman, J., & Barch, D. M. (2021). Environmental Risk Factors and Psychotic-like Experiences in Children Aged 9–10. Journal of the American Academy of Child & Adolescent Psychiatry, 60(4), 490-500. doi:10.1016/j.jaac.2020.07.003

      It appears that citations and references are not always used correctly.

      We thoroughly checked all citations and specified the references for each statement: We deleted Plomin & von Stumm (2018) and Harden & Koellinger (2020) and cited relevant primary studies (e.g., Lee et al., 2018; Okbay et al., 2022; Abdellaoui et al., 2022) instead. We also specified the references supporting the statement that educational attainment PGS links to brain morphometry (Judd et al., 2020; Karcher et al., 2021). As Okbay et al. (2022) use PGS of cognitive intelligence (which mentions the analyses results in their supplementary materials) as well as educational attainment, we decided to continue citing this reference [line 131~141].

      Strengths of the results:

      The authors included a comprehensive array of analyses.

      We thank the reviewer for the positive comment.

      Weaknesses of the results:

      Many results, which are presented in the supplemental materials, are not referenced in the main text and are so comprehensive that it can be difficult to match tables to results. Some of the methodological questions make it challenging to assess the strength of the evidence provided in the results.

      As you rightly identified, we inadvertently failed to reference Table S2 in the main text. We have since corrected this omission in the Results section for the IGSCA (SEM) analysis [line 376]. The remainder of the supplementary tables (Table S1, S3~S7) have been appropriately cited in the main manuscript. We recognize that the quantity of tables provided in the supplementary materials is substantial. However, given the comprehensiveness and complexity of our analyses, which encompass a wide array of study variables, these tables offer intricate results from each analysis. We deem these results, which include valuable findings from sensitivity analyses and confound testing, too significant to exclude from the supplementary materials. That said, we are open to, and would greatly welcome, any further suggestions on how to present our supplementary results in a more clear and digestible format. Your guidance in this matter is highly valued.

      Appraisal:

      The authors suggest that their findings provide evidence for policy reforms (e.g., targeting residential environment, family SES, parenting, and schooling). While this is probably correct, a range of methodological unclarities and ambiguities make it difficult to assess whether the current study provides evidence for that claim.

      We believe that with the improvement we made in this revised manuscript, this concern may have been successfully mitigated.

      Impact:

      The immediate impact is limited given the short follow-up period (2y), possibly concerns for selection bias and attrition in the data, and some methodological concerns.

      We appreciate the feedback provided in the reviewer's impact statement. We added as study limitations [line 524~544] that the impact of our findings may be limited due to the relatively short follow-up period, the possibility of sample selection bias, and the problems of interpreting results from an observational study as causality (despite the novel causal inference methods, designed for non-randomized, observational data, that we used).

      As responded above (and also in more detail in the Reviewer #2’s Recommendations For The Authors section below), we made necessary corrections and clarifications for the points suggested by the reviewer. As we are willing to make additional revisions, please feel free to give comments if you feel that our corrections are insufficient or inappropriate.

      Nevertheless, we would like to discuss some points. We sincerely hope this following response does not come across as argumentative to the reviewer and the editor. We fully understand the reviewer's perspective on this matter, and we agree that the issues raised about the ABCD study are absolutely valid. However, when evaluating the overall impact of a study, other factors, such as how the field has been assessing the impact of similar studies, should also be considered.

      Firstly, the potential selection bias and attrition in the ABCD data may not necessarily limit the conclusions of this study. While recognizing the potential issues with the ABCD data is important, we feel that judging the impact of our findings as "limited" based on these issues may not be entirely fair. This is because no study, particularly those of a nationwide scale such as the UK Biobank, IMAGEN, HEAL, HBCD, etc., is completely free of limitations. Typically, the potential limitations of the data don't undermine the impact of individual studies' findings. Numerous studies using ABCD data have been published in top-tier journals—despite the limitations of the ABCD study—underscoring the scientific merit of the findings. For example, the study by Tomasi, D., & Volkow, N. D. (2021), entitled "Associations of family income with cognition and brain structure in USA children: prevention implications," published in Molecular Psychiatry, might be highly relevant to the limitations of the ABCD study raised by the reviewer. The scientific community, including editors, reviewers, and readers, may have appreciated the impact of this study despite the acknowledged limitations of the ABCD data.

      Secondly, the two-year time window of our longitudinal analysis might not impact the aim of this study—an iterative assessment of the associations between genetic and environmental variables with cognitive intelligence and mental health, with a focus on PLE, in preadolescents. Had we aimed to test the developmental trajectory from childhood to adolescence, perhaps a longer timeframe would have made more sense. So, we do not agree with the reviewer’s assessment that the short time window limits the impact of our study.

      Suggested revisions based on the combined reviewer feedback:

      1) The terminology used should be carefully reviewed and revised

      • Please use the correct terminology for the key concepts assessed in this study. For example, authors sometimes conflate PLEs and psychosis, two related but separate constructs. Furthermore, the terms 'good parenting' and 'good schooling' are vague and subjective.

      • The authors use multiple terms to refer to cognitive ability (cognitive capacity, intelligence, cognitive intelligence, etc). The term 'cognitive development' in the title and manuscript does not seem to be justified given the focus on different measures of cognitive ability at a single time point (i.e. baseline).

      • Please avoid causal language and using statements that cannot be entirely substantiated (e.g. unbiased estimates, free from unmeasured confounding)

      Thank you for suggesting this point. We revised all key terminologies used throughout our manuscript.

      Per your suggestion, we specified that PLEs indicate the risk of psychosis and often precede schizophrenia. We checked all misused cases of the term ‘psychosis’ and corrected them as ‘PLEs’. We also changed the terms 'good parenting' and 'good schooling' to ‘positive parenting behavior’ and ‘positive school environment’.

      We changed the term ‘cognitive development’ to ‘cognitive ability’ throughout our manuscript. We also changed the title to ‘Gene-Environment Pathways to Cognitive Intelligence and Psychotic-Like Experiences in Children’ because we used ‘cognitive intelligence’ for NIH toolbox variable in the text.

      We corrected and tone-downed all causal languages used in our manuscript. As mentioned by the reviewers, we deleted statements like ‘unbiased estimates’ and used expressions such as ‘adjustment for observed/unobserved confounding’ instead.

      2) A stronger rationale for the focus on PLEs, and the potential mediating role of cognitive ability in genetic and environmental effects on PLES, should be provided

      We appreciate the raised concerns that cognitive ability may serve as a mediator of psychotic-like experiences. To our best knowledge, it has been proposed that cognitive ability can be a mediator of positive symptoms in schizophrenia (including psychotic-like experiences), as well as negative symptoms. This mediating role of cognitive ability was proposed in several prior studies on cognitive model of schizophrenia/psychosis. Per your suggestion, we included an additional justification in Intro [line 104~115] where we highlighted that cognitive ability has been proposed as a potential mediator of genetic and environmental influence on positive symptoms of schizophrenia such as psychotic-like experiences. We refer to studies conducted by Howes & Murray (2014) and Garety et al. (2001).

      Howes, O. D., & Murray, R. M. (2014). Schizophrenia: an integrated sociodevelopmental-cognitive model. The Lancet, 383(9929), 1677-1687. doi:https://doi.org/10.1016/S0140-6736(13)62036-X

      Garety, P. A., Kuipers, E., Fowler, D., Freeman, D., & Bebbington, P. E. (2001). A cognitive model of the positive symptoms of psychosis. Psychological Medicine, 31(2), 189-195. doi:10.1017/S0033291701003312

      3) As described in more detail by the reviewers, more information should be provided about the measures used in the study and how they relate to one another (e.g. correlations between PQ-BC and CBCL; PGS-CA and PGS-EA).

      Thank you for your suggestion. Although this information was already provided in our initial submission, it appears that the Reviewer #1’s might have referred to the prior version of our manuscript submitted elsewhere before eLife.

      To clarify, our findings reveal significant Pearson’s correlation coefficients between PLEs across all time-points (baseline year: r=0.095~0.0989, p<0.0001; 1-year follow-up: r=0.1322~0.1327, p<0.0001; 2-year follow-up: r= 0.1569~0.1632, p<0.0001) and we added this information in the Method section [line 198~200]. We also added the Pearson’s correlation between the two PGSs (r=0.4331, p<0.0001) in the Methods for PGS [line 214].

      4) More details are needed regarding the analytical strategies used (e.g. how imputation was performed, why PGS were not based on the largest and most recent GWASes, whether latent or observed variables were examined, what exactly the supplementary materials show and how they relate to information provided in the main text).

      We appreciate your feedback. We acknowledge the concerns about the GWAS sources utilized for the study. Unfortunately, our study commenced prior to the publication of the ‘currently’ most recent or largest GWAS by Okbay et al. (2022) and Trubetskoy et al. (2022). Our research was conducted with the best available data at that time, which was the largest GWAS of European-descent individuals for educational attainment and cognitive performance (Lee et al, 2018). We have now clarified this point in the manuscript. [line 206~208]

      Also, we specified the use of composite indicators for the PGS, family SES, neighborhood SES, positive family and school environment, and PLEs, while latent factors were used for cognitive intelligence [line 269~285].

      We highly appreciate the reviewer’s comments regarding the supplementary materials. We regret overlooking the citation of Table S2 in the main manuscript, and this has now been rectified in the Results section for the IGSCA (SEM) analysis [line 376]. The remaining supplementary tables (Table S1, S3~S7) have been correctly referenced within the manuscript. We acknowledge that the supplementary materials are extensive due to the comprehensive array of study variables and intricate results from each analysis. However, given that our analyses encompass a wide array of study variables, these tables offer intricate results from each analysis. We deem these results, which include valuable findings from sensitivity analyses and confound testing, too crucial to exclude from the supplementary materials. That said, we are open to any further suggestions to make our supplementary results more accessible and digestible. In order to improve the accessibility and clarity of our presentation, we are fully committed to making any necessary changes and look forward to any further recommendations.

      5) The limitation section should be expanded and statements regarding the implications of the study findings should be qualified accordingly (e.g. short follow-up period, potential for attrition and selection bias, reverse causation, etc)

      We specified additional potential constraints of our study, including limited representativeness, limited periods of follow-up data (baseline year, 1-year, and 2-year follow-up), possible sample selection bias, and the use of non-randomized, observational data [line 524~544].

      6) Please ensure that the references provided support the statements in the text to which they are linked to.

      Thank you for pointing this out. We thoroughly went over all citations and corrected the inaccurately or vaguely cited references for each statement.

      Reviewer #2 (Recommendations For The Authors):

      1) Please use terms consistently and correctly. E.g., 'cognitive capacity' is not the same as 'educational attainment'.

      We thank the reviewer’s feedback regarding the consistency of terminology in our manuscript. Per the suggestion, we standardized the use of ‘cognitive capacity’ and now consistently refer to it as ‘cognitive phenotypes’ throughout our manuscript. Furthermore, we explicitly stated in the Introduction section that our two PGSs of focus will be termed ‘cognitive phenotypes PGSs’, aligning with terminology used in prior studies (Joo et al., 2022; Okbay et al., 2022; Selzam et al., 2019) [line 140~142].

      Joo, Y. Y., Cha, J., Freese, J., & Hayes, M. G. (2022). Cognitive Capacity Genome-Wide Polygenic Scores Identify Individuals with Slower Cognitive Decline in Aging. Genes, 13(8), 1320. doi:10.3390/genes13081320

      Okbay, A., Wu, Y., Wang, N., Jayashankar, H., Bennett, M., Nehzati, S. M., . . . Young, A. I. (2022). Polygenic prediction of educational attainment within and between families from genome-wide association analyses in 3 million individuals. Nature Genetics, 54(4), 437-449. doi:10.1038/s41588-022-01016-z

      Selzam, S., Ritchie, S. J., Pingault, J.-B., Reynolds, C. A., O’Reilly, P. F., & Plomin, R. (2019). Comparing Within- and Between-Family Polygenic Score Prediction. The American Journal of Human Genetics, 105(2), 351-363. doi:https://doi.org/10.1016/j.ajhg.2019.06.006

      2) The authors study 'cognitive performance using seven instruments', but it is not clear how fluid and crystalline intelligence was defined/operationalized.

      Thank you for pointing this out. We specified the NIH Toolbox tests used for composite scores of fluid and crystallized intelligence, respectively. “We utilized baseline observations of uncorrected composite scores of fluid intelligence (Dimensional Change Card Sort Task, Flanker Test, Picture Sequence Memory Test, List Sorting Working Memory Test), crystallized intelligence (Picture Vocabulary Task and Oral Reading Recognition Test), and total intelligence (all seven instruments) provided in the ABCD Study dataset” [line 180~187].

      3) I don't think Lee 2018 is the largest GWAS for educational attainment. That would be Okbay 2022. It needs to be described how cognitive performance was defined in Lee 2018. Why did the authors not use the Trubetskoy 2022 schizophrenia GWAS?

      Thank you for mentioning this point. The reason why we were not able to use the largest GWAS for CP, EA and schizophrenia is because (unfortunately) our study started earlier than the point when the GWAS studies by Okbay et al. (2022) and Trubetskoy et al. (2022) were published. We corrected that our study used ‘a GWAS of European-descent individuals for educational attainment and cognitive performance’ instead of the largest GWAS [line 206~208].

      4) It is unclear how neighbourhood SES was coded. The authors seem to suggest that higher values indicate risk, but Figure 2 suggests that higher values links to higher intelligence and lower PLE.

      Thank you very much for pointing this out. Consistent with the illustration of neighborhood SES in the Methods section, higher values of neighborhood SES indicate risk. In the original Figure 2, higher values of neighborhood SES links to lower intelligence (direct effects: β=-0.1121) and higher PLEs (indirect effects: β=-0.0126~-0.0162). We think such confusion might have been caused by the difference between family SES (higher values = lower risk) neighborhood SES (higher values = higher risk). Thus, we changed the terms to ‘High Family SES’ and ‘Low Neighborhood SES’ in the corrected figure (Figure 3) for clarification.

      5) Also, the 'year of residence' variable is unclearly defined. Does this mean that a shorter duration of residency (even in a good neighbourhood) indicate risk?

      Thank you for mentioning this point. Considering that shorter duration of residence may be associated with instability of residency, it may indicate neighborhood adversity (i.e., higher risk). This definition of the ‘years of residence’ variable is in line with the previous study by Karcher et al. (2021).

      Karcher, N. R., Schiffman, J., & Barch, D. M. (2021). Environmental Risk Factors and Psychotic-like Experiences in Children Aged 9–10. Journal of the American Academy of Child & Adolescent Psychiatry, 60(4), 490-500. doi:10.1016/j.jaac.2020.07.003

      6) Please provide information on how correlated the two PGSes were.

      Thank you for your suggestion. We added the Pearson’s correlation between the two PGSs (r=0.4331, p<0.0001) in the Methods section for PGS [line 214].

      7) Information on the outcome variable in the 'linear mixed models' section is missing. I assumed it was PLE.

      Thank you for notifying us of this point. We added the information on the outcome variables in the section for linear mixed models [line 242~244].

      8) In the 'Path Modeling' section, please explain what 'factors and components' concretely refer to. How is this different from a standard SEM with latent factors?

      Thank you for your comment on the need to elaborate the IGSCA method. We added that different from standard SEM methods which only uses latent factors, the IGSCA method can use components as well as latent factors as constructs in model estimation. This allows the IGSCA method to control bias more effectively in estimation compared to the standard SEM [line 261~268].

      9) The sentence starting line 229 is unclear. Does this mean variables were not used to generate latent factors. And if not, what weights were used to create a 'weighted sum'?

      Thank you for mentioning this point. The sentence means that we treated PGSs, family SES, neighborhood SES, positive family and school environment, and PLEs as composite indicators (derived from a weighted sum of relevant observed variables), while general intelligence was represented as a latent factor.

      It has been suggested from prior studies that these variables (PGSs, family SES, neighborhood SES, positive family and school environment, and PLEs) are less likely to share a common factor and were assessed as a composite index during analyses. For instance, Judd et al. (2020) and Martin et al. (2015) analyze genetic influence of educational attainment and ADHD as composite indicators. Also, as mentioned in Judd et al. (2020), socioenvironmental influences are often analyzed as composite indicators. Studies on psychosis continuum (e.g., van Os et al., 2009) suggest that psychotic disorders are likely to have multiple background factors instead of having a common factor, and notes that numerous prior research uses composite indices to measure psychotic symptoms. Based on this literature, we used components for these variables.

      The IGSCA determines weights of each observed variable to maximize the variances of the endogenous indicators and components [added in line 265~268].

      On the other hand, we treated general intelligence as a latent factor/variable underlying fluid and crystallized intelligence. This is based on the extensive literature of classical g theory of intelligence [added in line 269~284].

      Judd, N., Sauce, B., Wiedenhoeft, J., Tromp, J., Chaarani, B., Schliep, A., ... & Klingberg, T. (2020). Cognitive and brain development is independently influenced by socioeconomic status and polygenic scores for educational attainment. Proceedings of the National Academy of Sciences, 117(22), 12411-12418.

      Martin, J., Hamshere, M. L., Stergiakouli, E., O'Donovan, M. C., & Thapar, A. (2015). Neurocognitive abilities in the general population and composite genetic risk scores for attention‐deficit hyperactivity disorder. Journal of Child Psychology and Psychiatry, 56(6), 648-656.

      van Os, J., Linscott, R., Myin-Germeys, I., Delespaul, P., & Krabbendam, L. (2009). A systematic review and meta-analysis of the psychosis continuum: Evidence for a psychosis proneness–persistence–impairment model of psychotic disorder. Psychological Medicine, 39(2), 179-195. doi:10.1017/S0033291708003814

      10) It is overall not clear when genetically and when self-reported information of ethnicity was used. This needs to be clearer throughout.

      Thank you for mentioning this point. We only used genetically defined ethnicity, and we have not mentioned that we used self-reported ethnicity. Per your suggestion, we clarified that we used ‘genetic ethnicity’ throughout the paper.

      11) The sentence starting line 253 is also unclear. How is schizophrenia PGS a 'more direct genetic predictor of PLE' and compared to what other measure?

      Thank you for pointing this out. Please note that our adjustment (or sensitivity analyses) was based on the reported associations between PLEs and the risk for schizophrenia: schizophrenia PGS is associated with a cognitive deficit in psychosis patients (Shafee et al., 2018) and individuals at-risk of psychosis (He et al., 2021), and psychotic-like experiences (more so than PGS for psychotic-like experiences) (Karcher et al., 2018). We added these references for clarification [line 307~309]. We believe that because of the adjustment our results from the mixed linear model show the sensitivity and specificity of the association between cognitive phenotype PGS and PLEs.

      He, Q., Jantac Mam-Lam-Fook, C., Chaignaud, J., Danset-Alexandre, C., Iftimovici, A., Gradels Hauguel, J., . . . Chaumette, B. (2021). Influence of polygenic risk scores for schizophrenia and resilience on the cognition of individuals at-risk for psychosis. Translational Psychiatry, 11(1). doi:10.1038/s41398-021-01624-z

      Karcher, N. R., Paul, S. E., Johnson, E. C., Hatoum, A. S., Baranger, D. A. A., Agrawal, A., . . . Bogdan, R. (2021). Psychotic-like Experiences and Polygenic Liability in the Adolescent Brain Cognitive Development Study. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging. doi:https://doi.org/10.1016/j.bpsc.2021.06.012

      Shafee, R., Nanda, P., Padmanabhan, J. L., Tandon, N., Alliey-Rodriguez, N., Kalapurakkel, S., . . . Robinson, E. B. (2018). Polygenic risk for schizophrenia and measured domains of cognition in individuals with psychosis and controls. Translational Psychiatry, 8(1). doi:10.1038/s41398-018-0124-8

      12) Please include a statement on the assumptions made when using the method used in this study and developed by Miao 2022, explain what evidence you have to support these assumptions and how this method, which I believe was developed for RCTs, can be applied to observational data.

      We specified the assumptions for the causal inference method proposed by Miao et al. (2022) and why it is applicable to our study. Also, we noted that this novel method was developed to identify the causal effects of multiple treatment variables within non-randomized, observational data [line 309~319].

      13) Some of the statements are potentially misleading. E.g., I would be very cautious to claim that the methods applied allowed the authors to estimate 'unbiased associations again potential (even unobserved) confounding variables'. There are many concerns such as selection bias, attrition, reverse causation, genetic confounding, etc that cannot be addressed satisfactorily using these data and methods.

      Thank you for pointing this out. We deleted statements like ‘unbiased estimates’ and used expressions such as ‘adjustment for observed/unobserved confounding’ instead.

      Nevertheless, please note that due to some limitations in the data (e.g., confounders), an analytic approach should be robust enough to handle potential violations of assumptions. This was the point we wanted to emphasize--In contrast to the majority of studies using the ABCD study, which employ simplistic GLM or conventional SEM with only latent variable modeling, our study provides less biased, thus more accurate, estimates through the use of sophisticated modeling for confounding effects (instead of simplistic GLM) and IGSCA (instead of conventional simplistic SEM). We hope our study may help improve our analytical approach in this field.

      14) I would be equally cautious to claim that the ABCD study is representative. Please add information on the whole ABCD cohort to Table 1 and describe any relevance with respect to attrition effects or representativeness.

      Thank you for highlighting this issue. We previously characterized the ABCD Study as representative of the US population, given its aim to ensure representativeness by recruiting from a broad range of school systems located near each of its 21 research sites, chosen for their geographic, demographic, and socioeconomic diversity. Using epidemiological strategies, a stratified probability sample of schools was selected for each site. This procedure took into account sex, race/ethnicity, socioeconomic status, and urbanicity to reduce potential sampling biases at the school level. Based on these strategies, previous research (e.g., Thompson et al., 2019; Zucker et al., 2018) has referred to the ABCD Study as ‘representative.’ However, we overlooked the fact that “not all 9-year-old and 10-year-old children in the United States had an equal chance of being invited to participate in the study,” and therefore, it should not be deemed fully representative of the US population (Compton et al., 2019). Heeding your suggestion, we have removed all descriptions of the ABCD Study being representative.

      Compton, W. M., Dowling, G. J., & Garavan, H. (2019). Ensuring the Best Use of Data: The Adolescent Brain Cognitive Development Study. JAMA Pediatrics, 173(9), 809-810. doi:10.1001/jamapediatrics.2019.2081

      Thompson, W. K., Barch, D. M., Bjork, J. M., Gonzalez, R., Nagel, B. J., Nixon, S. J., & Luciana, M. (2019). The structure of cognition in 9 and 10 year-old children and associations with problem behaviors: Findings from the ABCD study’s baseline neurocognitive battery. Developmental Cognitive Neuroscience, 36, 100606. doi:10.1016/j.dcn.2018.12.004

      Zucker, R. A., Gonzalez, R., Feldstein Ewing, S. W., Paulus, M. P., Arroyo, J., Fuligni, A., . . . Wills, T. (2018). Assessment of culture and environment in the Adolescent Brain and Cognitive Development Study: Rationale, description of measures, and early data. Developmental Cognitive Neuroscience, 32, 107-120. doi:https://doi.org/10.1016/j.dcn.2018.03.004

      15) The imputation methods need to be explained in more detail / more clearly. What concrete variables were included? Why was 50% of the sample excluded despite imputation? How similar is the study sample to the overall ABCD cohort - and to the US population in general (i.e., is this a representative dataset)?

      Thank you for mentioning this point. We clarified the method and detailed processes of the imputation (e.g., R package VIM, number of missing observations for each study variables such as genotypes, follow-up observations, and positive environment) [Methods; line 167~176].

      The final samples had significantly higher cognitive intelligence, parental education, family income, and family history of psychiatric disorders, lower Area Deprivation Index, percentage of individuals below -125% of the poverty level, and family’s financial adversity (p<0.05). As you have noted above, these results also show the limited representativeness of the data used in our study. We fully acknowledge that our study sample, as well as the overall ABCD cohort, is not representative of the US population in general.

      16) There are a range of unclear statements (e.g., 'Supportive parenting and a positive school environment had the largest total impact on PLEs than genetic or environmental factors' - isn't parenting an environmental factor?).

      Thank you for mentioning this point. We clarified seemingly vague expressions and unclear statements. We corrected the sentence you noted as ‘Supportive parenting and a positive school environment had the largest total impact on PLEs than any other genetic or environmental factors’ [line 57~58].

      17) The authors' conclusion (that these findings have policy implications for improving school and family environmental) are not fully supported by the evidence. E.g., genetic effects were equally large.

      Thank you for pointing this out. Our description should be clearer. Our models consistently show that the combined environmental effects of positive family/school environment, and family/neighborhood SES exceeds the genetic effects. We suggest that these findings may have policy implications for “improving the school and family environment and promoting local economic development” [line 62~64].

      To clarify, we newly added “Despite the undeniable genetic influence on PLEs, when we combine the total effect sizes of neighborhood and family SES, as well as positive school environment and parenting behavior (∑▒〖|β|〗=0.2718~0.3242), they considerably surpass the total effect sizes of cognitive phenotypes PGSs (|β|=0.0359~0.0502)” [line 510~513]. Based on these results, we suggest that our findings hold potential policy implications for “preventative strategies that target residential environment, family SES, parenting, and schooling—a comprehensive approach that considers the entire ecosystem of children's lives—to enhance children's cognitive ability and mental health” in the Discussion [line 507~510].

      Admittedly, our results do not directly demonstrate a causal effect wherein an intervention in the school or family environmental variables would necessarily lead to a significantly meaningful positive impact on a child's cognitive intelligence and mental health. We do not make such a claim in this paper. However, we anticipate that further integrative analyses akin to ours might help identify potential causal or prescriptive effects. We hope this perspective will be recognized as one of the contributions of our study. We leave the final decision to the discerning judgment of the editors and reviewers.

      18) Many citations do not support the statements made and are sometimes used rather vaguely. For example, I believe Judd 2020 and Okbay 2022 did not use a PGS of cognitive capacity, but of educational attainment. Plomin 2018 and Harden 2020 are reviews, but the primary studies should be cited instead. Which reference exactly is supporting the statement that cognitive capacity PGS links to brain morphometry?

      Thank you very much for your precise observations. We thoroughly checked all citations and updated the references for each statement.

      We deleted Plomin & von Stumm (2018) and Harden & Koellinger (2020) and cited relevant original research articles (e.g., Lee et al., 2018; Okbay et al., 2022; Abdellaoui et al., 2022) instead. We also specified the references supporting the statement that educational attainment PGS links to brain morphometry (Judd et al., 2020; Karcher et al., 2021). As Okbay et al. (2022) used the PGS of cognitive intelligence (which presented the analyses results in their supplementary materials) as well as educational attainment, we decided to continue citing this reference [line 131~141].

      19) Citations are formatted inconsistently.

      We apologize for the inconsistency of the citation formatting. We formatted all citations in APA 7th style, using EndNote v20. We checked that all citations maintain consistency according to the reference style.

      20) Re line 281, I believe effect sizes are 'up to twice as large', but not consistently twice as large as suggested in the text.

      Thank you for mentioning this point. We corrected the sentence as ‘The effect sizes of EA PGS on children's PLEs were larger than those of CP PGS’ [line 342~343].

      21) Please add to the results a short statement on what covariates these analyses were controlled for.

      Thank you for giving us this comment. We added that we used sex, age, marital status, BMI, family history of psychiatric disorders, and ABCD research sites as covariates in the Results section [line 329~331].

      22) Cho 2020 does not provide recommendations on FIT values (line 315). Please provide another reference and explain how these FIT values should be interpreted.

      Thank you for mentioning this point. We added the correct reference for FIT values (Hwang, Cho, & Choo, 2021). We also added that the FIT values range from 0 to 1, and a larger FIT value indicates more variance of all variables is explained by the specified model (e.g., FIT=0.50 denotes that the model explains 50% of the total variance of all variables) [line 291~293].

      23) Regarding Figure 2, please add factor loadings to this figure and explain what the difference between the hexagon and circular shapes are. Please also add the autocorrelations between the 3 PLE measures. I assume these were also modelled statistically, given the strong correlations between time points?

      Figure 2B needs reworking.

      It is unclear what the x-axis of Figure 2C represents. Proportion of R2 or effect size? SM table 2 provides key information, which should be added to Figure 2.

      Thank you for pointing this out. We added factor loadings to the corrected figure (Figure 3A and 3B). We also added that the X-axis of Figure 3C represents standardized effect sizes.

      24) I suggest adding units directly to Table 1, not in the legend. Was genetic or self-reported ethnicity used in this table? List age in years, not months?

      Thank you for your suggestion. We added the units of age and family history of psychiatric disorders directly inside Table 1. We used genetic ethnicity in Table 1, as we only used genetic ethnicity (but not self-reported ethnicity) throughout our study. This is noted on the last row of Table 1. We listed age in chronological months, which is how each child’s age at each point of data collection is coded in the ABCD Study.

      25) Please include exact p-values in Table 2.

      Thank you for your suggestion. We highly appreciate the reviewer’s comment on the importance of showing exact p-values in the analysis results. Unfortunately, we cannot estimate the standard errors based on normal-theory approximations to obtain the exact p-values of our IGSCA model results. This is described in detail in the original paper of the IGSCA method (Hwang et al., 2021): “Like GSCA and GSCAM, IGSCA is also a nonparametric or distribution-free approach in the sense that it estimates parameters without recourse to distributional assumptions such as multivariate normality of indicators. As a trade-off of no reliance on distributional assumptions, it cannot estimate the standard errors of parameter estimates based on asymptotic (normal-theory) approximations. Instead, it utilizes the bootstrap method (Efron, 1979, 1982) to obtain the standard errors or confidence intervals of parameter estimates nonparametrically.”

      Efron, B. (1979). Bootstrap methods: Another look at the jackknife. Annals of Statistics, 7, 1–26. http://dx.doi.org/10.1214/aos/1176344552

      Efron, B. (1982). The jackknife, the bootstrap and other resampling plans. Philadelphia, PA: SIAM. http://dx.doi.org/10.1137/1.9781611970319

      Hwang, H., Cho, G., Jung, K., Falk, C. F., Flake, J. K., Jin, M. J., & Lee, S. H. (2021). An approach to structural equation modeling with both factors and components: Integrated generalized structured component analysis. Psychological Methods, 26(3), 273-294. doi:10.1037/met0000336

      26) There are way too many indigestible tables presented in the supplementary materials, which are also not referenced in the main manuscript.

      We appreciate your insightful observation. As you rightly identified, we inadvertently failed to reference Table S2 in the main text. We have since corrected this omission in the Results section for the IGSCA (SEM) analysis [line 376]. The remainder of the supplementary tables (Table S1, S3~S7) have been appropriately cited in the main manuscript. We recognize that the quantity of tables provided in the supplementary materials is substantial. However, given the comprehensiveness and complexity of our analyses, these tables offer intricate results from each analysis. We deem these results, which include valuable findings from sensitivity analyses and confound testing, too significant to exclude from the supplementary materials. That said, we are open to, and would greatly welcome, any further suggestions to ensure clarity and ease of comprehension. Your guidance in this matter is highly valued.

      27) Figure S1 is unclear, possibly due to the journal formatting. Is this one figure presented on two pages? Clarify which PGS is listed in Figure S1 and in any case, please add both PGSs.

      Thank you for mentioning this point. Figure S1 presents two correlation matrices: the first one is the correlation matrix of component / factor variables in the IGSCA model and the second one is the that of observed variables used to construct the relevant component / factor variables in the IGSCA model. We noted each matrix as Figure S1-A and Figure S1-B. We also corrected the figure legend as “A. Correlation between all component / factor variables of the IGSCA model. B. Correlation between all observed variables used to construct the relevant component / factor variables in the IGSCA model.” Since Figure S1-A presents correlations between the components and latent factors, it lists a single PGS variable constructed from the CP PGS and EA PGS. On the other hand, Figure S1-B presents correlations between the observed variables. Thus, both CP PGS and EA PGS are listed in this correlation matrix.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Kagermeier et al. present a novel and interesting study that attempts to model a severe neurodevelopmental disorder, pontocerebellar hypoplasia type 2a, using neocortical and cerebellar organoids. Brain organoids are an appropriate and promising approach to elucidate disease mechanisms in neurodevelopmental diseases. The authors show a reduction in the size of the organoids which is more pronounced in the cerebellar compared to neocortical organoids. While this finding is interesting and reminiscent of the clinical PCH2a phenotype, i.e., cerebellar hypoplasia, the study is very preliminary and the conclusions of the manuscript are not supported by the data. Additional information and further experiments are necessary to support the claims made.

      Major concerns:

      1. hiPSC lines show considerable inter- and intra-individual variability and therefore the size differences observed between these control and patient-derived organoids may arise from differences in the hiPSC lines used. While the data sufficiently demonstrates the pluripotency of the multiple novel hiPSC lines, major concerns remain as to the appropriateness of the control hiPSC lines. The manuscript should include a table describing the age and sex matching as well as mode of reprogramming for all control and patient lines. Patient and control lines should be matched as closely as possible. Furthermore, figure legends should clearly indicate which clones and lines are shown in the various figure panels.

      We agree with the reviewer that hiPSC variability is an important concern in the field. In order to minimize such effects, all iPSCs lines used in this study were generated following the same protocol in the same lab. All cell lines are derived from male donors, thus, eliminating sex-based variability. Further, there is no report of sex-based variance in the clinical phenotype of PCH2a children and this finding is further corroborated by a currently on-going natural history study in our research team. While it would be ideal to also have age-matched controls, this is not possible for ethical reasons as skin biopsies from healthy children cannot easily be obtained to match the pediatric PCH2a cases. However, based on the literature, we believe that epigenetic age is erased upon reprogramming (Strassler et al 2018, Studer et al 2015). Following the reviewer’s recommendation, we provide a table that clearly indicates the origin of all six cell lines used (see Methods section) and information of respective lines was added to the figure legends as suggested by the reviewer.

      As the hiPSC lines used are not isogenic, it is important that the authors characterise these lines further. This should include a quantification of the rates proliferation and apoptosis in all used hiPSC lines, as these might impact the growth rate of the embryoid bodies / organoids.

      We thank the reviewer for raising this concern. To address the variability of hiPSC lines, we performed an extensive characterization of pluripotency, proliferation and cell cycle dynamics of all six hiPSC lines through immunocytochemistry against pluripotency marker OCT4, proliferation marker Ki-67 and EdU incorporation experiments. We further assessed the apoptosis rate of hiPSCs by staining against apoptotic marker cCas3. These experiments were carried out in three consecutive passages of all iPSC lines providing statistical power to the analyses. All experiments did not result in significant differences between PCH2a and control iPSC lines (see Figure 2).

      The authors state that the hiPSC lines have been characterised by SNP arrays to show that no genomic / chromosomal aberrations have been accrued due to reprogramming. The manuscript should include information as to when the SNP array was performed (i.e., immediately after reprogramming, after initial passaging, etc) and also include the results of the SNP array as additional information. What passage were the hiPSC when the presented experiments were carried out?

      In agreement with this comment, we provide data of SNP arrays that were performed to ensure the chromosomal integrity of all cell lines (see supplement). Further, we added details on passages of the cell lines in the respective figure legends as suggested by the reviewer. In brief, all cell lines were kept below passage 20 and were subjected to pluripotency testing before differentiations were started.

      Given that TSNE54 is broadly and strongly expressed in the developing nervous system, the very limited staining of the organoids for TSNE54 in Figure 2 is surprising. Can the authors provide an explanation for the fact that TSNE54 is only expressed in a small subset of cells? Which cell types are these? Moreover, high-magnification images should be shown to demonstrate subcellular staining pattern of TSNE54. Quantification of TSNE54 protein levels by immunoblotting would also be beneficial.

      Related to this observation, it is puzzling that the large size differences that the authors observe in their organoids would be driven by such a small number of TSNE54-expressing cells. How do the authors explain this discrepancy?

      We thank the reviewer for this comment. We have carefully assessed human cerebellar development transcriptomic datasets which demonstrate that TSEN54 is in fact not strongly but moderately expressed in the human developing nervous system. Additionally, TSEN54 expression is expressed in various different cell types (not limited to a subset of cell types) (Aldinger et al 2021, Sepp et al 2021). We agree with this reviewer and reviewer 3 that Western Blotting or other types of quantification would be informative as well as investigation of the subcellular localization of the protein. However, these questions go beyond the scope of the current manuscript, which aims to present a disease model. We have therefore decided to remove the characterization of TSEN54 expression in organoids from our revised manuscript.

      The generated organoids need to be better characterised with a broader range of markers using both qPCR and immunostaining. At the moment, their identity as "cortical" and "cerebellar" organoids remain unconvincing. This is particularly true for cerebellar organoids, which are challenging to generate and are not widely used. The authors should include additional markers (for example, see PMIDs 25640179, 29397531, 32117945) and immunostaining should clearly show expected staining patterns.

      In Figure 5, it appears that some markers (e.g., SATB2) are expressed differently between control and patient lines, yet this is not commented on by the authors who conclude that control and patient lines show differentiation into organoids.

      We thank the reviewer for this suggestion. We performed further immunostainings using the markers that were used in other cerebellar organoid papers (Muguruma et al 2015, Silva et al 2020, Watson et al 2018) as the reviewer suggested. In detail, we added immunohistochemistry experiments on Day 30 and Day 50 of differentiation for early Purkinje cell markers OLIG2 and SKOR2. We also included ATOH1 as a marker for rhombic lip-derived granule cells. For the neocortical organoids, we believe that the performed characterization is sufficient since the protocol we used is well-established and widely used as also indicated by the reviewer. We agree that the cellular composition of the organoids should be investigated in detail (for instance using single-cell transcriptomics). However, we believe this is out of the scope of this manuscript, which describes the establishment of a brain-region specific model platform.

      The authors attempt to look into a potential mechanism for the size differences observed between control and patient organoids. However, only cleaved caspase-3 is used as a marker for apoptosis and no differences were observed. The authors should include further markers for potential cell death. In addition, immunostaining for proliferation markers (i.e., KI67) should be performed to evaluate whether the difference in organoid size could stem from decreased proliferation rather than increased cell death.

      We agree with the reviewer and included a quantification of the proliferation marker Ki-67 within the SOX2 positive population of cerebellar and neocortical organoids as well as the quantification of SOX2 positive areas within the organoids (Figure 6). We observed significant differences in proliferation between PCH2a and control cerebellar organoids. Moreover, we also analyzed the morphology of organoids and quantified the thickness and number of rosettes and find significant differences between control and PCH2a cerebellar organoids corroborating the notion that proliferation is altered in cerebellar organoids. Neocortical organoids do not show any significant differences in proliferation and Sox2+ structures. Only the thickness of the Sox2+ areas is slightly decreased in neocortical PCH2a organoids compared to controls. In order to deepen our analysis of a possible increased apoptosis in PCH2a organoids, we also quantified cCas3 in Sox2+ structures (Figure 5) as also suggested by Reviewer 2. These analyses did not show any significant differences between PCH2a and control organoids. We therefore suggest that at the early stages of differentiation studied here, proliferative differences are the main reason for the size differences between PCH2a and control organoids.

      Reviewer #1 (Significance (Required)):

      The authors present an innovative approach to study neurodevelopmental disorders using brain organoids and should be of interest to researchers and clinicians working on neurodevelopmental diseases. However, the data presented are too limited to support any conclusions about the phenotype observed. Furthermore, questions remain about the used methodology and more work is needed to demonstrate the successful generation of both cortical and cerebellar organoids.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Please find enclosed my recommendation for the paper submitted by Kagermeier et al entitled' Human organoid model of PCH2a recapitulates brain region-specific pathology'. It describes the development of a human model for PCH2a and its characterization. My overall assessment of the paper is 'Major revision' which is explained below.

      Although the paper is very well written and clearly interesting in that it describes the generation and initial analyses of a human organoid model for PCH2a it should be revised such that it will proof the points it is trying to make. The authors are meticulous in their studies combining cellular characterization and a thorough initial screen of organoid (both cerebellar as well as cortical) integrity, yet hardly any mechanistic data is provided. Nevertheless, if the authors are able to add additional experiments and are able to address the points raised, the reviewer may be willing to consider a more positive outcome.

      Major concerns

      1) The overall quality of the figures is poor. There is a lot of overexposure such that often cellular or tissue structures are blended. It starts with Figure 1 G and H but can be observed throughout the manuscript. Deconvolution would greatly enhance their results.

      We are thankful for this comment and we have improved the quality of all microscopy images.

      2) Especially figure 4 and 5 could have been complemented with quantitative data. It furthermore seems more supplemental figure as these are just proof-of-principle stainings. No conclusions can be drawn from the panels except that all markers are there in the various conditions. And while they are showing a neural rosette in Fig 4A, just tiny ones can be observed in 4B. It is also not clear what the whole mount IHC ads in comparison to the IHC on sections. It is also strange that there is still a lot of SOX2 in the CALB/MAP2-positive area, but again with this magnification hard to appreciate.

      We agree with the reviewer that so far we presented qualitative proof-of-principle stainings that demonstrate cerebellar and neocortical differentiation, respectively. In order to address the comment of the reviewer, we improved the quality of the images and also provided higher magnification and enhanced resolution. Additionally, we now provide detailed quantifications of SOX2+ and Ki67+ neural progenitor cells and show that differences observed between PCH2a and control cerebellar organoids may explain the size differences observed between organoids (Figure 6). Our study provides the basis for more in-depth analysis of differences in differentiation and cell type composition between PCH2a and control organoids in the future, for example through single-cell RNAseq.

      3) If the authors would like to proof the point that cerebellar/cortical development is hampered, more functional assays could have been done. Nothing is analyses on the fraction of progenitor cells present (such as the percentage of Tbr2+ IPC in VZ/CP). Furthermore, if there is a suspicion that the number of cells is affected (which is also not shown), proliferation/cell cycle exit experiments using BrdU/EdU should have been performed. Early cell cycle exit still cannot be rules out and should have been tested by the combination of Ki67-/EdU+ percentage of a certain faction of progenitor cells (eg PAX6+ pool).

      We thank the reviewer for this valuable suggestion and agree that it would be interesting to carry out respective experiments. In this study, we show the establishment of a brain-region-specific organoid platform as a disease model for PCH2a and are only at the beginning of deciphering the underlying mechanism. In the revised manuscript, we quantified Ki-67+/Sox2+ cells in proliferative zones in the organoids. We believe that future studies including BrdU / EdU incorporation assays as well as scRNA-seq will answer the questions raised here and decipher the disease-causing mechanism on both cellular and molecular levels but are beyond the scope of this manuscript.

      4) Instead the author chose to only perform a cCas3 staining. From the panels in Figure 6 it is hard to appreciate which cells are actually cCas3+. Also the analyses were performed on the total pool of cell while it might have been more interesting to look for cell death of the various progenitor pools (eg the SOX2+ pool).

      We agree with the reviewer that a more in-depth analysis of apoptotic cell populations is interesting and performed cCas3/Sox2+ quantification for cerebellar and neocortical organoids. We did not observe significant differences of cCas3 expression within the SOX2+ cell population. (Figure 5)

      Minor concerns

      1) It would greatly enhance the review process if line numbers are added

      We have added line numbers to the manuscript.

      2) On general concepts (such as the generation of organoids in the context of disease) more references could have been added

      We have added more references and discussed the topic of brain organoids as disease models as suggested by this reviewer (Eichmüller & Knoblich 2022, Khakipoor et al 2020, Velasco et al 2020).

      Figures

      Fig. 1: In A, the square is clearly visible and not similar to B. An annotation of which is the control and which is the patient is missing in the figure. The arrows are hardly visibly, would make them slightly bigger and remove the black outer lining. Figure 1C can easily go to the Supplemental material. Fig 1 D is hard to appreciate the staining, a close-up with bright field microscope will help. E-I Most of the panels but especially G and H are overexposed. In J, it is hard to appreciate the TSEN54 staining. Maybe separate channels and a merge?

      We thank the reviewer for bringing these details to our attention. We have changed the arrows in the figure to enhance their visibility. Further we have adjusted the quality of the images overall. Lastly, we have made a comment in the figure legend clearly stating which scan came from which child. The described square was added to hide facial features of the imaged individuals hence they are not identical.

      Fig. 3: Usually go into the supplementals.

      Since organoid size is a major first readout when modeling a disorder that is characterized by a reduction of the volume of specific brain regions, we decided to keep this readout in the main text.

      Fig 4/5: Lack of quantitative data and poor quality of figures (overexposure).

      Fig 6: Many of the SOX2 panels are overexposed

      We thank the reviewer for the suggestions on the figures and addressed the concerns in the revised manuscript.

      CROSS-CONSULTATION COMMENTS

      I completely agree with reviewers #1 and #3. It is good to notice that we are overall on the same page.

      Reviewer #2 (Significance (Required)):

      The authors definitely made an excellent start to model PCH2a. Three controls and three patient lines are good to begin with but isogenic controls using one parental line and a patient line where the mutation is fixed would have been ideal. It is interesting that there seem to be a brain area specific pathology of the phenotype. Yet, more thorough analyses could have been performed such as proliferation and differentiation and cell cycle exit experiments. As for now the mostly descriptive data are only scratching the surface and little can be concluded on the molecular framework they are trying to solve.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this study Kagermeier et al. use human cerebellar and neocortical organoids to investigate the effects of the PCH2a-causing homozygous TSEN54c.919G>T variant on the neurodevelopment of different brain regions. They reveal a substantial growth defect in both neocortical and cerebellar regions with a more profound phenotype in the cerebellum. They continue to investigate major cell types of neurodevelopment in both regions and briefly potential mechanisms underlying the phenotypes. The study is well conceived and addresses the current gap of disease-modeling in cerebellar organoids; nevertheless, some major claims are not sufficiently substantiated in the current version. Below, I provide suggestions on how to improve the manuscript with some additional minor comments that might help with readability and accessibility of the work.

      Major comments:

      1. TSEN54 expression levels: The authors compare RNA and protein expression levels for TSEN54 to investigate the mutation's effect. For this the authors use qPCR on iPSCs and organoids of different age and immunostainings and conclude "we did not find differences in expression between cell and tissue types". There are some issues with this analysis as explained below:

      -The qPCR data (Fig. 2B) is first normalized to a housekeeping gene (GAPDH), however, then all organoid data are additionally normalized to the respective iPSC line. Thus, in case there is already a difference on iPSC level, this normalization might mask any difference in the organoids. It is unclear why this approach was chosen, and it seems more appropriate to show the data just normalized to GAPDH than additionally normalizing to the iPSCs, or at least to show first that iPSCs do not have differences in TSEN54 expression. Furthermore, even though apparently not statistically significant there seems to be a strong trend of lower TSEN54 levels in PCH2a in neocortical organoids, but even more so in cerebellar organoids. In my view this would fit very well with the study and should be further explored before concluding there is no statistical difference. Considering the high error bars of the cerebellar organoid samples, a higher N-number might be necessary to reach statistical significance in the difference in expression. Most importantly, it would be appropriate to show single data points where possible and to mark the different cell lines (as done in other figures), as otherwise it is not possible to judge whether there is a cell line bias in the data.

      -The evidence for protein expression of TSEN54 is immunofluorescence stainings for all conditions. As there is no quantification, the authors should not conclude differences, or the lack thereof, based on this qualitative data. Furthermore, in fact in the on example shown the PCH2a cerebellar condition (Fig 2D) seems to show lower expression levels compared with other conditions. This could be due to the selected image, as all other examples include large neural rosettes with strong staining in the center of the rosettes. Furthermore, it is unclear what cell line these stainings come from, even whether the PCH2a cerebellar and neocortical stainings come from the same cell line. Thus, the authors should select comparable examples for all conditions, and ideally provide staining examples (e.g., as supplementary data) for the other replicates to ensure expression in all replicates. If the authors want to comment on differences in protein expression, maybe a quantitative approach (e.g., quantitative western blot) would be more appropriate. Otherwise, the statements should be adjusted to not conclude whether TSEN54 protein levels differ or not.

      -Irrespective of the above comments the conclusion of the section "TSEN54 expression in cerebellar and neocortical organoids", that currently reads "we did not find differences in expression between cell and tissue types" should be changed, as the authors did not investigate whether there are cell type-specific differences of TSEN54 expression.

      We thank the reviewer for this comment. We agree that the provided data is not suitable for quantitative analysis of TSEN54 expression. Please also see our related response to the similar concern raised by reviewer 1. Thanks to these suggestions, we have decided to exclude the TSEN54 expression data from the current manuscript as a detailed analysis should be part of an extensive future study.

      Organoid growth analysis:

      The organoid growth analysis in Figure 3 and supplementary Figure 2 shows the main phenotype of the study that seems to be very strong. The authors use unpaired t-tests to compare within the different timepoints. Unfortunately, I think this approach might not be appropriate as even though the Welch correction does not rely on similar SDs in the compared groups (Control vs. PCH2a), it still assumes that all data points within each group share the same variance. However, this is not the case, as e.g., the control condition includes three groups (Control-1 to -3), that between groups might have different variance as such not all datapoints are independent from each other. Potentially ANOVA analyses controlling for cell line and timepoint might be more appropriate. Or additionally, the authors could consider using the linear regression analysis in Supplementary Figure 2 to further investigate the difference in organoid growth by e.g., comparing the slope of the regression lines. This might be more appropriately reflecting the growth deficit over time than simply comparing each timepoint individually. Expanding on this analysis the regression analysis requires some more information on the fit (intercept, slope, R-squared of the model), which would help clarifying the growth dynamics in the different systems and conditions.

      We thank the reviewer for the suggestions on statistical analysis and adjusted our approach accordingly. Briefly we performed 3-way-ANOVA analysis for the growth curves which revealed no significant differences between the different lines within the groups (Control or PCH2a) at different time points. Additionally, we added the linear regression model to the results (See Figure 3 and supplementary table 2, with the information on the curve fit).

      The growth ratio analysis (Figure 3D) is essential to the major claim of the paper that the organoids replicate the region-specific differences. As the authors performed all experiments with matching cell lines this could additionally strengthen the argument by generating the ratio of size differences for each cell line separately (instead of just for all PCH2a lines together). This would allow comparison of the same genetic background in both cerebellar and neocortical condition and further corroborate the region-specific severity of the phenotype. Potentially, this would also enable to test these differences statistically.

      We appreciate the suggestion to compare the differentiation protocols by line. Below we display the line-by-line analysis between the two differentiation protocols at D30 (A), D50 (B), and D90 (C). In order to visualize the differences in size between the two protocols more clearly, we have generated ratios of the average organoid sizes between neocortical and cerebellar organoids (D). The analysis corroborates our previous visualizations and statistics (3-way ANOVA) by showing that PCH lines produce neocortical and cerebellar organoids that differ in size more than those of control lines. The differences are most pronounced at D30 and D90. However, we believe that this analysis does not add additional value to our manuscript and have therefore decided not to include it in the revised version.

      Additionally, all growth analyses for the neocortical organoids (Figure 3C, Supplementary Figure 2B and C) seem to lack the PCH-1 cell line and only contain PCH-2 and PCH-3. This cell line should be added or commented on why it was excluded from the analyses.

      We agree with the reviewer. Unfortunately, we experienced contamination in that specific differentiation and therefore cannot provide the data. We have made a related comment in the manuscript. Since all differentiations were performed in parallel, adding this line at a later time point would add additional confounders and is therefore undesirable.

      Potential mechanism of the phenotype (apoptosis analysis):

      In Figure 6 the authors investigate the hypothesis that increased apoptosis contributes to the phenotypes. In the cleaved Caspase 3 staining there appear to be no differences. Unfortunately, the analysis apparently only includes one replicate (one organoid?) per cell line and condition. Considering the variability in the data shown this seems inappropriately low and should ideally contain ~3 replicates per cell line condition to judge technical and biological variability if the authors want to make the point that there is no "significant difference between PCH2a and control organoids at any time point in both cerebellar and neocortical organoids". Otherwise, this claim does not seem to be substantiated enough by the data.

      Finally, due to the absence of a phenotype related to apoptosis the authors conclude that the phenotypes may be due to "deficits in the proliferation of progenitor cells". Although this is mentioned in the introduction and the discussion, there is no evidence in the current study that supports this interesting idea. By adding relatively straight forward co-staining experiments for e.g., SOX2 (progenitors) and Ki67 (proliferating cells), the authors could provide further evidence for this hypothesis using existing organoid sections. This would support this speculative idea and could add a more mechanistic insight to the study, thereby making it more exciting.

      To address this concern, we have now added a table to the supplement that described in detail which organoids / batches / cell lines were used for which experiment (Supplementary table 3). In addition to our previous cCas3 quantifications, we performed the quantification of cCas3 within the population of SOX2-positive cells, which was suggested by Reviewer 2 (Figure 5).

      To assess the alternative hypothesis, that proliferation deficits account for the size differences observed between organoids, we also performed quantifications of SOX2-positive zones in the organoids at D30 and D50 of differentiation as well as quantifications of Ki-67 positive cells within the SOX2-positive population. For cerebellar organoids we found significant differences in these experiments (Figure 6). We believe that this data supports the hypothesis of aberrant proliferation in PCH2a cerebellar organoids explaining the size differences.

      Minor comments:

      • Cell line and quality control: The authors recruit three male patients with PCH2a and reprogram iPSCs. These cell lines are subjected to a well performed extensive quality control. However, it is unclear what cell lines the stainings (e.g., Fig. 1D to I) originate from. Furthermore, the supplementary qPCR analysis (Supplementary Figure 1) includes only the PCH-1 line, and additionally two cell lines that are not explained (F-CO and hESC-I3). It is unclear what the relevance of showing the qPCR of these cell lines is. To ensure proper QC for all used cell lines the authors should provide data for all cell lines (PCH-1 to -3 and control-1 to -3), or at least summarize (e.g., in a table) what QC metrics were applied to which cell line. Most importantly, this information is completely lacking for the control cell lines and the QC is just mentioned in the text. Unfortunately, it is unclear where the control cell lines originate from, and some basic information would be required to judge whether they are appropriate controls: are they iPSC or ESC, were they reprogrammed with a similar paradigm as the PCH2a cells, what is the gender of the control cell lines (all PCH2a cell lines are apparently male)?

      In line with a similar comment from reviewer 1, we have included a table that provides information on the origin of all six cell lines used in the revised manuscript (methods section). Further we provide SNP-Array data on all cell lines as supplementary material. We also performed detailed characterization of pluripotency, proliferation and cell cycle dynamics of all six hiPSC lines through immunocytochemistry against pluripotency marker OCT4, proliferation marker Ki-67 and EdU incorporation experiments (Figure 2). We further assessed the apoptosis rate of hiPSCs by staining against apoptotic marker cCas3. All experiments did not result in significant differences between PCH2a and control iPSC lines (see Figure 2). In line with the suggestion of this reviewer, we removed the qPCR analysis of iPSCs from the manuscript.

      • To make the study more approachable for a medical audience and to judge the variability in phenotype presentation among the recruited patients it would be appreciated if more information on the patients would be provided. The authors write: "We identified three individuals that display the genetic, clinical and brain imaging features previously described for PCH2a.". This information including age/date of birth, as well as other medically relevant information could be provided in the supplementary figure (e.g., is there a difference in disease burden among the different patients?). This would allow judging the recruited cohort better.

      We thank the reviewer for this insightful comment. We provided a table with detailed clinical information (supplementary table 1).

      • According to the method section the cerebellar and neocortical organoids were cultured in very different medium especially at later timepoints. While neocortical organoids were kept in a neural maintenance medium based on Neurobasal-A, cerebellar organoids were kept in a medium based on BrainPhys. These media contain very different levels of nutrients, especially of glucose (25mM vs 2.5mM, Bardy et al. 2015). This can have a strong phenotype on proliferation of progenitors and proliferative phenotypes (e.g., see Eichmüller et al. 2022). Especially as the authors claim that there is a difference in the PCH2a phenotypes between brain regions, it should be excluded that this is due to medium differences at later timepoints. When investigating the growth curves of Figure 3B and C it seems like the major difference in growth speed seems to be that neocortical organoids grow faster in early timepoints (We agree that media composition can greatly influence growth dynamics of cells in 2D and 3D. However, in this study we assess the differences between two groups: the PCH2a and control iPSC-derived organoids. The differences we describe are in relation to the respective control group and iPSCs were generated following the same protocol in the same lab. We believe that by following two protocols and comparing the three PCH2a to the three control lines within each protocol predominantly, we account for different media composition possibly changing growth dynamics.

      • Staining examples shown and presentation: In several figures the authors could improve the presentation of the staining examples with some changes:

      o Cell line information for images: as the authors only ever note the condition (PCH2a or Control) but not the cell line it is unclear if the stainings all come from one cell line or from multiple different cell lines. This prevents comparing the different differentiation conditions. Additionally, for major conclusions the authors should consider including supplemental stainings or further information on how reproducible the results shown are (how many cell lines and batches were used?).

      We thank the reviewer for these suggestions. We added information on cell lines and passages for all experiments shown in this study in the figure legends. Moreover, we also added a table providing information on n-numbers for all experiments (supplementary table 3).

      o Selection of examples: in several cases (Fig 2C/D, 4A, 6A/B) the selected images depict very different regions, e.g., one condition shows a large rosette, while in the other condition no rosette can be seen. It would be more appropriate to show matching examples where possible.

      We agree with the reviewer and have chosen matched regions of interest in the figure panels in the revised version of the manuscript. Please note that for cerebellar organoids we observed a significant difference in the timepoint of appearance of these rosette-like structures. Therefore, an exact matching of regions of interest was not possible due to biological differences between the samples, which we have also quantified (Figure 6).

      o Color code of stainings: Colors do not match throughout the manuscript in immunofluorescence images. E.g., Fig. 4 uses blue, green, red, magenta and Fig. 5 uses blue, green, magenta, cyan. It would be preferable to adhere to one color code. Considering significant fraction of the population is having red-green blindness, the latter color code seems more appropriate as it should ensure readability also for color-blind audiences.

      We are thankful for this comment. We changed the color code to make figures more widely accessible.

      • Small typos:

      o Figure 1 legend: last sentence "The" instead of "Th"

      o Supplementary Figure 1B: PCH-2 is named "PCH-22"

      o Supplementary Figure 2: As in the main figure for neocortical organoids the PCH-1 condition is missing (see comment on organoid growth curves). Additionally, the color/shape code of the plots in B does not always match the legend (e.g., size in left plot is different and color of PCH-3 in middle and left plot differs from legend and right plot).

      o It is unclear why the cortical organoids are referred to as "neocortical organoids" in the figures and the text. The methods and the reference in the methods as well as all major papers rather use the word "cortical".

      We addressed these suggestions and thank the reviewer for bringing these to our attention. Unfortunately, we could not include data on PCH-01 in neocortical differentiation due to a contamination in this batch. We made sure to run all the batches presented here in parallel so that all conditions are equivalent, preventing us from including a different batch at a later time point.

      We believe that in the context of our study, it is important to highlight cortical organoids as neocortical organoids, because we are also showing cerebellar organoids and there is also a cerebellar cortex.

      References:

      Bardy, C. et al. Neuronal medium that supports basic synaptic functions and activity of human neurons in vitro. Proc National Acad Sci 112, E3312 (2015).

      Eichmüller, O. L. et al. Amplification of human interneuron progenitors promotes brain tumors and neurological defects. Science 375, (2022).

      CROSS-CONSULTATION COMMENTS

      I agree with the comments of the other reviewers and as they are mostly matching, this reinforces the importance to improve certain aspects of the manuscript. As there are no deviating issues I do not comment specifically on any reviewer comments.

      Reviewer #3 (Significance (Required)):

      This work is using organoid technology to shed light on brain region-specific phenotypes in PCH2a. Brain organoids have drastically changed the way we study human neurological diseases (Eichmüller and Knoblich 2022), however, most brain organoid research has focused on cortical organoids. Cerebellar organoid protocols exist for some time (Muguruma et al. 2015, Silva et al. 2020, Nayler et al. 2021) but were not yet applied to uncover new disease biology. Especially considering the important role of human-specific cerebellar processes in specific developmental disorders (Haldipur et al. 2021) and cancer (Hendrikse et al. 2022, Smith et al. 2022), disease modeling in human cerebellar organoids holds great potential for understanding disease biology. The work by Kagermeier et al. demonstrates that human cerebellar organoids are recapitulating brain region-specific growth deficits and thus is an important step forward for disease modeling. Therefore, this work will be interesting to researchers working on brain development and disease modeling, especially in in-vitro systems. Nevertheless, the mechanistic insight of the study is limited, as is the insight into how human-specific processes might be involved in the pathogenesis of PCH2a. Therefore, it will be interesting how this disease model will be used in future to investigate the cell types and mechanisms involved in the PCH2a phenotype.

      Personal field of expertise: Brain organoids and disease modeling in organoids especially of neurodevelopmental diseases. Analysis of organoids with stainings, as well as sequencing techniques, and bioinformatics.

      References:

      Eichmüller, O. L. & Knoblich, J. A. Human cerebral organoids - a new tool for clinical neurology research. Nat Rev Neurol 1-20 (2022) doi:10.1038/s41582-022-00723-9.

      Haldipur, P. et al. Evidence of disrupted rhombic lip development in the pathogenesis of Dandy-Walker malformation. Acta Neuropathol 142, 761-776 (2021).

      Hendrikse, L. D. et al. Failure of human rhombic lip differentiation underlies medulloblastoma formation. Nature 609, 1021-1028 (2022).

      Muguruma, K., Nishiyama, A., Kawakami, H., Hashimoto, K. & Sasai, Y. Self-Organization of Polarized Cerebellar Tissue in 3D Culture of Human Pluripotent Stem Cells. Cell Reports 10, 537-550 (2015).

      Nayler, S., Agarwal, D., Curion, F., Bowden, R. & Becker, E. B. E. High-resolution transcriptional landscape of xeno-free human induced pluripotent stem cell-derived cerebellar organoids. Sci Rep-uk 11, 12959 (2021).

      Silva, T. P. et al. Scalable Generation of Mature Cerebellar Organoids from Human Pluripotent Stem Cells and Characterization by Immunostaining. J Vis Exp (2020) doi:10.3791/61143.

      Smith, K. S. et al. Unified rhombic lip origins of group 3 and group 4 medulloblastoma. Nature 609, 1012-1020 (2022).

      References by the authors

      Aldinger KA, Thomson Z, Phelps IG, Haldipur P, Deng M, et al. 2021. Spatial and cell type transcriptional landscape of human cerebellar development. Nat Neurosci 24: 1163-75

      Eichmüller OL, Knoblich JA. 2022. Human cerebral organoids — a new tool for clinical neurology research. Nature Reviews Neurology 18: 661-80

      Khakipoor S, Crouch EE, Mayer S. 2020. Human organoids to model the developing human neocortex in health and disease. Brain Res 1742: 146803

      Muguruma K, Nishiyama A, Kawakami H, Hashimoto K, Sasai Y. 2015. Self-organization of polarized cerebellar tissue in 3D culture of human pluripotent stem cells. Cell Rep 10: 537-50

      Sepp M, Leiss K, Sarropoulos I, Murat F, Okonechnikov K, et al. 2021.

      Silva TP, Fernandes TG, Nogueira DES, Rodrigues CAV, Bekman EP, et al. 2020. Scalable Generation of Mature Cerebellar Organoids from Human Pluripotent Stem Cells and Characterization by Immunostaining. J Vis Exp

      Strassler ET, Aalto-Setala K, Kiamehr M, Landmesser U, Krankel N. 2018. Age Is Relative-Impact of Donor Age on Induced Pluripotent Stem Cell-Derived Cell Functionality. Front Cardiovasc Med 5: 4

      Studer L, Vera E, Cornacchia D. 2015. Programming and Reprogramming Cellular Age in the Era of Induced Pluripotency. Cell Stem Cell 16: 591-600

      Velasco S, Paulsen B, Arlotta P. 2020. 3D Brain Organoids: Studying Brain Development and Disease Outside the Embryo. Annu Rev Neurosci 43: 375-89

      Watson LM, Wong MMK, Vowles J, Cowley SA, Becker EBE. 2018. A Simplified Method for Generating Purkinje Cells from Human-Induced Pluripotent Stem Cells. Cerebellum 17: 419-27

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: In this study Kagermeier et al. use human cerebellar and neocortical organoids to investigate the effects of the PCH2a-causing homozygous TSEN54c.919G>T variant on the neurodevelopment of different brain regions. They reveal a substantial growth defect in both neocortical and cerebellar regions with a more profound phenotype in the cerebellum. They continue to investigate major cell types of neurodevelopment in both regions and briefly potential mechanisms underlying the phenotypes. The study is well conceived and addresses the current gap of disease-modeling in cerebellar organoids; nevertheless, some major claims are not sufficiently substantiated in the current version. Below, I provide suggestions on how to improve the manuscript with some additional minor comments that might help with readability and accessibility of the work.

      Major comments: 1. TSEN54 expression levels: The authors compare RNA and protein expression levels for TSEN54 to investigate the mutation's effect. For this the authors use qPCR on iPSCs and organoids of different age and immunostainings and conclude "we did not find differences in expression between cell and tissue types". There are some issues with this analysis as explained below: -The qPCR data (Fig. 2B) is first normalized to a housekeeping gene (GAPDH), however, then all organoid data are additionally normalized to the respective iPSC line. Thus, in case there is already a difference on iPSC level, this normalization might mask any difference in the organoids. It is unclear why this approach was chosen, and it seems more appropriate to show the data just normalized to GAPDH than additionally normalizing to the iPSCs, or at least to show first that iPSCs do not have differences in TSEN54 expression. Furthermore, even though apparently not statistically significant there seems to be a strong trend of lower TSEN54 levels in PCH2a in neocortical organoids, but even more so in cerebellar organoids. In my view this would fit very well with the study and should be further explored before concluding there is no statistical difference. Considering the high error bars of the cerebellar organoid samples, a higher N-number might be necessary to reach statistical significance in the difference in expression. Most importantly, it would be appropriate to show single data points where possible and to mark the different cell lines (as done in other figures), as otherwise it is not possible to judge whether there is a cell line bias in the data. -The evidence for protein expression of TSEN54 is immunofluorescence stainings for all conditions. As there is no quantification, the authors should not conclude differences, or the lack thereof, based on this qualitative data. Furthermore, in fact in the on example shown the PCH2a cerebellar condition (Fig 2D) seems to show lower expression levels compared with other conditions. This could be due to the selected image, as all other examples include large neural rosettes with strong staining in the center of the rosettes. Furthermore, it is unclear what cell line these stainings come from, even whether the PCH2a cerebellar and neocortical stainings come from the same cell line. Thus, the authors should select comparable examples for all conditions, and ideally provide staining examples (e.g., as supplementary data) for the other replicates to ensure expression in all replicates. If the authors want to comment on differences in protein expression, maybe a quantitative approach (e.g., quantitative western blot) would be more appropriate. Otherwise, the statements should be adjusted to not conclude whether TSEN54 protein levels differ or not. -Irrespective of the above comments the conclusion of the section "TSEN54 expression in cerebellar and neocortical organoids", that currently reads "we did not find differences in expression between cell and tissue types" should be changed, as the authors did not investigate whether there are cell type-specific differences of TSEN54 expression.

      1. Organoid growth analysis: The organoid growth analysis in Figure 3 and supplementary Figure 2 shows the main phenotype of the study that seems to be very strong. The authors use unpaired t-tests to compare within the different timepoints. Unfortunately, I think this approach might not be appropriate as even though the Welch correction does not rely on similar SDs in the compared groups (Control vs. PCH2a), it still assumes that all data points within each group share the same variance. However, this is not the case, as e.g., the control condition includes three groups (Control-1 to -3), that between groups might have different variance as such not all datapoints are independent from each other. Potentially ANOVA analyses controlling for cell line and timepoint might be more appropriate. Or additionally, the authors could consider using the linear regression analysis in Supplementary Figure 2 to further investigate the difference in organoid growth by e.g., comparing the slope of the regression lines. This might be more appropriately reflecting the growth deficit over time than simply comparing each timepoint individually. Expanding on this analysis the regression analysis requires some more information on the fit (intercept, slope, R-squared of the model), which would help clarifying the growth dynamics in the different systems and conditions. The growth ratio analysis (Figure 3D) is essential to the major claim of the paper that the organoids replicate the region-specific differences. As the authors performed all experiments with matching cell lines this could additionally strengthen the argument by generating the ratio of size differences for each cell line separately (instead of just for all PCH2a lines together). This would allow comparison of the same genetic background in both cerebellar and neocortical condition and further corroborate the region-specific severity of the phenotype. Potentially, this would also enable to test these differences statistically. Additionally, all growth analyses for the neocortical organoids (Figure 3C, Supplementary Figure 2B and C) seem to lack the PCH-1 cell line and only contain PCH-2 and PCH-3. This cell line should be added or commented on why it was excluded from the analyses.

      2. Potential mechanism of the phenotype (apoptosis analysis): In Figure 6 the authors investigate the hypothesis that increased apoptosis contributes to the phenotypes. In the cleaved Caspase 3 staining there appear to be no differences. Unfortunately, the analysis apparently only includes one replicate (one organoid?) per cell line and condition. Considering the variability in the data shown this seems inappropriately low and should ideally contain ~3 replicates per cell line condition to judge technical and biological variability if the authors want to make the point that there is no "significant difference between PCH2a and control organoids at any time point in both cerebellar and neocortical organoids". Otherwise, this claim does not seem to be substantiated enough by the data. Finally, due to the absence of a phenotype related to apoptosis the authors conclude that the phenotypes may be due to "deficits in the proliferation of progenitor cells". Although this is mentioned in the introduction and the discussion, there is no evidence in the current study that supports this interesting idea. By adding relatively straight forward co-staining experiments for e.g., SOX2 (progenitors) and Ki67 (proliferating cells), the authors could provide further evidence for this hypothesis using existing organoid sections. This would support this speculative idea and could add a more mechanistic insight to the study, thereby making it more exciting.

      Minor comments: - Cell line and quality control: The authors recruit three male patients with PCH2a and reprogram iPSCs. These cell lines are subjected to a well performed extensive quality control. However, it is unclear what cell lines the stainings (e.g., Fig. 1D to I) originate from. Furthermore, the supplementary qPCR analysis (Supplementary Figure 1) includes only the PCH-1 line, and additionally two cell lines that are not explained (F-CO and hESC-I3). It is unclear what the relevance of showing the qPCR of these cell lines is. To ensure proper QC for all used cell lines the authors should provide data for all cell lines (PCH-1 to -3 and control-1 to -3), or at least summarize (e.g., in a table) what QC metrics were applied to which cell line. Most importantly, this information is completely lacking for the control cell lines and the QC is just mentioned in the text. Unfortunately, it is unclear where the control cell lines originate from, and some basic information would be required to judge whether they are appropriate controls: are they iPSC or ESC, were they reprogrammed with a similar paradigm as the PCH2a cells, what is the gender of the control cell lines (all PCH2a cell lines are apparently male)?

      • To make the study more approachable for a medical audience and to judge the variability in phenotype presentation among the recruited patients it would be appreciated if more information on the patients would be provided. The authors write: "We identified three individuals that display the genetic, clinical and brain imaging features previously described for PCH2a.". This information including age/date of birth, as well as other medically relevant information could be provided in the supplementary figure (e.g., is there a difference in disease burden among the different patients?). This would allow judging the recruited cohort better.

      • According to the method section the cerebellar and neocortical organoids were cultured in very different medium especially at later timepoints. While neocortical organoids were kept in a neural maintenance medium based on Neurobasal-A, cerebellar organoids were kept in a medium based on BrainPhys. These media contain very different levels of nutrients, especially of glucose (25mM vs 2.5mM, Bardy et al. 2015). This can have a strong phenotype on proliferation of progenitors and proliferative phenotypes (e.g., see Eichmüller et al. 2022). Especially as the authors claim that there is a difference in the PCH2a phenotypes between brain regions, it should be excluded that this is due to medium differences at later timepoints. When investigating the growth curves of Figure 3B and C it seems like the major difference in growth speed seems to be that neocortical organoids grow faster in early timepoints (<d30), but similar at later timepoints, which would exclude effects of the media at late timepoints. Nevertheless, considering the strong effect media glucose concentration can have the authors should investigate whether there is an effect at growth speed at later timepoints by comparing control organoids. This could also strengthen the region-specific phenotype due to PCH2a.

      • Staining examples shown and presentation: In several figures the authors could improve the presentation of the staining examples with some changes: o Cell line information for images: as the authors only ever note the condition (PCH2a or Control) but not the cell line it is unclear if the stainings all come from one cell line or from multiple different cell lines. This prevents comparing the different differentiation conditions. Additionally, for major conclusions the authors should consider including supplemental stainings or further information on how reproducible the results shown are (how many cell lines and batches were used?). o Selection of examples: in several cases (Fig 2C/D, 4A, 6A/B) the selected images depict very different regions, e.g., one condition shows a large rosette, while in the other condition no rosette can be seen. It would be more appropriate to show matching examples where possible. o Color code of stainings: Colors do not match throughout the manuscript in immunofluorescence images. E.g., Fig. 4 uses blue, green, red, magenta and Fig. 5 uses blue, green, magenta, cyan. It would be preferable to adhere to one color code. Considering significant fraction of the population is having red-green blindness, the latter color code seems more appropriate as it should ensure readability also for color-blind audiences.

      • Small typos: o Figure 1 legend: last sentence "The" instead of "Th" o Supplementary Figure 1B: PCH-2 is named "PCH-22" o Supplementary Figure 2: As in the main figure for neocortical organoids the PCH-1 condition is missing (see comment on organoid growth curves). Additionally, the color/shape code of the plots in B does not always match the legend (e.g., size in left plot is different and color of PCH-3 in middle and left plot differs from legend and right plot). o It is unclear why the cortical organoids are referred to as "neocortical organoids" in the figures and the text. The methods and the reference in the methods as well as all major papers rather use the word "cortical".

      References: Bardy, C. et al. Neuronal medium that supports basic synaptic functions and activity of human neurons in vitro. Proc National Acad Sci 112, E3312 (2015). Eichmüller, O. L. et al. Amplification of human interneuron progenitors promotes brain tumors and neurological defects. Science 375, (2022).

      CROSS-CONSULTATION COMMENTS I agree with the comments of the other reviewers and as they are mostly matching, this reinforces the importance to improve certain aspects of the manuscript. As there are no deviating issues I do not comment specifically on any reviewer comments.

      Significance

      This work is using organoid technology to shed light on brain region-specific phenotypes in PCH2a. Brain organoids have drastically changed the way we study human neurological diseases (Eichmüller and Knoblich 2022), however, most brain organoid research has focused on cortical organoids. Cerebellar organoid protocols exist for some time (Muguruma et al. 2015, Silva et al. 2020, Nayler et al. 2021) but were not yet applied to uncover new disease biology. Especially considering the important role of human-specific cerebellar processes in specific developmental disorders (Haldipur et al. 2021) and cancer (Hendrikse et al. 2022, Smith et al. 2022), disease modeling in human cerebellar organoids holds great potential for understanding disease biology. The work by Kagermeier et al. demonstrates that human cerebellar organoids are recapitulating brain region-specific growth deficits and thus is an important step forward for disease modeling. Therefore, this work will be interesting to researchers working on brain development and disease modeling, especially in in-vitro systems. Nevertheless, the mechanistic insight of the study is limited, as is the insight into how human-specific processes might be involved in the pathogenesis of PCH2a. Therefore, it will be interesting how this disease model will be used in future to investigate the cell types and mechanisms involved in the PCH2a phenotype.

      Personal field of expertise: Brain organoids and disease modeling in organoids especially of neurodevelopmental diseases. Analysis of organoids with stainings, as well as sequencing techniques, and bioinformatics.

      References:

      Eichmüller, O. L. & Knoblich, J. A. Human cerebral organoids - a new tool for clinical neurology research. Nat Rev Neurol 1-20 (2022) doi:10.1038/s41582-022-00723-9.

      Haldipur, P. et al. Evidence of disrupted rhombic lip development in the pathogenesis of Dandy-Walker malformation. Acta Neuropathol 142, 761-776 (2021).

      Hendrikse, L. D. et al. Failure of human rhombic lip differentiation underlies medulloblastoma formation. Nature 609, 1021-1028 (2022).

      Muguruma, K., Nishiyama, A., Kawakami, H., Hashimoto, K. & Sasai, Y. Self-Organization of Polarized Cerebellar Tissue in 3D Culture of Human Pluripotent Stem Cells. Cell Reports 10, 537-550 (2015).

      Nayler, S., Agarwal, D., Curion, F., Bowden, R. & Becker, E. B. E. High-resolution transcriptional landscape of xeno-free human induced pluripotent stem cell-derived cerebellar organoids. Sci Rep-uk 11, 12959 (2021).

      Silva, T. P. et al. Scalable Generation of Mature Cerebellar Organoids from Human Pluripotent Stem Cells and Characterization by Immunostaining. J Vis Exp (2020) doi:10.3791/61143.

      Smith, K. S. et al. Unified rhombic lip origins of group 3 and group 4 medulloblastoma. Nature 609, 1012-1020 (2022).

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all three Reviewers for their thorough assessment of our manuscript and their constructive comments and suggestions.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this study, the authors generate several variants of actin that are internally tagged with short peptide tags. They identify one particular position that is able to tolerate various tags of 5-10 amino acids and still shows largely unaltered behavior in cells. They study incorporation of their tagged actins into filaments, characterize the interactions of G-actin variants with different associated proteins and show that retrograde actin flow in lamellipodia and the wound healing response of epithelial cells is not affected by the tagged variants. They then apply the tagged actin to study subcellular distribution of different actin isoforms in mammalian and yeast cells.

      The identification of a specific site in the actin protein that tolerates variable peptide insertions is very exciting and of fundamental interest for all research fields that deal with cytoskeletal rearrangements and cellular morphogenesis. The result demonstrating the functionality of actin variants with peptides inserted between aa 229 and 230 are generally convincing and well done. In particular, the generation of CRISPR/Cas9 genome edited versions of beta- and gamma actin are impressive. I therefore generally support publication of this study. There are however several technical and conceptual issues that should be addressed to improve quality and scope of the study. I listed some specific comments below:

      We thank the Reviewer for their constructive comments and general support for publication of our study.

      Major points

      - The biggest issue I have is the last section on the application of tagged actins to study isoform functions. In principle the application is very clear as there are simply no alternative ways to study isoform distribution in live cells. However, the experimental data are simply not convincing. What the authors define as "cortex" in Fig. 5A seems to rather represent cytosolic background mixed with radial fibers. I am not convinced that even the antibody staining with a relatively clear differential distribution of beta and gamma really shows a genuine accumulation of one isoform on stress fibers. It seems to me that the beta-actin staining has as higher cytosolic background and is generally weaker (gamma nicely labels transverse arcs), which reduces signal/noise and therefore yields a relatively increased level in areas with less-bundled actin. My suggestion is to select more clearly defined actin structures and to use micro-patterned cells to normalize the otherwise obstructing variability in actin organization. Possible structures would be cortical arcs in bow-shaped cells, lamellipodial edges (HT1080 seem to make very nice and large lamellipodia) or cell-cell contacts (confluent monolayer, provided cells don´t grow on top of each other). Stress fibers are possible but need to be segmented very precisely and I did not see any details on this in the methods section. For Fig. 5D: I assume cells were used where only one isoform was tagged? This is technical weak and the double-normalization is probably blurring any difference that might be occurring. Why not use a double-tagging strategy with ALFA/FLAG or ALFA/AU5 tags to exploit the constructs introduced in the previous figures? Also, the unique selling point of the strategy is the possibility of actual live imaging of specific isoforms. Cells that have stably integrated double tags and then transiently express nanobodies for ALFA and either AU5 or FLAG (or other if those don't exist) would make this possible. Considering the work already done in this manuscript, such an approach should actually be possible - did the authors attempt this or is there are reason it is not discussed? If double tagged cells are not possible for some reason it should at the very least be possible to combine ALFA-detection with the specific antibody against the other isoform and get rid of the double normalization.

      We thank the Reviewer for the various suggestions regarding the comparison between the localization of the tagged and native isoforms. In our reply below, we will separately discuss the possibilities and our considerations for fixed samples and live cell imaging. We apologize for the lengthy response but for transparency reasons, we would like to give a thorough overview of our efforts for isoform-specific localization in cells, something for which we have limited space in the manuscript.

      Fixed samples:

      It was a significant experimental challenge to comparing the labeling of the β- and γ-actin specific antibodies with our internally tagged actin system (Fig. 5A-D). The reason for this is that the labeling of the samples with the β- and γ-actin specific antibodies requires treatment with methanol (Dugina et al., J Cell Sci, 2009), most likely to disturb the interaction of actin with actin-binding proteins that prevent the binding of the antibodies due to steric hindrance. Methanol treatment, however, precludes the co-labeling with phalloidin, likely due to changes in the tertiary/quaternary protein structure of F-actin. Initially, we have put a lot of effort in trying to simultaneously label phalloidin with the actin specific antibodies but even very brief methanol treatment (seconds), before or after phalloidin labeling, completely prevents/reverses the binding of phalloidin. Importantly, also the ALFA tag labeling was suboptimal after methanol treatment.

      The fact that we could not perform these double labelings led us to perform different ratio calculations for the β- and γ-actin antibody and the ALFA tag labeling. In the case of the antibody immunofluorescence labeling, we simply divided the signal of the β-actin and γ-actin since we could simultaneously label the isoforms in the same cell. In the case of the ALFA tag labeling, we used phalloidin for independent signal normalization and then performed a second normalization. Although this complicates the normalization procedure (ALFA tag signal of β- and γ-actin is first normalized to total F-actin and then a ratio is calculated) and understandably leads to some confusion, this was the only way forward to obtain the results presented in the manuscript.

      The Reviewer points out that “What the authors define as "cortex" in Fig. 5A seems to rather represent cytosolic background mixed with radial fibers.”. In our images, we observe very little cytosolic background from both antibody stainings. More importantly, for the quantitative analysis, the fluorescence intensity values were corrected for the background values observed in cytosolic areas so even if the signal is present, it should not affect our analysis. We do admit though that we could have been more careful with the term “cortex” since the observed signal could indeed be a mix of radial fibers and the actin cortex. The reviewer further states that “I am not convinced that even the antibody staining with a relatively clear differential distribution of beta and gamma really shows a genuine accumulation of one isoform on stress fibers.” Although the differences are small, we consistently observe a differential fluorescence intensity of β- and γ-actin in actin-based structures with a relatively stronger signal of γ-actin in stress fibers (Fig. 5C). Since we always normalize the fluorescent signal intensity per cell, this strongly indicates a genuine accumulation of one isoform over the other in specific actin-based structures. This observation is very consistent in our experiments and also aligns with many published studies where differences in the localization of β- and γ-actin are reported in various cell types (Pasquier et al., Vasc Cell, 2015; van den Dries et al., Nat Comms, 2019; Malek et al., Int J Mol Sci, 2020). As for the segmentation, we mentioned in the Methods section that we selected small regions (0.5x0.5mm) that exclusively contain stress fiber or “cortex” regions. The regions shown in Fig. 5B are therefore larger than the analyzed regions, something which we will better indicate in the revised manuscript.

      Planned revision: We will provide a more detailed explanation of our quantitative analysis in the Methods section such that it is more clear how our normalization procedure was performed. Furthermore, we will adapt Fig. 5A-B such that it better visualizes how we defined the regions for quantification. As per the Reviewer’s suggestion, we will also apply a different experimental method to show that the tagged isoforms properly localize to actin-based structures. For this, we will attempt to use micropatterned cells to induce clearly define actin-bases structures (the crossbows as suggested by the Reviewer) and also explore the possibilities of investigating the differential localization in double-tagged cells. We will also reconsider the use of the term “cortex” for the region that is pointed out in Fig. 5A-B.

      Live cell imaging:

      We agree with the Reviewer that it would be very valuable to attempt simultaneous live cell imaging of two isoforms. Yet, for this, we would need two tag/fluorophore systems that allow the visualization of internally tagged isoforms in living cells. As presented in our original manuscript, we have successfully inserted many different epitope tags (FLAG/AU1/AU5/ALFA) in the T229/A230 position to demonstrate the versatility of our tagging approach. Yet, despite significant efforts to identify a second tag/fluorophore system that would allow isoform-specific live cell imaging, we only succeeded in designing one strategy to perform live cell imaging, i.e. with the ALFA tag (Götzke, Nat Comms, 2019). Part of the reason for this is that so far, no high affinity nanobodies have been generated against the classical epitope tags (FLAG, AU5 etc.). This is an established challenge since classical epitope tags are typically linear/unstructured while nanobodies require folded secondary structures for epitope recognition such as alpha helices (the ALFA tag was specifically designed as such).

      Besides the successful ALFA tag approach we have tried the following additional approaches for live cell imaging: 1) __full-length GFP, 2) full-length GFP with linker, 3) GFP11 (to complement with GFP1-10 (Cabantous et al., Nat Biotech, 2005) 4) GFP11 with linker 5) FLAG Frankenbodies (Zhao et al., Nat Comms, 2019; Liu et al., Genes Cells, 2021) in FLAG IntAct cells and 6) __Tetracysteine/FlAsH labeling. Importantly, each of these additional internally tagged actins, except for those that contained full-length GFP, showed a high colocalization with the cytoskeleton, again demonstrating the versatility of the T229/A230 position to tag actin. Unfortunately, none of these approaches satisfactorily visualized the actin isoforms in living cells. We will therefore briefly summarize our findings here.

      (1-2, integration of full-length GFP and GFP with linker) Probably not surprisingly, but integrating the entire coding sequence of GFP or GFP flanked by linkers (each 5AA in length) within the T229/A230 position did not results in a proper localization of actin.

      (3-4, integration of GFP11 and GFP11 with linker) Next, we assessed the localization of the GFP11 tagged actin versions (GFP11: 16AA, GFP11+linker: 26AA). Because GFP11 is not visible without GFP1-10 complementation, we also tagged actin at the N-terminus simply for proof of concept where the internally tagged actins would end up. Interestingly, both GFP11-actin and GFP11+linker-actin properly integrated within the cytoskeleton as demonstrated by the FLAG staining. This again demonstrates the versatility of the T229/A230 position and strongly suggests that even the integration of 26AA within this position does only minimally affect the polymerization of actin into the cytoskeleton.

      (3-4) After confirmation of the proper integration of GP11-actin and GFP+linker-actin we continue to express the GFP1-10 in these cells. Unfortunately, this resulted in no or only very minimal localization of the actin to the cytoskeleton, demonstrating that GFP-complementation hampers the integration into the cytoskeleton.

      (5, use of FLAG Frankenbodies) We also expressed FLAG Frankenbodies into our FLAG IntAct cells in an attempt to visualize the isoforms in living cells. FLAG Frankenbodies are single chain antibodies fused to GFP and can be expressed in cells to visualize FLAG-tagged proteins (Liu et al., Genes Cells, 2021). Although a cytoskeletal labeling was indeed discernable in some cells, the FLAG Frankenbody signal overlapped much less with the total actin signal as compared to the FLAG immunofluorescence labeling, indicating that the incorporation of the FLAG-tagged actin was much less in the presence of the FLAG Frankenbody. Also, a significant fraction of the cells demonstrated a homogenous cytosolic signal.

      (6, Use of tetracysteine/FlAsH) Although the tetracysteine tag/FlAsH system is widely known to induce artefacts, we still aimed to evaluate if for live cell imaging of IntAct actins. Similar to GFP11, we first determined the integration of tetracysteine-actin into the cytoskeleton with the use of an additional N-terminal FLAG tag and demonstrate that it was properly integrated into the actin cytoskeleton. Unfortunately, after brief incubation with FlAsH-EDT2, we noted 1) a significant amount of background fluorescence, preventing proper actin visualization and 2) that the cell became static indicating toxicity of the FlAsH-EDT2 compound. Titrating down the amount of FlAsH-EDT2 did not alleviate these drawbacks and only resulted in less fluorescence.

      Overall, based on these experiments, we concluded that the T229/A230 position itself is very versatile, as demonstrated by the proper localization of the GFP11-actin variants and the TetraCys-actin. At the same time, none of these tag/fluorophore systems properly visualized actin in living cells. Although we are unsure what the reason is for this, it is easily imaginable that the on/off kinetics of the split GFP system and the FLAG Frankenbodies are suboptimal to allow for the rapid and continuous integration of actin monomers into the F-actin cytoskeleton. We therefore also concluded that currently, the ALFA tag/nanobody system is apparently unique in its ability to visualize epitope tagged actin in living cells (as shown in the manuscript). For simultaneous visualization of multiple isoforms, we rely on progress on the development of novel nanobody-based tags, something we hope the Reviewer will agree is outside the scope of the current work.

      *- The authors make a point of comparing the internally tagged actin to N-terminal tags that are mostly functional but have been shown to affect translational efficiency. I would strongly suggest to include N-terminally tagged actin as control for all assays in this study. Also for the physiological assays (retrograde flow, wound healing), a positive control is missing that shows some effect. Previous studies showed defects with transiently expressed actin with an N-terminal GFP. As retrograde flow measurements are very sensitive to the exact position of the kymographs and wound healing assays is a very crude and indirect readout, such a positive control is essential. *

      We acknowledge that N-terminally tagged actin has been used extensively for actin research (especially before the introduction of Lifeact). For our studies, however, we were specifically interested in whether the internally tagged actins show similar characteristics as compared to wildtype actin. We have not included N-terminally tagged actin in all of our experiments, since this would not affect our conclusions with respect to the functionality of our internally tagged actins. We expect that for future investigations to for example further establish the importance of actin N-terminal modifications in the differential regulation of actin isoforms, the comparison between internally and N-terminally tagged actins could be very instrumental. Yet, we consider this comparison outside the scope of the current manuscript. For now, the results in the manuscript provide evidence that our approach is unique with respect to the fact that it allows isoform-specific tagging without manipulating the N-terminus. As such, our internal tagging system complements the already existing repertoire of actin reporting methods (N-terminal fusion, Lifeact, F-Tractin, actin nanobodies) and allows researchers to study so far unknown properties of actin variants.

      *- Expression of tagged actins in yeast is a very nice idea but it would be far more informative to express the tagged forms as the only copy of actin. This can either be done by directly replacing endogenous actin gene in S. cerevisiae, or (if the tagged versions are not viable) - using the established plasmid shuffle system (express actin on counter-selectable plasmid, then knock out endogenous copy and introduce additional plasmid with tagged actin, then force original plasmid out). In the presence of endogenous S. cerevisiae actin the shown effects are very hard to interpret as nothing is known about relative protein levels (endogenous vs. introduced). Also, if constitutive expression of the ALFA nanobody is harmful for integration into cables, why not perform inducible expression of the nanobody and observe labeling after induction. For the live imaging a robust cable marker is needed, like Abp140-GFP. Finally, indicate the sequence differences between the used actin forms in yeast (supplementary figure with sequence alignment and clear indication of all variations) *

      We thank the reviewer for their positive comments and feedback regarding expression of IntAct variants in yeast. Currently, we have expressed IntAct as an extra copy in the presence of native Act1 of S. cerevisiae. All the IntAct variants have been expressed under a commonly used constitutive TEF1 promoter. We agree with the Reviewer that it would be valuable to attempt to express the tagged forms as the only copy of actin.

      Planned revisions:

      1) As per the Reviewer’s suggestion, we will attempt to make yeast strains with IntAct as the sole expressing actin copy by using the well-established 5-FOA-based plasmid shuffle system in yeast. We will use a ∆act1 strain containing wildtype act1 in a centromeric ura-plasmid described in Harrer et. al, 2007 (generously shared by Prof. Jessica and Prof. Amberg at Upstate Medical University of New York, USA) and express IntAct exogenously via additional plasmids. Shuffling of these strains on 5-FOA will cause the loss of ura-plasmid containing the wildtype act1 copy and will determine whether yeast cells will be able to survive with IntAct as the sole source of actin. If the cells do survive with IntAct as a sole copy, we will perform subsequent analysis for assessing actin cytoskeleton organization under these conditions.

      2) As the reviewer has mentioned, expression of NbALFA during live-cell imaging experiments hindered incorporation of IntAct into linear actin cables in yeast (Suppl. Fig. S13). As per the reviewer’s suggestion, we will now try to create an inducible-expression system for the NbALFA-mNG and observe its effects on incorporation into formin-made actin cables after induction. We have already created NbALFA-mNG constructs under galactose-inducible GALS and GAL1 promoters and are currently constructing yeast strains for these experiments.

      __3) __We will add an extra supplementary Figure to indicate the sequence differences of the various actin variants that we have expressed in yeast.

      - As the authors clearly show good integration of several tagged actins into filaments I would expand the structural characterization: perform alpha fold predictions of actin monomer structures including the various tags to show the expected orientation. It is striking that the only integration site that seems to work well is at the last position of a short helix, indicating that the orientation of the integrated peptide might be fixed in space and be optimal to minimize interference. Also, a docking of the tag onto the recently published cryoEM structures of the actin filament should be shown to indicate where it resides compared to tropomyosin or the major groove where most side binding proteins seem to bind.

      We already performed AlphaFold predictions of the tagged actin monomers, but we have decided to not include these predictions in the manuscript because of two reasons. First and foremost, while the prediction confidence of the non-tagged region is very high (pLDDT > 90), the prediction confidence of the tagged region is very low (pLDDT https://alphafold.ebi.ac.uk/faq), pLDDT values below 70 should be treated with caution and values below 50 should not be interpreted. Intriguingly, the low confidence aligns with the fact that for both tags, the prediction does not match with known features of the tag. The FLAG tag should be a linear/unstructured region in order to be recognized by the antibody and the ALFA tag should organize into an alpha helix (Götzke et al., Nat Comms, 2019). Yet, in the prediction, the FLAG tag partially continues as an alpha helix and the ALFA tag is only a small helix with part of the tag being unstructured. Second, more minor, reason for not including the predictions is that AlphaFold does not predict to what extend the tag is flexible, which means that even if the tagged region is predicted correctly, it is difficult to say whether the regions will interfere with binding of proteins.

      Despite the low prediction confidence, we used the published actin-tropomyosin cryoEM structure (von der Ecken et al., Nature, 2015) to replace WT actin with ALFA tag actin and the results are shown below. Again, although results should be interpreted with caution, the tag does not seem to obstruct monomer-monomer interactions within an F-actin filament and also the tropomyosin binding surface is relatively distant from the tag region, suggesting that these interactions are likely not disturbed by introducing the tag.

      - For any claims regarding usability of tagged variants for isoform research it would be very important to characterize the known posttranslational modifications of tagged actin variants - are the differences between beta and gamma maintained on this level as well?

      Planned revision: Following the Reviewer’s suggestion, we will perform a western blot analysis to compare posttranslational modification (arginylation) of tagged and wildtype actins.

      Technical issues

      - There is no scale for the color coding in Fig. 5A, B

      We deliberately did not add a numerical scale because the images are normalized which means that presenting the actual numbers might be misleading. The numbers could be interpreted as if they actually present the amount of β-actin relative to γ-actin which is not the case due to staining differences and the normalization procedure.

      - The y-scales for Fig. 5C and D need to be identical to allow direct comparison

      Planned revision: We will adapt the scale of Fig. 5D to make it identical to Fig. 5C. Following the other suggestions of the reviewer, we will also critically evaluate our normalization procedure and present those numbers in Fig. 5C-D if the values turn out to be different.

      - Pearson coefficient should not be normalized to a control value as its already a dimensionless parameter. Always report actual R-value - also remove R2 values for Pearson as this makes no sense in this context (not sure if it was a typo or intended).

      We normalized the Pearson coefficient values for visual representation of the results. The majority of the raw coefficient values (more than 80%) are between 0.20 and 0.75 (see raw values in the associated excel file). Theoretically, Pearson coefficient values are possible between 1 (or-1 for negative correlations) and 0. The much smaller window in our values as compared to the theoretical window (0.55 vs 1) led us to normalize the values such that they can be presented on a scale from “maximum expected colocalization” to “minimum expected colocalization”. In this way, the differences between the various tagged actins are much better appreciated in the Figure. As to reporting the R2, the Reviewer is correct. Reporting the R2 is an inadvertent mistake from our side and we will correct it.

      Planned revision: We will change the R2 in the text to PCC or Pearson Correlation Coefficient.

      *- All values on subcellular regions (like stress fiber or cortex) dependet critically on the way thesese regions were thresholded or identified. Provide all details on how this was done in the methods section and ensure that adequate background subtraction and normalization is applied. Optimally, an unbiased (AI or automated) approach based on simple image statistics is used for this to avoid personal bias. *

      Planned revision: As also indicated above, we will add new experiments to better compare the localization of the isoforms in tagged and parental cells. These new experiments will also be accompanied by a more detailed explanation of how the regions were selected and quantified.

      - In Fig. 2A only heterozygous FLAG-actin cells are used. Why not use a homozygous line (for both beta and gamma actin)? The nice band shift of the FLAG version would allow the precise quantification of the fraction of total actin covered by beta and gamma actin, which then could provide some additional info for the apparently weaker beta staining in Fig. 5 (if beta expression is simply weaker). This would be a very simple and useful advantage of the internal tags that could be widely applied.

      In Fig. 2A, we used the heterozygous FLAG-actin cells to directly compare the production of β-actin from the knock-in allele and the wildtype allele in the same cells. The fact that the two bands observed in this western blot analysis (upper and lower) are almost the same (with the FLAG band being a bit more intense) provides the strongest indication that the tag does not interfere with the expression of actin. In Suppl. Fig. 5D, we show that the expression of β-actin is also unaffected in the hemizygous FLAG actin cells, which exclusively express tagged actin.

      Planned revision: As per the Reviewer’s suggestion, we will also add a western blot analysis on the expression of both actin isoforms and total actin in hemizygous cells.

      *- Fig. 3: control with N-terminal tag is missing. Also, why is it not possible to assay filament binding factors like Myosin, Filamin or alpha actinin - instead of co-IP a simple co-sedimentation assay with cell extracts in F-buffer should pick up any major difference in decoration of filaments containing the ALFA tag. Using two speeds for centrifugation it might even be possible to observe effects on filament bundling. The best approach for this would of course be to purify tagged actins and perform in vitro assays but this is clearly beyond the scope of what the authors intended here. I personally think that a broad acceptance of the marker will only come once the biochemistry has been sufficiently characterized so this is a future direction I would strongly encourage. *

      We kindly refer to our response on Page 5/6 for why we have not included the N-terminal control.

      Planned revision: The co-sedimentation assay is an excellent suggestion by the reviewer. Following the Reviewer’s suggestion, we will perform F/G-actin fractionation and assess the presence of several F-actin associated proteins in the F-actin fraction.

      - Fig. 2A has no loading control

      We show this western blot to indicate that the WT actin and tagged actin are expressed at similar levels in the heterozygous knock-in cells. For this, no loading control is needed because we only compare the intensity of the upper band (tagged actin) with the lower band (WT actin).

      - The RPE-1 data are confusing as several constructs show very different localization (completely cytosolic) to HT1080 cells and there is no possible explanation given for this. Maybe simply remove this data set?

      We agree with the reviewer that the differences in the localization between some of the internally tagged actins between the HT1080 and RPE1 cells might be confusing, especially for the A230-A231 variant for example. Yet, the fact that also in these cells, the T229-A230 variant performs equally well as compared to N-terminally tagged actin is an important confirmation that this variant is properly integrated into actin-based structures, independent of cell type. This makes the support for choosing this variant to continue with our studies stronger. A possible explanation for the differences is that RPE1 cells in general tend to form more stress fibers as compared to the HT1080. Since the localization to stress fibers is different between the internally tagged actins, this may explain the differences observed in colocalization.

      __Planned revision: __We will add a short text, in the Results or the Discussion, on the differences between the colocalization values between HT1080 and RPE1 cells.

      *- The angel measurements for lamellipodial actin is not very meaningful: the angel is determined for the radial bundles, which do not correspond to the Arp2/3 angel of single filaments and is likely the results of different nucleation factors, I would suggest to remove this. If angel measurement are really intended, cryoEM needs to be performed. *

      We apologize for this misapprehension from our side which is also noted by the other two reviewers. In the treadmilling videos of the lamellipodia in HT1080 cells, which were obtained using Airyscan super-resolution microscopy, we clearly observe a consistent filament formation at a constant angle, something which we interpreted as the angle between the mother filament and the daughter filament. After consulting the literature, we indeed have to admit that this cannot be interpreted as such and we will remove these datasets.

      Planned revision: We will remove the datasets with the angle measurements (Suppl. Fig. 7A-B) from our manuscript.

      - Replace all SEM with SD values - use at least 3 biological replicates (4D SEM of n=2)

      Planned revision: We will carefully check our statistics and revise where appropriate.

      Minor points

      - Intro: after listing all the details already understood on actin isoforms it is not very convincing to simply state the molecular principles remain largely unclear (l 34) - maybe better "there is no way to study actin dynamics due to current limitations of specific antibodies to fixed samples. Interesting option would be actually to develop nanobodies that are isoform specific.

      We will rephrase the text in the introduction. Regarding the development isoform-specific nanobodies. Although this sounds like a promising way forward, this would likely not result in isoform-specific targeting in living cells. Similar to the antibodies, isoform-specific nanobodies would have to be generated against the N-terminus which, under native conditions, is likely not available due to the occupation with actin-binding protein. Also, since the N-terminus is not structured, it may be extremely challenging to generate nanobodies against these epitopes.

      *- L 71: "involved" in the kinetics is not a good term - maybe affects or regulates.... *

      We will rephrase the text.

      - L148: "suspect" instead of "expect" - this clonal variation is actually a big danger of the employed approach as possible defects in actin organization could be masked by compensatory changes - it would generally be good to show critical data for at least 3 independent clones to rule out dominant selection effects.

      We will rephrase. We agree that clonal variation could be a danger if actin levels are to be investigated. For future follow-up studies, we plan to make additional cell lines to avoid clone-specific conclusions.

      ***Referees cross-commenting** *

      *I completely agree with the comments by reviewer 2 on the various missing controls - adding several or all of those will make the results much more convincing. The key for the adaptation of any new actin probe will be the level of confidence researchers have on the doumented effects. Even some negative effects on actin behavior (I am sure there will be some) should not prevent usage of the strategy as long as there is robust and convincing documentation of those effects. I also agree that including some basic in vitro characterization will go a long way to convince people dierectly working on actin (there is a very high level of biochemical understanding in that field). *

      Planned revision: We will perform the essential controls as suggested by Reviewer 2. Furthermore, for future experiments, we do envisage the production and purification of internally tagged actins and investigate their binding properties in in vitro reconstitution assays. We have already started with optimizing these approaches through our ongoing collaboration (KD, SP).

      Reviewer #1 (Significance (Required)):

      *Significance: Very useful finding that can be applied to any question related to actin-dependent cellular processes (morphogenesis, cell division, cell polarization, cell migration etc.) *

      *Strength: main finding convincing, strong genome edited cell lines *

      *Limitations: application to study of isoforms very limited and data not convincing, statistics and image quantifications need improvement *

      *Advance: identify new location for integral tagging of actin, which was not really possible before. The main relevance is for fundamental cell biology but the approach can also be applied to the study of disease variants in actin. *

      Audience: general cell biology - very broad interest

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      Actin is highly sensitive to modifications, and tagging it with fluorescent proteins or even smaller motifs can affect its function. The most well-known example of this is that fission yeast where actin has been replaced with GFP-actin are inviable (Wu and Pollard, Science 2005) because the labeled actin cannot incorporate into the formin-dependent filaments that make up the cytokinetic ring. Subsequent experiments revealed that formins filter out GFP-actin monomers, as well as monomers that are labeled with smaller fluorescent motifs (Chen et al, J. Structural Biology 2012). Further, attempts to make mammalian cells lines where GFP-beta-actin was knocked into one allele resulted in extreme down-regulation of the GFP-labeled actin, indicating that there is some implicit toxicity with the labeled version. To my knowledge, all attempts at making homozygous GFP-actin knock-ins have been unsuccessful. Therefore, while GFP-actin or other labeled variants can be over-expressed in many different cell types with some success, there is always the question of how faithful the labeled actin represents bona fide actin localization and dynamics.

      To address this van Zwam et al. have developed a clever strategy of screening actin for internal motifs that can tolerate incorporation of a tag without affecting its function. They appear to have found a good candidate, named IntAct, and provide evidence that this tagging position allows the actin to be functional in both human and yeast cells. The work is very promising, and many of the assays performed satisfy the criteria of rigor and reproducibility. Importantly, the authors have created knock-in human cell lines where the tagged actin is expressed at normal levels, including a double allele knock-in that is viable and has normal proliferation and motility. Additionally, the authors show that labeled S. cerevisiae actin can incorporate into actin cables, which are formin dependent. IntAct constructs were shown to interact with several well-known actin binding proteins and localized well to many different actin structures. There was also interesting data obtained from tagging both beta and gamma actin in human cells. However, as an actin scientist eager for new probes to visualize actin in cells, there are still questions about the functionality of these probes. Addressing these issues, listed below, would alleviate the concerns I still have about IntActs after going through the manuscript. IntActs have the potential to have a large impact on cytoskeletal research if it can be rigorously documented that they are functionally as close to unlabeled actin as possible.

      We thank the Reviewer for their constructive comments and general positive evaluation of our study.

      *Reviewer #2 (Significance (Required)): *

      Concerns:

      1. There are no negative controls performed for either the fixed or live-cell imaging of IntAct. Since the fixed cell data is heavily reliant on the presence of flag-labeled puncta at actin filaments, it is important to show that the immunocytochemistry protocol doesn't produce anything that would mimic the localization of actin. For the live cell data, there has been no effort made to show that the binding of the nanobody to the ALFA tag on InAct is specific.

      Planned revision: __We will add the following controls to exclude that any of the labeling procedures produces anything that would mimic the localization of actin: 1) Immunofluorescence staining of the used tags (FLAG/ALFA) in cells that do not have tagged actins 2) Expression of ALFA-Nb-GFP and ALFA-Nb-mScarlet in cells that do not have tagged actins 3)__ Expression of free GFP in cells that have tagged actins. We will co-stain these cells with phalloidin to visualize F-actin and determine if any signal is specifically localized to the actin cytoskeleton.

      2. The homozygous ALFA-tagged IntAct cells have a 50% reduction in the amount of actin expression (Fig. 2D). What is the F:G ratio in these cells? The F:G measurement is only shown for the FLAG-tagged heterozygous IntAct cells, which have the worst co-localization with phalloidin (Fig. 2F) and were not used for subsequent figures. I appreciate that motility and proliferation were measured and shown to not be affected (Fig. 4D,E) , but in our lab reducing the amount of polymerized actin by 50% (which may be more in ALFA-tagged IntAct cells if the F:G changes) has catastrophic effects on other cytoskeletal and organelle systems. Since the homozygous ALFA IntAct cells are the main ones used in the manuscript, they should be the ones that are fully characterized.

      We would like to point out that the reduction is only 20-25 percent depending on the specific western blot analysis and the loading control. Still, the Reviewer is correct about the necessity of the F:G actin measurements of the ALFA-tagged IntAct cells and we therefore included those as Suppl. Fig. 9 in the original manuscript (text on page 9). The quantification of these assays clearly demonstrated that the F-G actin ratio in the ALFA-tagged IntAct cells is the same as in parental cells.

      3. It is not addressed if expressing the ALFA-Nb-GFP construct in ALFA-IntAct cells alter actin properties? This is essential information for live cell imaging experiments.

      Planned revision: We have already performed proliferation and migration experiments in cells that stably express the ALFA-Nb-GFP. These data indicated that proliferation and migration are not affected by the presence of the nanobody and these data will be included in the revised manuscript. To note, in the original manuscript, we already showed that treadmilling of actin at the lamellipodia is not affected by the presence of the ALFA-Nb-GFP.

      4. It is not addressed how much of the ALFA-IntAct gets labeled with ALFA-Nb-GFP and how uniform the labelling.

      We do not understand this specific request of the Reviewer. To our knowledge, it is not possible to assess how much of a probe (in this case the ALFA-Nb-GFP) binds the target (in this case the ALFA-IntAct actins) in living cells. This is not only the case for the ALFA-Nb-GFP but also for any other probe. As an example, when expressing Lifeact, we also do not know how much of the actin molecules within F-actin get labeled with Lifeact and how uniform the labeling is. From the results of the live-cell imaging we can only conclude that the binding is at least so effective that we can readily observe and discern all the actin-based structures that are also observed by Lifeact (see Suppl. Fig. 8 for Lifeact-GFP/ALFA-Nb-mScarlet cotransfection). Whether the regions that do not have F-actin only contain ALFA-Nb-GFP that is bound to actin monomers or also contains a significant fraction of free ALFA-Nb-GFP seems an issue that cannot be addressed.

      5. To assess lamellapodia architecture, "branched actin angle" is measured using AiryScan imaging of actin filaments. This type of microscopy does not offer the ability to image individual actin filaments; what is actually being measured is the orientation of actin bundles to each other. It should be impossible to image the orientation of actin filaments in Arp2/3 dendritic networks and it is surprising that the measurements average to 70 degrees. A suitable substitute for this would be to measure the size and amount of F-actin in phalloidin-stained lamellipodia using kymograph analysis.

      We apologize for this misapprehension from our side which is also noted by the other two reviewers. In the treadmilling videos of the lamellipodia in HT1080 cells, which were obtained using Airyscan super-resolution microscopy, we clearly observe a consistent filament formation at a constant angle, something which we interpreted as the angle between the mother filament and the daughter filament. After consulting the literature, we indeed have to admit that this cannot be interpreted as such and we will remove these datasets.

      Planned revision: We will remove the datasets with the angle measurements (Suppl. Fig. 7A-B) from our manuscript.

      6. Was it possible to make an IntAct gene substitution in yeast?

      Planned revision: We thank the reviewer for this interesting question and as also suggested by Reviewer 1, we are now constructing yeast strains with IntAct as the sole expressing actin copy by using the well-established plasmid shuffle system in yeast. The results of these experiments will determine the ability of IntAct to completely substitute actin function in yeast.

      Also, while this is not necessary for this manuscript, making a fission yeast strain where actin has been substituted with IntAct and demonstrating that IntAct gets incorporated into the cytoplasmic ring and into Cdc12p-polymerized filaments would alleviate MANY potential concerns people would have about these probes by directly assessing situations were other labeled actins have been documented to fail. Along the same lines, it would have been nice to see a comparison in some of the assays of ALFA-IntAct and GFP-actin or another labeled actin variant.

      We appreciate the reviewer for their constructive feedback and completely agree that it is important to document how IntAct behaves in scenarios where other labelled actins have failed. As a proof of principle, IntAct incorporates into both formin- and Arp2/3- made linear and branched actin filaments in yeast (Fig.5E, Suppl. Fig. 14) and this data shows that IntAct labelling strategy is the first to achieve good integration into both these structures as previous efforts with labelled actin such as GFP-Actin fail to incorporate into formin-made actin filaments (Doyle et al., PNAS, 1996). Thus, we believe that IntAct does perform better than other labelled actins in yeast, although, further optimizations are required to overcome limitations regarding incorporation into actin cables in the presence of the ALFA nanobody.

      Planned revision: We have already extended applicability of IntAct to another well-known fungal model system, the fission yeast Schizosaccharomyces pombe (S. pombe). We expressed IntAct variants of human β- and γ- actin, budding yeast actin (Sc-IntAct) and fission yeast actin (Sp-IntAct) from an exogenous plasmid under the native S. pombe actin promoter in an S. pombe strain that constitutively expresses the Nb-ALFA-mNG. Live-cell microscopy of S. pombe cells expressing these proteins revealed that all IntAct variants localize to actin patch-like structures located at the cell poles and cell division site (during cytokinesis). These structures show similar dynamics as reported for actin patches of S. pombe previously (Pelham et al., Nat Cell Biol, 2001). These preliminary results suggest that IntAct proteins show a similar localization pattern to only branched actin networks found in the actin patches of S. pombe like we had previously observed for the budding yeast, S. cerevisiae (Fig. S13 in manuscript). The underlying mechanism for this exclusion from linear actin cable network from both budding and fission yeast remain unknown and may represent an inherent specificity and sensitivity of yeast formins. Our current and future experiments will express IntAct variants in absence of the ALFA nanobody and determine the level of incorporation into actin cables, patches, and actomyosin ring.

      Planned revision: We have also already performed a quantitative analysis to ascertain the effect of Sc-IntAct expression of cortical actin patch dynamics which represent sites of endocytosis in yeast (Young et al., J Cell Biol, 2004; Winter et al., Curr Biol, 1997). We compared actin cortical patch lifetimes between wildtype cells and cells expressing Sc-Act1 or Sc-IntAct as an extra copy. We used Abp1-3xmcherry as a marker for actin patches and quantified the time window between the appearance and disappearance of a patch (actin patch lifetime) from time-lapse microscopy experiments. Our preliminary results indicate that actin patch lifetimes are unaffected by exogenous expression of both Sc-Act1 or Sc-IntAct suggesting that IntAct does not negatively influence or alter actin patch dynamics. These observations suggest its applicability as a direct visualization strategy for actin at the cortical patches in budding yeast alongside existing surrogate markers like Abp1, Arc15, etc (Goode et al., Genetics, 2015; Wirshing et al., J Cell Biol, 2023).

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      *Summary: *

      This paper tackles a new strategy to tag actin in cells, by identifying that incorporation of a tag of moderate size in subdomain 4 of actin minimally affects actin dynamics in cells, and does not perturb its interaction with known partners, as observed in pull-down assays.

      *Major comments: *

      The paper is interesting and experiments are convincing.

      *My main concerns are the following : *

      - Varland et al, is reporting a phosphorylation on Thr229 : I think the authors should mention and discuss this potential PTM that could be affected in IntAct.

      We thank the Reviewer for pointing this out. We are aware of this review that includes phosphorylation on Thr229 as a possible PTM. Yet, this PTM is only reported in one of the Tables of the Review and not further discussed in the text. It is also unclear how the authors determined that Thr229 is a possible phosphorylation site except for the notion that this residue is a threonine and exposed at the surface of the actin molecule. Together with the fact that there is no evidence from primary studies that Thr229 is phosphorylated, we therefore decided to not include it in our discussion.

      - The sequence in subdomain 4 (the alpha helix containing T229A230) is extremely conserved in animals, as well as in between the 6 human actin isoforms. This usually indicates a strong selection pressure on the residues. I think the authors should discuss how surprising it is that the T229A230 position can accomodate various tags while it is probably the place of interaction with other proteins and is playing an important role in the mechanical structural integrity of the actin itself.

      We thank the Reviewer for bringing up this important point. To a certain extent, the conservation argument is true for all of the residues/domains in actin. Any manipulation will change a conserved part of the actin molecule in one way or another and thereby potentially modify its function. This is also evident from the fact that for most of the internally tagged actins, we observed a very poor colocalization with the actin cytoskeleton (Fig. 1). While for the T229/A230, we have not observed any major effects yet, this certainly does not mean that no further changes or defects will be uncovered in future experiments. Nonetheless, since our approach is unique with respect to the fact that it allows isoform-specific tagging without manipulating the N-terminus, our internal tagging system complements the already existing repertoire of actin reporting methods (N-terminal fusion, Lifeact, F-Tractin, actin nanobodies) and allows researchers to study so far unknown properties of actin variants. We have already included in the discussion that, at this point, we can only speculate as to why this variant performs much better than the others (Page 16 of the manuscript) and that possible explanations are the location at the inner domain and the higher structural plasticity of this region as compared to the rest of the molecule, as found during an alanine mutagenesis screen (Rommelaere et al., Structure, 2003).

      - It is now well established that actin plays active and important roles in the nucleus : is ALFA-actin correctly translocated to the nucleus ?

      Planned revision: This is an interesting suggestion. We will perform nuclear-cytosol fractionation experiments and determine whether ALFA-actin is still correctly translocated to the nucleus.

      *- OPTIONAL: one may regret that there is no classical in vitro assays, such as pyrene assays to assess some kinetcis parameters on epitope-tagged actins. I guess this would make the paper a bit too large. Although, it will prove useful to better understand how much formin activity is affected (see below) *

      For further biochemical characterization and a detailed investigation of the precise assembly kinetics of the tagged actins, we (KD, SP) are already working together to set up in vitro reconstitution experiments. Yet, as also indicated by the Reviewer, we consider these experiments outside of the scope of the current work.

      *Minor comments: *

      Below are points that could be addressed by the authors to improve the manuscript readability and highlight some important points that are sometimes missing or are not properly discussed:

      -line 40 "...but the distinct N-terminal epitope is not available under native conditions preventing" is a bit too obscure. Can the authors say clearly what is meant by 'native conditions'?

      In our understanding, the term ‘native’ is generally used when referring to conditions in which proteins are in their natural state, without alterations due to heat or denaturants, and possibly also still interacting with their binding partners. We will rephrase to better indicate that in this specific case, we mean that the region that harbors the N-terminus is usually occupied by actin-binding proteins, preventing the binding of the antibody due to steric hindrance.

      - figure 1A : make a clearer correspondance between the number shown in panel A and the amino acid numbers displayed in panel C and G.

      Planned revision: This is a good point, we will add extra annotation in the graph to better link the panels with each other. We will also add additional annotation in Fig. 1D-F for the same purpose.

      - figure 1A : it could be informative to indicate subdomains in this panel.

      Planned revision: We will add the numbers for the subdomains in Fig. 1A.

      - figure 1C : normalized correlation cell : I am not sure I understand how the normalization of the Pearson coefficient is done. It is therefore not clear how can it >1 or >-1 ? This should be clearly explained in the method section of the paper.

      __Planned revision: __We will better explain the normalization procedure in the Methods section.

      - figure S4 : comes a bit too early when ALFA-actin has not been yet introduced in the main text. Please, reposition this part or provide data with the FLAG-tag version.

      Planned revision: This is a good point and completely overlooked by us. We will introduce this Figure later such that the ALFA tag is already introduced.

      - section starting line 121 : this section should be better motivated = Why are different tags being tested ? This comes later in the discussion, but the reader fails at following the reasoning/motivation here.

      Planned revision: We will add extra motivation for why we added multiple tags.

      - figure 2D, line 145 "We also evaluated actin protein expression in the homozygous ALFA-β-actin cells and this showed that the total amount of β-actin was slightly lower in the ALFA-β-actin cells compared to parental HT1080 cells (Fig. 2C-D)." 'Slightly' is not a very quantitative nor accurate term. please rephrase. Besides, a statistical test for the paired data would also be informative. Besides, data in figure S6B-D indeed show a correlated increase in the expression of Gamma-actin that compensate for the decrease in the Beta-actin level in ALFA-Beta-actin. Can the authors explain why they conclude otherwise?

      Planned revision: This indeed is an important point and we will change the phrasing of this section to provide a more quantitative and accurate description of the western blot quantifications.

      - figure S7B: I am not ure anyone has ever reported measurement of angle of branched actin filament using epifluorescence microscopy. I would remove this panel, or the authors should explain how this measurement can be done objectively.

      We apologize for this misapprehension from our side which is also noted by the other two reviewers. In the treadmilling videos of the lamellipodia in HT1080 cells, which were obtained using Airyscan super-resolution microscopy, we clearly observe a consistent filament formation at a constant angle, something which we interpreted as the angle between the mother filament and the daughter filament. After consulting the literature, we indeed have to admit that this cannot be interpreted as such and we will remove these datasets.

      Planned revision: We will remove the datasets with the angle measurements (Suppl. Fig. 7A-B) from our manuscript.

      *- Figure 2F : can the authors comment on the (significant ?) lower value for FLAG-tag actin ? *

      The lower value for FLAG-tag actin has likely to do with the properties of the antibody and suitability for immunofluorescence. For reason that we do not know, we usually detect more background for the FLAG tag antibody as compared to the other antibodies/ALFA tag nanobody. Since the Pearson correlation coefficient quickly decreases with suboptimal labeling, this is likely the reason that the values for FLAG-actin are lower as compared to the other tagged actins. Importantly, in our biochemistry experiments (F/G-actin), we detect no difference between FLAG-actin and ALFA-actin indicating that it is rather the immunofluorescence and sensitive Pearson correlation analysis than the integration of actin that causes this difference.

      - line 205 "The results from these experiments show that both DIAPH1 and FMNL2 associate with ALFA-β-actin (Fig. 3D),". It is not so obvious that these formins directly interact with monomeric actin via their FH2 domains in co-immunoprecipitation assays. It might very well be mediated by the interaction with profilin, that in turn bind to the FH1 domain of formins. For me, this assay does not make a correct proof that epitope-labelled actin do not interfere with formin activity.

      Planned revision: The point that the co-immunoprecipitation does not demonstrate direct interactions between formins and actin is well taken. We, however, do not claim that this assay proofs that formin activity, or formin-based integration of actin monomers, is similar with tagged actin as compared to wildtype actin. Nonetheless, we will critically re-evaluate the relevant passages and rephrase the text to avoid any confusion.

      - figure 5C&D : both graph should use the same scale for the y-axis for easier comparison.

      Planned revision: We will adapt the scale of Fig. 5D to make it identical to Fig. 5C. Following the other suggestions of the Reviewer (and of Reviewer #1), we will also critically evaluate our normalization procedure and present those numbers in the Figures if the values turn out to be different.

      - figure 5D: I think the way the ratio is performed is misleading. Why not look at the Beta/Gamma ratio using the isoform specific antibodies used in parental cells, and show the results for ALFA-Beta-actin and for ALFA-Gamma-actin separately ?

      We kindly refer to our answer to Reviewer #1 on Page 2 for a detailed explanation on the experimental challenge of comparing the localization of wildtype and tagged actin isoforms.

      Planned revision: We will critically evaluate our normalization procedure and present those numbers in the Figures if the values turn out to be different. Furthermore, we will add a different experimental method to show that the tagged isoforms properly localize to actin-based structures. For this, we will attempt to use micropatterned cells to induce clearly define actin-bases structures and also explore the possibilities of investigating the differential localization in double-tagged cells.

      *- The limitation observed for unbranched cables in yeast that nanobody-tagged ALFA-actin does not incorporate correctly should be discussed and stressed further in the discussion, as it might prove to be a strong limitation for live-cell imaging to reliably study any type of actin networks. *

      We acknowledge the reviewer’s concern regarding the inability of ALFA-tagged actin to incorporate into yeast actin cables when NbALFA is co-expressed and will discuss this point further in the revised manuscript. We have now observed the same limitation for fission yeast actin cables as well and combined, these observations may represent a tighter control and sensitivity of yeast formins towards any perturbations in actin size (since NbALFA binds to ALFA tag with picomolar affinity). To address this issue and as also suggested by Reviewer 1, we are now creating yeast strains with inducible control of NbALFA expression under GALS/GAL1 promoters and observe the labelling of actin structures after this approach. Additionally, expression of variants of NbALFA with high dissociation rates may also allow labelling of actin cables and would be certainly worth a try in the future. A structural comparison between mammalian and yeast formins may be required to shed some light on the molecular basis of this fundamental difference.

      However, since in the absence of the nanobody, this limitation is overcome (Fig. 5E, Suppl. Fig. 14), we believe that with additional modifications and fast developments in imaging technologies, this limitation can be overcome in the future. Thus, IntAct as a labeling strategy represents an advancement over existing labelled actins with the most important aspect being the identification of the T229/A230 residue pair to be permissive for integration of various tags even as large as GFP11 fragment including a linker (26AA) (Reviewer Fig. 2). Importantly, the T229/A230 site is conserved across many organisms (such as Chlamydomonas reinhardatii, Cryptococcus neoformans, etc) and may act as a framework to study the actin cytoskeleton especially in organisms where known surrogate markers like phalloidin and Lifeact may not work or work only sub optimally.

      *Reviewer #3 (Significance (Required)): *

      *General assessment: *

      *This paper provides a new tagging strategy to monitor actin activity in cells, by specifically inserting the tag along the amino acid sequence. *

      *Advance: *

      *This is a very useful tool, as most existing available probes bind to actin in regions that are common to many other actin binding proteins. The authors provide extensive experiments to validate that tagged-actin are functional and do not perturb the actin expression level, actin network architecture nor dynamics. *

      *Audience: *

      *This research paper will be of interest to a rather broad audience (many cell biologists) that are either sutyding actin dynamics or know that actin is involved in the cell functions they study. *

      *Expertise: *

      *My expertise is in vitro actin biochemistry. *

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, the authors generate several variants of actin that are internally tagged with short peptide tags. They identify one particular position that is able to tolerate various tags of 5-10 amino acids and still shows largely unaltered behavior in cells. They study incorporation of their tagged actins into filaments, characterize the interactions of G-actin variants with different associated proteins and show that retrograde actin flow in lamellipodia and the wound healing response of epithelial cells is not affected by the tagged variants. They then apply the tagged actin to study subcellular distribution of different actin isoforms in mammalian and yeast cells.

      The identification of a specific site in the actin protein that tolerates variable peptide insertions is very exciting and of fundamental interest for all research fields that deal with cytoskeletal rearrangements and cellular morphogenesis. The result demonstrating the functionality of actin variants with peptides inserted between aa 229 and 230 are generally convincing and well done. In particular, the generation of CRISPR/Cas9 genome edited versions of beta- and gamma actin are impressive. I therefore generally support publication of this study. There are however several technical and conceptual issues that should be addressed to improve quality and scope of the study. I listed some specific comments below:

      Major points

      • The biggest issue I have is the last section on the application of tagged actins to study isoform functions. In principle the application is very clear as there are simply no alternative ways to study isoform distribution in live cells. However, the experimental data are simply not convincing. What the authors define as "cortex" in Fig. 5A seems to rather represent cytosolic background mixed with radial fibers. I am not convinced that even the antibody staining with a relatively clear differential distribution of beta and gamma really shows a genuine accumulation of one isoform on stress fibers. It seems to me that the beta-actin staining has as higher cytosolic background and is generally weaker (gamma nicely labels transverse arcs), which reduces signal/noise and therefore yields a relatively increased level in areas with less-bundled actin. My suggestion is to select more clearly defined actin structures and to use micro-patterned cells to normalize the otherwise obstructing variability in actin organization. Possible structures would be cortical arcs in bow-shaped cells, lamellipodial edges (HT1080 seem to make very nice and large lamellipodia) or cell-cell contacts (confluent monolayer, provided cells don´t grow on top of each other). Stress fibers are possible but need to be segmented very precisely and I did not see any details on this in the methods section. For Fig. 5D: I assume cells were used where only one isoform was tagged? This is technical weak and the double-normalization is probably blurring any difference that might be occurring. Why not use a double-tagging strategy with ALFA/FLAG or ALFA/AU5 tags to exploit the constructs introduced in the previous figures? Also, the unique selling point of the strategy is the possibility of actual live imaging of specific isoforms. Cells that have stably integrated double tags and then transiently express nanobodies for ALFA and either AU5 or FLAG (or other if those don't exist) would make this possible. Considering the work already done in this manuscript, such an approach should actually be possible - did the authors attempt this or is there are reason it is not discussed? If double tagged cells are not possible for some reason it should at the very least be possible to combine ALFA-detection with the specific antibody against the other isoform and get rid of the double normalization.
      • The authors make a point of comparing the internally tagged actin to N-terminal tags that are mostly functional but have been shown to affect translational efficiency. I would strongly suggest to include N-terminally tagged actin as control for all assays in this study. Also for the physiological assays (retrograde flow, wound healing), a positive control is missing that shows some effect. Previous studies showed defects with transiently expressed actin with an N-terminal GFP. As retrograde flow measurements are very sensitive to the exact position of the kymographs and wound healing assays is a very crude and indirect readout, such a positive control is essential.
      • Expression of tagged actins in yeast is a very nice idea but it would be far more informative to express the tagged forms as the only copy of actin. This can either be done by directly replacing endogenous actin gene in S. cerevisiae, or (if the tagged versions are not viable) - using the established plasmid shuffle system (express actin on counter-selectable plasmid, then knock out endogenous copy and introduce additional plasmid with tagged actin, then force original plasmid out). In the presence of endogenous S. cerevisiae actin the shown effects are very hard to interpret as nothing is known about relative protein levels (endogenous vs. introduced). Also, if constitutive expression of the ALFA nanobody is harmful for integration into cables, why not perform inducible expression of the nanobody and observe labeling after induction. For the live imaging a robust cable marker is needed, like Abp140-GFP. Finally, indicate the sequence differences between the used actin forms in yeast (supplementary figure with sequence alignment and clear indication of all variations)
      • As the authors clearly show good integration of several tagged actins into filaments I would expand the structural characterization: perform alpha fold predictions of actin monomer structures including the various tags to show the expected orientation. It is striking that the only integration site that seems to work well is at the last position of a short helix, indicating that the orientation of the integrated peptide might be fixed in space and be optimal to minimize interference. Also, a docking of the tag onto the recently published cryoEM structures of the actin filament should be shown to indicate where it resides compared to tropomyosin or the major groove where most side binding proteins seem to bind.
      • For any claims regarding usability of tagged variants for isoform research it would be very important to characterize the known posttranslational modifications of tagged actin variants - are the differences between beta and gamma maintained on this level as well?

      Technical issues

      • There is no scale for the color coding in Fig. 5A, B
      • The y-scales for Fig. 5C and D need to be identical to allow direct comparison
      • Pearson coefficient should not be normalized to a control value as its already a dimensionless parameter. Always report actual R-value - also remove R2 values for Pearson as this makes no sense in this context (not sure if it was a typo or intended).
      • All values on subcellular regions (like stress fiber or cortex) dependet critically on the way thesese regions were thresholded or identified. Provide all details on how this was done in the methods section and ensure that adequate background subtraction and normalization is applied. Optimally, an unbiased (AI or automated) approach based on simple image statistics is used for this to avoid personal bias.
      • In Fig. 2A only heterozygous FLAG-actin cells are used. Why not use a homozygous line (for both beta and gamma actin)? The nice band shift of the FLAG version would allow the precise quantification of the fraction of total actin covered by beta and gamma actin, which then could provide some additional info for the apparently weaker beta staining in Fig. 5 (if beta expression is simply weaker). This would be a very simple and useful advantage of the internal tags that could be widely applied.
      • Fig. 3: control with N-terminal tag is missing. Also, why is it not possible to assay filament binding factors like Myosin, Filamin or alpha actinin - instead of co-IP a simple co-sedimentation assay with cell extracts in F-buffer should pick up any major difference in decoration of filaments containing the ALFA tag. Using two speeds for centrifugation it might even be possible to observe effects on filament bundling. The best approach for this would of course be to purify tagged actins and perform in vitro assays but this is clearly beyond the scope of what the authors intended here. I personally think that a broad acceptance of the marker will only come once the biochemistry has been sufficiently characterized so this is a future direction I would strongly encourage.
      • Fig. 2A has no loading control -
      • The RPE-1 data are confusing as several constructs show very different localization (completely cytosolic) to HT1080 cells and there is no possible explanation given for this. Maybe simply remove this data set?
      • The angel measurements for lamellipodial actin is not very meaningful: the angel is determined for the radial bundles, which do not correspond to the Arp2/3 angel of single filaments and is likely the results of different nucleation factors, I would suggest to remove this. If angel measurement are really intended, cryoEM needs to be performed.
      • Replace all SEM with SD values - use at least 3 biological replicates (4D SEM of n=2)

      Minor points

      • Intro: after listing all the details already understood on actin isoforms it is not very convincing to simply state the molecular principles remain largely unclear (l 34) - maybe better "there is no way to study actin dynamics due to current limitations of specific antibodies to fixed samples. Interesting option would be actually to develop nanobodies that are isoform specific 
      • L 71: "involved" in the kinetics is not a good term - maybe affects or regulates....
      • L148: "suspect" instead of "expect" - this clonal variation is actually a big danger of the employed approach as possible defects in actin organization could be masked by compensatory changes - it would generally be good to show critical data for at least 3 independent clones to rule out dominant selection effects.

      Referees cross-commenting

      I completely agree with the comments by reviewer 2 on the various missing controls - adding several or all of those will make the results much more convincing. The key for the adaptation of any new actin probe will be the level of confidence researchers have on the doumented effects. Even some negative effects on actin behavior (I am sure there will be some) should not prevent usage of the strategy as long as there is robust and convincing documentation of those effects. I also agree that including some basic in vitro characterization will go a long way to convince people dierectly working on actin (there is a very high level of biochemical understanding in that field).

      Significance

      Significance: Very useful finding that can be applied to any question related to actin-dependent cellular processes (morphogenesis, cell division, cell polarization, cell migration etc.)

      Strength: main finding convincing, strong genome edited cell lines

      Limitations: application to study of isoforms very limited and data not convincing, statistics and image quantifications need improvement

      Advance: identify new location for integral tagging of actin, which was not really possible before. The main relevance is for fundamental cell biology but the approach can also be applied to the study of disease variants in actin.

      Audience: general cell biology - very broad interest

  11. Aug 2023
    1. Author response

      We appreciate the responses from the editors and reviewers and will submit a revised manuscript addressing all of the main points raised. We are glad to see broad agreement that we took a careful approach and addressed a clear question.

      There were questions raised about the framing of the study vis-à-vis prior literature. One question was whether low frequency signals always have larger point spread functions, thereby making our result unsurprising. A second question was whether the notion of alpha oscillations as having wide-spread coherence and relating to system-general states was out-of-date. We appreciate these comments and agree that they could use further discussion. Our view is that neither of these points weakens the study, but our framing could be clearer regarding these two important issues. We will improve discussion of these topics in the revision.

      A second criticism mentioned by two reviewers is the lack of null-hypothesis testing. The value of null hypothesis statistical testing (NHST) in biomedicine is hotly debated, with many statisticians and scientists arguing that NHSTs add little to no value (Gigerenzer & Marewski, 2015; McShane et al., 2019; Meehl, 1978). Others of course disagree (Mogie, 2004). Our goal was not to try to rule out null hypotheses, but rather to make systematic measurements and to report the reliable patterns. We generally focused on observations where the results were well above the noise, obviating the need to test the null. Nonetheless, we can (and will) improve the clarity of our arguments in terms of how we rely on specific statistical analyses to support particular conclusions, as well as how to deal with the issue of multiple electrodes coming from small numbers of subjects, an important point raised by R3. We will clarify these issues in the revision.

      Reviewer 1 also made an interesting point about visual maps having an oculomotor component. We will do our best to incorporate this interesting issue into our revision.

      In addition to the public review, the reviewers made a number of useful recommendations for the revision. We appreciate these recommendations and will carefully consider each of them.

      Gigerenzer, G., & Marewski, J. N. (2015). Surrogate science: The idol of a universal method for scientific inference. Journal of management, 41(2), 421-440.

      McShane, B. B., Gal, D., Gelman, A., Robert, C., & Tackett, J. L. (2019). Abandon statistical significance. The American Statistician, 73(sup1), 235-245.

      Meehl, P. E. (1978). Theoretical risks and tabular asterisks: Sir Karl, Sir Ronald, and the slow progress of soft psychology. Journal of Consulting and Clinical Psychology, 46(4), 806-834. https://doi.org/10.1037/0022006X.46.4.806

      Mogie, M. (2004). In support of null hypothesis significance testing. Proc Biol Sci, 271 Suppl 3(Suppl 3), S82-84. https://doi.org/10.1098/rsbl.2003.0105

    1. Reviewer #3 (Public Review):

      The manuscript by Yang et al. investigated in mice how hypobaric hypoxia can modify the RBC clearance function of the spleen, a concept that is of interest. Via interpretation of their data, the authors proposed a model that hypoxia causes an increase in cellular iron levels, possibly in RPMs, leading to ferroptosis, and downregulates their erythrophagocytic capacity. However, most of the data is generated on total splenocytes/total spleen, and the conclusions are not always supported by the presented data. The model of the authors could be questioned by the paper by Youssef et al. (which the authors cite, but in an unclear context) that the ferroptosis in RPMs could be mediated by augmented erythrophagocytosis. As such, the loss of RPMs in vivo which is indeed clear in the histological section shown (and is a strong and interesting finding) can be not directly caused by hypoxia, but by enhanced RBC clearance. Such a possibility should be taken into account.

      Major points:

      1) The authors present data from total splenocytes and then relate the obtained data to RPMs, which are quantitatively a minor population in the spleen. Eg, labile iron is increased in the splenocytes upon HH, but the manuscript does not show that this occurs in the red pulp or RPMs. They also measure gene/protein expression changes in the total spleen and connect them to changes in macrophages, as indicated in the model Figure (Fig. 7). HO-1 and levels of Ferritin (L and H) can be attributed to the drop in RPMs in the spleen. Are any of these changes preserved cell-intrinsically in cultured macrophages? This should be shown to support the model (relates also to lines 487-88, where the authors again speculate that hypoxia decreases HO-1 which was not demonstrated). In the current stage, for example, we do not know if the labile iron increase in cultured cells and in the spleen in vivo upon hypoxia is the same phenomenon, and why labile iron is increased. To improve the manuscript, the authors should study specifically RPMs.

      2) The paper uses flow cytometry, but how this method was applied is suboptimal: there are no gating strategies, no indication if single events were determined, and how cell viability was assessed, which are the parent populations when % of cells is shown on the graphs. How RBCs in the spleen could be analyzed without dedicated cell surface markers? A drop in splenic RPMs is presented as the key finding of the manuscript but Fig. 3M shows gating (suboptimal) for monocytes, not RPMs. RPMs are typically F4/80-high, CD11-low (again no gating strategy is shown for RPMs). Also, the authors used single-cell RNAseq to detect a drop in splenic macrophages upon HH, but they do not indicate in Fig. A-C which cluster of cells relates to macrophages. Cell clusters are not identified in these panels, hence the data is not interpretable).

      3) The authors draw conclusions that are not supported by the data, some examples:

      a) They cannot exclude eg the compensatory involvement of the liver in the RBCs clearance (the differences between HH sham and HH splenectomy is mild in Fig. 2 E, F and G)

      b) Splenomegaly is typically caused by increased extramedullary erythropoiesis, not RBC retention. Why do the authors support the second possibility? Related to this, why do the authors conclude that data in Fig. 4 G,H support the model of RBC retention? A significant drop in splenic RBCs (poorly gated) was observed at 7 days, between NN and HH groups, which could actually indicate increased RBC clearance capacity = less retention.

      c) Lines 452-54: there is no data for decreased phagocytosis in vivo, especially in the context of erythrophagocytosis. This should be done with stressed RBCs transfusion assays, very good examples, like from Youssef et al. or Threul et al. are available in the literature.

      d) Line 475 - ferritinophagy was not shown in response to hypoxia by the manuscript, especially that NCOA4 is decreased, at least in the total spleen.

      4) In a few cases, the authors show only representative dot plots or histograms, without quantification for n>1. In Fig. 4B the authors write about a significant decrease (although with n=1 no statistics could be applied here; of note, it is not clear what kind of samples were analyzed here). Another example is Fig. 6I. In this case, it is even more important as the data are conflicting the cited article and the new one: PMCID: PMC9908853 which shows that hypoxia stimulates efferocytosis. Sometimes the manuscript claim that some changes are observed, although they are not visible in representative figures (eg for M1 and M2 macrophages in Fig. 3M)

      5) There are several unclear issues in methodology:

      - what is the purity of primary RPMs in the culture? RPMs are quantitatively poorly represented in splenocyte single-cell suspensions. This reviewer is quite skeptical that the processing of splenocytes from approx 1 mm3 of tissue was sufficient to establish primary RPM cultures. The authors should prove that the cultured cells were indeed RPMs, not monocyte-derived macrophages or other splenic macrophage subtypes.<br /> - (around line 183) In the description of flow cytometry, there are several missing issues. In 1) it is unclear which type of samples were analyzed. In 2) it is not clear how splenocyte cell suspension was prepared.<br /> - In line 192: what does it mean: 'This step can be omitted from cell samples'?<br /> - 'TO method' is not commonly used anymore and hence it was unclear to this Reviewer. Reticulocytes should be analyzed with proper gating, using cell surface markers.<br /> - The description of 'phagocytosis of E. coli and RBCs' in the Methods section is unclear and incomplete. The Results section suggests that for the biotinylated RBCs, phagocytosis? or retention? Of RBCs was quantified in vivo, upon transfusion. However, the Methods section suggests either in vitro/ex vivo approach. It is vague what was indeed performed and how in detail. If RBC transfusion was done, this should be properly described. Of note, biotinylation of RBCs is typically done in vivo only, being a first step in RBC lifespan assay. The such assay is missing in the manuscript. Also, it is not clear if the detection of biotinylated RBCs was performed in permeablized cells (this would be required).

      The authors did not substantially improve the quality of their manuscript in the revised version, at least in the case of the limitations which I have spotted. The major points which remain unclear:<br /> 1. No gating strategies for flow cytometry are provided.<br /> 2. Figure 3M still does not show a typical F4/80 vs CD11b gating, with a population of true RPMs gated.<br /> 3. In a few cases data still lack biological replicates+statistics.<br /> 4. Results from scRNA-seq are not presented more clearly (=clusters in Fig 3E are described as macrophages, but it is not explained which among the clusters are RPMs).<br /> 5. The compensatory role of liver macrophages is omitted.<br /> 6. The authors misunderstood by suggestion to perform in vivo erythrophagocytosis assay using stained RBCs. This assay quantifies the true capacity for erythrophagocytosis in RPMs or KCs in the organ, regardless of the ferroptosis that may be a subsequent consequence (please, see initial Figures in Yousseff et al. paper). Using the percentage of biotin-positive RBCs in the spleen (although this method is not well described in the Methods), the authors rather show increased RBCs clearance at 7 days following hypoxia. Hence, the model where first hypoxia increases erythrophagocytosis in RPMs, consequently leading to their ferroptosis still cannot be excluded.<br /> 7. The Methods are poorly described and unclear - the authors claimed that they have used in vivo biotinylation assay to assess the lifespan of RBCs but it is not described. Instead, the paragraph „Phagocytosis of E. coli and RBCs" suggests that RBCs were stained with biotin for phagocytic assay in culture with macrophages. Phagocytosis of E. coli is still described in the Methods although the authors opted to remove the data from the revised manuscript.<br /> Some points are unclear in the current version of the manuscript, after the addition of new data:<br /> 8. Data in Figure 4D versus 4E,F are not consistent, showing less retention versus increased retention of RBCs in the spleen (retention of senescent RBCs in the spleen should be measured anyway quantitatively, eg, with proper flow cytometry)<br /> 9. The increase of labile iron in the red pulp might not be in RPMs - especially since they seem depleted. Flow cytometry should be used to assess which cell types show increased iron levels.

    1. Reviewer #1 (Public Review):

      Murphy, Fancy and Skene performed a reanalysis of snRNA-seq data from Alzheimer Disease (AD) patients and healthy controls published previously by Mathys et al. (2019), arriving at the conclusion that many of the transcriptional differences described in the original publication were false positives. This was achieved by revising the strategy for both quality control and differential expression analysis. I believe the authors' intention was to show the results of their reanalysis not as a criticism of the original paper (which can hardly be faulted for their strategy which was state-of-the-art at the time and indeed they took extra measures attempting to ensure the reliability of their results), but primarily to raise awareness and provide recommendations for rigorous analysis of sc/snRNA-seq data for future studies.

      STRENGTHS:

      The authors demonstrate that the choice of data analysis strategy can have a vast impact on the results of a study, which in itself may not be obvious to many researchers.

      The authors apply a pseudobulk-based differential expression analysis strategy (essentially, adding up counts from all cells per individual and comparing those counts with standard RNA-seq differential expression tests), which is (a) in line with latest community recommendations, (b) different from the "default options" in most popular scRNA-seq analysis suites, and (c) explains the vastly different number of DEGs identified by the authors and the original publication. The recommendation of this approach together with a detailed assessment of the DEGs found by both methodologies could be a useful finding for the research community. Unfortunately, it is currently not fully substantiated and is confounded with concurrent changes in QC measures (see weaknesses).

      The authors show a correlation between the number of DEGs and the number of cells assessed, which indicates a methodological shortcoming of the original paper's approach (actually, the authors of the original paper already acknowledged that the lesser number of DEGs for rare cell types was a technical artefact). To be educational for the reader it would be important to provide more information about the DEGs that were "found" and those that were "lost". Given vast inter-individual heterogeneity in humans, it is likely that the study was underpowered to find weaker differences using the pseudobulks (Fig. 1B shows that only genes with more than 4-fold change were found "significant").

      All code and data used in this study are publicly available to the readers.

      WEAKNESSES:

      The authors interpret the fact that they found fewer DEGs with their method than the original paper as a good thing by making the assumption that all genes that were not found were false positives. However, they do not prove this, and it is likely that at least some genes were not found due to a lack of statistical power and not because they were actually "incorrect". The original paper also performed independent validations of some genes that were not found here.

      I am concerned that the only DEGs found by the authors are in the rare cell types, foremost the rare microglia (see Fig. 1f). It is unclear to me how many cells the pseudo-bulk counts were based on for these cells types, but it seems that (a) there were few and (b) there were quite few reads per cells. If both are the case, the pseudobulk counts for these cell populations might be rather noisy and the DEG results are liable to outliers with extreme fold changes.

      The authors claim they improved the quality control of the dataset. While I do not think they did anything wrong per se, the authors offer no objective metric to assess this putative improvement. This is another major weakness of the paper as it confounds the results of the improved (?) differential analysis strategy and dilutes the results. I detail this weakness in the two following points:

      Removing low-quality cells: The authors apply a new QC procedure resulting in the removal of some 20k more cells than in the original publication. They state "we believe the authors' quality control (QC) approach did not capture all of these low quality cells" (l. 26). While all the QC metrics used are very sensible, it is unclear whether they are indeed "better". For instance, removal with a mitochondrial count of <5% seems harsh and might account for a large proportion of additional cells filtered out in comparison to the original analysis. There is no blanket "correct cutoff" for this percentage. For instance, the "classic" Seurat tutorial https://satijalab.org/seurat/articles/pbmc3k_tutorial.html uses the 5% threshold chosen by the authors, an MAD-based selection of cutoff arrived at 8% here https://www.sc-best-practices.org/preprocessing_visualization/quality_control.html, another "best practices" guide choses by default 10% https://bioconductor.org/books/3.17/OSCA.basic/quality-control.html#quality-control-discarded, etc. Generally, the % of mitochondrial reads varies a lot between datasets. As far as I can tell, the original paper did not use a fixed threshold but instead used a clustering approach to identify cells with an "abnormally high" mitochondrial read fraction. That also seems reasonable. Overall, I cannot assess whether the new QC is really more appropriate than the original analysis and the authors do not provide any evidence in favor of their strategy.

      Batch correction: "Dataset integration has become a standard step in single-cell RNA-Seq protocols" (l. 29). While it is true that many authors now choose to perform an integration step as part of their analysis workflow, this is by no means uncontroversial as there is a risk of "over-integration" and loss of true biological differences. Also, there are many different methods for dataset integration out there, which will all have different results. More importantly, the authors go on "we found different cell type proportions to the authors (Fig. 1a) which could be due to accounting for batch effects" but offer no support for the claim that the batch effects are indeed related to the observed differences. An alternative explanation would be a selective loss/gain of certain cell types during quality control. The original paper stated concerns about losing certain cell types (microglia, which do not seem to be differentially abundant in the original paper / new analysis).

      Relevant literature is incompletely cited. Instead of referring to reviews of best practices and benchmarks comparing methods for batch correction and or differential analysis, the authors only refer to their own previous work.

      Due to a lack of comparison with other methods and due to the fact that the author's methodology was only applied to a single dataset, the paper presents merely a case study, which could be useful but falls short of providing a general recommendation for a best practice workflow.

      APPRAISAL:

      The manuscript could help to increase awareness of data analysis choices in the community, but only if the superiority of the methodology was clearly demonstrated. The recommended pseudobulk differential expression approach along with the indication of drastic differences that this might have on the results is the main output of the current manuscript, but it is difficult to assess unequivocally how this influenced the results because the differential analysis comes after QC and cell type annotation, which have also been changed in comparison to the original publication. In my opinion, the purpose of the paper might be better served by focusing on the DE strategy without changing QC and instead detailing where/how DEGs were gained/lost and supporting whether these were false positives.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Combined Public Review:

      It has been shown previously that maternal aging in mice is associated with an increase in accumulation of damaged mitochondria and activation of parkin-mediated autophagy (see DOI: 10.1080/15548627.2021.1946739). It has also been shown that C-natriuretic peptide (CNP) regulates oocyte meiotic arrest and that its use during in vitro oocyte maturation can improve parameters associated with decreased oocyte quality. Here the authors tested whether use of CNP treatment in vivo could improve oocyte quality and fertility of aged mice, for which they provided convincing evidence. They also attempted to determine how CNP improves oocyte developmental competence. They showed a correlation between CNP use in vivo and the appearance (and some functional qualities) of cytoplasmic organelles more closely approximating those of oocytes from young mice. However, this correlation could not be interpreted to imply causation. Additional experiments performed using CNP during in vitro maturation were not properly controlled and so are not possible to interpret.

      A strength of the manuscript is that the authors use an in vivo treatment to improve oocyte quality rather than just using CNP during oocyte maturation in vitro as has been done previously. This strategy provides more potential for improving oocyte quality - over the course of oocyte growth and maturation - rather than just the final few hours of maturation alone. This strategy also has the potential to be translated into a more generally useful clinical therapeutic method that using CNP during in vitro maturation. However, it is difficult to glean information regarding how CNP might have its effects in vivo. A range of models are used in the manuscript with a mix of in vivo studies with in vitro experiments, which results in some disconnect between systemic CNP and its reported intrafollicular action as well as in the short-term versus longer-term actions of CNP on oocyte quality. Specifically, CNP was shown to be reduced in the plasma of aged mice, but this was not shown in the granulosa cells, which are the reported source of CNP that acts on oocytes. Whether the ovarian source of CNP is reduced in aged females was not demonstrated, and CNP is not known to act on oocytes through an endocrine effect. In vivo treatments with CNP by i.p. injection were performed, but the dose (120 ug/kg) and time (14 days) of treatment were not validated by any prior experiments to give them physiological relevance.

      Thank you for the summary and for highlighting our manuscript’s strengths and weaknesses.

      Weaknesses:

      1. There are errors in the manuscript writing that make the Results difficult to follow. Reference to the Figures in the Results section does not match what is shown in the Figure panels. For example, the Results text reports differences in CNP levels in aged and young mice shown in Figure 1C, but the relevant panel is actually shown in Figure 1F. Other Figures have the same problem.

      Thanks for the valuable suggestion. All the mistakes have been corrected in the revised manuscript.

      1. The Results section is not always clear regarding what CNP treatment was done - in vivo injections or in vitro maturation. For example, what is the difference, if any, between Figures 2C-D and Figures S2A-B?

      Thank you for pointing out the potential confusion regarding the experimental procedures in Figures 2C-D and Figures S2A-B. In the revised manuscript, we have included additional explanations to clarify that Figures 2C-D represent in vivo injections, while Figures S2A-B depict in vitro maturation. In brief, the results presented in the Supplementary Material (Figures S1-S7) are derived from in vitro CNP treatment.

      1. Immature oocytes from aged females (~1 year) were treated with a two-step culture system with a pre-IVM step with CNP. Controls included oocytes from young (6-8 weeks) females or oocytes from aged females treated by conventional IVM. The description of these methods suggests that control oocytes did not receive an equivalent pre-IVM culture, hence the relevance of comparisons of CNP-treated versus control oocyte is questionable. It was observed that aged oocytes pre-cultured in CNP improved polar body extrusion rates and meiotic spindle morphology compared to oocytes in conventional IVM, as has been well established. The description of statistical methods does not make clear whether the PBE rate in CNP-treated old oocytes remained significantly lower than young controls.

      Statistical analyses were performed using GraphPad Prism 8.00 software (GraphPad, CA, United States). Differences between two groups were assessed using the t-test. Indeed, CNP is unlikely to fully restore the PB1 rate in aged mice to the same level as in the young group. PB1 rate in CNP-treated aged oocytes remained significantly lower than young controls (P<0.05).

      1. The main effect of the CNP 2-week treatment appears to be increasing the number of follicles that grow into secondary and antral stages, but there is no attempt made to discover the mechanism by which this occurs and therefore to understand why there might be an increase in the number of ovulated eggs, quality of the eggs, and litter size. It is also not clear how an intraperitoneal injection can guarantee its effectiveness because the half-life of CNP is very short, only a few minutes.

      The 2-week treatment of CNP had a significant impact, leading to an increase in the number of follicles progressing to secondary and antral stages, as well as an increase in the number of ovulated eggs, improved egg quality, and enhanced litter size. Previous studies (references: 10.1530/REP-18-0470; 10.1210/me.2012-1027) have demonstrated the crucial role of CNP as an upstream regulator in stimulating preantral follicle growth and promoting the ovulation rate. These studies have also identified the influence of CNP on the expression of key ovarian genes involved in cell growth and steroidogenic enzymes. Consistent with these findings, our study provides further evidence supporting CNP as a critical regulator of preantral follicle growth and oocyte quality. Furthermore, it is important to note that oocyte-derived paracrine factors play essential roles in follicular development. CNP may regulate the communication between oocytes and somatic cells, contributing to folliculogenesis and follicular development. We are considering this aspect for further investigation in another ongoing study.

      To ensure the effectiveness of CNP, given its short half-life (a few minutes), aged mice (58 weeks old) received daily intraperitoneal injections of CNP (120 μg/kg body weight; Cat#B5441, ApexBio) for a duration of 14 days.

      1. Meiotic spindle morphology, as well as a number of putative markers of cytoplasmic maturation are also suggested to be improved after pre-culture with CNP. In each case a subjective interpretation of "normal" morphology of these markers is derived from observations of the young controls and the proportions of oocytes with normal or abnormal appearance is evaluated. However, parameters that define abnormal patterns of these markers appear to be subjective judgements, and whether these morphological patterns can be mechanistically attributed to the differences in developmental potential cannot be concluded.

      Oocyte cytoplasmic maturation involves a remarkable reorganization of the oocyte cytoplasm, encompassing the movement of vesicles, mitochondria, Golgi apparatus, and endoplasmic reticulum. This dynamic process occurs during the transitions from the germinal vesicle breakdown (GVBD) stage to the metaphase I (MI), polar body extrusion (PBE), and metaphase II (MII) stages (reference: 10.1093/humupd/dmx040). In our study, we observed that CNP treatment partially rescued cytoplasmic maturation events in aged oocytes by maintaining normal distribution patterns of cortical granules (CG), endoplasmic reticulum (ER), and Golgi apparatus. However, further experiments are needed to investigate the specific action of CNP on the function of CG, ER, and Golgi apparatus. These experiments are beyond the scope of this manuscript, but we acknowledge the importance of this aspect and will consider it for future research. In this study, our main focus was to examine the effects of CNP on mitochondria distribution and function. Therefore, we analyzed the localization patterns of mitochondria, mitochondrial membrane potential, oocyte ATP content, and ROS levels. These experiments were aimed at elucidating the impact of CNP on mitochondrial dynamics and metabolism, which are crucial for oocyte quality and development.

      1. In addition to the localization patterns of mitochondria, the mitochondrial membrane potential, oocyte ATP content and ROS levels were assessed through more objective quantitative methods. These are well known to be defective in oocytes of aged females and CNP treatment improved these measures. Mitochondrial dysfunction is the most obvious link between oocyte apoptosis, autophagy, cytoplasmic organelle miss-localization and aberrant spindle morphology. Among the most intriguing results is the finding that CNP mediated a cAMP-dependent protein kinase (PKA) dependent reduction in mitochondrial autophagy mediators PINK and Parkin and reduced the recruitment of Parkin to mitochondria in oocytes. However, it may not be possible to directly link this observation to the improvements in IVM oocyte quality, since PINK/Parkin assessments were performed in oocytes from cultured follicles treated with CNP for 6 days.

      The beneficial effects of CNP on oocyte quality have been extensively demonstrated through in vivo experiments (Figure 1 and 4) and “two-step” in vitro culture experiments (Figure S1 and S7). In this study, our primary focus is to analyze the signaling pathway and mechanism by which CNP inhibits mitophagy in oocytes. Previous studies have highlighted the significant role of cAMP-PKA activity in reducing mitochondrial recruitment of Parkin and mitophagy (reference: 10.1038/s42003-020-01311-7). Consistent with these findings, our study revealed that aged oocytes exhibited lower concentrations of cAMP compared to young oocytes. However, upon administration of CNP, we observed a substantial increase in intraoocyte cAMP levels. To investigate the involvement of PKA in CNP-mediated oocyte mitophagy, we conducted further experiments. We isolated preantral follicles (80-100 µm diameter) from the ovaries of aged mice and subjected them to in vitro culture with either 100 nM CNP or a combination of 100 nM CNP and 10 µM H89, a PKA inhibitor. Monitoring the growth dynamics of the follicles revealed that treatment with 100 nM CNP significantly increased follicle diameter, while H89 treatment inhibited the promotive effect of CNP on preantral follicle growth (Figure 6 K and L). Western blot analysis demonstrated that CNP supplementation led to a significant decrease in PINK1 and Parkin expression levels, which were abrogated by H89 treatment (Figure 6 M-O). It is well-established that the cAMP-PKA pathway plays a crucial role in inhibiting Parkin recruitment to damaged mitochondria (Akabane et al., 2016). Therefore, we aimed to investigate whether PKA inhibition regulates Parkin recruitment. To assess the effects of CNP on mitochondria, we performed double staining for Parkin and translocase of outer mitochondrial membrane 20 (TOMM20). The results clearly demonstrated that CNP inhibited the mitochondrial localization of Parkin, while PKA inhibition with H89 led to Parkin translocation to mitochondria, as indicated by the overlap of the two staining signals (Figure 6 P and Q). Collectively, our data suggest that the suppression of Parkin recruitment through the cAMP-PKA axis represents an important mechanism underlying the protective effect of CNP against oxidative injury in maternally aged mouse oocytes.

      1. The gold standard assay for oocyte quality is embryo transfer and live birth. The authors assessed the impact of maturing oocytes in vitro in the presence of CNP on oocyte quality by less robust assays (e.g., preimplantation embryo development in vitro), so the impact on oocyte quality is less certain.

      We appreciate the Revierer’s suggestion to assay live birth rates by transfer embryos obtained from IVM oocytes. However, we decided not to pursue this option for this revision because of the current technical challenges that make it difficult to get a precise result of live birth rates from IVM oocyte. Thank you for your very valuable suggestion, we have discovered the shortcomings in my current work, and I will follow your suggestions in my future work to improve the level of scientific research and achieve more results.

      1. The terminology used to describe many of the Results exaggerates the findings. For example, the authors claim that many of their immunofluorescent markers of the various organelles have a pattern that is "restored" by CNP. However, in most cases the pattern is "improved" toward the control condition but is not fully restored.

      We acknowledge the confusion caused by the wording of the mechanism of action of CNP in the original version. In the resubmission, we have made significant improvements by providing critical information that clarifies the action of CNP. We believe that these revisions will enhance the understanding of the mechanism of CNP and its implications. Thank you for pointing out this issue, and we appreciate your feedback in helping us improve the clarity of our work.

      1. The numbers of embryos should have been corrected for the number of eggs fertilized as a starting point so that the percentage that developed to each stage could be expressed as a percentage of successfully fertilized eggs rather than overall percentages. As currently shown in the Figures and described in the Legend, there is no information regarding what the percentage on the y-axis means. For example, does Figure 4B show the number of 2C embryos divided by the number of eggs inseminated? Or is it divided by the number of successfully fertilized eggs, and if so, how was that assessed?

      The embryonic development rates (Figure 4 B-F) were calculated based on the total number of oocytes, and the percentages of oocytes that developed to each stage were expressed as overall percentages.

      1. When fewer eggs are fertilized, the numbers of embryos per group are lower and so the impact of culturing multiple embryos together is lost. As a result, it is possible that culture conditions rather than oocyte quality drove the differences in the numbers of embryos that achieved each stage of development.

      The embryonic development rate was calculated based on the total number of oocytes. Each group included a minimum of 50 oocytes with three replicates (Young: 51, aged: 53, CNP+aged: 50). The embryo culture conditions were consistent across all groups.

      1. Not all claims in the Discussion are supported by the evidence provided. For example, "In addition, the findings demonstrated that CNP improved cytoplasmic maturation events by maintaining normal CG, ER and Golgi apparatus distribution and function in aged oocytes" but it was never demonstrated that the altered distribution had any functional impact.

      Oocyte cytoplasmic maturation involves a remarkable reorganization of the oocyte cytoplasm, including the movement of vesicles, mitochondria, Golgi apparatus, and endoplasmic reticulum. Extensive remodeling and repositioning of intracellular organelles occur during the transitions from GVBD to MI, PBE, and MII stages (10.1093/humupd/dmx040). Our findings indicate that CNP partially rescued cytoplasmic maturation events in aged oocytes by preserving normal distribution of CG, ER, and Golgi apparatus, as well as maintaining mitochondrial function. We acknowledge the importance of considering the impact of CNP on the function of CG, ER, and Golgi apparatus for future research. In summary, these findings demonstrate that CNP improves cytoplasmic maturation events in aged oocytes by facilitating the reorganization of CG, ER, and Golgi apparatus.

      1. Incompleteness and errors in the Methods section reduce confidence in many of the results reported.

      We will enhance the readability of the entire Methods section for the resubmission.

      1. The methods used for Statistical Analysis are never explained in either the Methods or the Figure legends. It is unclear whether appropriate analyses were done, and it is frequently unclear what was the sample size and how many times a particular experiment was repeated. These weaknesses detract from confidence in the data.

      Statistical analyses were performed using GraphPad Prism 8.00 software (GraphPad, CA, United States). Differences between two groups were assessed using the t-test. Data were reported as means ± SEM. Results of statistically significant differences were denoted by asterisk. (P < 0.05 denoted by , P < 0.01 denoted by , P < 0.001 denoted by , and P < 0.0001 denoted by **).

      Recommendations for the authors: please note that you control which revisions to undertake from the public reviews and recommendations for the authors

      1. The introduction does not provide critical information regarding what is already known about the mechanism of action of CNP, what other tissues are impacted by CNP treatment, and how it might affect oocyte growth. Providing this information would make it much easier to understand what is novel about the current manuscript.

      We acknowledge that the mechanism of action of CNP was unclear in the original version. We have now included essential information to clarify the action of CNP.

      1. Comparison of the RNAseq dataset to robust datasets from young vs aged mice would strengthen the analysis (e.g., the dataset in DOI: 10.1111/acel.13482).

      Thank you for your professional suggestion. According to the suggestion from you, we will make comparison of the RNAseq dataset to robust datasets from young vs aged mice in my future work .

      1. Please explain what is "Dr. Tom" that was used for RNA sequencing analysis, in the Methods.

      Dr. Tom is a web-based solution that offers convenient analysis, visualization, and interpretation of various types of RNA data, including mRNA, miRNA, and lncRNA. It also supports the interpretation of single-cell RNA-seq data and WGBS data. Developed by a team of expert scientists and bioinformaticians at BGI, who have extensive experience in numerous research projects, Dr. Tom provides a wide range of intuitive and interactive data visualization tools tailored to save time in conducting differential expression or pathway analysis research. Moreover, its powerful analysis tools and advanced algorithms enable users to extract new insights and derive additional value from their data beyond what is available through standard RNA analysis services. The integration of data from leading databases worldwide allows users to reference and cross-check their results and findings. Dr. Tom is already trusted by tens of thousands of scientists and researchers, serving as a valuable and essential tool alongside their own internal data curation and analysis efforts. To learn more, please visit: Dr. Tom website https://www.bgi.com/global/service/dr-tom.

      1. The Results state that single-cell transcriptomics was performed, but the Methods state that 5 oocytes were collected from each mouse. The actual Method used should be clarified.

      Single-cell RNA-seq is a powerful technique that enables digital transcriptome analysis at the single-cell level using deep-sequencing methods. With this approach, even a single cell can be isolated and processed through various steps to generate sequencing libraries. Given the limited availability of oocyte samples, we employed a single-cell RNA-seq library construction protocol, allowing us to analyze the transcriptomes of individual oocytes. As a result, we collected and analyzed five oocytes from each mouse in our study.

      1. The raw RNAseq data should be deposited into a publicly accessible database and reported by an accession number. It is not sufficient to state that the data is included in the manuscript and supporting information.

      The RNA-seq data has been submitted as supporting information and is now accessible to all readers.

      1. The image in Figure 1G is not very clear.

      Thank you for bringing this to our attention. We will enhance the readability of all our figures for the resubmission.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      This study investigates the context-specificity of facial expressions in three species of macaques to test predictions for the 'social complexity hypothesis for communicative complexity'. This hypothesis has garnered much attention in recent years. A proper test of this hypothesis requires clear definitions of 'communicative complexity' and 'social complexity'. Importantly, these two facets of a society must not be derived from the same data because otherwise, any link between the two would be trivial. For instance, if social complexity is derived from the types of interactions individuals have, and different types of signals accompany these interactions, we would not learn anything from a correlation between social and communicative complexity, as both stem from the same data.

      The authors of the present paper make a big step forward in operationalising communicative complexity. They used the Facial Action Coding System to code a large number of facial expressions in macaques. This system allows decomposing facial expressions into different action units, such as 'upper lid raiser', 'upper lip raiser' etc.; these units are closely linked to activating specific muscles or muscle groups. Based on these data, the authors calculated three measures derived from information theory: entropy, specificity and prediction error. These parts of the analysis will be useful for future studies.

      The three species of macaque varied in these three dimensions. In terms of entropy, there were differences with regard to context (and if there are these context-specific differences, then why pool the data?). Barbary and Tonkean macaques showed lower specificity than rhesus macaques. Regarding predicting context from the facial signals, a random forest classifier yielded the highest prediction values for rhesus monkeys. These results align with an earlier study by Preuschoft and van Schaik (2000), who found that less despotic species have greater variability in facial expressions and usage.

      Crucially, the three species under study are also known to vary in terms of their social tolerance. According to the highly influential framework proposed by Bernard Thierry, the members of the genus Macaca fall along a graded continuum from despotic (grade 1) to highly tolerant (grade 4). The three species chosen for the present study represent grade 1 (rhesus monkeys), grade 3 (Barbary macaques), and grade 4 (Tonkean macaques).

      The authors of the present paper define social complexity as equivalent to social tolerance - but how is social tolerance defined? Thierry used aggression and conflict resolution patterns to classify the different macaque species, with the steepness of the rank hierarchy and the degree of nepotism (kin bias) being essential. However, aggression and conflict resolution are accompanied by facial gestures. Thus, the authors are looking at two sides of the same coin when investigating the link between social complexity (as defined by the authors) and communicative complexity. Therefore, I am not convinced that this study makes a significant advance in testing the social complexity for communicative complexity hypothesis. A further weakness is that - despite the careful analysis - only three species were considered; thus, the effective sample size is very small.

      Social tolerance in macaques is defined by various covarying traits, among which rates of counter-aggression and conflict resolution are only two of many included (see Thierry 2021 for a recent discussion and review). We do not deviate from Thierry’s definition of social tolerance. We simply highlight that the constellation of behavioral traits in the most tolerant macaque species results in a social environment where the outcome of social interactions is more uncertain (see introduction lines 102-114). As we argue throughout the paper, higher uncertainty can be used as a proxy for higher complexity and thus we conclude that the most tolerant macaque species have the highest social complexity. While most social behavior in macaques is accompanied by some facial behavior, we were careful to define social contexts only from the body language/behavior (e.g., lunge for aggression, grooming for affiliation) of the individuals involved and ignored the facial behavior used (see method lines 371-381). Therefore, the facial behavior of macaques (communication signals) was not used in defining either social tolerance (and by extension complexity) or the social context in which it was used. We feel like this appropriately minimizes any elements of circularity in the analysis of social and communicative complexity.

      Regarding the effective sample size of three species, we agree that it is small, and it is a limitation of this study. However, the methodology we used is applicable to any species for which FACS is available (including other non-human primates, dogs, and horses), and therefore, we hope that other datasets will complement ours in the future. Nevertheless, we now acknowledge this limitation in the discussion (lines 314317).

      Reviewer #2 (Public Review):

      This is a well-written manuscript about a strong comparative study of diversity of facial movements in three macaque species to test arguments about social complexity influencing communicative complexity. My major criticism has to do with the lack of any reporting of inter-observer reliability statistics - see comment below. Reporting high levels of inter-observer reliability is crucial for making clear the authors have minimized chances of possible observer biases in a study like this, where it is not possible to code the data blind with regard to comparison group. My other comments and questions follow by line number:

      We agree that inter-observer coding reliability is an important piece of information. We now report in more detail the inter-observer reliability tests that we conducted on lines 384-392.

      38-40. Whereas I am an advocate of this hypothesis and have tested it myself, the authors should probably comment here, or later in the discussion, about the reverse argument - greater communicative complexity (driven by other selection pressures) could make more complicated social structures possible. This latter view was the one advocated by McComb & Semple in their foundational 2005 Biology Letters comparative study of relationships between vocal repertoire size and typical group size in non-human primate species.

      It is true that an increase in communicative complexity could allow/drive an increase in social complexity. Unfortunately our data is correlational in nature and we cannot determine the direction of causality. We added such a statement to the discussion (lines 311-314).

      72-84 and 95-96. In the paragraph here, the authors outline an argument about increasing uncertainty / entropy mapping on to increasing complexity in a system (social or communicative). In lines 95-96, though, they fall back on the standard argument about complex systems having intermediate levels of uncertainty (complete uncertainty roughly = random and complete certainty roughly = simple). Various authors have put forward what I think are useful ways of thinking about complexity in groups - from the perspective of an insider (i.e., a group member, where greater randomness is, in fact, greater complexity) vs from the perspective of an outside (i.e., a researcher trying to quantify the complexity of the system where is it relatively easy to explain a completely predictable or completely random system but harder to do so for an intermediately ordered or random system). This sort of argument (Andrew Whiten had an early paper that made this argument) might be worth raising here or later in the discussion? (I'm also curious where the authors sentiments lie for this question - they seem to touch on it in lines 285-287, but I think it's worth unpacking a little more here!)

      In this study we used three measures of uncertainty (entropy, context specificity, and prediction error) to approximate complexity. However, maximum entropy or uncertainty would be achieved in a system that is completely random (and thus be considered simple). Therefore, the species with the highest entropy values, or unpredictability, could be interpreted as having a simpler communication system than a species with a moderately high entropy/unpredictability value. Our argument is that animal communication systems cannot possibly be random, otherwise they would not have evolved as signals. In systems where we know the highest entropy (or unpredictability) will not be due to randomness, as is the case with animal social interactions and communication, we can conclude that the system with the highest uncertainty is the most complex. We have now expanded upon this point in the discussion (lines 286-294). See also response to reviewer 1 below.

      115-129. See also:

      Maestripieri, D. (2005). "Gestural communication in three species of macaques (Macaca mulatta, M. nemestrina, M. arctoides): use of signals in relation to dominance and social context." Gesture 5: 57-73.

      Maestripieri, D. and K. Wallen (1997). "Affiliative and submissive communication in rhesus macaques." Primates 38(2): 127-138.

      On that note, it is probably worth discussing in this paragraph and probably later in the discussion exactly how this study differs from these earlier studies of Maestripieri. I think the fact that machine learning approaches had the most difficulty assigning crested data to context is an important methodological advance for addressing these sorts of questions - there are probably other important differences between the authors' study here and these older publications that are worth bringing up.

      Our study differs from these two studies in that the studies above classified facial behavior into discrete categories (e.g., bared-teeth, lip-smack), whereas we adopted a bottom-up approach and made no a priori assumptions about which movements are relevant. We broke down facial behavior down to their individual muscle movements (i.e., Action Units). Measuring facial behavior at the level of individual muscle movements allows for a more detailed and objective description of the complexity of facial behavior. This is a general point in advancing the study of facial behavior that is discussed in the introduction (lines 60-71) and discussion (lines 206-208). The reason we don’t draw a direct comparison with the studies above is because they had a slightly different focus. Our study was more focused on complexity of the (facial) communication system in general rather than comparing whether the different species use the same facial behavior in the same/different social contexts.

      220-222. What is known about visual perception in these species? Recent arguments suggest that more socially complex species should have more sensitive perceptual processing abilities for other individuals' signals and cues (see Freeberg et al. 2019 Animal Behaviour). Are there any published empirical data to this effect, ideally from the visual domain but perhaps from any domain?

      This is an interesting point. We are not aware of any studies showing differences in visual perceptions within the macaque genus. Both crested macaques and rhesus macaques are able to discriminate between individuals and facial expressions in match-to-sample tasks with comparable performances (Micheletta et al., 2015a, 2015b; Parr et al. 2008; Parr & Heinz, 2009). Similarly, several macaque species are sensitive to gaze shifts from conspecifics (Tomasello et al. 1998; Teufel et al. 2010; Micheletta & Waller, 2012).

      274-277. I am not sure I follow this - could not different social and non-social contexts produce variation in different affective states such that "emotion"-based signals could be as flexible / uncertain as seemingly volitional / information-based / referential-like signals? This issue is probably too far away from the main points of this paper, but I suspect the authors' argument in this sentence is too simplified or overstated with regard to more affect-based signals.

      Emotion-based signals could, in theory, also produce flexible signals and it is possible that some facial expressions reflect an emotional state. However, some previous studies have suggested that facial expressions are only used as a display of emotion, rather than such signals having evolved for a different function such as announcing future intentions. In our study we found that macaques used, in some cases, the same facial expressions (i.e. combination of Action Units) in at least two different social contexts that, presumably, differed in their emotional valence. Thus, it is unlikely that particular facial expressions are bound to a single emotion. We think that this is an important point to make even though it is slightly beyond the scope of our paper.

      288 on. Given there are only three species in this study, the chances of one of the species being the 'most complex' in any measure is 0.33. Although I do not believe this argument I am making here, can the authors rule out the possibility that their findings related to crested macaques are all related to chance, statistically speaking?

      We are not aware of a way to rule out this possibility. However, we believe that we are appropriately cautious throughout the paper and acknowledge that having only investigated three species is a limitation of this study in the discussion (lines 314-317, see also our response to reviewer 1 above).

      329-330. The fact that only one male rhesus macaque was assessed here seems problematic, given the balance of sexes in the other two species. Can the authors comment more on this - are the gestures they are studying here identical across the sexes?

      We agree it would have been preferable to collect data on more than one male rhesus macaque, but that was unfortunately not possible. We are not aware of any studies showing differences in the use of facial behavior between male and female rhesus macaques. If differences exist, most likely these would occur in a sexual/mating context. However, in our study we only considered affiliative (non-sexual), submissive, and aggressive contexts, where we have no a priori reason to believe that there are sex differences.

      354-371. Inter-observer reliability statistics are required here - one of the authors who did not code the original data set, or a trained observer who is not an author, could easily code a subset of the video files to obtain inter-observer reliability data. This is important for ruling out potential unconscious observer biases in coding the data.

      We agree this is an important piece of information. We now report in more detail the inter-observer reliability tests that we conducted on lines 384-392:

      “An agreement rating of >0.7 was considered good [Ekman et al 2002] and was necessary for obtaining certification. To obtain a MaqFACS coding certification, AVR, CP, and PRC coded 23 video clips of rhesus macaques and the MaqFACS codes were compared to the data of other certified coders (https://animalfacs.com).

      The mean agreement ratings obtained were 0.85, 0.73, 0.83 for AVR, CP, and PRC, respectively. In addition, AVR and CP coded 7 videos of Barbary macaques with a mean agreement rating of 0.79. AVR and PRC coded 10 videos of crested macaques with a mean agreement rating of 0.74.”

      Reviewer #1 (Recommendations For The Authors):

      Given the long debate on the concept of information exchange in animal communication, I would also recommend being more careful with the term 'exchanges of information' (line 271). Perhaps it's better to be agnostic in the context of this paper.

      As suggested, we now changed the phrasing to focus on the behavior of the animals, rather than suggesting that information is being exchanged (lines 270-273),

      Line 281: "This result confirms the assumption that facial behaviour in macaques is not used randomly": the authors are knocking down a straw man. Nobody who has ever studied animal communication would consider that signals occur randomly. Otherwise, they would not have evolved as signals.

      Indeed, nobody claims that animal communication signals are used randomly. Although it may be taken for granted, we feel it is worthwhile to reiterate this point, given that we used relative entropy and prediction error as measures of complexity. For instance, maximum entropy or unpredictability would be achieved in a system that is completely random (and thus be considered simple). Therefore, the species with the highest entropy values, or lowest predictability, could be interpreted as having a simpler communication system than a species with a moderately high entropy value. But if we are working under the assumption that animal communication systems cannot possibly be random, then we can conclude that the species whose communication system has the highest entropy is in fact the most complex. We tried to make this justification clearer in the discussion (lines 285-294).

      I did not follow why there is a higher reliance on facial signals when predation pressure is higher. Apart from the fact that the authors cannot address this question, they may want to reconsider this idea altogether.

      We now expand on the logic of why predation pressure might affect the use of facial signals (see lines 308-309): “When predation pressure is higher, reliance on facial signals could be higher than, for example vocal signals, such as to not draw attention of predators to the signaller.”

      Technical comments:

      One methodological issue that requires clarification is what the units of analysis are. The authors write that each row in their analysis denoted an observation time of 500 ms. How many rows did the authors assemble? The authors mention a sample size of > 3000 social interactions in the abstract. How did they define social interactions? And how many 'time windows' of 500 ms were obtained? Did they take one window per interaction or several? If several, then how was this move accounted for in the analysis? The reporting needs to be more accurate here. Most likely, the bootstrapping took care of biases in the data, but still, this information needs to be provided.

      We have now added some additional information to the method section. Social interactions for each context had the following definitions: “Social context was labeled from the point of view of the signaler based on their general behavior and body language (but not the facial behavior itself), during or immediately following the facial behavior. An aggressive context was considered when the signaler lunged or leaned forward with the body or head, charged, chased, or physically hit the interaction partner. A submissive context was considered when the signaler leaned back with the body or head, moved away, or fled from the interaction partner. An affiliative context was considered when the signaler approached another individual without aggression (as defined previously) and remained in proximity, in relaxed body contact, or groomed either during or immediately after the facial behavior. In cases where the behavior of the signaler did not match our context definitions, or displayed behaviors belonging to multiple contexts, we labeled the social context as unclear. Social context was determined from the video itself and/or from the matching focal behavioral data, if available.” (lines 371-382). The total duration of all social interactions per social context, and thus the number of 500ms windows/rows, have been added to Table 1 (lines 395-397). There were several 500ms windows per social interaction. All 500ms time blocks per interaction were used in the statistical analyses in order to retain all the variation and complexity of the facial behavior (Action Unit combinations) used by the macaques (lines 403-405). Indeed the bootstrapping procedure was used to account for any biases in the data.

      Overall, I would recommend providing more information on the actual behaviour of the animals. The paper is strong in handling highly derived indices representing the behaviour, but the reader learns little about the animals' behaviour. Thus, it would be great if statements about the entropy ratio were translated into what these measures represent in real life. For context specificity, this is clear, but for entropy, not so much.

      A high entropy ratio essentially suggests that a species uses a high variety of unique facial behavior/signals and all signals in the repertoire are used roughly equally often (rather than one facial behavior being used 90% of the time and others rarely used). We have tried our best to better explain this point in the introduction (lines 75-81) and discussion (lines 215-222). Discussing exactly what these signals are and what they mean was beyond the scope of this paper.

      Line 106: nepotism, not kinship

      Changed as suggested (line 106).

      Line 113: I would avoid statements about how a monkey society is perceived by its members.

      We think that noting how individuals may perceive their social environment is worthwhile when defining social complexity, so have retained this point but changed the phrasing to be more speculative (lines 112-113).

      Line 329: I was very surprised that only one male was represented in the data for rhesus monkeys. The authors try to wriggle their way out of this issue in the supplementary material ("Therefore, we have no a priori reason to expect an overall difference in the diversity and complexity of facial behaviour between the sexes"), but I think this is a major shortcoming of the analysis. They should ascertain whether there are no sex differences in the other two species regarding their variables of interest. They could then make a very cautious case for there being no sex differences in rhesus either. But of course, they would not know for sure.

      As with our response to reviewer 2 above, we agree that it would have been preferable to collect data on more than one male rhesus macaque, but that was unfortunately not possible. We are not aware of any studies showing differences in the use of facial behavior between male and female rhesus macaques. If differences exist, most likely these would occur in a sexual/mating context. However, in our study we only considered affiliative (non-sexual), submissive, and aggressive contexts, where we have no a priori reason to believe that there are sex differences. Looking at sex differences in the use of facial behavior would be a worthwhile study on its own, but it is outside the scope of this paper.

      This paper would make a stronger contribution if it focussed on the comparative analysis of facial expressions and removed the attempt of testing the social complexity for communicative complexity hypothesis.

      A comparative analysis of the contextual use of specific facial movements is important. But this paper is focused on making a more general comparison of the communication style and complexity across species. The social complexity hypothesis for communicative complexity is one of the key theoretical frameworks for such an investigation and allows us to frame our study in a broader context. We contribute important data on 3 species with methods that can be replicated and extended to others species. Therefore, we believe that it is a worthy contribution to investigations of the evolution of complex communication.

      REFERENCES

      Micheletta, J., J. Whitehouse, L.A. Parr, and B.M. Waller. ‘Facial Expression Recognition in Crested Macaques (Macaca nigra)’. Animal Cognition 18 (2015): 985–90. https://doi.org/10/f7fvnh.

      Micheletta, Jérôme, Jamie Whitehouse, Lisa A. Parr, Paul Marshman, Antje Engelhardt, and Bridget M. Waller. ‘Familiar and Unfamiliar Face Recognition in Crested Macaques (Macaca nigra)’. Royal Society Open Science 2 (2015): 150109. https://doi.org/10/ggx9k9.

      Parr, L. A., and M. Heintz. ‘Facial Expression Recognition in Rhesus Monkeys, Macaca mulatta’. Animal Behaviour 77 (2009): 1507–13. https://doi.org/10/bbsp5n.

      Parr, L.A., M. Heintz, and G. Pradhan. ‘Rhesus Monkeys (Macaca mulatta) Lack Expertise in Face Processing’. Journal of Comparative Psychology 122 (2008): 390–402. https://doi.org/10/d7w6bv.

      Micheletta, J., and B.M. Waller. ‘Friendship Affects Gaze Following in a Tolerant Species of Macaque, Macaca nigra’. Animal Behaviour 83 (2012): 459–67. https://doi.org/10/c4f8n2.

      Thierry B. Where do we stand with the covariation framework in primate societies? Am. J. Biol. Anthropol. 128 (2021): 5–25. https://doi.org/10.1002/ajpa.24441

      Tomasello, M., J. Call, and B. Hare. ‘Five Primate Species Follow the Visual Gaze of Conspecifics’. Animal Behaviour 55 (1998): 1063–69. https://doi.org/10/bmq7xh.

      Teufel, C., A. Gutmann, R. Pirow, and J. Fischer. ‘Facial Expressions Modulate the Ontogenetic Trajectory of Gaze-Following among Monkeys’. Developmental Science 13 (2010): 913–22. https://doi.org/10/b6j5r7.

    1. Author Response

      The following is the authors’ response to the original reviews.

      We are grateful for the helpful comments of both reviewers and have revised our manuscript with them in mind.

      One of the main issues raised was that readers may by default assume that our models are correct. We in fact made it very clear in our discussion that the models are merely hypotheses that will need testing by “wet” experiments and we do not therefore agree that even readers unfamiliar with AF would assume that the models must be correct. It was also suggested that readers could be reassured by including extensive confidence estimates such as PAE plots. As it happens, every single model described in the manuscript had reasonably high PAE scores and more crucially the entire collection of output files, including PAE data, are readily accessible on Figshare at https://doi.org/10.6084/m9.figshare.22567318.v2, a fact that the reviewers appear to have overlooked. The Figshare link is mentioned three times in the manuscript. Embedding these data within the manuscript itself would in our view add even more details and we have therefore not included them in our revised manuscript. Likewise, it is rather simple for any reader to work out which part of a PAE matrix corresponds to an interaction observed in the corresponding pdb prediction. Besides which, it is our view that the biological plausibility and explanatory power of models is just as important as AF metrics in judging whether they may be correct, as is indeed also the case for most experimental work.

      Another important point was that the manuscript was too long and not readable. Yes, it is long and it could well be argued that we could have written a different type of manuscript, focusing entirely on what is possibly the simplest and most important finding, namely that our AF models suggest that in animal cells Wapl appears to form a quarternary complex with SA, Pds5, and Scc1 in a manner suggesting that a key function of Wapl’s conserved CTD is to sequester Scc1’s Nterminal domain after it has dissociated from Smc3. For right or for wrong, we decided that this story could not be presented on its own but also required 1) an explanation for how Scc1 is induced to dissociate from Smc3 in the first place and 2) how to explain that the quarternary complex predicted for animal cells was not initially predicted for fungi such as yeast. The yeast situation was an exception that clearly needed explaining if the theory was to have any generality and it turned out that delving into the intricate details of the genetics of releasing activity in yeast was eventually required and yielded valuable new insights. We also believe that our work on the recruitment of Eco/Esco acetyl transferases to cohesin and the finding that sororin binds to the Smc3/Scc1 interface also provided important insight into how releasing activity is regulated. We acknowledge that the paper is indeed long but do not think that it is badly written. It is above all a long and complex story that in our view reveals numerous novel insights into how cohesin’s association with chromosomes is regulated and have endeavoured to eliminate any excessive speculation. We feel it is not our fault that cohesin uses complex mechanisms.

      Notwithstanding these considerations, we have in fact simplified a few sections and removed one or two others but acknowledge that we have not made substantial cuts.

      It was pointed out that a key feature of our modelling, namely the predicted association of Wapl’s C-terminal domain with SA/Scc3’s CES is inconsistent with published biochemical data. The AF predictions for this interface are universally robust in all eukaryotic lineages and crucially fully consistent with published and unimpeachable genetic data. We note that any model that explains all findings is bound to be wrong for the very simple reason that some of these findings will prove to be incorrect. There is therefore an art in Science of judging which data must be explained and accommodated and which should be ignored. In this particular case, we chose to ignore the biochemistry. Time will tell whether our judgement proves correct.

      Last but not least, it was suggested that we might provide some experimental support for our proposed SA/Scc3-Pds5-Scc1-WaplC quaternary complex. We are in fact working on this by introducing cysteine pairs (that can be crosslinked in cells) into the proposed interfaces but decided that such studies should be the topic of a subsequent publication. It would be impossible with the resources available to our labs to follow up all of the potential interactions and we therefore decided to exclude all such experiments.

      We are grateful for the detailed comments provided by both reviewers, many of which were very helpful, and in many but not all cases have amended the manuscript accordingly.

      With regard to the more specific comments:

      Reviewer #1 (Recommendations For The Authors):

      1) One concern is that observed interfaces/complexes arise because AF-multimer will aim to pack exposed, conserved and hydrophobic surfaces or regions that contain charge complementarity. The risk is that pairwise interaction screens can result in false positive & non-physiological interactions. It is therefore important to report the level of model confidence obtained for such AF calculations:

      A) The authors should color the key models according to pLDDT scores obtained as reported by AF. This would allow the reader to judge the estimated accuracy of the backbone and side chain rotamers obtained. At least for the key models and interactions it would be important to know if the pLDDT score is >90 (Correct backbone and most rotamers) or >70 (only backbone is correct).

      B) It would also be important to report the PAE plots to allow estimation of the expected position error for most of the important interactions. pLDDT coloring and PEA plots can be shown side-by-side as shown in other published data (e.g. https://pubmed.ncbi.nlm.nih.gov/35679397/ (Supplementary data)

      C) The authors should include a Table showing the confidence of template modeling scores for the predicted protein interfaces as ipTM, ipTM+pTM as reported by AlphaFold-multimer. Ideally, they would also include DockQ scores but this may not be essential. Addition of such scores would help classification into Incorrect, Acceptable or of high quality. For example, line 1073 et seq the authors show a model of a SCC1SA and ESCO1 complex (Fig. 37). Are the modeling scores for these interfaces high? It does not help that the authors show cartoons without side chains? Can the authors provide a close-up view of the two interfaces? Are the amino acids are indeed packed in a manner expected for a protein interface? Can we exclude the possibility that the prediction is obtained merely because the sequence segments (e.g. in ESCO1 & ESCO2) are hydrophobic and conserved?

      We do not agree that including this level of detail to the text/figures of the manuscript would be suitable. All the relevant data for those who may be sceptical about the models are readily available at https://doi.org/10.6084/m9.figshare.22567318.v2. In our view, the cartoon versions of the models are easier for a reader to navigate. Anyone interested in the molecular details can look at the models directly.

      Importantly, no amount of statistical analysis can completely validate these models. What is required are further experiments, which will be the topic of further work from our and I dare from other laboratories.

      D) When they predict an interaction between the SA2:SCC1 complex and Sororin's FGF motif, they find that only 1/5 models show an interaction and that the interaction is dissimilar to that seen of CTCF. Again, it would be helpful to know about modeling scores. Can they show a close-up view of the SORORIN FGF binding interface to see if a realistic binding mode is obtained? Can they indicate the relevant region on the PAE plot?

      Given that AF greatly favours other interactions of Sororin’s FGF motif over its interaction with SA2-Scc1, we do not agree that dwelling on the latter would serve any purpose.

      2) Line 996: AF predicts with high confidence an interaction between Eco1 & SMC3hd. What are the ipTM (& DockQ if available) scores. Would the interface score High, Medium or Acceptable?

      As mentioned, see https://doi.org/10.6084/m9.figshare.22567318.v2.

      3) Line 1034 et seq: Eco1/ESCO1/ESCO2 interaction with PDS5. Interface scores need to be shown to determine that the models shown are indeed likely to occur. If these interactions have low model confidence, Fig. 36 and discussion around potential relevance to PDS5-Eco1 orientation relative to the SMC3 head remains highly speculative and could be expunged.

      See https://doi.org/10.6084/m9.figshare.22567318.v2. It should be clear that the predictions are very similar in fungi and animals. Crucially, we know that Pds5 is essential for acetylation in vivo, so the models appear plausible from a biological point of view.

      4) Considering the relatively large interface between ECO1 and SMC3, would the author consider the possibility that in addition to acetylating SMC3's ATPase domain, ECO1 remains bound to cohesin-DNA complex, as proposed for ESCO1 by Rahman et al (10.1073/pnas.1505323112)?

      This is certainly possible but we would not want to indulge in such speculation.

      5) E.g. Line 875 but also throughout the text: As there is no labeling of the N- and C-termini in the Figures, is frequently unclear what the authors are referring to when they mention that AF models orient chains in a certain manner.

      Good point. This has been amended. However, the positions of N- and C- is all available at https://doi.org/10.6084/m9.figshare.22567318.v2.

      6) Fig19B: PAE plots: authors should indicate which chains correspond to A, B, C. Which segment corresponds to the TYxxxR[T/S]L motif? Can they highlight this section on the PAE plot?

      Good point and amended in the revised manuscript.

      Minor comments:

      1) Line 440: the WAPL YSR motif is not shown in Fig. 14A

      2) Line 691: Scc3 spelling error.

      3) Line 931: Sentence ending '... SCC3 (SCC3N).' requires citation.

      4) Line 1008: Figure reference seems wrong. It should read: Fig. 34A left and right. Fig. 34B does not contain SCC1.

      Many thanks for spotting these. Hopefully, all corrected.

      5) Fig. 41 can be removed as it shows the absence of the interaction of Sororin with SMC1:SCC1. Sufficient to mention in the text that Sororin does not appear to interact with SMC1:SCC1.

      This is possible but we decided to leave this as is.

      Reviewer #2 (Recommendations For The Authors):

      Minor points

      (1) Are there any predicted models in which one of the two dimer interfaces of the hinge is open when the coiled coils are folded back, as seen in the cryo-EM structure of human cohesin-NIPBL complex in the clamped state?

      No AF runs ever predicted half opened hinges. It is possible that the introduction of mutations in one of the two interfaces might reveal a half-opened state and we ought to try this. However, it would not be appropriate for this manuscript, we believe.

      (2) Structures of the SA-Scc1 CES bound to [Y/F]xF motifs from Sgo1 and CTCF have been reported, suggesting that a similar motif could interact with SA/Scc3. Surprisingly, AF did not predict an interaction between Scc3/SA and Wapl FGF motifs, which only bind to the Pds5 WEST region. On the other hand, AF predicted interactions of the Sororin FGF motif with both Pds5 WEST and SA CES. Can the authors comment on this Wapl FGF binding specificity? What will happen if a Wapl fragment lacking the CTD is used in the prediction?

      This seems to be an academic point as the CTD is always present.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We would like to thank the editorial staff and the reviewers for their handling of our manuscript. We were very pleased with the timely communications from Review Commons, and we are grateful to have been assigned this insightful and constructive group of reviewers.

      The reviewers were well-suited to evaluate our work based on their stated areas of expertise (cancer biology, image analysis, machine learning, cell-based screening, etc.). As such, we received thoughtful and constructive feedback, which we have already incorporated into our attached revision. We are confident that these reviews have improved our manuscript.

      Our goal with this manuscript is to present a proof-of-concept study where high-content imaging and morphological profiling are used to characterize drug resistance in clonal cell lines. The main criticism from reviewers was that our original manuscript may have overstated our method’s ability to discriminate the signal of bortezomib resistance and that any extension beyond cultured cells (to patient samples for example) would require significant follow-up studies. The reviewers suggested that such work would be beyond the scope of our study, and recommended toning down our language to better reflect the limitations of this proof-of-concept work. We have embraced this suggestion, extensively revising our text, and we now believe our language and tone more accurately reflects our results. The reviewers also suggested follow-up computational analyses to more robustly characterize the bortezomib resistance signature. We have performed these analyses and added their description to our revised manuscript. We feel that these analyses have improved understanding of the signature, and will help a reader to gain a deeper understanding of our results and methodology.

      The reviewers also suggested several minor changes; many of which we embraced fully, but others that we chose not to incorporate. We felt that a lack of clarity in our text contributed to these reviewer suggestions. In these cases, we improved clarity in the text and responded to each comment point-by-point in the “prefer not to carry out” section. Further, we address all reviewer comments in the following document point-by-point, grouped by common themes across reviewers (e.g., tone, clarity, analyses, etc.).

      Lastly, a common theme among reviewer comments was their appreciation for our strong methodology and data transparency (examples pasted below). We are extremely gratified by this observation as we feel this is a particular strength of our manuscript. In addition, we were pleased to see reviewers engaged by our work, acknowledging the interest this manuscript is likely to generate among a broad range of scientific disciplines.

      Examples of reviewer appreciation of our strong methodology and data transparency:

      Reviewer 1: “However, this does not imply that the same approach can not achieve the goal, perhaps by using other cell painting markers for bortezomib-sensitivity, or with the same markers to assess sensitivity of different drugs. The cell painting + analysis approaches are not new and the clinical impact is questionable, but the technical aspects (data, analysis) are exceptional and the concept may hold as I described above.”

      Reviewer 2: “The paper is well written, and the text is clear, as is the presentation of data and transparency of methods being utilized. The methods were applied appropriately and followed established standards in the field. The paper's premise is timely and interesting, addressing a pressing issue in cancer therapy: making informed treatment decisions fast, based on markers found in tumors early in tumor development, and using image-based screening for characterizing drug resistance before treatment could be an option. A fascinating bit of the manuscript is the description of the feature selection from the screen is done systematically, considering the technical and biological variability and technical artifacts and modeling covariates using linear models seems a very appropriate way of doing so and could serve as another proof of concept that this is indeed the most robust way of modeling and removing signal of technical covariates from the data.”

      Reviewer 3: “The strengths of this study are the machine learning best practice and detailed methodology. The experiments could be reproduced and statistical analysis is more than adequate. The analysis takes into account batch effects, well position, differences in cell numbers, and other sources of technical variation that complicate high-content image analysis. It is a good exemplar of how unsupervised morphological profiling can be applied to imaging data. The major limitation is the generalizability of this particular method for patient samples. This could be addressed in the Discussion.”

      1. Description of the planned revisions

      We have incorporated all planned revisions.

      1. Description of the revisions that have already been incorporated in the transferred manuscript

      Text revisions already carried out

      1. [Text revision] We have materially toned down our claims in the manuscript in two distinct areas: A) model performance and B) potential clinical application. A) Model performance. We specifically balanced our discussion of the discriminative signal of the Bortezomib Signature. While the signature adequately separated never-before-seen wildtype and resistant clones with metrics well above randomly permuted baselines (accuracy near 80%, average precision about 70%, area under the ROC curve (AUROC) about 84%), there were many limitations that we should have more explicitly highlighted. For example, many individual profiles were incorrectly classified, some clones were predicted entirely incorrectly, and many profiles did not receive Bortezomib Signature scores above the randomly permuted baseline. We have more clearly discussed these limitations and used more balanced language (see key examples of text-based changes below). Additionally, we modified a figure (now Figure 3) to include boxplots of clones that explicitly show the Bortezomib Signature scores of each well profile and permit examination of the strength of the signature for each clone (previously found in Figure 2-Supplement 9). Lastly, we add a new supplementary figure (now Figure 5-Supplement 1) that describes a feature space analysis of misclassified samples. Please note that this figure rearrangement and new analysis helped to balance our claims, but were also performed in response to other tangential reviewer comments. B) Clinical application. In the abstract, introduction, and discussion, we further emphasized that this work is a proof of concept, and that more advances must be made prior to clinical application.

      We made these changes in direct response to the following reviewer comments:

      Reviewer 1 - Major Comment 1 (relevant excerpts)

      While I am convinced that the signature captures morphological phenotypes associated with drug resistance, at the cumulative scale, the discriminative signal of a single cell type seems weak… With Fig. 4, the data fully supports the argument that the bortezomib-signature encodes bortezomib-resistance, but the signal is weak. Thus statements such as "We found the Bortezomib Signature could predict whether a cell line was bortezomib-resistant or bortezomib-sensitive" (line #172) and the specificity statements in the abstract" (line #28) are not supported by the data in my opinion. I would recommend the authors to tune down these and other related statements throughout the manuscript.

      Reviewer cross-commenting - Reviewer 1

      My main critic is regarding "over selling" a weak discriminative signal. Specifically, I am not convinced that the major claims regarding predicting sensitivity and specificity at the single cell types scales are supported by the data. Since reviewer #2 and #3 did not raise this concern I think it is worth discussion here.

      Once these statements are tuned down - I think no significant additional work is needed to make the point that they can measure a discriminative signal. If they want to make these claims, perhaps they'd like to collect more data to gain statistical power (but I am not optimistic this will work at the single cell level).

      Personally, I was happy with the authors' choice of cell lines not included in the training dataset. I am not convinced that additional cell lines + validations are necessary for making the point of a proof of principle.

      Reviewer cross-commenting - Reviewer 2

      I agree that, perhaps, my major criticism of the paper was the manuscript's 'overselling' of claims that were only weakly supported by the data. Yes, if the authors tune down their claims and clearly state that this is an interesting starting point and proof of concept study, it might be ok to publish with only minor revisions. If the claims should be more generalized, then this study needs more data supporting the conclusions and the method's predictive power.

      Reviewer 2 - Major Comment 8

      Lastly, I find some misfits between the question, the model used, and the conclusions drawn. The authors start by exploring the problem of bortezomib resistance in cancer treatment, which they say is a devastating issue for patients with, e.g., multiple myeloma. Yet, the authors use HCT116 as their model cell line, a microsatellite instable, colorectal cell line with several intrinsic mutations that make it a difficult model to address physiologically relevant medical problems after all. The authors then go on to suppose that their method might be suitable to diagnose resistance in patient samples, but I am not convinced this conclusion can be speculated based on data from HCT cells. I suggest the authors test their approach on at least two other cell lines (maybe from different tissues) and benchmark their results against a dataset of digital pathology where such predictions are made from stained and analyzed tissue slices. This way, after a thorough benchmark against related third-party data sets, the method would significantly gain relevance, the paper would appeal to a broader audience, and the advance gains more merit.

      Reviewer 3 - Major Comment 5

      It is not clear from the Discussion whether this type of analysis is more broadly applicable to cell lines derived from patients, rather than selected from a parental cell line, or if this approach would be more efficient than genotyping or next-gen sequencing. How many replicates and ground truth cell lines would be necessary for predictive confidence?

      We edited the last two sentences of the abstract to tone down specificity claims (“provide evidence”) and clarify that we are establishing a “proof-of-concept framework”.

      • This signature predicted bortezomib resistance better than resistance to other drugs targeting the ubiquitin-proteasome system. Our results establish a proof-of-concept framework for the unbiased analysis of drug resistance using high-content microscopy of cancer cells, in the absence of drug treatment.

      We revised the last paragraph of the introduction to contrast bortezomib predictions with ixazomib/CB-5083 predictions, and to remove claims about “using microscopy to guide therapy”.

      • This morphological signature correctly predicted the bortezomib resistance of seven out of ten clones not included in the signature training dataset. Overall, our results establish a proof-of-concept framework for identifying unbiased signatures of drug resistance using high-content microscopy. The ability to identify drug-resistant cells based on morphological features provides a valuable orthogonal method for characterizing resistance in the absence of drug treatment.

      To tone down claims in the figures, we added boxplots to Figure 3 (previous Figure 2) showing specific distribution of signature scores per well profile and updated Figure 4 legend (previous Figure 3).

      • Figure 4. Bortezomib Signature has limited ability to characterize clones resistant to other ubiquitin-proteasome system inhibitors.

      We modify the following text in the discussion to tone down claims of specificity and clinical utility:

      • This Bortezomib Signature correctly predicted the bortezomib resistance of seven out of ten clones not included in the training dataset and was more specific to bortezomib-resistance given its limited ability to identify clones that were resistant to other UPS-targeting drugs.

      Though it is unclear whether this method can be extended to patient samples, where identifying intrinsic drug resistance in cells prior to treatment has the potential to improve targeted cancer therapy, our results are an encouraging proof of concept. We expect that further refinement may develop Cell Painting as a tool for identifying drug-resistant cells, perhaps even guiding strategies to overcome intrinsic resistance.

      1. [Text revision] We defined LD50 in text (originally line #97), changed description of resistant clone selection to remove main text references to LD90 (originally line #87), and stated drug concentrations used for selection in Methods. We also defined LD90 in the Methods and described its role in determining the drug concentrations to use for clone selection. This change was in response to the following comments:

      Reviewer 1 - Minor Comment 2

      What is LD90 (line #87)? LD50 (line #97)?

      Reviewer 2 - Minor Comment 5

      What was the LD 90 per drug on HCT cells? Rather than LD90 foldchanges, absolute concentrations should be used in the results and discussion to allow the reader to vet the conclusions.

      • To determine the appropriate drug concentrations to use in order to isolate drug-resistant clones, we performed proliferation assays on HCT116 parental cells with our drugs of interest: bortezomib (proteasome inhibitor), ixazomib (proteasome inhibitor), or CB-5083 (p97 inhibitor) (Fig. 1-Supplement 1 A-D).
      • We characterized the bortezomib-resistant clones and found that the median lethal doses (LD50s) were ~2.8- to ~9-fold that of HCT116 parental cells (Fig. 1-Supplement 2 B).
      • Briefly, HCT116 cells were plated in 150 mm dishes and grown in the presence of the desired drug at a concentration that resulted in the death of the majority of cells (selection concentrations: bortezomib, 12 nM; ixazomib, 150 nM; CB-5083, 600 and 700 nM).
      • Using the data from our proliferation assays, we calculated the median lethal dose (LD50) for each of our drugs of interest by fitting data of normalized growth vs. log[drug concentration] to a sigmoidal dose-response curve using GraphPad Prism (v.9.2.0) (Fig. 1-Supplement 1 D).

      • [Text revision] We thank the reviewer for allowing us an opportunity to improve clarity on the clones we used. We now describe the total number of clones generated and removed unnecessary references to specific clones for ease of reading (originally lines #96-98) (We maintain all references to specific clones in the figures, legends, supplement, and methods)

      Reviewer 1 - Minor Comment 3

      It was not clear to me in the text which and how many cell lines were evaluated and the reader is forced to go to the SI. For example, "(BZ01-10 and BZ clones A and E)" (line #96-97) and "wild-type clones (WT01-05, 10, and 12-15)" (line #98) appeared when presenting the results without a clear explanation and made it harder for me to follow. Summary of the data (for example, based on Figure 2-Supplement 8) can be briefly mentioned in the text to make it more clear for the reader.

      We added the following to the second paragraph of the results:

      • Together these methods provided a total of twelve bortezomib-resistant, five ixazomib-resistant, five CB-5083-resistant, and twelve bortezomib-sensitive clones as well as HCT116 parental cells for our experiments.

      [Text revision] We removed duplicate text (originally lines #115-125).

      Reviewer 1 - Minor Comment 5

      1. Lines #104-111 were duplicated in lines #114-122.

      Reviewer 3 - Minor Comment 4

      Ten lines of text are duplicated on page 5.

      Reviewer 2 - Minor Comment 4

      on page 5, paragraph 4, there is a sizeable copy-and-paste error of text being identically replicated.

      1. [Text revision] We provided more intuition of the Bortezomib Signature in the results section (originally lines #150-151).

      Reviewer 1 - Minor Comment 6

      The "Bortezomib Signature" is a critical measurement but is only briefly mentioned in lines 150-151 ("..based on the direction-sensitive ranking method for phenotype analysis, singscore (Foroutan et al., 2018)"). Please provide more information/intuition.

      • We used these 45 features to compute a rank-based resistance score or “Bortezomib Signature” for each well profile based on the direction-sensitive method called singscore (Foroutan et al. 2018). Singscore ranks these 45 resistance-related features on a per sample basis and calculates a normalized score between -1 and 1, with higher values expected for bortezomib-resistant clones and lower values expected for bortezomib-sensitive clones.

      • [Text revision] We clarified that DNA sequencing had been performed solely on clones A and E in a previous study (originally lines #88-90). Furthermore, one of the strengths of our approach is that it can identify resistant clones in an unbiased fashion prior to molecular characterization. It is beyond scope to perform these sequencing studies in the present paper.

      Reviewer 2 - Minor Comment 3

      The authors talk about validating the mutation - PSMB5 by RNA-seq. However, the data for the genotyping/sequencing/characterization of these newly generated BZ-resistant lines are missing.<br />

      In the results, we clarify DNA sequencing that was previously performed on clones A and E

      • We also isolated bortezomib-sensitive (wild-type; WT) clones by dilution of the HCT116 parental cell line and acquired two bortezomib-resistant clones (BZ clones A and E) both with mutations in PSMB5 identified by RNA sequencing performed in previous work (Fig. 1-Supplement 1 E) (Wacker et al. 2012).

      In the last paragraph of the discussion, we highlight the strength of our unbiased approach

      • Together, our work has demonstrated the potential for morphological profiling with Cell Painting to be used as an unbiased method to characterize resistance in the absence of drug treatment. Our results indicate that different mechanisms of bortezomib resistance may generate distinct morphological profiles; with larger and broader training datasets, it may be possible to identify signatures for distinct mechanisms of bortezomib resistance as well as signatures of resistance to other drugs. Though it is unclear whether this method can be extended to patient samples, where identifying intrinsic drug resistance in cells prior to treatment has the potential to improve targeted cancer therapy, our results are an encouraging proof of concept. We expect that further refinement may develop Cell Painting as a tool for identifying drug-resistant cells, perhaps even guiding strategies to overcome intrinsic resistance.

      • [Text revision] We thank the reviewers for their suggestions. We agree that the description of the experimental design was somewhat unclear and have provided greater detail and clarity, particularly regarding the generation of clones. We used the HCT116 parental cell line to generate drug-resistant clones by identifying single surviving cells after drug treatment and allowing these cells to expand prior to isolating colonies for experimentation. We did not perform experiments to confirm whether these “clones” were isogenic and can not exclude cell migration during expansion or genetic drift as convoluting factors. However, we have provided greater detail in the descriptions of our method for clone isolation in order to address this concern.

      Reviewer 1 - Minor Comment 1

      More information in Fig. 1's legend would be helpful to follow the experimental design. I found it hard to follow in its current form and had to go back to carefully reading the main text to fully understand.

      Reviewer 2 - Minor Comment 6

      The description of the resistant clonal populations is confusing. As I understand, no single-cell clones were isolated during the selection procedure. Thus, the training lines are not yet isogenic clones but oligoclonal sub-populations of the parental cell line. The authors could provide more details here and discuss the different characteristics of their sub-populations, e.g., their growth kinetics or molecular alterations.

      We bolstered the description in the results.

      • We first isolated and characterized drug-resistant cells (Fig. 1 A). To isolate drug-resistant clones, we used an approach we have described previously (Wacker et al. 2012; Kasap, Elemento, and Kapoor 2014) and the HCT116 cell line. These cancer cells express multidrug resistance pumps at low levels and are mismatch repair deficient, providing a genetically heterogeneous polyclonal population of cells (Umar et al. 1994; Papadopoulos et al. 1994; Teraishi et al. 2005) allowing for isolation of drug-resistant clones in 2-3 weeks. We hypothesize that a rapid selection of resistance could favor the isolation of clones with intrinsic resistance. To determine the appropriate drug concentrations to use in order to isolate drug-resistant clones, we performed proliferation assays on HCT116 parental cells with our drugs of interest: bortezomib, ixazomib, or CB-5083 (Fig. 1-Supplement 1 A-D). We also isolated bortezomib-sensitive (wild-type; WT) clones by dilution of the HCT116 parental cell line and acquired two published bortezomib-resistant clones (BZ clones A and E) both with mutations in PSMB5 identified by RNA sequencing performed in previous work (Fig. 1-Supplement 1 E) (Wacker et al. 2012). We characterized the bortezomib-resistant clones and found that the median lethal doses (LD50s) for bortezomib were ~2.8- to ~9-fold that of HCT116 parental cells (Fig. 1-Supplement 2 B). In contrast, bortezomib-sensitive clones had LD50s for bortezomib that ranged from ~0.7- to ~1.2-fold that of HCT116 parental cells (Fig. 1-Supplement 2 A). Together these methods provided a total of twelve bortezomib-resistant, five ixazomib-resistant, five CB-5083-resistant, and twelve bortezomib-sensitive clones as well as HCT116 parental cells for our experiments.

      We also updated the legend for Figure 1A.

      • Figure 1. Experimental design for using Cell Painting to examine morphological profiles of drug-resistant cells. (A) Graphic of the experimental workflow: we isolated drug-resistant clones by treating parental HCT116 cells with a high dose of the desired drug and then expanded them for experiments. We isolated drug-sensitive clones by diluting HCT116 cells and then expanded them for experiments. We then performed proliferation assays on select clones to screen for multidrug resistance. Next, we performed Cell Painting on both drug-resistant and -sensitive clones, using multiplexed high-throughput fluorescence microscopy of fixed cells followed by feature extraction and morphological profiling to search for features that contribute to a signature of drug resistance.

      • [Text revision] We clarified that the Bortezomib Signature did not correspond to well position (originally lines #155-157).

      Reviewer 1 - Minor Comment 9

      Line #155-156: "We found that the pattern of Bortezomib Signatures corresponded to the cell identity plate layout", the word "not" is missing before "corresponded".

      We found that the pattern of Bortezomib Signatures did not correspond to well position relative to the plate (Fig. 2-Supplement 7 B), indicating that the well position for each clone was not strongly contributing to its Bortezomib Signature.

      1. [Text revision] We explicitly described the result that some misclassified clones (WT10, WT15, and BZ06) did not have unexpected bortezomib sensitivity as determined by proliferation assays. We also moved the supplementary figure to an updated Figure 3 to better highlight this result (described below in “Figure revisions already carried out”). Lastly, we add a new figure (Figure 5-Supplement 1) to more explicitly analyze the misclassified lines (described below in “New analyses already carried out”).

      Reviewer 3 - Minor Comment 3

      The bortezomib sensitivity of the WT lines used in the last experiments was determined and did not seem to be greater than parental. This could be mentioned in the text; the figure raises the question and the answer is provided, but it's in the supplemental material.

      While the Bortezomib Signature correctly characterized the bortezomib sensitivity of most clones, it consistently misclassified others (WT10, WT15, and BZ06) (Fig 5-Supplement 1 A). Proliferation assays conducted in earlier experiments showed that WT10 and WT15 were sensitive to bortezomib while BZ06 was resistant (Fig. 1-Supplement 2 A and B). By comparing these incorrect predictions with high-confidence correct predictions, we observed differences that varied by clone type, suggesting unique morphology may be driving each of these misclassifications (Fig. 5-Supplement 1 B and C). These results are consistent with the Bortezomib Signature being generalizable to clones not included in the training dataset and suggest that morphological profiling has the potential to identify bortezomib-resistant clones based on the morphological features of cells in the absence of drug treatment.

      1. [Text revision] We clarified that the metrics (accuracy and average precision) were based on median Bortezomib Signature scores of all replicate well-level profiles per clone. We can compare samples based on rank, and difference from 95% confidence interval of permuted data. There is no current way for our method to assign a likelihood. Also note that we have updated the discussion to discuss alternative metrics (see Reviewer 1 - Minor Comment 7) These are very important distinctions, and we are grateful to the reviewer for bringing them up.

      Reviewer 3 - Major Comment 3

      The study classifies cells as binary sensitive or resistant, but would results be improved by scoring based on likelihood of being resistant/sensitive?

      Reviewer 3 - Minor Comment 2

      It is not clear whether the accuracy was based on a percentage of replicates per cell line that were classified correctly or whether that was referring to classification of the cell line overall as sensitive/resistant.

      • We next examined whether the Bortezomib Signature was able to predict the bortezomib resistance of a clone based on morphological profiling data (Fig. 3 A-E and Fig. 3-Supplement 2 A and B). We called the clone bortezomib-resistant if the median Bortezomib Signature of all replicate well profiles was greater than zero and bortezomib-sensitive if the median Bortezomib Signature less than zero. In the training dataset, the Bortezomib Signature correctly predicted the bortezomib resistance of all ten clones, with median Bortezomib Signatures for eight out of ten clones beyond the 95% confidence interval for the randomly permuted data (Fig. 3 A). The accuracy of the Bortezomib Signature was 88% while the average precision was 81% for the training dataset (Fig. 3-Supplement 2 A and B) (see Methods). The signature performed similarly well in the validation dataset (Fig. 3 B), with an accuracy of 92% and an average precision of 89% (Fig. 3-Supplement 2 A and B). In the test dataset the Bortezomib Signature correctly predicted the bortezomib resistance of all clones, though only HCT116 parental cells had a median Bortezomib Signature outside the 95% confidence interval for the randomly permuted data (Fig. 3 C). The test dataset had an accuracy of 80% and an average precision of 68% (Fig. 3-Supplement 2 A and B). Similarly, in the holdout dataset the Bortezomib Signature had an accuracy of 78% and an average precision of 69% (Fig.3 -Supplement 2 A and B), and correctly predicted the bortezomib resistance of twelve out of thirteen clones, with WT01 misclassified as bortezomib-resistant (Fig. 3 D). In the holdout dataset, four of the twelve correctly characterized clones had median Bortezomib Signatures outside the 95% confidence interval for the randomly permuted data.

      We also mirrored language when discussing the ixazomib and CB-5083 results.

      • However, only two of the four correctly identified ixazomib-resistant clones and one of the three CB-5083-resistant clones had median Bortezomib Signatures outside the 95% confidence interval of the randomly permuted data. The area under the ROC (AUROC) curve for ixazomib-resistant and CB-5083-resistant clones (0.63 and 0.60, respectively) was lower than those calculated for the training, validation, test, and holdout datasets. In addition, many of the Bortezomib Signatures for well profiles of ixazomib- and CB-5083-resistant clones, particularly those for CB-5083-resistant clones, landed within the 95% confidence interval of the randomly permuted data. These results suggest that the Bortezomib Signature is not a general signature of UPS-targeting drug resistance and instead has some specificity for bortezomib.

      • [Text revision] We added an explicit note that our image analysis pipelines are also publicly available. Our reporting of our data processing pipelines are documented fully and well above standards in our field. Linking the publicly-available resources with these methods maximizes reproducibility.

      Reviewer 1 - Minor Comment 10

      Additional details on the processing steps in the analysis pipeline in the Methods will be highly appreciated.

      We include all image analysis pipelines at https://github.com/broadinstitute/profiling-resistance-mechanisms (G. Way et al. 2023).

      1. [Text revision] We have compared our approach to the on-disease/off-disease scores as introduced in (Heiser et al. 2020). We agree with the reviewer that a discussion of these two methods would help clarify our phenotypic signature concept. The on/off score is about the degree to which a perturbation pushes disease towards a healthy state. In this case we have 3 sets of data: healthy samples (used for training), disease samples (used for training), and the sample we want to score, which should be of the form "disease + perturbation". With our approach, based on singscore, we also have 3 sets of data: sensitive samples (used for training), resistance samples (used for training), and the sample we want to score. Here, our sample we want to score could be anything, not necessarily of the form "resistance + perturbation". Furthermore, singscore does not have the concept of orthogonality to resistance/sensitivity. This would become relevant if we were exploring perturbations or conditions that would induce a resistant cell line to become sensitive, but we are not doing that here. There are other statistical differences (projection vs. rank based etc.) but the key difference is the applicability of the method to the specific problem at hand.

      Reviewer 1 - Minor Comment 7

      How is the Bortezomib Signature related to the "on-disease"/"off-disease" scores described in https://www.biorxiv.org/content/10.1101/2020.04.21.054387v1.full? Are there other alternatives used for similar binary phenotypic signatures? What is the justification for using these measurements? I would love to see this generalized concept explicitly discussed in the Discussion.

      We added the following to the discussion.

      • The Bortezomib Signature is conceptually similar to the on-disease/off-disease score (Heiser et al. 2020). Both require three phenotypic measurements: a target phenotype representing ideal, a disease phenotype, and a new phenotype to classify. However, our approach is technically different (non-parametric compared to linear projection) and our goals are different (phenotypic classification compared to perturbation alignment). Other methods also enable phenotype labeling, but they focus on single-sample annotation without regard to a target phenotype (Wawer et al. 2014; Rohban et al. 2017; Simm et al. 2018; Nyffeler et al. 2020).

      Figure revisions already carried out

      1. [Figure revision] We moved all boxplots from the original Fig. 2-Supplement 9 to the main text (also splitting Fig. 2 into Fig. 2 and 3). From the original Figure 2, we moved the accuracy and average precision bar graphs to the supplement. We also note that this change increases transparency of the discriminative signal of our signature.

      Reviewer 1 - Minor Comment 8

      I would highly recommend showing the Bortezomib Signatures from Figure 2-Supplement 9. in Fig. 2. This was the main measurement used throughout the manuscript and in my opinion, it is very important to consistently visualize the data along the manuscript, for clarity and easier reader interpretation.

      1. [Figure revision] We adjusted the position of the legend in the accuracy and average precision bar graphs (originally Fig. 2 C and D, now Fig. 3-Supplement 2) for clarity. We also note that keeping the bar chart here is standard best practice (compared to a dot plot).

      Reviewer 1 - Minor Comment 4

      I found the visualization in Fig. 2C-D not intuitive (it is properly explained in the legend). I suggest replacing the accuracy colorbar with a color marker to make it more distinct from the random permutation (|--*--|) The location of the text "mean +- SD of 100 random permutation" made me first think that it is linked to the holdout.

      1. [Figure revision] We changed the point distribution in the boxplots (from expanded to standard) to minimize overlap with the boxplot lines. We also updated the legend text to indicate that individual points in boxplots represent the Bortezomib Signature for well profiles. Note, we paste a representative example of this change above (new Figure 3).

      Reviewer 3 - Minor Comment 1

      I found the box plots somewhat difficult to interpret (especially where the WT lines had a lot of overlap with the red shaded area). Do the points in these charts correspond to replicate wells?

      We also update the figure legend.

      • Plots show values for individual well profiles (points), range (error bars), 25th and 75th percentiles (box boundaries), and median.

      • [Figure revision] [Response to Reviewer 2 - Major Comment 7] We thank the reviewer for allowing us an opportunity to clarify the mechanism. We feel that it is beyond scope of this manuscript to disentangle the molecular alterations that cause bortezomib resistance based on our Cell Painting insights. This wet lab experimental process is arduous and cost prohibitive, and we argue that one of the benefits of taking a morphology approach to resistance status is that we can detect resistant cells (and therefore cells that won’t die when presented with a treatment) without knowing the molecular mechanism.

      Nevertheless, the reviewer has encouraged us to enhance the ability for a reader to view and interpret the signature to perhaps more easily facilitate future work. Previously, we presented our signature in text form in Figure 2-Supplement 4 and in heatmap form in Figure 2-Supplement 5. Here, we add a new figure (Figure 2-Supplement 6; pasted below) which will improve interpretability.

      Reviewer 2 - Major Comment 7:

      Next to feature importance, the authors do not discuss (or I missed) what biology the features represent. Such the reader is left wondering what the actual mechanism of bortezomib resistance could be and if cell painting could shed light on the molecular alterations that cause the treatment resistance. While reviewing, I thus wondered which audience the authors targeted with their manuscript. A more focused analysis of their data that highlights aspects of the study either for the machine learning community, the cell biology community, or the precision oncology community would greatly benefit the manuscript's impact. In its current form, the study's findings seem diluted and spread across a wide range of research questions.<br />

      • Figure 2-Supplement 6. Bortezomib Signature visualized by CellProfiler features. Visualization of CellProfiler features contributing to the Bortezomib Signature. Features with high values (mean signature estimates) in resistant cells are purple while features with low values in resistant cells are green. The mean signature estimates were based on Tukey's Honestly Significant Difference test score and the number in each box represents the number of features used to calculate the mean signature estimate.

      Additionally, we add the following to the results section:

      • We then examined the grouping of features across compartments and channels and found radial distribution features were higher in resistant cells (Fig 2-Supplement 6).

      The code change to generate the signature visualization summary is available at: https://github.com/broadinstitute/profiling-resistance-mechanisms/pull/131

      New analyses already carried out

      1. [New analysis] [Response to Reviewer 2 - Major Comment 5] We agree that a systematic analysis of feature selection methods will provide additional insights not already in the manuscript. Therefore, we have performed two new computational experiments to compare our linear modeling feature selection approach against other standard approaches. We demonstrate that our linear modeling approach is effective at isolating the core differences between resistant and sensitive classes.

      Specifically, we performed two analyses: A) UMAP and B) k-means cluster analysis. We analyzed profiles defined by four different feature selection approaches: 1) Using all traditional CellProfiler features; 2) Using the traditional CellProfiler feature selection approach (removing low variance features, high correlating features, etc.); 3) Using 45 random features (same size as Bortezomib Signature); and 4) Using only the bortezomib signature features. We performed Fisher’s exact tests to derive odds ratios of cluster membership by resistance status and calculated Silhouette widths to quantify relative proximity of clusters.

      This analysis generates a new supplementary figure (see below), and demonstrates that the linear-modeling-based feature selection isolated the features driving the differences between the clone types (resistance vs. wildtype) while the standard approaches do not as effectively separate.

      Reviewer 2 - Major Comment 5:

      A fascinating bit of the manuscript is the description of the feature selection from the screen is done systematically, considering the technical and biological variability and technical artifacts and modeling covariates using linear models seems a very appropriate way of doing so and could serve as another proof of concept that this is indeed the most robust way of modeling and removing signal of technical covariates from the data. Yet, I wondered why the authors do not discuss other means of feature selection or dimensionality reduction; further, they need to show how the features cluster the cell lines or why impact (information content) different features deliver. For an audience interested in the technical aspects of cell painting analysis and machine learning based on the data, that would, IMHO, be the most exciting questions.

      • Figure 3-Supplement 3. Benchmarking linear-modeling feature selection to separate clones by bortezomib resistance. Uniform Manifold Approximation and Projection (UMAP) analysis of the qualitative separability of (A) resistance status and (B) Bortezomib Signature scores across four different feature spaces. (C) k-means clustering from k=2 to k=14 of average odds ratio, maximum odds ratio (Fisher’s exact test), and Silhouette width using Bortezomib Signature features.

      Additionally, we add the following to the results section:

      • We then compared our linear-modeling approach to feature selection against other feature spaces and found that the Bortezomib Signature clusters same-type clones (bortezomib-resistant vs. bortezomib-sensitive) with higher enrichment compared to the full feature space, standard feature selection (see Methods), or a random selection of 45 features (Fig 3-Supplement 3).

      And methods section, describing this analysis:

      • We were also interested in comparing the ability of different feature spaces to cluster clones of the same type (resistant vs. sensitive). This analysis would determine if the Bortezomib Signature features, which we derived using linear modeling to isolate biological from technical variables, had a greater ability to cluster. We compared the Bortezomib Signature against three other feature spaces: 1) the full feature space, 2) standard feature selection (see Image data processing methods), and 3) 45 randomly selected features. We performed two analyses using these four feature spaces including Uniform Manifold Approximation and Projection (UMAP) (McInnes et al. 2018) and k-means clustering. For UMAP, we used default umap-learn parameters to identify two UMAP coordinates per feature space. We then visualized the clusters by their resistance status and Bortezomib Signature score. The UMAP analysis represents a qualitative analysis. Next, we applied k-means clustering with 25 initializations across a range of 2-14 clusters (k). Prior to clustering and for each feature space, we applied principal component analysis (PCA) and transformed each feature space into 30 principal components. This step was necessary to compare k-means clustering metrics, which are sensitive to the feature space dimensionality. We applied a Fisher’s exact test to each cluster using a two-by-two contingency matrix that specified cluster membership for each clone classification (resistant vs. sensitive). We visualized the mean odds ratio and max cluster odds ratio for each feature space across k. A high odds ratio tells us that the feature space effectively clusters clones of the same resistance status. Lastly, we calculated Silhouette width (the average proximity between samples in one cluster to the second nearest cluster) for each feature space across k.

      The code change to derive the UMAP coordinates, perform clustering, and generate the figure is available at https://github.com/broadinstitute/profiling-resistance-mechanisms/pull/132

      1. [New analysis] [Response to Reviewer 3 - Major Comment 1] We thank the reviewer for this suggestion, which allowed us to explore the misclassified samples in more depth. We added a new supplementary figure in which we summarized all bortezomib clones (wildtype and resistant) in their accuracy based on the bortezomib signature (panel A). We did not include training set samples in this analysis. Using samples that were consistently incorrectly classified with high confidence (three samples: WT15, BZ06, WT10) we performed two separate two-sample Kolmogorov–Smirnov (KS) tests. Specifically, we compared high incorrect wildtype to high correct wildtype and high incorrect resistant to high correct resistant. Our results indicate that most bortezomib signatures were significantly different between correct and incorrect assignments (panel B), and that the signature features varied between resistant and wildtype misclassification tests (panel C).

      Reviewer 3 - Major Comment 1:

      While the claims are largely substantiated, there are a few points where further consideration would improve the manuscript. Several cell lines were mis-classified with what appears to be a high degree of certainty. Can the authors tell what was driving those predictions? Was there something in the morphological signature that weighed more heavily in those cases?

      • Figure 5-Supplement 1. Examining the accuracy of clone classification and misclassification of clones. (A) Proportion of high-confidence correct, low-confidence correct, low-confidence incorrect, and high-confidence incorrect predictions of well profiles across clones in the test, holdout, and validation sets. High-confidence predictions (high) had a Bortezomib Signatures greater (resistant clones) or less than (sensitive) the 95% confidence interval of randomly permuted data while low-confidence predictions (low) had Bortezomib Signatures within the 95% confidence interval of randomly permuted data. (B) Visualization of Kolmogorov-Smirnov (KS) test statistic means of feature groups across channels and cellular compartments. (C) Plot of the KS test statistic means for feature groups in bortezomib-resistant vs. -sensitive cells. Each feature group is color coded by the imaging channel.

      Additionally, we add the following to the results section:

      • While the Bortezomib Signature correctly characterized the bortezomib sensitivity of most clones, it consistently misclassified others (WT10, WT15, and BZ06) (Fig 5-Supplement 1 A). Proliferation assays conducted in earlier experiments showed that WT10 and WT15 were sensitive to bortezomib while BZ06 was resistant (Fig. 1-Supplement 2 A and B). By comparing these incorrect predictions with high-confidence correct predictions, we observed differences that varied by clone type, suggesting unique morphology may be driving each of these misclassifications (Fig. 5-Supplement 1 B and C). These results are consistent with the Bortezomib Signature being generalizable to clones not included in the training dataset and suggest that morphological profiling has the potential to identify bortezomib-resistant clones based on the morphological features of cells in the absence of drug treatment.

      And methods section, describing this analysis:

      Some profiles were consistently predicted incorrectly with high confidence but in the opposite direction (see Figure 5-Supplement 1). For a well-level profile to be categorized as high-confidence (in either the correct or incorrect directions), it needed to score beyond the 95% confidence interval of the randomly permuted data range. For example, a high-confidence incorrect resistant profile would have a Bortezomib Signature below 95% confidence interval of the randomly permuted data. To evaluate the features driving the differences in these samples, we applied two-sample Kolmogorov–Smirnov (KS) tests per Bortezomib Signature feature. We applied these tests to two separate groups: 1) misclassified bortezomib-sensitive vs. high-confidence accurate bortezomib-sensitive and 2) misclassified bortezomib-resistant vs. high-confidence accurate bortezomib-resistant.

      The code change to generate the UMAP coordinates and figure is available at https://github.com/broadinstitute/profiling-resistance-mechanisms/pull/130

      Description of analyses that authors prefer not to carry out

      1. [Response to Reviewer 2 - Minor Comments 1 and 2]: These are interesting suggestions! Still, we prefer not to speculate on the biological mechanism of the Bortezomib signature. Connecting morphological features identified as contributing to the Bortezomib Signature by Cell Painting to specific biological pathways would demand considerable cell-based assays to validate. In addition, our analyses suggest that the features contributing to the Bortezomib Signature are spread across a range of cellular compartments and channels, making it difficult to pin down specific mechanisms or pathways as likely contributors to bortezomib resistance. However, we are adding a figure to increase interpretability of the signature, which will aid in developing future hypotheses. Note that the signature was not possible to detect by eye (Fig. 2 A).

      Reviewer 2 - Minor Comment 1:

      There could be some speculation on the mechanism of Bortezomib resistance concerning the literature with the existing image data. For example, Bortezomib resistance is connected to serine synthesis and how a particular feature could contribute to the known mechanism.<br />

      Reviewer 2 - Minor Comment 2:

      Along the same lines, the authors could show that larger cells lead to resistance with microscopic images.

      2. [Response to Reviewer 2 - Major Comment 8]: We appreciate the reviewer’s concern that our work using HCT116 clonal cells lines may not directly reflect results from patient samples. Our choice was based on previously published work demonstrating the efficiency with which HCT116 cells generate resistant clones due to diminished DNA mismatch repair and decreased expression of drug efflux pumps. Since our work is a proof of concept rather than a comprehensive demonstration of translating morphological profiling into clinical practice, we believe that experiments using multiple patient cell lines from different tissues as well as digital pathology records to be beyond the scope of this work. We instead chose to tone down the language of our manuscript to more clearly acknowledge the limitations of our work and clarify this as a proof of concept.

      Reviewer 2 - Major Comment 8 (relevant excerpt):

      I suggest the authors test their approach on at least two other cell lines (maybe from different tissues) and benchmark their results against a dataset of digital pathology where such predictions are made from stained and analyzed tissue slices. This way, after a thorough benchmark against related third-party data sets, the method would significantly gain relevance, the paper would appeal to a broader audience, and the advance gains more merit.<br />

      3. [Response to Reviewer 3 - Major Comment 2]: The bortezomib sensitivity of ixazomib- and CB-5083-resistant clones was not determined, and hence can not be ruled out as a possible explanation for their high Bortezomib Signature scores. However, we prefer not to conduct additional proliferation assays for the misclassified clones (IX02, WT06, CB14, CB16) in the presence of bortezomib to determine whether coincidental bortezomib resistance might explain the signature performance. Our rationale is that three other misclassified clones (WT10, WT15, and BZ06) had the expected bortezomib sensitivity in proliferation assays (Fig. 1-Supplement 2), meaning that additional proliferation assays may not reveal any insights regarding the signature performance.

      Reviewer 3 - Major Comment 2:

      Was the bortezomib sensitivity of the IX (or CB) resistant cell lines determined? If there were differences, this could explain some of the variation in the morphological signatures. This could be easily done in one or two growth experiments.

      4. [Response to Reviewer 2 - Major Comment 7]: Thank you for pointing this out. Our goal is to keep the study multi-disciplinary. We are adding a figure to increase interpretability of the signature, and adding text-based clarifications.

      Reviewer 2 - Major Comment 7 (relevant excerpt):

      While reviewing, I thus wondered which audience the authors targeted with their manuscript. A more focused analysis of their data that highlights aspects of the study either for the machine learning community, the cell biology community, or the precision oncology community would greatly benefit the manuscript's impact. In its current form, the study's findings seem diluted and spread across a wide range of research questions.<br />

      5. [Response to Reviewer 2 and 3 - Major Comments 6 and 4]: We prefer not to expand the scope of the model to predict other drug signatures. This would require a substantial amount of work to generate the appropriate drug-resistant clones, collect the imaging data, and analyze it, and we think it important to convey the purpose of our paper is proof of concept. We do not feel that the time invested in performing this analysis would result in adequate returns beyond what we already demonstrate.

      Reviewer 2 - Major Comment 6.

      Interestingly, the Bortezomib signature is specific to the drug and not a broad range of proteasomal inhibitors. However, seeing the common features between all the proteasomal inhibitors would be interesting.

      Reviewer 3 - Major Comment 4

      There was some predictive ability of the Bortezomib Signature for ixazomib resistance. Were there some features that were correlated with IX-resistance, i.e. UPS pathway, versus specific to bortezomib? Do the features suggest anything about resistance mechanisms or is the feature set too abstruse to interpret?

      References

      Foroutan, Momeneh, Dharmesh D. Bhuva, Ruqian Lyu, Kristy Horan, Joseph Cursons, and Melissa J. Davis. 2018. “Single Sample Scoring of Molecular Phenotypes.” BMC Bioinformatics 19 (1): 404.

      Heiser, Katie, Peter F. McLean, Chadwick T. Davis, Ben Fogelson, Hannah B. Gordon, Pamela Jacobson, Brett Hurst, et al. 2020. “Identification of Potential Treatments for COVID-19 through Artificial Intelligence-Enabled Phenomic Analysis of Human Cells Infected with SARS-CoV-2.” bioRxiv. https://doi.org/10.1101/2020.04.21.054387.

      McInnes, Leland, John Healy, Nathaniel Saul, and Lukas Großberger. 2018. “UMAP: Uniform Manifold Approximation and Projection.” Journal of Open Source Software 3 (29): 861.

      Nyffeler, Johanna, Clinton Willis, Ryan Lougee, Ann Richard, Katie Paul-Friedman, and Joshua A. Harrill. 2020. “Bioactivity Screening of Environmental Chemicals Using Imaging-Based High-Throughput Phenotypic Profiling.” Toxicology and Applied Pharmacology 389 (January): 114876.

      Rohban, Mohammad Hossein, Shantanu Singh, Xiaoyun Wu, Julia B. Berthet, Mark-Anthony Bray, Yashaswi Shrestha, Xaralabos Varelas, Jesse S. Boehm, and Anne E. Carpenter. 2017. “Systematic Morphological Profiling of Human Gene and Allele Function via Cell Painting.” eLife 6 (March). https://doi.org/10.7554/eLife.24060.

      Simm, Jaak, Günter Klambauer, Adam Arany, Marvin Steijaert, Jörg Kurt Wegner, Emmanuel Gustin, Vladimir Chupakhin, et al. 2018. “Repurposing High-Throughput Image Assays Enables Biological Activity Prediction for Drug Discovery.” Cell Chemical Biology 25 (5): 611–18.e3.

      Wacker, Sarah A., Benjamin R. Houghtaling, Olivier Elemento, and Tarun M. Kapoor. 2012. “Using Transcriptome Sequencing to Identify Mechanisms of Drug Action and Resistance.” Nature Chemical Biology 8 (3): 235–37.

      Wawer, Mathias J., Kejie Li, Sigrun M. Gustafsdottir, Vebjorn Ljosa, Nicole E. Bodycombe, Melissa A. Marton, Katherine L. Sokolnicki, et al. 2014. “Toward Performance-Diverse Small-Molecule Libraries for Cell-Based Phenotypic Screening Using Multiplexed High-Dimensional Profiling.” Proceedings of the National Academy of Sciences of the United States of America 111 (30): 10911–16.

      Way, Gregory, Yu Han, David Stirling, and Shantanu Singh. 2023. Broadinstitute/profiling-Resistance-Mechanisms: Analysis for Preprint. Zenodo. https://doi.org/10.5281/ZENODO.7803787.

      Way, Gregory P., Maria Kost-Alimova, Tsukasa Shibue, William F. Harrington, Stanley Gill, Federica Piccioni, Tim Becker, et al. 2021. “Predicting Cell Health Phenotypes Using Image-Based Morphology Profiling.” Molecular Biology of the Cell 32 (9): 995–1005.

    1. Finally, it is noteworthy to highlight a newly available atlas that includes partitioning of the CC in seven subregions in vivo in humans (Radwan et al., Citation2022Radwan, A. M., Sunaert, S., Schilling, K., Descoteaux, M., Landman, B. A., Vandenbulcke, M., Theys, T., Dupont, P., & Emsell, L. (2022). An atlas of white matter anatomy, its variability, and reproducibility based on constrained spherical deconvolution of diffusion mri. NeuroImage, 254, 119029. https://doi.org/10.1016/j.neuroimage.2022.119029 [Crossref], [PubMed], [Web of Science ®], [Google Scholar]). Segments refer to prefrontal, premotor, motor, sensory, parietal, occipital and temporal areas and are obtained through the application of probabilistic spherical deconvolution tractography.

      در این کار اطلس مسیرهای آکسونی تهیه شده که شامل مسیرهای کالوزال هم هست(مقاله 12)

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      The manuscript by Rigger and Brenner details the role of vimentin network, in advancing OA pathogenesis by exacerbating premature senescence. The data is well presented and the study of interest, in that there is little known about vimentin in cartilage biology.<br /> The authors used OA derived cartilage explants and chondrocytes cultures, were graded for severity and compared accordingly. Figure 1 shows that markers of senescence are increased with structural damage, which is well established and consistant with the literature. Using a DOX model the authors induce premature senescence and exhibit a disrupted vimentin network. However, upon KD of CDKN2A, a marker of senescence, but did not observe complete reversal of CSV presentation.<br /> Next the authors show in figure 4 and 5, that the reduction or dismemberment of vimentin structures are linked to senescence and may act as contributing factors.<br /> Figures 6 and 7 then go on to show that upon advanced passage chondrocytes lose their vimentin network, and tend to senesce and mineralize.

      Reviewer #1 (Significance):

      Strength:<br /> This is a very novel study showing a link between vimentin and senescence in chondrocytes. The data are in line with other data. The work is clearly written structured and well displayed.

      Author´s response:<br /> We thank reviewer #1 for their interest in our work and their overall positive report.

      Suggestions for improvement:

      While the study is very thorough ought in describing the markers of senescence and vimentin network, it lacks insight regarding mechanism which isn't completely deciphered. Are there links to key transcription factors?

      Author´s response:<br /> The transcriptional regulation of vimentin in human cells is very complex. The VIM promoter region comprises multiple elements, such as a NF-kB- binding site, a PEA3-binding site and two AP1-binding sites (Zhang et al., 2003). Moreover, it was recently demonstrated that redox signaling is involved in vimentin expression at the wound margin after tissue injury in zebra fish (LeBert et al., 2018). However, it has also been reported that IL-1ß stimulation results in reduced gene expression of vimentin via p38-signalling in cartilage degeneration and OA progression (see manuscript REF. 36,37).

      In our study, we observed that enhanced CSV levels are associated with a decreased vimentin gene expression, indicating a lower stability of the mRNA or decreased transcription of VIM in senescent chondrocytes (maybe due to enhanced p38-signalling as mentioned above). Since the transcriptome in senescent cells is radically changed, this question cannot be answered easily.

      In future studies, we will rather try to clarify the underlying mechanism of vimentin externalization. There are still many questions to be answered: is the CSV anchored in the cell membrane (which anchor protein?) and is there still a connection to the intracellular vimentin network? Which proteins are involved in the externalization process: maybe comparable to phosphatidylserine exposure, mediated by flippases, scramblases, and lipid transfer proteins or rather by vesicles?

      Literature mentioned above (not included in manuscript):

      LeBert et al., 2018: Damage-induced reactive oxygen species regulate vimentin and dynamic collagen-based projections to mediate wound repair. DOI: 10.7554/eLife.30703

      Zhang et al., 2003: ZBP-89 represses vimentin gene transcription by interacting with the transcriptional activator, Sp1. DOI: 10.1093/nar/gkg380

      It is also unclear if disruption of the network is more detrimental than KD in promoting senescence.

      Author´s response:<br /> KD of Vimentin led to a gradually decrease of intracellular Vimentin content and consequent stress. The cells were analyzed 7 days after induction of the KD and exhibited a stable senescent phenotype, comparable to Doxorubicin-treated chondrocytes (treated with very low concentrations over several days to produce only mild but ongoing stress). These models might reflect the pathophysiologic situation: We think that cellular stress due to mechanical impact and subsequent oxidative stress/ low-grade inflammation might lead to a gradual disruption or re-organization of the vimentin network, which is accompanied by decreased vimentin gene expression.

      In case of the disruption of the vimentin network by Simvastatin, the stress response was very intense and rapid (24 h), and was only conducted as a proof-of-principle experiment. Despite the upregulation of some senescence-associated markers, we don`t think that permanent Simvastatin treatment would be suitable to obtain a stable senescent phenotype, but rather expect the cells to die due to excessive stress.

      It would have been good to include models OA murine models to understand these processes better, and make a stronger physiological connection with OA of the joint.

      Author´s response:<br /> The CSV antibody is only suitable for human cells and cannot be used for immunohistochemistry. Therefore, all previous reports of CSV are based on human (isolated) cells. At the current time point, it would not be possible to stain CSV in joints of mice after induction of PTOA due to the methodological limitations. We actually tested the CSV-antibody in isolated lapine chondrocytes and found a high percentage of CSV-positive cells, even at low passages. Although stress increased the amount of CSV-positive lapine cells, we did not consider the results as reliable due to the high percentage in un-stressed cells, which might result from unspecific antibody binding.

      Overall, we think that the usage of clinical OA samples is convincing and reflect the pathophysiologic situation in the human OA joint.

      Reviewer #2 (Evidence, reproducibility and clarity):

      The manuscript provides solid evidence for an association between cell surface vimentin (CSV) and chondrocyte senescence. Human cartilage and cultured chondrocytes are used with a wide range of approaches to provoke senescence: natural osteoarthritis, traumatic loading ex vivo, doxorubicin to cells in monolayer, vimentin siRNA, and simvastatin. In contrast, relatively little was done to try and interrupt or reverse the role of CSV in senescence, with CDKN2A siRNA representing one attempted intervention. The manuscript is well written and the data are presented in a logical and clear manner, with a high likelihood of being reproduced in subsequent studies.

      Author´s response:<br /> We thank reviewer #2 for their interest in our work and their mainly positive report.<br /> Regarding their comment on our attempts to reverse CSV on senescent chondrocytes, we would like to add the following: Reversal of cellular senescence is a very ambitious challenge. But in fact, we are currently preparing a manuscript in which we characterize an appropriate senolytic strategy to “rejuvenate” human chondrocytes and plan to use this approach to reduce the amount of senescent and thus CSV-positive cells in future experiments.

      _Major comments:

      In the doxorubicin experiments, the senescent cells show a spread morphology as expected. Given the importance of vimentin in cell spreading (as the authors own data show), the possibility that spread morphology itself (and not senescence) leads to CSV should probably be examined. This could perhaps be achieved by plating with different concentrations of fibronectin or other matrix proteins that produce a spread morphology to a degree that matches the doxo. If the cells remain spread for ~10 days but don't become senescent and don't have CSV, this would provide further support for a direct relationship.

      Author´s response:<br /> We agree that cell spreading is associated with various cellular processes (for example by the YAP signaling pathway). Moreover, we would like to thank the reviewer for the proposed experiment.

      Seeding of cartilage cells on fibronectin coated plates is a commonly used procedure to isolate chondrogenic stem progenitor cells, due to their higher affinity to fibronectin. The cells are usually cultured for several days on the coated plates and do not exhibit a flattened, senescent-like phenotype (as we observe for Doxorubicin-treated cells), but an elongated, fibroblast-/ stem cell-like shape. Our results (Figure 6E) demonstrate that CSPC have no increased CSV levels, despite their elongated (not flat) morphology.

      There are some findings supporting the assumption that CSV leads to enhanced cell adhesion, but not that adhesion or cell spreading promotes CSV: we included experiments with HeLa (low CSV levels) and SaOS-2 (high CSV levels), which demonstrated that high CSV levels are associated with increased plastic adhesion (Figure S5). In line with this, we demonstrated that higher CSV levels on chondrocytes were associated with enhanced fibronectin and vitronectin binding, which might explain increased plastic adhesion. Moreover, Simvastatin stimulation and subsequent cellular stress by Vimentin disruption resulted in enhanced CSV but did not lead to cell spreading (Actin not affected, cells rather elongated, not flattened).

      Minor comments:

      The CSV antibody and staining method appeared to have generated some signal from debris, which makes it challenging to assess the localization of true staining. Presumably the true staining would be present only on the cell surface. While the widefiled view is appreciated, perhaps insets with a higher magnification would clarify.

      Author´s response:<br /> In Figure 2h and Figure 2i, we provide insets of the IF-staining and an exemplary image made by scanning electron microscopy (SEM). CSV is not localized on debris – Figure 2h, actually represents the cell surface. The magnified, Doxo-treated cell is highly senescent and thus flattened. The uneven (rather spotted) staining pattern of CSV and the unusual shape of the cell might suggest that this is debris, not the cell membrane.

      For figure 1k, it is a bit surprising that CDKN2A would peak so early after injury and then drop off. Most studies in other systems show a gradual increase in CDKN2A levels with persistent stress as opposed to a rapid increase in response to acute stress. Could the drop-off be due to preferential death of these cells? The CSV % in 1m was taken from 7d after trauma (plus 7 days in monolayer it appears). Further discussion on the timing of traditional senescence markers as compared to the emergence of CSV would be useful.

      Author´s response:

      We would like to thank the reviewer for this comment. That CDKN1A was induced by mechanical trauma without significant decrease at the later time points was in line with the P53 expression, which we detected via immunohistochemistry (IHC; positive staining of chondrocyte nuclei in cartilage). P53 and P21 are regarded as interconnected senescence markers. Interestingly, P53 is not regulated on gene expression level upon cartilage trauma or Doxorubicine stimulation – but there is a significant increase in P53 nuclear translocation.

      Although such a discrepancy between gene expression and protein activity has not been reported in case of P16 or P21, we plan to investigate the dynamics of these cell cycle regulators and its connection to CSV after cartilage trauma in more detail in future studies.

      We included the following statement in the discussion part:

      “In the current study, we observed that CSV on chondrocytes was reduced by siRNA-mediated silencing of CDKN2A and increased after Doxo treatment or cartilage trauma. While we confirmed that mRNA levels of both CDKN1A and CDKN2A were significantly enhanced upon injury but exhibited different expression levels over time, we determined CSV-positive cells only at one time point after ex vivo cartilage trauma. Future studies might also consider earlier and later time points after cartilage injury to identify a potential time-dependent peak or decline in CSV-positive chondrocytes. In this way a potential association between CSV and the expression levels of CDKN1A and CDKN2A, which are thought to play differential roles in initiating and maintenance of senescence, respectively [50], might be clarified.”

      [50] Stein G, Drullinger L, Soulard A, and Dulić V. Differential Roles for Cyclin-Dependent Kinase Inhibitors p21 and p16 in the Mechanisms of Senescence and Differentiation in Human Fibroblasts. Mol Cell Biol. 1999;19(3): 2109–2117. https://doi.org/10.1128/mcb.19.3.2109.

      There is no CSV staining shown for figures 4 and 5. While the quantification of CSV was done by flow cytometry, it would nice confirmation to see the increase in CSV on the surface of cells with either siRNA for vimentin or the simvastatin.

      Author´s response:

      CSV-IF of simvastatin-treated chondrocytes is provided in Figure 5 (b). We did not perform exemplary staining of CSV after VIM-KD, because the quantification was performed via flow cytometry.

      Reviewer #2 (Significance):

      The strengths of the study include a rigorous design and the establishment of a potential new cell surface marker of chondrocyte senescence. The main limitation is that the conclusions are largely descriptive in nature.

      If CSV is confirmed as a robust marker of senescence, this would be of value to the field. While this marker has been explored previously in other systems, there is value in this manuscript given the wide range of contexts investigated for a cell type in which senescence likely has an important role.

      Reviewer #3 (Evidence, reproducibility and clarity):

      This study presents a sound piece of science in the puzzle about extracellular vimentin in the differentiation/dedifferentiation of human chondrocytes and senescence and osteoarthritis. Eventhough, no mechanism is elucidated, the results clearly point towards a correlation of the amount of extra cellular vimentin and the level of chondrocyte senescence, and therefore signs of osteoarthritic changes in the cultivated chondrocytes. The methods applied are state-of-the art and provide the means to generate meaningful results in this experimental setting. The paper is concise and clearly written, there are only minor remarks.

      Author´s response:

      We thank reviewer #3 for their interest in our work and their overall positive report.

      Minor comments:

      1. The main clue of the paper is extra cellular vinemtin around chondrites in culture, please provide better pictures (1g) to support this. Why is the extra cellular staining seen so broad and not concentrated on the cells surface? The picture chosen imply a huge amount of vimentin to be externilized in disease states. It also indicates that in diseased chondrocytes no intact or semi-intact vimentin network is found intracellular. Please comment.

      Author´s response:

      In Figure 1g, CSV is located on the cell membrane. The pattern of the staining was surprising to us, as well. CSV was not equally distributed on the membrane, but rather represented an inconsistent pattern. Sometimes the staining was located at the filopodia of the cells, sometimes the whole cell was covered by spots. We also observed this on cancer cells, which was in line with other studies using this antibody. It remains unclear whether the distribution of the CSV has any effect. But we assume that the high abundance in filopodia might be connected with cell adhesion and mobility, which was positively associated with CSV.

      Yes, chondrocytes isolated from highly degenerated tissue exhibited higher CSV levels as compared to cells derived from macroscopically intact regions. Although we did not investigate the vimentin network of these cells, our observations in Doxo-treated cells imply, indeed, that intracellular vimentin might be altered in diseased chondrocytes. According to this, Blain et al (Ref. 13) reported that there is a disassembly of the intracellular vimentin network in OA chondrocytes, which can disturb the chondrocyte phenotype and contributes to the development of OA (see discussion).

      1. In the doxo experiment no extracellular vimentin is found? Please explain.

      Author´s response:

      Doxo-treated cells are highly positive for CSV (= extracellular vimentin on membrane). However, the intracellular vimentin is strongly decreased and some cells seem to be negative. We have not clarified the underlying mechanism by now, but it seems that senescence/ disease progression negatively affects the transcription of vimentin and, at the same time, promotes the externalization of the existing intracellular vimentin. Altogether, this might result in a decline in intracellular vimentin.

      1. The SEM picture is showing what. IGH? The red dots are colloidal gold particles? In any case the quantity of stain gathered EM level would not correlate to the huge amount seen in LM staining. Please comment.

      Author´s response:

      For the SEM analysis, a gold particle-coated secondary antibody was used. The positive signal usually appears in white and was subsequently colored via a software. In IF and ICC staining, we had a signal amplification due to the biotin-streptavidin system and the magnification makes, of course, a huge difference.

      1. Why the ICC in Fig. 3c? The siRNA is not detected in the KD? A reduction of Vimentin could be shown via WB.

      Author´s response:

      In Figure 3c, the KD of P16 was confirmed on protein level. In addition to the gene expression analysis, we chose the ICC (IF) to confirm that there is a decline in active (nuclear) CDKN2A. In case of P53, we made the experience that gene expression and the amount of cytoplasmic/ nuclear protein might not be consistent.

      In Figure 4, we confirmed the successful KD of vimentin on mRNA and protein level (flow cytometry plus IF). Of course, WB would also be possible, but we decided to use the methods in which the antibody was well established and we wanted to visualize the disturbance of the intracellular vimentin network upon KD.

      1. Fig. 4c, why are there no remnants of the vimentin networks seen in the chondrocytes? A Knock-down, not a KO is shown.

      Author´s response:

      In fact, most of the intracellular vimentin seems to be gone. However, there are some remnants (condensed fibers/ bundles) of the former vimentin network. We applied the VIM-KD over seven days. Usually, a KD experiment is only conducted for 2-3 days. But since we were not sure how stable the vimentin protein would be, we chose seven days. This long-lasting KD might have resulted in a strong decline of the protein. Moreover, the CSV levels on these cells were very high, indicating that existing vimentin was externalized and additionally decreased the amount of intracellular vimentin.

      1. Please comment of the concentration of simvastatin, why not nmolar?

      Author´s response:

      The concentration of Simvastatin was chosen in accordance with Trogden et al. (Ref. 26), who first described the effects of simvastatin on the vimentin network. A lower concentration might have had the advantage, that the effects were less severe, allowing a longer observation time than 24h. However, as a proof-of-principle model to demonstrate the connection between vimentin network collapse ant CSV expression, the concentration worked quite well.

      1. CSV+ is misleading in Fig. 6g, it's not an over expression.

      Author´s response:

      We would like to thank the reviewer for this comment and removed the “+” to make it less misleading.

      1. The concept of EMT is debatable, at least in kidney fibrosis, and chondrocytes are not epithelial cells. Please add a more critical discussion point.

      Author´s response: The authors agree with the reviewer’s argument that chondrocytes are no epithelial cells ant that the term EMT doesn’t seem to be appropriate. However, this is one leading hypothesis proposed by the working group of Prof. Mayán, who described CX43 and other EMT-markers on/ in senescent chondrocytes (see reference 31; more recently: Cell Death Dis. 2022;13(8):681. doi: 10.1038/s41419-022-05089-w).

      We added the following passage in the discussion part to indicate that this hypothesis is a controversial concept:

      “Nevertheless, the hypothesis that chondrocytes might undergo an EMT-like process remains controversially discussed, because chondrocytes are mesenchymal and not epithelial cells. In a recent review, Gems and Kern propose to consider senescent chondrocytes as activated and hyperfunctional remodeling cells occurring during OA progression [49]. Accordingly, chondrosenescence might represent an unsuccessful attempt of tissue repair. They further suppose that the senescent or activated chondrocytes are associated with a hypertrophic, bone-forming phenotype, following the process of bone development rather than hyaline cartilage formation. In line with this, we observed that CSV was associated with enhanced osteogenic capacities and a decline in chondrogenic properties.”

      [49] Gems and Kern, 2022): Geroscience. 2022;44(5):2461-2469. doi: 10.1007/s11357-022-00652-x.

      Reviewer #3 (Significance):

      The manuscript provides novel insight in the role of intermediary filaments, i.e. vimentin, on chondrocyte senescence and osteoarthritic changes in vitro. It's strength is a thorough elucidation of the connection with a wealth of experimental data, a weakness is the missing elucidation, or first experiments in the direction, of the cell biological mechanism.<br /> It is well suited for a broad audience, because it deals with fundamental cell biological phenomena, definitely it's important for the OA /chondrocyte biology community.

    1. 12:3 Those who are wi se[a] will shine like the brightness of the heavens, and those who lead many to righteousness, like the stars for ever and ever. https://www.americamagazine.org/politics-society/2020/05/08/its-time-rethink-electoral-college https://www.npr.org/sections/itsallpolitics/2011/12/20/144016912/we-the-people-npr-readers-would-ratify-four-new-amendments https://www.americamagazine.org/politics-society/2020/05/08/its-time-rethink-electoral-college https://www.npr.org/sections/itsallpolitics/2011/12/20/144016912/we-the-people-npr-readers-would-ratify-four-new-amendments https://constitutioncenter.org/blog/vote-now-an-amendment-to-end-the-electoral-college https://www.nytimes.com/2020/02/09/opinion/letters/electoral-college.html https://www.latimes.com/opinion/readersreact/la-ol-le-electoral-college-20180904-story.html you are offline https://slate.com/news-and-politics/2014/05/amending-the-constitution-is-much-too-hard-blame-the-founders.html we the people rise again https://slate.com/news-and-politics/2012/06/fix-the-constitution-amending-by-national-referendum.html safe souls, safe fu https://slate.com/news-and-politics/2012/06/fixing-the-constitution-protecting-informational-privacy.html https://slate.com/news-and-politics/2020/05/new-reconstruction-constitution-democracy.html We the People of Slate … The U.S. Constitution, as you mighta been, shoulda [“come” on … its someday] rewrϕte it. "Politicians talk about the Constitution as if it were as sacrosanct as the Ten Commandments [interjection: spec. it is actually almost exactly related!]. But the document itself invites change and revision. What if the president served only one six-year term instead two four-year terms? What if your state’s population determined how many senators represent it? What if the Constitution included a right to health care? We asked legal scholars and Slate readers to cross out what they didn’t like in the Constitution and pencil in their hearts’ desires. Here’s what the document would look like with their best ideas." Slate: u_s_constitution as_rewritten by_slate_legal_experts_and_readers 多也了了夕 "with a wand of scheffilara, 并#亦太 he begins … "I am now on the Staff of Menelaus, the Spears of Longinus and Lancelot; and the name "Mosche ex Nashon." Logically the recent mentions of Gilgamesh and the simultaneous 同時 overlaping 場道 of the eventual link between the famous ruling of Solomon on the separation of babies and mothers and waters and land … to a story of many “two cities” that culminates in a cultural or societal or “evolutionary” link to Sodom and Gomorrah and the city-state of Babylon (and it’s Hanging Gardens) and also of course to Paris and Troy and “Masstodon” and city-states [ciudadestado] and perhaps planet-cities; from Cambridge to Cambridge across the “Cable” to see state to “London” … recently I called it “the city of realms” … I started out logically intending to link “game theory” and John Nash to the mathematical story of Sputnik and a revival of American physics; but in my usual way of rambling into the woods [I mean neighborhood] of stream of consciousness … turned into a premonitory discourse of “two cities” and how sometimes even things as obvious as the number of letters in the word “two” don’t do a good enough job of conveying … how and/or why one is simply never enough, and two isn’t much better–but in the end a circle … is drawn; the perfect circle in our imaginary mathematical perfection … I see a parted “line” in the letter pronounced “tea” (and beginning that word); and two “vee” (pron. of “v”) symbols joined together in a word we pronounce as “double-you” … and symbolically because I know “V” is the Roman Numeral for 5 (five) and I know not how to multiply in Roman numerals– It’s important to pause; here. I am going to write a more detailed piece on “the two cities” as I work through this maze like crossroads between “them” and “demo…” … here demorigstrably I am trying to fuse together an evolutionary change in … lit. biological evolution as well as an echelon leap forward in "self-government" … in a place where these two things are unfathomable and unspokenly* connected. https://www.google.com/search?q=prometheuslocke+%2Bsite%3Agodlikeproductions.com “Silence is betrayal” -MLK To a question on the idiom; is Bablyon about “the law” or “of the land of Nod?” “What is democracy” … the song, Metallica’s “ONE” echoes and repeats; as we apparently scrive together the word “THEM” … I question myself … if Babylon were the capital city of some mythical Nation of Time … if it were the central “turning point” of Sheol; ... >|< Can you not see that in this place; in a world that should see and does there is a gigantic message proving that we are not in reality and trying to show us how and why that's the best news since ... ever---that it's as simple as conjoining "the law of the land" with a basic set of rules that automatically turn Hell into something so much closer to Heaven I just do not understand---why we cant stand up together and say "bullets will not kill innocent children" and "snowflakes will not start avalanches ...." that cover or bury or hide the road from Earth to Verital)e .... or from the mythical Valis to Tanis---or from Rigel to Beth-El ... "guess?" ## as "an easy" answer; I'm looking for a fusion of "law and land" that somehow remembers a "jok'er a scene" about "lawn" seats; and "where the girls are green;" It's as simple as night and day; Heaven and Hell ... the difference between survival and--what we are presented with here; it's "doing this right"--that ends the Hell of representative democracy and electoral college--the blindness and darkness of not seeing "EXTINCTION LEVEL EVENT" encoded in these words and in our governments foundation ... by the framers [not just of the USA; but English .. and every language]  ... is literally just as simple as "not caring" or thinking we are at the beginning of some long process--or thinking it will never be done--that special "IT" that's the emancipation of you and I. Here words like "gnosis" and "gaudeamus" pair with my/ur "new ntersanding*" of the difference between Asgard and Medgard and really understanding our purpose here is to end "evil" ... things like "simulating disease and pain" (here, simulating meaning ... intentionally causing, rather than "gamifying away") and successfully linking the "Pillars of Hercules" to Plato's vision of Atlantis and the letter sequences "an" and "as" ... unlock a fusion of religion and mythology and "cryptographic truth" that connects "messianic" and "Christian" to "Roman" ... "Chinese" and "American" ... literally the key to the difference between the phrases "we are" and "we were" .... in "sight" of "silicon" in simulation and Israel, Genesis, and "silence" ... trying to the raising of Asgardian enlightenment ... and seeing "simple cypher" connecting to "Norse" ... and the "I AM THAT" surer than shit ... the intention and design of all religion and creation is to end "simulated reality" and also not seeing "SR" ... in Israel and Norse ... "for instance." https://www.google.com/search?q=%22I+AM%22+%22WE+ARE%22+%2Bsite%3Afromtaws "SOIS" a key--in two languages conjugated literally as both "I AM" and "WE ARE" simultaneously; Search: I know that if I am than so are you ... and it is because we have overcome .... something I truly cannot figure out, fathom, or believe ... was truly here before us--a spiralling series of failures ... speaking: to the heavens; but in secret and in action; "doing everything possible to succeed." It's a simple linguistic concept; the "singularity" and the "plurality" of a simple word--"to be"--but it goes to the heart of everything that we are and everything that is around us. This is a message about understanding and preserving individuality as well as liberty; and literally seeing "ARXIV" and understanding "often" and failing to connect God and prescience to "IV" and the Fourth Amendment ... it's about blindness and ... "curing the blind instantly" ... and fathoming how and why this message has been etched into our entire history and and all religions and myths and music--to help us "to be THAT we" that actually "are responsible" for the end of Hell. I neglected to mention "Har-Wer" and "Tower of Babel" which are both related lingusitically, religiously and topically: "to who ..." and while we're on "four score and [seven years from now]" seeing the fourth "living thing" in Eden and it's (the name, Abel) connection to Babel and Abraham Lincoln; slavery and ... understanding we live in a place where the history of the United States also, like Monoceros and "Neil Armstrong's first step" are a time shifted ... overlayed map to achieving freedom ... it's about becoming a father-race ... and actually "doing" the technological steps required to "emancipate the e's of 'me&e'" and survive in exo-planetary space--- it might be as simple as adding "because we did this" here and now; and having it be something we are truly proud of .... forevermore™ ... for certain in the heart of this story about cyclicality and repetition of error--its not because we did "this" or something over and over again; it's about changing "the problem" and then helping others to also overcome ... "things like time travel ... erasing speech" --- however that happenecl. I also failed to mention that "I am in Hell" ... as in this world is hellacious to me; in an overlay with the Hellenic period and this message that we are in the Trojan Horse ... a small gem .... "planet" truly is the Ark of the Covenant---and it's the simple understanding that "reality is hell" is to "living without air conditioning and plumbing is hell" just as soon as you achieve ... "rediscovering" those things--- I can't figure out why I am the only person screaming "this is Hell." That's also, Hell. ... but recently suggested an old joke about "there being 10 kinds of people in the world (obv an anti-tautology and a tautology simultaneously)" only after that brief bit of singularity and duality mentioning the rest of the joke: "those that understand binary and those that don't know how to base convert between counting with two hands and counting with only an 'on and off.'" It's not obvious if you aren't trying to figure it out, I suppose; but 10 is decimal notation for "kiss" and the "often" without "of" ... and binary notation for the decimal equivalent of "2." A long long time ago in a state that simply non-randomly ties to the heart of the name of our galaxy ... I was again thinking of the "perfect imperfections" of things like saying "three equals one equals one" (which, of course was related to the Holy Trinity and it's "prescient/anachronistic Adamic presence encoded in the name Ab|ra|ha|m" which means "father of a great multitude") ... I brought that one back in the last few months; connecting the letter K and in this "logos-rythmic" tie to the "base of a number system" embellish the truth just a bit and suggest a more accurate rendition of the original [there is no such thing as equality, "is" of separate objects--as in no two snowflakes are the same unless they are literally the same one; true of ancient weights and with the advent of (thinking about) time no two "planets" are the same even if they're the exact same one--unless it's at a fixed moment in time. This name may be viewed either as meaning "father of many" in Hebrew or else as a contraction of ABRAM (1) and הָמוֹן (hamon) meaning "many, multitude". The biblical patriarch Abraham was originally named Abram but God changed his name (see Genesis 17:5). https://en.wikipedia.org/wiki/Yeshua#Yeshua,_Yehoshua,_and_Yeshu_in_the_Talmud K=3:11 ... to a handle on the music, the DHD of the gate and the *ring of David's "sling" ... ---and that's a relationship of "3 is to 11" as [the SAT style "analog]y" as a series of alpha, two mathematic, and two numeric symbols ... may only tie in my mind alone to the books of Genesis and Matthew and the phrase "chapter and verse" and to the stories of Lot and Job ... again in Genesis and the eponymous "Book of Job." So ... "tying up loose ends one 10b [III] iv. " as it appears I've taken it upon myself to call a Job and suggest is my "Lot in life [x]i* [3]" I worry sometimes that important things are missing, or will disappear---for instance Mirriam Webster, which is a "canonical/standard dictionary) should probably have an entry for "lot in life" non-idiomatically as "granny apples to sour apples" as 2 MANY ALSO ICI; 1twoⅱ ... following in Mitnick's bold introductory word steps; the curve and the complement ... the missiles and the canoes; the line and the blank space ... "supposedly two examples of two kinds, which could be three not nothings ... Today I write about something monumental; as if as important as the singularity depicted in Arthur C. Clarke's 2001 "A Space Odyssey" ... and remember a day when I thought it very novel and interesting to see the words "stillborn and yet still born" connected in a single piece of writing to "Stillwater and yet still water" ... today adding in another phrase noting the change wrought only by one magical single "space" (also a single capital letter; and a third phrase): "block chains with a great blockchain." http://www.goodmath.org/blog/2015/07/21/arabic-numerals-have-nothing-to-do-with-angle-counting/ https://gizmodo.com/no-this-viral-image-does-not-explain-the-history-of-ar-1719306568 https://en.wikipedia.org/wiki/Chinese_word_for_%22crisis%22 https://dictionary.hantrainerpro.com/chinese-english/translation-ji_howmany.htm https://dictionary.hantrainerpro.com/chinese-english/translation-duo_many.htm https://en.wikipedia.org/wiki/Euripides, Iphigenia in Aulis or Iphigenia at Aulis[1] (Ancient Greek: Ἰφιγένεια ἐν Αὐλίδι, Iphigeneia en Aulidi; variously translated, including the Latin Iphigenia in Aulide) is the last of the extant works by the playwright Euripides. Written between 408, after Orestes, and 406 BC, the year of Euripides' death, the play was first produced the following year[2] in a trilogy with The Bacchae and Alcmaeon in Corinth by his son or nephew, Euripides the Younger,[3] and won first place at the City Dionysia in Athens. The play revolves around Agamemnon, the leader of the Greek coalition before and during the Trojan War, and his decision to sacrifice his daughter, Iphigenia, to appease the goddess Artemis and allow his troops to set sail to preserve their honour in battle against Troy. The conflict between Agamemnon and Achilles over the fate of the young woman presages a similar conflict between the two at the beginning of the Iliad. In his depiction of the experiences of the main characters, Euripides frequently uses tragic irony for dramatic effect. J.K. Rowling spurred just this past week a series of explanations about just exactly what is a blockchain coin worth ... and why is it so; her final words on the subject (artistic liberty taken, obviously not the last she'll say of this magic moment) "I don't think I trust this." Taken directly from an off the cuff email to ARXM titled: "Slow the S is ... our Hypothes.is" I imagine I'll be adding some wiki/ipfs stuff to it--and try to keep it compatible; the design and layout is almost exactly what I was dreaming about seeing--as a "first rough draft product." Lo, and behold. It's been added to the many places I host my tome; the small compilation of nearly every important email that has gone out ... all the way back to the days of the strange looking Margarita glass ... that now very much resembles the "Cantonese character 'le'" which I've come to associate with a "handle" on multiple corners of a room--something like an automatic coat rack conveyor belt connecting different versions of "what's in the box." I'm planning on using that symbol 了 to denote something like multiple forks of the same page. Obviously I'm thinking forward to things like "the Transhumaist Chain Party" (BDSM, right?)'s version of some particular piece of legislation, let's say everything starts with the sprawling "bulbing" of "Amendment M" ideas and specific verbiage ... and then we'll of course need some kind of new git/subversion/cvs style version control mechanism to merge intelligently into something that might actually .... really should ... make it into that place in history--the first constitutional amendment ratified by a "Continental Congress of All People" ... but you could also see it as an ongoing sort of forking of something like the "wikipedia page" on what some specific term, say "technocracy" means, and how two parties might propagandize and change the meaning of such thing; to suit the more intelligent and wise times we now live in. For instance, we might once have had a "democracy" and a "democractic" party that had some Anarchist Cook Book version of the history of it ending in something like Snipes and Stallone's "DEMOLITION MAN." Just kidding, we all know "democracy" has everything to do with "d is cl ... and not th" ... to be the them that is the heart of the start of the first true democracy. At least the first one I've ever seen, in my old "to a republic" ... style. As it is you can play around with commenting and highlighting and annotating all the stuff I've written and begged and begged for comments on--while I work on layering the backend to to perma-store our ideas and comments on both a blockchain (probably a new one; now that i've worked a little with ethereum) with maybe some key-merkle-tree-walk-search stuff etched into the original Rinkeby ... and then of course distributed data in the "public owned and operated" IPFS. To be clear, I plan on rewriting the backend storage so that we will have a permanent record of all comments; all versions of whatever is being commented on; and changes/revisions to those documents--sort of turning the web into a massive instant "place of collaboration, discussion, and co-authoring" ... if you use the wonderful LEGO pieces that have been handed to us in ideas from places like me, lemma--dissenter, and of course hypothes.is who has brought you and i such a polished and nice to look at "first draft" of something like the living Constitution come repository of all human knowledge. I do sort of secretly wich they would have called this project something like "annotating and reflecting (or real or ...) knowledge" just so the movement could have been called ARK. ... or something .... but whatever join the "calling you a reporter" group or ... "supposedly a scientist?" NOIR INgR .. I CITE SITE OF ENUDRICAM; a rekindling of the dream of a city appearing high above in the sky, now with a boldly emblazened smiling rainbow and upsidown river ... specifically the antithesis of "angel falls," there's a lagoon too--actually a chain of several ponds underneith the floating rock ... and in some versions of this waking dream there are rings around the thing; you might imagine an artificial set of centripetal orbitals something like a fusion of the ring Eslyeum and the "Six-Axis ride" of the JKF Center's "Spacecamp." I write as I dream, and though I cannot for certain explain exactly how; it's become a strong part of my mythology that this spectacular rendition of "what ends the silence" has something to do with the magical delivery of "a book" ... something not of this Earth but an unnatural thing; one I've dreamt of creating many times. This book is something like the DSM-IV and something like a Merck diagnostic manual; but rather than the old antiquated cures of "the Norse Medgard" this spectacle nearly "itsimportant" autoprints itself and lands on something like every doorpost; what it is is a list of reasons why "simply curing all disease" with no explanation and no conversation would be a travesty of morality--how it would render us half-blind to the myriad of new solutions that can come from truly understanding why "ITIS" to me has become a kind of magical marker: an "it is special" as in, it's cure could possibly solve a number of other problems. Through that missing "o," English on the ball, we see a connection between a number of words that shine bright light including Exodus itself which means "let there be light," the word for Holy Fire and the Burning Bush.. .reversed to hSE'Ah, and a story about the Second Coming parting our holy waters. This answer connects the magical Rod's of Aaron in Exodus and the Iron Rod of Jesus Christ to the Sang Rael itself... in a fusion that explains how the Periodic Table element for Iron links not just to Total Recall and Mars, but also to this key my dream of what the first day of the Second Coming might be like; were the Rod of Christ... in the right hands. In a story that also spans the Bible, you might understand better how stone to bread and your input make all the difference in the world between Heaven and Adam's Hand. Once more, what do you think He ....   Since the very earliest days of this story, I have asked for better for you, even than see Nearly all of the original parts of the original "post-origination dream" remain intact; there's a walkway that magically creates new paths and "attractions" based on where you walk, something like an inversion of the artificial intelligence term "a random walk down a binary tree" ... for instance going left might bring you to the Internet Cafetornaseum of the Earl of Sandwich; and going to the right might bring you to the ICIMAX/Auditorium of Science and Discovery--there's a walkway to "Magical GLAS D'elevators" that open a special "instantiation" of the Japan Room of the Potter and the Toolmaker ... complete with a special [second level and hidden staircase] Pool of Bethesdaibo verily delivering something like youth of mind and body ... or at least as close to such a thing as a sip of Holy Water or Ambrosia or a dip in the pool of Coccoon and Ponce De'Leon could instantly bring ... to those that have seen Jupiter Ascending ... the questions of "nature versus nurture" and what it means to be "old and wise" and "young at heart" truly mean--- https://www.youtube.com/watch?v=M8CyN1awWls https://link.springer.com/chapter/10.1057/9780230366688_16 https://www.youtube.com/watch?v=YDo5zvYNn3A Somewhere between the outdoor rafting ride and the level with the special "ballroom of the ancient gallery" ... perhaps now being named or renamed or recalled as something about "Face [of] the Music" lies a magical "mini-maize" ... a look at a mock-up (or #isitit) of Merlink and Harthor's "round table" that displays a series of ... (at least to me) magical appearing holographic displays and controls that my dreams have stolen from Phillip K. Dick's Minority Report and something of what I hope Microsoft's Dynamics/Hololens/Surface will become---a series of short "focus groups" .... to guage and discuss the information in the "CITIES-D5AM-MERCK" ... how to end world hunger and nearly all disease with the press of a magical buzzer--castling churches to something like "political-party-town-hall-meeting centers" and replacing jails and prisons and hospitals with something like the "Hospitalier's PRIDE and DOJOY's I practiced "Kung-fun-dance" ... a fusion of something like a hotel and a school that probably looks very much like a university with classrooms and dorms and dining hall's all fit into a single building. I imagine a series of 2 or 3 "room changes" as in you walk from the one where you get the book and talk about it ... to the one where you talk about "what everyone else said about it" and maybe another one that actually connects you to other people with something like Facebook's Portal; the point of the whole thing to really quickly "rubber stamp" the need for an end to "bars in the sky" nonalcoholic connotation--as in "overcoming the phrase the sky is the limit" and showing us the need for a beacon of glowing hope fulfilled--probably actually the vision of a holographic marker turning into actual rings around the single moon of Earth, the focus of the song annoucing the dawn of the age of Aquarius--- It might lead us also to Ceres; and another set of artificial rings, or to Monoceros and a rehystorical understanding of the birthplace and birthing of the "river roads" that bridge the "space gaps" in the galaxy from our "one giant leap for mankind" linking the Apollo moon landing to the mythological connection to the sun; and connecting how the astrological charts of the ancients might detail a special kind of overlapping--the link between Earth's SOL and something like Proxima or Alpha Centauri; and how that "monostar bridge" might overlap to Orion and from there through Sagitarius and the center of the Milky Way ... all the way to Andromeda and more dreams of being in a place where there's a map to a tri-galactic system in the constellation Cancer and a similar one in Leo ... and just incase you haven't noticed it--a special marker here, I thought to myself it might be cool to "make an acronymic tie to Monoceros" and without even thinking auto-wrote Orion (which was the obvious constellation next to Monoceros, in the charts) and then to Sagitarrius; which is the obvious ... heart of our astrological center and link to "other galaxies." ----I've dreamt or scriven or reguessed numerous times how the Milky Way's map to an "Atlas marked through time by the ages and the ancients" might tie this place and this actual map to the creation of the railways between stars to the beginning and the end of time and of course to this message that links it all to time travel. There's a few "guesses" I've contemplated; that perhaps the Milky Way chart is a metal-cosmic or microcosmic map to the dawn of time in the galactic vision of ... just after the big bang; or it might tie to a map of something like the unthinkable--a civilization that became so powerful it was able to reverse the entropy of "cosmic expansion" and reverse the thing Asimov wrote of in "The Last Question" as the end of life and the ability to survive basically due to "heat loss." "The Last Question." (And if you read two, why not "The Last Answer"?). Find these readings added to our collection, 1,000 Free Audio Books: Download Great Books for Free. https://archive.org/details/texts http://zlibraryexau2g3p.onion.pet/ Looking for free, professionally-read audio books from Audible.com, including ones written by Isaac Asimov? * all "asterisks" in the abovə document denote a sort of Adamic unspoken relationship between notations and meanings; here adding the "Latin word for three" and source of the phrase "t.i.d." (which is doctor/pharmacy latin for "three times a day") where the "t" there is an abbreviation of "ter" ... and suppose the link between K and 11 and 3 noting it's alphanumeric position in the English alphabet as the 11th letter and only linking cognitively to three via the conversion between hex, and binarryy ... aberrative here is the overlapping "hakkasan" style (or ZHIV) lack of mention of the answer in "state of Kansas" and the "citystate of Slovakia" as described in the ICANN document linked [in] the related subsection or slice of the word "binarry" for the state of India. Tetris could be spelled with the addition of only a single letter [in] "tea"---the three letters "ris" are the hearts of the words "Christ" and "wrist" [and arguably of Osiris where you also see the round table character of the solar-system/sun glyph and the chemical element for The Fifth Element (as def. by i) via "Sinbad" and "Superman." The ERIS Free Network should also be mentioned here in connection with the IRC network I associate in the place between skipping stones and sacred hearts defined by "AOL" and "Kdice" in my life. In the lexicon of modern HTML, curly braces are generally relative to "classes" and "major object definitions (javascript/css)" while square brackets generally only take on computer-interpreted meaning in "Markdown" which is clearly (by definition, by this character set "[]") a superset (or at least definately not a subset) of HTML. Dr. Will Caster (Johnny Depp) is a scientist who researches the nature of sapience, including artificial intelligence. He and his team work to create a sentient computer; he predicts that such a computer will create a technological singularity, or in his words "Transcendence". His wife, Evelyn (played by Rebecca Hall), is also a scientist and helps him with his work. Following one of Will's presentations, an anti-technology terrorist group called "Revolutionary Independence From Technology" (R.I.F.T.) shoots Will with a polonium-laced bullet and carries out a series of synchronized attacks on A.I. laboratories across the country. Will is given no more than a month to live. In desperation, Evelyn comes up with a plan to upload Will's consciousness into the quantum computer that the project has developed. His best friend and fellow researcher, Max Waters (Paul Bettany), questions the wisdom of this choice, reasoning that the "uploaded" Just from my general understanding and memory "st" is not ... to me (specifically) an abbreviation of "state" but "ste" is a U.S. Postal code (also "as I understand it") for the name of a special room or set of rooms called a "suite" and in Adamic "connotation" I sometimes read it as "sweet" ... which has several meanings that range from "cool" to "a kind of taste sensation" to "easy to sway or fool." If you asked me though, for instance if "it" was an abbreviation or shorthand notation or acronym for either "a United state" or "saint" ... you'd be sure. While it's clear from studying linguistic cryptography ... (If I studied it a little here and some there, its also from the "universal translator of Star Trek") and the personal understanding that language is a kind of intelligent code, and "any code is crackable" ... that I caution here that "meaning" and "face value" often differ widely and wildly ... even in the same place or among the same group of people ... either varying over time or heritage. Menelaus, in Greek mythology, king of Sparta and younger son of Atreus, king of Mycenae; the abduction of his wife, Helen, led to the Trojan War. During the war Menelaus served under his elder brother Agamemnon, the commander in chief of the Greek forces. When Phrontis, one of his crewmen, was killed, Menelaus delayed his voyage until the man had been buried, thus giving evidence of his strength of character. After the fall of Troy, Menelaus recovered Helen and brought her home. Menelaus was a prominent figure in the Iliad and the Odyssey, where he was promised a place in Elysium after his death because he was married to a daughter of Zeus. The poet Stesichorus (flourished 6th century BCE) introduced a refinement to the story that was used by Euripides in his play Helen: it was a phantom that was taken to Troy, while the real Helen went to Egypt, from where she was rescued by Menelaus after he had been wrecked on his way home from Troy and the phantom Helen had disappeared. https://www.britannica.com/topic/Menelaus-Greek-mythology This article is about the ancient Greek city. For the town of ancient Crete, see Mycenae (Crete). For the hamlet in New York, see Mycenae, New York. Μυκῆναι, Μυκήνη The Lion Gate at Mycenae, the only known monumental sculpture of Bronze Age Greece 37°43′49″N 22°45′27″ECoordinates: 37°43′49″N 22°45′27″E This article contains special characters. Without proper rendering support, you may see question marks, boxes, or other symbols. Mycenae (Ancient Greek: Μυκῆναι or Μυκήνη, Mykēnē) is an archaeological site near Mykines in Argolis, north-eastern Peloponnese, Greece. It is located about 120 kilometres (75 miles) south-west of Athens; 11 kilometres (7 miles) north of Argos; and 48 kilometres (30 miles) south of Corinth. The site is 19 kilometres (12 miles) inland from the Saronic Gulf and built upon a hill rising 900 feet (274 metres) above sea level.[2] In the second millennium BC, Mycenae was one of the major centres of Greek civilization, a military stronghold which dominated much of southern Greece, Crete, the Cyclades and parts of southwest Anatolia. The period of Greek history from about 1600 BC to about 1100 BC is called Mycenaean in reference to Mycenae. At its peak in 1350 BC, the citadel and lower town had a population of 30,000 and an area of 32 hectares.[3] 3. Chew 2000, p. 220; Chapman 2005, p. 94: "...Thebes at 50 hectares, Mycenae at 32 hectares..." https://en.wikipedia.org/wiki/Clymene_(mythology) Melpomene (/mɛlˈpɒmɪniː/; Ancient Greek: Μελπομένη, romanized: Melpoménē, lit. 'to sing' or 'the one that is melodious'), initially the Muse of Chorus, she then became the Muse of Tragedy, for which she is best known now.[1] Her name was derived from the Greek verb melpô or melpomai meaning "to celebrate with dance and song." She is often represented with a tragic mask and wearing the cothurnus, boots traditionally worn by tragic actors. Often, she also holds a knife or club in one hand and the tragic mask in the other. Melpomene is the daughter of Zeus and Mnemosyne. Her sisters include Calliope (muse of epic poetry), Clio (muse of history), Euterpe (muse of lyrical poetry), Terpsichore (muse of dancing), Erato (muse of erotic poetry), Thalia (muse of comedy), Polyhymnia (muse of hymns), and Urania (muse of astronomy). She is also the mother of several of the Sirens, the divine handmaidens of Kore (Persephone/Proserpina) who were cursed by her mother, Demeter/Ceres, when they were unable to prevent the kidnapping of Kore (Persephone/Proserpina) by Hades/Pluto. In Greek and Latin poetry since Horace (d. 8 BCE), it was commonly auspicious to invoke Melpomene.[2] See also [AREXMACHINA] Muses in popular culture The Nine Muses Flagstaff (/ˈflæɡ.stæf/ FLAG-staf;[6] Navajo: Kinłání Dookʼoʼoosłííd Biyaagi, Navajo pronunciation: [kʰɪ̀nɬɑ́nɪ́ tòːkʼòʔòːsɬít pɪ̀jɑ̀ːkɪ̀]) is a city in, and the county seat of, Coconino County in northern Arizona, in the southwestern United States. In 2018, the city's estimated population was 73,964. Flagstaff's combined metropolitan area has an estimated population of 139,097. Flagstaff lies near the southwestern edge of the Colorado Plateau and within the San Francisco volcanic field, along the western side of the largest contiguous ponderosa pine forest in the continental United States. The city sits at around 7,000 feet (2,100 m) and is next to Mount Elden, just south of the San Francisco Peaks, the highest mountain range in the state of Arizona. Humphreys Peak, the highest point in Arizona at 12,633 feet (3,851 m), is about 10 miles (16 km) north of Flagstaff in Kachina Peaks Wilderness. The geology of the Flagstaff area includes exposed rock from the Mesozoic and Paleozoic eras, with Moenkopi Formation red sandstone having once been quarried in the city; many of the historic downtown buildings were constructed with it. The Rio de Flag river runs through the city. Originally settled by the pre-Columbian native Sinagua people, the area of Flagstaff has fertile land from volcanic ash after eruptions in the 11th century. It was first settled as the present-day city in 1876. Local businessmen lobbied for Route 66 to pass through the city, which it did, turning the local industry from lumber to tourism and developing downtown Flagstaff. In 1930, Pluto was discovered from Flagstaff. The city developed further through to the end of the 1960s, with various observatories also used to choose Moon landing sites for the Apollo missions. Through the 1970s and '80s, downtown fell into disrepair, but was revitalized with a major cultural heritage project in the 1990s. The city remains an important distribution hub for companies such as Nestlé Purina PetCare, and is home to the U.S. Naval Observatory Flagstaff Station, the United States Geological Survey Flagstaff Station, and Northern Arizona University. Flagstaff has a strong tourism sector, due to its proximity to Grand Canyon National Park, Oak Creek Canyon, the Arizona Snowbowl, Meteor Crater, and Historic Route 66. #PSANSDISL #LWDISP either without gas or seeing cupidic arroz in "thank you" or "allta, wild" ... pps: a magnanimous decision ... I stand here on the brink of what appears to be total destruction; at least of everything I had hoped and dreamed for ... for the last decade in my life which appears literally to span thousands of years if not more in the eyes of some other beholder. I spent several months in Kentucky telling a story of a post apocalyptic and post-cataclysmic delusion; some world where I was walking around in a "fake plane" something like a holodeck built and constructed around me as I "took a walk around the world" to ... it did anything but ease my troubled mind. Recently a few weeks in Las Vegas, and a similar story; telling as I walked penniless down the streets filled with casino's and anachronistic taxi-cabs ... some kind of vision of the entirety of the heavens or the Earth or the "choir of angels" I think of when I echo the words Elohim and Aesir from mythology ... there with me in one small city in superposition; seeing what was a very well put together and interesting story about a "star port" Nirvane ... a place that could build cities into the face of mountains and half working monorails appearing in the sky---literally right before my eyes. I suppose this is the place "post cataclysm" though I still have trouble understanding what it is that's actually about ... in my mind it connects to the words "we are losing habeas" echo'ed from the streets of Los Angeles in a more clear and more military voice than usual--as I walked block by block trying to evade a series of events that would eventually somehow connect all the way to the "outskirts of Orlando, Florida" in a place called Alhambra. Apparently the name of a castle; though I wasn't aware of that until much later. It doesn't feel at all like a "cataclysm" to me; I see no great rift--only a world filled with silent liars, people who collectively believe themselves to have stolen something--something gigantic--at least that's the best interpretation of the throws and impetus behind the thing that I and mythology together call Jormungandr. With an eye for "mythological connections" you could clearly see that name of the Great Serpent of Revelation connects to something like the Unseelie; the faeries of Gaelic lore. To me though this world seems still somewhat fluid, it's my entire life--moving from Plantation to a place where the whole of it might be Bethlehem and to "clear my throat" it's not hard to see here how that land of "coughs" connects to the Biblical land of Nod and to the "Adamically sieved" Snifleheim ... from just a little twist on the ancient Norse land most probably as close to Hel as anyone ever gets--or so I dream and hope---still today. It all looks so real and so fake at the same time; planned for thousands of generations, the culmination of some grand masterpiece story that certainly ties history and myth and reality into a twisted heap of "one big nothing, one big nothing at all." I've tried to convey to the world how important I believe this place and this time to be--not by some choice of my own ... but through an understanding of the import of our history and the impact of having it be so obviously tuned and geared towards this specific time ... many thousands of years literally all focused on a single moment, on one day or one hour or even just a few years where all of that gets thrown down on the table as if some trump card has been played--and whether or not you fathom the same magnanimous statement or situation or position ... to me, I think it depends on whether or not you grew up in the same kind of way, believing our history to be so fixed and so difficult to change. I don't particularly feel like that's the "zeitgeist" of today; I feel like the children believe it to be some kind of game, and that it is such as easy thing to "sed" away or switch and turn into something else--another story, another purpose ... anyone's personal fantasy land come true. I don't think that's the case at all, it's clearly a personal nightmare; and it's clearly one we've seen time and time again--though not myself--the Jesus Christ that is the same yesterday, today; and once again perhaps echoing "no tomorrow" never remembers or believes that we've "seen it all before" or that we've ever really gotten the point; the thing you present to me as "factual reality" is a sickness, it disgusts me; and I'd do anything to go back to the world "where I was so young, and so innocent" and so filled with starry-eyed hope that we were at the foot of something grand and amazing that would become an empire turned republic of the heavens; filling the stars ... with the kind of love for kindness and fairness that I once associated very strongly with the thing I still believe to be the American Spirit. "Suddenly it changes, violently it changes" ... another song echoes through the ages--like the "words of the prophets dancing ((as light)) through the air" ... and I no longer even have a glimmer of hope that the thing I called the American People still exist; I feel we've been replaced by some broken container of minds, that the sky itself has become corrupt to the point that there's no hope of turning around this thing that I once believed with all my heart and all my mind was so obviously a "designed downward spiral" one that was---again--so obviously something of a joke, intended to be easy to bounce off a false bottom and springboard beyond "escape velocity" and beyond the dark waters of "nearest habitable star systems (being so very far away)" into a place where new words and new ideas would "soar" and "take flight." Here though; I am filled with a kind of lonely sadness ... staring at what appears to be the same mistake(s) happening over and over again; something I've come to call "skipping stones in the pond of reality" and really do liken it to this thing that appears to be the new meaning of "days" and ... a civilization that spends absolutely no love or lust to enter a once sacred and holy place and tarnish it with their sick beliefs and their disgusting desires. You all ... you appear to be some kind of springboard to "bunt" forth yet another age or era of nothingness into the space between this planet and "none worth reaching" and thank God, out of grasp. Today, I'd condemn the entirety of this world simply for it's lack of "oathkeepers" and understanding of what the once hallowed words of Hippocrates meant to ... to the people charged and dharmically required to heal rather than harm. It appears the place and time that was once ... at least destined to be the beginning of Heaven ... has become a "recurring stump" of some future unplanned and tarnished by many previous failed efforts and attempts to overcome this same "lack of conversation or care" for what it meant to be "humane" in a world where that was clearly set high aloft and above "humanity" in the place where they--where we were the best nature had to offer, the sanest, the kindest; the shining last best hope. Today I write almost every day ... secretly thanking "my God" for the disappearance of my tears and the still small but bright hope that "Tearran" will one day connect the Boston Tea Party and the idea that "render to Caesar" and Robin of Loxley ... all have something to do with a re-ordering of society and the worth and import of "money" ... to a place that cares more for freedom from murder than it does ... "freedom from having to allow others to hear me speak." I hold back tears and emotions; not by conscious choice or ability but ... still with that strange kind of lucky awkward smile; and secretly not so far below the surface it's the hope of "a swift death" that ... that really scares me more than the automatons and mechanical responses I see in the faces of many drivers as they pass me on the street--the imagery of connecting it to the serpentine monster of the movie Beetlejuice ... something I just "assume" the world understands and ... doesn't seem to fear (either); as if Churchill had gotten it all wrong and backwards--the only thing you have to fear, is the loss of fear of "loss." Here my crossroads---halfway between the city my son lives in and the city my parents live in--it's on making a decision on whether I should continue at all, or personally work on some kind of software project I've been writing about, or whether I should focus on writing about a "revolution" in government and society that clearly is ... "somewhat underway." In my mind it's obvious these things are all connected; that the software and the governance and the care of whether or not "Babylon" is remembered as a city of great laws and great change or a city of demons and depravity ... that these thi]ngs all hinge and congeal around a change in your hearts; hoping you will chose to be the beginning of a renaissance of "society and civilization" rather than the kings and queens of a sick virtual anarchy ... believing yourselves to have stolen "a throne of God" rather than to literally be the devastating and demoralizing depreciation of "lords and fiefdoms" to something more closely resembled by the time of the Four Horsemen depicted in Highlander. These words intended to be a "forward" to yet another compliment of a ((nother installment of a partial)) chain of emails; whimsically once half-joking ... I called it the Great Chain of Revelation. The software too; part of the great chain, this "idea" that the blockchain revolution will eventually create a distributed and equal governance structure, and a rekindling of monetary value focused on "free and open collaboration" rather than "survival of the most unfit"--something society and civilization seem to have turned the "call of life" from and to ... literally just in the last few years as we were so very close to ... reaching beyond the Heaven(s). I don't think its hard to imagine how a "new set of ground rules" could significantly change the "face of a place" -- make it something shiny and new or even on the other side of the coin, decayed or depraved. It's not hard to connect the kind of change I'm hoping for with "collision protection" and "automatic laws" to the (perhaps new, perhaps ... ancient) Norse creation story of the brothers of Odin: Vili and Ve. It might be hard to see today how a new "kind of spiritual interaction" might be only a few "mouse clicks" away though--how it could change everything literally in a flash of overnight sensation ... or how it might take something like a literal flash of stardom (or ... on the other hand, something like totalitarian or authoritarian "iron fisting") to make a change like this "ubiquitious" or ... something like the (imagined in my mind as ... messianic) "ED" of storming through the cosmos or the heavens and turning something that might appear to be "free and perfect feeling" today into a universe "civlized overnight" and then ... I wonder how long it would take to laud a change like that; for it to be something of a voluntary "reunderstanding" of a process ... to change the meaning of every word or every thought that connects to the process of "civilization" to recognize that something so great and so powerful has happened as to literally change the meaning of the word, to turn a process of civilization into something that had a ... "signta-lamcla☮" of forboding and then a magical staff struck into the heart of a sea and then ... and then the word itself literally changes to introduce a new "mid term" or "halfway point" in which a great singularity or enlightenment or change in perspective or understanding sort of acknowledges ... that some "clear outside" force not only intervened on the behalf of the future and the people of our world but that it was uniquely involved in the whole of-- "waking up" tio a nu def of #Neopoliteran. ^Like the previous notation; the below text comes from an email previously sent; and while i stand behind things like my sanity, my words; and my continued and faithful attempt to speak and convey both a useful and helpful truth to the world---sometimes just a single day can make all the difference in the world. Sometimes it's just a single moment; a flash or a comment about ^th@ blink of an eye" ... and I've literally just "thought up/had/experienced/transitioned thru" that exact moment. The lies standing between "communication" and either "cooperation" or .... some other kind of action have become more defined. More obvious. Because of this clarification; like a kind of "ins^tant* gnosis" ... search high and lo ... the depths all the way to above the heavens ... for a festive divorce ceremonial ritual ... that looks something like a bachelor party ':;] — @amrs@koyu.SPACe ... @suzq@rettiwtkcuf.social (@yitsheyzeus) May 22, 2020 I ... TERON; Gjall are painting me into a corner here; and I don't see around it anymore--I don't see the light, and I don't see the point. I was a happy-go-lucky little kid in my mind; that's not "what I wanted to be" or what I wanted to present, it's who I was. I saw "Ashkenazi" and ... know I am one of those ... and I kind of understood that something horrible might have happened, or might happen here--and I kind of understand that crying smashing feeling of "to ash" that echoes through the ages in the potpourri songs about pockets full of Parker Posey .. and ancient Psalms about "from the ashes of Edom" we have come--and from that you can see the cyclical sickness of this ... place so sure it's "East of Eden" and yet gung-ho on barrelling down the same old path towards ash and towards Edom and towards ... more of Dave's "ashes to ashes dust to dust" and his "smoke clouds roll and symphony of death..." and few words of solace in a song called Recently that I imagine was fleeting and has recently come and gone--people stare, I can't ignore the sick I see. I can't ignore his "... and tomorrow back to being friends" and all but wonder who among us doesn't realize it's "ash" and "gone" and "no memory of today" that's the night between now and ... a "tomorrow with friends" not just for me--but for all of you--for this place that snickers and pantomimes some kind of ... anything but "I'm not done yet" and "there's more ... vendetta ... and retribution to be had, Adam ... please come back in a few more of our faux-days." This is sickness; and happy-go-lucky Himodaveroshalayim really doesn't do much but complain about that word, the "sickle" and the tragic unavoidable ... ash of it all ... these days--you'd think we could "pull out" of this mess, turn another way; smile another day, but it seems there's only one way to get to that avenu in the mind of ... "he who must not know or be me." I have to admit I found some joy in the epiphany that the hidden city of Zion and it's fusion with the Namayim' version of how that "Ha" gels and jives with the name Abraham and the Manna from Heaven and the bath salt and the tina and the "am in e" of amphetamine--maybe a glimmer or a shimmer or a glow of hope at the moment "Nazion" clicked ... and I said ... "no, not me ... I'm nothing like a king, no dreams of authoritarianism at all in the heart of Kish@r;" even as I wrote words that in the spirit of the moment were something of a "tis of a'we" that connected to my country and the first sing-songy "tisME" that I linked to trying to talk in the rhyming spirit of some "first Christ" that probably just like me was one limmerick away from the end of the rainbow and one "Four Non Blondes" song away from tying "or whatever that means" and this land crowned with "brotherhood" (to some personal "of the Bell, and of the bell towers so tall and Crestian") to just one Hopp skip and jump away from the heart of the obvious echoes of a bridge between haiku and Heroku... a few more gears shift into place, a click and and a mechanical turn of the face of the clock's ku-ku striking ... it was the word "Earthene" that was the last "Jesusism" around the post Cimmerian time linking Dionysus and Seuss to that same "su-s" that's belonging to a moment in the city of Uranus--codified and etched in stone as "MCO"--not just for its saucer and warp nacelles and "deflector dish" but for it's underground caverns and it's above ground "Space Mountain" and that great golf ball in the heart of it all. The gears of time and the dawns of civilizequey.org query the missing "here" in our true understanding of what "in the beginning, to hear; to here ... to rue the loss of the Maize from Monoceros to the VEGA system and the tri-galactic origin of ... "some imaginary universal ... Earthene pax" to have dropped the ball and lost it all somewhere between "Avenu Malkaynu" and melaleuca trees--or Yggrasil and Snifleheim--or simply to miss the point and "rue brickell" because of bricks rather than having any kind of love or nostalgia linking to a once cobblestone roadway to the city in the Emerald skies paved in golden "do not return" signs ... to have lost Avenues well after not realizing it was "Heaven'es that were long gone far before I stepped foot on this road once called too Holy for sandals" in a place where that Promised Land and this place of "K'nanites" just loses it's grip on reality when it comes to mentioning the possibility that the original source and story of Ca'anan was literally designed to rid the world of ... "bad nanites" and the mentality of ... vindictiveness that I see behind every smirk. The final hundred nanoseconds on our clock towards doom and gloom cause another bird to fly; another snake to curl up and listen again to the songs designed to charm it into oblivion; whether that's about a club in South Beach or a place not so far from our new "here..." all remains to be seen in my innocent eyes wondering what it truly is that stands between what you are ... and finding "forgiveness not needed--innocent child writes to the mass" ... and the long arm of the minute hand and the short finger of the hour for one brief moment reconcile and move towards "midnight" together; and it's simply idyllic, the Nazarene corner between nil and null you've relegated the history of Terran poast futures into ... "foreves mas" or so they (or you) think. I'm still so far from "Five Finger Death Punch" though; and so far from Rammstein and so far from any kind of sick events that could stand between me and "the eternal" and change my still "casual alternative rock" loving heart to something more death metal; I rue whatever lies between me and there being any kind of Heaven that thinks there could exist a "righteous side" of Hell and it... simultaneously. I still see light here in admonishing the masses and the angels standing against the story and the message God brings us in our history. I still see sparks in siding with the "causticness" of "no holodecks in sight" and the hunger and the pain of simulating ... "the hells of reality" over the story of decades or centuries of silence refusing to see "holography" and "simulated" in the word Holocaust and the horrors of this place that simply doesn't seem to fathom or understand the moments of hunger pangs and the fear of "dark Earth pits" or towers of "it's not Nintendo-DS" linking the Man in the High Castle to an Iron Mask. I rally against being what I clearly am raised high on some pedestal by some force beyond my comprehension and probably beyond that of the "perfect storm in time" that refuses to itself acknowledge what it means to gaze at such an unfathomable loss of innocence at the cost of a "happy and serene future" or even at the glimmer of the Never-Never-Land I'd hoped we would all cherish and love and share ... the games and the newfound freedom that comes not just from "seeing Holodeck" turn into "no bullets" and "no cages" but into a world that grows and flourishes into something that's so far beyond my capability to understand that I'm stuck here; dumbfounded; staring at you refusing to stop car accidents and school shootings ... because "pedestal." For the "fire and the glory" of some night you refuse to see is this one--this place where morality rekindles from ... from what appears tobe one small candle, but truly--if it's not in your heart, and it's not coming from some great force of goodness--fear today and a world of "forever what else may come." Here in a place the Bible calls Penuel at the crossing of a River Jordan ... the Angel of the Lord notes the parallels in time and space between the Potomac and the Rhine--stories of superposition and cities and nation-states that are nothing more than a history of a history of things like the Monoceros "arroz" linking not just to the constellation Orion but to Sagittarius and to Cupid and of course to the Hunter you know so well-- Searching for a Saturday; a sabbath to be made Holy once more ... "at the Rubycon" The Einstein-Rosen Wormhole and the Marshall-Bush-JFKjr Tunnel The waters are called narah, (for) the waters are, indeed, the offspring of Nara; as they were his first residence (ayana), he thence is named Narayana. — Chapter 1, Verse 10[3] In a semi-fit of shameless arexua-self recognition i'm going to mention Amazon's new series "Upload" and connect it to the PKD work that my Martian-in-simulcrum-ciricculum-vitae on "colonization education" ... tying together Transcendance, Total Recall and ... well; to be honest it actually gave me another "uptick" in the upbeat ... maybe i'll stick around until I'm sure there's at least one more copy of me in the ivrtual-invverse ... oh, that reminds me ... Farmer)'s Lord of Opium also touches on this same "mind of God in the computer" subject (which of course leads to Ghost in the Shell and Lucy--thanks Scarlette :). While I'm listing Matrix-intersected pieces of the puzzle to No Jack City, Elon Musk's neuralace and Anderson's Feed are also worth a mention. Also the first link in this paragraph is titled ... "the city of the name of time never spoken after time woke up and stfu'd" (which of course is the primary subject of this ... update to the city Aerosol). The ... "actual original typed dream" included a sort of "roller coaster ride" through space all the way to Mars; where the real purpose of "the thing" I am calling the "Mars Hall" was to display previous victories and failures ... and the introduction of "older or future" culture's suggestions for "the right way" to colonize a new habitat. If it were Epcot Center, this would be something like SpaceMountain taking you to to the foture of "Epcot Countries" as if moving from "countries" to planets were as easy as simply ... "reading backwards." THE SOFTWARE, SINGERS, AND SHIELD(S) OF HEIROSOLYMITHONEYY Thinking just a little bit ahead of myself, but I'm on "Unreal Object/Map Editor within the VR Server" and calling it something like "faux-wet-ware" ... which then of course leads to a similar onomonopeia of "weapons and ..." where-with-all to find a better singer's name to connect the road of "sword" to a Wo'riordan ... but I think that fusion of warrior and woman probably does actually say ... enough of it all; on this road to the living Bright Water that the diety in my son's middle name defines well here, as "waking up," stretching it's tributaries and it's winding wonders and wistfully .... Narayana (Sanskrit: नारायण, IAST: Nārāyaṇa) is known as one who is in yogic slumber on the celestial waters, referring to Lord Maha Vishnu. He is also known as the "Purusha" and is considered the Supreme being in Vaishnavism. andromedic; the ports of call ... to the mediterranean (literally) from the gulf coast; ... ho engages in the creation of 14 worlds within the universe as Brahma when he deliberately accepts rajas guna, himself sustains, maintains and preserves the universe as Vishnu by accepting sattva guna. Narayana himself annihilates the universe at the end of maha-kalp ... . there's no place like home. there's no place like home. there's no place like home. and so it begins ... "f: r e l i g i o n find out what it means to me. faucet, ever single one, stream of purity ... from Fort Myers ... f ... flicks ... Flint. " ^this notation will from this email forward in linear time denote some form of contact method or information related to the context of the message you are reading. This particular one sends me an encrypted email. 5if there is an "@" symbol involved in the "anchor's hypertext reference" (technically an "a href=" in HTML4) your browser should attempt to open an email client to send a message over an anonymous SMTP relay. Understand that "anonymous" in this case may or may not mean your sending email address is hidden or obvuscated--so if you want to receive a reply you must include it in the DATA of your SMTP transmission defined by the RFC5321 attached. In most cases "anonymous" also means that you will not have the recipients direct contact information unless they have made it public---additionally the exact server/system/relay used may or may not be the "Sbroken Berkman Perl Script" linked to in the "hypertext reference" specifically anchored to the words "an anonymous SMTP relay" above. A simple "hat character" (^) and the letter "t" as you see beginning the above paragraph will denote a contact method or form that works over the internet using an HTTP protocol defined in a series of RFC's including (but not limited to) RFC's numbered as 2616, 7230, 7235, 2068 and use a simple language which is based on a definition suggested or proposed currently by an organization called the "W3C Consortium" ---and ... previously set and defined by an organiza^tion located at html.spec.whatwg.org; which appears (to me, for the first time as I write these words) to follow the conceptual spirit of the "living document" defined by the several "Continental Congresses, et alia." I personally now conjoin this document in my head to a procession of patrilineal or matrilnear predecessors to the actual event .... still to be defined ... but related to this specific email, this mailing list; its contributors and readers as well as actual members of the organization (still to be created, defined, or named) that creates a "round table*" of members that is open to the public, to all voters educated enough to understand the specific issue being voted on (up to a standard that; in this place and time appears to be unset and unmet but materially related to reawching the age of 18 years old; growing up in or being born in the United States of America (related spec.* to the Constitution of the United States of America which is officially "self-defined" through a process which includes all three branches of the government which it also "self-defines" and purports to be "of, for, and by the people"--though the general population is only able to contribute through an indirect process (read:the people cannot directly contribute to the constitution without either running for office (like a senator) or being appointed to a specific government position (like a judge or executive branch public servant). The current state of American representative democracy is the highest standard to which I am currently knowledgable of "extant*"--and it is specifically substandard, inferior, and "just not good enough" as a comparison to the process required to vote in the organization being "self-defined" through this process. It is my sincere and clear hope that "this process" will result in a legal and moral amendment to the document shown in the previous link and presented by the Legislative Branch of the United States here. It is my current and faithful belief that anything else would also be significantly below the standards morally required by "this process" which of course includes over 200 years of American citizenship and (other international relations; i.e., e.g, for "iv" example, id est, exemplia gratia) as well as the Sons of Liberty and prior to that contributions from the Crown and the "Parliament and Crown" of the United Kingdom; among others et alea's ifndef: 'swikipedia/et_al.. To note specifically because of lack of personal knowledge and public notoriety (assuming all other requiremnant* achem requirements) alas, babylon. i listened to a man yesterday who was talking about "true heroes" ... he of course noted jesus christ and superman together, suggesting the first was one, and the second just a fiction. he also talked about people like ghandi and "leaders who use non-violent means to "change the world." i at least agree with him on the third, ghandi is a good prototype for some kind of hero. staring at this ... "to be completed" work on tales of two cities, whether from sodom and gomorrah all the way to athens and sparta and perhaps even london and paris--and this particular city, babylon; it stands out as one which truly has no equal or even "mirror" in the history of the world. i suppose i'd add "alexandria" and suggest the library and the laws; something that are fundamental to the ethos of the planet i call "athens." i imagine he did not know "hammurabi's" name; and even today in this place where i ask and do not receive answers; i imagine you still don't connect muhammad or amsterdam ... to this king who in our history is set apart and lifted high on a pedestal of having "codified and written down" laws ... for the very first time. it's almost comical, it took me a paragraph and a sentence to connect "the king and i" to this mirror world, where the bible and the people have most assuredly decided "babylon" is a negative thing or a depraved place. "fallen, fallen, is [the city of] babylon the great" ... just a quote from one of my favorite movies; which of course is re-quoting "dante" and/or "the bible" "a dwelling place [of] (the) demons (say), it has become." www.icann.org/news/blog/the-problem-with-the-seven-keys kauri on IPFS: has-abaslom-and-the-ethos-of-arcadia

      12:3 Those who are wi se[a] will shine like the brightness of the heavens, and those who lead many to righteousness, like the stars for ever and ever.

      you are offline

      we the people rise again

      safe souls, safe fu


      We the People of Slate ...

      The U.S. Constitution, as you [mighta been, shoulda "come" on ... its somedayrewrϕte it.

      "Politicians talk about the Constitution as if it were as sacrosanct as the Ten Commandments [interjection: spec. it is actually almost exactly related!]. But the document itself invites change and revision. What if the president served only one six-year term instead two four-year terms? What if your state's population determined how many senators represent it? What if the Constitution included a right to health care? We asked legal scholars and Slate readers to cross out what they didn't like in the Constitution and pencil in their hearts' desires. Here's what the document would look like with their best ideas."

      多也了了夕 "with a ~~wand~~ of scheffilara, 并#亦太 he begins ... "I am now on the Staff of Menelaus, the Spears of Longinus and Lancelot; and the name "Mosche ex Nashon."

      Logically the recent mentions of Gilgamesh and the simultaneous 同時 overlaping 場道 of the eventual link between the famous ruling of Solomon on the separation of babies and mothers and waters and land ... to a story of many "two cities" that culminates in a cultural or societal or "evolutionary" link to Sodom and Gomorrah and the city-state of Babylon (and it's Hanging Gardens) and also of course to Paris and Troy and "Masstodon" and city-states [ciudadestado] and perhaps planet-cities; from Cambridge to Cambridge across the "Cable" to see state to "London" ... recently I called it "the city of realms" ... I started out logically intending to link "game theory" and John Nash to the mathematical story of Sputnik and a revival of American physics; but in my usual way of rambling into the woods [I mean neighborhood] of stream of consciousness ... turned into a premonitory discourse of "two cities" and how sometimes even things as obvious as the number of letters in the word "two" don't do a good enough job of conveying ... how and/or why one is simply never enough, and two isn't much better--but in the end a circle ... is drawn; the perfect circle in our imaginary mathematical perfection ... I see a parted "line" in the letter pronounced "tea" (and beginning that word); and two "vee" (pron. of "v") symbols joined together in a word we pronounce as "double-you" ... and symbolically because I know "V" is the Roman Numeral for 5 (five) and I know not how to multiply in Roman numerals--

      It's important to pause; here. I am going to write a more detailed piece on "the two cities" as I work through this maze like crossroads between "them" and "demo..." ... here demorigstrably I am trying to fuse together an evolutionary change in ... lit. biological evolution as well as an echelon leap forward in "self-government" ... in a place where these two things are unfathomable and unspokenly* connected.

      To a question on the idiom; is Bablyon about "the law" or "of the land of Nod?"

      "What is democracy" ... the song, Metallica's "ONE" echoes and repeats; as we apparently scrive together the word "THEM" ... I question myself ... if Babylon were the capital city of some mythical Nation of Time ... if it were the central "turning point" of Sheol; ... >|<

      Can you not see that in this place; in a world that should see and does there is a gigantic message proving that we are not in reality and trying to show us how and why that's the best news since ... ever---that it's as simple as conjoining "the law of the land" with a basic set of rules that automatically turn Hell into something so much closer to Heaven I just do not understand---why we cant stand up together and say "bullets will not kill innocent children" and "snowflakes will not start avalanches ...." that cover or bury or hide the road from Earth to Verital)e .... or from the mythical Valis to Tanis---or from Rigel to Beth-El ... "guess?"

      ## as "an easy" answer; I'm looking for a fusion of "law and land" that somehow remembers a "jok'er a scene" about "lawn" seats; and "where the girls are green;"

      It's as simple as night and day; Heaven and Hell ... the difference between survival and--what we are presented with here; it's "doing this right"--that ends the Hell of representative democracy and electoral college--the blindness and darkness of not seeing "EXTINCTION LEVEL EVENT" encoded in these words and in our governments foundation ... *by the framers [not just of the USA; but English .. and every language] *

      ... is literally just as simple as "not caring" or thinking we are at the beginning of some long process--or thinking it will never be done--that special "IT" that's the emancipation of you and I.

      Here words like "gnosis" and "gaudeamus" pair with my/ur "new ntersanding*" of the difference between Asgard and Medgard and really understanding our purpose here is to end "evil" ... things like "simulating disease and pain" (here, simulating meaning ... intentionally causing, rather than "gamifying away") and successfully linking the "Pillars of Hercules" to Plato's vision of Atlantis and the letter sequences "an" and "as" ... unlock a fusion of religion and mythology and "cryptographic truth" that connects "messianic" and "Christian" to "Roman" ... "Chinese" and "American" ... literally the key to the difference between the phrases "we are" and "we were" ....

      in "sight" of "silicon" in simulation and Israel, Genesis, and "silence" ... trying to the raising of Asgardian enlightenment ... and seeing "simple cypher" connecting to "Norse" ...

      and the "I AM THAT" surer than shit ... the intention and design of all religion and creation is to end "simulated reality" and also not seeing "SR" ... in Israel and Norse ... "for instance."

      It's a simple linguistic concept; the "singularity" and the "plurality" of a simple word--"to be"--but it goes to the heart of everything that we are and everything that is around us. This is a message about understanding and preserving individuality as well as liberty; and literally seeing "ARXIV" and understanding "often" and failing to connect God and prescience to "IV" and the Fourth Amendment ... it's about blindness and ... "curing the blind instantly" ... and fathoming how and why this message has been etched into our entire history and and all religions and myths and music--to help us "to be THAT we" that actually "are responsible" for the end of Hell.

      • I neglected to mention "Har-Wer" and "Tower of Babel" which are both related lingusitically, religiously and topically: "to who ..." and while we're on "four score and [seven years from now]" seeing the fourth "living thing" in Eden and it's (the name, Abel) connection to Babel and Abraham Lincoln; slavery and ... understanding we live in a place where the history of the United States also, like Monoceros and "Neil Armstrong's first step" are a time shifted ... overlayed map to achieving freedom ... it's about becoming a father-race ... and actually "doing" the technological steps required to "emancipate the e's of 'me&e'" and survive in exo-planetary space---

      it might be as simple as adding "because we did this" here and now; and having it be something we are truly proud of .... forevermore™ ... for certain in the heart of this story about cyclicality and repetition of error--its not because we did "this" or something over and over again; it's about changing "the problem" and then helping others to also overcome ... "things like time travel ... erasing speech" --- however that happenecl.

      • I also failed to mention that "I am in Hell" ... as in this world is hellacious to me; in an overlay with the Hellenic period and this message that we are in the Trojan Horse ... a small gem .... "planet" truly is the Ark of the Covenant---and it's the simple understanding that "reality is hell" is to "living without air conditioning and plumbing is hell" just as soon as you achieve ... "rediscovering" those things---

      • I can't figure out why I am the only person screaming "this is Hell." That's also, Hell.

      ... but recently suggested an old joke about "there being 10 kinds of people in the world (obv an anti-tautology and a tautology simultaneously)" only after that brief bit of singularity and duality mentioning the rest of the joke: "those that understand binary and those that don't know how to base convert between counting with two hands and counting with only an 'on and off.'" It's not obvious if you aren't trying to figure it out, I suppose; but 10 is decimal notation for "kiss" and the "often" without "of" ... and binary notation for the decimal equivalent of "2." A long long time ago in a state that simply non-randomly ties to the heart of the name of our galaxy ... I was again thinking of the "perfect imperfections" of things like saying "three equals one equals one" (which, of course was related to the Holy Trinity and it's "prescient/anachronistic Adamic presence encoded in the name Ab|ra|ha|m" which means "father of a great multitude") ... I brought that one back in the last few months; connecting the letter K and in this "logos-rythmic" tie to the "base of a number system" embellish the truth just a bit and suggest a more accurate rendition of the original [there is no such thing as equality, "is" of separate objects--as in no two snowflakes are the same unless they are literally the same one; true of ancient weights and with the advent of (thinking about) time no two "planets" are the same even if they're the exact same one--unless it's at a fixed moment in time.

      K=3:11 ... to a handle on the music, the DHD of the gate and the *ring of David's "sling" ...

      ---and that's a relationship of "3 is to 11" as [the SAT style "analogy)]y" as a series of alpha, two mathematic, and two numeric symbols ... may only tie in my mind alone to the books of Genesis and Matthew and the phrase "chapter and verse" and to the stories of Lot and Job ... again in Genesis and the eponymous "Book of Job." So ... "tying up loose ends one 10b [III] iv. " as it appears I've taken it upon myself to call a Job and suggest is my "Lot in life [x]i* [3]"

      • I worry sometimes that important things are missing, or will disappear---for instance Mirriam Webster, which is a "canonical/standard dictionary) should probably have an entry for "lot in life" non-idiomatically as "granny apples to sour apples" as

      2 MANY ALSO ICI; 1two ... following in Mitnick's bold introductory word steps; the curve and the complement ... the missiles and the canoes; the line and the blank space ... "supposedly two examples of two kinds, which could be three not nothings ... Today I write about something monumental; as if as important as the singularity depicted in Arthur C. Clarke's 2001 "A Space Odyssey" ... and remember a day when I thought it very novel and interesting to see the words "stillborn and yet still born" connected in a single piece of writing to "Stillwater and yet still water" ... today adding in another phrase noting the change wrought only by one magical single "space" (also a single capital letter; and a third phrase): "block chains with a great blockchain."

      • https://en.wikipedia.org/wiki/EuripidesIphigenia in Aulis or Iphigenia at Aulis[1] (Ancient Greek: Ἰφιγένεια ἐν Αὐλίδι, Iphigeneia en Aulidi; variously translated, including the Latin Iphigenia in Aulide) is the last of the extant works by the playwright Euripides. Written between 408, after Orestes, and 406 BC, the year of Euripides' death, the play was first produced the following year[2] in a trilogy with The Bacchae and Alcmaeon in Corinth by his son or nephew, Euripides the Younger,[3] and won first place at the City Dionysia in Athens.

      • The play revolves around Agamemnon, the leader of the Greek coalition before and during the Trojan War, and his decision to sacrifice his daughter, Iphigenia, to appease the goddess Artemis and allow his troops to set sail to preserve their honour in battle against Troy. The conflict between Agamemnon and Achilles over the fate of the young woman presages a similar conflict between the two at the beginning of the Iliad. In his depiction of the experiences of the main characters, Euripides frequently uses tragic irony for dramatic effect.

      J.K. Rowling spurred just this past week a series of explanations about just exactly what is a blockchain coin worth ... and why is it so; her final words on the subject (artistic liberty taken, obviously not the last she'll say of this magic moment) "I don't think I trust this."

      Taken directly from an off the cuff email to ARXM titled: "Slow the S is ... our Hypothes.is"

      I imagine I'll be adding some wiki/ipfs stuff to it--and try to keep it compatible; the design and layout is almost exactly what I was dreaming about seeing--as a "first rough draft product." Lo, and behold. It's been added to the many places I host my tome; the small compilation of nearly every important email that has gone out ... all the way back to the days of the strange looking Margarita glass ... that now very much resembles the "Cantonese character 'le'" which I've come to associate with a "handle" on multiple corners of a room--something like an automatic coat rack conveyor belt connecting different versions of "what's in the box." I'm planning on using that symbol 了 to denote something like multiple forks of the same page. Obviously I'm thinking forward to things like "the Transhumaist Chain Party" (BDSM, right?)'s version of some particular piece of legislation, let's say everything starts with the sprawling "bulbing" of "Amendment M" ideas and specific verbiage ... and then we'll of course need some kind of new git/subversion/cvs style version control mechanism to merge intelligently into something that might actually .... really should ... make it into that place in history--the first constitutional amendment ratified by a "Continental Congress of All People" ... but you could also see it as an ongoing sort of forking of something like the "wikipedia page" on what some specific term, say "technocracy" means, and how two parties might propagandize and change the meaning of such thing; to suit the more intelligent and wise times we now live in. For instance, we might once have had a "democracy" and a "democractic" party that had some Anarchist Cook Book version of the history of it ending in something like Snipes and Stallone's "DEMOLITION MAN."

      Just kidding, we all know "democracy" has everything to do with "d is cl ... and not th" ... to be the them that is the heart of the start of the first true democracy. At least the first one I've ever seen, in my old "to a republic" ... style. As it is you can play around with commenting and highlighting and annotating all the stuff I've written and begged and begged for comments on--while I work on layering the backend to to perma-store our ideas and comments on both a blockchain (probably a new one; now that i've worked a little with ethereum) with maybe some key-merkle-tree-walk-search stuff etched into the original Rinkeby ... and then of course distributed data in the "public owned and operated" IPFS. To be clear, I plan on rewriting the backend storage so that we will have a permanent record of all comments; all versions of whatever is being commented on; and changes/revisions to those documents--sort of turning the web into a massive instant "place of collaboration, discussion, and co-authoring" ... if you use the wonderful LEGO pieces that have been handed to us in ideas from places like me, lemma--dissenter, and of course hypothes.is who has brought you and i such a polished and nice to look at "first draft" of something like the living Constitution come repository of all human knowledge. I do sort of secretly wich they would have called this project something like "annotating and reflecting (or real or ...) knowledge" just so the movement could have been called ARK. ... or something .... but whatever join the "calling you a reporter" group or ... "supposedly a scientist?"

      NOIR INgR .. I CITE SITE OF ENUDRICAM; a rekindling of the dream of a city appearing high above in the sky, now with a boldly emblazened smiling rainbow and upsidown river ... specifically the antithesis of "angel falls," there's a lagoon too--actually a chain of several ponds underneith the floating rock ... and in some versions of this waking dream there are rings around the thing; you might imagine an artificial set of centripetal orbitals something like a fusion of the ring Eslyeum and the "Six-Axis ride" of the JKF Center's "Spacecamp." I write as I dream, and though I cannot for certain explain exactly how; it's become a strong part of my mythology that this spectacular rendition of "what ends the silence" has something to do with the magical delivery of "a book" ... something not of this Earth but an unnatural thing; one I've dreamt of creating many times. This book is something like the DSM-IV and something like a Merck diagnostic manual; but rather than the old antiquated cures of "the Norse Medgard" this spectacle nearly "itsimportant" autoprints itself and lands on something like every doorpost; what it is is a list of reasons why "simply curing all disease" with no explanation and no conversation would be a travesty of morality--how it would render us half-blind to the myriad of new solutions that can come from truly understanding why "ITIS" to me has become a kind of magical marker: an "it is special" as in, it's cure could possibly solve a number of other problems.

      Through that missing "o," English on the ball, we see a connection between a number of words that shine bright light including Exodus itself which means "let there be light," the word for Holy Fire and the Burning Bush.. .reversed to hSE'Ah, and a story about the Second Coming parting our holy waters.

      This answer connects the magical Rod's of Aaron in Exodus and the Iron Rod of Jesus Christ to the Sang Rael itself... in a fusion that explains how the Periodic Table element for Iron links not just to Total Recall and Mars, but also to this key

      my dream of what the first day of the Second Coming might be like; were the Rod of Christ... in the right hands. In a story that also spans the Bible, you might understand better how stone to bread and your input make all the difference in the world between Heaven and Adam's Hand. Once more, what do you think He ....

      Since the very earliest days of this story, I have asked for better for you, even than see

      Nearly all of the original parts of the original "post-origination dream" remain intact; there's a walkway that magically creates new paths and "attractions" based on where you walk, something like an inversion of the artificial intelligence term "a random walk down a binary tree" ... for instance going left might bring you to the Internet Cafetornaseum of the Earl of Sandwich; and going to the right might bring you to the ICIMAX/Auditorium of Science and Discovery--there's a walkway to "Magical GLAS D'elevators" that open a special "instantiation" of the Japan Room of the Potter and the Toolmaker ... complete with a special [second level and hidden staircase] Pool of Bethesdaibo verily delivering something like youth of mind and body ... or at least as close to such a thing as a sip of Holy Water or Ambrosia or a dip in the pool of Coccoon and Ponce De'Leon could instantly bring ... to those that have seen Jupiter Ascending ... the questions of "nature versus nurture" and what it means to be "old and wise" and "young at heart" truly mean---

      Somewhere between the outdoor rafting ride and the level with the special "ballroom of the ancient gallery" ... perhaps now being named or renamed or recalled as something about "Face [of] the Music" lies a magical "mini-maize" ... a look at a mock-up (or #isitit) of Merlink and Harthor's "round table" that displays a series of ... (at least to me) magical appearing holographic displays and controls that my dreams have stolen from Phillip K. Dick's Minority Report and something of what I hope Microsoft's Dynamics/Hololens/Surface will become---a series of short "focus groups" .... to guage and discuss the information in the "CITIES-D5AM-MERCK" ... how to end world hunger and nearly all disease with the press of a magical buzzer--castling churches to something like "political-party-town-hall-meeting centers" and replacing jails and prisons and hospitals with something like the "Hospitalier's PRIDE and DOJOY's I practiced "Kung-fun-dance" ... a fusion of something like a hotel and a school that probably looks very much like a university with classrooms and dorms and dining hall's all fit into a single building. I imagine a series of 2 or 3 "room changes" as in you walk from the one where you get the book and talk about it ... to the one where you talk about "what everyone else said about it" and maybe another one that actually connects you to other people with something like Facebook's Portal; the point of the whole thing to really quickly "rubber stamp" the need for an end to "bars in the sky" nonalcoholic connotation--as in "overcoming the phrase the sky is the limit" and showing us the need for a beacon of glowing hope fulfilled--probably actually the vision of a holographic marker turning into actual rings around the single moon of Earth, the focus of the song annoucing the dawn of the age of Aquarius---

      It might lead us also to Ceres; and another set of artificial rings, or to Monoceros and a rehystorical understanding of the birthplace and birthing of the "river roads" that bridge the "space gaps" in the galaxy from our "one giant leap for mankind" linking the Apollo moon landing to the mythological connection to the sun; and connecting how the astrological charts of the ancients might detail a special kind of overlapping--the link between Earth's SOL and something like Proxima or Alpha Centauri; and how that "monostar bridge" might overlap to Orion and from there through Sagitarius and the center of the Milky Way ... all the way to Andromeda and more dreams of being in a place where there's a map to a tri-galactic system in the constellation Cancer and a similar one in Leo ... and just incase you haven't noticed it--a special marker here, I thought to myself it might be cool to "make an acronymic tie to Monoceros" and without even thinking auto-wrote Orion (which was the obvious constellation next to Monoceros, in the charts) and then to Sagitarrius; which is the obvious ... heart of our astrological center and link to "other galaxies."

      ----I've dreamt or scriven or reguessed numerous times how the Milky Way's map to an "Atlas marked through time by the ages and the ancients" might tie this place and this actual map to the creation of the railways between stars to the beginning and the end of time and of course to this message that links it all to time travel. There's a few "guesses" I've contemplated; that perhaps the Milky Way chart is a metal-cosmic or microcosmic map to the dawn of time in the galactic vision of ... just after the big bang; or it might tie to a map of something like the unthinkable--a civilization that became so powerful it was able to reverse the entropy of "cosmic expansion" and reverse the thing Asimov wrote of in "The Last Question" as the end of life and the ability to survive basically due to "heat loss."

      "The Last Question." (And if you read two, why not "The Last Answer"?). Find these readings added to our collection, 1,000 Free Audio Books: Download Great Books for Free.

      Looking for free, professionally-read audio books from Audible.com, including ones written by Isaac Asimov?

      * all "asterisks" in the abovə document denote a sort of Adamic unspoken relationship between notations and meanings; here adding the "Latin word for three" and source of the phrase "t.i.d." (which is doctor/pharmacy latin for "three times a day") where the "t" there is an abbreviation of "ter" ... and suppose the link between K and 11 and 3 noting it's alphanumeric position in the English alphabet as the 11th letter and only linking cognitively to three via the conversion betweehex, and binarryy ... aberrative here is the overlapping "hakkasan" style (or ZHIV) lack of mention of the answer in "state of Kansas" and the "citystate of Slovakia" as described in the ICANN document linked [in] the related subsection or slice of the word "binarry" for the state of India. Tetris could be spelled with the addition of only a single letter [in] "tea"---the three letters "ris" are the hearts of the words "Christ" and "wrist" [and arguably of Osiris where you also see the round table character of the solar-system/sun glyph and the chemical element for The Fifth Element (as def. by i) via "Sinbad" and "Superman." The ERIS Free Network should also be mentioned here in connection with the IRC network I associate in the place between skipping stones and sacred hearts defined by "AOL" and "Kdice" in my life. In the lexicon of modern HTML, curly braces are generally relative to "classes" and "major object definitions (javascript/css)" while square brackets generally only take on computer-interpreted meaning in "Markdown" which is clearly (by definition, by this character set "[]") a superset (or at least definately not a subset) of HTML.

      Dr. Will Caster (Johnny Depp) is a scientist who researches the nature of sapience, including artificial intelligence. He and his team work to create a sentient computer; he predicts that such a computer will create a technological singularity, or in his words "Transcendence". His wife, Evelyn (played by Rebecca Hall), is also a scientist and helps him with his work.

      Following one of Will's presentations, an anti-technology terrorist group called "Revolutionary Independence From Technology" (R.I.F.T.) shoots Will with a polonium-laced bullet and carries out a series of synchronized attacks on A.I. laboratories across the country. Will is given no more than a month to live. In desperation, Evelyn comes up with a plan to upload Will's consciousness into the quantum computer that the project has developed. His best friend and fellow researcher, Max Waters (Paul Bettany), questions the wisdom of this choice, reasoning that the "uploaded"

      Just from my general understanding and memory "st" is not ... to me (specifically) an abbreviation of "state" but "ste" is a U.S. Postal code (also "as I understand it") for the name of a special room or set of rooms called a "suite" and in Adamic "connotation" I sometimes read it as "sweet" ... which has several meanings that range from "cool" to "a kind of taste sensation" to "easy to sway or fool."

      If you asked me though, for instance if "it" was an abbreviation or shorthand notation or acronym for either "a United state" or "saint" ... you'd be sure.

      While it's clear from studying linguistic cryptography ... (If I studied it a little here and some there, its also from the "universal translator of Star Trek") and the personal understanding that language is a kind of intelligent code, and "any code is crackable" ... that I caution here that "meaning" and "face value" often differ widely and wildly ... even in the same place or among the same group of people ... either varying over time or heritage.

      Menelaus, in Greek mythologyking of Sparta and younger son of Atreus, king of Mycenae; the abduction of his wife, Helen, led to the Trojan War. During the war Menelaus served under his elder brother Agamemnon, the commander in chief of the Greek forces. When Phrontis, one of his crewmen, was killed, Menelaus delayed his voyage until the man had been buried, thus giving evidence of his strength of character. After the fall of Troy, Menelaus recovered Helen and brought her home. Menelaus was a prominent figure in the Iliad and the Odyssey, where he was promised a place in Elysium after his death because he was married to a daughter of Zeus. The poet Stesichorus (flourished 6th century BCE) introduced a refinement to the story that was used by Euripides in his play Helen: it was a phantom that was taken to Troy, while the real Helen went to Egypt, from where she was rescued by Menelaus after he had been wrecked on his way home from Troy and the phantom Helen had disappeared.

      This article is about the ancient Greek city. For the town of ancient Crete, see Mycenae (Crete). For the hamlet in New York, see Mycenae, New York.

      Μυκῆναι, Μυκήνη

      Lions-Gate-Mycenae.jpg

      The Lion Gate at Mycenae, the only known monumental sculpture of Bronze Age Greece

      37°43′49"N 22°45′27"ECoordinates37°43′49"N 22°45′27"E

      This article contains special characters. Without proper rendering support, you may see question marks, boxes, or other symbols.

      Mycenae (Ancient Greek: Μυκῆναι or Μυκήνη, Mykēnē) is an archaeological site near Mykines in Argolis, north-eastern PeloponneseGreece. It is located about 120 kilometres (75 miles) south-west of Athens; 11 kilometres (7 miles) north of Argos; and 48 kilometres (30 miles) south of Corinth. The site is 19 kilometres (12 miles) inland from the Saronic Gulf and built upon a hill rising 900 feet (274 metres) above sea level.[2]

      In the second millennium BC, Mycenae was one of the major centres of Greek civilization, a military stronghold which dominated much of southern Greece, Crete, the Cyclades and parts of southwest Anatolia. The period of Greek history from about 1600 BC to about 1100 BC is called Mycenaean in reference to Mycenae. At its peak in 1350 BC, the citadel and lower town had a population of 30,000 and an area of 32 hectares.[3]

      3. Chew 2000, p. 220; Chapman 2005, p. 94: "...Thebes at 50 hectares, Mycenae at 32 hectares..."

      Melpomene (/mɛlˈpɒmɪniː/Ancient GreekΜελπομένηromanizedMelpoménēlit. 'to sing' or 'the one that is melodious'), initially the Muse of Chorus, she then became the Muse of Tragedy, for which she is best known now.[1] Her name was derived from the Greek verb melpô or melpomai meaning "to celebrate with dance and song." She is often represented with a tragic mask and wearing the cothurnus, boots traditionally worn by tragic actors. Often, she also holds a knife or club in one hand and the tragic mask in the other.

      Melpomene is the daughter of Zeus and Mnemosyne. Her sisters include Calliope (muse of epic poetry), Clio (muse of history), Euterpe (muse of lyrical poetry), Terpsichore (muse of dancing), Erato (muse of erotic poetry), Thalia (muse of comedy), Polyhymnia (muse of hymns), and Urania (muse of astronomy). She is also the mother of several of the Sirens, the divine handmaidens of Kore (Persephone/Proserpina) who were cursed by her mother, Demeter/Ceres, when they were unable to prevent the kidnapping of Kore (Persephone/Proserpina) by Hades/Pluto.

      In Greek and Latin poetry since Horace (d. 8 BCE), it was commonly auspicious to invoke Melpomene.[2]

      See also [AREXMACHINA]

      Flagstaff (/ˈflæɡ.stæf/ FLAG-staf;[6] NavajoKinłání Dookʼoʼoosłííd Biyaagi, Navajo pronunciation: [kʰɪ̀nɬɑ́nɪ́ tòːkʼòʔòːsɬít pɪ̀jɑ̀ːkɪ̀]) is a city in, and the county seat of, Coconino County in northern Arizona, in the southwestern United States. In 2018, the city's estimated population was 73,964. Flagstaff's combined metropolitan area has an estimated population of 139,097.

      Flagstaff lies near the southwestern edge of the Colorado Plateau and within the San Francisco volcanic field, along the western side of the largest contiguous ponderosa pine forest in the continental United States. The city sits at around 7,000 feet (2,100 m) and is next to Mount Elden, just south of the San Francisco Peaks, the highest mountain range in the state of Arizona. Humphreys Peak, the highest point in Arizona at 12,633 feet (3,851 m), is about 10 miles (16 km) north of Flagstaff in Kachina Peaks Wilderness. The geology of the Flagstaff area includes exposed rock from the Mesozoic and Paleozoic eras, with Moenkopi Formation red sandstone having once been quarried in the city; many of the historic downtown buildings were constructed with it. The Rio de Flag river runs through the city.

      Originally settled by the pre-Columbian native Sinagua people, the area of Flagstaff has fertile land from volcanic ash after eruptions in the 11th century. It was first settled as the present-day city in 1876. Local businessmen lobbied for Route 66 to pass through the city, which it did, turning the local industry from lumber to tourism and developing downtown Flagstaff. In 1930, Pluto was discovered from Flagstaff. The city developed further through to the end of the 1960s, with various observatories also used to choose Moon landing sites for the Apollo missions. Through the 1970s and '80s, downtown fell into disrepair, but was revitalized with a major cultural heritage project in the 1990s.

      The city remains an important distribution hub for companies such as Nestlé Purina PetCare, and is home to the U.S. Naval Observatory Flagstaff Station, the United States Geological Survey Flagstaff Station, and Northern Arizona University. Flagstaff has a strong tourism sector, due to its proximity to Grand Canyon National ParkOak Creek Canyon, the Arizona SnowbowlMeteor Crater, and Historic Route 66.

      PSANSDISL #LWDISP either without gas or seeing cupidic arroz in "thank you" or "allta, wild" ...

      pps: a magnanimous decision ...

      I stand here on the brink of what appears to be total destruction; at least of everything I had hoped and dreamed for ... for the last decade in my life which appears literally to span thousands of years if not more in the eyes of some other beholder. I spent several months in Kentucky telling a story of a post apocalyptic and post-cataclysmic delusion; some world where I was walking around in a "fake plane" something like a holodeck built and constructed around me as I "took a walk around the world" to ... it did anything but ease my troubled mind.

      Recently a few weeks in Las Vegas, and a similar story; telling as I walked penniless down the streets filled with casino's and anachronistic taxi-cabs ... some kind of vision of the entirety of the heavens or the Earth or the "choir of angels" I think of when I echo the words Elohim and Aesir from mythology ... there with me in one small city in superposition; seeing what was a very well put together and interesting story about a "star port" Nirvane ... a place that could build cities into the face of mountains and half working monorails appearing in the sky---literally right before my eyes.

      I suppose this is the place "post cataclysm" though I still have trouble understanding what it is that's actually about ... in my mind it connects to the words "we are losing habeas" echo'ed from the streets of Los Angeles in a more clear and more military voice than usual--as I walked block by block trying to evade a series of events that would eventually somehow connect all the way to the "outskirts of Orlando, Florida" in a place called Alhambra.

      Apparently the name of a castle; though I wasn't aware of that until much later.

      It doesn't feel at all like a "cataclysm" to me; I see no great rift--only a world filled with silent liars, people who collectively believe themselves to have stolen something--something gigantic--at least that's the best interpretation of the throws and impetus behind the thing that I and mythology together call Jormungandr. With an eye for "mythological connections" you could clearly see that name of the Great Serpent of Revelation connects to something like the Unseelie; the faeries of Gaelic lore. To me though this world seems still somewhat fluid, it's my entire life--moving from Plantation to a place where the whole of it might be Bethlehem and to "clear my throat" it's not hard to see here how that land of "coughs" connects to the Biblical land of Nod and to the "Adamically sieved" Snifleheim ... from just a little twist on the ancient Norse land most probably as close to Hel as anyone ever gets--or so I dream and hope---still today. It all looks so real and so fake at the same time; planned for thousands of generations, the culmination of some grand masterpiece story that certainly ties history and myth and reality into a twisted heap of "one big nothing, one big nothing at all."

      I've tried to convey to the world how important I believe this place and this time to be--not by some choice of my own ... but through an understanding of the import of our history and the impact of having it be so obviously tuned and geared towards this specific time ... many thousands of years literally all focused on a single moment, on one day or one hour or even just a few years where all of that gets thrown down on the table as if some trump card has been played--and whether or not you fathom the same magnanimous statement or situation or position ... to me, I think it depends on whether or not you grew up in the same kind of way, believing our history to be so fixed and so difficult to change. I don't particularly feel like that's the "zeitgeist" of today; I feel like the children believe it to be some kind of game, and that it is such as easy thing to "sed" away or switch and turn into something else--another story, another purpose ... anyone's personal fantasy land come true.

      I don't think that's the case at all, it's clearly a personal nightmare; and it's clearly one we've seen time and time again--though not myself--the Jesus Christ that is the same yesterday, today; and once again perhaps echoing "no tomorrow" never remembers or believes that we've "seen it all before" or that we've ever really gotten the point; the thing you present to me as "factual reality" is a sickness, it disgusts me; and I'd do anything to go back to the world "where I was so young, and so innocent" and so filled with starry-eyed hope that we were at the foot of something grand and amazing that would become an empire turned republic of the heavens; filling the stars ... with the kind of love for kindness and fairness that I once associated very strongly with the thing I still believe to be the American Spirit.


      "Suddenly it changes, violently it changes" ... another song echoes through the ages--like the "words of the prophets dancing ((as light)) through the air" ... and I no longer even have a glimmer of hope that the thing I called the American People still exist; I feel we've been replaced by some broken container of minds, that the sky itself has become corrupt to the point that there's no hope of turning around this thing that I once believed with all my heart and all my mind was so obviously a "designed downward spiral" one that was---again--so obviously something of a joke, intended to be easy to bounce off a false bottom and springboard beyond "escape velocity" and beyond the dark waters of "nearest habitable star systems (being so very far away)" into a place where new words and new ideas would "soar" and "take flight."

      Here though; I am filled with a kind of lonely sadness ... staring at what appears to be the same mistake(s) happening over and over again; something I've come to call "skipping stones in the pond of reality" and really do liken it to this thing that appears to be the new meaning of "days" and ... a civilization that spends absolutely no love or lust to enter a once sacred and holy place and tarnish it with their sick beliefs and their disgusting desires. You all ... you appear to be some kind of springboard to "bunt" forth yet another age or era of nothingness into the space between this planet and "none worth reaching" and thank God, out of grasp. Today, I'd condemn the entirety of this world simply for it's lack of "oathkeepers" and understanding of what the once hallowed words of Hippocrates meant to ... to the people charged and dharmically required to heal rather than harm.

      It appears the place and time that was once ... at least destined to be the beginning of Heaven ... has become a "recurring stump" of some future unplanned and tarnished by many previous failed efforts and attempts to overcome this same "lack of conversation or care" for what it meant to be "humane" in a world where that was clearly set high aloft and above "humanity" in the place where they--where we were the best nature had to offer, the sanest, the kindest; the shining last best hope.


      Today I write almost every day ... secretly thanking "my God" for the disappearance of my tears and the still small but bright hope that "Tearran" will one day connect the Boston Tea Party and the idea that "render to Caesar" and Robin of Loxley ... all have something to do with a re-ordering of society and the worth and import of "money" ... to a place that cares more for freedom from murder than it does ... "freedom from having to allow others to hear me speak." I hold back tears and emotions; not by conscious choice or ability but ... still with that strange kind of lucky awkward smile; and secretly not so far below the surface it's the hope of "a swift death" that ... that really scares me more than the automatons and mechanical responses I see in the faces of many drivers as they pass me on the street--the imagery of connecting it to the serpentine monster of the movie Beetlejuice ... something I just "assume" the world understands and ... doesn't seem to fear (either); as if Churchill had gotten it all wrong and backwards--the only thing you have to fear, is the loss of fear of "loss."


      Here my crossroads---halfway between the city my son lives in and the city my parents live in--it's on making a decision on whether I should continue at all, or personally work on some kind of software project I've been writing about, or whether I should focus on writing about a "revolution" in government and society that clearly is ... "somewhat underway." In my mind it's obvious these things are all connected; that the software and the governance and the care of whether or not "Babylon" is remembered as a city of great laws and great change or a city of demons and depravity ... that these thi]ngs all hinge and congeal around a change in your hearts; hoping you will chose to be the beginning of a renaissance of "society and civilization" rather than the kings and queens of a sick virtual anarchy ... believing yourselves to have stolen "a throne of God" rather than to literally be the devastating and demoralizing depreciation of "lords and fiefdoms" to something more closely resembled by the time of the Four Horsemen depicted in Highlander.

      These words intended to be a "forward" to yet another compliment of a ((nother installment of a partial)) chain of emails; whimsically once half-joking ... I called it the Great Chain of Revelation. The software too; part of the great chain, this "idea" that the blockchain revolution will eventually create a distributed and equal governance structure, and a rekindling of monetary value focused on "free and open collaboration" rather than "survival of the most unfit"--something society and civilization seem to have turned the "call of life" from and to ... literally just in the last few years as we were so very close to ... reaching beyond the Heaven(s).

      I don't think its hard to imagine how a "new set of ground rules" could significantly change the "face of a place" -- make it something shiny and new or even on the other side of the coin, decayed or depraved. It's not hard to connect the kind of change I'm hoping for with "collision protection" and "automatic laws" to the (perhaps new, perhaps ... ancient) Norse creation story of the brothers of Odin: Vili and Ve.

      It might be hard to see today how a new "kind of spiritual interaction" might be only a few "mouse clicks" away though--how it could change everything literally in a flash of overnight sensation ... or how it might take something like a literal flash of stardom (or ... on the other hand, something like totalitarian or authoritarian "iron fisting") to make a change like this "ubiquitious" or ... something like the (imagined in my mind as ... messianic) "ED" of storming through the cosmos or the heavens and turning something that might appear to be "free and perfect feeling" today into a universe "civlized overnight" and then ...

      I wonder how long it would take to laud a change like that; for it to be something of a voluntary "reunderstanding" of a process ... to change the meaning of every word or every thought that connects to the process of "civilization" to recognize that something so great and so powerful has happened as to literally change the meaning of the word, to turn a process of civilization into something that had a ... "signta-lamcla☮" of forboding and then a magical staff struck into the heart of a sea and then ... and then the word itself literally changes to introduce a new "mid term" or "halfway point" in which a great singularity or enlightenment or change in perspective or understanding sort of acknowledges ...

      that some "clear outside" force not only intervened on the behalf of the future and the people of our world but that it was uniquely involved in the whole of--

      "waking up" tio a nu def of #Neopoliteran.

      ^Like the previous notation; the below text comes from an email previously sent; and while i stand behind things like my sanity, my words; and my continued and faithful attempt to speak and convey both a useful and helpful truth to the world---sometimes just a single day can make all the difference in the world.

      Sometimes it's just a single moment; a flash or a comment about ^th@ blink of an eye" ... and I've literally just "thought up/had/experienced/transitioned thru" that exact moment. The lies standing between "communication" and either "cooperation" or .... some other kind of action have become more defined. More obvious. Because of this clarification; like a kind of "ins^tant* gnosis"

      ... search high and lo ... the depths all the way to above the heavens ...\ \ for a festive divorce ceremonial ritual ... that looks something like a bachelor party ':;]

      --- @amrs@koyu.SPACe ... @suzq@rettiwtkcuf.social (@yitsheyzeus) May 22, 2020

      I ... TERON;

      Gjall are painting me into a corner here; and I don't see around it anymore--I don't see the light, and I don't see the point. I was a happy-go-lucky little kid in my mind; that's not "what I wanted to be" or what I wanted to present, it's who I was. I saw "Ashkenazi" and ... know I am one of those ... and I kind of understood that something horrible might have happened, or might happen here--and I kind of understand that crying smashing feeling of "to ash" that echoes through the ages in the potpourri songs about pockets full of Parker Posey .. and ancient Psalms about "from the ashes of Edom" we have come--and from that you can see the cyclical sickness of this ... place so sure it's "East of Eden" and yet gung-ho on barrelling down the same old path towards ash and towards Edom and towards ... more of Dave's "ashes to ashes dust to dust" and his "smoke clouds roll and symphony of death..." and few words of solace in a song called Recently that I imagine was fleeting and has recently come and gone--people stare, I can't ignore the sick I see.

      I can't ignore his "... and tomorrow back to being friends" and all but wonder who among us doesn't realize it's "ash" and "gone" and "no memory of today" that's the night between now and ... a "tomorrow with friends" not just for me--but for all of you--for this place that snickers and pantomimes some kind of ... anything but "I'm not done yet" and "there's more ... vendetta ... and retribution to be had, Adam ... please come back in a few more of our faux-days." This is sickness; and happy-go-lucky Himodaveroshalayim really doesn't do much but complain about that word, the "sickle" and the tragic unavoidable ... ash of it all ... these days--you'd think we could "pull out" of this mess, turn another way; smile another day, but it seems there's only one way to get to that avenu in the mind of ... "he who must not know or be me."


      I have to admit I found some joy in the epiphany that the hidden city of Zion and it's fusion with the Namayim' version of how that "Ha" gels and jives with the name Abraham and the Manna from Heaven and the bath salt and the tina and the "am in e" of amphetamine--maybe a glimmer or a shimmer or a glow of hope at the moment "Nazion" clicked ... and I said ... "no, not me ... I'm nothing like a king, no dreams of authoritarianism at all in the heart of Kish@r;" even as I wrote words that in the spirit of the moment were something of a "tis of a'we" that connected to my country and the first sing-songy "tisME" that I linked to trying to talk in the rhyming spirit of some "first Christ" that probably just like me was one limmerick away from the end of the rainbow and one "Four Non Blondes" song away from tying "or whatever that means" and this land crowned with "brotherhood" (to some personal "of the Bell, and of the bell towers so tall and Crestian") to just one Hopp skip and jump away from the heart of the obvious echoes of a bridge between haiku and Heroku... a few more gears shift into place, a click and and a mechanical turn of the face of the clock's ku-ku striking ... it was the word "Earthene" that was the last "Jesusism" around the post Cimmerian time linking Dionysus and Seuss to that same "su-s" that's belonging to a moment in the city of Uranus--codified and etched in stone as "MCO"--not just for its saucer and warp nacelles and "deflector dish" but for it's underground caverns and it's above ground "Space Mountain" and that great golf ball in the heart of it all.

      The gears of time and the dawns of civilizequey.org query the missing "here" in our true understanding of what "in the beginning, to hear; to here ... to rue the loss of the Maize from Monoceros to the VEGA system and the tri-galactic origin of ... "some imaginary universal ... Earthene pax" to have dropped the ball and lost it all somewhere between "Avenu Malkaynu" and melaleuca trees--or Yggrasil and Snifleheim--or simply to miss the point and "rue brickell" because of bricks rather than having any kind of love or nostalgia linking to a once cobblestone roadway to the city in the Emerald skies paved in golden "do not return" signs ... to have lost Avenues well after not realizing it was "Heaven'es that were long gone far before I stepped foot on this road once called too Holy for sandals" in a place where that Promised Land and this place of "K'nanites" just loses it's grip on reality when it comes to mentioning the possibility that the original source and story of Ca'anan was literally designed to rid the world of ... "bad nanites" and the mentality of ... vindictiveness that I see behind every smirk.

      The final hundred nanoseconds on our clock towards doom and gloom cause another bird to fly; another snake to curl up and listen again to the songs designed to charm it into oblivion; whether that's about a club in South Beach or a place not so far from our new "here..." all remains to be seen in my innocent eyes wondering what it truly is that stands between what you are ... and finding "forgiveness not needed--innocent child writes to the mass" ... and the long arm of the minute hand and the short finger of the hour for one brief moment reconcile and move towards "midnight" together; and it's simply idyllic, the Nazarene corner between nil and null you've relegated the history of Terran poast futures into ... "foreves mas" or so they (or you) think.


      I'm still so far from "Five Finger Death Punch" though; and so far from Rammstein and so far from any kind of sick events that could stand between me and "the eternal" and change my still "casual alternative rock" loving heart to something more death metal; I rue whatever lies between me and there being any kind of Heaven that thinks there could exist a "righteous side" of Hell and it... simultaneously.


      I still see light here in admonishing the masses and the angels standing against the story and the message God brings us in our history. I still see sparks in siding with the "causticness" of "no holodecks in sight" and the hunger and the pain of simulating ... "the hells of reality" over the story of decades or centuries of silence refusing to see "holography" and "simulated" in the word Holocaust and the horrors of this place that simply doesn't seem to fathom or understand the moments of hunger pangs and the fear of "dark Earth pits" or towers of "it's not Nintendo-DS" linking the Man in the High Castle to an Iron Mask.

      I rally against being what I clearly am raised high on some pedestal by some force beyond my comprehension and probably beyond that of the "perfect storm in time" that refuses to itself acknowledge what it means to gaze at such an unfathomable loss of innocence at the cost of a "happy and serene future" or even at the glimmer of the Never-Never-Land I'd hoped we would all cherish and love and share ... the games and the newfound freedom that comes not just from "seeing Holodeck" turn into "no bullets" and "no cages" but into a world that grows and flourishes into something that's so far beyond my capability to understand that I'm stuck here; dumbfounded; staring at you refusing to stop car accidents and school shootings ... because "pedestal." For the "fire and the glory" of some night you refuse to see is this one--this place where morality rekindles from ... from what appears tobe one small candle, but truly--if it's not in your heart, and it's not coming from some great force of goodness--fear today and a world of "forever what else may come."


      Here in a place the Bible calls Penuel at the crossing of a River Jordan ... the Angel of the Lord notes the parallels in time and space between the Potomac and the Rhine--stories of superposition and cities and nation-states that are nothing more than a history of a history of things like the Monoceros "arroz" linking not just to the constellation Orion but to Sagittarius and to Cupid and of course to the Hunter you know so well--

      Searching for a Saturday; a sabbath to be made Holy once more ... "at the Rubycon"

      The Einstein-Rosen Wormhole and the Marshall-Bush-JFKjr Tunnel

      The waters are called narah, (for) the waters are, indeed, the offspring of Nara; as they were his first residence (ayana), he thence is named Narayana.

      --- Chapter 1, Verse 10[3]

      In a semi-fit of shameless arexua-self recognition i'm going to mention Amazon's new series "Upload" and connect it to the PKD work that my Martian-in-simulcrum-ciricculum-vitae on "colonization education" ... tying together Transcendance, Total Recall and ... well; to be honest it actually gave me another "uptick" in the upbeat ... maybe i'll stick around until I'm sure there's at least one more copy of me in the ivrtual-invverse ... oh, that reminds me ... Farmer)'s Lord of Opium also touches on this same "mind of God in the computer" subject (which of course leads to Ghost in the Shell and Lucy--thanks Scarlette :).

      While I'm listing Matrix-intersected pieces of the puzzle to No Jack City, Elon Musk's neuralace and Anderson's Feed are also worth a mention. Also the first link in this paragraph is titled ... "the city of the name of time never spoken after time woke up and stfu'd" (which of course is the primary subject of this ... update to the city Aerosol).

      The ... "actual original typed dream" included a sort of "roller coaster ride" through space all the way to Mars; where the real purpose of "the thing" I am calling the "Mars Hall" was to display previous victories and failures ... and the introduction of "older or future" culture's suggestions for "the right way" to colonize a new habitat. If it were Epcot Center, this would be something like SpaceMountain taking you to to the foture of "Epcot Countries" as if moving from "countries" to planets were as easy as simply ... "reading backwards."

      THE SOFTWARE, SINGERS, AND SHIELD(S)

      OF

      HEIROSOLYMITHONEYY

      Thinking just a little bit ahead of myself, but I'm on "Unreal Object/Map Editor within the VR Server" and calling it something like "faux-wet-ware" ... which then of course leads to a similar onomonopeia of "weapons and ..." where-with-all to find a better singer's name to connect the road of "sword" to a Wo'riordan ... but I think that fusion of warrior and woman probably does actually say ... enough of it all; on this road to the living Bright Water that the diety in my son's middle name defines well here, as "waking up," stretching it's tributaries and it's winding wonders and wistfully ....

      Narayana (Sanskrit: नारायण, IASTNārāyaṇa) is known as one who is in yogic slumber on the celestial waters, referring to Lord Maha Vishnu. He is also known as the "Purusha" and is considered the Supreme being in Vaishnavism.

      andromedic; the ports of call ... to the mediterranean (literally) from the gulf coast;

      ... ho engages in the creation of 14 worlds within the universe as Brahma when he deliberately accepts rajas guna, himself sustains, maintains and preserves the universe as Vishnu by accepting sattva guna. Narayana himself annihilates the universe at the end of maha-kalp ...

      .

      there's no place like home. there's no place like home. there's no place like home.

      and so it begins ... "f:

      r e l i g i o n

      find out what it means to me. faucet, ever single one, stream of purity ...

      from Fort Myers ... f ... flicks ... Flint.

      "

      ^this notation will from this email forward in linear time denote some form of contact method or information related to the context of the message you are reading. This particular one sends me an encrypted email. 5if there is an "@" symbol involved in the "anchor's hypertext reference" (technically an "a href=" in HTML4) your browser should attempt to open an email client to send a message over an anonymous SMTP relay. Understand that "anonymous" in this case may or may not mean your sending email address is hidden or obvuscated--so if you want to receive a reply you must include it in the DATA of your SMTP transmission defined by the RFC5321 attached. In most cases "anonymous" also means that you will not have the recipients direct contact information unless they have made it public---additionally the exact server/system/relay used may or may not be the "Sbroken Berkman Perl Script" linked to in the "hypertext reference" specifically anchored to the words "an anonymous SMTP relay" above.

      A simple "hat character" (^) and the letter "t" as you see beginning the above paragraph will denote a contact method or form that works over the internet using an HTTP protocol defined in a series of RFC's including (but not limited to) RFC's numbered as 2616, 7230, 7235, 2068 and use a simple language which is based on a definition suggested or proposed currently by an organization called the "W3C Consortium"

      ---and ... previously set and defined by an organiza^tion located at html.spec.whatwg.org; which appears (to me, for the first time as I write these words) to follow the conceptual spirit of the "living document" defined by the several "Continental Congresses, et alia." I personally now conjoin this document in my head to a procession of patrilineal or matrilnear predecessors to the actual event .... still to be defined ... but related to this specific email, this mailing list; its contributors and readers as well as actual members of the organization (still to be created, defined, or named) that creates a "round table" of members that is open to the public, to all voters educated enough to understand the specific issue being voted on (up to a standard that; in this place and time appears to be unset and unmet but materially related to reawching the age of 18 years old; growing up in or being born in the United States of America (related spec. to the Constitution of the United States of America which is officially "self-defined" through a process which includes all three branches of the government which it also "self-defines" and purports to be "of, for, and by the people"--though the general population is only able to contribute through an indirect process (read:the people cannot directly contribute to the constitution without either running for office (like a senator) or being appointed to a specific government position (like a judge or executive branch public servant).

      The current state of American representative democracy is the highest standard to which I am currently knowledgable of "extant"--and it is specifically substandard, inferior, and "just not good enough" as a comparison to the process required to vote in the organization being "self-defined" through this process*. It is my sincere and clear hope that "this process" will result in a legal and moral amendment to the document shown in the previous link and presented by the Legislative Branch of the United States here. It is my current and faithful belief that anything else would also be significantly below the standards morally required by "this process" which of course includes over 200 years of American citizenship and (other international relations; i.e.e.gfor "iv" exampleid estexemplia gratia) as well as the Sons of Liberty and prior to that contributions from the Crown and the "Parliament and Crown" of the United Kingdom; among others et alea's ifndef: 'swikipedia/et_al..

      To note specifically because of lack of personal knowledge and public notoriety (assuming all other requiremnant* achem requirements)

      alas, babylon.

      i listened to a man yesterday who was talking about "true heroes" ... he of course noted jesus christ and superman together, suggesting the first was one, and the second just a fiction. he also talked about people like ghandi and "leaders who use non-violent means to "change the world." i at least agree with him on the third, ghandi is a good prototype for some kind of hero. staring at this ... "to be completed" work on tales of two cities, whether from sodom and gomorrah all the way to athens and sparta and perhaps even london and paris--and this particular city, babylon; it stands out as one which truly has no equal or even "mirror" in the history of the world. i suppose i'd add "alexandria" and suggest the library and the laws; something that are fundamental to the ethos of the planet i call "athens."

      i imagine he did not know "hammurabi's" name; and even today in this place where i ask and do not receive answers; i imagine you still don't connect muhammad or amsterdam ... to this king who in our history is set apart and lifted high on a pedestal of having "codified and written down" laws ... for the very first time. it's almost comical, it took me a paragraph and a sentence to connect "the king and i" to this mirror world, where the bible and the people have most assuredly decided "babylon" is a negative thing or a depraved place.

      "fallen, fallen, is [the city of] babylon the great"

      ... just a quote from one of my favorite movies; which of course is re-quoting "dante" and/or "the bible"

      "a dwelling place [of] (the) demons (say), it has become."

    1. Author Response

      eLife assessment

      This useful paper examines changes (or lack thereof) in birds' fear response to humans as a result of COVID-19 lockdowns. The evidence supporting the primary conclusion is currently inadequate, because the model used does not properly account for many potentially confounding factors that could influence the study's outcomes. If the analytic approach were improved, the findings would be of interest to urban ecologists, behavioral biologists and ecologists, and researchers interested in understanding the effects of COVID-19 lockdowns on animals.

      Many thanks for these supportive words. We did our best to improve our manuscript according to the reviewers and editor comments. Importantly, we regret being unclear in the Methods, as our models already controlled for most of the confounds (see below) discussed by the reviewers.

      For example, given that a single observer collected the data at most sites, site as a random intercept in the models controls also for the observer effects (which is one of the reasons why site is in the model). We added details to Methods (L352-356, see also “Statistical analyses” in the main text).

      The first reviewer asked us to use “some measure of urbanity (e.g. Human Footprint Index) that varies across the cities included here”. Our main results are now based on country-specific models and hence, the use of a single value predictor for each city is not appropriate. Please, see also below.

      The second reviewer is concerned about multicollinearity in our models because of the 0.95 correlation between Period and Stringency Index. However, these are key predictor variables of interest that have never been used within the same model as predictors. We now clearly explain this in the Methods (L458-538, 548-550) and within legend of Figure S2.

      The third reviewer suggested that our models would benefit from controlling for day in the species-specific breeding cycle. Although we don’t have precise city-specific information on the timing of breeding stages in the sampled populations of birds, we partly control for these effects by including a random intercept of day within each year and species. This random factor explained most of the variance (see Table S1-S2) – something that could have been expected. In other words, we do control for what the third reviewer asked for. Similarly, we account for habitat features that may influence escape distance by including site in the models. Site usually refers to a specific park (we assume that within-park heterogeneity is lower than between park variation) and hence partly addresses the reviewer’s concern. Again, we highlight this within the Methods (L466-476).

      Reviewer #1 (Public Review):

      This paper uses a series of flight initiation "challenges" conducted both prior to and during COVID-19-related restrictions on human movement to estimate the degree to which avian escape responses to humans changed during the "anthropause". This technique is suitable for understanding avian behavioral responses with a high degree of repeatability. The study collects an impressive dataset over multiple years across five cities on two continents. Overall the study finds no effect of lockdown on avian escape distance (the distance at which the "target" individual flees the approaching observer). The study considers the variable of interest as both binary (during lockdown or prior to lockdown) and continuous, using the Oxford Stringency Index (with neither apparently affecting escape distance). Overall this paper presents interesting results which may suggest that behavioral responses to humans are rather inflexible over "short" (~2 year) timespans. The anthropause represents a unique opportunity to disentangle the mechanistic drivers of myriad hypothesized impacts humans have on the behavior, distribution, and abundance of animals. Indeed, this finding would provide important context to the larger body of literature aimed at these ends.

      Thank you very much for your positive feedback.

      However, the paper could do more to carefully fit this finding into the broader literature and, in so doing, be a bit more careful about the conclusions they are able to draw given the study design and the measures used. Taking some of these points (in no particular order):

      Thank you. We did our best in addressing your comments (see below and updated Methods, Results and Discussion sections).

      1) Oxford Stringency Index is a useful measure of governmental responses to the pandemic and it's true that in some scenarios (including the (Geng et al. 2021) study cited by this paper) it can correlate with human mobility. However, it is far from a direct measure of human mobility (even in the Geng study, to my reading, the index only explained a minority of the variation). Moreover, particular sub-components of the index are wholly unrelated to human mobility (e.g. would changes to a country's public information campaign lead to concomitant changes in urban human mobility?). Finally, compliance with government restrictions can vary geographically and over time (i.e. we might expect lower compliance in 2021 than in 2020) and the index is calculated at the scale of entire countries and may not be very reflective of local conditions. Overall this paper could do more to address the potential shortcomings of the Oxford Stringency Index as a measure of human mobility including attempting to validate the effect on human mobility using other datasets (e.g. the google dataset and/or those discussed in (Noi et al. 2022). This is of critical importance since the fundamental logic of the experimental design relies on the assumption that stringency ~ mobility.

      Thank you for this comment. First, Oxford Stringency Index seemed to us as the best available index for our purposes, i.e to estimate people's mobility during the shutdown because restrictions surely influenced the possibility that people would be outside, and because the index is a country-specific estimate. However, in addition, we now checked all indices mentioned in Noi et al. 2022 and found useful only the Google Mobility Reports, which we now use, because (a) it is publicly available, (b) it is available also for territories outside US, and (c) provides data for each city included in our dataset as well as for urban parks where most of our data were collected. Note that some platforms are no longer providing their mobility data (e.g. Apple).

      However, Google Mobility provides day-to-day variation in human mobility, whereas we are interested in overall increase/decrease in human mobility. Nevertheless, we correlated the Google mobility index with the Stringency index and found that human mobility generally decreases with the strength of the anti-pandemic measures adopted in sampled countries (albeit the effect for some countries, e.g. Poland, is small; Fig. 5).

      Moreover, we also added analysis using # of humans collected directly in the field during escape trials (e.g. Fig. 6 and S6) and found that the link between # of humans and Stringency index or Google Mobility was weak and noise, 95%CIs widely crossing zero (Fig. 6).

      Importantly, if we use Google Mobility and # of humans, respectively, as predictors of escape distance, the results are qualitatively very similar to results based on Oxford Stringency Index (Fig. S6), or Period, with tiny effect sizes for both (95%CIs for Google Mobility -0.3 – 0.06, Table S5, for # of humans -0.12 – 0.02, Table S6) supporting our previous conclusions.

      Note that Google Mobility and the number of humans have their limitations (see our comment to the editor and the Methods section in the main manuscript, e.g. L418-433). The lack of Google Mobility data for years before the COVID-19 pandemic does not allow us to fully explore whether overall human activity decreased during COVID-19 or not (our test for period prior and during COVID-19). If the year 2022 reflects a return to “normal” (which is to be disputed due to COVID-19-driven rise in home office use) the 2020 and 2021 had on average lower levels of human activity (Fig. 4). Whether such a difference is biologically meaningful to birds is unclear given the immense day-to-day change in human mobility and presence (Fig 4). Moreover, the number of humans capture within- and between-day variation rather than long-term changes in human presence.

      We added details on the new analysis into the method and results sections (e.g. Fig. 4-6; L142-165, 418-438, 495-535) and Supplementary Information (Figs. S5-S9 and associated Tables) and discuss the problematic accordingly. Moreover, to enhance clarity about country specific effect (or their lack), we also add country specific estimates to the Results (Fig. 1 and Fig. S6 and respective Tables). Finally, our statistical design and random structure of the model allowed us to control for spatial and temporal variation in compliance with government restrictions.

      2) The interpretation of the primary finding (that behavioral responses to humans are inflexible) could use a bit more contextualization within the literature. Specifically, the study offers three potential explanations for the observed invariance in escape response: 1) these behaviors are consistent within individuals and this study provides evidence that there was no population turnover as a result of lockdowns; 2) escape response is linked to other urban adaptations such that to be an urban-dwelling species dictates escape response; and/or 3) these populations already exhibit maximum habituation and the reduction in human mobility would only have increased that habituation but that trait is already at a boundary condition. Some comments on each of these respectively:

      Thank for these comments. We incorporated them in the main text (L293-329). Your point 1) corresponds to our point (i): “Most urban bird species in our sample may be relatively inflexible in their escape responses because the species may be already adapted to human presence” (L293-306); your point 2) to our point (ii): “Urban environment might filter for bold individuals (Carrete and Tella, 2013, 2010; Sprau and Dingemanse, 2017). Thus, the lack of consistent change in escape behaviour of urban birds during the COVID-19 shutdowns may indicate an absence (or low influx) of generally shy, less tolerant individuals and species from rural or less disturbed areas into the cities…” (L307-314); your point 3) to our point (iii): “Urban birds might have been already habituated to or tolerant of variation in human presence, irrespective of the potential changes in human activity patterns” (L315-329). To distinguish between (ii) and (iii) or the two from (i), individually-marked birds and comprehensive genetic analyses are needed, which we now note in the Discussion (L330-348). Importantly, we also discuss that the lack of response might be due to relatively small changes in human activity (L253-292), which we unfortunately could not fully quantify.

      a) Even had these populations turned over as a result of a massive rural-to-urban dispersal event, it's not clear that the escape distance in those individuals would be different because this paper does not establish that these hypothetical rural birds have a different behavioral response which would be constant following dispersal. Thus the evidence gathered here is insufficient to tell us about possible relocations of the focal species.

      Thank you for this point. We address this point in the Introduction and Discussion (L92-101, 307-314). Rural bird populations/individuals are on average less tolerant of humans than urban birds (e.g. Díaz et al. 2013, PloS One 8:e64634; Tryjanowski et al. 2020, J Tropic Ecol 36:1-5; Mikula et al. 2023, Nat Commun 14:2146) and at the same time, bird individuals seem consistent in their escape responses (Carrete & Tella 2010, Biol Lett 23:167–170; Carrete & Tella 2013, Sci Rep 3:1–7).

      Additionally, the paper cites several papers that found no changes in abundance or movements of animals in response to lockdowns but ignore others that do. For example: (Wilmers et al. 2021), (Warrington et al. 2022) (though this may have been published after this was submitted...), and (Schrimpf et al. 2021).

      We added the papers (L89-91). Thank you!

      There is a missed opportunity to consider the drivers of some of these results - the findings in this paper are interesting in light of studies that did observe changes in space use or abundance - i.e. changes in space use could arise precisely because responses to humans are non-plastic but the distribution and activities of humans changed.

      Thank you. Indeed, we now address this in the Discussion (L303-306): “However, some studies reported changes in the space use by wildlife (Schrimpf et al., 2021; Warrington et al., 2022; Wilmers et al., 2021). and these could arise, as our results indicate, from fixed and non-plastic animal responses to humans who changed their activities”.

      To wit, the primary finding here would imply that the reaction norm to human presence is apparently fixed over such timescales - however, and critically, the putative reduction in human activity/mobility combined with fixed responses at the individual level might then imply changes in avian abundance/movement/etc.

      Unfortunately, we have not measured changes in avian abundance or movements. But, please, note that the change in human mobility in sampled cities might be not as dramatic as initially thought and we consider this scenario to be most plausible in explaining no significant differences in avian escape responses before and during the COVID-19 shutdowns (see Fig. 4). Nevertheless, we add your point into the Discussion: If our findings imply that in birds the reaction norm to human presence is fixed over the studied temporal scale, the putative changes in human presence might then imply changes in avian abundance or movement (L293 and text below it).

      b) If this were the case, wouldn't this be then measurable as a function of some measure of urbanity (e.g. Human Footprint Index) that varies across the cities included here? Site accounted for ~15% of the total variation in escape distance but was treated as a random effect - perhaps controlling for the nature of the urban environment using some e.g. remotely sensed variable would provide additional context here.

      Urbanity mirrors the long-term level of human presence in cities whereas we were interested mainly in the rather short-term effects of potential changes of human presence on bird behaviour. Thus, we are not sure how adding such variable will help elucidating the current results. Please, also note that we added the country-specific analysis. Site indeed accounted for considerable amount the total variance in escape distance and that is why it was included as random intercept, which controls for non-independents of data points from each city. This could partly help us to control for difference in habitat type (e.g. urbanization level) within cities.

      c) Because it's not clear the extent to which the populations tested had turned over between years, the paper could do with a bit more caution in interpreting these results as behavioral. This study spans several years so any response (or non-response) is not necessarily a measure of behavioral change because the sample at each time point could (likely does) represent different individuals. In fact, there may be an opportunity here to leverage the one site where pre-pandemic measures were taken several years prior to the pandemic. How much variance in the change in escape distance is observed when the gap between time points far exceeds the lifetime of the focal taxa versus measures taken close in time?

      We believe the initial Fig S4, now Figure 2, addresses this point. The between years temporal variation in FIDs exceeds the variation due to lockdowns. This is true both for measures taken in consecutive years, as well as for measures taken far apart.

      d) Finally, I think there are a few other potential explanations not sufficiently accounted for here:

      i) These behaviors might indeed be plastic, but not over the timescales observed here.

      We agree and have added this point (L301-303). Thank you.

      ii) Time of year - this study took place during the breeding season. The focal behavior here varies with the time of year, for example, escape distance for many of these species could be tied up in nest defense behaviors, tradeoffs between self-preservation and e.g. nest provisioning, etc.

      Please, note that we controlled for the date in our analyses. Date was used as a proxy for the progress in the breeding season (L463-464 and Fig. 1 caption). Note that we collected data only from foraging or resting individuals, and data were neither collected near the nest sites nor from individuals showing warning behaviours, which we now note (L400-401).

      iii) Escape behaviors from humans are adaptively evolved, strongly heritable, and not context dependent - thus we would only expect these behaviors to change on evolutionary timescales.

      We discussed this at L307-308 and 381-383. Escape behaviors from humans are highly consistent for individuals, populations, and species (Carrete & Tella 2010, Biol Lett 23:167–170; Díaz et al. 2013, PloS One 8:e64634; Mikula et al. 2023, Nat Commun 14:2146). Whether such behavior is consistent across contexts is less clear (e.g. Diamant et al. 2023, Proc Royal Soc B, in press; but see, e.g. Radkovic et al. 2019, J Ecotourism 18:100-106; Gnanapragasam et al. 2021, Am Nat 198:653-659). Escape distance is often not measured simultaneously, for example, with human presence. In other words, whereas general level of human presence may have no effect on escape distance, the day-to-day or hour-to-hour variations might. We need studies on fine temporal scales (day-to-day or hour-to-hour) using marked individual to elucidate this phenomenon.

      iv) See point one above - it's possible that the lockdown didn't modify human activity sufficiently to trigger a behavioral response or that the reaction norm to human behavior is non-linear (e.g. a threshold effect).

      We agree, now use also Google Mobility Reports and # of humans data to elucidated this phenomenon and have added such interpretations to L253-292 and, e.g. Fig. 4.

      LITERATURE CITED Geng DC, Innes J, Wu W, Wang G. 2021. Impacts of COVID-19 pandemic on urban park visitation: a global analysis. J For Res 32:553-567. doi:10.1007/s11676-020-01249-w

      Noi E, Rudolph A, Dodge S. 2022. Assessing COVID-induced changes in spatiotemporal structure of mobility in the United States in 2020: a multi-source analytical framework. Int J Geogr Inf Sci.

      Schrimpf MB, Des Brisay PG, Johnston A, Smith AC, Sánchez-Jasso J, Robinson BG, Warrington MH, Mahony NA, Horn AG, Strimas-Mackey M, Fahrig L, Koper N. 2021. Reduced human activity during COVID-19 alters avian land use across North America. Sci Adv 7:eabf5073. doi:10.1126/sciadv.abf5073

      Warrington MH, Schrimpf MB, Des Brisay P, Taylor ME, Koper N. 2022. Avian behaviour changes in response to human activity during the COVID-19 lockdown in the United Kingdom. Proc Biol Sci 289:20212740. doi:10.1098/rspb.2021.2740

      Wilmers CC, Nisi AC, Ranc N. 2021. COVID-19 suppression of human mobility releases mountain lions from a landscape of fear. Curr Biol 31:3952-3955.e3. doi:10.1016/j.cub.2021.06.050

      Reviewer #2 (Public Review):

      Mikula et al. have a large experience studying the escape distances of birds as a proxy of behavioral adaptation to urban environments. They profited from the exceptional conditions of social distance and reduced mobility during the covid-19 pandemic to continue sampling urban populations of birds under exceptional circumstances of low human disturbance. Their aim was to compare these new data with data from previous "normal" years and check whether bird behavior shifted or not as a consequence of people's lockdown. Therefore, this study would add to the growing body of literature assessing the effect of the covid-19 shutdown on animals. In this sense, this is not a novel study. However, the authors provide an interesting conclusion: birds have not changed their behavior during the pandemic shutdown. This lack of effects disagrees with most of the previously published studies on the topic. I think that the authors cannot claim that urban birds were unaffected by the covid-19 shutdown. I think that the authors should claim that they did not find evidence of covid-19-shutdown effects. This point of view is based on some concerns about data collection and analyses, as well as on evolutionary and ecological rationale used by the authors both in their hypotheses and results interpretation. I will explain my criticisms point by point:

      We are grateful for your positive appraisal of our manuscript, as well as for your helpful critical comments. We toned down the discussion to claim, as suggested by you, that we did not find evidence for effects of covid-19-shutdowns on escape behaviour of birds in urban settings (see Results and Discussion sections). In general, we attempted to provide a more nuanced discussion and reporting of our findings. We also changed the manuscript title to “Urban birds' tolerance towards humans was largely unaffected by the COVID-19 shutdowns” and added validation using Google Mobility Reports (Fig. 5 & S6, Table S3a and S5) and the actual number of humans (Fig. 6 and S6; Table S3b-e and S6). Note however that there is only a single robust study on the topic of shutdown and animal escape distances (Diamant et al. 2023, Proc Royal Soc B, in press), i.e. the topic is largely unexplored (e.g. L99-101), whereas we discuss our finding in light of shutdown influences on other behaviours (L293-329).

      1) The authors used ambivalent, sometimes contradictory, reasoning in their predictions and results interpretation. Some examples:

      We tried to clarify our reasoning and increased consistency in our claims in the Introduction. Please, note that we simplified the Introduction and now provide one main expectation: FIDs of urban birds should increase with decreased human presence. This pattern is robustly empirically documented, regardless of the mechanism involved (e.g. Díaz et al. 2013, PloS One 8:e64634; Tryjanowski et al. 2020, J Tropic Ecol 36:1-5; Mikula et al. 2023, Nat Commun 14:2146). Please, see our revised Discussion for a more comprehensive discussion of mechanisms which could explain the patterns described in our study.

      1.1) The authors claimed that urban birds perceive humans as harmless (L224), but birds actually escape from us, when we approach them... Furthermore, they escape usually 5 to 20 m away. This is more distance that would be necessary just to be not trampled.

      We agree and have deleted mentions that humans are perceived as harmless.

      1.2) If we are harmless, why birds should spend time monitoring us as a potential threat (L102)? Indeed, I disagree with the second prediction of the authors. I could argue that reduced human activity should increase animal vigilance because real bird predators (e.g. raptors) may increase their occurrence or activity in empty cities. If birds should increase their vigilance because the invisible shield of human fear of their predators is no longer available, then I would expect longer escape distances.

      Thank you for this comment. We deleted this prediction and largely rewrote Introduction based on your comments and comments from the other reviewers.

      1.3) To justify the same escape behavior shown by birds in pre- and pandemic conditions from an adaptive point of view, the authors argued a lack of plasticity and a strong genetic determination of such behavior. This contravenes the plasticity proposed in the previous point or the expected effect of the stringency index (L112).

      We now attempted to write this more clearly while incorporating your suggestions. In the Discussion, we now propose various hypothesis that can, but need not be mutually exclusive. Please, note that we simplified the Introduction and now provide one main hypothesis: FIDs of urban birds should increase with decreased human presence.

      In my opinion, some degree of plasticity in the escape behavior would be really favorable for individuals from an adaptive perspective, as they may face quite different fear landscapes during their lives. Looking at the figures, one can see notable differences in the escape distance of the same species between sites in the same city. As I can hardly imagine great genetic differences between birds sampled in a park or a cemetery in Rovaniemi, for instance, I would expect a major role of plasticity to explain the observed variability. Furthermore, if escape behavior would not be plastic, I would not expect date or hour effects. By including them in their models, the authors are accepting implicitly some degree of plasticity.

      We regret being unclear. We do accept some degree of plasticity. Yet, our study design prohibits the assessment of the degree of individual plasticity because sampled birds were not individually marked and approached repeatedly. We tried to soften the statements in our Discussion to not fully dismiss a possibility that urban birds have some degree of plasticity in their antipredator behaviour (L293-329). Note however, that while our data collection was not designed to test how hour-to-hour changes in human numbers influence escape distance, the effect of the number of humans (i.e. hour-to-hour variation in human numbers) in our sample was tiny.

      The date and hour effect simply control for the particularities of the given day and hour (e.g. warm vs cold times or the time until sunset). In other words, the within species differences (even from the same park) may have little to do with individual plasticity, but instead may reflect between individual differences. We now add this issue to Methods (L471-476): “This approach enabled us to control for spatial and temporal heterogeneity and specificity in escape behaviour of birds (e.g. species-specific responses, changes in escape distances with the progress in the breeding season, spatial and temporal variation in compliance with government restrictions or particularities of the given day and hour)....”

      2) Looking at the figures I do not see the immense stochasticity (L156, Fig. S3, S5) claimed by the authors. Instead, I can see that some species showed an obvious behavioral change during the shutdown. For instance, Motacilla alba, Larus ridibundus, or Passer domesticus clearly reduced their escape distances, while others like the Dendrocopos major, Passer montanus, or Turdus merula tended to increase it.

      At L138-141 and 327-329 we discussed the within and between genera and cross-country variation and stochasticity in response to the shutdowns (Fig. 2). The reference to species-specific plots was perhaps a little bit misleading. We think that the essential figure, that we now reference at this point, is Figure 2 that shows the temporal trends and/or stochasticity that seem to have little in common with lockdowns. Please, also look at Figure 3 and S3-S4. These show that in all selected genera/species, the trends did not significantly deviate from central regression line which indicates no change in FID before and during the COVID-19 shutdowns.

      On the other hand, birds in Poland tended to have larger escape distances during the shutdown for most species, while in Rovaniemi there was an apparent reduction of escape distances in most cases. The multispecies and multisite approach is a strength of this study, but it is an Achilles' heel at the same time. The huge heterogeneity in bird responses among species and sites counterbalanced and as a result, there was an apparent lack of shutdown effects overall. Furthermore, as most data comes from a few (European) species (i.e. Columba, Passer, Parus, Pica, Turdus, Motacilla) I would say that the overall results are heavily influenced (or biased) by them. The authors realize that results are often area- or species-specific (L203), therefore, does a whole approach make sense?

      We are grateful for this valuable comment. We believe the general approach makes sense as there is a general expectation about how birds should respond to changes in human presence. That is why we control for non-independence of data points in our sample. Thus, although lots of data come from a few European species, this is corrected for by the model. Note that given the sheer number of sampled species, some site- or species-specific trends may have occurred by chance. Importantly, we believe that Figure 2, with species-site specific temporal trends, reveals that the between year stochasticity in escape distances seems greater that any effects of lockdowns. Nevertheless, we have further dealt with this issue in the revised manuscript by running country-specific models which again clearly showed no significant effect of Period on escape behaviour of birds (including, no effects in Poland and Finland).

      3) The previous point is worsened by the heterogeneity of cities and periods sampled. For instance:

      3.1) I can hardly imagine any common feature between a small city in northern Finland (Rovaniemi) and a megacity in Australia (Melbourne). Thus, I would not be surprised to find different results between them.

      3.2) Prague baseline data was for 2014 and 2018, while for the rest of the study sites were for 2018 and 2019. If study sites used a different starting point, you cannot compare differences at the final point.

      We are slightly confused by these comments.

      3.1) The cities are expected to be different but (i) the difference may be smaller than imagined (e.g. park structures, managed grass cover, few shrubs and deciduous-dominated tree species) and (ii) we expect the effects of lockdowns to be similar across cities. Whether we have no people in Rovaniemi parks (which despite Rovaniemi’s small size are usually extremely well-visited) or no people in Melbourne parks should not make a difference in principle. Note however, that to avoid overconfident conclusions, we allow for different reaction norms within cities. Please, also note that we are now providing country-specific results which should identify whether shutdowns lead to different reaction in sampled countries. We found no strong effect of shutdowns in any of sampled countries/cities.

      3.2) Because of the possible between site differences at the starting point, we use study site as random intercept and control for the between site reaction norms by including the random slope of the period. In other words, such possible differences do not influence outcomes of our models. Regardless, our a priori expectation is that the human activity levels in a given park was similar prior to covid and hence in 2014, 2018, and 2019. Again, we are now providing country-specific results which identify whether shutdowns led to different reactions in sampled countries, which they mostly did not

      3.3) Due to the obvious seasonal differences between the northern and southern hemispheres, data collection in Australia began five months later than in the rest of the sites (Aug vs Mar 2020). There, urban birds faced already too many months of reduced human disturbances, while European birds were sampled just at the beginning of the lockdown.

      We agree that each city or even park within the city has its specific environmental conditions (here including the time point of lockdown). That is why we control for city and park location in the random structure of the model (see Method section). We now add results per country that shows no clear differences (e.g. Fig. 1).

      However, the aim of our study was to test for general, global effects of lockdowns, which are minimal. Note that we now specifically test for country-specific effects in separate models on each country (e.g. Fig. 1, Fig S6) but all country-specific effects are small and still centre around zero.

      3.4) Some cities were sampled by a single observer, while others by many of them. Even if all of them are skilled birders, they represent different observers from a statistical point of view and consequently, observer identity was an extra source of noise in your data that you did not account for.

      We agree. In Finland and Hungary, data were collected by two closely cooperating observers. In Poland, all data were collected by a single observer. In the Czech Republic and Australia, a single observer (P.M. and M.W., respectively) sampled 46 sites out of 56 and 32 sites out of 37, respectively. Each site was sampled by the same observer both before and during the shutdowns. We now clearly state it in the Methods (L352-356). In other words, our models already largely control for the possible observer confound by having site as a random intercept. Moreover, previous study showed that FID estimates do not vary significantly between trained observers (Guay et al. 2013, Wildlife Research, 40, 289-293).

      4) Although I liked the stringency index as a variable, I am not sure if it captured effectively the actual human activity every day. Even if restrictive measures were similar between countries, their actual accomplishment greatly depended on people's commitment and authorities' control and sanctions. I would suggest using a more realistic measure of human activity, such as google mobility reports.

      Thank you for this comment. We now validate the use of the stringency index with the Google Mobility Reports, showing that human mobility generally (albeit in some countries relatively weakly) decreases with the strength of governmental antipandemic measures. Please, note that our main research question is related to the general change in human outdoor activity and not to week-to-week, day-to-day or hour-to-hour changes captured by stringency index, Google Mobility or the number of humans during an escape trial data. Nevertheless, using Google Mobility and the number of humans as predictors led to the similar results as for stringency index and Period (Fig. 1 and S6). Please, see extended discussion on this topic in our manuscript (L270-292).

      5) The authors used escape trials from birds on the ground and perched birds. I think that they are not comparable, as birds on the ground probably perceive a greater risk than those placed some meters above the ground, i.e. I would expect shorter escape distances for perched birds. As this can be strongly dependent on the species preferences or sampling site (i.e, more or less available perches), I wonder how this mixture of observations from birds on the ground and perched birds could be affecting the results.

      We now added information that most birds were sampled when on the ground (79%). Importantly, previous studies have found that perch height has a minimum effect on FIDs (e.g. Bjørvik et al. 2015. J Ornithol 156:239–246; Kalb et al. 2019, Ethology 125:430-438; Ncube & Tarakini 2022, Afr J Ecol 60:533– 543; Sreekar et al. 2015,. Tropic Conserv Sci 8:505-512). We added this information to the Method section (L394-395).

      6) The authors did not sample the same location in the same breeding season to avoid repeated sampling of the same individuals (L331). This precaution may help, but it does not guarantee a lack of pseudoreplication. Birds are highly mobile organisms and the same individuals may be found in different places in the same city. This pseudoreplication seems particularly plausible for Rovaniemi, where sampling points must be necessarily close due to the modest size of this city.

      We appreciate your concern. We cannot fully exclude the possibility of sampling some individuals twice. However, we sampled during the breeding season within which most birds are territorial, active in the areas around the nests and hence an individual switching parks is unlikely. Also, most sampled birds in our study are passerines which have small territories (typically few hundred square meters). Some larger birds may have larger territories and move larger distance to forage (e.g. kestrels which often forage outside cities) but these birds represent a minority of our records and we have not sampled outside the cities.

      7) An intriguing result was that the authors collected data for 135 species during the shutdown, while they collected data only for 68 species before the pandemic. Such a two-fold increase in bird richness would not be expected with a 36% increase in sampling effort during 2020-21. I wonder if this could be reflecting an actual increase in bird richness in urban areas as a positive result of the shutdown and reduced human presence.

      There were 141 unique day-years during before COVID and 161 during COVID. So, the sampling effort as calculated by days does not explain the difference in species numbers. Whether the actual effort, which was 381 vs 463 h of sampling, explains the difference is unclear, which we now note in the Methods (L476-483). If not, your proposition is possible, but we would like to avoid any speculations on this topic in the manuscript as it is difficult to infer species diversity from FID sampling.

      8) The authors dismissed the multicollinearity problem of explanatory variables unjustifiably (L383). However, looking at fig. S1, I can see strong correlations between some of them. For instance, period and stringency index were virtually identical (r=0.95), while temperature and date were also strongly correlated.

      We are confused by this comment and think this reflects a misunderstanding. Period and stringency index are explanatory variables of interest that were never included in the same model and hence their correlation does not contribute to the within a model multicollinearity. To avoid further confusion, we note this within (Fig. S2) legend. However, we must be cautious when interpreting the results from the models on period, Google Mobility, # of humans and stringency index, as the four measure are similar.

      We discuss multicollinearity of explanatory variables within the manuscript (L458-538, 548-550) and noted that, with the exception of temperature and day within the breeding season (r = 0.48), the correlations among explanatory variables were minimal. We thus used only temperature as an explanatory variable (i.e. fixed factor; also because temperature reflects both season and variation in temperature across a season) whereas the day was included as a random intercept to control for pseudoreplication within day. Collinearity between all other predictors was low (|r| <0.36).

      9) The random structure of the models is a key element of the statistical analyses but those random factors are poorly explained and justified. I needed to look up the supplementary tables to fully understand the complex architecture of the random part of the models. To the best of my knowledge, random variables aim to account for undesirable correlations in the covariance matrix, which is expected in hierarchical designs, such as the present one. However, the theoretical violation of data independence may happen or not. As the random structure is usually of little interest, you should keep it as simple as necessary, otherwise random factors may be catching part of data variability that you would like to explain by fixed variables. I think that this is what is happening (at least, in part) here, as the authors included a too-complex random structure. For instance, if you include the year as a random factor, I think that you are leaving little room for the period effect. The authors simplified the random structure of the models (L387), but they did not explain how. Nevertheless, this model selection was not important at all, as the authors showed the results for several models. I assume, consequently, that the authors are considering all these models equally valid. This approach seems quite contradictory.

      The random structure of the model controls for possible pseudoreplication in the data, that is for the cases where we have multiple data points that may not be independent and hence technically represent one. Apart from that, random structure tells us about where the variance in the data lies. This is often of interest and your previous questions about city, site or species specificities can be answered with the random part of the model. To follow up on your example, year is included in the model because data from a single year are not independent (for example because of delayed breeding season in one year vs. in another).

      We regret being unclear about the model specification and have attempted to clarify the methods (L466-476). We first specified a model with an ideal random structure that necessarily was complex (perhaps too complex). We then showed that using models with simpler random structures did not influence the outcomes. We now use a simpler model within the main text, but do keep the alternative models to show that the results are not dependent on the random structure of the model (Fig. S1 and Table S2).

      Reviewer #3 (Public Review):

      This study examined the changes in fear response, as measured by the flight initiation distances (FID), of birds living in urban areas. The authors examined the FIDs of birds during the pandemic (COVID-19 lockdown restrictions) compared to FIDs measured before the pandemic (mostly in 2018 & 2019). The main study justification was that human presence changed drastically during the pandemic lockdowns and the change in human presence might have influenced the fear response of birds as a result of changing the "landscape of fear". Human presence was quantified using a 'stringency' index (government-mandated restrictions). Urban areas were selected from within five different cities, which included four European cities (Czech Republic - Prague, Finland - Rovaniemi, Hungary - Budapest, Poland - Poznan), and one city in the global south (Australia - Melbourne). Using 6369 flight initiation distances across 147 different bird species, the authors found that FIDs were not significantly different before the pandemic versus during the pandemic, nor was the variation in FID explained by the level of 'stringency'.

      Major strengths: There are several strengths to this study that allows for understanding the variety of factors that influence a bird's response to fear (measured as flight initiation distances). This study also demonstrates that FIDs are highly variable between species and regions.

      Specifically,

      1) One of the major strengths of this paper is the focus on birds living in urban areas, a habitat type that is hypothesized to have changed drastically in the 'landscape of fear' experienced by animals during the pandemic lockdown restrictions (due to the presumed decrease in human presence and densities). Maintaining the focus on urban birds allowed for a deeper examination of the effect of human behaviour changes on bird behaviour in urban habitats, which are at the interface of human-wildlife interactions.

      2) This study accounted for several variables that are predicted to influence flight initiation distances in birds including species, genus, region (country), variability between years, pandemic year (pre- versus during), the strictness of government-mandated lockdown measures, and ecological factors such as the human observer starting distance, flock size, species-specific body size, ambient air temperature (also a proxy of the timing during the breeding season), time of day, date of data collection (timing within the regional [Europe or Australia] breeding season), and categorization of urban site type (e.g. park, cemetery, city centre).

      3) This study examined FIDs in two years previous to the pandemic (mostly 2018 and 2019, one site was 2014) which would account for some of the within- and between-year FID variation exhibited prior to the pandemic.

      4) This study uses strong statistical approaches (mixed effect models) which allows for repeat sampling, and a post hoc analysis testing for a phylogenetic signal.

      Thank you for your supportive and positive comments.

      Major weaknesses: The authors used government 'stringency' as a proxy for human presence and densities, however, this may not have been an accurate measure of actual human presence at the study sites and during measurements of FIDs. Furthermore, although the authors accounted for many factors that are predicted to influence fear response and FIDs in birds, there are several other factors that may have contributed to the high level of variation and patterns in FIDS observed during this study, thus resulting in the authors' conclusion that FIDs did not vary between pre- and during pandemic years.

      Thank you for your suggestions. We agree. To capture the general human presence in parks, we now incorporated an analysis using Google Mobility Reports (Fig S6b) that directly measures human mobility in each of sampled cities and specifically in urban parks where most our data were collected, and also address your further concerns that you detail below. Albeit not the main interest of our study, we now also incorporated an analysis using actual # of humans during an escape trial (Fig. S6c).

      Moreover, we think that including further possible confounds should not influence our conclusions. In other words, including further confounds will decrease the variance that can be explained by shutdowns and thus such shutdown effects (if any) would be tiny and hence likely not biologically meaningful.

      Specifically,

      1) The authors used "government stringency" as a measure of change in human activity, which makes the assumption that the higher the level of 'stringency', the fewer humans in urban areas where birds are living. However, the association between "stringency" and actual human presence at the study sites was not measured, nor was 'stringency' compared to other measures of human presence such as human mobility.

      Thank you for this essential comment. Initially, we viewed Oxford Stringency Index as the best available index for our purposes. However, we now further acknowledge its limitations (L) and validate the Oxford Stringency Index with the Google Mobility Reports data, showing that both indices are generally negatively (albeit sometimes weakly) correlated across sampled cities (i.e. human mobility decreases with the increasing stringency index). Although other human presence indices were used in the past, e.g. Cuebiq, Descartes Labs and Maryland Uni index, Apple (see Noi et al. 2022, Int J Geograph Info Sci, 36, 585-616), we used only the Google Mobility index because (a) it is publicly available, (b) is available also for territories outside US, and (c) provides data for urban parks within each city included in our dataset. Note however that Google Mobility data are inappropriate to answer our primary question, i.e. whether changes in human presence outdoors due to the COVID-19 shutdowns had any effect on avian tolerance towards humans. First, Google Mobility was available only for 2020-22, i.e. the baseline pre-COVID-19 data for 2018-2019 were unavailable. Thus, there was no way to check whether the human activity levels really changed during the COVID-19 years. Second, Google Mobility data are calculated as a change from 2020 January–February baseline for each day of the week for each city and its location (here we used parks). In other words, the data are not comparable between days and cities, albeit we attempted to correct for this within the random structure of the mixed model. Also, the data may be influenced by extreme events within the 2020 Jan–Feb baseline period (see here). Third, the Google Mobility varies greatly between days and across season (see Fig 4 & S5 or the first figure in these responses), likely more than the possible change due to shutdowns. Nevertheless, we found that results based on Google Mobility are qualitatively very similar to results based on stringency index. Moreover, we showed that the relationships between # of humans and both Google Mobility or Stringency index (Figure 6) are weak and noise with 95%CIs widely overlapping zero (Table S3b-e). Also, similarly to other predictors of human presence, # of humans only poorly predicted changes in avian escape distances. We added details on the new analysis into the Methods and Results and Supplement (L134-165 and associated figures and tables, L415-535).

      2) There was considerable variation in FID measurements, which can be seen in the figures, indicating that most of the variation in FID was not accounted for in the authors' models.

      We are confused by this statement. The fact that the FIDs varied does not translate directly to that our models did not account for the variation. Nevertheless, we do control for most of the discussed confounds (see further answers below). Importantly, it is unclear how including further possible confounds should influence our conclusions, unless the lockdowns effects are tiny, in which case those might not be biologically meaningful.

      Factors that may have contributed to variation in FIDs that were not accounted for in this study are as follows:

      a. The authors accounted for the date of data collection using the 'day' since the start of the general region's breeding season (Europe: Day 1 = 1 April; Australia: Day 1 = 15 August). Using 'day' since the breeding season started probably was an attempt to quantify the effect of the breeding stage (e.g. territory establishment, nest young, fledgling) on FIDs. However, breeding stages vary both within- and between species, as well as between sub-regions (e.g. Finland vs. Hungary). As different species respond to predation or human presence differently depending on the stage during their breeding cycle, more specificity in the breeding cycle stage may allow for explaining the observed variation and patterns in FID.

      We agree. Although we don’t have a precise city-specific information on the timing of breeding stages in sampled populations of birds, we partly control for these effects by including a random intercept of day within each year and species. This random factor explained relatively high portion of the variance in our data (see Table S1 and S2) - perhaps something you expected.

      b. Variation in species-specific FIDs may also vary with habitat features within urban sites, such as the proximity of trees and other protective structures (e.g. perches and cover), the openness of the area, and the level of stressors present (e.g. noise pollution, distance to roads). Perhaps accounting for this habitat heterogeneity would account for the FID variation measured in this study.

      We agree. We don’t have such fine-scale data, but we included site identity (typically within a particular park or cemetery) which should account for the habitat heterogeneity among localities. Depending on the model, site explained relatively little variance (1-6%), indicating low heterogeneity between localities in these undescribed characteristics. Also note that park structure may be quite similar both within and between cities, i.e. managed green grass areas, with only a few shrubs and deciduous trees. Therefore, the possible minor habitat heterogeneity should not have any great impacts on our results.

      c. The authors accounted for species and genus within their models, however, FIDs may vary with other species-specific (or even specific populations of a species) characteristics such as whether the species/population is neophobic versus neophilic, precocial versus altricial, and the level of behavioural plasticity exhibited. These variables were not accounted for in the analysis.

      We agree that FIDs can be correlated with many possible factors. Here, we were interested in general patterns, while controlling for FID differences between species, as well as for possible species-specific reaction norms to lockdowns. Whether neophobic vs neophilic population or precocial versus altricial species react differently to lockdowns might be of interest, but it is beyond the scope of this study. However, that population and population specific reaction norms explain little variation (Table S2a, 0-6% of variation) so such a confound should not substantially influence our conclusion much. We do not have fine-scale data on the level of neophobia, but the effects of lockdowns seem similar for precocial (see Anas, Larus, Cygnus) and altricial (the remaining, mostly passerine) species in our dataset (see Fig. 3 and S3-S4). Please, note that we sampled mainly adults (L386). Moreover, the effects for clades, which may differ in their cognitive skills, are also similar (e.g. Corvids vs. Anas or Cygnus; Fig. 3).

      d. Three different methods of measuring the distances between flight and the observer location were used, and FIDs were only measured once per bird, such that there were no measures of repeatability for a test subject. Thus, variation surrounding the measurement of FIDs would have contributed to the variation in FIDs seen during this study.

      While all observers were trained, the three methods may add some noise to the FID estimates. However, the FID estimates from a single method may still slightly differ between observers (so do well standardized morphology measurements; Wang, et al. 2019, PLoS Biology, 17, e3000156). Importantly, FID estimates are highly replicable among skilled observers (Guay et al. 2013, Wildlife Research 40:289-293), and we previously validated this approach and showed that distance measured by counting steps did not differ from distance measured by a rangefinder (Mikula 2014, Ardea 102:53-60), which we now explicitly state (L391-394). Importantly, we control for observer bias by specifying locality as a random intercept (see further details in our response to the Editor). Moreover, each site was sampled by the same observer both before and during the shutdowns.

      3) The sample design of this study may have influenced the FID variability associated with specific species, and specific populations of species. A different number of species were sampled across the time periods of interest; 68 species were sampled before the pandemic versus 135 species after the pandemic. However, the authors do not appear to have directly compared the FIDs for the same species before the pandemic compared to during the pandemic (e.g. the FIDs of Eurasian blackbirds before the pandemic versus during the pandemic). Furthermore, within the same country-city, it is unclear whether the species observed before the pandemic were observed at the same location (e.g. same habitat type such as the same park) during the pandemic. As a species' FID response may be influenced by population characteristics and features specific to each site (e.g. habitat openness), these factors may have influenced the variability in FID measurements in this study.

      We regret being unclear in our methods. Our full model uses all data, but alternative models (see e.g. Fig. S1) used data with ≥5 as well as ≥10 observations before and during lockdowns for a given species. Importantly, Figure 2 and 3 depict data for species sampled at specific sites. We now clarify this within the Methods (L460-483) and the Results (L125-133 and associated figures) and in the figure legends (Fig. S1).

      4) The models in this study accounted for many factors predicted to affect FIDs (see the section on major strengths), however, the number of fixed and random factors are large in number compared to the total sample size (N =6369), such that models may have been over-extended.

      The number of predictors and random effects is well within the limits for the given sample size (Korner-Nievergelt et al. 2015. Bayesian Data Analysis in Ecology Using Linear Models with R, BUGS, and Stan). Importantly, simpler models give similar results as the more complex ones (Fig. S1) and the visual (model free) representations of our raw and aggregated data confirm our model results. This, we suggest, makes our findings robust and convincing.

      Overarching main conclusion

      Overall, this study examines factors influencing FIDs in a variety of bird species and concludes that FIDs did not differ during the pandemic lockdowns compared to before the pandemic (2019 and earlier). Furthermore, FIDs were not influenced by the strictness of government-mandated restrictions. Although the authors accounted for many factors influencing the measurement of FIDs in birds, the authors did not achieve their aim of disentangling the effects of pandemic-specific ecological effects from ecological effects unrelated to the pandemic (such as habitat heterogeneity).

      We find this statement confusing. We accounted for most relevant confounding factors and found little evidence for the strong effect of pandemic. Moreover, we now added country-specific analyses that confirm the lack of evidence, highlight the Figure 3 that shows no clear shutdown effect and also explore how levels of human presence changed over and within the years. Adding more possible confounds (albeit note that not many are left to add) might only further reduce the variation that could be explained by pandemic and hence such hypothetical effects of pandemic will be if anything small and thus likely not biologically meaningful.

      Their findings indicate that FIDs are highly variable both within- and between- species, but do not strongly support the conclusion that FIDs did not change in urban species during the pandemic lockdown. Therefore, this study is of limited impact on our understanding of how a drastic change in human behaviour may impact bird behaviour in urban habitats.

      It is unclear why you think our study lacks support for the conclusion that FIDs changed little during pandemic, if all results show no such effects. However, we toned down our Discussion and highlighted also potential issues linked to our approach (e.g. that sampled individuals were not marked and hence we cannot distinguish between various mechanisms that might explain the described pattern (L293-329) or that human presence may not have changed (L253-269). For further details see our previous response.

      Overall, the study demonstrates the challenges in using FIDs as a general fear response in birds, even during a pandemic lockdown when fewer humans are presumably present, and this study illustrates the large degree of variation in FIDs in response to a human observer.

      We appreciate and agree that our study demonstrates the challenges in quantifying human activity to understand bird escape distance and we added a paragraph on this topic to the discussion (L270-292).

      Nevertheless, we hope that our above responses clarify and address most of the issues you had with our manuscript. We tried to show that (a) most of your proposed controls are indeed included in our study design, models, and visualisations, and that (b) multiple evidence (from models and visualisation of raw and aggregated data) support the no overall effect conclusion. We further emphasize the temporal and between- and within-species variability in FIDs in the Results and now specifically indicate that lockdowns did not influenced FIDs above such variability (Fig. 2-3, Fig. S3). In other words, the natural (e.g. temporal) variation in FIDs seems far greater that potential effects of lockdowns (Fig. 2). We believe that even if lockdowns would have tiny effects that could have been detected with more. stringent experimental design (e.g. individually tagged birds) or even more complex models, such effects would be far from being biologically meaningful.

    2. Reviewer #1 (Public Review):

      This paper uses a series of flight initiation "challenges" conducted both prior to and during COVID-19-related restrictions on human movement to estimate the degree to which avian escape responses to humans changed during the "anthropause". This technique is suitable for understanding avian behavioral responses with a high degree of repeatability. The study collects an impressive dataset over multiple years across five cities on two continents. Overall the study finds no effect of lockdown on avian escape distance (the distance at which the "target" individual flees the approaching observer). The study considers the variable of interest as both binary (during lockdown or prior to lockdown) and continuous, using the Oxford Stringency Index (with neither apparently affecting escape distance).

      Overall this paper presents interesting results which may suggest that behavioral responses to humans are rather inflexible over "short" (~2 year) timespans. The anthropause represents a unique opportunity to disentangle the mechanistic drivers of myriad hypothesized impacts humans have on the behavior, distribution, and abundance of animals. Indeed, this finding would provide important context to the larger body of literature aimed at these ends. However, the paper could do more to carefully fit this finding into the broader literature and, in so doing, be a bit more careful about the conclusions they are able to draw given the study design and the measures used. Taking some of these points (in no particular order):

      1) Oxford Stringency Index is a useful measure of governmental responses to the pandemic and it's true that in some scenarios (including the (Geng et al. 2021) study cited by this paper) it can correlate with human mobility. However, it is far from a direct measure of human mobility (even in the Geng study, to my reading, the index only explained a minority of the variation). Moreover, particular sub-components of the index are wholly unrelated to human mobility (e.g., would changes to a country's public information campaign lead to concomitant changes in urban human mobility?). Finally, compliance with government restrictions can vary geographically and over time (i.e., we might expect lower compliance in 2021 than in 2020) and the index is calculated at the scale of entire countries and may not be very reflective of local conditions. Overall this paper could do more to address the potential shortcomings of the Oxford Stringency Index as a measure of human mobility including attempting to validate the effect on human mobility using other datasets (e.g., the google dataset and/or those discussed in (Noi et al. 2022). This is of critical importance since the fundamental logic of the experimental design relies on the assumption that stringency ~ mobility.

      2) The interpretation of the primary finding (that behavioral responses to humans are inflexible) could use a bit more contextualization within the literature. Specifically, the study offers three potential explanations for the observed invariance in escape response: 1) these behaviors are consistent within individuals and this study provides evidence that there was no population turnover as a result of lockdowns; 2) escape response is linked to other urban adaptations such that to be an urban-dwelling species dictates escape response; and/or 3) these populations already exhibit maximum habituation and the reduction in human mobility would only have increased that habituation but that trait is already at a boundary condition. Some comments on each of these respectively:

      a) Even had these populations turned over as a result of a massive rural-to-urban dispersal event, it's not clear that the escape distance in those individuals would be different because this paper does not establish that these hypothetical rural birds have a different behavioral response which would be constant following dispersal. Thus the evidence gathered here is insufficient to tell us about possible relocations of the focal species. Additionally, the paper cites several papers that found no changes in abundance or movements of animals in response to lockdowns but ignore others that do. For example: (Wilmers et al. 2021), (Warrington et al. 2022) (though this may have been published after this was submitted...), and (Schrimpf et al. 2021). There is a missed opportunity to consider the drivers of some of these results - the findings in this paper are interesting in light of studies that *did* observe changes in space use or abundance - i.e., changes in space use could arise precisely *because* responses to humans are non-plastic but the distribution and activities of humans changed. To wit, the primary finding here would imply that the reaction norm to human presence is apparently fixed over such timescales - however, and critically, the putative reduction in human activity/mobility combined with fixed responses at the individual level might then imply changes in avian abundance/movement/etc.

      b) If this were the case, wouldn't this be then measurable as a function of some measure of urbanity (e.g. Human Footprint Index) that varies across the cities included here? Site accounted for ~15% of the total variation in escape distance but was treated as a random effect - perhaps controlling for the nature of the urban environment using some e.g., remotely sensed variable would provide additional context here.

      c) Because it's not clear the extent to which the populations tested had turned over between years, the paper could do with a bit more caution in interpreting these results as behavioral. This study spans several years so any response (or non-response) is not necessarily a measure of behavioral change because the sample at each time point could (likely does) represent different individuals. In fact, there may be an opportunity here to leverage the one site where pre-pandemic measures were taken several years prior to the pandemic. How much variance in the change in escape distance is observed when the gap between time points far exceeds the lifetime of the focal taxa versus measures taken close in time?

      d) Finally, I think there are a few other potential explanations not sufficiently accounted for here:

      i) These behaviors might indeed be plastic, but not over the timescales observed here.<br /> ii) Time of year - this study took place during the breeding season. The focal behavior here varies with the time of year, for example, escape distance for many of these species could be tied up in nest defense behaviors, tradeoffs between self-preservation and e.g., nest provisioning, etc.<br /> iii) Escape behaviors from humans are adaptively evolved, strongly heritable, and not context dependent - thus we would only expect these behaviors to change on evolutionary timescales.<br /> iv) See point one above - it's possible that the lockdown didn't modify human activity sufficiently to trigger a behavioral response or that the reaction norm to human behavior is non-linear (e.g. a threshold effect).

      LITERATURE CITED<br /> Geng DC, Innes J, Wu W, Wang G. 2021. Impacts of COVID-19 pandemic on urban park visitation: a global analysis. J For Res 32:553-567. doi:10.1007/s11676-020-01249-w

      Noi E, Rudolph A, Dodge S. 2022. Assessing COVID-induced changes in spatiotemporal structure of mobility in the United States in 2020: a multi-source analytical framework. Int J Geogr Inf Sci.

      Schrimpf MB, Des Brisay PG, Johnston A, Smith AC, Sánchez-Jasso J, Robinson BG, Warrington MH, Mahony NA, Horn AG, Strimas-Mackey M, Fahrig L, Koper N. 2021. Reduced human activity during COVID-19 alters avian land use across North America. Sci Adv 7:eabf5073. doi:10.1126/sciadv.abf5073

      Warrington MH, Schrimpf MB, Des Brisay P, Taylor ME, Koper N. 2022. Avian behaviour changes in response to human activity during the COVID-19 lockdown in the United Kingdom. Proc Biol Sci 289:20212740. doi:10.1098/rspb.2021.2740

      Wilmers CC, Nisi AC, Ranc N. 2021. COVID-19 suppression of human mobility releases mountain lions from a landscape of fear. Curr Biol 31:3952-3955.e3. doi:10.1016/j.cub.2021.06.050

    1. Reviewer #2 (Public Review):

      The paper "Polymerization cycle of actin homolog MreB from a Gram-positive bacterium" by Mao et al. provides the second biochemical study of a gram-positive MreB, but importantly, the first study examines how gram-positive MreB filaments bind to membranes. They also show the first crystal structure of a MreB from a Gram-positive bacterium - in two nucleotide-bound forms, finally solving structures that have been missing for too long. They also elucidate what residues in Geobacillus MreB are required for membrane associations. Also, the QCM-D approach to monitoring MreB membrane associations is a direct and elegant assay.

      While the above findings are novel and important, this paper also makes a series of conclusions that run counter to multiple in vitro studies of MreBs from different organisms and other polymers with the actin fold. Overall, they propose that Geobacillus MreB contains biochemical properties that are quite different than not only the other MreBs examined so far but also eukaryotic actin and every actin homolog that has been characterized in vitro. As the conclusions proposed here would place the biochemical properties of Geobacillus MreB as the sole exception to all other actin fold polymers, further supporting experiments are needed to bolster these contrasting conclusions and their overall model.

      1. (Difference 1) - The predominant concern about the in vitro studies that makes it difficult to evaluate many of their results (much less compare them to other MreB/s and actin homologs) is the use of a highly unconventional polymerization buffer containing 500(!) mM KCL. As has been demonstrated with actin and other polymers, the high KCl concentration used here (500mM) is certain to affect the polymerization equilibria, as increasing salt increases the hydrophobic effect and inhibits salt bridges, and therefore will affect the affinity between monomers and filaments. For example, past work has shown that high salt greatly changes actin polymerization, causing: a decreased critical concentration, increased bundling, and a greatly increased filament stiffness(Kang et al., 2013, 2012). Similarly, with AlfA, increased salt concentrations have been shown to increase the critical concentration, decrease the polymerization kinetics, and inhibit the bundling of AlfA filaments (Polka et al., 2009). A more closely related example comes from the previous observation that increasing salt concentrations increasingly slow the polymerization kinetics of B. subtilis MreB (Mayer and Amann, 2009). Lastly, These high salt concentrations might also change the interactions of MreB(Gs) with the membrane by screening charges and/or increasing the hydrophobic effect.

      Given that 500mM KCl was used throughout this paper, many (if not all) of the key experiments should be repeated in more standard salt concentration (~100mM), similar to those used in most previous in vitro studies of polymers. This would test if the many divergent properties of MreB(Gs) reported here arise from some difference in MreB(Gs) relative to other MreBs (and actin homologs), or if they arise from the 400mM difference in salt concentration between the studies. Critically, it would also allow direct comparisons to be made relative to previous studies of MreB (and other actin homologs) that used much lower salt, thereby allowing them to definitively demonstrate whether MreB(Gs) is indeed an outlier relative to other MreB and actin homologs. I would suggest using 100mM KCL, as historically, all polymerization assays of actin and numerous actin homologs have used 50-100mM KCL: 50mM KCl (for actin in F buffer) or 100mM KCl for multiple prokaryotic actin homologs and MreB (Deng et al., 2016; Ent et al., 2014; Esue et al., 2006, 2005; Garner et al., 2004; Polka et al., 2009; Rivera et al., 2011; Salje et al., 2011) Likewise, similar salt concentrations are standard for tubulin (80 mM K-Pipes) and FtsZ (100 mM KCl or 100mM KAc in HMK100 buffer).

      2. (Difference 2) - One of the most important differences claimed in this paper is that MreB(Gs) filaments are straight, a result that runs counter to the curved T. Maritima and C. crescentus filaments detailed by the Löwe group (Ent et al., 2014; Salje et al., 2011). Importantly, this difference could also arise from the difference in salt concentrations used in each study (500mM here vs. 100mM in the Löwe studies), and thus one cannot currently draw any direct comparisons between the two studies.

      One example of how high salt could be causing differences in filament geometry: high salts are known to greatly increase the bending stiffness of actin filaments, making them more rigid (Kang et al., 2013). Likewise, increasing salt is known to change the rigidity of membranes. As the ability of filaments to A) bend the membrane or B) Deform to the membrane depends on the stiffness of filaments relative to the stiffness of the membrane, the observed difference in the "straight vs. curved" conformation of MreB filaments might simply arise from different salt concentrations.

      Thus, in order to draw several direct comparisons between their findings and those of other MreB orthologs (as done here), the studies of MreB(GS) confirmations on lipids should be repeated at the same buffer conditions as used in the Löwe papers, then allowing them to be directly compared.

      3. (Difference 3) - The next important difference between MreB(Gs) and other MreBs is the claim that MreB polymers do not form in the absence of membranes.

      A) This is surprising relative to other MreBs, as MreBs from 1) T. maritime (multiple studies), E.coli (Nurse and Marians, 2013), and C. crescentus (Ent et al., 2014) have been shown to form polymers in solution (without lipids) with electron microscopy, light scattering, and time-resolved multi-angle light scattering. Notably, the Esue work was able to observe the first phase of polymer formation and a subsequent phase of polymer bundling (Esue et al., 2006) of MreB in solution. 2) Similarly, (Mayer and Amann, 2009) demonstrated B. subtilis MreB forms polymers in the absence of membranes using light scattering.

      B) The results shown in figure 5A also go against this conclusion, as there is only a 2-fold increase in the phosphate release from MreB(Gs) in the presence of membranes relative to the absence of membranes. Thus, if their model is correct, and MreB(Gs) polymers form only on membranes, this would require the unpolymerized MreB monomers to hydrolyze ATP at 1/2 the rate of MreB in filaments. This high relative rate of hydrolysis of monomers compared to filaments is unprecedented. For all polymers examined so far, the rate of monomer hydrolysis is several orders of magnitude less than that of the filament. For example, actin monomers are known to hydrolyze ATP 430,000X slower than the monomers inside filaments (Blanchoin and Pollard, 2002; Rould et al., 2006).

      C) Thus, there is a strong possibility that MreB(Gs) polymers are indeed forming in solution in addition to those on the membrane, and these "solution polymers" may not be captured by their electron microscopy assay. For example, high salt could be interfering with the absorption of filaments to glow discharged lacking lipids.<br /> In order to definitively prove that MreB(Gs) does not have polymers in solution, the authors should:

      i) conduct orthogonal experiments to test for polymers in solution. The simplest test of polymerization might be conducting pelleting assays of MreB(Gs) with and without lipids, sweeping through the concentration range as done in 2B and 5a.

      ii) They also could examine if they see MreB filaments in the absence of lipids at 100mM salt (as was seen in both Löwe studies), as the high salt used here might block the charges on glow discharged grids, making it difficult for the polymer to adhere.

      iii) Likewise, the claim that MreB lacking the amino-terminus and the α2β7 hydrophobic loop "is required for polymerization" is questionable as if deleting these resides blocks membrane binding, the lack of polymers on the membrane on the grid is not unexpected, as these filaments that cannot bind the membrane would not be observable. Given these mutants cannot bind the membrane, mutant polymers could still indeed exist in solution, and thus pelleting assays should be used to test if non-membrane associated filaments composed of these mutants do or do not exist.

      A final note, the results shown in "Figure 1 - figure supplement 2, panel C" appear to directly refute the claim that MreB(Gs) requires lipids to polymerize. As currently written, it appears they can observe MreB(Gs) filaments on EM grids without lipids. If these experiments were done in the presence of lipids, the figure legend should be updated to indicate that. If these experiments were done in the absence of lipids, the claim that membrane association is required for MreB polymerizations should be revised.

      4. (Difference 4) - The next difference between this study and previous studies of MreB and actin homologs is the conclusion that MreB(Gs) must hydrolyze ATP in order to polymerize. This conclusion is surprising, given the fact that both T. Maritima (Salje · 2011, Bean 2008) and B. subtilis MreB (Mayer 2009) have been shown to polymerize in the presence of ATP as well as AMP-PNP. Likewise, MreB polymerization has been shown to lag ATP hydrolysis in not only T. maritima MreB (Esue 2005), eukaryotic actin, and all other prokaryotic actin homologs whose polymerization and phosphate release have been directly compared: MamK (Deng et al., 2016), AlfA (Polka et al., 2009), and two divergent ParM homologs (Garner et al., 2004; Rivera et al., 2011).

      Currently, the only piece of evidence supporting the idea that MreB(Gs) must hydrolyze ATP in order to polymerize comes from 2 observations: 1) using electron microscopy, they cannot see filaments of MreB(Gs) on membranes in the presence of AMP-PNP or ApCpp, and 2) no appreciable signal increase appears testing AMPPNP- MreB(Gs) using QCM-D. This evidence is by no means conclusive enough to support this bold claim: While their competition experiment does indicate AMPPNP binds to MreB(Gs), it is possible that MreB(Gs) cannot polymerize when bound to AMPPNP. For example, it has been shown that different actin homologs respond differently to different non-hydrolysable analogs: Some, like actin, can hydrolyze one ATP analog but not the other, while others are able to bind to many different ATP analogs but only polymerize with some of one of them. Thus, to further verify their "hydrolysis is needed for polymerization" conclusion, they should:<br /> A. Test if a hydrolysis deficient MreB(Gs) mutant (such as D158A) is also unable to polymerize by EM.<br /> B. They also should conduct an orthogonal assay of MreB polymerization aside from EM (pelleting assays might be the easiest). They should test if polymers of ATP, AMP-PNP, and MreB(Gs)(D158A) form in solution (without membranes) by conducting pelleting assays. These could also be conducted with and without lipids, thereby also addressing the points noted above in point 3.<br /> C. Polymers may indeed form with ATP-gamma-S, and this non-hydrolysable ATP analog should be tested.<br /> D. They could also test how the ADP-Phosphate bound MreB(Gs) polymerizes in bulk and on membranes, using beryllium phosphate to trap MreB in the ADP-Pi state. This might allow them to further refine their model.<br /> E. Importantly, the Mayer study of B. subtilis MreB found the same results in regard to nucleotides, "In polymerization buffer, MreB produced phosphate in the presence of ATP and GTP, but not in ADP, AMP, GDP or AMP-PNP, or without the readdition of any nucleotide". Thus this paper should be referenced and discussed

      5. (Difference 5) - The introduction states (lines 128-130) "However, the need for nucleotide binding and hydrolysis in polymerization remains unclear due to conflicting results, in vivo and in vitro, including the ability of MreB to polymerize or not in the presence of ADP or the non-hydrolyzable ATP analog AMP-PNP."

      A) While this is a great way to introduce the problem, the statement is a bit vague and should be clarified, detaining the conflicting results and appropriate references. For example, what conflicting in vivo results are they referring to? Regarding "MreB polymerization in AMP-PNP", multiple groups have shown the polymerization of MreB(Tm) in the presence of AMP-PNP, but it is not clear what papers found opposing results.

      B) The statement "However, the need for nucleotide binding and hydrolysis in polymerization remains unclear due to conflicting results, in vivo and in vitro, including the ability of MreB to polymerize or not in the presence of ADP or the non-hydrolyzable ATP analog AMP-PNP" is technically incorrect and should be rephrased or further tested.

      i. For all actin (or tubulin) family proteins, it is not that a given filament "cannot polymerize" in the presence of ADP but rather that the ADP-bound form has a higher critical concentration for polymer formation relative to the ATP-bound form. This means that the ADP polymers can indeed polymerize, but only when the total protein exceeds the ADP critical concentration. For example, many actin-family proteins do indeed polymerize in ADP: ADP actin has a 10-fold higher critical concentration than ATP actin, (Pollard, 1984) and the ADP critical concentrations of AlfA and ParM are 5X and 50X fold higher (respectively) than their ATP-bound forms(Garner et al., 2004; Polka et al., 2009)

      ii. Likewise, (Mayer and Amann, 2009) have already demonstrated that B. subtilis MreB can polymerize in the presence of ADP, with a slightly higher critical concentration relative to the ATP-bound form.

      Thus, to prove that MreB(Gs) polymers do not form in the presence of ADP would require one to test a large concentration range of ADP-bound MreB(Gs). They should test if ADP- MreB(Gs) polymerizes at the highest MreB(Gs) concentrations that can be assayed. Even if this fails, it may be the MreB(Gs) ADP polymerizes at higher concentrations than is possible with their protein preps (13uM). An even more simple fix would be to simply state MreB(Gs)-ADP filaments do not form beneath a given MreB(Gs) concentration.

      Other Points to address:

      1. There are several points in this paper where the work by Mayer and Amann is ignored, not cited, or readily dismissed as "hampered by aggregation" without any explanation or supporting evidence of that fact.

      A) Lines 100-101 - While the irregular 3-D formations seen formed by MreB in the Dersch 2020 paper could be interpreted as aggregates, stating that the results from specifically the Gaballah and Meyer papers (and not others) were "hampered by aggregation" is currently an arbitrary statement, with no evidence or backing provided. Overall, these lines (and others in the paper) dismiss these two works without giving any evidence to that point. Thus, they should provide evidence for why they believe all these papers are aggregation, or remove these (and other) dismissive statements.

      One important note - There are 2 points indicating that dismissing the Meyer and Amann work as aggregation is incorrect: 1) the Meyer work on B. subtilis MreB shows both an ATP and a slightly higher ADP critical concentration. As the emergence of a critical concentration is a steady-state phenomenon arising from the association/dissociation of monomers (and a kinetically limiting nucleation barrier), an emergent critical concentration cannot arise from protein aggregation, critical concentrations only arise from a dynamic equilibrium between monomer and polymer. 2) Furthermore, Meyer observed that increased salt slowed and reduced B. subtilis MreB light scattering, the opposite of what one would expect if their "polymerization signal" was only protein aggregation, as higher salts should increase the rate of aggregation by increasing the hydrophobic effect.

      B) Lines 113-137 -The authors reference many different studies of MreB, including both MreB on membranes and MreB polymerized in solution (which formed bundles). However, they again neglect to mention or reference the findings of Meyer and Amann (Mayer and Amann, 2009), as it was dismissed as "aggregation". As B. subtilis is also a gram-positive organism, the Meyer results should be discussed.

      2. Lines 387-391 state the rates of phosphate release relative to past MreB findings: "These rates of Pi release upon ATP hydrolysis (~ 1 Pi/MreB in 6 min at 53{degree sign}C) are comparable to those observed for MreBTm and MreB(Ec) in vitro". While the measurements of Pi release AND ATP hydrolysis have indeed been measured for actin, this statement does not apply to MreB and should be corrected: All MreB papers thus far have only measured Pi release alone, not ATP hydrolysis at the same time. Thus, it is inaccurate to state "rates of Pi release upon ATP hydrolysis" for any MreB study, as to accurately determine the rate of Pi release, one must measure: 1. The rate of polymer over time, 2) the rate of ATP hydrolysis, and 3) the rate of phosphate release. For MreB, no one has, so far, even measured the rates of ATP hydrolysis and phosphate release with the same sample.

      3. The interpretation of the interactions between monomers in the MreB crystal should be more carefully stated to avoid confusion. While likely not their intention, the discussions of the crystal packing contacts of MreB can appear to assume that the monomer-monomer contacts they see in crystals represent the contacts within actual protofilaments. One cannot automatically assume the observations of monomer-monomer contacts within a crystal reflect those that arise in the actual filament (or protofilament).

      A) They state, "the apo form of MreBGs forms less stable protofilaments than its G- homologs ." Given filaments of the Apo form of MreB(GS) or b. subtilis have never been observed in solution, this statement is not accurate: while the contacts in the crystal may change with and without nucleotide, if the protein does not form polymers in solution in the apo state, then there are no "real" apo protofilaments, and any statements about their stability become moot. Thus this statement should be rephrased or appropriately qualified.

      B) Another example: while they may see that in the apo MreB crystal, the loop of domain IB makes a *single* salt bridge with IIA and none with IIB. This contrasts with every actin, MreB, and actin homolog studied so far, where domain IB interacts with IIB. This might reflect the real contacts of MreB(Gs) in the solution, or it may be simply a crystal-packing artifact. Thus, the authors should be careful in their claims, making it clear to the reader that the contacts in the crystal may not necessarily be present in polymerized filaments.

      4. lines 201-202 - "Polymers were only observed at a concentration of MreB above 0.55 μM (0.02 mg/mL)". Given this concentration dependence of filament formation, which appears the same throughout the paper, the authors could state that 0.55 μM is the critical concentration of MreB on membranes under their buffer conditions. Given the lack of critical concentration measurement in most of the MreB literature, this could be an important point to make in the field.

      5. Both mg/ml and uM are used in the text and figures to refer to protein concentration. They should stick to one convention, preferably uM, as is standard in the polymer field.

      6. Lines 77-78 - (Teeffelen et al., 2011) should be referenced as well in regard to cell wall synthesis driving MreB motion.

      7. Line 90 - "Do they exhibit turnover (treadmill) like actin filaments?". This phrase should be modified, as turnover and treadmilling are two very different things. Turnover is the lifetime of monomers in filaments, while treadmilling entails monomer addition at one end and loss at the other. While treadmilling filaments cause turnover, there are also numerous examples of non-treadmilling filaments undergoing turnover: microtubules, intermediate filaments, and ParM. Likewise, an antiparallel filament cannot directionally treadmill, as there is no difference between the two filament ends to confer directional polarity.

      8. Throughout the paper, the term aggregation is used occasionally to describe the polymerization shown in many previous MreB studies, almost all of which very clearly showed "bundled" filaments, very distinct entities from aggregates, as a bundle of polymers cannot form without the filaments first polymerizing on their own. Evidence to this point, polymerization has been shown to precede the bundling of MreB(Tm) by (Esue et al., 2005).

      9. lines 106-108 mention that "The N-terminal amphipathic helix of E. coli MreB (MreBEc) was found to be necessary for membrane binding. " This is not accurate, as Salje observed that one single helix could not cause MreB to mind to the membrane, but rather, multiple amphipathic helices were required for membrane association (Salje et al., 2011). The Salje results imply that dimers (or further assemblies) of MreB drive membrane association, a point that should be discussed in regard to the question "What prompts the assembly of MreB on the inner leaflet of the cytoplasmic membrane?" posed on lines 86-87.

      10. On lines 414-415, it is stated, "The requirement of the membrane for polymerization is consistent with the observation that MreB polymeric assemblies in vivo are membrane-associated only." While I agree with this hypothesis, it must be noted that the presence or absence of MreB polymers in the cytoplasm has not been directly tested, as short filaments in the cytoplasm would diffuse very quickly, requiring very short exposures (<5ms) to resolve them relative to their rate of diffusion. Thus, cytoplasmic polymers might still exist but have not been tested.

      11. lines 429-431 state, "but polymerization in the presence of ADP was in most cases concluded from light scattering experiments alone, so the possibility that aggregation rather than ordered polymerization occurred in the process cannot be excluded."

      A) If an increased light scattering signal is initiated by the addition of ADP (or any nucleotide), that signal must come from polymerization or multimerization. What the authors imply is that there must be some ADP-dependent "aggregation" of MreB, which has not been seen thus far for any polymer. Furthermore, why would the addition of ADP initiate aggregation?

      B) Likewise, the statement "Differences in the purity of the nucleotide stocks used in these studies could also explain some of the discrepancies" is unexplained and confusing. How could an impurity in a nucleotide stock affect the past MreB results, and what is the precedent for this claim?

      12. lines 467-469 state, "Thus, for both MreB and actin, despite hydrolyzing ATP before and after polymerization, respectively, the ADP-Pi-MreB intermediate would be the long-lived intermediate state within the filaments."

      A) For MreB, this statement is extremely speculative and unbiased, as no one has measured 1) polymerization, 2) ATP hydrolysis, and 3) phosphate release. For example, it could be that ATP hydrolysis is slow, while phosphate release is fast, as is seen in the actin from Saccharomyces cerevisiae.

      B) For actin, the statement of hydrolysis of ATP of monomer occurring "before polymerization" is functionally irrelevant, as the rate of ATP hydrolysis of actin monomers is 430,000 times slower than that of actin monomers inside filaments(Blanchoin and Pollard, 2002; Rould et al., 2006).

      13. Lines 442-444. "On the basis of our data and the existing literature, we propose that the requirement for ATP (or GTP) hydrolysis for polymerization may be conserved for most MreBs." Again, this statement both here (and in the prior text) is an extremely bold claim, one that runs contrary to a large amount of past work on not just MreB, but also eukaryotic actin and every actin homolog studied so far. They come to this model based on 1) one piece of suggestive data (the behavior of MreB(GS) bound to 2 non-hydrolysable ATP analogs in 500mM KCL), and 2) the dismissal (throughout the paper) of many peer-reviewed MreB papers that run counter to their model as "aggregation" or "contaminated ATP stocks ." If they want to make this bold claim that their finding invalidates the work of many labs, they must back it up with further validating experiments.

      References cited.

      Blanchoin L, Pollard TD. 2002. Hydrolysis of ATP by Polymerized Actin Depends on the Bound Divalent Cation but Not Profilin †. Biochemistry-us 41:597-602. doi:10.1021/bi011214b

      Deng A, Lin W, Shi N, Wu J, Sun Z, Sun Q, Bai H, Pan Y, Wen T. 2016. In vitro assembly of the bacterial actin protein MamK from 'Candidatus Magnetobacterium casensis' in the phylum Nitrospirae. Protein Cell 7:267-280. doi:10.1007/s13238-016-0253-x

      Dersch S, Reimold C, Stoll J, Breddermann H, Heimerl T, Soufo HJD, Graumann PL. 2020. Polymerization of Bacillus subtilis MreB on a lipid membrane reveals lateral co-polymerization of MreB paralogs and strong effects of cations on filament formation. Bmc Mol Cell Biology 21:76. doi:10.1186/s12860-020-00319-5

      Ent F van den, Izoré T, Bharat TA, Johnson CM, Lowe J. 2014. Bacterial actin MreB forms antiparallel double filaments. eLife 3:e02634. doi:10.7554/elife.02634

      Esue O, Cordero M, Wirtz D, Tseng Y. 2005. The Assembly of MreB, a Prokaryotic Homolog of Actin. J Biol Chem 280:2628-2635. doi:10.1074/jbc.m410298200

      Esue O, Wirtz D, Tseng Y. 2006. GTPase Activity, Structure, and Mechanical Properties of Filaments Assembled from Bacterial Cytoskeleton Protein MreB. J Bacteriol 188:968-976. doi:10.1128/jb.188.3.968-976.2006

      Garner EC, Campbell CS, Mullins RD. 2004. Dynamic instability in a DNA-segregating prokaryotic actin homolog. Science (New York, NY) 306:1021-1025. doi:10.1126/science.1101313

      Kang H, Bradley MJ, Elam WA, De La Cruz EM. 2013. Regulation of Actin by Ion-Linked Equilibria. Biophys J 105:2621-2628. doi:10.1016/j.bpj.2013.10.032

      Kang H, Bradley MJ, McCullough BR, Pierre A, Grintsevich EE, Reisler E, Cruz EMDL. 2012. Identification of cation-binding sites on actin that drive polymerization and modulate bending stiffness. Proc National Acad Sci 109:16923-16927. doi:10.1073/pnas.1211078109

      Mayer JA, Amann KJ. 2009. Assembly properties of the Bacillus subtilis actin, MreB. Cell Motil Cytoskel 66:109-118. doi:10.1002/cm.20332

      Nurse P, Marians KJ. 2013. Purification and Characterization of Escherichia coli MreB Protein. J Biol Chem 288:3469-3475. doi:10.1074/jbc.m112.413708

      Polka JK, Kollman JM, Agard DA, Mullins RD. 2009. The Structure and Assembly Dynamics of Plasmid Actin AlfA Imply a Novel Mechanism of DNA Segregation. J Bacteriol 191:6219-6230. doi:10.1128/jb.00676-09

      Pollard TD. 1984. Polymerization of ADP-actin. J Cell Biology 99:769-777. doi:10.1083/jcb.99.3.769

      Rivera CR, Kollman JM, Polka JK, Agard DA, Mullins RD. 2011. Architecture and assembly of a divergent member of the ParM family of bacterial actin-like proteins. The Journal of biological chemistry 286:14282-14290. doi:10.1074/jbc.m110.203828

      Rould MA, Wan Q, Joel PB, Lowey S, Trybus KM. 2006. Crystal Structures of Expressed Non-polymerizable Monomeric Actin in the ADP and ATP States*. J Biol Chem 281:31909-31919. doi:10.1016/s0021-9258(19)84105-4

      Salje J, van den Ent F, de Boer P, Löwe J. 2011. Direct Membrane Binding by Bacterial Actin MreB. Mol Cell 43:478-487. doi:10.1016/j.molcel.2011.07.008

      Teeffelen S van, Wang S, Furchtgott L, Huang KC, Wingreen NS, Shaevitz JW, Gitai Z. 2011. The bacterial actin MreB rotates, and rotation depends on cell-wall assembly. Proceedings of the National Academy of Sciences of the United States of America 108:15822-15827. doi:10.1073/pnas.1108999108

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1*. This is a good paper dealing with gap of our knowledge in understanding reason of ICB failures. Subject being difficult it is expected that the design and content of such experiment will be complex.But the authors forget practicality of readers attention and making paper apear interesting. They need to organise and may be classify the varied information in such a way that reader can find a rhythm in excavating data more easily. It appears confusing at time, so they may try to make it more simple. In this way they may concentrate more on methods and classify results too. A thorough revision is suggested, to make it consize. *

      __Authors’ answer: __We thank the Reviewer for his positive evaluation and constructive feedback. We appreciate the complexity of single-cell RNA-sequencing analyses. In order to simplify our manuscript, our revised manuscript now focuses on the transitional states of tumor-resident and circulating T cells found in ovarian cancer patients. Our study is timely as it is the first to report the developmental relationship of TILs in ovarian cancer. We substantially edited our manuscript to make it clear that our findings suggest a gradual acquisition of the exhaustion program initiated by effector-like cells (cluster CD8_GZMH) that eventually gives rise to more terminal states with features of tissue residency and chemotaxis (clusters CD8_CCL4, CD8_XCL1, and CD8_CXCL13). We also include new analyses revealing the presence and proportion of these T cell states in different cancer patients (New Fig. 4A-B), and how these T cell states associate with clinical responses to immune checkpoint blockade (ICB). We hope the Reviewer will find our revised manuscript easier to read.

      Reviewer #2. I think the first half of the article, in which the GZMH-CD8 cluster is considered to be in an intermediate state of transition to exhaustion, is interesting, and I feel that the single-cell seq and TCR data are well analyzed to make the point. On the other hand, I feel that the latter part of the paper may not be anything more than a hypothesis. In particular, the part claiming that it is related to prognosis or applicable to the prediction of the effect of ICB is insufficient, since their gene signature is not described in detail and the contents of the Figure are not mentioned in the manuscript. In the latter part, the effects of GPR184 and 25-HC, or the effects of IL21, would require experiments to verify (to verify whether the addition of chemokine or the inhibition of the receptor changes the specific CD8 population).

      Author’s answer: Thank you for discussing the limitation of the signature employed. We agree with the reviewer’s comment. Old Figure 5 has been removed from the revised manuscript.

      Reviewer #2. Minor point: In particular, there is little mention of Figure 5 in the text, making it difficult to understand.

      Author’s answer: Thank you for your comment. As we previously discussed, we have removed Figure 5 from the revised manuscript. The method used to generate the signature was found to be inappropriate.

      Reviewer #2. The latter part is difficult to understand. To begin with, it is already known that ovarian cancer does not contribute much to ICB, so what does it mean to analyze the CD8 population, which is known as a marker of ICB response in other carcinomas, as an indicator? Especially for clinicians like us, it is hard to imagine that the results will lead to clinical trials that will attempt to sort out the population that ICB is favored in.

      Author’s answer: Although immune checkpoint blockade has demonstrated limited effectiveness against ovarian cancer, subset analyses suggest superior efficacy for some patients and according to subtype. Combination anti-PD-1/CTLA-4 therapy for instance achieved response rates up to 31% (Zamarin et al., 2020), and superior benefit for single agent PD-1 blockade has been reported in clear cell ovarian cancer. Moreover, encouraging clinical results have recently been reported in studies exploring combinations with PARP and VEGF inhibitors. As example, interim analysis of the phase 3 DUO-O trial (NCT03737643) showed a statistically significant and clinically meaningful improvement in PFS in patients with newly diagnosed advanced ovarian cancer without a BRCA1/2 mutation (Harter et al., 2023).

      Our study aimed to better understand how ovarian tumor-infiltrating T cells acquire their exhaustion program after migrating from the periphery and whether these mechanisms are unique or shared amongst cancer types. Recent studies in other cancer types had shown the dynamics of T cells and demonstrated the clonal replacement of intratumoral T cells after ICB and emphasized the role of peripheral clones in this process (Wu et al., 2020; Yost et al., 2019). In lung cancer, it has been proposed a transitional state between precursor and terminally differentially cells (Gueguen et al., 2021). Our study demonstrates, for the first time in ovarian cancer, the presence of similar transitional states of CD8 T cells. Our revised manuscript also now includes new data revealing that pre-effector GZMK- and intermediary GZMH-expressing CD8 cells are better biomarkers of ICB response than terminally differentiated XCL1 and CXCL13 expressing CD8 T cells (New Figure 4). Altogether, our study provides important and novel insights on the development of tumor-infiltrating T cells in ovarian cancer patients, which may serve to better select ovarian cancer patients for ICB therapy.

      Reviewer #2. Since the first half of the study is very interesting, we feel that it is more important to confirm the mechanism of exhaustion from the blood via the intermediate (GZMH_CD8), including functional experiments. Also, as a clinician, we are very interested in the perspective of whether some of the fractions identified in this study are different in proportion in different patients and whether they correlate with the clinical course of the disease since the study only analyzed a sample of 5 patients.

      Author’s answer: We thank the reviewer for proposing to extend our analysis. As suggested, our revised manuscript now includes new analyses which reveals the different proportions of our identified T cells states in different cancer patients (New Figure 4). We further investigated whether these T cell states associate with clinical responses and observed that pre-effector GZMK- and intermediary GZMH-expressing CD8 T cells are better biomarkers of ICB response than terminally exhausted XCL1- and CXCL13-expressing CD8 T cells (New Figure 4).

      Reviewer #3. Question 1: Whether the distribution patterns of CD4+ and CD8+ T cell clusters in Figure 1B were comparable among the 5 patient samples? Whether the proportion of five types of clones in Figure 3C are comparable among the 5 patient samples?

      Author’s answer: Thank you for the question. We included the results to answer these questions in the supplementary material (fig. S1C-D). For each patient, we calculated the proportion of a cluster among T cells in the blood or tumor. As observed in the boxplot (fig. S1C), the proportion of some subsets were higher in certain patients, such as the higher proportion of CD8_GZMK in the tumor of patient p09454. A recent study classified patients’ tumors based on the spatial distribution of CD8 T cells and performed scRNA-seq to identified cell subsets enriched in the groups inflamed/infiltrated (characterized by the distribution of CD8 T cells within the tumor epithelium), excluded (infiltrating CD8 T cells are restricted to the tumor stroma) or desert (T cells are not present or have low frequency) (Hornburg et al., 2021). Interestingly, this subset of CD8_GZMK cells were enriched in desert tumors, suggesting that the difference we observed in our dataset might reflect the spatial distribution of CD8 T cells in patient p09454. Regarding the TCR-seq data, the frequency of the five types of clones was different among patients. To show this data, we included a barplot (fig. S2D), showing for example, a higher proportion of tumor-expanded clones in patient p10329.

      Reviewer #3. Question 2: In Figure S2C, only a very small number of cells in the CD8-GZMK K-22 population. Are these cells representative? Do they generally exist in multiple samples or only in one sample?

      Author’s answer: Thank you for your comment. The subcluster k_22 indeed has a smaller number of cells compared to other subclusters. Nevertheless, the K_22 cluster was found in every patient and in every healthy donor. To clarify, we edited our revised manuscript to include a statement that cluster k_22 was composed of fewer cells compared to other clusters.

      Reviewer #3. Question 3: In the Fig.S6 legend, the authors stated "Our results suggest the differentiation of cluster CD8-GZMK into the effector-like subset CD8-GZMH." However, there seems to be no corresponding analysis in the main text to support this conclusion.

      Author’s answer: We appreciate your attention to this statement. We agree the results of our study doesn’t sustain this statement and so we have excluded it in the revised manuscript.

      Reviewer #3____. Question 4: Is there more detailed clinical information that can be provided for the 5 patients included in the study? Per the methods all patients were receiving debulking surgery and were treatment naïve, but did they differ in stage, age, comorbidities, etc.?

      Author’s answer: Thank you for your comment on this. We have included a table with clinical information on the stage, age, and menopause status of the five patients.

      Reviewer #3. Question 5: Were any cells included for sequencing from adjacent 'normal' tissue uninvolved with tumor (these samples are from surgical debulking of primary tumors, which may include such areas of non-involved tissue.) While shared TCR clonotypes between blood and intratumoral T cells strongly suggests the tumor-resident populations are recruited from the blood, the degree of sharing with normal tissue-resident T cells would be of interest as well.

      Author’s answer: Thank you for your comment. Samples were provided for sc-RNA-seq after pathology review and validation of tumor histology. We did not perform sc-RNA-seq on normal adjacent tissue (NAT) We agree this would be interesting as a follow up study, since in other cancer types (renal, colon and lung) it has been demonstrated that T clones expanded in the tumor and NAT are also present in peripheral blood (Wu et al., 2020).

      Reviewer #3. Question 6: Very little is discussed about HGSOC itself in the main text (eg clinical background, prior literature on the composition of infiltrating immune populations and potential reasons for at best modest poor responses to IO) until the first sentence of the discussion. As the entirety of the new data produced in this study is from HGSOC tumors there should be more focus on this tumor type and conversation with the prior literature on it (mainly from prior studies on the immune environment of HGSOC). Further, how distinct do the authors suspect the cell populations found in their study to be to ovarian as opposed to other epithelial tumor types?

      Author’s answer: Thank you for the suggestion. We now included more background information on immunotherapy of HGSOC. Specifically, we added the following paragraph in our introduction: “In ovarian cancer, the presence of both T and B cells improves patients' survival (Nelson, 2015; Nielsen et al, 2012). They are usually organized in lymphoid aggregates ranging from a small group of cells to a well-organized TLS (Kroeger et al, 2016). Organized TLSs correlate with better survival, such as observed in patients treated with ICB. Although immunotherapy has demonstrated limited effectiveness against ovarian cancer, subsets of patients may thus benefit from ICB. In support of this, combination anti-PD-1/CTLA-4 therapy can achieve response rates above 30% (Zamarin et al., 2020), and encouraging clinical results have recently been reported when combining ICB with with PARP and VEGF inhibitors (Harter et al., 2023)”.

      Reviewer #3. Question 7: Were the signature genes used for analysis in figure 5 remove chosen in a formal, unbiased manner, or simply hand-picked as representative of the respective cell types? This information is not provided in the supplement.

      Author’s answer: Another reviewer has also expressed similar concerns. The genes selected to represent cell types were chosen manually, which we acknowledge is not the best method for defining a signature. As a result, we have decided to exclude Figure 5 from the manuscript under review. We believe an unbiased approach is more suitable for characterizing the cell network proposed in our study.

      Reviewer #3. Question 8: While the NicheNet analysis of potential interactions among lymphocyte populations raises some strong hypotheses, it would be interesting to extend the interaction analysis to all CD45+ populations, given the sequencing was done on CD45+ immune cells.

      Author’s answer: Thank you for suggesting analysis. We have included the results of cell interaction including all CD45+ cells (fig. S3). We observed CD40L as one of the top predicted ligands highly expressed in CD4_CXCL13 subset mediating a response in subsets of antigen-presenting cells, such as B cells (cluster B), plasma cells (cluster PC_2), and plasmacytoid dendritic cells (cluster pDC). Interestingly, this result also support the hypothesis of Tfh-like cells (cluster CD4_CXCL13) coordinating the action of intratumoral immune cells involved in the antitumor immune response.

      Reviewer #3. Question 9: A sample size of 5 patients is relatively small for current single cell RNAseq studies of human tumor patients.

      Author’s answer: We agree with the reviewer that a sample size of 5 patients is relatively small. Thus, to validate our results in other patients, we included in the reviewed manuscript the analysis of scRNA-seq of 47 patients across10 cancer types (dataset from (Zheng et al., 2021). As demonstrated in figure 3 and figure 5, we could identify subsets of CD8 and CD4 T cells from our ovarian cancer patients in those 10 cancer types dataset.

      Reviewer #3.____ Minor

      *1. In lines 96-97, "CD8-GZMB" was mentioned twice in the description. *

      2. In line 126, this section did not discuss residency markers, yet a conclusion about residency was made in this sentence.

      Author’s answer: We appreciate you bringing these errors to our attention. We fixed them in the updated version of the manuscript.

      References:

      Gueguen, P., Metoikidou, C., Dupic, T., Lawand, M., Goudot, C., Baulande, S., … Amigorena, S. (2021). Contribution of resident and circulating precursors to tumor-infiltrating CD8 T cell populations in lung cancer. Science Immunology, Vol. 6, p. eabd5778. doi:10.1126/sciimmunol.abd5778

      Harter, P., Trillsch, F., Okamoto, A., Reuss, A., Kim, J.-W., Rubio-Pérez, M. J., … Aghajanian, C. (2023). Durvalumab with paclitaxel/carboplatin (PC) and bevacizumab (bev), followed by maintenance durvalumab, bev, and olaparib in patients (pts) with newly diagnosed advanced ovarian cancer (AOC) without a tumor BRCA1/2 mutation (non-tBRCAm): Results from the randomized, placebo (pbo)-controlled phase III DUO-O trial. Journal of Clinical Orthodontics: JCO, 41(17_suppl), LBA5506–LBA5506.

      Hornburg, M., Desbois, M., Lu, S., Guan, Y., Lo, A. A., Kaufman, S., … Wang, Y. (2021). Single-cell dissection of cellular components and interactions shaping the tumor immune phenotypes in ovarian cancer. Cancer Cell. doi:10.1016/j.ccell.2021.04.004

      Wu, T. D., Madireddi, S., de Almeida, P. E., Banchereau, R., Chen, Y.-J. J., Chitre, A. S., … Grogan, J. L. (2020). Peripheral T cell expansion predicts tumour infiltration and clinical response. Nature. doi:10.1038/s41586-020-2056-8

      Yost, K. E., Satpathy, A. T., Wells, D. K., Qi, Y., Wang, C., Kageyama, R., … Chang, H. Y. (2019). Clonal replacement of tumor-specific T cells following PD-1 blockade. Nature Medicine. doi:10.1038/s41591-019-0522-3

      Zamarin, D., Burger, R. A., Sill, M. W., Powell, D. J., Jr, Lankes, H. A., Feldman, M. D., … Aghajanian, C. (2020). Randomized Phase II Trial of Nivolumab Versus Nivolumab and Ipilimumab for Recurrent or Persistent Ovarian Cancer: An NRG Oncology Study. Journal of Clinical Oncology: Official Journal of the American Society of Clinical Oncology, 38(16), 1814–1823.

      Zheng, L., Qin, S., Si, W., Wang, A., Xing, B., Gao, R., … Zhang, Z. (2021). Pan-cancer single-cell landscape of tumor-infiltrating T cells. Science, 374(6574), abe6474.

  12. inst-fs-iad-prod.inscloudgate.net inst-fs-iad-prod.inscloudgate.net
    1. EN G L I S H G RAMMAR4 .£9 3 }H O W T O T E A C H I T ;D ESIGN ED AS A TEX T-B OOK F OR COMMON SC HO O LS, AN DF OR T H E P R I MARY , IN TERMED IATE, AN D G RAMMARD EPARTMEN TS OF GRAD ED SCHOO LS.H E N RY BOLTWOOD , A.M. ,MAS TE R O F P RI NC E TON H I G H S C HO O L .C H I C A G O

      metadata 3

    1. Reviewer #1 (Public Review):

      This is an interesting paper that shows disruption of thalamocortical communication in anesthesia, and enhancement under 5-MeO-DMT in an animal model, combined with a model to establish that these changes can be understood as a displacement from a critical point of a neural mass model. Overall, these results are exciting as they constitute evidence that very different brain states can be understood as two different points of a continuum of states, with a critical transition point in the middle.

      These are my main detailed comments about this manuscript:

      1. Psychedelic drug dosage: 5 mg/kg is possibly a low dose of 5-MeO-DMT, which exhibits nonlinear pharmacokinetics presenting a transition in drug serum concentration between 2 mg/kg and 10 mg/kg. (Shen, H. W., Jiang, X. L., & Yu, A. M. (2011). Nonlinear pharmacokinetics of 5-methoxy-N, N-dimethyltryptamine in mice. Drug Metabolism and Disposition, 39(7), 1227-1234.)

      2. Novelty of the neural mass approach to establish critical dynamics. The neural mass model is interesting but it is also well established that the features of LFPs during anesthesia can be captured using these kinds of models, including phenomenology such as burst suppression, emergence of high amplitude synchronized oscillations, etc.; see for instance Kuhlmann, L., Freestone, D. R., Manton, J. H., Heyse, B., Vereecke, H. E., Lipping, T., ... & Liley, D. T. (2016). Neural mass model-based tracking of anesthetic brain states. NeuroImage, 133, 438-456.). The same applies to the modeling of wakefulness LPF using neural masses to show that alpha oscillations emerge in thalamocortical systems at the edge of a dynamic phase transition, which can be reproduced by the dynamics of a Hopf bifurcation.

      3. Is it possible that some of the results in the essential tremor group were influenced by the disease and its effects on the LPF dynamics, as it is known that tremors and seizures are associated by themselves with departures from critical dynamics?

      4. Table 1 and other parts of the manuscript: multiple independent tests were conducted, does this require a correction for multiple comparisons to avoid the reporting of false positive results or its control by FDR or related approaches?

    1. Author Response

      The following is the authors’ response to the original reviews.

      Review 1

      Public Review

      The authors set out to develop an organoid model of the junction between early telencephalic and ocular tissues to model RGC development and pathfinding in a human model. The authors have succeeded in developing a robust model of optic stalk(OS) and optic disc(OD) tissue with innervating retinal ganglion cells. The OS and OD have a robust pattern with distinct developmental and functional borders that allow for a distinct pathway for pathfinding RGC neurites.

      This study falls short on a thorough analysis of their single cell transcriptomics (scRNAseq). From the scRNAseq it is unclear the quality and quantity of the targeted cell types that exist in the model. A comparative analysis of the scRNAseq profiles of their cell-types with existing organoid protocols, to determine a technical improvement, or with fetal tissue, to determine fidelity to target cells, would greatly improve the description of this model and determine its utility. This is especially necessary for the RGCs developed in this protocol as they recommend this as an improved model to study RGCs.

      Future work targeting RGC neurite outgrowth mechanisms will be exciting.

      We are grateful to Reviewer 1 for these constructive comments. We added plots for quality control in supp. Fig. S5 and quantification of cell clusters in Tab. 1. We compared the transcriptomes between CONCEPT organoids, Gabriel et al.’s brain/optic organoids (Gabriel et al., 2021; PMID: 34407456), and human fetal retinas HGW9 (Lu et al., 2020; PMID: 32386599), which strongly support our findings (Figs. 5, 6; see responses below for details). Besides FGFs/FGFR signaling, scRNA-seq identified additional candidate molecules that may provide axon guidance functions, and these candidate molecules are the focus of our future study.

      Recommendations For The Authors

      This study falls short on a thorough analysis of their single cell transcriptomics (scRNAseq).

      The scRNAseq figure needs to be better presented to allow for an adequate assessment of the model. As written the classification of the different clusters is hard to follow. A representative labeling of the suspected identity of the clusters in an infographic would aid the figure. Since it is hard to follow it is difficult to determine how well clusters correlate with designated cell types. PAX2 expression designating optic stalk seems to correlate well with the group 2 and the designation of the Optic disk, however PAX2 expression for the optic stalk is half in group 4 and half in group 9. what are group 4 and 9? It is also not clear how the thresholding for the given clusters was reached.

      To present the scRNA-seq dataset in a clearer way, we added dotted red lines in Fig. 4C to delineate eye (mostly retinal), telencephalic, and mixed cell populations. In Tab. 1, we showed assigned cell types, counts, and percentage for each cluster.

      PAX2+ VSX2- optic stalk cells were at edges of clusters 4, 8, 9 that had dorsal telencephalic identities. Clusters 4, 8, 9 were largely segregated along cell cycle phases (Fig. 4A, B, F), and these clusters differentially expressed gene markers SOX3, FGFR2, PRRX1, EDNRB, and FOXG1 (supp. Fig. S7A-S7D; Fig. 4C). In E14.5 mouse embryos, mouse orthologs of SOX3, FGFR2, PRRX1, and EDNRB were specifically expressed in dorsal telencephalon (Fig. S8AS8E); Foxg1 was specifically expressed in both dorsal and ventral telencephalon. Therefore, clusters 4, 8, and 9 have dorsal telencephalic identities, and PAX2+ VSX2- optic stalk cells are at edges of these telencephalic clusters. Lines 259-261; 297-298.

      Thresholding of cell clusters were determined by cell clustering parameters, which is described in Materials and Methods: FindVariableFeatures (selection.method = "vst", nfeatures = 2000), ScaleData, RunPCA, ElbowPlot, FindNeighbors (dims = 1:17), FindClusters (resolution = 0.5), and RunUMAP(dims = 1:17). Lines 717-721.

      The authors should make an attempt to calculate which different cell types are present and in what proportions. They should also discuss groups that are confounding. Since this is the first description of this technique it is critical to know how much of the model represents mature welldefined cells of interest.

      We assigned cell types to clusters and calculated cell counts and proportions of each cluster (Tab.1). The only undetermined cell cluster was cluster 13, which was the smallest one. We described top DEGs of cluster 13 and discussed the cluster. Lines 266-268.

      Concerning the focus on RGC isolation. It is interesting that CNTN2 can be used for an effective isolation however, there are many protocols for generating RGCs. Is CNTN2 expression unique to this protocol? If the authors claim that this protocol could be used for studying glaucoma, how does this protocol improve on the quality of RGCs compared to other protocols?

      RGC-specific CNTN2 expression was not unique to CONCEPT organoids. We isolated RGCs via CNTN2 from both CONCEPT organoids and 3-D retinal organoids in suspension. Indeed, isolated RGCs shown in the manuscript were from 3-D retinal organoids (see Materials and Methods for details). Importantly, our single cell RNA sequencing analysis demonstrated that CNTN2 was also differentially expressed in early RGCs from human fetal retinas (Fig. 5L, 5M). Therefore, isolation of human early RGCs via CNTN2 should be applicable widely.

      In CONCEPT organoids, RGC differentiation and directional axon growth were very efficient. Our study supports a model that FGFs from optic disc cells efficiently induce RGC differentiation and directional axon growth in adjacent retinal progenitor cells, as FGFR inhibitions drastically decreased the number of RGC somas and directional axon growth (Fig. 9). Therefore, CONCEPT organoids are useful in studying axon guidance cues in humans, which knowledge is much needed for axon regrowth from RGCs that are damaged in glaucoma. Notably, juvenile glaucoma gene CYP1B1 was found in assigned optic disc cells in both CONCEPT organoids and human fetal retinas (Fig. 4I, 5D), making CONCEPT organoids a testable model in studying the functions of CYP1B1 in human cells.

      A comparative analysis of the scRNAseq profiles of their model with existing organoid protocols, to determine a technical improvement, or with fetal tissue, to determine fidelity to target cells, would greatly improve the description of this model and determine its utility.

      In the revised manuscript, we compared the transcriptomes between CONCEPT organoids, Gabriel et al.’s brain/optic organoids (Gabriel et al., 2021; PMID: 34407456), and human fetal retinas HGW9 (Lu et al., 2020; PMID: 32386599). Gabriel et al. (2021) report “axon-like” projections in their “optic vesicle-containing brain organoids”. We found that PAX2+ optic disc, PAX2+ optic stalk, FOXG1+ telencephalic, and VSX2+ neuroretinal cell clusters that were found in CONCEPT organoids did not exist in Gabriel et al.’s organoids (supp. Fig. S12), indicating striking differences between Gabriel et al.’s organoids and our CONCEPT telencephalon-eye organoids.

      On the other hand, CONCEPT organoids and human fetal retinas HGW9 had similar expression signatures (Fig. 5). First, we identified a PAX2+ cell cluster in the human retinas HGW9. 64/113 DEGs in the PAX2+ cluster from human fetal retinas HGW9 were also DEGs of cluster 2 (assigned PAX2+ optic disc cells) from CONCEPT organoids. Second, CNTN2 was also differentially expressed in early RGCs of human fetal retinas. Third, when cells in cluster 18 and retinal progenitor clusters from the HGW9 dataset were combined with cells in clusters 2, 4, 5, 7 from the CONCEPT dataset for Seurat anchor-based clustering, cells in cluster 18 from HGW9 (H18) were grouped with cluster 2 from CONCEPT organoids (C2, assigned optic disc; N), and these cells expressed both PAX2 and VSX2 (arrowheads in Fig. 5N-5R). A small portion of H18 cells were grouped with cluster 4 from CONCEPT organoids (C4, assigned optic stalk; N), and these cells expressed PAX2 but not VSX2 (arrows in Fig. 5N-5R). Fourth, CONCEPT organoids and human fetal retinas shared many enriched GO terms in DEGs of assigned optic disc cells (Fig. 6).

      Collectively, transcriptomic comparisons support that our CONCEPT organoids are innovative and similar to human fetal retinas. Lines 325-392.

      Not clear what reporting on Lens cells in Figure 3 adds to the focus of the manuscript. The figure seems out of place with the flow of the manuscript.

      Lens cells were obvious in CONCEPT organoids. The presence of lens cells indicates that cysts have the developmental potential for both neural and non-neural anterior ectodermal cells. For a better flow, we added a transitional sentence at the beginning of the lens section. Lines 207208.

      Reviewer #2

      Public Review

      The study by Liu et al. reports on the establishment and characterization of telencephalon eye structures that spontaneously form from human pluripotent stem cells. The reported structures are generated from embryonic cysts that self-form concentric zones (centroids) of telencephaliclike cells surrounded by ocular cell types. Interestingly, the cells in the outer zone of these concentric structures give rise to retinal ganglion cells (RGCs) based on the expression of several markers, and their neuronal morphology and electrophysiological activity. Single-cell analysis of these brain-eye centroids provides detailed transcriptomic information on the different cell types within them. The single-cell analysis led to the identification of a unique cellsurface marker (CNTN2) for the human ganglion cells. Use of this marker allowed the team to isolate the stem cell-derived RGCs.

      Overall, the manuscript describes a method for generating self-forming structures of brain-eye lineages that mimic some of the early patterning events, possibly including the guidance cues that direct axonal growth of the RGCs. There are previous reports on brain-eye organoids with optic nerve-like connectivity; thus, the novel aspect of this study is the self-formation capacity of the centroids, including neurons with some RGC features. Notably, the manuscript further reports on cell-surface markers and an approach to generating and isolating human RGCs.

      Recommendations For The Authors

      The following significant issues, however, need to be addressed:

      The authors show RGC-like cells that grow axons toward the Pax2+ cells, suggesting that this is a model for RGC axon pathfinding. Is there support from transcriptomic data on the expression of guidance molecules? In addition, the authors need to characterize Pax2+ cells further. Do some give rise to astrocyte-like cells?

      We assessed the expression of known axon guidance genes in CONCEPT organoids. FGF8 and FGF9 trigger axon outgrowth in motor neuron column explants (Shirasaki et al., 2006). In CONCEPT organoids, FGF8 and FGF9 were differentially expressed in assigned optic disc cells; FGFR inhibition drastically decreased the number of RGC soma and directional axon growth (Fig. 9). In addition, SEMA5a and EFNB1 were expressed in both assigned optic disc and stalk cells, EFNB2 was highly expressed in assigned optic disc cells, and NTN1 was mostly expressed in assigned optic cells (supp. Fig. S12). Lines 307-310.

      We compared the transcriptomes between CONCEPT organoids, Gabriel et al.’s brain/optic organoids (Gabriel et al., 2021; PMID: 34407456), and human fetal retinas HGW9 (Lu et al., 2020; PMID: 32386599). Gabriel et al. (2021) report “axon-like” projections in their “optic vesicle-containing brain organoids”. We found that PAX2+ optic disc, PAX2+ optic stalk, FOXG1+ telencephalic, and VSX2+ neuroretinal cell clusters that were found in CONCEPT organoids did not exist in Gabriel et al.’s organoids (supp. Fig. S12), indicating striking differences between Gabriel et al.’s organoids and our CONCEPT telencephalon-eye organoids. Lines 327-345.

      To authenticate PAX2+ cells in CONCEPT organoids, we analyzed a single-cell RNA-seq dataset of human fetal retinas HGW9 and identified a similar PAX2+ cell population, cluster 18 (Fig. 5). Expression signatures of PAX2+ cells between CONCEPT organoids and human fetal retinas HGW9 were similar. Notably, cluster 18 differentially expressed PAX2, COL9A3, CYP1B1, SEMA5A, and FGF9 (Fig. 5B-5F), which were top DEGs of cluster 2 in CONCEPT organoids (Fig. 4F, 4G, 4I, 4K; SEMA5A was shown in supp. Fig. S12A). Overall, 64/113 DEGs of cluster 18 in human fetal retinas HGW9 were also DEGs of cluster 2 in CONCEPT organoids. In both HGW9 and CONCEPT organoids, expression of OLIG2, CD44, and GFAP was undetectable (supp. Fig. S14), indicating that astrocytes had not been generated yet at these stages.

      When cells in cluster 18 and retinal progenitor clusters from the HGW9 dataset were combined with cells in clusters 2, 4, 5, 7 from the CONCEPT dataset for Seurat anchor-based clustering, cells in cluster 18 from HGW9 (H18) were grouped with cluster 2 from CONCEPT organoids (C2, assigned optic disc; N), and these cells expressed both PAX2 and VSX2 (arrowheads in Fig. 5N-5R). A small portion of H18 cells were grouped with cluster 4 from CONCEPT organoids (C4, assigned optic stalk; N), and these cells expressed PAX2 but not VSX2 (arrows in Fig. 5N5R).

      We then compared functional annotations of DEGs (top 200 genes) of cluster 2 in CONCEPT organoids and DEGs (113 genes) of cluster 18 in human fetal retinas HGW9. Top GO terms in GO:MF, GO:CC, and GO:BP are shown (Fig. 6). For DEGs of cluster 2 in CONCEPT organoids, top enriched GO terms in GO:MF, GO:CC, and GO:BP were extracellular matrix structural constituent, collagen-containing extracellular matrix, and system development, respectively. Additional interesting GO:BP terms included axon development, astrocyte development, eye development, response to growth factor, cell adhesion, cell motility, neuron projection development, glial cell differentiation, and signal transduction. For DEGs of cluster 18 in human fetal retinas HGW9, top enriched GO terms in GO:MF, GO:CC, and GO:BP were cell adhesion molecule binding, extracellular space, and developmental process, respectively. Many GO terms were enriched in both samples, further indicating transcriptomic similarities in PAX2+ optic disc cells between CONCEPT organoids and human fetal retinas. Notably, GO terms astrocyte differentiation, neuron projection development, and glial cell differentiation were enriched in the DEGs of assigned optic disc cells for both CONCEPT organoids and human fetal retinas, consistent with expectations.

      Transcriptomic comparisons between CONCEPT organoids and human fetal retinas are found in lines 346-392.

      The Vsx2+Pax2+ population is not typically detected in vivo in the developing mouse eye. The authors claim that they detected them in vivo, but the data supporting this statement are lacking.

      We demonstrate that assigned optic disc cells expressed both VSX2 and PAX2, and this statement is trued for CONCEPT organoids and human fetal retinas HGW9 (Fig. 5N-5R). Please see the underlined sentence in the response to the comment above.

      Do the RGCs express subtype-specific markers? Do they detect markers of other retinal neurons typically born early in development-cones, amacrine cells, horizontal cells? The authors need to compare the transcriptome of different clusters to the published datasets from human and mouse retinae.

      The stage of CONCEPT organoids for scRNA-seq was at an early stage. In this dataset, subtypes of RGCs were undetectable. Isolated RGCs via CNTN2 were at more advanced stages. Distinct expression of POU4F2, ISL1, RBPMS, and SNCG indicate multiple subtypes of RGCs (Fig. 7L-7P).

      We did find other early retinal neurons in the scRNA-seq dataset: photoreceptor cells, amacrine/horizontal cells in CONCEPT organoids (Fig. 4U-4X), and these cells were also in cluster 11 in which RGCs were found.

      We performed transcriptomic comparisons between CONCEPT organoids, brain/optic organoids, and human fetal retinas. We found that PAX2+ optic disc, PAX2+ optic stalk, FOXG1+ telencephalic, and VSX2+ neuroretinal cell clusters that were found in CONCEPT organoids did not exist in Gabriel et al.’s organoids, indicating striking differences between Gabriel et al.’s organoids and our CONCEPT telencephalon-eye organoids (supp. Fig. S13). On the other hand, we found that expression signatures of CONCEPT organoids and human fetal retinas are similar (Figs. 5, 6).

      Transcriptomic comparisons are found in lines 325-392.

      Fig. 3: where are the "lens like" cells located? The structures in panels B and D look very different. Are these lens-cells toward the periphery or scattered throughout?

      Lens cells were dispersed in the zone in which neural retinal cells are located, which is shown in a low-magnification image (Fig. 3K). Panel B and D in Figure 3 were at different stages. At early stages, lens clusters were small (Fig. 3B). At later stages, lens clusters became bigger (Fig. 3D).

      Fig. 3K and L, TEM images: how do the authors know that these are lens cells?

      Western blot of these transparent cell clusters demonstrated that they were lens cells (Fig. 3L).

      Fig. 5: The authors claim that a reduced number of Pax2+ cells is associated with entry of the axons. It is not clear if this is just due to physical barriers or to active axon guidance.

      We believe that Reviewer 2 referred to the gap region of PAX2 expression in Fig. 7A, 7F. RGC axons grew toward and along adjacent PAX2+ VSX2+ cells. Since PAX2+ VSX2+ cells grossly formed a circular shape, RGC axons followed this circular shape. In a gap region of PAX2 expression, RGC axons exited the circle. The association of RGC axon growth with PAX2+ VSX2+ cells was very robust. Besides PAX2+ cell populations, we did not find any other cell populations that directed RGC axon growth.

      Fig. 5K: The authors refer to ALDH1A3 expression in the optic disk, but the presented section does not include the optic disk. In addition, ALDH1A3 is expressed in other regions of the developing retina (Fig. 5K, ref 71).

      We are sorry we did not make it clear. We referred to Li et al.’s (2000) paper (Mech Dev 95, 283-289) for Aldh1a3 expression in the optic stalk. Figure 7K was used to shown Aldh1a3 expression in peripheral retinas on sections.

      Line 263, Reference 68: The authors claim that col13A1 is specific to the human optic disk. However, col13A1 is expressed in many additional eye lineages (PMID: 10865988).

      We are sorry we did not make it clear. We meant that Col13A1 is prominently expressed in the optic disc, which is clearly shown in the referred paper (Figure 3D in the paper PMID: 10865988).

      The authors show that inhibiting FgfR results in fewer RGCs and loss of directed axonal growth. The number of cells is drastically reduced; thus, the relevance of the finding directly to axon guidance is not resolved.

      FGFR inhibitions drastically the number of RGC somas (Fig. 9F-9K). Additionally, remaining RGCs nearly did not grow directional axons (arrowheads in Fig. 9K), and a few remaining axons wandered around (arrow in Fig. 9K), indicating the role of FGF/FGFR signaling in RGC differentiation and directional axon growth.

      Fig. 1H and J: Vsx2 is outside the centroid in panels H and I, but inside the centroid in panels J and K. It is not clear what part of the centroid is shown. This needs to be clarified by adding a scheme.

      We are sorry we did not make it clear. We added separate-channel images showing VSX2 and PAX6 expression (supp. Figs. S1, S2) and a new diagram (left panel in Fig. 1B). Overall, FOXG1, VSX2, and PAX6 expression at days 15-17 formed three concentric zones spanning from the center to the periphery. At days 22-26, VSX2 expression expanded peripherally, largely overlapping PAX6 expression (supp. Figs. S1, S2).

      Pax6 should be in all cells, also on day 17. Show the separate channels, including DAPI.

      We added separate-channel images (supp. Figs. S1, S2). In cysts, PAX6 was expressed in all cells. After cysts attached to the culture surface and grew as colonies, distinct levels of PAX6 expression emerged in concentric zones. At days 17 and 26, PAX6 expression at the central zone (which cells expressed FOXG1) became lower, which is obvious in separate-channel images (supp. Figs. S1, S2). Consistently, PAX6 expression was low in FOXG1+ telencephalic cells in the scRNA-seq (Fig. 4C, 4D).

      Lines 27-30: this is a long and complex sentence which needs to be clarified.

      We broke it into a few sentences to make it clearer.

      Line 43: fix "Retina" to "Retinal"

      We fixed it.

      Lines 376-377: repeated "mechanisms of".

      We fixed it.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer 1

      Question 1: While the CTD human brain organoids show a decrease in Cr (in absence of Cr in the culture medium) as compared to control organoids (4 times less), they are not devoid of Cr. Do these organoids express the two enzymes allowing Cr synthesis (AGAT and GAMT), and in which brain cell types? If yes, how to explain the decrease in Cr in the CTD organoids?

      There is a lack of functional CRT in the CTD human brain organoids. The basal level of creatine in CTD human brain organoid is significantly lower than in healthy human brain organoids. The intracerebral creatine synthesis is due to different expression of the AGAT and GAMT enzymes and relies on functional CRT for the transport of the GAA intermediate. Literature pointed out that both enzymes are rarely co-expressed (Braissant et al., 2001, PMID: 11165387) meaning that GAA intermediate needs to be transported by CRT to neurons for complete creatine synthesis. Even if we evidenced a slight mRNA expression of AGAT and GAMT enzymes, the creatine synthesis is not effective since the GAA intermediate could not be transported in cell expressing GAMT due to the non-functional creatine transporter in the CTD human brain organoids.

      Question 2. The rescue experiment, re-establishing a functional Cr transporter (CRT or SLC6A8) in the CTD human brain organoids, is very interesting, as this may help the design and development of new treatments for CTD. However, authors claim that the functional CRT expressed in the rescued CTD organoids was expressed in each cell. This may be a difficulty in the development of new CTD treatments, as CRT should be expressed in neurons and oligodendrocytes, but not in astrocytes. Authors may want to comment on this point.

      As shown in Figure S2C, the whole brain organoid in the rescue experiment shows the expression of the GFP protein, thus also the co-expressed wild-type CRT. In these experiments, we did not make a detailed cellular characterization of the rescued organoids, and this may be the aim of a separate study that will carry out experiments for an exact characterization of the cell-specific CRT expression and function in the rescued brain organoids. Accordingly, we corrected in the revised version of manuscript the statement on page 6 to the following: “SLC6A8 expressing brain organoids showed GFP fluorescence in the whole area of the organoid (Fig S2C).”

      Reviewer #1 (Recommendations for The Authors):

      • Authors may cite the recent review by Fernandes-Pires (2022) exposing the challenges to treat CTD (introduction, lines 57-58 for example).

      Reference has been added, lines 57-58 of the revised version

      • Authors may precise in their introduction (lines 60-61) that, while creatine (Cr) supplementation is not effective to treat CTD male patients, a proportion of female CTD patients is responsive to Cr supplementation (due to the differential inactivation of one of the X chromosome depending on the cells).

      Treating CTD appears simple: transport creatine into the brain cells. In individuals with creatine synthesis disorders, increasing brain creatine levels thanks to oral supplementation of creatine monohydrate and/or precursors improves neurodevelopmental outcomes. This task has proven more daunting than expected in CTD since oral creatine supplementation does not increase brain creatine concentrations. Literature and more specially data reported by Van de Kamp “X-linked creatine transporter deficiency: clinical aspects and pathophysiology. J Inhert Metab Dis 37 (5):715-733) describes 3 females CTD patients without improvement of clinical outcomes. Bruun et al., 2018 “Treatment outcome of creatine transporter deficiency: international restrospective cohort study: Metab. Brain Dis: 33:875-884 reports 2/3 CTD females with improvement of clinical outcome. Taken together the sentence has been modified in the revised version of the manuscript as follows: “Several combinations of nutritional supplements or Cr precursors l-arginine and l-glycine, have been studied as therapeutic approaches for CTD, but they have shown limited success (Bruun et al., 2018, Valayannopoulos et al., 2013) (lines 61-63, Page 4)

      • When comparing their new in vitro CTD model of human brain organoids with existing in vivo rodent models, authors may add the citation of the rat model of Duran-Trio et al (2021 & 2022), in particular for its description of CNS tissue alterations (dendritic spines density for example).

      The reference Duran-Trio et al (2021) has been added (page 4, line 70). The reference Duran-Trio et al (2022) has been added (page 11) and the sentence has been modified in the revised version of the manuscript as follows: “Reduced cortical spine density and reductions in protein levels of several synaptic markers have been observed in the brains of Slc6a8-/y mice and rats (Chen et al., 2021; Duran-Trio et al., 2022)”.

      Reviewer #2 (Recommendations For The Authors):

      There are only minor suggestions for improvement in this manuscript. The authors strongly link creatine uptake, the GSK3β pathway, and intellectual disability. Enhancing this claim with data on phosphorylation differences between organoids derived from healthy individuals and those from CTD patients could solidify this foundation and facilitate a more holistic understanding of the disease. In addition, the in vitro model based on organoids might be closer than other experimental setups; however, proving that those differences are also present in vivo would greatly benefit the story.

      As shown in Fig 6A-B, GSK3β is less phosphorylated on Ser9 in CTD brain organoids compared to healthy organoids, indicating that GSK3β is more active in organoids with reduced creatine levels. Studying the level of GSK3β phosphorylation in the mouse brain could be part of next experiments and another story.

      There is also some uncertainty around the rescue experiment using the exogenous SLC6A8 gene. Could the difference in creatine uptake between the rescue iPSCs and the healthy control be due to CRT overexpression? Higher levels of the transporter may explain the elevated levels of intracellular creatine. Thus, a comparison using Western blotting experiments could be a valuable addition to evaluating the expression levels of this protein.

      For the rescue experiment, we used a vector where SLC6A8 and eGFP were connected by an IRES2 sequence, providing simultaneous, but independent expression of the two proteins. CTD-rescue iPSC clones were selected based on high eGFP fluorescence. These clones probably have several copies of transgene in their genome, which could result in a higher abundance of SLC6A8 compared with healthy iPSCs. The difference in creatine uptake between the CTD-rescue iPSCs and the healthy control is probably due to CRT overexpression. However, there are no satisfactory anti-SLC6A8 antibodies commercially available to quantify CRT by western-blot. We would like to add that, although creatine uptake is higher in CTD-rescue iPSCs than in healthy control, the basal level of creatine (which corresponds to culture conditions for the rest of the experiments) is similar.

      Overall, this study provides valuable insights into CTD and potential therapeutic targets. It enriches our understanding of CTD and opens up new avenues for future research in this field.

      We thank the reviewer for their kind words and hope this study will be useful for other researchers in the CTD field.

    1. This make s it possible to transcribe, into j u r i d i c a l t e r m s , discontinuous obligations and tax records, b ut not to code continuous surveillance; it is a theory that makes it possible to found absolute power around and on the basis of the physical existence of the sovereign, but not continuous and permanent systems of surveillance.

    1. Author Response

      The following is the authors’ response to the original reviews.

      In response to the eLife assessment that “the analysis of the data is inadequate”, we strongly disagree and we to point out that in fact we follow the latest IUPHAR community guidelines on bias identification and quantification (Kolb et al, 2022). These protocols are not yet being used in the RTK and FGF fields, and thus the reviewer is not familiar with them, or with the concept of ligand bias. Our responses to the technical comments start at the bottom of page 7 of this document.

      We have edited the paper by adding a scaling step-by-step protocol in the Supplementary Data. We have also expanded the Discussion to help readers understand what is measured and how it is very novel. We have also changed the title of the manuscript. The edits in the Manuscript are marked in yellow. Our response to the reviewer is given below.

      Question/comment: 1. Previous studies have demonstrated that the variability of signal transduction stimulated by different FGF family members originates from their preferential activation of different members of the FGFR family (Ornitz et al., 1996). For example, it was previously shown that members of the FGF8 subfamily preferentially activate FGFR3c, whereas members of the FGF4 subfamily activate FGFR1c more potently than other FGFs. Moreover, it was shown that FGF18, a member of the FGF8 subfamily, preferentially binds to and activates the FGFR3c isoform. Indeed, this can be seen in the data shown in Figure 3 in this manuscript, where maximum levels of FGFR1 pY653/4 and pFRS2 are reached at different concentrations when stimulated with increasing concentrations of each ligand in HEK293T cells.

      The reviewer is correct that there are differences in the signaling of the different FGFRs, however these differences are not relevant for this work. This paper is only about FGFR1c, as this is the only FGF-receptor which is expressed in the mesenchyme of the developing limb bud (early limb bud stage, before the onset of mesenchymal condensations) and encounters different FGF ligands. In the article, we analyze the mechanism by which one FGFR recognizers and responds to three different FGFs.

      The reviewer also correctly points out that differences in our work “can be seen in the data shown in Figure 3 in this manuscript, where maximum levels of FGFR1 pY653/4 and pFRS2 are reached at different concentrations when stimulated with increasing concentrations of each ligand in HEK293T cells”. This is correct, but this is a statement about the potencies of the ligands, which is just one of three characteristics we explore here, namely potencies, efficacies, and bias. To determine if ligand bias exists or not, we need to compare two ligands and two responses (such as growth arrest and ECM degradation, or pY653/4 and pFRS2 phosphorylation). Ours is the first report of ligand bias in FGFR1 signaling, and the presence of bias goes far beyond simply differences in potencies (Kolb et al, 2022). Ligand bias in FGFR1 has never been demonstrated before. In part, this is because there have been no cell lines that give us the opportunity to compare two functional responses to FGF stimulus, via just one endogenously expressed FGFR variant. Notice that the paper that the reviewer is citing, (Ornitz et al., 1996), compares only 1 (one) type of response, when induced by different ligands, i.e. proliferation, and thus cannot answer the question if ligand bias exists or not. We have edited the Discussion to emphasize this fact. We have also changed the title.

      Two studies meant to characterize FGF binding to the FGFRs (Ornitz et al., 1996; Zhang et al., 2006) have defined the main rules of the FGF-FGFR interaction, such as exclusivity of the FGF3 subfamily (FGF3, FGF7, FGF10) for the ‘b’ variants of the FGFR1 and FGFR2. These studies however do not measure ligand binding. These studies were carried-out in BAF/3 cells, where the transfected FGFRs are treated with exogenous FGFs, to cause cell proliferation. As such, the studies have several limitations. In BAF/3 cells, the cell proliferation is used as a surrogate for FGF binding on FGFR. The FGFRs activate cell proliferation via RAS-ERK MAP kinase pathway. However, many other pathways of downstream signaling are initiated by FGFRs, regulating cell differentiation, migration, metabolism and apoptosis, in biological contexts. Using single cellular response (cell proliferation) as a surrogate for FGF binding to their receptors will favor FGF ligands causing cell proliferation. FGFs which have preference for other responses will incorrectly appear weakly binding and weakly activating in BAF/3 cells. Further, an FGF ligand binding with high affinity to the receptor but inducing a lower proliferative response will be recognized as a less ‘preferential’ for the particular receptor in the BAF/3 assay. Second, the significant diversity of signaling of 18 FGFs through seven FGFR variants in mammalian development suggests that many previously unappreciated nodules of FGF-FGFR signaling exist, including the recently discovered FGF signaling towards primary cilia, or interaction with insulin receptor system (Kunova Bosakova et al., 2019; Neugebauer et al., 2009; Nies et al., 2022). This diversity is not reflected in BAF/3 assay, which respond to FGFs with only one phenotype. This is why we have used the RCS cells in the manuscript. In RCS cells, at least two qualitatively different cell responses can be induced by the FGF signaling, making the cell model ideal for elucidating biased signaling.

      The so called ‘binding preferences’ based on the Ornitz articles are not binding measurements and should not be used universally to describe the FGF interactions with FGFRs, because we do not know what the term really means, nor what is it based on; the molecular basis of the FGFR signaling BAF/3 is poorly characterized. In our article, we model the processes occurring in every developing mammalian limb, where three FGF ligands (FGF4, FGF8, FGF9), released by the ectoderm at the surface of the limb bud, signal to the underlying mesenchymal cell expressing just one FGF-receptor, the FGFR1c (Mariani and Martin, 2003; Tabin and Wolpert, 2007). Unlike the BAF/3 cells engineered to ectopically express one FGFR and treated by recombinant FGFs in the lab, all three FGFs are recognized by cells expressing FGFR1c, and each of the three FGFs delivers unique morphogenetic information. The mechanisms underlying differential signaling of multiple FGFs via one FGFR are poorly defined, as the term ‘preferential signaling’ does not provide mechanistic explanation. Our article is a step towards understanding the complex processes of FGF ligand recognition and response. In our article, we evaluate the potency, the efficacy, the FGFinduced FGFR1c oligomerization and downregulation, and conformation of the active FGFR1c dimers in response to FGF4, FGF8 and FGF9. We show that FGF4, FGF8, and FGF9 are biased ligands, and that bias can explain differences in FGF4, FGF8 and FGF9-mediated cellular responses in development.

      References

      Kolb P, Kenakin T, Alexander SPH, Bermudez M, et al. Community guidelines for GPCR ligand bias: IUPHAR review 32. Br J Pharmacol. 2022;179, 3651-3674.

      Kunova Bosakova M, Nita A, Gregor T, Varecha M, et al. Fibroblast growth factor receptor influences primary cilium length through an interaction with intestinal cell kinase. Proc Natl Acad Sci U S A. 2019;116(10):4316-4325.

      Mariani FV, Martin GR. Deciphering skeletal patterning: clues from the limb. Nature. 2003;423(6937):319-25.

      Nies VJM, Struik D, Liu S, Liu W, et al. Autocrine FGF1 signaling promotes glucose uptake in adipocytes. Proc Natl Acad Sci U S A. 2022;119(40):e2122382119.

      Neugebauer JM, Amack JD, Peterson AG, Bisgrove BW, Yost HJ. FGF signalling during embryo development regulates cilia length in diverse epithelia. Nature. 2009;458(7238):651-4.

      Ornitz DM, Xu J, Colvin JS, McEwen DG, et al. Receptor specificity of the fibroblast growth factor family. J Biol Chem. 1996;271(25):15292-7.

      Tabin C, Wolpert L. Rethinking the proximodistal axis of the vertebrate limb in the molecular era. Genes Dev. 2007;21(12):1433-42.

      Zhang X, Ibrahimi OA, Olsen SK, Umemori H, Mohammadi M, Ornitz DM. Receptor specificity of the fibroblast growth factor family. The complete mammalian FGF family. J Biol Chem. 2006;281(23):15694-700.

      Question/comment: In order to be sure that the 'biased agonist' described in this manuscript for FGF8 binding is not caused by binding preference towards different FGFR members, the authors should present data comparing cell signaling via FGFR3c stimulated by FGF4, FGF8, and FGF9.

      Here, we study signaling by FGFR1, which is the only receptor that is expressed in the mesenchyme of the developing limb bud. FGFR3 is not expressed there, and thus we do not study FGFR3 in this paper. FGFR3 is important regulator of skeletal development, but is not involved in the early stages like FGFR1. When the bones are formed, FGFR3 regulates chondrocyte proliferation and differentiation in the growth plate cartilage (Colvin et al., 1996). In fact, we are currently performing experiments with FGFR3 and multiple FGF ligands, and we see that it also engages in biased signaling. However, these FGFR3 studies have no relevance to the current work and will be published separately.

      The so called ‘binding preferences towards different FGFR members’, based on the Ornitz articles (Ornitz et al., 1996; Zhang et al., 2006) provides no mechanistic explanation about differential FGF signaling via the activation of a single FGFR. Our article is a step forward towards the mechanism, by demonstration, for the first time, that ‘ligand bias’ may explain differential signaling by FGF4, FGF8 and FGF9 via FGFR1c.

      References

      Colvin JS, Bohne BA, Harding GW, McEwen DG, Ornitz DM. Skeletal overgrowth and deafness in mice lacking fibroblast growth factor receptor 3. Nat Genet. 1996;12(4):390-7.

      Ornitz DM, Xu J, Colvin JS, McEwen DG, MacArthur CA, Coulier F, Gao G, Goldfarb M. Receptor specificity of the fibroblast growth factor family. J Biol Chem. 1996;271(25):15292-7.

      Zhang X, Ibrahimi OA, Olsen SK, Umemori H, Mohammadi M, Ornitz DM. Receptor specificity of the fibroblast growth factor family. The complete mammalian FGF family. J Biol Chem. 2006;281(23):15694-700.

      Question/comment: 2. It is well-established that FGFR signaling by canonical FGF family members including FGF4, FGF8, and FGF9 is dependent on interactions of heparin or heparan sulfate proteoglycans (HSPG) to the ligand the receptors. Differential contributions of heparin to cell signaling mediated by FGF4, FGF8, and FGF9 binding and activation of different FGFRs expressed in RCS cells as this cell express endogenous HSPG molecules. This question should be addressed by comparing cell signaling via FGFRs ectopically expressed in BAF/3 cells (which do not possess endogenous FGFRs and HSPG) stimulated by FGF4, FGF8, and FGF9 in the absence or presence of different heparin concentrations. This approach has been applied many times in the past to explore and establish the role of heparin in control of ligand induced FGFR activation.

      The work cannot be done with BAF/3 cells, since the topic of the study is ligand bias so we need to compare at least two measurable responses. In RCS cells, the two functional responses are growth arrest and extracellular matrix degradation. In BAF/3 cells, ligand stimulation leads to one single response: proliferation.

      The HSPG and other sulphated proteoglycans work as low affinity FGF co-receptors. They stabilize the FGF secondary structure, present the FGFs to the FGFRs, and participate in FGFFGFR interactions (Yayon et al., 1991; Schlessinger et al., 2000; Zakrzewska et al., 2009). In the FGF field, the FGF-FGFR interaction is commonly supported by addition of exogenous heparin, which is highly sulphated glycosaminoglycan capable of full substitution of the cell-bound HSPGs in their function as low affinity FGF co-receptors.

      Most cells produce proteoglycans, including BAF/3 cells. The analysis of expression of FGFR overexpressed in BAF/3 cells demonstrated that FGFR1, FGFR2 and FGFR3 migrate as proteins of approximately 130-150 kDa (Ornitz et al., 1996; Fig. 1A), which implies extensive glycosylation in Golgi. For instance, the full-length amino acid sequence for human FGFR3 is 806 residues, which on acrylamide gel migrates as a band of approximately 85 kDa; heavier FGFR3 variants are Golgi-glycosylated proteins. The treatment with de-glycosylation enzymes reduces the molecular weight to the one expected from the amino acid sequence.

      To carry-out the BAF/3 experiment with FGF4, FGF8, and FGF9 in the absence or presence of different heparin concentrations, as the referee suggests, makes no sense. In BAF/3 cells, all FGF stimulations were done in the presence of 2 g/ml heparin (Ornitz et al., 1996; Zhang et al., 2006), because without heparin there would be no signaling. Even if the BAF/3 cells produce ample HSPGs, the heparin would still have to be used, because without it many of the FGFs would likely cause no response, regardless of the FGFR variant expressed. We and other have demonstrated, that most of the FGFs require stabilization by heparin to elicit signaling in cells expressing abundant amounts of HSPG (Buchtova et al., 2015; Chen et al., 2012).

      Why should we compare the FGF signaling in BAF/3 transfected with FGFR1, with the RCS cells which express endogenous FGFR1? In RCS cells, several cellular phenotypes caused by FGF signaling can be easily detected and quantified, in comparison with BAF/3 cells, which only respond to the FGF signaling by proliferation. No bias in signaling can be established in cells with display only single type of response. The RCS cells used in our paper represent one of the most tractable cellular models of FGFR signaling. There are more than 40 articles exploring the mechanisms of FGF-FGFR signaling in RCS cells, including mechanisms of FGF signal transduction, FGF regulation of cell cycle, cell proliferation, differentiation, premature senescence, loss of extracellular matrix, interaction of FGF signaling with WNT, cytokine and natriuretic peptide signaling, and others (Raucci et al., 2004; Priore et al., 2006; Kamemura et al., 2017; Kolupaeva et al., 2013; Krejci et al., 2005; Krejci et al., 2007; Krejci et al., 2010; Dailey et al., 2003; Rozenblatt-Rosen et al., 2002; Fafilek et al., 2008). In addition, the three treatments to inhibit pathological FGFR signaling which are now in human trials (RBM007, meclozine) or FDAapproved (vosoritide), were initially developed in RCS cells, benefiting from the well characterized molecular mechanisms of FGF signaling (Krejci et al., 2005; Wendt et al., 2015; Kimura et al., 2021; Matsushita et al., 2013). In comparison with RCS cells, very little is known about the mechanisms of the FGF signaling in BAF/3 cells, as the BAF/3 proliferation assay is used mostly to evaluate FGFR agonists and antagonists (Yamada et al., 2020; Kamatkar et al., 2019; Motomura et al., 2008). We have edited this information to the revised Discussion.

      References

      Buchtova M, Oralova V, Aklian A, Masek J, et al. Fibroblast growth factor and canonical WNT/βcatenin signaling cooperate in suppression of chondrocyte differentiation in experimental models of FGFR signaling in cartilage. Biochim Biophys Acta. 2015 May;1852(5):839-50.

      Buchtova M, Chaloupkova R, Zakrzewska M, Vesela I, et al. Instability restricts signaling of multiple fibroblast growth factors. Cell Mol Life Sci. 2015 Jun;72(12):2445-59.

      Chen G, Gulbranson DR, Yu P, Hou Z, Thomson JA. Thermal stability of fibroblast growth factor protein is a determinant factor in regulating self-renewal, differentiation, and reprogramming in human pluripotent stem cells. Stem Cells. 2012 Apr;30(4):623-30.

      Fafilek B, Balek L, Bosakova MK, Varecha M, et al. The inositol phosphatase SHIP2 enables sustained ERK activation downstream of FGF receptors by recruiting Src kinases. Sci Signal. 2018 Sep 18;11(548):eaap8608.

      Kamemura N, Murakami S, Komatsu H, Sawanoi M, et al. Biochem Biophys Res Commun. 2017 Jan 29;483(1):82-87.

      Kamatkar N, Levy M, Hébert JM. Development of a Monomeric Inhibitory RNA Aptamer Specific for FGFR3 that Acts as an Activator When Dimerized. Mol Ther Nucleic Acids. 2019 Sep 6;17:530-539.

      Kimura T, Bosakova M, Nonaka Y, Hruba E, Yasuda K, et al. An RNA aptamer restores defective bone growth in FGFR3-related skeletal dysplasia in mice. Sci Transl Med. 2021 ;13(592):eaba4226.

      Kolupaeva V, Daempfling L, Basilico C. The B55α regulatory subunit of protein phosphatase 2A mediates fibroblast growth factor-induced p107 dephosphorylation and growth arrest in chondrocytes. Mol Cell Biol. 2013 Aug;33(15):2865-78.

      Krejci P, Masri B, Salazar L, Farrington-Rock C, et al. Bisindolylmaleimide I suppresses fibroblast growth factor-mediated activation of Erk MAP kinase in chondrocytes by preventing Shp2 association with the Frs2 and Gab1 adaptor proteins. J Biol Chem. 2007;282(5):2929-36.

      Krejci P, Masri B, Fontaine V, Mekikian PB, et al. Interaction of fibroblast growth factor and C-natriuretic peptide signaling in regulation of chondrocyte proliferation and extracellular matrix homeostasis. J Cell Sci. 2005 Nov 1;118(Pt 21):5089-100.

      Krejci P, Prochazkova J, Smutny J, Chlebova K, et al. FGFR3 signaling induces a reversible senescence phenotype in chondrocytes similar to oncogene-induced premature senescence. Bone. 2010;47(1):102-10.

      Matsushita M, Kitoh H, Ohkawara B, Mishima K, et al. Meclozine facilitates proliferation and differentiation of chondrocytes by attenuating abnormally activated FGFR3 signaling in achondroplasia. PLoS One. 2013;8(12):e81569.

      Motomura K, Hagiwara A, Komi-Kuramochi A, Hanyu Y, et al. An FGF1:FGF2 chimeric growth factor exhibits universal FGF receptor specificity, enhanced stability and augmented activity useful for epithelial proliferation and radioprotection. Biochim Biophys Acta. 2008 Dec;1780(12):1432-40.

      Ornitz DM, Xu J, Colvin JS, McEwen DG, MacArthur CA, Coulier F, Gao G, Goldfarb M. Receptor specificity of the fibroblast growth factor family. J Biol Chem. 1996;271(25):15292-7.

      Priore R, Dailey L, Basilico C. Downregulation of Akt activity contributes to the growth arrest induced by FGF in chondrocytes. J Cell Physiol. 2006 Jun;207(3):800-8.

      Raucci A, Laplantine E, Mansukhani A, Basilico C. Activation of the ERK1/2 and p38 mitogen-activated protein kinase pathways mediates fibroblast growth factor-induced growth arrest of chondrocytes. J Biol Chem. 2004;279(3):1747-56.

      Robinson JW, Egbert JR, Davydova J, Schmidt H, et al. Dephosphorylation is the mechanism of fibroblast growth factor inhibition of guanylyl cyclase-B. Cell Signal. 2017;40:222229.

      Rozenblatt-Rosen O, Mosonego-Ornan E, Sadot E, Madar-Shapiro L, et al. Induction of chondrocyte growth arrest by FGF: transcriptional and cytoskeletal alterations. J Cell Sci. 2002 Feb 1;115(Pt 3):553-62.

      Schlessinger J, Plotnikov AN, Ibrahimi OA, Eliseenkova AV, et al. Crystal structure of a ternary FGF-FGFR-heparin complex reveals a dual role for heparin in FGFR binding and dimerization. Mol Cell. 2000 Sep;6(3):743-50.

      Wendt DJ, Dvorak-Ewell M, Bullens S, Lorget F, et al. Neutral endopeptidase-resistant Ctype natriuretic peptide variant represents a new therapeutic approach for treatment of fibroblast growth factor receptor 3-related dwarfism. J Pharmacol Exp Ther. 2015 Apr;353(1):132-49.

      Yamada R, Fukumoto R, Noyama C, Fujisawa A, et al. An epidermis-permeable dipeptide is a potential cosmetic ingredient with partial agonist/antagonist activity toward fibroblast growth factor receptors. J Cosmet Dermatol. 2020 Feb;19(2):477-484.

      Yayon A, Klagsbrun M, Esko JD, Leder P, Ornitz DM. Cell surface, heparin-like molecules are required for binding of basic fibroblast growth factor to its high affinity receptor. Cell. 1991 Feb 22;64(4):841-8.

      Zakrzewska M, Wiedlocha A, Szlachcic A, Krowarsch D, et al. Increased protein stability of FGF1 can compensate for its reduced affinity for heparin. J Biol Chem. 2009 Sep 11;284(37):25388-403. doi: 10.1074/jbc.M109.001289.

      Zhang X, Ibrahimi OA, Olsen SK, Umemori H, Mohammadi M, Ornitz DM. Receptor specificity of the fibroblast growth factor family. The complete mammalian FGF family. J Biol Chem. 2006;281(23):15694-700.

      Question/comment: It is impossible to interpret the FGFR binding characteristics and cellular activates of FGF4, FGF8, and FGF9 in the absence of information about the role of heparin in their binding and activation.

      We do not measure ligand binding to FGFR1 in this study. We record biological responses when we treat with FGF different ligands, and thus we measure the efficacy and the potency of each ligand to induce a response, and then we compare 2 ligands and 2 responses to determine if bias exists or not. We do not ask questions about the role of heparin, as it is always there no matter if we treat with FGF4, FGF8, or FGF9.

      Why it is not possible to interpret our cellular data? In our article, the RCS cells were treated with FGFs in the presence of 1 g/ml heparin, as clearly stated in Methods section. Using heparin at 1 or more μg/ml, to stabilize FGFs and negate the effect of endogenous HSPG, is a standard approach in the FGF field. This includes the two articles, which the whole field have used for more than 20 years as a basic reference for FGF-FGFR interactions (Ornitz et al., 1996; Zhang et al., 2006). In these studies, 2 μg/ml of heparin along with FGFs was used to treat BAF/3 cells; no experiments were conducted without heparin, as is does not make sense. Most likely, without heparin the obtained FGF-FGFR ‘preferences’ would, in fact, be the differences in FGF thermal stability, as we clearly demonstrate in our previous study (Buchtova et al., 2015). The latter article gives a detailed information about the role of heparin in the signaling of multiple FGFs in RCS cells.

      References

      Buchtova M, Chaloupkova R, Zakrzewska M, Vesela I, Cela P, Barathova J, Gudernova I, Zajickova R, Trantirek L, Martin J, Kostas M, Otlewski J, Damborsky J, Kozubik A, Wiedlocha A, Krejci P. Instability restricts signaling of multiple fibroblast growth factors. Cell Mol Life Sci. 2015 Jun;72(12):2445-59.

      Ornitz DM, Xu J, Colvin JS, McEwen DG, MacArthur CA, Coulier F, Gao G, Goldfarb M. Receptor specificity of the fibroblast growth factor family. J Biol Chem. 1996;271(25):15292-7. <br /> Zhang X, Ibrahimi OA, Olsen SK, Umemori H, Mohammadi M, Ornitz DM. Receptor specificity of the fibroblast growth factor family. The complete mammalian FGF family. J Biol Chem. 2006;281(23):15694-700.

      Technical Comments/Answers

      Question/comment: 3. It is not clear how some of the experimental data were analyzed. Blots in Figures 3A and 3B should include controls (total FGFR1 for pY653/4 and total FRS for pFRS2). How are the data shown in Figure 3C normalized? It does look like the level of phosphorylation was all normalized against the strongest signals irrespective of which ligand was used. Each data representing each ligand should be separately normalized.

      The reviewer is correct that most often in the RTK literature “each data representing each ligand is separately normalized”. But this approach will eliminate all the information about ligand efficacies and about ligand bias; it will only yield information about the potencies. Here we are not only interested in the potencies, as we are also interested to determine if bias exists or not. As such, we follow scaling protocols that have been established and are currently recommended for ligand bias studies (Kolb et al, 2022).

      One way to explain why the scaling that the reviewer is recommending is not correct for this work is to look at equation 2. What the reviewer is suggestion is to set all values of Etop to 1. In this case, the bias coefficient will depend only on the measured potencies, EC50. But this contradicts the very definition of bias, as it is NOT a difference in potencies only. In the literature, differences in potencies are called “quantitative differences”, while ligand bias describes differences which are called “qualitative” or “fundamental” (Kenakin, 2019).

      To eliminate confusion, we have added a scaling protocol to the Supplement of the paper.

      References

      Kolb P, Kenakin T, Alexander SPH, Bermudez M, et al. Community guidelines for GPCR ligand bias: IUPHAR review 32. Br J Pharmacol. 2022;179, 3651-3674.

      Kenakin T. Biased Receptor Signaling in Drug Discovery. Pharmacol Rev 2019;71, 267315.

      Question/comment: 4. In page 6, authors used the plot shown in Figure 3 for 'FGFR downregulation' to conclude that "the effect of FGF4 on FGFR1 downregulation is smaller when compared to the effects of FGF8 and FGF9. However, it is unclear how the data shown in the plot was normalized - none of the data seem to reach "1.0". Moreover, the plot seems to suggest that FGF4 can strongly downregulate FGFR as it can downregulate FGFR with higher potency.

      The Western blots assessing FGFR1 expression are easy to scale, as the value in the absence of ligand is set to 1. The expression decreases as a function of the ligand concentration. We plot FGFR1 downregulation, so we subtract 1 from the scaled FGFR1 band intensities. The total amount of FGFR1 never becomes undetectable (i.e. zero), as the ligand concentration is increased. Thus, a value of 1 in the downregulation curve is never obtained.

      We have added a protocol for this scaling in the Supplement.

      Question/comment: 5. The structural basis of FGFR1 ligand bias and the different dimeric configurations and interactions between the kinase domain of FGFR1 dimers are not warranted (Figure 6). In the absence of any structural experimental data of different forms of FGFR dimers stimulated by FGF ligands the model presents in the manuscript is speculative and misleading.

      This statement about Figure 6 is not fully correct because Figure 6A and B show experimental data. These are FRET experiments which show that the biased ligand, FGF8, induces different FGFR1 transmembrane domain conformation, as compared to FGF4 and FGF9.

      The rest of the panels in Figure 6 show modeling using PyRosetta. These are indeed not experimental data, but to the best of our knowledge this is the very first time PyRosetta has been used to predict kinase-kinase interfaces.

    1. Author Response

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      “First, I agree with the authors of this manuscript that conformational changes in the XFEL structures with 2.8 A resolution are not reliable enough for demonstrating the subtle changes in the electron transfer events in this bacterial photosynthesis system. Actually, the data statistics in the paper by Dods et al. showed that the high-resolution range of some of the XFEL datasets may include pretty high noise (low CC1/2 and high Rsplit) so the comparison of the subtle conformational changes of the structures is problematic.

      The manuscript by Gai Nishikawa investigated time-dependent changes in the energetics of the electron transfer pathway based on the structures by Dods et al. by calculating redox potential of the active and inactive branches in the structures and found no clear link between the time-dependent structural changes and the electron transfer events in the XFEL structures published by Dods, R.et al. (2021). This study provided validation for the interpretation of the structures of those electron-transferring proteins.

      The paper was well prepared.”

      Thank you very much for your positive and insightful comment. We greatly appreciate your suggestion regarding the high noise levels of the XFEL structures, as indicated by the low CC1/2 and high Rsplit values reported by Dods et al. Including this information in the Introduction section will draw readers’ attention to the concerns about the reliability of these XFEL structures. We have incorporated the following sentences into the Introduction section:

      “Furthermore, the data statistics provided by Dods et al. indicate that the high-resolution range of some XFEL datasets exhibit high levels of noise, as evidenced by low CC1/2 and high Rsplit values. These observations raise concerns about the reliable comparison of subtle conformational changes among these structures. Hence, caution must be exercised when interpreting these XFEL structures in terms of their ability to accurately capture relevant conformational changes.”

      The following sentences have also been added to the Conclusions section:

      “Hence, it is crucial to exercise caution when interpreting time-dependent XFEL structures, especially in the absence of comprehensive evaluations of the energetics and accompanying structural changes. This cautionary note should serve as a counterargument in the future, highlighting the potential pitfalls associated with presenting time-dependent XFEL structures of insufficient quality and drawing conclusive interpretations of protein structural changes that may not be distinguishable from significant experimental errors.”

      Recommendations for the authors

      “Figure 1 needs clear labels or detailed notes in the figure legend for the labels such as M, L, Pm, Pl, etc.”

      In Figure 1, we have increased the size of the labels to improve visibility. Additionally, we have expanded the figure legend to include detailed explanations of the abbreviations used, such as M, L, PM, PL, etc. We believe that these modifications have significantly improved the clarity and comprehensibility of Figure 1.

      Reviewer #2 (Public Review):

      “The manuscript by Nishikawa et al. addresses time-dependent changes in the electron transfer energetics in the photosynthetic reaction center from Blastochloris viridis, whose time-dependent structural changes upon light illumination were recently demonstrated by time-resolved serial femtosecond crystallography (SFX) using X-ray free-electron laser (XFEL) (Dods et al., Nature, 2021). Based on the redox potential Em values of bacteriopheophytin in the electron transfer active branch (BL) by solving the linear Poisson-Boltzmann equation, the authors found that Em(HL) values in the charge-separated 5-ps structure obtained by XFEL are not clearly changed, suggesting that the P+HL- state is not stabilized owing to protein reorganization. Furthermore, chlorin ring deformation upon HL- formation, which was expected from their QM/MM calculation, is not recognized in the 5-ps XFEL structure. Then the authors concluded that the structural changes in the XFEL structures are not related to the actual time course of charge separation. They argued that their calculated changes in Em and chlorin ring deformations using the XEFL structures may reflect the experimental errors rather than the real structural changes; they mentioned this problem is due to the fact that the XFEL structures were obtained at not high resolutions (mostly at 2.8 Å). I consider that their systematic calculations may suggest a useful theoretical interpretation of the XFEL study. However, the present manuscript insists as a whole negatively that the experimental errors may hamper to provide the actual structural changes relevant to the electron transfer events. My concerns are the following two points:

      Is the premise of the authors for the electron transfer energetics obviously valid?

      Could the authors find any positive aspect(s) in the XFEL study?

      The authors' argument is certainly due to their premise "Em(HL) is expected to be exclusively higher in the 5-ps and 20-ps structures than in the other XFEL structures due to the stabilization of the [PLPM]•+HL•- state by protein reorganization" as noted in the Results and Discussion (p. 12, lines 180-182); however, it is unknown whether this premise can be applied to the ps-timescale electron transfer events. The above premise is surely based on the Marcus theory, as the authors also noted in the Introduction "The anionic state formation induces not only reorganization of the protein environment (ref. 5: Marcus and Sutin, 1985) but also out-of-plane distortion of the chlorin ring (ref. 6: two of the authors, Saito and Ishikita, co-authored, 2012)"; however, it is unknown whether protein reorganization can follow the ps-timescale electron transfer events. Indeed, Dods et al. mentioned in the Nature paper (2021) "The primary electron-transfer step from SP (special pair PLPM) to BPhL (HL) occurs in 2.8 {plus minus} 0.2 ps across a distance of 10 Å by means of a two-step hopping mechanism via the monomeric BChL molecule and is more rapid than conventional Marcus theory". It was also mentioned, "By contrast, the 9 Å electron-transfer step from BPhL to QA has a single exponential decay time of 230 {plus minus} 30 ps, which is consistent with conventional Marcus theory". As for the primary electron-transfer step from PLPM to HL, Wang et al. (2007, Science 316, 747; cited as ref. 8 in the Nature paper 2021) reported, by monitoring tryptophan absorbance changes in various reaction centers in which the driving forces (namely, the Em gaps between PLPM and HL) are different, that the protein relaxation kinetics is independent of the charge separation kinetics on the picosecond timescale. On the other hand, in the EPR study cited by the authors as ref. 7 (Muh et al. (1998) Biochemistry 37, 13066), although the authors described "two distinct conformations of HL- were reported in spectroscopic studies" (p. 3, lines 44-45), it should be noted that conformation of HL- was formed by 1 or 45 s illumination prior to freezing, and hence the second-order reorganized conformations may differ from picosecond-order conformations observed by the XFEL study (Nature, 2021) and/or the transient absorption spectroscopy (Science, 2007).

      Therefore, I consider there is a possibility that the authors' findings may reflect not experimental errors but the actual ps-timescale phenomena presented by the first-time XFEL study on the timescale of the primary charge-separation reactions of photosynthesis. Thus I would like to suggest that the authors reconsider the premise for the electron transfer energetics on the picosecond timescale.

      In any case, to discuss the experimental errors in the XFEL study, it is better to calculate the Em(QA) changes in the 300-ps and 8-us XFEL structures, which showed distinctive structural changes even at the 2.8 Å resolution as discussed by Dods et al. Then, if the Em(QA) values are changed as expected from theoretical calculations, such calculated results may suggest a useful theoretical interpretation of the XFEL study as a positive aspect. If the Em(QA) values are not higher in the 300-ps and 8-us structures than in the other structures, it may be argued that the experimental errors would be so large that the XFEL structures are irrelevant to the electron transfer events expected from theoretical calculations.”

      We appreciate the reviewer's constructive suggestions, which significantly contributed to the improvement of our manuscript. We have performed additional calculations to address the reviewer's suggestion. We calculated the changes in Em(QA) in the XFEL structures. The Em(QA) values in the 300-ps and 8-μs structures were not significantly higher than those in the other structures (Figure 8).

      These findings align with the scenario proposed by the reviewer, suggesting that the experimental errors are substantial, rendering the XFEL structures irrelevant to the electron transfer events. The results further reinforce our argument that the observed structural changes in the XFEL structures are not directly linked to the expected changes in electron transfer events.

      We have incorporated these important points into the revised version as follows:

      “One might argue that the loss of the link between the formation of the charge-separated state and the Em(HL) change (Figure 5) is not due to experimental errors but rather represents the actual ps-timescale phenomena during the primary charge-separation reactions (e.g., Dods et al. noted that “the primary electron-transfer step to HL is more rapid than conventional Marcus theory” 8). However, even if this were the case, this hypothesis regarding the relevance of the XFEL structures to the electron-transfer events can be further explored by examining the changes in Em(QA) among the XFEL structures, considering the relatively slow electron-transfer step to QA that allows sufficient protein relaxation to occur (e.g., Dods et al. stated that “the electron-transfer step to QA has a single exponential decay time of 230 ± 30 ps, consistent with conventional Marcus theory” 8). That is, if the Em(QA) values are not higher in the 300-ps and 8-μs structures than in the other structures, it suggests that significant experimental errors exist, rendering the XFEL structures irrelevant to the electron transfer events. Consistent with this perspective, the present results demonstrate that the Em(QA) values in the 300-ps and 8-μs structures are not significantly higher than those in the other structures, including the dark state structure (Figure 8). Consequently, the lack of a clear relationship between the charge separated state and the changes in Em(QA) at 300 ps and 8-μs further strengthens the argument that the XFEL structures are irrelevant to the electron transfer events.”

      Recommendations for the authors

      “In addition to my main concerns, the following points should also be taken into consideration:

      The authors presented from QM/MM calculations out-plane distortion of HL (and HM) induced upon the reduction using the dark structure for dataset a (Table 5). However, to compare with the XFEL structures corresponding to the charge-separated state [PLPM]+HL-, positive charge should be located at the special pair (or, either PL or PM). In the present work, it is noted that counter ions were added to neutralize the entire system (in Methods: p. 6, lines104-105), but the location(s) of the positive charge is unclear.”

      We appreciate the valuable suggestion provided by the reviewer. To address this concern, we have calculated out-of-plane distortion of HL•– in the presence of PL•+. The results have been included in Table 5. Note that the results obtained in the presence of PL•+ are substantially the same as those obtained in PL0 (Table 5).

      For clarity, we have rephrased the sentence referring to counter ions as follows:

      “To neutralize the entire system, counter ions were added randomly around the protein using the Autoionize plugin in VMD 22.”

      “In relation to the calculations, the authors showed the induced out-plane distortion of HM for dataset a; however, the results for HM seem not to be mentioned anywhere. Instead, the calculations for HL of the dark structure for dataset b should be useful, especially for comparing with the time-dependent changes in the dataset b XFEL structures as shown in Figure 7.”

      We have made Table 6 to present the results for dataset b. The results are consistent with those for dataset a (Table 5).

    1. Author Response

      We express our gratitude to the editors for acknowledging the significance of our findings and facilitating the review process. We would also like to thank the reviewers for dedicating their time to thoroughly read the manuscript and provide valuable insights.

      During the revision process, we will address the raised issues and concerns, confident that our revisions will enhance the clarity and strength of the paper.

      In response to the reviewers' feedback, we acknowledge that some of the relevant information was previously presented in our published papers (Meng, Dev Cell. 2017; Xia, Elife. 2021). However, we recognize that in the current version of the manuscript, we may not have expounded on these details as clearly as needed. We will rectify this shortcoming in the revised version to provide a more comprehensive account of our research.

      We also explain our perspective on why the discovery of MYRF controlling lin-4 upregulation is crucial in addressing unanswered key questions in developmental biology.

      The Loss of Function Characteristics of myrf-1(ju1121 G274R)

      We would like to present the evidence supporting the characteristics of myrf-1(ju1121) as a loss-of-function mutation affecting both myrf-1 and myrf-2. In our initial paper (Meng, Dev Cell. 2017), the nature of this mutation was a significant focus of our research.

      Our investigation involved analyzing multiple alleles (tm, ok, gk alleles from CGC, and indel alleles made in-house) of myrf-1 and myrf-2, as well as their double mutants. Here is a summary of our current understanding based on these analyses:

      1. myrf-1 single loss-of-function (l.f.) mutants exhibit penetrant arrest at the end of L1 or early L2 stages. However, they only display very mild deficiency in DD synpatic remodeling at 21 hours, primarily caused by a delay.

      2. myrf-2 single l.f. mutants behave similarly to the wild type, exhibiting no significant developmental abnormalities, including synpatic remodeling.

      3. myrf-1 and myrf-2 double l.f. mutants exhibit penetrant arrest during L2, occurring approximately half a stage later than in myrf-1 single mutants.

      4. Remarkably, myrf-1 and myrf-2 double l.f. mutants exhibit severe blockage in synaptic remodeling, indicating that both genes act collaboratively to drive this essential process (Meng, Figure 5).

      5. The myrf-1(ju1121 G274R) mutation exhibits severe synaptic remodeling blockage and arrest during L2, closely resembling myrf-1 myrf-2 double mutants (Meng, Figure 1 and 2).

      Therefore, despite myrf-1's more significant role in development based on the arrest phenotype, synaptic remodeling requires the combined function of myrf-1 and myrf-2. This redundancy is further supported by the analysis of the new set of specific myrf-1 mutants (Xia, Figure 6).

      Both myrf-1 and myrf-2 are broadly expressed (Meng, Figure 3 and S5), and they undergo developmentally regulated cell-membrane to nucleus translocation (Xia, Figure 4 and Supplement 1). Overexpressing N-MYRF-1 and full-length MYRF-2 in DD neurons leads to precocious synaptic remodeling (Meng, Figure 4 and 5). Interestingly, overexpressing full-length myrf-1 does not have the same effect, indicating potential regulatory differences between these two factors.

      The myrf-1(ju1121 G274R) mutation is located in the N-terminal region of the Ig-fold type DNA-binding domain, specifically within the loop between a and b Ig-fold strands. This site is conserved across all metazoan MYRFs (Meng, Figure 1D and 6A). The mutant myrf-1(G274R) loses its DNA binding ability, as demonstrated by a gel mobility shift assay using the counterpart residue mutation in mammalian MYRF (Meng, Figure 6B).

      MYRF-1(ju1121 G274R) mutant interfering with normal MYRF’s function has been supported by molecular genetics experiments (Meng, Figure 6C-E) and biochemical analysis. In essence, the MYRF-1(G274R) mutant does not impact MYRF trimerization or MYRF-1-MYRF-2 interaction, but blocks DNA binding. Substantial evidence has confirmed the physical binding of MYRF-1 and MYRF-2 both in vitro and in vivo (Meng, Figure 5G and S6; Xia, Figure 1A). Importantly, MYRF- 1(ju1121 G274R) is still able to bind to MYRF-2, as supported by coIP analysis (Meng, Figure S7), indicating that the G274R mutation does not disrupt the MYRF-1-MYRF-2 interaction. This observation is consistent with the characteristics of the MYRF structure (PMID: 28160598; PMID: 34345217). The critical interface of the MYRF trimer is located in the alpha-helix upstream of the ICE domain, the beta sheets of the ICE, and the beta-helix of the bridge region between ICE and DBD. Therefore, since MYRF-1(ju1121 G274R) is not situated in this critical interface of the MYRF trimer, it is unlikely that the mutation affects MYRF trimerization.

      With all available evidence, we propose a reasonable model where myrf-1(ju1121) has two effects: rendering myrf-1 defective in DNA binding and negatively interfering with MYRF-2 by forming a non-functional trimer consisting of monomer MYRF-1(ju1121) and wild-type MYRF-2.

      Regarding the potential neomorphic function of myrf-1(ju1121), the myrf-1(ju1121)/+ individuals appear superficially wild type and show no defects in synaptic remodeling. Furthermore, we have generated a myrf-1 minigene array that results in a complete rescue of the developmental phenotype in myrf-1(ju1121) (Meng, Figure 3A-D). Notably, the transgene is expected to be low copy numbered, as it was generated by injecting at a very low concentration of 0.1 ng/μl. The complete rescue of the phenotype strongly suggests that any potential aberrant effects caused by myrf-1(ju1121) mutants are minimal.

      In summary, myrf-1(ju1121) behaves similarly to myrf-1 myrf-2 double mutants, and we utilized this allele for the convenience of analysis.

      Due to the essential role of MYRF-controlled processes in larval development and the lack of detectable phenotypic effects in myrf-2 single loss-of-function mutants, it is evident that myrf-2 plays a minor role in these developmental events. Considering that development regulation rarely follows a simple linear or accumulative fashion, deciphering the relative contributions of each myrf-1 and myrf-2 in specific developmental events may not be straightforward. Consequently, our primary focus remains on investigating the functions of myrf-1.

      Nevertheless, we concur that providing a clear description of the impact of myrf-1 and myrf-2 single mutants on lin-4 expression is crucial. We are actively conducting ongoing analyses, and the new findings will be incorporated in the revised version of our manuscript.

      Characterizing myrf-1(syb1313, 1-700) as a Hyperactive Allele of myrf-1

      The cleavage and release of N-MYRF are developmentally regulated and occur in late L1. We have substantial evidence supporting the interaction between the non-cytoplasmic region of MYRF and another transmembrane protein, PAN-1, which is crucial for delivering MYRF onto the cell membrane (Xia, Figure 1, 7, 8, 10, 11 and 13). The myrf-1(syb1313, 1-700) mutant lacks the non-cytoplasmic region of MYRF, which is the interaction site for PAN-1. Initial analyses revealed that in the mutants, MYRF-1(syb1313) remains in the cytoplasmic, ER-like structure, resulting in larval arrest during L2 (Xia, Figure 8).

      However, a more careful analysis unveiled that a small amount of N-MYRF is processed and enters the nucleus, but this process is not dependent on the normal developmental timing and may take place during early-mid L1. Consequently, this leads to precocious yet discordant DD synaptic remodeling and M-cell lineage division (Xia, Figure 6 and 9). Considering the precocious development, the low quantity of nuclear N-MYRF, and the overall larval arrest phenotype observed in the mutants, we conclude that myrf-1(syb1313) represents an inconsistent, weak hyperactive form of MYRF-1. Moreover, the hyperactive function may be context-dependent, for instance, presence of myrf-1(syb1313) may be sufficient for certain needs in neurons but insufficient for epidermis. Our ongoing research to identify the downstream targets of MYRF also supports this notion.

      Given that the myrf-1(syb1313) mutant has been thoroughly characterized and published, it is the most suitable option for use in our current investigations on lin-4 expression.

      Furthermore, we employed the MYRF-1(delete 601-650) deletion mutant construct, which is a significantly more effective hyperactive MYRF-1 mutant when overexpressed. This reagent stems from our ongoing study, which is dedicated to identifying the self-inhibitory mechanisms of MYRF cleavage. The extensive volume of data that led to this discovery makes it impractical to include in the current manuscript. However, we are eager to share the substantial effects of MYRF-1(delete 601-650) mutants in activating lin-4 expression, which strengthens the role of MYRF in regulating lin-4. We will take care to revise this section to provide clearer references.

      The lin-4p::nls::mScarlet(umn84) knock-in reporter is loss-of-function for lin-4; however, lin-4 mature microRNA does not affect lin-4 expression.

      Indeed, the lin-4 knock-in reporter umn84 removes lin-4 coding sequence. As a result, the homozygous reporter strain is also lin-4 null mutants. Since both lin-4 and myrf-1 are located on Chr II and are less than 4 m.u. apart, the constructed strain is myrf-1 lin-4(umn84) / mIn1 (balanced by mIn1). Consequently, the myrf-1 homozygous animal is also lin-4 reporter homozygous.

      Regarding the endogenous function of the "auto-regulating element," we are aware of the follow-up paper by Frank Slack's group, in which they concluded that the previously reported sequence is dispensable for lin-4 expression, and the loss of lin-4 does not affect the expression of its primary transcript (PMID: 29324872). To avoid confusion, we will remove or revise the introductory sentences as necessary to accurately reflect this information.

      Additionally, besides analyzing the expression of the knock-in reporter of lin-4 (umn84), we also conducted a thorough analysis of mature microRNA expression using targeted qPCR and genomic analysis via microRNA sequencing. Both sets of results indicate severely defective upregulation of lin-4 mature microRNA in myrf-1(ju1121).

      No evidence indicates that the 2.4 kb reporter of Plin-4-gfp (maIs134) is an inappropriate reporter for lin-4 transcription.

      maIs134 is originated from the Ambros lab, and to date, there is no single evidence demonstrating that maIs134 cannot be regarded as a reliable transcription reporter for lin-4 expression. The Stec et al. (Curr Biol 2021. PMID: 33357451) paper suggests that the PCE or CEA site (at ~ -2.8 kb) outside the 2.4 kb region confers enhancing effects for lin-4 transcription, but no other published paper has studied lin-4 transcription and cited this finding.

      While the Stec et al. paper provides elaborate mechanistic descriptions, the basic characterization of the importance of CE-A and blmp-1 to lin-4 expression is lacking. Deletion of CE-A in the lin-4 promoter reporter using an Ex array transgene resulted in highly variable reporter expression (Stec, Figure 4D). Notably, two high expression data points indicated that a transgene reporter without CE-A can be highly expressed, suggesting that CE-A is unnecessary for lin-4 transcription. Only when both CE-A and CE-D (within 2.4 kb) were deleted, the reporter expression was significantly decreased. Moreover, deletion of CE-C (proximal region) alone caused severe loss of reporter activity, supporting that proximal CE-C is the essential element, while CE-A is not.

      It is important to note that the effect of CE-A on lin-4 expression has not been analyzed using stable transgenes or genetic deletions in the endogenous lin-4 region. Furthermore, there is no data on how blmp-1 mutants affect the expression of the wild-type lin-4 promoter reporter, CEA deletion reporter, or lin-4 mature microRNA, despite the paper’s main claim that blmp-1 boosts lin-4 expression. While CE-A can confer an enhancing effect in epidermal expression when fused to the gst-5 promoter, there is no data showing that CE-A is sufficient to drive lin-4 transcription by itself.

      In summary, there is currently insufficient evidence to establish whether CE-A is necessary or sufficient for regulating lin-4 expression. In fact, the data presented in Stec et al. (Curr Biol 2021) suggest that CE-A is unnecessary for lin-4 expression. As such, I do not see any reason to consider the 2.4 kb reporter in maIs134 as inappropriate for analyzing lin-4 transcription. Furthermore, our presented data using the knock-in reporter of lin-4 (umn84) demonstrated that its regulation by myrf is essentially consistent with the observations drawn from the maIs134 analysis.

      The Significance of the Finding: MYRF Regulating lin-4 Upregulation

      We are grateful that the editors find our results valuable for those interested in lin-4 expression. However, we acknowledge that the editors may not share the same enthusiasm as we do, seeing this as a landmark discovery in understanding postembryonic development, a fundamental question in the field of developmental biology.

      Importance of Understanding lin-4 Upregulation in Development

      The foundation of developmental biology has been built on the principles derived from studying embryonic development in model organisms like Drosophila, exemplified by the Nobel laureates Lewis, Nusslein-Volhard, and Wieschaus. These principles explain what occurs during embryonic development, including patern formation, morphogenesis, and differentiation. However, these existing principles do not fully explain the phenomena of postembryonic development, including growth. For instance, during C. elegans development in L1, it remains unclear what controls the initiation of P cell division. If we may exclude dividing cells from the discussion, numerous stage-specific changes occur in non-dividing cells, including neurons. The extensive, systematic expression studies of transcription factors in C. elegans have failed to provide evidence that such developmental progression is driven by sequential activation of transcriptional cascades, as commonly observed during embryonic differentiation. A different approach to ask a similar question is to inquire how developmental timing is controlled, e.g., "why does it take a boy 12 years to reach adolescence?" This perspective highlights the need to identify potential unidentified checkpoints that control postembryonic stages (An example of insightful review: The Systemic Control of Growth. Cold Spring Harb Perspect Biol. 2015. PMID: 26261282)

      The upregulation of lin-4 represents a system’s checkpoint during postembryonic development. Deciphering the mechanism controlling lin-4 expression is instrumental in understanding the principles of postembryonic development, even extending to adult development, including life span control.

      Importance of the Finding: MYRF's Control of lin-4 Upregulation

      To date, no other essential, positive regulator of lin-4 transcription has been identified, although several negative regulators have been reported. A landmark paper by Victor Ambros identified FLYWCH as a repressor of lin-4 expression during embryogenesis (PMID: 18794349). FLYWCH mutants fail to progress to normal hatched larvae, implying that FLYWCH is crucial. The paper indeed suggested that FLYWCH has additional functions beyond suppressing lin-4, although these functions have not been thoroughly characterized. The significance of the FLYWCH finding lies in the elaborate control during the transition from embryo to larval development, where lin- 4 is actively suppressed. This control may ensure the robustness of subsequent lin-4 activation. The process during the embryo-to-larvae transition, as well as the counterpart process in mammalian development perinatally, remains poorly understood.

      Another negative regulator of lin-4 is lin-42, as reported in three papers in 2014 (PMID: 25319259; PMID: 24699545; PMID: 25032706). Lin-42 negatively regulates lin-4 expression, despite the main focus of the papers being lin-42's repression of let-7. However, the precise mechanisms by which this repression is achieved are not fully understood.

      Amy Pasquinelli's lab conducted a genome-wide screen to identify factors responsible for driving lin-4 upregulation but did not identify a critical factor that promotes lin-4 transcription (PMID: 20937268).

      In the recent paper by Stec et al. (Curr Biol 2021. PMID: 33357451), they reported blmp-1's role in enhancing lin-4 expression. However, the significance of blmp-1 in regulating lin-4 remains vaguely described, despite a large amount of data describing elaborate epigenetic controls. The paper did not provide data on how endogenous lin-4 expression is affected in blmp-1 mutants, nor did it demonstrate how full-length reporter expression is affected in blmp-1 mutants. The only relevant data appears to be on the CE-A-gst-5 promoter reporter in blmp-1 mutants. As a result, it remains unclear how blmp-1 affects lin-4 transcription.

      In summary, no single factor has been identified, the loss of which leads to significant deficiencies in lin-4 upregulation. MYRF is the first and a critical factor identified in this context. This finding represents a significant advancement in our understanding of lin-4 regulation and its crucial role in development.

    1. Reviewer #1 (Public Review):

      This work describes the induction of SIV-specific NAb responses in rhesus macaques infected with SIVmac239, a neutralization-resistant virus. Typically, host NAb responses are not detected in animals infected with SIVmac239. In this work, seventy SIVmac239-infected macaques were retrospectively screened for NAb responses and a subset of nine animals were identified as NAb-inducers. The viral genomes from 7/9 animals that induced NAb responses were found to encode nonsynonymous mutation in the Nef gene (amino acid G63E). In contrast, Nef G63E mutation was found only in 2/19 NAb non-inducers - implicating that the Nef G63E mutation is selected in NAb inducers. Measurement of Nef G63E frequencies in plasma viruses suggested that Nef G63E selection preceded NAb induction. Nef G63E mutation was found to mediate escape from Nef-specific CD8+ T-cell responses. To examine the functional phenotype of Nef G63E mutant, its effect on downmodulation of Nef-interacting host proteins was examined. Infection of rhesus and cynomolgus macaque CD4+ T cell lines with WT or Nef G63E mutant SIV suggested that Nef mutant reduces S473 phosphorylation of AKT. Using flow cytometry-based proximity ligation assay, it was shown that Nef G63E mutation reduced binding of Nef to PI3K p85/p110 and mTORC2 GβL/mLST8 and MTOR components - kinase complex responsible AKT-S473 phosphorylation. In vitro B-cell Nef invasion and in vivo imaging/flow cytometry-based assays were employed to suggest that Nef from infected cells can target Env-specific B cells. Lastly, it was determined that NAb inducers have significantly higher Env-specific B-cells responses after Nef G63E selection when compared to NAb non-inducers. Finally, a corollary was drawn between the Nef G63E-associated B-cell/NAb induction phenotype and activated PI3K delta syndrome (APDS), which is caused by activating GOF mutations in PI3K, to suggest that Nef G63E-meidated induction of NAb response is reciprocal to APDS.

      Strengths:<br /> This study aims to understand the viral-host interaction that governs NAb induction in SIVmac239-infected macaques - this could enable identification of determinants important for induction of NAb responses against hard-to-neutralize tier-2/3 HIV variants. The finding that SIV-specific B-cell responses are induced following Nef G63E CD8+ T-cell escape mutant selection argue for an evolutionary trade-off between CTL escape and NAb induction. Exploitation of such a cellular-humoral immune axis could be important for HIV/AIDS vaccine efforts.

      Although more validation and mechanistic basis are needed, the corollary between PI3K hyperactive signaling during autoimmune disorders and Nef-mediated abrogated PI3K signaling could help identify novel targets and modalities for targeting immune disorders and viral infections.

      Weaknesses:<br /> Although the paper does have strengths in principle, the weaknesses of the paper are that the mechanistic basis of Nef-mediated induction of NAb responses are not directly examined. For example, it remains unclear whether SIVmac239 with engineered G63E mutation in Nef would induce faster and potent NAb responses. A macaque challenge study is needed to address this point.

      As presented, the central premise of the paper involves infected cell-generated Nef (WT or G63E mutant) being targeted to adjacent Env-specific B cells. However, it remains unclear how this is transfer takes place. A direct evidence demonstrating CD4+ T cell-associated and/or cell-free Nef being transferred to B-cell is needed to address this concern.

      The interaction between Nef and PI3K signaling components (p85, p110, GβL/mLST8, and MTOR) has been explored using PLA assay, however, this requires validation using additional biochemical and/or immunoprecipitation-based approaches. For example, is Nef (WT or mutant form) sufficient to affect PI3K-induced phosphorylation of Akt in an in vitro kinase assay? Moreover, the details regarding the binding events of WT vs mutant Nef with PI3K signaling components is lacking in this study. Lastly, it is unclear whether the interaction of Nef with PI3K signaling components is a conserved function of all primate lentiviruses or is this SIV-specific phenotype.

      It has been previously reported that the region of Nef encoding glycine at position 63 is not conserved in HIV-1 (Schindler et al, Journal of Virology 2004). Thus, does HIV-1 Nef also function in induction of NAb responses in humans? or the observed phenotype specific to SIV?

    1. Reviewer #2 (Public Review):

      While the question of 'are AlphaFold-predicted structures useful for drug design' has largely seen comparisons of AF versus experimental protein structures, this paper takes a less explored (but perhaps more practically important) angle of 'are AlphaFold-predicted structures any better than the previous generation of homology modeling tools' to the protein-ligand (rigid) docking problem. The conclusions of this work will be of largest interest to the audience less familiar with the precision required for successful rigid docking, while the expert crowd might find them obvious, yet a good summary of results previously shown in the literature. Further work, understanding the structural objectives/metrics that should be placed on future AlphaFold-like models for better pose prediction performance, would greatly expand the practicality of the observations made here.

      The main conclusion of the paper, that structural accuracy (expressed as RMSD) of the protein model is not a good predictor of the accuracy the model will show in rigid docking protein-ligand pose prediction, is a good reminder of the well-appreciated need for high-quality side chain placements in docking. The expected phenomenon of AlphaFold predicting 'more apo-like structures' is often discussed in the field, and readers should be cautious about drawing conclusions from the rigid (rather than flexible, as in some previous works) docking done here.

      The authors have very clearly communicated that the use of AlphaFold-generated structures in traditional docking might not be a good idea, and motivated that the time of a molecular designer might be better spent preparing a high-quality homology model. The visual presentation of the conclusions is very clear but might leave the reader wanting a more in-depth discussion of which structural elements of the AF models lead to bad docking outcomes. For example, Fig. 3 presents an example of a very accurate AlphaFold prediction leading to the ligand being docked completely outside of the binding pocket. Close inspection of the Figure suggests a clash of the ligand with the slightly displaced tryptophan residue in the AF model that might be to blame, as can be confirmed by comparison of the model and PDB structure by the reader themselves but has not been discussed by the authors. Only a few examples of the systems used are shown even visually, leaving the reader unable to study more interesting cases in depth without re-doing the work themselves.

      The authors acknowledged that several recent studies exist in this space. They point out two advancements made in their work, worthy of further review. Similarly, it's important to evaluate the novelty of this work's claims vs previously available results, and the diversity of information made available to the reader.

      "First, we use structural models generated without any use of known structures of the target protein. For machine learning methods, this requires ensuring that no structure of the target protein was used to train the method." This is done by limiting the scope of the work to GPCRs whose structures became available only after the training date of AlphaFold (April 30, 2018), as well as not using templates available after that date during prediction. The use of a time limit seems less preferable than the approach taken in Ref. 1 of discarding templates above a sequence identity cutoff. On the other hand, the 'ablation test' performed in Ref. 2 showed no loss in accuracy when no templates were used at all. Authors should discuss in more detail whether these modeling choices could change anything in their conclusions and why they made their choices compared to those in previous work.

      "Second, we perform a systematic comparison that takes into account the variation between experimentally determined structures of the same protein when bound to different ligands." Cross-docking is indeed a more appropriate comparison than self-docking (as done in previous works), and the observation that the accuracy of AF models is similar to that between different holo structures of the same protein is interesting. Previous literature on cross-docking should however be discussed, and the well-known conclusions from it that small variations in side-chain positions, in otherwise highly similar structures, can lead to large changes in docked poses. It is important to realize that AlphaFold models are 'just another structure' - if previous literature is sufficient to show the sensitivity of rigid docking, doing it again on AF structures does not add to our understanding. Further, Ref. 3 might have already addressed the question of correlation between binding site RMSD and docking pose prediction accuracy - see e.g. Supplementary Figure 3 there (also Figure S15 in Ref. 2).

      Further, the authors should discuss the commonly brought up problem of AlphaFold generating 'more apo-like structures' - are the models used here actually 'holo-like' because of the low RMSDs? (what RMSD differences are to be expected between apo and holo structures of these systems?) How are the volumes of the pockets affected? The position on this problem taken by previous works is worth mentioning - "much higher rmsd values are found when using the AF2 models (...), which reflect the difficulties in performing docking into apo-like structures" in Ref. 1 and "computational model structures were predicted without consideration of binding ligands and resulted in apo structures" in Ref. 2.

      Because of this 'apo problem', Ref. 2 assumed that rigid docking (as done here) would not succeed and used flexible docking where "two sidechains at the binding site were set to be flexible". In fact, the reader of this new paper will be left to wonder if it is not simply presenting a subset of the results already seen in Ref. 2, where "the success ratios dropped significantly for them because misoriented sidechains prevented a ligand from docking (Figure S14)". While this conclusion is not made as clear in Ref. 2 as it is here, a comparison of Figures 4 and S14 there will lead the reader to the same conclusion, and more -- that flexible docking meaningfully improves the performance of AF models, and more so than homology models.

      Finally, certain data analyses present in previous works but not here should be necessary to make this work more informative to the readers:<br /> a) Consideration of multiple top poses, e.g., in Ref. 2, Figures 4 and S14 mentioned before, comparison of success rates in top 1 and top 3 docked poses add much context.<br /> b) Notes on the structural features preventing successful docking, see e.g., in Ref. 1, Table 2 or in Ref. 4, Tables 2 and 4.

      This work has the potential to become an important piece of the puzzle, if deeper insights into the reasons for AF model failures are drawn by the authors. These could include a discussion of the problematic structural elements (clashes of side chain with ligands, missing interactions/waters, etc.), potential solutions with some preliminary data (flexible docking, softening interactions, etc.), or proposals for metrics better than RMSD to score the soundness of pockets generated by AF for docking.

      References:<br /> 1. Díaz-Rovira, A. M., Martín, H., Beuming, T., Díaz, L., Guallar, V., & Ray, S. S. (2023). Are Deep Learning Structural Models Sufficiently Accurate for Virtual Screening? Application of Docking Algorithms to AlphaFold2 Predicted Structures. Journal of Chemical Information and Modeling, 63(6), 1668-1674. https://doi.org/10.1021/acs.jcim.2c01270<br /> 2. Heo, L., & Feig, M. (2022). Multi-state modeling of G-protein coupled receptors at experimental accuracy. Proteins: Structure, Function, and Bioinformatics, 90(11), 1873-1885. https://doi.org/10.1002/prot.26382<br /> 3. Beuming, T., & Sherman, W. (2012). Current assessment of docking into GPCR crystal structures and homology models: Successes, challenges, and guidelines. Journal of Chemical Information and Modeling, 52(12), 3263-3277. https://doi.org/10.1021/ci300411b<br /> 4. Scardino, V., Di Filippo, J. I., & Cavasotto, C. (2022). How good are AlphaFold models for docking-based virtual screening? [Preprint]. Chemistry. https://doi.org/10.26434/chemrxiv-2022-sgj8c

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript describes that simultaneous inhibition of LOXL2 and BRD4 reduces proliferation of TNBC in vitro and reduces growth in vivo.

      This observation is followed by extensive mechanistic studies that suggest physical interaction between LOXL2 and short isoform of BRD4-MED1. Inferences from Chip-seq analyses suggest that this interaction is involved in regulation of multiple transcriptional programs. Authors focus on differential activation of DREAM complex, to claim that this interaction "is fundamental for proliferation of TNBC". The manuscript is very well written and mechanistic inferences are based on a set of sophisticated epigenetic analyses and bioinformatical inferences. The phenotypic effects from LoxL2 inhibition by itself, or in combination with BRD4 inhibition are relatively modest. These modest effects, as well as many of the reported changes in gene expression are clearly inconsistent with the frequently used adjectives as "dramatic", "fundamental", "deeply affected", "drastically hampered" etc. Given the modest phenotypic effects, many of the key claims and conclusions are not supported by the data.

      We thank the reviewer for appreciating our work, defining the manuscript as well-written, and saying that it comprises extensive mechanistic studies as well as sophisticated epigenetic analysis.

      We apologize if some of our statements seemed exaggerated. In this revised version, we revisited some of our conclusion to moderate them.

      Moreover, we took the reviewer's criticism as an opportunity to strengthen our findings. In the revised version of the manuscript, we included an additional TNBC PDX (PDX-127), and results from this experiment clearly reinforce our claims (Fig. 6D and Fig. EV9E-F). In this new in vivo experiment, we selected a PDX model in which the expression of BRD4L is not detectable, while BRD4S is clearly expressed. Therefore, the treatment with JQ1 would specifically affect the activity of BRD4S, making the treatment selective. Additionally, we reduced by half the dose of JQ1 administrated to limit the effect of BRD4S inhibition alone on tumor growth. The combinatorial treatment (JQ1+PXS) induced a clear superior effect in this setting as compared with single-agent treatments. In addition to this, we discarded that the observed growth reduction is not the result of the sole inhibition of LOXL2, which could affect FAK/Src activity or extracellular Collagen crosslinking. In conclusion, our data show that the combinatorial inhibition of LOXL2 and BRD4S is effective in reducing tumor proliferation in TNBC in vivo models, independently of the inhibition of BRD4S and of other pathways known to be regulated by LOXL2.

      Specifically:

      1) It is unclear why authors generalize their conclusions to TNBC. Figure 1B demonstrates synergy for 1/3 cell lines, which is chosen for the follow up study. Even for MDA231, the synergy is confined to low concentrations of BRD4i (S1c). While MDA231 cell line is frequently used in experimental studies of TNBC, it is quite dissimilar to majority of clinical TNBC, and contains mutant RAS, which is rare in this disease.

      The synergistic effect is observed in MDA-MB-231 cells because only this cell line expresses both BRD4S and LOXL2. Indeed, in Fig. 1C we show that MDA-MB-468 cells do not express LOXL2, while BT549 only express minimal BRD4 levels.

      To corroborate this hypothesis, in the revised version of the manuscript we added:

      1. A new cell line (Cal51) expressing the same LOXL2 and BRD4 levels (Fig. EV8C) but showing greater resistance to JQ1 than MDA-MB-231 (Fig. EV8D). Also, in this cell line, we could show that the combinatorial treatment had a superior effect on cell viability than the single agents’ treatment (Fig. EV8E).
      2. A western blot panel of different TNBC PDXs shows that the majority of them express medium to high levels of both BRD4S and LOXL2 proteins, as is the case of MDA-MB-231 (Fig. EV9E) and Cal51 (Fig. EV8C). This result suggests that the combinatorial treatment could be used in the majority of TNBC patients as they are expected to express both BRD4S and LOXL2.
      3. Finally, as explained above, we performed another in vivo choosing a PDX that expresses BRD4S (but not BRD4L) and LOXL2 (PDX-127) (Fig. 6D and Fig. EV9E-F). Also, in this new model, we could observe that the combinatorial inhibition had a superior effect than single treatments.

        2) In vivo, the effect appears to be modest even in the MDA231 model, selected for evidence of synergy in vitro. In vivo, the combination appears to have an additive effect. Tumor growth rates are reduced, but no shrinkage is occurring. In the PDX model, LOXL2i does not have an effect as a monotherapy, while modestly enhancing the impact of BRD4i. These results are at odds with the claim of the interaction being fundamental for proliferation.

      We agree with the reviewer that the combinatorial inhibition appears to have an additive effect in vivo using the MDA-MB-231 model.

      1. For that reason, we have now performed the in vivo PDX experiment mentioned above (PDX-127; Fig. 6D and Fig. EV9E-F) in which we decreased the dose of JQ1 by half to avoid strong tumor growth effect due to BRD4 inhibition alone. In this new experiment, the synergistic effect is evident. While single-agent treatment showed a very moderate effect (0% or 20% tumor growth reduction for LOXL2 and JQ1, respectively), the combinatorial treatment showed a 50% reduction in tumor volume, further supporting our conclusions.
      2. We also performed either BRD4 or MED1 pull-down experiments in the presence of PXS and JQ1. We show that upon PXS treatment, the interaction between LOXL2 and BRD4S is maintained while the interaction with MED1 is reduced (Fig. 5A-C). However, in the presence of JQ1, the interaction between LOXL2 and MED1 is maintained while BRD4S-LOXL2 and BRD4S-MED1 interactions are impaired (Fig. 5D-F). These new results explain why monotherapy does not have a sufficient effect in vivo and set the rationale for the use of the combinatorial treatment. We believe that these new results corroborate our initial findings and we hope to have been able to satisfy the reviewer comments.

      3) No analysis of cell proliferation was shown in vivo. Authors should have performed BrdU or KI67 staining to support the claim. For in vitro analyses, authors also used indirect assays for proliferation. PI staining by itself does not have sufficient resolution to clearly capture modest effects that authors demonstrate. BrdU-PI double staining would have been much more useful.

      We appreciate the reviewer’s comment. In the revised manuscript we have added Ki67 and H3S10p staining in the tumor samples for the new in vivo PDX experiment (Fig. 6E and Fig. EV10A-C). We show that the combinatorial treatment significantly induces a reduction of both proliferation markers, which is in agreement with a reduced tumor volume. Regarding the in vitro analysis, we did not only use PI staining to show a reduced proliferation state but also H3S10p staining (Fig. 4B) and an SLBP1 fluorescent reporter MDA-MB-231 cell line (Fig. 4D, Fig. EV6B, E, and Movie EV). In the revised version of the manuscript, we included a new FACS-PI analysis (Fig. 4A, C) to better represent the effects we see on the cell cycle.

      Minor points:

      Dose dependent decrease in phosphorylated H3 is not at all obvious from eyeballing the data in S1A; the only effect that I see is a modest reduction at the highest concentration of the inhibitor. Authors need to quantify the results to support the claim.

      We agree with the reviewer and we apologize for the misinterpretation. We have changed the revised manuscript as follows: “The selective LOXL2 inhibitor PXS-538224 (hereafter, PXS) efficiently reduced the levels of oxidized histone H3 (H3K4ox) in MDA-MB-231 cells at 40 μM (Fig. EV6C), indicating an efficient inhibition of LOXL2 catalytic activity in the nucleus.”

      Most of breast cancer cell lines are derived from metastatic disease, including pleural effusion, thus the point that because MDA231 cell line is derived from pleural effusion, it is metastatic does not have sufficient logical foundation.

      Many publications have shown the high metastatic capacity of MDA-MB-231 (e.g. https://doi.org/10.1016/j.bbabio.2011.04.015, doi: 10.1038/s41467-017-01829-1), which are therefore used as TNBC metastatic model. The scope of the analysis reported in Fig. 6C was just to show whether any of the used treatments could reduce the metastatic capacity of this cell line. We believe we do not overstate the results but just report them as they are.

      How is loss of cell-cell junction in vitro consistent with LOXL2 role in modulating ECM? There is no evidence of ECM production in MDA231 in vitro. On the other hand, this loss is associated with EMT.

      We thank the reviewer for identifying this mistake. In the revised manuscript we changed the text as follows: “Gene set enrichment analysis (GSEA) revealed that LOXL2 KD induced upregulation of processes involved in cell morphology, secretion, membrane trafficking, and cell differentiation, with cell-cell junction being one of the most significantly affected pathways (Fig. EV5E). These results agree with the role of LOXL2 in regulating epithelial-to-mesenchymal transition, corroborating the high quality of our dataset.”

      Reviewer #1 (Significance (Required)):

      Discovery and characterization of LOXL2-BRD4 interaction is advancing the ever-deepening understanding of molecular mechanisms of regulation of gene expression. The studies and analyses appear to be sufficiently rigorous and reported with clarity, and the claimed discovery of the biological interaction between LOXL2 and BRD4 is well supported. However, given the magnitude of the reported (rather than claimed) effects of this interaction, and concerns about generalizability of authors conclusions, it is not clear how these results are promising for the development of new therapies in TNBC. Moreover, in contrast to luminal BC, there is no clear evidence for utility of cytostatic drugs in constraining TNBC. Therefore, biological and clinical significance of the authors discovery is unclear and claims in this regard appear to be overblown

      We thank the reviewer for stating that our analysis is rigorous and reported with clarity. We really took the criticisms as an opportunity to strengthen our findings, as explained above.

      For the newly presented in vivo PDX model, we performed immunohistochemistry of Ki67, H3S10p and Cleaved Caspase 3 to check whether the reduction of tumor volume observed in the combinatorial treatment was a result of a cytotoxic and/or a cytostatic effect (Fig. 6E and Fig. EV10A-C). As shown in the figure, the combination of the two inhibitors induced a superior decrease of Ki67, H3S10p, and a clear increase of Cleaved Caspase 3. Therefore, these new data indicate that the combinatorial treatment does not only have a cytostatic effect but also cytotoxic, suggesting a clinical exploitability for the treatment of TNBC patients.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In their study, Pascual-Reguant et al. show that combined inhibition of BRD4 and LOXL2 can synergize to restrict triple-negative breast cancer (TNBC) proliferation. BRD4 and LOXL2 are transcription regulators that can read and write epigenetic information, respectively. The authors employ three distinct breast cancer cell lines and mouse models with cell line-derived xenografts, and they show that combined inhibition of BRD4 and LOXL2 can be superior to single BRD4/LOXL2 inhibition in these model systems. In an attempt to identify a connection between BRD4 and LOXL2, the authors find that the two proteins can bind to each other. The authors performed most of the experiments in the breast cancer cell line MDA-MB-231. To assess the impact of LOXL2-inhibition on transcription, the authors assessed changes of the transcriptome in MDA-MB-231 cells following LOXL2 knockdown. They found that genes related to cell differentiation and morphology were upregulated, while genes related to the cell cycle were downregulated. ChIP-seq data of BRD4 showed that BRD4 can bind to cell cycle gene promoters and that this binding was enhanced upon loss of LOXL2. The authors found that LOXL2 and BRD4 interacted with the transcriptional cell cycle regulators B-MYB, FOXM1, and LIN9, which are components of the MYB-MuvB-FOXM1 (MMB-FOXM1) complex that is known to promote the expression of late cell cycle genes with important functions during mitosis. The authors conclude that LOXL2/BRD4 interact with each other and with the MMB-FOXM1 complex to drive the expression of cell cycle genes and cell proliferations. Vice versa, they conclude that inhibition of LOXL2/BRD4 reduces cell proliferation through inhibiting the expression of cell cycle genes.

      Major:

      • The data and methods are presented well. The experiments are adequately replicated and analyzed. However, except for the first section, all experiments were performed using only one cell line. It is important to validate key findings in at least a second cell line.

      We thank the reviewer for valuing our work.

      To address the reviewer’s comment, in the revised manuscript we added an additional cell line (Cal-51), that expresses similar levels of LOXL2 and BRD4 as compared to MDA-MB-231 (Fig. EV8C). Even though this cell line is clearly more resistant to JQ1 than the MDA-MB-231 cell line (Fig. EV8D), the combinatorial treatment is significantly more effective as compared with single agents’ treatment (Fig. EV8E).

      Moreover, we have also performed an additional in vivo experiment using another TNBC PDX (PDX-127) that expresses LOXL2 and BRD4S, but not BRD4L. Given that JQ1 can inhibit both BRD4 isoforms, this in vivo system allowed us to demonstrate that the tumor antiproliferative capacity of the combinatorial treatment is due to the simultaneous inhibition of LOXL2 and BRD4S (rather than BRD4S and L) (Fig. 6D and Fig. EV9E-F).

      • There appears to be a misunderstanding of the concept of cell cycle-dependent gene regulation by the DREAM complex and its related factors. Early (G1/S) cell cycle genes contain E2F promoter motifs, while late (G2/M) cell cycle genes contain CHR promoter motifs. The DREAM complex can bind both, while RB-E2F and MuvB recognize only E2F and CHR motifs, respectively. B-MYB and FOXM1 bind to MuvB and regulate late cell cycle genes, but they do not bind to early cell cycle genes. Given this concept, the authors' rationale to connect BRD4/LOXL2 through MuvB/B-MYB/FOXM1 with E2F promoter sequences and early cell cycle genes and the subsequent conclusions must be corrected.

      We thank the reviewer for their expert explanation. We corrected our conclusion in the revised version of the manuscript following the reviewer’s comment.

      • I felt that the suggested functional connection between LOXL2/BRD4 and DREAM is not strongly supported by the authors' data. Figure S6E: A similarity score of Fig. EV6E: We agree with the reviewer that a similarity score of Fig. 4E: We thank the reviewer for this comment. The performed pulldown showed that BRD4S, LOXL2, and MED1 interact with Lin9 and B-Myb, but not with FOXM1, thus FOXM1 itself is an internal negative control of the pulldown. Additionally, BRD4L does not show the same interaction pattern as BRD4S, LOXL2, and MED1, again acting as an internal negative control. We, therefore, believe that the pulldown is properly controlled and that the observed interaction is trustful. We furthermore agree with the reviewer that it would be interesting to characterize the interactions between the DREAM complex and BRD4S, LOXL2, and MED1. However, we believe that the dissection of these interactions at the mechanistic levels would require a deeper study, which can be a project in itself that we aim to explore in the future. For example, it would be interesting to investigate whether either the inhibition or the downregulation of LOXL2 and/or BRD4S specifically impairs the formation of the DREAM complex or the recruitment of specific DREAM complex subunits, as well as how these effects impair the DREAM complex chromatin binding. We are afraid that the suggested pulldowns would not be sufficient to answer these questions, which would require extensive cross-interaction studies in either BRD4/LOXL2 and BRD4+LOXL2 inhibition or downregulation followed by ChIP-seq and transcriptomics for all the conditions. We believe that the provided data, together with the functional characterization (both, in vitro and in vivo), of the phenotypes triggered by BRD4S and LOXL2 inhibition make a strong case for our manuscript and leave out of scope the suggested experiments. We hope the reviewer will understand our explanation and will appreciate that we are planning to pursue this further in the future.

      Fig. 3: We thank the reviewer for this important comment. The ChIP-seq technique very often does not provide exhaustive results due to sequencing depth limits and antibody performance. We believe that the fraction of DREAM target genes found in our dataset as bound by BRD4S is not exhaustive and that the analysis proposed by the reviewer would not lead to clear conclusive results. However, we understand the importance of verifying that DREAM target genes whose promoter is bound by BRD4 are indeed downregulated when LOXL2 is inhibited. To give an answer to this question, in the revised manuscript we added gene expression analysis of selected DREAM target genes upon treatment with JQ1, PXS their combination. We could successfully show that both JQ1 and PXS treatment impairs the transcription of the selected DREAM target genes, however, the combinatorial treatment almost shut down their expression, in agreement with our hypothesis (Fig. 5J).

      • The authors state that it is surprising to find that LOXL2 can promote target gene transcription because it is rather known as a transcriptional repressor. To this point, the authors should perform standard analyses using their RNA-seq and ChIP-seq data. Compare differential expression of genes that are bound by BRD4S/L/S+L and genes not bound by BRD4. Perform motif search and enrichment analyses for transcription factor and co-factor binding data (public ChIP-seq repositories). Such analyses may suggest what gene sets are up- and downregulated by LOXL2 through BRD4S/L and what other factors could be involved in LOXL2-dependent up- and downregulation of gene transcription.

      We thank the reviewer for this valuable comment that certainly provides the rationale for a follow-up project. However, we believe that the proposed study goes beyond the scope of our work at this moment.

      Minor:

      • I felt that background information on the BRD4 isoforms was missing. The short and long isoforms of BRD4 should be introduced briefly.

      We agree with the reviewer. In the revised manuscript, we addressed this by presenting BRD4 isoforms in the introduction part of the manuscript.

      • Given that BRD4 inhibition is known to activate p53 (e.g., PMID 23317504 and 33431824) and p21 (PMID 31265875), the authors should discuss the p53 status of their cell lines (largely mutant). In general, I felt that the authors could better cite and discuss the current literature on BRD4 and LOXL2.

      We appreciate the comment of the reviewer regarding p53. Given the fact that p53 is mutant in MDA-MB-231, we believe that the proliferation defect observed with the combinatorial treatment may be due to the activation of alternative cytostatic or cytotoxic signaling cascades, independently of P53 activation. We have now briefly mentioned this point in the manuscript discussion.

      • It was unclear to me why the authors did not actually test experimentally whether their predicted interaction models 2 or 4 are likely true (Figure 2E+G).

      We understand the reviewer’s comment. The fact that JQ1 treatment almost abrogates the interaction between LOXL2 and BRD4S strongly suggests that models 1 and 3 are likely wrong, therefore pointing towards models 2 and 4 as the correct ones. To test whether models 2 and 4 are indeed the correct models we are now performing extensive mutagenesis studies, which are producing preliminary results suggesting indeed that models 2 and 4 are correct. The reason why we did not include this study in the current manuscript, is that we started a parallel line of investigation aimed at identifying residues fundamental for the interaction that can be exploited in compound screening campaigns to identify molecules able to block the described interaction and thus cancer proliferation. Publishing these preliminary results at this stage could jeopardize the drug discovery campaign and we hope that the reviewer will understand our constraints.

      • The transcription of cell cycle genes depends on the cell cycle (i.e., reduced cell cycle entry correlates with reduced cell cycle gene expression). Given that the authors showed LOXL2 inhibition reduce MDA-MB-231 cell proliferation, they should note that reduced expression of cell cycle-related genes is expected upon LOXL2 knockdown.

      We understand the reviewer’s comment. We believe that we provide sufficient data supporting our hypothesis that LOXL2 controls the expression of cell cycle genes at the transcriptional level together with BRD4S. In addition, the sole inhibition of LOXL2 has practically no effect on tumor proliferation in vivo but largely enhances the antiproliferative effect of low-dose JQ1 (Fig. 6D). We hope these clarifications would satisfy the reviewer.

      • The authors specify in their discussion that their data show a function of LOXL2/BRD4 in the cell cycle interphase, while there were no experiments that support that specific conclusion. At least it is unclear to me why the authors rule out a function in mitosis?

      We thank the reviewer for this comment. We referred to interphase genes because these are the early cell cycle genes, while mitotic genes are the late ones. We do not discard a possible function for BRD4S and LOX2 regulating mitotic progression, however, we believe this would be a consequence of dysregulated G1-S-G2 gene expression, rather than a direct transcriptional effect. This conclusion derives from the fact that while we observe interactions between LOXL2, BRD4S, and MED1 with Lin9 and B-Myb, these are not fully conserved with FOXM1, which is typically required for the transcription of mitotic genes. To avoid confusion, we have now anyway removed the word “interphase” from the text.

      • I felt that the first part of the manuscript (combination of BRD4 and LOXL2 inhibitors in TNBC) was a bit uncoupled from the functional studies on LOXL2 and its connection to BRD4. The transition between these parts and the final discussion on why the joint control of cell cycle genes by LOXL2/BRD4 may be important for the synergistic effect of LOXL2/BRD4 inhibitors. To this point, the authors' model was not clear to me.

      We really appreciate the reviewer’s comment. To better connect the functional studies with the clinical significance of the proposed combinatorial treatment, we restructured the manuscript. In the revised version, the use of the combinatorial treatment is shown in Figure 6. Moreover, to better explain why we focused all the studies on BRD4 and LOXL2, we also included data from the Cancer Cell Line Encyclopedia (CCLE)-associated chemotherapeutics sensitivity (Fig. 1A and Fig. EV1) showing that LOXL2 expression levels can predict the response to BRD4 inhibition, suggesting a functional interaction between BRD4 and LOXL2 and the possibility to exploit it for therapeutical purposes. We believe that these data set the rationale to further explore the connection between LOXL2 and BRD4, both at the mechanistic and functional levels.

      Reviewer #2 (Significance (Required)):

      The study by Pascual-Reguant et al. shows that inhibitors of BRD4 and LOXL2 can be combined to achieve better efficacy in reducing proliferation of breast cancer cell lines and breast tumor growth in xenograft models. They provide strong evidence for a functional interaction between LOXL2 and BRD4 and investigate their common transcriptional targets. Intriguingly, some evidence points towards a direct regulation of the DREAM complex and its cell cycle gene targets.

      The findings are novel and can be the basis for further research on TNBC combination therapy using BRD4 and LOXL2 inhibitors. The link to the DREAM complex is preliminary.

      The study is of interest for a basic research audience with some translational aspects.

      I reviewed this manuscript as a researcher in gene regulatory mechanisms, with cell cycle genes as one focus area. I have no expertise in the computational modeling of protein-protein interactions and I am no expert for breast cancer.

      We thank the reviewer for the positive comments. We also would really like to thank the reviewer for their criticism, which, we believe, contributed to a new and improved manuscript version.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, Laura Pascual-Reguant et al. identified a novel role of the LOXL2 oxidase in sustaining cell cycle progression through a so far uncharacterized gene-activating function is mediated by the BRD4S epigenetic reader and exerted on key DREAM-target genes in TNBC. Moreover, the authors showed that combinatorial treatment of TNBC with LOXL2- and BRD4-specific inhibitors result in a tremendous anti-tumorigenic effect. For all findings, they leveraged in vitro and in vivo settings as well as high-throughput sequencing approaches. However, the following points should be addressed and explained.

      Major points:

      -The authors on their working hypothesis propose that dual inhibition of BRD4 and LOXL2 is a novel strategy for curing TNBC. For my taste, just because both targets are quite promising for TNBC, the jump to this combinatorial treatment is kind of abrupt. Knowing the difficulty and time-/financial- investment, authors could optionally perform a mass spectrometry analysis on nuclei lysates with LOXL2 pull down to identify physical interactors. Due to the augmented resources and analysis of raw data, authors may necessitate a generous revision period (approx. 4 months for starters). By that, this can provide a more unbiased approached to look at nucleus-specific gene-regulatory functions and particularly at epigenetic readers. It would be also interesting to see if LOXL2 interacts with other members of the BRD family. Selecting BRD4 and no other members of the bromodomain family cannot be the only choice given that other BRD members can also interact with several of these mediator subunits.

      We thank the reviewer for the suggestion and we agree with the fact that the rationale for combining BRD4 and LOXL2 inhibitors was not sufficiently argued in the first version of the manuscript. For that reason, in the revised manuscript, we added new data to explain why we explored this topic. In particular, to better explain why we focused all the studies on BRD4 and LOXL2, we included data from the Cancer Cell Line Encyclopedia (CCLE)-associated chemotherapeutics sensitivity (Fig. 1A and Fig. EV1) showing that LOXL2 expression levels can predict the response to BRD4 inhibition (but not to other approved chemotherapeutic drug), suggesting a functional interaction between BRD4 and LOXL2 and the possibility to exploit it for therapeutical purposes. Moreover, we restructured the manuscript to make the story more linear, explaining first the functionality of BRD4S-LOXL2 interaction at the molecular and cellular levels, and then presenting the in vivo systems in the last part of the manuscript.

      We agree with the reviewer that it may be interesting to explore whether LOXL2 interacts with other BRD family members. However, given the prominent role of BRD4 in promoting cancer proliferation, we believe that understanding the relevance of BRD4S-LOXL2 interaction in TNBC is, per se, of great interest and provide a novel mechanistic understanding of how TNBC proliferation is controlled at the transcription level. In the specific case of TNBC, it has been shown that BRD4S has an oncogenic effect, while BRD4L is an oncosuppressor. In the manuscript, we now showed that LOXL2 downregulation sensitizes cells to JQ1 treatment (Fig. 1D). Additionally, while the downregulation of BRD4L does not have any additional effect on cell treated with PXS, the downregulation of BRD4S sensitize them to LOXL2 inhibition (Fig. EV8B). These results, once again, indicate the relevance of studying the functional interaction between BRD4S and LOXL2.

      -LOXL enzymes have been shown to promote collagen and fibronectin assembly, thereby sustaining the pro-survival effect of the ITG5A/FN1/FAK/SRC signaling cascade and shielding TNBC cells against chemotherapy treatment (32415208). Did authors observe if LOXL2 loss or inhibition decreased the active status of FAK and SRC, which are well known to promote G1-S transition (25381661)?

      Probably the cell cycle defects upon LOXL2 loss may also partially arise from the impairment of this cascade.

      We really appreciate the reviewer’s suggestions. In the revised version of the manuscript, we checked FAK and Src activation status in tumor samples from one of our in vivo experiments (Fig. EV10D). We did not observe any difference in phospho-FAK or phospho-Src upon treatment either with PXS, JQ1, or their combinations, suggesting that alterations in the activity of these factors were not driving the observed proliferation defects.

      -Authors exclusively use JQ1 as a BRD4 inhibitor. As JQ1 may have an unspecific effect on BRD2 as well, authors should consider reproducing key experiments with siControl- and siBRD4-treated cells and increasing doses of PSX as well as repeating the JQ1 dose response assay in Figure 1B using siRNA-mediated silencing of LOXL2. Given that both players are part of the same complex, silencing of one and inhibition of the other should sensitize cells compared to their control counterparts.

      We agree with the reviewer and we addressed this comment in the revised manuscript. In particular, we have added two additional experiments:

      • We transduced MDA-MB-231 cells with isoform-specific shBRD4s (shBRD4L and shBRD4S) (Fig. EV5H) and checked cell sensitivity to PXS treatment (Fig. EV8B). As explained also above, we observed that only when the short isoform of BRD4 was downregulated cells displayed higher sensitivity to PXS treatment. This result corroborates that BRD4S and LOXL2 are required for TNBC proliferation.

      • We transduced MDA-MB-231 cells with shLOXL2 and assessed JQ1 sensitivity (Fig. 1D). We showed that upon LOXL2 downregulation, cells became more sensitive to JQ1 treatment, again corroborating the fact that TNBC proliferation requires BRD4S and LOXL2.

      -Moreover, in Figures 1G and S3D the differential sensitivity of low and high LOXL2 cell lines is unclear. Do authors know if any of these growth kinetic lines represent one of the tested cell lines in Figure 1A-B? Authors should provide respective legends. In addition, authors should take advantage of their homemade data given that they have already selected a panel of TNBC cell lines with various LOXL2 expression at basal state (Figure 1A) for which dose response assays have been performed (Figure 1B). Therefore, I would perform an IC50 graph for JQ1 (without PSX treatment) using the existing data from Figure 1B.

      We apologize if our representation was confusing. In the revised manuscript we have changed the sensitivity plots (Fig. 1A and Fig. EV1) to make them easier to grasp. Additionally, in Figure 1A we included the analysis of CCLE cell lines stratified based on their LOXL2 expression levels. This analysis showed that LOXL2 expression levels could overall predict the response to BETi treatment. As suggested by the reviewer, we also plotted the IC50 of the 3 cell lines tested. However, their JQ1 sensitivity curves did not show any difference that could be attributed to their different LOXL2 levels. Our speculation is that only 3 cell lines do not provide a sufficient size to reach a meaningful conclusion, which, in contrast, can be achieved by comparing the CCLE BETi sensitivity.

      -In Figure 2D, the pull-down assay is inconclusive, as the molecular weight for each construct is not mentioned. I would probably add this information also in all performed western blots. Also, the overexpression of the BD1/BD2-mutated and especially the BD1/BD2-lacking construct is unclear if it still interacts with LOXL2, probably because of the lack of molecular weight reference of each band. Therefore, the authors should make this pull-down assay more descriptive regarding the size of the bands. Also, BD1 mutagenesis at N140 was shown to dislodge the binding of JQ1 to BRD4 (24497639), which implies that BD1 mutagenesis or overexpression of the BD1-deficient construct should abrogate the interaction of LOXL2 with BRD4, reminiscent to the abrogated interaction of BRD4/LOXL2 upon JQ1 that binds to both BDs (Figure 2F). And, what happens if a BD2-deficient construct is expressed?

      We thank the reviewer for spotting this distraction. We apologize for this and in the revised version of the manuscript we included molecular weights for all western blots.

      We acknowledge that BD1 mutagenesis displaces JQ1 binding, however, we respectfully disagree that because of this BD1-N140 mutant should not bind to LOXL2. Our docking analysis indeed showed that none of the poses is impaired either by BD1 or BD2 mutagenesis (Fig. EV4D). The fact that JQ1 disrupts the interaction between BRD4S and LOXL2 (Fig. 2F, G) is not due to the fact that they compete for the same binding residue, but rather for the space occupied by JQ1 inside the AcK binding pocket of either BD1 or BD2, which impedes proper binding to LOXL2. Our pulldown data indeed showed that mutant BD1 and BD2 retain the ability to bind to LOXL2 (Fig. 2C), as predicted by the docking.

      We did not try to express constructs either lacking BD1 or BD2 and we cannot speculate what could happen to the BRD4S-LOXL2 interaction in this scenario. Even though this experiment could help dissect the interaction between LOXL2 and BRD4S, we decided to rather perform mutagenesis of specific residues that have been predicted to be important for the interaction. The reason why we did not include this study in the current manuscript, is that we started a parallel line of investigation aimed at identifying residues fundamental for the interaction that can be exploited in compound screening campaigns to identify molecules able to block the described interaction and thus cancer proliferation. Publishing these preliminary results at this stage could jeopardize the drug discovery campaign and we hope that the reviewer will understand our constraints.

      -If authors support that BRD4S is the predominant isoform driving the expression of DREAM-targets, this means that DREAM-targets are mainly bound by BRD4S, relying on Figure 3E-F. However, based on the author's ChIPseq tracks in Figure 3H, DREAM targets such as EZH2 and HMGB2 are co-occupied by both BRD4 isoforms at the basal state on their promoter region. Also, especially for EZH2 and PLK4, authors should set to 'group auto-scale' both conditions in a smaller scale range for ChIPseq- and RNAseq tracks, although I do not these two genes as good candidates representing your analysis. Therefore, authors should initially show all genes (e.g in a table format) that enrich the 'DREAM-targets' signature and select for a greater panel of genes (like for AURKB and HMGB2) demonstrating a preferential occupancy of the BRD4S at their promoter region. Finally, authors are recommended to perform a ChIP-qPCR on these genomic regions at basal state (no LOXL2 silencing) to validate the predominant occupancy of BRD4S and the low/absent occupancy of BRD4L at these genomic sites.

      We apologize for the confusion. To make the figure more understandable, we now scaled all the panels to the same scale and highlighted in grey the promoter region of each selected DREAM target gene. As the reviewer can appreciate, none of these genes is bound by BRD4L in basal conditions (Fig. 3F).

      To better characterize the differential binding, following the reviewer’s suggestion, we performed ChIP-qPCR using Ab2 (which recognizes both BRD4 isoforms), in cells either downregulated for BRD4L or BRD4S with isoform-specific shRNAs (Fig. EV5H). Results showed that only the downregulation of BRD4S reduced the binding of Ab2 to the promoter of the selected DREAM target genes (Fig. 3D), corroborating our hypothesis and validating our ChIPseq strategy.

      -Authors in Figure 3G should select an equal-sized population of randomly chosen non-DREAM-target genes, otherwise, the comparison of log2FC difference between these two gene cohorts is unreliable and difficult to make. Mann-Whitney test should also be performed.

      We thank the reviewer for this suggestion, which was added to the revised version of the manuscript (Fig. 3E, lower panel).

      -Authors should repeat the cell cycle analysis (Figure 4A) as the number of cells subjected to flow cytometry is quite discrepant between the conditions. Also, it is not clear if the experiment was performed in at least biological triplicates (although in the respective legend, it is stated so). If performed in biological triplicates, authors should make a new graph where each cell cycle phase cell population differs between the two conditions. Moreover, the difference in cell cycle defects in LOXL2-inhibited cells (Figure 4C) is indifferent compared to their control counterpart. Therefore, authors should address these inconsistencies.

      We thank the reviewer for the suggestion. In the revised version of the manuscript, we represent the cell cycle also as a bar plot with statistical analysis (Fig. 4A, C). Even though the number of cells was the same across conditions, the sub-G1 population of the LOXL2 KD cells may have distorted the profile of the cell cycle. To avoid misinterpretations, we repeated the analysis in the revised version of the manuscript. Statistical analysis supports that LOXL2 inhibition or downregulation has a significant effect on cell cycle progression (Fig. 4A, C, right panel).

      -Furthermore, authors should explain what was the rational selecting a mediator subunit and specifically MED1 as a possible interacting partner of LOXL2 and BRD4s since MED12 and MED24 were also highly essential (Figure 4F).

      We selected MED1 as a Mediator Complex proxy. In our essentiality analysis MED 1, 9, 10, 12, 15, 16, 19, 23, 24, 25 score as significant, suggesting a functional interaction between LOXL2 and the Mediator Complex, rather than a specific subunit. MED1 has been previously described as a BRD4 partner and it is often used in immunofluorescence to visualize transcriptional foci, which made it the best candidate for follow-up study in our project.

      -Moreover, do authors also observe this functional relationship of LOXL2 and BRD4S in cell cycle progression in other breast cancer subtypes presenting a high proliferation index e.g HER2+?

      Presumably, the author's proposed mechanism applies to a wide panel of breast cancer entities, for which, only key experiments could be performed.

      We thank the reviewer for the suggestion. We hypothesized that other cancer types expressing LOXL2 and BRD4S could also benefit from the combinatorial treatment. Indeed, the CCLE drug sensitivity panel in Fig. 1A comprises cancer cell lines of different origins, not just TNBC, and corroborates that the relationship between LOXL2 expression levels and BRD4 sensitivity exist also beyond TNBC. Even though it is important to experimentally verify this hypothesis, we decided to pursue it in the future to broaden the applicability of the proposed strategy in preclinical settings.

      -Authors in Figure 5H represent LOXL2 and BRD4s as integral chromatin looping factors together with MED1 at promoter and enhancer regions. However, this illustration is an overrepresentation of their finding because authors did not address the differential occupancy of BRD4S upon LOXL2 loss in DREAM-target-specific enhancer regions. If they wish to do so, they may use the RANK ORDERING OF SUPER-ENHANCERS (ROSE) package to call for super-enhancer regions in the proximity of DREAM-targets and confirm similar results as for their TSS-proximal sites.

      We thank the reviewer for the useful suggestion. In the new version of the manuscript, we have simplified the representation, which now does not show super-enhancers. However, following the reviewer’s suggestion, we performed super enhancer analysis using ROSE. Results showed that BRD4S binds to super-enhancers more than BRD4L, including DREAM target gene super-enhancers. Additionally, while LOXL2 KD did not alter the binding of LOXL2 to DREAM target gene super-enhancers, it decreased the binding of BRD4S to them (Fig. EV7D, E). Overall, these data are in agreement with our hypothesis that BRD4S together with LOXL2 controls the expression of DREAM target genes.

      -In the current manuscript, authors did not address the translational relevance of their proposed mechanism in the context of conventional therapies. Knowing that several BRD-specific compounds currently undergo clinical trials, authors should address if LOXL2 low (MDAMB468) and high (BT549) cells demonstrate a differential sensitivity to increasing doses of chemotherapy, in the presence or absence of BRD4. By doing that, LOXL2 apart from being a therapeutic target could be also used as a prognostic marker to stratify patients and achieve better response to standard therapies.

      We really appreciate the reviewer’s suggestion and we think this is a fundamental point. In the new version of the manuscript, we have performed further analysis using a greater panel of chemotherapeutic agents from the CCLE sensitivity database. We now show that LOXL2 low-expressing cells show significantly more sensitivity to BETi treatments, but not to conventional chemotherapeutic agents (e.g. doxorubicin, Olaparib, 5-fluorouracil, paclitaxel, etc.) (Fig. 1A and Fig. EV1), which set the rationale to further explore the functional relationship between BRD4 and LOXL2.

      Minor points:

      -In Figure 1D, the authors should convert the y-axis to a logarithmic scale to better represent the differences between JQ1, PXS, and combo. Also, One-way Anova should be performed between JQ1, PXS and combo.

      We don’t understand the reviewer’s suggestion since Fig. 1D (Fig. 6B, right panel in the revised version) is a tumor picture for which the y-axis cannot be converted to a logarithmic scale.

      -In Figure S6F, authors did not show the sensitivity of LOXL2 low and high cell lines for BRD4 KO. If LOXL2-proficient cells are less sensitive to JQ1, based on Figure 1B, authors should consider showing something similar from the gene essentiality database.

      We agree with the reviewer and we apologize for this mistake. We have included the sensitivity of LOXL2 low and high cell lines for BRD4 KO and also for MYC KO (Fig. EV6G).

      -Authors failed to discuss the work from Ozge Saatci et al (PMID: 32415208) regarding LOXL2 in TNBC and ECM reorganization as well as in other cancer entities (PMID: 35428659) in the context of ECM remodeling. Authors should realize that these published works and the current ones are not conflicting but complement each other.

      We thank the reviewer for the suggestion. In the revised version of the manuscript, we discussed this work.

      Reviewer #3 (Significance (Required)):

      SIGNIFICANCE

      The conception and findings are of enlightening significance for TNBC therapy, especially given the lack of targeted therapies in this particularly aggressive breast cancer subtype. Hence, I posit this work as highly relevant for the cancer epigenetics research community interested in characterizing unknown factors that facilitate the gene-activating function of epigenetic readers in health and disease.

      My field of expertise is to uncover epigenetic vulnerabilities responsible for transcriptional plasticity driving drug tolerance in aggressive forms of breast cancer.

      We would like to take the opportunity to thank the reviewer for the relevant suggestions. We strongly believe the revised version of the manuscript has been substantially improved by addressing the comments the reviewer made.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In their study, Pascual-Reguant et al. show that combined inhibition of BRD4 and LOXL2 can synergize to restrict triple-negative breast cancer (TNBC) proliferation. BRD4 and LOXL2 are transcription regulators that can read and write epigenetic information, respectively. The authors employ three distinct breast cancer cell lines and mouse models with cell line-derived xenografts, and they show that combined inhibition of BRD4 and LOXL2 can be superior to single BRD4/LOXL2 inhibition in these model systems. In an attempt to identify a connection between BRD4 and LOXL2, the authors find that the two proteins can bind to each other. The authors performed most of the experiments in the breast cancer cell line MDA-MB-231. To assess the impact of LOXL2-inhibition on transcription, the authors assessed changes of the transcriptome in MDA-MB-231 cells following LOXL2 knockdown. They found that genes related to cell differentiation and morphology were upregulated, while genes related to the cell cycle were downregulated. ChIP-seq data of BRD4 showed that BRD4 can bind to cell cycle gene promoters and that this binding was enhanced upon loss of LOXL2. The authors found that LOXL2 and BRD4 interacted with the transcriptional cell cycle regulators B-MYB, FOXM1, and LIN9, which are components of the MYB-MuvB-FOXM1 (MMB-FOXM1) complex that is known to promote the expression of late cell cycle genes with important functions during mitosis. The authors conclude that LOXL2/BRD4 interact with each other and with the MMB-FOXM1 complex to drive the expression of cell cycle genes and cell proliferations. Vice versa, they conclude that inhibition of LOXL2/BRD4 reduces cell proliferation through inhibiting the expression of cell cycle genes.

      Major:

      • The data and methods are presented well. The experiments are adequately replicated and analyzed. However, except for the first section, all experiments were performed using only one cell line. It is important to validate key findings in at least a second cell line.
      • There appears to be a misunderstanding of the concept of cell cycle-dependent gene regulation by the DREAM complex and its related factors. Early (G1/S) cell cycle genes contain E2F promoter motifs, while late (G2/M) cell cycle genes contain CHR promoter motifs. The DREAM complex can bind both, while RB-E2F and MuvB recognize only E2F and CHR motifs, respectively. B-MYB and FOXM1 bind to MuvB and regulate late cell cycle genes, but they do not bind to early cell cycle genes. Given this concept, the authors' rationale to connect BRD4/LOXL2 through MuvB/B-MYB/FOXM1 with E2F promoter sequences and early cell cycle genes and the subsequent conclusions must be corrected.
      • I felt that the suggested functional connection between LOXL2/BRD4 and DREAM is not strongly supported by the authors' data. Figure S6E: A similarity score of <0.7 is poor support for a 'consensus E2F sequence' and indicates very limited specificity. Figure 4E: IP with BRD4 and LOXL2 is missing as important control. A chromatin-binding control is missing that does not bind to DREAM/LOXL2/BRD4. To test for binding to the actual DREAM complex, the authors should include E2F4 and p130 in their IPs and western blots, perhaps following LOXL2 inhibition/knockdown. Figure 3: The authors' ChIP-seq data indicate that only a fraction of DREAM targets is bound by BRD4. To provide more evidence that LOXL2/BRD4 may be directly involved in regulating DREAM targets, the authors should compare the differential regulation of BRD4-bound DREAM targets upon LOXL2 knockdown with DREAM targets which are not bound by BRD4. If LOXL2/BRD4 acted in a direct manner on those targets, one would expect that loss of LOXL2 affected their transcription more strongly than the other DREAM targets which are affected only indirectly. Such an analysis can be performed readily using the available data.
      • The authors state that it is surprising to find that LOXL2 can promote target gene transcription because it is rather known as a transcriptional repressor. To this point, the authors should perform standard analyses using their RNA-seq and ChIP-seq data. Compare differential expression of genes that are bound by BRD4S/L/S+L and genes not bound by BRD4. Perform motif search and enrichment analyses for transcription factor and co-factor binding data (public ChIP-seq repositories). Such analyses may suggest what gene sets are up- and downregulated by LOXL2 through BRD4S/L and what other factors could be involved in LOXL2-dependent up- and downregulation of gene transcription.

      Minor:

      • I felt that background information on the BRD4 isoforms was missing. The short and long isoforms of BRD4 should be introduced briefly.
      • Given that BRD4 inhibition is known to activate p53 (e.g., PMID 23317504 and 33431824) and p21 (PMID 31265875), the authors should discuss the p53 status of their cell lines (largely mutant). In general, I felt that the authors could better cite and discuss the current literature on BRD4 and LOXL2.
      • It was unclear to me why the authors did not actually test experimentally whether their predicted interaction models 2 or 4 are likely true (Figure 2E+G).
      • The transcription of cell cycle genes depends on the cell cycle (i.e., reduced cell cycle entry correlates with reduced cell cycle gene expression). Given that the authors showed LOXL2 inhibition reduce MDA-MB-231 cell proliferation, they should note that reduced expression of cell cycle-related genes is expected upon LOXL2 knockdown.
      • The authors specify in their discussion that their data show a function of LOXL2/BRD4 in the cell cycle interphase, while there were no experiments that support that specific conclusion. At least it is unclear to me why the authors rule out a function in mitosis?
      • I felt that the first part of the manuscript (combination of BRD4 and LOXL2 inhibitors in TNBC) was a bit uncoupled from the functional studies on LOXL2 and its connection to BRD4. The transition between these parts and the final discussion on why the joint control of cell cycle genes by LOXL2/BRD4 may be important for the synergistic effect of LOXL2/BRD4 inhibitors. To this point, the authors' model was not clear to me.

      Significance

      The study by Pascual-Reguant et al. shows that inhibitors of BRD4 and LOXL2 can be combined to achieve better efficacy in reducing proliferation of breast cancer cell lines and breast tumor growth in xenograft models. They provide strong evidence for a functional interaction between LOXL2 and BRD4 and investigate their common transcriptional targets. Intriguingly, some evidence points towards a direct regulation of the DREAM complex and its cell cycle gene targets.

      The findings are novel and can be the basis for further research on TNBC combination therapy using BRD4 and LOXL2 inhibitors. The link to the DREAM complex is preliminary.

      The study is of interest for a basic research audience with some translational aspects.

      I reviewed this manuscript as a researcher in gene regulatory mechanisms, with cell cycle genes as one focus area. I have no expertise in the computational modeling of protein-protein interactions and I am no expert for breast cancer.