7,181 Matching Annotations
  1. Oct 2022
    1. Underlining Keyterms and Index Bloat .t3_y1akec._2FCtq-QzlfuN-SwVMUZMM3 { --postTitle-VisitedLinkColor: #9b9b9b; --postTitleLink-VisitedLinkColor: #9b9b9b; --postBodyLink-VisitedLinkColor: #989898; }

      Hello u/sscheper,

      Let me start by thanking you for introducing me to Zettelkasten. I have been writing notes for a week now and it's great that I'm able to retain more info and relate pieces of knowledge better through this method.

      I recently came to notice that there is redundancy in my index entries.

      I have two entries for Number Line. I have two branches in my Math category that deals with arithmetic, and so far I have "Addition" and "Subtraction". In those two branches I talk about visualizing ways of doing that, and both of those make use of and underline the term Number Line. So now the two entries in my index are "Number Line (Under Addition)" and "Number Line (Under Subtraction)". In those notes I elaborate how exactly each operation is done on a number line and the insights that can be derived from it. If this continues, I will have Number Line entries for "Multiplication" and "Division". I will also have to point to these entries if I want to link a main note for "Number Line".

      Is this alright? Am I underlining appropriately? When do I not underline keyterms? I know that I do these to increase my chances of relating to those notes when I get to reach the concept of Number Lines as I go through the index but I feel like I'm overdoing it, and it's probably bloating it.

      I get "Communication (under Info. Theory): '4212/1'" in the beginning because that is one aspect of Communication itself. But for something like the number line, it's very closely associated with arithmetic operations, and maybe I need to rethink how I populate my index.

      Presuming, since you're here, that you're creating a more Luhmann-esque inspired zettelkasten as opposed to the commonplace book (and usually more heavily indexed) inspired version, here are some things to think about:<br /> - Aren't your various versions of number line card behind each other or at least very near each other within your system to begin with? (And if not, why not?) If they are, then you can get away with indexing only one and know that the others will automatically be nearby in the tree. <br /> - Rather than indexing each, why not cross-index the cards themselves (if they happen to be far away from each other) so that the link to Number Line (Subtraction) appears on Number Line (Addition) and vice-versa? As long as you can find one, you'll be able to find them all, if necessary.

      If you look at Luhmann's online example index, you'll see that each index term only has one or two cross references, in part because future/new ideas close to the first one will naturally be installed close to the first instance. You won't find thousands of index entries in his system for things like "sociology" or "systems theory" because there would be so many that the index term would be useless. Instead, over time, he built huge blocks of cards on these topics and was thus able to focus more on the narrow/niche topics, which is usually where you're going to be doing most of your direct (and interesting) work.

      Your case sounds, and I see it with many, is that your thinking process is going from the bottom up, but that you're attempting to wedge it into a top down process and create an artificial hierarchy based on it. Resist this urge. Approaching things after-the-fact, we might place information theory as a sub-category of mathematics with overlaps in physics, engineering, computer science, and even the humanities in areas like sociology, psychology, and anthropology, but where you put your work on it may depend on your approach. If you're a physicist, you'll center it within your physics work and then branch out from there. You'd then have some of the psychology related parts of information theory and communications branching off of your physics work, but who cares if it's there and not in a dramatically separate section with the top level labeled humanities? It's all interdisciplinary anyway, so don't worry and place things closest in your system to where you think they fit for you and your work. If you had five different people studying information theory who were respectively a physicist, a mathematician, a computer scientist, an engineer, and an anthropologist, they could ostensibly have all the same material on their cards, but the branching structures and locations of them all would be dramatically different and unique, if nothing else based on the time ordered way in which they came across all the distinct pieces. This is fine. You're building this for yourself, not for a mass public that will be using the Dewey Decimal System to track it all down—researchers and librarians can do that on behalf of your estate. (Of course, if you're a musician, it bears noting that you'd be totally fine building your information theory section within the area of "bands" as a subsection on "The Bandwagon". 😁)

      If you overthink things and attempt to keep them too separate in their own prefigured categorical bins, you might, for example, have "chocolate" filed historically under the Olmec and might have "peanut butter" filed with Marcellus Gilmore Edson under chemistry or pharmacy. If you're a professional pastry chef this could be devastating as it will be much harder for the true "foodie" in your zettelkasten to creatively and more serendipitously link the two together to make peanut butter cups, something which may have otherwise fallen out much more quickly and easily if you'd taken a multi-disciplinary (bottom up) and certainly more natural approach to begin with. (Apologies for the length and potential overreach on your context here, but my two line response expanded because of other lines of thought I've been working on, and it was just easier for me to continue on writing while I had the "muse". Rather than edit it back down, I'll leave it as it may be of potential use to others coming with no context at all. In other words, consider most of this response a selfish one for me and my own slip box than as responsive to the OP.)

    1. Author Response

      Reviewer #1 (Public Review):

      Figures 2 through 6. There is no description of the relationship between the findings and the anatomical location of the electrodes (other than distal versus local). Perhaps the non-uniform distribution of electrodes makes these analyses more complicated and such questions might have minimal if any statistical power. But how should we think about the claims in Figures 2-6 in relationship to the hippocampus, amygdala, entorhinal cortex, and parahippocampal gyrus? As one example question out of many, is Figure 2C revealing results for local pairs in all medial temporal lobe areas or any one area in particular? I won't spell out every single anatomical question. But essentially every figure is associated with an anatomical question that is not described in the results.

      To address the reviewer’s point we now report the distribution of spike-LFP pairs across anatomical regions for each Figure 2-6. The results split by anatomical regions are reported in Figure 2 – figure supplement 7, Figure 3 – figure supplement 7, Figure 4 – figure supplement 1, Figure 5 – figure supplement 2, and Figure 6 – figure supplement 3. We also calculated a non-parametric Kruskal-Wallis Test to statistically examine the effect of anatomical regions on the results shown in each figure. Generally, these new results show that the effects are similar across regions, apart from two exceptions (i.e. Figure 4 – supplement 1; and Figure 5 – supplement 2). However, we would like to stress that these results should be taken with a huge grain of salt because the electrodes were not evenly distributed across regions (i.e. ~75% of observations pertain to the hippocampus), and patients as the reviewer correctly points out. This leads to sometimes very low numbers of observations per region and it is difficult to disentangle whether any apparent differences are driven by regional differences, or differences between patients. Detailed results are reported below.

      Manuscript lines 207-212: “In the above analysis all MTL regions were pooled together to allow for sufficient statistical power. Results separated by anatomical region are reported in Figure 2 – figure supplement 7 for the interested reader. However, these results should be interpreted with caution because electrodes were not evenly distributed across regions and patients making it difficult to disentangle whether any apparent differences are driven by actual anatomical differences, or idiosyncratic differences between patients.”

      Manuscript lines 255-258: “Finally, we report the distal spike-LFP results separated by anatomical region in Figure 3 – figure supplement 7, which did not reveal any apparent differences in the memory related modulation of theta spike-LFP coupling between regions.”

      Manuscript lines 264-266: “PSI results separated by anatomical regions are reported in Figure 4 – figure supplement 1, which revealed that the PSI results were mostly driven by within regional coupling.”

      Manuscript lines 399-303: “We also analyzed whether the memory-dependent effects of cross-frequency coupling differ between anatomical regions (see Figure 5 – figure supplement 2). This analysis revealed that the results were mostly driven by the hippocampus, however we urge caution in interpreting this effect due to the large sampling imbalance across regions.”

      Manuscript lines 343-346: “As for the above analysis we also investigated any apparent differences in co-firing between anatomical regions. These results are reported in Figure 6 – figure supplement 3 and show that the earlier co-firing for hits compared to misses was approximately equivalent across regions.”

      Figure 1

      1A. I assume that image positions are randomized during a cued recall?

      Yes, that was the case. We now added that information in the methods section.

      Manuscript lines 526: “Image positions on the screen were randomized for each trial.”

      What was the correlation between subjects' indication of how many images they thought they remembered and their actual performance?

      We did not log how many images the patients thought they remembered. Specifically, if the patients answered that they remembered at least one image, then they were shown the selection screen where they could select the appropriate images. Therefore, we cannot perform this analysis. We report this now in the methods section. However, albeit interesting, the results of such an analysis would not affect the main conclusions of our manuscript.

      Manuscript lines 523-524: “The experimental script did not log how many images the patient indicated that they thought to remember.”

      1B. Chance is shown for hits but not misses. I assume that hits are defined as both images correct and misses as either 0 or 1 image correct. Then a chance for misses is 1-chance for hits = 5/6. It would be nice to mark this in the figure.

      Done as suggested (see Figure 1).

      The authors report that both incorrect was 11.9%. By chance, both incorrect should be the same as both correct, hence also 1/6 probability, hence the probability of both incorrect seems quite close to chance levels, right?

      Yes, that is correct, however, across sessions the proportion of full misses (i.e. both incorrect) was significantly below chance (t(39)=-1.9214; p<0.05). Nevertheless, the proportion of fully forgotten trials appears to be higher than expected purely by chance. This is likely driven by a tendency of participants to either fully remember an episode, or completely forget it, as demonstrated previously in behavioural work (Joensen et al., 2020; JEP Gen.). We report this now in the manuscript.

      Manuscript lines 132-136: “Across sessions the proportion of full misses (i.e. both incorrect) was significantly below chance (t39=-1.92; p<0.05). However, the proportion of fully forgotten trials appears to be higher than expected purely by chance. This is likely driven by a tendency of participants to either fully remember an episode, or completely forget it, as demonstrated previously in behavioral work (25).”

      1C. How does the number of electrodes relate to the number of units recorded in each area?

      The distribution of neurons per region is shown in the new Figure 1D (see above). It approximately matches the distribution of electrodes per region, except for the Amygdala where slightly more neurons where recorded. This is because of one patient (P08) who had two electrodes in the left and right Amygdala and who contributed at lot of sessions (i.e. 9 sessions, comparing to an average of 4.44 per patient).

      Line 152. The authors state that neural firing during encoding was not modulated by memory for the time window of interest. This is slightly surprising given that other studies have shown a correlation between firing rates and memory performance (see Zheng et al Nature Neuroscience 2022 for a recent example). The task here is different from those in other studies, but is there any speculation as to potential differences? What makes firing rates during encoding correlate with subsequent memory in one task and not in another? And why is the interval from 2-3 seconds more interesting than the intervals after 3 seconds where the authors do report changes in firing rates associated with subsequent performance? Is there any reason to think that the interval from 2-3 seconds is where memories are encoded as opposed to the interval after 3 seconds?

      Zheng et al. used a movie-based memory paradigm where they manipulated transitions between scenes to identify event cells and boundary cells. They show that boundary cells, which made up 7.24% of all recorded MTL cells, but not event cells (6.2% of all MTL cells) modulate their firing rate around an event depending on later memory. There are quite a few differences between Zheng et al’s study and our study that need to be considered. Most importantly, we did not perform a complex movie-based memory paradigm as in Zheng et al. and therefore cannot identify boundary cells, which would be expected to show the memory dependent firing rate modulation. This alone could contribute to the fact that no significant differences in firing rates in the first second following stimulus onset were observed. Such an absence of a difference of neural firing depending on later memory is not unprecedented. In their seminal paper, Rutishauser et al. (2010; Nature) report no significant differences in firing rates (0-1 seconds after stimulus onset, which is similar to our 2-3 seconds time window) between later remembered or later forgotten images. This finding is also in line to Jutras & Buffalo (2009; J Neurosci) who also show no significant difference in firing rates of hippocampal neurons during encoding of remembered and forgotten images.

      The 2-3 seconds time interval, which corresponds to 0-1 seconds after the onset of the two associate images, is special because it marks the earliest time point where memory formation can start, therefore allowing us to investigate these very early neural processes that set the stage for later memory-forming processes. While speculative, these early processes likely capture the initial sweep of information transfer into the MTL memory system which arguably is reflected in the timing of spikes relative to LFPs. It is conceivable that these initial network dynamics reflect attentional processes, which act as a gate keeper to the hippocampus (Moscovitch, 2008; Can J Exp Psychol) and thereby set the stage for later memory forming processes. This interpretation would be consistent with studies in macaques showing that attention increases spike-LFP coupling, whilst not affecting firing rates (Fries et al., 2004; Science). We modified the discussion section to address this issue.

      Manuscript lines 468-474: “Interestingly, these early modulations of neural synchronization by memory encoding were observed in the absence of modulations of firing rates, which is consistent with previous results in humans (16) and macaques (12), but contrasts with (43). Studies in macaques showed that attention increases spike-LFP coupling whilst not affecting firing rates (44). It is therefore conceivable that these initial network dynamics reflect attentional processes, which act as a gate keeper to the hippocampus and thereby set the stage for later memory forming processes (45).”

      Lines 154-157 and relationship to the subsequent analyses. These lines mention in passing differences in power in low-frequency bands and high-frequency bands. To what extent are subsequent results (especially Figures 3 and 4) related to this observation? That is, are the changes in spike-field coherence, correlated with, or perhaps even dictated by, the changes in power in the corresponding frequency bands?

      To address this question we repeated the analysis that we performed for SFC for Power in those channels whose LFP was locally coupled to spikes in gamma, and distally coupled to spikes in theta. Furthermore, we correlated the difference in peak frequency between hits and misses between Power and SFC. If power would dictate the effects seen in SFC then we would expect similar effects of memory in power as in SFC, that is an increase of peak frequency for hits compared to misses for gamma and theta. Furthermore, we would expect to find a correlation between the peak frequency differences in power and SFC. None of these scenarios were confirmed by the data. These results are now reported in Figure 2 – figure supplement 5 for gamma, and Figure 3 – figure supplement 5 for theta.

      Manuscript lines 195-199: “We also tested whether a similar shift in peak gamma frequency as observed for spike-LFP coupling is present in LFP power, and whether memory-related differences in peak gamma spike-LFP are correlated with differences in peak gamma power (Figure 2 – figure supplement 5). Both analyses showed no effects, suggesting that the effects in spike-LFP coupling were not coupled to, or driven by similar changes in LFP power.”

      Manuscript lines 248-253: “As for gamma, we also tested whether a similar shift in peak theta frequency is present in LFP power, and whether there is a correlation between the memory-related differences in peak theta spike-LFP and peak theta power (Figure 3 – figure supplement 5). Both analyses showed no effects, suggesting that the effects in spike-LFP coupling were not coupled to, or driven by similar changes in LFP power.”

      Do local interactions include spike-field coherence measurements from the same microwire (i.e., spikes and LFPs from the same microwire)?

      Yes, they do. Out of the 53 local spike-SFC couplings found for the gamma frequency range, 11 (20.75%) were from pairs where the spikes and LFPs were measured on the same microwire. We assume that the reviewer is asking this question because of a concern that spike interpolation may introduce artifacts which may influence the spectrograms and consequently the spike-LFP coupling measures. This was also pointed out by Reviewer #2. To address this concern, we split the data based on whether the spike and LFP providing channels were the same or different. The results show that (i) the spectrogram of SFC is highly similar between the two datasets, with a prominent gamma peak present in both and no significant differences between the two; (ii) restricting the analysis to those data where the LFP and spike providing channels are different replicated the main finding of faster gamma peak frequencies for hits compared to misses; and (iii) limiting the SFC analysis further to only ‘silent’ channels, i.e. channels where no SUA/MUA activity was present at all also replicated the main finding of faster gamma peak frequencies for hits compared to misses.

      These analyses suggest that the SFC results were not driven by spike interpolation artefacts.

      Manuscript lines 199-203: “To rule out concerns about possible artifacts introduced by spike interpolation we repeated the above analysis for spike-LFP pairs where the spike and LFP providing channels are the same or different, and for ‘silent’ LFP channels (i.e. channels were no SUA/MUA activity was detected (see Figure 2 – figure supplement 6). “

      60 Hz. It has always troubled me deeply when results peak at 60 Hz. This is seen in multiple places in the manuscript; e.g., Figures 2B, 2E. What are the odds that engineers choosing the frequency for AC currents would choose the exact same frequency that evolution dictated for interactions of brain signals? This is certainly not the only study that reports interesting observations peaking at 60 Hz. One strong line of argument to suggest that this is not line noise is the difference between conditions. For example, in Figure 2B, there is a difference between local and distal interactions. It is hard for me to imagine why line noise would reveal any such difference. Still ...

      The frequency for AC currents in Europe is 50 Hz, not 60 Hz as in the US. Therefore, our SFC effects are well outside the range of the notch.

      Figure 6. I was very excited about Figure 6, which is one of the most novel aspects of this study. In addition to the anatomical questions about this figure noted above, I would like to know more. What is the width of the Gaussian envelope?

      The width of the Gaussian Window used in the original results was 25ms. We chose this time window because in our view it represents a good balance between integrating over a long-enough time window and thus allowing for some jitter in neural firing between pairs of neurons, whilst still being temporally specific. Finding the right balance here is not trivial because a too short time window underestimates co-firing, and a too long time window may not provide the temporal specificity necessary to detect co-firing lags (Cohen & Kohn, 2011; Nat Neurosci). To test whether this choice critically affected our results, we repeated the analysis for different window sizes, i.e. 15, 35, and 45 ms. The results show that the pattern of results did not change, with hits showing earlier peaks in co-firing compared to misses. Critically, the difference in co-firing peaks was significant for all window sizes, except for the shortest one which presumably is due to the increase in noise because of the smaller time window over which spikes are integrated. These issues are now mentioned in the methods section, and the results for the different window sizes are reported in Figure 6 – figure supplement 4.

      Manuscript lines 346-347: “The co-firing analyses were replicated with different smoothing parameters (see Figure 6 – figure supplement 4).”

      Manuscript lines 894-898: “We chose this time window because it should represent a good balance between integrating over a long-enough time window and thus allowing for some jitter in neural firing between pairs of neurons, whilst still being temporally specific (57). To test whether this choice critically affected our results, we repeated the analysis for different window sizes, i.e. 15, 35, and 45 ms (see Figure 6 – figure supplement 4).”

      Are these units on the same or different microwires?

      All units used for the analysis shown in Figure 6 come from different microwires. This was naturally the case because the putative up-stream neuron was distally coupled to the theta LFP, and the putative down-stream neuron was locally coupled to gamma at this same theta LFP electrode. This information is listed in Figure 6 – source data 1 which lists the locations and electrode IDs for all neuron pairs shown in figure 6.

      How do the spike latencies reported here depend on the firing rates of the two units?

      To address this question we first tested whether firing rates (averaged across the putative up-stream and down-stream neurons) differ between hits and misses. If they do, this would be suggestive of a dependency of the spike latency differences between hits and misses on firing rates. No such difference was observed (p>0.3). Second, we correlated the differences between hits and misses in Co-firing peak latencies with the differences in firing rates. Again, no significant correlation was observed (R=-0.06; p>0.7), suggesting that firing rates had no influence on the observed differences in co-firing latencies. These control analyses are now reported in the main text.

      Manuscript lines 347-350: “No significant differences in firing rates between hits and misses were found (p>0.3), and on correlations between firing rates and the co-firing latencies were obtained (R=-0.06; p>0.7), suggesting that firing rates had no influence on the observed co-firing differences between hits and misses.”

      What do these results look like for other pairs that are not putative upstream/downstream pairs?

      As we reported in the original manuscript in lines 352-355 we did not find a memory dependent effect on co-firing latencies if we select neuron pairs solely on the basis of distal theta SFC. Within this analysis the distally theta coupled neuron would be the up-stream neuron and the neuron recorded at the site where the theta LFP is coupled would be the down-stream neuron. This null-result suggests that in order for the memory dependent difference in co-firing lags to emerge, the down-stream neurons need to be coupled to a local gamma rhythm in order for the memory effect on co-firing latencies to emerge. However, within this previous analysis there is still a notion of up-stream and down-stream neurons because neuron pairs were selected based on distal theta phase coupling. We therefore repeated this analysis for all pairs of neurons in a completely unconstrained fashion such that all possible pairs of neurons that were recorded from different electrodes were entered into the co-firing analysis. This analysis also revealed no difference in co-firing lags, neither for positive lags nor for negative lags. Instead, what this analysis showed is tendency for hits to show a higher occurrence of simultaneous or near simultaneous firing, which is in line with Hebbian learning. These results are now reported in Figure 6 – figure supplement 1.

      Manuscript lines 333-335: “In addition, a completely unconstrained co-firing analysis where all pairs possible pairings of units were considered also showed no systematic difference in co-firing lags between hits and misses (Figure 6 – figure supplement 1).”

      Reviewer #2 (Public Review):

      Roux et al. investigated the temporal relationship between spike field coherence (SFC) of locally and distally coupled units in the hippocampus of epilepsy patients to successful and unsuccessful memory encoding and retrieval. They show that SFC to faster theta and gamma oscillations accompany hits (successful memory encoding and retrieval) and that the timing of the SFC between local and distal units for hits comports well with synaptic plasticity rules. The task and data analyses appear to be rigorously done.

      Strengths: The manuscript extends previous work in the human medial temporal lobe which has shown that greater SFC accompanies improved memory strength. The cross-regional analyses are interesting and necessary to invoke plasticity mechanisms. They deploy a number of contemporary analyses to disentangle the question they are addressing. Furthermore, their analyses address limitations or confound that can arise from various sources like sample size, firing rates, and signal processing issues.

      Weaknesses:

      Methodological:

      The SFC coherence measures are dependent in part on extracting LFPs derived from the same or potentially other electrodes that are contaminated by spikes, as well as multiunit activity. In the methods, they cite a spike removal approach. Firstly, the incomplete removal or substitution of a signal with a signal that has a semblance to what might have been there if no spike was present can introduce broadband signal time-locked to the spike and create spurious SFC. Can the authors confirm that such an artifact is not present in their analyses? Secondly, how did they deal with the removal of the multiunit activity? It would be suspected that the removal of such activity in light of refractory period violation might be more difficult than well-isolated units, and introduce artifacts and broadband power, again which would spuriously elevate SFC. Conversely, the lack of removal of multiunit activity would seem to for a surety introduce significant broadband power. One way around this might be that since it is uncommon to have units on all 8 of the BF microwires, to exclude the microwire(s) with the units when extracting the LFP to avoid the need to perform spike removal.

      The reviewer raises a valid concern which we address as follows. Firstly, an artefact introduced into SFC by linear interpolation would be a problem for those local SFCs where the spike providing channel and the LFP providing channel are identical. Out of the 53 local spike-SFC couplings found for the gamma frequency range, only 11 (20.75%) were from pairs where the spikes and LFPs come from the identical microwire. It is unlikely that this minority of data would have driven the results. Furthermore, it is unlikely that the interpolation would introduce a frequency shift of SFC that is memory dependent, because the interpolation is more likely to cause a general increase in broadband SFC (as opposed to having a frequency band specific effect). However, to address this concern, we split the data based on whether the spike and LFP providing channels were the same or different. The results show that (i) the spectrogram of SFC is highly similar between the two datasets, with a prominent gamma peak present in both and no significant differences between the two; (ii) restricting the analysis to those data where the LFP and spike providing channels are different replicated the main finding of faster gamma peak frequencies for hits compared to misses.

      Secondly, we followed the reviewer’s suggestion and repeated the SFC analysis for ‘silent’ microwires, i.e. microwires where no single or multi-units were detected. This analysis replicated the same memory effects as observed in the analysis with all microwires. Specifically, we found an increase in the local gamma peak SFC frequency for hits compared to misses, as well as an increase in distal theta peak SFC frequency for hits compared to misses. These results are reported in the main manuscript and in Figure 2 – figure supplement 6 for gamma, and figure 3 – figure supplement 6 for theta.

      Manuscript lines 199-203: “To rule out concerns about possible artifacts introduced by spike interpolation we repeated the above analysis for spike-LFP pairs where the spike and LFP providing channels are the same or different, and for ‘silent’ LFP channels (i.e. channels were no SUA/MUA activity was detected (see Figure 2 – figure supplement 6).”

      Manuscript lines 253-255: “We also repeated the above analysis for spike-LFP pairs by only using ‘silent’ LFP channels (i.e. channels were no SUA/MUA activity was detected (see Figure 3 – figure supplement 6) to address possible concerns about artefacts introduced by spike interpolation.”

      In a number of analyses the spike train is convolved with a Gaussian in places with a window length of 250ms and in others 25ms. It is suspected that windows of varying lengths would induce "oscillations" of different frequencies, and would thus generate results biased towards the window length used. Can the authors justify their choices where these values are used, and/or provide some sensitivity analyses to show that the results are somewhat independent of the window length of the Gaussian used to convolve with the times series.

      The different choices in window length for the Gaussian convolution reflect the different needs of the two analyses where these convolutions were applied. In one analysis we wanted to get a smooth estimate of spike densities that we can average across trials, similar to a peri-stimulus spike histogram. For this analysis we used a window length of 250 ms which we found appropriate to yield a good balance between retaining smooth time courses whilst still being temporally sensitive. Importantly, for the statistical analysis of the firing rates, spike densities were averaged in much larger time windows than 250 ms (i.e. 1 – 2 seconds) therefore our choice of window length for spike densities would not have any bearing on the averaged firing rate analysis.

      In the other analysis, which is more central for our manuscript, we used a cross-correlation between spike trains to estimate co-firing lags in the range of milliseconds. Therefore, this analysis necessitated a much higher temporal precision. We used a Gaussian Window with a width of 25ms because it represents a good balance between integrating over a long-enough time window and thus allowing for some jitter in neural firing between pairs of neurons, whilst still being temporally specific. Finding the right balance here is not trivial because a too short time window would be prone to noise and underestimates co-firing, whereas a too long time window may not provide the temporal specificity necessary to detect co-firing lags (Cohen and Kohn, 2013; Nat Neurosci). To test whether this choice critically affected our results, we repeated the analysis for different window sizes, i.e. 15, 35, and 45 ms. The results show that the basic pattern of results did not change, with hits showing earlier peaks in co-firing compared to misses. Critically, the difference in co-firing peaks was significant for all window sizes, except for the shortest one which is likely due to the increase in noise because of the smaller time window over which spikes are integrated. These issues are now mentioned in the methods section, and the results for the different window sizes are reported in Figure 6 – figure supplement 4.

      Manuscript lines 346-347: “The co-firing analyses were replicated with different smoothing parameters (see Figure 6 – figure supplement 4).”

      Manuscript lines 894-898: “We chose this time window because it should represent a good balance between integrating over a long-enough time window and thus allowing for some jitter in neural firing between pairs of neurons, whilst still being temporally specific (57). To test whether this choice critically affected our results, we repeated the analysis for different window sizes, i.e. 15, 35, and 45 ms (see Figure 6 – figure supplement 4).”

      Conceptual:

      The co-firing analyses are very interesting and novel. In table S1 are listed locally and distally coupled neurons. There are some pairs for example where the distally coupled neuron is in EC and the downstream one in the hippo, and then there is a pair that is the opposite of this (dist: hippo, local EC). There appear to be a number of such "reversal", despite the delay between these two regions one would assume them to be similar in sign and magnitude given the units are in the same two regions. It seems surprising that in two identical regions of the hippo the flow of information or "causality", could be reversed, when/if one assumes information flows through the system from EC to hippo. This seems unusual and hard to reconcile given what is known about how information flows through the MTL system.

      The reviewer is correct that the spike co-firing analysis suggests a bi-directional flow of information between the hippocampus and surrounding MTL regions (e.g. entorhinal cortex; see Figure 6 – figure supplement 3). However, this bi-directional flow of information is not incompatible with neuroanatomy and the memory literature. The entorhinal cortex serves as an interface between the hippocampus and the neocortex with superficial layers providing input into the hippocampus (via the perforant pathway), and the deeper layers receiving output from the hippocampus (van Strien et al., 2009; Nat Rev Neurosci). Therefore, on a purely anatomical basis we can expect to see a bi-directional flow of information between the hippocampus and the entorhinal cortex, albeit in different layers. Importantly, reversals as shown in our Figure 6 – source data 1 involved different microwires and therefore different neurons (i.e. the entorhinal unit in row 1 was recorded from microwire 3, whereas the entorhinal unit in row 2 was recorded from microwire 8). It is conceivable that these two neurons correspond to different layers of the entorhinal cortex and therefore reflect input vs. output paths. Moreover, studies in humans demonstrated that successful encoding of memories depends not only on the input from the entorhinal cortex into the hippocampus, but also on the output of the hippocampal system into the entorhinal cortex, and indeed on the dynamic recurrent interaction between these input and output paths (Maass et al. 2014; Nat Comms; Koster et al., 2018; Neuron). Our bi-directional couplings between hippocampal and surrounding MTL regions (such as the EC) are in line with these findings. We have added a discussion of this issue in the discussion section.

      Manuscript lines 447-452: “Notably, the neural co-firing analysis indicates a bidirectional flow of information between the hippocampus and surrounding MTL areas, such as the entorhinal cortex (see Figure 6 – figure supplement 3; Figure 6 – source data 1). This result parallels other studies in humans showing that successful encoding of memories depends not only on the input from surrounding MTL areas into the hippocampus, but also on the output of the hippocampal system into those areas, and indeed on the dynamic recurrent interaction between these input and output paths (43, 44).”

    1. Author Response

      Reviewer #3 (Public Review):

      This paper is based on digital reconstruction of a serial EM stack of a larva of the annelid Platynereis and presents a complete 3D map of all desmosomes between somatic muscle cells and their attachment partners, including muscle cells, glia, ciliary band cells, epidermal cells and specialized epidermal cells that anchor cuticular chaetae (chaetal follicle cells) and aciculae (acicular follicle cells). The rationale is that the spatial patterning of desmosomes determines the direction of forces exerted by muscular contraction on the body wall and its appendages will determine movement of these structures, which in turn results in propulsion of the body as part of specific behaviors.

      To go a step further, if connecting this desmosome connectome with the (previously published) synaptic connectome, one may gain insight into how a specific spatio-temporal pattern of motor neuron activity will lead, via a resulting pattern of forces caused by muscles, to a specific behavior. In the authors' words: "By combining desmosomal and synaptic connectomes we can infer the impact of motoneuron activation on tissue movements".This is an interesting idea which has the potential to make progress towards understanding in a "holistic" way how a complex neural circuitry controls an equally complex behavior. The analysis of the EM data appears solid; the authors can show convincingly that desmosomes can be resolved in their EM dataset; and the technology used to plot and analyze the data is clearly up to the task. My main concern is with the way in which the desmosome pattern is entered in the analysis, which I think makes it very difficult to extract enough relevant information from the analysis that would reach the stated goal.

      1) The context of how different structures of the Platynereis larval body, by changing their position, move the body needs much more introduction than the short paragraph given at the end of the Introduction.

      -My understanding is that the larval body is segmented, and contraction of the segments can cause a certain type crawling or swimming: does it? Do the longitudinal muscles, for example, insert at segment boundaries, and alternating contraction left-right cause some sort of "wiggling" or peristalsis?

      Longitudinal muscles do not insert only at segment boundaries, but have desmosomal connections along the entire length of the cell. Individual longitudinal muscle cells can span up to 3 segments. However the cells are staggered in such a way that all longitudinal muscle cells with somas in one segment can collectively cover up to 4 segments. Longitudinal muscles are involved in turning when swimming (Randel et al., 2014). The undulatory trunk movements and parapodial walking movements are due to the contraction of oblique and parapodial muscles. The longitudinal muscles provide support during crawling (via desmosomal links) but it is unlikely that these muscles contract segmentally. Disentangling the distinct contributions of 53 types of muscles during crawling will require further studies.

      -In addition, there are segmental processes (parapodia, neuropodia), and embedded in them are long chitinous hairs (Chaetae, Acicula). Do certain types of the muscles described in the study insert at the base of the parapodia/neuropodia (coming from different angles), such that contraction would move the entire process, including the chaetae/acicula embedded in their tips?

      Yes, acicular muscles insert at the proximal base of the acicula, and by moving the acicula they move the entire noto-/neuropodia. We have presented the anatomy of all acicular and chaetal muscles types in the figures and videos.

      -Or is it that only these chaetae/acicula move, by means of muscles inserting at their base (the latter is clearly part of the story)? Or does both happen at the same time: parapodium moves relative to the trunk, and chaeta/acicula moves relative to the parapodium? How would these movements lead to different kind of behaviors?

      -Diagrams should be provided that shed light on these issues.

      We have extended Video2 to show individual muscles and their relation to the aciculae in one of the parapodia. We also clarified this in the text:

      “Several acicular muscles attach on one end to the proximal base of the aciculae and on the other end to the paratrochs and epidermal cells. Oblique muscles attach to the basal lamina, epidermal and midline cells at their proximal end, run along the anterior edge of parapodia and attach to epidermal and chaetal follicle cells at their distal tips. Both of these muscle groups are involved in moving the entire parapodium. Acicular muscles move the proximal tips of the aciculae, while oblique muscles move the parapodium by moving the tissue around the chaetae and the aciculae. All acicular movements also correspond to parapodial movements. Chaetae are embedded in the parapodium and therefore move with it, but the chaetal sac muscles can also independently retract the chaetae into the parapodium or protract them and make them fan out.”

      2) The main problem I have with the analysis is the way a muscle cell is treated, namely as a "one dimensional" node, rather than a vector.

      -In the current state of the analysis, the authors have mapped all desmosomes of a given muscle cell to its attached "target" cell. But how is that helpful? The principal way a muscle cell acts is by contracting, thereby pulling the cells it attaches to at its two end closer together. As the authors state (p.4) "...desmosomes..are enriched at the ends of muscle cells indicating that these adhesive structures transmit force upon muscle-cell contraction."

      At the level of the current analysis our data reveal which cells may be moved by the contractions of the individual muscle cells. The reviewer is right that treating a muscle as a vector (or set of vectors) would be a more accurate description, which would potentially also open up the possibility of computational modelling. We have provided such a vectorised dataset in the revised version, where each muscle-cell skeleton is subdivided into short linear segments (Figure2–source–data 2). This dataset may be useful to approach the problem with a three dimensional approach, which is beyond the scope of the current analysis. We also included an additional video (Video 7) showing examples of muscles and their partners where the cells and the desmosomes connecting them are highlighted. This reveals that the desmosomes connecting two cells are often at the very end of the muscle cell.

      -for that reason, the desmosomes at the muscle tips have to be treated as (2) special sets. Aside from these tip desmosomes there are other desmosomes (inbetween muscles, for example), but they (I would presume) have a very different function; maybe to coordinate muscle fiber contraction? Augment the force caused by contraction?

      Desmosomes between muscles only occur between muscles of different types, not for homotypic connections. There are other types of junctions (adhaerens-like junctions) that connect individual cells of a muscle bundle together (not analysed here). We clarified this in the text.

      • As far as I understand for (all of) the desmosome connectome plots, there is no differentiation made between desmosome subsets located at different positions within the muscle fiber. I therefore don't see how the plots are helpful to shed light on how the multiplicity of muscles represented in the graphs cause specific types of neurons.

      We would like to point out that the cells and structures that muscles connect to via desmosomes are very likely the parts of the body that will move during the contraction of the muscle or will provide structural support (e.g. basal lamina) for the muscle cell to contract. This is most evident in the parapodial complex. The majority of muscles in the body connect to the aciuclar folliclecells and the aciculae are the most actively moving parts in the body during crawling (see Video 4). In any case, since we provide all skeleton reconstructions and the xyz coordinates of all desmosomes, the data could be further analysed following these suggestions by the reviewer.

      • As it stands these plots "merely" help to classify muscles, based on their position and what cell type they target: but that (certainly useful) map could have probably also be achieved by light microscopic analysis.

      This has never been achieved by light microscopy analysis in the hundreds of papers on invertebrate muscle anatomy (e.g. by phalloidin staining). For an LM analysis, it would not be sufficient to label the muscle fibres, but one would also need to label the desmosomes and a multitude of non-muscle cell types including the extent of their cytoplasm. This is technically very challenging (we would nevertheless be happy to hear specific suggestions for markers etc. from the Reviewer). Currently, only EM provides the required depth of structural information and resolution. This is why we believe that our dataset and analysis is unique, despite over a century of research in invertebrate anatomy.

      3) Section "Local connectivity and modular structure of the desmosomal connectome" p.4-7" undertakes an analysis of the structure of the desmosome network, comparing it with other networks.

      -What is the rationale here? How do the conclusions help to understand how the spatial pattern of muscles and their contraction move the body?

      We hope that our analysis may also be of interest to the community of network scientists and we believe that the reconstruction of a quite large and novel type of biological network warrants a more quantitative network analysis, using the standard methods and measures of network science – as we presented e.g. in Figure 4 – even if these mathematical analyses may not directly reveal how muscles move the body. We hope that some readers with an interest in quantitative analyses will also appreciate the broader picture here.

      -Isn't, on the one hand (given that position of the desmosome was apparently not considered), the finding that desmosome networks stand out (from random networks) by their high level of connectivity ("with all cells only connecting to cells in their immediate neighbourhood forming local cliques") completely expected?

      We disagree that the result was completely expected. Even if this was the case, we think it is quite different to say that a result is expected or to thoroughly quantify certain parameters and mathematically characterise key properties of the desmosomal graph (as we have done). These network analyses help to conceptualise our findings and to think about the muscle system in more global, whole-body terms.

      -On the other hand, does this reflect the reality, given that (many?) muscle cells are quite long, connecting for example the anterior border of a segment with the posterior border.

      Indeed, a quantitative analysis helped us to identify cases where the reality deviated somewhat from what was completely expected, and we thank the reviewer for these comments. As we explain in the revised version, some longitudinal muscles show an unexpected position in the force-field layout of the graph, due to their long-range connections. We have added extra clarifications to the text: “To analyse how closely the force-field-based layout of the desmosomal connectome reflects anatomy, we coloured the nodes in the graph based on body regions (Figure 5). In the force-field layout, nodes are segregated by body side and body segment. Exceptions include the dorsolateral longitudinal muscles (MUSlongD) in segment-0. These cells connect to dorsal epidermal cells that also form desmosomes with segment-1 and segment-2 MUSlongD cells. These connections pull the MUSlongD_sg0 cells down to segment-2 in the force-field layout (Figure 5D).”

      1. In the section "Acicular movements and the unit muscle contractions that drive them" the authors record movement of the acicula and correlate it with activity (Ca imaging) of specific muscle types. This study gives insightful data, and could be extended to all movements of the larva.

      -The fact that a certain muscle is active when the acicula moves in a certain direction can be explained (in part) by the "connectivity": as shown in Fig.7L, the muscle inserts at a acicular follicle cell on the one side, and to an epithelial (epidermal?) cell and the basal lamina on the other side. But how meaningful is a description at this "cell type level" of resolution? The direction of acicula deflection depends on where (relative to the acicula base) the epithelial cell (or point in the basal lamina) is located. This information is not given in the part of the connectome network shown in Fig.7L, or any of the other graphs.

      This information is indeed not shown in the graphs, where each cell is treated as a node. However, we provide this information in the detailed anatomical figures in Figure 6 – figure supplement 1-3 and Video 7, where the individual acicular and oblique muscle types are visualised. In principle, one could subdivide aciculae into e.g. proximal and distal halves and derive a more detailed network. We have not done this but since all the EM, anatomical rendering and connectivity data are available in our public CATMAID server (https://catmaid.jekelylab.ex.ac.uk/), we hope that the interested readers will be able to further analyse the data.

      We renamed ‘epithelial’ cells to ‘epidermal’ cells.

    1. When files are rendered on a computer screen a user witnesses something akin to the performance of a play. The underlying data in a file is interpreted and rendered through software for a user to interact with in much the same way that the script of a play is interpreted and performed by a cast on a stage. In each case, while the underlying script or files remains the same, a given performance of a file or a play is going to look and sound different. For some kinds of research questions those differences do not matter, however, it is necessary in either case to be aware of the differences.

      Because seeing a play is such a fleeting event, writing a play review may be a thrilling, though tough, effort. You must be both a spectator watching and appreciating the performance as well as a critical analyst of the production itself. You must be able to offer a quick overview of the play, as well as a close objective analysis of the performance you attended, as well as an interpretation and review of the full ensemble of staging, acting, directing, and so on. Couldn't the same be said about a file that is rendered on a computer screen, it has the ability to disappear at any point, so should we not read it carefully and think of why this specific work was digitized. What does this work really look like in its original form? How was this artifact written?

    1. “Acknowledging that they have that sovereignty over the material, that it is indeed not yours [the institution’s], is one of the key things we’re trying to promote in the work that we’re doing with the archival community in general,”

      I think this approach to the matter is a fantastic step forward to such a sensitive issue. The items being archived are no more the archivist's property than they are the institutions property. These records belong to a culture that we should aim to preserve independently of our own, and we cannot truly attempt such a feat if we try to claim ownership over every piece we host. After all, in essence, these records are knowledge that local indigenous communities are offering to preserve for us as opposed to it being lost to time. Some of it may never be shared with outsiders of that community, yet some of it may be shared, and surely that value alone would be worth the cost of the preservation programs.

      To give an analogy to this idea, if you could prevent the library of Alexandria from burning, even though you may never personally access it, but others might, would you? or would you let it burn and lose an unknown amount of knowledge and history in the process.

    2. They even refer to deaths. Indeed, for some families, these records may be the only existing documents detailing the fates of their children.

      I think that it is important to keep these records of our nation's past as a reminder of where we came from and what horrid mistakes we made. Without these records, future generations may be doomed to repeat the same atrocities committed at residential schools, as well as other places. I also think that the ability to digitize and spread these documents allows the families of those affected to finally find closure in what has happened to their families.

    1. We utilized a between ‐ subjects design in which we compared two types of feedback: Antisocial feedback and prosocial feedback, with no feedback as a control condition. In the antisocial feedback condition, keeping tokens to the self (i.e., maximizing one's own outcome) received many thumbs up, whereas in the prosocial feedback condition, donations to the group received many thumbs up. The no feedback control condition was similar to the feedback conditions in the sense that participants were informed that a spectator group would evaluate their decisions, so participants anticipated the possibility of feedback. The only difference in the no feedback control condition was that after making their decisions, participants were not shown any feedback

      I feel like if I was a part of this experiment that this would effect my decisions. If I knew that someone was watching and judging my decisions I would subconsciously change my original answers to answers that I think the people watching would approve of. It may just be something that is wired into our minds, that we have to accommodate our answers/actions based upon who is watching.

    2. they may also be instrumental in prompting adolescents to adopt other types of behavior, such as prosocial behavior

      I never really thought of it this way, the fact that adolescents may be picking up these bad behaviors because they're told to do the opposite. I believe that our hearts are not as pure as we think they are, when Adam and Eve ate of the forbidden fruit and brought sin into this world, we have all now been born with sin, therefore I believe that there is something within us that desires to go against good... there is evil within us that wants to succeed, and maybe that's where this rebellion comes from that we see being mentioned here.

    1. Author Response

      Reviewer #1 (Public Review):

      Using a large neonatal dataset from the developmental Human Connectome project, Li and colleagues find that cortical morphological measurements including cortical thickness are affected by postnatal experience whereas cortical myelination and overall functional connectivity of ventral cortex developed significantly were not influenced by postnatal time. The authors suggest that early postnatal experience and time spent inside the womb differentially shape the structural and functional development of the visual cortex.

      The use of large data set is a major strength of this study, furthermore an attempt to examine both structural and functional measures, and connectivity analysis and separating these analyses based on the pre-and full-term infants is impressive and strengthens the claims made in the paper. While I find this work theoretically well-motivated and the use of the large dHCP dataset very exciting, there are some concerns, that need to be addressed.

      There is a bit of confusion if the authors really compared the structural-functional measures in the final analysis. If the authors wish to make claims about the relationship, then there must be a compelling analysis detailing these findings.

      Thanks for the suggestions. We have added analysis to directly investigate the relationship between the development of homotopic connection and corresponding structural measurements in the area V1 (Page 13 Line 5-16):

      “The above results revealed that structural and functional properties of the ventral visual cortex both developed with PMA, but were differently influenced by the in-utero and external environment (Table 1). We further investigated the relationship between structural and functional development based on area V1, which showed a strong developmental effect in both structural and functional analyses. Mediation analysis was employed to see whether the development (GA or PT) of the homotopic connection between bilateral V1 was mediated by the structural properties (CT or CM). We found that the PT had a significant direct effect on the homotopic function that was not mediated by CT or CM (Fig 6a-b). In contrast, the direct effect of GA on the homotopic connection was not significant but the indirect effect of GA through CM on the connection was significant (Fig 6c-d).”

      There is also a bit of confusion in the terminology used in the study regarding ages; the gestational age, premenstrual age, and postnatal time. I think clarifying and simplifying it down to GA and postnatal time will help the reader and avoid confusion.

      Thank you for the suggestion. We have made extensive revision regarding the terminology throughout the paper and simplified it down to GA and PT. Please see the response to the 1st major concern in the Essential Revisions (for the authors) section above.

      *Reviewer #2 (Public Review):

      The authors utilize the publicly available dHCP dataset to ask an interesting question: how does postnatal experience and prenatal maturation influence the development of the visual system. The authors report that experience and prenatal maturation differentially contribute to different aspects of development. Namely, the authors quantify cortical thickness, myelination, and lateral symmetry of function as three different metrics of development. The homotopy and preterm infant analyses are strengths that, on their own, could have justified reporting. However, I have concerns about the analytic approaches that were used and the conclusions that were drawn. Below I list my major concerns with the manuscript.

      PMA vs. GA vs. PT

      The authors seek to understand the contribution of experience and prenatal development, yet I am unsure why the authors focused on the variables they did. There are three variables of interest used throughout this study: Gestational age at birth (GA), postnatal time (PT), and postmenstrual age at the time of scan (PMA). The last metric, PMA, is straightforwardly related to GA and PT since PMA = GA + PT. In most (but not all) of the manuscript, the authors use PMA and PT, with GA used without justification in some cases but not in others.

      It is unclear why PMA is used at all: PMA is necessarily related to PT and GA, making these variables non-independent. Indeed, the authors show that PMA and PT are highly correlated. The authors even say that "the contribution of postnatal experience to the development was not clarified because PMA reflects both prenatal endogenous effect and postnatal experience." So, why not use GA at birth instead of PMA? Clearly, GA is appropriate in some cases (e.g., Figure S4 or in some of the ANOVA applications), and to me, it seems to isolate the effect the authors care about (i.e., duration of prenatal development). Perhaps there is some theoretical justification for using PMA, but if so, I am unaware.

      That said, I expect that replacing all analyses involving PMA with GA will substantially change the results. I do not see this as a bad thing as I think it will make the conclusions stronger. As is, I am left unsure about what the key takeaways of this paper are.

      We appreciate the suggestions, and we have replaced the related analyses involving PMA with GA in the manuscript. Please see the Response to the 1st major concern in the Essential Revisions (for the authors) section above for more detail.

      Using GA instead of PMA will have several benefits: 1) It will be much simpler to think of these two variables since they contrast the duration of fetal maturation and time postnatally. 2) This will help the partial correlation analyses performed since the variance between the variables is more independent. It will also mean that the negative relationships observed between PT and cortical thickness when controlling for PMA (e.g., Figure 2h) might disappear (reversed signs for partial correlations are common when two covariates are correlated). 3) this will allow the authors to replace Figure 1a with a more informative plot. Namely, they could use a scatter of GA and PT, giving insight into the descriptive statistics of both dimensions.

      We have revised the manuscript throughoutly following the reviewer’s suggestion. However, we thought it would be necessary to show the overall development of CT and CM across the general age (PMA) in Figure 1. Therefore, we didn’t replace the figure 1a but added a scatter figure between GA and PT in Figure 2-figure supplement 1 and added descriptive statistics of them in the manuscripts: “The mean GA of the neonates was 39.93 weeks (SD = 1.26) and the mean PT was 1.21 weeks (SD = 1.25), the correlation between them was not significant (r = - 0.08, p > 0.1; Figure 2-figure supplement 1).” Moreover, the negative relationships between PT and CT when controlling for PMA disappeared in the revised results as the reviewer’s predicted.

      I suspect that one motivation for the use of PMA over GA is for the analysis in Figure 6. In this analysis, the authors pick a group of term infants with a PMA equal to the preterm infants. Since PMA is the same, the only difference between the groups (according to the authors) is the amount of postnatal experience. However, this is not the only difference between the groups since they also vary in GA (and now PT and GA are negatively correlated almost perfectly). I don't know how to interpret this analysis since both the amount of prenatal maturation and postnatal experience vary between the groups.

      We appreciate the reviewer’s opinion that both GA and PT were different between preterm and term-born neonates. Then any of the differences between the two groups might came from the combined effect of GA and PT in our results, and unfortunately, we might not able to separate them in this analysis. However, the preceding results indicated that the CT was significantly influenced by PT and GA while CM was significantly influenced by GA, which So we discuss the preterm and term-born comparison in the context of these findings (Page 19 Line 26-29 and Page 20 Line 1-5): “We found CT in the ventral cortex was generally lower in the term-born than preterm-born infants, while the CM showed the opposite trend in the two groups. Since the preterm babies have longer PT but shorter GA compared to full-term infants at the same PMA, this result supported the above analysis that CT was preferably influenced by PT while CM was largely dependent on GA during the neonatal period”. Furthermore, we added a description in the limitation section to stress the caveat (Page 20 Line17-19): “Meantime, both GA and PT were different between preterm and term-born neonates. Then any of the differences between the two groups might came from the combined effect of GA and PT, and unfortunately, we were not able to separate them in this study.”

      Justification of conclusions and statistical considerations

      I had concerns about some of the statistical tests and conclusions that the authors made. I refer to some of these in other sections (e.g., the homotopy analyses), but I raise several here.

      I am not sure what evidence the authors are using to make this claim: "we found that the cortical myelination and overall functional connectivity of ventral cortex developed significantly with the PMA but was not directly influenced by postnatal time." Postnatal time is significantly correlated with cortical myelination, as shown in Figures 2g, 2h, 3b, 3c, and postnatal time is significantly correlated with functional connectivity, as shown in Figures 4h, 5c, 5d, and 5e. Hence, this general claim that "the development of CT was considerably modulated by the postnatal experience while the CM was heavily influenced by prenatal duration" doesn't seem to be supported: both myelination and thickness are affected by postnatal experience and prenatal duration (as measured by PMA). A similar sentiment is expressed in the abstract. Perhaps the authors suggest different patterns in the strength of change for PMA vs. PT across these metrics, but if so, then statistical tests need to support that conclusion, and the claims need to reflect that sentiment.

      Interestingly, Figure S4 presents a compelling ANOVA that does support this conclusion. Still, this result is relegated to the supplement, and it also uses GA, rather than PMA, making it hard to reconcile with the other claims made in the main text. Moreover, it uses ANOVAs, which dichotomizes a continuous variable. Here and elsewhere in the manuscript (e.g., Figures 3d, 3e), the authors split the infants into quartiles and compare them with ANOVAs. Their use for visualization is helpful, but it is unclear what the statistical motivation for this is rather than treating these as continuous variables like is possible with linear mixed-effects models. Moreover, it is unclear why the authors excluded half the data from the study (i.e., quartiles 2 and 3) in this ANOVA when all four quartiles could be used as factors.

      We appreciate the reviewer’s comments. We have clarified our results and conclusion in the revised manuscript based on the new analyses that replaced PMA with PT and GA (See the response to the 1st major concern in the Essential Revisions). The previous claims have been changed as following:” the postnatal time could modulate the cortical thickness in ventral visual cortex and the functional circuit between bilateral primary visual cortices. But the cortical myelination, particularly that of the high-order visual cortex, developed without significant influence of postnatal time in such early period” (Page 2, Lines 8-12). This claims could be supported by the results in figure 2. Moreover, to support the claims about the comparison of the influence between GA and PT on structural development, we replaced the ANOVA analysis with a linear mixed-effect model as the reviewer mentioned.

      1) To compare the influence of GA against PT on the structural development in the whole ventral visual cortex (Page 7 Line 15-19), “We applied a linear mixed-effect model to test whether the CT (or CM) of the whole ventral cortex were differently influenced by the GA vs. PT, and found that the GA had a significantly stronger effect on the CM than PT (interaction between GA and PT, p < 0.05) but no significant difference was found of the effect on the CT between the ages (p > 0.6).”

      2) To compare the influence of GA against PT on the structural development in the area V1 and VOTC, we applied a similar linear mixed-effect model analysis for the two ROIs (Page 8 Line 17-18 and Page 9 Line 1-4): “Moreover, we applied a linear mixed-effect model to test the developmental influence of GA vs. PT on the cortical structure , and the results showed that the CT in two ROIs showed non-significantly different influences from GA against PT (p > 0.3), but CM showed at least marginally significant results in both two ROIs (V1: p < 0.01 and VOTC: p < 0.09).”

      It is unclear what the evidence is to support the following claim: "Both CT and CM show higher correlation with PMA in the posterior than anterior region, and higher correlation in the medial than lateral part within the anatomical mask (Figure 2a and Figure S2b-c [sic])" From Figure 2 or Figure S2, I don't see a gradient. From Figure S3, there might be a trend in some plots, but it is hard to interpret since it is non-monotonic. More generally, is there a statistical test to support this claim?

      We added a correlation analysis between the diction (x: lateral to medial; y: posterior to anterior) and measurements (CT and CM) in the ventral visual cortex, and the resulting coefficient was all significant (r = 0.7/-0.8 for CT along x/y axis, and r = 0.91/-0.83 for CM along x/y axis; p < 0.001). See Figure 1-figure supplement 2. However, the consideration provided by the reviewer still exists that such significance was driven by part of the areas and the gradient was non-monotonic. Therefore, we replaced the original claim with the following sentence (Page 6 Line 3-8): “In addition, we found distinct spatial variation along ventral cortex, e.g. posterior-anterior and medial-lateral directions (Figure 1-figure supplement 2a-b). Generally, both CT and CM showed higher correlation with PMA in the posterior than anterior region (r = -0.8 and -0.83; p < 0.001), and higher correlation in the medial than lateral part within the ventral visual cortex (r = 0.7 and 0.91; p < 0.001; Figure 1-figure supplement 2c-d).”.

      "and the interaction [sic] was more prominent in CM (simple effect: t = 10.98, p < 10-9) that in than CT (t = 2.07, p < 0.05)." Does 'more prominent' mean it is 'significantly stronger'? If not, then the authors should adjust this claim

      The claim ‘more prominent’ did express ‘significantly stronger’ since we found that the interaction between CM and CT along PMA or PT was significant in the ANOVA analysis. This analysis has been removed because we thought that the comparison between two structural measurements is not very relevant to the conclusion of the paper. We now applied a linear mixed-effect model to compare the influence of GA against PT on specific structural development. So this result and claim have been removed from the new manuscript.

      Are the authors Fisher Z transforming their correlations? In numerous places, correlation values seem to be added together or used as the input to other correlation analyses. It is unclear from the methods whether the authors are transforming their correlation values to make that use appropriate.

      We are sorry for the confusion. All the statistical analyses involving correlation coefficients were Fisher-Z transformed. We have added a clear description in the manuscripts involving the Fisher-Z transformation (Page 25 Line 16-18).

      Homotopy analyses

      The homotopy section is a strength of the paper, but I have doubts about the approach taken to analyze this data and some of the conclusions drawn. I don't expect any of my suggestions to change the takeaway of this section, but I do think they are essential criticisms to address.

      I do not think that the non-homotopic control condition is appropriate. In Arcaro & Livingstone (2017), the authors had 3 categories for this analysis: homotopic pairs (e.g., left V1 vs. right V1), adjacent pairs (e.g., left V1 vs. right V2), and distal pairs (e.g., left V1 vs. right PHA1). In the homotopy analysis performed by Li and colleagues, they compare homotopic pairs with all other pairs. I don't think that is generous to the test since non-homotopic pairs include adjacent pairs that should be similar and distal pairs that shouldn't be similar. This may explain why some non-homotopic distribution overlaps with the homotopic distribution in Figure 4c.

      Thanks for these suggestions. In the revised manuscript, we reanalyzed the data by dividing the connections into three groups for each subject. See Page 26 Line 24-29: “For each subject, Pearson correlations were carried out on the ROI-averaged time series within and across the left and right ventral cortex. The resulting connections were divided into three groups, namely the homotopic connection (the connection between two paired areas in two hemispheres. e.g. right and left V1), adjacent connection (e.g., right V1 and left V2 since V1 and V2 are adjacent) and distant connections (two areas that were not the paired or adjacent)”.

      Regardless of this decision, I think the authors should reconsider their statistical test. I think the authors are using a between samples t-test to compare the 34 homotopic pairs with the hundreds of non-homotopic pairs. This is statistically inappropriate since the items are not independent (i.e., left V1 vs. right V1 is not independent of left V1 vs. right V2, which is also not independent of left V3 vs. right V2). This means the actual degrees of freedom are much lower than what is used. Moreover, I am unsure how the authors do this analysis across participants since this test can be done within participants. The authors should clarify what they did for this analysis and justify its appropriateness.

      Thank you for the suggestion. In the previous manuscript, we first averaged the connection matrix across subjects and then calculated the homotopic (or non-homotopic) connections between areas, and therefore, statistical analysis could not be performed. In the revised paper, we calculated the three groups of connections for each subject before the average. We applied a non-parameter statistical analysis (Wilcoxon signed-rank) to address the issue of the independent comparison among the connections, and found the homotopic connections were significantly stronger than the adjacent or distant connections.

      See (Page 26 Line 29 and Page 27 Line 1-3): “Independent-sample T-test was used to test whether the homotopic correlation was significantly greater than zero across subjects. To compare the correlation among the three types of connections, we applied a non-parameter statistical analysis (Wilcoxon signed-rank) across subjects”.

      The results showed that (Page 9 Line 17-21) “the homotopic connections in all ROIs of ventral cortex were significant (mean r = 0.13– 0.43, t > 12.87, s < 10-9; Fig 4a-b), and were significantly higher than adjacent connections (0.29 ± 0.12 vs. 0.19 ± 0.10, Wilcoxon signed rank test on the Fisher-Z transformed r value: z = 16.32, p < 10-9) and distal connections (0.04 ± 0.06, z = 16.32, p < 10-9; Fig. 4c)”.

      Could the authors speculate on why the correlations in homotopic regions are so much lower than what Arcaro and Livingstone (2017) found. I can think of a few possibilities: higher motion in infants, less rfMRI data per participant, different sleep/wake states, and different parcellation strategies. Regarding the last explanation, I think this is a real possibility: the bilateral correlation may be reduced if the Glasser atlas combines functionally heterogeneous patches of the cortex. Hence, the authors should consider this and other possible explanations.

      Thank you for the suggestion. The neonates included in this study were all under natural sleep during the scan, so sleep/wake states would not be one of the causes. We added some possible reasons for this difference following the related results (Page 19 Line 9-13): “However, the present homotopic connections in the human neonates were lower than those in neonate macaca mulattas (Arcaro and Livingstone, 2017). This difference might relate to the higher motion in human infants, less r-fMRI data in the present study, coarser parcellation in the visual cortex used in this work, and the developmental difference between primates and humans in the neonatal period.”

      The authors assume that the homotopic analyses mean that there are lateral connections between hemispheres (e.g., "Furthermore, the connections among the ventral visual cortex have developed during this early stage. Specifically, the homotopic connections between bilateral V1 and between bilateral VOTC both increased with GA, indicating an increased degree of functional distinction"). While this might be true, it doesn't need to be. Functional connectivity can be observed between regions that lack anatomical connectivity. Instead, two regions could both be driven by another region. In this case, the thalamus might drive symmetrical activity in the visual cortex.

      We agree with the reviewer’s view that the development of functional connectivity might be driven by other regions like thalamus. So we added this interpretation in the discussion section (Page 19 Line 23-25): “It is worth noting that the increased homotopic connection can be direct or indirect, e.g., the effect might be driven external regions with enhanced connection to both of the areas (e.g. thalamus)”.

      Miscellaneous

      I am not sure what the motivation of this line is: "Moreover, those studies did not fully control the visual experience in the first few weeks of the subjects, thus cannot give a clear conclusion whether the innate functional connectivity is unrelated to postnatal visual experience." Arcaro, Schade, Vincent, Ponce, & Livingstone (2017) did control the visual experience of subjects. Moreover, the research here doesn't control infant experience in the way this sentence implies: it implies an experiment manipulation (i.e., fully control) rather than a statistical control that is done here. Consider rephrasing

      We have rephrased this sentence in the introduction section (Page 5 Line 2-5): “Moreover, the human infants participating in a previous study (Kamps et al., 2020) were around one month old (mean age: 27 d; range from 6 to 57 d), who might already acquire some visual experience, and thus this study could not exclude postnatal visual experience on the innate functional connectivity”.

      I am not sure why this claim is made: "Area V1 was selected because this region is the most basic region for visual processing and probably is the most experience-dependent area during early development". Is there evidence supporting this claim? Plasticity is found throughout the visual cortex, and I think which region is most plastic depends on the definition of plasticity. For instance, most people have the same tuning properties to gabor gratings (e.g., a cardinality bias), but there is enormous variability in face tuning across cultures.

      We have removed this claim in the manuscript.

      The abstract says 783 infants were included in this study, but far fewer are actually used. The authors should report the 407 number in the abstract if any number at all.

      We have revised the number accordingly.

      Any comparisons of preterms and terms ought to be given the caveat that the preterm environment can be very different than the term environment: whereas a term infant goes home and sees friends and family without restriction, the preterm environment can be heavily regulated if they are in a NICU. Authors should either provide details about the environments of the preterms in their study, or they should consider how differences in the richness of visual experience - regardless of quantity - may affect visual development.

      We agree with the reviewer’s concern, and added a paragraph in the limitation section to stress the caveat (Page 20 Line 12-16): “One limitation of this study is the comparison between preterm and term-born infants did not consider the different visual experience in these infants. The preterm-born neonates may experience very different environment than those of the term-born, e.g. the preterm environment can be heavily regulated if they were in a NICU, but we didn’t have detailed information about the postnatal environment to control for it.”

      Reviewer #3 (Public Review):

      The authors use a large neonatal dataset to examine how development may occur differently based on whether on not the neonate spent that time in gestation or out of the womb accruing potentially accruing visual experience. In this manner, the authors hope to tease apart those aspects of development that are biologically programmed versus those that occur in response to experience within the visual cortex. They show structurally that cortical thickness is affected by postnatal experience while cortical myelination is not, and functionally they find regional differentiation present between visual areas at birth and that their connectivity changes with development and postnatal experience. The conclusions seem well supported by the data and analyses and provide some insight into which aspects of brain structure at birth are sculpted more by postnatal experience and which are more determined by endogenous developmental timelines.

      The analyses are based on a large sample of infants, and the authors were careful to statistically separate which aspects of an infant's age, gestational or postnatal, are driving brain development, providing a deeper picture of infant brain development than previous publications. Overall, the findings seem well supported by the data as the analyses are relatively straightforward.

      Visualization of the data and findings could be improved, as a few figures are difficult to interpret without having to read the methods.

      We have extensively revised the figures in the manuscript to improve the readability. See updated Figures 2-7.

      The acronyms regarding gestation, postnatal, and post-menstrual time are a little distracting. Please consider explicitly writing "gestational time" etc when referring to these numbers to improve readability.

      We have replaced the analyses involving PMA with gestational age (GA) or postnatal time (PT) in the revised manuscript to simplify the terminology. Please see the Response to the 1st major concern in the Essential Revisions (for the authors) section above. We believe this change makes the paper easier to follow even with the abbreviations.

      Because the cortical ribbon of infants is so thin at birth, there seems to be a possibility that partial-volume effects could be more prevalent in less-developed infants and impact myelin metrics. If not modeled or estimated, it should at least be discussed.

      In fact, the cortical thickness of the neonatal brain is not thinner than that of the adult. Particularly, the average cortical thickness of infants aged 0-5 months is around 2-2.5 mm (Wang et al., 2019), which is similar to adults (Fjell et al., 2015). Therefore, the partial-volume effect for cortical gray matter is not a special concern for infants.

      Nevertheless, we agree that the partial-volume effects might have different influences on infants of different ages. We added this consideration in the limitation section (Page 20 Line 20-24). “Another concern was about the partial-volume effect on the cortical measurements. The changing thickness of cortical ribbon during development may changes the degree of partial-volume effect, and thus may affect the cortical myelination measurement and may contribute to the myelination difference observed between preterm and term-born groups.”

      Structural and functional development could be more formally compared using quantitative models if the authors want those points more strongly related; the two are only qualitatively discussed at present.

      We have added a formal analysis to investigate the relationship between structural and functional development. Please see the Response to the 1st concern of Reviewer 1 (public review).

    1. Author Response

      Reviewer 2 (Public Review):

      1) The hypothesis that the genes responsible for the Mendelian traits are also the causal genes for the cognate complex traits does not seem to hold, given the prior work and the data shown in the study. For example, if this hypothesis is true, it is unexplained why the candidate genes were not even enriched in the GWAS regions for height and breast cancer.

      Following the removal of a data artifact from our breast cancer analysis and the inclusion of Backman et al.’s larger list of genes implicated in height, every phenotype in our analysis displays enrichment in proximity to GWAS peaks. Enrichment is present not only in genes selected based on cognate Mendelian phenotypes, but also on those from Backman et al., which examined the same complex trait phenotypes that were used for GWAS. In that work, the enrichment GWAS signal near of genes selected on coding variants was as high as 59.3-fold.

      Our use of Mendelian-trait-causing genes is not dependent on GWAS. Short of large-scale experimental work, we do not know any better way to confirm the genes’ broad relevance to GWAS phenotypes than their enrichment near peaks. This enrichment has been persuasively demonstrated by previous research. Freund et al. (2019) tested the enrichment of 20 Mendelian disorder gene sets against 62 complex phenotypes. Though there was no statistically significant overlap of phenotypically non-matched Mendelian genes and GWAS peaks (2% matched), the overlap of matched Mendelian genes and GWAS peaks was significant (54% matched).

      We have included additional evidence and references for this relationship in Supp. Note 1.

      2) The only evidence supporting their hypothesis appears to be the enrichment of the candidate genes in the GWAS regions for seven out of the nine traits. However, significant enrichment of the candidate genes in the GWAS regions does not necessarily mean that a large proportion of the candidate genes are the causal genes responsible for the GWAS signals. Analogously, we cannot use the strong enrichment of eQTLs in GWAS regions as evidence to claim that a large proportion of the GWAS signals are driven by eQTLs.

      Our gene sets were selected by considering two criteria: whether they are relevant to each complex trait, and whether they are biologically interpretable.

      The genes identified in Backman et al. have a strong case for relevance. They are evaluated for association, not with cognate Mendelian phenotypes, but with the exact same complex traits used for GWAS.

      Our genes, selected based on cognate Mendelian traits, are less obviously relevant, but have advantages for interpretation. Many have well-understood biological roles and are part of pathways that have been studied in great detail. Because most of these genes can cause dramatic phenotypic changes with one variant, the direction of effect is easier to understand than genes identified through burden testing. In fact, loss-of-function coding variants that cause autosomal dominant traits can be thought of as large-effect, context-independent eQTLs—they cause phenotypic change by decreasing gene expression roughly 50% across cell types, developmental stages, etc.

      Ideal genes for our analysis would combine the advantages of both sets. They would have individual coding variants that could be tied to complex traits using exome sequences. However, natural selection creates tradeoffs between variant frequencies and variant effect sizes. Large-effect variants (such as those responsible for Mendelian traits) are generally too rare to be detected in population sequencing. Coding variants that reach frequencies detectable in databases such as UK Biobank typically have smaller effect sizes, requiring them to be aggregated in order to implicate genes.

      We believe that our original gene set is plausible both because of its collective enrichment in GWAS signal and because each gene is individually known to cause cognate phenotypes. Enrichment is not proof, but can serve as strong evidence when backed up by known biology. Though selection precludes a perfect gene set, the enrichment in both our Mendelian gene set and the set from Backman et al. addresses each criterion—interpretability and relevance—individually, and, taken together, provides an argument for the relevance of genes selected based on coding variants.

      3) Considering the large numbers of GWAS signals, we would expect a substantial number of genes in the GWAS regions by chance. It would be interesting to quantify the number of genes in the GWAS regions if the 143 genes are randomly selected. Correcting the observed number of genes for that expected by chance (e.g., subtracting the observed number by that expected by chance), the proportion of the candidate genes in the GWAS regions would be small.

      The proportion of the candidate genes whose eQTL signals were colocalized with the GWAS signals or in close physical proximity with the fine-mapped GWAS hits was small. However, I would not be surprised if they are significantly enriched, compared with that expected by chance (e.g., quantified by repeated sampling of the 143 genes at random).

      Taking random sets of genes, or the entire set of non-putatively-causative genes shows that, given the size of our gene set, we would expect 43 randomly selected genes to fall within 1 Mb of a peak (95% confidence interval: 31.5-54.5). Instead, we find 147 peak-adjacent genes. When looking closer to genes, the enrichment increases. At a distance of 100 kb, we find 104 putatively causative genes, but the null model predicts only 11 (95% CI 4.5-17.0), a roughly ten-fold difference.

      Enrichment remains significant even when using a more conservative null. It may be that genes like ours, with importance to phenotype, are more likely than random genes to fall near GWAS peaks, even if their phenotype does not correspond to the GWAS phenotype. In this case, we might see enrichment even in the absence of a relationship between our Mendelian and complex traits. To account for this, we also tested significance by testing genes sets against different phenotypes (e.g. testing our LDL genes with a UC GWAS, and our height genes with a T2D GWAS). The results of this permutation are visible in Supp. Fig. 1, and further confirm the enrichment.

      Finally, non-expression based analysis found that Mendelian genes had large enrichments in heritability. As in our study, they included Mendelian genes for diabetes and LDL—the Mendelian diabetes genes were enriched 65-fold for common-variant heritability and the Mendelian LDL genes were enriched 212-fold (Weiner et al. 2022).

      Though it is true that the number of colocalizations and TWAS hits likely represents a statistically significant enrichment over all genes, we feel that this does not affect the conclusions of the paper. The model that noncoding variants identified by GWAS act as eQTLs certainly has some truth—colocalization and TWAS studies have found, in total, many associations. But the model’s success has not lived up to its expectations. This has been suggested, albeit inconclusively, by the failure of most GWAS peaks to colocalize. By evaluating, not the portion of loci that can be tied to a gene, but the portion of already-implicated genes that can be tied to a locus, we believe the model’s deficiencies are both more clear and more puzzling.

      4) It is unclear how the authors selected the breast cancer genes. If the genes were selected based on tumor somatic mutations, it is a problem because there is no evidence supporting that somatic mutation target genes are also cancer germline risk genes.

      Genes for breast cancer were selected using the MutPanning method (Dietlein et al. 2020), which takes somatic mutations found in tumors, and evaluates them in the context of known mutation patterns. The relationship between somatic and germline variants in cancer is little studied. We believe it is meaningful that, as explained in our response to overall comment 2ii, we do now find an enrichment of our breast cancer genes near GWAS peaks. Though these genes are very unlikely to be a perfect set, the conclusions of our paper remain true with or without the inclusion of this phenotype.

      5) The authors observed no enrichment of the candidate genes in height and breast cancer GWAS regions. In this case, should these traits and the corresponding genes be removed from the subsequent analyses?

      The reviewers’ notes about enrichment—and its absence in height and BC—prompted us to review our analysis of it. The enrichment for five of our phenotypes remained significant, and the lack of enrichment for breast cancer genes proved artifactual. After accounting for the artifact, the enrichment of breast cancer genes displays the same pattern as most other phenotypes, displaying highly significant enrichment as compared to the genomic background and a permutation analysis. Supplementary figure 1 has been updated to reflect this change, and to add the enrichments found in Backman et al.

      Because our original analysis of height has nominal, but not corrected, significance for enrichment, the problem may be one of power. The set of height genes identified by Backman et al. is larger than our original set and displays a significant enrichment in proximity to GWAS signal. This enrichment is also present when the two gene sets are combined, as shown in the updated Supp. Figure 1.

      Reviewer 3 (Public Review):

      1) The positive results are substantially reduced when restricting the analyses to a set of selected tissues of relevance to the trait. Isn't it implicated that the selection of relevant tissues in this study is not comprehensive, and further, tissue specificity is common in mediating genetic effects by gene expression? First, it seems some apparently relevant tissues are not selected (Table 2), such as bone for height (Finucane et al. 2015 NG). One approach to assess the relevant tissues for the predefined set of putatively causative genes is to see if these genes are enriched in the differentially expressed gene sets for those tissues. Second, among 84 putatively causative genes overlapped with GWAS signals, they identified 39 genes by TWAS, 11 genes by fine mapping with linear distance to chromatin modification features, and 41 genes by fine mapping with ChromHMM enhancer annotations, but these numbers reduced substantially to 9, 5 and 27 when restricting the same analysis to the selected tissues for each trait. If genes function only in the relevant tissues, I think using bulk expression data would lose power but is unlikely to give false positives. Thus, it is possible that for the traits analysed, not all relevant tissues are selected so that only a fraction of genes identified in bulk expression analysis can be replicated in the tissue-specific analysis. This appears to me a notable piece of evidence to support the hypothesis of biological context that the authors tend to have reservations in discussion.

      Testing for colocalizations or TWAS hits in all tissues may increase power for several reasons. First, it is possible that some GTEx tissues have unrealized relevance to our phenotypes. Secondly, in the event that a tissue is not present in GTEx, we may still detect relevant eQTLs in a tissue that is not itself involved in the trait, but which has similar patterns of expression. Finally, some tissues may be correct, but underpowered due to their small sample size. In this case, we may better detect the colocalization in tissues that are “irrelevant,” but are well-powered and have correlated expression.

      However, this creates problems of interpretation. Say we find, for example, a colocalization of an APOE eQTL with an LDL GWAS peak in skin tissue. Does this mean that skin tissue contributes to LDL levels? Is it simply because skin tissue has more samples than liver? Are we uncovering a strange, unexpected pleiotropy?

      We believe we can achieve both objectives—power and interpretability—with our use of MASH (Urbut et al. 2019) as described in response 3 of the first section. Briefly, MASH is a Bayesian tool that we use to update the estimates of eQTLs in GTEx data. Each tissue is adjusted to incorporate signals detected in other tissues with similar expression. This mitigates the danger of ignoring the correct tissue, and increases the power of tissues with small sample sizes. Its benefit is demonstrated by the substantial increase in the number of expression-GWAS colocalizations identified by coloc—however, the number of genes identified that fall within our putatively causative gene sets remains strikingly small.

      2) How much do both LD differences between GWAS and eQTL samples and the presence of allelic heterogeneity contribute to the observed low colocalization rate? One of their main findings is the low colocalization between trait-associated variants and eQTL in non-coding regions, which accounts for only 7% of the putatively causative genes. In discussion, the authors believe that this finding cannot be explained by lack of statistical power and is directly supported by a Bayesian analysis which reported high posterior probabilities of distinct signals for GWAS and eQTL. I agree that power is probably not a big issue. However, my concern is that given the large difference in sample size between GWAS and GTEx datasets, any small differences in LD between the two samples might cause a statistical separation of the signals even when trait phenotype and gene expression truly share a causal variant. Moreover, the presence of more than one causal variant with allelic heterogeneity in the locus may also play a part in the failure of colocalization. Consider two causal variants for the complex trait, one regulating the target gene and the other regulating another gene in co-expression. Potentially, the presence of the second causal variant would diminish the colocalization probability at the target gene.

      The ability of our statistical tools to actually find colocalizations is a critical one in this project. Small sample size increases the variance of the LD matrix, but is one of only many factors that influence power, which include LD differences between study populations and eQTL effect sizes.

      Though we restricted both GWAS and GTEx samples to subjects with European ancestry and used PCs as covariates, reviewers are correct that there are likely to be LD differences between samples, due to both slight variations in populations and the smaller sample sizes of GTEx. Analysis of colocalization tools in cases of mismatched LD have shown that decreases in power are small. Chun et al. (2017) tested JLIM in simulated conditions of modest population mismatch, using CEU haplotypes to create the GWAS, and haplotypes from all non-Finnish Europeans for eQTL associations. They then attempted to distinguish shared vs. distinct causative variants for GWAS and eQTL, finding no decrease in sensitivity or specificity (Supp. Fig. 6 of Chun et al. 2017).

      The case in which two genes are co-regulated by nearby variants, both causative for the GWAS trait, creates a condition of allelic heterogeneity for the GWAS trait (as opposed to the expression trait). Chun et al. evaluated JLIM’s loss of power as a result of AH, and found that the power loss is small, except in cases in which the two variants have equal effects (Supp. Fig. 10). Testing cases in which the AH occurs for the expression trait returned a similar result (Supp. Fig. 9).

      Hukku et al. (2021) performed similar analyses on coloc, eCAVIAR, and fastENLOC. Allelic heterogeneity was found to damage the power of coloc (by about a factor of 2). Testing on different pairs of populations, they conclude that extreme LD mismatches (e.g. Finnish vs. Yoruban samples) can lead to substantial power loss, but moderate LD mismatches (e.g. Finnish vs. British samples) do not. Though a factor of two is substantial, it would not change the qualitative conclusions of this paper. Overall, given the variety of methods we employ (including those, such as JLIM, more robust to AH), we are confident that they have, when taken together, been shown to be robust to the concerns raised.

      Finally, TWAS should, by design, be less vulnerable to LD differences and allelic heterogeneity. This can result in false positives, when genes with correlated expression are identified together, despite only one being causative. It can also result in non-causative genes being prioritized over causative ones, however, generally both genes will be identified (Wainberg et al. 2019).

      3) Perhaps the authors can perform some simulations to quantify the influence of tissue-specific expression effects, LD differences between eQTL and well-powered GWAS, and allelic heterogeneity, as discussed above, on their analyses. I understand that the authors may not be willing to do as it would involve a lot of work. But I'd like to see at least some discussion on how these questions can be better addressed in the future research.

      These are nuanced technical questions, and to address them by simulation in our paper would, as noted, involve a lot of work. We have summarized previous work that evaluated the effects of LD differences and AH in our response to essential revision 4. We discuss our concerns about the possibility of an overly broad tissue search in essential revisions 3 and 5, and our decision to address this question using MASH in essential revision 3.

      4) It looks quite striking that only 6% of the putatively causative genes are identified by TWAS with the correct effect direction. But I think this number is slightly misleading as one may interpret it as only 6% of the functionally relevant genes are regulated by trait-associated variants. In fact, 46% of the genes are detected by TWAS but only 11% are confirmed in their selected tissues, among which about half (5/9) have correct effect direction. First, the result could be limited by the selection of relevant tissues, as discussed above. Second, the fact that half of the genes do not show correct effect direction may reflect a nonlinear relationship between expression and trait, or the presence of cell-type heterogeneity within a tissue. These may not necessarily overturn the assumption that these genes are regulated by trait-associated variants in the causal tissues or cell types.

      In our initial submission, we had been reluctant to expand the list of tissues for two reasons. First, increasing from the small number of tissues with known biological relevance to all tissues (or all non-brain tissues) increases the multiple-testing correction burden. Second, and, in our eyes, more important, colocalizations in tissues without clear biological relevance are not biologically interprable. Such hits can be results of complicated genetic architecture (e.g. shared eQTLs), power differences in tissues with correlated expression, or biology not directly related to the trait in question.

      That said, the tissue data we have access to are incomplete, and we are without question missing some relevant tissues. Additionally, some relevant tissues have lower sample sizes, and thus lower power, than tissues that are not relevant but may still share eQTLs. To overcome these problems, we applied Multivariate Adaptive Shrinkage (MASH), a Bayesian method that detects correlations between different (in this case tissues) and uses them to produce posterior estimates of summary statistics in each tissue (Urbut et al. 2019). Unlike meta-analysis, which produces one result, the effect size estimates for each tissue are distinct, though informed by one another.

      Using MASH has a pronounced effect on colocalization results. The number of non-putatively causative genes colocalizing increases from 389 to 489, while the number of putatively causative genes in our Mendelian set is unchanged, remaining at 2. The number of genes from the Backman et al. set increases from 2 to 5. Though this is a proportionally large increase, it still represents a small fraction of genes. We have updated our paper to use these results—which should be less dependent on the tissues we selected—but the message has not changed.

      5) While they highlight the roles of alternative regulatory mechanisms, few testable hypotheses are put forward for the field, which is somewhat disappointing but understandable given how little we know about the human genome at the mechanistic level.

      We have added a set of models that may explain the “missing heritability” to Table 4 in the discussion. Though we do not propose experiments, we have included citations for research relevant to confirming or disproving these models.

    2. Reviewer #3 (Public Review):

      Connally et al investigated a central question in complex trait genomics - what's the main mechanism that mediates the effects of trait-associated variants in non-coding regions, which harbour most of the signals identified by genome-wide association studies (GWAS). It is widely perceived that these variants affect trait phenotypes by regulating expression of genes in cis that are functionally relevant to the trait. The authors argue that this is not true because they find limited evidence of linking the trait-associated non-coding variants to a set of putatively causative genes that are known to cause the severe form of the complex trait. The authors discussed four possible explanations to their observations. They argue that incorrect assumptions and lack of statistical power are not likely to be critical, withhold their judgment on the biological context, and claim that the most convincible explanation is the existence of alternative regulatory mechanisms. This conclusion is very important and sobering if it is true because it will inform where to invest the most efforts in the future GWAS.

      It is an interesting idea of using genes of known roles in the "Mendelian forms" of the cognate complex traits as true positives to investigate the biology of non-coding variants. The analyses are done carefully. The discussion of the results is sharp, stands high, and provides lots of food for thought. My major comments lie in the strength of support of their results for the conclusion of "missing regulation" likely attributed to alternative regulatory mechanisms. The results presented seem to also support the biological context hypothesis that non-coding variants regulate gene expression in a tissue or cell type-specific manner.

      Major comments:

      The positive results are substantially reduced when restricting the analyses to a set of selected tissues of relevance to the trait. Isn't it implicated that the selection of relevant tissues in this study is not comprehensive, and further, tissue specificity is common in mediating genetic effects by gene expression?<br /> First, it seems some apparently relevant tissues are not selected (Table 2), such as bone for height (Finucane et al. 2015 NG). One approach to assess the relevant tissues for the predefined set of putatively causative genes is to see if these genes are enriched in the differentially expressed gene sets for those tissues. Second, among 84 putatively causative genes overlapped with GWAS signals, they identified 39 genes by TWAS, 11 genes by fine mapping with linear distance to chromatin modification features, and 41 genes by fine mapping with ChromHMM enhancer annotations, but these numbers reduced substantially to 9, 5 and 27 when restricting the same analysis to the selected tissues for each trait. If genes function only in the relevant tissues, I think using bulk expression data would lose power but is unlikely to give false positives. Thus, it is possible that for the traits analysed, not all relevant tissues are selected so that only a fraction of genes identified in bulk expression analysis can be replicated in the tissue-specific analysis. This appears to me a notable piece of evidence to support the hypothesis of biological context that the authors tend to have reservations in discussion.

      How much do both LD differences between GWAS and eQTL samples and the presence of allelic heterogeneity contribute to the observed low colocalization rate?<br /> One of their main findings is the low colocalization between trait-associated variants and eQTL in non-coding regions, which accounts for only 7% of the putatively causative genes. In discussion, the authors believe that this finding cannot be explained by lack of statistical power and is directly supported by a Bayesian analysis which reported high posterior probabilities of distinct signals for GWAS and eQTL. I agree that power is probably not a big issue. However, my concern is that given the large difference in sample size between GWAS and GTEx datasets, any small differences in LD between the two samples might cause a statistical separation of the signals even when trait phenotype and gene expression truly share a causal variant. Moreover, the presence of more than one causal variant with allelic heterogeneity in the locus may also play a part in the failure of colocalization. Consider two causal variants for the complex trait, one regulating the target gene and the other regulating another gene in co-expression. Potentially, the presence of the second causal variant would diminish the colocalization probability at the target gene.

      Perhaps the authors can perform some simulations to quantify the influence of tissue-specific expression effects, LD differences between eQTL and well-powered GWAS, and allelic heterogeneity, as discussed above, on their analyses. I understand that the authors may not be willing to do as it would involve a lot of work. But I'd like to see at least some discussion on how these questions can be better addressed in the future research.

      It looks quite striking that only 6% of the putatively causative genes are identified by TWAS with the correct effect direction. But I think this number is slightly misleading as one may interpret it as only 6% of the functionally relevant genes are regulated by trait-associated variants. In fact, 46% of the genes are detected by TWAS but only 11% are confirmed in their selected tissues, among which about half (5/9) have correct effect direction. First, the result could be limited by the selection of relevant tissues, as discussed above. Second, the fact that half of the genes do not show correct effect direction may reflect a nonlinear relationship between expression and trait, or the presence of cell-type heterogeneity within a tissue. These may not necessarily overturn the assumption that these genes are regulated by trait-associated variants in the causal tissues or cell types.

      While they highlight the roles of alternative regulatory mechanisms, few testable hypotheses are put forward for the field, which is somewhat disappointing but understandable given how little we know about the human genome at the mechanistic level.

    1. Reviewer #1 (Public Review):

      As an m6A reader, YTHDC1 is known to affect the processing of RNA post-transcriptionally and this article attempted to relate this function in splicing and nuclear export to defects in muscle regeneration after acute injury using LACE-seq. Mechanistically, they provided evidence on m6A-YTHDC1 participation in modulating splicing and target export in myoblast. Additionally, the authors preliminarily confirmed the interaction of YTHDC1 with several key RNA processing factors such as hnRNPG1 to suggest a possible mechanism for m6A-YTHDC1 regulating splicing. Overall it provides new insight into YTHDC1 function in regulating SC activation/proliferation, although some of the data could be improved to fully support the conclusions.

      1. The title "Nuclear m6A Reader YTHDC1 Promotes Muscle Stem Cell Activation/Proliferation by Regulating mRNA Splicing and Nuclear Export" seems a bit overstated. Their data are not sufficient to show YTHDC1 regulating nuclear export. From figure 6 we could see some mRNAs export was inhibited upon YTHDC1 loss but intron retention also occurs on these mRNAs, for example, Dnajc14. Since intron retention could lead to mRNA nuclear retention, the mRNA export inhibition may be caused by splicing deficiency. From the data they provided we could not draw the conclusion that YTHDC1 directly affects mRNA export. I think they should not emphasize this point in the title.

      2. The mechanism of YTHDC1 promoting muscle stem cell activation/proliferation is not solidified. The authors could strengthen their evidence through bioinformatics analysis or give more discussion. Besides, the previous work done by Zhao and colleagues (Zhao et al., Nature 542, 475-478 (2017).) reported another m6A reader Ythdf2 promotes m6A-dependent maternal mRNA clearance to facilitate zebrafish maternal-to-zygotic transition. Does YTHDC1 regulate mRNA clearance during SC activation/proliferation? The authors should explore this possibility by deep-seq data analysis and provide some discussion.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Major #1

      This study primarily uses the genetic mouse model in which LSD1 gene is inactivated after tamoxifen injection in 8 weeks old mice, as shown in supplemental figure 1 B and C. 8 weeks after birth postnatal growth of muscle is not complete and the contribution of satellite cells to muscle growth is still significant. Therefore the timing of tamoxifen injection used cannot discriminate if the observed phenotype involves the function of LSD1 during the post-natal growth of the muscle or in the muscle fibers or both. One way to demonstrate the real contribution of LSD1 in the maintenance of muscle fibers plasticity under environmental stress would be to inject Tamoxifen later (around 10-12 weeks of age), in order to remove a possible bias caused by the contribution of satellite cells during the post-natal growth. At least key findings should be confirmed at this later stage.

      In this study, we used ACTA1-CreERT mice to conditionally knockout LSD1 in the skeletal muscle. The ACTA1 promoter is derived from human muscle actin gene, which is not expressed in the satellite cells, and has been widely used for the transgene expression in myofibers (Stantzou et al. Development 2017). Thus, the inactivation of LSD1 occurs in the existing myofibers, and alterations in satellite cell function, if any, would be indirect effects of the loss of LSD1 in mature myocytes or differentiating myoblasts.

      To test whether postnatal muscle growth was affected in our LSD1-mKO mice, we administrated tamoxifen (4OHT) to pre-weaning mice (11 days old). LSD1 depletion did not affect the expression of muscle fiber genes, when muscle tissues were isolated from mice 11 days after the start of 4OHT (Additional Data).

      These evidences exclude the contribution of satellite cells in the phenotypes observed in the LSD1-mKO mice. __Additional Data __will be added in the revised manuscript.

      Major #2

      LSD1 m-KO muscles seem to have more type I and IIA fibers than WT, even without DEX treatment. Is it possible to quantify the results in supplemental figure 4C?

      As suggested, we quantitatively analyzed the fiber type compositions in Supplemental Fig. 4C using the data from WT (n=4) and LSD1-mKO (n=5) mice (Additional Data). We did not find a significant difference between these mice, confirming our finding that the loss of LSD1 accelerates the Dex-driven phenotypic changes. __Additional Data__will be added in the revised manuscript.

      Major #3

      The effect on fiber type is convincing, while variations in gene expression are of quite low amplitude. However, the atrophy should be induced by other means to ensure that the effects are specific to GC/nuclear receptors pathways; Denervation? Starvation? Not all the experiments need to be repeated, just key results such as, for example, exacerbation of atrophy in LSD1 m-KO, Foxk1 increase.

      We agree that testing alternative atrophy models is important for generalizing our findings. For this, we employed a model for diabetes-related muscle atrophy. A pro-diabetic agent streptozotocin (STZ) disturbs the function of pancreatic islet leading to fast-fiber atrophy (O’Neill et al. Diabetes 2019). LSD1-mKO did not affect the muscle weight in STZ-treated mice (Additional Data). Consistently, there were no major difference in the expression of atrophy genes in STZ-treated WT and LSD1-mKO mice (Additional Data). These results suggest that the LSD1 function depends on the source of atrophy-inducing stress, and that the loss of LSD1 sensitized the muscle to GC-mediate signaling. Additional Data will be added in the revised manuscript.

      Major #4

      Autophagy data: the effect on the LC3I/LC3II ratio are modest. The autophagy part should be removed or completed with additional data to convincingly show that autophagy is affected. Links between LSD1 and mTOR have been published, so the mTOR pathway could be investigated in the model (S6k, S6 and 4EBP1 phosphorylation). Given AKT levels and phosphorylation are affected by the absence of LSD1 + DEX, it can be predicted that mTOR activity will change.

      We have analyzed the expression of additional autophagy markers p62 and phosphorylated 4EBP1. Consistent with the upregulated expression of atrophy genes and increased LC3I/II ratio, LSD1-mKO mice had elevated levels of p62 and phosphorylated 4EBP1 (Additional Data). Altogether, the data suggest that Dex-induced muscle atrophy was exacerbated by the loss of LSD1. Additional Data will be added in the revised manuscript.

      Major #5 The ability of LSD1 to retain FOXK1 in the nucleus is an important information that should be better supported experimentally. In the absence of such information, no mechanism can be proposed for the effect of LSD1 of FOXK1. The immunofluorescence images provided are not convincing and moreover they could be interpreted by a reduction in the level of FOXK1 protein (degradation?) rather than by a nuclear exclusion in the presence of DEX. This point should be addressed, authors could include western blot of nuclear and cytoplasmic fractions to better quantify the nuclear level of FOXK1 in absence of LSD1.

      We agree that performing the suggested experiment would further enhance the quality of our study.

      Major #6 The absence of centralized nuclei indicates that there is no fiber regeneration but it does not exclude the possibility that satellite cells were recruited to existing fibers and thus participated to hypertrophy. To eliminate this possibility, the average nuclei/cytoplasm volume should decrease if hypertrophy results from increased protein synthesis and not myonuclei accretion. This should be checked.

      We histologically analyzed the sections of Gas muscles after Dex treatment and found that there is no evidence of central nuclei in either WT or KO mice (Supplemental Fig. 4D).

      As mentioned above (Major #1), it is unlikely that the satellite cell function was responsible for the enhanced atrophic phenotype.

      Major #7 The upregulation of ERR____g in the absence of LSD1 is convincing in the VWR conditions. ERR____g level should be evaluated in the sedentary LSD1 KO mice.

      We have analyzed the expression of ERRg in sedentary mice, and found no significant difference between WT and KO mice (Additional Data). This suggests that the loss of LSD1 in combination with VWR training led to the increased expression of ERRg. Additional Data will be added in the revised manuscript.

      Minor #1

      There is a clear difference in the number of mouse replicates between treated (Dex or VWR) and non-treated mice, regardless the genotype. Experiments with non-treated mice lack adequate numbers to make a definitive conclusion. For example, there is a huge spread in the data in Figure 1 B and 4 D. If the number of animals would have been increased, would the absence of difference hold up?

      We increased the number of non-treated animals in Figures 1B and 4B as suggested. Nonetheless, we did not find any significant differences in the muscle weight (Additional Data). These changes will be reflected on original Figures 1B and 4B.

      Minor #2 The authors claim that: "Consistent with the results of the augmented endurance capacity, the Sol muscle in the KO mice showed enhanced succinate dehydrogenase (SDH) staining, indicating that the number of oxidative fibers increased (Figure 4F and Supplemental Figure 8F)". However, supplemental figure 8 D indicates that the number of type I fibers does not change compared to WT. Authors should clarify this statement.

      Indeed, we found that the area of type I fiber but not the number was increased in the LSD1-KO Sol (Fig. 4D and Supplemental Fig. 8D). Because SDH staining reflects the OXPHOS capacity in all fiber types, it is possible that the OXPHOS capacity in the fibers other than type I had been augmented by LSD1-KO. Thus, for clarification, we will change the statement as follows: OXPHOS capacity of Sol was enhanced by the loss of LSD1.

      Reviewer #2

      __Methods

      1__

      The authors used the Cre-lox system with tamoxifen to generate skeletal muscle-specific LSD1 KO mice. It is clear that both the mRNA and protein levels of LSD1 in various muscles were dramatically reduced, but there is still some LSD1 expressed in skeletal muscle, especially in Sol muscle (Supplemental Figure 1C). The author needs to think about whether it is appropriate to use the term "LSD1 knockout" or "LSD1 deficiency".

      We thank the reviewer for this comment. In this study, we crossed LSD1-floxed mice with ACTA1-creERT mice. This enables the deletion of critical exons of LSD1 in mature myocytes and myogenic precursors that have initiated the differentiation program. LSD1 is a ubiquitously expressed gene, and it is known that immature myogenic cells (e.g., satellite cells, Tosic et al. Nat Commun. 2018) and other non-myogenic cells such as hematopoietic and vascular cells abundantly express LSD1 (Kerenyi et al. Elife 2013, Yuan et al. Biochem Pharmacol. 2022). Thus, it is likely that LSD1 expression by these cell types were detected in our whole muscle western blots. We will add these statements in the text for clarification.

      __Results

      2__

      To identify the transcriptional regulators that mediate the regulation of atrophy-associated genes by LSD1, the authors performed motif analyses on the promotor regions of upregulated genes in LSD1-mKO Gas. Based on the results and other reports, they focused on Foxk1 and proved LSD1 and Foxk1 cooperatively regulate the atrophy transcriptome in the presence of Dex. However, Figure 3C showed that a transcription factor Nfatc1 is also reduced in Sol muscle similar to Foxk1. Also, other studies demonstrated that the transcription factor NFATc1 controls fiber type composition and is required for fast-to-slow fiber type switching in response to exercise in vivo. More specifically, NFATc1 inhibits MyoD-dependent fast fiber gene promoters by physically interacting with the N-terminal activation domain of MyoD and blocking recruitment of the essential transcriptional coactivator p300 (Cell Rep. 2014 Sep 25; 8(6): 1639-1648). Furthermore, it has been reported that LSD1 Controls Timely MyoD Expression via MyoD Core Enhancer Transcription (Cell Rep. 2017 Feb 21;18(8):1996-2006. doi: 10.1016/j.celrep.2017.01.078). It is unclear how the authors exclude Nfatc1 for the LSD1-mediated effects in different muscle fibers. Further experiments may be necessary to exclude Nfatc1.

      We thank the reviewer for an insightful comment. In addition to Foxk1, we tested the involvement of NFATc1 in the gene regulation under LSD1-depleted state. We treated C2C12 with an LSD1 inhibitor S2101 in combination with a calcium ionophore that promotes the transcriptional function of NFATc1 by inducing its nuclear localization (Meissner et al. J Cell Physiol. 2007). While LSD1 inhibition promoted the expression of Pgc1a and Myh7, ionophore treatment had no additive effects (Additional Data). Because we found a physical association of Foxk1 with LSD1, we focused on the functional involvement of Foxk1 in LSD1-mediated repression of atrophy genes. We recently performed an ATAC-seq analysis in Dex-treated muscle, and found that the Foxk1 motif but not the NFATc1 motif was enriched in the LSD1-KO-specific open chromatin regions. This data further suggests the significant contribution of Foxk1 in the transcriptional regulation under LSD1 depletion.

      #3

      In figure 3D, only merged images were colored. It would be better to show colored images for Foxk1 and DAPI.

      We will replace the images with the colored ones.

      #4

      Immunofluorescence analysis in C2C12 myotubes showed that Dex exposure reduced the nuclear retention of Foxk1, which was further promoted by the addition of T-3775440, an LSD1 inhibitor (Figure 3D). The author also used Foxk1-KO C2C12 myotubes to prove LSD1 and Foxk1 cooperation to regulate the expression of type I /IIA fiber and atrophy genes in Foxk1-KO cells. Are the effects of LSD1 dependent on Foxk1 or synergistically acting with Foxk1? The treatment of LSD1 inhibitor in Foxk1-KO C2C12 may be helpful to answer this question.

      As suggested, we will examine the combination effect of LSD1 inhibition and Foxk1-KO. In addition, we will analyze chromatin association of LSD1 in Foxk1-KO cells by ChIP experiments, to test whether the function of LSD1 depends on Foxk1.

      #5

      In supplementary figure 2, body weight in the mKO+Dex group was reduced in comparison to that of WT+Dex. How about the body weight of mKO mice without Dex injection compared to that of WT? This data will be helpful to understand the effect of muscle-specific LSD1 deficiency on whole-body energy balance.

      We measured the body weight of untreated mice, and found that there is no genotype effect (Additional Data). Thus, we think that LSD1-mKO alone does not influence the whole-body energy balance. We will include this data in the revised version.

      #6

      The authors analyzed the size distribution of myofibers and mentioned that large type I and type IIA fibers preferentially increased in the LSD1-mKO muscle, whereas large type IIB + IIX fibers decreased (Supplemental Figure 4, B, E, and F). It is better to show the results of statistics. If no significance were found, it should be mentioned in the result section.

      We have performed statistical analyses on Supplemental Fig. 4E and F, and found that a fraction of large type I fibers was significantly larger in KO mice. This result will be added in the next version.

      #7

      Page 11, To reveal the genes regulated by LSD1 under the VWR condition, the authors performed additional RNA-seq analysis using Sol muscle. The non-hierarchical clustering analysis was informative and showed signaling pathways related to ‘mitochondrion’, ‘mitochondrion organization’, and ‘oxidative phosphorylation’ were altered in the Sol muscle deficient in LSD1 under the VWR condition (Figure 5B). However, it is unclear why they focus on Err-gamma to explain LSD1-KO phenotypes in Sol muscle. Is this gene also derived from RNA seq? It is better to show whether Err-gamma expression is also significantly altered based on RNA seq data.

      Indeed, ERRg was upregulated by LSD1-KO+VWR and was included in the Cluster 6 genes together with the OXPHOS and mitochondria-related genes (Additional Data and Fig. 5A). These data prompted us to focus on ERRg as a potential factor that explains the LSD1-KO phenotype. Additional Data will be included in the revised version.

      #8

      The authors claim that LSD1 serves as an "epigenetic barrier" that optimizes fiber type-specific responses and muscle mass under stress conditions. This claim is derived from the loss of function studies. To generalize the functions of LSD1, the gain of function studies will be also necessary. Adding the characteristics of LSD1 overexpression in C2C12 cells will further improve the quality of the manuscript.

      We agree that the gain of function studies will further strengthen the quality of our manuscript. As suggested, we will perform an LSD1 overexpression experiment using C2C12 cells and analyze the expression of atrophy and fast fiber related genes. Because Esrrg is completely silenced in C2C12 cells, it is difficult to monitor ERRg-mediated gene regulation in these cells. To overcome this, we will use a cardiomyocyte cell line, in which ERRg is functionally involved in differentiation (Sakamoto et al. Nat Commun 2022). We will overexpress LSD1 in these cells and examine whether the expression of ERRg and its downstream targets are altered.

      __Discussion

      9__

      The authors mentioned supplementary figure 10 only at the end of the manuscript of the discussion section (page 15) without a specific explanation of the figures in the result section. The data are important in that LSD1 expression in human muscles declined with age and showed a negative correlation with the expression of the atrophy gene. It should be presented in the result section with a more detailed description.

      We agree that these data are important and need further explanations. We will describe the details in the Results section and move the entire figure to the main figure.

      #10

      There are other studies to examine LSD1 and muscle regeneration or functions (e.g. Nat Commun 9, 366 (2018). ____https://doi.org/10.1038/s41467-017-02740-5____). More discussion to compare the current study and other studies will be necessary.

      We thank the reviewer for this comment. We will add the discussion accordingly.

    1. We think of the key, each in his prison Thinking of the key, each confirms a prison

      Humans are social animals for a reason. It is due to others that we have a sense of ourselves. We compare ourselves to others, and by knowing the differences between us and others, we gradually gain a sense of identity. If we are just alone, without any comparison between ourselves and other people, we can never know the defining characteristics of ourselves. There will be no sun without shadows. And by forming a prison around ourselves, we create a sense of “otherness” that forms a wall between a body of our own and our surroundings. However, as the objective world holds unlimited amounts of truths for us to perceive, Bradley argued that there is fundamentally “no difference between the inner and the outer”. All humans have access to the same amount of information, but what makes us distinct is not “any difference of kind, but only of degree”. In other words, there is an extent to which we perceive the surrounding world. There may be overlaps between the perceptions of mine and that of others, but ultimately, it is the chain of every single person that makes up the whole world. Eliot may have mentioned Bradley’s argument about self-identity to further his opinion on the continued existence of the self after death. The world is made up of a chain of identities, where as one goes away, another one spawns and fills up the spot. There may be millions and millions of overlapping areas, but they are not necessarily the same due to tiny nuances. There is that sense of continuity that transcends bodily boundaries as well.

    2. He who was living is now dead We who were living are now dying

      This section of the Waste Land may be read as a commentary on religion that works on multiple levels, created via allusion to the books of the Bible. The two key lines of this section are ‘He who was living is now dead/ We who were living are now dying’. The first line has much Biblical precedent. In the Book of Revelations, Jesus says ‘I am the first and the last. I am he that liveth, and was dead; and behold, I am alive for evermore, Amen; and have the keys of hell and of death’. The reason that Jesus used to be dead but is now living is because of the Reïncarnation. Eliot’s ‘He’ – an obvious nod to God – is a reversal. It can be read as a Jesus that was never reïncarnated, as he went simply from being alive to being dead, without the final stage, or as having been reïncarnated – so ‘living’ again – but then somehow expired following that. Whichever way we read it, however, present an anti-Biblical view, one in which Jesus is no longer ‘living’. The second line – ‘we who were living are now dying’ – casts ‘us’ (the question arising: is the reader included amongst the ‘we’?) as being in the long, drawn-out, almost timeless process of ‘dying’, but still somehow alive. Therefore: ‘we’ have outlived Jesus. This may be a commentary on religion itself – that is, Jesus does not exist in the modern world – or on failing attitudes towards religion, with the question on the minds of many at the time being ‘how is it possible for both Christian love to exist in the world and such deep suffering?’. Additionally, John 11.25 states ‘he that believeth in me, though he were dead, yet shall he live.’ According to Eliot, however, ‘we’ are dying, perhaps representing the decline of belief. To take this yet further, Psalm 63 begins ‘O God, thou art my God; early will I seek thee: my soul thirsteth for thee’. Therefore, God is here presented as a life-nourishing water. In Eliot’s Waste Land, ‘there is no water’. By contrast, to take the other interpretation: if we, however, have outlived Jesus, then what happens now with the ‘keys to hell and to death’. Are Hell and Death flung open? Is ‘Hell empty, and all the Devils here’ (another potential reference to the Tempest…)? Is tha perhaps why such suffering permeates the world?

      An additional note: few of Eliot’s contemporary readers could have read the line ‘we who were living are now dying’ without thinking of John McCrae’s famous ‘In Flanders Fields, especially those most poignant and emotive of lines: ‘We are the Dead. Short days ago/ We lived, felt dawn, saw sunset glow,/ Loved and were loved, and now we lie,/ In Flanders fields.’ To think of these lines allows us to perhaps transcend the meta-religious interpretation of this passage, to forget God, salvation, belief and faith, but merely to focus on the human: the suffering, the pain, the love lost – the Dead.

    1. The work that we make, McGann tells us, “is not the achievement of one’s desire: it is the shadow of that desire.”[2

      I strongly agree. Often, especially in the industries where art is concerned, what is actually made is just a "watered down" version of many desires. What people may perceive that piece of work as is not all there is to it. For example, persons may look at a painting and say "It's very pretty" while the artist himself viewed it as an entire storyline while trying to put it into object form.

    2. The distance between our wish and our object is often so great

      This is very true noting that our imaginations can often run wild and create the literal most. However, creating these wishes into an object may prove to be challenging because of many factors such as processes that need to be done

    1. Fake news” was actual false news: stories that were blatantly made up, written and shared by people in the US who were economically or politically motivated. Or, in some cases, by Macedonians seeking a paycheck. While the motives may vary, the product is the same: fictional stories.

      This statement was particularly interesting because it makes one think what the true motive of creating false news truly is? As stated, for some it may be a paycheck but there must be a deeper reason and unfortunately we may never know that answer.

    1. General comments:

      This study carefully delineates the role of magnesium in cell division versus cell elongation. The results are really important specifically for rod-shaped bacteria and also an important contribution to the broader field of understanding cell shape. Specifically, I love that they are distinguishing between labile and non-labile intracellular magnesium pools, as well as extracellular magnesium! These three pools are really challenging to separate but I commend them on engaging with this topic and using it to provide alternative explanations for their observations!

      A major contribution to prior findings on the effects of magnesium is the author’s ability to visualize the number of septa in the elongating cells in the absence of magnesium. This is novel information and I think the field will benefit from the microscopy data shown here.

      I completely agree with the authors that we need to be more careful when using rich media such as LB. It is particularly sad that we may be missing really interesting biology because of that! It’s worth moving away from such media or at least being more careful about batch to batch variability. Batch to batch variability is not as well appreciated in microbiology as it is for growing other cell types (for example, mammalian cells and insect cells).

      For me, the most exciting finding was that a large part of the cell length changes within the first 10min after adding magnesium. The authors do speculate in the discussion that this is likely happening because of biophysical or enzymatic effects, and I hope they explore this further in the future!

      I love how the paper reads like a novel! Congratulations on a very well-written paper!

      Kudos to the authors for providing many alternative explanations for their results. It demonstrates critical thinking and an open-mind to finding the truth.

      Specific comments:

      Figure 2C → please include indication of statistical significance

      Figure 3C → please include indication of statistical significance

      Figure 6A → please include indication of statistical significance

      Figure 8B → please include indication of statistical significance

      Figure S1B → please include indication of statistical significance

      Figure S3B → please include indication of statistical significance

      For your overexpression experiments, do the overexpressed proteins have a tag? It would be helpful to have Western blot data showing that the particular proteins are actually being overexpressed. I think the phenotypes that you observe are very compelling so I don’t doubt the conclusions. Western blot data would just provide some additional confirmation that you are actually achieving overexpression of UppS, MraY, and BcrC.

      Questions:

      Based on your data, there are definitely differences in gene expression when you compare cells grown in media with and without magnesium. Because the majority in cell length increase occurs in such a short time though (the first 10min), I was wondering if you think that some or most of it is not due to gene expression? Do you have any hypotheses what is most likely to be affected by magnesium? Do you think if the membrane may be affected?

      Why do you think less magnesium activates this program of less division and more elongation? Additionally why is abundant magnesium activating a program of increased cell division and less elongation? Do you think there is some evolutionary advantage, especially considering how important magnesium is for ATP production?

      Related to this previous question, I also wonder if this magnesium-dependent phenotype would extend to other unicellular organisms, may be protists or algae? That would be a really exciting direction to explore!

      Regarding the zinc and manganese experiments, why do you think they lead to additional phenotypes compared to magnesium? Do you have any hypotheses?

      Regarding your results that Lipid I availability may be a major a problem for the cell division in the absence of magnesium, do you think that is due to effects magnesium has on the enzymes directly, or do you think magnesium affects the substrate availability/conformation by coordinating the phosphate groups? Or something else, may be membrane conformation?

    1. Author Response

      Reviewer #1 (Public Review):

      The authors took advantage of an existing protein-trap resource in zebrafish to identify genes important for normal pacemaker function in adults. They generated a collection of lines with mutation in genes that expressed at reasonably high levels in the heart and assess their ECG. They identified 3 candidates with increased incidence of sinus arrest and focused on validation of dnajb6b. The dnjb6b mutant fish display other defects including enhanced response to atropine and carbacol and bradycardia. They show that dnajb6b is expressed in a subset of cells in the sinus node in zebrafish. In mouse sinus node, DNAJB6 expressing cells have low expression of TBX3 and its target HCN4. In addition, Dnajb6b+/- mice also display similar phenotypes. Analysis of pacemaker function in ex vivo mouse hearts by high-resolution fluorescent optical mapping of action potentials revealed that the number of leading pacemakers in Dnajb6b+/- hearts is decreased in the sinus node, with a concomitant increase in the auxiliary pacemakers. RNAseq analysis of the right atrial tissues detected expression changes in ion channels and genes involved in Ca2+ handling and Wnt signaling. Overall, the results support the conclusion that DNAJB6 is important for proper sinus node function, thus adding it to the short list of sick sinus syndrome genes. However, the manuscript has several weaknesses.

      Weakness:

      The manuscript does not address the mechanism by which decreased DNAJB6B causes sick sinus syndrome. For example, it is unknown if DNAJB6B functions cell autonomously or non-cell autonomously in the sinus node. The RNAseq analysis identified changes in ion channels in the right atrial tissues of 1-year old mice, cellular electrophysiology of the sinus node cells was not assessed.

      The main goal of this research is to prove the feasibility of discovering novel SSS genes in adults via a forward genetic approach in zebrafish. Thus, the major hallmark would be to prove causality and specificity of the candidate genes identified from this screen, such as Dnajb6. Comprehensive mechanistic study would be a focus for future studies.

      Nevertheless, we carried out the following experiments to address the mechanisms. Based on these data, a new section was added to the discussion section (Lines 424-465).

      (1) In mice, we did more antibody immunostaining and confirmed a negative correlation in terms of expression intensity between the Dnajb6 and Tbx3 proteins. We further detected a significantly increased Tbx3 immunostaining signal in the SAN tissues of Dnajb6 heterozygous mice compared to WT controls (new Figure 3D-F).

      (2) In zebrafish, we compared expression patterns of the sqET33-mi59B conduction system reporter line between the GBT411/dnajb6b heterozygous and homozygous mutants. We found the atrio-ventricular canal (AVC) signal became diffused in GBT411/dnajb6b homozygous adult hearts. In addition, the ring-like structure usually seen in the SAN region of WT controls and in the GBT411/dnajb6 heterozygous was largely lost in 3 out of 9 GBT411/dnajb6b homozygous adult hearts examined (new Figure 2).

      Together with the ectopic pacemaker activity detected in the Dnajb6 heterozygous mice (new Figure 5A and 5B), we speculate that Dnajb6 might act as a suppressor of Tbx3 transcription factor in defining cell fate specification into SAN pacemaker myocytes. Since Tbx3 was reported to suppress chamber myocardial differentiation (Mommersteeg et al., Circ Res. 2007;100(3):354-62), upregulation of Tbx3 may thus contribute to enhanced atrial ectopic activity in Dnajb6 heterozygous mice.

      Furthermore, TBX3 has been recently identified as a component of the Wnt/β-catenin-dependent transcriptional complex (Zimmerli et al., eLife. 2020;9:e58123), which is significantly affected in Dnajb6 heterozygous mice (see new Figure 7B-C). This further supports a possible role of TBX3 in both SAN and atrial remodeling.

      (3) Finally, in collaboration with Drs. Grandi, Morotti, and Ni from University of California Davis, we utilized a population-based computational modeling approach to determine the cellular/ionic mechanisms that could underlie the ex vivo observed SSS phenotype in the Dnajb6 heterozygous mice (new Figure 6). We used our previously published model of the mouse SAN myocyte (Morotti et al. Int J Mol Sci. 2021; 22(11):5645) and enhanced it with addition of both sympathetic and parasympathetic stimulations to model the effects of isoproterenol- and carbachol-induced changes in pacemaker activity (i.e., firing rate), respectively. We generated a population of 10,000 mouse SAN myocyte models by random modification of selected model parameters describing maximum ion channel conductances and ion transport rates from the baseline model and assessed isoproterenol- and carbachol-induced effects on each model variant. We then separated this population of models in two subpopulations representing the WT and Dnajb6+/- mice phenotypes: namely, we extracted the model variants that recapitulate changes observed in Dnajb6+/- vs. WT mice, including a reduced firing rate at baseline, an increased response to isoproterenol, and a decreased response to carbachol administration (new Figure 6). This filtering process resulted in n=438 models that correspond to the Dnajb6+/- mice phenotype and n=6,995 models that correspond to the WT phenotype. We analyzed the parameter value differences in these two subgroups to revealed several crucial parameters that are significantly correlated with the observed electrophysiological changes. The analysis revealed a significant decrease in the maximal conductances of the fast (Nav1.5) sodium current, the L-type Ca2+ current (ICa,L), the transient outward, sustained, and acetylcholine-activated K+ currents, the background Na+ and Ca2+ currents, as well as the ryanodine receptor maximal release flux of the Dnajb6+/- vs. WT model variants. We also found a significant increase in the Na+/Ca2+ exchanger (NCX) maximal transport rate, and conductances of the T-type Ca2+ current and the slowly-activating delayed rectifier K+ current. These new studies provide some novel mechanistic insights into the observed SSS phenotype in Dnajb6+/- mice. Importantly, these new in silico experiments add another conceptual level to the phenotype-based screening approach introduced in the current study to identify new genetic factors associated with SAN dysfunction. Direct testing of these mechanisms would require a substantial amount of single SAN cell patch clamp and confocal microscopy experiments which are out of scope of the current manuscript and will be pursued in a follow-up study.

      The manuscript does not address why the zebrafish homozygous mutants are adult viable while the mouse homozygotes are embryonic lethal. The insertion of the GBT411 disrupt dnajb6b(L) but not dnajb6b(S), while the mouse mutation deletes the entire gene. Does this difference partially explain the difference?

      Indeed, the difference between zebrafish and mouse can be partially explained by the fact that only the long isoform of dnajb6b gene, dnajb6b(L), was disrupted in the GBT411 mutant, while both the long-Dnajb6(L) and short-Dnajb6(S) isoforms of Dnajb6 gene was largely deleted in the Dnajb6 knockout mice. However, we think the main reason is probably that functional redundancy in zebrafish but not mouse: zebrafish has two dnajb6 homologues, dnajb6b and dnajb6a, while mouse has only one Dnajb6 homologue. We added these points to the paper (Lines 377-379).

      Reviewer #2 (Public Review):

      In this manuscript, the authors expand upon previous work describing development of a protein trap library made with the gene-break transposon. This library was screened to identify lines displaying gene trap expression in the heart (zebrafish insertional cardiac mutant collection). A pilot screen of these lines using adult ECG phenotypes identifies dnajb6b as a new gene important for cardiac rhythm. Using the GBT/dnajb6b zebrafish line, Ding et al. find a proportion of aged homozygous mutant fish (1.5-2 years) present sinus arrest episodes and reduced heart rate. Treating GBT411/dnajb6b mutant adults with compounds revealed aberrant responses to autonomic stimuli, and sinus arrest episodes were induced following verapamil exposure, providing evidence that GBT411/dnajb6b as an arrhythmia mutant. This conclusion could be better supported by presenting specific ECG parameters to characterize the conduction defect more thoroughly. The authors then report that Dnajb6+/- adult mice recapitulate some of the phenotypes observed in zebrafish, including sinus arrest and AV blocks, as well as impaired (although different) responses to autonomic stimuli. The authors describe that these are features of sick sinus syndrome in the absence of cardiomyopathy phenotypes in either the zebrafish or mouse lines. However, overall cardiac morphology is not well described for either the GBT411/dnajb6b or Dnajb6+/- models.

      We carried out more experiments to examine left ventricular (LV) structure in Dnajb6 heterozygous mice at 1 year of age, using H&E staining, Masson’s trichrome staining, and transmission electron microscopy (TEM) analysis. We now show clearly that there are no significant myocardium structural changes in the LV as well as atrial and SAN tissues of Dnajb6 heterozygous mice (new Supplemental Figures 3 and 5), when the SSS phenotype was already noticeable. However, in the GBT411/dnajb6b heterozygous mutant at ~2 years of age, we detected severe sarcomere structural abnormality in 1 out of 3 fish hearts examined (see Response-only Figure 1). In addition, in a previous publication (Ding et al., Circ Res, 2013:112(40:606-17), we reported evident cardiac remodeling phenotypes in the GBT411/dnajb6b homozygous fish at 12 months of age.

      Together, we have obtained more experimental evidence to strengthen the claim that arrhythmia is not due to cardiomyopathy/structural remodeling in the Dnajb6+/- mice. However, the evidence from fish remains weak. Therefore, we removed the claim that “when structural remodeling/cardiac dysfunction have not yet occurred” in fish and modified our statement in mice accordingly (Lines 372-377, 385-386).

      To further support a role for Dnajb6 in sinoatrial node dysfunction, the authors performed optical mapping of action potentials from isolated mouse atrial tissue. These data reveal that Dnajb6+/- cultures exhibit ectopic pacemakers outside of the sinoatrial node, including within the atrial wall and inter-atrial septum. These data also show prolongation of SAN recovery time at baseline and following autonomic stimulation, further suggesting SAN dysfunction. RNA-sequencing experiments of DNAjb6+/- adult right atrial tissue showed differentially expressed genes encoding Ca2+ handling related proteins, ion channels, and WNT pathway related proteins. As these genes are involved in the cardiac conduction system, the authors suggest these pathways as molecular mechanisms underlying SSS phenotypes in Dnajb6 models.

      Sick sinus syndrome is a relatively rare arrhythmia most commonly found in older populations. Therefore, it has been challenging to establish clinically relevant models and there is a limited understanding of mechanisms of SSS pathogenesis. One particular strength of this manuscript is the ECG phenotype-based forward screen of the gene-breaking transposon (GBT)-based gene trap library in aged animals. This pilot study provides proof-of-concept that this screening approach is well suited to identify regulators of cardiac function in adults and genes linked to adult diseases like SSS.

      Thank you very much for recognizing the major strength of our manuscript!

    1. Why are Georgians Nostalgic about the USSR? Part 1 Several surveys in recent years suggest that close to half of the Georgian public considers the dissolution of the USSR a bad thing. After nearly 30 years since gaining independence, why do so many Georgians look back with nostalgia towards the Soviet Union? Reasons for Soviet nostalgia in other contexts are usually associated with how people experienced transition from state socialism to capitalism. The economic hypothesis explaining nostalgia argues that a perception of being part either “a winner” or “a loser” of the transition is associated with nostalgic feelings towards the Soviet Union. Other hypotheses introduce politics into the equation. According to this explanation, those who reject democracy on ideological grounds are more likely to be nostalgic as are those who think that democratic institutions are too feeble in delivering state services. Are these explanations true for Georgian Ostalgie? This series of blog posts explores these and other potential explanations to Soviet nostalgia.The 2019 Caucasus Barometer survey asked respondents whether the dissolution of the USSR was a good or a bad thing, as well as the reasons why. Respondents were considered nostalgic if they reported that the dissolution was a bad thing. However, it is worth keeping in mind the exact wording of the question when reading the analysis. Overall, 42% of the public think that the dissolution of the USSR was a bad thing, and a statistically indistinguishable share (41%) report it was good, leaving about 16% who were not sure.When it comes to why it was a bad thing, by far, the most common reason is that respondents believe that people’s economic situation has worsened. And they’re not necessarily wrong.Georgia had a particularly difficult economic transition during independence. Overall purchasing power is much higher today than before the transition, however, it only recovered to pre-transition levels in 2006 according to World Bank data.At the same time, average purchasing power hides the high levels of economic inequality in Georgia. Inequality increased from an estimated GINI of 0.313 in 1988 to 41.3 in 1998. In 2018, it stood at 37.9 according to the World Bank data. Concomitantly social services were cut.This likely explains why a majority of respondents that are nostalgic report that the economic situation has worsened to explain why they think the dissolution of the Soviet Union was a bad thing. The fact that some respondents directly cite a lower number of workplaces as a reason for believing that the dissolution was a negative thing, attests to this. The second most common reason is related to the conflicts that followed independence and the lost territories.What sets nostalgic Georgians apart? A logistic regression model looking at attitudes towards democracy, Russia, political party preferences, and a number of demographic measures suggests a number of characteristics. Age is an important predictor, with older people being considerably more nostalgic.Education also appears important, as individuals with more education are less likely to be nostalgic. Wealth has a less clear role, appearing only slightly relevant for overall attitudes, and more relevant when we look at those citing economic reasons for their attitude. This suggests that those who regret the dissolution of the USSR are those who suffered the most during the transition. This also suggests that as the economy improves and newer generations come of age, nostalgia towards the USSR may decline.While age, education, and wealth are relevant, they are not the only factors. Attitudes towards democracy and towards Georgia’s orientation to Russia also seem to separate nostalgics from non-nostalgics. Those who believe that Georgia should forego NATO and EU membership in favor of closer ties to Russia as well as those who think that Georgia is not a democracy and that democracy is not necessarily the best form of government, are more likely to also believe that the dissolution of the USSR was a negative thing.Similar patterns emerge when disaggregating the reasons for nostalgia, with wealth being more relevant for those who mentioned the worse economy as a reason for nostalgia. Interestingly, feeling close to a particular political party does not seem to be relevant for these attitudes, once other factors are held constant. One exception is when looking at identity-related responses for the attitudes. Respondents who feel close to pro-western opposition parties are less likely to believe that the dissolution of the USSR was a bad thing because ties with other nationalities became less common, travel to other former Soviet Republics became harder, or for people judging each other because of their identity. Ethnic minorities in Georgia are more likely to report these reasons than ethnic Georgians.Nostalgia towards the USSR seems to be primarily related to an individual’s experience of the transition, and their current attitudes towards democracy and Russia. This connection might suggest that skepticism towards democracy and the West is related to individuals’ experiences of the transition. However, more direct analysis of attitudes towards democracy is needed to test this idea.
      აღნიშნული ბლოგი არის იმის შესახებ, თუ რატომ არიან ქართველები ნოსტალგიურად განწყობილნი საბჭოთა კავშირის მიმართ. ავტორი გვთავაზობს რამდენიმე მიზეზს სსრკსადმი ქართველი ერის დადებითი დამოკიდებულების საილუსტრაციოდ. უპირველესი მიზეზი ამ კეთილგანწყობის არის ის, თუ როგორ გამოსცადა ქართველმა ხალხმა სოციალისტური  წყობილებიდან კაპიტალისტურში გადასვლა. ბლოგში აღნიშნულია, რომ ქართველებმა ძალზედ განიცადეს საბჭოთა კავშირის დაშლა, ვინაიდან მათი ეკონომიკური მდგომარეობა გაუარესდა. 
        მეორე მიზეზი კი არის ის, რომ საბჭოთა კავშირის დაშლამ უარყოფითად იმოქმედა საქართველოს შიდაპოლიტიკურ ცხოვრებაზე. ბლოგში ნახსენებია ის კონფლიქტები, რაც მოჰყვა დამოუკიდებლობის მოპოვებასა და სახელმწიფო ტერიტორიების დაკარგვას. საყურადღებოა ისიც, რომ სსრკ-ს მონატრებას გრძნობს ძირითადად ძველი თაობა. ახალი თაობა კი განათლების ძალით ხვდება, თუ რატომ არ არის საბჭოთა კავშირში ნოსტალგიის სამართლიანი საფუძველი. 
         ჩემი დამოკიდებულება ამ საკითხის მიმართ აშკარაა. ვინაიდან და რადგანაც, მე მივეკუთვნები საქართველოს იმ ახალგაზრდა თაობას, რომელიც დაიბადა და ცხოვრობს დამოუკიდებელ საქართველოში, იოტისოდენა სურვილიც არ მაქვს მენატრებოდეს საბჭოთა კავშირი. ის ბოროტების იმპერია, რომელიც ტოტალიტარულად მართავდა მცირე ერებს, ქვეყნებს. მიკვირს, როგორ შეიძლება მისტიროდე იმ დესპოტურ რეჟიმს, რომელიც ხალხს მუდმივ ტერორში ამყოფებდა და თავისუფალი აზრის ნებისმიერ გამოვლინებას სასტიკად უდგებოდა? ვფიქეობ, განათლების როლი ამ საკითხში ყველაზე დიდია. ერუდირებული ადამიანი მოვლენებს სწორად აფასებს და შესაბამისად, ნაკლები შანსია მისი მხრიდან სსრკ-სადმი ნოსტალგიის.
      
    1. the reason is that a perception 00:10:38 is kind of perceptual in structure and the buddhist world encodes this by arguing that the internal um sense the the manus venana is a sense faculty just like external faculties 00:10:52 and so just as our external faculties present us with a world that just seems to us even though we know it's not to be just as it is that we see it just as it is 00:11:03 it's tempting to think that we've got this apparent object distinct from our sensory apprehension of it but is but an object that's presented by a completely veritable process 00:11:15 because as i say perception just feels like it presents the world to us as it is i look at a red apple and i think damn i know exactly what that apple smells like looks like tastes like and 00:11:27 feels like forgetting that all i have is the apple as it's mediated by the peculiar perceptual system that i have and by all of the conceptual resources through which i filtered my perception 00:11:41 so in the same way a perception or introspective awareness just feels like it presents our own cognitive affective and perceptual states to us just as they are 00:11:53 independent of that appreceptive system and those conceptual categories so just as external perception gives us the illusion that we're just detectors of the world as it is inner perception can give us the illusion that we are just 00:12:06 detectors of our inner um our inner world just as it is so even when we remind ourselves as i'm reminding you right now of this 00:12:18 extremely complex mediation of our perceptual encounter with external objects we find ourselves in constantly experiencing our own experience as though 00:12:31 we've got the world just as it is and then we sometimes say okay maybe we're not getting the world just as it is but at least i'm getting my sensory experiences just as they are the apple might not be red but the redness i 00:12:42 experience is exactly the redness that i think i experience the sweetness that i introspect must be the sweetness just as it is and so forth so even if we give up for a moment and it's hard to give it up 00:12:54 for more than that the notion of immediacy with regard to external perception we often retreat to thinking that that's mediated but my awareness of my own inner episodes is the immediate 00:13:06 awareness that mediates my knowledge of the external world and i think that in the sense of that perception that sense of immediacy is even greater it's really hard for us to be convinced that our inner experience 00:13:20 could possibly be deceptive we seem to think that if i think that i believe something i must believe it if i think that i'm feeling something i must be feeling it and that feeling and that believing grab my inner 00:13:33 reality just as it is and so part of the problem that arises is that the mediation of our introspective awareness by our introspective faculty becomes 00:13:46 cognitively invisible to us just as what i'm seeing the world my visual faculty is invisible and it just delivers a visible world to me and i have to really think to to understand 00:13:58 what my own visual faculty visual organ and visual consciousness are contributing i think i experience my introspective faculty as just giving me inner objects and i have to think and remind myself 00:14:11 that actually my inner sense faculty is also a fallible instrument and that i may be misusing that instrument or that instrument might be intrinsically deceptive and that's a hard thing to get one's mind around 00:14:25 as a consequence we've become seduced by this idea that even if our knowledge of some things is mediated that mediation can't go all the way down we get seduced by the idea that there's got to be a 00:14:38 basic foundational level of experience to which we can have some kind of immediate access and to which when we know it we know it absolutely veritically in the theory of knowledge that leads us to foundationalism in the 00:14:51 philosophy of mind it leads us to sense datum theory um and i find that in a lot of buddhist situations a lot of buddhist practitioners take it to be this idea of an infallibility of an immediate kind of 00:15:03 experience if i'm sitting on the cushion just right so with all of that in play um i want to move to exercising that myth of the given that i've been characterizing 00:15:16 and to show that buddhist philosophy offers us powerful ways of doing that and i'm going to begin by talking about first person knowledge through the lens of the madhyamaka tradition

      Jay emphasizes the compelling sense of this allure of immediacy. We believe that our perceptual and our introspective faculties give us an infallible representation of reality, and never question that it could be fallible.

      This is very much aligned with the research on Umwelt by Jakob Von Uexkull.

      Aperception, the introspection and awareness of our inner space is just as alluring.

      So in summary: perception gives us the feeling that we are sensing the way the external world actually is and aperception gives us the feeling that we are aware of the inner world as it is. However, both are relative, the first to our peculiar sense faculties and the second to our linguistic and conceptual modeling of reality. Both are specific filters that create the specific situated interpretation of reality as a human being.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01536

      Corresponding author(s): Michael Glotzer

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements We thank the reviewers for their thoughtful and helpful comments. In general, the reviews were highly positive, although their reviews indicated parts of the manuscript that needed further clarification. We have made extensive changes that improve the clarity and rigor of this submission. We have performed several additional experiments which have extended our analysis in several ways detailed below. None of the conclusions have changed.

      The following is a list of eight major changes implemented during the revisions. Point-by-point responses to the reviewers comments follow on subsequent pages.

      1. The reviews made clear that we needed to more explicitly discuss the AIR-1 depletion phenotype. This phenotype is complex, it does not result in a complete loss of asymmetry, unlike, for example, depletion of the centrosome component SPD-5. This is because, in AIR-1 depleted embryos, a PAR-2 and cortical flow-dependent pathway induces PAR-2 accumulation at both anterior and posterior poles that induces flows from each pole to the lateral region (Reich 2019, Kapoor 2019, Zhao 2019, Klinkert 2019; PMIDs 31155349, 31636075, 30861375, 30801250). These flows also modulate ECT-2 localization. To clarify this point which came up in multiple reviews, we now include an explanation of the complexity of the AIR-1 phenotype and we present an analysis of ECT-2 localization in embryos depleted of both AIR-1 and PAR-2.

      In addition to the 95% confidence intervals that were present on our graphs, we now include indications of the results of statistical tests of significance to the results of different treatments.

      We have revised the analysis ECT-2 accumulation in two ways. First, in the previous draft, we assessed the anterior accumulation over the anterior 40% and the posterior 15% of the embryo. We have revised this analysis comparing the anterior and posterior 20% of the cortex, respectively. This is simpler and more logical in contexts where embryos are symmetric. In addition, we altered the measurements of the length of the posterior boundary. Previously we used a common threshold value, below which we counted pixels to assess boundary length. During the revisions, we noticed that this value was not appropriate for our mutant transgenes which accumulated to higher levels. Therefore, we revised our analysis pipeline such that, for each embryo, we measure the average intensity of the cortex in the anterior 60% of the embryo. We set a threshold of 0.85* this average anterior intensity value. As before, cortical positions below this threshold contribute to the boundary length. This is a more robust and simpler means of evaluating the size of the posterior domain. Neither of these changes affect any of our conclusions, but they are simpler and more rigorous.

      Most of our figures include quantification of the degree of ECT-2 asymmetry as well as the average anterior and posterior accumulation of ECT-2 as a function of time. While the images show the intensity profiles across the embryo, previously, we did not explicitly show a quantification of the average intensity of ECT-2 as a function of position along the embryo. A new graph, Figure 2Bv, shows this for control embryos and embryos in which tubulin is depleted and depolymerized. This shows that the MT depolymerization results in lower accumulation at the posterior of the embryo and higher accumulation at the anterior.

      We provide documentary and quantitative evidence that ZYG-9 depletion induces potent cortical flows (Figure 3c and Figure 3, supplement 3), further bolstering the central role of cortical flows in inducing ECT-2 asymmetry.

      As requested by reviewer 2 (R2b), we have included the analysis of ECT-2 distribution in Gα depleted embryos. As expected due to the lack of spindle elongation, the displacement of ECT-2 from the posterior cortex is greatly attenuated.

      As requested by reviewer 2 (R2d), we now show that ECT-2C fragments accumulate on the cortex in embryos depleted of ECT-2.

      One other important point raised by several reviewers concerns the behavior of the ECT-2 T634E allele. This allele, due to the substitution of a phosphomimetic residue, accumulates on the cortex at about 50% the level of the wild-type version. To investigate the possibility that this quantitative difference was the cause of the phenotype, we depleted both the wild-type and mutant ECT-2 constructs by RNAi (these are the sole sources of ECT-2 in the animals). First, we find that wild-type ECT-2 can be depleted to 20% of wild type levels with only a 13% rate of cytokinesis failure (when T634E is depleted to 20%, embryos fail more than 50% of the time). Thus the two-fold reduction in cortical ECT-2 seen in T634E not likely highly significant (ECT-2 is not haploinsufficient). In addition, embryos with ECT-2 T634E initiate ingression in a timely manner, but the furrows ingress more slowly than wild-type. In contrast, depletion of ECT-2 to 20% results in a delay in furrow initiation, but once these furrows form, they ingress at rates similar rates to wild-type. Thus, the T634E variant exhibits a behavior that is quite distinct from that resulting from a (strong) reduction in the levels of wild-type ECT-2.

      Point-by-point description of the revisions

      This section is mandatory. Please insert a point-by-point reply describing the revisions that were already carried out and included* in the transferred manuscript. *

      (Reviewer comments: italicized 9 pt font, author response: plain text 10 pt font. Numbers have been added to the reviewer comments e.g. R2c=Reviewer 2, third comment)

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary

      R1a* In this study the authors addressed how Ect2 localization is controlled during polarization and cytokinesis in the one-cell C. elegans embryo. Ect2 is a central regulator of cortical contractility and its spatial and temporal regulation is of uttermost importance. After fertilization, the centrosome induces removal of Ect2 from the posterior plasma membrane. During cytokinesis Ect2 activity is expected to be high at the cell equator and low at the cell poles. Similarly to polarization, the centrosome provides an inhibitory signal during cytokinesis that clears contractile ring components from the cell poles. Whether and how the centrosomes regulate Ect2 localization is not know and investigated in the study. *

      This is an accurate summary of the goals of this study.

      R1b *The authors start by filming endogenously-tagged Ect2 and find that Ect2 localizes asymmetrically, with high anterior and low posterior membrane levels during polarization and cytokinesis. They reveal that the centrosome together with myosin-dependent flows results in asymmetric Ect2 localization. Previous studies had suggested that Air1, clears Ect2 from the posterior during polarization and the authors expand those finding by showing that Air1 function is also required to displace Ect2 from the posterior membrane during cytokinesis. *

      *To elucidate if Ect2 displacement is induced by phosphorylation of Ect2 by Air1, the authors investigate the localization of a C-terminal Ect2 fragment containing the membrane binding PH domain. When the predicted Air1 phosphorylation sites are mutated to alanine, the Ect2 fragment still localizes asymmetrically but exhibits increased membrane accumulation. *

      *Finally, they investigate the functional role of Air-1 during furrow ingression. They demonstrate that embryos deficient of Air1 and NOP1 have impaired furrow ingression. Lastly, the authors sought to confirm that there is a direct effect of Air1 on Ect2 function by generating a phosphomimetic point mutation of Ect2 using Crispr. They find that the membrane localization of phosphomimetic Ect2 is reduced and consequently furrow ingression is impaired. *

      This is an accurate summary of our results.

      Major comments

      R1c *It is not convincing that the six putative phosphorylation sites are targeted by the Air1. If Air1 phosphorylation displaces Ect2 from the membrane, a reduction in Ant/Post Ect2 ratio is expected in the phosphodeficient mutants, like after air1 RNAi. However this is not observed for cytokinesis or polarization (Fig. 5D(i); E). This suggests that phosphorylation of those sites is not essential for the asymmetric Ect2 localization. *

      In otherwise wild-type embryos, phosphorylation of these sites is not required for asymmetric ECT-2 localization. Non-phosphorylatable ECT-2 variants exhibit asymmetric localization because these proteins relocalize due to myosin-directed flows. To test the role of phosphorylation, we examine the distribution of ECT-2 and ECT-2C fragments in myosin-depleted embryos in which the flows are blocked, under these conditions, transient local depletion is observed with the phosphorylatable variants, Fig 5E.

      While AIR-1 promotes normal polarity establishment, as shown in several recent papers, cortical changes nevertheless occur in the absence of AIR-1. Specifically, a parallel PAR-2 dependent pathway induces weaker flows from both poles toward the equator. To further substantiate the effect of PAR-2 accumulation on ECT-2 accumulation in AIR-1 depleted embryos, we assayed ECT-2 accumulation in air-1(RNAi); par-2(RNAi) embryos (Figure 4, supplement 2). These results show that ECT-2 is nearly symmetric in these double depleted embryos. In addition we have edited the text to describe the unusual bi-polar PAR-2 accumulation that occurs in AIR-1 depleted embryos.

      R1d *The authors aim to demonstrate that phosphorylation of the identified sites is important for cytokinesis. For this they investigate contractile ring ingression in the phosphomimetic point mutation. Since ring ingression is slower and fails in nop1 mutant they authors conclude that this demonstrates a functional importance of this site. I am not surprised that embryos ingress slower in this mutant since Ect2 localization to the membrane is reduced. This however does not show that this phosphorylation site is the target of the centrosome signal. Importantly, authors would need to demonstrate that Rho signaling and thus Ect2 activity, is increased at the poles, when phosphodeficient Ect2 is the only Ect2 in the embryo. *

      The fact that a phosphomimetic residue at this site leads to reduced membrane localization is highly relevant, as we suggest that phosphorylation of this site contributes to the mechanism by which AIR-1 generates asymmetric ECT-2. Given the role of AIR-1 in regulating polarity, a version of ECT-2 that can not be phosphorylated would be predicted to be dominant lethal, necessitating a conditional expression strategy which does not currently exist in the early C. elegans embryo system (indeed we were unable to recover a T-> A allele at this site, despite extensive efforts). To avoid this issue, we used a viable, fertile, hypomorphic allele that is predicted to be less responsive to AIR-1 activity. The goal of this experiment was to evaluate whether the putative AIR-1 sites affect not only the NOP-1 pathway for furrow ingression, but also impact furrowing that is centralspindlin-dependent.

      To complement this finding have performed experiments in which ECT-2 was partially depleted We used RNAi to partially deplete ECT-2 and ECT-2 T634E and measured the total embryo fluorescence of each ECT-2 variant and the kinetics of furrow ingression. Partial depletion of wt ECT-2, to ~ 20% of control levels leads to delay in furrow formation and all but 2/18 (11%) of embryos complete cell division. In contrast, a similar depletion of ECT-2T634E depletion results in a failure of furrow ingression in ~52 % of embryos. Furthermore, while ECT-2T634E embryos initiate furrowing with normal kinetics, they exhibit a slower rate of furrow ingression, in contrast, partial depletion of WT ECT-2 results in a delay in furrow initiation, but once initiated, the rate of furrow ingression is not significantly affected. These results demonstrate that ECT-2T634E behavior can not simply be explained by a modest reduction in membrane binding.

      R1e *The authors use the Aurora A inhibitor MLN8237: It was shown prior (De Groot et al., 2015) that this inhibitor is not highly specific for Aurora A, and that it also inhibits Aurora B. Thus experiments need to be repeated with MK5108 or MK8745. They should also be conducted during polarization. Why does Aurora A inhibition not abolish asymmetry? That would be expected? *

      The role of AIR-1 in symmetry breaking during polarization is previously published, including with chemical inhibitors (Reich 2019, Kapoor 2019, Zhao 2019, Klinkert 2019, PMID 31155349, 31636075, 30861375, 30801250). ECT-2 localization depends on both the spatial regulation of AIR-1 activity and the distribution of cortical factors that contribute to ECT-2 cortical association, as a result of cortical flows. During acute, chemical perturbation of AIR-1 it is likely that these factors, which were polarized prior to drug treatment, remain polarized, allowing the residual cortical ECT-2 to remain asymmetric. The reviewer is correct about the specificity of MLN8237 and we do not rely on it alone to demonstrate the role of AIR-1. Rather this experiment is a complement to our AIR-1 depletion studies, which are sufficient to establish specificity. We present this experiment merely to show that AIR-1 acutely regulates ECT-2 during cytokinesis in embryos that were entirely unperturbed during polarization.

      R1f *There is no statistical analysis of the results in the entire study. For all claims stating a change in Ant/Post Ect2 ratio or Ect2 membrane localization selected time points should be statistically compared: for example the main point of Fig.1 is that Ect2 becomes more asymmetric during anaphase. Thus a statistical analysis of the Ect2 ratio at anaphase onset (t=0s) and eg. t=90 s after anaphase onset should be performed; or Fig. 3A nop-1 mutant Ant/Post Ect2 ratio during polarization: again statistical analysis of control and nop-1 mutant embryos is needed at a particular time point. *

      All of the graphs were presented with the mean of ~10 embryos per condition and included the 95% confidence intervals. In the revised manuscript, we have included tests of statistical significance, at each time point. While non-overlapping confidence intervals generally suggest statistical significance, we include these analyses on the graphs as it can be difficult to assess statistical significance when the confidence intervals overlap.

      R1g *The aim of Fig. 2B is to demonstrate that Ect2 localization is independent of microtubules, however they still observe some microtubules with the Cherry-tubulin marker and those are even very close to the membrane and therefore could very well influence Ect2 on the membrane. Therefore I am not convinced that this experiment rules out that microtubules have no role in regulating Ect2 localization. *

      We do not exclude that microtubules play a contributing role in ECT-2 phosphoregulation, but rather we conclude that the primary cue is the centrosome. Indeed, microtubules can play an important role in controlling spindle positioning which affects the proximity of the centrosome to the cortex.

      The manuscript states, “Despite significant depletion of tubulin and near complete depolymerization of microtubules (Figure 2B, insets), we observed strong displacement of ECT-2 from a broad region of the posterior cortex during anaphase (Figure 2B).” Thus, despite dramatic reductions in microtubules, not only does ECT-2 become polarized, it becomes hyperpolarized. In contrast, were microtubules directly involved in ECT-2 displacement, one would expect a reduction in polarization as a result microtubule depolymerization. Conversely, though SPD-5 depleted embryos contain far more microtubules than embryos in which microtubule assembly is suppressed, ECT-2 is not polarized in SPD-5 depleted embryos. Thus in the manuscript, we conclude, “Collectively, these studies suggest that ECT-2 asymmetry during anaphase is centrosome-directed.” This conclusion is well supported by the results shown.

      R1h *Throughout the paper the authors should tone down their statement that Air1 breaks symmetry by phosphorylating Ect2, since phosphorylation of Ect2 by Air2 is not shown. *

      We agree with this comment and will make the necessary edits to the text. Indeed, this is the reason why we had included the final section in our original draft, “Limitations of this study” which makes this point explicitly.

      R1i *I understand that the establishment of Ect2 asymmetry is important for polarization. However, how does asymmetric Ect2 localization result in more active Ect2 at the cell equator, which is required for the formation of the active RhoA zone? Would we not expect an accumulation of Ect2 at the cell equator, or if that is not the case more active Ect2 at the equator versus the poles? *

      The pseudocleavage furrow forms as a result of the anterior enrichment of active RHO-1 and its downstream effectors. There is no evidence for a local accumulation of active RHO-1 specifically at the site of the pseudocleavage furrow. Rather, this furrow forms at the boundary between the portion of the embryo where RHO-1 is active and the posterior of the embryo where RHO-1 is far less active (Figure 1 Supplement 2). We suggest that aster-directed furrowing during cytokinesis likewise results from asymmetric accumulation of the same components, without them necessarily being specifically enriched solely at the furrow.

      While cytokinesis generally involves an equatorial contractile ring, furrow formation can be driven by an asymmetric - i.e. non-equatorial - accumulation of actomyosin. This behavior is exemplified during pseudocleavage during which the entire anterior cortex is enriched for actomyosin and the posterior is depleted of myosin (Figure 1 Supplement 2). Several published studies provide evidence that the asymmetric pattern of myosin accumulation contributes to cytokinesis (PMID 22918944, 17669650).

      Minor comments

      R1j *Can the authors explain why the quantification of Ant/Post Ect2 ratio in control embryos differs in different figures? For example: in Fig. 1D i) a slight increase of Ect2 asymmetry ratio is seen at around 80 s after anaphase onset. In comparison, in Fig. 2C (i) this increase is not obvious. Are those different genetic backgrounds? *

      In figure 1 D, time 0 begins at anaphase onset, whereas in 2C, time 0 is specified at the time of nuclear envelope breakdown (NEBD). The duration between NEBD and anaphase onset is ~130 sec and an increase in ECT-2 polarization is observed at 220 s post NEBD, ie 90 sec post anaphase onset comparable to that seen in Fig 1D.

      R1k *One key point of the paper is that myosin-dependent cortical flows amplify Ect2 asymmetry during polarization and cytokinesis. During polarization the data is convincing, however during cytokinesis Ect2 ratio is only slightly decreased after nmy-2 depletion, again is this decrease even significant? *

      Figure 3 supplement 1 shows a significant difference in ECT-2 asymmetry between control and myosin-depleted embryos.

      R1l *In the introduction: "Centralspindlin both induces relief of ECT-2 auto-inhibition and promotes Ect2 recruitment to the plasma membrane" it should be added 'Equatorial' membrane, since Ect2 membrane binding is, to my knowledge, not compromised in centralspindlin mutants or in Ect2 mutants that cannot bind centralspindlin. *

      Generally speaking, the reviewer is correct that cortical accumulation of ECT-2 globally is centralspindlin independent. However, as seen in e.g. ZYG-9 depleted embryos, ECT-2 is recruited to the posterior cortex in a centralspindlin-dependent manner. Thus centralspindlin can promote ECT-2 accumulation to the cortex and the site of that accumulation will be dictated by the position of the spindle midzone.

      R1m *Labels in the figures are often very small eg Fig. 1 ii-v) and difficult to read. In addition it is easier for the reader if the proteins shown in the fluorescent images is also labeled in the figure (eg Fig. 2B add NG-Ect2). *

      These useful suggestions have been incorporated.

      R1n *Material and methods it should be mentioned which IPTG concentration was used. *

      The IPTG concentration (1 mM) has been added to the revised text.

      R1o *The authors speculate that the Air1 phosphorylation sites in Ect2 PH domain prevent binding to phospholipid due the negative charge. At the same time, the authors propose that the PH domain binds to a more stable protein on the membrane, which is swept along with the cortical flows and they propose anillin could be that additional binding partner. I might miss something, but do the authors suggest Ect2 has two binding partners: anillin and the phospholipids? It would be necessary to explain this better. *

      *The authors should test if anillin represents the suggested myosin II dependent Ect2 anchor. For this they should check if Ect2 localization to the membrane is altered upon on anillin RNAi. *

      This summary of our model is largely correct, though we do not know the identity of the more stable cortical anchor(s). While we suspect the PH domain binds to a phospholipid, ECT-2 cortical localization also requires ~100 residues C-terminal to the PH domain. It is likely that this domain interacts with a cortical component.

      In preliminary experiments, ECT-2 accumulation is not strictly anillin-dependent. However, functional redundancy may obscure a contribution of anillin. Anillin was mentioned simply because of the evidence for a physical interaction between ECT-2 and anillin (Frenete PMID 22514687). In the revised manuscript we also include the possibility that ECT-2 accumulations involves one or more anterior PAR proteins. The identity of the cortical anchor(s) is an interesting question for future studies. We consider this question beyond the scope of the current manuscript.

      R1p *The title of fig. 3 does not fit the statement the authors want to make, since the key point is how Ect2 polarization is affected and not membrane localization in general. *

      Thank you for this suggestion. The title has been changed to “Cortical flows contribute to asymmetric cortical accumulation of ECT-2”

      R1q *In Fig 4A/C. After air1 depletion the authors observe a reduction in Ect2 asymmetry. Why are the centrosomes not marked in the figures? Because they cannot be detected? The authors would also need to show that the mitotic spindle and centrosomes are no altered by air1 RNAi in the zyg9 mutant. Otherwise the observed effect might be indirect. *

      Centrosomes are perturbed by depletion of AIR-1 (Hannak, PMID 11748251), but they are still detectable and their positions will be added to figure 4. As has been extensively demonstrated, AIR-1 depletion does lead to attenuated spindles and defects in spindle assembly, some of which are also seen TPXL-1 depleted embryos. These consequences of AIR-1 depletion does complicate the analysis, but this is typical of factors that regulate many processes. This is one of the key reasons why we used ZYG-9 depletion in combination with AIR-1 depletion to overcome these indirect effects.

      R1r *The authors state that tpxl-1 depletion attenuates Ect2 asymmetry, this is not seen in the quantification ((Fig. 4B(i)). The main phenotype they observe is that Ect2 levels on the membrane increase (Fig. 4 (ii) and (iii). They go on testing the function of tpxl1 by depleting tpxl1 in the zyg9 mutant, where the centrosomes are close to the posterior cortex. Here they see no effect on Ect2 asymmetry. Based on that they conclude that tpxl1 has no role in this process. To me this finding is not surprising since the centrosome is close the cortex in zyg9 mutant embryos. Therefore sufficient amounts of active Air1 could reach the membrane and displace Ect2. Thus an amplification of the inhibitory signal by tpxl1 on astral microtubules might not be required. The authors need to mention this possibility and tone down their statment (also in the discussion) that tpxl1 is not required for this process. *

      In the text, we state, “Cortical ECT-2 accumulation is enhanced by TPXL-1 depletion, though the degree of ECT-2 asymmetry is unaffected (Figure 4B).… we observed robust depletion of ECT-2 at the posterior pole in zyg-9 embryos depleted of TPXL-1, but not AIR-1 (Figure 4C). We conclude that while AIR-1 is a major regulator of the asymmetric accumulation of ECT-2, the TPXL-1/AIR-1 complex does not play a central role in this process.” We consider this to be an accurate description of the results. In sum, we have found no evidence that TPXL-1 contributes to generating ECT-2 asymmetry, beyond its well established role in regulating spindle length and position. The are several other processes that are known to be AIR-1 dependent and TPXL-1 independent; these primarily involve the centrosome (Ozlu, PMID 16054030). Given that TPXL-1 associates with astral microtubules, the fact that microtubule depletion can enhance ECT-2 asymmetry also argues against a requirement for TPXL-1.

      R1s *It was shown that the C-terminus of Ect2 is sufficient and the PH domain is required for Ect2 membrane localization in C. elegans (Chan and Nance, 2013; Gomez-Cavazos et al., 2020). Papers should be cited. *

      Thank you for this helpful comment. Chan and Nance 2013 indeed shows that the ECT-2 C-term is sufficient to localize to the cell cortex. In contrast, the Gomez-Cavasos paper (PMID 32619481) shows in figure S2 that the PH domain is required for cortical localization of ECT-2; this paper does not focus extensively on cortical accumulation of ECT-2. We have cited Chan and Nance in the revised manuscript.

      R1t *The authors find that nmy-2 depletion results in loss of asymmetry for the Ect2 C-term and Ect2 3A fragment during polarization. Why is the same experiment not shown for cytokinesis? *

      Strong depletion of NMY-2 prevents polarity establishment, resulting in symmetric spindles, which in turn results in symmetric ECT-2 accumulation. Thus, the requested experiment would not provide significant additional information.

      R1u *Air1 is targeted to GFP-C-term Ect2 fragment via GFP-binding to determine the influence on GFP-C-term Ect2 localization (Fig. 5F). They state that they see a reduction of Ect2 C-term but not of C-term 3A after targeting. The reader has to compare Fig. 5D with F. Since the differences are not big, they need to compare the Ect2 C-term and Ect2 C-term 3A with and without Air1 targeting in the same graph (plus statistics). Otherwise this statement is not convincing. *

      It is not straightforward to directly compare ECT-2C in the presence and absence of GBP-mCherry-AIR-1, because the GBP:AIR-1 fusion protein recruits a large fraction of ECT-2C to the centrosome. For this reason we think it is best to compare the behavior over time of ECT-2C and ECT-2C3A in the presence of GBP-mCherry-AIR-1. At the onset of anaphase, these two fragments localize similarly, but they then diverge over time.

      R1v *In Fig. 6A the authors determine the contribution of air1 to furrowing. For this they deplete air1 in the nop1 mutant. According to previous studies, air1 mutants have a monopolar spindle. How can the authors analyze the function of air1 in cytokinesis when the spindle is monopolar? Did the authors do partial air1 depletion? They authors need to show that there is not major effect on the spindle and centrosome for their conditions. For comparison air1(RNAi) alone has to be included, otherwise the experiment is not conclusive. *

      AIR-1 depletion does not result in a monopolar spindle in C. elegans embryos, though the spindle is attenuated and disorganized (PMID 9778499). TPXL-1 depletion also results in short, well organized spindles (PMID 19889842). The concerns are the reason we performed the ZYG-9 depletion experiments in Figure 4C to ensure the centrosomes are proximal to the cortex.

      R1w *Upon air1(RNAi) in the nop1 mutant NMY2 intensity seems decreased and not increased. Can the authors comment on that, since that is opposite of what is expected. *

      This is expected as previous studies have shown that NOP-1 contributes to RHO-1 activation during polarization and cytokinesis (Tse, PMID 22918944). (NOP stands for No Pseudocleavage).

      R1x *In Fig 6B they introduce a phosphomimetic point mutation in S634 [sic, T634] in the endogenous Ect2 locus. It not clear why the authors chose this site out of the six putative sites and why they only chose one and not 3 or 6 sites? This needs some explanation. *

      In our early work with ECT-2 transgenes, we found that a T634E mutation strongly affected cortical ECT-2C, so we decided to assess its affect on the function and localization of endogenous ECT-2. While we were able to recover a T634E variant, we were not able to recover a T634A variant, despite considerable effort. Based on these experiences, we anticipated that we would be unable to recover a mutant version of ECT-2 in which all sites were changed to phosphomimetic.

      R1y *In the model (fig. 7) no astral microtubules are shown during pronuclear meeting and metaphase. Astral microtubules are present at this stage and should be added to the schematic. *

      MTs will be added to the figure.

      Reviewer #1 (Significance (Required)):

      R1z *The centrosomes inhibit cortical contractility during polarization and cytokinesis in the one-cell C. elegans embryo. Centrosome localized Air1 was proposed to be part of this inhibitory signal, however the phosphorylation target of Air1 is not known. The identification of Ect2 as a phosphorylation target of Air1 would be a great advancement in the field. However, the presented manuscript lacks convincing data that Ect2 is the phosphorylation target of Air1 during polarization and cytokinesis. *

      We explicitly acknowledge that we have not directly shown that AIR-1 phosphorylates ECT-2. However, we have shown that (i) AIR-1 inhibits cortical ECT-2 localization, (ii) the negative regulator of AIR-1, SAPS-1, promotes AIR-1 cortical accumulation, (iii) that the cortical localization domain of ECT-2 has putative AIR-1 sites, which, when mutated to non-phosphorylatable residues leads to increased cortical accumulation of ECT-2 (and (iv) phosphomimetic residues reduce its cortical accumulation), and (v) that these AIR-1 sites are required to render GFP-ECT-2C responsive to GBP-AIR-1. For these reasons we feel that our data makes a strong, albeit indirect, case that AIR-1 regulates ECT-2, even though we clearly acknowledge that we do not directly show that AIR-1 directly phosphorylates ECT-2.

      Direct proof would require the demonstration that AIR-1 phosphorylates ECT-2 in vivo. This would be difficult to show as ECT-2 phosphorylation is likely transient, it likely affects only a subset of the total ECT-2 pool, and it likely results in loss of membrane association of ECT-2. As it it not possible to synchronize C. elegans embryos, biochemical analysis would be very difficult. Even a phosphospecific antibody for the putative ECT-2 phosphosites might not be particularly informative, as it would be predicted to give a diffuse cytoplasmic signal.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      R2a* In this work, Longhini and Glotzer investigate the localization of an essential regulator of polarity and cytokinesis, RhoGEF ECT-2, in the one-cell C. elegans embryo. The authors show that centrosome localized Aurora A kinase (AIR-1 in C. elegans) and myosin-dependent cortical flows are critical in asymmetric ECT-2 accumulation at the membrane. Since membrane interaction of ECT-2 is dependent on the Pleckstrin homology domain present at the C-terminus of ECT-2, they further analyzed the importance of putative AIR-1 consensus sites present in this domain. The authors linked the relevance of these sites in controlling ECT-2 localization and its significance on cytokinesis. The manuscript is well written, the work is interesting, and the data quality is high. *

      We thank the reviewer for their critique.

      Major comments:

      R2b - In Fig. 2, the authors claim that the centrosomes and the position of the mitotic spindle are critical in regulating the asymmetric enrichment of ECT-2 at the membrane. To test the relevance of spindle positioning on ECT-2 localization, the authors depleted PAR-3 and PAR-2. The authors observed that the ECT-2 asymmetry is affected in these settings. However, PAR-3 or PAR-2 depletion impacts polarity, which is critical for many cellular processes, including spindle positioning. Can the authors try to specifically misposition the spindle without affecting polarity? For instance, by depleting Galpha/GPR-1/2 and assessing the impact of such depletion on ECT-2 localization.

      Thank reviewer for good suggestion. We have performed the suggested experiment (presented in Figure 2, supplement 2). As one might predict, ECT-2 starts out polarized as Gα is not required for polarity establishment. During anaphase, ECT-2 becomes more symmetric in Gα depleted embryos as compared to wild-type.

      R2c *-I wonder why the intensity of ECT-2 at the anterior and posterior membrane decreases in air-1(RNAi) post anaphase onset (Fig. 4A)? Moreover, I fail to observe a significant asymmetric distribution of ECT-2 in embryos depleted for PERM-1. Therefore it appears that the difference between DMSO and MLN8237-treated embryos is not substantial (at least in the images)? *

      We do not have a complete or rigorous explanation for all the changes in cortical ECT-2, but they are highly reproducible. We speculate that there are cell cycle regulated changes in ECT-2 accumulation, in addition to its regulation by AIR-1. For example, in figure 1, a strong reduction in both anterior and posterior cortical ECT-2 is evident beginning at approximately -350 sec, which may reflect the initial stages of Cdk1 activation. This may result from cell cycle regulated modulation of ECT-2, as there is evidence that mammalian ECT-2 is subject to a very potent inhibition membrane association by Cdk1 (PMID 22172673). Alternatively, there could be cell cycle modulation of the cortical factor that serves as the “co-anchor” of ECT-2. The ability of GBP-AIR-1 to induce GFP-ECT-2C dissociation also appears cell cycle regulated.

      Consistent with a cell cycle regulated component, note that NEBD is delayed in AIR-1 depleted embryos (PMID 17669650, 17419991, 30861375). This delay results in a shorter interval between NEBD and e.g. the peak in Cdk1 activation, explaining the earlier decrease in AIR-1(RNAi) embryos vs. control, relative to NEBD.

      Our quantitative analysis indicates a significant increase of cortical ECT-2 upon treatment with MLN8237. In addition, the quantitation in the previous version did show a significant polarization of ECT-2 in PERM-1-depleted embryos prior to treatment. We have revised this figure to simply show an acute increase in cortical ECT-2 upon drug treatment, as the focus of this experiment was solely to show that ECT-2 cortical accumulation is acutely responsive to chemical inhibition during cytokinesis in otherwise normal embryos.

      *-The data in Fig. 5 and 6 are exciting but raise a few concerns: *

      R2d *a). The authors show that ECT-2C localization mimics the localization of endogenous tagged ECT-2. However, all these analyses with ECT-2C and various mutants are performed in the presence of endogenous ECT-2. Can the author check the localization of these mutant strains in conditions where the endogenous proteins are depleted? I understand that the cortical flow would be perturbed in conditions where endogenous ECT-2 is depleted. However, I suspect that one can analyze the anaphase-specific distribution. *

      We have examined ECT-2C localization in embryos depleted of ECT-2. Cortical localization of ECT-2C is not dependent upon endogenous ECT-2. This result is now shown in figure 5 supplement 1. However, as the reviewer suggested, embryos depleted of ECT-2 do not show a high degree of ECT-2C asymmetry as ECT-2 is required for the cortical flows that amplify the symmetry breaking during polarization. During cytokinesis, ECT-2C does show a modest change in localization at the poles; the extent of the polar reduction is limited and the changes are symmetric as ECT-2 displacement causes spindles to be symmetrically positioned and limits their elongation during anaphase.

      R2e *b). Can the author comment on why ECT-2C does not accumulate at a similar level as ECT-2C(3A or 6A) at the cell membrane when AIR-1 is depleted (compare Fig. 5D with Supplemental Fig. 5)? *

      When ECT-2C(3A or 6A) are expressed in otherwise wild-type embryos, embryo polarization occurs, resulting in anterior-directed flows that concentrate the factor(s) that enables the anterior enrichment of ECT-2 (and ECT-2C 3A/6A). By contrast, when AIR-1 is depleted, most embryos exhibit a “bipolar” phenotype in which PAR-2 is recruited to both anterior and posterior poles, and the actomyosin network becomes somewhat concentrated laterally (PMID 30801250, 30861375, 31636075). The differential positioning of the actomyosin network in AIR-1 depleted embryos is likely responsible for the interesting difference that the reviewer points out. This section of the results states. “Nevertheless, these variants accumulated in an asymmetric manner. ECT-2C asymmetry temporally correlated with anteriorly-directed cortical flows (Figure 5 D,E), raising the possibility that asymmetric accumulation of endogenous ECT-2 drives flows that cause asymmetry of the transgene, irrespective of its phosphorylation status.”

      R2f *c). Does the cortical localization of the ECT-2C(6A) mutant become symmetric upon further depletion of AIR-1? Of course, if the asymmetric distribution of ECT-2C(6A) is dependent on the presence of endogenous protein in the cellular milieu, the point raised earlier will help address this concern. *

      We have not performed this exact experiment with ECT-2C-3A though we have performed it with a longer ECT-2 C-terminal fragment (aa 559-924). As expected, due to the considerations described above, the asymmetry of ECT-2C-3A is reduced when AIR-1 is depleted. Likewise, ECT-2C-6A is becomes symmetric when endogenous ECT-2 is depleted due to the dependence of its asymmetry on cortical flows, as discussed above.

      In the revised manuscript, we provide additional explanation of the AIR-1 depletion phenotype which will explain the origin of the asymmetric distribution of ECT-2.

      R2g *d). The authors predict that the AIR-1 mediated phosphorylation delocalizes ECT-2 from the polar region of the cell cortex. Since the posterior spindle pole is much closer to the posterior cortical region, the delocalization is much more robust at the posterior cell membrane. I wonder why targetting AIR-1 at the membrane (GBP-mCherry-AIR-1) does not entirely abolish GFP-ECT-2C membrane localization? Can the author include the localization of GBP-mCherry-AIR-1 in the data? Also, do we know for sure if GBP-mCherry-AIR-1 is kinase active? *

      The GBP-mCherry-AIR-1 transgene was obtained from the Gönczy lab which demonstrated that it has some activity (PMID 30801250). Given that centrosomal AIR-1 (as compared to astral AIR-1) is the primary pool of AIR-1 responsible for modulating cortical ECT-2 levels, it is a not clear that the GBP-fused form of AIR-1 is as active as the centrosomal pool of AIR-1; indeed we suspect it is significantly less active, similar to the manner in which TPXL-1/AIR-1 appears less active towards ECT-2 than centrosomal AIR-1. Indeed as the reviewer suggests, were this pool of AIR-1 highly active, we would expect that its cortical recruitment would preclude embryo polarization, and this transgene would cause lethality when expressed with a GFP-tagged cortical protein. These concerns notwithstanding, we do observe a specific reduction in the anterior accumulation of ECT-2C as compared to ECT-2C3A, suggesting that this form of the kinase has some ability to modulate ECT-2C.

      Co-expression of GFP-ECT-2C with GBP-mCherry-AIR-1 induces the centrosomal/astral accumulation of GFP-ECT-2C, which is highly visible in the figure and not seen in the absence of GBP-mCherry-AIR-1. Not surprisingly, the co-expression also induces a cortical pool of GBP-mCherry-AIR-1 that is not seen in the absence of GFP-ECT-2C. These redistributions indicate formation of the complex between GFP-ECT-2C and GBP-mCherry-AIR-1. The mCherry-AIR-1 images could be added as insets to the figure, but in our opinion, they would not make a substantive contribution, given the dramatic accumulation of centrosomal GFP-ECT-2C.

      R2h *e). The authors show that centrosomal enriched AIR-1 [spd-5(RNAi)], but not the astral microtubules localized AIR-1 [tpxl-1(RNAi)], is vital for ECT-2 membrane localization. Interestingly, the authors showed that AIR-1 acts in the centralspindlin-directed furrowing pathway (Fig. 6A). I wonder if the authors can combine NOP-1 depletion with TPXL-1 depletion? I guess this will further help to exclude the function of TPXL-1 in the centralspindlin-directed furrowing pathway. *

      We would like to clarify that our data indicates that AIR-1 acts on both the centralspindlin-independent furrowing (e.g. the anterior furrow in 4C), as well as centralspindlin-dependent furrowing (Figure 6).

      While the experiment the reviewer proposes appears simple in theory, the interpretation is potentially a bit more complex, due to the role of TPXL-1 in spindle elongation, which can affect centralspindlin-directed furrowing. That said, there are two published experiments and one experiment in the manuscript that indicate that centralspindlin dependent furrowing can occur in TPXL-1 depleted embryos. First, Lewellyn et. al. showed that while tpxl-1(RNAi) embryos furrow, tpxl-1(RNAi); zen-4(RNAi) embryos do not, suggesting centralspindlin can function in the absence of TPXL-1. Second, the same paper shows that embryos doubly depleted of TPXL-1 and GPR-1/2 exhibit multiple furrows. Our previous work has shown that furrowing in Galpha-depleted embryos is centralspindlin dependent (Dechant and Glotzer). Furthermore, in the current manuscript we found that embryos depleted of both TPXL-1 and ZYG-9 form posterior furrows (8/8 embryos, 6/8 furrows were strong furrows) although the appearance of these furrows is delayed, presumably due to the reduction in spindle elongation due to TPXL-1-depletion. As described in the manuscript, these posterior furrows have been previously shown to be centralspindlin dependent and NOP-1 independent.

      In accordance with these results, and in direct response to the reviewer’s specific suggestion, we do observe furrowing in nop-1(it142); TPXL-1(RNAi) embryos (10/10 embryos furrow, 9/10 complete cytokinesis) . Thus, all of the available results indicate that TPXL-1 is largely dispensable for centralspindlin dependent furrowing. However, the role of TPXL-1 in centralspindlin-dependent furrowing is not a focus of the manuscript, thus we do not favor including this result, as it distracts from the primary focus of the study.

      R2i *f). Why do NMY-2-GFP cortical levels appear lower in 30% of the embryos that show various degrees of cytokinesis defects (Fig. 6A)? *

      There are a number of possible origins of the variability. As shown in (Reich 2019, Kapoor 2019, Zhao 2019, Klinkert 2019, PMID 31155349, 31636075, 30861375, 30801250), AIR-1 depletion results in variable polarization (unpolarized PAR-2, bipolarized PAR-2, anterior PAR-2, posterior PAR-2). Furthermore, spindles in AIR-1 depleted embryos exhibit somewhat variable positioning. While we were unable to correlate these sources of variability with furrow formation, these results demonstrate that AIR-1 depletion impairs furrowing directed by centralspindlin, which was not entirely expected, given that (i) AIR-1 depletion potently suppresses NOP-1 dependent flows of cortical myosin, as evidenced by the loss of an anterior furrow in AIR-1(RNAi); nop-1(it142) embryos and (ii) centralspindlin directed furrowing can occur in the posterior in ZYG-9 depleted embryos both in the presence or absence of AIR-1 (Figure 4C).

      R2j *g). The authors report that phosphomimetic mutation at the phospho-acceptor residue in ECT-2 impacts its cortical accumulation. This strain, together with NOP-1 depletion, affects furrow ingression. One explanation for this phenotype is that phosphomimetic mutant weakly accumulates at the membrane. However, one interesting observation is that ECT-2T634E enriches at the central spindle (Fig. 6B, panel 120 sec), which somehow I could not find in the text. Could this additional localization of ECT2 at the central spindle contribute to the cytokinesis defects that the authors have observed? The microscopy images the authors have included show that ECT-2T634E significantly localizes at the equator at the time of furrow initiation. Can the authors add the localization of ECT2 wild-type and ECT-2T634E in NOP-1 depleted conditions where they see an apparent impact on the cytokinesis? Similarly, if the authors include the localization of NMY-2 in these conditions-it will further add more weightage to the data. *

      We regularly detect trace amounts of ECT-2 on the central spindle and this is slightly enhanced at in the ECT-2T634E mutant. However, given the large cytoplasmic pool of ECT-2, it seems unlikely that the slight enrichment of ECT-2 on the central spindle significantly affects the cortical pool of ECT-2, though the reduction in cortical ECT-2 may facilitate its enrichment on the central spindle.

      As shown in figure 3B, depletion of NOP-1 does not dramatically affect cortical ECT-2 levels in wild-type embryos. Likewise, we did not observe a significant effect of NOP-1 depletion in ECT-2 T634E, thus we decided not to include this negative result.

      As discussed in general point 8, we suggest the modest reduction in the membrane pool of ECT-2 is unlikely to be the primary cause of the T634E, but rather the ability of AIR-1 to modulate induce its relocalization. Consistent with this interpretation, the embryos that failed ingression tended to have more symmetric spindles, which could limit the residual cortical flows that facilitate furrow ingression.

      Minor comments:

      R2k -An explanation of how the timing of NEBD was analyzed in multiple settings would be helpful.

      Depending on the experiment, we used either ECT-2:mNG fluorescence (it is excluded from the nucleus until NEBD) and/or the Nomarski images to score NEBD.

      R2l ____-*The authors mentioned on p. 6-'Despite significant depletion of tubulin.....during anaphase'. These experiments are performed in the near complete depolymerization of microtubules; thus, regular anaphase will not establish. I understand that the authors are monitoring localization wrt the timing similar to anaphase in the non-perturbed condition, and thus a bit of change in the sentence is required. *

      Thank you for highlighting this point. We have substituted “following mitotic exit” for “anaphase”. In these images, mitotic exit can be scored by the emergence of contractility.

      R2m*-After testing the relevance of SPD-5 (that primarily acts on PCM and not on centrioles)-the authors write on p. 6 that 'two classes of explanation...early embryo'. I did not understand the importance of this sentence here. *

      To clarify, we deleted the words “classes of” from the sentence in question and following that sentence we added the word, “first” indicating that we were explaining the first of the two possible explanations

      R2n*-The observed impact of spd-5 (RNAi) on ECT-2 localization could be because of the effects of SPD-5 depletion on centrosomal AIR-1? The authors can link the impact of SPD-5 depletion not only with the centrosome but also with AIR-1 in the discussion. *

      Indeed, it is well established that SPD-5 is required for centrosomal AIR-1 (Hamill DR, et. Al Dev Cell (2002). The revised discussion now states, “Specifically, during both processes, ECT-2 displacement requires the core centrosomal component SPD-5, which is required to recruit AIR-1 to centrosomes{Hamill et al., 2002, #1201}, but ECT-2 displacement is not inhibited by depolymerization of microtubules and it does not require the AIR-1 activator TPXL-1 (see below).”

      R2o-In the various Figure legends, sometimes the authors mention time '0' as anaphase, and other time as anaphase onset.

      In all cases, anaphase onset was intended and the legends will be corrected.

      Reviewer #2 (Significance (Required)):

      R2p *The manuscript is well written, the work is interesting, and the data quality is of good quality. *

      We thank the reviewer for their encouragement as well as for their thoughtful critique!

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      R3a* Symmetry breaking is the process by which uniformity of the system is broken. Many biological systems, such as the body axes establishment and cell divisions in embryos, undergo symmetry breaking to pattern cellular interior design. C. elegans zygote has been a classic model system to study the molecular mechanism of symmetry breaking. Previous studies demonstrated critical roles of centrosomes and microtubules in breaking symmetry in the actin cytoskeleton during anterior-posterior polarization and cytokinesis. It, however, remains elusive how centrosomes and/or microtubules regulate the assembly and contractility of the actin cytoskeleton. Recent reports identified Aurora-A AIR-1 as the key centrosomal kinase that suppresses the function of the actin cytoskeleton, but little is known about a substrate of the kinase during symmetry breaking events. *

      Longhini and Glotzer proposed in this manuscript that RhoGEF ECT-2 plays a critical role in symmetry breaking of the actin cytoskeleton under the control of AIR-1 kinase. Kapoor and Kotak (2019) previously proposed the same GEF as a downstream effector of centrosomes, but this work did not provide direct evidence for ECT-2 as the AIR-1 effector. This manuscript identified three putative phospho-acceptor sites in the PH domain of ECT-2 that render ECT-2 responsive to inhibition by AIR-1. Although this manuscript lacks direct in vivo and in vitro evidence for phosphorylation of ECT-2 by AIR-1 kinase, the above findings reasonably support a model where in AIR-1 promotes the local inhibition of ECT-2 on the cortex. Design of the experiments, the quality of images, and data analysis are reasonable, and the main text was written very well. The main conclusion of this work will attract many readers in cell and developmental biology fields. I basically support its publication in the journals supported by Review Commons with minor revisions (see below).

      We thank the reviewer for their encouraging remarks and helpful comments.

      Minor comments

      R3b 1) In Figures 2A and 2B, the authors claimed apparent correlation between spindle rocking and ECT-2 displacement. However, because both MTs and ECT-2 in Fig2AB images are blur, I cannot convince myself whether ECT-2 intensities on the cortex showed negative correlation with the distance between the posterior centrosome and the cortex. The authors may want to provide quantitative data set and use a statistical test to support this conclusion.

      Only figure 2A focuses on the rocking. The important structure to assess is the position of the centrosome, as the astral arrays of microtubules are largely radially symmetric (except towards the spindle midzone). As this point in the manuscript were were not discriminating between the astral microtubules and the centrosomes, rather focusing on the overall position of the aster as a whole. Figures 2B, 2D, Fig 2 Supplements 1 and 2, Fig 3C, and Fig 4B, summarized in figure 7A provide quantitive evidence that the centrosome-cortex distance is an important determinant of ECT-2 cortical accumulation.

      R3c *2) Figure 2D would [sic; presumably should] show a ratio between the anterior/posterior pole and the lateral cortex. *

      The reviewer is presumably noticing that the lateral cortex is brighter than the poles when PAR-3 is depleted. While we agree with this assessment, the point of this experiment was to evaluate whether both centrosomes are equally capable of regulating cortical ECT-2 at the respective poles. It appears to us that comparing the anterior and posterior poles is the appropriate measurement to make to address this point and comparison of the poles to the lateral cortex in par-3(RNAi) vs control would be confusing to readers.

      R3d *3) In Figure 3D, the authors need to clarify why they measured ECT-2 dynamics only within the "anterior pole". It would be reasonable to measure ECT-2 dynamics by FRAP and cortical high-speed live imaging on the posterior and the lateral cortex during symmetry breaking. *

      We measured ECT-2 recovery at a variety of sites with similar recovery kinetics. The comparison of ECT-2 dynamics on anterior and posterior furrows were shown in order to compare ECT-2 dynamics on centralspindlin-dependent and -independent furrows.

      We now provide additional supplemental data on ECT-2 dynamics during symmetry breaking. When ECT-2 is polarized, the residual signal is too low to obtain a measure of its recovery.

      R3e 4) In Figure 4 supplement, a difference between with or without ML8237 seems marginal. The authors need to show a statistical test to claim "rapid enhancement of cortical ECT-2 after ML8237 treatment".

      We will provide a statistical analysis. As the inhibitor affects ECT-2 globally, the anterior/posterior ratio doesn’t change significantly. To avoid confusion, we now present total cortical ECT-2 levels upon anaphase onset in this experiment as this is the most relevant parameter.

      R3f *5) I would strongly suggest the authors to clearly state in the first paragraph of discussion that "this working hypothesis is not supported by direct evidence for phosphorylation of ECT-2 by AIR-1 kinase in vitro and in vivo." It should be reasonable to weaken the statement "by Aurora A-dependent phosphorylation of the ECT-2 PH domain" in p13. *

      We agree with the underlying sentiment (as indicated by the “limitations” section that was present in the original version) and we have revised these sentences accordingly: “Our studies suggest that asymmetric, posteriorly-shifted, spindle triggers an initial focal displacement of ECT-2 from the posterior cortex by Aurora A-dependent phosphorylation of the ECT-2 PH domain, though the evidence for this phosphorylation event is indirect.”

      Reviewer #3 (Significance (Required)):

      *See the second paragraph of the Evidence, Reproducibility, and Clarity section. *

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This paper presents an investigation of the mechanisms of how chitin is synthesized in Drosophila by investigating the chitin synthetase Kkv and two proteins related/redundant proteins that are required for chitin production Exp and Reb.

      The authors show that synthesis of nascent chitin polymers is separable from the secretion of chitin and that Ex/Reb is specifically required for chitin translocation/secretion. To understand the functions of Exp/Reb, the authors perform structure/function analyses and examine the localization of the proteins. They find that Na-MH2 domain in Exp/Reb is required for chitin translocation, and that a motif the authors name CM2 is required for Exp localization. For Kkv, they show the WGTRE domain is required for ER exit and that a coiled-coiled domain is required for KKV localization and full Kkv activity. By using live imaging and mutations that disrupt membrane trafficking, the authors show that Kkv, which is a transmembrane protein, cycles to the membrane, and like most membrane proteins, is endocytosed and transits through the endocytic system and is returned to the apical surface. Interestingly, despite being dynamically moved around the cell, chitin synthesis produces highly organized extracellular matrixes. Considering that constitutive production of chitin by Kkv everywhere in the cell would create a mess, these results underscore that regulated organized secretion/translocation of chitin is central to generating patterned extracellular matrixes (as the saying goes, "location, location, location"). Consistent with Exp/Reb being important regulators in extracellular matrix patterning, Exp/Reb not only are required for export of chitin, in the absence of Exp/Reb, the pattern of Kkv localization at the apical surface is altered. Unexpectedly however, by using super resolution microscopy the authors show that Kkv and Exp/Reb have complementary rather than matching localizations. Thus, while it is not clear exactly how Exp/Reb are regulating Kkv, they are doing something very interesting.<br /> Overall, this paper will be of broad interest to the cell biology and developmental biology communities, and to the translational community working to develop chitin as a commercial biopolymer. It is also generally clearly written, although I think there are some inaccuracies in the how some points are phrased. The experiments are well done, and subject to the revisions out lined below.

      Major concerns:

      • A major conclusion of the paper is that Exp/Reb are not required for chitin synthesis. On the most basic level this statement is well supported, because chitin grains are made in the cytoplasm in the Exp/Reb mutants. However, I think the field would be better served with a more nuanced consideration or the role of Reb/Exp. From the data presented, it seems that in the absence of Reb/Exp, the total amount of chitin produced is greatly reduced. I think it would be worth considering Exp/Reb, or the synthesis process in general, as having processivity or duty cycle or quality control such that in the absence of Exp/Reb while Kkv may make short chitin polymers, or occasional long polymers, the major production of chitin doesn't get going without Exp/Reb. Thinking of Reb/Exp as processivity factors in addition to export factors dramatically changes how one thinks of the proteins and the process of chitin synthesis. While these considerations can be handed with some discussion, it would be very interesting to look at the length of the chitin polymers in the Reb/Exp mutants and see if the average chain length is much reduced. This would help distinguish between Exp/Reb reving up the total number of Kkv molecules that produce chitin and Exp/Reb allowing the same number of Kkv molecules to stay active and produce much longer chitin chains. A caveat here is that I have no idea how hard this is to do, so I won't put this at the level of a required revision, but this result would significantly deepen the analysis in the paper.
      • In looking that the subcellular localization of the Kkv and Reb in regular and super resolution, the authors I think the authors missed an important, but straight forward way to gain insight into the apparent complementary distribution of Kkv and Exp/Reb. In stage 16 WT embryos, Kkv has a distinct ringed pattern that corresponds to the tanedial ridges (e.g. clearly visible in Fig. 6A and 6G). How those ridges are set up is unclear, although there are some interesting Turing-pattern models out there. One prediction might be that Exp/Reb should be in between the Kkv rings. If so, maybe Exp/Reb are key components of patterning chitin secretion to make this 3D patterned matrix? Alternatively, maybe Exp/Reb act on a smaller length scale and will match the Kkv ring pattern, just not overlapping with Kkv at the very fine scale. These are straightforward experiments and again could provide key insights into the function of Exp/Reb.
      • In general, most of the figures do not include WT or a control for comparison. This makes it hard for non-experts to assess what the effect of a mutation or condition is. For example, there are no examples of WT or Df(exp reb) in Figures 1-4. I realize this would increase the number of panels, but the paper would be more accessible if comparisons were within figures instead of comparing between main and supplementary figures and other papers.
      • To bolster the case the Exp/Reb directly regulate Kkv distribution, the authors should examine the distribution of Kkv in a catalytically null Kkv mutant, or drugs that block Kkv, or mutations in other genes required for Kkv activity to show that the altered distribution of Kkv in Exp/Reb mutants is a direct consequence of the lack of Exp/Reb rather than in indirect consequence of lack of extracellular chitin, which causes gross perturbations in the trachea. Also, are there differences in the distributions of Kkv in salivary glands with or without the presence of Exp/Reb? If Exp/Reb change the distribution of Kkv in the salivary glands, which normally do not express Kkv and presumably many other components of the chitin ECM system, this would be a powerful argument that there is a direct effect.

      Minor concerns.

      • Page 5 "These intracellular chitin punctae disappeared from stage 14, when chitin is then deposited extracellularly (Fig 1B')." Fig. 1B' is stage 15 embryos.
      • Page 5 "lead to tracheal morphogenetic defects". It would be helpful to the reader if the text or legend told the reader what they were looking for? Broken tubes? Inflated tubes? Variable tubes?
      • Fig. 1H. Main text says "co-expression of Kkv and expMH2/rebMH2 did not lead to tracheal morphogenetic defects (Fig 1H, ...". The tracheal dorsal trunk in Fig. 1H does not look WT. The legend does not state the stage, but the DT looks to have an enlarged diameter and it might be too long. Please present measurements on stage 16 trachea to confirm that there is no effect on tracheal morphology.
      • Fig. 3E there is a lot of GFP-Kkv that is not in co-localized with the KDEL marker. Can the authors clarify what compartment all the other staining is? ER?
      • Section 3.1. The authors imply that the WGTRE domain is specifically required for ER exit. However, an alternative is that absent the WGTRE domain, the protein just does not fold correctly, which would also preclude ER exit, but would be a different problem for the protein to make chitin if it isn't folded.
      • Page 15. I disagree with statement "At stage 16, control embryos showed a highly homogeneous apical distribution of Kkv in stripes, corresponding to the taenidial folds, and Kkv vesicles were largely absent (Fig 6G)." In Fig. 6G, the tandeal ring pattern is clearly visible, as are the fusion cells. If Kkv distribution were "highly homogeneous" these structures/pattern would not be visible.
      • Page 15. I also disagree with the characterization of the apical Kkv distribution in st 15 embryos. "In control embryos we detected a very uniform and homogenous pattern of apical Kkv (Fig 6I).". To my eye, the pattern is punctate and random for the clumps of stain, with the underlying beginnings of the tanidial pattern starting to be visible. The pattern appears neither uniform nor homogenous.
      • P16. The degree of order in the distribution of Kkv is overstated. The authors state that "The results of this analysis, showed that Kkv on the apical membrane, is evenly distributed following a regular pattern (Fig. 6L,L',L',M)." However, given that there is barely a visibly perceptible difference between the actual distribution of Kkv in 6L' and a calculated random distribution in 6L", and that the pattern is neither visibly even or regular, it would be more representative to say something to the effect that the analysis shows there is "underlying order" or "some degree of order" or a "non-random pattern". Visually, the key difference between 6L ' and L" is that there are fewer closely clustered Kkv dots. You could still have an uneven distribution of Kkv that maintains minimum spacing, which is a kind of ordered organization, but not one that would be assumed from the description. It would be helpful if the authors instead of just saying a "regular pattern" also stated the nature of the pattern they observe, i.e. Grid? Stripes? Minimum spacing?
      • Discussion. Another model for the role of Exp/Reb could be to bind and neutralize an inhibitor of Kkv activity. This would account complementary distribution of Kkv and Exp/Reb.
      • Fig. 6L. what tissue is being analyzed? Presumably trachea, but this should be specified as salivary glands are also mentioned in the legend.
      • Fig. 7 C models. I believe that the super resolution data is not accurately accounted for in the models. In both model 1 and model 2, Kkv and Exp/Reb are shown to be in close proximity, but the super resolution data suggests that most Kkv and Exp/Reb are separated hundreds of nanometers. Further, showing Kkv and Exp/Reb as touching was not supported by the coIP experiments, which failed to detect an interaction. It is possible that only a small fraction of Exp/Reb that is in close proximity to Kkv is active, but if so, this should be explicitly mentioned in the models to reconcile the data showing that Kkv and Exp/Reb are mostly not anywhere near each other.
      • -Image analysis. Please detail the criteria for "apical" and "basal" regions were the basis for freehand segmentation. What was counted as apical and what was basal?
      • Abstract and Introduction: The authors state that "We find that Kkv activity in chitin translocation, but not in polymerization, requires the activity of Exp/Reb, and in particular of its conserved Na-MH2 domain.", but then follow that with the statement that "Furthermore, we find that Kkv and Exp/Reb display a largely complementary pattern at the apical domain, and that Exp/Reb activity regulates the topological distribution of Kkv at the apical membrane." Many readers, will find the use of "furthermore" confusing because they will take furthermore as the about to be described data logically following the previous data, but then run headlong into the fact the Kkv and Exp/Reb show a complementary distribution, which does not obviously follow from Kkv activity requiring Exp/Reb. The authors could clarify this and highlight the interesting, unexpected and exciting nature of their results by replacing "Furthermore" with "Unexpectedly" or "Surprisingly", and emphasizing the important role of Exp/Reb in Kkv organization. Maybe something like: Unexpectedly, we find that although Kkv and Exp/Reb display largely complementary patterns at the apical domain, Exp/Reb activity nonetheless regulates the topological distribution of Kkv at the apical membrane.

      Significance

      The topic is interesting from the aspect of cell biology in terms of how a long polymer is created intracellularly, secreted and spatially organized to create a sophisticated extracellular matrix. The topic is also of general interest because chitin is central to the body plan of all insects, crustaceans and many other species, and chitin is of increasing interest as a biopolymer that could have extensive commercial uses.

      In addition to an informative structure/function analysis of the Kvv and Exp/Reb, the results identify what is, to my knowledge, the first regulator of the spatial organization of chitin sythase in insects and it unexpectedly shows a complementary pattern to the the synthase. This highlights just how little we understand about how complex extracellular matrixes are synthesized.

    1. The question the world’s scientists are tackling is to what extent human-caused global heating is to blame for a particular extreme weather event as opposed to natural variability in weather patterns.

      I appreciate the author bringing up this point to acknowledge that there may be other sources which lead to the extreme weather that occurs today. It is effective in avoiding biases and also raising the question to the readers of whether there may be more than one factor which goes into the extreme weather we see today (i'm not saying I think global warming is not the cause).

  2. docdrop.org docdrop.org
    1. t is obvious that the b_ackgrounds of students conrribute to the uneven-ness of opportunities for academic success

      As talked about in Duncan and Murname's article, there is a significant difference in academic success based on the children's socioeconomic status. Another factor, such as the structure of the school may have an impact on the student's academic success as well. In high school, I remember it was pretty diverse, but the students still separated among themselves into racial groups. I think it is inevitable for these groups to not be created based on specific traits since we are naturally attracted to others that have similar traits.

    1. Author Response

      Reviewer #1 (Public Review):

      This report describes evidence that the main driving force for stimulation of glycolysis in cultured DGC neurons by electrical activity comes from influx of Na+ including Na+ exchanging into the cell for Ca2+. The findings are presented very clearly and the authors' interpretations seem reasonable. This is important and impactful because it identifies the major energy demand in excited neurons that stimulates glycolysis to supply more ATP.

      Strengths are the highly rigorous use of fluorescent probes to directly monitor the concentrations of NADH/NAD+, Ca2+ and Na+. The strategies directly test the roles of Na+ and Ca2+.

      A weakness is an ambiguity about the effects of ouabain to inhibit the Na+/K+ ATPase directly and the absence of biochemical controls to validate the interpretation of the ouabain experiment.

      We appreciate the reviewer's comments about the work. While we can not rule out non-specific effects of ouabain at the concentrations needed to block Na+/K+ ATPase in these experiments, we do think that we can rely on the prior biochemical work characterizing the multiple components of ouabain binding in fresh mouse brain tissue, which is a close match to the acute mouse brain slice tissue used here.

      Reviewer #2 (Public Review):

      This study seeks to determine how neuronal glycolysis is coupled to electrical activity. Previous studies had found that glycolytic enzymes cluster within nerve terminals (in C. elegans) during activity. Furthermore, the glucose transporter GLUT4 is recruited to synaptic surface during activity. The authors previously showed that Ca2+ does not stimulate glycolysis in active neurons. Here, the authors show that the cytosolic Na+, not Ca2+, and the activity of the Na+/K+ pump drive glycolysis. However, it is important to note that in this study, glycolysis was examined in the soma, not nerve terminals, where some of the previous studies were conducted. A few other caveats in the interpretation of the findings are listed below:

      1) The NADH/NAD+ ratio is used throughout as the only measurement reflecting glycolytic flux.

      In this and previous work, we have validated that increased cytosolic NADH production (whose major sources are related to glycolysis), rather than altered NADH reoxidation, produces the changes in NADH/NAD+ ratio.

      2) It has been hypothesized that the close association of glycolytic enzymes with ion transporters (such as the Na+/K+ pump) is meant to provide localized ATP to power these pumps. How does bulk glycolysis (monitored with NADH/NAD+ ratio) relate to localized/compartmentalized glycolysis?

      Even if glycolysis is indeed localized to the plasma membrane (an interesting and difficult-to-address hypothesis), we believe that because the mitochondrial shuttles are the main pathway for NADH re-oxidation, and most mitochondria are not localized to the plasma membrane, changes in glycolytic NADH production are likely to be reflected in changes of the bulk cytosolic NADH/NAD+.

      3) Related to point 2, most of the Peredox measurements in the paper have been made at baseline, in the absence of electrical activity. Therefore, it is not clear how the findings relate to activity-driven glycolysis.

      The ion exchange experiments and even the faster Ca2+ puff experiments can mimic but indeed cannot match the speed of activity-driven changes in ion concentrations. Unfortunately, it is impossible to induce normal electrical activity in neurons in the absence of extracellular Na+. We believe that the complete inability of Ca2+ elevation alone (without Na+-Ca2+ exchange) to stimulate glycolysis, combined with the substantial Ca2+ contribution to activity-driven glycolysis, makes a good argument that Ca2+ entering during activity is likely to stimulate glycolysis via Na+ entry and the Na+/K+ ATPase.

      4) The finding that inhibition of SERCA during stimulation actually elevates cytosolic NADH level argues against Na+ being the only ion that regulates glycolysis.

      The ability of SERCA inhibition to produce a small increase in activity-driven glycolysis is consistent with the simple argument that reduced SERCA-driven uptake of Ca2+ into ER results in additional Ca+ removal via Na+/Ca2+ exchange (which can then affect glycolysis via Na+ levels).

      5) The finding that "SBFI ΔF/F transients were longer in duration than the RCaMP LT transient" does not necessarily mean that Na+ elevation lasts longer than Ca2+ in the cell. This could be an artefact of the SBFI on/off rate relative to RCaMP. In fact, prolonged elevation of cytosolic Na+ would make neurons refractive to depolarization in AP trains.

      The rates of Na+ binding and unbinding to SBFI are likely to occur on the microsecond timescale (based on the known properties of crown ether molecules), much faster than the observed transient duration of approximately one minute. Prolonged elevation of cytosolic Na+ alone (to the levels seen here) should not cause neurons to be refractory to firing; refractoriness typically occurs in the setting of prolonged depolarization and consequent inactivation of NaV channels.

      Reviewer #3 (Public Review):

      Meyer et al have studied the mechanisms of glycolysis activation in the hippocampus during neuronal activity. The study is logically laid out, uses sophisticated fluorescence lifetime imaging technology and smart experimental designs. The support for intracellular [Na+] vs [Ca2+] rise driving glycolysis is strong. The evidence for the direct involvement of the Na+/K+ pump is based only on pharmacology using ouabain but the Na+/K+ pump is admittedly not an easy subject for specific perturbations. I still think that the Authors should strengthen the support for the pathway.

      We are happy that the reviewer feels that the evidence for Na+ rather than Ca2+ as the effector of glycolysis is strong. The tools for investigating the role of the Na+/K+ pump (NKA) are indeed limited to pharmacology, because (as the reviewer says) there are not many other options. The requirement for Na+ elevation (which stimulates NKA activity) to trigger glycolysis and the ability of ouabain, a specific NKA inhibitor, to prevent this seem like strong implication of NKA in the mechanism of glycolysis activation. Genetic manipulation of the NKA may be unable to change the level of pump activity, because of compensation by altered expression of other subunits (PMID 17234593); it also is unclear how any chronic manipulation would shed light on the role of NKA in triggering glycolysis. But perhaps future studies of knock-in mice in which the α1 isoform of NKA has made more sensitive to ouabain (PMIDs 15485817; 34129092) might allow the identification of the NKA as the target of ouabain in this situation to be made even more secure.

      Also, there is a long list of publications on the connection between the Na+/K+ pump and glycolysis. It might be useful to highlight the role of the NCX- Na+/K+ pump coupling in the activation of glycolysis in the title.

    1. Author Response

      Reviewer #1 (Public Review):

      Dotov et al. took joint drumming as a model of human collective dynamics. They tested interpersonal synchronization across progressively larger groups composed of 1, 2, 4 and 8 individuals. They conducted several analyses, generally showing that the stability of group coordination increases with group numerosity. They also propose a model that nicely mirrors some of the results.

      The manuscript is very clear and very well written. The introduction covers a lot of relevant literature, including animal models that are very relevant in this field but often ignored by human studies. The methods cover a wide range of distinct analyses, including modelling, giving a comprehensive overview of the data. There are a few small technical differences across the experiments conducted with small vs. large groups, but I think this is to some extent unavoidable (yet, future studies might attempt to improve this). Furthermore, the currently adopted model accounts well for behaviors where all individuals produce a similar output and therefore are "equally important". However, it might be interesting to test to what extent this can be generalized to situations where each individual produces a distinct sound (as in a small orchestra) and therefore might selectively adapt to (more clearly) distinguishable individuals.

      We agree that this is important. We discuss this in a new section (4.1) at the end of the discussion. We suggest that heterogeneity makes it possible for other modes of organization to compete with the attractive tendency towards the global average. We also point out that factors such as individual skill, task difficulty, delays, and selective attention enable such heterogeneity in the ensemble.

      Similarly, it would be interesting to test to what extent the current results (and model) can be generalized to interactions that more strongly rely on predictive behavior (as there is not much to predict here given that all participants have to drum at a stable, non-changing tempo).

      We can only speculate that the present results are less relevant to interactions that rely strongly on predicitive behavior, as behaviour in our simple task could be modeled well by our hybrid single oscillator Kuromoto model. We inserted the idea that the presence of a group rhythm can diminish the demands for individuals to predict each other’s notes, the end of paragraph 1, page 27.

      An important implication of this study is that some well-known behaviors typically studied in dyadic interaction might be less prominent when group numerosity increases. I am specifically referring to "speeding up" (also termed "joint rushing") and "tap-by-tap error correction" (Wolf et al., 2019 and Konvalinka et al., 2010, also cited in the manuscript, are two recent examples). I am not sure whether this depends on how the data is analyzed (e.g. averaging the behavior of multiple drummers), yet this might be an important take-home message.

      Thank you for the suggestion. We edited to emphasize that the relevant part of the analysis of the drumming data was performed at the individual level and using the same methods as typically done in dyadic tapping (first sentences of Section 2.7.2). Speeding up was the only variable where we used group-averages. For consistency, and to avoid confusion, in the present version we re-did the stats (the changed statistical parameters are highlighted) and figures using the individual data points and we did not observe major changes.

      I am confident that this study will have a significant impact on the field, bringing more researchers close to the study of large groups, and generally bridging the gap between human and animal studies of collective behavior.

      Reviewer #2 (Public Review):

      In this manuscript Dotov et al. study how individuals in a group adjust their rhythms and maintain synchrony while drumming. The authors recognize correctly that most investigation of rhythm interaction examines pairs (dyads) rather than larger groups despite the ubiquity of group situations and interactions in human as well as non-human animals. Their study is both empirical, using human drummers, and modeling, evaluating how well variations of the Kuramoto coupled-oscillator describe timing of grouped drummers. Based on temporal analyses of drumming in groups of different sizes, it is concluded that this coupled oscillator model provides a 'good fit' to the data and that each individual in a group responds to the collective stimulus generated by all neighbors, the 'mean field'.

      I have concerns about 1) the overall analysis and testing in the study and about 2) specific aspects of the model and how it relates to human cognition. Because the study is largely empirical, it would be most critical for the authors to propose two - or more - alternative hypotheses for achieving and maintaining synchrony in a group. Ideally, these alternatives would have different predictions, which could be tested by appropriate analyses of drummer timing. For example, in non-human animals, where the problem of rhythm interaction in groups has been examined more thoroughly than in humans, many acoustic species organize their timing by attending largely to a few nearby neighbors and ignoring the rest. Such 'selective attention' is known to occur in species where dyads (and triads) keep time with a Kuramoto oscillator, but the overall timing of the group does not arise from individual responses to the mean field. Can this alternative be evaluated in the drumming data ? Would this alternative fit the drumming data as well as, or better than , the mean field, 'wisdom of the crowd' model ?

      These are very important points. The present paper is restricted to a simple task where participants are instructed to synchronize with each other. However, we now more explicitly acknowledge the limitations of our study and include a new section, “Beyond the group average” at the end of the Discussion that is dedicated to this issue and discussed other organizing tendencies that are particularly relevant in larger and more diverse ensembles. In the context of the present task, the relative difference between local and global interactions was likely negligible because of the small differences in timing, from 4 to 16 ms, between the closest and most distant pairs.

      It will be interesting in future studies to introduce acoustic heterogeneity by varying the timbre of the instruments, for example. In the present study, the instruments had the same timbre with narrowly varying fundamental frequencies (117-129 Hz in the duets/quartets and 249-284 Hz in the octets), a situation that encourages integration of all the acoustic information. We do point out that the present approach needs to be expanded to be able to account for competitive pressure and selective attention.

      The well-known Vicsek model (discussed briefly in paragraph 2, page 15), related to the Kuramoto under certain assumptions, can account for a variety of dynamic behaviors in flocking animals. The ability for selective attention in the form of a heterogeneous coupling matrix, combined with the existence of competitive pressure in the form of negative coupling terms can result in spontaneous formation of clusters and spatiotemporal patterns of movement. This is consistent with prior research in chorusing animals (insects and anurans). Large musical ensembles also involve groupings of instruments such as separate sections that change their relative loudness across time. Typically these are not spontaneous but composed and conducted, yet they may satisfy the same constraints.

      We also pointed out that we see these as complementary organizing principles. Even in the Vicsek model, there is a notion of a ‘local order parameter’ whereby individuals are coupled to a group average within a narrow interaction radius. The relative importance of other organization tendencies depends on the layout of the acoustic environment and the competitive and collaborative aspects of the task. Hence, parameters such as delay and individual heterogeneity could act as symmetry breaking terms that enable different stabilities from the basic global group synchrony.

      A second concern arises from relying on a hybrid, continuous - pulsed version of the Kuramoto coupled oscillator. If the human drummers in the test could only hear but not see their neighbors, this hybrid model would seem appropriate: Each drummer only receives sensory input at the exact moment when a neighbor's drumstick strikes the drum. But the drummers see as well as hear their neighbors, and they may be receiving a considerable amount of information on their neighbors' rhythms throughout the drum cycle. Can this potential problem be addressed? In general, more attention should be paid to the cognitive aspects of the experiment: What exactly do the individual drummers perceive, and how might they perceive the 'mean field' ?

      This is all very relevant. We instructed participants to focus on X’s in the centers of their drums and not look at their peers (edited to mention that in at the end of Section 2.4, page 9). Additionally, the pattern of results for tempo change, cross-correlations, and variability in the dyadic condition was consistent with previous studies that involved purely auditory tapping tasks (emphasized in the begging of paragraph 2, page 26). The best way to address this limitation would be to repeat the study and block the visual contact among participants, as well as include a condition emphasizing visual contact.

      It is beyond the scope of the present paper to make model-based predictions of effects of coupling and information availability, but this should be done in future work. For the present paper, we now include a simulation involving continuous coupling (end of section 2.9.2, page 16) and Supplementary Figure 8A) which fails to reproduce the results for variability, results that are well captured by the hybrid continuous-pulsed model we developed, see the Supplementary Materials.

      Reviewer #3 (Public Review):

      The contribution provides approaches to understanding group behaviour using drumming as a case of collective dynamics. The experimental design is interestingly complemented with the novel application of several methods established in different disciplines. The key strengths of the contribution seem to be concentrated in 1) the combination of theoretical and methodological elements brought from the application of methods from neurosciences and psychology and 2) the methodological diversity and creative debate brought to the study of musical performance, including here the object of study, which looks at group drumming as a cultural trait in many societies.

      Even though the experimental design and object of study do not represent an original approach, the proposed procedures and the analytical approaches shed light on elements poorly addressed in music studies. The performers' relationships, feedbacks, differences between solo and ensemble performance and interpersonal organization convey novel ideas to the field and most probably new insights to the methodological part.

      It must be mentioned that the authors accepted the challenge of leaving the nauseatic no-frills dyadic tests and tapping experiments in the direction of more culturally comprehensive (and complex) setups. This represents a very important strength of the paper and greatly improves the communication with performers and music studies, which have been affected by the poor impact of predictable non-musical experimental tasks (that can easily generate statistical significant measurements). More specifically, the originality of the experiment-analysis approach provided a novel framework to observe how the axis from individual to collective unfolds in interaction patterns. In special, the emergence of mutual prediction in large groups is quite interesting, although similar results might be found elsewhere.

      Thank you for these comments.

      On another side, important issues regarding the literature review, experimental design and assumptions should be addressed.

      I miss an important part of the literature that reports similar experiments under the thematic framework of musical expressivity/expression, groove, microtiming and timing studies. From the participatory discrepancies proposed in 1980's Keil (1987) to the work of Benadon et al (2018), Guy Madison, colleagues and others, this literature presents formidable studies that could help understand how timing and interactions are structured and conceptualized in the music studies and by musicians and experts. (I declare that I have no recent collaborations with the authors I mentioned throughout the text and that I don't feel comfortable suggesting my own contributions to the field). This is important because there are important ontological concerns in applying methods from sciences to cultural performances.

      Thank you for the suggestions. We included a brief discussion in the newly added “Beyond the group average” section at the end of the Discussion, specifically the first paragraph, pages 27-8. We think that expressive timing naturally fits in continuation with the other reviewers’ concerns about how much the idea of the group average generalizes to real musical situations. By design and instruction, we stripped individual expression from the present task. Specific cultural contexts and performance styles may want to escape or at least expressively tackle this constraint of our task, and we believe that now that we have established the mean field as one factor affecting group behaviour, further studies can take on the challenge of developing models that make predictions in more complex situations closer to real musical interactions – and testing those models empirically.

      One ontological issue that different cultural phenomena differ from, for example, animal behaviour. For example, the authors consider timing and synchrony in a way that does not comply with cultural concepts: p.4 "Here we consider a musical task in which timing consistency and synchrony is crucial". A large part of the literature mentioned above and evidence found in ethnographic literature indicate that the ability to modulate timing and synchrony-asynchrony elements are part of explicit cultural processes of meaning formation (see, for example, Lucas, Glaura and Clayton, Martin and Leante, Laura (2011) 'Inter-group entrainment in Afro-Brazilian Congado ritual.', Empirical musicology review., 6 (2). pp. 75-102.). Without these idiosyncrasies, what you listen to can't be considered a musical task in context and lacks basic expressivity elements that represent musical meaning on different levels (see, for example, the Swanwick's work about layers/levels of musical discourse formation).

      Indeed, this is an important issue. We often use cultural phenomena merely as a motivation but do not dive in the relevant details. Here, in addition to the previous discussion, we now reiterate that the tendency towards the group average is one organizing tendency but there are additional ones, enabled by individual heterogeneity and context. For example, marching bands and chanting crowds probably impose different constraints than individual artistic expression by skillful musicians.

      Such plain ideas about the ontology of musical activities (e.g. that musical practice is oriented by precision or synchrony) generate superficial constructs such as precision priority, dance synchrony, imaginary internal oscillators, strict predictive motor planning that are not present in cultural reports, excepting some cultures of classical European music based on notation and shaped by industrial models. The lack of proper cultural framing of the drumming task might also have induced the authors to instruct the participants to minimize "temporal variability" (musical timing) and maintain the rate of the stimulus (musical tempo), even though these limiting tasks mostly take part of musical training in some societies (examples of social drumming in non-western societies barely represent isochronous tempo or timing in any linguistic or conceptual way). The authors should examine how this instruction impacts the validity of results that describe the variability since it was affected by imposed conditions and might have limited the observed behaviour. The reporting of the results in the graphs must also allow the diagnosis of the effect of timing in such small time frame windows of action.

      We agree totally. We made changes and tried to be more specific about the cultural framing, delineating contexts where the present ideas are more relevant and where they are less relevant, or at least incomplete (the bottom of page 3, and pages 27-8).

    1. Author Response

      Reviewer #1 (Public Review):

      This paper primarily assessed the host/phage interactions for bacteria in the order of Cornyebacteriales to identify novel host factors necessary for phage infection, in regards to genes responsible for bacterial envelope assembly. Bacteria in this order, such as Mycobacterium tuberculosis and Corynebacterium diphtheriae have unique, complex envelopes composed of peptidoglycan, arabinogalactan, and mycolic acids. This barrier is a potent protector against the therapeutic effects of antibiotics. Phages can be used to discover novel aspects of this bacterial envelope assembly because they engage with cell surface receptors. To uncover new factors, the researchers challenged a high-density transposon library of Corynebacterium glutamicum (called Cglu in the paper) with phages, Cog, and CL31. Results by transposon sequencing identified loci that were interrupted, leading to phage resistance. This study implicated the importance of Cglu genes, ppgS, cgp_0658, cgp_0391, and cgp_0393. They also identified a new gene called cgp_0396 necessary for arabinogalactan modification and recognized a conserved host factor called Ahfa (Cpg_0475) that plays a crucial role in Cglu mycolic acid synthesis. Ultimately, this work implicated the importance of mycomembrane porins, arabinogalactan, and mycolic acid synthesis pathways in the assembly of the Cornyebacteriales envelope.

      Strengths of the research:

      • Language choice: A major strength of the paper is that this could easily be given to an undergraduate student with introductory knowledge of biology and they would still be able to get the gist of this paper. The language is written in a clear, concise fashion with explanations of terms not everyone would immediately know unless they worked in the field specifically.

      • These figures are generally explained in a direct manner, clearly stating the major conclusions the reader should get after carefully analyzing the presented data

      We thank the reviewer for the enthusiasm for our work and our description of it.

      How the research could be strengthened:

      • It could be worthwhile to describe some of your results mathematically. For example, the differences you see in your phage infections relating to the differences in logs, etc. Bar graphs also should be described in mathematical terms, when "something is lower compared to the WT," how much is lower, etc?

      To keep the text streamlined, we refrained from adding descriptions of the results mathematically in the text. The reader can refer to the figures to get the magnitudes of any changes observed.

      • There were no p values relating to the statistical significance of any of the data presented, which should be changed for the final manuscript implicating the importance of this work.

      We added the p-values as requested.

      • Figure 8 was not entirely supported by the data, especially Figure 8A which either could be improved with better images that support the author's claims, etc.

      We do not understand why the reviewer believes that Figure 8A does not support our conclusions. The mutant cells do not label with the 6-TMR-Tre dye whereas the WT control does. The dye labels mycolic acid such that our conclusion that AhfA is involved in mycolic acid synthesis is valid. In any case, we have included an additional supplementary source data file of the uncropped image of the 6-TMR-Tre treated cells to show a larger number of mutant cells that fail to stain, further supporting our conclusion.

      Reviewer #2 (Public Review):

      In this manuscript, McKitterick and Bernhardt use genetic approaches to investigate genes in Corynebacterium glutamicum that are required for efficient phage infection. They make use of a high-density transposon library that was generated in the Bernhardt lab recently. They challenged the library with two phages, CL31 and Cog. Importantly, they elegantly adapted the phages to the laboratory strain MB001 before. The MB001 strain is ideal for genetic experiments since all prophage elements were removed in this strain. The evolved phages are likely a very useful tool for further investigations aiming to understand host/virus interactions in this model. The phage-infected libraries were plated and the collected colonies were sequenced. Genes involved in efficient phage infection had multiple transposon insertions. Using this method the authors identified specific genes required for infection with Cog and CL31. The Cog phage needs apparently the porin proteins in the mycolic acid membrane for efficient infection and the authors speculate that the porins may act as auxiliary receptors for phage adsorption. Furthermore, genes involved in putative arabinogalactan modification were found to be important. Mutants in these genes did not abolish phage adsorption and thus play a role in viral genome injection. For phage CL31 the authors show that in particular genes involved in mycolic acid synthesis are essential. The genes identified include one coding for a protein involved in protein mycoloylation. A candidate for such a lipidation is the porin protein complex PorAH. The trehalose-6-phosphate synthase OtsA was also identified as important for phage infection. Also strictly required for the establishment of the myco membrane, otsA deletions are viable in C. glutamicum. As part of their analysis, they also identified an unknown factor in mycolic acid synthesis in C. glutamicum. Analysis of a spontaneous resistant mutant to CL31 revealed a mutation in cg_0475 (renamed ahfA). Deletion of ahfA drastically reduced mycolic acid production. This was proven by thin layer chromatography and fluorescent staining. Interestingly, deletion of ahfA also results in a cell morphology defect, indicating the importance of a correct mycolic acid layer for cell shape.

      In summary, the authors provide an excellent paper that is clearly written and experiments are conducted nicely.

      We thank the reviewer for their kind words and enthusiasm for the work.

      Reviewer #3 (Public Review):

      In their manuscript, McKitterick and Bernhardt perform a screen to determine host factors, such as receptors, which are important for bacterial viruses (phages) to infect Corynebacterium glutamicum., an organism that shares the unique membrane of mycobacteria (mycomembrane), with M. tuberculosis. To do so, they challenge a previously described Tn-seq library with a high MOI of 2 phages - Cgl and Cog. The surviving strains are those in which genes important for phage infection (such as receptors) are disrupted. The authors' screen is successful, and the authors identify and validate several factors important for the infection of each phage, providing the first such screen in Corynebacterium. Moreover, the authors perform a suppressor screen to identify additional factors and experimentally follow up several genes of interest. Finally, the authors use the newly determined host specificity of te phages to implicate new genes in mycolic acid synthesis. As a whole, this is a strong work that paves the way to a deeper understanding of Corynebacterial and (by extension) Mycobacterial phages and should be of broad interest.

      Below, we suggest additional analyses, context, and elaboration that will help the ms. elaboration to fully realize its impact.

      Major points:

      1. Although the authors' experimental design is fundamentally sound, I am worried about the possibility of "jackpotting" in shaping their results, particularly in the uninfected control experiment. If the authors' Tn-seq library is ~200,000 strains, and they don't plate at least 10-100x times that many colonies then any given strain (regardless of its phenotype) may or may not be represented in the output of the experiment, causing false phenotypes to be ascribed to genes based on chance. This is particularly a problem for the uninfected control, where the authors choose to dilute the culture 1000fold to mimic the number of colonies that survive infection. They may be better served by plating the whole culture on the plates, to ensure adequate representation of the library. Part of the reason for this concern is that an overwhelming majority of statistically significant hits (something like 80-90%) appear to confer susceptibility rather than resistance (source data Fig 2) - something the authors' experimental design should not be able to measure. The lack of accurate representation of distributions of strains in the starting culture also calls into question the quantitative differences they present in the results

      We thank the reviewer for their thorough analysis of our experimental design. The Tn-Seq experiments were repeated with the uninfected controls plated at a density that maintains the representation of the original library. The overall results are largely unchanged because we maintain our focus on hits that become greatly enriched following phage infection not those that become depleted. The vast majority of these hits were validated for their involvement by constructing mutant strains, indicating the robustness of the current and previous analyses. With respect to the depletion of insertion mutants, we mentioned in the original submission that they are unlikely to be biologically meaningful.

      a. L138. Where the authors describe their initial experimental design it would be helpful to add more details. What is the size of the Tn library? What is the coverage in their experiment? Approximately how many colonies are recovered on the plates after phage infection and in the uninfected control?

      This information has been added (Fig. 2 table supplement 1).

      b. it is important to know how the number of colonies on the plates compares to the number of reads in the experiment. In the analysis of most HT screens, one implicitly assumes that each read corresponds to 1 cell, hence each read can be treated as statistically independent. This assumption is critical to the statistical methods used to analyze this data. By scraping a plate of colonies (which may be required for efficient phage infection), the authors potentially violate this assumption (since the number of cells → number of colonies, which are the actual statistically independent entities in the experiment). Does this assumption hold (or approximately hold) for the screen? If not, a different statistical method should be used to determine p-values.

      We respectfully disagree with the reviewer on this point. In our view, a slurry of colonies from a plate is no different than a culture. Both contain a mixture of cells containing an array of different transposon mutants each represented multiple times in the population due to replication of the original mutant. We do not think there is any meaningful difference to the analysis whether this replication occurs in liquid or on a plate. In both cases, a read corresponds to a single cell/molecule of purified genomic DNA from the population.

      1. The authors' Tn-seq methodology is different from previously published HT-phage screens (e.g. Mutalik et al., 2020 and Rousset et al., 2018). Based on my knowledge of classical phage biology, I agree that plating the infected cells has advantages. However, the rationale will not be clear for most people performing such experiments. Please explain the rationale for the experimental protocol.

      Although the authors in the Mutalik et al paper did do competition experiments in liquid over several infection cycles, they also made use of a solid platebased assay in which they adsorbed their phages to the library cells for 15 minutes before plating. These plates were incubated overnight and resistant colonies were scraped, pelleted, and DNA prepped in a similar manner to the approach we took.

      We prefer plating over liquid growth because colony formation is an easy way to ensure that the mutant population has undergone numerous rounds of doubling under a given condition before the analysis is performed.

      a. Why did the authors plate the cultures after initial phage absorption instead of remaining in liquid?

      We were concerned that some potential receptor-related mutants would be less fit and would therefore be lost in a competition experiment. As such, plating after phage adsorption would decrease the competition between phage survivors. Furthermore, we thought that plating would additionally ensure that the bacteria that are sequenced are true survivors and not just reflect remnant DNA in the culture.

      b. How reproducible are the authors' Tn-seq results? The SRA ascension shows multiple replicates but this is not described in the manuscript nor reflected in the supplementary data. Given the potential for bottleneck and jackpotting effects in this assay, some measure of reproducibility is important for interpreting the results (see point 1).

      We performed completely new Tn-seq experiments for each phage in duplicate. The hit lists remained largely unchanged from our initial analysis and those that were investigated further were enriched for insertions in both new data sets. Thus, the results are highly reproducible.

      c. L587 "Significant hits with fewer than 10 insertions on each strand were manually removed." Why did the authors choose this criterion? Almost all of the genes they removed have very asymmetric distributions (e.g. in the Cog experiment, cgp3051 has 47853 fwd reads and 6 rev reads. Asymmetric distribution of insertions suggests that overexpression of downstream genes has an important (positive or negative) effect. This is a worthwhile pursuit, and many automated analysis pipelines can disambiguate these effects, including those developed in the Walker Lab (e.g. doi: 10.1038/s41589018-0041-4). These genes shouldn't be thrown away when they are arguably some of the most informative hits!

      We have updated the criteria we used for selecting the most impactful insertion enrichments. Our concern in this report was to investigate mutants that affect phage infection when inactivated. We will pursue genes that affect phage infection when overexpressed (as indicated by asymmetric insertion orientation distributions) in a follow-on study. We think such a study would best be carried out with a different transposon containing a strong outward facing promoter.

      1. There is a somewhat extensive phylogeny of M. smegmatis phages (phagesdb.org). Are the phages that the authors work on related to any of these phages? If so, what cluster do they map to? What is the host range of other phages in that cluster? If not, may be worthwhile to mention that these are quite distinct from other studied phages.

      We agree that the phylogenetic history of corynephages is quite interesting. Very few phages that infect Cglu have been isolated and sequenced, let alone studied. Neither Cog nor CL31 share significant nucleotide identity with other sequenced phages, thus they do not have assigned clusters at the moment.

      1. Given that cgp_0475 was a strong hit in the Tn-seq, why was it not identified in the previous chemical genomics experiments from the lab (https://doi.org/10.7554/eLife.54761) ?

      We appreciate the reviewer’s interest in previous work from the lab. In the prior phenotypic analysis, cgp_0475 was identified as having severe fitness defects across many conditions. However, it was not possible to correlate its phenotype with other genes involved in mycolic acid synthesis like pks and fadD2 because they were found to be so sick in the phenotypic outgrowth that they were classified as essential.

      1. Is there any relationship between the growth-rate of the mutants and their phage susceptibility? This can be analyzed using the authors' previous studies of this library.

      While some of the phage resistant mutants are associated with poor fitness (namely those involved in mycolic acid synthesis), not all were associated with decreased growth. For example, there were minimal fitness defects associated with deletions of either porAH or the genes involved GalN decoration. However, loss of these genes greatly inhibited the ability of Cog to infect.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the manuscript entitled "Long-term mitotic DNA damage promotes chromokinesin-mediated missegregation of polar chromosomes in cancer cells," the authors propose that DNA damage on mitotic chromosomes causes chromokinesin-mediated polar chromosomes, which eventually results in missegregation and micronuclei formation. They first performed screening of compounds that cause DNA damage on mitotic chromosomes and found that DNA damage delayed mitosis in the nocodazole wash-out experiment. The authors found that several DNA damage-inducing compounds all caused an increase of asymmetric Mad1 localization on polar chromosomes. Using photoactivatable GFP-a-tubulin, the authors showed that a-tubulin stabilizes after Etoposide treatment. They finally showed that chromokinesin Kid and Kif4a knockdown rescues the asymmetric Mad1 localization.

      Major comments:

      1. Page 6, line 155: the authors claim that "In contrast, among other defects, treatment with any of the DNA-damaging compounds caused a significant mitotic delay due to the presence of misaligned chromosomes near the spindle poles." Although Figure 2A shows a representative image of polar chromosomes, I do not find quantitative data that analyze %polar chromosomes in mitosis treated with DNA-damaging compounds. I also do not find the data supporting the claim that polar chromosomes caused a mitotic delay. Because most subsequent analyses were performed based on this result, the quantitative data should be provided here. For the latter, I suggest showing "time in mitosis (Fig 2B)" separately with or without polar chromosomes.
      2. According to Figure 2C, the ratio of "Exit with micronuclei (from misaligned chromosome(s))" is relatively low compared to other phenotypes such as "Mitotic arrest" or "Cell death." I wonder if polar chromosome phenotype is also correlated with these other cell fates. Please clarify which fate is correlated with polar chromosome formation after DNA damage.
      3. In Figure 3, the authors used Nocodazole-treated background to assess the involvement of SAC in DNA-damaging compound-induced mitotic delay. However, as shown in Figure 2B, DNA-damaging compounds cause a minor delay in mitosis, which might be challenging to analyze in the presence of Nocodazole. There is also a possibility that DNA damage response (DDR) works independently and adjunctly to delay mitosis. Because one of the major claims of the authors is that "the SAC is the only mechanism that is required to delay mitosis in the presence of long-term mitotic DNA damage (page 10, line278)", I recommend Nocodazole wash-out (as in Figure 2B) to examine the effect of MPS1-IN-1 (and ideally an inhibitor of the DDR pathway, such as ATMi) on mitotic delay induced by DNA-damaging compounds.
      4. Line 226, (our unpublished observations): because the authors claim that "the formation of polar chromosomes due to the stabilization of kinetochore-microtubule attachments upon long-term mitotic DNA damage is likely exclusive to cancer cells," the authors should present data on RPE-1 cells at least for %polar chromosome formation (as suggested in comment 1) and Mad1 localization. Plus, even though the data is provided, the statement "exclusive to cancer cells (page 8, line 230)" is speculative and should be toned down. Mad1 localization data is also important because the authors claim that "long-term mitotic NA damage specifically stabilized kinetochore-microtubule attachments in cancer cells (page 10, line 288)" in the discussion.
      5. For the Mad1 assay, such as in Fig. 4A, the authors analyzed the CENP-C pair with two or one Mad1 foci formation. However, in some representative pictures, for example, Fig S4A-Etoposide, I found pairs of CENP-C signals on the polar chromosome without any Mad1 foci (the one next to the pairs shown in the square). As the authors argue, these kinetochores may represent polar chromosomes that eventually satisfy SAC and may be important. I, therefore, wonder why those kinetochores are omitted from the assay. Please explain this point in the manuscript if there is any reason.

      Minor comments:

      1. Page 7, line 168: the authors claim that "regardless of the type of DNA lesion, long-term mitotic DNA damage persists throughout mitosis and promotes micronuclei formation from polar chromosomes." However, the former claim is not fully supported by Figure S3, which addressed the effect of Etoposide only; the latter claim is not fully supported by Figure 2C, which lacks clarity (as pointed out in comment 2) and statistical analysis. Please revise this sentence.
      2. Line 182: it would be helpful for readers to explain why MG132 was used.
      3. Line 210: it would be helpful for readers to explain briefly what PA-GFP means and how the assay works.
      4. Figure 6E-G: I wonder whether siKid+siKif4a affected %polar chromosomes or not.
      5. Page 10, line 287: the authors claim that "we show that long-term mitotic DNA damage..., causing the missegregation of polar chromosomes due to the action of arm-ejection forces by chromokinesisns,...." However, only Mad1 localization data is provided in Figure 6E-G, and whether siKid + siKif4a rescues the missegregation of polar chromosomes is not clear. The authors should either provide supporting evidence or revise this sentence for clarity.
      6. Figure 1E: some color codes for each compound are difficult to distinguish. I also found it challenging to locate some lines on the graph. I recommend separating this graph, for example, by types of DNA lesions caused by compounds, and color codes that are easy to distinguish should be used.

      Referees cross-commenting

      I generally agree with other reviewers' comments and confirmed that they raised similar concerns.

      Significance

      It has been described previously that mitotic arrest induces DNA damage and that the DDR pathway during mitosis is attenuated. The data presented in this manuscript provide a potentially novel cellular response against DNA damage during mitosis. The manuscript will be of interest to those in the field of the cell cycle (especially mitosis), the DDR, and tumor chemotherapies. While the finding that DNA damage during mitosis causes polar chromosomes is potentially interesting, the manuscript is still rather descriptive, and data that address the molecular mechanism is insufficient for the level that the authors conclude. Although the data quality is high, I think some essential data supporting their conclusion and clarity of the description are missing from the manuscript, which can be addressed before publication.

    1. this is supposed to be science fiction. escapism! exploring the boundaries that we can’t explore in polite shitty society. and not one character in your entire novel is trans, gay, ace, or queer? you may be excused, old dead white dude, for being born in the early twentieth century when your very exposure to such ideas would have been oppressively policed. like ianthe says, i can respect that but i can’t admire it fade into obsolesence pls kthx

      j’adore.

      I wonder when the last time was that I read a book written by a man. Maybe the Adventure Zone comics? Ah, and Yeats. But I feel a bit as though… I am willing to humble myself before some authors to see what I must expand my view to understand. But for white dudes I do not extend very much Benefit Of Doubt unless I have a lot of social context telling me they are worth it. This has been working out pretty okay. I still read about cis dudes and they aren’t always boring, so I think it’s possible to keep tuning an approach until you’re finding stuff that works along multiple dimensions; it isn’t entirely zero-sum.

    1. Further, with the hierarchical powerdynamic neutralized clients have the freedom to provide input and tailortreatment goals. They may even feel safe to correct or interject if practitionersare off-course to ensure the accuracy of information. Finally, practitionerhumility, flexibility, and openness allow for adjustments during treatment tooptimize outcomes. The end result is motivated clients who are heard,validated, and empowered

      Some clients will actually fight you to put you in a power position over them. There is something which they may find relieving or conforting in surrendering that empowerment, or perhaps they just believe they don't have the right to expect a non-heirarchical relationship with their counselor. Often when we are asked for advise as counselors it is actually a bid for the permission for the client to surrender their will, intentions and decisiveness to you. That may have characterised familiar abusive relationships for them in the past. Or it may be a way for them to avoid looking inside or taking up their power. But if you give advice too readilly - you might end up responsible if things don't turn out right. What if things don't turn out right for them because it wasn't what they wanted and they get hurt- even if they don't blame you for this- you might actually be to blame. Cultural humility is partly having the ability to recognize that we can accompany and help people quite a lot, but we can not rescue them or think for them, or want for them or know better for them. We are actually fighting for a capacity within the therapeutic relationship- to build connections of equality and empowerment.

    1. All this is true, but all this is not all the truth.  What the older scientific men did not see -- what Newton did not see, as he looked to the perfect order of the heavens -- what Cuvier did not see, when he dwelt so fondly on the teleology seen in every part of the animal structure -- what Paley did not see, when he pointed out the design in every bone, in every joint and muscle -- what Chalmers did not see, when in his astronomical discourses he sought to reconcile the perfection of the heavens with the need of God's providing a Saviour for men -- has been forced on our notice, as naturalists have been searching into animal life, with its struggles and its sufferings

      Again, I think McCosh is highlighting that all of these scientific men in varying fields have found answers to certain questions we have about the universe and ourselves. They may have uncovered some truths through the practice of research, experimentation, and other scientific tools, but they still cannot account for the larger aspect of their existence, who created them and how were they made from the start? Only a supreme being could be responsible. So, he's saying, yes some of what you're saying/ found, identified, uncovered is true, but it's not the entire truth. You still can't account for the innate, and there is where we find God.

  3. Sep 2022
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      See cover letter for more details.

      Summary of response to reviewers:

      We were immensely pleased that the reviewers considered our conclusions “well supported” and our study “beautifully executed”. Reviewers also recognized the significance of our work. Reviewer 1 stated that “building a model that describes one of these pathways will allow us to begin to test therapies to treat or prevent scoliosis” then noted that we “help to build a larger model of normal spine morphogenesis” and that this is “important”. Reviewer 2 called our work an “exciting advance in our understanding of one of the essential signaling pathways that help regulate body axis straightening and spine morphogenesis in zebrafish” and mentioned that our work “may also help to further our understanding of the etiology and pathophysiology of multiple forms of neuromuscular scoliosis in humans”. Reviewer 3 agreed, stating that our work “adds important information on the role of urotensin signaling in spine formation” and noted that it is timely: “findings are of special significance in the light of recent reports that mutations in UTS2R3 show association with spinal curvature in patients with adolescent idiopathic scoliosis”.

      We thank the three reviewers for reading our research and providing feedback. In all cases, we have incorporated (or plan to incorporate) their suggestions, and we believe this has (will) make our manuscript much stronger. Indeed, reviewers had only a small number of “major points”, and all are easily addressed as summarized below. We have already addressed some of those “major points”, as well as the majority of “minor points” raised by reviewers, in our current draft. We expect that all comments can be fully addressed within around 1 month.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are plannedto address the points raised by the referees.

      • *

      We have divided our responses by whether the reviewers considered their points major or minor. All points have already been, or will soon be, fully addressed.


      Major points


      Reviewer 1

      • *

      The key conclusions are well supported, see below for my two major issues.

      Please don't call this lordosis. Lordosis or hyperlordosis effects lumbar vertebra. The curve in the lumbar region shifts body weight so that human gait is more efficient that that in the great apes, or so the story goes. Zebrafish do not have lumbar vertebra equivalents or a natural curve in the caudal region. Similarly, fish do not have the equivalent vertebra to generate kyphosis, which is again a hyper flexion of a normal human spinal curve. Instead zebrafish have Weberian, precaudal and caudal vertebra. It would be so much more useful for the field if the authors used these terms and specified ranges, i.e. numbered vertebrae, that are effected so we can directly and accurately compare regions of defects between zebrafish mutants. It would help to make the point that the uts2r3 mutant has more caudally located curves than urp1/2 double mutants. We appreciate this point and agree with the reviewer. Lordosis (or hyperlordosis) is indeed the accentuation of a curve which naturally exists in humans but not zebrafish. We called the phenotype of Urotensin pathway mutants ‘lordosis’ or ‘lordosis-like’ because of the position of the curves — in caudal vertebrae, which are evolutionarily and positionally equivalent to lumbar vertebrae, though they are structurally different to human lumbar vertebrae. To address this comment, we will no longer refer to the phenotype as lordosis in our Introduction or Results sections and we will expand our Discussion to include this point raised by the reviewer.

      1. The observation that urp1/2 double mutants have curves only in the D/V plane and almost completely lack side-to-side curves is noteworthy. Does the urp1-/-urp2-/- mutant uncouple two systems for posture? If this separate a DV from side-to-side postural control system, that would be very interesting. It is particularly important to describe how penetrant the phenotype is and how many times it was observed. See 9 minor comments. It would help the reader if the authors explicitly described the features that they see in the cfap298 mutant that constitute lateral curves and that are lacking in urp1/2 (e.g. in figure 4E).

      We plan to expand the figure and analysis describing D/V curves and M/L curves. While our first draft included only cfap298 and urp1-∆P;urp2-∆P mutants, our next draft will also incorporate uts2r3 and pkd2l1 mutants. We have already scanned cohorts of all mutant fish, and so the remaining work to render and quantify the degree of lateral curvature will not take long. This will allow us to conclusively determine whether these different mutations indeed uncouple two systems controlling posture in different directions. As the reviewer requests, we will include all fish analyzed in either main or supplementary figures, include numbers in figure legends, and quantify the penetrance of M/L and D/V curves.

      We have also generated cfap298;urp1-∆P;urp2-∆P triple mutants and are currently scanning them to reveal skeletal form. Preliminary data suggests triple mutants have three-dimensional curves but D/V curves are more severe in triple mutants than in cfap298 mutants alone. This makes sense if Urp1/Urp2 are important for controlling D/V spinal shape and, as our qPCR shows, Urp1/Urp2 are downregulated but not lost completely in cfap298 mutants. It also furthers the notion that cilia motility controls D/V and M/L curves by separable mechanisms. * *

      • *

      Reviewer 2

      Need to show that the CRISPANT targeting was effective for mutagenesis at each loci screened in the work presented in Figure 1E. In Figure 1E, we presented the phenotypes of crispant embryos (i.e. embryos injected with four gRNAs targeting a specific gene alongside relatively high doses of Cas9 protein; see schematic in Figure 1G). In positive controls (cfap298 and sspo), crispants showed the expected phenotype in all cases (Figure 1E and see Figure 1H for quantitation). As for germline mutants, urp1 and urp2 crispants showed no early axial phenotypes (Figure 1E and 1H). As such, the reviewer requests that we perform molecular assays to determine whether mutagenesis was successful in these embryos. To do so, we will perform either T7 assays or next-generation/Sanger sequencing of mutated loci. This will allow us to determine and quantify the effectiveness of our mutagenesis. Results will be shared in a new supplementary figure. These assays are straightforward and we expect they will not take very long to complete. Indeed, we have performed these assays previously for other genes (e.g. Grimes et al., 2019 and several unpublished genes). We have achieved high levels of mutagenesis in all cases, making us very confident that we will achieve similarly high levels of mutagenesis in this case.

      Reviewer 3


      The addition of the F0 crispant experiment to show that the pro-peptide of urp1/2 does not have a function and is responsible for the difference between the observed morpholino and the crispr phenotype was important. However, since no phenotype was observed in crispants it is important to add evidence of induced cuts for all guide RNAs used in the crispant experiment. These control experiments might have been done already. If not, they can easily be done in a short period of time by performance of T7 assays on injected fish and would not require additional reagents. This is the same point raised by reviewer 2 and so we refer to the response above. In summary, we agree with the reviewer and we are currently performing these suggested experiments which are straightforward and working well.

      The authors claim that there were no structural defects observed in urp1/2 double mutants. However, the hemal arch in figure 3 E seems to be deformed. This could be normal variance or a phenotype. This can be addressed by simple reinspection of the scans.

      We believe there are no major vertebral structural defects that could be attributed to causing the spinal curves because vertebrae are well-formed in mutants and we see no defects in the initial patterning of vertebrae in our calcein experiments. However, since urp1-∆P;urp2-∆P and uts2r3 mutant spines are curved, the vertebrae are a little misshapen. We plan two revisions, one textual and one analytical.

      First, we will make clear in our textual edits that some vertebrae are slightly misshapen, as occurs in non-congenital forms of human spinal curve disease (in congenital forms, the shape defects are more striking and likely causative in the curvature). We agree with the reviewer that stating that there is a lack of vertebral structural defects lacked nuance, so we will expand on this in our next draft.

      Second, we will quantify vertebral shapes in spinal curve mutants and report these data in our next draft. After reinspection of the scans, as the reviewer suggested, we believe it would be informative for our readers to see quantitation of vertebral shape. We expect these data to more rigorously back up our statements about ‘minor structural differences’ of vertebrae between uncurved and curved individuals. We have already begun this work, and completing it should only take a few more weeks. As an example, we have measured the shape of centra by calculating aspect ratios in wild-type and urp1-∆P;urp2-∆P double mutants in curved regions of the spine:

      These preliminary data already make clear that there are indeed subtle morphological differences between vertebrae in mutants and wild-type, as occurs in human spinal curve deformities. We will present completed versions of these data (several parameters that describe vertebral shape) in our next draft and provide comments about whether such changes could be causative in spinal curve etiology as occurs in congenital-type scoliosis.

      Minor points


      Reviewer 1

      Supplementary FigS3B How to measure the Cobb Angle is unclear. Why is the first curve not counted? I count 3 curves. First a ventral displacement, then a dorsal to ventral return, then a sharp flex before the tail. How to measure Cobb angle might be easier to explain if the figure is expanded into steps. Identify the apical vertebra, then showing how the lines are drawn parallel to those vertebrae, then where the measured angle forms between the lines perpendicular to the drawn parallel lines.

      We will more thoroughly explain how Cobb angle is measured in our next draft.

      5a. I think we (zebrafish biologists) need be explicit about what we mean with "without vertebral defects." What do we count as defects? Vertebrae can be fused, bent, shortened or the growing edges can be slanted. In Figure 3E, and movie7, it is clear that the highlighted mutant vertebrae are shorter than WT. The growing ends of normal vertebra are perpendicular to the long axis of the vertebra. In the mutants the ends are slanted. Please define in the text what you consider a relevant vertebral defect, because these vertebrae have defects. Or are you only considering the calcein stained centra at 10dpf?

      We strongly agree with the reviewer. As described more thoroughly above in response to Major Comment – Reviewer 3, we plan both textual edits and new quantitation of vertebral shape to address this comment. Our quantitation indeed shows some vertebrae are shorter in mutants as the reviewer noticed. We also plan a new paragraph in the Discussion section which will speak about the issue of what zebrafish biologists might mean by “without vertebral defects”.

      5b. Do you want to base your patterning conclusion on primarily the calcein data as these are closer to the notochord patterning time window. Please anchor this conclusion to a specific time or standard length e.g. 10dpf/5.6mm.

      When we edit our descriptions of vertebral defects, and include new quantitative data on the shape of vertebrae, we will be clear that the vertebrae are slightly structurally malformed. In addition, when we speak of the calcein data, we will anchor those conclusions to the specific timepoint best studied by this method, as the reviewer suggests.

      "At 30 dpf... several mutants exhibited a significant curve in the pre-caudal vertebrae, in addition to a caudal curve (Fig. 3D and S3C). Since pre-caudal curves were rare in mutants at 3-months, this suggested that curve location is dynamic".The frequency of this observation is important. Does it effect all or a fraction of mutants? Can you provide some numbers to anchor these observations? Maybe fractions e.g.. 3 of 4 fish had precaudal curves at 30pdf, and 0 of 10 fish had precaudal curves by 3 mpf?

      In our next draft, we will provide numbers of fish examined at 30 dpf and also show graphical summaries of curve position (as we did for younger fish). Last, all scans will be included in a new supplementary figure.

      The description of the pkd2l1 mutant, instead of terming it kyphosis can you tell the reader the vertebra number at the peak of the curve. The authors say the pkd2l1 mutant is highly distinct from urp1/urp2-/-, but the reader needs to hear exactly what is distinct. For example, does this mutant have both lateral and D/V curves?

      We have now scanned several pkd2l1 mutant fish and we will include images of pkd2l1 mutants at two different timepoints together with quantitation of curve position. Our results agreed with those previously published for this mutant line (Sternberg et al., 2018) but we believe it is important for our readers to see side-by-side images and quantitation so they can see the distinctions.

      At 3-months of age, pkd2l1 mutants essentially appear wild-type but by around 12-months they have developed a D/V curve in the pre-caudal vertebrae. They do not exhibit M/L curves; we will quantify this and include these data in our Figure about M/L deviation.

      We called the phenotype displayed by pkd2l1 mutants “kyphosis” to be in line with a previous publication describing these mutants (Sternberg et al., 2018). We will add new wording in the Discussion about whether or not zebrafish can truly model kyphosis and lordosis (see response to Reviewer 1 major comment above), and we make clear in our Results that the phenotype has “been argued to model kyphosis (Sternberg et al., 2018)” rather than “is kyphosis”.

      It is intriguing that pkd2l1 mutants do not exhibit any curves until much later in life than urp1-∆P;urp2-∆P and uts2r3mutants. Inspired by this finding, we aged urp1-∆P and urp2-∆P single mutants and found that they go on to develop D/V curves by 12-months i.e.

      • *

      • *3-months 12-months Position of curve

      urp1-∆P no curves mild D/V curves Mostly caudal

      urp2-∆P mild D/V curves intermediate D/V curves Mostly caudal

      urp1-∆P;urp2-∆P severe D/V curves severe D/V curves Mostly caudal

      uts2r3 severe D/V curves severe D/V curves Mostly caudal

      cfap298 severe 3D curves severe 3D curves Caudal and pre-caudal

      pkd2l1 no curves mild D/V curves Mostly pre-caudal

      Phenotypes in urp1-∆P and urp2-∆P single mutants upon aging shows: 1) Urp1 and Urp2 are not entirely redundant in long-term spine maintenance and 2) proper Urp1/Urp2 dose is essential. We will include these new data in our next draft.

      Does uts2r3-/- have no /minimal side-to-side curves like urp1/urp2-/-?

      This is an interesting question. To address it, we will add images of uts2r3 mutant spines from the dorsal aspect and include them with our new quantitation of lateral curvature. To sum, the reviewer’s suggestion is correct – there are minimal side-to-side curves in uts2r3 mutants.

      One finding that deserves more discussion is the observation that urp1/urp2 double mutants have almost no side-to-side defects and all the obvious bends are in the D/V plane. Does this uncouple two systems for posture? Please consider the following paper. It shows a proprioception system that maintains normal side-to-side posture. A spinal organ of proprioception for integrated motor action feedback. Picton LD, Bertuzzi M, Pallucchi I, Fontanel P, Dahlberg E, Björnfors ER, Iacoviello F, Shearing PR, El Manira A. Neuron. 2021 Apr 7;109(7):1188-1201.e7. doi: 10.1016/j.neuron.2021.01.018. Epub 2021 Feb 11. PMID: 33577748

      Thank you for pointing out this manuscript. We will include it in our expanded Discussion.

      Reviewer 2

      Fig 3F: might be improved by making the images black and white and possibly inverted. It is not easy to clearly see the vertebrae as is. * *

      Thanks for the suggestion, we will make this change.

      • *

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Minor points


      Reviewer 1

      • *

      Figure 1D legend says urp1 is expressed in dorsal while urp2 is express in all CSF-cNeurons, but the image for urp1 shows only ventral cells in WT, while the image for urp2 shows the same cells ...and more dorsal cells. Please replace image with one that matches the text. Apologies for this, we have now corrected it. The image was correct but we accidentally wrote “dorsal” instead of “ventral” when describing the CSF-cN sub-population harboring urp1 transcripts.

      In Figure 2H, the position of curve apex graphic, how many fish were examined? In 2f it looks like n=8 and n=9. Can this info be added to the figure?

      We have now included the number examined in the legend.

      I did not find legends for the movies. The first call to the movies calls movies 1-3 without explaining what each shows. The labels on the downloaded files are not informative.

      Apologies for forgetting to submit these. We have now added informative Movie legends.

      Reviewer 3

      • *

      It would be helpful to the reader to add a little more information on urp1 and upr2 proteins and their processing to make it clear while only the 3' region of the protein was targeted to induce mutations. We have incorporated textual edits to make this more clear. We now state in the second sentence of the Results section:

      Urp1 and Urp2 are encoded by 5-exon genes with the final exon coding for the 10-amino acid peptides that are released by cleavage from the pro-domain (Fig. 1A).

      Together with Fig. 1A and Supplementary Fig. 1, we hope it is now clear to readers how Urp1 and Urp2 are generated from a 5-exon gene encoding the pro-domain and the peptide, which are separated by cleavage.

      It would also be helpful to the reader to have a schematic indicating the guide target sites (they could be added to figure S1 C + D) in the protein to be able to interpret the result more easily.

      Done!

      Figure 5: Addition of a square to H would help understand were the pictures in D-F were taken.

      Done!

      4. Description of analyses that authors prefer not to carry out

      N/A. We are performing all experiments/analyses requested by reviewers.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01574

      Corresponding author(s): Casey, Greene

      1. General Statements [optional] We thank the reviewers for their thorough feedback. We have addressed all the points raised, revised the manuscript accordingly, and explained our changes below. To aid readability, the reviewers’ comments have been converted to italics, and our responses have been bolded.

      Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors systematically evaluate the performance of linear and non-linear ML methods for making predictions from gene expression data. The results are interesting and timely, and the experiments are well designed.

      I have a few minor comments:

      - It was hard for me to understand Figure 1B. I think a figure like this would be very helpful however. What do the numbers represent? If sample ID, then I am not sure why x-axis label is also "samples"

      - For analysis of GTEx data, not sure what "studywise splitting" would mean, since the GTEx dataset is one study? Do you leave out the same individuals from all tissues for evaluation?

      We thank the reviewer for their input on these two points. To make Figure 1B clearer and to elaborate on our stratified splitting methods, we have amended its description to “We stratify the samples into cross-validation folds based on their study (in Recount3) or donor (in GTEx). We also evaluate the effects of sample-wise splitting and pretraining (B).”

      - I found the sample size on x-axis of Fig 2a confusing. If I understand correctly, GTEx has a total of ~1000 subjects. So in some sense, effective sample size can not be bigger than 1000. If you are counting subjects x tissue as sample, then it can be misleading in terms of the effective sample size.

      We thank the reviewer for this point. To incorporate it into the manuscript, we’ve added the following text to the description of Fig. 2: “It is worth noting that "Sample Count" in these figures refers to the total number of RNA-seq samples, some of which share donors. As a result, the effective sample size may be lower than the sample count. “

      - Would be interesting to assess out-of-sample generalizability of linear and non-linear models. Have you tried training on GTEx and predicting on Recount3 or vice versa?

      This question intrigued us. We reran the tissue prediction experiments from the manuscript on a subset of the GTEx and Recount3 datasets in which we performed an intersection over tissues and genes. We found that in the out-of-sample domain the logistic regression model and the three layer neural network performed similarly, while the five layer net generally had a lower accuracy despite having similar accuracy in the training domain. We also found (consistent with our results in the paper) that GTEx predictions are an easier task than their Recount counterparts. Below are plots demonstrating these findings:

      [These plots appear in the PDF but do not appear to work in the ReviewCommons Form].

      Reviewer #1 (Significance (Required)):

      Important and timely study, evaluating linear vs non-linear methods for predicting phenotype from gene expression datasets.

      We appreciate the reviewer’s positive comments on the timeliness of our manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary

      The authors want to assess the presence of non-linear signal in gene expression values in the task of tissue and sex classification. They use logisitic regression classifiers and two types of neural networks, with 3 and 5 layers, and assess classification performance on two large expression datasets from Recount3 and GTEX and three simulated datasets.

      The authors carefully construct their learning setup in such a way that one can reason about the removal of linear signal from the expression features. The interesting conclusion is, that although the linear approach works well on both datasets, and sometimes even better than the more complex models. The authors convingly show, that there is a significant non-linearity in the gene expression data. However, just because it is "there" does not imply that any non-linear methods performs better.

      Major comments:

      - Are the key conclusions convincing?

      The authors did a good job in showing, that there is non-linear signal in gene expression features for the classification problems studied.

      We thank the reviewer for their positive feedback.

      - Should the authors qualify some of their claims as preliminary or speculative, or

      remove them altogether?

      The overall claims of the authors are justified, the discussion may be improved.

      We appreciate the reviewer’s support for our overall claims and we have adjusted the manuscript as noted point by point below.

      - Would additional experiments be essential to support the claims of the paper?

      No, additional experiments are not essential. But the authors did not compare to other non-linear methods such as SVM or knn-classifiers in the resulst or conclusion section. It is unlikely that the main conclusion would change if those methods were tried. But it is possible that other "simpler" non-linear methods, such as knn for example, are able to outperform the logistic regression classifier on the GTEX and Recount3 data set. Thus, the authors should at least mention this as part of the conclusion and could extend their discussion on the implications of their study concerning other tasks or models.

      We agree that there should be more discussion of other models in the conclusion section. We have updated the fifth paragraph of the conclusion accordingly:

      “We are also unable to make claims about all problem domains or model classes. There are many potential transcriptomic prediction tasks and many datasets to perform them on. While we show that non-linear signal is not always helpful in tissue or sex prediction, and others have shown the same for various disease prediction tasks, there may be problems where non-linear signal is more important. It is also possible that other classes of models, be they simpler nonlinear models or different neural network topologies are more capable of taking advantage of the nonlinear signal present in the data.”

      - Are the suggested experiments realistic in terms of time and resources?

      Not applicable.

      - Are the data and the methods presented in such a way that they can be reproduced?

      There is a separate github repo which has the code to reproduce the analyses. This is good. However, would be nice to explain in more detail in the manuscript how the limma function was used for removing the linear signal, as they mention the "removeBatchEffect" function was used, but it would be good to tell the reader how that works, as this is their way for assessing the effect of linear-signal removal. Are there any limitations for the assessment of signal removal in this way?

      We thank the reviewer for their input, and have updated the model training section on signal removal to read: “We also used Limma[24] to remove linear signal associated with tissues in the data. We ran the ‘removeBatchEffect’ function on the training and validation sets separately, using the tissue labels as batch labels. This function fits a linear model that learns to predict the training data from the batch labels, and uses that model to regress out the linear signal within the training data that is predictive of the batch labels.”

      We have also elaborated on the limitations of signal removal by updating the sentence “This experiment supported our decision to perform signal removal on the training and validation sets separately, as removing the linear signal in the full dataset induced predictive signal (supp. fig. 6)” to read “This experiment supported our decision to perform signal removal on the training and validation sets separately. One potential failure state when using the signal removal method would be if it induced new signal as it removed the old. This state can be seen when removing the linear signal in the full dataset(supp. fig. 6).”

      - Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      Minor comments:

      - Specific experimental issues that are easily addressable.

      no

      - Are prior studies referenced appropriately?

      Yes

      - Are the text and figures clear and accurate?

      *Also, they conducted 3 different experiments in Figure 3. It would be useful to separate the figure into 3) A, 3) B, and 3) C and link that specifically in the text. Figure 4 is an extended version of Figure 2, just with the additional results of the signal removed performances. *

      We appreciate the feedback. To make the figure and the text more clear, we have added A, B, and C subheadings to figure 3, and updated the subfigure’s references within the text accordingly.

      First, the pairwise results in 4B are hard to read as the differences in colors and line type are difficult to see as some lines are short. Second, we did not find it helpful to reproduce the full signal approach in Figure 4. We would suggest to make Figure 4 as Figure 2, and simply only talk about the Full signal mode in the beginning, how it is in the text.

      We agree. We have made Figure 4 our new Figure 2 and updated the references in the text.

      Further, it would be nice to give better names in the legends of these plots. Pytorch_lr is not a nice name.

      We thank the reviewer for pointing this out. We have updated the names in the legends to be “Five Layer Network”, “Three Layer Network”, and “Logistic Regression”

      - Do you have suggestions that would help the authors improve the presentation of

      their data and conclusions?

      As the Recount3 dataset is different in quality and complexity it would be reasonable to show the results of the binary classifcation also in the main paper. In particular, as this behaves different to the GTEX binary classification.

      We have now moved the Recount binary classification figure from the supplement to join the GTEx binary classification data as the new figure 4.

      -The title is somewhat unprecise. It may induce the impression that the paper is about expression-prediction, although that is not the case. Further, in the abstract they don't mention what prediction problem they solve and that these are classification problems. After reading the paper it is clear why the authors choose that, but we are suggesting an alternative title that the authors may consider:

      The effect of nonlinear signal in classification problems using gene expression values

      We agree with the reviewer’s comment and have updated our title to “The effect of non-linear signal in classification problems using gene expression”

      Further, they should give more details on the problem learned in the abstract.

      We thank the reviewer for their feedback, and have added details to the abstract about the problem domains. The relevant sentence now reads “We verified the presence of non-linear signal when predicting tissue and metadata sex labels from expression data by removing the predictive linear signal with Limma, and showed the removal ablated the performance of linear methods but not non-linear ones.”

      *-In addition, the conclusion section, which may be title as Disucssion and Conclusion, could contain additional points concerning the topology and training of the neural networks. *

      We have updated the heading of the final section to Discussion and Conclusion. To expand on the potential drawbacks of our neural network topologies, we have also updated the limitation portion of Discussion and Conclusion to read “We are also unable to make claims about all problem domains or model classes. There are many potential transcriptomic prediction tasks and many datasets to perform them on. While we show that non-linear signal is not always helpful in tissue or sex prediction, and others have shown the same for various disease prediction tasks, there may be problems where non-linear signal is more important. It is also possible that other classes of models, be they simpler nonlinear models or different neural network topologies are more capable of taking advantage of the nonlinear signal present in the data.”

      Obviously, it is possible that other simpler or more complex neural networks have a better performance on the GTEX and Recount3 data sets compared to logistic regression. In fact, the results from Figure4 suggest that, as there is clearly useful non-linear signal in those datasets for the classification problems studied. However, optimizing a non-linear model is inherently more complex and time-consuming, and thus may not be done thoroughly in previously published papers. Compared to a linear model that is easier and faster to optimize, this may be one reason why studies find that, despite non-linear signal, the linear model performs better. Other factors such as the samples size, which the authors already mention, of course also plays a big role, and if hundreds of thousands of datasets would be there , e.g. from single cell measurements, non-linear methods may have a better chance of outcompeting linear models.

      We agree, which is why we consider the signal removal experiment to be so important. By demonstrating that the non-linear methods we used were in fact learning non-linear signal we were able to show that there was something that non-linear models were able to learn that logistic regression was unable to. That is to say that while the presence of non-linearity in the decision boundary is necessary for non-linear models to outperform linear ones, it is not by itself sufficient. Perhaps with more data or a different model non-linear methods would perform better, but there is certainly a class of models and problems where logistic regression is preferable.

      Reviewer #2 (Significance (Required)):

      The submitted manuscript adds to the discussion of the necessity of non-linear models when solving classification problems using gene expression data. The significance is mostly technically, as a comparison of logistic regression and two neural network topologies that are being compared on two large expression datasets. However, there is also a conceptual part of the contribution, which is with regards to the implications of their experiments.

      Interested audience would be computer scientists and bioinformaticians or others, that are involved in creating or interpreting these or similar prediction models.

      Our field of expertise is in the creation of machine learning models using different types of OMICs data. All aspects of the work could be assessed.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the authors discuss an interesting problem regarding the comparative performance of linear and non-linear machine learning models. The main conclusion is that logistic regression (linear model) and neural networks (non-linear model) have comparable performance if the data contain both linear and non-linear relations between the features (X) and the prediction target (Y), however, if the linear component in the X-Y relation is removed (e.g. regressed out) the neural networks will outperform logistic regression. This conclusion implies that linear models such as logistic regression mainly relies on the linearity in the X-Y relation.

      However, whether X-Y relation has a linear component and whether the data (e.g. for different Y classes) are linearly separable are two different questions. For example, consider a data generating mechanism, y=x^2+x and label the data points using two classes (y1). Clearly, the data is linearly separable, and any machine learning algorithm should perform very well on this problem. Now remove the linear component form the X-Y relation and use y=x^2 to generate the data. The data is still linearly separable, and the performance of logistic regression should not be affected.

      We agree that there is a difference between optimal linear decision boundaries and linear relationships between elements in the training data. Our use of the term “relationship” in place of “decision boundary” was imprecise. To make this more clear, we have made the following changes:

      Introduction:

      “Unlike purely linear models such as logistic regression, non-linear models should learn more sophisticated representations of the relationships between expression and phenotype.” -> “Unlike purely linear models such as logistic regression, non-linear models can learn non-linear decision boundaries to differentiate phenotypes.”

      “However, upon removing the linear signals relating the phenotype to gene expression we find non-linear signal in the data even when the linear models outperform the non-linear ones.” -> “However, when we remove any linear separability from the data, we find non-linear models are still able to make useful predictions even when the linear models previously outperformed the nonlinear ones.”

      Discussion and conclusion:

      We removed the following paragraph: “Given that non-linear signal is present in our problem domains, why doesn’t that signal allow non-linear models to make better predictions? Perhaps the signal is simply drowned out. Recent work has shown that only a fraction of a percent of gene-gene relationships have strong non-linear correlation despite a weak linear one [23].”

      The point is that the performance of linear models is mainly dependent on whether the data are linearly separable instead of the linearity in X-Y relation as the manuscript suggests.

      We agree that this is the key point and appreciate the reviewer for helping us to more carefully hone the language to convey this point.

      Reviewer #3 (Significance (Required)):

      The performance comparison between linear and non-linear machine learning models is important.

      We appreciate the reviewer’s recognition of the significance of the work.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      We thank the reviewers for their excellent suggestions and constructive comments. We now added new data on PE15/PPE20 binding to Ca2+, the PDIM status of mutant strains, additional controls, added to the discussion, added detail to the Methods, and provide all RNA-seq data. Please see replies to the comments in detail below:

      Reviewer 1:

      Major points

      1. Cellular localization:
      2. “The authors do not describe the cellular fractionation method…”, “The authors show some Western blot data in Fig. S3, though the legend is superficial (abbreviations not explained) and the controls with markers for cellular localization appear to be lacking”. “Further, the authors do not prove that FLAG-tagged PE20 is functional.”

      We included a description of the fractionation method in Materials and Methods (lines 475-485). We also added detail to the legend of Fig. 4A to explain the abbreviations and controls used. The same cell fractions were used in Fig. 4A and Fig. S3A, as mentioned in the Figure S3 legend (“The same cell fractions as in Fig. 4A were used, see controls therein”). We know that the FLAG-tagged PPE20 is functional because the strain used in this experiment is the same we used for genetic complementation experiments in which FLAG-tagged PPE20 functionally complements ppe20 deletion in all three assays (ATP consumption, biofilm, Ca2+ influx, Fig.4 B,C,D,G).

      • “The authors should extend discussion part of the manuscript. Several proteomic studies.” “Did authors analyze culture filtrate fraction by Western?

      We thank the reviewer for the references and extended the Discussion to include results from existing proteomic studies on PE15/PPE20 (lines 229-234). We did not test for PE15/PPE20 in culture filtrate, and previous proteomic results are contradictory. Several PE/PPE proteins, including PE15/PPE20 have been detected in the cell wall and in the CFP, but not consistently. The functional significance of this dual localization is unclear.

      1. Is PE15/PPE20 a channel?

      2. “PPE20 purified alone from the cytosol of E. coli?”

      We did not purify either protein by itself. As the reviewer correctly notes, PE/PPE proteins are refractory to individual purification. We clarified that we purified and used the complex for experiments even if only PPE20 is shown, as in Figure 3C,D, and E (Lines 124-127). See also Methods line 382 ff.

      • “…a positive control of a mutant that is indeed deficient in Mg2+ import (and thus showing a phenotype) is lacking.”

      Lacking a specific Mg2+ import mutant, and because it is a relatively minor point, we removed the statements about selectivity.

      1. Thermal melting assay

      2. It is surprising to see that the thermal melting assays was done for PPE20 and PE15 as separately purified proteins.

      We co-purified PE15 and PPE20 for all biochemical experiments. We clarified that point (see also point 2 above).

      • “the thermal melting assay only seemed to give some results for PPE20 alone, and not for PE15”

      PE15 did not produce interpretable results in this assay, as mentioned in line 144. We clarified in the Fig. 3 legend that the complex was used although only PPE20 is detected by Western blot and shown in Figure 3C.

      • “…the results are counter-intuitive… How can the authors be sure that the presence of Ca2+ does not simply lead to more protein precipitation (via rather unspecific interactions) at elevated temperatures? Some positive controls with bona fide calcium binding protein in the same thermal melting setup would have helped to clarify this.”

      The effect of Ca2+ on PPE20 is somewhat counterintuitive, although not unprecedented. Proteins can be stabilized or destabilized by ligand binding, and a recent proteome-wide study on the basis of thermal shift analysis showed that ~17% of proteins were destabilized by ligand (ATP). For a channel in particular, ligand binding might be expected to be coupled to protein relaxation in the process of channel opening, which could well translate to lower thermal stability. We added the positive control showing the behavior of a known Ca2+ binding protein (new Fig. S2A). In addition, we included a negative control showing that Ca2+ does not generally increase protein denaturation (Fig. S2B). We think that this control addresses the reviewer’s concern more directly.

      • If the authors want to stick to their claims regarding Ca2+ binding to PE15/PPE20, they have to perform additional assays (e.g. equilibrium dialysis or ITC) with the entire PE15/PPE20 complex. Further, they have to show that PE15/PPE20 forms a proper oligomeric protein that is membrane bound and reasonably behaved on size exclusion chromatography, when expressed in and purified from E. coli.

      Detecting Ca2+ binding to proteins is not trivial, and we thank the reviewer for suggesting equilibrium dialysis as another, orthogonal assay. We now show an equilibrium dialysis experiment that confirms Ca2+ binding by the PE15/PPE20 complex. Please see the new Fig. 3F. and G. and lines 146-152 (Results) and 429-443 (Methods).

      The PE/PPE proteins are generally difficult to express and purify recombinantly, likely due to the typically large unstructured regions. Also, the yield of PE15/PPE20 when expressed in E. coli was very low so that we were not able to detect the complex by SEC. However, data in Fig. 3 conclusively show that PE15 and PPE20 bind.

      1. RNA-seq data

      2. The authors should include a table with all other identified genes that are potentially involved in calcium homeostasis

      We provided all other significant differentially expressed genes in the new Table S1.

      Minor points:

      1. “what is the binding affinity of the Ca sensor?”

      We added the Ca2+ binding affinity of Twitch-2B (KD: 200nM) in line 176.

      1. Figure 4D: “one would expect a drop in FRET signal after EGTA addition… Can the authors explain?”

      We do see a clear drop in FRET signal after EGTA addition, in particular in 7H9 medium (black versus red line, Fig. 5B). Given the high affinity of Twitch-2B for Ca2+ (200nM), however, it is not surprising that the drop is not more pronounced, as intracellular Ca2+ is expected to be tightly bound to Twitch.

      1. The experiments showing outer membrane localization of PE15/PPE20 are very important, but results of these experiments (western-blot and FRET) are shown in supplementary figures. They should be transferred/integrated into the main Figures.

      We agree and moved Figure S3A to the main Figures as Figure 4A.

      1. Line 166: the authors claim that the assay did not work in 7H9 due to low Ca2+ concentration in this medium. Why did the authors not just add a bit more calcium to show whether this claim holds true?

      7H9 is not a suitable medium for these experiments because the baseline Ca2+ concentration is too high, not too low (see Fig. 5B, grey versus black line). Adding more Ca2+ to 7H9 medium resulted in precipitation, probably due to its interaction with phosphates. Our use of “low” in this context was confusing, we changed the wording of this sentence (line 180-181).

      1. Line 183: more detailed description on cellular fractionation and subsequent anti-FLAG Western needed here.

      We added more detail in the Materials section (lines 475 ff).

      Reviewer 2:

      • A major concern regarding the importance of the data: there are considerable technical challenges in generating Ca2+ depleted media. This is clear in that M. tuberculosis seems to be unaffected by Ca2+ in the medium - similar growth seems in Ca2+-free media to media with up to 10mM Ca2+ (Fig. S1). This raises a concern about the physiological relevance of the data (mammalian cells have intracellular Ca2+ of 0.01-0.1mM, extracellular free Ca2+ is around 1mM).

      If we correctly understand this comment, the reviewer is unconvinced that we fully and reproducibly depleted Ca2+ from medium based on a lack of an effect of Ca2+ on in vitro growth. We tested for baseline Ca2+ levels and depletion in media by inductively coupled plasma optical emission spectrometry and added these data showing precise quantitation of Ca2+ in medium (see new Fig. S1B).

      • The role of PE15/PPE20 in Ca2+ acquisition may be clearer if the authors ensure that the PDIM layer is intact. Specifically, there is a technical issue: The authors use Tween80 as a detergent. Tween-80 partially strips the outer cell wall of M. tuberculosis resulting in shedding of PDIM and PE/PPE proteins. Tyloxapol is a somewhat milder detergent. Some of the experiments would possibly show clearer phenotypes by use of Tyloxapol.

      We share the concern about PDIM, as PDIM loss is common in in vitro culture. We analyzed the total lipids by thin layer chromatography and confirmed the presence of PDIM in all three strains (Fig S3C, lines 198-201). We repeated experiments with Tyloxapol and did not see differences to Tween-80. We nonetheless now show the Tyloxapol data (Fig 5D).

      • The authors could increase the impact of their work be exploring the role of PE15/PPE20 during pathogenesis of resting versus activated bone marrow macrophages where Ca2+ fluxes of the host cell play a role in host responses.

      We agree. In vivo or macrophage experiments are a logical next step to fully characterize the function of PE15/PPE20, but we think it is beyond the scope of this manuscript. The main contribution of this paper is the identification of channel function of a PE/PPE protein pair that extends the novel channel paradigm for these proteins. These data support that transport might indeed be a shared function of the entire PE/PPE family with 169 members.

      Minor:

      • The authors should consider citing Sharma et al (2021)

      We cited the paper.

      • Are there Ca2+ binding motifs in PPE20?

      We did not detect canonical Ca2+ binding motifs in PPE20.

      • RNAseq data may need to be deposited in a public database.

      RNA-seq data have been deposited to NCBI - GEO accession GSE214266

      Link: https://urldefense.com/v3/https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE214266;!!NuzbfyPwt6ZyPHQ!tCf4MS_HRKJFn6qV2orkDAkXTWvx9IIU11fAV7TguYE2ietoMBpBgRC7rvfnM9bsoiVdIvDBUHdPmHZliDP2o5sRZR2ziK4$

      Token: cvmhakcgbpmbfuz

      • In its current state, the work is somewhat incremental

      The function of the large PE/PPE protein family of Mtb has been one of the most longstanding and perplexing puzzles in Mtb biology. For more than 20 years, speculation about their potential role, for example in antigenic variation, abounded but no conclusive evidence for this or another shared function emerged. A recent landmark paper then conclusively showed that a subset of the PE/PPE proteins function as nutrient channels (Wang et al., Science 2020). However, whether transporter function is a general function of the family of 169 PE/PPE proteins remains untested. Our PE/PPE pair is associated with a different type VII secretion system (Esx-3) and belongs to a different subfamily than the previous examples, suggesting a shared function across families and perhaps even all of these proteins. Given the intense interest and many false leads that have plagued the identification of PE/PPE function in the last 20 years, the difficulty of working with them biochemically, as well as the almost complete absence of understanding of Ca2+ homeostasis in Mtb, we do not consider our work incremental.

      Reviewer 3

      • My only slight concern is the meaning attached to the "biofilm" assays. It is never very clear to me that this is anything more than formation of a surface pellicle and general hydrophobicity of the mycobacterial cells.

      We fully agree that Mtb biofilms remain poorly defined. However, the term biofilm as used in our study has already found its way into the literature and we would rather not cause confusion by calling the same phenomenon by a different name. Whatever the term used, we do not suggest any other relevance other than it being a Ca2+-dependent phenotype that serves as one of several tests to parse PE15/PPE20’s role in Ca2+ homeostasis.

      Cross-consultation comments:

      • We agree with the concerns of reviewer#2 that the role of PDIM and use of detergent should be looked at more closely.

      We tested the roles of PDIM and detergent, see reviewer 2.

      • Likewise, the paper would strongly benefit from some further insights into the potential physiological role of PPE20/PE15 in calcium homeostasis.

      We show PE15/PPE20 function in the transport of Ca2+ and the first Ca2+-related cellular phenotypes in Mtb. Testing the role of the complex in an infection model is outside of the scope of this manuscript and mouse infection experiments would take many months and would likely be intractable because of the expected extensive redundancy among the 169 PE/PPE proteins.

    1. Or, take the case of unemployment as described by sociologist C. WrightMills:When, in a city of 100,000, only one man is unemployed, that is his per-sonal trouble, and for its relief we properly look to the character of theman, his skills, and his immediate opportunities. But when in a nation of50 million employees, 15 million men are unemployed, that is an issue, and

      we may not hope to find its solution within the range of opportunities open to any one individual. The very structure of opportunities has collapsed. Both the correct statement of the problem and the range of possible solutions require us to consider the economic and political institutions of the society, and not merely the personal situation and character of a scatter of individuals.16

      1. C. Wright Mills, The Sociological Imagination (New York: Oxford University Press, 1959), p. 9.

      I love this quote and it's interesting food for thought.

      Framing problems from the perspectives of a single individual versus a majority of people can be a powerful tool.

      The idea of the "welfare queen" was possibly too powerful because it singled out an imaginary individual rather than focusing on millions of people with a variety of backgrounds and diversity. Compare this with the fundraisers for impoverished children in Sally Stuther's Christian Children's Fund (aka ChildFund) which, while they show thousands of people in trouble, quite often focus on one individual child. This helps to personalize the plea and the charity actually assigned each donor a particular child they were helping out.

      How might this set up be used in reverse to change the perspective and opinions of those who think the "welfare queen" is a real thing instead of a problematic trope?

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This paper demonstrates a link between oxidative stress, lipid biosynthesis, and targeted histone acetylation in fission yeast. In mutant cells with defects in lipid synthesis (cbf11, mga2 lacking transcription factors, and cut6 lacking acetyl-CoA carboxylase), transcripts of a number of genes implicated in resistance to oxidative stress are increased. This is associated with higher levels of H3K9 acetylation and increased tolerance to oxidative stress. These effects are mediated through Sty1, a stress-activated MAP kinase and the transcription factor Atf1.

      It is also shown that H3K9 acetylation levels in the promoter region and just downstream of the transcriptional start site are increased in cbf11 mutants (Fig. 5A).

      By mutational analysis, the authors implicate the acetyl transferases Mst1 and Gcn5 in this transcriptional effect. Other related acetyl transferases, Hat1, Elp3, Mst2, Rtt109 have been ruled out as main contributors to the dysregulation in unstressed cbf11 mutants. That specific acetyl transferases have been shown to be required is a strength of the investigation.

      Major comments:

      The hypothesis is put forward in the manuscript that altered acetyl-CoA levels in cbf1 mutants would underlie the dysregulation of genes induced by oxidative stress. Histone acetyl transferases compete for acetyl-CoA with lipid biosynthesis, and so with increased demand for acetyl-CoA underacetylation in the concerned promoters would result - specifically at H3K9. These results do not directly support the hypothesis, on the other hand they are not sufficient to rule it out.

      Actually, we view this phenomenon the other way round: We primarily focus on exponentially growing cells, which have substantial demand for fatty acid (FA) production (= high acetyl-CoA consumption). So the level of promoter histone acetylation under these conditions is our baseline, or “normal” state. When FA production is decreased (cbf11 or cut6 mutants; inhibition of FA synthase by cerulenin…), stress gene promoters get *hyper*acetylated. We do not have any data on (or claims about) histone underacetylation compared to the baseline. Nevertheless, we now show that overexpression of Cut6/ACC results in decreased resistance to oxidative stress (Fig. 5C), which is compatible with the notion that increased acetyl-CoA consumption would result in insufficient histone acetylation at stress gene promoters during stress.

      Acetyl-CoA levels were measured only in undisturbed cells, and the possibility remains that under oxidative stress there would be changes in acetyl-CoA pools that could explain this apparent contradiction - why did not the authors examine that?

      Under oxidative stress, the Sty1 stress MAPK is activated, leading to a massive Atf1-dependent transcription wave, which is also associated with increased SAGA-dependent H3K9 acetylation (PMID: 21515633). This well-studied cellular response, however, is not the main focus of our study. Rather, we found a novel connection between perturbed lipid metabolism and increased expression of stress genes in cells *not challenged* by oxidative stress (i.e. Sty1-Atf1 are not hyperactivated). This is why we only measured acetyl-CoA concentrations in untreated cells.

      The authors argue that although the global acetyl-CoA levels are not increased, local concentrations might be altered in a way to permit higher H3K9 acetylation levels at selected promoters. Although a formal possibility, this is rather far-fetched as a small and freely diffusible molecule like acetyl-CoA should quickly equilibrate within one cellular compartment. I think that although the overall relationships that the authors have established between oxidative stress, H3K9 acetylation levels with increased expression, and lipid biosynthesis, are compelling, the role of acetyl-CoA concentrations is not clear and should be de-emphasized.

      Interestingly, acetyl-CoA production in the nucleus has been published by several studies (reviewed in PMID: 29174173), suggesting that local acetyl-CoA concentrations (microgradients) within the cell are functionally relevant. We agree that acetyl-CoA is a small molecule which, in theory, should diffuse quickly throughout the nucleocytoplasmic space. However, empirical evidence shows that the lipid synthesis in the cytosol and histone acetylation in the nucleus may not access a uniform nuclear-cytosolic pool of acetyl-CoA (PMID: 28099844, PMID: 28552616). This is related to the fact that the acetyl-CoA sink is large and acetyl-CoA may react with many proteins (i.e. any extra amounts will be consumed rapidly).

      Even though we provide strong evidence that HAT activity is critical for the crosstalk between FA synthesis and stress gene expression, we do agree that we have not conclusively established the role of acetyl-CoA in the process. However, we still feel that it is justified to point out acetyl-CoA is a “possible” mediator molecule for the crosstalk in the Results and Discussion sections.

      Minor comments:

      In many of the bar diagrams, only a borderline statistical significance is indicated (p ~ 0.05) despite seemingly large numerical differences between the means. In the legends it is stated that one-sided Mann-Whitney U tests were used. This is a non-parametric test with low power - would it not have been better to use a t test?

      We do agree that the non-parametric Mann-Whitney U test is rather conservative and, therefore, less sensitive for small sample sizes, such as n = 3. Our reason for using this particular test instead of the parametric t-test is that qPCR fold-change values come from a log-normal distribution, which is incompatible with t-test (requires normal distribution of data). Importantly, using conservative statistical testing does not invalidate our conclusions.

      What do the error bars in the diagram show, SEM? If a non-parametric test is used, a parametric measure of variability is irrelevant.

      The error bars represent standard deviation (SD). We do not see an issue here as, in our opinion, the visual style of numeric data presentation is independent from any chosen statistical testing methods.

      It would be helpful to the reader to indicate directly in the diagram panels what is actually shown, not just "fold change vs ..." In Fig. 1, 2, 4 D and 5 we see mRNA levels, in Fig. 3 chromatin IP.

      Done

      Reviewer #1 (Significance (Required)):

      The paper represents conceptual advances for our understanding of how stress responses, metabolism and transcriptional regulation are linked, although one of the links (acetyl-CoA levels in this case) is tenuous.

      This manuscript belongs in a rich literature on stress responses on the gene expression level, mostly from studies in yeast. Potentially, it adds entirely new information on how cellular stress may be mechanistially linked to stress responses.

      These results are potentially general and of broad interest to the biological community.

      This reviewer is familiar with yeast genetics, stress responses, and quantification of gene expression.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      As more and more metabolic intermediates are found to also serve as co-factors for epigenetic modifications, it has been widely accepted that regulating the levels of these key metabolites can be an effective way to control nutrient related gene expression. Acetyl-CoA is one of those early examples. Increased acetyl-CoA was shown to promote local acetylation at growth genes (Mol Cell 2011 PMID: 21596309), and ACC deletion funnels more Acetyl-CoA towards histone acetylation reactions and causes global hyperacetylation (Ref 17). However, whether those increased metabolite/co-factor can exert signal-specific effects remains elusive. For instance, although increased acetyl-CoA stimulates the SAGA complex enzymatic activity, it is not clear whether it also causes SAGA to be targeted to new sites without external cues to induce new transcription factor binding. Does increased acetyl-CoA cause broad hyperacetylation at all inducible genes which are the primary targets for those HAT complexes?

      In this manuscript, Princová et al. found that deletion of fatty acid synthesis transcriptional factors Cbf11 and Mga2 increases cell survival under H2O2 induced oxidative stress in S. pombe. They further showed that several stress-related genes increased upon Cbf11 deletion, and H3K9 acetylation at their promotor regions were elevated. They argued that FA-TF deletion may indirectly regulate stress-related genes potentially through influencing Acetyl-CoA level, although they failed to detect significant changes of global Acetyl-CoA levels. While it's interesting to see yet another example of metabolite-mediated gene expression regulation, the current manuscript only made incremental advance towards mechanistic principles of how these co-factors finetune specific gene expression program.

      Specific comments:

      1. This work showed convincingly that deletion of CBF11 or MGA2 leads to resistance to oxidative stress. However, it provides little mechanistic insight into how deletion of Cbf11 increased the expression of stress response genes and why some HATs are involved but others not (Figure EV5).

      We respectfully disagree with the notion that we only provide “little mechanistic insight” into the process whereby FA metabolism affects stress gene expression.

      • First, we show that not only deletion of cbf11, but also a very specific manipulation of the rate-limiting FA-producing enzyme (Cut6/ACC; Fig. 4D), or chemical inhibition of FA synthase by cerulenin (new Fig. 4F) all lead to increased stress gene expression. On the other hand, overproduction of Cut6/ACC results in decreased stress gene expression and lower resistance to ox. stress (new Fig. 5B-C). These findings clearly show the specific and tight mutual relationship between FA synthesis and expression of stress genes.

      • Second, we show that the DNA-binding activity of Cbf11 is critical for affecting stress gene expression levels, yet Cbf11 does not act as a stress gene repressor.

      • Third, we show that, compared to e.g. peroxide treatment, stress gene mRNA levels are only moderately increased upon downregulation of FA synthesis. So the situation can be called stress gene “derepression”. At the same time the major stress-response regulators (Sty1-Atf1, Fig. 2A-C; Pap1, new Fig. 2D-E) are required for the derepression, but, importantly, neither of them shows increased activation compared to unstressed WT cells (Fig. 3A-C). These data suggest a qualitative difference between the two phenomena (canonical stress response vs dysregulation of FA synthesis). Furthermore, they hint at an important role of the chromatin environment.

      • Fourth, we show that Gcn5/SAGA and Mst1, but not 4 other HATs, mediate the connection between FA metabolism and stress gene expression (Fig. 5D-E), and we show clear and specific H3K9 hyperacetylation of stress gene promoters in FA metabolism mutants (Fig. 5A), arguing that this is not a general acetylome issue.

      • Fifth, we show that the stress genes affected by changes in FA metabolism show unusually high nucleosome (H3) occupancy in their transcribed regions (even in unperturbed WT cells; Fig. 5A bottom panels), which could dictate the observed specificity in regulation.

      While we agree that our understanding is not yet complete, we have already described many mechanistic aspects of the link between FA metabolism and stress gene expression.

      1. Although in Cbf11 deletion cells, increased resistance to H2O2 is relied upon the Sty1/Atf1 pathway, the authors did not establish a link between lipid synthesis and Atf1 activity because Cbf11 deletion does not affect the phosphorylation of Atf1.

      Sty1 and/or Atf1 show non-zero activity even in normal, healthy, unstressed cells. Importantly, Atf1 is bound to many target promoters even in the absence of stress (Fig. 3B; PMID: 20661279, PMID: 28652406). Moreover, Sty1 is actually needed for orderly cell cycle progression (sty1KO cells are elongated, a result of postponed mitotic entry; e.g. PMID:7501024), which we now mention in the Introduction and Discussion. Our point is that Sty1-Atf1 are not hyperactivated under normal conditions - this only happens during major stress insults. Thus, in unstressed cbf11KO cells, stress gene promoters are hyperacetylated, which may facilitate their (Sty1-Atf1 and Pap1-dependent) transcription, without the need for hyperactivation of the stress response regulators. Such increased transcriptional competence of stress promoters is consistent with our findings that upon peroxide treatment stress gene mRNA levels in cbf11KO exceed those in WT (Fig. 1B). We have amended the corresponding section of the Discussion to more clearly explain our conclusions and hypotheses.

      1. Cbf11 deletion causes elevated H3K9 acetylation at the promotor regions of a number of stress respond genes, the author did not mention whether demonstrate how lipid synthesis defect causes the hyperacetylation at these promoters.

      As discussed in our manuscript, we suggest that following downregulation of FA synthesis, the surplus acetyl-CoA is used by Gcn5 and Mst1 HATs to hyperacetylate stress gene promoters.

      1. As all lipid-metabolism mutants show increased stress response, it would helpful to examine whether H2O2 induction of WT cells influence lipid synthesis, thus establish physiological links between FA synthesis and stress response.

      We now mention in the Discussion section that, curiously, cut6/ACC mRNA levels are downregulated upon peroxide treatment. However, the significance of this finding is unclear as FA metabolism is strongly regulated at the post-translational level (PMID: 12529438). Unfortunately, we are not in a position to measure changes in metabolic fluxes upon stress. In any case, we believe that such experiments would be outside the scope of the current study.

      Beside, fatty acid may be beneficial to fight oxidative stress because they maintain the integrity of cell membrane. What is the potential effect of CBF11 deletion in this aspect? The author may want to discuss it.

      The reviewer suggests that higher production of FA would result in higher resistance to oxidative stress. However, our data do not indicate this - we show that under low FA synthesis the stress resistance is actually higher. Nevertheless, we acknowledge in the Discussion that the scenario suggested by the reviewer can occur, for example, in cancer cells which become more resistant to oxidative stress following increased lipid biosynthesis/storage.

      1. Since H2O2 treatment also causes change in glucose metabolism including upregulation of glucose transporter Ght5 (PMID: 30782292), it would be enlightening to see if there is a crosstalk between the lipid and glucose metabolisms. Does Ght5 expression increase upon H2O2 treatment in CBF11 deletion strain?

      While the topic is interesting, we strongly believe that the relationship between glucose metabolism and stress gene expression is outside the scope of this study.

      According to our data used in Fig. 4A, ght5 expression in cbf11KO at 60 min after 0.74 mM H2O2 treatment is downregulated 3-fold.

      5 Different H2O2 concentration causes different stress response in pombe: Pap1 and Sty1 mediate responses for low and high H2O2, respectively. For fully activated Sty1 response, the concentration of H2O2, needs to reach 1mM (PMID: 17043891). In this study, the H2O2 concentration ranges from 0.5-1.5mM and Pap1 regulated Ctt1 does show increase upon H2O2 treatment. To test if suppressed lipid synthesis facilitates Sty1 dependent activation, it would be helpful to examine the activation of Pap1 (its nuclear translocation) to eliminate other influences.

      We agree with the reviewer. We have now included data on the role of Pap1 in the crosstalk between lipid metabolism and stress gene expression. We show that Pap1 is required for increased expression of gst2 and ctt1 in untreated cbf11KO cells (Fig. 2D). We note that ctt1 is coregulated by both Pap1 and Atf1 (Fig. 2B, D). Also, Pap1 is partially required for H2O2 resistance of cbf11KO cells (Fig. 2E). Importantly, similar to Sty1-Atf, Pap1 is not hyperactivated (no nuclear accumulation) by 10 or 60 min of cerulenin treatment (Fig. 3C), while stress gene expression is upregulated at 60 min in cerulenin (Fig. 4F) and keeps increasing after 120 min (data not shown). These data collectively support our hypothesis that upon decreased FA synthesis, stress gene promoters become more transcription-competent without the requirement for hyperactivation of the corresponding stress gene regulators.

      Reviewer #2 (Significance (Required)):

      see above

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This study examines the intriguing phenomenon that perturbation of fatty acid biosynthesis induces expression of stress-response genes by increased intracellular levels of acetyl-CoA and hyperacetylation of histones at the promoters of these genes. Loss of the CSL transcription factor Cbf11 results in induced expression of a subset of stress-response genes in unperturbed conditions and resistance to H2O2. These stress-response genes are not direct targets of Cbf11, but their upregulation is dependent on the Sty1-Atf1 pathway. Similar effects in upregulation of stress-response genes were observed in the cut6 hypomorph and mga2 deletion strain, however no change in global levels of acetyl-Co-A in the former as well as in the cbf11 deletion was detected. The upregulated stress-response genes appear to be linked to increased H3K9 acetylation in their promoters and dependent on the Gcn5 and Mst1 HATs.

      The authors present good supportive evidence linking fatty acid biosynthesis to epigenetic regulation of stress response genes potentially mediated by intracellular levels of acetyl-CoA. This is an exciting area and the fission yeast model system is ideal to elucidate the molecular mechanisms behind this process. This is a substantial body of work with state-of-the art functional genomics approaches and LC-MS analysis. The data is of high quality and the manuscript is well written and relatively easy to read. Below are my comments for the manuscript.

      It was determined that increased expression of stress-response genes in the cbf11 deletion is dependent on the presence of Sty1, and partially dependent on Atf1. How about Pap1 (or Prr1) - would this transcription factor that is also regulated by Sty1 be involved in the upregulation of the stress-response genes in the cbf11 deletion? Activation of Sty1 and Atf1 by phosphorylation was not observed in unperturbed cbf11 deletion cells which would be expected in the proposed model. This discrepancy was not well explained. Could activation of Sty1/Atf1/Pap1 in unperturbed cbf11 cells be assayed in a different way such as nuclear localization?

      As these concerns were also raised by Reviewer 2, to avoid duplication, we kindly ask you to read our detailed responses above. Briefly, we have now included new data clarifying the role of Pap1 in the increased expression of selected stress genes in cbf11KO cells (or when FA synthesis is chemically inhibited) - comment #5 of Reviewer 2 above. Also, we explain why Sty1-Atf1 and/or Pap1 hyperactivation (i.e. above their activity level in untreated WT) is actually not needed in order for decreased FA synthesis to trigger a mild/moderate increase in stress gene expression - comment #2 of Reviewer 2 above. We have now also clarified this issue in the Discussion section.

      As for the use of alternative methods for measuring the activation status of Sty1-Atf, we have already provided data from multiple independent and very sensitive methods (western blot, ChIP-qPCR; Fig. 3A-B). Also, it is questionable whether microscopy would be more sensitive than our current methods. Moreover, our H2O2-sensitive reporter does not indicate an increasingly oxidative environment inside cbf11KO cells, quite on the contrary (Fig. 1D).

      It would strengthen the model that perturbation of fatty biosynthesis induces expression of stress-response genes and H2O2 resistance if more mutant strains other than cut6 and two of its known regulators were tested. Does the proposed model apply to any deficiency in fatty acid synthesis in general or only those that result in increased levels of acetyl-CoA? For example, would deletion strains of fas1, fas2, lsd90, lcf1, lcf2 or the4 show the same stress response as cut6, mga2, and cbf11 mutants?

      The roles of lsd90, lcf1, lcf2 and the4 have been only poorly characterized so far, making it potentially difficult to interpret any stress-related phenotypes of these mutants. However, the role of the fatty acid synthase Fas1/Fas2 complex in FA production is well established. We have therefore inhibited FAS using cerulenin and found that this treatment also leads to increased stress gene expression (Fig. 5F), without causing Pap1 hyperactivation (Fig. 3C). Importantly, fas1/fas2 are not Cbf11 target genes, and FAS inhibition by cerulenin represents an acute intervention, very different from the long-term effects in cbf11/mga2/cut6 mutants.

      Also, does overexpression of cut6+ confer sensitivity to H2O2?

      Yes, our new data show that ~2-fold overexpression of cut6 both partially abolished the derepression of stress genes in cbf11KO cells (Fig. 5B), and increased sensitivity to H2O2 of WT cells (new Fig. 5C).

      The authors hypothesize that induced expression of stress-response genes in the cbf11 deletion and cut6 hypomorph is due to H3K9 hyperacetylation because of increased acetyl-CoA abundance in the cell. However, LC-MS analysis showed no change in global abundance of acetyl-CoA in the cbf11 deletion and cut6 hypomorph although differential levels of acetyl-CoA in the nucleus relative to the rest of the cell cannot be ruled out. The authors mentioned that ppc1-537 and ssp2 null are known to have lower abundance of acetyl-CoA and the latter could suppress the cbf11 deletion-induced gene expression for two of three genes tested by qPCR. Can ppc1-537 also suppress the cbf11 deletion-induced gene expression? Are ppc1-537 and the ssp2 null sensitive to H2O2?

      The ppc1-537 mutant is sick and has a growth defect, making it difficult to interpret any findings regarding its survival/resistance phenotype (see a similar issue with the cut6-621 mutant in Fig. 4E). Ssp2/AMPK has a pleiotropic role in the cell and its activity is actually controlled by Sty1-Atf1 under some stress conditions (PMID: 28515144) and the ssp2KO is resistant to osmotic stress (PMID: 28600551). All this makes it potentially difficult to derive reliable conclusions about ppc1 and ssp2. However, our current data on cut6 (ts hypomorph, Pcut6MUT, overexpression) and FAS/cerulenin are derived from precisely targeted and specific interventions, and support the proposed connection between FA synthesis and stress gene expression, and are consistent with the suggested role of acetyl-CoA (and its microgradients) in mediating the connection.

      I think Rtt109 is H3K56 specific.

      Indeed, H3K56 is the characterized specificity of Rtt109, and we indicate this explicitly in the manuscript. We wanted to make our HAT screen comprehensive since we could not presume which histone or even non-histone acetylation target(s) is involved in lipid metabolism-mediated stress gene expression. Even though we have observed increased H3K9ac (Gcn5/SAGA target), other modifications are likely involved since Mst1 affects stress gene expression in lipid mutants, but Mst1 is not known to target H3K9.

      Reviewer #3 (Significance (Required)):

      The authors present good supportive evidence linking fatty acid biosynthesis to epigenetic regulation of stress response genes potentially mediated by intracellular levels of acetyl-CoA. This is an exciting area and not all the molecular details have been elucidated in this process. S. pombe is ideal to study this fundamental process and discoveries would be applicable to other eukaryotic study organisms.

      My expertise is in eukaryotic gene regulation, molecular genetics and functional genomics, so I am quite qualified to critically review this paper.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This paper demonstrates a link between oxidative stress, lipid biosynthesis, and targeted histone acetylation in fission yeast. In mutant cells with defects in lipid synthesis (cbf11, mga2 lacking transcription factors, and cut6 lacking acetyl-CoA carboxylase), transcripts of a number of genes implicated in resistance to oxidative stress are increased. This is associated with higher levels of H3K9 acetylation and increased tolerance to oxidative stress. These effects are mediated through Sty1, a stress-activated MAP kinase and the transcription factor Atf1.

      It is also shown that H3K9 acetylation levels in the promoter region and just downstream of the transcriptional start site are increased in cbf11 mutants (Fig. 5A).

      By mutational analysis, the authors implicate the acetyl transferases Mst1 and Gcn5 in this transcriptional effect. Other related acetyl transferases, Hat1, Elp3, Mst2, Rtt109 have been ruled out as main contributors to the dysregulation in unstressed cbf11 mutants. That specific acetyl transferases have been shown to be required is a strength of the investigation.

      Major comments:

      The hypothesis is put forward in the manuscript that altered acetyl-CoA levels in cbf1 mutants would underlie the dysregulation of genes induced by oxidative stress. Histone acetyl transferases compete for acetyl-CoA with lipid biosynthesis, and so with increased demand for acetyl-CoA underacetylation in the concerned promoters would result - specifically at H3K9.

      These results do not directly support the hypothesis, on the other hand they are not sufficient to rule it out. Acetyl-CoA levels were measured only in undisturbed cells, and the possibility remains that under oxidative stress there would be changes in acetyl-CoA pools that could explain this apparent contradiction - why did not the authors examine that?

      The authors argue that although the global acetyl-CoA levels are not increased, local concentrations might be altered in a way to permit higher H3K9 acetylation levels at selected promoters. Although a formal possibility, this is rather far-fetched as a small and freely diffusible molecule like acetyl-CoA should quickly equilibrate within one cellular compartment. I think that although the overall relationships that the authors have established between oxidative stress, H3K9 acetylation levels with increased expression, and lipid biosynthesis, are compelling, the role of acetyl-CoA concentrations is not clear and should be de-emphasized.

      Minor comments:

      In many of the bar diagrams, only a borderline statistical significance is indicated (p ~ 0.05) despite seemingly large numerical differences between the means. In the legends it is stated that one-sided Mann-Whitney U tests were used. This is a non-parametric test with low power - would it not have been better to use a t test? What do the error bars in the diagram show, SEM? If a non-parametric test is used, a parametric measure of variability is irrelevant.

      It would be helpful to the reader to indicate directly in the diagram panels what is actually shown, not just "fold change vs ..." In Fig. 1, 2, 4 D and 5 we see mRNA levels, in Fig. 3 chromatin IP.

      Significance

      The paper represents conceptual advances for our understanding of how stress responses, metabolism and transcriptional regulation are linked, although one of the links (acetyl-CoA levels in this case) is tenuous.

      This manuscript belongs in a rich literature on stress responses on the gene expression level, mostly from studies in yeast. Potentially, it adds entirely new information on how cellular stress may be mechanistially linked to stress responses.

      These results are potentially general and of broad interest to the biological community.

      This reviewer is familiar with yeast genetics, stress responses, and quantification of gene expression.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This study presents a first structural insight on formin mDia bound to actin filaments in physiological conditions. Based mainly negative stain EM, the authors use 2D and 3D class averaging to describe two main configuration of the formin at the filament barbed end. The two configurations support the previously proposed stair-stepping model, which was based on crystal structures, with an open state where the formin binds two actin monomers and a closed state where three monomers are bound. Because the majority of the structures fall in the first, open state, this supports the existence of this intermediate. The authors also show that the orientation of the free FH2 in this open state is somewhat flexible, as several sub-classes with different angles can be distinguished. Finally, they identify, for the first time, formin densities bound along the length of the filament.

      The data is well presented and I don't have any major issue. The only point is that the information that all the initial structural data comes from negative stain EM comes should be put upfront. One gets the feeling that cryoEM is used throughout until one reads the section on cryoEM. Given that the methodology is now also established for cryoEM, it is regrettable that data was not collected with a 300kV microscope, which may have revealed more details of the conformations, but I understand microscope time is hard to come by, and the authors did a remarkable job from negative-stain EM.

      The finding of formin densities binding along the length of the actin filament is very interesting. Besides the previous cited finding, it also reminds of the observations made in yeast where Bni1 (in S. cerevisiae; PMID 17344480) and For3 (in S. pombe; PMID 16782006) where shown to exhibit retrograde movement with polymerizing actin cables in vivo. This would be interesting to consider in the discussion.

      Reviewer #1 (Significance (Required)):

      This study extends our understanding of the mechanism of formin-mediated actin assembly, by providing a first structural observation in physiological conditions. While confirmatory of previously proposed model, but also excludes an alternative model, and offers novel observations of flexibility and binding along the actin filament length. It will be of great interest to researchers on the actin cytoskeleton.

      My expertise is in the actin cytoskeleton and formins, but I am no expert in EM structural analysis.

      We thank reviewer 1 for the very positive comments and for pointing out the relevance of our study for the actin cytoskeleton field. As advised, we now specify upfront in the abstract and in the introduction that most of the presented results were obtained from negative stain electron microscopy. Following the reviewer’s advice, we have enriched the discussion to highlight the retrograde movements of formins in actin cables observed in vivo.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Maufront et al. have used EM to study the conformation of mDia1 at the barbed end and the core of actin filaments to explain the molecular mechanism of the FH2 dimer processivity at these sites. Based on modelled structural data they tried to describe how the conformational changes in FH2 dimer lead to its partial dissociation, and then association with filaments during the process of translocation coupled to subunit addition at actin filaments barbed ends. This supports a previous study (Otomo et al. 2005, Nature), in which using X-ray crystallography structural data were used to propose a stair-stepping model for Bni1p translocation at the barbed ends during actin polymerization. The model for mDia1 binding to core filaments is also given. Moreover, using EM structure and the previously reported structures of actin (PDB: 5OOE), and actin with formin FH2 dimer (PDB: 1Y64), authors explained the dynamic nature of FH2 dimer at barbed ends of the filaments using the flapping model. But due to the low resolution of their structures ~ 26-29A0, the finer details of actin and the FH2 dimer structure at barbed ends could not be resolved, leaving open questions about the orientation of actin helical twist at this end during elongation. The authors tried several conditions to get high density barbed-end filaments, but that did not collect adequate number of particles, resulting in low number of particles selected for structure modelling purposes. However, to attain more physiologically relevant structure they used cryo-EM, but were successful in capturing only the open conformation structure of FH2 dimer (at low resolution). Thus, due to low resolution of structures the key findings have not added much to what we already know about the mechanism of FH2 dimer translocation during actin polymerization, except that their studies support the stair-stepping model (Otomo et al. 2005, Nature) and not that of "stepping second" model ( Paul and Pollard. 2008, Curr. Bio.). Thus, this manuscript does not merit publication in this journal.

      We thank reviewer 2 for taking the time to read and review our study. However, we respectfully disagree with the statement that our findings “have not added much to what we already know about the mechanism of FH2 dimer translocation during actin polymerization”. As mentioned in our report, collecting EM data for formins in physiological conditions (at the barbed ends of growing filaments), as we do here for the first time, entails limitations on the number of particles one can observe and on the resulting resolution. Despite this rather low resolution, our data allow us to discriminate between two proposed models accounting for the processivity of formin FH2 domains at filament barbed ends. Being able to determine which of two competing models is valid (as the reviewer says we do) does add a lot to what we already know.

      Major comments:

      1. Present study does not provide any new insight about the conformation of the actin dimer at the barbed ends of actin filaments when FH2 domains of formin are bound. This study appears to be more like an extension of previous research (Otomo et al. 2005, Nature), in which the authors used X-ray crystallography data to propose a model for actin filaments elongation by formin bound at the barbed ends.

      As mentioned above, we respectfully disagree with this remark. First, in Otomo et al. 2005, formins are arranged in a crystal into a non-physiological “daisy chain” arrangement around a non-canonical tetramethyl rhodamine-actin filament. Our observations were made in physiological conditions displaying a single formin dimer at the barbed end of a polymerizing filament. Second, the stair stepping model originating from Otomo et al. was only inferred and extrapolated from the crystal structure and not directly observed. Both the open and the closed conformations were speculations, that had never been observed up to now. In our current report we directly visualize these two conformations. Third, the observations of Otomo et al. were obtained using formin Bni1p from yeast, not the mammalian formin mDia1, for which there is little (PDB 1V9B) structural data available describing the structure of a truncated mDia1 in the absence of actin. Finally, in addition to validating the stair-stepping model experimentally, we make unexpected observations that are totally absent from the model derived from Otomo et al. and subsequent studies.

      The low resolution of structures is a major concern.

      As mentioned above, the limited resolution is the price we had to pay for being in physiological conditions, with formins interacting with the barbed ends of growing actin filaments. Nonetheless, this resolution is sufficient to discriminate between the two previously existing models, and to make new observations, beyond these models.

      Given the low resolution of data, how can the authors decide on the number (4) of classes of FH2 domain (in open state) and present them as "continuum of conformations". They stated "details featured in class 4 do not appear as sharp as in class 2". What was the basis of deciding on the sharpness level?

      We agree that this point was unclear, and we thank the reviewer for pointing it out. The choice of the number of sub-classes for the open state is a trade-off between the sharpness (ie signal-to-noise ratio) of the resulting image, which is a direct consequence of the number of particles within each sub-class, and the internal variability within each sub-class. Class 4 might appear more “blurry” because it gathers particles displaying a range of angles. When increasing the number of generated classes in the 2D processing, we observe angular variations of the FH2 domains intermediate to the ones displayed in Figure 3. However, because increasing the number of classes results in averaging less particles per class, the generated classes appeared more noisy or “blurry” and not as “sharp”, as mentioned in the manuscript. Hence, we chose the number of displayed classes so that the signal-to-noise would remain satisfactory and sufficient to be able to determine the relative angle between the two FH2 domains. To make things clearer, “do not appear as sharp” was replaced by “displayed a lower signal-to-noise ratio and thus looked noisier”. The expression “sharp” was replaced by “enough contrast”.

      The authors showed 30Å structure of FH2 domain encircling actin filaments towards their pointed ends, but said nothing about the kind of decoration it could be, a "daisy-chain" or "concentric circle"? Also, they did not mention anything about the orientation of actin helical twist and specific sites of binding. These information would provide new in-depth understanding of how formins binds while diffusing along the filaments.

      The quality is sufficient to distinguish isolated FH2 dimers along the core of actin.

      Accordingly, the FH2 dimers we observed along the core of our actin filaments adopt a conformation similar to that observed at the barbed end, as mentioned in the text (‘concentric circle’). This observation differs from the reported for INF2 which accumulated along filaments and may interact in a ‘daisy-chain‘ manner (Gurel et al, 2014 ; Sharma et al, 2014). From our data, we can thus assume that formins interact with F-actin along the core of filaments similarly to the way they do at the barbed ends, and might translocate in a two-step manner alongside the actin filament. As stated in the manuscript, the actin helical twist could not be deciphered. For docking the crystal structures within our EM envelope, we used the formin-actin contacts described previously in Otomo et al.

      The author stated - "The leading FH2 domain likely provides a first docking intermediate for actin monomers that would help their orientation relative to the barbed end, resulting in a higher actin monomer on-rate". This statement was made on the basis of observing 79% times FH2 in the open state in their data set. This seems like an overstatement because they don't have any direct structural data to support such claim.

      We agree with the reviewer that our statement, taken from the discussion section, is speculative, and we apologize if this was unclear. Our purpose was to propose a plausible mechanism, based on our structural data, since the FH2 domain stands in front of the barbed end in the “open conformation” and since it likely interacts with actin monomers. We have now rephrased our sentence to state more clearly that is a hypothetical mechanism : “We propose that… could provide…”.

      In the Discussion they mentioned "the FH2 dimer would then be "lagging" behind the elongating barbed end if actin twisting back to 180{degree sign} occurs before the addition of actin monomer and this explains the diffusing along the actin filaments". Did authors encounter filaments with two formins bounds to them in their negative stain images? What is their view on this? In current data, they showed structure in which only one FH2 dimer is bound to the pointed ends of actin filaments. Have they tried increasing the concentration of formins to obtain structures with more than one formin is bound towards the pointed ends of actin filaments?

      Following the recommendations from reviewer 2, we have performed an additional analysis and we now show typical examples of filaments observed with a formin along their core, including cases where two formins are observed on the same filament (Supplementary Figure 12). As we now explain in the discussion section, five different mechanisms (including lagging) can be invoked to explain how a formin can be located along the core of the filament. These five mechanisms can all account for the possibility to have more than one formin on the same filament.

      The lagging mechanism, however, is the only one where we would expect that the filaments with a formin along their core are less likely to also have a formin at their barbed end (because the formin at the core spontaneously departed the bare barbed, that was left bare and with a shorter time to load another formin before fixation of the sample). A simple statistical analysis of our data leads to the estimation that 48 ± 7% (n=50) of actin filaments with a formin within their core also display a formin at their barbed ends. This is significantly less than for the global filament population, where 77 ±0.4% (n=10,461) of barbed ends are decorated with formins. This supports the lagging scenario as a likely mechanism putting formins along the core of the filament.

      Regarding the specific suggestion to increase the formin concentration: We did screen different formin concentrations, but with higher concentrations the level of noise due to unbound formins was significantly increased in the image background and impeded a proper analysis. This is why we consistently used 100 nM formins.

      To increase the density of short filaments for sample preparation, the authors used additional actin binding proteins "shown in supplementary Figure 2.C". There is no supplementary Figure 2.C. Moreover, it would be nice if the concentrations of these proteins are mentioned in the text.

      We apologize for this mistake. Supplementary Figure 2.C has now been added and the protein concentrations have been added in the main text.

      Minor comments:

      1. Figure 1 legend needs editing. E is missing in the legend.

      Thanks for noticing this. We have added the missing legend for 1.E. 2. There is no supplementary Figure 2.C.

      We apologize for this mistake. We have now added supplementary Figure 2.C.

      It is recommended that the authors report the number of particle used during 2D and RELION 3D classifications in the figures. This would help in better understanding of the probability of the conformations mentioned in the text.

      It was mentioned in the text. We have now made this information clearer to the reader.

      Reviewer #2 (Significance (Required)):

      This is the first direct study showing the two (open and closed) conformations of mDia1 FH2 domain at the barbed ends of actin filaments using EM and cryoEM. The study supports the proposed molecular mechanism of FH2 processivity at the barbed ends during filaments elongation using stair-stepping model reported earlier (Otomo et al. 2005, Nature). For the first time, FH2 has been shown to fluctuate between various angles with respect to static actin filaments, and on this basis they propose a flapping model (Fig 5). They explained the whole mechanism using structural proof, but the low resolution of data raises a question about their quality sufficiency to propose this mechanism. The overall novelty of this manuscripts is insufficient for the publication in this journal. Audience having understanding of the actin and actin binding proteins will be interested in this study. Additionally, researcher from the field of structural biology (EM and CryoEM) will be interested. I have been working in the field of actin and actin binding proteins for past 4 years. Over 10 years' experience in protein biochemistry, structural biology and molecular biology.

      We do not fully understand why, on one hand, reviewer 2 indicates that “for the first time, FH2 has been shown to fluctuate between various angles…” and that “Audience having understanding of the actin and actin binding proteins will be interested in this study. Additionally, researcher from the field of structural biology (EM and CryoEM) will be interested.”. On another hand, reviewer 2 states that “The overall novelty of this manuscripts is insufficient for the publication in this journal.”, which seems contradictory with the above statements and comments.

      Regarding novelty, we insist on the fact that we have achieved for the first time the direct observation of FH2 formin domains at a resolution sufficient to discriminate between two distinct models at the barbed ends, as well as to observe the presence of formin mDia1 along the core of actin filaments in conditions where nobody has proposed that this could happen.

      In addition, we have not specified any specific journal within the possible ones from “review commons”, up to now.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, Julien et al. use negative stain electron microscopy and cryo-EM to show two conformations of the FH2 domain for the formin mDia1 bound to the barbed end of an actin filament. These conformations support the "stair-stepping" model of FH2 domain movement with an elongating actin filament, as previously postulated by Otomo et al. (reference 1). The two states observe correspond to the "open" (~79%) and closed (~21%). The authors also show the conformational variability of the open state suggesting flexibility in this state. Finally, the authors observe FH2 domains encircling the actin filament at a distance from the barbed end, and suggest that the FH2 can diffuse from the barbed end down the filament.

      Major comments:

      1) Novel insights into formin function derived from this structure would raise impact. Issues that could be addressed include the following. Simply adding some lines to the discussion would not really add impact, but additional experimental/modeling work would.

      We agree that comparing the binding mode of different formins on actin filaments, testing the impact of profilin, and assaying FH2 domains in the absence of FH1, as proposed below, would provide a broad set of interesting additional data. However, without claiming that our results can be generalized to all formins in all conditions, we believe that our findings are novel and should be of interest to a large community. The proposed additional experiments/modeling represent an impressive amount of work, and will be carried out in future investigations. We answer these comments in more details below.

      1. Whether this model really holds true for all FH2 domains. Formin FH2 dimerization and processive filament barbed end elongation are widespread features of formins, which have been evidenced for many organisms from metazoan to plants. Since we could dock the FH2 from yeast formin Bni1p to account for mammalian mDia1, we think the FH2 domain conformations may be conserved enough among species to display similar translocation mechanisms at the barbed ends of actin filaments, using a two-state mechanism. We chose to use the crystal structure from Bni1 formin (PDB 1Y64) because this structure was obtained in the presence of an actin filaments and brings some insights about the formin-actin contacts.

      In order to convince reviewer 3, we superimposed the existing crystal structure of the FH2 mDia1 domain (PDB: 1V9D) with our model and reconstruction and show (Supplementary Figure 12) that the differences are minor. The mDia1 FH2 domains (atomic structures in red, PDB : 1V9D) are aligned with Bni1p FH2 domains (atomic structures in green and blue, PDB : 1Y64) previously fitted into the electron microscopy envelope of a barbed end capped by a formin in the « open state ». The FH2 domains are well aligned with a slight discrepancy in the knob/actin contact regions (blue arrows). This discrepancy most likely results from the absence of actin partners in the crystals obtained with mDia1 FH2 domains. The Bni1p structure thereby most accurately represents the knob/actin contact region. In addition, the folding of the lasso domain around the post domain is resolved in the Bni1p structure. Note here that the Bni1p lasso domains wrap equally well around the Bni1p post domain and the mDia1 post domain (green arrows).

      1. Whether the % time spent in the open and closed states might dictate the vastly different elongation rates mediated by different formins. For example, mDia1 is considered one of the 'faster' elongators (equivalent to actin alone in the absence of profilin), while fission yeast Cdc12 essentially caps filaments in the absence of profilin. We have discussed this aspect thoroughly in the discussion section to conclude that:” Our direct assessment of the open state occupancy rate thus provides important information on the molecular nature of the formin-barbed end conformations which could not be directly inferred from kinetic measurements, with or without mechanical tension, so far. Considering a gating factor of 0.9 and considering that formin mDia1 spends 79% of the time in the open state, we can compute that the on-rate for monomers would be slightly higher (14% higher) for an mDia1-bearing barbed end in the open state, than for a bare barbed end.”

      We agree that repeating our set of EM experiments and analysis with other formins, like fission yeast Cdc12, would be interesting. However, this would take a long time, and falls out of the scope of our paper.

      1. Whether the % time spent in the open and closed states varies if filaments are actively elongating in the presence or absence of profilin. We have chosen not to include profilin in our experiments, and to limit the concentration of G-actin, in order to reduce the background in our EM micrographs. Also, a rapid filament elongation would increase the amount of F-actin per barbed end, while a dense population of short filaments is key to obtain accurate data (as we explain in the discussion, paragraph 1, p9).

      We speculate that, by providing a link between the FH1 domains and the filament barbed end, profilin might very well alter the percentage of time spent in the open state, and mitigate lagging as mentioned in the discussion section. Properly addressing the impact of profilin with our EM experiments is very challenging, for the reasons we have explained. It would require further investigations, beyond the scope of this study.

      1. How this model impacts the interactions of formins with other proteins at the barbed end. For example, capping proteins. We did not include capping proteins (or other additional proteins) because we wanted to avoid increasing the number of particles from diverse nature per field of view, as they constitute a background that is detrimental for the analysis of EM micrographs. We would have add to sort out additional populations in the course of image analysis. We thus only mixed actin and formin in our assays.

      2. Do these results relate to formin function in disease? Because formins regulate actin polymerization, their malfunction is linked to a variety of diseases. We therefore expect our findings to be useful to researchers in the medical field. However, our study remains in the scope of basic research and primarily aims at understanding the mechanisms of formin-assisted actin polymerization.

      2) The observation that formin FH2 domains can bind filament sides has been made several times. In particular, a structural model of the FH2 domain of the INF2 formin along the side of an actin filament (Gurel et al 2014, PMID 24915113). This publication also references other papers showing other formins binding to filament sides. There are two points to this comment:

      1. The model in Gurel et al is that the FH2 domain does not slide down the filament from the barbed end. Rather, the FH2 dimer has an appreciable dissociation rate, enabling it to encircle the filament without having to slide. This FH2 dissociation has been observed for another formin that has been shown to bind filament sides, FMNL1 (called FRL1 in the listed publication), in Harris et al 2006 (PMID 16556604). The authors must explain their reasoning for thinking that mDia1's FH2 can slide down the filament from the barbed end. One possibility is to make observations of this FH2 population in filaments that were not sonicated. What is the average distance of FH2s from the barbed end? We thank the reviewer for pointing our attention to this report from Gurel et al. which we now cite. Following this comment, as well as point 6 of reviewer 2, we now discuss the different mechanisms that could lead to our observation of mDia1 along the core of the filament. We provide a new analysis of our data (discussion section), arguing in favor of the lagging mechanism (i.e. ‘sliding down’ from the barbed end), without excluding the competing scenarios. Briefly, we compute that 48 ± 7% (n=50) of actin filaments with a formin within their core also display a formin at their barbed ends. This is significantly less than for the global filament population, where 77 ±0.4% (n=10,461) of barbed ends are decorated with formins. This supports the lagging scenario, which is the only one where a filament with a formin along its core should be less likely to also have a formin at its barbed end.

      The distance of FH2s from the barbed end would provide additional information. However, it is difficult to estimate, since we often to not see the entire filament, and since we do not know which end is the barbed end.

      1. Interestingly, in some of the works studying formin binding to filament sides, mDia1 was shown to be rather poor in this property. It would be useful to get an idea of what % of the observed FH2s are in the filament core, as opposed to at the barbed end. Along with the additional analysis mentioned in the previous point, we have now estimated that about 8% of actin filaments display a formin within their core. We have added this number in the manuscript (end of the Results section). As a comparison, in our assays, 77% of filament barbed ends bear a formin.

      2. The authors must reference the past works showing FH2 binding to filament sides, particularly the structural work. At present, no mention of prior work on FH2 side binding is mentioned. As advised, we have now added additional references and more particularly Gurel et al, 2014.

      3) My major technical concern in this manuscript is that the authors use the FH1-FH2-DAD domain of mDia1 for the imaging, but use FH2 structure of Bni1p for 3D characterization (Otomo et al.). Even though Bni1p has been used for functional and structural analysis, mDia1 and Bni1p FH2 domains share low sequence homology. In addition, mDia1 only partially complements loss of Bni1 function in vivo (Moseley et al., 2004 PMID 14657240). Can the authors use the partial structural information of the mDia1 FH2 from Shimada et al 2004 (PDB 1V9D, PMID 14992721)? Alternately, the authors could have used FH2 domain of Bni1p for imaging. At the very least, the authors should explain clearly why they used different proteins for imaging and modeling.

      As mentioned above (please see our response to point 1.a), we chose to use the crystal structure from formin Bni1 (PDB 1Y64) because this structure was obtained in the presence of an actin monomers, and it thus brings some insights about the formin-actin contacts. The existing structures obtained from formin mDia1 does not include actin (full length by EM: Maiti et al, 2012; crystal structure of subdomains (without FH1): Otomo et al., 2010 PLoS one). It thus seems relevant, in the context of our investigations, to use a structure where formin-actin contacts could be at least partially inferred.

      Further, we superimposed the existing crystal structure of the FH2 mDia1 domain (PDB: 1V9D) with our model and reconstruction and show that the differences are minor (please see the figure in our response to point 1.a, above).

      4) The open and closed states are observed from negative staining data. However, the authors can only find one of the states (open) by cryo-EM, which decreases the confidence level of the paper's conclusions. It would be useful for the authors do a little more to try to find the closed conformation by cryo-EM.

      Using Cryo-EM we can already recover the most abundant open conformation.

      Unfortunately, as pointed out here, the number of particles obtained was too low to enable high resolution and reveal the two observed conformations. Indeed, considering a density of ~ 5 barbed ends par micrograph, the collection of tens of thousands of images would have been necessary, which was not realistic regarding the access we have to latest generation microscopes.

      5) It is unclear whether there are additional effects of using FH1-FH2-DAD protein (not FH2 only) for the imaging, as it shows long protrusion at the tip of actin barbed end. To avoid those concerns the authors could use only FH2 domain of mDia1. Also the authors have to note that they used Bni1p structure because there are no published structures of mDia1 so far.

      We had indeed tried to use a construct deprived of the flexible FH1 domain but the lower purity of this construct and the presence of aggregates led to the collection of lower quality EM micrographs. As profilin was not included in our assay, FH1 domains were not involved in actin polymerization at the barbed end and thus remain very flexible and unstructured. Consistently, we did not detect any additional electronic density that could result from the FH1 domains.

      We indeed point out (p5) that “We used the crystal structure from yeast Bni1p FH2 domains in interactions with an actin filament, rather than the existing one from mammalian mDia1 formin FH2 dimer in isolation (PDB 1V9D), because actin-formin contacts are described in the Bni1p structure.” Minor comments:

      1) Figure 1: It would be interesting if imaging is provided for mDia1 bound to filaments which it has nucleated. Would it be possible that binding to pre-formed filaments is different to that for mDia1-nucleated filaments?

      This is a good suggestion for further investigations but it extends beyond the scope of this study: as we explain, our attempts to nucleate filaments from mDia1 lead to lower quality micrographs, and the sonication of preformed filaments was our best option. However, we do not expect the translocation mechanism of FH2 to differ, as a function of the nucleation history of the filament, since the formin interacts with a filament whose elongation it has assisted over several subunits.

      2) Supplementary figure 2: Numbers of things in the S2 is unclear and poorly described in both results and methods. In particular, figure S2A, the definitions of the black and gray lines (steady state actin) is not clear. Are they containing 5% pyrene actin? Is that actin in polymerization buffer or in monomer-actin buffer? Is that actin incubated with actin polymerization buffer for a certain time before measurement of fluorescent intensity? In figure S2B, how the authors calculate the monomer actin concentration? The authors should provide the information in either results or methods part.

      We apologize for the lack of information. Since this is a standard assay, we have now added more details in the Methods section (rather than in the Results section).

      All curves shown in figure S2 were obtained with 5% pyrene actin. The gray curve shows the pyrene fluorescence intensity baseline from 1 µM G-actin monomers, obtained in G-buffer. The black curve is the fluorescence intensity at steady-state of 1 µM actin in polymerizing conditions, (after 1 hour of incubation at room temperature, at 5 µM, the sample was diluted without sonication and left for another hour before measuring the fluorescence intensity).

      The monomeric actin concentrations shown in figure S2B are derived from the intensity level of pyrene at any time point during the experiment, using the simple equations we now present in the Methods section.

      3) Supplementary figure 2 C: The figure and legend are missing in the manuscript. Furthermore, the authors describe that they used Gc-globulin to sequester monomeric actin in solution. Is gc-globulin widely used for actin monomer sequestration?

      Thank you for noticing the missing panel which is now back in place. Indeed, Gc globulin is known to sequester G-actin (Van Baelen, H., R. Bouillon, and P. DeMoor. 1980. “Vitamin D binding protein (Gc-globulin) binds actin”. J. Biol. Chem. 255:2270-2272). This is why we have attempted to use it. We could see a slight effect but we did not want to increase the noise within our images with additional proteins that would have made the analysis more complicated.

      CROSS-CONSULTATION COMMENTS Reviewer #1 mentions that the authors identify formin densities bound along the actin filament for the first time. I agree that the imaging of the mDia1 along the actin filament using electron microscopy is novel, but the concept of formin binding has already been found and studied well with other formins (PMID 16556604, PMID 24915113) and even mDia1 has poor binding activity compared to other formins. It was really nice of the authors to show the mDia1 side filament binding, but I don't think it is a striking finding.

      I have no comment for Reviewer #2.

      Reviewer #3 (Significance (Required)):

      If the EM refinements and 3D rendering techniques are conducted rigorously (which this reviewer is unable to judge), the data support an existing theory of how FH2 domains interact with the actin barbed end. Overall, the data will be of interest in formin field. However, as written the paper confirms an existing model, and does not represent new insight. Impact would be raised by providing insights from these findings that impact formin function or disease.

      We have answered this concern above. The existing models were speculative and not based on direct observations. They relied on data obtained in non-physiological conditions.

      Here, we directly observe two distinct conformations in our structural data, and clearly validate one model over the other. This provides a major advancement in our understanding of formin interaction with actin filaments. In addition, we uncovered an unexpected behavior of formin mDia1, which can readily be found along the core of the filament without the aid of additional proteins, and we propose a mechanism based on our data to account for this observation.

      Another main point is that the observation of FH2 domains bound along an actin filament, while interesting, is not novel. Others have found this for other formins, but those papers are not referenced here.

      The direct binding of formins to the sides of actin filaments is thought to be specific to some particular formins (we now cite additional references in our manuscript, to discuss this point). Formin mDia1, which is a ubiquitous and widely studied mammalian formin (perhaps the most studied), has only been described to diffuse along actin filaments when a capping protein dislodges it from the barbed end (Bombardier et al. Nat Com 2015). Here, we show that formin mDia1 can be found encircling the core of actin filaments, in the absence of any capping protein. This behavior is novel and unexpected. It should open new avenues for research on formin mDia1, as well as on other formins.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): ____ *A significant criticism of the paper is an assumption that readers will be familiar with all of the findings in the author's previous 2016 paper and the PGL-1 papers by Aoki et al. Minimal context is given for each approach. *

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      *Some conclusions are not well supported and require further analysis, proper controls, and more extensive descriptions of the experiments performed. *

      We have addressed the reviewer’s concerns as detailed below.

      Most importantly, the central conclusion and title of the paper is that composition can buffer the dynamics of individual proteins within liquid-like condensates. In other words, in vitro condensation assays often do not recapitulate LLPS behavior in vivo. That said, the findings in this study would be significantly strengthened and complemented by observing endogenously tagged PGL-3 and PGL-3 mutants in living worms, considering the efficiency of using CRISPR in C. elegans to insert tags and make precise mutations.

      The original manuscript already contained data where we microinjected wild-type PGL-3 and mutant PGL-3 proteins (recombinantly purified) into adult C. elegans gonads to assay how the P granule phase supports diffusion of these proteins.

      In the revised version, we now include additional data which shows “dynamics buffering” in transgenic worms generated using CRISPR/Cas9 technology. Briefly, we used CRISPR/Cas9 to generate transgenic C. elegans which expresses PGL-3-mEGFP or PGL-3(D425-452)-mEGFP from the native pgl-3 locus. In vitro, wild-type PGL-3-mEGFP protein generates liquid-like condensates. On the other hand, the recombinantly purified PGL-3(D425-452)-mEGFP protein generates condensates that are non-dynamic. In contrast to these observations in vitro, both wild-type PGL-3-mEGFP and PGL-3(D425-452)-mEGFP show similar dynamics (half-time of FRAP recovery) within P granules in vivo.

      *To improve readability, the introduction to P granules should be expanded, and include the reasons for looking at the nematode-specific PGL-3 protein among all the other known P granule proteins. A recap of previous findings on PGL-3 phase separation, in vivo and in vitro, is warranted, starting with the significant results of Saha et al 2016. Setting up the investigative questions in the context of recent work on PGL-1 (Aoki, et al) is also necessary. *

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      The physiological concentration of PGL-3 should be more transparent, including why some experiments in this study are done at physiological concentrations while others are not. Describing why salt concentrations, crowding agents, and protein abundance are similar or different for each experiment is necessary and relevant. For example, after showing in Figure 1 that PGL-3 protein phase separates, the paragraph starting on line 161 says that it was previously shown that PGL-3 doesn't phase separate at physiological concentrations without RNA. One has to go back to Figure 1 to realize it was done differently than Figure 2 and Saha 2016.

      The concentrations of PGL-3 protein and use of crowding agents (if any) have already been specified within figures or figure legends. Salt concentrations used are specified within figure legends or materials and methods section.

      We have added the following paragraph to the materials and methods section of the revised manuscript.

      “Saha et al. 2016 showed that at physiological concentrations (approx. 1 mM), the PGL-3 protein is unable to phase separate into condensates. At these concentrations, mRNA promotes phase separation of PGL-3. To assay for mRNA-dependence of condensate assembly, it is therefore essential to use physiological concentrations of the PGL-3 protein or mutants (e.g. Figure 2). However, these condensates are generally too small to assay rate of internal rearrangement of PGL-3 molecules within condensates using fluorescence recovery after photobleaching experiments. Therefore, to generate large condensates for measuring internal rearrangement of PGL-3 or mutant molecules, we primarily used higher concentrations of these proteins where binding to RNA is not essential for phase separation. However, to mimic the in vivo P granule phase as closely as possible, we generally added constituent proteins in proportion to their in vivo abundance estimated in Saha et al. 2016.”

      The added paragraph in the Introduction section of the revised manuscript may be helpful to the readers. * *

      *Statements in the same paragraph like "in contrast to full-length PGL-3, mRNA does not support phase separation..." should be qualified by stating the concentration observed, with or without salts or other crowding agents. Similarly, line 230 "suggests that interactions involving the disordered C-terminal region of PGL-3 are not essential for the fast dynamics" and should be qualified with "at non-physiological concentrations and with XX crowding agents or salt concentration." It would be more consistent if physiological concentrations were consistent from figure to figure, as extra variables weaken some of the stated conclusions. *

      We thank the reviewer for this suggestion. However, we feel the statements (without full experimental details within main text) help convey the conceptual essence of the findings better. Of course, all these statements contain reference to figures or prior publications which provide relevant details about experimental conditions.

      *The 2010 review reference stating that there are 40 P granule enriched proteins is outdated. More recent reviews put the number much higher. This is relevant because the approach to put PGL-3 in a more physiological environment by including just PGL-1, GLH-1 and mRNA with the condensate assays, out of ~100 P granule enriched proteins, may not be sufficient to conclude "that the influence of complex composition on dynamics is modest" (line 223), or imply that the multicomponent nature of the P granule is reconstituted by adding these components (line 355). *

      We revised the text to indicate that P granules contain approx. 70 proteins and added appropriate references.

      • *

      Based on current information of constitutive P granule components (PGL-1, PGL-3, GLH-1, GLH-2, GLH-3, GLH-4, DEPS-1, MIP-1 and mRNA), (Kawasaki et al, 1998, 2004; Spike et al, 2008a, 2008b; Price et al, 2021; Cipriani et al, 2021; Phillips & Updike, 2022) we reconstituted P granule-like phase in vitro with mRNA, PGL- and GLH- proteins that likely constitute the most abundant components within P granules in vivo (based on concentration estimates in Saha et al. 2016).

      We do appreciate the reviewer’s comment that more components can be added to our in vitro reconstitution in addition to the limited set of components used in our study. However, we feel it is interesting to observe that a limited set of components can support dynamics buffering (the main message of the paper). Further, the complementary in vivo experiments show that the P granule phase can also support dynamics buffering.

      *Figure 1C needs to include PGL-3(370-693) in the analysis. Figure 1E is also incomplete without a comparison of FRAP recovery between PGL-3(1-452) and full PGL-3 as the control.

      *

      Fig. 1c already includes data with PGL-3 (370-693) [top row, central panel]. FRAP recovery data with full-length PGL-3 is already available in Supplementary Fig. 2c, g.

      *Figure 4C is missing an essential control where PGL-3 and S1 FRAP is performed without PGL-1, GLH-1, and mRNA. *

      In the revised version, we have added Supplementary Fig. 5f, where FRAP recovery of the following condensates are plotted together: 1) PGL-3 alone, 2) S1 alone, 3) PGL-3 + PGL-1, GLH-1 and mRNA, 4) S1 + PGL-1, GLH-1 and mRNA.

      *It would also help show sup Fig4A in the main figure to show concentration dependence. *

      We revised Fig. 4 to address the reviewer’s suggestion.

      Consider adding subtitles to supplementary figures.

      We considered the suggestion but felt it may not be essential.

      *M&M should include an explanation for statistical analysis *

      We added a paragraph describing statistical analysis within the Materials and Methods section.

      *CROSS-CONSULTATION COMMENTS I am also in agreement with the comments and critiques of reviewers 2 and 3.

      * Reviewer #1 (Significance (Required)): The paper by Saha and colleagues investigate the in vitro liquid-liquid phase separation propensity of a P granule protein PGL-3 and its structural domains. The findings largely replicate and support the phase-separation properties of a paralogous protein called PGL-1, as recently described by Aoki et al. 2021. Furthermore, they show that the dynamics demonstrated by recombinant PGL-3 may be maintained or buffered by the complex composition of P granules.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      *Jelenic et al. describe the effect of partner proteins on the FRAP dynamics of recombinant PGL-3 protein and variants in in vitro condensates and C elegans p-granules. The study shows that the N terminal a-helical dimerization domains is required for condensate formation and modulate of it alters aggregation and the FRAP dynamics of its condensates. Interestingly, a construct including the entire IDR region (370-693) by itself does not phase separate on its own at these conditions. The K126E K129E mutant (known previously to disrupt dimerization) and the deletion mutant abrogate llps. A mutant construct that shuffles the sequence in the region 423-453 called S1 here reduces the helicity and the condensate FRAP dynamics but recovered in the presence of a few P granule components. Also, the reduced dynamics of partially unfolded PGL-3 condensates are also rescued by the p-granule components to a certain degree of the unfolded PGL3 concentrations. This threshold concentration for recovering the condensate dynamics is further reduced in the helix reducing S1 mutant, which is also dependent on the number and the nature of P granule components.

      Overall, the study aims to probe how "composition can buffer protein dynamics within liquid-like condensates" - yet several underlying aspects of the study do not fully support that conclusion. The introduction does not sufficiently introduce the known structural information of the two dimerization domains in C elegans PGL proteins for which structures are known. The region is discussed as "alpha helical" but really there are two evolutionarily conserved independently folding dimerization domains (referring to the mutants as "reduced alpha helicity" is not helpful - these are mutations that destabilize a folded domain).*

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      *Additionally, the abstract and introduction ignore the aspects of aggregation (touched on in discussion) - this is likely what the disruption to the helical region in residue 450 region is doing (the helix is not on the dimer interface based on homology / sequence identity to the crystal structure of PGL-1 central dimerization domain. *

      We think elucidating the molecular mechanism of apparent aggregation of PGL-3 (D425-452) could be an interesting direction for future investigation. Here, we focused our analysis predominantly on the mutant S1 since it generates liquid-like condensates with ~20- fold slower dynamics (compared to wild-type) in contrast to non-dynamic condensates/aggregates. Therefore, influence of other P granule components on the dynamics of PGL-3 in liquid-like condensates is easier to address using the mutant S1 rather than PGL-3 (D425-452). We didn’t find evidence that S1 aggregates as we did not detect aggregates of S1 molecules using fluorescence confocal microscopy and the slow dynamics in condensates of S1 does not change significantly over 24 h (Supplementary Fig. 3f).

      However, in the revised version, we now include additional in vivo data with C. elegans expressing the aggregation-prone PGL-3 (D425-452)-mEGFP. Briefly, we used CRISPR/Cas9 to generate transgenic C. elegans which expresses PGL-3-mEGFP or PGL-3(D425-452)-mEGFP from the native pgl-3 locus. In vitro, wild-type PGL-3-mEGFP protein generates liquid-like condensates. On the other hand, the recombinantly purified PGL-3(D425-452)-mEGFP protein generates condensates that are non-dynamic. In contrast to these observations in vitro, both wild-type PGL-3-mEGFP and PGL-3(D425-452)-mEGFP show similar dynamics (half-time of FRAP recovery) within P granules in vivo.

      Finally, the "dynamics buffering" is not really clearly established and could also be explained as small concentrations of aggregated proteins act like clients while increasing the concentration results in aggregation and "cross linking" in the entire droplet - and this concentration is never achieved in the in worm experiments so it is not clear. In other words, the change in FRAP dynamics not observed in worms is perhaps not surprising if small amount of recombinant proteins are incorporated into the granules. *

      *

      Data with the S1 mutant establishes that dynamics buffering can be observed in condensates with different sets of additives both in vitro (Fig. 5a, b) and in vivo (Fig. 4a, b). Further, data with condensates of S1 containing the additives PGL-3 (K126E K129E) or S1 (K126E K129E) demonstrate that dynamics (half-time of FRAP recovery) within S1 condensates, and in turn “dynamics buffering” depend on inter-molecular interactions. With respect to the hypothesis proposed by the reviewer, we did not detect aggregates within S1 condensates using confocal fluorescence microscopy.

      In contrast to S1 condensates, condensates containing partially unfolded PGL-3-mEGFP together with PGL-1, GLH-1 and mRNA showed spatial inhomogeneities in fluorescence signal throughout the condensate (Fig. 4g). We have not tested if areas with higher fluorescence signal represent aggregates. It is a possibility that the partially unfolded PGL-3-mEGFP fluorescence signal becomes more homogeneous if higher concentrations of additives (PGL-1, GLH-1 and mRNA) are used. However, the presented data demonstrate the significant effect of the P granule components (PGL-1, GLH-1 and mRNA) on the FRAP recovery rate of partially unfolded PGL-3-mEGFP in condensates (compare figures Fig. 3e and Fig. 4g).

      However, consistent with dynamics buffering, the P granule phase in vivo supports wild-type dynamics of different PGL-3 constructs over a range of concentrations - PGL-3(D425-452)-mEGFP at physiological concentration (CRISPR transgenic strain, Fig. 4e) or at higher concentrations (microinjected S1 and partially unfolded PGL-3-mEGFP, Fig. 4b).

      • *

      *It is also not clear what the mechanism of the changes is - is the protein driven to fold more properly (despite S1 disruption of its conserved sequence) inside the condensate? Does it still self interact and act as a dimerization domain? Does this change disrupt interactions? *

      We agree with the reviewer that identifying the precise structural changes of the S1 protein within the condensate vs. dilute phase could be an interesting direction for future investigation. However, we have already discussed the issues raised by the reviewer in the original manuscript.

      “Our data is consistent with the model that other regions of S1 molecules cooperate with residues 425-452 (shuffled) to generate stronger inter-molecular interactions. For instance, addition of the mutant S1 (K126E K129E) enhances dynamics of S1 within condensates in contrast to maintaining the slower dynamics observed within condensates of S1 alone. This suggests that the interactions disrupted by the mutations K126E and K129E also contribute to slow S1 dynamics. One possibility is that interactions involving the residues K126 and K129 favor S1 conformations that enhance 425-452 (shuffled)-dependent interactions. Indeed, the mutations K126E K129E have been reported to interfere with interactions among N-termini of PGL-3 molecules (Aoki et al, 2021). While two self-association domains within the α-helical N-terminus of PGL-3 have been mapped (Aoki et al, 2021, 2016), structural insights into those associations are limited. However, PGL-3 shares significant sequence similarity with another protein PGL-1. Crystal structures are available for fragments of the PGL-1 protein that show the two self-association domains at the N-terminus are predominantly α-helical and globular in nature (Aoki et al, 2016, 2021). Therefore, one possibility is that shuffling the sequence 425-452 of PGL-3 or heat-induced unfolding of PGL-3 exposes hydrophobic residues that become available to participate in inter-molecular interactions.”

      What is the real mechanism by which PGL-3 phase separates if not via the disordered domains? *

      *

      We agree with the reviewer that elucidating the detailed mechanism of phase separation of PGL-3 is an interesting direction for future investigation. However, we feel this is not required to support the main message of this manuscript.

      Throughout the manuscript, the term "dynamics" is used to indicate FRAP, but it would be better to define what is meant (diffusion of PGL-3 in condensates) instead of using dynamics a term that could mean many things. Secondly, FRAP cannot directly measure liquidity etc (see recent critiques by McSwiggen elife 2019, etc) so it is better to be cautious in the claims. Finally, discussing "dyanmics buffering" adds more terminology where it is not needed - perhaps say "changes to diffusion of PGL-3 in condensates".

      We feel it is useful to introduce a term that describes our observation. To our knowledge, our observation is novel and therefore requires a new term to describe it.

      However, we do appreciate the concern raised by the reviewer. We used a more generic term “dynamics buffering” in contrast to the more specific “diffusion buffering” since we did not directly estimate diffusion behavior at the ‘single-molecule’ level. However, we already described what we mean by “dynamics buffering” in the text as follows.

      “We used condensates of similar size for our analysis (average ± 1 SD of diameter of condensates are 6.4 ± 1.7 mm (Fig. 5a) and 5.9 ± 0.4 mm (Fig. 5b)). Therefore, dynamics buffering here is likely to represent similar diffusion rates of S1 within condensates.”

      • *

      *The "N-terminus" is not 65% of the protein. One could define this as the N-terminal domain, but again there are two clear folded domains in the first 65% of the protein and this needs to be described better. *

      We revised the text to replace the terms “N-terminus” and “N-terminal domain” to “N-terminal fragment”.

      *The description of "stickers" and the references to tau and hnRNPA1 are confusing as this is a predominantly ordered domain while those are IDRs. *

      • *

      We feel this is important as it aids discussing our work in the context of current literature describing the mechanisms of macromolecular phase separation.

      The suggestion in the discussion that "P granule components support dynamics by participating in intermolecular interactions wth PGL-3-mEGFP molecules" is not well supported because no interaction assays are performed and no mutaitons are made that disrupt these interactions to test this.

      Indeed, we have not conducted interaction assays or mutational analysis to directly test this. However, our detailed analysis with the S1 mutant supports this suggestion.

      While partially unfolded PGL-3-mEGFP molecules lose 30% of a-helicity, the a-helicity of the S1 mutant is reduced by 15% compared to wild-type PGL-3. Data with S1 and partially unfolded PGL-3-mEGFP molecules show that loss of a-helicity correlates with slower diffusion of protein molecules within condensates. Using the mutants PGL-3 (K126E K129E) and S1 (K126E K129E), we show that diffusion rate of S1 molecules within condensates depend on inter-molecular interactions, and presence of other P granule components support faster diffusion rate of S1 molecules within condensates. Therefore, we feel it is safe to speculate that intermolecular interactions with P granule components can support dynamics of a “more unfolded” (compared to S1) version of PGL-3 molecule. * *

      *More detailed analysis of some of the claims: Claim 1: An a-helical region mediates the phase separation of PGL-3, and the C-terminal disordered region by itself does not phase separate. The N-terminal dimerization is essential for LLPS. The C-terminal IDR interactions with mRNA facilitate the LLPS. Comments: The authors show sufficient experimental data using microscopy and FRAP on truncated constructs with the N-terminal and C-terminal regions - but see above regarding how these are described - a proper domain structure with the folded domains shown and the RGG motifs highlighted should be added and integrated throughout the discussion. *

      In the revised version of the manuscript, we described the predicted PGL-3 domains within a paragraph in the introduction: “The interactions that support phase separation of the PGL-3 protein remains unclear. Structural studies on the orthologous PGL-1 protein revealed two dimerization domains. This raises the possibility that PGL-3 also contains similar dimerization domains, and phase separation depends on interactions involving these domains.”

      Our Fig. 1a already includes the schematic representation of PGL-3 with predicted N-terminal and Central Dimerization domains and RGG repeats.

      *They show that the N-terminus is necessary and adequate for LLPS, and the C-terminus by itself does not phase separate. But, how does the N-terminal domains phase separate? This is not explained - what are the interactions? *

      • *

      Also, a di-mutant (K126E K129E) that is known, and also authors use SEC-MALS to show their N-terminal construct is consistent with the published results. Disrupting the n-terminal dimerization prevents phase separation, suggesting the importance of these residues in the N-terminus for self-assembly and LLPS. The Microscopy data backs the claim that the mRNA-mediated LLPS is facilitated by binding with C-terminus. However, the m-RNA binding to IDR is not sufficient for LLPS. Yet, the authors do not explain how higher salt prevents phase separation - again the mechanism of phase separation is unclear. Is it multivalent interaction of the two dimerization domains? A basic model (that is tested) would be important.

      We agree with the reviewer that elucidating the detailed mechanism of phase separation of PGL-3 is an interesting direction for future investigation. However, we feel this is not required to support the main message of this manuscript.

      However, our manuscript already provides some relevant insights as follows.

      “To investigate the underlying mechanism further, we began by testing if the N-terminal α-helical region of PGL-3 can self-associate. Our analysis using size exclusion chromatography followed by multi-angle light scattering (SEC-MALS) showed that this PGL-3 fragment 1-452 forms a dimer (Supplementary Fig. 2f). Mutation of two residues (K126E K129E) have been shown to interfere with interactions among the N-termini of PGL-3 molecules (Aoki et al, 2021). We mutated these two residues within the full-length PGL-3 protein (K126E K129E) (Fig. 1a) and found that this mutant PGL-3 (K126E K129E) protein cannot phase separate even at high protein concentrations up to ~130 µM (Fig. 1b, c). Addition of mRNA does not trigger phase separation of this protein at physiological concentrations either (Fig. 2a, b). Taken together, our data is consistent with a model where association among folded N-termini of PGL-3 molecules is essential for phase separation.”

      A likely possibility is that phase separation of PGL-3 depends on electrostatic inter-molecular interactions among the folded N-terminal fragment of PGL-3 molecules. Therefore, high salt prevents phase separation.

      Are the tags removed to ensure that phase separation is not caused by tags or remaining linker regions? Is the protein purified to be without nucleic acid contamination or other purity metrics?

      Most of the experiments were done with only 5% of total protein tagged with 6x-His-mEGFP. No additional tags were present on the constructs. For recombinant expression and purification, proteins were cloned such that it is possible to remove the 6xHis-mEGFP tag following treatment with TEV protease. Following removal of the 6xHis-mEGFP tag, the residual linker is just two amino acid residues long. We used 100% tagged-protein for our experiments only in very few cases (indicated in the figure legends).

      To demonstrate purity of recombinant proteins, SDS-PAGE gels with all protein constructs used in this study are shown in Supplementary Fig. 1.

      To minimize contamination of nucleic acids, we treated samples with Benzonase during the course of purification.

      To assess the extent of nucleic acid contamination, the ratio of absorbance at 260 nm and 280 nm (A260/A280) was monitored. In exceptional cases with high A260/A280 values, we analyzed samples further by purifying RNA from the sample using RNA purification kit (Qiagen) and found that RNA represented 1% or less of the sample mass.* *

      Claim2: The N-terminal a-helical region modulates the dynamics within condensates. The IDR region has minimal effect on the fast dynamics of PGL-3. Comments: The authors show that the full-length PGL-3 condensates have modest influence of components by comparing the FRAP half times with or without the P granule components, including mRNA. However, have the authors tried this in the presence of mRNAs for the constructs lacking the IDRs as they have several RGG domains and bind with mRNA and are likely to change the dynamics.

      We thank the reviewer for this suggestion. However, this experiment is not essential to support the claim made in the context of homotypic condensates of PGL-3 : “The N-terminal a-helical region modulates the dynamics within condensates. The IDR region has minimal effect on the fast dynamics of PGL-3.”

      *The authors report the importance of the N-terminal a-helical region by making a construct that lacks/disrupts a part of the helices lowers the thermal stability and significantly lowers the dynamics of the condensates. Also unfolding of helices is shown to reduce the dynamics. One primary concern is whether these "rescued" protein dynamics imply protein functionality. *

      An assay of “functionality” e.g. an enzymatic activity of the PGL-3 protein is not available.

      However, we compared the fecundity of C. elegans worms expressing from the native pgl-3 locus, PGL-3-mEGFP or the mutant protein PGL-3(D425-452)-mEGFP, to assay the functionality of P granules in these strains. We found that worms of both genotypes produced similar number of offspring (Fig. 4d). This suggests that deletion of residues 425-452 of PGL-3 does not result in significant loss of function of P granules.

      Are these semi denatured proteins refolded in the presence of P-granule components?

      We feel that identifying the precise structural changes of the semi-denatured PGL-3 proteins within the condensate vs. dilute phase could be an interesting direction for future investigation.

      Finally, it is not clear why the authors chose to disrupt folding of the central dimerization domain?

      The manuscript included a paragraph to describe the rationale.

      “This suggests that interactions involving the disordered C-terminal region of PGL-3 are not essential for the fast dynamics within condensates. Therefore, we addressed the role of the N-terminal α-helical region (1-452) in driving dynamics. In order to avoid engineering mutations that result in significant misfolding of PGL-3 and concomitant loss of its ability to phase separate, we focused our mutational analysis close to the junction of the folded N-terminus and the disordered C-terminus of PGL-3. Surprisingly, we found that a full-length PGL-3 construct (D425-452) that lacks only 27 residues phase separates into condensates that are non-dynamic (Fig. 3a, c). Sequence analysis of the PGL-3 protein predicts that this region 425-452 spans two α-helices (one complete helix and fraction of a second helix) (Supplementary Fig. 3d). We generated a PGL-3 construct (hereafter called ‘S1’) (Fig. 3a) in which the sequence in the region, 425-452, is shuffled while keeping the overall amino acid composition unchanged. We found that S1 phase separates into condensates that are 20- fold less dynamic than with wild-type PGL-3 (Fig. 3d, Supplementary Fig. 3c).”

      Saying that "reduced alpha-helicity of PGL-3 correlates with slower dynamics in condensates" may be factual in these assays but "correlation" should be expanded upon to include mechanism and to me it seems that the statement should read "aggregation of PGL-3 causes slower dynamics in condensates" (both the partially destabilized mutant and the fully unfolded WT show similar effects perhaps to different degrees).

      We feel that identifying the precise structural changes of the semi-denatured PGL-3 proteins within the condensate vs. dilute phase could be an interesting direction for future investigation.

      We did not use the term "aggregation" since we did not detect aggregates of S1 molecules using fluorescence confocal microscopy.

      *CROSS-CONSULTATION COMMENTS I agree with the other reviewer's comments and critiques, I have concerns about the biological relevance and also the biophysical mechanisms. Reflecting on the other reviewers' comments, the papers could provide more depth in one or both of these areas to come to firm conclusions that are either revealing about PGL biology or elucidate a (possible) general biophysical mechanism. *

      In the revised version, we now include additional data which shows “dynamics buffering” in transgenic worms generated using CRISPR/Cas9 technology. Briefly, we used CRISPR/Cas9 to generate transgenic C. elegans which expresses PGL-3-mEGFP or PGL-3(D425-452)-mEGFP from the native pgl-3 locus. In vitro, wild-type PGL-3-mEGFP protein generates liquid-like condensates. On the other hand, the recombinantly purified PGL-3(D425-452)-mEGFP protein generates condensates that are non-dynamic. In contrast to these observations in vitro, both wild-type PGL-3-mEGFP and PGL-3(D425-452)-mEGFP show similar dynamics (half-time of FRAP recovery) within P granules in vivo.

      Reviewer #2 (Significance (Required)): *Hence, although the authors shows how inclusion of other components can alter the one protein component phase separation, this is done with entirely artificial means of destabilizing the fold of one of the domains which likely leads to aggregation. So the true impact of the work is hard to understand because the mutations impact on the basic biophysical properties of the domain (stability, interaction) are not completely characterized and the reason for disrupting this folding is not clear. *

      A major impact of our work is elucidation of a novel “dynamics buffering” property within biomolecular condensates in vitro. Our in vivo data is consistent with this finding.

      • *

      We have chosen two orthogonal ways of perturbing the PGL-3 protein (i.e. mutations and temperature-dependent unfolding) to assay the effect on diffusion rate against different levels of perturbation (e.g. 30% loss of a-helicity in heat-denatured PGL-3-mEGFP vs. 15% loss of a-helicity in the S1 mutant, compared to wild-type PGL-3). Studying the phase separation behavior of these “artificially-generated” constructs provided the understanding that dynamics of PGL-3 in condensates depends on inter-molecular interactions, and slower dynamics generally correlate with stronger inter-molecular interactions. Further, interactions among two or more P granule components can buffer against large change in dynamics / aggregation within the P granule phase. These insights may lay the groundwork for addressing how more “natural” modifications (e.g., post-translational modifications, high local concentration of “sticky” molecules) may influence dynamics within biomolecular condensates in vivo.

      Based on current knowledge of P granule composition, chaperone proteins (e.g. heat-shock family proteins) do not show abundant concentration within P granules. However, it is unclear if chaperone proteins are completely excluded from the P granule phase. Therefore, we speculate that weak interactions among two or more non-chaperone proteins contribute significantly to “dynamics buffering” within the P granule phase in vivo.

      In the discussion section of the manuscript, we had speculated that “dynamics buffering” may potentially explain observations reported in the nucleolus: “Similarly, interactions among components could be a potential mechanism of storage of misfolding-prone proteins in non-aggregated state within the liquid-like nucleolus under stress in vivo (Frottin et al, 2019).”

      Our finding is also relevant in the context of synthetic biology with applications that require steady diffusion rate of macromolecules during biochemical reactions within biomolecular condensates.

      • *

      My field of expertise is protein phase separation and protein structure. * *

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: P granules are liquid condensates found in the developing germlines and embryos of C. elegans. Prior work by the authors and others have established P granules as a tractable model to investigate the basic biophysical properties of liquid condensates. Much of the prior published work focused on specific P granule scaffold proteins, PGL-1 and PGL-3. How attributes of these PGL proteins and the effect of other P granule components affect condensate properties is not fully understood. Here, Jelenic, et al. probe the biophysical properties of PGL-3. Using recombinant protein, they show that an N-terminal, alpha-helical region of PGL-3 is sufficient for liquid condensate formation and that N-terminal assembly is required for this formation. Creation of a scrambled alpha-helical region in PGL-3 and heat treatment affects PGL-3 fluidity. This fluidity can be "rescued" in vivo and in vitro with the inclusion of other P granule factors, including wildtype PGL-3, PGL-1, GLH-1 and mRNA. The authors note an inverse correlation between fluidity and mutant PGL-3 fluorescent intensity. They propose a model that heterotypic compositions of condensates can buffer their fluidity against components with stronger multivalent interactions. *

      MAJOR: 1. PGL-3 is a fantastic model to study the biophysical properties of a liquid condensate. But as the authors address in their discussion, the S1 mutant will likely affect the central domain folding, at its minimum causing exposure of a hydrophobic surface not typically exposed in biology. These helices are found at the terminal portion of the domain determined in the crystal structure and as depicted in the authors' Figure 1A. While the cause of S1's enhanced molecular interactions does not affect the in vitro work presented in this manuscript, it does affect how the conclusions connect to the biological nature of P granules and liquid condensates more generally. *

      We have chosen two orthogonal ways of perturbing the PGL-3 protein (i.e. mutations and temperature-dependent unfolding) to assay the effect on diffusion rate against different levels of perturbation (e.g. 30% loss of a-helicity in heat-denatured PGL-3-mEGFP vs. 15% loss of a-helicity in the S1 mutant, compared to wild-type PGL-3). Studying the phase separation behavior of these “artificial” constructs provided the understanding that dynamics of PGL-3 in condensates depends on inter-molecular interactions, and slower dynamics generally correlate with stronger inter-molecular interactions. Further, interactions among two or more P granule components can buffer against large change in dynamics / aggregation within the P granule phase. These insights may lay the groundwork for addressing how more “natural” modifications (e.g., post-translational modifications, high local concentration of “sticky” molecules) may influence dynamics within biomolecular condensates in vivo.

      Based on current knowledge of P granule composition, chaperone proteins (e.g. heat-shock family proteins) do not show abundant concentration within P granules. However, it is unclear if chaperone proteins are completely excluded from the P granule phase. Therefore, we speculate that weak interactions among two or more non-chaperone proteins contribute significantly to “dynamics buffering” within the P granule phase in vivo.

      In the discussion section of the manuscript, we had speculated that “dynamics buffering” may potentially explain observations reported in the nucleolus: “Similarly, interactions among components could be a potential mechanism of storage of misfolding-prone proteins in non-aggregated state within the liquid-like nucleolus under stress in vivo (Frottin et al, 2019).”

      Our finding is also relevant in the context of synthetic biology with applications that require steady diffusion rate of macromolecules during biochemical reactions within biomolecular condensates.

      • Recombinant PGL-3 experiments added PGL-1, GLH-1 and mRNA simultaneously and measured fluidity. It will be interesting to know which components contribute to fluidity and whether fluidity enhancement of each component is dependent on one another. Addition experiments with each component should be included and/or at least discussed in the main text. *

      Our data with S1-mEGFP or PGL-3-mEGFP (pre-heated at 50°C) proteins microinjected into C. elegans gonads, and the transgenic strain expressing PGL-3(D425-452)-mEGFP from the pgl-3 locus showed that the P granule phase can support fast dynamics of these mutant PGL-3 constructs. Since P granules have a complex composition, one possibility is that fast dynamics of these constructs is supported by interactions involving many P granule components. We found that using only a limited set of P granule components (PGL-1, GLH-1 and mRNA) can buffer dynamics of S1 in condensates in vitro.

      In absence of a systematic analysis investigating the individual role of approx. 70 P granule proteins in buffering S1 dynamics in condensates in vitro, we have claimed in the text that dynamics-buffering of S1 in condensates is supported by interactions among two or more components. However, we do appreciate the reviewer’s comment and feel it would be interesting to investigate the contribution of individual P granule components towards fluidity in future studies. We have discussed this in the ‘Discussion’ section of the manuscript.

      • The biological relevance of PGL-1, GLH-1, and mRNA were not discussed in the main text. How these factors contribute to P granule assembly and function should be mentioned in the Introduction or Results. *

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      *MINOR: 1. Line 20, "most non-membrane-bound compartments...have complex composition": Are there examples of condensates that do not have complex composition? *

      Not all non-membrane-bound compartments may have been characterized. To accommodate this possibility, we refrained from making a more general statement, but stated “most non-membrane-bound compartments…”.

      • Lines 40-43, RNA interactions driving LLPS: Please include citations from the Parker Lab (e.g. Van Treeck and Parker, Cell. 2018 doi: 10.1016/j.cell.2018.07.023) *

      We added the reference suggested by the reviewer.

      • *

      • Line 60, condensates contain hundreds of different proteins and RNA: Please cite at least a few examples of condensates with their components identified. *

      We added some references following suggestion by the reviewer.

      • Lines 82-84, PGL-3 drives assembly: Please cite Kawasaki, et al. Genetics 2004 for the discovery of PGL-3. *

      We added the reference suggested by the reviewer.

      • Lines 88-89, PGL-3 N-terminal fragment predominantly alpha-helical: The PGL domain structures should be cited here as supporting evidence that these regions are composed primarily of alpha helices (Aoki, et al 2016, 2021) *

      • *

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      • Lines 158-159, driving forces for phase separation: This statement should be removed or expanded. The authors point regarding the protein concentrations is not clear here but clarified in the Discussion (Lines 691-693). Recommend removing due to its speculative nature. *

      We retained the speculative comment in the results section. We feel that this prepares the readers for the discussion later in the manuscript.

      • Lines 210: Add commas before and after "PGL-1 and GLH-1"*

      We addressed the reviewer’s suggestion.

      • Lines 218-219: add "and" instead of comma between PGL-1 and GLH-1 *

      We addressed the reviewer’s suggestion.

      • Lines 238-239, alpha-helices: The PGL CDD structure should also be referenced here (Aoki, et al 2016). *

      To address this concern, we have added a paragraph in the Introduction section of the revised manuscript.

      • Lines 680-682, MEG proteins: Please cite accordingly. *

      We added the reference suggested by the reviewer.

      • Lines 694-695, heterotypic interactions: Please cite Saha, et al. 2016. *

      We added the reference suggested by the reviewer.

      • Figure 1: Add space between 1 and mM DTT *

      We addressed the reviewer’s suggestion.

      • Figure 2b: Please provide statistics between condensate numbers. *

      We provide statistics between condensate numbers in Fig. 2b.

      • Figure 4A: The region of the germline imaged and analyzed should be mentioned in the caption or the main text. *

      We revised the Figure legend of Fig. 4a to address this issue.

      • Figure 4B,C: Please include statistics between the FRAP curves. *

      We have included statistics comparing FRAP curves in Supplementary Fig. 4a-c.

      • Figure 4D: It will be helpful to compare this curve to Figure S4A in the same graph. Please also include graph statistics. *

      We have revised Fig. 4 to address the reviewer’s suggestion.

      • Figure 5: The data points are difficult to resolve. Recommend use of color.*

      We considered the suggestion, but felt it works better in the original form.

      • Figure 6: This is a very general model that does not highlight the extensive experimental work performed by the authors. Recommend incorporating PGL-3, mutants and P granule factors into this model. *

      We thank the reviewer for appreciating our extensive work. However, we retained the original Fig. 6 for the sake of simplicity.

      • Methods, Line 939, C. elegans section: What worms were used? TH623? Please describe the genotype. *

      We have included a table listing the strains used in the study and their genotype. * CROSS-CONSULTATION COMMENTS While my review was arguably the more favorable of the three, I agree with the other reviewers' comments and evaluation, particularly with Reviewer #1. As written in my review, my primary concern was the biological relevance of the work.*

      Reviewer #3 (Significance (Required)):

      Overall, the in vitro work presented investigating the biophysical properties of this minimal P granule system was thorough and well-analyzed, and the manuscript was clearly written. Additional citations and statistics will improve the manuscript and the strength of the conclusions, respectively. The biological relevance of this study to P granule form and function in vivo, and to condensates in vivo, is debatable. This work will interest those who study condensate biology, the biophysics of protein-protein and protein-RNA interactions, and RNA biochemists more generally.

      A major impact of our work is elucidation of a novel “dynamics buffering” property within biomolecular condensates in vitro. Our in vivo data is consistent with this finding.

      We have chosen two orthogonal ways of perturbing the PGL-3 protein (i.e. mutations and temperature-dependent unfolding) to assay the effect on diffusion rate against different levels of perturbation (e.g. 30% loss of a-helicity in heat-denatured PGL-3-mEGFP vs. 15% loss of a-helicity in the S1 mutant, compared to wild-type PGL-3). Studying the phase separation behavior of these “artificially-generated” constructs provided the understanding that dynamics of PGL-3 in condensates depends on inter-molecular interactions, and slower dynamics generally correlate with stronger inter-molecular interactions. Further, interactions among two or more P granule components can buffer against large change in dynamics / aggregation within the P granule phase. These insights may lay the groundwork for addressing how more “natural” modifications (e.g., post-translational modifications, high local concentration of “sticky” molecules) may influence dynamics within biomolecular condensates in vivo.

      • *

      Based on current knowledge of P granule composition, chaperone proteins (e.g. heat-shock family proteins) do not show abundant concentration within P granules. However, it is unclear if chaperone proteins are completely excluded from the P granule phase. Therefore, we speculate that weak interactions among two or more non-chaperone proteins contribute significantly to “dynamics buffering” within the P granule phase in vivo.

      In the discussion section of the manuscript, we had speculated that “dynamics buffering” may potentially explain observations reported in the nucleolus: “Similarly, interactions among components could be a potential mechanism of storage of misfolding-prone proteins in non-aggregated state within the liquid-like nucleolus under stress in vivo (Frottin et al, 2019).”

      Our finding is also relevant in the context of synthetic biology with applications that require steady diffusion rate of macromolecules during biochemical reactions within biomolecular condensates.

      *I have expertise in P granules, protein/RNA biochemistry, condensate assembly, and C. elegans. *

      References

      Aoki ST, Kershner AM, Bingman CA, Wickens M & Kimble J (2016) PGL germ granule assembly protein is a base-specific, single-stranded RNase. Proceedings of the National Academy of Sciences of the United States of America

      Aoki ST, Lynch TR, Crittenden SL, Bingman CA, Wickens M & Kimble J (2021) C. elegans germ granules require both assembly and localized regulators for mRNA repression. Nat Commun 12: 996

      Cipriani PG, Bay O, Zinno J, Gutwein M, Gan HH, Mayya VK, Chung G, Chen J-X, Fahs H, Guan Y, et al (2021) Novel LOTUS-domain proteins are organizational hubs that recruit C. elegans Vasa to germ granules. Elife 10: e60833

      Frottin F, Schueder F, Tiwary S, Gupta R, Körner R, Schlichthaerle T, Cox J, Jungmann R, Hartl FU & Hipp MS (2019) The nucleolus functions as a phase-separated protein quality control compartment. Science 365: 342–347

      Kawasaki I, Amiri A, Fan Y, Meyer N, Dunkelbarger S, Motohashi T, Karashima T, Bossinger O & Strome S (2004) The PGL family proteins associate with germ granules and function redundantly in Caenorhabditis elegans germline development. Genetics 167: 645–661

      Kawasaki I, Shim YH, Kirchner J, Kaminker J, Wood WB & Strome S (1998) PGL-1, a predicted RNA-binding component of germ granules, is essential for fertility in C. elegans. Cell 94: 635–645

      Phillips CM & Updike DL (2022) Germ granules and gene regulation in the Caenorhabditis elegans germline. Genetics 220: iyab195

      Price IF, Hertz HL, Pastore B, Wagner J & Tang W (2021) Proximity labeling identifies LOTUS domain proteins that promote the formation of perinuclear germ granules in C. elegans. Elife 10: e72276

      Saha S, Weber CA, Nousch M, Adame-Arana O, Hoege C, Hein MY, Osborne Nishimura E, Mahamid J, Jahnel M, Jawerth L, et al (2016) Polar Positioning of Phase-Separated Liquid Compartments in Cells Regulated by an mRNA Competition Mechanism. Cell 166: 1572-1584.e16

      Spike C, Meyer N, Racen E, Orsborn A, Kirchner J, Kuznicki K, Yee C, Bennett K & Strome S (2008a) Genetic analysis of the Caenorhabditis elegans GLH family of P-granule proteins. Genetics 178: 1973–1987

      Spike CA, Bader J, Reinke V & Strome S (2008b) DEPS-1 promotes P-granule assembly and RNA interference in C. elegans germ cells. Development (Cambridge, England) 135: 983–993

    1. Author Response

      Reviewer #1 (Public Review):

      Several questions have remained regarding the characteristics of these cells:

      1) Based on the transcriptome data in Figure 2, the authors inferred that thymic macrophages are "specialized in lysosome degradation of phagocytosed material and antigen presentation" yet did not show functional data to support these claims. Functional assays such as phagocytosis and antigen presentation are desirable, especially in comparison to other well characterized macrophage populations.

      We agree with the reviewer that additional functional characterization of thymic macrophages will strengthen the conclusions of our manuscript. We have performed antigen presentation assay and in vitro phagocytosis assay to functionally characterize the thymic macrophages. Indeed, thymic macrophages seem to be quite good antigen presenting cells – not as good as thymic DCs, but much better than peritoneal macrophages. This is documented in Fig. 3A and B. They were also good phagocytes both in vitro and in vivo as demonstrated in Fig. 3C-G. Surprisingly, peritoneal macrophages were better in the in vitro phagocytosis assay. We attribute this result to thymic macrophages’ poor survival during the sorting and in vitro culture.

      2) Do transcriptomes of CX3CR1+ thymic macrophages in old mice significantly differ from those of young mice?

      This is a very interesting question that we plan to explore in the future, but we feel it is beyond the scope of the current manuscript.

      3) It would be helpful to better graphically show the compositions (both cell number and cell ratio) of thymic macrophage subsets (TIM4+, CX3CR1+, and others) in mice at different ages (1 week, 6 weeks, and 4 months old). It is not straightforward to deduce all the information based on the current data presentation.

      We thank the reviewer for the suggestion! Plotting the cell numbers did reveal a peak in young age and then significant decline in the number of Tim4+ cells and a trend for accumulation of Tim4+ cells with age. Unfortunately, older mice show great variability in thymus size, which prevented the Tim4- result from being statistically significant. We have added these data to Fig. 8F.

      4) The description of the gating strategy of thymic macrophages for Figure 1 is quite verbose. Adding a step-wise gating strategy of thymic macrophages as a figure panel would be helpful for readers to follow the experimental details.

      We thank the reviewer for the suggestion. The description of the gating strategy has been stripped to 2 panels that capture its essence (Fig. 1B).

      Reviewer #2 (Public Review):

      This work provides by far the most thorough characterization of thymic macrophages. The authors used bulk RNA-seq, single-cell seq and fate mapping animal models to demonstrate the phenotype, origin and diversity of thymic macrophages. Overall the manuscript is well written and the conclusions of the paper are mostly well supported by data.

      Some aspects of data acquisition and data analysis need to be clarified.

      1) the authors should state what does row min row max in figure2 b,d refer to. is this expression value on log scale? In figure 2d, the authors compared their own RNAseq data with ImmGen seq data, what kind of normalization did the authors apply?

      We appologize for not making this clear. The values in Fig. 2b and d (current Fig. 2A and C) are expression values on log scale. We have included this information in the figure.

      Our data is part of the IMMGEN dataset. We sorted the cells and sent them to the US for RNA sequencing. That is why we referred to it as “our” data. However, to avoid confusion we changed the wording to clearly reflect that the data are from IMMGEN.

      2)The authors used immunofluorescent to identify the localization of two populations of macrophages, where they used merTK staining to indicate all macrophages. However, MerTK expression may not restrict to immune cells. The authors are encouraged to confirm that MerTK only labels macrophages in thymus by co-staining with F4/80 or CD45. Tim4 can also be used in immunofluorescence.

      We agree that staining with additional macrophage markers will strengthen our conclusions about ThyMacs localization. We have performed staining with CD64 together with MerTK or Tim4. CD64 and MerTK almost completely overlapped and so did CD64 and Tim4 in the cortex. We could not stain MerTK and Tim4 together because the antibodies are raised in the same species (rat). Additional evidence for the specificity of these markers for thymic macrophages comes from Fig. 3E and F showing the high degree of co-localization of apoptotic cells (TUNEL+) with MerTK or Tim4. Finally, Fig. 4 figure supplement 1 also clearly shows the distribution of TIM4 and CD64 in the whole thymus.

      3) The data of Cx3cr1+ cells accumulation with age in thymus is very interesting, and as the author has discussed, might indicate their contribution to thymus involution. However, the authors only showed change of percentage. As the total macrophages numbers decreased with age, it is not clear whether these cells actually "accumulate" with age. It will help us to assess if this increased percentage of Cx3Cr1+ cells is an actual increase of "influx" or due to the decrease of the self-maintain Tim4+ macrophage subsets.

      The reviewer is raising a very important point. As the changes in the Tim4+ and Tim4- thymic macrophages proportions with age occur at the background of thymic involution, it is difficult to judge whether Tim4+ cells self-maintain and whether Tim4- cells accumulate. Plotting the cell numbers revealed a peak in young age and then significant decline in the number of Tim4+ cells and a trend for accumulation of Tim4+ cells with age. Unfortunately, older mice show great variability in thymus size, which prevented the Tim4- result from being statistically significant. We have added these data to Fig. 8F.

      Reviewer #3 (Public Review):

      This study by Zhou et al. focuses on thymic macrophages and shows that two populations can be distinguished with different identities, localization and origin. Authors use several murine reporter and fate-mapping models, coupled with flow cytometry and transcriptomics approach to support their claims.

      Overall, the question tackled by this study is interesting, thymic macrophages having a bit being forgotten in the last decade which has seen many studies similar to the one presented here in other organs. So, the stated aim to closing this gap is relevant. But the actual version of the study suffers from many defects, more or less severe, which affect the clarity and the persuasiveness of it.

      • About the plan, authors study the origin of the thymic population and provide data in fig 2, 3 & 4 assuming that thymic macs form a homogeneous population. But from fig 5, they distinguish 2 populations and study them separately. So the end of the paper renders obsolete the beginning, that asks for a revision of the whole plan.

      We agree with the reviewer that there is more than one way to tell this story and we have been agonizing over our plan. However, we respectfully disagree that the beginning of the paper is made obsolete by the ending for several reasons:

      1) The initial figures in our manuscript contain very fundamental characterizaition of ThyMacs. Just as the revelation of a heterogeneity in liver macrophages or lung macrophages (ref) does not render all prior research on these cells obsolete, the initial figures in our manuscript are an essential part of the story. Such data are available for all other studied tissue resident macrophage populations. Removing them will be a disservice to the community.

      2) Another reviewer asked for deeper characterization of ThyMacs based on the data in Fig. 2. Accommodating this request will be very difficult if we remove this part.

      Nevertheless, we agree that ThyMacs heterogeneity is the central claim of the manuscript and should be introduced earlier. Now, the original figure 5 (current Fig. 4) that described the heterogeneity has been moved before the original figures 3 and 4 (current Fig. 5 and 6). Additional analyses distinguishing Tim4+ and Tim4- ThyMacs has been incorporated in current Fig. 5 and 6.

      • The figure 1 is not very clear. The backgating should be added in 1a. Or why not using the color map axis mode from FlowJo to show 3 parameters at a glance? The gating strategy should be more clearly displayed on the figure. On fig 1S3, there are clearly 2 pops in the CX3CR1-GFP mice. Why not starting from this to introduce the two populations?

      We thank the reviewer for the suggestion. We have included a color map axis to show MerTK, CD64, and F4/80 in one plot. The description of the gating strategy has been stripped to 2 panels that capture its essence. \We agree that there are several indications for heterogeneity among thymic macrophages, starting with Fig. 1E – the expression of Tim4, and Fig S4c – the expression of CX3CR1-GFP. We have added extra text at the beginning of the paragraph describing current Fig. 4 to point out these facts.

      • The figure 2 could be revised also. First, the panel 2a is useless and should be removed. A PC analysis of all the macs would be more useful here. Also, the color code used for the genes is confusing. Why genes up in ThyMacs are red in 2b but only half of them in 2d? Info can be found in the legend but it should be more clear on a graphical point of view.

      We have revised Fig. 2 according to the reviewer’s suggestions. The PCA analysis is consistent with the hierarchical clustering and shows that splenic and liver macrophages are most closesly related to ThyMacs. We agree that the presence of red in both heatmaps is confusing and we have changed the color code – color was removed from current Fig. 2A but retained in Fig. 2C.

      • For figure 3, what is the timepoint of the panel 3b? Here, authors should show microglia and ThyMacs for both timepoints and conclude based on the comparison. If ThyMacs are as stable as the microglia, no replacement. If not, replacement. For the panel 3f, n=3 is too low to be convinced notably with the standard variation here. And displaying the dot plot with 11% of blood mono from donor while the median being around 20 is not fair, authors should present the most representative plot. For the panel 3h, there are more GFP (in term of MFI) for TEC and ThyMacs than for total cells. How is it possible? TECs and ThyMacs should be in the total cells? Or the gating is not clear enough?

      We thank the reviewer for pointing our omissions. Fig. 3b (current Fig. 5B) is from E19.5 and we have added this information to the figure. We also agree that in Fig. 3f (current Fig. 5F) the sample number is too small and the variation too large to make solid conclusions. That is why we have repeated the partial chimeras experiment trying to irradiate as much as possible of the mice without affecting the thymus. We have substituted the data in the Fig. 3e and 3f with the new data. For Fig. 3h, we appologize for not labeling the data clearly. The panels labeled “single, live cells” should be labeled as “thymocytes” as they were obtained without enzymatic digestion that is essential for both TECs and ThyMacs. However, we found an important caveat in the thymus transplant experiment. It appeared that some of the thymus macrophages were GFP positive not because they express GFP but because they have engulfed GFP+ cells. As a result our experiments with embryonic GFP+ thymus transplants overestimate the percentage of donor-derived ThyMacs (all of them were GFP+). We have repeated the thymus transplantation experiments with congenically marked thymuses (CD45.2 donor and CD45.1 host). While this set up did not allow us to use the thymic epithelial cells as positive control because they are CD45-, we did identify host-derived ThyMacs, consistent with Tim4- cells originating from adult HSCs. Thus, we have replaced the previous data in Fig. 3H and 3I with current figures 5H and 5I.

      • For figure 4, the EdU staining (4e) is not convincing at all. The signal is very low (as compared to 4c for example.

      We agree that signal after 21d chase is a lot weaker than after 2 h (Fig. 4c) or 21d (Fig. 4e) of EdU pulse. The reason we decided to keep this data is that: 1) the thymocytes also have much lower EdU staining after 21d chase compared to 2h and 21d of EdU pulse; 2) The results from EdU staining are very consistent with the data from Ki67 staining, cell cycle analysis, and scRNA-Seq revealing a small population (~5%) of cycling ThyMacs.

      • For figure 7, the interpretation of the data and the way to present them are not clear. Authors use an inducible fate-mapping model. The fact that Tim4- loose their signal with time argue for a replacement by non-labelled cells (blood monocytes) whereas Tim4+ ones are stable meaning they self-maintain. It is what authors claim. But how it fits with previous data where they say that Tim4+ derived form CX3CR1+? The explanation that is a bit subtended here but not enough clearly shown is that CX3CR1+ give rise to Tim4+ during embryonic development but is stops after, Tim4 self-renew independently, and CX3CR1+ are slowly replaced by monocytes. As this is the central claim of the paper, it should be most clearly reported and for this, a substantial change of the whole plan is required.

      We thank the reviewer for pointing out the need for better explanation. The maintenance of the different populations of ThyMacs is indeed complex and proceeds in different ways in the different periods of life. We have added some extra data to Fig. 7 (current Fig. 8) that we hope will add some clarity to the maintenance of thymic macrophages with age. The new Fig. 8F shows the dynamics of the cell numbers of Tim4+ and Tim4- macrophages with age. Tim4+ cells reach a peak in young mice and decline significantly as mice age. So, we do not think that they are self-maintaining but instead, undergo slow attrition with very limited replacement. These results are consistent with Fig. 6I showing low levels of Mki67 in Tim4+ cells. Tim4- are a different story: they progressively accumulate with age. Although the variability in thymus size and Tim4- macrophages in very old mice is too great for the data to reach significance, the trend is clear.

      As for the dynamics of the populations in the embryonic period, we added data formally demonstrating that TIM4+CX3XR1- are derived from CX3CR1+ cells by fate mapping (Fig. 7E-G). We induced re-combination in pregnant ROSA26LSL-GFP mice pregnant from Cx3cr1CreER males at E15.5 when almost all ThyMacs are Cx3cr1+ (Fig. 7A). Just before birth, at E19.5, we could find a substantial proportion of TIM4+CX3CR1- cells among the fate mapped GFP+ macrophages, indicating that Cx3cr1+ cells, indeed, give rise to TIM4+CX3CR1- cells. As pointed out before, this pathway gets exhausted by the first week after birth – at d7 all ThyMacs are TIM4+.

    1. we must acknowledge that our styles of teaching may need to change. Let's face it: most of us were taught in classrooms where styles of teachings reflected the hotion of a single norm of thought and experience, which we were encouraged to believe was universal.

      I totally agree with this statement, because different people think differently, and people's brain work and learn in different ways in different stage of growth. It is really surprising that we as a student, all learn from the same method and experiences. And clearly the style of teaching should be change,

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors have assembled an enormous amount of statistical data on the genomes and phylogeny of Arctic algae, including the genomes of four new species that they sequenced for this study. Their main finding is that horizontal gene transfer has led to convergent evolution in distantly related microalgae.

      **Major comments**

      Reviewer #1__: The purpose of the study is not clearly stated in the abstract or the introduction. The authors say (line 93) "Defining the genetic adaptations underpinning these small algal species is crucial as a baseline to understand their response to anthropogenic global change (Notz & Stroeve,2016)." Is this their goal? Or are they just quoting another study? The authors state (line 103) "We extend by sequencing the genomes of four distantly related microalgae...". This is not really a question or a hypothesis. I am sure the authors can provide a more compelling reason to embark on such a labor-intensive study.__

      Reply: We agree that the aim was lost in the details and the Introduction is now focused towards the original goal of the study, which was to investigate convergent evolution in a biogeographically isolated ocean. Additional references on the formation and history of the Arctic Basin have been added to the Introduction to provide context. “An ocean has been present at the pole since the beginning of the Cretaceous. Shaped by tectonic processes (Nikishin et al., 2021) the Arctic Ocean has been a relatively closed basin since the Masstrichtian at the end of the late Cretaceous epoch (ca. 70 million years before present), with episodic sea-ice cover since that time (Niezgodzki et al., 2019). This long history suggests limited gene flow from the global ocean over vast time scales and Arctic marine species including microalgae could well have unique adaptations to cold arctic conditions.” Line 78-83.

      And following this we provide a clear hypothesis “The potential for lineages of ancient Arctic origin and the episodic input of outside species led us to our hypothesis that Arctic microalgae convergently evolved traits or adaptations aiding survival in an ice-influenced ocean. Line 112-117.

      We also discuss both the adaptive and distinct physical environment of the Arctic, and its topographical separation from other ocean regions as dispersal limitation would enhance the Arctic-specific genomic signatures. We now cite the recent paper by Sommeria-Kline et al. (2020), which puts eukaryotic plankton biogeography into a global context (Line 72)

      Reveiwer #1__: The most prominent shared trait that the authors found are genes for ice-binding proteins. However, in view of their importance, little information is given about their different types and possible functions.__

      Reply: We appreciate the comment and have added information on relevant ice binding proteins found in the Arctic Algae. In addition, we discuss how the functional and secretory diversity of IBP would enhance the survivability of pelagic taxa. Lines 534 to 564.

      Although ice binding proteins from multicellular animals and plants are outside the scope of this study, there is a recent review; Bar Doley, Braslavsky and Davies 2016 Annual review of Biochemisty 85: 515-542.

      .

      Reviewer #1__: The HGT of ice-binding proteins is a major focus of this study, but little is said about what previous studies have said about this. What are the previous studies, what are their findings and how do the present findings contribute to this?__

      Reply: We agree that this aspect should have been more visible. We incorporated new data to characterize IBPs drawn from MMETSP transcriptomes, and environmental Tara Ocean metagenomes, as well as our Arctic strains. We note that as we take a PFAM-based approach, the IBPs treated are DUF3494/PF11999 domain, which are type 1 IBPs / algal IBPs (Raymond and Remia 2019). As an example of novelty, we identify the position of IBPs from dinoflagellates, within a larger Arctic Clade that included CCMP2293, CCMP2436 and CCMP2097 and Arctic TARA IBP, rendering this a pan-algal IBD clade.

      In addition, we were able to resolve the position of anomalous F. cylindrus IBP that fell between two Arctic associated clades (A and B, in our Fig 4). This finding is consistent with F. cylindrus originating in the Arctic as previously suggested and subsequently invading the Southern Ocean.

      The recurrent acquisition of multiple diverse IBP isoforms in individual species through HGT events has not been previously reported, and the extent of isoforms in the Arctic was surprising. See for example multiple different IBP forms with separate origins in Pavlovales CCMP2436 (Fig 4). The previous studies are referred to in the context of the phylogeny of the IBD within the results section: Lines 322- 413, and Lines 534-585.

      Reviewer #1: Figure 5 on HGT of ice-binding proteins is difficult to follow. It would be clearer if each panel could be described separately, clearly stating its main finding. I doubt that a reader could look at this figure and explain to a colleague what it shows.

      Reply: We have revised rearranged the figure (now Fig 4) with Arctic A, B, C and D clearly indicated as well as the two Antarctic dominated clades. The upper schematic includes the deepest phylogeny of algal IBDs to date, incorporating all of UniRef, MMETSP and TARA Oceans. The fasta files underlying the tree and the nexus file used are provided the S1 Data Folder, which is an excel folder with information on the analysis of the data. The callout and order of the clades has been revised to facilitate interpretation of the phylogenies more clearly. The entire section has been completely rewritten.

      Reviewer #1: This is also a problem with many of the other figures. For each figure, what is the question being asked and what is its take-home message?

      Reply: We agree that the message was lost and have now focused on our original question in our accepted proposal to JGI. “Is there a convergence among arctic microalgae at the genomic level?”. We found some genome properties were common among the Arctic isolates (more unknown PFAMS and several expanded PFAMs). The importance of ice binding proteins in Arctic Isolates and the widespread inter-algal HGT of this important protein among the Arctic strains. The IBP biogeography and phylogeny strongly indicate that the Arctic microalga have acquired IBP locally and that the Antarctic strains have acquired additional isoforms independently from Antarctic bacteria and fungi (Lines 565-585).

      Reviewer ____#1____: ____The paper has more data than a reader can absorb. It could be strengthened by reducing the number of figures, simplifying them if possible, and more clearly stating the value of the remaining figures.

      Reply. As suggested, we have refocused the paper, removing more speculative statistics based analysis and associated figures. The main conclusions are supported by the 5 main figures. We are now present 5 main figures and 11 supplementary figures (previously 23 downloadable supplementary figures and 40 on-line only figures supporting the support figures). We agree with the reviewer, and we feel the revised version is a more transparent synthesis. Briefly the Figures illustrate the following points. Fig. 1. The multigene tree of available algal genomes and transcriptomes provides a clear framework for judging the divergence of subsequent individual gene and PFAMs phylogenies. Fig. 2 (originally Fig. 3). Indicates the convergence of PFAM domains in the Arctic strains, in contrast to strains from elsewhere. Fig. 3 (originally Figure 4) shows Arctic specific expansions and contraction of PFAM domains, again demonstrating convergent evolution in the Arctic. The figure identifies specific PFAMs that contribute to the within-Arctic convergence. This figure is based on statistical methods independent of Fig 2. Figure 4 is the most extensive IBP phylogeny to date and has been discussed above. Figure 5, which was supplementary in our non-peer reviewed version, shows the biogeographic distribution of IBP, and can be compared to the distributions of the 18S rRNA genes from the four Arctic algae provided as supplementary (S6 Fig.)

      **Minor comments**Reviewer #1

      1. The figure citations are confusing. E.g., what does "Fig.1- Figure supplement 1" refer to? Does this refer to 1 or 2 figures? Apparently, it refers only to Fig. S1, so many readers will be confused when they look at Fig. 1.

      Reply: We apologize for the confusing format; the manuscript had been formatted for the online journal eLife. Our revision follows the more traditional style of PLoS Biology and other Review Commons journals.

      .

      Multiple citations should be in order of publication date, not alphabetical order.

      Reply ; We agree that date of publications is quite standard and recognizes priority of publication. Several on line journals no longer follow this rule and citation order will follow the specific style used by our accepting journal.

      Reviewer #1 (Significance (Required)): It is well known that useful genes tend to be shared among microorganisms. The present study strengthens previous studies in showing that gene transfer is an important process in polar regions.

      Reply: We thank the reviewer for recognizing the importance of our study.


      Reviewer #2 ____(Evidence, reproducibility, and clarity (Required)):

      This manuscript is the result of a large international collaborative effort, including the US Department of Energy Joint Genome Institute. Its focus is comparative genomics of eukaryotic Arctic algae. The primary data described in the ms are four new genome and transcriptome sequences from diverse Arctic algae, represented by a cryptomonad, a haptophyte, a chrysophyte, and a pelagophyte.

      The authors compare these new data to previously published genomic/transcriptomic data from eukaryotic algae with the goal of understanding genome evolution in the Artic. The results of the paper are a series large-scale comparative genomic bioinformatics analyses, including the associated statistical analyses. The key findings center on statistically significant features of Arctic genomes, features that stand out as compared to the genomes of algae that are not primarily found in the Arctic. Together, these findings allow the authors to make various hypotheses and suggestions about genetic adaptations to polar environments.

      By far the most significant finding is that the genomes of Arctic algae are enriched in genes encoding proteins with an ice-binding domain, paralleling findings from Antarctic algae. These genes appear to have spread among Arctic algal genomes via horizontal gene transfer, which raises a series of interesting questions. In my opinion, the major conclusions of this paper are supported by the data. Listed below are a few comments that may improve the ms:

      Reviewer #2.

      1) In today's post-genomics era, everyone seems to be sequencing nuclear genomes. Often what distinguishes high-impact and low-impact genome papers is the number of genomes presented and the quality of the genome assembly. I may have missed it, but reading the main text, the figures/tables, and the supplementary data I was not able to get a sense of the quality of the four genome assemblies from which the main findings are based. I was eventually able to find this information from PhycoCosm (note: some of the links to this site are not working in the ms). My quick scan of the PhycoCosm summary info for the four genomes indicates that the assemblies are highly fragmented, likely because they are based on short-read Illumina sequencing rather than a combination of short and long reads. I think it is important to briefly discuss (and or present) the quality of the assemblies in the ms and to highlight the potential limitations/drawbacks of employing highly fragmented assemblies when carrying out large-scale comparative genomics.

      Reply: We agree and the data concerning the genome quality assemblies has been moved to the main text Table 1. The comparison with other paired related strains is provided in an excel folder designated S2 Data Folder.

      Reviewer #2.

      2) Horizontal gene transfer is undeniably a major driving force in evolution, and one that has shaped genomic architecture across the Tree of Life. I believe the data presented here support a role for HGT in the genome of evolution of Arctic algae, particularly with respect to genes encoding proteins with an ice-binding domain. However, we can all think of numerous instances when authors of genome papers were too quick to point to HGT. Thus, I would urge more caution and balance when presenting the HGT data, including some discussion about factors that could incorrectly lead researchers to conclude a significant role for HGT, such as contamination, gene duplication, mis-assemblies, etc. I'm not suggesting that you change the main conclusions, but just tone down the language in places (e.g., "we reveal remarkable convergence in the coding content ... ").

      Reply: We understand the reviewers concerns and now more clearly outline the pipeline we have used to identify HGTs. This included: filtering each genome to remove all possible contaminant sequences first, considering both contig co-presence of vertical- and horizontally-derived genes, and reciprocal and independent annotations of gene sequences in both genome sequences and MMETSP transcriptomes. Retained genes were subjected to simultaneous BLAST analysis and manually curated phylogenies using decontaminated reference datasets. The most parsimonious explanation for our final IBP domain microbial algal clusters (Fig 4) is HGT. On the side of caution, we removed the entire section that identified potential arctic HGT based primarily on a less targeted broad statistical analysis. The focus is now on 3 genes that have clearly identifiable utility in the Arctic, were found to be enriched in Arctic genomes via a separate analysis and had homologs in the Tara Ocean Polar circle data. In addition, we describe more clearly the role of expansion and enrichment of PFAMs and the high proportion genes without an identifiable PFAMs in the Arctic strains as evidence for Arctic convergence separate from potential HGT.

      Reviewer #2.

      3) The downside of studying protists (as compared to multicellular animals, for instance) is that most are not widely known by the scientific community and even fewer scientists can picture what they actually look like (e.g., Pavlovales sp. CCMP2436). A few more details about the four Arctic algae that make up the focus of this paper might be helpful for the casual reader. My sense is that if at the next departmental meeting I asked my colleagues what a pelagophyte was most would look at me with a blank stare. Moreover, am I right to assume that all four algae are psychrotolerant rather than psychrophilic (Supplement Fig. 1 makes me think otherwise). It might be good to point out the difference in the text.

      Reply: High resolution images of each strain are available on the JGI home page for each alga, given the multiple figures we feel photos would not add information.

      Reviewer #2

      4) I don't think Supp. Table 1 (the Pan-algal dataset) got uploaded correctly during the manuscript submission stage. The first link I click on gives me Supp. Table 2.

      Reply: We apologize for this, the format was incorrect for the file designation and there were lost links. We now more actually refer to these as Data Folders as they are excel folders containing multiple sheets, All supplementary links will be verified again on final submission.

      .

      Reviewer #2 (Significance (Required)):

      By far the most significant finding from this paper is that the genomes of Arctic algae are enriched in genes encoding proteins with an ice-binding domain, paralleling findings from Antarctic algae. These genes appear to have spread among Arctic algal genomes via horizontal gene transfer, which raises a series of interesting questions. This is not the first paper to present these types of ideas, but it is arguably the broadest analysis yet, at least with respect to eukaryotic algae. This work will be of great interest to polar scientists, phycologists, protistologists, and the genomics community. I am genome scientist studying protists, including algae.

      Reply. We thank the reviewer for their insightful comments.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      This manuscript is focused on Arctic microalgae, an important yet understudied community in permanently cold ecosystems. By sequencing the genomes of four phylogenetically diverse and uncharacterized polar algae, the authors seek to elucidate genomic features and protein families that are similar in polar species (and differ from their relatives from temperate environments) This work used high-throughput genomic sequencing and computational analysis to demonstrate significant horizontal gene transfer (HGT) in several gene families, including ice-binding proteins. The authors suggest that this HGT is an effector of environmental adaptation to Arctic environments.

      **Major comments and experiment suggestions:**

      The authors conclude that HGT between arctic species is a driver of polar adaptation. The authors strongly support the claim that HGT is present more frequently in the polar algae examined here. Whether this is adaptive should be further explored though. For instance, ice-binding domains were one PFAM group found at significantly higher frequencies in the polar species - but are all of these species associated with ice? What would be the benefit of IBDs in an alga that is found in the open ocean. Similar with the other domains (Lns 333-335), its not clear whether these are truly adaptive features. ____This is more speculative.

      Reply: We agree that detail was lacking and have considerably expanded our introduction on the character of the Arctic Ocean and have stated the goals and underlying hypothesis. Briefly, all surface water organisms that live in the Arctic encounter ice during the year as the ocean freezes in winter, and surface waters reman around negative 1.7 °C for much of the year. This information has been added to the introduction. We have also expanded the discussion on the multiple effects of different IBPs that would be ecologically beneficial for plankton as well as ice-algae and cite relevant experimental studies and reviews.

      Reviewer #3) ____HGT was a major conclusion of this study, putting this in a wider perspective would strengthen the conclusion, especially in the context of HGT from prokaryotes. Are there insights on whether IBDs are present in Arctic prokaryotes?

      Reply: This is a good question, and we now point out that there were 91 Arctic bacterial and archaeal IBP sequences in our comparative dataset. In contrast to the Antarctic clades, none were closely related to the Arctic strain IBPs (Fig 4). Line 336.

      Reviewer #3) ____The data obtained from the genomic works supports the conclusions stronger that ones from transcriptomes, where what genes/domains are present would depend largely on the sampling conditions. This should be emphasized.

      Reply: The main rational for using transcriptomes was that more of these are available and enabled us to detect convergences and HGT across a broader taxonomic range than would be possible with genome-only data, where we had access to a total of only 21 microalgal genomes. In general transcriptome studies are aimed at identifying responses under different conditions and rely on comparative expression data, usually 2-fold differences in up or down expression under different growth conditions, see for example Freyria et al. 2022 (Communications Biology). Unlike a transcriptome expression study, our data mining detected any (constitutive or regulated) expression in these unicellular haploid cells, we would have detected genes used under any condition that an algal happened to be growing. IBD was not detected in any of the temperate genomes, and only detected in transcriptomes of Arctic and Arctic-Boreal groups. However, we agree that there may be some limitation of transcriptomes only studies and mention this. Lines 522-528.

      Reviewer #3) ____An experiment to determine whether the species are cold extremophiles (psychrophiles) would be useful here to strongly support the data in Figure 1. The authors state that their species can not survive >6C but this is based on experiments done on older studies. Considering the cultures have been maintained as a continuous culture for decades, confirming that they still have psychrophilic characteristic would be useful. This is a straightforward and low cost experiment that requires simply measuring growth rates at several temperatures to define the optimal and confirm that the cells are not viable above 6C.

      Reply: These are interesting points, and the broad “background” statements in the original manuscript would require a separate study,and have been deleted. Temperature tolerance experiments are not so simple for cold adapted algae with slow growth rates. Such experiments require specialized incubators to maintain low temperatures. Temperature experiments have been carried out on the cultures in the context of other studies, see for example, Daugberg et al. 2018, J. Phycol. But this is not within the scope of the present study.

      We now restrict our conclusions to the specific question of convergence among Arctic strains. We apologize for the misunderstanding on the history of the cultures. They have not been in “continuous culture” but are cryopreserved. We now simply indicate that they grow below 6 °C, which is sufficient to assume that they are likely cryophiles, our experience is that they do not grow well or at all at higher temperatures, our efforts have been to maintain the cultures that are otherwise easily lost. We now make no claims about optimality or limits. Here we simply examined genomes and available transcriptomes that were generated from algae growing at 4-6 °C.

      Reviewer #3) ____**Minor comments:**

      Defining the species used here as psychrophiles would put the study in context better. The authors relate their finding to Antarctic species (HGT, ice-binding domains, large genomes) all of which are confirmed psychrophiles.

      Reply: The temperature definition of psychrophiles is surprisingly high (optimal growth below 15 °C) and this definition of psychrophiles is now given in the introduction. The point is really that there are few isolates from cold surface waters that have been well studied. We now add. “A handful of polar algal genomes have been extensively studied, with 4 of these from around Antarctica and classified as psychrophiles (not being able to grow above 15 °C (Feller & Gerday, 2003)”. Lines 103-107.

      Reviewer #3) ____A short rationale on why these species at all would be useful - are they representative of their classes? Do they have psychrophilic characteristics that might make them useful models in the future? Are they widely used now?

      Reply: We appreciate the point as the definition of utility in discovery-based science is an open dialog.

      We agree that the study requires context and have added our rational for selecting the species for genome sequencing to the introduction. “To address questions on genetic adaptations to this ice-influenced environment, we sequenced 4 phylogenetically divergent microalgae, from 4 algal classes belonging to 3 algal phyla: Cryptophyceae (Cryptophyta), Pavlovophyceae (Haptophyta), Chrysophyceae and Pelagophyceae (both in the Ochrophyta) isolated from the ca. 77 °N, where surface ice flow persists through June (Mei et al., 2002). The four isolates were selected as representatives of different water and ice conditions and phylogeny from available strains collected in April and June 1998 during the North Water Polynya study”.

      Reviewer #3) ____Starting algal cultures were maintained in a continuous culture since 1998 and under continuous light since at least 2015, have the authors confirmed that these algae retain their physiological features even after this long time? The accumulation of mutations is a possibility here.

      Reply: We apologize for the misunderstanding of the timeline; the history of the cultures was not given in the manuscript and the inferred history is not quite correct. The 2015 date was the year of publication for the MMETSP data. Our continuous light statement is a record of our standard culture conditions. We now elaborate on the material used in the current study. The cultures were deposited in the Bigelow culture collection (now NCMA) in 2002 and cryopreserved once they had been verified and given a culture designation. We obtained fresh cultures in 2005 and these were used for the MMETSP project. We obtained fresh cultures again in 2011, specifically for the JGI genome project. These algae do not grow fast and most of the DNA was sent to JGI in 2012 for most of the isolates. This history is rather long and not relevant, since one would speculate that over the years the algae would tend to lose the ice associated functionality, e.g. they were not frozen in seawater every year for 4 to 6 months or subject to sudden freshwater exposure, when ice melts. We would encourage other researchers to order the cultures and run experiments. We note that many of the 40 or so algae isolated from the same campaign have been used by others for specific studies and at least 8 are in the MMETSP data set. The presence of 18S rRNA and phylogenetic position of the IBP sequences compared to Tara Arctic circle data confirms long-term Arctic presence of each species and the IBP domains in the Arctic without marked changes over the last 20 years.

      Reviewer #3) ____Ln381 - The culture collection IDs for each sequenced species should be included here

      Reply: we have added the culture IDs throughout.

      Reviewer #3) ____Ln. 389 - Algal cells are harvested and used for nucleic acid extraction, the nucleic acids themselves are not harvested

      Reply: we agree and corrected the wording

      Reviewer #3 (Significance (Required)):

      This study is well places in the current state of research on polar alga and represents a significant and very valuable addition to the current knowledge pool. Algae in general are lagging behind other groups of photosynthetic organisms in the number of sequenced and analyzed genomes, despite algae being one of the main primary producers globally. This is even more strongly felt in polar research, where only 4 species have been sequenced, most of which are restricted to Antarctica. There is a true gap in our knowledge when it comes to Arctic species, and this study fills this gap. As the authors correctly state, we need more knowledge on polar environments and the primary producers that support these important ecosystems in light of current climate change trends.

      Reply: we appreciate the succinct summary of our study and thank the reviewer for insights and suggestions that have improved the manuscript.

      Reviewer field of expertise: Polar algae, stress responses, plant and algal energetics, cell signalling

      Reply: We appreciate the incites and perspective steming from the reviewer's expertise.

      Relevant key references cited in the reply:

      Daugbjerg N, Norlin A, Lovejoy C. Baffinella frigidus gen. et sp. nov. (Baffinellaceae fam. nov., Cryptophyceae) from Baffin Bay: Morphology, pigment profile, phylogeny, and growth rate response to three abiotic factors. Journal of Phycology. 2018;54(5):665-80

      Feller, G. and Gerday, C. (2003) Psychrophilic enzymes: Hot topics in cold adaptation. Nat Rev Microbiol, 1, 200-208.

      Freyria NJ, Kuo A, Chovatia M, Johnson J, Lipzen A, Barry KW, et al. Salinity tolerance mechanisms of an Arctic Pelagophyte using comparative transcriptomic and gene expression analysis. Communications Biology. 2022;5(1). doi: 10.1038/s42003-022-03461-2

      Mei, Z. P., Legendre, L., Gratton, Y., Tremblay, J. E., Leblanc, B., Mundy, C. J., Klein, B., Gosselin, M., Larouche, P., Papakyriakou, T. N., Lovejoy, C. and Von Quillfeldt, C. H. (2002) Physical control of spring-summer phytoplankton dynamics in the North Water, April-July 1998. Deep-Sea Research Part Ii-Topical Studies in Oceanography, 49, 4959-4982.

      Niezgodzki, I., Tyszka, J., Knorr, G. and Lohmann, G. (2019) Was the Arctic Ocean ice free during the latest Cretaceous? The role of CO2 and gateway configurations. Global and Planetary Change, 177, 201-212.

      Nikishin, A. M., Petrov, E. I., Cloetingh, S., Freiman, S. I., Malyshev, N. A., Morozov, A. F., Posamentier, H. W., Verzhbitsky, V. E., Zhukov, N. N. and Startseva, K. (2021) Arctic Ocean Mega Project: Paper 3-Mesozoic to Cenozoic geological evolution. Earth-Science Reviews, 217.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript is the result of a large international collaborative effort, including the US Department of Energy Joint Genome Institute. Its focus is comparative genomics of eukaryotic Arctic algae. The primary data described in the ms are four new genome and transcriptome sequences from diverse Arctic algae, represented by a cryptomonad, a haptophyte, a chrysophyte, and a pelagophyte.

      The authors compare these new data to previously published genomic/transcriptomic data from eukaryotic algae with the goal of understanding genome evolution in the Artic. The results of the paper are a series large-scale comparative genomic bioinformatics analyses, including the associated statistical analyses. The key findings center on statistically significant features of Arctic genomes, features that stand out as compared to the genomes of algae that are not primarily found in the Arctic. Together, these findings allow the authors to make various hypotheses and suggestions about genetic adaptations to polar environments.

      By far the most significant finding is that the genomes of Arctic algae are enriched in genes encoding proteins with an ice-binding domain, paralleling findings from Antarctic algae. These genes appear to have spread among Arctic algal genomes via horizontal gene transfer, which raises a series of interesting questions. In my opinion, the major conclusions of this paper are supported by the data. Listed below are a few comments that may improve the ms:

      1) In today's post-genomics era, everyone seems to be sequencing nuclear genomes. Often what distinguishes high-impact and low-impact genome papers is the number of genomes presented and the quality of the genome assembly. I may have missed it, but reading the main text, the figures/tables, and the supplementary data I was not able to get a sense of the quality of the four genome assemblies from which the main findings are based. I was eventually able to find this information from PhycoCosm (note: some of the links to this site are not working in the ms). My quick scan of the PhycoCosm summary info for the four genomes indicates that the assemblies are highly fragmented, likely because they are based on short-read Illumina sequencing rather than a combination of short and long reads. I think it is important to briefly discuss (and or present) the quality of the assemblies in the ms and to highlight the potential limitations/drawbacks of employing highly fragmented assemblies when carrying out large-scale comparative genomics.

      2) Horizontal gene transfer is undeniably a major driving force in evolution, and one that has shaped genomic architecture across the Tree of Life. I believe the data presented here support a role for HGT in the genome of evolution of Arctic algae, particularly with respect to genes encoding proteins with an ice-binding domain. However, we can all think of numerous instances when authors of genome papers were too quick to point to HGT. Thus, I would urge more caution and balance when presenting the HGT data, including some discussion about factors that could incorrectly lead researchers to conclude a significant role for HGT, such as contamination, gene duplication, mis-assemblies, etc. I'm not suggesting that you change the main conclusions, but just tone down the language in places (e.g., "we reveal remarkable convergence in the coding content ... ").

      3) The downside of studying protists (as compared to multicellular animals, for instance) is that most are not widely known by the scientific community and even fewer scientists can picture what they actually look like (e.g., Pavlovales sp. CCMP2436). A few more details about the four Arctic algae that make up the focus of this paper might be helpful for the casual reader. My sense is that if at the next departmental meeting I asked my colleagues what a pelagophyte was most would look at me with a blank stare. Moreover, am I right to assume that all four algae are psychrotolerant rather than psychrophilic (Supplement Fig. 1 makes me think otherwise). It might be good to point out the difference in the text.

      4) I don't think Supp. Table 1 (the Pan-algal dataset) got uploaded correctly during the manuscript submission stage. The first link I click on gives me Supp. Table 2.

      Significance

      By far the most significant finding from this paper is that the genomes of Arctic algae are enriched in genes encoding proteins with an ice-binding domain, paralleling findings from Antarctic algae. These genes appear to have spread among Arctic algal genomes via horizontal gene transfer, which raises a series of interesting questions. This is not the first paper to present these types of ideas, but it is arguably the broadest analysis yet, at least with respect to eukaryotic algae. This work will be of great interest to polar scientists, phycologists, protistologists, and the genomics community. I am genome scientist studying protists, including algae.

    1. Consolidated peer review report (23 September 2022)

      GENERAL ASSESSMENT

      In this manuscript, Tiemann, J., et al. take on a large-scale exploration of how mutations associated with disease impact calculated stability and conservation scores across the entire membrane proteome. The aim was to gain mechanistic insight into the causes of pathogenicity of missense mutations of human membrane proteins and verify whether, as is the case for soluble proteins, mutational destabilisation of membrane proteins can explain disease. To do so, the authors use a framework they previously developed, using measures of stability change (ΔΔG) and sequence conservation (ΔΔE, the GEMME score) to predict fitness effects of mutations with large-scale mutational data (Høie et al., 2022).

      By conducting a proteome-wide analysis of missense variants in human membrane proteins, the authors find decisively that pathogenic mutations are heavily enriched within the transmembrane region of membrane proteins. In addition, they report that they can sometimes use their calculated properties to classify residues based on their potential roles in stability or function, and that stability appears to be a major determinant of conservation and likely pathogenicity for GPCRs. 

      The authors thus make meaningful strides towards explaining the clinical impact of variants within membrane proteins, a currently under-characterized yet important category of proteins. The analyses have been conducted in a rigorous way, and the data and protocols are openly available. This work will be of interest to researchers working on membrane proteins as well as those applying computational methods to biophysical systems.

      On the other hand, the choices made by the authors in terms of presentation make the identification of the main conclusions of the paper challenging. In part, this is likely due to fundamental technical challenges associated with calculating biophysical properties for membrane proteins. In addition, although the analysis was performed at the scale of the proteome, due to the decision to only consider X-ray crystallography structures, the number of proteins analyzed is rather small (15). It thus remains unclear how the findings are transferable to other membrane proteins and how robust the comparison between the different functional classes is. 

      RECOMMENDATIONS

      Revisions essential for endorsement:

      1.     The authors are careful with what they claim, to the point where it becomes difficult to interpret the major messages. It appears there are many contributing factors to noise within these assays, resulting in complex figures that make it hard to interpret the data. The goal of presenting the data without overinterpreting it is noble, and the difficulty of digesting and presenting the comparisons in this work should be emphasized, but the complexity of the results made it difficult for reviewers to interpret without more robust processing. Further, we were not always certain how each result fits into the overall argument, which from our reading is whether the performance of predictors for classifying pathogenic mutations based on conservation and stability calculations provides insight into the mechanisms underlying membrane protein disease. Overall, we feel that clarifying the unifying argument of the manuscript and simplifying the figures would greatly improve the comprehensibility of this work. This could be achieved with one of the following approaches, although we leave the final choice to the authors: 

      • The manuscript could attempt to answer the following question: “Can existing methods be used to computationally determine whether pathogenic mutations are due to stability?” It would then explore why this question can or cannot be answered with the current analysis pipeline and existing tools. The answer is likely that the current tools are insufficient and the manuscript would thus point towards a future area of growth to be able to address the question.

      • The manuscript could focus on presenting the dataset. The results would be presented as preliminary examples of the kind of information that can be extracted and the type of analysis that may be done. In this case, claims such as “stability causes x% of pathogenic mutations” should be avoided, and the most important aspect of the manuscript would be that it accompanies a well-curated and openly available dataset, and provides links to it. In that context, the authors should mention whether there are existing curated and/or established databases of (human) membrane proteins, and how the dataset of putative membrane proteins compares with these resources.

      • The manuscript could focus on presenting the “computational approach”, which consists of mapping ddG-ddE, combined with an analysis of the localization of pathogenic (and non-pathogenic) mutations and the types of mutation (conservative, non-conservative etc.). Revisions would be needed to present results as examples of the kind of information this approach may provide.

      • The manuscript could possibly make a clear and compelling case for the idea that mutations of membrane proteins cause disease either because they destabilize the protein or because they occur at sites that are directly involved in function. This would require major revisions of the results and a systematic, clear and robust combined analysis of quadrant-location, protein-region-location, and amino-acid-type substitution.

      Related to the above, it would be useful to clarify in the introduction what is expected from the study upfront: did the authors expect that the picture that would emerge would indeed be the same for membrane proteins as for soluble proteins? Are there different degradation pathways for these two classes of proteins and is a loss of stability expected to have different consequences or not? In the end, the role of destabilization is rationalized in terms of buriedness and amount of physico-chemical change upon mutations. Hence, are the results of the study saying something about the mechanisms of disease variants or simply about the physico-chemical composition and topology of membrane proteins? To answer this point, we suggest contextualizing the study more by expanding on the published literature. This would also clarify that the membrane protein folding field is very far behind the soluble protein folding field, and, as a result, that we cannot expect the methods that work for soluble proteins to work for membrane proteins, or even if methods will mature to the point that they do yield predictive results for membrane proteins. 

      2.     In general, uncertainties need to be better quantified and discussed and statistical tests included. For example:

      • The low correlation of Rosetta estimates of ΔΔG and experimental ΔΔG is 0.47, which means less than 25% of ΔΔG is accounted for by Rosetta. This uncertainty needs to be considered more carefully: it will likely affect the AUC (i.e. is AUC(ΔΔG) < AUC(ΔΔE) because not all mutations are pathogenic due to stability, or is this a mere consequence of the uncertainty of ΔΔG estimates?) and the number of points in the different quadrants (how many of the points in a quadrant are false-positives or false-negatives, etc., and can we guess which they are by using other information such as the protein region, aa-type change, ΔΔE value, etc?). 

      • A variant may fall in the “wrong” ΔΔG-ΔΔE quadrant because of the mentioned (large) ΔΔG error, but also because of ΔΔE errors. This needs to be considered. Some estimate of the ΔΔE error needs to be made (e.g. by bootstrapping the alignment). Even in an ideal case in which ΔΔE is dependent only on ΔΔG, i.e. that both ΔΔG_Rosetta and ΔΔE are estimates of a “true” ΔΔG, not all points would fall in a y = x line in the ddE-ddG plane. How many points would there be in each of the quadrants because of mere estimation errors?

      • As the authors state, quadrant IV has few points. But it also seems that there are more blue points than red points in regions further away from the axes. Could the author comment on this observation? Is there a tendency for the ΔΔG measure to “over predict” pathogenicity ?

      • Within the manuscript the authors widely compare different groupings to drive their narrative. For example, on line 115 the authors discuss the enrichment of pathogenic mutations within the transmembrane domains, which then leads to many subsequent explorations of why TMs may be involved in disease. For this comparison, there is a large and visible significant difference, thus there may not be a need for a statistical test for significance. However, there are many other comparisons that are harder to interpret due to multiple different groupings, complex data representation, and at its core a fundamentally complex study. In these cases, we would like to see more robust statistical tests. For example, on line 184, after breaking up data in 2B based on ΔΔG and ΔΔE cutoffs, the authors write “...only a few variants (14.2%) falling in the quadrant of low ΔΔE and ΔΔG…” – it is unclear what a few means or if this is a significant reduction in variants compared to other quadrants. 

      3.     Regarding the performance of Rosetta to measure ΔΔGs:

      • The authors state that pathogenic mutations causing loss of stability are more often located in the interior of the protein (buried), implying bigger physico-chemical property changes. Isn’t that expected from Rosetta design? Indeed, while the analysis of the distribution of variants among protein regions (buried, etc.) and mutation-type (hydrophobic-to-hydrophobic, etc.) does add additional information to support the hypothesis that in some cases stability loss causes disease, it is important to recognize that this is not completely independent evidence because any ΔΔG predictor should somehow capture the observed patterns. 

      • ROC curves are used to determine how well ΔΔG guides pathogenicity, as a follow up to the observations that pathogenic mutations are enriched in TM regions of membrane proteins. The intuition here is that deleterious mutations within TMs are likely disrupting folding and therefore a ΔΔG-based predictor should do relatively well. However, the authors find that Rosetta-based ΔΔG calculations do not do well in all membrane proteins with benign-like and pathogenic mutations (Figure 2A) and solved crystal structures. In contrast, ΔΔG works quite well when trained solely on GPCRs (Figure 3A). The interpretation of this could be that stability is not a major driver of membrane protein disease – however, in many cases it is, such as Rhodopsin and CFTR. In contrast, another explanation is that Rosetta doesn’t predict stability well for mammalian membrane proteins, and in fact the authors discuss this at length in the limitations of the study section, explaining this is because Rosetta is trained on many bacterial beta barrel membrane proteins. We appreciated this section but would have preferred more of this discussion earlier on as it could aid in understanding why the ΔΔG predictors don’t perform accurately, as presented in Figure 2A. 

      • Could the authors clarify what they mean by “where the Rosetta energy function suggested a potential incompatibility between the experimental structure and the Rosetta energy function”? 

      4.     Regarding ΔΔE, in the present work, there is an implicit assumption that the constraints that operate during evolution of the aligned sequences, across species, as captured by GEMME, are the same constraints that affect the variants within a population, and therefore determine whether a variant will be pathological/non-pathological. This is a major assumption that needs to be spelled out and discussed. Mentioning this will help interpret “misplaced” points of the ΔΔE-ΔΔG map.

      Additional suggestions for the authors to consider:

      1.     The comparison of pathogenic/non-pathogenic mutations should consistently be made across the various sections of the paper. In too many cases in the present version of the paper this comparison is not emphasized. In some cases, the distribution of variants is described, without clearly differentiating pathogenic from non-pathogenic. In other cases, only pathogenic variants are considered, without comparing with the non-pathogenic cases.

      2.     Moving the section on the two specific proteins to the end of results would likely improve the flow of the paper. The A/B x ΔΔE-ΔΔG plane analysis would be presented first, then the A/B x ΔΔE-ΔΔG x “protein regions” analysis, and finally the A/B x ΔΔE-ΔΔG x regions x “aa-type” analysis before ending with examples.

      3.     The choice to restrict the analysis to X-ray crystallography structures from the PDB is not obviously well suited. Indeed, the coverage of membrane proteins by the PDB is rather low, and the authors found that less than 30% of all annotated human membrane proteins have at least some part resolved. One of the potential advantages of the AlphaFold database is to improve this coverage, and the analyses presented by the authors would thus benefit from considering predicted models displaying high confidence values.

      4.     In Figure 2, the authors define two classes of variants in their dataset, group A (pathogenic variants) and group B (benign or non-pathogenic with an allele frequency > 9.9 · 10^-5). Then they tested their models’ ability to distinguish between groups A and B by constructing ROC curves for Rosetta ΔΔG and GEMME ΔΔE. To visualize variant effects and further classify variants, they plotted individual variants along a ΔΔG vs. ΔΔE plot. They then use this plot to further classify variants based on their combined ΔΔG and ΔΔE values. The allele frequency cutoff is so important for generating group B that all downstream analysis is dependent on this. But because these residues are coming from a much more limited set of proteins, we think it would be useful to include a comparison showing that the gnomad allele frequency > 9.9 x 10^-5 cutoff remains informative for differentiating between benign and pathogenic residues.

      5.     In Figure 3, the authors apply their analysis to variants across all GPCRs, as well as just GPCR transmembrane regions. The AUC curves in panel A are much more accurate when applied to just this protein family, as also seen in panel B where variants fall into very clear subpopulations within each quadrant. The illustration and category definitions on the left of panel C are a helpful guide for the discussion of different variant types and their relevance to stability of the protein versus function in a unique way, however the plot on the right of panels C and D is confusing and not immediately intuitive making it difficult to consider comparisons that are discussed within the text. Indeed, the authors state that “Pathogenic variants in GPCRs, especially in the transmembrane region, lose function mostly by loss of stability”. Comparing these two panels, it is concluded that the pathogenic variants that do not lose stability are more often found in the TM regions of GPCRs compared to all datasets. This is somewhat confusing and the numbers supporting this affirmation in Fig 3C seem quite low.

      6.     The authors do not extensively discuss their results in the context of the membrane protein field nor the specific membrane proteins they highlight such as Rhodopsin and GTR1 (Figure 4). For Rhodopsin, at least, there has been extensive work done on its folding by Johnathan Schlebach’s lab and others, including a mutational scan. It could be useful to at least contextualize and contrast results here with previously published work. 

      7.     In Figure 5, the authors consider whether the identities of the starting and mutant residues correlate with their overall quadrants. Panel A is extremely difficult to interpret. We are  also unsure how robust any differences are likely to be, given the uneven sampling and the small number of samples in some of the boxes. Narrowing the comparisons (changed vs. unchanged property, A vs B) would likely improve comprehension and may be more meaningful. Panel B is, on the other hand, a wonderful example of how to clearly display complex, multidimensional data in a comprehensible way. The well-demonstrated association of hydrophobicity and transmembrane stability is beautifully demonstrated directly from the data, and the potential discordance with evolutionary conservation as well. We find this correlation even more striking given that the hydrophobicity scale used here was explicitly determined in the context of transmembrane regions, but the variants are drawn from all regions of the targets. We were curious to know what percentage of these are drawn from the transmembrane vs. soluble regions of the targets.

      REVIEWING TEAM

      Reviewed by:

      Willow Coyote-Maestas Paper Discussion Group, UCSF, USA: membrane proteins; high throughput experimental variant screening; developing assays for measuring how mutations break membrane proteins in order to explore how mutations alter folding, trafficking, and function of membrane proteins (see Appendix for group members).

      Julian Echave, Professor, Universidad Nacional de San Martín, Argentina: theoretical and computational study of biophysical aspects of protein evolution.

      Elodie Laine, Associate Professor, Sorbonne Université, France: development of methods for predicting the effects of missense mutations using evolutionary information extracted from protein sequences and/or structural information coming from molecular dynamics simulations.

      Curated by:

      Lucie Delemotte, KTH Royal Institute of Technology, Sweden

      APPENDIX

      Willow Coyote-Maestas Paper Discussion Group:

      Feedback was generated in a meeting of the journal club involving:

      Willow Coyote-Maestas

      Christian Macdonald

      Donovan Trinidad

      Patrick Rockefeller Grimes

      Matthew Howard

      Arthur Melo

      (This consolidated report is a result of peer review conducted by Biophysics Colab on version 1 of this preprint. Minor corrections and presentational issues have been omitted for brevity.)

    1. Artykuł przedstawia podłoże rozwoju metod rozpoznawania dokumentów oraz wyszukiwania informacji do 1939 roku, czyli do momentu, w którym Vannevar Bush napisał artykuł „As We May Think”, opublikowane potem w 1945 roku.

      Artykuł przekonuje do tego, że pomysł Busha nie był ani tak oryginalny, ani tak rewolucyjny, jak się go przedstawia. Autor przedstawia także stanowiska innych badaczy czy wynalazców, którzy mieli zarzuty względem projektu Memeksu.

      Autor skupia się przede wszystkim na osobie Emanuela Goldberga i jego wynalazku wyszukiwarki mikrofilmów. Przedstawia także powody, które spowodowały, że jego wynalazek był pomijany i zapomniany.

    1. However, we can also stimulate growth by capitalizing on existing strengths.

      I think that this is very important to understand. If you know where or with what your students succeed then you can use those strengths to help them in other areas where they may not feel as confident.

    1. His team already has more findings that support the cerebellum’s contribution to addictive behavior, and in particular to the solidifying of a neurally stimulating behavior such as drug use. Such memory-making may render some individuals more susceptible to addictions. “One of the biggest problems is that those who are addicts [or former addicts] can be weaned from their addiction but if there is a new stress, the person is very susceptible to relapse,” Khodakhah says. “We think the reason is that there is a signature of the memory within the cerebellum. . . . If we understand that better we might be able to provide pharmacological or other therapeutic interventions to help these individuals.”

      orienting statements - I feel like this gives the article a clear end and conclusion which ultimately reveals the direction of the essay

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are very grateful about the thorough reading and deep understanding of the work that these 4 reviewers have provided. Although it is evident that they still request an improved profiling of some aspects, it is very encouraging that all four think the work is very interesting, original, insightful and adds a new layer of knowledge to the regulation of DNA damage sensing and repair. It is also very rewarding that the four reviewers estimate that this work will sew connections between different fields and interest a broad readership. This is why we have designed here a very deep revision, tailored to satisfy all the raised concerns except one, and this just for technical reasons.

      Please find below the original reviewers’ comments and our answers to them preceded by the symbol “>”:

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Ovejero et al. report an increase in lipid droplet (LD) abundance after long (from 120' on) exposure of budding yeast cells to DNA damaging agents zeocin and camptothecin (CPT). Next, they analyze DNA damage signaling in yeast mutants that impair triacylglycerol (TAGs) or sterol (STEs) esterification. They observe a slight anticipation in Rad53/CHK2 phosphorylation (indicative of DDR signaling) in yeast stem mutants, as well as in yeast cells or human cells lines pre-treated with oleate upon zeocin treatment. Yeast stem mutants are sensitive to zeocin and captothecin, but only confer sensitivity to hydroxyurea upon combination with tagD mutations. Authors relate these phenotypes to a somewhat decreases DSB resection in yeh2D mutants (expected to have reduced steryl esters pools) and RPA-foci in steD yeast cells. Next, a reduction in single strand annealing recombination repair events upon zeocin treatment is reported using a genetic reporter in steD mutants and oleate-treated cells. From these data they conclude that inability to process sterols in response to DSBs leads to an exacerbated DDR and prevents DNA repair. Next, it is shown that Flag-tagged Tel1 distinctly interacts with mono-phosphate phosphoinositides, including PI(4)P. An interaction in vivo is also inferred through Proximity Ligation Assays (PLA) using anti-PI(4)P and anti-ATM antibodies in human cell lines, which was moderately downregulated upon treatment with MMS or zeocin. Over-expression of the Osh4/OSBP1 transporter, which consumes PI(4)P, increased the number of Tel1 (nuclear) foci upon zeocin treatment. Conversely Sac1 ablation, in which accumulation of PI(4)P is expected, abrogated nuclear Tel1 foci formation and reduced telomere length (a phenotype related to lack of Tel1 function). From these results authors conclude that Tel1 availability in the nucleus is influenced by PI(4)P availability. Lastly, treatment with an OSBP1 inhibitor led to a cell line and damaging agent -variable reduction of ATM phosphorylation and a mostly non-significant reduction of DNA resection, measured by native BrdU detection, in response to CPT treatment. Overall, authors conclude that i) biding of Tel1/ATM to PI(4)P modulates its functional availability in the nucleus, and that ii) DNA damage elicits the esterification and storage of sterols toward LDs, which contributes to tritate Tel1/ATM away from the nucleus dampening the DDR and affecting long-range resection.

      Major comments: While the conclusion that Tel1/ATM binds PI(4)P and this interaction modulates Tel1/ATM functional availability at the nucleus is convincing, the conclusion that DSBs elicit a change in the metabolism of this lipid to "control" Tel1/ATM function is not demonstrated. The notion that sterol processing occurs in response to DSBs is not sufficiently supported by the data presented, as the increase in LD numbers is observed much after activation of the DDR (Rad53 phosphorylation) in Zeozin-treated yeast cells.

      We are afraid that we have not been clear enough in explaining the kinetics giving rise to our model. As indicated by the reviewer, our work shows, through kinetic studies, that the storage of sterols within LD occurs at later stages than the activation of the DDR by Tel1 and Rad53 phosphorylation. Tel1 foci decline is necessary for subsequent engagement of downstream DNA long-range resection. Since we propose that sterol storage within LD is a means to attenuate Tel1 engagement at DSBs, it is thus logical (and thus compatible with the data we show) that LD number increase occurs simultaneously with Tel1 foci decrease, at late stages of the reactionWe will include this explanation and graph in the revised version of the work.

      In addition, evidence is not provided on the mechanisms by which PI(4)P metabolism would be controlled, which would be expected to be DDR-independent as they are placed upstream of this signaling pathway in the author's model.

      The key mechanism through which, in the end, PI(4)P metabolism will be controlled, is the esterification of sterols within LD. Given that, as clarified above, LD formation in response to DSBs occurs “late” (i.e., after 120 min), it is not excluded that the DDR itself can instruct, through phosphorylation of some effector(s), LD formation. In other words, by ordering LD formation, the DDR would be launching a self-limiting mechanism. In support, we now know, although we do not show in this work, that eliminating key DDR proteins prevents the formation of LD in response to DNA damage. Because of this, we have undertaken an educated-guess approach and chosen critical or rate-limiting enzymes in LD biology either possessing an S/T-Q cluster domain (predicted to be a phosphorylation substrate for the DNA Damage Response kinases (1), and/or retrieved in phospho-proteomic screens as specific DDR targets (2,3). This adds up to 28 proteins in S. cerevisiae and 45 proteins in Homo sapiens. Importantly, the emergent candidates fall into two identical categories in both organisms. To provide initial support for their pertinence, we have generated a point mutant in the putative S/T-Q cluster of one of the yeast candidates. Of high relevance, we find that the concerned mutant is impaired in correctly triggering LD formation in response to DNA damage, and we have now obtained a specific funding to pursue this characterization that, as such, constitutes a different work from the one presented in this manuscript. We hope that the reviewer is now convinced yet that she/he agrees in keeping this information for subsequent manuscript(s).

      The damaging agents used have been suggested to alter the redox metabolism and even lipid peroxidation (Kitanovic 2009, Mizumoto 1993, Krol 2015, Todorova 2015, Ren 2019, Singh 2014). Hence it is possible that PI(4)P changes are not due to DSBs, but an indirect though relevant effect. In absence of direct evidence supporting an active regulation of PI(4)P dynamics in response to DNA breaks, this conclusion remains speculative and this should be noted in the manuscript.

      We fully agree with the reviewer that the used genotoxins are triggering a myriad of effects which could elicit the same phenomenon by indirect means. Yet, we want to stress that the use of camptothecin, which elicits a very robust LD formation phenotype (Figure 1C), is very likely specific, as it is proven as a potent and direct trapper of Top1 onto DNA after having cleaved it. Nevertheless, we propose in the next paragraph two specific experiments to dismiss this problem, please see immediately below.

      Authors conclude that LD is specific to DSB induction. This seems an overstatement as they just reported LD increases in response to two agents that also induce other kinds of DNA damage. To also strengthen the link between DSBs and PI(4)P modulation of Tel1 function, authors should analyze LD numbers, Rad53 phosphorylation and Tel1 nuclear re-localization in response to HO-induced DNA breaks (e.g., using the system employed in Figure 3C).

      We humbly think that enzymatically-induced DNA breaks will both activate Rad53 phosphorylation and Tel1 nuclear concentration, as this has already been established, thus requiring no further exploration. Yet, it is very important to assess the reviewer’s suggestion concerning whether enzymatically-induced DNA breaks also trigger the formation of LD. To this end, we will perform two complementary studies in which, instead of using HO, which cuts only a few times in the genome, we will:

      1. a) exploit the naturally DSB-accumulating mutant rad3-102, which we previously characterized in the past (4), and which we already exploit in this work for recombination analyses (Figure S4A), to evaluate whether it endogenously harbors more LD in comparison with the WT.
      2. b) we have recently created a tool in which gRNAs targeted to different subsets of transposons in the genome can drive Cas9 to create DSB in a dose-dependent manner ((9), under revision in Genetics). We will use this system to monitor the LD formation in response to Cas9-triggered cuts. In addition, on figure 5A, significant differences in GFP-Tel1 foci abundance between WT and steD or yeh2D cells are only observed after 210', way after the slight effect on Rad53 phosphorylation is observed. This is at odds with the conclusion that Tel1 association to STEs modulates DDR signaling.

      We are afraid that we have not been clear enough in explaining the kinetics giving rise to our model. As indicated by the reviewer, our work shows, through kinetic studies, that the storage of sterols within LD occurs at later stages than the activation of the DDR by Tel1 and Rad53 phosphorylation. Tel1 foci decline is necessary for subsequent engagement of downstream DNA long-range resection. Since we propose that sterol storage within LD is a means to attenuate Tel1 engagement at DSBs, it is thus logical (and thus compatible with the data we show) that LD number increase occurs simultaneously with Tel1 foci decrease, at late stages of the reactionWe will include this explanation and graph in the revised version of the work.

      Minor comments:

      Figure S1D and E, experiments should be carried out to include time points in which LD accumulation and cell cycle arrest are observed upon zeocin treatment (i.e., up to 210' as in Figure 1A)

      We will provide cytometry profiles of cells at 210 min. These data exist already in our laboratory.

      How do authors explain increased single strand annealing recombination frequencies in steD and oleate-treated wild type cells (Figure 4A). Should it not be expected that increased STEs also impair recombination induced by endogenous damage?

      Only ste∆ (and not +oleate) indeed manifests an increase in basal recombination frequencies, likely arising from endogenous damage. Although the increase is observed, it is not significant. We agree anyway with the reviewer that, was the experiment to be repeated more times, the increase may be found significantly different. We do not have any honest proposal to explain this.

      Data presented in figure 4B and 4C are not fully convincing. Performing time course experiments might help concluding if the differences observed represent a relevant defect in DSB processing.

      We will perform a Pulsed Field Gel Electrophoresis (PFGE) kinetcis in response to zeocin with or without oleate pre-loading to reinforce the conclusion.

      Is Figure 5B referring to Flag-tagged Tel1 or GFP-tagged Tel1 as stated in the figure legend?

      There is a misunderstanding here, as the mentioned Figure 5B corresponds to P-ATM immunofluorescences in human cells, not to any tagged Tel1 experiment.

      Treatment with the ATM inhibitor AZD0156 increased PI(4)P-ATM PLA signals. From these authors conclude that "association of ATM and PI(4)P inversely correlated with the need for ATM within the nucleus. Do they imply that treatment with ATM-inhibitors reduces the requirement for ATM function in the nucleus? The interpretation of this result should be further elaborated to sustain this conclusion.

      We may have conveyed a wrong notion at this point. We do not imply at all that ATM inhibitors reduce the need for ATM in the nucleus. Instead, we imply that, by reinforcing ATM attachment to Golgi-resident PI(4)P, ATM inhibitors end up titrating ATM away from the nucleus. We will clarify our explanation to avoid misunderstandings.

      An increase of GFP-Tel1 foci upon OSH4 overexpression is described on Figure 7B. These are described as nuclear in the results, but no reference is made in the figure or legend as to how nucleus positions are addressed in these experiments. This should be clarified.

      We systematically combine the tagging of a nucleoplasmic protein (mCherry-Pus1) with the detection of GFP-Tel1 foci, as to unambiguously assess the nuclear position of Tel1 foci. We will include this explanation and the corresponding mCherry-Pus1 channel to clarify this.

      Also, WT controls and quantifications should be included in the experiments shown on Figure 7C.

      These experiments are quantified from the moment we did them, although we did not include such quantifications in the present version for the sake of space. We will do so in the revised version.

      Reviewer #1 (Significance (Required)):

      While the conclusion of lipid metabolism responding to DSBs is not convincing, the observation that Tel1/ATM function is modulated by PI(4)P biding is significant and advances the understanding on the function and regulation of this key kinase in promoting genome integrity maintenance. This is an unanticipated result which is highly novel and has implications for the modulation of Tel1/ATM function through pharmacological manipulation of lipid metabolism. This finding would be of broad interest for scientists working on the response to DNA damage and the maintenance of genome integrity. This reviewer belongs to that group and has limited expertise to evaluate the lipid metabolism genetic manipulation in the manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors show that cytoplasmic PI4P have a regulatory role on ATM response to DNA double strand breaks. The process involves a balance between exchange of PI4P between Golgi and ER in exchange of esterified sterols. The study is of interest, however provides indirect evidences to support their conclusions.

      Major comments : 1). Since the major conclusion relates to PI4P association with ATM in basal conditions to keep ATM outside nucleus and known presence of PI4P, ATM in other organelles of a cell, further experiments such as cell fractionation experimental that show golgi specific interaction would support the main conclusion.

      In continuation of 1st comment, since PI4P in substrate of PI4 phosphoinositol kinases, is there a competition between PI4kinases and ATM for PI4P binding should be addressed through immunoprecipitation studies.

      First of all, we need to specify here that PI4kinases will phosphorylated PI4 to create PI(4)P. Thus, PI(4)P is the product, and not the substrate, of PI4kinases. We therefore do not expect any competition between such kinases and ATM.

      Second, we take good note of the reviewer’s concern that the pool of PI(4)P at the Golgi may be shared, and also that it would be important to reinforce the notion of the relative subcellular localization of ATM under different treatments. To this end, we will perform the following integrative experiment:

      Immunoprecipitation of PI(4)P could theoretically be done using our specific antibody, yet the IP efficiency of a lipid cannot be verified by western blot. Further, there are PI(4)P pools elsewhere in the cell that would mess up with interpretations. We therefore dismiss the use of anti-PI(4)P as a tool to perform immunoprecipitations.

      Instead, to explore the impact of PI(4)P levels on ATM both at the Golgi and within the nucleus, we will split our cultures in two to either immunoprecipitate specific cytoplasmic Trans-Golgi Network-associated proteins (we will test separately TGN46 and GOLPH3); or the nuclear ATM-interacting factor MRE11 from nuclei, then blot for co-immunoprecipitated ATM. The relative co-immunoprecipitated ATM is expected to vary under different treatments to which the cells will be exposed, namely:

      • untreated
      • zeocin, to trigger ATM need in the nucleus
      • OSBP inhibition (+/- zeocin), to stabilize PI(4)P at the Golgi
      • PIK93, an inhibitor of PI4 kinases that prevents PI(4)P synthesis

      2). The authors claim that the ATM retention is the main function of PI4P in Golgi. The authors should rule out the possibility that the phenotype observed on DNA damage response is not due to non availability of PI4P substrate for PI4P kinases, that have recently been shown to participate in genome integrity maintenance.

      We want to explain that we do not intend to say that PI(4)P main function at the Golgi is ATM retention, as PI(4)P is a molecule binding and modulating multiple proteins, as for example the aforementioned GOLPH3. We will first revise our text to correct it, in case we have conveyed this incorrect notion, as it stems from the reviewer’s comment.

      Second, the reviewer evokes the notion that PI(4)P can be the substrate of a second phosphorylation, which could give rise to PI(3,4)P or to PI(4,5)P, which could still undergo remodeling into PI(3)P, for example. Recent work by Dr Michael Sheetz’s lab demonstrated that this set of phosphoinositides serves to drive the nucleation and activation of the ATR-Chk1 branch of the DNA Damage Response upon genotoxic stress, yet was completely inert with respect to the ATM-Chk2 branch (5). To rule out the possibility, as evoked by the reviewer, that the oleate-induced DDR phenomena we describe relate to these other events, we have now explored the response of the ATR-Chk1 branch when comparing the response of zeocin-treated cells that have been pre-loaded or not with oleate. We observe that the ATR-Chk1 branch is unaltered by oleate loading. Thus, we can now propose that the PI(4)P branch exclusively modulates the ATM-Chk2 axis.

      3). Does Oleate treatment influences Rad53 protein levels in addition to its phosphorylation that affect DNA damage response may be addressed.

      Exponential cultures from three different WT, three different ste∆ and three different yeh2∆ strains have now been taken and pre-loaded for 2 hours with 0.05% oleate, then total levels of Rad53 (without induction of DNA damage) assessed. We can now formally say that basal levels of Rad53 protein are not altered by this incubation. We will include this control in the revised manuscript.

      4). Does Yeh2 deletion reduces LDS should be checked.

      We frequently use yeh2∆ cells in our studies. In particular, we have recently published work characterizing the phenotype of this strain with respect to the formation of lipid droplets in the nucleus (6). We are currently exploiting those same sets of data to quantify the total number of LD in order to satisfy the reviewer’s concern.

      5). Figure 4D representation should show % of phospho reduction of initial activation and a better western blot image should be shown that show equal loading of samples.

      We are currently repeating these gels and blots for the sake of clarity, as requested.

      6). In immunoprecipitation experiments, kindly include isotypee IgG controls as well to rule out non-specificity.

      Of course, this important control will be included every time.

      Minor points: 1). Figure S1F do not show oleate treatment as presented in results section.

      We will revise the accurate naming.

      2). A better gel for S4B should be presented with ponceau of the same gel.

      We are currently repeating this gel and associated blot for the sake of clarity, as requested.

      3). Nuclear PI4Ps has also been previously reported, an explanation to the specific interaction of ATM and PI4P in the Golgi should be addressed/discussed.

      We take it that the reviewer is referring here to the recent work by Fáberová et al (7) in which PI(4)P and PI(4,5)P were described as very dynamic in the nucleus, and mostly related then to mRNA transcription, splicing and export. We will reinforce the connection of our phenomenon to the Golgi-associated pool of PI(4)P thanks to the co-immunoprecipitation experiments proposed above, and will timely contextualize these in light of the paper by Fáberová and co-workers in the revised version. Thank you for reminding us of this work.

      Reviewer #2 (Significance (Required)):

      The current work definitely adds a layer in our understanding to ATM regulation and cross-talk between different PIKK family of kinases. ATM localisation in extra nuclear regions of a cell has been described earlier with significant impact on cell physiology such as mitochondria etc., ATM retention at golgi and limiting nuclear ATM levels is significant advance at ATM activity regulation, while signifying non canonical function of PI4P.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, the authors propose that ATM/Tel1 signaling is regulated in a spatiotemporal manner during genotoxic stress both in yeast and mammalian cells. They show that Lipid droplets accumulate in response to genotoxic stress. As a consequence, there is a decrease of exchange of PI4P from the Golgi to ER, thus dampening ATM/Tel1 signaling by sequestering this kinase into the Golgi. The authors combined findings in yeast and mammals showing that this mechanism is conserved throughout eukaryotes. For this purpose, they use a vast number of techniques that support their proposed model.

      Major comments:

      The conclusions were made based on evidence combining yeast genetics, immunofluorescence, DNA end resection analysis and pharmacological interventions. The hypothesis that ATM is kept away from the nucleus by physically interacting with PI4P at the Golgi, thus allowing processive repair is bold and contributes for a better understanding of the choreography of the DDR kinases during DSB repair. However, many of the experiments in yeast and mammals show only mild phenotypes and there is no evidence that this mode of ATM dampening impact cell viability in mammals.

      We agree with the reviewer that the effects associated to the reported phenomenon are indeed mild. This is a fact. We would like to remind that the metabolism of sterols is finely controlled, and at many different levels, in a very complex manner. For example, sterol increases in the cell will immediately be compensated by reduced synthesis, while synthesis inhibition will immediately promote uptake from the medium, and/or release from stores (for example, see (8)). As a natural consequence, the window of manipulation and, more importantly, the strength of the phenotypes we can uncover are small.

      Therefore, I have some comments and suggestions of experiments that I think could improve the quality of the manuscript. I believe that most of these new experiments does not require much time and resources.

      • Does oleate treatment in RPE-1/Huh-7 cells induce loss of viability? An experiment showing loss of viability like MT-assay or decreased cell proliferation would reinforce the importance of the mechanism proposed.

      This experiment was already included in the previous version, yet it may have escaped the attention of the reviewer. We show in Figure S2E that oleate treatment restricts viability in Huh-7 cells alone, and also worsens their tolerance to zeocin. Perhaps we should reconsider moving this result to the main figures so that it does not go unnoticed.

      • In yeast there is evidence that a ste delta strain show sensitivity to zeocin/CPT, but there is no experiment showing the same effect on cells lacking Yeh2. Since both strains share similar phenotypes, it would be interesting to show that increased kinetics of Rad53 signaling leads to sensitivity to genotoxins.

      We have now performed this experiment, we will include the matching information for yeh2∆ cells, which agrees with the predictions.

      • The conclusion that ste delta cells exposed to zeocin leads to unproductive events due to defects in DNA-end resection could be reinforced by a decrease in Rad52 foci. It has been previously shown by the group of Dr. Marcus Smolka, that inhibition of DNA-end resection decreases Rad52 foci (https://doi.org/10.1083/jcb.201607031). Since the authors were able to monitor Rad52-YFP (Figure S1A), it shouldn't consume time and resources.

      The reviewer is right that this experiment should not be time- or resources-consuming. We will evaluate the accumulation of Rad52 foci in response to the concerned genotoxin in ste∆ cells.

      • Since the authors propose that there is a DNA repair defect due to inhibition of long-range DNA-end resection, it would be important to monitor gamma-H2A(X) signal either in yeast or mammals.

      Taking into consideration the reviewer’s suggestion, we have now performed anti-yH2AX immunofluorescence of all the implied conditions (genotoxins +/- oleate pre-load) and will quantify them to answer the concern.

      • How do the authors exclude the possibility that yeast mutants or oleate treatment in yeast/mammalian cells change membrane permeability allowing an increase in genotoxin concentration?

      Although this is a very reasonable criticism, we want to remind the data we present in Figure S4A in which we use the naturally DSB-bearing rad3-102 cells for recombination analyses, showing that, in the absence of any genotoxin, the same phenotype also applies. Yet, we want to reinforce the notion that LD formation in response to DSB can also occur when the breaks are not chemically, but physically, induced. To this end, and also to match a related request by Reviewer 1, we will:

      1. a) exploit the naturally DSB-accumulating mutant rad3-102 (4) to evaluate whether it endogenously harbors more LD in comparison with the WT.
      2. b) we have recently created a tool in which gRNAs targeted to different subsets of transposons in the genome can drive Cas9 to create DSB in a dose-dependent manner ((9), under revision in Genetics). We will use this system to monitor the LD formation in response to Cas9-triggered cuts. In addition, on figure 5A, significant differences in GFP-Tel1 foci abundance between WT and steD or yeh2D cells are only observed after 210', way after the slight effect on Rad53 phosphorylation is observed. This is at odds with the conclusion that Tel1 association to STEs modulates DDR signaling.

      We are afraid that we have not been clear enough in explaining the kinetics giving rise to our model. As indicated by the reviewer, our work shows, through kinetic studies, that the storage of sterols within LD occurs at later stages than the activation of the DDR by Tel1 and Rad53 phosphorylation. Tel1 foci decline is necessary for subsequent engagement of downstream DNA long-range resection. Since we propose that sterol storage within LD is a means to attenuate Tel1 engagement at DSBs, it is thus logical (and thus compatible with the data we show) that LD number increase occurs simultaneously with Tel1 foci decrease, at late stages of the reactionWe will include this explanation and graph in the revised version of the work.

      • It would be interesting to investigate genetic interactions between ste delta (or yeh2delta) and yeast mutants with DNA-end resection problems (exo1delta; sae2delta). For instance, it has been shown that Sae2 antagonizes checkpoint signaling by competing with Rad9 to DSB sites (https://doi.org/10.1073/pnas.1816539115). Also, cells lacking Sae2 show an increase in Rad53 signaling due to increased Tel1 Signaling. Therefore, an epistatic effect between these two pathways would reinforce the hypothesis of the manuscript.

      we will build the double mutant sae2∆ yeh2∆ and assess the potential epistatic behavior they may display with respect to some key phenotypes (Tel1 foci formation, Rad53 phosphorylation…).

      • The authors showed that Tel1-GFP does not accumulate in the nucleus in cells lacking Sac1 (Figure 7C). Tel1 is important to cope with increased DSBs in the absence of Mec1, thus avoiding genomic instability. Cells lacking both Mec1 and Tel1 show a sick phenotype with an exponential increase in gross chromosomal rearrangements and sensitivity to genotoxins. Therefore, does deletion of Mec1 (and Sml1) in sac1 delta phenocopies a mec1tel1 delta? Alternatively, does pharmacological inhibition of ATR in the presence of the OSBP1 inhibitor causes loss of viability or chromosomal aberrations?

      We will delete SAC1 in mec1∆ sml1∆ and compare the fitness, through growth drop assays, with respect to the mutant tel1∆ mec1∆ sml1∆.

      We will expose cells either to OSBP1 inhibitor, ATR inhibitor, or both, and assess the phosphorylation of their downstream common effector H2AX. Additionally, we will assess the effect on cell growth of the combination of ATRi and OSBP1i using synergy matrices. We will determine if the combination of both drugs synergizes or not to impair cell proliferation and reduce cell viability.

      • Finally, it seems strange to me that ATR/Mec1 signaling is not mentioned throughout the entire manuscript. Does PI4P pathway affect only ATM/Tel1? In Figure 2D, an antibody against phospho-CHK1 could be used to monitor ATR signaling. In line with that, I would like to see in the discussion how these new findings are in line with evidence from a 2019 paper showing that phophoinositides PIP2 and PIP3, but not PI4P are important for ATR signaling (DOI: 10.1038/s41467-017-01805-9). They showed that a nuclear pool of PIP2 increases upon DNA damage induction and rapidly accumulates at DNA lesions. This event is important for the recruitment of ATR. Since PI4P is substrate for PIP2 synthesis and there is a nuclear pool of PI4P and PIP2, I think it is important to discuss if the results presented here are in line with these previous findings.

      The reviewer evokes recent work by Dr Michael Sheetz’s lab demonstrating that a different set of phosphoinositides serves to drive the nucleation and activation of the ATR-Chk1 branch of the DNA Damage Response upon genotoxic stress, yet was completely inert with respect to the ATM-Chk2 branch (5). We have now explored, also to satisfy a similar concerned raised by Reviewer 2, the response of the ATR-Chk1 branch when comparing the response of zeocin-treated cells that have been pre-loaded or not with oleate. We observe that the ATR-Chk1 branch is unaltered by oleate loading. Thus, we can now propose that the PI(4)P branch exclusively modulates the ATM-Chk2 axis.

      We will of course give the needed credit to this work and contextualize our findings accordingly.

      Minor comments:

      • Line 124: The correct is Figure S1E, lower panel and not Figure S1F -Lines 127-128: Figure S2A does not show zeocin treatment

      Both minor mistakes will be corrected.

      Reviewer #3 (Significance (Required)):

      Together, these new findings, if corroborated by others, might be important to open new lines of investigation in basic and translational research regarding human diseases as explored in the discussion section. I believe this paper will attract attention not only from the DDR field but also from other areas of research such as nutrient and lipid signaling both in yeast and mammals. I hope I was able to collaborate in this review, since my main expertise is in the area of DNA damage signaling using budding yeast as an organism model.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      This is a very interesting study where Sara et al. demonstrated a link between lipid metabolism with DNA repair response (DDR). In this study, they have proposed ATM as a novel PI4P-effector. The sterol deposition into lipid droplets impacts the Golgi PI4P level due to lipid exchange machinery facilitated by OSBP1, therefore regulating the cytosolic retention of ATM due to PI4P binding. Although how lipid droplets in the cytosol sense the DNA damage and control the initiation of DDR by regulating ATM is still unclear, this study linked lipid biology/PI signaling to DNA damage repair and showed the evolutionary conservation of PI signaling and DNA repair machinery from yeast to humans. The experiments are well designed, nicely controlled, with a high quality of data presentation. With some improvements, this work could be a very interesting study attracting a broad readership.

      In their model, ATM is PI4P-bound and sequestered inside the cytosol under basal conditions. Upon genotoxic stress, activation of OSBP1 removes PI4P and free PI4P-bound ATM for nuclear translocation of DNA repair. This could also be interpreted as genotoxic stress-induced PIP-kinase activity, where PI4P is processed into PIP2 or PIP3, somehow redirecting ATM into the nucleus to initiate its activation for DDR. Those aspects should be discussed and improved.

      Both Reviewers 2 and 3 have somehow evoked a similar concern. More precisely, the work by Dr Michael Sheetz’s lab demonstrating that a different set of phosphoinositides serves to drive the nucleation and activation of the ATR-Chk1 branch of the DNA Damage Response upon genotoxic stress, yet was completely inert with respect to the ATM-Chk2 branch (5). We have now explored, to satisfy all reviewers’ concerns, the response of the ATR-Chk1 branch when comparing the response of zeocin-treated cells that have been pre-loaded or not with oleate. We observe that the ATR-Chk1 branch is unaltered by oleate loading. Thus, we can now propose that the PI(4)P branch exclusively modulates the ATM-Chk2 axis.

      Additionally, we will of course give the needed credit to this work and contextualize our findings accordingly.

      Upon stress, there is nuclear activation of p53-phosphoinositide (PI) signalosomes and PIP-kinases. Also, there is a significant PIP2 pool inside the nucleus with an involvement in DNA damage repair. Those papers and their relevance to the current study need to be discussed. If ATM is a novel PI4P-effector, there is also nuclear PI4P formation or nuclear PI4P accumulation upon stresses based on recent studies; how the ATM interacts with PIPn in the nucleus upon translocation? A know ATM substrate p53 is PIP2/PIP3 bound in the nucleus based on recent studies. Will ATM prefer to interact with other PIPn-bound proteins in the nucleus or PIPn regulate their interaction needs to be discussed.

      These additional notions are in line with the previous paragraph presented by the reviewer, and our answers too. We will provide a constructive overview of all these ideas in the revised version of the manuscript.

      Major points: 1. The PI4P-ATM complex is supported only by PLA and PIP strips. Need more robust biochemical characterization of the interaction: co-IP, lipid binding, and/or in vitro constitution.

      We agree with the need to perform assays in which PI(4)P is embedded in a bilayer, as to confidently assess whether Tel1 can bind it in that context. We have now performed a pilot experiment in which we have confronted purified FLAG-Tel1 to liposomes harboring PI(4)P. Western blot analysis using anti-FLAG antibody shows the encouraging result that FLAG-Tel1 can be found there. As a control, we have performed the same process but in the absence of any liposomes. We observe that a residual fraction of FLAG-Tel1 can nevertheless be found in this control, most probably because the buffer used during the liposome assay makes part of FLAG-Tel1 precipitate.To avoid this type of background and to increase our trust in the results, we propose to perform the liposome assay but on a discontinuous density gradient, so that liposomes will be retrieved in the top layer (and bound FLAG-Tel1 with them, if that is the case), while any precipitated FLAG-Tel1 will be in the bottom fraction (liposome floatation assay). As a further control, we will include the same liposomes but lacking PI(4)P. We expect to be successful in the floatation assays. If we are not, we will repeat the experiment presented above to be confident that the observed increase is reproducible.

      1. The use of drug inhibitors only in the final figure is problematic. KD or KO experiments should be performed to confirm that ATM and the exchanger are the relevant targets.

      We have now used siRNAs against the exchanger protein, OSBP1, with a very high silencing rate success. We have next monitored the activation status of the chromatin-associated ATM target KAP1, in order to monitor the predicted decrease of ATM activity specifically inside the nucleus. Our results confirm the role of OSBP1, by KD experiments as requested by the reviewer, in attenuating ATM nuclear participation.

      1. Poor quality of some WBs (e.g Fig. S1F).

      We have now repeated the Western Blot to detect Rad53-P in response to 20 mM HU in WT versus ste∆cells.

      1. Lack of statistical analyses for some data (e.g. Fig. 1B-E)

      We had already included, in the previous version, the complete statistical analyses corresponding to Figures 1B to E and evoked here by the reviewer. They were indeed included in Figure S1C, and our brief reference to them in the text may have escaped her/his attention. We will make a clear reference to this in the revised version.

      Additional clarification points:

      Figure 1: No representative images were shown for quantifications in Figure 1C, D, E.

      If the reviewer / editor estimates it pertinent, we can of course include them. Yet, they will be very redundant with the images displayed in Figure 1A.

      Line 121: Should be Figure S1E, upper panel. Line 124: Should be Figure S1E, lower panel. Figure 2D-E, please show the quantification of the ratio of pCHK2/CHK2 with an N=3

      We will correct / include the requested changes.

      Figure S2B: needs quantification of NileRed staining to conclude induction in LD formation

      We will quantify the LD as requested.

      Figure 3C, to show the selectivity of ATM-binding toward PI4P, PLA of ATM with other PIPn species should be assessed, such as PI3P, PI4,5P2, and PI3,4,5P3.

      We have provided an overview of the binding preferences of ATM with respect to the full battery of phosphoinositides in the strip-binding assay shown in Figures S5C and 6B. Other than that, we are afraid that PLA studies as the ones we develop in the current manuscript for PI(4)P are not feasible, since no reliable antibodies exist for most of the phosphoinositide species evoked by the reviewer.

      Figure S6A, PI4P level could be assessed by IF staining using PI4P antibody besides using PI4P sensor.

      We will use our PI(4)P antibody to monitor by immunofluorescence the behavior of this molecule in response to either MMS or zeocin, as suggested.

      References

      1. Cheung HC, San Lucas FA, Hicks S, Chang K, Bertuch AA, Ribes-Zamora A. An S/T-Q cluster domain census unveils new putative targets under Tel1/Mec1 control. BMC Genomics. 2012;
      2. Bensimon A, Schmidt A, Ziv Y, Elkon R, Wang SY, Chen DJ, et al. ATM-dependent and -independent dynamics of the nuclear phosphoproteome after DNA damage. Sci Signal. 2010;
      3. BastosdeOliveira FM, Kim D, Cussiol JR, Das J, Jeong MC, Doerfler L, et al. Phosphoproteomics Reveals Distinct Modes of Mec1/ATR Signaling during DNA Replication. Mol Cell. 2015;
      4. Moriel-Carretero M, Aguilera A. A Postincision-Deficient TFIIH Causes Replication Fork Breakage and Uncovers Alternative Rad51- or Pol32-Mediated Restart Mechanisms. Mol Cell. 2010;37(5):690–701.
      5. Wang YH, Hariharan A, Bastianello G, Toyama Y, Shivashankar G V., Foiani M, et al. DNA damage causes rapid accumulation of phosphoinositides for ATR signaling. Nat Commun. 2017;
      6. Kumanski S, Forey R, Cazevieille C, Moriel-Carretero M. Nuclear Lipid Droplet Birth during Replicative Stress. Cells. 2022;11(1390).
      7. Fáberová V, Kalasová I, Krausová A, Hozák P. Super-Resolution Localisation of Nuclear PI(4)P and Identification of Its Interacting Proteome. Cells. 2020;9(5):1–17.
      8. Luo J, Yang H, Song BL. Mechanisms and regulation of cholesterol homeostasis. Nat Rev Mol Cell Biol [Internet]. 2020;21(4):225–45. Available from: http://dx.doi.org/10.1038/s41580-019-0190-7
      9. Coiffard J, Santt O, Kumanski S, Pardo B, Moriel-Carretero M. A CRISPR-Cas9-based system for the dose-dependent study of 4 DNA double strand breaks sensing and repair 5 6. bioRxiv [Internet]. 2021;1–37. Available from: https://doi.org/10.1101/2021.10.21.465387.
    1. gammon

      I want to focus this annotation on one word in particular, that at first I overlooked: gammon. Taken at face value, gammon is a British term for a smoked or cured ham. Thus, it would be easy to not think much of the line: “Well, that Sunday Albert was home, they had a hot gammon, / And they asked me in to dinner, to get the beauty of it hot.” However, gammon also has two other definitions: 1. To defeat an opponent in backgammon, another board game which shares similarities to chess. and 2. To hoax or deceive. First, alluding to backgammon within “A Game of Chess” provides interesting parallels and reflections on what it means to be within a game. From Middleton’s play, we know that chess is strongly affiliated with seduction and lust. While this may be a stretch, I believe that backgammon acts as a contrast to chess as a representation of what society was before the War and deterioration of creativity and individualism that Eliot constantly references within The Wasteland. Chess is the younger of the two, and has a belligerent connotation (possibly in reference to The Great War), in comparison to the meditative nature of backgammon. To be engaged in chess is a cerebral battle, and in Eliot’s mind, England is losing. Moreover, in Sukhbir Singh’s journal article, “Gloss on "Gammon" in "The Waste Land", II, Line 166”, he mentions the importance of the characterization of the gammon as “hot.” Singh deems the gammon aphrodisiac, and believes that “hot” refers to the Duke’s “flaming appetite” and “hot lust.” Singh’s opinion fits nicely into the second alternate definition of gammon, which is to hoax or deceit. In this case, the unnamed woman in the poem has most likely fallen into her “flaming appetite” and participated in an affair. Singh believes that this lack of love is Eliot’s reflection of societal deterioration.

    1. Yet when all this is admitted I still feel that the considerations which I have urged should have a wide influence upon the type of psychology which is to be developed in the future. What we need to do is to start work upon psychology, making behavior, not consciousness, the objective point of our attack. Certainly there are enough problems in the control of behavior to keep us all working many lifetimes without ever allowing us time to think of consciousness an sich. Once launched in the undertaking, we will find ourselves in a short time as far divorced from an introspective psychology as the psychology of the present time is divorced from faculty psychology

      Watson acknowledges some arguments that may come from his views, but still believes psychology to stop focusing on consciousness but rather on the behavior.

    2. But on the other hand, since it does respond to thermal, tactual and organic stimuli, its conscious content must be made up largely of these sensations; and we usually add, to protect ourselves against the reproach of being anthropomorphic, 'if it has any consciousness'. Surely this doctrine which calls for an anological interpretation of all behavior data may be shown to be false: the position that the standing of an observation upon behavior is determined by its fruitfulness in yielding results which are interpretable only in the narrow realm of (really human) consciousness

      Anthropomorphism is when you think about animals or objects as if they were human. For instance, pet owners might observe human-like qualities in their pets, believing that their pet is experiencing an emotional state similar to what a human feels. https://psychcentral.com/health/why-do-we-anthropomorphize#anthropomorphism

    1. Learning (defined as actionable knowledge) can reside outside of ourselves (within an organization or a database), is focused on connecting specialized information sets

      I haven't really explored this idea before, but it makes complete sense! When we take time to reflect on what we already know, what we have just learned, and ask questions about what else these ideas may relate to, we get the big picture. I think this idea could apply to connecting ideas and it could also be about connecting people (like in out Twitter chats) so that we have more resources or better support to continue the learning process.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      In our work, we quantified the abundance and positions of major kinetochore proteins within the metaphase kinetochore in budding yeast using single-molecule localization microscopy. Based on these measures, we revised the current model of the kinetochore and provided a nanoscale view of the complex.

      We now revised our manuscript according to reviewers’ points. We performed new analyses to quantify the measurement errors and to justify our data analysis workflows. We further exploited the correlation-based analysis and found a correlation between the spreads of kinetochore proteins perpendicular to the spindle axis and their positions along the axis. We also discussed the potential non-centromeric pools and revised our model of the kinetochore. Further information on our analyses was now provided to improve the clarity. Changes to the text were implemented to better reflect our data. Information from relevant works was incorporated to better connect this work to the field.

      We thank the reviewers for their points, which help us show the rigorousness of our analyses, further demonstrate the potential of our work, and improve clarity.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors have developed a rigorous methodology for using single-molecule imaging of exogenously labeled kinetochore proteins to count and estimate their copy numbers and the average distance from the kinetochore protein Spc105. Although the method is technically sound, its application to the kinetochore raises some crucial questions below. My biggest concern is the effect of non-centromeric pools of the centromeric proteins Cse4, Cep3, and Ctf19 on the estimated copy number per kinetochore. The authors should be able to address most, if not all, questions by presenting a more in-depth data analysis.

      Major points

      1. Accounting for tilt of the yeast spindle relative to the image plane: It is not clear to me how the authors ascertain whether the spindle being imaged is nearly parallel to the image plane. In the companion fission yeast study, spindle poles are used for this purpose, but this study seems to rely only on the labeled kinetochore proteins. The criteria used to select the in-plane spindles should be clearly defined.

      We thank the reviewer for pointing this out. We selected the in-plane spindles based on their average PSF size, which informs the z positions of the center of the kinetochore cluster (for simplicity, now all ’half-spindle’ was changed to ‘kinetochore cluster’). To calibrate the z position of kinetochore clusters, we first measured the width of the kinetochore cluster by fitting a cylindrical distribution. Overall, the kinetochores are likely symmetrically distributed around the spindle axes. Therefore, the height and the width of a kinetochore cluster should be the same. We then calibrated the z positions of the PSF size based on fluorescent bead data. Next, we plugged in the cylindrical distribution to the calibration curve to correlate the mean PSF size and position of the kinetochore cluster. We only took the kinetochore clusters with a mean PSF size

      1. The effects of PSF depth on counting kinetochore proteins: The authors use a well-characterized nuclear pore protein as the reference to estimate kinetochore protein counts per half-spindle. Although this method appears rigorous in principle, I am unsure about the effect of the spatial distribution of kinetochores on the accuracy of the estimated number. Nuclear pore proteins are all localized within an 100 nm away from the focal plane even when the spindle is perfectly parallel to the focal plane. A discussion of this possibility, its effect on the protein count/distance estimates, and any mitigating factors is essential to highlight the caveats associated with the conclusions.

      Based on the cylindrical distribution (see please the reply to point 1) of kinetochore clusters and their positions in z, we calculated the upper and lower boundaries of the distribution of kinetochore proteins in z, given a specific mean PSF size cutoff of a kinetochore cluster. Regardless of how stringent the cutoff is (130 and 135 nm), we made sure the boundaries do not exceed the imaging depth defined by our choice of the PSF size filtering (

      1. Presentation of the cross-correlation analysis: The authors use cross-correlation for an unbiased calculation of the axial separation between a protein of interest and Cse4, but I am curious about the structure of the underlying data, and the intensity image in Figure 1 is not easy to examine. It will be helpful to include more analysis of the underlying data for at least a subset of the proteins (e.g., proteins at short, intermediate, and long distances from Cse4) as supplementary data.

      2. The authors should include X and Y projections of the cross-correlation function.

      3. Do the widths of cross-correlation functions (i.e., their spread perpendicular to the spindle axis) match across all proteins and experiments? This should be an almost invariant characteristic of the measurements, assuming that proteins within each kinetochore tightly cluster around the 25 nm microtubule. This line of thinking makes the large width of the cross-correlation shown in Figure 1 somewhat surprising.

      4. It will also be interesting to test if the correlation between the positions of Spc105 molecules, especially perpendicular to the spindle axis, is comparable to the known separations between adjacent microtubules in the yeast spindle (the authors could use Winey et al. 1995 for serial-section EM of yeast spindles for comparison).

      The reviewer is interested in the spread, or the size of the distribution, of a protein in a kinetochore along and perpendicular to the spindle axis. This is an interesting idea and can be done practically. However, the information can be more easily obtained based on auto-correlation instead of cross-correlation, due to its better signal-to-noise ratio along the dimension perpendicular to the spindle axis. Cross-correlations in that dimension are convoluted with background localizations and different localization precisions of the two channels. These factors are hard to interpret and disentangled. In auto-correlations, although the background is still present, it can be modeled and then removed easily, as now mentioned on page 15 lines 500-516.

      Accordingly, we performed auto-correlation analysis on all the proteins and compared them to simulations representing different sizes. We find that the size of the distribution correlates to the position of the protein along the spindle axis. The results are now included as the new Fig. S5 and discussed on page 6 lines 169-176.

      The cross-correlation analysis was based on only the position of the maximum value, not the projections. To keep the figure concise, we decided not to include the projections. However, the auto-correlation analysis was indeed based on projections, which we now included in Fig. S5.

      Regarding the correlation between the positions of Spc105 molecules, we believe the reviewer actually refers to the correlation between the positions of kinetochores. Auto-/cross-correlations contain the information of the cluster sizes, based on the first peak (as shown in Fig. S5), and the relative distance (if the pattern is periodic). Unfortunately, the positions of kinetochores perpendicular to the spindle axis are not periodically distributed. Therefore, we cannot comment on the separations between adjacent microtubules.

      1. Cse4 count (4 per kinetochore) and the model presented: One of the surprising conclusions of the study is that there are two nucleosomes associated with each microtubule attachment, with Mif2/CENP-C potentially interacting with both nucleosomes. There are two critical issues that the authors must consider.

      (1) Fluorescent protein chimeras of Cse4 and CBF3 and COMA complex members do not exclusively localize to kinetochores. Biochemical studies show that both Cse4 and CBF3 proteins interact with non-centromeric DNA, e.g., see work from the Biggins lab regarding Cse4 over-expression and also from the Henikoff group that used ChIP-seq. I can't think of a similar reference for the CBF3 complex, but the DNA-binding proteins are also likely to interact with other parts of the genome. The non-centromeric protein is visible as a significant background fluorescence in wide-field microscopy, e.g., see Cep3 localization here: https://images.yeastrc.org/imagerepo/viewExperiment.do?id=202308&experimentGroupOffset=3&experimentOffset=0&experimentGroupSize=3

      Similar background fluorescence can be detected for Cse4 and Ctf19. This extra-centromeric localization of Cse4, Cep3, and Ctf19 makes it possible that the protein counts included by the authors are "contaminated" to some extent by the extra-centromeric protein. The authors should discuss this possibility and how it might affect their counts.

      After consideration, we agree with the reviewer that, specifically, a fraction of counted Cse4 molecules should be considered non-centromeric. We agree that the previous data is certainly sufficient to conclude it. The reviewer made a similar suggestion about COMA and CBF3 subcomplexes. In recent years a substantial portion of inner kinetochore components has been reconstituted. In Harrison et al. 2019, the Ctf19 complex structure has been solved. Two copies of the complex were observed. Therefore, the non-centromeric pool of COMA is certainly possible and we now made the adjustments to the text (page 8, lines 219-225) and Fig. 4. Accordingly, we now also modified the abstract (page 1, lines 26-27) and restructured the sections (page 10) to accommodate the different possibility of Cse4 copy numbers. While, fluorescence imaging of CBF3 presents a signal throughout the nuclear region we observed only four copies of Cep3 (part of CBF3). A CBF3 structure also has been resolved by Yan et al. 2018, in which the complex was proposed to exist as a dimer. This translates into four copies of Cep3. Therefore, we find it more suitable to leave all observed Cep3 (CBF3) molecules within a kinetochore model.

      (2) The model drawn in Figure 4 makes explicit assumptions about the positioning of the four Cse4 molecules (or two nucleosomes) in each kinetochore relative to the rest of the kinetochore components. Yet, the data shown do not justify this specific arrangement. Lawrimore et al. 2011 claim that the non-centromeric Cse4 nucleosomes must be randomly distributed in the pericentromeric chromatin to evade detection in biochemical tests. Therefore, the nearest-neighbor analysis suggested above will be valuable for gaining new insights into the relative positioning of the centromeric- and non-centromeric Cse4 nucleosomes. A similar analysis for Cep3 and Ctf19 will also be helpful. If stereotypical positioning of these molecules cannot be detected, then the model should be revised accordingly (alternative models that are also consistent with the data can be included).

      The reviewer has pointed out that Lawrimore et al. 2011 proposed and justified the existence of a non-centromeric Cse4 pool. This arrangement, also potentially along other inner kinetochore components, makes sense and our data did not indicate it otherwise. Therefore, we now revised our model accordingly by applying changes in the main text on page 10 lines 302-305 __as well as in __Fig. 4.

      (3) I suggest one experiment that can help the authors better understand protein organization in one kinetochore. Joglekar et al. 2006 used a dicentric chromosome to isolate single kinetochores on the spindle axis to test the assumption that each kinetochore consists of approximately the same number of molecules of kinetochore proteins. The strains are easy to construct (transform existing strains with a linearized plasmid). Single kinetochores can be seen with a low but reasonable frequency. I leave the decision to perform the experiment to the authors' discretion depending on whether the experiment will be worth the effort in strengthening or enhancing their conclusions.

      We performed the suggested experiment using the strain published in Joglekar et al. 2006 (kindly provided by Prof. Kerry Bloom) with Cse4 additionally tagged with mMaple. However, we always observed several super-resolved Cse4 clusters (likely of several kinetochores) overlapping with Nuf2-GFP diffraction-limited signal, therefore unable to assign a single isolated kinetochore to the lagging centromere.

      1. Information regarding the degree of correction applied to calculate protein count per half-spindle: It will be helpful to include data regarding the degree of correction applied to the expected and measured numbers of NPC protein as supplementary data so that the readers can see the magnitude of this correction relative to the measured counts.

      We would like to clarify that we did not correct the data. Instead, we calibrate the copy number, given that the copy number of Nup188 per NPC is known. We assume the same ratio between localization and copy number applies to both Nup188 and the kinetochore proteins. We now include a new Table S4 listing calibration factors of all experiments shown in Fig. 3.

      Minor points:

      1. McIntosh et al. JCB 2013 used microtubule plus-ends in serial section electron micrographs of yeast spindles to align the centromeric region and found a disk-shaped structure that roughly corresponds to the size of a single nucleosome ~ 80 nm away from the tip of the microtubule and centered the microtubule axis. The authors should refer to this finding in their discussion of the model that they present with two nucleosomes. In my opinion, this is compelling evidence for a nucleosome-like structure serving as the kinetochore foundation.

      We agree with this reviewer's comment. The study, among others, present compelling evidence for a point-centromere. We now included the finding in the discussion on page 10, lines 293-294.

      1. As discussed by the authors, the number of Cse4 molecules per kinetochore has been the subject of some controversy. Biochemical data from the Biggins group and ChIPseq data from the Westermann group (Altunkaya et al. 2016 Current Biology) strongly suggest that Cse4 molecules can only be found centered on the centromeric sequence. The latter reference should be included in the discussion.

      Thank you for pointing this out. Indeed, this is important. We have now added the relevant reference in the discussion on__ page 10 lines 291-292__.

      1. Although microscopy-based methods have estimated anywhere from 1, 2, to 6 Cse4 molecules per kinetochore, these studies generally agree on the stoichiometry between Cse4 and the rest of the kinetochore proteins, e.g., Ndc80 complex proteins are ~ 4-fold more abundant that Cse4, etc. The present study seems to disagree with protein stoichiometry. The authors may find it worthwhile to note this feature of their data.

      We now discuss the stoichiometry difference between our results and others on page 11 lines 322-324.

      1. Omission of the Dam1 complex from this study is disappointing to me personally, but I am sure that the authors have good reasons for this. They should briefly comment on the absence of the Dam1 complex in this study.

      To provide information on the Dam1 complex, we imaged Ask1, a component of the complex. The measured positioning and copy number of the protein are now included in Fig. 2 and Fig. 3 respectively, and described and discussed in respective parts of the manuscript.

      Reviewer #1 (Significance (Required)):

      Cieslinski and colleagues present a single-molecule localization-based study to define the copy numbers and relative organization of kinetochore proteins in budding yeast. These numbers confirm and significantly refine prior measurements of the same aspects of the kinetochore. They also raise new questions and point to new research directions. The measurements also reveal a model of the protein organization of the budding yeast kinetochore in metaphase. For these reasons, the manuscript is of significant interest to the cell division field.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this study, Cielinski and colleagues have applied single molecule localization microscopy to map the positions of proteins in the yeast kinetochore. This has not been reported previously and this study is both well-conducted and the data appear solid. They also use a modification of this technique to assess the stoichiometry of kinetochore proteins. The results that they obtain are broadly in line with several previous studies that use other methodology. There may be an improvement in accuracy using this new approach that has not been obtained previously and there are some important novel conclusions from this work. I would like the authors to address the following concerns prior to publication:

      Major points

      1. One interesting finding is that there is a discrepancy in the length of both the MIND and NDC80 complexes (from crystallographic data) with their relative positions. The authors suggest that the outer complexes could be twisted or rotated in respect of the spindle axis. It would be great if the authors could illustrate this in their model (or discuss it in the text), to demonstrate the required angle of twist/rotation of both complexes to account for the discrepancy. A twisted filament structure to the outer kinetochore does have some implications for its response to tension - a key determinant of kinetochore-microtubule attachment. It also may provide some flexibility to the structure under tension.

      The discussion about this discrepancy has now been incorporated in the main text, page 9 lines 263-267. For clarity, we only partially reflect this in our schematic model (Fig. 4A; the MIND complex) but we already reflected this in the illustrative structural model in Fig. 4B.

      1. For the experiment with cycloheximide, the authors state "Although we observed minor changes in copy numbers, the overall effect of CHX was small." For some proteins, Cse4i for example, there appears to be a significant decrease in intensity (30-40%) after cycloheximide treatment, see Figure S3. While the conclusion that tag maturation does not affect copy number measurements is sound, I suggest modifying this section to reflect the data.

      We now modified the section accordingly by pointing out that Cse4i under CHX measurements led to reduction of the signal. The modification can be found on page 8 lines 207-211.

      1. Page 5. The statement "These data agree reasonably well with previous diffraction-limited dual-color microscopy studies ..." provides readers with little ability to compare the data. I would like to see a supplementary figure comparing these new data with previous studies, especially those of Joglekar et al 2009, see Figure 3 in this paper.

      We thank the reviewer for suggesting such a table. This will allow readers a direct comparison of the data between our study and Joglekar at al. 2009. The comparison can be found in new Table S1 __and __Fig. S4, which are now mentioned on page 5.

      1. In terms of the distances quoted, are they in one dimension (as per Jogelkar et al 2009) or in three? The results section is entitled "...positions of kinetochore proteins along the metaphase spindle axis", which suggests a single dimension. Please make this very clear in the results section. In the discussion, is the statement "we mapped the relative positions of 15 kinetochore proteins along the kinetochore axis", which is not entirely clear. It seems from the methods that this is one dimension "...we determined the average distance between the two proteins along the spindle axis. “I suggest clarifying the results section briefly and clearly to indicate that this is a single dimension being measured and also using consistent wording of the axis measured throughout the text.

      We agree the previous description may not be clear to the viewers. We now changed the text accordingly in the results section, page 5 lines 129-130.

      Minor points:

      Abstract: I would drop "all" from "For all major kinetochore proteins...", since full characterisation was performed on 14 proteins (9 in terms of copy number).

      We now deleted “all” in the abstract as the reviewer suggested__.__

      Page 2: "trough" to through.

      Corrected.

      Page 2 "S. cerevisiae" to italics

      Corrected.

      Methods p11. How do the MKY strains relate to common yeast genetic backgrounds? (e.g. are they S288C?).

      MKY strains are derivative of S288C. The information was now updated in the Methods section and in Table S2.

      Reviewer #2 (Significance (Required)):

      This manuscript, together with an accompanying one from Virat et al., are nice complementary studies that provide the first single molecule localization studies of the yeast kinetochore. Although other labs have used super-resolution methods to study individual kinetochore proteins; both of these new studies map distances between many proteins at the kinetochore and thus are able to produce maps of the overall kinetochore structure. Like the previous study using standard resolution methods (Joglekar et al, 2009. Current Biology 19, 694-699); these studies will likely provide a benchmark for future studies on eukaryotic kinetochore architecture, including those in mammalian systems. Additionally, this work will appeal to super-resolution microscopists.

      My expertise is as a yeast kinetochore cell biologist.

    1. Author Response

      Reviewer #1 (Public Review):

      I'm curious about whether the microscopy provided any information about when secretory vesicles leave the TGN. Do they leave throughout the lifetime of a TGN structure, or do they leave in a burst when a TGN structure disperses as marked by loss of Sec7? This information might take us a step closer to understanding how secretory vesicles are made.

      Given the limitations of our current imaging set-up with regards to high-speed 3D two-color microscopy, we were unable to capture a large number of these events and therefore cannot make concrete statements about this, however, the quantified events did not appear to be preceded or followed by additional events, suggesting some temporal separation.

      Reviewer #2 (Public Review):

      The authors are encouraged to integrate their data together better with published biochemistry and structural work into more complete mechanisms for vesicle trafficking, tethering and fusion. The manuscript would be improved by a clearer model(s) of how these factors come together to carry out exocytosis.

      This suggestion has been addressed by the addition of a new model figure (Figure 9).

      Moreover, many conclusions (especially as they appear in the Results and Figures) are written as if they are well supported by the data (or others' data), when they are often speculative, or reasonable alternative explanations exist. The authors should be clear about which conclusions are well supported, and which are hypotheses. (e.g. Fig 6I, which is a terrific figure, but some of the "conclusions/statements" are speculations).

      We have made textual changes to make clearer distinctions between conclusions that are supported by the data, and which are more speculative.

      The mechanistic and experimental definitions for the start/end of "tethering" and "fusion" are not clearly stated in the main text, which leads to confusion when examining the arrival of different factors (and seems to lead to circular arguments about what is defining what). Are these definitions well supported by the previously published and current data? E.g. is the disappearance of GFP-Sec4 really equal to the fusion event? Without data showing membrane-merger or content delivery, this needs to be described as an assumption that is being made.

      Early in the results, we now define precisely what we interpret as the start of tethering and time of fusion. Unfortunately, thus far, all attempts at designing a cargo marker suitable for defining membrane fusion have not succeeded, however, we believe the observations in Figure 4 strongly support assumption that loss of GFP-Sec4 signal coincides with fusion.

      The Sro7 results and conclusions are complicated, and not always carefully supported, for several reasons: there is a functionally redundant paralog Sro77, and data shows Sro7 can bind to Sec4, Sec9 and Exo84 in exocyst (Brennwald, Novick and Guo labs). The authors should be clearer, as they seem to pick and choose which interactions they think are relevant for different observations.

      We did not intend to “pick-and-choose” relevant interactions and now more clearly state what our Sro7 results mean.

      The assumption that yeast Sec1 behaves similarly to other Sec1/Munc18 proteins for "templating" SNARE complex assembly, e.g. Vps33 in Baker et al, is unlikely, given the binding studies from a number of labs (Carr, McNew, Jantti). Furthermore, the evidence for Sec1 interaction with exocyst suggests that they may work together (Novick, Munson labs). Previous data from the Guo lab (Yue et al 2017) and new BioRxiv data from the Munson/Yoon labs suggest that exocyst may play key roles in SNARE complex assembly and fusion.

      We did not mean to imply that the exocyst does not play a meaningful and critical role in SNARE complex assembly and fusion. This was an unintentional omission, which we have now addressed in the text. Our interpretation of the published meaning of SM-protein “templating” is that SM’s facilitate the alignment of the critical zero-layer ionic residues in the SNARE motifs, which may be possible regardless of affinity to single SNARE motifs. Indeed, for Sec1 specifically, it may be possible that this exact function is of lower importance relative to, perhaps, the stabilization and protection of trans-SNARE complexes prior to membrane fusion. Future studies may clarify this.

      There is concern that the number of molecules of each of the factors measured is accurate, and how the authors really know that they are visualizing single vesicle events (especially with data showing that "hot-spots" may exist). For example, why is the number of molecules of exocyst is ~double or more than that previously observed (Picco et al; Ahmed et al with mammalian exocyst).

      Estimating the numbers of molecules is subject to some variation due to fluorescent tags used and to some extent where the protein is tagged. Since different tags were used in the earlier studies, being within a factor of two is not that surprising.

      For puncta of exocyst subunits in the mother or moving towards the plasma membrane, what is the evidence that they are actually on vesicles? The clearest argument seems to be the velocity at which they move, but this could be due to the direct interaction of exocyst with the myosin (which is a tighter interaction in vitro than exocyst-Sec4 binding), rather than being on vesicles. Furthermore, do all the exocyst complexes in the cell show this behavior, or could these be newly synthesized/assembled complexes?

      Transport of the exocyst by myosin alone without a vesicle seems very unlikely, as this myosin V needs to be activated by binding vesicle-associated Sec4 (Donovan et al., 2012, 2015). Moreover, transport of just two exocyst complexes by a myosin dimer would be very hard to detect. Nonetheless, we have added an additional supplementary figure (Figure 1 Supplement 5C) illustrating a clear example of exocyst complex colocalization with a secretory vesicle in the mother cell which we hope will quell fears that the exocyst complex is indeed on secretory vesicles, albeit in small numbers, during this stage of transport.

      With regard to the exocyst octamer leaving at the time of "fusion," the authors should discuss Ahmed et al.'s finding of Sec3 leaving prematurely in mammalian cells, as well as data from the Toomre lab.

      We did reference this earlier work in mammalian cells and indicate that it differs from the situation in yeast. We don't have anything insightful to be drawn from these differences.

      Reviewer #3 (Public Review):

      In this context, it is notable that dual-channel imaging appears to be made by sequential, not simultaneous, acquisition, which deserves a currently missing comment. Moreover, given the weight that image acquisition plays in this project, it might be described and justified better.

      As noted above, we have expanded our description of the microscopy. We took two-color images sequentially as our microscope is not configured with a beam-splitter for simultaneous imaging.

      This referee could not fully understand the routine of image acquisition, specifically, the continuous movement of the stage in the Z-axis as images are streamed (to the RAM or to the disk? the latter takes time, line 177); does it mean that Z-stepping is solely governed by the exposure time? The CCD camera penalizes pixel size (16 µm) at the expense of achieving outstanding quantum efficiency. The optical path includes a 100x objective and a 2x magnification lens to compensate for the large camera pixel size, thereby achieving 0.085 µm/pixel, but these lenses 'waste' part of the fluorescent signal. One wonders if the CMOS camera (6.5 µm pixel size) coupled with a 63x objective wouldn't be appropriate? A brief discussion on this choice would be helpful for readers.

      We now discuss the microscopy in more detail and why we use an EMCCD rather than aCMOS camera.

      It is remarkable that Sec2 and Sec4 are recruited to membranes even before a vesicle is formed (Fig 6I). I find somewhat weak the evidence that RAB11s 'mark' the TGN, and disturbing the fact that RAB11 reaches the PM (does GFP tagging prevent GAP accession?). I should like to recommend strongly that the authors integrate into the introduction/discussion information on the late steps of exocytosis available for Aspergillus nidulans, another ascomycete that is particularly well suited for studying this process. Here RAB11 is not a late Golgi resident but is transiently (20 s) recruited to TGN cisternae in the late stages of their 120 s maturation cycle to drive the transition between Golgi and post-Golgi (Pantazopoulou MBoC, 2014). Recruitment of RAB11 to the TGN is preceded by the arrival of its TRAPPII GEF (Pinar, PNAS 2015; Pinar PLOS Gen 2019), a huge complex that is incorporated en bloc to the TGN (Pinar JoCS, 2020). Upon RAB11 acquisition RAB11 membranes engage molecular motors (Penalva, MBoC 2017) to undertake a several-micron journey that transports them to a vesicle supply center located underneath the apex (review, Pinar & Penalva, 2021). Here is where Sec4 is located, strongly indicating that there is a division of work between two Rabs each mediating one of the two stages between the TGN and the membrane (Pantazopoulou, 2014, MBoC).

      In the general comments above, we discuss the possible artifact of tagged Ypt31 on the PM. In the Discussion, we now compare our results in S. cerevisiae with the findingss in A. nidulans.

    1. Reviewer #1 (Public Review):

      Grande et al report the results of a series of functional connectivity experiments that build upon and extend results reported in Maass et al. (2015). The authors conducted three separate but interrelated analyses with a primary aim of characterising entorhinal-hippocampal processing pathways in the human brain.

      The first analysis served to identify subregions within the entorhinal cortex (EC) that preferentially connect with the retrosplenial cortex (RSC), posterior parahippocampal cortex (PHC) and perirhinal areas 35 (A35) and 36 (A36). The results of this analysis revealed that the RSC and PHC preferentially connect with the anterior medial EC and posterior medial EC respectively while A35 and A36 preferentially connect with the anterior lateral EC and posterior lateral EC respectively. In a second analysis, the authors evaluated patterns of functional connectivity between the four entorhinal subregions identified in Analysis 1 and specific subfields of the hippocampus, namely the subiculum and CA1. The authors provide evidence that each EC subregion preferentially connects with specific regions along the transverse (medial-lateral) axis of the subiculum and CA1.

      In a third analysis, the authors investigated whether 'object' and 'scene' information is differentially processed within EC subregions and along the transverse axis of the subiculum and CA1. Results revealed that the posterior medial EC and distal (medial) subiculum were preferentially engaged by 'scene' stimuli. In contrast, anterior regions of the EC and the CA1/subiculum border were equally engaged by 'object' and 'scene' stimuli. The authors propose that the posterior medial EC and distal subiculum may represent a unique route for scene/contextual information flow while anterior regions of the EC and the CA1/subiculum border may be involved in integrating both 'scene' and 'object' information.

      Overall, the study was well-motivated, well-designed and appropriately analysed to address the research questions. The conclusions of the paper are well supported by the data.

      The primary novelty of these results relate to the characterisation of how the RSC, PHC, A35 and A36 functionally connect with different portions of the EC and how, in turn, these EC subregions preferentially connect along the medial-lateral axis of the subiculum and CA1. These new and detailed insights will have an impact on and advance current theoretical models of entorhinal-hippocampal functional organisation in the human brain with implications for our understanding of human memory processing and its dysfunction.

      The study also provides new evidence regarding the functional organisation of EC-hippocampal circuitry as it relates to 'object' and 'scene' processing. Results of this component of the analysis support accumulating evidence that medial portions of the hippocampus and EC are preferentially engaged during scene-based cognition.

      Taken together, the results of this study inform and extend current theoretical models of entorhinal-hippocampal information processing pathways in the human brain.

      A major strength of the study is the detailed approach used to investigate each cortical region of interest (ROI), to characterise their functional connectivity with subregions of the EC and, in turn, how these EC subregions functionally relate to hippocampal subfields. The authors take advantage of the rich dataset acquired at 7T to gain new insights into entorhinal-hippocampal functional interactions.

      While the detailed approach noted above is a major strength of the study, it is also the source of some weaknesses. For example, when manually segmenting small ROIs (such as hippocampal subfields), quality assurance measures are important to give the reader confidence that the ROI masks are, as accurately as possible, measuring what we think they are measuring. A weakness of this study in its current form is that no quality assurance measures have been presented for the ROIs. The authors provide no metrics relating to intra- or inter-rater reliability (e.g., DICE metrics) for the manually segmented ROIs. Also, it can be difficult to warp small ROIs such as hippocampal subfields to EPI images with sufficient accuracy. No data is presented to assure readers that the ROIs (manually segmented on structural images and then warped to EPI space) were well aligned with the EPI images.

      It is also important to note that the subiculum mask used in this study appears to encompass the entire 'subicular complex' inclusive of the subiculum, presubiculum and parasubiculum. Importantly, the pre- and parasubiculum are located on the medial most aspect of the 'subicular complex' but this region is referred to throughout the current study as the 'distal subiculum'. Therefore, results attributed to the distal subiculum likely also reflect functional activation of the pre- and parasubiculum. Indeed, this makes sense considering accumulating evidence that the pre- and parasubiculum are preferentially engaged during scene-based cognition. Interpretation of results relating to the 'distal subiculum' should, therefore, be interpreted with this in mind.

    1. Author Response

      Reviewer #1 (Public Review):

      This well-written paper combines a novel method for assaying ubiquitin-proteasome system (UPS) activity with a yeast genetic cross to study genetic variation in this system. Many loci are mapped, and a few genes and causal polymorphism are identified. A connection between UPS variation and protein abundance is made for one gene, demonstrating that variation in this system may affect phenotypic variation.

      The major strength of the study is the power of yeast genetics which makes it possible to dissect quantitative traits down to the nucleotide level. The weakness is that is not clear whether the observed UBS variation matters on any level, however, the claims are suitable to moderate, and generally supported.

      We agree with the reviewer that understanding how causal variants for ubiquitin-proteasome system (UPS) activity affect other molecular, cellular, and organismal phenotypes is an important area of future research.

      The paper provides a nice example of how it is possible to genetically dissect an "endo-phenotype", and learn some new biology. It also represents a welcome attempt to put the function of a mechanism that is heavily studied in molecular cell biology in a broader context.

      We thank the reviewer for these kind words.

      Reviewer #2 (Public Review):

      In this manuscript, the authors developed an elegant quantitative reporter assay to identify quantitative trait loci that regulates N-end rule pathway, a major quality control mechanism in eukaryotes. By crossing two yeast species with divergent proteostasis activity, they generated a population that showed broad variation in proteostasis activity. By sequencing and mapping the underlying loci, they have identified several genes that regulate N-end rule activity. They then verified them using precise genetic tools, validating the power of their approach.

      Overall, it is a very solid manuscript that would be highly interesting for the quality control field.

      In general, I really liked this manuscript for these reasons:

      • Uses fluorescent timers elegantly to quantitatively measure protein degradation.

      • Validates the approach in depth, showing the readers how the tool works.

      • Uses the power of yeast genetics and bulk segregant analysis to map loci that may have small effects.

      • Validates the mapped loci using precise genetic tools.

      In a field that is dominated by biochemistry, this manuscript will be a fresh breath of air…

      We thank the reviewer for their thoughtful evaluation of our work and these kind words.

      Reviewer #3 (Public Review):

      This manuscript, "Variation in Ubiquitin System Genes Creates Substrate-Specific Effects on Proteasomal Protein Degradation" studies the genetic basis of differences in protein degradation. The authors do so by screening natural genetic variation in two yeast strains, finding several genes and often several variants within each gene that can affect protein degradation efficiency by the Ubiquitin-Proteasome system (UPS). Many of these variants have "substrate-specific effects" meaning they only affect the degradation of specific proteins (those with specific degrons). Also, many variants located within the same genes have conflicting effects, some of which are larger than others and can mask others. Overall, this study reveals a complex genetic basis for protein degradation.

      Strengths: Revealing the genetic basis for any complex trait, such as protein degradation, is a major goal of biology. The results of this paper make a significant step towards the goal of mapping the genes and variants involved in this specific trait. Fine mapping methods are used to home in on the specific variants involved and to measure their effects. This is very nicely done and provides a detailed view of the genetic basis of protein degradation. Further, the GFP/RFP system used to quantify the efficiency of the protein degradation system is a very elegant system. Also, the completeness of the analysis, meaning that all 20 N-degrons were studied, is impressive and leads to very detailed findings. It is interesting that some genetic variants have larger and opposite effects on the degradation of different N-degrons.

      We thank the reviewer for these positive comments.

      Weaknesses: Some of the results discussed in this paper are not surprising. For example, the finding that both large effect and small effect genetic variants contribute to this complex trait is not at all surprising. This is true of many complex traits.

      We agree with the reviewer that the number and patterns of QTLs we observe are perhaps not unexpected given that most traits are genetically complex. However, we also note that our results stand in stark contrast to previous efforts to understand how natural genetic variation affects the UPS, which have focused almost exclusively on large-effect mutations in UPS genes that cause rare Mendelian disorders. We have therefore chosen to retain our discussion of the complex genetic architecture of the UPS.

      The discussion of human disease is also a bit extensive given this study was performed on yeast. It might be more productive to use these findings to understand the UPS better on a mechanistic level. Why does the same genetic variant have opposite effects on the degradation of different degrons, even in cases where those degrons are of the same type?

      Following the reviewer’s suggestion we have removed multiple references to human disease from the introduction. We retained paragraph 3 of the introduction (previously, lines 43-55, pg. 2, para. 2 in the revised manuscript), which discusses disease-causing mutations in UPS genes, because the examples presented highlight two important motivations for our work: (1) individual genetic differences create variation in UPS activity and (2) much of our knowledge of how natural genetic variation affects the UPS comes from these rare, limited examples. However, we have re-written the paragraph to focus on these points and removed descriptions of the clinical manifestations of the disorders mentioned.

      We agree with the reviewer that understanding the mechanistic basis of substrate-specific variant effects on distinct N-degrons is important. However, doing so would require additional experiments that we argue are outside the scope of the current study.

      Overall, this manuscript excels at mapping the genetic basis of variation in the UPS system. It demonstrates a very complex mapping from genotype to phenotype that begs for further mechanistic explanation. These results are important to the UPS field because they may help researchers interrogate this highly conserved essential system. The manuscript is weaker when it comes to the broader conclusions drawn about the relative importance of large vs. small effects variants on complex traits, the amount of heritability explained, and the effects of genetic variation on protein abundance vs transcript abundance. Though in the case of protein vs transcript, I feel the cursory examination of the trends is perhaps at an appropriate level for the study, as it is mainly meant to show these things differ rather than to show exactly how and why they differ.

      We state that the distribution of QTL effect sizes for UPS activity consists of many QTLs with small effects and few QTLs of large effects. While this result is similar to patterns observed for other complex traits, it differs dramatically from the results of previous studies of genetic influences on the UPS, which have been largely confined to large-effect variants. Given these differences, we think it is appropriate and worthwhile to emphasize the complex genetic architecture of UPS activity.

      We agree that estimating the fraction of heritability explained by our QTLs and variants would be valuable. However, as noted in our response to Reviewer 1, the QTL mapping method we used does not permit ready calculation of heritability estimates due to its pooled nature.

      The reviewer is correct in noting that the primary goal of our RNA-seq and proteomics experiments was to provide an initial exploration of the effects of causal variants for UPS activity on global gene expression at the protein and mRNA levels. While a comprehensive dissection of the effects of this and other causal variants is an important area of future work, our results here show broad changes in global gene expression and establish that the causal UBR1 variant affects gene expression at the protein and mRNA levels.

      Reviewer #4 (Public Review):

      Overall the paper is clear and well-written. The experimental design is elegant and powerful, and it's a stimulating read. Most QTL mapping has focused on directly measurable phenotypes such as expression or drug response; I really like this paper's distinctive approach of placing bespoke functional assays for a specific molecular mechanism into the classical QTL framework.

      We thank the reviewer for their thoughtful evaluation of the work and positive comments.

    1. I thought I should have sunk down at last, and never got out; but I may say, as in Psalm 94.18, “When my foot slipped, thy mercy, O Lord, held me up.” Going along, having indeed my life, but little spirit, Philip, who was in the company, came up and took me by the hand, and said, two weeks more and you shall be mistress again. I asked him, if he spake true? He answered, “Yes, and quickly you shall come to your master again; who had been gone from us three weeks.” After many weary steps we came to Wachusett, where he was: and glad I was to see him. He asked me, when I washed me? I told him not this month. Then he fetched me some water himself, and bid me wash, and gave me the glass to see how I looked; and bid his squaw give me something to eat. So she gave me a mess of beans and meat, and a little ground nut cake. I was wonderfully revived with this favor showed me: “He made them also to be pitied of all those that carried them captives” (Psalm 106.46).

      I think this example of her reference to religion demonstrates its significance amidst her hardships.

    1. Oftentimes they even refered to one another.

      An explicit reference in 1931 in a section on note taking to cross links between entries in accounting ledgers. This linking process is a a precursor to larger database processes seen in digital computing.

      Were there other earlier references that are this explicit within either note making or accounting contexts? Surely... (See also: Beatrice Webb's scientific note taking)


      Just the word "digital" computing defines that there must have been an "analog' computing which preceded it. However we think of digital computing in much broader terms than we may have of the analog process.

      Human thinking is heavily influenced by associative links, so it's only natural that we should want to link our notes together on paper as we've done for tens of thousands of years (at least.)

    1. Rain, shine, and seasons aside, passengers scheduling rides are instructed by call center operators to be outside our pick-up location at our scheduled pick-up time, even though our ride may be nowhere near at that time. We are also instructed to be prepared to wait up to 30 minutes for our drivers in case of traffic or delays. Drivers who arrive within that “30-minute window” are still considered to be on time, even though the passenger may have been outside for up to half an hour at that point. Those 30-minute delays may actually turn into hours-long waits for many customers, as drivers must follow predetermined routes that lengthen trips and exacerbate travel conditions. Drivers, on the other hand, are instructed to give late passengers only a five-minute grace period. Drivers are also encouraged to call passengers if they do not see us when they arrive, but such calls are considered a courtesy, not a requirement.

      one of the issues. I think this is especially jarring to the reader because most of us have used an Uber before, or other forms of public transportation and these "terms" are very different.

    1. Author Response

      Reviewer #1 (Public Review):

      The manuscript shows that bone is resorbed during the early steps of limb regeneration in urodeles, and osteoclasts are required for this process. In case of impaired resorption, integration of newly-formed tissue with the original bone shaft is compromised. The manuscript further shows that wound epithelium is required for bone resorption and suggests that it induces osteoclastogenesis or migration of osteoclasts. Furthermore, the authors showed that the formation of novel skeletal elements is initiated while the resorption of the old one is still actively ongoing.

      The study is well designed, conclusions are relatively well supported, and data are presented in a clear way. Two new models of transgenic axolotls have been created. The strongest and most important finding is that partial bone resorption is required for tissue reintegration. My main concern is the novelty of this study, which is quite limited in my opinion.

      Specifically, resorption of bone stump during limb regeneration has been shown before in various model organisms.

      The role of osteoclasts in this process has not been well characterized in urodeles but has been shown during the regeneration of a mouse digit.

      It is reasonable to anticipate that similarly, osteoclasts are resorbing bone in salamanders, especially since this is the only cell type known for bone resorption.

      Thus, this observation, despite being nicely and thoroughly done, is of limited interest.

      The role of wound epithelium in bone histolysis is well demonstrated via skin flap experiments in this manuscript. However, upon skin flap surgery no limb regeneration occurs, implying wound epithelium is a key tissue triggering all the processes of limb regeneration. Accordingly, the absence of bone histolysis in such conditions can be secondary to the absence of any other part of the regenerative process, e.g., blastema formation, macrophage M1 to M2 transition, reinnervation, etc. The proposed link between wound epithelium and osteoclastogenesis (i.e., Sphk1, Ccl4, Mdka) is very superficial and very suggestive.

      No functional evidence was provided to confirm these connections. Finally, the authors showed that new bone formation occurs while resorption of the bone stump is still ongoing. This is a nice observation, but again, rather indirect as it is based on the dynamics of bone resorption and bone formation in different animals. Due to high variability among animals, direct evidence, like double staining for osteoclasts and blastema markers would address this point more precisely.

      We consider that our work provides evidence, for the first time, that skeletal resorption in early stages of regeneration has a durable impact by affecting tissue integration. We show that this process occurs in a short and conserved time, which provides a window of interest for comparative research with other models, and interventional therapies. To our knowledge, limb regeneration is studied mainly in amphibians, as they are the only established lab model with this ability. Some lizards, geckos and possibly iguanas, have been reported to regrow an appendage albeit lacking the regenerative fidelity amphibians have. In an established regeneration lab model, such as the axolotl, the study of regeneration-induced resorption has been scarce.

      During murine digit tip, osteoclasts are recruited to the amputation site and resorb the bone in a similar time frame as we show here in the axolotl. Ablating osteoclasts delays the regeneration time, however, no study has been conducted on the impact of tissue integration. Additionally, a key difference between mouse digit and adult axolotl limb regeneration is that the new skeletal elements are built fundamentally different: direct ossification (bone on top of bone) in mouse, versus endochondral ossification (cartilage on top of osteo-cartilage elements) in the axolotl limb. The tissue integration of the latter may present different challenges worth exploring to understand its regulation. What this work adds, is a characterization of the temporal and cellular dynamic of regeneration-induced resorption, the interaction of osteoclasts with skeletal cells and lastly, the impact on tissue integration.

      Based on previous studies in mammals, it is reasonable to anticipate the presence and role of osteoclasts in salamanders. However, the growing body of work in the field, as well as our own work in the axolotl, have shown that extrapolations of mammalian skeletal biology to other species come with their risks.

      We agree that the role of the wound epithelium (WE) in skeletal histolysis will require further and extensive work. The evidence shown here, provides a glimpse of the complex response and crosstalk of the WE with the tissue underneath, and we hypothesize this response is tailored to the tissue composition exposed during the injury.

      Finally, following the reviewer’s advice, we have conducted new experiments to prove the temporal connection between skeletal resorption and regeneration, showing that these processes occur simultaneously.

      Reviewer #3 (Public Review):

      This study outlines the role of osteoclast-mediated resorption in integrating the skeletal elements during limb regeneration, using axolotls that can regenerate the entire limb upon amputation. Using calcium-binding vital dyes (calcein and alizarin red), the authors first demonstrated that a large portion of amputated skeletal elements is resorbed prior to blastema formation. They further show that 1) inhibiting bone resorption by zoledronic acid impairs proper integration of the pre-existing and regenerating skeletal elements, 2) removing the wound epithelium using the full skin flap surgery inhibits bone resorption, and 3) bone resorption and blastema formation are correlated. The authors reached the major conclusion that bone resorption is essential for successful skeletal regeneration. Notably, this study applies a well-established and elegant axolotl limb regeneration model and transgenic reporter strains to reveal the potential roles of resorption in limb regeneration.

      Strengths:

      1. The authors utilized a well-established axolotl limb regeneration model and applied elegant vital mineral dyes and transgenic reporter lines for sequential in vivo imaging. The authors also provided quantitative assessment by examining multiple animals, particularly in the early sections, ensuring the rigor and the reproducibility of the study.

      2. The authors further performed important interventions that can impinge upon successful limb regeneration, including inhibition of bone resorption by zoledronic acid and impairment of the wound epithelium by full skin flap surgery. These procedures gave rise to useful insights into the relationship between bone resorption and successful limb regeneration.

      3. The imaging presented in this manuscript is of exceptionally high quality.

      Weaknesses:

      1. Despite the high quality of the work, many analyses in this study are incomplete, making it insufficient to support the major conclusion. For example, in Figure 4, the authors did not provide any quantitative assessment to show how zol affects the integration of the skeletal elements (angulation?), which seems to be essential for supporting the conclusion. Likewise in Figure 7, the analyses of EdU+ cells and Sox9 reporter expression were not included in zol-treated animals. Similarly in Figure 5, quantification of osteoclasts was not performed with the full skin flap surgery group. Analyses of only normally regenerated animals are not sufficient to support many of the conclusions.

      2. The phenotype of zol-treated animals in limb regeneration is somewhat disappointing. Although zol-treated animals show decreased blastema formation and unresorbed pre-existing skeletal elements, limb regeneration still occurs and the only phenotype is a relatively minor defect in skeletal integration. It is possible that zol-induced defect in blastema formation is not directly linked to the failure of integration at a later stage. I find this “weakness” a bit subjective.

      3. As an integration failure of the newly formed skeleton still occurs in untreated animals, it is not entirely clear how the authors can attribute this defect to a lack of bone resorption. More quantitative analyses would be necessary to demonstrate the correlation between zol treatment and lack of integration.

      Taking into consideration the reviewer’s concerns, we have improved our analysis of integration phenotype. The assessment of integration success was carried out using a score matrix and with it, we correlated the extent of resorption with integration efficiency more accurately. We believe our results provide sufficient evidence to support this correlation.

      When we first saw the phenotype of zol-treated animals, we were far from disappointed, we were actually intrigued that we could observe a significant failure in tissue integration after removing the function of osteoclasts in an early phase of regeneration. All or nothing results are exciting, subtle results on the other hand, could prove more informative, and we think this is the case here. Our treatment does not inhibit regeneration, but disrupts tissue integration, opening another fascinating aspect of regeneration: how old tissue is capable of functionally integrate newly-formed tissue?

      The integration phenotypes observed in the un-resorbed limbs does not resemble anything reported in the field so far. Moreover, the range of phenotypes observed led us to better determine its correlation with resorption. Importantly, the presence of integration failures in untreated animals allowed us to look into ECM organization at this old-new tissue interphase, while highlighting the normal occurrence of imperfect regeneration in the axolotl limb.

      Finally, we have included new results to complement the conclusions presented at the end of our work. Albeit we observed differences in blastema size in zol-treated animals, we did not observe difference in the amount of EdU+ cells, which reveals that the skeleton cannot be used as a reference for assessing blastema location. This conclusion is complemented with our in vivo assays in which we observed condensation of cartilage despite resorption still occurring. We consider our conclusions to be justified and supported by the assays presented in our work.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01594

      Corresponding authors: Hidehiko Kawai and Hiroyuki Kamiya

      1. General Statements [optional]

      We would like to extend our gratitude to the Editor and both Reviewers for their constructive and insightful comments to our manuscript. We deeply appreciate the Reviewers’ careful consideration of our work, in result of which we think the paper has greatly improved. Below, we have responded to all points raised by the Reviewers.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The analysis of mutations in mammalian, including human, genomes has been of interest for many decades. Early DNA sequencing technologies enabled direct identification of mutations in target genes provided that the mutant genes could be readily isolated. This requirement stimulated the development of shuttle vector plasmids that carried a mutation marker gene and could replicate in both mammalian and bacterial cells. These were used in experiments in which the plasmids, treated with a mutagen, would be passaged through mammalian host cells after which the progeny plasmids were introduced into an indicator bacterial strain. Colonies with mutant marker genes could be distinguished by color or survival, the plasmids recovered, and the sequence of the mutant gene determined. The shuttle vector plasmid that became the most widely used contained as the marker the supF amber suppressor tyrosyl tRNA gene positioned in the plasmid such that deletion mutations associated with mammalian cell transfection were selected against. Although various improvements have been introduced since its introduction in the mid-1980s, including bar codes to distinguish independent from sibling mutations (in the early 1990s), the basics of the system have been maintained, and it and variations are still in use. The Kamiya group has made several adjustments to the supF shuttle vectors, including the construction of indicator bacterial strains based on survival of bacteria containing mutant supF genes (the initial system relied on colony color). They have published many studies of mutagenesis by various agents, error prone polymerases, etc. In the current submission they describe a comprehensive approach to identifying mutations in the supF gene that exploits Next Generation Sequencing technology that can identify the full spectrum of mutations including those that escape detection in phenotypic screens. The study is exhaustive and presents a methodical validation of each component of their approach. They report UV induced mutations, the mechanism of which has been well characterized in previous literature. They also describe a category of multiple mutations, which had been observed in the early work with the supF plasmids, and whose relationship to UV photoproducts is most likely indirect.

      *We thank the Reviewer for their very insightful feedback to our manuscript and their positive assessment. We have added some discussion points based on the essential references mentioned in the Reviewer’s comments, which we believe made the explanation of our study more complete. *

      Major comments: This manuscript presents a technical advance on the use of the supF mutation reporter system. The extent of the validation of each component of the system, including the bar code is rigorous. Their data on the nature and location of UV induced mutations are in very good agreement with previous studies with supF and other reporter genes, a further validation of their approach. Their discussion of the mechanism of the UV induced mutations is in accord with prior work from other laboratories. However, their interpretation of the multiple mutations, although reasonable in invoking a role for APOBEC deamination of cytosines (see eLife. 2014; 3: e02001 for another discussion of this issue), overlooks a much earlier study on the same topic that showed that nicks in the vicinity of the marker gene are mutagenic and can induce multiple mutations (Proc Natl Acad Sci 1987 84:4944-8). It would be useful for the authors to consider their data on the multiple mutations in the light of the earlier analysis. Furthermore, a check to verify the covalently closed circular integrity of the plasmid preparations would be an important quality control and could reduce the mutagenesis observed in 0 UV controls.

      We thank the Reviewer for the valuable comments that made our manuscript clearer and more emphatic. We are hereby addressing all of the Reviewer’s concerns. The available data accumulated from previous studies have proved the high sensitivity of the supF assay as a mutagenesis assay, which now has been clearly supported by the results in the current study. We believe that this NGS assay will be able to fulfil the data requirements to tackle many questions related to mutagenesis, thanks to the simplicity and cost-effectiveness of the procedure. However, to meet the experimental objectives, the preparation and analysis of the library are crucially important procedures in the stages of initial setting up of the assay. The covalently closed circular integrity of the vector library is definitely one of the important points we should pay attention to when performing this assay. After the construction of the BC12-library, we have to check the quality of the library by agarose gel electrophoresis. The background mutation frequency and the sequence of the library itself (uploaded as described in the DATA AVAILABILITY section of this manuscript) also needs to be analyzed by NGS before the experiment. We are also routinely constructing the double-stranded shuttle vector from a single-stranded circular DNA with a variety of site-specific damaged oligonucleotides. The treatment with T5 exonuclease followed by purification is absolutely essential to decrease the background mutation frequency. Without the treatment with the exonuclease, cluster mutations may be increased under specific experimental conditions. For this study, we carried out the conventional supF assay using the BC12-library purified after T5 exonuclease treatment. However, in this case the process of purification slightly increased the mutant frequency of the BC12-library to about 2 x10-4 (corresponding to 1x10-6/bp).Therefore, when setting up the essay, we have to consider the background control that we will need for the data analysis. In response to the Reviewer’s comments, we have now added the following paragraph in the DISCUSSION section:

      Page 16, line 25:

      ”5) For the supF assay, spontaneous cluster mutations at TC:GA sites were often observed, and it was well illustrated in an earlier study that a nick in the shuttle vector was a trigger for these asymmetric cluster mutations (54). Therefore, we need to be aware of the quality of each library and how it affects the outcome of each analysis, especially for detection of very low levels of mutations. Depending on the purpose of the experiments, in the preparation of covalently closed circular vector libraries it is essential to eliminate the background level of mutations. In fact, the in vitro construction of the library of double-stranded shuttle vectors from single-stranded circular DNA requires the process of treatment with T5 Exonuclease, which drastically decreases background mutations.”

      Minor points The authors state that only 30% of the base sequence of the supF gene can be "used for dual-antibiotic selection on the indicator E. coli". An earlier review (Mutation Res 220: 61,1989) indicated that within the mature tRNA region single or tandem mutations had been reported at 87% of sites, using the colony color assay. The direct NGS analyses would be indifferent to phenotype, and one would expect the maximum number of mutable sites would be recovered from this approach. It would be helpful for an explicit statement regarding the number of mutant sites to be in the Discussion, as this should strengthen the case for the NGS strategy.

      We thank the Reviewer for the helpful comment. These are important points we should indeed mention. This method will complement previous data, and especially the data from titer plates will provide us with non-biased mutation spectra for the whole analyzed region. We have now explained in detail about the coverage of mutation spectra in the DISSCUSSION section.

      Page 14, line 14:

      The mutation spectra of single or tandem base-substitutions for inactive supF genes identified by using the blue-white colony color assays were comprehensively summarized in an earlier review article, and it was noted that the mutations were detected at 86 sites within a 158-bp region covering the supF gene (54%) and at 74 sites within the 85-bp mature tRNA region (87%), thus demonstrating the great sensitivity of the supF assay system for analysis of mutation spectra (19). However, obtaining reliable datasets by the conventional supF assay requires skill and experience, especially for studies where the mutations of interest are induced with low frequency. The method has been advanced by the construction of indicator bacterial strains with different supF reporter genes which allow selection based on survival of bacteria containing mutant supF genes. However, the fact that the supF phenotypic selection process relies on the structure and function of transfer RNAs that may be differently affected by different mutations means that the improvement of the efficiency of the selection process may cause loss of coverage of the mutation spectra, as it is under our experimental conditions, where the coverage is about 30% (19,20).”

      Page 15, line 4:

      From this point of view, we believe that we can secure a sufficient number of experiments to improve the accuracy of the analysis and to confirm the reproducibility of the experiments. Furthermore, the data from colonies grown on titer plates provides us, at least in principle, and with the exception of large deletions and insertions, with non-biased mutation spectra for the whole analyzed region.

      Supplementary Figure 1 shows the organization of 8 supF reporter plasmids. Were these discussed in the text and employed in the experiments? It was not clear in the text.

      We thank the Reviewer for the helpful comment. It was indeed not clear which vectors we used and why we constructed a series of vectors. Now, we have added the vectors we used for the constructions of the library and each experiment in the RESULTS and MATERIALS AND METHODS sections. Since this is quite important for us and, we believe, the readers, we also added the explanations in the DISCUSSION section, detailing why we have constructed a series of shuttle vectors, as follows:

      Page 19, line 36:

      Mutational signatures identified in cancer cells are emerging as valuable markers for cancer diagnosis and therapeutics. Innumerable physical, chemical and biological mutagens, including anticancer drugs, induce characteristic mutations in genomic DNA via specific mutagenic processes. The mutation spectra obtained here by using the presented advanced method were in good agreement with accumulated data from previous papers where the conventional method had been used, with the advantage that our method provided less-biased mutation spectra data. As described above, the datasets presented here highlighted novel mutational signatures and also cluster mutations with a strand-bias, which could be associated with the processes of replication, transcription, or repair of DNA-damage, including a single strand break (a nick). In this study, eight series of supF shuttle vector plasmids were constructed, as presented in Supplementary Figure S1; however, the analysis was carried out using N12-BC libraries prepared from either pNGS2-K1 (Figures 1-4) or pNGS2-K3 (Figures 5-10). The pNGS2-K1/-A1/-K4/-A4 and pNGS2-K2/-A2/-K3/-A3 vector series contain an M13 intergenic region with opposite orientations relative to the supF gene, which allows us to incorporate specific types of DNA-damage at specific sites in the opposite strand of the vector library. Also, the pNGS2-K1/-A1/-K3/-A3 and pNGS2-K2/-A2/-K4/-A4 vector series contain the SV40 replication origin, which enables bidirectional replication and transcription, at opposite sides of the supF gene. Although this is still preliminary data, it is notable that the spontaneously induced mutations for the different vectors in U2OS cells were not significantly different. Therefore, the here presented mutagenesis assay with NGS, by using these series of libraries, can be applied in many different types of experiments to address both quantitative and qualitative features of mutagenesis. It is possible to design series of libraries containing DNA lesions or sequences suitable for the investigation of specific molecular mechanisms, such as TLS, template switching, and asymmetric cluster mutations.”

      CROSS-CONSULTATION COMMENTS Comment on the issue raised by Reviewer #2 regarding plasmids with unrepaired DNA damage introduced into E. coli after passage through U2OS cells: treatment of the plasmid harvest with Dpn1 eliminates un-replicated plasmid DNA. Also, SV40 T antigen drives run away replication of the plasmids, which contain the SV40 origin of replication. This greatly dilutes plasmids with remaining UV photoproducts.

      Reviewer #1 (Significance (Required)):

      Significance This is a comprehensive description of a technical advance for the analysis of mutations based on the most widely used system for reporting mutations in mammalian, including human, cells. As costs for NGS decline it is likely to become the approach of choice.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the authors developed a novel mutagenesis assay by combining the conventional supF forward mutagenesis assay with NGS technology. The manuscript is well written, providing design, methods, and results of the experimental system in very much details, which this reviewer highly evaluates. However, the manuscript may be too long and could be more concise. In addition, this reviewer is afraid that main figures seem difficult to fit printed pages (especially multi-paneled figures of large size, such as Fig. 5 through 8). The authors should re-organize the figures by reducing size and/or moving partly to supplementary information.

      We thank the Reviewer for the helpful comments to our manuscript. It is true that the multi-paneled figures were too large, and we have now re-analyzed and optimized most of the figures by reducing size, transferring to Supplementary Figures, and separating one figure into two. Although the number of Figures and Supplementary Figures have now increased, we believe that it has become easy to follow for readers and to fit printed pages. *We considered carefully the Reviewer’s remark about the length of the manuscript, but we feel that the text was already as concise as we could make it, and we have already left out some more detailed explanations. *

      1. Some UV-induced DNA damage (typically CPD) is repaired only slowly in human cells, so that the replicated plasmid DNAs recovered from U2OS cells may still contain damage and possibly induce mutations in E. coli after transfection. As the result of high sensitivity of NGS analysis, it is worried that such mutations could be also included in the results. To obtain even more accurate mutational characteristics in mammalian cells, the authors could consider to treat the DNA samples with photolyases before transformation of E. coli. The authors could consider to discuss on this point.

      *We thank the Reviewer for the helpful comment, indeed Dpn I treatment is one of the very important procedures for avoiding analysis bias. We have now expanded the explanation why the libraries have to be treated with Dpn I, as follows: *

      Page 11, line 4:

      the libraries were extracted from the cells, and treated with dam-GmATC-methylated DNA specific restriction enzyme Dpn I to digest un-replicated DNAs that contain UV-photoproducts.”

      1. It is quite intriguing that multiple mutations in a single BC clone tend to occur in the same DNA strand. Is there any trend in a distance between the mutated sites? Considering participation of TLS polymerases in the first round of replication, it may be interesting if multiple DNA lesions occur in relatively close positions so that TLS polymerases elongate the DNA strand without switching back to replicative polymerases.

      We thank the Reviewer for the valuable and insightful suggestions for this assay. We have analyzed the positions of SNSs in multiple-mutations shown in Supplementary Figures S11 and S12. As the reviewer mentioned, we may be able to address the mechanisms of TLS switching in mammalian cells by using this assay. In this study, the obtained non-biased mutation spectra of multiple mutations may not be enough for the static analysis, but our results indicate that multiple mutations were induced at relatively close positions. It would be interesting if we could address the mechanisms of TLS polymerase switching. We believe that the accumulation of large numbers of non-biased mutation spectra will provide us with growing opportunities to address more questions in mutagenesis. We have now added the Supplementary Figures S11 and S12, as well as the following discussion points:

      Page 14, line 6:

      5) The distance between two SNSs in multiple mutations induced by UV irradiation was relatively shorter than the theoretically expected based on the sequence (Supplementary Figures S11 and S12).”

      Page 18, line 27:

      “In addition, the positions of SNSs in the multiple mutations were closer to each other compared to the theoretically expected positions (Supplementary Figures S11 and S12), which may reflect switching events involving TLS polymerases. It should be noted that the presented data for the distance between two SNSs in the multiple mutations was analyzed from the data from selection plates in order to secure a sufficient number of mutations, and therefore, there may be a bias due to hot spots associated with the selection process. However, the results from the limited number of mutations from the titer plates are similar to these from the selection plates. It can be proposed that this assay may also be applied for analysis of TLS polymerases in mammalian cells.”

      1. This reviewer is wondering whether the results of mammalian cells are influenced by transcription-coupled repair in this experimental system. Because the SV40 replication origin functions as bidirectional promoters, the supF region may be transcribed in U2OS cells so that DNA damage on transcribed strands may be removed more efficiently than non-transcribed strands. Please comment on this, if relevant.

      *We thank the Reviewer for the insightful comments. This issue is also very important and interesting, and should be addressed in the mutagenesis research. That is exactly the reason why we presented series of vectors for the assay in this paper. The SV40 replication origin has an effect on the background mutations, which this is also dependent on the experimental conditions. However, this needs to be confirmed by further studies. We hope the idea for these constructions will be helpful for many laboratories. We have now added the following parts in the DISCUSSION section. *

      Page 18, line 36:

      Mutational signatures identified in cancer cells are emerging as valuable markers for cancer diagnosis and therapeutics. Innumerable physical, chemical and biological mutagens, including anticancer drugs, induce characteristic mutations in genomic DNA via specific mutagenic processes. The mutation spectra obtained here by using the presented advanced method were in good agreement with accumulated data from previous papers where the conventional method had been used, with the advantage that our method provided less-biased mutation spectra data. As described above, the datasets presented here highlighted novel mutational signatures and also cluster mutations with a strand-bias, which could be associated with the processes of replication, transcription, or repair of DNA-damage, including a single strand break (a nick). In this study, eight series of supF shuttle vector plasmids were constructed, as presented in Supplementary Figure S1; however, the analysis was carried out using N12-BC libraries prepared from either pNGS2-K1 (Figures 1-4) or pNGS2-K3 (Figures 5-10). The pNGS2-K1/-A1/-K4/-A4 and pNGS2-K2/-A2/-K3/-A3 vector series contain an M13 intergenic region with opposite orientations relative to the supF gene, which allows us to incorporate specific types of DNA-damage at specific sites in the opposite strand of the vector library. Also, the pNGS2-K1/-A1/-K3/-A3 and pNGS2-K2/-A2/-K4/-A4 vector series contain the SV40 replication origin, which enables bidirectional replication and transcription, at opposite sides of the supF gene. Although this is still preliminary data, it is notable that the spontaneously induced mutations for the different vectors in U2OS cells were not significantly different. Therefore, the here presented mutagenesis assay with NGS, by using these series of libraries, can be applied in many different types of experiments to address both quantitative and qualitative features of mutagenesis. It is possible to design series of libraries containing DNA lesions or sequences suitable for the investigation of specific molecular mechanisms, such as TLS, template switching, and asymmetric cluster mutations.”

      1. page 13: Please check whether the description of Fig. 9C is correct (6th line, graph on top; 9th line, bottom graph).

      We thank the Reviewer for carefully checking our manuscript, it was mislabeled in the text. Now, following the Reviewer’s comments, most figures have been changed from the figures in the previous submission. We appreciate the careful review.

      CROSS-CONSULTATION COMMENTS Reviewer #1 gives quite relevant comments as an expert of the mutagenesis field. It would improve this manuscript greatly for the authors to make appropriate modifications according to his/her suggestions.

      Reviewer #2 (Significance (Required)):

      It is quite convincing that this method has a great potential to give much more extensive information on mutational characteristics, most importantly, by eliminating the bias caused by phenotypic selection. Therefore, this work certainly must be worth being published in an appropriate journal.

    1. Aur ore ara, ~ spu8 dur ‘parapamur skoq Aj paysitres 10 peap uarpyi Ay “vanowdH © SRPOTL “SPDaq9 SAPPARLL [SILL ‘suodurey, -ouasdy, ‘asedmpooy

      This really illustrates the disconnect between what provokes stress or despair between the two worlds. Privilege check. SO THE BEGINNING PARTS ARE CREATING DISTANCE. I think the later parts will make us see that while we live two different lives, we live right next to each other. And nothing is as far as it may seem.

    1. ust as some friendly debunkerswere able to build connections with believers, addressing the issues of false conspiracy theoriesmay require us to see our similarities and common concerns instead of focusing solely on ourdifferences in belief. This challenge also requires us to see the problem from a socio-technicalperspective by treating the technologies and the social relationships on the internet together as anorganic whole. We hope our research contribute to the understanding of conspiracy believers andtheir belief changing process, and shed light on how we may better facilitate people in makingsense of online information

      Made me think of flat-earth documentary on Netflix.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      Review of "Identifying novel regulators of placental development using time series transcriptomic data and network analyses."

      The authors present a detailed bioinformatic assessment of mouse developmental time series of the placenta. They apply current data mining and analysis methods to identify protein-centred networks that are likely enriched to specific cell types of the placenta. They then translate these findings to humans using statistical comparisons of human single-cell sequencing data of the placenta. Lastly, they use knock-down experiments to validate the conserved functional importance of the hub genes in the mouse protein networks in human cells.

      The strengths of this paper are the rigorous data mining methods and the functional translation to humans from mice. There are no critical weaknesses to the article. There is a blend of statistical analysis with anecdotal or hand curation from databases and the literature, but it is unclear if these curated finings are circumstantial or statistically meaningful. In the end, the hypothesis seems to hold in that 4/4 gene knocked down in the human cells gave a migration phenotype.

      Comments, questions, critique:

      1. Given the translational aims of the paper, more introduction/discussion material on the comparative aspects of mice and humans are needed. Are giant cells and EVT the same? What are the cell equivalents that you are discovering? The Soncin et al. paper is cited, but I think underused. This publication contains time series data on mice and humans and could be used as external validation of clusters, networks, and other analyses. Other publications to consider for context are

      2. Cox B, et al. Mol Syst Biol 5: 279.

      3. Silva JF, Serakides R. 2016. Cell Adhes Migr 10: 88-110. (specifically discusses migration difference between the species placentae)

      We thank the reviewer for the comment and valuable resources. We agree that more information about the similarities and differences between the migratory cells needs to be provided. We have added the following details in the introduction of the manuscript:

      “Although there are certain differences between the mouse and human placenta (Hemberger, Hanna, and Dean 2020; Soncin, Natale, and Parast 2015), they do express common genes during gestation, including common regulators and signaling pathways involved in placental development (Cox et al. 2009; Soncin et al. 2018; Soncin, Natale, and Parast 2015; Watson and Cross 2005). For example, Ascl2/ASCL2 and Tfap2c/TFAP2C are required for the trophoblast (TB) cell lineage in both mouse and human models (Guillemot et al. 1994; Kuckenberg, Kubaczka, and Schorle 2012; Varberg et al. 2021). Another example is the HIF signaling pathway, which regulates TB differentiation in both mouse and human placenta (Soncin, Natale, and Parast 2015).”

      “Although the structure of the placenta is not identical between mouse and human, certain mouse placental cell types are thought to be equivalent to human placental cell types (Soncin, Natale, and Parast 2015). For example, parietal TGCs and glycogen TBs have been described as equivalent to human extravillous trophoblasts (EVTs) (Soncin, Natale, and Parast 2015). Mouse TGCs are not as invasive as human EVTs (Soncin, Natale, and Parast 2015), and they have different levels of polyploidy and copy number variation (Morey et al. 2021); however, both EVTs and TGCs are able to degrade extracellular matrix to enable TB migration into the decidua (Silva and Serakides 2016).”

      Added to discussion:

      “These genes were selected primarily based on the network analyses, but also based on expression data from human cells to account for possible differences between mouse and human placental gene expression.”

      As the reviewer suggested, we used the Soncin et al., 2015 data for validation. Only 6,317 of the 11,713 protein-coding genes used for hierarchical clustering were detected in the mouse dataset in Soncin et al., 2015. This issue could be because the Soncin data was generated using microarrays.

      Nevertheless, we still compared our e7.5 and e9.5 hierarchical groups with: (1) Soncin et al. gene clusters in mouse that were downregulated over time, had highest expression from e9.5-12.5, or were upregulated over time; and (2) Soncin et al. gene clusters in human that were best correlated with mouse clusters and were either downregulated over time or upregulated over time. We observed a general consensus that our e7.5-hierarchical group had the highest percent of agreement with Soncin et al. gene groups that are downregulated over time, and our e9.5-hierarchical group had the highest percent of agreement with Soncin et al. gene groups that either have highest expression at e9.5-e12.5 or genes that are upregulated over time. This data is added below, described in the results section 1, and included in Supplementary Table S1.

      Comparison with Soncin et al. mouse data:

      Having expression > 0 (in Soncin et al.) and being in any hierarchical clusters

      E7.5-hierarchical genes (down-regulation trend)

      E9.5-hierarchical genes (up-regulation trend)

      Cluster 2, 3 and 7 (Soncin et al., downregulation trend)

      1009

      800 (79.3%)

      279 (27.7%)

      Cluster 6 (Soncin et al., highest at e9.5 – e12.5)

      120

      51 (42.5%)

      110 (91.7%)

      Cluster 1, 4 and 5 (Soncin et al., upregulation trend)

      1019

      415 (40.7%)

      881 (86.5%)

      Comparison with Soncin et al. human data:

      Having expression > 0 (in Soncin et al.) and being in any hierarchical clusters

      E7.5-hierarchical genes (down-regulation trend)

      E9.5-hierarchical genes (up-regulation trend)

      HS Cluster 5 (Soncin et al., downregulation trend)

      164

      92 (56.1%)

      52 (31.7%)

      HS Cluster 2 and 4 (Soncin et al., upregulation trend)

      111

      44 (39.6%)

      72 (64.9%)

      The following statement was added to the result section:

      “Second, we compared our hierarchical groups with previously published mouse and human placental microarray time course data from Soncin et al., 2015 (Soncin, Natale, and Parast 2015). Despite the technical differences between the datasets, we observed a consensus that our e7.5 hierarchical cluster had the highest percent of overlap with Soncin et al. gene groups that are downregulated over time, and our e9.5 hierarchical cluster had the highest percent of overlap with Soncin et al. gene groups that either have highest expression at e9.5 - e12.5 or genes that are upregulated over time (Supplementary Table S1).”

      Clustering represented in Figure 1B, was this a supervised model? Why only three clusters?) Did you specify that there would be three models and force each gene profile into one of the categories? How robust are the fits? A fitted model might be a better approach as you can specify the ideal models (early high, late high and mid-high), then determine each gene profile that fits each model and only assess those genes with a significant fit to the model. Forcing clustering to the three-model fit likely gives many poorly fitting profiles. While in the end, this works out, it may be due to applying other post hoc methods for gene enrichment, where noise distributes randomly.

      We carried out unsupervised transcript clustering using hierarchical clustering (agglomerative approach using Euclidean distance and complete linkage). The resulting dendrogram was cut at the second highest level to obtain three clusters. We have added additional validation with different numbers of clusters (k = 3, 4 and 5) and quantification of agreement between different clustering methods to show the robustness of the hierarchical clusters. We acknowledge that hierarchical clustering could be sensitive to noise and could result in poorly fitted transcripts in each group; however, it was a necessary first step for us to identify genes relevant to the distinct placental processes at the three timepoints. Acknowledging this disadvantage, we only focused the analyses on genes that are differentially expressed over time and were present in the timepoint hierarchical groups.

      We added the additional analysis as Supplementary Figure S1, and the following statements were added in the results section:

      "First, we used three different algorithms, K-means clustering, self-organizing maps, and spectral clustering, to validate the trends of the expression levels in hierarchical groups, as well as the number of transcript groups (k = 3, 4 and 5). Only with k = 3 did we obtain groups with median expression level trends consistent in all four algorithms (Supplementary Figure S1). Moreover, with k = 3, the maximum percent of agreement (see Materials and Methods) between hierarchical clusters and clusters obtained using each of the different algorithms was 70.34-87.26% (Supplementary Figure S1), while the maximum percent of agreement between hierarchical clusters and clusters obtained from other algorithms decreases to between 55.67-65.72% with k = 4 and 54.81-59.19% with k = 5.”

      We agree model-based clustering could be an alternative approach and have added it to the discussion section:

      “Combining hierarchical clustering with differential expression analysis, we were able to identify gene groups using an unsupervised approach. It has also been shown that for times-series analyses with fewer than eight timepoints, pairwise differential expression analysis combined with additional methods identifies a more robust set of genes (Spies et al. 2019). Alternatively, model-based clustering using RNA-seq profiles (Si et al. 2014) could also be useful for gene group identification. However, it is still important to evaluate the robustness and functional relevance of the fitted models by carrying out additional downstream analyses.”

      Several statements are made about the conservation of importance between mouse and human hub genes. For example, "We predict these highly expressed genes to be generally important for TB function and processes such as cell migration, a term associated with multiple timepoint specific networks (Figure 2A)." While your knock-down assay of migration results shows these hub genes to be necessary to humans, what do they mean to the mouse? You did not use mouse TSC to assess functional importance concurrently. You note a small number of genes as of known importance, "127 hub genes of which 16 have been annotated as having a role in placental development". Were the others knocked out but lack a developmental phenotype or not assessed? Are these functionally redundant in the mouse or not involved in the same processes between the species?

      To assess the possible role of hub genes in mouse development more comprehensively, we extended our search for gene functions on the Mouse Genome Informatics (MGI) database to include not only placenta related GO and MGI phenotype terms (defined as “genes with known roles”), but also embryo related GO and MGI phenotype terms (defined as “genes with possible roles”). We included embryo related terms as “genes with possible roles” because embryonic lethal mouse knockout lines frequently have placentation defects, and because defects in placental development can be associated with the development of other embryonic tissues (Brown and Hay 2016; Perez-Garcia et al. 2018; Woods, Perez-garcia, and Hemberger 2018). This change resulted in an increase in the number of genes with relevant functions in mouse, including several annotated as embryonic lethal or with abnormal embryonic growth (see Supplementary Table S6). With the additional annotations:

      • 6 out of 17 hub genes of e7.5 networks have known/possible roles.
      • 17 out of 28 hub genes of e8.5 networks have known/possible roles.
      • 48 out of 127 hub genes of e9.5 networks have known/possible roles. We also carried out randomization tests to determine if the number of known/possible genes we identified were significant. Randomization tests were carried out with the following procedure: for each timepoint, from the respective timepoint-specific groups, we sampled 10,000 gene sets of the same number as the hub gene numbers. Then we counted the number of known/possible genes in each random set. A p-value is calculated as the number of times a random gene set has ≥ known/possible genes than the observed number, divided by 10,000. We found that the number of genes with known/possible roles at each time point are statistically significant (Supplementary Figure S3). This result indicates that the gene sets we identified are significantly associated with relevant phenotypes in mouse.

      The remaining hub genes are unannotated as related to placental or embryonic functions in the MGI database. Based on that, it is difficult to determine if they lack a relevant phenotype, or if there has not been a detailed assessment of the placenta.

      Added to section 2 of the result section:

      “Briefly, genes annotated under any GO or MGI phenotype terms related to placenta, TB cells, TE and the chorion layer are considered as having a “known” role in the placenta. Genes annotated under terms related to embryo are considered as having a “possible” role in the placenta, because embryonic lethal mouse knockout lines frequently have placentation defects, and because defects in placental development can be associated with the development of other embryonic tissues (Brown and Hay 2016; Perez-Garcia et al. 2018; Woods, Perez-garcia, and Hemberger 2018). Hereafter, such genes are referred to as “known/possible genes”. In the e7.5 networks, there were 17 hub genes in which six genes were known/possible. The number of hub genes that are labelled as known/possible is statistically significant when comparing to random gene sets selected from the e7.5 timepoint-specific group (Supplementary Figure S3). In the e8.5 and e9.5 networks, 17 out of 28 and 48 out of 127 hub genes were known/possible, respectively. Similar to e7.5, the number of hub genes labelled as known/possible in e8.5 networks and e9.5 networks were both statistically significant when comparing to random gene sets selected from the corresponding timepoint-specific groups (Supplementary Figure S3). These results indicate that the gene sets we identified are significantly associated with relevant phenotypes in the mouse.”

      For the four genes that we tested in HTR-8/SVneo cells, we also added more information about the current known role of the gene in mouse.

      Added to the discussion section:

      “We identified hub genes and their immediate neighboring genes which could regulate placental development and confirmed the roles of four novel genes (Mtdh, Siah2, Hnrnpk and Ncor2) in regulating cell migration in the HTR-8/SVneo cell line. These genes were selected primarily based on the network analyses, but also based on expression data from human cells to account for possible differences between mouse and human placental gene expression. Previous studies suggested these four candidates are functionally important in mouse. Mtdh has been suggested to regulate cell proliferation in mouse fetal development (Jeon et al. 2010). The Siah gene family is important for several functions (Qi et al. 2013). Of relevance to the placenta, Siah2 is an important regulator of HIF1α during hypoxia both in vitro and in vivo (Qi et al. 2008). Moreover, while Siah2 null mice exhibited normal phenotypes, combined knockouts of Siah2 and Siah1a showed enhanced lethality rates, suggesting the two genes have overlapping modulating roles (Frew et al. 2003). Hnrnpk-/- mice were embryonic lethal, and Hnrnpk+/- mice had dysfunctions in neonatal survival and development (Gallardo et al. 2015) . Ncor2-/- mice were embryonic lethal before e16.5 due to heart defects (Jepsen et al. 2007). According to the International Mouse Phenotyping Consortium database (Dickinson et al. 2016), Ncor2 null mice also showed abnormal placental morphology at e15.5. However, none of these genes have been studied in TB migration function.”

      In determining conservation between mouse and human networks, were only 1:1 orthologs examined or did you consider more complex 1:many mapping conditions between the two species?

      In this work, we used only one-to-one orthology between mouse and human avoid duplication while sampling in the enrichment tests. We added this detail in the method section. However, as found in Cox et al., 2009, genes with one-to-many orthologs could be highly intriguing and should be investigated in future studies.

      Should the migration assay be normalized to survival/adhesion? If 70,000 cells were seeded but had 50% cell death (or reduced adhesion), then it may appear to be poor migration. Should the migration be evaluated as a ratio of top to bottom cell densities to control for poor adhesion or survival?

      We thank the reviewer for bringing up this important point. Unfortunately, with the method we used we cannot quantify the densities on top, because the cells on top need to be scraped off prior to measuring the cells at the bottom (the two densities cannot be measured separately). To help with this concern, in a separate experiment we instead counted cell numbers 48-hours post-transfection for cells treated with target gene siRNA and cells treated with negative control siRNA to determine if apoptosis or changes in proliferation rate could be leading to changes in the observed migration. From this data, we determined that none of the siRNA knockdowns resulted in a significant change of cell counts (p-value > 0.05). We do note that Siah2 siRNA #1 has some decrease in counts (p-value = 0.081) and Ncor2 siRNA #1 and #2 have some increase in cell counts (p-value = 0.081 and p-value = 0.077) (Supplementary Figure S7). Additional follow up experiments we have performed with our targets of interest, which are out of the scope of this paper, demonstrate that different pathways and processes could be involved in the resulting decrease in migration we observed (we are following up experimentally in more detail for each gene). Proliferation and other assays could also be used to further examine the increase in Ncor2 cell counts that were observed. We have added the cell count results and additional text to the discussion.

      Added to results, section 4:

      “When comparing the number of cells 48 hours post-transfection for cells treated with target gene siRNA to cells treated with negative control siRNA, we determined that none of the target gene siRNA treatments resulted in significant changes in cell counts. We do note that Siah2 siRNA #1 has some decrease in cell counts (p-value = 0.081), and Ncor2 siRNA #1 and Ncor2 siRNA #2 have some increase in cell counts (p-value = 0.081 and p-value = 0.077) compared to negative control treated samples (Supplementary Figure S7). This provides evidence that, in general, the reduction in cell migration capacity was likely not due to the target gene impacting the rate of cell death.”

      To the discussion:

      “Moreover, we observed that cell counts generally were not decreased upon target gene knockdown compared to negative control knockdown. However, more detailed analysis and process specific assays are needed. For example, future studies assessing each gene’s role in cell adhesion, cell-cell fusion, cell proliferation and cell apoptosis can be done to better understand their roles in placental development.”

      Reviewer #1 (Significance (Required)):

      This significantly advances previous publications on this topic by functionally testing the discovered genes.

      This highlights an excellent data mining strategy for a developmental disease using mice and translating to humans.

      The audience is likely developmental biologists and reproductive specialists.

      My expertise is bioinformatics and developmental biology.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors used RNA-seq data from mouse fetal placenta at e7.5, e8.5, and e9.5 to create timepoint-specific gene expression interaction networks to find genes that they predicted would regulate placental development. They confirmed four novel candidate genes and showed that in the transfected human trophoblast HTR-8/SVneo cell line, these four candidates reduced cell migration capacity. Additionally, the authors show that bulk RNA-seq data can be used to infer cell-type composition and when used with single-cell RNA-seq, can be a powerful tool to study the biological processes that involve multiple cell-types.

      Overall, the authors are rigorous in their analyses, their conclusions appear sound, and the work could be an asset to the broader placental biology field. However, although the authors present an approach that future studies might find useful to replicate and their work has produced numerous novel transcripts/genes that warrant further investigation, the approach is not entirely novel, and could be expanded/improved (as suggested by the authors in the discussion), particularly with regard to validation of the genes/networks identified. Major and minor comments are listed below.

      Major comments:

      1) The authors used clustering and differential expression analysis to define sets of timepoint-specific genes. However, it was not clear to me the benefits of this approach. Why would using this approach be better than differential expression analysis alone such as in a typical ANOVA?

      We have added more discussion on this matter to explain our approach. We believe using hierarchical clustering and pairwise differential expression analysis can help identify gene lists with higher confidence. These are the new details we added to the discussion section:

      “Combining hierarchical clustering with differential expression analysis, we were able to identify gene groups using an unsupervised approach. It has also been shown that for times-series analyses with fewer than eight timepoints, pairwise differential expression analysis combined with additional methods identifies a more robust set of genes (Spies et al. 2019). Alternatively, model-based clustering using RNA-seq profiles (Si et al. 2014) could also be useful for gene group identification. However, it is still important to evaluate the robustness and functional relevance of the fitted models by carrying out additional downstream analyses.”

      2) Related to number 1 above, although the authors are interested in timepoint-specific transcripts, the author's methods would filter out possibly interesting transcripts that turn on and off during development. The authors might want to check to see if there are transcripts that are up in e7.5 and then down in e8.5 but then up again in e9.5. Also, the author's methods seem to include transcripts that are not exclusive to one timepoint (i.e. are up in e7.5 and e8.5 but not e9.5). It might be interesting to differentiate transcripts that are exclusive to one timepoint from those that are in more than one timepoint.

      We thank the reviewer for their valuable comment. We agree genes that turn on and off during the time course could be very interesting. In performing this analysis, we found that the number of such genes is rather small (38 genes that are up-regulated at e7.5 compared to e8.5 and up-regulated at e9.5 compared to e8.5). These genes were not enriched for processes that we observed with timepoint-specific gene groups, such as “trophoblast giant cell differentiation” (e7.5-specific genes), “labyrinthine layer development” (e8.5- and e9.5-specific genes), "blood vessel development” (e7.5- and e9.5-specific genes) and “response to nutrient” (e9.5-specific genes) (Supplementary Table S3). They are generally enriched for processes related to cytokine production and regulation of secretion.

      We also agree that it is interesting to differentiate transcripts that are exclusive to one time point from those that are in more than one time point. In the revised manuscript, we added additional analysis for genes that belong to multiple timepoint groups due to different transcripts of the same gene being annotated as timepoint-specific, and genes unique to each timepoint (Added to results section 1):

      “It is possible that timepoint-specific groups share genes that have timepoint-specific transcripts. Indeed, we identified 37 genes shared between e7.5 and e8.5, 5 genes shared between e7.5 and e9.5, and 109 genes shared between e8.5 and e9.5 (Supplementary Table S3). We found that genes only present at one timepoint (timepoint-unique genes) were generally enriched for similar terms as the full group of timepoint-specific genes (Supplementary Table S3). However, terms related to the development of labyrinth layer like “labyrinthine layer morphogenesis” and “labyrinthine layer blood vessel development” were only enriched when using all e8.5-specific genes but not when using e8.5 timepoint-unique genes. Moreover, we found that, unlike genes shared between e9.5 and e7.5, genes shared between e9.5 and e8.5 were enriched for processes such as “blood vessel development” and “insulin receptor signaling pathway”. This observation may indicate that different transcripts of the same genes could be expressed at different timepoints for the continuation of certain biological processes.”

      3) In the network analysis it would be interesting and helpful to the reader to highlight, if any, nodes or terms that were found to be significant (i.e. hubs or genes that have a high centrality metric etc.) in both the STRING and GENIE3 networks or overlap the networks created by the two different algorithms to compare them. This might help readers better rank genes when using these data to decide what genes are most important at each timepoint.

      We observed only one hub gene shared among networks inferred by the two methods (Vegfa in the e9.5 networks). However, hub genes of networks inferred by one method could be nodes in networks inferred by the other method. Hence, we have added lists of such genes in section 2. Interestingly, many of these genes have known roles in placental development. In terms of biological functions shared between the networks at the same timepoints, there were multiple interesting processes such as “positive regulation of cell migration”, “epithelium migration” and “vasculature development”, which we highlighted in Figure 2A.

      In the revised manuscript, we have added the following details in different paragraphs of section 2 of the results:

      “Although the networks inferred by the two methods did not share any hub genes, hub genes identified with one method could be members of the other method’s networks. These hub genes are Mmp9 (e7.5_1_STRING), Frk, Hmox1, and Nr2f2 (e7.5_2_GENIE3) (Table 1). This observation strengthens the potential roles of Frk gene in placental development.”

      “Hub genes identified with one method and present in the other method’s networks are Hsp90aa1, Akt1, and Mapk14 (e8.5_1_STRING), Dvl3 and Msx2 (e8.5_2_GENIE3) (Table 1).”

      “Hub genes identified with one method and present in the other method’s networks include important genes such as Rb1 (Sun et al. 2006), Yap1 (Meinhardt et al. 2020) (e9.5_1_GENIE3) and Vegfa (e9.5_2_STRING) (Table 1). Notably, Vegfa is the only hub gene identified with both of the network inference methods.”

      4) The author's conclusion that network analysis can be used to identify genes more likely associated with specific placental cell types is very likely true, but I think that the conclusion would be more impactful if the authors reported how the method compares to simply taking a list of differentially expressed genes and looking for cell type enrichments using their favorite enrichment software. For example, if a gene is highly connected in a particular network that has been identified as SCT-specific, but that gene isn't considered an SCT "marker" by the placental biology research community, it would be interesting to highlight that it is prevalent in a previously published scRNA-seq dataset or a dataset that has isolated that particular cell type to show the advantages of using networks to find placental cell type specific genes.

      We completely agree with the reviewer’s point and have now added a randomization analysis to compare the enrichment using PlacentaCellEnrich (PCE) with genes in networks and random genes (Supplementary Figure S6). We randomly sampled 10,000 gene sets with the same sizes as the subnetworks from their corresponding hierarchical groups and carried out PCE analysis. These tests showed that the enrichments of cell type-specific genes were only significant with the subnetwork genes but not the random genes. The randomization tests added a valuable highlight that the network genes are highly relevant to cell type-specific genes in the human placenta, and therefore provided more confidence in the gene lists obtained from the network analyses.

      We also further checked the expression of the hub genes in other independent data in order to identify hub genes that are potentially cell type specific markers. For example, we observed that Dvl3 (e8.5_2_GENIE3) and Olr1 (e9.5_3_STRING) have been shown to be differentially expressed in SCT compared to other TB subtypes (human trophoblast stem cells, EVT (Sheridan et al. 2021) or endovascular TB (Gormley et al. 2021)).

      We added the following detail in the results, section 3:

      “Importantly, randomization tests showed that the enrichment of cell type-specific genes were only significant in these subnetworks but not in random gene sets selected from corresponding timepoint hierarchical groups (Supplementary Figure S6), which highlights the biological relevance of the gene network modules.”

      Added to the discussion section:

      “Moreover, hub genes could be used to identify potential novel markers for the cell types corresponding to their subnetworks. For example, hub genes of subnetworks enriched for SCT-specific genes such as Dvl3 (e8.5_2_GENIE3) and Olr1 (e9.5_3_STRING) are not established SCT marker genes, but are in fact differentially expressed in SCT compared to human trophoblast stem cells, EVT (Sheridan et al. 2021) or endovascular TB (Gormley et al. 2021). In general, combining network analysis with existing gene expression data from single cell or pure cell populations will allow identification of novel cell-specific marker genes to help future studies focused on different TB populations.”

      5) While the selection of genes for validation was limited by the model system available for testing, the authors should recognize that the genes/networks identified here should first and foremost be validated in a mouse model (by knockdown/overexpression studies using mouse trophoblast stem cells or by evaluation of placenta/embryo in a KO/transgenic mouse model). Whether or not the data are relevant to human placentation is (at least initially) irrelevant. While we recognize that these are difficult studies requiring significant time and resources, as is, the data and results will have significantly less impact than if even a limited amount of such validation could be performed.

      We thank the reviewer for this valuable comment. Based on this comment and the suggestions from reviewer #1, we have added the following points to the manuscript to discuss the relevance of the genes in the mouse models, and further explain our gene choices:

      To assess the possible role of hub genes in mouse development more comprehensively, we extended our search for gene functions on the Mouse Genome Informatics (MGI) database to include not only placenta related GO and MGI phenotype terms (defined as “genes with known roles”), but also embryo related GO and MGI phenotype terms (defined as “genes with possible roles”). We included embryo related terms as “genes with possible roles” because embryonic lethal mouse knockout lines frequently have placentation defects, and because defects in placental development can be associated with the development of other embryonic tissues (Brown and Hay 2016; Perez-Garcia et al. 2018; Woods, Perez-garcia, and Hemberger 2018). This change resulted in an increase in the number of genes with relevant functions in mouse, including several annotated as embryonic lethal or with abnormal embryonic growth (see Supplementary Table S6). With the additional annotations:

      • 6 out of 17 hub genes of e7.5 networks have known/possible roles.
      • 17 out of 28 hub genes of e8.5 networks have known/possible roles.
      • 48 out of 127 hub genes of e9.5 networks have known/possible roles. We also carried out randomization tests to determine if the number of known/possible genes we identified were significant. Randomization tests were carried out with the following procedure: for each timepoint, from the respective timepoint-specific groups, we sampled 10,000 gene sets of the same number as the hub gene numbers. Then we counted the number of known/possible genes in each random set. A p-value is calculated as the number of times a random gene set has ≥ known/possible genes than the observed number, divided by 10,000. We found that the number of genes with known/possible roles at each time point are statistically significant (Supplementary Figure S3). This result indicates that the gene sets we identified are significantly associated with relevant phenotypes in mouse.

      The remaining hub genes are unannotated as related to placental or embryonic functions in the MGI database. Based on that, it is difficult to determine if they lack a relevant phenotype, or if there has not been a detailed assessment of the placenta.

      Added to section 2 of the result section:

      “Briefly, genes annotated under any GO or MGI phenotype terms related to placenta, TB cells, TE and the chorion layer are considered as having a “known” role in the placenta. Genes annotated under terms related to embryo are considered as having a “possible” role in the placenta, because embryonic lethal mouse knockout lines frequently have placentation defects, and because defects in placental development can be associated with the development of other embryonic tissues (Brown and Hay 2016; Perez-Garcia et al. 2018; Woods, Perez-garcia, and Hemberger 2018). Hereafter, such genes are referred to as “known/possible genes”. In the e7.5 networks, there were 17 hub genes in which six genes were known/possible. The number of hub genes that are labelled as known/possible is statistically significant when comparing to random gene sets selected from the e7.5 timepoint-specific group (Supplementary Figure S3). In the e8.5 and e9.5 networks, 17 out of 28 and 48 out of 127 hub genes were known/possible, respectively. Similar to e7.5, the number of hub genes labelled as known/possible in e8.5 networks and e9.5 networks were both statistically significant when comparing to random gene sets selected from the corresponding timepoint-specific groups (Supplementary Figure S3). These results indicate that the gene sets we identified are significantly associated with relevant phenotypes in the mouse.”

      For the four genes that we tested in HTR-8/SVneo cells, we also added more information about the current known role of the gene in mouse.

      Added to the discussion section:

      “We identified hub genes and their immediate neighboring genes which could regulate placental development and confirmed the roles of four novel genes (Mtdh, Siah2, Hnrnpk and Ncor2) in regulating cell migration in the HTR-8/SVneo cell line. These genes were selected primarily based on the network analyses, but also based on expression data from human cells to account for possible differences between mouse and human placental gene expression. Previous studies suggested these four candidates are functionally important in mouse. Mtdh has been suggested to regulate cell proliferation in mouse fetal development (Jeon et al. 2010). The Siah gene family is important for several functions (Qi et al. 2013). Of relevance to the placenta, Siah2 is an important regulator of HIF1α during hypoxia both in vitro and in vivo (Qi et al. 2008). Moreover, while Siah2 null mice exhibited normal phenotypes, combined knockouts of Siah2 and Siah1a showed enhanced lethality rates, suggesting the two genes have overlapping modulating roles (Frew et al. 2003). Hnrnpk-/- mice were embryonic lethal, and Hnrnpk+/- mice had dysfunctions in neonatal survival and development (Gallardo et al. 2015) . Ncor2-/- mice were embryonic lethal before e16.5 due to heart defects (Jepsen et al. 2007). According to the International Mouse Phenotyping Consortium database (Dickinson et al. 2016), Ncor2 null mice also showed abnormal placental morphology at e15.5. However, none of these genes have been studied in the context of TB migration.”

      Minor comments:

      1) In the GO analysis, why not use a combination of hypergeometric and binomial distribution for enrichment decisions?

      We used hypergeometric tests as in the default setting of ClusterProfiler. GO enrichment with hypergeometric test for differentially expressed genes was also suggested in Rivals et al., 2007 (Rivals et al. 2007). Combination of hypergeometric and binomial tests will be of great use when carrying out enrichment for cis-regulatory domains where there is a higher chance of sampling a gene randomly (McLean et al. 2010).

      We have added this detail in the method section to make the analysis clearer.

      2) In Figure 2B, are there any genes that are both hub nodes (diamonds) and annotated as having placental functions (squares)? If so, it might be good to show that in some way.

      We agree this is necessary and have altered the presentation in Figure 2. In the revised manuscript, we have added an additional list of hub genes as genes with possible roles. The figure now shows hub genes with known placental functions (diamonds), hub genes with possible functions (hexagons) and hub genes without related annotation (rounded squares). Non-hub genes are now not shown to avoid crowdedness.

      3) It might improve the deconvolution analysis to employ more than one method and recent reports have shown that the cell-type signature data is the most important parameter with the main factors influencing performance being biological (such as where the sample was taken) rather than technical (https://doi.org/10.1038/s41467-022-28655-4).

      We agree the conclusion would have been further confirmed if we could employ another deconvolution method. Upon literature search, we found another tool, CAM (N. Wang et al. 2016), that had similar approaches to LinSeed which aims to infer cell proportions without reference. However, the tool has been taken down from Bioconductor and is not currently maintained. As a result, to the best of our knowledge, LinSeed is the only deconvolution tool that is completely reference-free.

      We also tried carrying out the deconvolution analysis with another method, DSA (Zhong et al. 2013), with a limited number of marker genes obtained through literature review. However, when the marker genes are highly correlated in multiple cell types, the models failed to infer meaningful proportions.

      We acknowledge that we need additional single cell RNA-seq data or marker genes obtained from pure cell populations to make more concrete conclusions for the deconvolution analysis. We hope with future studies, there will be more evidence supporting our observations.

      We have added this acknowledgement in the results section:

      “The identification of these cell groups could have resulted from noise introduced by both biological and technical variation, which is challenging to overcome when using a small sample size or analyzing without prior knowledge in the deconvolution analysis.”

      Added to the discussion section:

      “Nevertheless, we acknowledge that our deconvolution analysis and cell type annotations were limited due to the absence of matching scRNA-seq data, data from pure cell populations, or extensive cell marker lists. As these types of information are available, deconvolution analysis can be used to identify species-specific cell types or correcting for confounding effects prior to DEA (Sutton et al. 2022).”

      4) The above report also shows that there are ways to correct for cell-type composition differences in DEA which might be interesting to look when using bulk data from different timepoints in future studies when focusing on different biological processes and not timepoint-specific transcripts.

      We agree correcting for cell proportion prior to differential expression analysis will be interesting for future studies. When single cell RNA-seq data or more extensive marker gene lists are available, deconvolution analysis will be of great use for this purpose.

      We have added this in the discussion section (also mentioned in point #3):

      “Nevertheless, we acknowledge that our deconvolution analysis and cell type annotations were limited due to the absence of matching scRNA-seq data, data from pure cells, or extensive cell marker lists. As these types of information become more available, deconvolution analysis can be used to identify species-specific cell types or correcting for confounding effects prior to DEA (Sutton et al. 2022).”

      5) Could the authors speculate as to possible reason(s) that an siRNA knockdown would give variable results functionally, while the actual gene expression appears to be consistently and sufficiently downregulated? Did the authors evaluate protein levels following siRNA knockdown?

      Following the reviewer’s comment, we have evaluated protein levels for each target gene and each siRNA. For the genes that gave variable results between siRNAs (MTDH and NCOR2), we did not observe a change in their ability to reduce protein levels (Supplementary Figure S7). It is therefore possible that there are off-target effects for one of the siRNAs. We considered this possibility in designing the project, which is why we tested two siRNAs per target gene. Although siRNA off-target effects may be present, visual inspection of the migration experiments indicate that transfection with each of the siRNAs reduces migration capacity. We have added the possibility of off-target effects in the discussion section:

      “We observed that while all siRNAs were able to decrease cell migration capacity, there was variability in the amount of decrease, even when comparing two siRNAs targeting the same gene. This observation did not seem to be associated with differences in transcript or protein knockdown levels and could be due to different off-target effects for different siRNAs.”

      6) As mentioned in the discussion, finding genes that have timepoint dependent isoforms would an interesting and novel addition to the manuscript.

      Protein isoforms would be interesting to study. Here we focused on different mRNA transcripts. We carried out additional GO analysis on the genes unique to each timepoint and genes shared among timepoints. This was also done in response to major comment 2:

      In the revised manuscript, we added additional analysis for genes that belong to multiple timepoint groups due to different transcripts of the same gene being annotated as timepoint-specific, and genes unique to each timepoint (Added to results section 1):

      “It is possible that timepoint-specific groups share genes that have timepoint-specific transcripts. Indeed, we identified 37 genes shared between e7.5 and e8.5, 5 genes shared between e7.5 and e9.5, and 109 genes shared between e8.5 and e9.5 (Supplementary Table S3). We found that genes only present at one timepoint (timepoint-unique genes) were generally enriched for similar terms as the full group of timepoint-specific genes (Supplementary Table S3). However, terms related to the development of labyrinth layer like “labyrinthine layer morphogenesis” and “labyrinthine layer blood vessel development” were only enriched when using all e8.5-specific genes but not when using e8.5 timepoint-unique genes. Moreover, we found that, unlike genes shared between e9.5 and e7.5, genes shared between e9.5 and e8.5 were enriched for processes such as “blood vessel development” and “insulin receptor signaling pathway”. This observation may indicate that different transcripts of the same genes could be expressed at different timepoints for the continuation of certain biological processes.”

      7) Although outside the scope of this manuscript, it might be interesting to look at the effects of knocking down network genes on the networks themselves and in combination with a phenotypic readout such as a migration assay. With numerous knockouts and migration assay readouts, one could possibly find a better method to rank the genes within the networks.

      We agree with this comment. Upon literature search, we realized this approach has been used in previous studies on other biological contexts such as virus entry (A. Wang et al. 2010; A. Wang, Ren, and Li 2011) and cancer cell growth (Paul et al. 2021). Although these studies used different network inference strategies from ours, their in silico gene knockouts proved to be effective for the candidate selection. However, the knockout process (both computationally and experimentally) may not be trivial; therefore, we agree the approach will be useful for future studies.

      CROSS-CONSULTATION COMMENTS

      I mostly agree with the other two reviewers.

      It is not clear to me that additional KD experiments (i.e. ones that might affect fusion, proliferation, apoptosis), as proposed by Reviewer #3, would be that much more informative. There are many differences between mouse and human placentation, and these model systems (HTR8 and BeWo) are not truly representative of either. The additional data mining/computational work would be more useful and enhance data interpretation.

      Reviewer #2 (Significance (Required)):

      The authors use RNA-seq of mouse placenta at e7.5, e8.5, and e9.5 to show that timepoint-specific expression patterns are highly correlated with certain biological processes and point to the existence of certain cell types in the sample. While focused on early post-implantation mouse placental development, the author's methods could be transferrable to other timepoints, species, and organs. Furthermore, with their method they uncover what appears to be several novel, early placental, developmentally important genes and their results might be of interest to those in the field studying placental development.

      Reviewer #3:

      Summary:

      This paper is an analysis of RNA-seq data from the mouse human placenta at embryonic day from 7.5 to 9.5 days. Bioinformatics was used to pinpoint genes networks, and tentatively connect with human cell populations. Wet experiments were performed on the HTR8/SV neo trophoblast cell model.

      The introduction clearly posits the reasons why mouse models were chosen, and presents some examples of genes that are conserved between human and mouse placentas, before presenting the major steps of mouse placental development at the crucial periods analyzed.

      The results are divided into four parts:

      1. Identification of genes that are specific of fetal tissues at the three days studied
      2. A network analysis of the genes using classical bioinformatics tools (String, Genie3) to identify gene modules
      3. A connection with the human placenta at the level of cell-specific expression profile is then analyzed
      4. A in vitro validation on a trophoblast cell model using siRNA to Knockdown genes identified in the in silico part of the paper. Three clustering methods were used to classify the genes according to their profile (at which time point they have the highest level). The function associated are dispatched into three logical physiological events (7.5: proliferation and ectoplacental cone development, 8.5 attachment of the placenta -chorioallantoidian at this stage- , and 9.5: syncytiotrophoblast constitution and labyrinth development, structures essential for growth and exchange).

      Mostly minor comments:

      Quality of the transcriptomics data: 6 replicates per condition (some being pools at E7.5 and 8.5) is a lot, and I congratulate the authors to have make such effort. This says a lot about the technical quality of their results. Nevertheless, there is no comment on the exclusion of two samples in the further analysis based upon the PCA. Could the authors comment upon the reasons why these two samples behave so differently from the others?

      We thank the reviewer for the comment. We reviewed the RNA concentration and quality prior to sequencing, and did not observe that the outliers were of lower quality. After sequencing, quality control metrics (obtained with FastQC), also did not indicate that the two outliers were of poor quality. Based on the PCA, it is also unlikely that two samples were swapped. One possibility is that the tissues obtained for these samples were diseased in some way. However, this is difficult to confirm, so we did not want to speculate about this in the manuscript. We did exclude the two samples to ensure the accuracy of our downstream analyses.

      Rq: at this stage some statistics of the degree of enrichment in keyword should be provided (such as Enrichment Scores, normalized or not, and False Discovery Rates, to be able to evaluate the actual robustness of the genes network identified. In addition, it seems that the authors supervised the 'keywords' and 'ontologies' toward placental function. A more agnostic approach could be very relevant, such as identifying the ontologies associated to for instance the set of genes that are highest at 8.5 days, by comparing them with preliminary datasets accessible via the GSEA platform of the BROAD institute or similar sites such as Webgestalt. This does not mean that the placental-targeted approach is not useful, but to have a more global overview is in my opinion indispensable.

      We agree and this is a good point. We have now added a stringent approach to determine if the placenta-targeted terms are truly relevant to the gene networks. We performed randomization tests using random gene sets sampled from hierarchical groups of the same time point. These tests showed that the selected terms are significant in the networks when compared to gene groups of the same size from the timepoint specific hierarchical groups (Supplementary Figure S3). Moreover, we have added the specific -log10(q-value) of some highlighted enriched terms in the main text, so together with Figure 2A, the degree of enrichment of these terms can be shown in a clearer way.

      We have added this detail in the result section:

      “Compared to e8.5 and e9.5 networks, e7.5 networks had a higher rank or fold change and were significantly enriched for the GO terms “inflammatory response” (e7.5_1_STRING: -log10(q-value) = 22.82 and e7.5_2_GENIE3: -log10(q-value) = 3.95) and “female pregnancy” (e7.5_2_GENIE3: -log10(q-value) = 4.1) (Figure 2A, Supplementary Table S5). The term “morphogenesis of a branching structure”, which can be expected following chorioallantoic attachment around e8.5, was not enriched at e7.5, but was enriched in multiple e8.5 and e9.5 networks (e8.5_1_STRING: -log10(q-value) = 1.73, e8.5_2_GENIE3: -log10(q-value) = 1.72, e9.5_1_STRING: -log10(q-value) = 4.01, e9.5_1_GENIE3: -log10(q-value) = 1.54, e9.5_2_STRING: -log10(q-value) = 14.33, and e9.5_2_GENIE3: -log10(q-value) = 2.2). After chorioallantoic attachment finishes, nutrient transport is being established. Accordingly, we observed the following enrichments: “endothelial cell proliferation” (highest ranked in e9.5_2_STRING: -log10(q-value) = 15.91), “lipid biosynthetic process” (only significant after e7.5, highest ranked in e9.5_3_STRING: -log10(q-value) = 17.63), “cholesterol metabolic process” (only significant after e7.5, highest ranked in e9.5_2_GENIE3: -log10(q-value) = 2.76 and e9.5_3_STRING: -log10(q-value) = 7.79), and “response to insulin” (only significant after e7.5, highest ranked in e9.5_1_GENIE3: -log10(q-value) = 1.67).”

      “Using randomization tests, we observed the majority of these GO terms (10 out of 11 terms) were significantly enriched when using the network genes but not random gene sets (significance level of 0.05; the term “vasculature development” having p-value = 0.0549 and 0.0575 in with subnetwork e9.5_1_GENIE3 and e9.5_3_GENIE3, respectively) (see Materials and Methods, Supplementary Figure S3). This analysis demonstrates that the network genes were highly relevant to the biological functions of interest. Moreover, the observed GO terms strongly aligned with the processes enriched when using the full lists of timepoint-specific genes (Supplementary Table S3), indicating the representative characteristics of the network genes. While the current analysis focuses on the biological processes related to placental development, there are other terms significantly enriched, which can be found in Supplementary Table S5.”

      This is partially done in the part 2 of the results, but it would be relevant to do it on the group of highly expressed genes and not only on the clusters found by the algorithm of sting and genie3.

      We have added GO analysis for timepoint-specific genes and also observed highly relevant processes being enriched (Supplementary Table S3). This additional analysis has also helped strengthen the relevance of the network genes, as the observed terms with network genes aligned well with the terms enriched with the full lists of genes.

      Rq: in the second part of the results, everything is descriptive but no hierarchy is given to facilitate the understanding and to try to generate a few 'take-home messages' for the reader.

      We agree with the comment and have adjusted the writing accordingly. We have added the following statements in section 2 of the result section:

      “In summary, we identified 18 subnetworks across three timepoints for downstream analyses, some of which were enriched, according to GO analysis and randomization tests, for specific terms relating to placental development (Figure 2A).”

      “These results indicate that the gene sets we identified are functionally relevant in the mouse models.”

      “In summary, we have identified hub genes in networks at each timepoint. Analyzing the annotations of hub genes using the MGI database demonstrated that the hub genes are biologically relevant to mouse development and will be strong candidates for future investigation.”

      The network analysis is well presented in Figure 2. I wonder whether the author could add systematically besides the three examples that are given the network analysis for the other enrichment network that are described (the four at e7.5, the 6 at e8.5 and the 8 at e9.5).

      We have added the additional figures in Supplementary Figure S3.

      The deconvolution of the 3rd part of the results to try to connect the mouse results to the human cell situation is interesting. I suspect that given the terms of the mouse placentas used, it would be relevant to focus on 1st trimester human placental cells.

      The reference dataset we used in the PlacentaCellEnrich analysis was from human 1st trimester placenta samples. For the Placenta Ontology analysis, we were limited to the provided database from (Naismith and Cox 2021); however, it will be interesting to revisit the analysis when the database is extended.

      We have specified that the reference data in PlacentaCellEnrich analysis was from human 1st trimester placenta in the methods section:

      “For PlacentaCellEnrich, cell-type specific groups were based on the single-cell transcriptome data of first trimester human maternal-fetal interface from Vento-Tormo et al.”

      As previously mentioned, this is a highly descriptive paragraph, and two or three sentences at the end of each paragraph of the results would be in my opinion indispensable to present the most important observations of the results in an intelligible way. Overall, the data presented by the authors, are not obviously 'raw data', but an effort of interpretation should be done by the authors to underline the importance of their results, and to stress among these results which are the most important, and which are the most relevant for placental development and human health.

      We agree with the comment and have adjusted the writing accordingly. We have added this summary paragraph at the end of section 3 of the result section:

      “In summary, we have demonstrated that the identification of timepoint-specific gene groups and densely connected network modules can be used to infer the cellular composition of bulk RNA-seq samples. We used independent human datasets from different sources to annotate the cell types in each timepoint’s samples. As a result, from the bulk RNA-seq data we were able to observe that at e7.5 and e8.5, there was a high proportion of different TB populations, whereas at e9.5, the placental tissues consisted of multiple cell types such as TB, endothelial and fibroblast cells.”

      In the last part, which is very important in this type of paper, four genes were selected. A choice of highly expressed genes was made (which can in fact be discussed, some transcriptional factors may have a crucial importance with relatively low levels of expression). The efficiency of the siRNA was overall excellent. The authors showed that each of these siRNA is efficient to inhibit cell migration in the HTR8/SVneo model.

      The migration assays are quantified, but there is a inherent limit of the cell model: the authors analyzed only cell migration, but not other very important parameters. One of them is trophoblast fusion, an issue that can be studied in another trophoblast cell model, the BeWo cells, which are induced to fuse under forskolin. It would be highly relevant to test the siRNA identified in this respect, since fusion is a very conspicuous feature of trophoblast cells in mice as well as in humans. Other relevant endpoints such as proliferation markers, apoptosis markers, oxidative stress markers could be studied in the KD cell models. Alternatively, it would have been interesting to evaluate the overall effect of the siRNA by transcriptomics and check whether the modified gene expression leads to specific profiles characteristic of a certain moment of placental development in mice, or proportion of various cells in the human placentas. Without asking for further experiments the authors should mention these limits in their discussion.

      We completely agree with this comment and are investigating each of our candidate genes in more detail in ongoing studies. As we have already learned that each gene is involved in different processes and pathways, we feel that these studies are out of the scope of the current paper. However, we have added this point to our discussion section:

      “However, more detailed analysis and process specific assays are needed. For example, future studies assessing each gene’s role in cell adhesion, cell-cell fusion, cell proliferation and cell apoptosis can be done to better understand their roles in placental development.”

      In sum, I feel that this paper provides an excellent dataset, but that the authors should make an additional effort of redaction to extract the most important conclusions of their paper. This would increase its impact for a wider public.

      Thank you. We have attempted to do so in the revised version.

      Reviewer #3 (Significance (Required)):

      The context is well introduced, but explanatory and synthesis sentences are missing at the end of each paragraph. I am relatively competent in bioinformatics methods, including deconvolution, and rather expert in cell biology. Therefore I feel comfortable to evaluate this paper.

      References:

      Brown, Laura D., and William W. Hay. 2016. “Impact of Placental Insufficiency on Fetal Skeletal Muscle Growth.” Molecular and cellular endocrinology 435: 69. /pmc/articles/PMC5014698/ (August 24, 2022).

      Cox, Brian et al. 2009. “Comparative Systems Biology of Human and Mouse as a Tool to Guide the Modeling of Human Placental Pathology.” Molecular Systems Biology 5: 279. /pmc/articles/PMC2710868/ (July 20, 2022).

      Dickinson, Mary E. et al. 2016. “High-Throughput Discovery of Novel Developmental Phenotypes.” Nature 2016 537:7621 537(7621): 508–14. https://www.nature.com/articles/nature19356 (July 20, 2022).

      Frew, Ian J. et al. 2003. “Generation and Analysis of Siah2 Mutant Mice.” Molecular and Cellular Biology 23(24): 9150. /pmc/articles/PMC309644/ (July 27, 2022).

      Gallardo, Miguel et al. 2015. “HnRNP K Is a Haploinsufficient Tumor Suppressor That Regulates Proliferation and Differentiation Programs in Hematologic Malignancies.” Cancer Cell 28(4): 486–99. http://www.cell.com/article/S1535610815003050/fulltext (August 24, 2022).

      Gormley, Matthew et al. 2021. “RNA Profiling of Laser Microdissected Human Trophoblast Subtypes at Mid-Gestation Reveals a Role for Cannabinoid Signaling in Invasion.” Development (Cambridge, England) 148(20). https://pubmed.ncbi.nlm.nih.gov/34557907/ (August 15, 2022).

      Guillemot, François et al. 1994. “Essential Role of Mash-2 in Extraembryonic Development.” Nature 371(6495): 333–36. https://www.nature.com/articles/371333a0 (December 21, 2021).

      Hemberger, Myriam, Courtney W. Hanna, and Wendy Dean. 2020. “Mechanisms of Early Placental Development in Mouse and Humans.” Nature Reviews Genetics 21(1): 27–43. http://dx.doi.org/10.1038/s41576-019-0169-4.

      Jeon, Hyun Yong et al. 2010. “Expression Patterns of Astrocyte Elevated Gene-1 (AEG-1) during Development of the Mouse Embryo.” Gene expression patterns : GEP 10(7–8): 361. /pmc/articles/PMC3165053/ (July 27, 2022).

      Jepsen, Kristen et al. 2007. “SMRT-Mediated Repression of an H3K27 Demethylase in Progression from Neural Stem Cell to Neuron.” Nature 450(7168): 415–19. https://www.nature.com/articles/nature06270 (July 27, 2022).

      Kuckenberg, Peter, Caroline Kubaczka, and Hubert Schorle. 2012. “The Role of Transcription Factor Tcfap2c/TFAP2C in Trophectoderm Development.” Reproductive BioMedicine Online 25(1): 12–20. http://www.rbmojournal.com/article/S1472648312001010/fulltext (December 21, 2021).

      McLean, Cory Y. et al. 2010. “GREAT Improves Functional Interpretation of Cis-Regulatory Regions.” Nature Biotechnology 28(5): 495–501. http://dx.doi.org/10.1038/nbt.1630.

      Meinhardt, Gudrun et al. 2020. “Pivotal Role of the Transcriptional Co-Activator YAP in Trophoblast Stemness of the Developing Human Placenta.” Proceedings of the National Academy of Sciences of the United States of America 117(24): 13562–70. https://www.ncbi.nlm.nih.gov/geo/ (April 8, 2022).

      Morey, Robert et al. 2021. “Transcriptomic Drivers of Differentiation, Maturation, and Polyploidy in Human Extravillous Trophoblast.” Frontiers in Cell and Developmental Biology 9: 2269.

      Naismith, Kendra, and Brian Cox. 2021. “Human Placental Gene Sets Improve Analysis of Placental Pathologies and Link Trophoblast and Cancer Invasion Genes.” Placenta 112: 9–15.

      Paul, Abhijit et al. 2021. “Exploring Gene Knockout Strategies to Identify Potential Drug Targets Using Genome-Scale Metabolic Models.” Scientific Reports 2021 11:1 11(1): 1–13. https://www.nature.com/articles/s41598-020-80561-1 (July 27, 2022).

      Perez-Garcia, Vicente et al. 2018. “Placentation Defects Are Highly Prevalent in Embryonic Lethal Mouse Mutants.” Nature 555(7697): 463. /pmc/articles/PMC5866719/ (August 11, 2022).

      Qi, Jianfei et al. 2008. “The Ubiquitin Ligase Siah2 Regulates Tumorigenesis and Metastasis by HIF-Dependent and -Independent Pathways.” Proceedings of the National Academy of Sciences of the United States of America 105(43): 16713. /pmc/articles/PMC2575485/ (September 20, 2021).

      Qi, Jianfei, Hyungsoo Kim, Marzia Scortegagna, and Ze’ev A. Ronai. 2013. “Regulators and Effectors of Siah Ubiquitin Ligases.” Cell biochemistry and biophysics 67(1): 15. /pmc/articles/PMC3758783/ (July 27, 2022).

      Rivals, Isabelle, Lé On Personnaz, Lieng Taing, and Marie-Claude Potier. 2007. “Databases and Ontologies Enrichment or Depletion of a GO Category within a Class of Genes: Which Test?” Bioinformatics 23(4): 401–7.

      Sheridan, Megan A. et al. 2021. “Characterization of Primary Models of Human Trophoblast.” Development (Cambridge, England) 148(21). /pmc/articles/PMC8602945/ (August 15, 2022).

      Si, Yaqing, Peng Liu, Pinghua Li, and Thomas P. Brutnell. 2014. “Model-Based Clustering for RNA-Seq Data.” Bioinformatics 30(2): 197–205. https://academic.oup.com/bioinformatics/article/30/2/197/217752 (July 18, 2022).

      Silva, Juneo F., and Rogéria Serakides. 2016. “Intrauterine Trophoblast Migration: A Comparative View of Humans and Rodents.” Cell Adhesion and Migration 10(1–2): 88–110. http://dx.doi.org/10.1080/19336918.2015.1120397.

      Soncin, Francesca et al. 2018. “Comparative Analysis of Mouse and Human Placentae across Gestation Reveals Species-Specific Regulators of Placental Development.” Development (Cambridge) 145(2).

      Soncin, Francesca, David Natale, and Mana M. Parast. 2015. “Signaling Pathways in Mouse and Human Trophoblast Differentiation: A Comparative Review.” Cellular and Molecular Life Sciences 72(7): 1291–1302.

      Spies, Daniel, Peter F. Renz, Tobias A. Beyer, and Constance Ciaudo. 2019. “Comparative Analysis of Differential Gene Expression Tools for RNA Sequencing Time Course Data.” Briefings in Bioinformatics 20(1): 288. /pmc/articles/PMC6357553/ (July 18, 2022).

      Sun, Huifang et al. 2006. “An E2F Binding-Deficient Rb1 Protein Partially Rescues Developmental Defects Associated with Rb1 Nullizygosity.” Molecular and Cellular Biology 26(4): 1527. /pmc/articles/PMC1367194/ (February 6, 2022).

      Sutton, Gavin J. et al. 2022. “Comprehensive Evaluation of Deconvolution Methods for Human Brain Gene Expression.” Nature Communications 2022 13:1 13(1): 1–18. https://www.nature.com/articles/s41467-022-28655-4 (July 27, 2022).

      Varberg, Kaela M. et al. 2021. “ASCL2 Reciprocally Controls Key Trophoblast Lineage Decisions during Hemochorial Placenta Development.” Proceedings of the National Academy of Sciences of the United States of America 118(10). https://www.pnas.org/content/118/10/e2016517118 (December 21, 2021).

      Wang, Anyou, S. Claiborne Johnston, Joyce Chou, and Deborah Dean. 2010. “A Systemic Network for Chlamydia Pneumoniae Entry into Human Cells.” Journal of Bacteriology 192(11): 2809–15. https://journals.asm.org/doi/10.1128/JB.01462-09 (July 27, 2022).

      Wang, Anyou, Li Ren, and Hong Li. 2011. “A Systemic Network Triggered by Human Cytomegalovirus Entry.” Advances in Virology 2011.

      Wang, Niya et al. 2016. “Mathematical Modelling of Transcriptional Heterogeneity Identifies Novel Markers and Subpopulations in Complex Tissues.” Scientific Reports 2016 6:1 6(1): 1–12. https://www.nature.com/articles/srep18909 (July 27, 2022).

      Watson, Erica D., and James C. Cross. 2005. “Development of Structures and Transport Functions in the Mouse Placenta.” Physiology 20(3): 180–93.

      Woods, Laura, Vicente Perez-garcia, and Myriam Hemberger. 2018. “Regulation of Placental Development and Its Impact on Fetal Growth — New Insights From Mouse Models.” Frontiers in Endocrinology 9(September): 1–18.

      Zhong, Yi et al. 2013. “Digital Sorting of Complex Tissues for Cell Type-Specific Gene Expression Profiles.” BMC Bioinformatics 14(1): 1–10. https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-14-89 (July 27, 2022).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Review of "Identifying novel regulators of placental development using time series transcriptomic data and network analyses."

      The authors present a detailed bioinformatic assessment of mouse developmental time series of the placenta. They apply current data mining and analysis methods to identify protein-centred networks that are likely enriched to specific cell types of the placenta. They then translate these findings to humans using statistical comparisons of human single-cell sequencing data of the placenta. Lastly, they use knock-down experiments to validate the conserved functional importance of the hub genes in the mouse protein networks in human cells. The strengths of this paper are the rigorous data mining methods and the functional translation to humans from mice. There are no critical weaknesses to the article. There is a blend of statistical analysis with anecdotal or hand curation from databases and the literature, but it is unclear if these curated finings are circumstantial or statistically meaningful. In the end, the hypothesis seems to hold in that 4/4 gene knocked down in the human cells gave a migration phenotype.

      Comments, questions, critique

      1. Given the translational aims of the paper, more introduction/discussion material on the comparative aspects of mice and humans are needed. Are giant cells and EVT the same? What are the cell equivalents that you are discovering? The Soncin et al. paper is cited, but I think underused. This publication contains time series data on mice and humans and could be used as external validation of clusters, networks, and other analyses. Other publications to consider for context are

      a) Cox B, et al. Mol Syst Biol 5: 279.

      b) Silva JF, Serakides R. 2016. Cell Adhes Migr 10: 88-110. (specifically discusses migration difference between the species placenta)

      1. Clustering represented in Figure 1B, was this a supervised model? Why only three clusters?) Did you specify that there would be three models and force each gene profile into one of the categories? How robust are the fits? A fitted model might be a better approach as you can specify the ideal models (early high, late high and mid-high), then determine each gene profile that fits each model and only assess those genes with a significant fit to the model. Forcing clustering to the three-model fit likely gives many poorly fitting profiles. While in the end, this works out, it may be due to applying other post hoc methods for gene enrichment, where noise distributes randomly.

      2. Several statements are made about the conservation of importance between mouse and human hub genes. For example, "We predict these highly expressed genes to be generally important for TB function and processes such as cell migration, a term associated with multiple timepoint specific networks (Figure 2A)." While your knock-down assay of migration results shows these hub genes to be necessary to humans, what do they mean to the mouse? You did not use mouse TSC to assess functional importance concurrently. You note a small number of genes as of known importance, "127 hub genes of which 16 have been annotated as having a role in placental development". Were the others knocked out but lack a developmental phenotype or not assessed? Are these functionally redundant in the mouse or not involved in the same processes between the species?

      3. In determining conservation between mouse and human networks, were only 1:1 orthologs examined or did you consider more complex 1:many mapping conditions between the two species?

      4. Should the migration assay be normalized to survival/adhesion? If 70,000 cells were seeded but had 50% cell death (or reduced adhesion), then it may appear to be poor migration. Should the migration be evaluated as a ratio of top to bottom cell densities to control for poor adhesion or survival?

      Significance

      This significantly advances previous publications on this topic by functionally testing the discovered genes.

      This highlights an excellent data mining strategy for a developmental disease using mice and translating to humans.

      The audience is likely developmental biologists and reproductive specialists.

      My expertise is bioinformatics and developmental biology.

    1. But it’s with these weather worries that these manipulative scientists really give the game away. Urging us to use more wind power but complaining about all the hurricanes we keep having? They got us all to convert to solar power decades ago but keep whining about prolonged sunny spells? MAKE YOUR MINDS UP! Some of them even go so far as to say it’s climate change that’s causing forced migration of millions of people. But that’s clearly because everyone has solar cars and jetpacks and matter transporters now, so why would they stay in one place, with or without devastating environmental damage spurring them on. It’s all a bit convenient, isn’t it, all this palaver over climate change? Weird how 99.9999% of all scientists purportedly agree that it’s definitely happening and our most powerful quantum computers are certain to over a million decimal places that it’s our fault? Weird how they’re saying this now, at exactly the same time when they need all the volunteers they can get for the moon and Mars colonies. What’s more likely; that human industrial activity actually does lead to climate change, or that it’s all a massive meticulous centuries-long ruse to convince people that leaving Earth is a good idea? Obviously, it’s the latter. These scientists have no shame or respect. I can’t say I’m not tempted to go myself, though. I’d rather live on another planet, than on one where every aspect of your life is subject to rigorous scientific control. Nobody should have to put up with that crap.

      Overall it seems that climate change has affected the author or they are worried for others but others may say different. I truly don't think humans take full accountability for this but may play some part in it.

    1. An epistemic bubble is a social structure where insiders aren’t exposed to views on the outside. Despite the superficial similarity, epistemic bubbles and echo chambers work through entirely different mechanisms. In an echo chamber, inside members may have plenty of exposure to outside views, but outside voices have been undermined. Epistemic bubbles are structures of bad connectivity; echo chambers are structures of manipulated credence. In an epistemic bubble, outside voices aren’t heard; in an echo chamber, outside voices have been systematically discredited. Importantly, I’ve argued, many communities with problematic belief systems have been misdiagnosed as epistemic bubbles. But actually, they are mostly the result of echo chambers. It isn’t that climate change deniers, for example, are simply unaware of what climate change scientist think, or the standard publicly available arguments for climate change. They are, for the most part, quite well acquainted with those arguments and conclusions. It is that they think that the institutions of climate change science have been systematically corrupted and are untrustworthy. This helps to explain the intractability of climate change denialists. Since an epistemic bubble works through simply omitting outside voices, we should be able to shatter one simply by exposing an insider to more voices and more viewpoints. We should expect epistemic bubbles to go down with the first contact with the missing evidence. But echo chamber members are pre-prepared for encounters with external viewpoints and armed with explanatory mechanisms to dismiss those other voices. Echo chambers are far more robust.

      Epistemic bubble vs. echo chamber

    1. His excursions may be more enjoyable if he can reacquire the privilege of forgetting the manifold things he does not need to have immediately at hand, with some assurance that he can find them again if they prove important.

      hey that's what I said at the beginning of this piece

    2. if the user inserted 5000 pages of material a day it would take him hundreds of years to fill the repository, so he can be profligate and enter material freely.

      It's interesting that this machine focuses on retrieval of a person's personal memories, whereas we're more concerned with retrieving other people's ideas from the internet and from archives

    3. With machines for advanced analysis no such situation existed; for there was and is no extensive market

      machines for advanced analysis forced their way into the extensive market by becoming more familiar and user-friendly, but they are still essentially the same machine

    4. relegated to the machine.

      this is interesting in thinking about current ideas about what should be relegated to machines-- thinking about the debates around whether or not AI can really create art

    5. Mere compression, of course, is not enough; one needs not only to make and store a record but also be able to consult it

      now our problem is navigating the sheer amount of things compressed and actually being able to effectively consult them

    6. The investigator is staggered by the findings and conclusions of thousands of other workers—conclusions which he cannot find time to grasp, much less to remember, as they appear. Yet specialization becomes increasingly necessary for progress, and the effort to bridge between disciplines is correspondingly superficial.

      reminds me of our discussion about the move away from the 'solitary genius' of western tradition to a more collaborative approach towards knowledge-building in which everyone specializes in something different

    7. burying their old professional competition in the demand of a common cause, have shared greatly and learned much. It has been exhilarating to work in effective partnership.

      interesting that Bush characterizes the war as little more than an 'exhilarating' blip in the careers of professional scientists

    8. It is an enlarged intimate supplement to his memory.”

      This is a really interesting way to think about smartphones/tablets; there's less of a burden on us to carry a ton much information in our minds because it's so easy to pull out our phones and reference almost anything in the world.

    1. Web technologies do give us access to a larger society than is possible in face-to-face interaction, but over a century ago, a prominent author pointed out the double-edged nature of big societies as follows:

      I read online somewhere that humans were originally only supposed to live in small groups of 25 people or less, and that we are not built to handle the thousands of people we see on social media in a day. I do not know if the first part is accurate, but I believe we are no meant to see as much as we do. While we can interact much more than just face to face allows, we don't need to, and it may be too much. The amount of tragedy we witness everyday online certainly isn't healthy and I don't think our brains are supposed to handle all of that at once. So I agree that access to a larger society is definitely a double-edged sword.

    2. The tracks that you leave online are sometimes referred to as digital footprint, and they include your profile information, things you post, what you share, who you follow, what you like, etc. A majority of employers now will do some level of web searching (either via search engines or social media sites) to check on the digital footprints of people they are considering hiring. This means that people will be searching for you, and what they find may have an impact on your professional life.

      I remember in middle school every year for our digital literacy class we had to do a lesson about our digital footprint. And it really got instilled in us that you need to be careful what you put online because it will never go away. I remember we would be told stories of people losing their job because of something they posted about years before and even if they didn't use that platform anymore they got fired because their company didn't want to be associated with someone like that. I think this idea should be instilled into anyone who uses the internet because it is really helpful.

    1. Author Response

      Reviewer #2 (Public Review):

      Klein et al. have developed a high-throughput tracker to evaluate operant conditioning in Drosophila larvae. Employing this device, they train larvae to prefer bending towards one specific side (left or right), by using as unconditioned stimulus (US) the optogenetic activation of dopaminergic and serotoninergic neurons, demonstrating that larvae are able to perform this behaviour. Furthermore, they show that serotoninergic neurons alone are sufficient to mediate the reward signal, and that specifically serotoninergic neurons in the VNC are required for this behaviour. However, they do not show whether serotoninergic VNC neurons are sufficient. The results are interesting and novel. Operant conditioning had been shown for Drosophila adult. Furthermore, the existence of VNC circuits sufficient for operant conditioning had been shown for other species, as the authors point out in the discussion. Nonetheless, the genetic dissection to identify serotonine expressing neurons as mediators of operant conditioning in the Drosophila larva, and the identification of VNC serotonine cells as necessary are new. Furthermore, given the experimental advantages of the Drosophila larva, including genetic accessibility and a full connectome, the findings open the door to future research into the circuit mechanisms of operant conditioning. I have some comments that I think would be important to address.

      The high-throughput tracker is impressive. However, there is no sufficient documentation to ensure that an expert would be able to easily reproduce it. All of the hardware assembly files, the list of materials, as well as the electronic circuit maps and all of the required software needs to be appropriately documented and uploaded onto a public repository. This is a basic requirement when publishing new hardware/software, particularly in an open journal such as eLife.

      We have now included all the documentation and CAD files for the high-throughput tracker. The software is publicly available in the following Github repository (https://github.com/ZlaticLab/multi-larva-tracker-scripts-public). The CAD files are available in the Supplementary materials of the paper.

      • The differences observed in the results of operant conditioning are very subtle (see for example figure 3c), which means that it is extremely important that statistic analyses are correctly made. The sample number (n) for these experiments is really high (n>100) and for what I understood is not equivalent to the number of animals, because the same animal can generate n >1, eg. n = 2 or n =3 if it collides one or two times, as each time it collides a new identity is given to the larvae. This means that the datapoints collected are not independent, and I think in that case a Wilcoxon rank-sum test is not the appropriate test to take. I recommend the authors and eLife editors to consult with an expert in this type of statistics. Alternatively, the authors could, for each experiment, take into account only the data from larvae that did not collide, and for those that collide only take into account the data before the collision. This can be calculated easily as they just need to exclude from their analysis in each experiment all of the larval IDs where the ID is larger than the initial number of larvae identified by the software.

      We apologise if we did not clarify sufficiently that we only took into account (for each time bin) larvae that did not collide. Within the Materials and methods, we describe how objects retained for analysis had to satisfy several criteria. The first criterion is that the object needed to be detected in every frame of the given 60 s bin. In this way, the object identity is stable throughout the bin - a reflection that the object did not collide with another object. In other words, within a single time bin, the same animal only contributes once. Text has been added to the Materials and methods to clarify that this first criterion is selecting for larvae that did not collide.

      The reviewer mentions that Wilcoxon rank-sum test is not the appropriate nonparametric test for dependent samples. We agree. In accordance with this, the test used for within-bin comparisons was Wilcoxon signed-rank, which is also nonparametric but is for dependent samples. We believe, then, that there is no need to reconsider the statistical tests used.

      -The finding that serotoninergic neurons in the VNC, which with the line they used amount to only 2 neurons per VNC hemisegment, are required for operant conditioning is very interesting. It would be great if they could also test whether they are sufficient. It seems that they would just need to make two split Gal4 lines one for tsh and one for tph, so the experiment does not seem too difficult and would significantly add to their findings.

      Generating new intersections is beyond the scope of this already large study which has been significantly impacted by the pandemic. We have therefore added the following sections below explaining that we have identified candidate serotonergic neurons that are required for operant learning and that identifying specific single neuron types that may be sufficient would be an exciting avenue for future follow-up work.

      In the Results section entitled, “Serotonergic VNC neurons may play role in operant conditioning of bend direction” we have added:

      “The Tph-Gal4 expression pattern contains two neurons per VNC hemisegment (with the exception of a single neuron in each A8 abdominal hemisegment, Huser2012). Future experiments exclusively targeting a single serotonergic neuron per VNC hemisegment could be valuable in determining whether they are sufficient for operant learning.”

      In the Discussion section entitled: “Automated operant conditioning of Drosophila larvae”

      “Furthermore, developing sparser lines that target single serotonergic and dopaminergic neuron types will enable the identification of the smallest subsets of neurons that are sufficient for providing the operant learning signal. Behavioural experiments with these genetic lines may have the added benefit of mitigating conflicting or non-specific reinforcement signalling.”

    1. Author Response

      Reviewer #1 (Public Review):

      The manuscript is clear and well-written and provides a novel and interesting explanation of different illusions in visual numerosity perception. However, the model used in the manuscript is very similar to Dehaene and Changeux (1993) and the manuscript does not clearly identify novel computational principles underlying the number sense, as the title would suggest. Thus, while we were all enthusiastic about the topic and the overall findings, the paper currently reads as a bit of a replication of the influential Dehaene & Changeux (1993)-model, and the authors need to do more to compare/contrast to bring out the main results that they think are novel.

      Major concerns:

      1) The model presented in the current manuscript is very similar to the Dehaene and Changeux 1993 model. The main difference is in the implementation of lateral inhibition in the DoG layer where the 1993 model used a recurrent implementation, and the current model uses divisive normalization (see minor concern #1). The lateral inhibition was also identified as a critical component of numerosity estimation in the 1993 model, so the novelty in elucidating the computational principles underlying the number sense in the current manuscript is not evident.

      If the authors hypothesize that the particular implementation of lateral inhibition used here is more relevant and critical for the number sense than the forms used in previous work (e.g., the recurrent implementation of the 1993 model or the local response normalization of the more recent models), then a direct comparison of the effects of the different forms is necessary to show this. If not, then the focus of the manuscript should be shifted (e.g., changing the title) to the novel aspects of the manuscript such as the use of the model to explain various visual illusions and adaptation and context effects.

      Thank you for bringing up these issues. We acknowledge that there was a lack of clear explanations for the key differences between the proposed model and that of Dehaene & Changeux (hereafter D&C). Please see our revisions below where we: 1) explain the D&C model and its limitations in more in detail; 2) our critical changes to the D&C model; and 3) how those critical changes allow a novel way to explain numerosity perception.

      The paragraph in the Introduction where we first introduce D&C is modified to read:

      “The computational model of Dehaene and Changeux (1993) explains numerosity detection based on several neurocomputational principles. That model (hereafter D&C) assumes a one-dimensional linear retina (each dot is a line segment), and responses are normalized across dot size via a convolution layer that represents combinations of two attributes: 1) dot size, as captured by difference-of-Gaussian contrast filters of different widths; and 2) location, by centering filters at different positions. In the convolution layer, the filter that matches the size of each dot dominates the neuronal activity at the location of the dot owing to a winner-take-all lateral inhibition process. To indicate numerosity, a summation layer pools the total activity over all the units in the convolution layer. While the D&C model provided a proof of concept for numerosity detection, it has several limitations as outlined in the discussion. Of these, the most notable is that strong winner-take-all in the convolution layer discretizes visual information (e.g., discrete locations and discrete sizes yielding a literal count of dots), which is implausible for early vision. As a result, the output of the model is completely insensitive to anything other than number in all situations, which is inconsistent with empirical data (Park et al., 2021).”

      The revised Discussion describes our critical modifications to D&C and their consequences.

      “At first blush, the current model might be considered an extension of Dehaene and Changeux (1993). However, there are four ways in which the current model differs qualitatively from the D&C model. First, the D&C model is one-dimensional, simulating a linear retina, whereas we model a two-dimensional retina feeding into center-surround filters, allowing application to the two-dimensional images used in numerosity experiments (Fig. 1A). Second, extreme winner-take-all normalization in the convolution layer of the D&C model implausibly limits visual precision by discretizing the visual response. For example, the convolution layer in the D&C model only knows which of 9 possible sizes and 50 possible locations occurred. In contrast, by using divisive normalization in the current model, each dot produces activity at many locations and many filter sizes despite normalization, and a population could be used to determine exact location and size. Third, extreme winner-take-all normalization also eliminates all information other than dot size and location. By using divisive normalization, the current model represents other attributes such edges and groupings of dots (Fig. 1B) and these other attributes provide a different explanation of number sensitivity as compared to D&C. For example, the D&C model as applied to the spacing effect between two small dots (Fig. 4A) would represent the dots as existing discretely at two close locations versus two far locations, with the total summed response being two in either case. In contrast, the current model gives the same total response for a different reason. Although the small filters are less active for closely spaced dots, the closely spaced dots look like a group as captured by a larger filter, with this addition for the larger filter offsetting the loss for the smaller filter. Similarly, as applied to the dot size effect (Fig. 4B), the D&C model would only represent the larger dots using larger filters. In contrast, the current model represents larger dots with larger filters and with smaller filters that capture the edges of the larger dots, and yet the summed response remains the same in each case owing to divisive normalization (again, there are offsetting factors across different filter sizes). The final difference is that the D&C model does not include temporal normalization, which we show to be critical for explaining adaptation and context effects.”

      In sum, the current model explains a wider range of effects by using representations and processes that more closely reflect early vision. The change to two-dimensions allows application to real images. The inclusion of temporal normalization allows application to temporal effects. The change from winner-take-all to divisive normalization might appear to be a parameter setting, but it’s one that produces qualitatively different results and explanations (e.g., representations of edges and groupings that are part of the explanation of selective sensitivity to number). These behaviors are consistent with empirical data and are qualitatively different from that of the D&C model. Now that we’ve highlighted the ways in which this model differs qualitatively from the D&C model, we hope that our original title still works.

      Reviewer #2 (Public Review):

      This is a very interesting and novel model of numerosity perception, based on known computational principles of the visual system: center-surround mechanisms at various scales, combined with divisive normalization (over space and time). The model explains, at least qualitatively, several of the important aspects of numerosity perception.

      Firstly, the model makes major and minor predictions. Major: the effect of adaptation, at least 30%, as well as impendence of several densities and dot size; minor: tiny effects like irregularity, around 6%. I think it would make sense to separate these. To my knowledge, it is the first to account for adaptation, which was the major effect that brought numerosity into the realm of psychophysics: and it explains it effortlessly, using an intrinsic component of the model (divisive normalization), not with an ad-hoc add-on. This should be highlighted more. And perhaps, the fit can be more quantitative. Murphy and Burr (who they cite) showed that the adaptation is rapid. How does this fit the model? Very well, I would have thought.

      Thanks for the positive evaluation of our work. In the revised manuscript, we followed the reviewer’s suggestion to highlight the novelty of the model in its explanation of numerosity adaptation. As the reviewer says, one significant aspect of our work is that the model can explain a relatively large effect of numerosity adaptation with minimal effort. To be clear, even though we call it “numerosity” adaptation, the model does not know number in any explicit way. One way to highlight this aspect, we thought, is to compare the current adaptation results to a simulation where the adaptor and target are defined along the dimensions of size or spacing. In such cases (which are now reported in Fig. S6 and S7), no reliable under- or over-estimation was observed. These results suggest that numerosity adaptation is a natural byproduct of divisive normalization working across space and time.

      The question about the rapidity of adaptation is indeed an interesting one. However, the current model is not designed to simulate the effect of exposure duration on neural activity. More specifically, the current model operates across trials and stimuli (e.g., one response per stimulus), using a single parameter that captures the temporal gradient of divisive normalization from prior trials (e.g., the influence of two trials ago as compared to one trial ago). As currently formulated, the model does not address adaptation at the level of milliseconds, as would be necessary to model adaptor duration. To model adaptation at the millisecond level requires a dynamic model that not only specifies the rate of adaptation but also the rate of recovery from adaptation, such as in the visual orientation adaptation model of Jacob, Potter, and Huber (2021), which includes the dynamics of synaptic depression and synaptic recovery. In future work we hope to make such modifications to the model to expand the range of explained effects. Nevertheless, a dynamic version of the model should encompass this simpler trial-by-trial version of the model as a special case. Our goal in this study was a clear demonstration of the neural mechanisms underlying numerosity in early vision and so we have attempted to keep the model as simple as possible while still capturing neural behavior.

      We have elected not to fit data and instead we explored the behavior model in a qualitative way, asking whether the commonly observed numerosity effects emerge from the model in the qualitatively correct direction regardless of its parameter values (e.g., as reported in Fig S2). This approach follows from our central aim, which is to explain the neurocomputational principles of the number sense rather than produce a detailed model with specific parameters values fit to data. Our aim was to show that the correct qualitative behaviors naturally emerge from these principles without requiring specific parameter values (and more importantly, to show how these behaviors emerge from these principles).

      Jacob, L. P., Potter, K. W., & Huber, D. E. (2021). A neural habituation account of the negative compatibility effect. Journal of Experimental Psychology: General, 150(12), 2567.

      Among the tiny predicted effects (visually indistinguishable bar graphs) is the connectedness effect. But this is in fact large, up to 20%. I would say they fail here, by predicting only 6%. And I would say this is to be expected, as the illusion relies on higher-order properties (grouping), which would not immediately result from normalization. Furthermore, the illusion varies with individual personality traits (Pomè et al, JAD, 2021). The fact that it works with very thin lines suggests that it is not the physical energy of the lines that normalizes, but the perceptual grouping effect. I would either drop it, or give it as an example of where the predictions are in the right direction, but clearly fall short quantitatively. No shame in saying that they cannot explain everything with low-level mechanisms. A future revised model could incorporate grouping phenomena.

      Thank you for the suggestion. We agree that trying to explain the connectedness illusion with center-surround filters is not ideal. As the reviewer says, the main driver of the connectedness illusion is likely to be groupings of dots. The current model captures groupings of dots, but it does so in a circularly symmetric way, which is not ideal for capturing the oblong groupings (barbells) that are likely to play a role in the connectedness illusion. It is probably because of this mismatch (between the shape of the groupings and shape of the filters) that the model produces a smaller magnitude connectedness illusion. If the model included a subsequent convolution layer in which the filters were oriented lines of different sizes, it would likely produce a larger connectedness illusion. Following the reviewer’s suggestion, we have placed the connectedness illusion in the supplementary materials and only refer to this in the future directions section of the discussion, writing:

      “Another line of possible future work concerns divisive normalization in higher cortical levels involving neurons with more complex receptive fields. While the current normalization model with center-surround filters successfully explained visual illusions caused by regularity, grouping, and heterogeneity, other numerosity phenomena such as topological invariants and statistical pairing (He et al., 2015; Zhao and Yu, 2016) may require the action of neurons with receptive fields that are more complex than center-surround filters. For example, another well-known visual illusion is the effect of connectedness, whereby an array with dots connected pairwise with thin lines is underestimated (by up to 20%) compared to the same array without the lines connected (Franconeri et al., 2009). This underestimation effect likely arises from barbell-shaped pairwise groupings of dots, rather than the circularly symmetric groupings of dots that are captured with center-surround filters. Nonetheless, a small magnitude (6%) connectedness illusion emerges with center-surround filters (Fig. S10). Augmenting the current model with a subsequent convolution layer containing oriented line filters and oriented normalization neighborhoods of different sizes might increase the predicted magnitude of the illusion.”

      In short, I like the model very much, but think the manuscript could be packaged better. Bring out the large effects more, especially those that have never been explained previously (like adaptation). And try to be more quantitative.

      Thank you. We now highlight the novel computational demonstrations of adaptation to a greater degree and—as also suggested by Reviewer 1—provide more quantitative reports of the illusory effects that the model naturally produces.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      In this study we reveal that, in both mice and humans, the metabolic benefits of caloric restriction (CR) are sex- and age-dependent. Through a systematic review of the literature, we show that sex differences have been largely overlooked by previous CR research, a finding that Reviewer 1 highlights as “an important point”. Our results have critical implications for understanding the fundamental biology linking diet and health outcomes, as well as translational strategies to leverage the therapeutic benefits of CR in humans.

      We thank the reviewers for their helpful appraisal of our manuscript, which Reviewer 2 highlights as “a very well written paper”. Reviewer 1 emphasised the translational relevance of our findings and commented on the “systematic” nature of our study. They noted that it “was well performed”, ”is a valuable and important contribution to the field”, and “will elicit great interest in the scientific and public readership.” Indeed, the importance of sex as a biological variable is the focus of a September 2022 news feature in Nature (https://www.nature.com/articles/d41586-022-02919-x), underscoring the timeliness and relevance of our findings. Our response to the reviewers comments is outlined below, including the changes we have incorporated in a revised version of our manuscript.

      Reviewer 1 – Major Comments:

      __A) The clinical part is definitely the weak spot in the study. I don't think that the data should be omitted, but the authors should be very careful in interpreting the data. Obvious limitations apply to this part, which need to be more directly addressed in the abstract and discussion. It feels like the data from the small-scale clinical trial is exaggerated. __The clinical study was conducted by Prof Alex Johnstone’s group at the Rowett Institute of Nutrition and Health, University of Aberdeen. Her group are experts in the study of dietary interventions for weight loss. The study was conducted to a high standard and therefore we have the utmost confidence in the conclusions drawn from our analysis of this data.

      As we discuss in response to the reviewer’s other points below, the clinical study was primarily designed to address other outcomes and we analysed the data retrospectively to investigate if sex and age affect CR-induced weight and fat loss. This explains some of the limitations that the reviewer mentions, e.g. the relatively low numbers of younger males, and the focus on overweight and obese subjects. As requested, we have now addressed these limitations as follows:

      1. Updated the abstract to clarify that the data are from overweight and obese subjects.
      2. Updated the results to emphasise that we did a retrospective analysis of CR in overweight and obese subjects (lines 396-398).
      3. Performed an additional ANCOVA analysis to test if baseline adiposity or BMI contribute to the sex differences in body mass, fat mass or fat-free mass (new Supplementary Figure 11); see Reviewer 1 Major Point D below.
      4. Updated the ‘Limitations’ section of the Discussion to highlight the retrospective nature of the human study (lines 746-748).
      5. Updated the Methods to again clarify the retrospective nature of the analysis (lines 884-885). __B) It is important to mention in the abstract and the discussion that the human data came from obese participants. This might well influence the findings from human data. __The human subjects were overweight or obese; this was previously stated in the methods section (line 885) and in the discussion (lines 509-511). To further clarify this, we now also mention it in the Abstract (lines 52-53) and have reiterated it in the Discussion (line 744). Importantly, the fact that humans still show age-dependent sex differences in fat loss, even when overweight and obese, supports our conclusion that this age effect in mice is not simply a consequence of aged mice being fatter than younger mice. We refer to this as the ‘baseline adiposity’ hypothesis (lines 500-518 of the Discussion). In response to point D below, we have also analysed if the loss of fat mass or fat-free mass is influenced by adiposity or BMI at baseline (pre-CR). Our analyses show that neither of these parameters explain the sex differences in loss of fat mass or fat-free mass (see new Supplementary Figure 11).

      __C) It is very important to calculate the % calorie restriction of the human participants achieved throughout the CR study. This is crucial information to compare it to other studies. __We have updated the Methods (lines 906-909) to explain the basis for the weight loss diet, as follows: “Participants had their basal energy requirements determined and each participant was then fed an individualised diet with a caloric content equivalent to 100% of their resting metabolic rate (Table 3). This approach was taken to standardise the diet to account for individual energy requirements and energy restriction.” We have also updated Table 3 to show the caloric intake for males and females. Note that RMR accounts for ~60-70% of total daily energy expenditure (TDEE) in adults (Martin et al., 2022), so the diet in our study would give a daily caloric deficit of around 30-40% from baseline TDEE.

      __D) Since there is quite a wide range in the BMIs of the participants, can the authors also stratify against BMI? __We have done this against both baseline BMI and against baseline fat mass (the latter to further test the ‘baseline adiposity’ hypothesis). We present this data in an updated Supplementary Figure 11. We find that, in males but not in females, baseline BMI or fat mass are significantly associated with the changes in fat mass or fat-free mass: surprisingly, individuals with higher baseline fat mass or BMI show less fat loss and a greater loss of fat-free mass during CR. Importantly, males and females do not significantly differ in the relationships between baseline fat mass (or BMI) and loss of fat mass or fat-free mass. This further supports our conclusion that the sex differences in fat loss are unrelated to differences in baseline adiposity. We report this in lines 409-411 of the Results and lines 513-515 of the Discussion.

      __E) There is no mention of any pre-study registration online of the clinical part (e.g. _gov_). Was this done? __This study was done before pre-registration was a requirement for clinical trials. We retrospectively analysed the study data to investigate if sex and/or age influence the outcomes. In the updated manuscript we now state this on lines 884-885 of the Methods, as well as in the Results (line 396) and Discussion (lines 746-748).

      __F) In the methods section the authors write "Participants were informed that the study was funded by an external commercial sponsor...". This is important information, and this is not mentioned anywhere else in the paper. Can the authors clarify this point? A commercial sponsor would, in my view, qualify for a conflict of interest that needs to be mentioned. __We have updated the Declaration of Interests section to clarify this as follows: “The human weight loss study was funded by a food retailer; however, the company had no role in the data analysis, interpretation or conclusions presented in this paper.”

      __G) How did the authors determine the group sizes for the clinical part? I have some doubts about the sub-group sizes. It would be valuable information if the authors had a statistical analysis plan prior conducting the study. It appears a bit, like the sub-groups were chosen at random, to match findings of the mouse data. Otherwise, there should have been a better allocation within the sub-groups (especially age). __We agree that larger group sizes would have been preferable. This limitation reflects that the study was not originally designed to test age and sex effects on CR outcomes, but instead was analysed retrospectively to investigate the impact of these variables. As mentioned above, we have updated the text of the manuscript to highlight the retrospective nature of the analyses. In the Discussion, under ‘Limitations’, we also highlight the fact that relatively few younger subjects are included in the human study (lines 744-745).

      __H) *There's a big problem with the age stratification of the male participants in the clinical data. If I'm correct there are only 5 males 45 groupings.

      __I) The applied protocol for CR in mice is known to provoke long fasting phases and probably elicits some effects through fasting alone, rather than the caloric deficit. There are some papers out addressing this (e.g. by deCabo, Lamming). The authors should not dismiss this fact and at least address it in their discussion. Also, given this fact, it would be thoughtful to include a database-search - not only regarding CR - but also regarding various types of intermittent fasting protocols in humans and animal studies (similar to what the authors did in the supplemental figure). __We agree on the importance of highlighting recent studies demonstrating that prolonged daily fasting contributes to the outcomes of typical ‘single-ration’ CR protocols. We have added a new paragraph to the Discussion to address this (Lines 710-719).

      Regarding the second point, we feel that including a new literature search that addresses not only CR, but also intermittent fasting, is beyond the scope of the current manuscript. However, this is a very good idea and would be worth addressing in a future standalone review article. We have also updated our source data to include all data from our literature reviews, to help if other researchers wish to analyse according to fasting duration or other variables.

      __J) Did the authors monitor the eating time of the mice? __We have since done this in new cohorts of mice fed using the same CR protocol. We find that the mice consume their food within 2-3 hours, consistent with other CR studies. We have now mentioned this in the Methods section (lines 867-868).

      __K) While CR certainly has a lot of health benefits in rodents and humans, it should be advised to raise the cautious note that it may not be beneficial for everyone in the general population. For some groups of people and in some cases (e.g. infectious diseases, pregnancy) even CR with adequate nutritional intake of micro/macronutrients might be disadvantageous. This should be mentioned clearly, as the topic gets more and more "hyped" in public media and online. __We now highlight this important point in the opening paragraph of the introduction (lines 65-67).

      __L) There is no indication of how the authors dealt with missing data. Statistically this can be very important, especially in cases with a low number of data points. __In the Methods section we previously explained (lines 846-848) that “Mice were excluded from the final analysis only if there were confounding technical issues or pathologies discovered at necropsy.” No data had to be excluded from our human study and we have now stated this in the Methods (lines 897-898). For analyses involving paired or repeated-measures data (e.g. time courses of body mass or blood glucose), if data points were missing or had to be excluded for some mice then we used mixed models for the statistical analysis. We have now updated this information in the ‘Statistical analysis’ section of the Methods (lines 1047-1048). Because of the large numbers of mice used in our studies, analyses remain sufficiently well powered even if some data points were missing or had to be excluded.

      __M) Key data from qPCR should be followed up by western blots or other means. If this was done and there was no effect, the authors should report this. Also, is there any evidence or the possibility to support these findings regarding pck1 and ppara in human samples? __As requested, we will next use Western Blotting to assess the expression of proteins encoded by the transcripts that show sex and/or diet differences within the liver (Fig. 6A). These data will be reported in our fully revised manuscript.

      Regarding effects of CR on PCK1 and PPARA expression in human liver samples, no human CR studies have taken liver biopsies for downstream molecular analysis. Recent studies of the GTEx database confirm that hepatic gene expression in humans is highly sexually dimorphic (Oliva et al., 2020). We checked PCK1 and PPARA in the GTEx database and found that, in the liver, each of these transcripts is expressed more highly in females than in males (https://www.gtexportal.org/home/gene/PCK1 & https://www.gtexportal.org/home/gene/PPARA). While this is the opposite to what we observe in our ad libitum mice (Fig. 6A), it demonstrates that sex differences in these genes’ hepatic expression do occur in humans. The effect of CR on their hepatic expression, and whether this differs between males and females, remains to be addressed.

      N): I think it would be very valuable to analyse the sex-differences in lipolysis directly in fat tissues. The authors concentrated on differences in hepatic mRNA profiles, but there's an obvious possibility and gap in their story. ____We agree that this would be informative. In the Discussion we cite previous research identifying sex differences in adipose lipolysis and lipogenesis and explain how this fits with our findings (lines 567-574). Since submitting our manuscript, we have begun experiments to investigate sex differences in the effects of CR on lipid metabolism and molecular pathways in adipose tissue. However, these analyses are extensive and ongoing, so we feel strongly that attempting to include them in our present paper would not only substantially delay publication, but also overload what is already a very extensive paper. Therefore, we plan to report our findings in future publications.

      __O) Given the relatively low n and sometimes small effect sizes I fear that some of their findings won't be reproduced by other labs. Were the (mouse) data collected all at once in one cohort or did the authors pool data from different cohorts/repeats? __We presume the reviewer means ‘relatively high n’, as most of our mouse analyses used large group sizes. The mouse data were pooled from across multiple cohorts, with ANOVA confirming that the same sex-dependent CR effects were observed within each cohort. This reproducibility across multiple cohorts is a clear strength of our study because it demonstrates the robustness of our findings. Importantly, the sex differences in fat loss, weight loss and glucose homeostasis were still observed in our much-smaller cohort of evening-fed mice (Fig. S5-S6) (n = 5-6), demonstrating that large sample sizes are not needed for other researchers to detect these effects.

      Reviewer 1 – Minor comments:

      __a) The discussion is very extensive, and I suggest compressing the information presented there to make it more easily readable. __We have removed some text that was more speculative, such as the paragraph discussing a possible role for ERalpha. We have also revised wording elsewhere to state things more succinctly. However, given the scope of our study we feel we cannot substantially cut down the Discussion without compromising the interpretation of our findings. We note the Reviewer two’s comment that “This is a very well written paper” and feel that attempting to compress the extensive information in the Discussion would compromise, rather than help, the readability.

      __b) There is some confusion present in the literature regarding the nomenclature of CR/fasting interventions. Recently some reviews have summarized the different forms (e.g. Longo Nature Aging, Hofer Embo Mol Med, ...) and the authors should address this briefly. Especially the applied CR intervention in ____mice overlaps with intermittent fasting. __We have updated the Discussion (lines 710-719) to explain how our single-ration CR protocol also incurs a prolonged intermittent fast, and how this fast per se may contribute to metabolic effects.

      c): The order of the subpanels in Figure 9 (and other figures where B is below A and so on) is confusing. Please rearrange or indicate in a visual way which panels belong to each other.

      We disagree that the order of subpanels is confusing: the panels are clearly labelled, and we find it most logical to have the absolute values shown in the top row (panels A, C and E), with the corresponding graphs of fold changes shown beneath each of these (panels B, D and F). This allows the reader to quickly compare the absolute vs fold-change data for each readout. If we had panels A-C on the top row and D-F on the second row, then the connection between graphs 9C and 9D would be less clear and comparable.

      d): Did the authors also measure cardiovascular (e.g. blood pressure) parameters? There is some evidence out there that there is an age/sex dependency during fasting/CR. This would be a nice add-on to the rather small clinical data here.

      We did measure various cardiovascular parameters for our mice but find, unlike for the metabolic outcomes, these generally don’t show sex or age differences. In our human study we measured blood pressure and heart rate before starting CR and at weeks 3 and 4 post-CR. For this response to reviewers we have summarized these human data in Figure R1. The data show that CR decreases blood pressure and heart rate in males and females (Figs. R1A-E). In the younger age group (We have decided to not include these data in the current study because we feel it is already extensive and is focused on metabolic outcomes. We instead plan to report the cardiovascular outcomes (from both humans and mice) in a separate paper.

      __e) What was the decision basis for stratifying the human data into 45 years? __We used 45 years as the cutoff point because this is the age when, in women, oestrogen levels begin to decline (this point was stated in lines 491-492 of the Discussion, and we now reiterate it in lines 414-415 of the Results).

      __f) The part on aging starting in Figure 7 comes quite surprising and it is not clearly linked to the data before. A suggestion here would be to smooth the transition in the text and the authors could again perform a literature search regarding age-of-onset for CR/fasting interventions in mice and humans. __We have added a sentence to smooth the transition to these studies (lines 363-364). We had previously done a literature search to identify the age of onset of CR interventions in mice and humans. We summarise the findings of this search in lines 452-470 and 484-495 of the Discussion. We have also updated the source data so that it includes the our review of the CR literature, allowing other researchers to interrogate this data.

      g) At the first mention of HOMA and Matsuda indices, the effect direction should be put into physiological context.

      We now mention this in lines 231-232 of the Results.

      h) There is no mention of how the PCA analyses were conducted.

      We have updated the Methods to explain that the PCA analyses were done using R. We have updated the source data to include the outputs from these analyses, as well as the underlying code. These data and code are now available here https://doi.org/10.7488/ds/3758.

      i) Were the mice aged in-house in the authors' facility or bought pre-aged from a vendor? Is it known how they were raised? If bought pre-aged, were female and male animals comparable?

      We bred and aged all mice in house. Males and females were littermates from across several cohorts. Therefore, there are no concerns about lack of comparability resulting from environmental differences.

      j) Very minor note: I think that "focussed" has become very rarely used, even in British English. I don't know about the journal's language standards, but I would switch to the much more common "focused".

      We have updated to ‘focused’ as requested.

      k) Figure 6B/F (PCAs) should indicate the % difference of each dimension.

      We have updated the figures to show the % variance accounted for by each principal component. We have also updated the figure legend to specify this.

      l) Limitations section: Maybe tone down on "world-leading mass spec facility". This sounds like an excuse and this statement is unsupported and doesn't add anything valuable to the section. Other limitations would include the low n, as mentioned above and the mono-centric fashion of the mouse and human experiments.

      We have addressed these points as follows:

      • Toned down the description of our mass spec facility (they are renowned for expertise in steroid hormone analysis, so we our original text was intended to highlight that our facility are not novices for this).
      • Regarding the low n for some of the human groups, we now highlight this on lines 744-745 of the Discussion.
      • We have added a new paragraph to the Discussion (lines 710-719) explaining the limitations of our CR protocol, i.e. that includes elements of both CR and intermittent fasting. Reviewer 2:

      __Point 1: This is a very well written paper. __We thank the reviewer for this kind comment.

      __Point 2: Since the authors fed the animals in the morning, this is likely the reason for energy expenditure to be different in the CR vs ad lib groups. Although the authors do study the effects of night v day feeding and saw no change in the outcomes regarding weight, this fact I think should be mentioned somewhere. Also, figure 4A is expressed a W while all the other graphs are in kJ. I think it would be nice to see it all consistent. __Regarding the first point, we agree that time of feeding can influence when energy expenditure is altered, but most studies show that CR decreases overall energy expenditure regardless of time of feeding. For example, Dionne et al studied the effects of CR on energy expenditure, administering the CR diet during the night phase (Dionne et al., 2016). They found that CR mice have lower energy expenditure in the day but not in the night (Figure 3C in their paper), which is the opposite to our findings (Figure 4C). However, total energy expenditure in their study remains decreased with CR. This goes against the reviewer’s suggestion that feeding the animals in the morning “is likely the reason for energy expenditure to be different in the CR vs ad lib groups”. We have updated our manuscript (Lines 576-581) to clarify this.

      Regarding the second point, we have updated Figure 4A to express the data in kJ (showing the average kJ, per hour, at each time point). The figure legend has been updated to reflect this.

      __Point 3: For all the graphs, can you make the CR groups bold and not filled as it is hard to see the lighter colours. __We have updated the graphs so that the CR groups are represented by solid lines, rather than dashed lines.

      __Point 4: I know many investigators use them, but I am not sure how relevant HOMA-IR and the Matsuda index are in mice since they were specifically designed for humans. __The issue of whether it is ‘correct’ to use HOMA-IR and/or Matsuda index in mice is often debated in the metabolism field. Importantly, we are not using the absolute values for HOMA-IR or Matsuda in the same way that they are used in humans; instead, we are comparing the relative values between groups because these are still physiologically meaningful. We discussed this with Dr Sam Virtue, an expert in mouse metabolic phenotyping (Virtue and Vidal-Puig, 2021), who agrees on their usefulness in this way.

      __Point 5: Something also to note is the fact that all the glucose uptake data is under basal conditions. Just because there are no differences in the basal state does not mean that there are no differences after a meal/during an insulin stimulation. I think that this needs to be discussed and the muscle and fat not completely discounted as a player in the differences seen. __We agree that CR can enhance insulin-stimulated glucose uptake but our OGTT data suggest that it is effects on fasting glucose, rather than insulin-stimulated glucose uptake, that contribute to the sex differences we observe. We have now updated the Discussion (lines 608-613) as follows, “CR enhances insulin-stimulated glucose uptake (82) and it is possible that this effect differs between the sexes. However, our second relevant finding is that, during an OGTT, CR decreases the tAUC but not the iAUC, highlighting decreases in fasting glucose, rather than insulin-stimulated glucose disposal, as the main driver of the improvements in glucose tolerance.”

      References cited in Response to Reviewers:

      Dionne, D.A., Skovso, S., Templeman, N.M., Clee, S.M., and Johnson, J.D. (2016). Caloric Restriction Paradoxically Increases Adiposity in Mice With Genetically Reduced Insulin. Endocrinology 157, 2724-2734. 10.1210/en.2016-1102.

      Martin, A., Fox, D., Murphy, C.A., Hofmann, H., and Koehler, K. (2022). Tissue losses and metabolic adaptations both contribute to the reduction in resting metabolic rate following weight loss. Int. J. Obes. 46, 1168-1175. 10.1038/s41366-022-01090-7.

      Oliva, M., Muñoz-Aguirre, M., Kim-Hellmuth, S., Wucher, V., Gewirtz, A.D.H., Cotter, D.J., Parsana, P., Kasela, S., Balliu, B., Viñuela, A., et al. (2020). The impact of sex on gene expression across human tissues. Science 369, eaba3066. 10.1126/science.aba3066.

      Virtue, S., and Vidal-Puig, A. (2021). GTTs and ITTs in mice: simple tests, complex answers. Nat Metab 3, 883-886. 10.1038/s42255-021-00414-7.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewer for stating that “The detailed analysis uses many state of the art techniques to address the role of ROR1 and is of great interest to a large audience including basic researchers in the field of cancer biology and oncologists in the clinic” and we appreciate the reviewer’s constructive suggestions. We have substantially revised our manuscript and plan to perform new experiments based on these valuable comments.

      1. Description of the planned revisions

      Three main points: (1) The importance of AURKB as a downstream effector of ROR1 [Reviewer #1: major #2] Based on these suggestions, we plan to perform a colony formation assay using AURKB-overexpressing cells with ROR1-knockdown. We will clarify this point in the revised manuscript.

      (2) The link between ROR1 expression and YAP/BRD4 [Reviewer #1: major #5 and Reviewer #3: major #1] Based on the suggestion, we plan to perform the luciferase reporter assay. We will clearly describe this experiment in the revised manuscript.

      (3) Single-cell analysis using other models to validate tumor heterogeneity [Reviewer #2: major #1 and Reviewer #3: major #2] Based on your suggestion, we plan to analyze primary human tumors (public data: for example, GSE155698, CRA001160) and examine PDO#1 xenografts (in-house data). We will clearly state this information in the revised manuscript.

      For the two minor points suggested by Reviewer #2, we plan to (1) reanalyze TCGA data. (2) perform the organoid or colony formation assay to validate that the siRNA model functionally recapitulates the ROR1low vs. ROR1high phenotype.

      Please see the “Authors’ responses to the reviewers' comments” for more details.

      1. Description of the revisions that have already been incorporated in the transferred manuscript

      As suggested by the reviewer, we have substantially revised our manuscript, and the changes are shown in red. • Reviewer #1: major comments #2, #3, #4, and #5; minor comments #1 and #2 • Reviewer #2: major comments #2, #3, and #4; minor comments #2, #3, #4, #8, and #10 • Reviewer #3: minor comments #1 and #2

      Please see the “Authors’ responses to the reviewers' comments” for more details.

      1. Description of analyses that authors prefer not to carry out

      Authors’ responses to the reviewers' comments

      Reviewer #1

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript the authors analyzed the role of ROR1 in pancreatic cancer progression and metastasis. They found that ROR1 expression is specifically increased in an partial EMT cell cluster upon scRNA-Seq of tumor cells derived from an orthotopic mouse PDAC model. Moreover, the ROR1 high population in tumors specifies cells with high proliferation and tumor initiation capacities, increased metastatic propensity and chemoresistance, since knockdown of ROR1 shows reduction of these features in vivo. By comparing transcriptomes from several in vivo models the authors identified that ROR1 acts through AURKB and that its expression is regulated by an upstream enhancer that is bound by YAP/TAZ and BRD4 complexes. With this study the authors identified a new targetable pathway that promotes tumor progression and metastasis in PDAC. The detailed analysis uses many state of the art techniques to address the role of ROR1 and is of great interest to a large audience including basic researchers in the field of cancer biology and oncologists in the clinic. However, some of the findings are a bit preliminary and the drawn conclusions are not sufficiently supported by the experimental data. Moreover, some findings seem a bit out of context and do not really help to bring the story forward. At other instances experimental details are missing to mechanistically demonstrate the role of ROR1. In particular it remains elusive how ROR1 is regulated, i.e. which signaling events are crucial to generate ROR1 high vs. low cells. I listed my specific comments below.

      [Response] We thank the reviewer for stating that “The detailed analysis uses many state of the art techniques to address the role of ROR1 and is of great interest to a large audience including basic researchers in the field of cancer biology and oncologists in the clinic” and we appreciate the reviewer’s constructive suggestions. We have substantially revised our manuscript and plan to perform new experiments based on these valuable comments.

      1. The authors' initial finding is that in the partial EMT cluster ROR1, but also other RTKs (out of 56) are specifically increased. What about the other RTKs? Why was ROR1 chosen to analyze more thoroughly?

      [Response 1] We are thankful for the reviewer’s suggestion to clarify why ROR1 was selected. (1) Seven candidate genes (EPHA4, EPHA7, ERBB4, FGFR1, JAK3, LYN, and ROR1) were chosen as surface markers in the partial EMT cluster. (2) The genes were sorted in order of high expression. (3) ROR1 is reported to promote metastasis in breast cancer (Cui et al, 2013). The induction of metastasis is one of the functions of tumor-initiating cells. FGFR1 is already known to enhance the CSC-like phenotype in non-small cell lung cancer (Ji et al, 2016). (4) The antibody against ROR1 was marketed as available for cell sorting using FACS. Therefore, we focused on ROR1 as a potential new marker for tumor-initiating cells with a partial EMT signature.

      References Cui B, Zhang S, Chen L, Yu J, Widhopf GF 2nd, Fecteau JF, Rassenti LZ, Kipps TJ. Targeting ROR1 inhibits epithelial-mesenchymal transition and metastasis. Cancer Res. 2013 Jun 15;73(12):3649-60. doi: 10.1158/0008-5472.CAN-12-3832. PMID: 23771907; PMCID: PMC3832210. Ji W, Yu Y, Li Z, Wang G, Li F, Xia W, Lu S. FGFR1 promotes the stem cell-like phenotype of FGFR1-amplified non-small cell lung cancer cells through the Hedgehog pathway. Oncotarget. 2016 Mar 22;7(12):15118-34. doi: 10.18632/oncotarget.7701. PMID: 26936993; PMCID: PMC4924774.

      1. The finding of AURKB as crucial target of ROR1 is very weak and needs more in-depth analyses. It is not clear why AURKB was chosen over the other candidates. Is AURKB expression directly regulated by ROR1? Are the two genes directly linked? Can ROR1 deficiency be compensated by AURKB overexpression? Especially the decrease in AURKB protein level in Fig. 4K is not very convincing to account for the different phenotypes in ROR1 high and low cells. Is AURKB and ROR1 expression correlated in TCGA samples (like Fig. 8B)? In Fig. 4L the readout was changed from colony numbers to colony diameter. If AURKB is the crucial player downstream of ROR1, then colony formation efficiency should be affected at first. This needs to be shown. The statement in lines 223,224 that AURKB is a direct downstream target of ROR1 was not shown!

      [Response 2-1: changed] We thank the reviewer for noting this issue. We have performed additional experiments to assess the hypothesis that AURKB is a crucial downstream target of ROR1. ROR1-knockdown not only suppressed AKT phosphorylation (Supplemental Figure 9A) but also decreased c-Myc protein levels and the expression of c-Myc target genes (CDK4, CCND1, CDK2, and CCNE1), leading to a reduction in RB phosphorylation (new Supplemental Figure 9B and 9C). Based on these results, ROR1 regulates c-Myc expression through AKT signaling, leading to the activation of the E2F network (new Supplemental Figure 9D). We added some figures and descriptions to the preliminary revision manuscript (new Supplemental Figure 9B–9D, lines 357–363, lines 649–651).

      [Response 2-2: the planned revisions] We also plan to perform new experiments with a colony formation assay to determine whether ROR1 deficiency is compensated by AURKB overexpression. We agree that this experiment will confirm that AURKB is an important downstream target of ROR1 in PDAC proliferation.

      [Response 2-3] In TCGA-PAAD dataset, AURKB expression was not correlated with ROR1 expression. Since the ROR1high cluster is a minor population in the tumor, a downstream analysis of specific clusters with results from a bulk study such as this TCGA dataset is difficult to perform.

      [Response 2-4: changed] We have added a new graph of organoid formation efficiency (new Figure 4L) and changed some descriptions in the preliminary revision manuscript (line 227).

      1. Fig. 4 A-E: The ROR1 KD was induced in vitro but not continued in vitro. The transient KD has a strong impact on tumor forming capacity, even though recovery of expression is likely within the first days in vivo. This is very interesting and underscores the role of ROR1 in tumor initiation and presumably independent of differences in proliferation. Would the results be different, if the DOX treatment would start with injection of the cells and continued in vivo? Is then tumor initiation not affected and maybe only tumor growth?

      [Response 3: changed] We apologize for the confusing description in the original manuscript. In Fig. 4A–E, we used PDAC cells with stable expression of doxycycline-inducible shROR1. ROR1-knockdown was maintained in vivo by adding doxycycline to the drinking water. Continuous ROR1-knockdown suppressed tumor growth (Fig. 4C–E). Several statements we made were more ambiguous than intended, and we have adjusted the text and the figures for clarity in the preliminary revision manuscript (new Figure 4A and B, lines 203–204).

      1. In Fig. 5 the authors show that ROR1 is highly expressed in tumors after gemcitabine treatment and conclude that the ROR1 high cells are a resistant population. However, this statement is too strong, since gemcitabine treatment could also lead to an upregulation of ROR1 in "low" cells during acquisition of chemoresistence. Together with our knowledge on the role of EMT in driving therapy resistance and therapy-mediated induction of EMT, such a scenario is equally likely. Similarly, the statement in lines 370-372 is not supported by experimental evidence.

      [Response 4: changed] We appreciate the reviewer’s critical comments. As suggested, we have not clearly determined whether (1) the ROR1high cells survived gemcitabine treatment and/or (2) the ROR1low cells increased ROR1 expression upon exposure to this treatment. We have carefully changed some descriptions in the preliminary revision manuscript (lines 241–242, 382–383).

      1. In order to understand how ROR1 is regulated, the authors use ATAC-Seq and cut and run and identified a putative upstream enhancer element (Fig. 7). Although this element increases the activity of the promoter fragment in a reporter construct, the experiments do not help to understand how ROR1 activity is increased specifically in the "high" cells. Are peaks of YAP1 and BRD4 also changed between hi/lo cells? Is YAP OE and KD (BRD4 OE and KD) or the use of the inhibotor JQ1 altering the activity of the reporter constructs (i.e. only of the enhancer-promoter combination but not of the promoter only construct)? This would help to strengthen a direct link between ROR1, YAP and BRD4. Is YAP activity different in ROR1 high vs. low cells?

      [Response 5-1: changed] We thank the reviewer for this important comment. We have shown differences in chromatin accessibility and histone modification of the ROR1 enhancer between ROR1high and ROR1low cells using ATAC-seq and CUT&RUN assays (Fig. 7B). Very few ROR1high/low cells are present in xenograft. We were not successful in experiments examining the binding of YAP and BRD4 to enhancers in ROR1high/low cells because of the technical limitations in the ChIP and CUT&RUN assays. Instead, we used public data to examine YAP and BRD4 occupancy at the ROR1 enhancer region of cell lines with low ROR1 expression. In T-47D and MCF7 cells (breast cancer cells, low ROR1 expression), YAP and BRD4 did not bind to the ROR1 enhancer region (new Figure 8D and 8I). We have added figures and some descriptions to the preliminary revision manuscript (new Figure 8D and 8I, lines 304–309, line 768).

      [Response 5-2: the planned revisions] We plan to perform new experiments with the reporter assay you suggested. We agree that this experiment will help strengthen the direct link between ROR1, YAP and BRD4.

      [Response 5-3] As shown in Figure 8C, GSEA revealed that ROR1high cells in both S2-VP10 xenografts and PDO#1 xenografts expressed higher levels of YAP-regulated genes than ROR1low cells in these xenografts. We have added a description of this result as follows: “Thus, ROR1high cells have higher YAP activity than ROR1low cells.” (lines 304–305).

      1. In Fig. 8A the authors identified 202 antigens that match the H3 monomethylation / acetylation pattern. How was YAP etc. chosen?

      [Response 6] We apologize for the poor description in the original manuscript. We chose YAP and BRD4 based on the following criteria: (1) these antigens are expressed in S2-VP10 cells and PDO#1 and (2) bind to the ROR1 enhancer region (based on an analysis of public data).

      Minor: 1. Fig. 2D,E: What is actually shown here? Is there an overlap between the genes that define ROR1 high vs. low cells in both approaches? The gene list should be provided.

      [Response: changed] We apologize for the poor description in the original manuscript. We have added this information to the preliminary revision manuscript (new Supplemental Table 3).

      1. Fig. 3G: I suggest to include the images of the tumors from the ROR1 low cells in the main figure as well.

      [Response: changed] We appreciate the reviewer’s suggestion. We have moved this information from the supplementary information to the main figure in the preliminary revision manuscript (new Figure 3G, lines 186–189).

      Reviewer #1 (Significance (Required)):

      PDAC is a very aggressive desease with very low 5-year survival rates. Understanding of the pathobiology is of keen interest. The findings of the authors are of high significance and extremely relevant as they provide a mechanism that can also be targeted by specific drug combinations, i.e. standard care gemcitabine with specific ROR1 inhibition. The findings are of great interest to a large audience including basic researchers in the field of cancer biology and oncologists in the clinic.

      [Response] We greatly appreciate the reviewer’s comments.

      Reviewer #2

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this work Yamazaki and colleagues performed single cell RNA sequencing of one xenograft tumor formed by the S2-VP10 PDAC cell line to explore PDAC intratumor heterogeneity. Using this model they identified ROR1 as heterogeneously expressed in neoplastic cells. Using further in vivo and in vitro models they show that ROR1high cells have higher tumor initiation capacity than ROR1low. By histone and ATAC-seq analyses, they identify a ROR1 enhancer upstream the promoter and show that YAP and BRD4 bind to this genomic region and that BRD4 inhibition by JQ1 reduces ROR1 expression and organoid formation. The data, figures and methods are nicely and clearly presented.

      [Response] We thank the reviewer for stating that “The data, figures and methods are nicely and clearly presented”, and we appreciate the reviewer’s constructive suggestions. We have substantially revised our manuscript and plan to perform new experiments based on these valuable comments.

      Major comments

      1. The authors use one xenograft tumor as starting model and all conclusions are derived from the data generated with this model. To support the existence of identifie heterogeneity in the PDAC neoplastic compartment, I would strongly suggest to validate the existence of the partial EMT population and the ROR1 heterogeneity in single cell data bases generated from primary human tumors.

      [Response 1: the planned revisions] We thank the reviewer for the positive suggestion. We plan to perform a new analysis of available public single-cell data from human PDAC tumors. In addition, we also launched a single-cell analysis of PDO#1 xenografts.

      1. In Fig. 3G, it is mentioned that tumors grown from ROR1high cells recapitulate the original PDOx histology thus suggesting that ROR1high cells in the tumor are the actual TICs. ROR1low cells could also grow tumors, just with lower incidence. Are these tumors any different to the ROR1high derived ones? Is it just a lower tumor initiation capacity (TIC) or they can not recapitulate the tumor as the ROR1high cell? Can they also give rise to differentiated progeny cells? This should appear in the main text and not only in the discussion. I would suggest to move panel 3G to supplementary figure.

      [Response 2: changed] We thank the reviewer for noting this issue and apologize for the confusing description in the original manuscript. ROR1low cells generated tumors at a low frequency, and these tumors showed a hierarchical histology mimicking the original tumor. As suggested, we have added this information to the main text (new Figure 3G, lines 186–189).

      1. In line 160 you mention that known CSC markers such as CD44, PROM1 and DCLK1 are not differentially expressed between ROR1 high and low populations. Then, in figure 3H,I you analyze the expression of CD44v6 together with ROR1. I would try to put this information together in the text, or at least in fig. 3 start with something like "we had seen that both ROR1high and low express CD44, however...". In any case, I feel that the experiment with CD44 could be obviated (or at least moved to supplementary), as it brings the question of weather this is also true for DCLK1 or CD133.

      [Response 3: changed] We appreciate and agree with the reviewer's comment on this point. Accordingly, we have moved this figure to the supplementary information and changed the description (new Supplemental Figure 5C and 5D, lines 191–196).

      1. JQ1 has been described to inhibit PDAC growth by downregulation of MYC. To unequivocally link the effect of JQ1 in the downregulation of ROR1 (Fig. 8M) as discussed in the text it would be important to exclude that other mechanisms such as MYC downregulation are taking place. For example, does JQ1 treatment of ROR1low cells also reduce their colony formation capacity (in an experiment such as the one in fig. 3C). Or does ROR1 re-expression in Fig. 8M rescue the JQ1 effect? These or other experiments could help to establish a stronger link between (BRD4/JQ1) and ROR1.

      [Response 4: changed] We thank the reviewer for this important comment. As mentioned in the response to Reviewer #1-major comment #2, we newly found that ROR1 regulates c-Myc expression through AKT signaling, leading to the activation of the E2F network (new Supplemental Figures 9B–9D, lines 357–363).

      Minor comments 1. The data are nicely presented (text and figures) and the conclusions are clear. My suggestion to make the story more "catchy" at the beginning would be, if possible, to start from the observation done in primary human data and then move to the PDX model to explore ROR1 as a TIC marker in PDAC. For this, you could use available public single cell data of human PDAC tumors. If this doesn't work (it is of course possible that by unsupervised analysis you don't get the same clusters as in the PDX with the partial EMT cluster popping up), it would be nice if some primary tumor data came early in the story (currently the first figure showing heterogeneity in primary samples is in supplem fig. 4A).

      [Response: the planned revisions] We thank the reviewer for these excellent comments. As suggested, we plan to perform several new analyses (please see the previous comment for details: Reviewer #2-major comment #1).

      1. It is not clear if the xenografts were subcutaneous or orthotopic. It would be good to include this information in the main text (line 102) and the methods so that the reader knows what is the exact model that has been used.

      [Response: changed] We thank the reviewer for this comment and apologize for the poor description in the original manuscript. As suggested, we have added this information to the preliminary revision manuscript (line 101).

      1. In Fig. 2F and 2G I would highlight the EMT pathway to help the reader.

      [Response: changed] We thank the reviewer for this comment. As suggested, we have changed the relevant figures in the preliminary revision manuscript (new Figure 2F and 2G).

      1. In Supp Fig 4B it would be nice to have an amplified view of the staining as in panel C of the same figure.

      [Response: changed] We thank the reviewer for this comment. As suggested, we have added high-magnification images of the staining in the preliminary revision manuscript (new Supplemental Figure 4A and 4B).

      1. In the same figure (Fig. 4A-D) ROR1 shows an apical staining pattern that doesn't seem to resemble the staining in patient samples. I am not an expert in pathology evaluation but I would recommend a pathologist to give her/his opinion. Possibly, during the PDX process, few cells from the original patient tumor are selected giving a different staining pattern.

      [Response] We appreciate the reviewer's comment on this point. Dr. Ito, a coauthor of this paper, is a pathologist. We have changed some images of staining in patient samples (new Supplemental Figure 4A). We agree that ROR1 shows an apical staining pattern in PDX samples. However, some sites show similar apical staining patterns in patient samples (Patient #2 and Patient #4 in the new Supplemental Figure 4A). We propose that PDX mimics the original patient tissue because it has heterogeneity of ROR1 expression and morphological features indicative of a luminal structure.

      1. In the analyses of TCGA data, be aware that only 150 from the original dataset are actual PDAC tumors. The dataset contains otherwise data from cell lines, PDX, normal tissue, etc that should be removed for a proper analysis (see DOI: 10.3390/cancers11010126)

      [Response: the planned revisions] We thank the reviewer for the careful review of this issue. We are currently reconsidering with the pathologist whether the samples are appropriate based on TCGA data (diagnosis and pathology sections) and the paper you presented. The current data (Figures 3A, 4J, and 8B) were analyzed for samples excluding cell lines, PDX, and normal tissue in the TCGA-PAAD dataset.

      1. Does ROR1 correlate with RFS? This would nicely fit with the concept of TIC and metastasis.

      [Response] We thank the reviewer for noting this issue. Unfortunately, no correlation was observed between ROR1 expression and RFS.

      1. Line 219: ROR1 is not "depleted" in the lines as it is a downregulation model. "ROR1-downregulated" would be more correct.

      [Response: changed] We thank the reviewer for this suggestion and agree with your comment. We have corrected this term accordingly in the preliminary revision manuscript (line 223).

      1. It would be good to have a supplem figure showing that siROR1 cells show reduction organoid formation, to validate that the siRNA model functionally recapitulates the ROR1low vs high phenotype.

      [Response: the planned revisions] We thank the reviewer for this suggestion. We plan to perform a colony formation assay.

      1. Some of the supplemental figures are only referred in the discussion although they appear earlier than other in the main text. This is a bit confusing when going through the figures.

      [Response] We apologize for the poor description in the original manuscript. We have adjusted the order of the supplemental figures in the preliminary revision manuscript.

      CROSS-CONSULTATION COMMENTS I agree with the importance of addressing points 2 (link to AURKB), 4 (selection vs acquisition), 5 (mechanism in high vs low cells) raised by Reviewer 1, and the comments from Reviewer 3. I think that the study of other RTKs (point 1 from Reviewer 1) is not the focus of the story. It would be nice if the authors can comment on why they chose ROR1 but the fact that are other differentially expressed genes does not exclude the validity of the current story. I fell that the in vivo sustained KD experiment (point 3 from Reviewer 1) although interesting, it is not mandatory for a revision of this manuscript in case the adaptation of the animal protocol represents a long process. The experiment provided already in the current version is the best approach to address the role of ROR1 at the early initiation phase.

      [Response] We thank the reviewer for these positive comments. As suggested, we have substantially revised our manuscript.

      Reviewer #2 (Significance (Required)):

      Significance: This is a neat and interesting work with potential implications for the clinical field of pancreatic cancer as the authors identified a new subpopulation with enhanced tumor initiating cell capacity. However, the use of JQ1 for pancreatic cancer has been previously discussed mainly linked to MYC inhibition, but also to stromal reprogramming or DNA damage induction. I missed some discussion in this regard in the discussion section. What is adding the work to the field of JQ1 treatment in PDAC? IN a way, how do the authors foresee that the discovery of ROR1high cells and the regulation of ROR1 by BRD4 and YAP will be beneficial when considering JQ1 in the clinics? Maybe by stratifying patients? Or by following ROR1 upregulation upon initial chemotherapy? These questions are just suggestions. In general, some discussion to put the work into the context of previous works using JQ1 in PDAC would be nice.

      [Response: changed] We thank the reviewer for this comment. As you suggested, we have added a description of the proposed use of JQ1 and BRD4 inhibitors in ROR1high PDAC treatment to the Discussion section (lines 412–416).

      I believe that this work would be interesting not only to the pancreatic cancer community but also to a more general public working on cancer and/or stemmness as it touches several interesting points in that regard that can be applicable to other systems. My own work is focused on pancreatic cancer, patient heterogeneity and stromal interactions. I am not an expert on histone or ATACseq analyses.

      [Response] We greatly appreciate the reviewer’s comments.

      Reviewer #3

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary Yamazaki et al investigate partial EMT in pancreatic cancer and provide data that ROR1 marks pancreatic tumor cells that are capable of initiating tumors. The authors exploit scRNAseq of pancreatic tumor xenografts to identify a cluster of cells showing a partial EMT phenotype. The found 7 RTKs expressed more highly in this partial EMT cluster and focus their attention on ROR1, an 'orphan' receptor that has been implicated in WNT signaling and EMT previously. Validation experiments using ROR1-high vs low cells support that ROR1 expression correlates with EMT, poor outcome in human PDA patients, tumor forming and colony forming capacity. They also show that ROR1 high cells form tumors that recapitulate parental tumor histology. The authors show that ROR1 expression is associated with EF2 transcription factor activity, elevated expression of multiple targets including AURKB. Pharmacologic inhibition of AURKB reduces colony formation and genetic loss of ROR1 combined with chemotherapy (gemcitabine) has potent anti-tumor activity in vivo. The authors show that ROR1 expression is elevated in metastatic lesions and identify a novel enhancer element that putatively drives ROR1 expression in tumor cells. They provide evidence that this element is engaged by YAP/BRD4 and show that BRD4 inhibition reduces tumor cell colony formation. The manuscript is a solid combination of techniques with adequate controls and statistics.

      [Response] We thank the reviewer for stating that “The manuscript is a solid combination of techniques with adequate controls and statistics”, and we appreciate the reviewer’s constructive suggestions. We have substantially revised our manuscript and plan to perform new experiments based on these valuable comments.

      Major Comments: The overall conclusion that ROR1 expression marks a subset of pancreatic cancer cells that have the ability to initiate tumors is supported by the data provided. The correlative data are strong and the demonstration that loss of ROR1 reduces colony formation, reduces metastatic lesions and enhances the efficacy of chemotherapy are compelling. Additionally, the demonstration that ROR1 expression is elevated in metastatic lesions is consistent with many other drivers/markers of EMT in pancreatic cancer.

      The conclusion that ROR1 expression is driven by YAP/BRD4 is interesting and provides important mechanistic depth to the study. However, this conclusion could be strengthened by use of a suitable rescue experiment. For instance does overexpression of ROR1 rescue the effect of BRD4 inhibition or loss of YAP?

      [Response 1: the planned revisions] We thank the reviewer for this comment. We completely agree with the reviewer’s suggestion. However, the suggested examination to determine whether overexpression of ROR1 rescues the effect of BRD4 inhibition or loss of YAP may not be suitable because BRD4 and YAP act as transcriptional coregulators of various target genes. Instead, as mentioned in response to Reviewer #1-major comments 5-2, we plan to perform new experiments using a reporter assay.

      A challenge with the data presented in Figure 1, the scRNA-seq data that lead them to ROR1, is that it is not stated how many tumors are used to generate the scRNA-seq data and the overall number of tumor cells analyzed is relatively low (993). The authors should provide the number of tumors used for the initial scRNA-seq. A general concern with any scRNA-seq data is batch effect, this is mitigated to a degree by the follow on studies that provide functional validation of ROR1 in multiple cell lines.

      [Response 2: changed and the planned revisions] We appreciate the reviewer’s comments. As suggested, we have added this information to the preliminary revision manuscript (line 104). In addition, as mentioned in response to Reviewer #2 major comment #1, we plan to perform a new single-cell analysis of PDO xenografts (in-house data) and human PDAC tumors (available public data).

      The data and methods are provided in an adequate manner. Reproduction of the experiments is likely. The authors use multiple cell lines and tools that are generally available. The authors note a limitation of the study is that only human tumor xenografts were exploited.

      [Response] We thank the reviewer for the positive comment.

      Minor comments: Figure 1E and text page 9. The text identifies MERB3 as a gene that marks the partial EMT cluster, I believe this is a type and the gene should actually be MSRB3.

      [Response: changed] We apologize for the typo. We have corrected this error accordingly (line 114).

      Please provide the dose of gemcitabine in the legend for figure 5

      [Response: changed] We apologize for the poor description in the original manuscript. We have added this information.

      CROSS-CONSULTATION COMMENTS I think the comments from Referee #2 are pretty reasonable - have no additions

      Reviewer #3 (Significance (Required)):

      Intratumor heterogeneity is a major challenge for the treatment of many cancers, including pancreatic cancer. The data provided support that ROR1 marks a subset of cancer cells in pancreatic tumors that have the capacity to drive intratumor heterogeneity. If supported these data have the potential to drive significant impact. Identification of a marker and a targetable pathway that supports tumor initiation in pancreatic cancer has the potential to nominate companion therapies that enhance the efficacy of standard of care approaches. Further, identification of a pathway that drives partial EMT in pancreatic cancer provides a substantial increase in baseline knowledge of intratumor heterogeneity.

      These data would be broadly interesting to scientists interested in the tumor microenvironment, metastasis, therapy resistance and tumor progression. In addition, oncologists focused on drug development and combinatorial therapy will find this manuscript of interest.

      [Response] We greatly appreciate the reviewer’s comments.

    1. We confuse popularity with quality and end up copying the behavior we observe.

      This sentence is extremely important. Not only do we tend to associate popularity with quality, but also with credibility. I see time and time again Tik Toks and videos on Twitter, Instagram, and Facebook will go viral of someone going on a rant about some issues and making very bold claims without siting anything. In return, as the video grows in popularity, people will quote and cite the video as a credible source. However, a person's opinion that has becoming well-liked does not equate truth or fact. I will also see quality information go unnoticed as it lacks audience engagement. I think of times where I come across a Tik Tok that I like but it has almost no likes or comments. I first I find this odd. However, then I realize that the algorithm is testing this video to see if people will like it and if it gets likes, then it will show it on more feeds and then more feeds. However I will easily like a video that already has hundreds of thousands of likes because I think "why not?", when I could be adding to the spread of misinformation. Bias is inevitable and something we may never be able to fully avoid. However, I think the awareness of your own bias and knowing that you are bias is already helpful enough. Unfortunately too many claim they have no bias and only base on fact, which we all know is not true.

    1. it appearsto me that he was much too rash in dismissing the genre as too rigidto adapt itself to the changing conditions of reality and unsuitable as agenre to be able to reflect critically upon the social and material tensionsthat constitute our beleaguered modern and postmodern sensibilities.

      I agree with this claim and think that while society and science may suppress the ability for us to imagine fairy tales, I think they have a much larger impact that just speaking to our imagination. Instead, they speak to what we believe as people and to what we want in life. While fairy tales may often occur in mythical lands with mythical creatures, I think the reason they appeal so much to people is because they portray a world that is less stressful and more like the world they knew in their childhood, not because people necessarily want to ride on unicorns.

    2. As a metaphorical mode of representation, whether it may be oral, iconic, or written, the fairy tale effective ydraws our attention to relevant information that will enable us to knowmore about our real life situations, and through its symbolical code anflexible structure, it allows for personal and public, individual and co-lective interpretations

      I think this an interesting light to see fairytales in. It almost reminds me of the notion of "taking what you need" and our brains allowing us to focus on what resonates with us and taking out the morals that we can particularly relate to.

    1. McConnell said it’s up to the Republican candidates in various Senate battleground races to explain how they view the hot-button issue.   (function () { try { var event = new CustomEvent( "nsDfpSlotRendered", { detail: { id: 'acm-ad-tag-mr2_ab-mr2_ab' } } ); window.dispatchEvent(event); } catch (err) {} })(); “I think every Republican senator running this year in these contested races has an answer as to how they feel about the issue and it may be different in different states. So I leave it up to our candidates who are quite capable of handling this issue to determine for them what their response is,” he said.

      Context: Lindsey Graham had just proposed a bill for a nationwide abortion ban after 15 weeks of pregnancy.

      McConnell's position seems to be one that choice about abolition is an option, but one which is reserved for white men of power over others. This is painful because that choice is being left to people without any of the information and nuance about specific circumstances versus the pregnant women themselves potentially in consultation with their doctors who have broad specific training and experience in the topics and issues at hand. Why are these leaders attempting to make decisions based on possibilities rather than realities, particularly when they've not properly studied or are generally aware of any of the realities?

      If this is McConnell's true position, then why not punt the decision and choices down to the people directly impacted? And isn't this a long running tenet of the Republican Party to allow greater individual freedoms? Isn't their broad philosophy: individual > state government > national government? (At least with respect to internal, domestic matters; in international matters the opposite relationships seem to dominate.)

      tl;dr:<br /> Mitch McConnell believes in choice, just not in your choice.

      Here's the actual audio from a similar NPR story:<br /> https://ondemand.npr.org/anon.npr-mp3/npr/me/2022/09/20220914_me_gop_sen_lindsey_graham_introduces_15-week_abortion_ban_in_the_senate.mp3#t=206


      McConnell is also practicing the Republican party game of "do as I say and not as I do" on Graham directly. He's practicing this sort of hypocrisy because as leadership, he's desperately worried that this move will decimate the Republican Party in the midterm elections.

      There's also another reading of McConnell's statement. Viewed as a statement from leadership, there's a form of omerta or silent threat being communicated here to the general Republican Party membership: you better fall in line on the party line here because otherwise we run the risk of losing power. He's saying he's leaving it up to them individually, but in reality, as the owner of the purse strings, he's not.


      Thesis:<br /> The broadest distinction between American political parties right now seems to be that the Republican Party wants to practice fascistic forms of "power over" while the Democratic Party wants to practice more democratic forms of "power with".

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      This is a revision plan, the manuscript has not been modified yet as it is being transferred to a journal.

      *------------------------------------------------------------------------------ Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This study proposes (and uses) an elegant model of bacteria evolution to study how division of labor can emerge through the interaction between non-random mutations (occurring at some specific ``fragile' genomic sites) and genome architecture. The study is very interesting and the results are convincing. My main concerns are about the presentation of the model and results. Although I am confident about the results, some elements should be clarified for a better understanding and for a correct interpretation of the results. Two points in particular (detailed below as major comments) require clarification.

      Major comments:

      • the notion of telomere/centromere is used all throughout the paper but I think it is used in a misleading way. First, it seems that here there is only one telomere (but this is actually a detail of the model). More importantly, as long as I know, it is well known that in S. coelicolor the sequence degenerates more rapidly when getting closer to the telomeres (but telomeres are defined independently from this property). But here, the notion of telomere is precisely directly determined by its mutational instability (respectively, the centromere is defined by its stability). Although this is reasonable given the objective of the model, it forbid the use of sentences like "we observed that the genome of the evolved colony founded had two distinct regions: a telomeric [...] and a centromeric [...]" (line 234) or "When bacteria divide, mutations induced at fragile sites lead to the deletion of the part of the genome distal to them, causing large telometic deletions" (line 239 - this is not a result but a hidden description of the model) as this distinction between the two regions is not an outcome of the simulation but rather given a priori as a coded property of the fragile sites that all lead to deletions on the same -- called telomeric -- side (of course, formally if the genome contains no fragile site, there is no distinction but still). Please clarify this in the main text and in the methods. *

      Authors response (AR, in the following): we agree with the reviewer that the directionality of the deletions determines centromere and telomere in our model (and the reviewer is correct that we only consider one arm of the chromosome). We will explicitly state both in the main text and in the methods that the model does not include any explicit centromeric and telomeric structure, and that the polarity of the genetic information (and thus centromere and telomere) depends on the choice of directionality of the deletions.

      - In most part of the paper (methods, results, figures, sup mat...) antibiotics are considered to have a concentration (or a high/low production) but at least twice in the text (lines 165 and 488) it is said that only the presence/absence of antibiotics is modelled. I was not able to understand how the continuous values are transformed into presence/absence (is there a threshold?) but more importantly, I strongly suspect that this choice has a strong influence on the outcome. For instance, with a diffusion radius equals to 10, it means that an antibiotics producing cell is able to protect 2*\pi*10=~60 replicating cells. Hence, one could conjecture that the fraction of antibiotic-producing mutants should a little more than 2%... which is what is observed by the authors. So (1) please clarify this point (2) discuss (or experiments) the consequences of this choice on the conclusion.

      AR: the reviewer is correct that antibiotics are modelled as presence/absence – this was done for computational efficiency. However, the probability that a bacterium deposits an antibiotic at a site within the deposition radius is a continuous number, as it depends on the number of antibiotic genes and growth genes. We will make this clear in the main text and in the methods.

      Secondly, we show the effect of varying the deposition radius for the evolutionary dynamics in Supplementary Section S17. We will make this clear in the main text. For the area covered by different radius of antibiotic deposition, please see below.

      * Minor comments: - line 262: "We conclude that genome architecture is a key prerequisit for the maintenance of mutation-driven division of labor". Given the model hypotheses you cannot be so affirmative (it is a key prerequisit... in this model!) *

      AR: we will modify the statement as suggested. * *

      - line 286: "cannot" is probably too strong. It has not been observed...

      AR: we will modify the statement as suggested.

      - line 288 and following: you seem to consider that there is "selection for diversity". Given the large number of possible antibiotics and given that cells are "automatically" resistant to the antibiotics they produce, could it be simply drift? There is a clear selection pressure to limit the number of growth-promoting genes but no such pressure exist for antibiotics. Hence their number could simply drift (note that figs 2 and SF1 both use a log scale; random variations due to drift could be hidden by the log. Fig. SF2 does use a log scale and shows a dynamics that---to my eyes---claims for drift rather than for selection of diversity).

      AR: we agree with the reviewer that drift might contribute to the overall antibiotic diversity. This might be especially true for the antibiotic genes residing downstream of the fragile sites, which have low probability of expression in the wild-type (because of the many growth genes) and are deleted in the mutants. Duplications, deletions and modifications of these genes are effectively neutral, and are therefore likley subject to drift. We will include this discussion in the main text. However, bacteria are highly susceptible to the diverse antibiotics produced by other colonies (i.e. those produced – largely – by the mutants). These antibiotics and their diversity drives colony invasion and is thus selective. The overall number and diversity of antibiotics is therefore, at least in part, under selection.

      - line 340: "ends" should be "end" when discussing the model - line 345: "a telomeric region" should be "telomeric regions" when discussing the bacteria - line 359: "S. ambofaciens" should be italic - line 365: same for "Streptomyces"

      AR: we will modify the statement as suggested (and thank the reviewer for carefully reading the text).

      - line 245 states that colonies begin clonally but methods (lines 434-438) don't support this. Colonies don't begin clonally but they begin without antibiotic-producing spores (see also line 618)

      AR: we agree with the reviewer that colonies are not specifically initialised as clonal. We will modify the sentence as: By this process colonies eventually evolve to become functionally differentiated throughout the growth cycle.

      - line 442: "their" should be "its" - line 446: "hotspot for recombination" no, for "deletion" - line 449: please remove brackets around the reference.

      AR: we will modify the statement as suggested.

      - line 458: if I understood it correctly, there is no explicit competition in the model. Competition simply comes from the asynchronous replication. Am I true? Could you clarify that point?

      AR: The reviewer is correct that through asynchronous updating only one focal lattice site is update at a time. However, if a site is empty, the bacteria surrounding it are competing based on their replication rate kreplication. Dividing by the neighbourhood size (eta) simply ensures that a bacterium surrounded by a completely empty neighborhood replicates on average alpha_g times (alpha_g being the max growth rate). We will mention this in the methods.

      - line 490: "the antibiotic deposited is chosen randomly and uniformly among them". This is not fully clear. I suppose the bacteria is still resistant to all the antibiotics it \it{can} produce?

      AR: Yes. This is mentioned in the methods section “Replication”.

      - figure SF1: please use the same scales as in figure 2 such that the two plots can be easily compared

      AR: we will modify the x-axis to include the number of growth cycles.

      - section S3 and figure SF4: What is to be understood from the figure is not clear to me. Seems that WTs win only if generalists produce less AB or replicate slower (?) Is it true?

      AR: The reviewer is correct. In other words: when the artificial generalist has the same replication rate and the same antibiotic production rate as the WT, then the competition experiment ends with a near draw (the generalist still wins, but slowly). This means that the fitness cost associated to division of labor, i.e. to having two cell types doing the same work as one generalist – is small.

      We will include this description in the section.

      The figure is unfortunately complicated by the fact that we do not know a-priori how high the effective antibiotic production rate is (because antibiotics are spatially distributed by the stochastically generated mutants) – and so we had to make a large parameter screen to figure out the parameter values for which the competition experiment made most sense.

      - I found it very difficult to draw conclusion from section S4, S5 and S6. These experiments should be analyzed with the help of mathematical analyses of the equations. Moreover, the understanding of these results are rendered difficult due to the lack of clarity regarding the discrete (or not) nature of the antibiotic production/action/diffusion

      AR: We hope that we have clarified the distinction between antibiotic production rate and antibiotic presence/absence in the lattice.

      The model is not amenable to analytical tractability, which makes it difficult to make exact statements based on the equations that govern it. However, we can check that the model is robust, and identify regions of parameter space where the model behaves in a qualitatively similar way to main text results.

      Sections S4, S5 and S6 are essentially parameter screens to verify that the model reproduces the results reported in the main text for a broad range of parameters. The primary conclusion that can be drawn is that the model is robust to parameter changes.

      Section S4 explores the model robustness to changes in two key parameters of the model: the antibiotic inhibition due to growth genes beta_g and the parameter h_g, which is the number of growth genes that produces half-maximum growth rate. Section S5 further analyses the relation between these parameters, and how they together determine the strength of the trade-off. Section S6, finally, shows that a strong trade-off is not a necessary requirement for evolution of division of labor as the division also depends (in a counterintuitive way) on the parameter alpha_g, the maximum antibiotic production rate.

      We will include and expand these summarizing statements in each section, to make clear what each section achieves.

      - S7 and fig SF9. It is unclear to me why the fraction of mutants decrease along time elapsed in the cycle. Please explain.

      AR: The reason is that not all mutants are born with the same number of antibiotic genes (Fig. 3A). A mutant with fewer antibiotic genes might be susceptible to some of the antibiotics produced by another mutant, and could be killed by these antibiotics. Once a mutant is killed in the inner colony, a wt will replicate to fill the spot, and likely a wt offspring will take that site rather than another mutant. Thus there is a decline in overall mutant population.

      We will include this discussion in Section S7.

      - Figure SF14: what are the tin lines? if they correspond to the five repeats, how can it be that the bold line be the median?

      AR: we realise that the caption should be clearer. Each of the five lines (both bold and thin) in each pane represents the median number of genetic elements over time. The bold line just highlights one randomly chosen simulation (the same for each genetic element), to better guide the eye.

      We will clarify the caption of the figure.

      - S13 and figure SF15: given that AB concentration is ON/OFF, is this result really surprising? This also questions about the accumulation of AB genes in the original model. Although the authors regularly claim that this is due to selection for diversity, drift could also be at play (see above)

      AR: As mentioned above, we agree with the reviewer and we will mention that drift may co-determine antibiotic gene accumulation.

      - S17: for radius 1, 2 and 3, the aliasing is likely to be strong. Hence, the results cannot be interpreted with this sole information. Please give e.g. how many cells are "protected" for each radius (e.g. for r_{alpha}=1, this value can vary between 1 and 9!)

      AR: for radius=1, 2, 3, 5 ,8, 10 the area covered by antibiotic production is respectively 5 ,13, 29, 81, 197, 317. We will include this information in the figure.

      - L742: "matching the antibiotic bitstring with the bitstring of the antibiotic". True and actually elegant but simpler formulation could ease the reading...

      AR: We will change the sentence as follows: “Both antibiotics and antibiotic genes are characterised by a bitstring, which determines their type. Antibiotic resistance in the model is determined by matching these two strings.”

      - lines 746-751 and figure SF21: There again, could it be a consequence of the AB ON/OFF diffusion model?

      AR: we agree with the reviewer that a continuous diffusion model could affect resistance to antibiotics. We expect that the main effect will come from some antibiotics antibiotics having different concentrations. For instance, we could have a situation in which many deleterious antibiotics are produced in small amount, but have a compounding effect on the susceptible bacterium. This finer model of antibiotic production, diffusion and killing was not included in the model to limit the computational load.

      - S18-S19-S20: what should the reader understand from these results? Please better comment the figures.

      AR: we agree that figures in Section S18,19 and 20 could have more descriptive captions. Sections S18, 19 and 20 are parameter screen to check that the model is robust to changes in the mutation rates affecting fragile sites activation and de-novo formation. The primary result of Section S18 is that that division of labor evolves over a broad range of fragile site activation rates and de-novo fragile site formation rates (and even when these parameters are decreased by one order of magnitude).

      Section S19 shows how these combination of parameters result in quantitative changes in genome composition.

      Section S20 shows that the de-novo fragile site formation rate can be zero: as long as the system is initialised genomes that can divide labor, the fragile sites will persist even though no new ones are generated.* *

      • CROSS-CONSULTATION COMMENTS Sorry about the confusion about the computation of the number of cells protected by a single AB-producing cell. Of course it is of the order 10*\pi^2 !!! The global argument still holds but the number of cells protected is of course larger than 60 (note that, due to aliasing at the periphery the exact number of cells in the protected area is difficult to determine). *

      Author response: We hope the clarifications mentioned above answer the reviewer’s comment.

      * Reviewer #1 (Significance (Required)):

      First, an very importantly, I must say that I am no familiar with the biological model (Streptomyces coelicolor). So I am not fully able to judge the biological significance of this research (i.e. whether the way division of labor is achieved here enlights---or not---the biology of this bacteria). However, on the computational side, the model and the results (as they are summarized in the conclusion) are very interesting on their own and deserve publication.

      Remark: a lots of supplementary results are added to the paper that are not not fully explained or analysed. Please, better discuss all these results and their significance. *

      AR: we will extensively check and add detail to the supplementary material, ensuring that results are fully explained (see also response to reviewer 1).*

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript "Evolution of genome fragility enables microbial division of labor" presents a model of genetically-based division of labour in bacterial colonies. It is postulated that two essential processes, growth and the important for elimination of competitors production of antibiotics, are poorly compatible in a single cell. The beneficial for a colony cell specialization is assumed to be determined only by genetic differences that appear via deletions of growth- promoting loci. These deletions and production of various antibiotics are mediated by a rather elaborate genetic architecture, which includes position-sensitive "fragile" sites, mutable antibiotic and growth-promoting genes. The model produces rather predictable results that under sufficiently strong incompatibility between growth and antibiotic production, the long-term evolution results in formation of mosaic of colonies, each specialized in production of its specific set of antibiotics. Such production is facilitated by evolving rapidly mutable genomes that constantly generate non-reproducing antibiotic-pumping cells.

      The model appears very thoroughly developed and analyzed, and all major conclusion are intuitively appealing. Overall, the manuscript reads as a well-written quantitative proof of the principle of genetically-based division of labour between bacterial cells. The only part of the model that I'm a bit sceptical about is the unwarranted complexity of the genetic architecture. Unless the introduction of "fragile" sites and the directional ordering of genes is strongly justified by empirical data, a simpler and more clear assumption about mutational incapacitation of growth genes would suffice to reproduce the predicted phenomenology. So adding such empirical evidence would boost the relevance of the genetical part of the model. In the present form, all observed adaptations are inevitable simply because the expected division of labour will not evolve without each of them due to the design of the model. *

      AR: We agree with the reviewer that a simpler model with a predetermined effect of mutations, such as to incapacitate the growth genes, would suffice to reproduce the phenomenology of the mutation-driven division of labor observed in Streptomyces. Adding the complexity of a genome architecture introduces one more hypothesis: that genome fragility can evolve to organize the division of labor. This hypothesis, supported by the results presented here, can be tested experimentally.

      However, there is already some empirical support for our modelling choices: 1) mutation rates along the genome of Streptomyces are highly heterogeneous, 2) the genetic content is partitioned along the chromosome so that some genes are preferentially located in the mutationally quiet centromere, and others are in the mutationally active (sub)telomeric regions, 3) some cis genetic elements in Steptomyces’ genomes readily recombine to produce large-scale duplications and deletions (which we heavily simplified in the model as deletion-inducing fragile sites).

      We will extend the introduction to include the references for the empirical support to our model.

      * A couple of minor comments...

      217 This is achieved when fewer growth-promoting genes are required to inhibit antibiotic 218 production (i.e. lower βg). Shouldn't it be "larger \beta_g"? *

      AR: yes. Thanks for catching this!

      * Whether in the main text or Supplementary materials, it woud help to add a complete population dynamics equation with all gain and loss terms. *

      AR: we agree with the reviewer that it would be interesting to obtain a comprehensive population dynamics equation that captures the spatial dynamics of replication, mutation, and antibiotic production, causing colony formation and between-colony competition. However, deriving such equation would be a very big effort in itself, and we suspect that it would not be analytically tractable. Because of this, we prefer the “procedural” model description we gave – which also mirrors the model implementation (see github repository at github.com/escolizzi/strepto2).

      * Strikingly, we find the opposite: division of labor evolves when 224 bacteria produce fewer overall antibiotics (lower αa), under shallow trade-off conditions 225 (hgβg = 5; see Suppl. Section S6).

      I don't see why it is"striking". It seems perfectly explicable that a smaller \alpha requires more dedication to antibiotic production, thus favouring specialization. *

      * *AR: we agree that we have not conveyed why we found this result surprising. We have set the trade-off shallow enough (h_g beta_g =5) that the generalist wins when alpha_g =1. In addition, lowering alpha_a makes the benefit of creating a mutant smaller, because a highly specialised mutant with zero growth genes makes fewer antibiotics. A generalist is proportionally less affected. Intuitively, we have compunded two benefits for the generalist.

      But division of labor evolves, outcompeting the generalist – which surprised us.

      We will modify the paragraph to better explain what we expected, and we will tone down the wording, removing the word “strikingly”.

      *Reviewer #2 (Significance (Required)):

      Due to my relative lack of familiarity with the literature on evolution of genetically-based division of labour, I would rather not comment on the degree of innovation of the manuscript.

      The text is well written and is accessible to a wide readership, so it could be recommended to a general biological or evolutionary journal.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: In this manuscript the authors explore the co-evolution of genomic architecture and division of labour in antibiotic production, in a model inspired by the bacterium Streptomyces coelicolor. In the model a genetic trade-off is implemented where the having a large number of growths promoting genes (and thus fast growth) leads to a low production of antibiotics. On the other hand, having fewer growth promoting genes allows for a higher production of antibiotics. This trade-off selects for a division of labour, where one sub population specializes in antibiotic production and another sub population specializes in reproduction. This division of labour is achieved by evolving the genome structure, so that growth promoting genes are clustered together, separated from the rest of the genome by several fragile sites (sites that allow for large deletions). This allows a single mutational event to delete a large number of growth-promoting genes, which creates a cell, lacking growth genes and that thus has a high antibiotic production (cell specializing in antibiotic production). In other words, the genome structure evolves to shape evolvability, so as to allow cells with a high growth rate to rapidly and repeatably evolve/mutate into cells with a high antibiotic production. This creates a division of labour where a part of the population specializes in growth/reproduction and another part specializes in antibiotics production. This model provides a tangible mechanism to explain a similar division of labour observed in S. coelicolor. This mechanism also fits well with the large deletions observed in antibiotic-hyperproducing S. coelicolor cells, which are also repeatably generated during colony growth.

      Major comments: -Line 69, It would be good to give a bit more information here on the (number of) different types of antibiotics produced by S. coelicolor, to help the reader understand some of the modelling choices later on, such as allowing for the evolution of a large number (16 or higher if I understand correctly) of different antibiotics and a cell automatically being resistant to all antibiotics it produces (instead of having separate resistance genes). *

      AR: we agree with the reviewer that adding this information would put the model more in focus. The total number of antibiotics that can be produced by the genus Streptomyces has been estimated to be of the order of 100000 (ten to the fifth, [Watve et al., 2001]). Although we use S. Coelicolor as reference model organism for our computational model, we simulate long-term evolutionary dynamics that diversify the antibiotic repertoire. Each antibiotic is represented by a 16 bits string, meaning that there are 2^16 (= 65536) possible antibiotics in the system – consistent with the number of possible antibiotics in the genus.

      This being said, our model genomes evolve to have many more antibiotic genes than typical Streptomyces. Each species in the genus has up to 30 biosynthetic gene clusters [Genilloud, O. (2014)], a fraction of which make antibiotics. We discuss this discrepancy and propose solutions for this in the Discussion (also see below).

      Regarding the possibility of separating antibiotic resistance from antibiotic synthesis: we (and most literature on the eco-evolutionary dynamics of antibiotic-producing bacteria) simplified antibiotic production as depending on individual “antibiotic biosynthetic genes”. In reality several genes in a cluster must be expressed to synthesize an antibiotic. A typical biosynthetic gene cluster also encodes resistance genes for the cognate antibiotic, to prevent cell suicide [Mak et al., 2014] – hence antibiotic genes providing resistance in the model. This being said, Streptomyces genomes also host resistance genes to antibiotics for which they have no biosynthetic pathway themselves, including efflux pumps that give some nonspecific resistance [Nag et al 2021].

      Modelling antibiotic synthesis in more detail would allow to make a better model of antibiotic evolution, as well as to enrich the social dynamics of the model – because “cheaters” could evolve that are resistant but do not contribute to the antibiotics in the colony. These questions are certainly interesing, but would further complexify the model. They are exciting venues for future model expansions.

      We will include the literature mentioned above in the introduction, and use these references to better motivate the model.

      * -Lines 127-129 It is mentioned here fragile sites in the genome might represent transposable elements or long inverted repeats. Would both of these types of fragile sites behave the same? Has it been shown that both transposable elements and long inverted repeats can lead to large deletions from a linear chromosome? It would be nice to have a bit more background on how fragile sites might work or what they might look like in an empirical context. I am a bit unsure on this, but depending on their exact empirical nature, should fragile sites not also lead to increased rates of gene duplication near themselves? *

      AR: we see that we have not made a clear connection between the introduction, where we introduce the mutational dynamics of Streptomyces, and the methods, where we introduce fragile sites.

      Briefly, both duplications and deletions occur in Streptomyces, as well as circularization of the linear chromosome, conjugation, etc. [Hoff et al 2018,Tidjani et al 2019]. However, the outcome of all these mutations is biased towards deletion [hoff et al 2018, Zhang et al, 2020, Zhang et al, 2022]. There are many mechanisms involved in producing these mutations, forming the mutational hotspots, handling DNA breaks, and in the horizontal transfer of genetic material [Tidjani et al 2019; Lorentzi et al, 2021]. As the reviewer suggests – they do not behave all in the same way. To construct the model, we simplified all these mutational mechanisms into one genetic element, the “fragile site”, and assumed that they are solely responsible for the chromosomal-scale mutations that produce deletions.

      We will add this information to the introduction (see also response to reviewer 2), and refer to it in the methods.

      * -Line 160 As alluded to before, given the introduction provided, two assumptions come about here (lines 160-166) that lack a bit of justification/background/context. First, why does one allow the evolution of such a relatively large number of antibiotics? A bit more empirical in the introduction background would go a long way to making this assumption seem more justified. As far as I can see the genomic architecture leading to division of labour is only demonstrated for values of v that are 6 (i.e. 64 antibiotics) or above. Perhaps it is because I lack empirical background here, but this still seems to be a relatively large antibiotic space. Does the model also work with v=2? Perhaps it would be good to show a simulation with v=2 in supplementary material S16 as well. *

      AR: Hopefully the previous comment on the number of possible antibiotics also clarifies this point.

      We will carry out a simulation with v=2.

      * -Line 166 The assumption is made that if a bacterium produces a certain antibiotic, it is automatically resistant to this antibiotic. Now it could be that this assumption is empirically rooted, in which case it would be good to allude to this empirical justification. I wonder how would the results be impacted if the resistance genes were separated from the antibiotic production genes? (I do not think additional simulations are in any way necessary on this point, but some more context/thoughts on this matter would be helpful, perhaps near lines 306-309) *

      AR: Please see response to major comment on the possibility of separating antibiotic resistance from antibiotic synthesis. We will add the discussion there in the Discussion session.

      * -Figure 1 In the subscript it becomes evident that the probability of large deletions due to fragile sites is much higher (10 fold) than single gene duplications, it seems to me this should be the other way around, single gene duplications and deletions could be much more probable than fragile site induced large deletions. Would the model still produce the same results if the values for mu-d and mu-f were switched around? (Again, I do not think additional simulations are per se required, some justification for this assumption would already be plenty). *

      AR: We chose these parameter values because, empirically, large scale chromosomal rearrangements (deletions) occur more frequently than single gene duplication/deletion in Streptomyces – as they are the primary mechanism for Streptomyces development and division of labor. We now mention this in the caption of Fig. 1.

      Still, would we expect results to be affected if mu_d > mu_f? We do not think so, for the following reason: mu_d and mu_f are per-gene probabilities, so the genomic probability of duplication/deletion and of fragile site activation will depend on the evolved number of genes.

      in Fig. 5 we show that mu_f can be decreased by more than one order of magnitude and results do not change qualitatively. To compensate for a smaller per-gene deletion rate (mu_f), the evolved number of fragile sites per genome becomes larger (Suppl. Section S19, Fig. SF23). A similar compensatory increase of fragile sites could happen if duplications and deletions rate per gene were larger.

      * Minor comments: -Line 36, perhaps replace "must" with " can" as there are other ways to achieve a division of labour that do not hinge on genomic architecture such as those listed in the next sentence. This sentence seems at odds with the next one, which lists ways to achieve cell differentiation that do not per se completely rely on genomic architecture such as gene regulation. Maybe consider moving this sentence to be on line 40 (after "...organized at the genome level remains unclear") *

      AR: we will modify the text as suggested by the reviewer

      * -Line 48, perhaps remove "disposable" as there is no particular reason the somatic tissue is disposable, furthermore it invokes the disposable soma theory of aging which is not relevant here *

      AR: we will remove “disposable”.

      * -Line 147-148 Why these particular relationships, as a reader I do not understand how these functions were constructed and how they might influence the results, a bit more justification might be helpful. Perhaps later on (results/discussion) also address what might happen if you were to use different functions? *

      AR: we agree that these functions could use a little more explanation. The probability of replication is a function that increases with the number of growth genes. We assume that the function saturates, as growth cannot be arbitrarily large even if the genome hosts many growth genes. So we need at least two parameters: one for the maximum growth rate (alpha_g), and another that controls the curvature of the function (h_g). A simple choice is a Hill function, but other saturating functions would likely work just as well (e.g. an exponential function with a form alpha_g*(1-exp(-g/h_g)). Similarly, antibiotic synthesis inhibition from growth genes should tend to zero for larger numbers of growth genes, hence the exponential (but we expect that a hyperbolic form e.g 1/(1+g/beta_g) would work just the same).

      As this discussion is rather technical, we will include it in the methods section.

      * -I am clearly biased on this matter, since I work on evolvability. So, the authors should feel free to ignore this comment. Regardless, I think the authors have shown a wonderful example of the evolution of evolvability. Perhaps it would be nice to add a little bit of an evolvability angle in the discussion. In particular thinking about how fragile sites shape evolvability. *

      AR: we agree with the reviewer that the work is a clear form of evolution of evolvability. We now explicitly mention this in the discussion.

      * -Lines 404-411 It is great to see that the authors consider the wider applicability of their findings. It would be nice to add something here about the broader applicability in bacteria. As a large number of bacteria have circular chromosomes, how would these findings be impacted if circular chromosomes were at play? (I suspect they would largely still work in the same way, but keen to hear what the authors think). Referring to the work of Yona et al. 2012 on transient chromosomal duplications in yeast due to heat stress might also be good here, to show the more general applicability of the authors findings, this is another example where genomic architecture shapes evolvability. Yona AH, Manor YS, Herbst RH, Romano GH, Mitchell A, Kupiec M, Pilpel Y, Dahan O. Chromosomal duplication is a transient evolutionary solution to stress. Proc Natl Acad Sci U S A. 2012 Dec 18;109(51):21010-5. doi: 10.1073/pnas.1211150109. Epub 2012 Nov 29. PMID: 23197825; PMCID: PMC3529009. *

      AR: Bacteria show many forms of targeted mutational dynamics (we do already mention CRISPR and HGT). It recently came to our attention that many bacterial and archea genomes host so-called Diversity-Generating Retroelements (DGR) [Macadangdang et al, 2022]. DGRs accelerate microbial evolution at specific sites and generate functional diversity. We will include this reference in the discussion.

      We thank the reviewer for pointing us to the work on chromosomal duplication in yeast – we will also incorporate this “dramatic” form of duplication in the discussion.

      * -Lines 412 -419 I agree with the authors that in practice the cells specializing in antibiotic production look somewhat like soma, however I would consider not using this term here as strictly speaking the antibiotics producing cells can still reproduce (be it at an extremely low rate, which leads to their loss). *

      AR: We tone down both mentions of soma, as follows: “This gives rise to a division of labor driven by mutation, reminiscent of the division between germ and soma in multicellular eukaryotes.”

      And, in the last sentence, we write: “...mutant cells *effectively* function as soma by enhancing...”

      - Lines 434-438 If I understand correctly authors did not explicitly model the sporulation process (instead selecting random cells from the end of a cycle). I think this is a very good modelling choice that should not be changed; however, I do wonder how the results would be affected if sporulation was more explicitly modelled (for example by adding genes for sporulation, creating a 3 way trade-off between growth, sporulation and antibiotic production). Perhaps something that could be mentioned in the discussion.

      AR: we agree with the reviewer that more complex evolutionary problem could be implemented in the system, e.g. through a gene type required for sporulation. They would likely have interesting outcomes. For instance, some bacteria may decide never to sporulate, while others could enhance their antibiotic resistance by turning into spores. Moreover, including additional functions together with an evolvable gene regulation could better capture the developmental dynamics observed through the life cycle of Streptomyces.

      * I hope this review is of some use and helps the improvement of this manuscript. *

      * Yours sincerely,

      Timo van Eldijk

      Reviewer #3 (Significance (Required)):

      Significance: This study provides a clear conceptual advance by showing and studying how genome structure can evolve to create a division of labor. Thereby mechanistically explaining the division of labor in antibiotic production observed in S. coelicolor. It seems evident to me that whilst this study mainly focuses on S. coelicolor, the mechanism likely plays an important role in microbial evolution in general. Though others have previously theoretically explored such mechanisms, this study provides the first exploration modelled closely after an empirical system and hence provides a significant advance. In a more general sense, the evolution of genome architecture likely governs evolvability not just in microbes but in all life on earth. Therefore, I believe that this paper would be interesting for a general audience interested evolution. It would be of particular interest to those studying microbial evolution. My expertise lies in evolutionary biology, theoretical biology, microbial evolution and palaeontology. *

    1. Social workers treat each person in a caring and respectful fashion, mindful of individual differences and cultural and ethnic diversity. Social workers promote clients’ socially responsible self-determination. Social workers seek to enhance clients’ capacity and opportunity to change and to address their own needs.

      This specific area of the Code of Ethics is relevant to a situation that I have experienced in my field work for various reasons. First off, I encountered a situation when working with a client, with my supervisor present, that required me to keep this value and ethical principle in mind. In this instance, the client was expressing to me the way her family dynamic is run based on her cultural views. Although, my family dynamic may not be run the same way as my client's, in correspondence with the ethical principle of respecting her inherent dignity and worth, I used this information to be mindful of her individual differences and understand how this might play a role in her situation. This is an example of treating my client not only with respect but seeing how culturally her family operates their household. Furthermore, I could identify the emotional impact expressing this situation had on my client so I handled her feelings with empathy and compassion, as requested of us as social workers. As the session progressed, the client expressed difficulty in fully finding her drive when completing tasks in her daily routine. Due to this, I prepared self motivating activities and strategies for the client to visualize ways to push herself to complete these tasks. Specifically, we worked together on a time management chart where the client mapped out her schedule. In this, I was attempting to promote her socially responsible self-determination. By helping the client find her inner drive, I was attempting to instill in her that she was not only contributing to her personal self-determination but also to the social community around her as well. I hoped that helping her visualize this, it would make her want to act in positive ways and see how her inner self is a wonderful piece to the puzzle of her community around her. Moreover, this falls into enhancing her capacity and opportunity to change. My client expressed to my supervisor and I that it was difficult for her to pinpoint exactly what areas she wanted to work on and she felt like she needed assistance with. Through our discussion of her strengths, her interests, her hobbies, and especially her goals, I took note that the client began to open up more as the discussion went on and find her way of speaking on her struggles. This is a relevant example of respecting the dignity and worth of my client because in acknowledging her struggles, I am showing that I am not only a listener to her but a guide to her as well that seeks to help her find her inner desires. I constantly used phrases like, "I can absolutely see how you feel that way.." as a means of reassuring the client she is heard and respected. In my discussion with my client, the piece of the Code of Ethics that states, "seek to enhance clients capacity and opportunity to change" I think is especially relevant. This is because in my session with the client I aimed to have her strive to seek out just how much potential she truly possesses through conservation and self-reflection activities I provided. By giving my client strategies towards positive change, I felt as though this was representative of my client being able to pinpoint her needs while also understanding just the positive strategies she can use to meet those needs. In terms of the opportunity to change, in our session we talked a lot on this change towards a positive mindset and the ways she can do that on her own time as well, which I think also falls into this ethical principle.

    1. Signals of respect and disrespect Conflict resolution Restorative practices Ways of working Ways of handling emotion Response to trauma

      I like the way the article connects a tree with ourselves. Our outer selves are usually what people can see physically, the trunk is more internal thoughts of whatever you may have going on or how certain things in life have affected one, and then the roots are the more inner, mental type of thinking. This makes me think about how angry people are angry for their own respected reasons, why others can be more sensitive, and a different group might be more outgoing and laid back. It really makes you think about how everyone lives a different life, and how we all have different things going on that eventually make us unique in our own ways but also the same.

    1. The telephone and the phonograph, which already have done what seems to be almost miraculous work, may in time be made the means of conveying a message directly from the telegraph instrument to the person to whom it is addressed. But, until this is accomplished, we must acknowledge our dependence on the messenger-boys and fairly recognize them as person of business. 

      Its is crazy to think all of this had to be done to get to where we are today when it comes to technology.

    1. Author Response

      Reviewer #1 (Public Review):

      Anopheles is an important disease vector and the efforts to characterize the extent of genetic variation in the system are welcome. In this piece, the authors propose a Variational Autoencoders method to assign species boundaries in a large sample of Anopheles mosquitoes using a panel of 62 nuclear amplicons. Overall, the method performs well as it can assign samples to an acceptable granularity. The main advantage of the method is that it takes reduced representation genome sampling which should cut costs in genotyping. The authors do not compare the effectiveness of their amplicon panel with other approaches to do reduced representation sequencing, or the computational method with other previously published methods. Additionally, the manuscript does not clearly state what is the importance of species assignments and the findings/method are -by definition- limited to a single biological system.

      It is important to draw the reviewer’s attention to the fact that this is a two part approach – the reviewer seems to have overlooked the Nearest Neighbour component of the work. The approach is not solely a VAE – the VAE only comes into play at the species complex level. The higher level assignments are done using NN approaches.

      The manuscript has three main limitations. First, there is no explicit test of the performance of ANOSPP compared to other methods of low-dimensional sampling. While the authors state that the ANOSPP panel will lead to genotyping for low cost (justifiably so), there is no direct comparison to other low-representation methods (e.g., RAD-Seq, MSG).

      The key advantage of ANOSPP is that it works on the entire Anopheles genus; while the other suggested sequencing methods are more applicable to a group of specimens of the same or closely related species. The purpose of the panel is to do species identification for the whole genus; so it really is an alternative to the current methods of species identification, which commonly consists of morphological identification of the species complex, followed by complex-specific PCR amplification of a single species-diagnostic locus. The only other species identification method for Anopheles that is not limited to a single species complex, that we are aware of, is a mass spectrometry approach (Nabet et al. Malar J, 2021); however, they only investigate three different species and reach a classification accuracy of at most 67.5%.

      The main advantage of ANOSPP over other reduced representation sequencing methods, like MSG and RAD-Seq, is that it is specifically designed to work for the entire Anopheles genus to support genus-wide species identification. In a genus comprising an estimated 100 million years of divergence, a sequencing approach that relies on restriction enzymes is likely to introduce a lot of variability in which parts of the genome are sequenced for different species. Moreover, both MSG and RAD-Seq typically map the reads to a reference genome; any choice of reference genome will likely introduce considerable bias when dealing with such diverged species. In general, the sequence data generated by those sequencing methods require more complicated and labour intensive processing. And lastly, the costs per sample for library preparation and sequencing are substantially lower with ANOSPP than with MSG and RAD-Seq: for library prep <1 USD (ANOSPP) versus 5 USD (RAD-Seq) (Meek and Larson, Mol Ecol Resour, 2019) and with 768 samples (ANOSPP), 384 samples (MSG; Andolfatto et al, Genome Res., 2011) and 96 samples (RAD-Seq; Meek and Larson, Mol Ecol Resour, 2019) per run.

      Second, and on a related vein, the authors present NNoVAE as a novel solution to determine species boundaries in Anopheles. Perusing the very references the authors cite, it is clear that VAEs have been used before to delimit species boundaries which diminishes the novelty of the approach on its own.

      The VAE is only a part of the method presented in this manuscript. We believe a substantial amount of the value of NNoVAE lies in its ability to perform assignments for the entire Anopheles genus comprising over 100 MY of divergence - the closest analogous approach would be COI or ITS2 DNA barcoding, neither of which is robust for species complexes. Using NNoVAE, samples are first assigned to their relevant groups, and in many cases to their species, by the Nearest Neighbour method. Only those samples that are identified by the Nearest Neighbour method as members of the An. gambiae complex and cannot be unambiguously assigned to a single species, are passed through the VAE assignment method.

      Indeed, in (Derkarabetian et al, Mol Phylogenet Evol, 2019) VAEs are used to delimit species boundaries in an arachnid genus. However, this study works with ultra conserved elements, incorporating a total of 76kB of sequence, which is much more data than the approximately 10kB we get for all amplicons combined. Moreover, a crucial difference is that the referenced work uses SNP calls, based on alignment to one of their sequenced samples, as input for the VAE, where our VAE takes k-mer based inputs. This is also an important consideration in working with a large number of highly diverged species.

      Perhaps more importantly, the manuscript does not present a comparison with other methods of species delimitation (SPEDEStem, UML -this approach is cited in the paper though-), or even of assessment of population differentiation, such as STRUCTURE, ADMIXTURE, or ASTRAL concordance factors (to mention a few among many). The absence of this comparative framework makes it unclear how this method compares to other tools already available.

      NNoVAE is primarily a method for species assignment rather than for species delimitation. SPEDEStem addresses the question whether different groups of samples are separate species or not; different groups can be defined by e.g. described races, described subspecies, different morphotypes or different collection locations. The aim of ANOSPP and NNoVAE is to remove the necessity of any prior sorting of samples into groups – all that needs to be known is that the sample is an Anopheline. This avoids the issues associated with morphological identification and single marker molecular barcodes. So to perform species assignment with SPEDEStem, we’d have to run many replicates, each time asking whether a single sample is of the same species as one of the species represented in our reference database. For example, for the 2218 samples presented in the case studies, we would have to run SPEDEStem more than 130,000 times, to check for each of these samples whether they are any of the 62 species represented in the reference dataset NNv1.

      However, we agree that it would be good to check that the species-groups in the reference database, NNv1, are indeed supported as separate species. We attempted to run SPEDEStem, but the web browser no longer exists, and we were not able to install the command line application, which runs on Python 2. Moreover, the example files provided in the tutorial are not complete. Therefore, we were unable to even carry out this basic comparison.

      UML (unsupervised machine learning) approaches comprise quite a wide range of methods, including VAE. We have conducted a comparison between the VAE assignments and assignments based on UMAP, for the discussion see below and page 20 in the manuscript and newly added supplementary information section 4.

      As requested by the reviewer, we have compared our assignment approach to ADMIXTURE on the Anopheles gambiae complex training set (see Supplementary information section 5). It is a good sanity check to compare the structure revealed by ADMIXTURE to the structure revealed by the VAE. We found that ADMIXTURE does not satisfyingly differentiate between the species in the complex that are only represented by a handful of samples, while the VAE suffers much less from the differences in group sizes in the training set. Moreover, we want to point out that ADMIXTURE is a tool for assessing population differentiation, not for species assignment. To use it as an assignment method, there are two options: either infer the allele frequencies in the ancestral populations from the training set and use those to compute the maximum likelihood of ancestry frequencies for the test set; or run ADMIXTURE on the training and test sets combined and use the labels from the training set to label ancestral populations. A major drawback from the former approach is that it is tricky to discover cryptic taxa or outliers in the test set; while with the second approach we create a dependency of the training set results on the test set it is combined with during the run. But more importantly, ADMIXTURE performs worse than the VAE on the An. gambiae complex training set by itself; and identifies only two to three different groups among the five diverged species (An. melas, An. merus, An. quadriannulatus, An. bwambae and An. fontenillei). For more information, see page 20 in the manuscript and newly added supplementary information section 5

      One important use case of our method is to identify interesting samples, e.g. potential hybrids or cryptic taxa, for subsequent whole genome sequencing. After selection and whole genome sequencing of interesting samples detected by ANOSPP+NNoVAE, ADMIXTURE may be useful as one of the tools to investigate such samples.

      A final concern is less methodological and more related to the biology of the system. I am curious about the possibility of ascertainment bias induced by the amplicon panel. In particular, the authors conclusively demonstrate they can do species assignment with species that are already known. Nonetheless, there is the possibility of unsampled species and/or cryptic species. This later issue is brought up in passing the 'Gambiae complex classifier datasets' section but I think the possibility deserves a formal treatment. This is particularly important because the system shows such high levels of hybridization that the possibility of speciation by admixture is not trivial.

      We appreciate the reviewer’s concern regarding ascertainment bias in the amplicon panel. The targets have been selected based on multiple sequence alignments of all Anopheles reference genomes at the time (Makunin et al. Mol Ecol Resour, 2022). Using sequenced species from four different subgenera, the species span a considerable amount of evolutionary time in the Anopheles genus. For all species we have since tested the panel on, we find that at least half of the targets get amplified.

      We share the reviewer’s concern regarding species which are not (yet) represented in the reference database. This is one of the main advantages of the Nearest Neighbour method: it works on three levels of increasing granularity. So for samples that cannot be assigned at species level, we are often able to identify the group of species from the reference database it is closest to. In particular, the situation of a test sample whose species is not represented in the reference database, is mimicked in the drop-out experiment by the species-groups which contain only one sample. On page 16 in the manuscript, we explain how NNoVAE deals with such samples and we show that in the majority of cases NNoVAE assigns the sample to a group of closely related species rather than misclassifying it more specifically to the wrong species.

      In summary, the main limitation of the manuscript is that the authors do not really elaborate on the need for this method. The manuscript does show that the method is feasible but it is not forthcoming on why this is of importance, especially when there is the possibility of generating full genome sequences.

      ANOSPP and NNoVAE are specifically designed for high throughput accurate species identification across the entire Anopheles genus – WGS is important to address many questions, but is complete overkill for doing species identification. ANOSPP costs only a small fraction of whole genome sequencing, which makes it possible to monitor mosquito populations at much larger scale (e.g., in partnership with our vector biologist collaborators in Africa, we have already generated ANOSPP data for approximately 10,000 mosquitoes and will be running 500,000 over the next few years). Moreover, for most analyses using whole genome sequencing, a reference genome of a sufficiently similar species is required. While we are in a position of privilege having reference genomes for more than 20 species in Anopheles, we have a long way to go before we have 100s of reference genomes covering the true diversity of the genus.

      NNoVAE can also be used to select interesting samples (e.g. species that have not been through the panel before, divergent populations, potential hybrids), which can be submitted for whole genome sequencing subsequently.

      Since Anopheles is arguably one of the most important insects to characterize genetically, the ANOSPP panel is certainly important but I am not completely sure the method of species assignment is novel or groundbreaking .

      Reviewer #2 (Public Review):

      The medically important mosquito genus Anopheles contains many species that are difficult or impossible to distinguish morphologically, even for trained entomologists. Building on prior work on amplicon sequencing, Boddé et al. present a novel set of tools for in silico identification of anopheline mosquitoes. Briefly, they decompose haplotypes generated with amplicon sequencing into kmers to facilitate the process of finding similar sequences; then, using the closest sequence or sequences ("nearest neighbors") to a target, they predict taxonomic identity by the frequency of the neighbor sequences in all groups present in a reference database. In the An. gambiae species complex, which is well-known for its historical and ongoing introgression between closely-related species, this approach cannot distinguish species. Therefore, they also apply a deep learning method, variational autoencoders, to predict species identity. The nearest neighbor method achieves high accuracy for species outside the gambiae complex, and the variational autoencoder method achieves high accuracy for species within the complex.

      The main strength of this method (along with the associated methods in the paper on which this work builds) is its ability to speed up the identification of anopheline mosquitoes, therefore facilitating larger sample sizes for a wide breadth of questions in vector biology and beyond. This technique has the added advantage over many existing molecular identification protocols of being non-destructive. This high-throughput identification protocol that relies on a relatively straightforward amplicon sequencing procedure may be especially useful for the understudied species outside the well-resourced gambiae complex.

      An additional and intriguing strength of this method is that, when a species label cannot be predicted, some basic taxonomic predictions may still be made in some cases. Indeed, even in the case of known species, the authors find possible cryptic variation within An. hyrcanus and An. nili, demonstrating how useful this new tool can be.

      The main weakness of this method is that, as the authors note, accuracy is dependent on the quality and breadth of the reference database (which in turn relies on the expertise of entomologists). A substantial portion of the current reference database, NNv1, comes from one species complex, An. gambiae. This is reasonable given the complex's medical importance and long history of study; however, for that same reason, robust molecular and computational tools for identifying species in this complex already exist. The deep learning portion of this manuscript is a valuable development that can eventually be applied to other species complexes, but building up a sufficient database of specimens is non-trivial. For that reason, the nearest neighbor method may be the more immediately impactful portion of this paper; however, its usefulness will depend on good sampling and coverage outside the gambiae complex.

      Another potential caveat of this method is its portability. It is not clear from either the manuscript or the code repository how easy it would be for other researchers to use this method, and whether they would need to regenerate the reference database themselves. The authors clearly have expansive and immediate plans for this workflow; however, as many researchers will read this manuscript with an eye towards using these methods themselves, clarifying this point would be valuable.

      This is an important point; currently the amplicon panel is only run on specialised robots, but we are working to adapt the protocol so that it can be run in any basic molecular lab. We have now clarified this in the conclusion. Furthermore, there is never a need to regenerate the reference databases – this is fully publicly available at github.com/mariloubodde/NNoVAE and version controlled. As we obtain ANOSPP data from additional samples, representing new species or new within-species diversity, we will add these to the reference database and create an updated openly available version.

      The authors present data suggesting that their method is highly accurate in most of the species or groups tested. While the usefulness of this method will depend on the reference database, two points ameliorate this potential concern: it is already accurate on a wide breadth of species, including the understudied ones outside the An. gambiae complex; additionally, even when a specific species identification cannot be made, the specimen may be able to be placed in a higher taxonomic group.

      Overall, these new methods offer an additional avenue for identifying anopheline species; given their high-throughput nature, they will be most useful to researchers doing bulk collections or surveillance, especially where multiple morphologically similar species are common. These methods have the potential to speed up vector surveillance and the generation of many new insights into anopheline biology, genetics, and phylogeny.

    1. #stylez--3fKJu styles--3sKVw "> #_I_have_ "other ideas" that are related to our concentration here; and I really thinksomeone in a position like yours would benefit greatly from working on the branch of crypto related to "free communioation."I want to build an open social network ["protocol"] that combines "what email, facebook, reddit and ... wikipedia to enable "commenting on anything" the light of ...# "hey ma, where did all the online newspaper comments disappear to?"I know what has to go into it, I'm looking at things like hypothes.is, tableland.xyz and ... https://lnkd.in/gNbBAewt ... and I think it would be simple to put something together that will intrigue people; i know the software and infrastructure can offer us a bulletproof check on censorship that we need now more than ever before in history; and I'm having trouble figuring out why more people aren't interested in helping me ensure that we have a safe happy future free from "un america n th i ngz" like "no newspaper" and no recourse against insurance and credit fraud/problems; which is what I'm staring at in full blown disbelief.</textarea></div></div></label></div><div class="styles--2mJeY"><div class="styles--YBb-N styles--3IYUq"><div--

      stylez--3fKJu styles--3sKVw "> #I_have "other ideas" that are related to our concentration here; and I really think

      someone in a position like yours would benefit greatly from working on the branch of crypto related to "free communioation." I want to build an open social network ["protocol"] that combines "what email, facebook, reddit and ... wikipedia to enable " commenting on anything" the light of ...

      "hey ma, where did all the online newspaper comments disappear to?"

      I know what has to go into it, I'm looking at things like hypothes.is, tableland.xyz and ... https://lnkd.in/gNbBAewt ... and I think it would be simple to put something together that will intrigue people; i know the software and infrastructure can offer us a bulletproof check on censorship that we need now more than ever before in history; and I'm having trouble figuring out why more people aren't interested in helping me ensure that we have a safe happy future free from "un america n th i ngz" like "no newspaper" and no recourse against insurance and credit fraud/problems; which is what I'm staring at in full blown disbelief.</textarea></div></div></label></div><div class="styles--2mJeY"><div class="styles--YBb-N styles--3IYUq"><div -- The importance of seeing that it's an open "DeFi-inspir[ing/edu]" protocol that will work with existing service and interfaces like LinkedIn and Facebook and Mastodon and diasp.org is ... without question a necessary part of understanding the vision. I think we will see great leaps and bounds in interface design that make the "second small step" Dissenter/Unity and #hypothes eze ... have almost brought to the forefront of the "right venue." https://web.hypothes.is/sponsors/ Seeing #hypothesisontableland and having it work is the first "glaringly bright flash" that will ensure that we never again watch commenting and sites like discus and reddit and facebook turn from the light of social "what's on fire?" to the darkness of shadow ... "throttling" of the presentation of the world changing that somehow has missed our tongues and hopefully not our eyes.<br /> Hopefully once we start talking and getting more involved it will be clear how easy it was and is to make the world a better place just by ... "dropping in your two cents" or BTC as the case may be. --

      We've got to get serious about caring "of things like ourselves" for the truth and health and happiness that some of us probably take for granted as I do;

      still you can see me smiling when i know full well it's a little early for that--maybe you can help shift the timeline.

    1. On the top are slanting translucentscreens, on which material can be projected for convenientreading. There is a keyboard, and sets of buttons and levers.Otherwise it looks like an ordinary desk.

      I think it's interesting that Bush associates the memex with something the size of a desk. This may be due to the fact that he naturally assumed that something this powerful and containing this much information would need to be housed in a large apparatus. This is obviously not the case in modern day as we see smart devices hundreds of times smaller than a desk with all of the capabilities that Bush lists.

    2. Now, says Dr. Bush, instruments are at hand which, if properlydeveloped, will give man access to and command over the inheritedknowledge of the ages. The perfection of these pacific instrumentsshould be the first objective of our scientists as they emerge from theirwar work.

      During the time of World War II many scientific innovations were made. One view that many may not think about everyday while interacting with information. Moreover, its a view that is one of the most important as storing knowledge and accessibility of it. This in fact has opened the doors for the current digital age and provides a perspective on how far we have come and the amount of knowledge we have accessible to us currently.

    1. Children may never come to see fractions asbeing fundamentally different from whole num-bers and thus may fail to understand fractionoperations.

      This is why we use methods such as partitioning and iterating in order to see the fractions as fractions that can be divided or multiplied, instead of seeing the numerator and denominator of a fraction as separate whole numbers. This can be very confusing for students to think about this way.

    2. Improper fractions may be nonsensical to chil-dren because they may think that a quantity thatis more than the original amount is impossible.For example, 3/2 thought of as 3 out of 2 thingsis problematic, prompting the child to ask howshe can take three things when she has only twothings total.

      iterating is a process that makes improper fractions a lot more understandable. Using iterating, we can see that we have 3 equal parts of 1/2.

    1. Why is this important in this history of psychology?

      "The present work will, I venture to think, prove that I both saw at the time the value and scope of the law which I had discovered, and have since been able to apply it to some purpose in a few original lines of investigation. But here my claims cease. I have felt all my life, and I still feel, the most sincere satisfaction that Mr. Darwin had been at work long before me, and that it was not left for me to attempt to write 'The Origin of Species.' I have long since measured my own strength, and know well that it would be quite unequal to that task. Far abler men than myself may confess that they have not that untiring patience in accumulating and that wonderful skill in using large masses of facts of the most varied kinds, -- that wide and accurate physiological knowledge, -- that acuteness in devising, and skill in carrying out, experiments, and that admirable style of composition, at once clear, persuasive, and judicial, -- qualities which, in their harmonious combination, mark out Mr. Darwin as the man, perhaps of all men now living, best fitted for the great work he has undertaken and accomplished." This comes from the Classics in the History of Psychology Limits of Natural Selection By Chauncey Wright (1870). This shows us the importamce of the limits including in theories like this one. Natural selection indicates that the strongest will be the ones that will survive and there for will be the ones that will be able to have offsprings and make their generation endure. But thjis has a limit due to the sexual selection because it shows that the natural selection can not be impossed to people in any way or form. I see this working in psychology in a very big way because now that we are in a generation that is so ruled out by the social media this concept wants to persist and endure no matter what. I can see natural selecetion slowly decreasing amd really another type of selection evolving with the next future generations.

      Angela Cruz Cubero (Christian Cruz Cubero)

    1. #stylez--3fKJu styles--3sKVw "> #_I_have_ "other ideas" that are related to our concentration here; and I really thinksomeone in a position like yours would benefit greatly from working on the branch of crypto related to "free communioation."I want to build an open social network ["protocol"] that combines "what email, facebook, reddit and ... wikipedia to enable "commenting on anything" the light of ...# "hey ma, where did all the online newspaper comments disappear to?"I know what has to go into it, I'm looking at things like hypothes.is, tableland.xyz and ... https://lnkd.in/gNbBAewt ... and I think it would be simple to put something together that will intrigue people; i know the software and infrastructure can offer us a bulletproof check on censorship that we need now more than ever before in history; and I'm having trouble figuring out why more people aren't interested in helping me ensure that we have a safe happy future free from "un america n th i ngz" like "no newspaper" and no recourse against insurance and credit fraud/problems; which is what I'm staring at in full blown disbelief.</textarea></div></div></label></div><div class="styles--2mJeY"><div class="styles--YBb-N styles--3IYUq"><div

      The importance of seeing that it's an open "DeFi-inspir[ing/edu]" protocol that will work with existing service and interfaces like LinkedIn and Facebook and Mastodon and diasp.org is ... without question a necessary part of understanding the vision. I think we will see great leaps and bounds in interface design that make the "second small step" Dissenter/Unity and #hypothes eze ... have almost brought to the forefront of the "right venue."

      https://web.hypothes.is/sponsors/

      Seeing #hypothesisontableland and having it work is the first "glaringly bright flash" that will ensure that we never again watch commenting and sites like discus and reddit and facebook turn from the light of social "what's on fire?" to the darkness of shadow ... "throttling" of the presentation of the world changing that somehow has missed our tongues and hopefully not our eyes.

      Hopefully once we start talking and getting more involved it will be clear how easy it was and is to make the world a better place just by ... "dropping in your two cents" or BTC as the case may be.

    1. #stylez--3fKJu styles--3sKVw "> #_I_have_ "other ideas" that are related to our concentration here; and I really thinksomeone in a position like yours would benefit greatly from working on the branch of crypto related to "free communioation."I want to build an open social network ["protocol"] that combines "what email, facebook, reddit and ... wikipedia to enable "commenting on anything" the light of ...# "hey ma, where did all the online newspaper comments disappear to?"I know what has to go into it, I'm looking at things like hypothes.is, tableland.xyz and ... https://lnkd.in/gNbBAewt ... and I think it would be simple to put something together that will intrigue people; i know the software and infrastructure can offer us a bulletproof check on censorship that we need now more than ever before in history; and I'm having trouble figuring out why more people aren't interested in helping me ensure that we have a safe happy future free from "un america n th i ngz" like "no newspaper" and no recourse against insurance and credit fraud/problems; which is what I'm staring at in full blown disbelief.</textarea></div></div></label></div><div class="styles--2mJeY"><div class="styles--YBb-N styles--3IYUq"><div

      The importance of seeing that it's an open "DeFi-inspir[ing/edu]" protocol that will work with existing service and interfaces like LinkedIn and Facebook and Mastodon and diasp.org is ... without question a necessary part of understanding the vision. I think we will see great leaps and bounds in interface design that make the "second small step" Dissenter/Unity and #hypothes eze ... have almost brought to the forefront of the "right venue."

      https://web.hypothes.is/sponsors/

      Seeing #hypothesisontableland and having it work is the first "glaringly bright flash" that will ensure that we never again watch commenting and sites like discus and reddit and facebook turn from the light of social "what's on fire?" to the darkness of shadow ... "throttling" of the presentation of the world changing that somehow has missed our tongues and hopefully not our eyes.

      Hopefully once we start talking and getting more involved it will be clear how easy it was and is to make the world a better place just by ... "dropping in your two cents" or BTC as the case may be.

    1. The Andrew W. Mellon Foundation $752,000 1 January, 2014 The Andrew W. Mellon Foundation awarded Hypothesis a multi-year grant to support the development of annotation services for digital scholarly materials, including support for the I Annotate annual conference, I Annotate 2014: Annotato Ergo Sum.

      The importance of seeing that it's an open "DeFi-inspir[ing/edu]" protocol that will work with existing service and interfaces like LinkedIn and Facebook and Mastodon and diasp.org is ... without question a necessary part of understanding the vision. I think we will see great leaps and bounds in interface design that make the "second small step" Dissenter/Unity and #hypothes eze ... have almost brought to the forefront of the "right venue."

      https://web.hypothes.is/sponsors/

      Seeing #hypothesisontableland and having it work is the first "glaringly bright flash" that will ensure that we never again watch commenting and sites like discus and reddit and facebook turn from the light of social "what's on fire?" to the darkness of shadow ... "throttling" of the presentation of the world changing that somehow has missed our tongues and hopefully not our eyes.

      Hopefully once we start talking and getting more involved it will be clear how easy it was and is to make the world a better place just by ... "dropping in your two cents" or BTC as the case may be.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: GlmS, the glucosamine-6-phosphate synthetase in E. coli and related bacteria, is essential, required for synthesis of both peptidoglycan and LPS. It is regulated at various levels, including positive regulation of GlmS translation by the Hfq-binding sRNA GlmZ. GlmZ activation of translation is regulated, indirectly, by the levels of GlcN6P, the product of GlmS. The components of the sensing and regulatory cascade have previously been defined, via genetics, biochemical and molecular biology studies. GlmZ is cleaved by Rnase E, becoming inactive, when GlcN6P levels are high, dependent upon the binding of GlmZ to RapZ. RapZ has been found to directly sense GlcN6P levels; another regulatory RNA, GlmY, also binds RapZ in the absence of GlcN6P, protecting GlmZ from RapZ-mediated processing. The authors of this manuscript performed cryoEM to study the structure of two important complexes in this sensing cascade, RapZ/GlmZ and RapZ/GlmZ/RNase E-NTD, with the aim of clarifying how the RNA binding protein RapZ causes the cleavage of sRNA GlmZ by RNaseE. Some of the predictions for critical residues in the RapZ/GlmZ binary complex structure were investigated by mutagenesis RapZ to define essential resiudes for GlmZ cleavage; the results are consistent with the structure.

      Major comments:

      • Are the key conclusions convincing? 1) Given that this is basically a structural paper, the major questions would be whether the cryoEM reconstructions are accurate (appear to be consistent with general expectations) and whether there is clear evidence to support the physiological relevance of the structure. The tests of function are of two sorts: a) Effect of RapZ mutants in Fig. 3b-d. These tests show loss of RapZ function with various alleles, likely consistent with model (but as noted below, very difficult for the reader to identify on the structures in 3a). The implication is that these will interfere with GlmZ binding. Possibly direct tests of a couple of these mutants for GlmZ binding (or pull down of GlmZ from in vivo expressed protein) would further support the model. I note that the text says T248A was unaffected in cleavage, but seems to be much reduced in Fig. 3b, even if fusion activity is good.

      Our reply. We have made further tests of the mutations for GlmZ binding. Using electrophoretic mobility shift assays, we observe reduced GlmZ binding affinities for RapZ mutants K170A, H190A, C247A, T248A (figure below). We also tested the activity of RapZ variant with 4 substitutions at the proposed RapZ/NTD interface (right lanes in figure below).

      We followed the recommendation of the reviewer and performed co-purification experiments (“pull-down”) using StrepTactin affinity chromatography and Strep-tagged RapZ variants as baits. Eluates were assessed for RapZ protein content and co-eluting GlmZ and processed GlmZ* sRNAs using Northern blotting. These new results, which have been incorporated in Fig. S7c, show that all tested RapZ variants except for the wild-type protein are not capable to pull-down GlmZ or GlmZ* in cell extracts. This includes the RapZ-T248A variant, which as noted by the referee is nonetheless still capable to decrease full-length GlmZ to some extent, albeit processed GlmZ* is hardly detectable (Fig. 3b, lanes 23, 24). To address this issue further, we purified the RapZ-T248A variant and some additional variants for comparison and performed EMSA. Globally, the EMSAs confirm the co-purification experiments, i.e. they demonstrate strongly reduced GlmZ binding activity for most tested RapZ variants, but also show that the RapZ-T248 variant kept some residual binding activity. This may explain the weak signal for processed GlmZ in the Northern blot (Fig. 3b) as processed GlmZ* likely binds to RapZ for stabilization. Similar effects were previously seen for the RapZquad and the RapZ 1-279 variants in Durica-Mitic et al. 2020 (Fig. 5). Accordingly, we also changed our wording concerning the RapZ-T248A variant in the text. We have not incorporated the EMSA figure into the updated manuscript.

      b) The ternary complex was tested primarily by the BACTH assay of some RapZ mutants (Fig. S11), that show a reduced interaction. This is not a particularly convincing assay for a number of reasons: 1) the effects are relatively modest (2x down, in an assay that is probably not very linear with interaction, 2) some with reduced interaction (S239A, T248A) had good activity (at least all those with full interaction seem to be functional); 3) Ternary complex suggests that RapZ mediates this interaction; this could be tested by deleting glmZ (and maybe glmY as well) from this BACTH strain. 4) the authors suggest that there are also important protein-protein interactions, based on some observed interactions, and support this with similarly difficult to interpret BACTH data from a previous paper for Rnase E-RapZ interaction. Here, too, that is not the most compelling data (is this interaction RNA-independent?).

      Our reply: Previous work already indicated that formation of the ternary complex involves multiple interactions – direct protein-protein contacts but also indirect interactions mediated by sRNA GlmZ. For instance, in vitro pull-down signals (RapZ = prey; RNase E = bait) become reduced but not abolished when RNA-free protein preparations are used (Durica-Mitic et al., 2020; Fig. 2E). BACTH signals are reduced 2-fold when using RNase E and RapZ variants that are strongly impaired in their RNA-binding capabilities, respectively (Durica-Mitic et al., 2020; Fig. 2C). As the BACTH assays and in vitro pull-down approaches yield similar trends, we suggest that BACTH experiments represent a useful approach to clarify the questions under study.

      Point b1: To demonstrate that removal of multiple interactions is required to disrupt the ternary complex, we combined substitutions of residues making contact to the sRNA as well as residues directly contacting RNase E. According to the structure of the ternary complex presented here, residues T161, Y240, N271 and Q273 in RapZ are proposed to contact RNase E directly. Upon substitution of these four latter residues, resulting in the RapZ variant named RapZ-4 subst., the BACTH signal decreases two-fold – similar to what is observed for the RapZ variants that carry Ala substitutions of residues involved in sRNA-binding, such as H190 or R253. Importantly, when the latter two substitutions are introduced into the RapZ-4 subst. variant – either alone or in combination, the BACTH signal is reduced to almost back-ground levels. These results are in agreement with the features of the ternary complex proposed here and also with data obtained previously: They show that protein-protein and protein-RNA contacts must be concomitantly removed to disrupt the complex completely. We integrated the latter data as Fig. S7a in the revised manuscript and discuss the data at the appropriate positions in the text.

      Point b2: In our opinion, the data reporting regulatory activity of the individual RapZ variants (Fig. 3 b-d) correlate well with the BACTH data (Fig. S7a): RapZ variants carrying substitutions of residues I175 and N236 retain regulatory activity and concomitantly a high RNase E interaction potential indistinguishable from the wild-type is observed. In contrast, RapZ variants carrying substitutions affecting sRNA-binding, i.e. H190A, C247A, C247S, T248D, G249W, R253A loose activity completely and concomitantly show a 2-fold decrease in the BACTH signal. The remaining BACTH signal is explained by the remaining (protein-protein) contacts as discussed above (point b1). Therefore, these variants are likely uncapable to present GlmZ in a correct manner to RNase E even though interaction is retained to some degree.

      Only the RapZ mutants with exchanges H171A, S239A and T248A do not follow either of these two scenarios: albeit they exhibit reduced interaction with RNase E according to BACTH, they retain the ability to regulate the chromosomal glmS’-lacZ fusion, at least when produced from a plasmid (Fig. 3d). However, inspection of the GlmZ Northern blot signals (Fig. 3b) reveals that full-length GlmZ is decreased as expected, but that processed GlmZ* becomes either not visible or is much reduced when compared to wild-type RapZ. This explains by a reduced sRNA binding affinity, as pointed out above (point 1a), which also provides a rationale for the decreased BACTH signal.

      Point b3: We agree that deletion of glmZ in the BACTH strain would be an ideal approach to dissect the contributions of protein-protein and sRNA-protein mediated interactions for formation of the ternary complex in vivo. Unfortunately, construction of the strain is not straight-forward. In our hands, the BACTH reporter strain BTH101 is not amenable to chromosomal manipulations by using engineered recombination tools such as the phage lambda-derived Red system. This may be explained by regulatory elements used by the l Red system that depend on cAMP, which cannot be synthesized in this strain.

      __Point b4: __We have addressed this query in the response to point b1.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Possibly the importance of RNAse E-RapZ direct interaction, without further proof that this actually is needed for function.

      __Our reply: __We partially addressed this issue already in our response to point b1. Additionally, we also tested activity of the RapZ-4 subst variant that lacks the residues making direct contact to RNase E in our structure (Fig. 3b-d, last two lanes/columns). The results that are now described in the last paragraph of the results section show that this variant retains regulatory activity. Interestingly, the level of processed GlmZ* is strongly reduced in this case, similar to what is observed with the RapZ-S239A and RapZ-T248A variants discussed above. Therefore, these direct protein-protein contacts might have a role for GlmZ* decay in a manner that remains to be addressed.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. As noted above, further tests of RapZ mutants for RNA binding would be useful; if this has been done previously, needs to be presented.

      Our reply.

      This has been addressed in the response above.

      Are there Rnase E residues that would be predicted by the model to be critical for the RapZ or GlmZ interaction but are not otherwise needed for activity? Would these disrupt either the BACTH results or activity in vivo?

      Our reply.

      Please see response to this point above.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. Yes, they are. They are generally extrapolations from what is already in the paper or in previous studies by these groups.
      • Are the data and the methods presented in such a way that they can be reproduced? Yes.
      • Are the experiments adequately replicated and statistical analysis adequate? Yes.

      Minor comments: - Specific experimental issues that are easily addressable. None noted. - Are prior studies referenced appropriately? Yes, they are. However, the paper could more clearly outline what is already known at the level of interactions of the molecules under study here.

      Our reply. We have changed the text to better introduce information from previous studies: interprotomer contacts, properties of the isolated RapZ domains, conclusions from the truncation analyses, requirements for interaction for RNase E and for sRNA-binding, stabilization of processed GlmZ through RapZ binding (Göpel et al., 2013; Gonzalez et al 2017; Durica-Mitic and Görke, 2019; Durica-Mitic et al., 2020).

      • Are the text and figures clear and accurate?
      • In a number of places, the text and figure order/numbers are not correct. See Fig. S1 (p. 4), S2 (legends vs. figure panels).

      Our reply. We have corrected these in the revised text.

      Better labeling in many figures is needed. Clarify what is shown in Fig. S2d, and make the labels readable. Label the particle types in S3. Use schematics more (as in Fig. 4 and S8) to make it easier for reader to follow structure (for Fig. 2, for instance). It is very difficult to discern RapZ tetramer here. Fig. 3a, it is very difficult to see the residue numbers on the structures. Clearly identify the fructokinase-like domains. Label lanes in Fig. 3b, c, d. Indicate active site for RNase E. in Fig. 4, in schematic at least.

      Our reply. We have also corrected these in the revised text.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? Overall, clarify and highlight better how the structures here fit with what is already known about important sequences/regions of RapZ, GlmZ, and Rnase E, maybe color-coding parts of GlmZ shown to be important for RapZ recognition, etc.

      Our reply. We have added a sequence alignment for RapZ in the supplementary materials section, indicating important residues (Fig. S12).

      Page 12, the second last row. Text after 'In this model...' can be simplified or removed because it is just a hypothesis.

      Our reply. We have simplified the text.

      Our reply:

      We believe that the discussion section should also give room for novel ideas and hypotheses. Therefore, we wish to keep the paragraph.

      Reviewer #1 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. Rnase E is a major essential endonuclease in bacteria such as E. coli. How accessory proteins lead to its recognition and cleavage of regulatory RNAs such as GlmZ is not well understood at the structural level, and these structures provide important insight into that process. In addition, the GlmZ/RapZ regulatory circuit plays an important role in bacterial growth and pathogenesis, and understanding it at this level of detail will certainly open up possibilities for targeting this process in the future.

      • Place the work in the context of the existing literature (provide references, where appropriate). The components that go into the current structures have been studied previously, with publications on RapZ structure, analysis of critical regions within the GlmZ RNA, and demonstration of the domain of Rnase E involved in interactions with RapZ (Durica-Mitic et al, 2020; Khan et al, 2020, Gonzalez et al, 2017, among others), exactly how these fit together has not been known. Other RNA binding proteins that affect degradation have been reported, but are not fully understood, and ways in which the ribonuclease binds complex RNAs is not fully understood either.

      • State what audience might be interested in and influenced by the reported findings. This work should be of broad interested to the field of RNA-based regulation and RNA degradation, with particular interest for those working on these processes in bacteria.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. Our expertise is in RNA-based regulation and microbial genetics; we are not able to critically evaluate the cryoEM analysis itself.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Islam et al present their characterization of the E. coli RapZ-GlmZ-RNase E ternary complex in this manuscript under review. In E. coli, the RNA binding protein RapZ facilitates cleavage of GlmZ sRNA by RNase E when intracellular concentrations of GlcN-6P are high; when GlcN-6P levels are low RapZ is titrated by GlmY sRNA and GlmZ sRNA promotes an increase in the translation and stability of the mRNA encoding GlcN-6P synthase, GlmS. Via Cryo-EM, the authors of this manuscript solve the structure of the binary RapZ:GlmZ (Fig. 2) and ternary RapZ:GlmZ:RNase Y (Fig. 4) complexes. Based on the apparent RapZ-sRNA binding sites in the solved structure of the binary complex, the authors make substitutions in residues suspected to be involved in RNA binding and measure the impact of these substitutions on cleavage of GlmZ and GlmZ-mediated activation of GlmS expression (Fig. 3). The authors find that some of the residues predicted to be involved in RNA binding based on their structural studies are also important for the cleavage of GlmZ, presumably by RNase E. Finally, the authors show via bacterial two-hybrid assays that some residues of RapZ necessary for GlmZ cleavage are also important for its interaction with RNase E (Fig. S11). I would suggest that the authors measure co-immunoprecipitation of GlmZ with tagged-RapZ with or without substitutions in the proposed RNA binding residues to resolve this issue. Alternatively, EMSAs could be performed.

      Our reply. Please see the response above to reviewer 1. We have included results from EMSAs with selected RapZ mutants and for multiple mutations in the BACTH analysis.

      Major comments:

      Overall, the structural studies our impressive and provide considerable insight into the recognition of substrates by RapZ and RNase E. Given the dearth of solved structures of RNAs with their cognate RNA binding proteins, these results are very significant.

      A limitation in this work is the lack of experiments directly testing whether or not the residues of RapZ that appear to be important for its interaction with the GlmZ sRNA in the authors' Cryo-EM structures actually have a significant role in RNA binding. In lieu of measuring GlmZ binding by RapZ, the authors measure GlmZ cleavage in strains expressing RapZ or particular variants harboring substitutions in residues that appear to play a role in sRNA binding (Fig. 3b); however, it is impossible for the authors to determine whether impairment of GlmZ cleavage by RNase E in their assays is due to lack of GlmZ binding to RapZ, extraordinarily tight binding of GlmZ to RapZ, changes in the orientation of GlmZ bound to RapZ, or conformational changes in RapZ that lead to disruption of direct RapZ-RNase E contacts. The lack of this empirical data supporting their structural studies becomes more salient as the authors attempt to test whether RapZ binding of GlmZ is important for its interaction with RNase E via a bacterial two-hybrid assay. Since the authors have not directly examined the importance of particular RapZ residues on GlmZ binding, the authors' interpretation of their results from these assays is very speculative.

      Our reply: Reviewer 1 raised a similar point to which we replied above. The role of candidate residues in RapZ for binding GlmZ has been addressed by more direct assays (Pull-down/EMSA).

      The authors state on page 7 that "the interaction of RapZ:GlmZ with RNase E does not involve conformational rearrangement of either RapZ or GlmZ". However, the arrangement of SLII relative to SLI appears different between the RapZ:GlmZ and RapZ:GlmZ:RNase E structures presented. Additionally, SLII appears entirely bound by RapZ in the binary complex (Fig. 2b), whereas in the structure of the ternary complex, SLII appears less associated with RapZ (Fig. S4b). A supplementary figure showing side-by-side the structure of GlmZ bound to RapZ solved in the presence or absence of RNase E may make clear whether any differences that exist in the conformation of RapZ and GlmZ between the binary and ternary complex structures.

      Our reply: In the revised manuscript, we have included a supplementary figure showing side-by-side comparisons of the structures.

      Minor comments: Figure S1 legend. Change "inactivate" to "inactive" or "inactivated"

      Figure S2 legend. The description for "(d)" is for S2c and the text for "(c)" refers to the image in S2d.

      Figure legend S5a and S9a. If resolution in the key is in angstroms, then it should be indicated.

      Our reply: We have now corrected the above points in the revised text.

      Figure 1. The model appears to indicate that the apo-form of RapZ binds GlmZ and GlmY, whereas the GlcN-6P bound form does not. Moreover, in the discussion, the authors indicate that GlcN-6P interferes with GlmZ binding to RapZ. How does RapZ bind and cleave GlmZ when GlcN-6P levels are high, if GlcN-6P interferes with GlmZ binding? It would be useful for the authors to address this conundrum in their discussion.

      Our reply. We thank the reviewer for pointing out this paradox. Our unpublished work indicates that RapZ may have phosphatase activity for GlcN6P, and we added a comment to this in the discussion section.

      Fig. S3B and C. While panels in Fig. S3B and C seemed well aligned, numbering of lanes would provide additional clarity.

      We will provide lane numbers, accordingly.

      Many bacterial species including Bacillus subtilis, Streptococcus pyogenes, and Clostridium botulinum have RapZ homologs that bear a tyrosine instead of a histidine residue at the position corresponding to H190 in E. coli RapZ. Would you expect this change to reduce GlmZ binding by RapZ or lead to change in RNA specificity based on your structural data? This may be useful to discuss in the manuscript.

      We believe that the is more behind this question. Likely, the referee (by inspecting a RapZ sequence alignment) realized that almost all residues proposed to be involved in binding GlmZ are also conserved in RapZ homologs in Gram-positive bacteria, unless His190 and His171, which are replaced by tyrosines in some of these species. However, no RNA-binding activity has been reported for the Gram-positive RapZ homologs. If true, the question arises what is making the difference here? In principle, this could be due to the lacking histidine residues, which are replaced by tyrosines in Gram-positive RapZs. Alternatively, we consider that the positively charged residues at the far C-terminus (K270, K281, R282, K283), which were identified previously to be required for sRNA binding (Göpel et al., 2013; Durica-Mitic et al., 2020), and which could not be resolved in the current structures, are additionally required to obtain RNA-binding activity.

      Fig. S10. It is confusing to me that the yellow chain in the structure of RNase E is labeled as the DNase I-domain in the apo structure, whereas in the structure with RprA or GlmZ bound, this colored region is labeled as the 5' sensing domain.

      We have changed the figure to make it clearer.

      On page 12, the authors appear to indicate that their structural studies of the RapZ-GlmZ-RNase E ternary complex could be informative with regards to how KH domain proteins in Gram-positive bacteria could present their substrates to RNase E. First of all, these bacteria lack RNase E and instead encode an evolutionarily distinct endoribonuclease (RNase Y). Secondly, I think that it is overreaching to state that these structural studies will inform us on how KH domain proteins such as KhpA/KhpB, which may or may not have a chaperoning function akin to Hfq in Gram-positive bacteria, present substrates to RNase Y. Regardless, if this statement is to remain, the authors should make clear that is RNase Y and not RNase E that they are referring to.

      We have changed the text to make clear that a different RNase is employed in this case.

      Reviewer #2 (Significance (Required)):

      In my opinion, the significance of this work is in the achievement of high-resolution structures of the complexes of the RNA binding protein RapZ and the endoribonuclease RNase Y with RNA substrate bound. There are very few structures solved of RNA binding proteins or RNases with their cognate substrates. This is likely due to the difficult in obtaining high resolution data for the bound RNA that may have a large degree of flexibility or many alternative conformations. More structures like this are needed to advance our understanding of RNA-protein interactions.

      I believe that these findings would not only be of great interest to those that study small regulatory RNAs, such as myself, but also others more generally interested in RNA binding proteins, RNases, or protein-RNA interactions.

      Field of expertise: small regulatory RNAs, RNA chaperones, RNases

      **Referees cross-commenting**

      1. I agree with Reviewer #1 that the results of the bacterial two-hybrid assay would be more informative, if the authors tested the impact of deletion of glmZ on the ability of the wild type and mutant RapZ proteins to interact with RNase Y by this assay.

      As both reviewer #1 and I indicated, I think that it would be useful for the authors to directly assess the effect of key substitutions in RapZ on GlmY binding by a more direct measure of interaction, e.g., CoIP or EMSA.

      I do think that it would be nice at some point for the authors to actually provide evidence that GlcN6P binds to the site that they predict as reviewer 3 suggested but this may be be beyond the scope of this manuscript and may be better addressed in another manuscript in which the authors solve the structure of RapZ with GlcN6P bound. In the meantime, the authors could limit their speculation.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: The biogenesis of the bacterial cell envelope relies on glucosamine-6-phosphate (GlcN6P), which is mediated by GlmZ and the sRNA-binding protein RapZ. GlmZ stimulates translation of the GlcN6P synthetase. When the levels of the GlcN6P are sufficiently high, RapZ will presents GlmZ to the endoribonuclease RNase E for cleavage and thereby silencing synthesis of the GlcN6P synthetase. However, how RapZ recruit RNase E to GlmZ for degradation is still unsolved. This paper reports the cryoEM structure of the binary complex of RapZ: GlmZ and the ternary complex of the RNase E catalytic domain (RNase E-NTD), RapZ and GlmZ. RapZ interacts with SLI and SLII of GlmZ through complementarity in shape and electrostatic charge to the phosphodiester backbone of the sRNA and presents the sRNA by alignning its SSR comprising the cleavage site into the RNase E active center. This paper suggests a general RNase E recognition pathway for complex substrates, which will help to understand the mechanisms that other RNA chaperones such as Hfq might work in an analogous assembly to present base-paired sRNA/mRNA pairs for cleavage. In total, this is an excellent work. I will support the publication of it until these following points are presented.

      Major comments: 1. It was mentioned on Page 5 that "Sulphate and malonate ions were previously seen at these positions in crystal structures of apo RapZ" and pn Page 11 that " Interestingly, the phosphate groups of the RNA backbone occupy positions in RapZ that were previously observed to bind sulphate or malonate ions in the crystal structure of apo-RapZ, suggesting that this pocket could be the binding site for a charged metabolite such as GlcN6P". Is there any following experiments to investigate it further? If possible, I suggest the author to confirm that weather RapZ has the binding activity with GlcN6P or not.

      Binding of GlcN6P by the RapZ-CTD was demonstrated previously by SPR as well as by metabolomics of metabolites copurifying with RapZ (Khan et al., 2020), although evidence that the “sulphate/malonate binding sites” in RapZ also bind GlcN6P is still lacking. Crystallization of RapZ+GlcN6P is not straight forward as bound GlcN6P is apparently hydrolyzed over time.

      "The kinase-like N-terminal domain of RapZ (NTD) makes only a few interactions with the RNA, and the path of the RNA does not encounter the Walker A or B motifs (Figure 2b). It is possible that this domain could act as an allosteric switch, whereby the binding of an as yet unknown ligand triggers quaternary structural changes that affect RapZ functions." Is there any more structural information supporting it? If the domain act as an allosteric switch, is it possible to make some deletion or substitution to test it?

      The properties of the separated NTD and CTD of RapZ were assessed in previous work.

      Is there any results to compare the binding affinity of GlmY and GlmZ with RapZ?

      Affinities were determined previously using complimentary techniques:

      Göpel et al., 2013/EMSA: KD GlmY ~ 30 nM; KD GlmZ ~ 75 nM

      Gonzalez et al., 2017/biolayer interferometry: ~ 50 nM for both GlmY/GlmZ (full-length)

      Minor comments: 1. Page 8, is it "stabilised" or "stabilized", please check it.

      We have changed the spelling to “stabilized”.

      The legends for Figure S2 c and d are reversed.

      This has now been corrected.

      It was suggested to show the RNA molecules in Figure S1a.

      We have changed the figure to include single-stranded RNA substrate.

      Reviewer #3 (Significance (Required)):

      This paper suggests a general RNase E recognition pathway for complex substrates, which will help to understand the mechanisms that other RNA chaperones such as Hfq might work in an analogous assembly to present base-paired sRNA/mRNA pairs for cleavage. In total, this is an excellent work.

    1. Despite such claims against CAI, many of the researchers of the decade produced empirical evidence showing the significant benefits of CAI. Kinzie et al. found “a strong positive effect of computers on continuing motivation” (1989 p. 12), while Tennyson et al. (1980) showed how computers can aid and empower learners in taking control of meeting their own learning needs. This was similar to Dalton et al. (1987), who claimed that computers aid instructors and practitioners in providing personalized learning experiences to students. Yet the research of the decade continued to be rife with conflicting opinions as researchers sought to understand and define the role of technology, specifically computers, in education.

      This paragraph, and reading in general, makes me think about the positives and negatives of learning almost solely with computers during the midst of the pandemic we are in. Computers are a technological device I can’t imagine not being able to use in my high school and college careers. When reflecting on my past zoom classes I have taken, I do believe that it is a valid point to bring up the statement that suggests that the technology only benefits students if the teaching is implemented in a well thought and beneficial way. When we made the sift to online school it was not only an adjustment for students, but teachers as well. This class, and material within, really makes me think about how the sudden switch to the use of technology to learn has impacted my college experience. I can definitely pick out certain classes that gave me more meaningful experiences due to the way that the teacher was able to utilize the technology at hand. Lastly, it makes me think about what educational technology may look like in school settings when I plan to become a teacher myself in the next 5+ years.

    2. Access to mobile devices or computers is essential for students to participate in “flipped classrooms,” a model which grew in popularity during the 2010s. With flipped classrooms, what was “previously class content (teacher led instruction)” is replaced with “what was previously homework (assigned activities to complete) now taking place within the class” (O’Flaherty & Phillips, 2015, p. 85). This method of instruction emerged in the 2010s in response to increased access to technology and understanding of its benefits.

      I found this section interesting as I believe its a tactic used more toward those in high school / college students. Growing up, I think children needed more 'step-by step / how to' instruction when it came to our education as we were learning things we've never encountered prior but as you get older and move onto more advanced studies I believe that we're more so relating and understanding concepts to build onto base knowledge we already have. So although I do find this tactic valuable currently at this age, I don't know how beneficial this would be to younger kids as they may not even be able to accomplish their task if they lack the instruction they need when doing their assignment online without an educator in their direct presence.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Spinal cord injury (SCI) is a damage to the spinal cord, that causes temporary or permanent changes in its function. While in mammals the regeneration process are very limited zebrafish are able to repair the spinal cord. Based on the hypothesis, that the vascular response might affect the regeneration capacity, the paper by Ribeiro et al addresses the structure and injury response of the spinal cord vasculature. As the growth of zebrafish larvae and juveniles depends a lot on the individual response to the environment, the authors first established comparable body measurement parameters (other than age) and observed the natural spinal cord vascularization process, starting from 6mm body length of the animals. Using transgenic lines the authors describe the formation and patterning of endothelial cells and pericytes up to 9mm length, when a more developed vascular network was present. They observe the processes of vascular regeneration after a contusion based SCI model at different time points (days post injury (dpi)) and in correlation with glial and axonal regrowth, also observing BSCB barrier integrity, angiogenesis, pericyte recruitment and the dependence on Vegf signaling.

      The study is interesting and novel, vascular structures in the zebrafish adult spinal cord have not been reported yet and neither has the vascular response to SCI. Currently the study remains very descriptive, although the authors tried to add functional data, by inhibiting Vegf signaling.

      Major points for revision: The authors fail to establish whether there is any relationship between spinal cord regeneration and vessel regeneration. While I do very well understand the challenges and limitations the authors should put more effort into functional analyses.

      For example: the authors address EC proliferation as a marker for angiogenesis, but do not analyse whether or how much EC proliferation is required for revascularization and regeneration. Pharmacological inhibition of proliferation should be possible and used. From a vascular point of view it would also be interesting whether there is a differential influence of tip or stalk cell proliferation.

      Although we agree that it would be interesting to inhibit EC proliferation to assess its role in spinal cord regeneration, the use of proliferation inhibiting drugs would likely have a widespread effect on the lesioned spinal cord, since many cell types proliferate in response to injury. Therefore, a pharmacological approach would not allow us to dissect the specific role of endothelial proliferation.

      The same is true for pericyte recruitment: the role of pericytes for the vascular repair or the spinal cord regeneration is not clear. The authors could use use mutants with impaired pericyte development or e.g. nitroreductase mediated ablation of pericytes.

      These experiments have been performed in larvae by Tsata et al. (2021). Although it would be interesting to repeat in adults, we believe that these experiments are beyond the focus of our study.

      The statements regarding the role of Vegf are too bold. The problem lies in the limitations of assessing the efficiency of Vegf inhibition. The heatshock promotor has been shown to induce transcription for up to 4 hours, depending on the efficiency of heatshock. There are no data on the stability of dnVegfaa protein. Likewise the pharmacological inhibition could be far from complete. A full inhibition of Vegf signaling is expected to stop vessel growth or angiogenesis. While it is a sign of good practice, that the authors combined a genetic model with a pharmacological one, both leave the same unresolved issue. However if we believe a very limites requirement of Vegf-signaling, it would be interesting to look for other signaling pathways, like cxcl, IL, or FGF to regulates regenerative angiogenesis.

      We agree that our data does not allow us assess the level of inhibition of the Vegf pathway. Since we are unable to confirm this at the moment, we will be excluding the Vegf inhibition data and make this a descriptive study.

      Minor issues

      The correlation with spinal cord repair could be stated more clearly throughout the manuscript. For the uninformed reader it is less clear when exactly the spinal cord is functional again.

      We will include in Fig. 3 a plot of the swimming capacity in contusion-injured fish until 90 dpi and will explain in the text how the vascular response correlates with the functional recovery.

      While I find the model in figure 8 very helpful, it gives 5 to 30 days, for the neuronal regeneration. Maybe a more detailed timeline of EC regeneration and remodeling correlating with neuronal repair would help.

      We will update the model in Fig. 8 with a more detailed timeline and a better description of structures important for regeneration (glial bridge, axonal regrowth).

      In line with that in figure 4 it is not clear whether the images of different time points are indeed one individual animal at the different time points or representative animals for the stage (also figure 4 lacks panel labels, in my copy I can see A, K and L, but no other letters).

      We will detail in the figure legend that the images are of different animals that are representative for each stage.

      For understanding the (re)vascularization, the direction of blood flow might be helpful.

      We will perform an additional experiment to characterise the direction of blood flow in uninjured fish. For this we will use juvenile fish with a body size of 7-9 mm, in which we expect to be able to perform live imaging. We will use a lighsheet microscope to image circulating cells in the spinal cord blood vessels in fish with labelled thrombocytes (Tg(-6.0itga2b:EGFP); Lin et al., 2005) and endothelial cells (Tg(kdrl:ras-mCherry)). These transgenic lines are already available in our fish facility. Even though the vascular network has not yet reached its mature stage at these body sizes, we expect to have enough intraspinal vessels to describe the blood flow circuit.

      Especially for the connection between spinal cord regeneration and vessel regeneration. Does blood flow regulate vessel pruning after 14 dpi?

      Although we agree with the reviewer that it would be interesting to understand how blood flow direction is reestablished in repaired vessels and how blood flow levels correlate with vessel remodelling and pruning, this would be difficult to assess in this system. This could be examined using live imaging, but this technique is challenging in adult zebrafish and has only been carried out in more superficial organs than the spinal cord, such as skin (Castranova et al., 2022) and superficial brain structures (Barbosa et al., 2015; Castranova et al., 2021). In addition, SC-injured fish are more sensitive to external conditions and would probably not survive the long-term/repeated anaesthesia required for imaging.

      This analysis could be performed in fixed samples, for example using the the Golgi complex position in relation to the endothelial nuclei as a proxy for blood flow direction (Kwon et al., 2016), however: (1) this would require a new transgenic line (Tg(fli1a: B4GALT1-mCherry)) that would take time to import and establish in the lab; (2) the identification of regressing vessels is not straightforward in fixed samples and is usually studied in very well established vascular models, such as the mouse retina and zebrafish ISVs (Franco et al., 2015).

      For these reasons, we will not address this question by reviewer 1.

      The combined Vegfaa DN and PTK treatment data looks like it could be inhibiting endothelial cell proliferation (Figure7I).However, Supplementary Figure 8B shows endothelial proliferation does not change. Does it mean the number of endothelial cells is same but the volume of endothelial cells decrees?

      We will not be addressing the changes in endothelial density in the presence of dn-vegfaa and PTK787, since we will be removing the figures related to Vegf inhibition.

      There are also some remaining grammatical errors, for example (but NOT limited to) line 133 to 135.

      We will review grammatical errors in the text.

      As a personal interest I think evaluating the role of Notch in the SCI model would also be very interesting, especially with regard to the vasculature, however that might be out of the scope of the manuscript.

      We agree that Notch signalling may be a player during spinal cord revascularisation. However, mutants for dll4 (the Notch ligand involved in angiogenesis) die between 7-14 dpf and cannot be used for this study. In addition, the use of Notch-inhibiting drugs would likely have pleiotropic effects, since the Notch pathway is also involved in other aspects of spinal cord regeneration, namely in the regulation of regenerative neurogenesis (Dias et al., 2012). To our knowledge, tools that allow the endothelial-specific inhibition of the Notch pathway have not been developed, and therefore we will not be able to address this question.

      Reviewer #1 (Significance (Required)):

      The study is partially descriptive, but very novel as the aspects of vascularisation in a spinal cord injury model have not been described before. If the major revisions regarding functionality are addressed fully, I would wholeheartedly recommend publication and expect an interest for a broad audience. The presented images and their analyses are of very high quality, and therefore also enhance the impact of the study.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The study by Ribeiro et al. investigates the formation of new blood vessels after spinal cord injury in adult zebrafish. The authors initially characterize the extend of spinal cord vascularization during the development of juvenile zebrafish and investigate the association of pericytes with the newly forming vasculature. They then injure the spinal cord and describe the subsequent regeneration of blood vessels. They perform assays to analyze the functionality of the newly forming blood vessels and show that initially blood vessels are leaky. Through EdU labelling the authors show that endothelial cells proliferate. Pericytes similarly increased in numbers. Lastly, the authors inhibited VEGF signaling, which only mildly affected vascular regeneration.

      Together, this manuscript describes the re-vascularization of the regenerating spinal cord in adult zebrafish and addresses how blood vessels mature during this process through pericyte recruitment and decrease in leakiness. The manuscript provides some interesting initial insights into spinal cord vascularization, but is mainly descriptive and unfortunately remains superficial in this regard, as specified below:

      1. The authors only refer to "blood vessels" without specifying the type of blood vessels they observe (are these veins, arteries, capillaries)? A wealth of markers and transgenic zebrafish lines are available to better characterize spinal cord vessels. This is not necessary in case the authors solely refer to "blood vessels" as they do, but it greatly limits the insights into spinal cord vascularization. For instance, Wild et al. (2017) showed that new vessels apparently sprout from veins in the spinal cord. Is this also true during regeneration?

      We will perform RNA in situ hybridisation using probes for arterial and venous markers. We will assay the expression of arterial markers (dll4, dlc, flt1 and efnb2a) and venous markers (flt4 and ephb4a) in uninjured spinal cord (to characterise vessel identity in homeostasis) and in 3 and 7 dpi spinal cords (to investigate the identify of angiogenic vessels during regeneration).

      1. The authors state that their characterization revealed a "stereotypic organization of blood vessels". However, the organization does not appear to be stereotypic (as I understand this term as looking the same in each fish) at all. Can the authors compare e.g. 3 or 5 wildtype fish and extract features that all fish share and those that differ between fish? This would greatly enhance our understanding of the vascular variability within the wildtype population.

      We will provide an additional figure comparing the spinal cord vasculature in different fish.

      1. The authors show an interesting metameric organization of the vasculature with regions of high vascularization interspersed with sparsely vascularized areas. Are there any morphological landmarks that would precipitate these differences?

      We will acquire light sheet images of adult spinal cords without removing the vertebrae. This will allow us to determine if the metameric organisation is correlated with the vertebral distribution.

      Can the authors check whether they induce a lesion in a highly or poorly vascularized area? This might greatly influence the degree of re-vascularization.

      We always perform the spinal cord injury in the region between neural arches (Dietrich et al., 2021). Once we determine how the vasculature is organised in relation to the vertebrae, we will be able to determine if the lesions are performed in a region of high or low vascularisation.

      1. The same superficial characterization unfortunately also applies to the cell population the authors refer to as "pericytes". Traditionally, pericytes are characterized as being associated with capillaries and sharing a basement membrane with the endothelium. Is this the case here?

      We will further characterise the association between Tg(pdgfrß:citrine)-positive cells and blood vessels using an anti-laminin antibody (#L9393, Sigma) to label the basement membrane. Preliminary results recently acquired indicate that Tg(pdgfrß:citrine)-positive perivascular cells and endothelial cells are both enveloped by the basement membrane, supporting the identity of Tg(pdgfrß:citrine)-positive cells as pericytes. Moreover, pericytes are generally described as solitary mural cells associated with small diameter blood vessels (the type of distribution we observe for Tg(pdgfrß:citrine)-positive cells), whereas vascular smooth muscle cells (vSMCs) form concentric layers around larger blood vessels (a distribution we do not detect with this transgene) (Hellström et al., 1999). For these reasons we believe that this transgene is labelling pericytes. We will explain more clearly in the text how the morphology, localisation and density of Tg(pdgfrß:citrine)-positive cells suggests these cells are pericytes.

      In addition, pdgfrb is hardly specific for pericytes, as it also labels a multitude of other cell types (refer to e.g. Tsata et al. (2021)).

      The different cell types labelled by the pdgfrb reporter line used in the Tsata et al., 2021 paper were identified not by the use of different cell markers, but by their localisation: perivascular cells (the same cell type that we also detect), myoseptal cells (which we would not expect to detect, since we are only analysing the spinal cord tissue and not the adjacent muscle) and floor plate cells (a reporter distribution that the authors show is lost after 3 dpf and is not present in the adult spinal cord). Moreover, the Tsata et al., 2021 paper also includes a supplementary figure (S1, panel N) showing a restricted perivascular pdgfrb:GFP distribution in the wholemount adult spinal cord, in agreement with our characterisation. By their morphology and density, these perivascular cells are likely pericytes, as argued above.

      It is also not clear why the transgenic pdgfrb line the authors use only labels cells next to blood vessels. Tsata et al. show a much broader labelling. The authors need to validate their transgenic line using in situ hybridization showing where pdgfrb is being expressed endogenously and how this overlaps with the fluorescent protein expression of the pdgfrb transgenic line.

      We will perform ISH for pdgfrb to confirm if the Tg(pdgfrß:citrine) reporter reproduces the endogenous expression in the uninjured spinal cord and at 3 and 7dpi. The 3-7 dpi period is approximately equivalent to the 1-2 days post-lesion in larvae and, if the non-perivascular pdgfrb:GFP cells observed in the larval spinal cord are present in the adult, we expect to detect them by ISH during this phase of regeneration.

      There are also several transgenic lines available that allow for the distinction between smooth muscle cells and pericytes (e.g. Shih,..., Lawson, Development 2021 and Whitesell,..., Childs, Plos ONE 2014). As for the vasculature, this more detailed characterization is not necessary in case the authors refer to the cells as "cells labelled by the pdgfrb transgene and reside next to endothelial cells". However, this would not be reflective of the level of detail currently present in the field.

      As we explain above, the morphology and density of the pdgfrb:Citrine-positive cells suggests that these cells are pericytes and not smooth muscle cells (SMCs). To confirm this we will compare the expression of pdgfrb with markers of SMCs (i.e, 𝛼-smooth muscle actin and desmin) using immunohistochemistry and/or ISH.

      The reviewer also suggests the characterisation of pericyte subtypes using the lines described by Shih et al., 2021. Although this would be interesting, we do not consider it is essential for our study. It would be very demanding to import the reporter lines and it is not certain that these subtypes are present in the spinal cord.

      1. The authors state that "New blood vessels rapidly attracted pericytes, formed through proliferation and possibly migration of existing pericytes". This statement is not supported by the data, as the authors do not perform lineage tracing of pre-existing pericytes. The authors need to specifically label existing pericytes and then follow whether these pre-labelled cells can be found on newly forming blood vessels. Tsata et al. provide some evidence for this in zebrafish larvae, but they also conclude that pdgfrb expressing tenocytes contribute to new mural cells.

      We will reformulate the sentence to clarify that we detect pericyte proliferation, but pdgfrb-lineage tracing would be needed to provide evidence that existing pericytes contribute to the generation of mural cells associated to new blood vessels. However, we will not perform the lineage tracing experiment for the revision, as we are unable to currently import this line.

      1. The findings that new blood vessel growth only marginally depended on VEGFA signaling is striking. However, it might also point towards an inefficient inhibition of VEGFA signaling. In particular, other publications, for instance Cattin et al. 2015 have shown that inhibiting VEGFA signaling prevents new blood vessel growth during peripheral nerve regeneration in mouse. It will therefore be important that the authors demonstrate that their approach leads to successful inhibition of VEGFA signaling. VEGFAB mutants appear to be homozygous viable and important for spinal cord vascularization (Matsuoka et al., 2017). In addition, heterozygous VEGFAA mutants already have some vascular phenotypes, but are also viable. Can the authors combine these mutants with their inhibitor treatments to achieve a greater reduction in VEGFA signaling?

      Since we are unable to confirm the level of inhibition of the Vegf pathway and we are unable to import the suggested lines at the moment, we will be excluding the Vegf inhibition data.

      Reviewer #2 (Significance (Required)):

      Together, this publication is the first to describe to some extend the regenerating vasculature after spinal cord injury in adult zebrafish. However, both the vascular and regeneration fields are much more advanced than what the authors cover. Both blood vessels and perivascular cells can be characterized in much more detail, as outlined above. Also, studies on nerve regeneration and its dependence on the vasculature, e.g. during peripheral nerve regeneration in mouse have been carried out with a wealth of functional data available. Therefore, the impact of the present study in its current form will be limited. I am an expert on zebrafish blood vessel development.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      Ribeiro et al. described vascular development in the spinal cord from larval to adult stages in zebrafish, and found the dependence of vessel length on body-size. Then, the authors depicted the vascular regeneration process after spinal cord injury (SCI), which includes initial vascularization, angiogenesis, pericyte recruitment, and blood-spinal cord barrier establishment. Although the molecules or signaling pathways that drive the re-vascularization remain unidentified, this study illustrates the cellular processes of spinal cord vascular development and regeneration from the descriptive level, which may facilitate further understandings of mechanisms underlying vascular regeneration in the spinal cord.

      Major comments: - Are the key conclusions convincing? The descriptions of spinal cord vascularization during development and vascular regeneration after SCI are convincing. However, inhibition of Vegfaa and Vegfr2 is nearly ineffective. The author might not conclude that the Vegfr2 signaling plays any role.

      Since we are unable to confirm the level of inhibition of the Vegf pathway, we will be excluding the Vegf inhibition data.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Major comments: 1) In Figure 3, the exact injured site on the spinal cord is not clear. Please include a schematic illustration of full spinal cord to show where is the injured site. Are all the injury experiments in this study done at the same site? If not, is there any site difference regarding the regenerative capability.

      We will include a scheme of the injury site in the spinal cord in Fig.3. All the injury were performed in the same position and this will be clarified in the methods.

      2) Figure 2E showed a segmented pattern of spinal cord vasculature. Is this pattern correlated with the position of vertebra?

      We will acquire light sheet images of adult spinal cords without removing the vertebrae. This will allow us to determine if the metameric organisation is correlated with the vertebral distribution.

      3) In Figure 3, during vascular regeneration after SCI, the author only showed partial regeneration at 30 dpi. Why not show the stage of complete regeneration? At that stage, how about the behaviors of the regenerated animals?

      We will add an additional timepoint (90 dpi) to the characterisation of the revascularisation. Moreover, we will include in Fig. 3 a plot of the swimming capacity in contusion-injured fish until 90 dpi and will explain in the text how the vascular response correlates with the functional recovery.

      4) Only EdU data is not sufficient to conclude that new vessels come from proliferation of remaining endothelial cells. For example, these new vessels might come from transdifferentiation of lymphatic vessels, or immune cells, or glial cells, in the meantime proliferate. This could also explain why the inhibition of Vegfr2 signaling is ineffective on new vessel formation. Cre/loxP-mediated lineage tracings need to be performed to exactly identify where these new vessels originate.

      We will clarify in the text that while the detection of endothelial proliferation suggests existing endothelial cells contribute to new vessels, we cannot exclude that other cell types also give rise to endothelial cells. However, regarding the transdifferentiation of immune and glial cells into endothelial cells, to our knowledge few examples have been described in the literature and generally associated with cancers or in in vitro conditions (Fernandez Pujol et al., 2000; Li et al., 2011; Soda et al., 2011). For this reason we do not expect this rare process to occur during spinal cord repair.

      A cell type that has been associated with transdifferentiation into ECs are lymphatic cells (Das et al., 2022). However, we have analysed the expression of a lymphatic marker (Tg(lyve1b:DsRed)) and were only able to detect very few lyve1b:DsRed-positive cells before or after injury, suggesting that any possible lymphatic contribution would likely be very limited. We plan to include these data in the revised submission.

      5) To confirm the Tg(hsp70l:dn-vegfaa) did work in this study, the authors need a positive control. For example, the effects on vasculogenesis or angiogenesis during embryonic development after heat shock. If the transgene works, the vascular development at early stages should be blocked (Marín-Juez et al., 2016).

      We will be removing the vegf inhibition data, therefore we will not address this question.

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. The suggested experiments are realistic in terms of time and resources.

      • Are the data and the methods presented in such a way that they can be reproduced? In the method, the author should describe how to identify the Tg(hsp70l:dn-vegfaa) in more details, because there is no fluorescence before and after heat shock.

      We will be removing the vegf inhibition data, therefore we will not address this question.

      Are the experiments adequately replicated and statistical analysis adequate? Yes.

      Minor comments: - Specific experimental issues that are easily addressable. In Figure 6, from 30 dpi to 90 dpi, the number of pericytes decreased. Did these pericytes undergo apoptosis from 30 dpi on?

      We have not investigated pericyte apoptosis during vessel remodelling. However, this experiment would require the acquisition of long-term samples (between 60 and 90 dpi) and we would prefer not to address this question.

      Are prior studies referenced appropriately? Yes.

      • Are the text and figures clear and accurate? Please clearly labeled the injured region in Figure 6.

      We will identify more clearly the site of the injury in Fig.6.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? The number of proliferating ECs at 3 dpi is more than those at 5 dpi (Figure 5G). But the number of total EdU+ cells at 3 dpi is less than those at 5 dpi (Figure 5A-D). These data are consistent with Figure S3, which showed ECs were the leading cell type to enter the lesioned site, then were the axons and glial cells at later stages. Please explain and discuss whether the regeneration of other cell types is dependent on the accomplishment of vascular regeneration.

      As the reviewer points out, our data suggest that endothelial cells display an earlier peak of proliferation than spinal cord cells in general and colonise the lesioned tissue before new axons and glial cells. Although these observations could point to a role for ECs in the regeneration of other cell types, we would need to inhibit vascular repair to assess this possibility, which we were unable to do using Vegf inhibition. In our discussion we already mention some possible roles for ECs in stem cell proliferation, neurogenesis and axonal regrowth, but can expand this discussion if necessary.

      Reviewer #3 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Although this study has characterized the development and regeneration of spinal cord vasculature in details, the significance of the advance needs to be improved due to lack of mechanisms. Obviously Vegfa is not essential for the vascular regeneration after SCI. It is better for the authors to identify one or two factors required for this process, in addition to identify cell origins of new vessels. With those, the significance of this study will be improved because the cell origins and required factors will provide potential therapeutic targets after SCI.

      • Place the work in the context of the existing literature (provide references, where appropriate).
      • State what audience might be interested in and influenced by the reported findings. The audience includes people who are interested in vascular development and regeneration, and spinal cord clinicians.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. My field of expertise includes brain vascular regeneration, digestive organ development and regeneration. This study reported spinal cord vascular development and regeneration, which fit my expertise.

    1. Micah ReddingChristian Transhumanist AssociationMicah ReddingFavorites  · oSorpetdsnftf0t30m861u2c10ag1g6mi4ffaial25il81h5hu7gc4t6571t  · Shared with Public groupA few years ago, a friend called his young daughter over, and said to her, “Did you know that Micah thinks we’re all going to become immortal cyborgs, and that’s how we’ll usher in the kingdom of heaven?”I laughed. That wasn’t something I had said, but I think it was his attempt at “reading between the lines”.Is that what I think? The question raises other questions. Could the relationship between God and humanity in the person of Christ be described as “cyborg”? What about the spiritual body of 1 Corinthians 15? Some people have indeed described things this way. After all, it is the “bizarre and unnatural” juxtaposition of God’s spirit and human flesh that saves us and transforms us into constituents of the kingdom of heaven. This combination remains just as repulsive to many people as a robotic eye or an exoskeleton.But I think most people would probably be wondering about something else:Is it our technology that ultimately saves us?I think this question reflects confusion around cause and effect, faith and works, salvation and renewal.In the book of Revelation, we see a glorious city, descending from heaven to earth. This city is organic, human, and technological—a “cyborg city” just like our cities today. All the glory of the nations is brought into it—every good thing created or discovered has a place there. And this city is actively producing new and unprecedented means to heal and renew the outside world.I think this is a picture of what Paul calls “the body of Christ”, the community and ecosystem that Paul says will one day fill the universe. Salvation means being included in this community, both now and into the indefinite future. As part of this community, we are bound into a network of relationships that continually renews and sustains our life. As part of this community, we have gifts and works to do (Eph 2:10), which are the works of creation and healing that renew the world.Salvation consists of relationships, and relationships are always expressed in gifts and acts and works of creation.For several years now, I’ve been arguing that we can’t make sense of Genesis 1-2 (let alone Hebrews 2, 1 Cor 15, or Romans 8;) unless we understand science and technology to be part of these God-given works of creation and renewal.So is it our technology that saves us? No, rather our technology is a gift, a work, a byproduct of salvation overflowing into renewal and creation. Becoming immortal cyborgs won’t save us—but being saved may turn us into immortal cyborgs.

      Micah Redding -> Christian Transhumanist Association · oSorpetdsnftf0t30m86 1 u2c10ag1g6mi4ffaial25il81h5 h u7gc4t6571t · A few years ago, a friend called his young daughter over, and said to her, “Did you know that Micah thinks we’re all going to become immortal cyborgs, and that’s how we’ll usher in the kingdom of heaven?” I laughed. That wasn’t something I had said, but I think it was his attempt at “reading between the lines”. Is that what I think? The question raises other questions. Could the relationship between God and humanity in the person of Christ be described as “cyborg”? What about the spiritual body of 1 Corinthians 15? Some people have indeed described things this way. After all, it is the “bizarre and unnatural” juxtaposition of God’s spirit and human flesh that saves us and transforms us into constituents of the kingdom of heaven. This combination remains just as repulsive to many people as a robotic eye or an exoskeleton. But I think most people would probably be wondering about something else: Is it our technology that ultimately saves us? I think this question reflects confusion around cause and effect, faith and works, salvation and renewal. In the book of Revelation, we see a glorious city, descending from heaven to earth. This city is organic, human, and technological—a “cyborg city” just like our cities today. All the glory of the nations is brought into it—every good thing created or discovered has a place there. And this city is actively producing new and unprecedented means to heal and renew the outside world. I think this is a picture of what Paul calls “the body of Christ”, the community and ecosystem that Paul says will one day fill the universe. Salvation means being included in this community, both now and into the indefinite future. As part of this community, we are bound into a network of relationships that continually renews and sustains our life. As part of this community, we have gifts and works to do (Eph 2:10), which are the works of creation and healing that renew the world. Salvation consists of relationships, and relationships are always expressed in gifts and acts and works of creation. For several years now, I’ve been arguing that we can’t make sense of Genesis 1-2 (let alone Hebrews 2, 1 Cor 15, or Romans 8;) unless we understand science and technology to be part of these God-given works of creation and renewal. So is it our technology that saves us? No, rather our technology is a gift, a work, a byproduct of salvation overflowing into renewal and creation.

      Becoming immortal cyborgs won’t save us—but being saved may turn us into immortal cyborgs.

    1. Author Response

      Reviewer #2 (Public Review):

      Suggestions to improve the paper:

      Major Issues

      1) I do not think that the introduction accurately reflects the state of the field with respect to single cell omics and nerve injury. The CCI model is different than the SNI model, which has been used in most previous studies, in terms of the nature of the injury, and the resolution of pain after the injury. I do not think it is accurate to claim that the CCI model is somehow more relevant clinically, because both models are just that. It is also not really true that co-mingling, un-injured neurons have not been profiled before. The Renthal paper did this, but using a different model. There is value in what the authors have done here, but they can state it more clearly in the introduction. In particular, most published studies have only used male mice, so the sex differences aspect of this work is important. In that regard, the authors did not cite any of the growing literature on sex differences in neuropathic pain mechanisms.

      We revised the introduction and discussion to address the comments. Specifically, we revised the related information about animal models (Page 4-5). Although Renthal et al. examined co-mingling, “un-injured” neurons using a sciatic crush injury model, they did not find cell-type specific changes in uninjured neurons. The reason for this is unclear, but we speculate that it may be partially due to differences in the techniques (e.g., tissue processing, cell sorting, sequencing depth) and animal models (CCI versus crush injury). Compared to sciatic CCI induced by loose ligation of the sciatic nerve, crush injury would injure most nerve fibers (~50% of L3-5 DRG neurons are axotomized in this model). Therefore, the remaining “uninjured’ neurons for sequencing may be much less than that in the CCI model. In addition, we used Pirt-EGFPf mice to establish a highly efficient purification approach to enrich neurons for scRNA-seq and therefore largely increased the number of genes detected in DRG neurons. Comparatively, the neuronal selectivity and number of genes detected were lower in the previous study, which may have resulted in fewer DEGs and decreased ability to detect aforementioned changes. We include a brief discussion (Page 24).

      We appreciate the reviewer’s good suggestion, and cited sex differences studies in neuropathic pain mechanisms (Pages 5, 25). Although our findings suggest that peripheral neuronal mechanisms may also underlie sexual dimorphisms in neuropathic pain, Renthal et al. reported no differences in subtype distributions or injury-induced transcriptional changes between males and females after sciatic nerve crush injury (Renthal et al., 2020). We also discussed the differences between current findings and previous work and also emphasized the sex differences aspect of this work in the discussion (Page 25).

      2) I am curious about the choice to only use samples from 7 days after CCI. One of the advantages of the CCI model is that pain resolves at about 35-60 days, depending on how the ligations are done, and this allows one to look at how transcriptional programs change in DRG neurons after pain resolves. This would give some new insight, at least in comparison to the very comprehensive profiling done in the sciatic nerve crush model by Renthal and colleagues.

      We thank the reviewer for this comment. We provided the rationale for day 7 post-CCI (Page 22). It is the time point when neuropathic pain-like behavior is fully developed in most animals, and the post-injury time point examined in many previous studies. The reviewer is correct, an advantage of the CCI model is that pain resolves at about 35-60 days. Although meaningful, it was not our intention to conduct a time course study to fully characterize time-dependent transcriptional changes using scRNA-seq, which is costly and requires a great effort for data analysis, etc., and is beyond the scope of the current study. We will address this in a future study, and provided a brief discussion (Page 22).

      3) An alternative interpretation of the ATF3 expression is that the dissociation protocol causes this upregulation. ATF3 induction may be rapid and could occur due to the technique the authors chose to use. This could be acknowledged.

      We agree and acknowledged this in our original discussion (Page 22).

      4) I think the authors are a bit over-confident in their call of "injured" and "un-injured" neurons based on Sprr1a expression. This is really the only grounds for calling these neurons injured or uninjured. The fact is that the CCI model does not provide a clear way to determine injured and uninjured neurons contributing to neuropathic pain. This is an advantage of the SNL model, as shown in many classic papers from the Chung lab.

      We included a brief discussion about Sprr1a (Page 22). Although Atf3 is a classic marker of injured neurons in some previous studies, a recent study suggested that Sprr1a may be a better standard to define “injured” neurons (Nguyen et al., 2017). Although injured and uninjured neurons can be readily separated in the SNL model, they are mostly from different DRGs, but not intermingled in the same DRG. Since glia-neuron interaction and neuron-neuron interaction may occur between cells within the same DRG after injury, these interactions may profoundly affect neuronal excitability and gene expression. Accordingly, we choose the CCI model for the current study to determine whether injured and uninjured neurons contribute to neuropathic pain. We included a brief discussion (Page 5, 23, 24).

      5) There are now two papers on human DRG neurons that are available. One was recently published in eLife, and the other is available on Biorxiv, and has been there since Feb 2021. I expected the authors to make some comparisons of cell types that are changing in CCI with populations that are found in humans. Would similar effects be expected? Are these cell types represented in the human DRG?

      Study of human DRG is important, and recent studies elegantly characterized neurochemical and physiological properties. Previous findings have suggested some notable difference between human and rodent DRGs. Importantly, many markers and methods used for classifying subpopulations of rodent DRG neurons do not apply well to human DRG neurons. In addition, data from human DRG came from patients with different etiologies, but not due to peripheral nerve injury as in the animal study. Due to these differences, we feel that it is difficult to make direct compassion of cell types that are changing in CCI with corresponding human DRG neurons.

      Minor Issues

      1) Does the 40 um cell strainer eliminate some larger diameter cells from the analysis?

      We think this is unlikely, as large-diameter cells such as NF1 and NF2 clusters were also observed in our dataset. Importantly, we examined the cell strainer by washing it out inversely and did not find single cells. In addition, all subtypes identified in other studies were also found in our study. Nevertheless, an underrepresentation of the amount of NF neurons may be a result of the fact that not all NF neurons are GFP-positive in Pirt-EGFPf mice. In Pirt-EGFPf mice, expression of the knockin EGFPf was under the control of the endogenous Pirt promoter. Anti-GFP antibody staining revealed that GFP is widely expressed in 83.9% of all neurons. However, Pirt-negative neurons are mainly NF200+ and have large-diameter cell bodies. In addition, compared to small neurons, large neurons are also easier to lose during FACS sorting. We included a brief discussion of this potential limitation, as the NF population may be underrepresented in our sample set (Page 21).

    1. Author Response

      Reviewer #2 (Public Review):

      Zhong et al conducted a scRNA-seq analysis to uncover the features in multiple myeloma (MM) based on the Revised International Staging System (R-ISS) stage. They contributed 11 scRNA-seq datasets, including 9 MM samples and 2 healthy BMMC. And validated their findings using the deconvolution method in large cohorts.

      In addition, the newly identified and validated a subset of GZMA+ cytotoxic multiple myeloma cells. The experiments were nicely conducted and the datasets generated in this study might benefit many other studies. Major comments:

      1) Several studies on scRNA-seq in MM have been reported, but different from that reported in this study. The authors might discuss the insight gained from their study.

      Thanks for your comments. Several studies on scRNA-seq in MM have been disclosed some heterogeneity of MM. For example, Jang JS et al identified the molecular pathways during MM progression (MGUS, SMM, NDMM, and RRMM) [Blood Cancer J. 2019 Jan 3;9(1):2.]. Jean Fan et al devised a computational approach called HoneyBADGER to identify copy number variation and loss of heterozygosity in individual cells from single-cell RNA-sequencing data [Genome Res. 2018 Aug; 28(8):1217-1227.]. These studies verified the high heterogeneities existed in MM. But the specific the mechanism was not clear. Furthermore, these studies didn’t specify the heterogeneity among different stages in R-ISS staging system, which has been an international wide used prognostic stratification system. Therefore, we focused on the specific cluster, marker, and cross-talk pattern among the three stages of MM to reveal the potential mechanism of heterogeneity.

      2) The author claimed Proliferating plasma cells were increased in EBV-positive MM patients. It would be interesting to examine the abundance of EBV RNA levels in the scRNA-seq datasets. Several tools, such as viral-track or PathogenTrack, might be used to conduct such analysis.

      Thanks for the reviewer’s great suggestions and comments. According to your suggestion, we used PathogenTrack to identify pathogens in MM patients and added this analysis results in the file ‘Data for reviewers-1(PathogenTrack).xlsx’. However, the algorithm did not identify EBV reads in the scRNA-seq datasets. In order to verify our conclusion, we collected more MM patients’ samples and examined EBV, MKI67, and PCNA. Our result showed that EBV positive samples had significantly higher MKI67 and PCNA expression, compared with EBV negative samples on Lines 193 to 195, Page 6 (in Figure 5B and 5C).

      3) Methods used for deconvolution are missing.

      We thank the reviewer’s comments and suggestions. In our study, we didn’t use an analytical tool named CIBERSORT, thus we didn’t use deconvolution either in the manuscript. It may cause you a misunderstanding because of our unclear description.

      Reviewer #3 (Public Review):

      The authors constructed a single-cell transcriptome atlas of bone marrow in normal and R-ISS-staged MM patients. A group of malignant PC populations with high proliferation capability (proliferating PCs) was identified. Some intercellular ligand receptors and potential immunotargets such as SIRPA-CD47 and TIGIT-NECTIN3 were discovered by cell-cell communication. A small set of GZMA+ cytotoxic PCs was reported and validated using public data.

      For scRNA-seq data analysis, the authors did QC and filtering and removed low quality cells, including some doublets and followed by batch effect correction. Malignant PC populations were identified using the copy number analysis tool "inferCNV".

      The authors have done lots of analysis. But I think the results can be improved if they can do more analyses. I would recommend to 1) analyze doublets; 2) remove cell cycle effect; 3) GO and pathway analysis for genes with copy number change; 4) do cell-cell communication with more cell type/clusters.

      Thanks for your suggestion and comment.

      1) We applied Scrublet to computationally infer and remove doublets in each sample individually, with an expected doublet rate of 0.06 and default parameters used otherwise. The doublet score threshold was set by visual inspection of the histogram in combination with automatic detection. Information about this description was added to material and methods section as ‘We applied Scrublet [74] to computationally infer and remove doublets in each sample individually, with an expected doublet rate of 0.06 and default parameters used otherwise. The doublet score threshold was set by visual inspection of the histogram in combination with automatic detection.’ accordingly in Lines 731-734, Page 27.

      2) As we focused on the differences in proliferative capacity of myeloma cells, the cell cycle could reflect the difference well. Therefore, the cell cycle data was provided accordingly. Information about this description was added into main text as ‘Next, we analysed the cell cycle of six PC clusters, and distinguished them from other clusters, PCs in cluster 6 (PCC6) were presumably enriched in G2/M stage (Figure. 3B)’ in Lines 142-144, Page 5.

      3) We have analyzed the GO and pathway analysis for genes with copy number changes, and provided the file ‘Data for reviewers-2 and 3 (InferCNV for PCC4 and PCC6)’. Based on this, we found that oxidative phosphorylation was the most significant enriched pathways for PCC4 and PCC6, respectively. Cell-cell communication with more cell type/clusters was provided with the supplementary data in the file ‘Data for reviewers-3 (Overall T cells interaction ligand-receptor pairs dotplot, Overall T cells interaction ligand-receptor, Overall T cells interaction map)’.

      Data analysis of public data was sufficient to prove the small set of GZMA+ cytotoxic PCs. More data analysis or wet experiment proof is required.

      Thanks for your suggestion. The subset of cytotoxic PCs was identified in this study. These PCs exhibited NKG7 and GZMA. Furthermore, NKG7 showed the higher expression level than NKG7. Therefore, we validated it using Multi-parameter Flow Cytometry (MFC) and Immunofluorescence in MM samples. We identified a new subset of NKG7+ cytotoxic PCs and found that the percentage of NKG7+ PCs displayed obvious diversities among stage I, II and III groups. Information about this description was added in the main text as ‘In another MM single-cell dataset focusing on PC heterogeneity of symptomatic and asymptomatic myeloma (dataset GSE117156) [19], one cluster, C21, exclusively expressing NKG7 corresponded to PC18 in our dataset (Fig 2C-2D). In GSE117156 of all 42 samples, the cell proportion varied from 0% to 30.95% of all PCs, with an average percentage of 4.28% (Figure. 2E).Next, immunofluorescence confirmed the expression of NKG7 in cytoplasm of PCs (CD138 positive) from patients with MM (Figure. 2F). Finally, twenty MM patients (stage I: three patients, stage II: 10 patients and stage III: seven patients) were enrolled for multi-parameter flow cytometric (MFC) analysis. The results showed that the percentage of NKG7+ PCs displayed obvious diversities among stage I, II and III groups (Figure. 2G and Figure. S2). The average percentage of NKG7+ population was 2.73% in stage I, 8.89% in stage II and 0.58% in stage III (Figure. 2G and Figure. S3). In summary, we characterized a NKG7+ PC population (PC18), which may provide a novel perspective for the cytotherapy of MM.’ in Figure 2 and S3 and Lines 118-130, Page 4-5.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1.

      Reviewer #1 summary:

      In this manuscript by Lu et al., the authors describe some CRISPR screens and protein-protein interaction screens to identify novel regulators of wild-type p53 and mutant p53 function and stability. Besides generating a wealth of data, they discover FBXO42-CCDC6 as positive regulators of the some p53 hot-spot mutants, including R273H mutant p53, but not of all p53 mutants tested and also not of wild-type, indicating selectivity. Furthermore, the found C16orf72(TAPR1) as a negative regulator of p53 stability.

      Mechanistically, the authors claim a direct interaction between FBXO42 and CCDC6 and p53, but the importance of these interactions has not been shown. On the other hand the authors suggest that the FBXO42/CCDC6 regulate p53 via destabilization of USP28, but also the mechanism has not been worked out. For C16orf72, they show that it interacts with USP7, but no relevance of this interaction is shown either.

      Response: We sincerely thank the reviewer for the constructive and thorough review. We have incorporated most of the suggestions into our planned revision, with our major focus on the molecular mechanistic follow-up.

      Reviewer #1, major points.

      1. One very important point for me is that the authors do not show the levels of expression of p53 in the p53-mClover stable cell lines. It is known that overexpressed p53 is usualy more stable than endogenous levels of wt-p53. Therefore, I think it is necessary that the authors show the levels of the p53-mClover fusion proteins in the stably transduced cell lines compared to endogenous p53 levels in the parental RPE1 cells and also compared to the endogenous levels of R273H mutant in the PANC-1 cells.

      Response: We fully agree that the levels of overexpressed p53s are often more than the endogenous ones, due in part to increased expression and stability. In designing the reporter, we first tried to avoid the stabilisation of p53-GFP due to GFP aggregation by using the monomeric mClover-variant. Further, we titrated the WT and R273H clones (similar to our recent work in PMID: 35439056), to select clones with p53 levels closer to endogenous protein, and exhibiting high dynamic response to Nutlin-3a treatment.

      In the revised submission, we will include Western blotting comparing the levels of p53-mClover (WT and R273H) expression to the endogenous p53s in RPE1 (WT) and PANC1 (R273H) cell lines, in the presence or absence of Nutlin-3a.

      Also the functionality of the wild-type p53-mClover fusion is questionable, at least not shown. One would expect that the overexpression of a functional wt-p53 in p53-KO cells will affect the survival of the RPE1 cells. In Figure 5A the authors show that depletion of MDM2 or C16ORF72 is toxic for the RPE1 cells in a p53-dependent manner, indicating that elevated levels of p53 cannot be handled by these cells. So, experiment(s) showing that the wt-p53/mClover fusion is functional is needed.

      Response: We agree that it will be an important point to benchmark the reporter design. The ectopically expressed WTp53 is often observed to have reduced functionality compared to the endogenous WTp53. The WTp53-reporter line behaves similarly to the RPE1 line (p53-proficient), where both chemical (e.g. Nutlin) or genetic perturbation (e.g. depletion of MDM2/C16orf72) would be toxic in a p53-depedent manner. In line with this data, we have observed that the WTp53-reporter line is able to induce a p53 response as demonstrated by induction of p53-target genes such as p21, which is not observed in p53 null RPE cells, albeit the p21 induction is not as dramatic as in RPE1 cells with endogenous WTp53. Together, these data indicate that our WTp53-reporter is functional albeit with a somewhat reduced activity.

      In the revised submission, we will better demonstrate the functionality of the WTp53-mClover fusion by probing WTp53 target (e.g. p21), in the presence and absence of Nutlin. This is also performed as a part of the experiment addressing Point #1 above.

      A second important point is that the 'verification' of the hits from the screens is only done in one cancer cell line, PANC-1, with mutant p53. I would have like to see at least one other cell line with another p53 mutant endogenously expressed that is also regulated by FBXO42/CCDC6.

      Response: we will include validation of the hits (FBXO42, CCDC6) in other 1-2 tumour lines with confirmed R273H endogenous mutation (e.g. MB-MDA-468, etc).

      For many of the p53-mutants, a bimodal expression is observed. In the FBXO42- and CCDC6-depleted cells, the equilibrium shifts towards more negative cells but the levels in the two populations itself don’t change (while for example for USP28 depletion also the right peak shifts further up, Fig S4E). Is there any correlation with the cell cycle and p53 expression? And can the authors exclude that FBXO42 and CCDC6 are involved in cell cycle progression and hereby influence p53 indirectly (by combining PI staining with Clover-p53 for example).

      Response: we have indeed observed that the “bimodal” levels in the reporters of several mutants, which are also observed in other studies probing the endogenous p53 level (PMID: 29653964); while the population equilibrium shifts, the location of each peak (as a proxy of the level of p53s) are more stable.

      Regarding the relation between p53-level and cell cycle stage, indeed, both the authors in the paper above and we have probed this possibility, but were unable to establish a direct connection.

      In the revised submission, we will add flow cytometry analysis of the p53-mClover level, and the cell cycle position using Hoechst 33342 (live-cell permeable DNA staining).

      The authors claim that the FBXO42-CCDC6 axis regulates stability specifically some p53-mutants, including R273H-mutant, in a manner involving USP28. But USP28 regulates all forms of p53, not just some mutants version. How can the authors reconcile this apparent contradiction?

      Response: we thank the reviewer for this critical observation. From our screen (Supplemental Table 1A), we have indeed noticed a pronounced effects (|Z score| >=3) of FBXO42 on R273H and R248Q stability, and a marginal effect on wild-type p53. Similarly, USP28 had pronounced effects on R273H and R248Q and WTp53.

      In the discussion of the paper, we noted that USP28 was shown to regulate p53 levels through distinct mechanisms:

      ‘USP28 was originally implicated as a protective deubiquitinating enzyme counteracting the proteasomal degradation of p53, TP53BP1, CHCK2, and additional proteins68-71. USP28 regulates wild-type p53 via TP53BP1-dependent and -independent mechanisms. Concordantly, our data shows that USP28 and TP53BP1 are strong positive regulators of wild-type p53. However, while USP28 was also a strong hit in the mutant R273H p53 screen, TP53BP1 was not, indicating that the effects we see upon loss of USP28 on R273H p53 are independent of TP53BP1.’

      Together, this indicates that the R273H-mutant is regulated by a FBXO42-CCDC6-USP28 axis while wild-type p53 is regulated mainly via a USP28-TP53BP1 axis. We will attempt to address and discuss it in the revision.

      On a similar note, the authors show that FBXO42 and CCDC6 interact with p53, but not USP28. Do FBXO42 and CCDC6 interact with each other and with USP28? And is the interaction with p53 specific for the R273H version? This part of the mechanism is very poorly defined and the Co-IPs are not very convincing or relevant for the proposed model.

      Response: This comment will be more extensively addressed in the revision. We have indeed observed the interaction between FBXO42 and CCDC6 (via BioID and APMS); however, we failed to recover USP28 as an interactor of either FBXO42 or CCDC6. The interaction between CCDC6/FBXO42 is not specific to R273H; although we were able to IP endogenous R273H with CCDC6 in PANC-1 line, the WTp53 (as in HEK 293 TRex BioID line) was also picked up in the BioID preys of CCDC6/FBXO42. In addition, we have new data to show that FBXO42 directly interacts with WTp53.

      In the revised submission, we will improve the molecular underpinning of the FBXO42-CCDC6-USP28-p53 axis we propose. We will specifically address the following.

      (1.1.) Biochemically, further support that CCDC6 and FBXO42 regulate p53 via regulating USP28 stability: We will address this by established biochemical assays, e.g. cycloheximide-chase/MG132 experiment. While USP28 is an established WTp53 regulator, little is known about the mechanism, and the “upstream” regulation of USP28; we will attempt to fill this gap:

      (1.2.) And to an unbiased systematic approach, how R273H interactome changes upon the loss of CCDC6 or FBXO42.

      We will perform R273H-BioID upon loss of CCDC6 and FBXO42 and USP28.

      (1.3.) Furthermore, we will specifically exam the interaction of USP28-p53R273H with or without the genetic perturbation of FBXO42/CCDC6.

      Through these efforts, we hope to gain further mechanistic insights into this regulatory axis, but hope that the editors and reviewers will agree that a fully annotated mechanistic understanding is probably beyond the scope of this paper.

      Reviewer #1, minor points.

      The mechanisms of p53 regulation may vary greatly in different cell lines. Can the authors discuss why they choose to do the screen with different mutants, rather than with different cell lines expressing these same mutant endogenously?

      Response: While it is certainly very interesting to assess how WT and mutant p53 is regulated in different cell lines, such an approach is confounded by the ‘genetic make-up’ of the respective tested cell lines. For example, TP53BP1 might be a regulator in one cell line but not in another for the simple reason that the later cell line harbors a TP53BP1 deletion or mutation or expression levels. In addition, while working with endogenous p53 mutations certainly has many advantages, comparing different mutants in different cell lines is again very much confounded by the ‘genetic make-up’ of the respective tested cell lines.

      Our focus was slightly different, and we wanted to set out and specifically ask what the difference between p53 hotspot mutations are. Are they all the same or are there differences and importantly, are there differences between mutants and WT p53 and this can only be achieved when working in the same cellular background. In designing the screen, we have thus tried to optimise the inclusion of different hotspot mutants in an isogenic screening system. As such, we first depleted the endogenous WTp53 to minimise its interference and built the current isogenic system in the non-transformed RPE1 (“normal”) line.

      However, as discussed above, we agree that the screen results will be validated in more cell lines carrying respective endogenous mutants.

      Figure 1: Typo in the legends : Nultin ipv Nutlin

      Response: We apologise for the typos. This is addressed in the current submission, along with improved figure legends to improve readability.

      Figure 1b,1c : Show basal and Nutlin-3 induced MDM2 levels and in the overexpression cell lines; if WT-p53 is functional, MDM2 levels should be higher in WT-transduced cells compared to control or mt-p53 expressing cells.

      Response: In the revised submission, we will include Western blotting probing MDM2 levels (antibody permitting); this is a part of the experiment proposed for Points 1 and 2.

      Authors should explain which they name USP7 a negative regulator of p53, since it is supposed to de-ubiquitinate p53?!

      Response: The effects of USP7 on WTp53 have indeed been difficult to elucidate (by Prof. Vogelstein PMID: 15118411, and PMID: 15058298, and seemingly opposite by Prof. Gu Wei, PMID: 15053880, and PMID: 11923872). However, consistent with Prof. Vogelstein group, the inhibition of USP7 (either by inhibitor or genetically via CRISPR in our studies), has resulted in elevated p53 level.

      Figure 2E: the effect of MG132 on p53 seems to be very minimal on this Western blot; it would need quantification to be convincing...Quality of the blot is also not great.The fact that in control cells the levels of p53 R273H are not affected by MG132 treatment fits with Suppl Figure 2E, indicating that the proteasome has no effect on p53 R273H.

      Response: We indeed noticed that while the proteasome pathway is largely implicated in the WTp53 screen, it has much reduced effects on R273H. Interestingly, the treatment of MG 132 also has limited effects using PANC-1 line (with endogenous R273H). We will repeat this experiment and provide quantifications and modify the text accordingly.

      Suppl figure 3b, 3c, 3d:

      Somehow, I have the feeling that the results from the western blots and the FACS do not match fully, although not all the time-points are shown in the various experiments.

      For example, the FACS analysis (3b) suggests that in control-transduced cells after 16hr p53 is still increased. However, that is not clear at all in the Western blot (3c)

      Is Suppl Figure 3d the quantification of 3c experiment? If so, in the blot also the 24 hrs should be shown.

      The blot shown in Suppl Figure 3c suggests that CCDC6 expression increased upon irradiation. Do the authors agree with that? Would that explain why depletion of CCDC6 has more effect upon irradiation?

      Suppl Figure S3E: if I am right, this is essentially the same type of experiment as shown in figure 2e, but analysis of p53-expression by Western blot. In that blot no real effect of MG132 on p53 levels could be seen. But here, in the FACS analysis, MG132 clearly increases the p53-Clover fusion levels; for me again that Western blot and FACS data do not neccesarily match.

      Response: We apologise for the confusion. In the revised submission, we will improve the figure legends for better readability. Furthermore, in anticipation to the multiple cell lines involved in the revision, we will also clarify the cell lines in the figure.

      With regards to the difference between the flow cytometry and WB data, we have generally observed the flow cytometry bimodal shifting to be more sensitive than the WB, e.g. a 50% shift in population (FACS) is reflected by a 15% reduction in WB (which may be partially explained as WB is a measurement across the cell population and FACS determines the p53-GFP levels of every cell and thus the shift of cells between peaks). Similarly, we noticed flow-cytometry based quantification by antibody staining the endogenous p53 yielded similar sensitivity (PMID: 29653964). As such, we will ensure the validation of hits is performed in two modes. For WB experiment, we will do so in two cell lines carrying the endogenous mutants as suggested by Reviewers #1 and 2.

      Figure 3B: In the CCDC6 IP a very small amount of p53 can be found. I don't know how much input lysate compared to amount of lysate for IP is used, but the percentage of p53 found interacting with CCDC6 seems so marginal that is difficult to explain the effect of KO of CCDC6 in PANC1 cells.

      And, the authors called it a 'reciprocal IP' (Suppl Figure 4a) after transfection of V5-tagged CCDC6 into PANC1 cells, but it actually is the same type of IP. Did the authors try to IP p53 and blot for CCDC6? That would be a reciprocal IP.

      Response: We apologise for the confusion. In the revised submission, we will specify the portion of the lysates used for pre-IP (5% lysate) and IP (1 mg). As for the IP, we will also include the true reciprocal IP (IP p53, and blot for CCDC6).

      Figure 3H: how can authors explain that basal levels of USP28 in control and CCDC6-KO cells transfected with control plasmid are more or less the same and not reduced in the CCDC6-KO cells?

      Response: We will provide a better blot and quantification for this observation. In the current Fig 3H, the CCDC6-KO lane is slightly overladed as seen by the H3 loading control.

      Figure 3I: Essentially the whole blot here is of low quality; especially the FBXO42 blot; is deletion of USP28 increasing FBXO42 protein levels, or is it just the quality of the blot? All in all it seems that FBXO42 is very low expressed in the used cell lines.

      Response: We apologise for the confusion. In the revised submission, we will repeat and try to include higher quality WB, with more optimised condition for using the FBXO42 antibody.

      FBXO42 messenger level is readily detected using qRT.

      Figure 4B: I find it a bit surprising that USP7 is also found in the synthetic viability screen, since it has been shown that USP7 has many more essential targets and KO of p53 only partially rescues the development of USP7-KO mouse embryo's.

      Response: We thank the reviewer for this critical observation. While the double p53-USP7 knockout line is viable, we acknowledge that it is amongst the top scored hits due to the large differential viabilities between WT and p53-null lines. In the revised submission, we will further clarify the screen analysis and the associated interpretation.

      Figure 5: the authors nowhere show the efficacy of the guides targeting c16orf72. A Western blot showing the expression and the reduction upon expressing the guide-RNAs is essential.

      Response: We thank the Reviewer for this suggestion. The efficacy of each guide has been verified using ICE (at the genomic level), and in the revised submission, we will include this critical information as part of the Figure S2F.

      Figure 5E: First, here probably parental RPE1 cells have been used, but that is not stated. Second, the authors state 'only a slight increase in p53 levels upon siHUWE1'; I would say none compared to scrambled.

      I know HUWE1 is a very huge protein, but the blot of HUWE1 is not convincing. I seem to be able to conclude that siMDM2 and siUSP7 reduces HUWE1 levels?

      Response: We apologise for the confusion. In the revised submission, we will be specific of the cell line information on the figure, to improve the readability.

      We agree with the reviewers that assessment of large protein by WB is often difficult but given that this band almost completely disappears upon HUWE1 knock-down, strongly argues that we are indeed assessing the endogenous HUWE1. We also agree that it is an interesting observation that the levels of HUWE1 seem to be slightly reduced upon knock-down of MDM2 and USP7. We will repeat this experiments and provide quantitative data for HUWE1 and p53. Of note, in the screen, HUWE1 also scored as a negative regulator of wt-p53 and did not quite reach statistical significance for the p53 mutants.

      Regarding the relationship between C16orf72 and HUWE1, a newly published work (PMID: 35776542) seems to suggest that siHUWE1 has resulted in an increased C16orf72 level (termed HAPSTR1 in the paper), while siC16orf72 seemed to have no effect on HUWE1 level, although the stability of such a large protein by WB is often difficult to conclude.

      Figure 5F, in relation to figure 5D. Here the author overexpress both c16orf72 and USP7, and find an interaction. The implication of that is not clear. If they want to make point of this interaction, they should have looked at endogenous proteins.

      Response: We acknowledge the many concerns associated with coIP with ectopically, and especially overexpressed proteins in large quantity. In the revised submission, we will attempt to perform endogenous-based IP experiment (antibody permitting).

      It is worrying that USP7 apparently was not one of the hits in the Mass-spec experiment of which results are shown in Figure 5D. Also in that experiment c16orf72 was overexpressed, and USP7 is very highly expressed in essentially all cell lines, so do the authors have an explanation?

      Response: We indeed acknowledge this discrepancy. In the revised submission, we will attempt the coIP/IP using endogenous proteins (antibody permitting, or at least using endogenous target for one of the two partners). We also acknowledge that the limitation associated with the APMS for the detection of interactors.

      Suppl. figure 5D is missing

      Response: We apologise for the confusion. The Figure S5D was inconveniently placed at the top of the figure panel due to space limitation. In the revised submission, we will address this as a part of the overall readability improvement.

      Reviewer #1, Significance.

      The topic of the paper is of high interest given the relevance of p53 and its gain-of-function mutants in oncology, and the screens are well executed and clearly presented. In terms of novelty, FBXO42 has been linked to p53-degradation before, and c16orf72 was recently shown to be able to destabilize p53. However, the link between CCDC6 and p53 is novel and of interest, since they are both substrates of USP7 and are both regulators of the cell cycle.

      We think the manuscript has potential to add something to the field, but would benefit greatly from a better understanding of the molecular underpinnings of their newly described mechanisms, as well as the conditions in which the mechanism is active.

      Therefore, it might be advisable to shorten the manuscript, and go more in-depth in finding the mechanisms of regulation.

      Response: We sincerely thank the reviewer for all the constructive critiques. We will incorporate them in to our revision.

      Reviewer #2.

      Reviewer #2 summary:

      The paper describes several genome-wide CRISPR screens designed to identify regulators of p53 stability. The authors use a system in which p53 levels are marked by mClover expression, using RFP expression to normalise for gene expression changes.

      Reviewer #2, major points.

      1. The bimodal distribution of p53 expression levels in some reporter cell lines (G245S, R248Q, R248W and R273H) hampers the implementation of a robust readout and makes correct interpretation of the results challenging. While it is possible that the bimodal distribution indicates dynamic changes in p53 levels within one population, it also seems possible that a subclone of these cells have acquired additional alterations affecting p53 stability, and that the authors are screening a mixed population of two intrinsically different cell populations. This would make it difficult to interpret the results of the screen in these cell lines and may be a challenge when trying to identify something that has not already been highlighted on depmap.

      Response: We thank the reviewer for this critical observation. We strongly believe that this bimodal distribution is actually an inherent property of the p53 mutants in these cells for the following reasons: (1) The observation of the similar bimodal appearance in cell lines harbouring corresponding endogenous mutant p53s (PMID: 29653964) suggest that these two populations are of biological significance. (2) We have established 5-10 clonal lines each from the G245S, R248Q, R248W and R273H p53 reporter line and all of them exhibit a bimodal distribution, making it very unlikely that these populations are all through stochastic outgrowth of sub-populations with spontaneous mutations/alterations. (3) The bimodal distribution is stable over several months to years in culture. If it were a spontaneous mutations giving rise to a clone with higher mutant p53 levels, we would likely expect that over time this clone takes over the population. (4) We observed that such a pool of bimodal cells could be “synchronised” (e.g. by Nutlin, or MDM2 knockout) to one population, and later return to and repopulate the other (e.g. Nutlin washoff, Figure 1B). (5) When we sort out a single cells from the upper or the lower peak, expand them, we obtain again populations of cells with the same bimodal distribution, indicating that this is a dynamic process. Thus, we believe that these two populations were rather intrinsic, such that a cell in the population may assume both states.

      We also acknowledge the difficulties of screening using a bimodal population; however, we took advantage of these “bimodal” mutants and using FACS assessed the state of a single cell in relation to a genetic perturbation. Each guide has an equal chance of entering a cell that belongs to one of the two populations. If a gene knock-out really affects p53 levels, the cells with the respective guides enrich in one and deplete in the other population and the analysis comparing the guide abundances from these two peaks ensures the experiment are being perfectly internally controlled.

      While many of the top scored hits from the resulting screens are known regulators, it is critical that we validate our hits in an independent system, such as the cell lines harbouring endogenous p53 mutations, echoed by both Reviewers #1 and 2.

      The coverage of the sgRNA library (200x) is rather low for a negative selection screen, where a coverage of 500x would be more desirable. The FDR threshold is also rather lenient, a more stringent FDR threshold would seem more appropriate and shorten the list of potential hits.

      Response: We thank the reviewer for this constructive suggestion. A higher coverage, along with a more stringent FDR, will ensure an even stronger confidence for the remaining individual hits. The present reporter-based enrichment screen and the synthetical viability drop-out screen used four guides per gene, and with 200x coverage for each guide.

      In determining the coverage, we tried to reference recent successful screenings and apply earlier titration result for the 200x coverage (e.g. PMID: 26627737, PMID: 33465779, and reviewed in Nat Rev Methods Primers 2, 8 (2022). https://doi.org/10.1038/s43586-021-00093-4). While the threshold of FDR was often arbitrary, we fully agree that a more stringent FDR, which results in shortened hits list, may further boost the confidence of the hits, though also at the cost of losing potential hits due to collateral effects (e.g. guide efficiency).

      We agree with this reviewer that a higher FDR, esp. at the hits that result in p53 stabilization, would make sense as any gene whose loss causes cellular or genotoxic stress, would likely lead at least in part to p53 stabilization. In the revised submission, we will adjust the FDR accordingly.

      Although the study is focused on the regulation of p53 stability, there are no experiments to show that any of the manipulations alter the ubiquitination or degradation (half-life) of p53. The rescue of expression by proteasome inhibition is very modest (Figure 2E), suggesting the loss of expression may not be a reflection of degradation. A role for endogenous FBXO42 and C16orf72 in regulating the ubiquitination and half-life of endogenous p53 should be confirmed

      Response: We thank the reviewer for this suggestion. In the revised submission, we will monitor the ubiquitination status and also degradation (cycloheximide-chase) experiments for R273H cells, with or without the genetic alteration of CCDC6/FBXO42/C16orf72.

      Many p53 mutants are used for the initial screens, but very little validation is carried out to show that the apparent differences in factors regulating their stability persists in cells naturally expressing these mutants. For example, FBXO42 is identified as a protein required to maintain the stability of R273H, 248W and R248Q, but not R175H, G245S and R337H. While the authors show an association of CCDC6 and p53 in PANC1 cells (expressing 273H), it would be important to show a panel of R273H, 248W and R248Q expressing tumor cells and the response of p53 to FBXO42 and CCDC6 depletion, compared to similar experiments in a panel of R175H, G245S and R337H expressing tumor cells. Again, it would be important to show that any changes in protein levels are due to changes in protein stability.

      Response: We thank the reviewer for this suggestion. In the revised submission, we will include validations in more cell lines carrying endogenous mutant p53s, with a focus on the R273H mutant. We will also try to involve a line with an endogenous p53 mutation that does not respond to FBXO42/CCDC6 alteration.

      The potential hits should also be tested in wild type p53 expressing cells to confirm the specificity to mutant p53s.

      Response: In the revised submission, we will include WB for WT lines (e.g. RPE1) upon genetic alteration of CCDC6 and FBXO42. This was already performed for C16orf72 (Figure 6D).

      (6A) The role of C16orf72 in restraining p53 activity has been reported previously, as has the interaction with HUWE1 (including a new publication PMID: 35776542). The authors suggest an interaction between C16orf72 and USP7, although this should be shown with endogenous proteins. The relative importance of USP7 and HUWE1 binding is not explored. (6B) The effect of C16orf72 overexpression in promoting mammary tumors is impressive, although maybe the more interesting question is whether inhibition of C16orf72 expression can limit tumor development in this system.

      Response to 6A: we are excited about the independent observations by other group(s) confirming similar results! As a part of our improvement for mechanistic work-up, in the revised submission, we will attempt to address, whether C16orf72’ regulation of p53 is dependent on USP7 and/or HUWE1, or other known E3s, such as MDM2.

      (1) Whether the interaction of C16orf72 and HUWE1 or USP7 is required for the C16orf72 regulation of p53. Specifically, for example, we will perform epistasis experiments to test USP7’ or HUWE1’ ability to rescue the p53 levels in reporters upon ∆C16orf72. Due to the toxicity/lethality in WTp53 lines induced by the loss of C16orf72, we intend to test using R273H-reporter, or RPE1-line with ∆CDKN1A (p21) that is a synthetic viable rescue for ∆*C16orf72. *

      (2) In the revised submission, we will attempt to perform endogenous-based C16orf72-USP7 IP experiment (antibody permitting).

      6B. The effect of C16orf72 overexpression in promoting mammary tumors is impressive, although maybe the more interesting question is whether inhibition of C16orf72 expression can limit tumor development in this system.

      Response: We are also equally excited about the in vivo result supporting the idea that C16orf72 overexpression in tumour-prone mice (Pik3caH1047R) mice harbouring WTp53 may accelerate tumour formations. In the revised submission, we will further support that this effect is specific to WTp53/C16orf72, by including data of the control cohort with p53-null background (LSL-Pi3kH1047R; p53Flox/Flox).

      In regard to the effects of C16orf72-depletion in controlling tumour growth - we agree that this would be a very exciting avenue. Conditional C16orf72 mice are being made at the moment and these mice will allow us to comprehensively address this question. However, it will take several more month to generate and validate this line, and then another 2 breeding rounds to generate homozygous C16orf72fl/fl; Pik3caH1047R mice. In addition, the long time required to form tumours in the control mice with WTp53 (~250 days), it becomes not feasible for us to test whether the inhibition of C16orf72 could limit the tumour development, given the revision timeline. As such we respectfully believe that this would be beyond the scope of this manuscript.

      Reviewer #2, Minor comments.

      Figure 1b: The nutlin concentration stated in the methods section is wrong. Should be 10 µM instead of 10 nM (correct in figure legend).

      Figure 6b: y-axis label is missing.

      Figure 1e/f Legend: Should be FDR 0.5.

      Response: We apologise for typos. The current submission has incorporated the corrections.

      Figure 1c: Include results for a mutant that is not regulated by MDM2, such as R175H. Otherwise, as a standalone experiment, this figure doesn't add much.

      Response: We thank the reviewer for this suggestion. In the revised submission, we will include R175H/R337H.

      Figure 1h: While an UpSet plot is an elegant way to present unique and overlapping hits between different screens, Venn diagrams might be more 'accessible' to many readers and easier to understand.

      Response: We thank the reviewer for this feedback. The choice of UpSet blot was largely motivated by the different categories involved, which made the area representation and the intersection of the conventional Venn diagram no longer feasible.

      In the revised submission, we will improve our figure legend for the UpSet blot, to improve the readability.

      Might be worth stating that mClover is an eGFP variant and can therefore be targeted by eGFP sgRNAs so that it is easier to understand the following:

      o Page 5, paragraph 1: "We used the TKOv3 sgRNA library, which contains [...] 142 control sgRNAs targeting EGFP, LacZ and luciferase"

      o Page 5, paragraph 2: "As expected, sgRNAs targeting p53 and mClover were the most depleted sgRNAs, [...]

      Response: We thank the reviewer for this suggestion. We believe this will also improve the readability and have incorporated this into our current submission.

      Reviewer #2, Significance.

      Reviewer #2 (Significance (Required)):

      This is an interesting concept and the results could provide a useful resource for groups interested in the regulation of p53. The authors chose to focus on candidate genes that could have been identified by looking for the top 30 p53 co-dependent genes on depmap (C16orf72 is #24 in this list and FBXO42 is #28, most of the other genes ranking above are already known as p53 regulators). While this validates the screen, it would have been interesting if the authors had identified and validated new regulators of p53 that were not apparent from previously published work.

      Response: We thank the reviewer for all the thorough and constructive comments! In relation to the DepMap dataset, we are excited that many of the top hits from our screens are indeed top WTp53-correlators/anti-correlators (e.g. MDM2, USP28)!

      While the DepMap dataset used cell fitness/viability to construct the genetic relation score, this assay may not effectively rule out the many regulators that could otherwise elicit their regulation of p53 via regulating the general cell response to cell cycle, stress, etc. In our screen systems (i.e. protein stability and synthetic viability screens), we attempted to focus on the regulators of p53-stability (post-translational), and further coupled it with the synthetic viability screens to concentrate on hits that have a more direct role in p53 regulation (e.g. MDM2, C16orf72).

      One other difficulty to fully couple our screens to the DepMap dataset is due to the limited cell lines harbouring endogenous mutant p53s, e.g. R337H. This may also contribute to the uniqueness of the identified R337H-reporter specific hits (where cell lines harbouring R337H have not yet been included in the DepMap dataset), e.g. several Aminoacyl tRNA synthetases (SARS, YARS, etc) were identified as R337H unique regulators and subsequently verified using different guides in the reporter line, but could not be obtained via DepMap.

      We largely see this paper as a resource for the p53 field and would like to publish it as soon as possible. In fact, when we started working on C16orf72 or CCDC6/FBXO42, these hits were not known for their ability to regulate p53. We will work up several other hits, but this would be beyond the scope of this paper and the first author’s Ph.D. thesis that needs to be completed under a timeline.

      Reviewer #3.

      Reviewer #3 summary:

      The manuscript by Lu and coworkers performed genome wide CRISPR screens to search for genes that when knocked out, lead to p53 accumulation or degradation. Wt p53 and a panel of p53 hotspot mutants were chosen as reporter for the screen. The approach reassuringly identified many previously described regulators of p53 degradation, and also found a large set of new hits that many appear to be indirectly affecting p53 level.

      A key step of this approach is the follow up functional and mechanistic study of the hits. To this end, the authors chose FBXO42 as a top hit that blocks mutant p53 degradation, and C16orf72 as a top hit that promotes wt/mutant p53 degradation.

      Overall the functional data for FBXO42 is disappointing. FBXO42 knockout has quite modest effect on mutant p53 level (~50% reduction). The knockout also showed some effect on p53 mRNA level (~25% reduction), making the determination of mechanism difficult. It does not appear to be a promising targeting for reducing mutant p53 level and gain of function activity in tumor cells.

      We thank the reviewer for this constructive comment! We will address this in the revision, as proposed in Point #3.

      The C16orf72 finding unfortunately lost some novelty because it was independently identified as a p53 regulator in a recent study using CRISPR screening (PMID: 33660365). However, the repeated identification is reassuring and the current work provides more convincing functional data, showing C16orf72 knockout increase wt p53 level, inhibits cell proliferation specifically in p53+/+ cells, and overexpression of C16orf72 reduce wt p53 level and accelerates progression of a breast tumor mouse model. Their results suggest C16orf72 is a biologically relevant regulator of p53 in cancer development. In order to provide a reasonable amount of new information and set it further apart from the published study, some biochemical analysis looking into the mechanism of C16orf72 will be helpful.

      Reviewer #3 Major and Minor comments:

      Specific comments:

      1. There appears to be a mix up in the figure legend for Fig.1A describing line 1 and 2.

      Response: We sincerely apologise for the mix up in the figure legend! In the current submission, this has been fixed.

      Fig.2. Data for some p53 mutants mentioned in the text cannot be found in the main figure 2D and supplemental figure S3A.

      Response: We apologise for having not included the R175H and R337H mutants in Supplemental Figure S3A. In the revised version, we will include these two mutants.

      Fig.2 E-F. The effects of FBXO42 and CCDC6 KO on endogenous mutant p53 level is small (~50% decrease). Given that mutant p53 accumulates at high levels, whether a 50% decrease has meaningful effect on its gain of function activities is questionable. The knockouts also caused a ~25% decrease in p53 mRNA (FigS3F) which makes the mechanism quite difficult to investigate further.

      Response: We agree with the reviewer that the current data makes it difficult to conclude the mechanism. Given the design of our reporter, we still believe that the regulations could largely be at the post-translational level. In our revised version, we plan to exam the ubiquitination status of p53 upon losses of CCDC6/FBXO42, and also monitor the p53 degradation via cycloheximide chase.

      To further address whether this reduced level of mutp53 has biological impacts, we plan to test it in the tumour cell context. Given the difference in migration capability observed between PANC-1 and PANC-1-∆p53 line (e.g. PMID: 35439056), we plan to also evaluate the migration pattern of PANC-1, with the presence and absence of FBXO42/CCDC6 (controlled by similar FBXO42/CCDC6 loss in PANC-1- ∆p53 background). Furthermore, in tissue culture, although there is only marginal to no difference in cell growth rate between many mutant p53 lines (e.g. PANC-1) and their ∆p53 line, we plan to test whether a reduced serum or nutrient level could exacerbate the difference, and hence further be used to monitor the difference resulted from the loss of FBXO42/CCDC6.

      Fig.3B. The IP experiment using p53 shRNA and control shRNA should be done by IP of p53 followed by CCDC6 western blot. If CCDC6 IP is used as in the figure, then a CCDC6 shRNA knockdown sample should be compared to control shRNA. The current data does not rule out the possibility that CCDC6 antibody can nonspecifically pull down some p53.

      Response: We apologise for the confusion. In the revised version, we will include the proper reciprocal IP, with IP of endogenous p53 (R273H) followed by blotting of CCDC6.

      Fig.3D. The in vitro pull down experiment needs specificity controls such as non affected R175H p53 core domain. The data presented would suggest that MBP-FBXO42c captured more than 1:1 molar ratio of R273H core domain, which is unusual for specific binding unless there is aggregation of p53.

      Response: We thank the reviewer for this constructive comment! In the revised version, we will incorporate this, by repeating the in vitro pull-down assay including a non-p53 control protein.

      To increase the impact of the current study, the authors could provide more mechanism insight on how C16orf72 regulates p53 level, which was also missing in the other published study. For example, addressing whether C16orf72 effect is dependent on MDM2. Does it cooperate with MDM2 to ubiquitinate p53. Does it promote p53 ubiquitination in the absence of MDM2, since it interacts with HUWE1. Does it act by recruiting usp7 to stabilize MDM2.

      Response: we thank the reviewer for this very constructive and thorough comment! In our revised version, we will attempt these assays and incorporate them into the submission.

      Together with our response to Reviewer #2, Point #6, in the revised submission, we will attempt to address if C16orf72 regulation of p53 is dependent on MDM2 or HUWE1.

      (1) Whether the interaction of C16orf72 and HUWE1, or C16orf72 and USP7 is required for the C16orf72 regulation of p53. Specifically, for example, we will perform epistasis experiments to test HUWE1’ or USP7’s ability to rescue the p53 levels in reporters upon the loss of C16orf72 (∆C16orf72). Due to the toxicity/lethality in WTp53 lines induced by the loss of C16orf72, we intend to test using the R273H-reporter, or RPE1-line with ∆CDKN1A (p21) that is a synthetic viable rescue for ∆*C16orf72. *

      (2) Whether C16orf72 dependent upon or cooperate with MDM2 in regulating p53.

      We will first probe whether C16orf72 overexpression increased the p53 ubiquitination, and then decide whether overexpression of C16orf72 has additive effects to MDM2 overexpression in regulating p53 levels.

      We previously observed that overexpressing C16orf72 could not rescue the R273H level resulted from losing MDM2 (using flow-cytometry in R273H-reporter-∆MDM2), and as such, we plan to test the C16orf72-MDM2 relation in the MDM2-proficient context.

      The manuscript is in a form extremely unfriendly to review, text, figures and legends are all split up at multiple locations, the pdf figures are very sluggish to scroll.

      Response: We sincerely apologise for the inconvenience. In the current submission, we have split the submission into three separate files, (1) main text, (2) main figures, and (3) supplemental figures, along with (4) supplemental tables as individual EXCELs. We will also reduce the resolution of a few images, so the overall higher resolution is retained, while still fitting into the file size limit.

      Reviewer #3 (Significance (Required)):

      The work is significant in identifying a functionally relevant regulator of p53 stability.

      Response: we thank the reviewer again for the very constructive feedback!

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript by Lu et al., the authors describe some CRISPR screens and protein-protein interaction screens to identify novel regulators of wild-type p53 and mutant p53 function and stability. Besides generating a wealth of data, they discover FBXO42-CCDC6 as positive regulators of the some p53 hot-spot mutants, including R273H mutant p53, but not of all p53 mutants tested and also not of wild-type, indicating selectivity. Furthermore, the found C16orf72(TAPR1) as a negative regulator of p53 stability. Mechanistically, the authors claim a direct interaction between FBXO42 and CCDC6 and p53, but the importance of these interactions has not been shown. On the other hand the authors suggest that the FBXO42/CCDC6 regulate p53 via destabilization of USP28, but also the mechanism has not been worked out. For c16orf72, they show that it interacts with USP7, but no relevance of this interaction is shown either.

      Major points

      One very important point for me is that the authors do not show the levels of expression of p53 in the p53-mClover stable cell lines. It is known that overexpressed p53 is usualy more stable than endogenous levels of wt-p53. Therefore, I think it is necessary that the authors show the levels of the p53-mClover fusion proteins in the stably transduced cell lines compared to endogenous p53 levels in the parental RPE1 cells and also compared to the endogenous levels of R273H mutant in the PANC-1 cells.

      Also the functionality of the wild-type p53-mClover fusion is questionable, at least not shown. One would expect that the overexpression of a functional wt-p53 in p53-KO cells will affect the survival of the RPE1 cells. In Figure 5A the authors show that depletion of MDM2 or C16ORF72 is toxic for the RPE1 cells in a p53-dependent manner, indicating that elevated levels of p53 cannot be handled by these cells. So, experiment(s) showing that the wt-p53/mClover fusion is functional is needed.

      A second important point is that the 'verification' of the hits from the screens is only done in one cancer cell line, PANC-1, with mutant p53. I would have like to see at least one other cell line with another p53 mutant endogenously expressed that is also regulated by FBXO42/CCDC6.

      For many of the p53-mutants, a bimodal expression is observed. In the FBXO42- and CCDC6-depleted cells, the equilibrium shifts towards more negative cells but the levels in the two populations itself don't change (while for example for USP28 depletion also the right peak shifts further up, Fig S4E). Is there any correlation with the cell cycle and p53 expression? And can the authors exclude that FBXO42 and CCDC6 are involved in cell cycle progression and hereby influence p53 indirectly (by combining PI staining with Clover-p53 for example).

      • The authors claim that the FBXO42-CCDC6 axis regulates stability specifically some p53-mutants, including R273H-mutant, in a manner involving USP28. But USP28 regulates all forms of p53, not just some mutants version. How can the authors reconcile this apparent contradiction?

      On a similar note, the authors show that FBXO42 and CCDC6 interact with p53, but not USP28. Do FBXO42 and CCDC6 interact with each other and with USP28? And is the interaction with p53 specific for the R273H version? This part of the mechanism is very poorly defined and the Co-IPs are not very convincing or relevant for the proposed model.

      Minor points

      The mechanisms of p53 regulation may vary greatly in different cell lines. Can the authors discuss why they choose to do the screen with different mutants, rather than with different cell lines expressing these same mutant endogenously? .

      Figure 1: Typo in the legends : Nultin ipv Nutlin

      Figure 1b,1c : Show basal and Nutlin-3 induced MDM2 levels and in the overexpression cell lines; if WT-p53 is functional, MDM2 levels should be higher in WT-transduced cells compared to control or mt-p53 expressing cells. Authors should explain which they name USP7 a negative regulator of p53, since it is supposed to de-ubiquitinate p53?!

      Figure 2E: the effect of MG132 on p53 seems to be very minimal on this Western blot; it would need quantification to be convincing...Quality of the blot is also not great. The fact that in control cells the levels of p53 R273H are not affected by MG132 treatment fits with Suppl Figure 2E, indicating that the proteasome has no effect on p53 R273H.

      Suppl figure 3b, 3c, 3d:

      Somehow, I have the feeling that the results from the western blots and the FACS do not match fully, although not all the time-points are shown in the various experiments. For example, the FACS analysis (3b) suggests that in control-transduced cells after 16 hr p53 is still increased. However, that is not clear at all in theWestern blot (3c) Is Suppl Figure 3d the quantification of 3c experiment? If so, in the blot also the 24 hrs should be shown. The blot shown in Suppl Figure 3c suggests that CCDC6 expression increased upon irradiation. Do the authors agree with that? Would that explain why depletion of CCDC6 has more effect upon irradiation? Suppl Figure S3E: if I am right, this is essentially the same type of experiment as shown in figure 2e, but analysis of p53-expression by Western blot. In that blot no real effect of MG132 on p53 levels could be seen. But here, in the FACS analysis, MG132 clearly increases the p53-Clover fusion levels; for me again that Western blot and FACS data do not neccesarily match.

      Figure 3B: In the CCDC6 IP a very small amount of p53 can be found. I don't know how much input lysate compared to amount of lysate for IP is used, but the percentage of p53 found interacting with CCDC6 seems so marginal that is is difficult to explain the effect of KO of CCDC6 in PANC1 cells. And, the authors called it a 'reciprocal IP' (Suppl Figure 4a) after transfection of V5-tagged CCDC6 into PANC1 cells,but it actually is the same type of IP. Did the authors try to IP p53 and blot for CCDC6? That would be a reciprocal IP.

      Figure 3H: how can authors explain that basal levels of USP28 in control and CCDC6-KO cells transfected with control plasmid are more or less the same and not reduced in the CCDC6-KO cells?

      Figure 3I: Essentially the whole blot here is of low quality; especially the FBXO42 blot; is deletion of USP28 increasing FBXO42 protein levels, or is it just the quality of the blot? All in all it seems that FBXO42 is very low expressed in the used cell lines.

      Figure 4B: I find it a bit surprising that USP7 is also found in the synthetic viability screen, since it has been shown that USP7 has many more essential targets and KO of p53 only partially rescues the development of USP7-KO mouse embryo's.

      Figure 5: the authors nowhere show the efficacy of the guides targeting c16orf72. A Western blot showing the expression and the reduction upon expressing the guide-RNAs is essential. Figure 5E: First, here probably parental RPE1 cells have been used, but that is not stated. Second, the authors state 'only a slight increase in p53 levels upon siHUWE1'; I would say none compared to scrambled. I know HUWE1 is a very huge protein, but the blot of HUWE1 is not convincing. I seem to be able to conclude that siMDM2 and siUSP7 reduces HUWE1 levels? Figure 5F, in relation to figure 5D. Here the author overexpress both c16orf72 and USP7, and find an interaction. The implication of that is not clear. If they want to make point of this interaction, they should have looked at endogenous proteins. It is worrying that USP7 apparently was not one of the hits in de Mass-spec experiment of which results are shown in Figure 5D. Also in that experiment c16orf72was overexpressed, and USP7 is very highly expressed in essentially all cell lines, so do the authors have an explanation?

      Suppl. figure 5D is missing

      Referees cross-commenting

      I agree essentially with all comments of Reviewer #2. Especially the major points 3 and 4. The use of more cell lines expressing endogenous mutant p53 is very important. In addition, I can agree with almost all comments of Reviewer #3. The effects especially of FBXO42 ablation are rather minimal, so relevance is questionable.

      Significance

      Nature and Significance

      Compare to existing literature

      The topic of the paper is of high interest given the relevance of p53 and its gain-of-function mutants in oncology, and the screens are well executed and clearly presented. In terms of novelty, FBXO42 has been linked to p53-degradation before, and c16orf72 was recently shown to be able to destabilize p53. However, the link between CCDC6 and p53 is novel and of interest, since they are both substrates of USP7 and are both regulators of the cell cycle.

      We think the manuscript has potential to add something to the field, but would benefit greatly from a better understanding of the molecular underpinnings of their newly described mechanisms, as well as the conditions in which the mechanism is active.

      Therefore, it might be advisable to shorten the manuscript, and go more in-depth in finding the mechanisms of regulation.

    1. If we think carefully of the may find out what our human companions are thinking, we can not fail to be struck by the fact that our only method for obtaining such information is to be had by observing their conduct.

      Watson (1907) is pointing out one of the greatest ways to learn about human/animal behavior. This is done through observation. In 1907 this may not be as clear and simple due to the fact that psychology was not something relevant during those times.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2021-01204R

      Corresponding author(s): Alexander, Aulehla

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      *The paper by Miyazawa and colleagues addresses a key question: How is changed metabolic activity sensed and to induce changes in developmental programs. In recent years, there is more and more indication that metabolism is not only a dull workhorse synthesizing the building blocks for new cells and providing chemical energy, but that metabolic activity itself has also a regulatory role. How this precisely works is largely unknown and even also unexplored in higher cells. From early insights obtained in microbes, it seems that certain metabolites - possibly reflecting metabolic activity (i.e. flux) - could be metabolic signals that feedback into cellular regulation. *

      *The current paper takes this idea now to developmental processes, where the authors found that the glycolytic metabolite fructose-1,6-bisphosphate is a flux-dependent signal that interferes with developmental processes. This is a very exciting finding, as it indicates that this metabolite not only has a regulatory function in microbes but also in mouse during mesoderm development. *

      *Answering the question how such a flux-dependent metabolite mechanistically interferes with the developmental processes is an enormously difficult. Compared to other mechanistic studies, where deleting genes, modifying genes, and changing protein expressions will usually do the trick, here, perturbing metabolite levels is extremely challenging, particularly if such perturbations need to be carried out in a way that nothing else is perturbed. Researchers, who are not overly familiar with metabolism, usually underestimate the difficulty with targeted and insightful perturbation of metabolism. *

      *To this end, the authors of this paper need to be congratulated for a very well carried out study with very solid data, and excellent control experiments. The authors open up a new path towards understanding how embryo mesoderm development is regulated by metabolic activity. In particular, they show that that glycolytic flux, FBP and important developmental phenotypes as well as protein localization changes are linked. As normal with a complex metabolism-based story as this one, there is always more that could be done. Yet, the results are highly important to be reported now such that the field as a whole can build on these interesting results and to explore the exciting path further that has been opened by the authors. Thus, I strongly recommend publishing these findings: The data generated by the authors are accompanied by the required control experiments. The conclusions drawn are very solid. I do not have any major concerns but just a number of minor suggestions that the authors could consider in a revised version of the manuscript. *

      *Minor: *

        • At the end of the introduction, the authors stated their original goal. As it is phrased, it is unclear whether this goal has been obtained or not. They might want to consider replacing the last introductory sentence by a sentence stating what the reader can find in this paper.*

      1. We agree with the reviewer and have rephrased accordingly (line 112–117):

      “In this study, our goal was therefore to first determine in vivo sentinel metabolites during mouse embryo PSM development. We then combined genetic, metabolomic and proteomic approaches to investigate how altered glycolytic flux and metabolite levels impact developmental signaling and patterning processes.”

      • Data from Fig 3: If you plot the lactate secretion vs the FBP levels of the controls and the overexpression experiment, would the control and the overexpression data lie on one line (maybe if combined with the data shown in Fig 1A)?*

      2. As the reviewer suggested, it is of great interest to check whether lactate secretion and FBP levels show a similar correlation in control and cytoPfkfb3 embryos, considering that cytoPfkfb3 overexpression lifts the upper limit of glycolytic capacity and FBP levels (revised Figure 3B, 3E). As the reviewer suggested, we plotted FBP levels against lactate secretion and fitted a linear regression line onto control samples (please see the Figure R1 below). The new plot shows that lactate secretion and FBP levels in cytoPfkfb3 embryos lie on the linear regression line derived from wild-type samples, highlighting that a correlation between lactate secretion and FBP levels is maintained even in cytoPfkfb3 embryos. We now included this new plot in the revised Figure S4C and modified the text accordingly (line 474-477):

      “In addition, FBP levels showed a linear correlation with lactate secretion in control explants, and such a correlation was maintained even in cytoPfkfb3 explants (Figure S4C).”

      Figure R1. Correlation between lactate secretion and FBP levels in PSM explants. Linear regression line (a grey line) was derived from the data of control samples cultured in 0.5–25 mM glucose (black circles; from Figure 1A and 3E). The data from cytoPfkfb3 embryos cultured in 2.0–10 mM glucose (from Figure 3B and 3E) are shown as red rectangles.

      • Maybe the authors could attempt an experiment like the following one: Chose the strongest phenotype observed and test a combination of overexpressing cytoPfkfb3 and reducing extracellular glucose level at the same time? *

      3. We agree this suggested experiment is important to show that the phenotype in cytoPfkfb3 embryos is indeed dependent on glycolytic flux and have already addressed this specific point in our manuscript, see results in Figure 4B and 5A in our original manuscript. The results show that the phenotypes in cytoPfkfb3 explants, i.e. reduction in somite formation and downregulation of Msgn mRNA expression occur in a glucose dose-dependent manner. Since in this embryonic context, we show that glucose concentration impacts glycolytic flux (see increased lactate production upon glucose titration in Figure 3B), our findings support the conclusion that the effect of cytoPfkfb3 overexpression is flux-dependent and not due to the overexpression per se. Based on the reviewer's feedback, we have modified the text to clarify and highlight this critical point (line 339–345):

      “Combined, these results show that cytoPfkfb3 overexpression results in reduced segment formation, arrest of the segmentation clock oscillations and downregulation of Wnt signaling, in a glucose-dose dependent manner. As glucose concentration impacts, in turn, glycolytic flux (Figure 1A, 3B), these findings suggest that these phenotypes are flux-dependent and are not a mere result of cytoPfkfb3 overexpression.”

      • Can the proteomics experiments shown in Fig. 6 be repeated with high and low extracellular glucose? High glucose should yield high FBP levels and one would then expect to see the same as with the experiment where at 2 mM glucose 20 mM extracellular FBP were added. Is this the case? *

      4. We agree with the reviewer that based on the findings, one would expect the phenotype, i.e. in this case translocation of proteins, to correlate with FBP levels. Two of our results are of note in this regard.

      First, our data indicates that in order to see the effect on protein localization, high levels of FBP have to be reached. Accordingly, we find that Pfkl becomes depleted from the nuclear-cytoskeletal fraction in cytoPfkfb3 explants when cultured in 10 mM glucose but not (visibly) in 2.0 mM glucose (Figure 7D). Corresponding to this, FBP levels in cytoPfkfb3 explants show a significant increase (about 3-fold) from 2.0 to 10 mM glucose conditions (revised Figure 3E).

      Second, in control samples, FBP levels saturate in high glucose conditions. FBP levels in control samples do not further increase when glucose concentration is increased from 10mM to 25mM, and thus it does not become as high as in cytoPfkfb3 embryos cultured in 10 mM glucose (revised Figure 3E).

      Therefore, in order to reveal the translocation, it requires an experimental strategy that leads to significantly increased FBP levels, such as in cytoPfkfb3 explants with high glucose condition, or alternatively, direct supplementation of FBP.

      As also pointed out by the other reviewers, we are experimentally generating controlled conditions that exceed the physiological range which the embryo is exposed to. Accordingly, our data does not constitute evidence that under physiological conditions an alteration of protein localization in response to change in glycolytic flux and FBP levels occurs, at a smaller scale.

      We regard our approach as a first step to reveal potential mechanisms and so far hidden possible responses to changes in metabolic flux. In order to see minor changes in translocation upon small changes in glycolytic-flux/FBP levels, more quantitative approaches, such as live-imaging of tagged proteins, will need to be developed. We hence decided to include these discussion in our revised manuscript (line 657-666):

      “Of note, the translocation of proteins was observed only when high levels of FBP were reached upon direct FBP supplementation or cytoPfkfb3 overexpression with high glucose (Figure 6, 7). Future studies hence need to investigate whether flux-dependent change in protein localization occurs upon moderate and more physiological changes in glycolytic-flux/FBP levels. To this end, the development of more quantitative approaches, such as live-imaging of tagged enzymes and the development of metabolite biosensors, are needed.”

      • While the authors quantified proteins in different compartments, I was wondering whether they also looked for whole-embryo protein expression changes? *

      5. We have not done protein expression analysis using whole embryos, or other isolated tissues in this study. This is indeed a potentially interesting future experimental comparison.

      • Throughout the manuscript, the authors state the glucose levels or cytoPfkfb3 changes the glycolytic flux. While I tend to agree with this, it is important to note that the authors have not directly measured glycolytic flux, but use the amount of accumulated lactate as a proxy. I think it is important to add this disclaimer at important points in the manuscript, such that readers are aware of this point. *

      6. We fully agree with the reviewer and now have added the following sentence in the first result section to make this point clearer to the reader (line 126-128):

      "Throughout this study, we used quantification of secreted lactate as a proxy for glycolytic flux due to the inability to directly measure flux in embryonic tissues."

      Another aspect for changing FBP levels could be connected on what was found in yeast, where the FBP levels were found to oscillate with the cell cycle (https://pubmed.ncbi.nlm.nih.gov/31885198/). Could this be connected with the pattern formation here?

      7. This is indeed an interesting aspect to discuss; in the absence of experimental evidence connecting the observed pattern formation and cell cycle (though some classic work had suggested its existence) we have decided to omit the discussion of this potential link.

      • Line 606: The mentioned review article also covers yeast. As such, maybe the authors should replace the term "bacteria" with "microbes"? *

      8. We modified our manuscript accordingly.

      Reviewer #1 (Significance (Required)):

      **Referees cross-commenting**

      As I mentioned in my comment, targeted metabolic perturbations are extremely difficult. Perturbing a metabolite level without at the same time perturbing the flux through this pathways is difficult (of not impossible). Also, the opposite is the case.

      I am not sure whether experiments as the one suggested by reviewer 2 (comment 1) will really lead to results from which further conclusions can be drawn. Furthermore, there does not need to be a linear correlation between the extracellular glucose concentration and metabolic flux/FBP levels (as my reviewer colleague implies). Thus, I am not sure whether doing this experiment makes sense, or would lead to strengthened conclusions.

      Reviewer 2 also states "The lack of proven mechanism for the activity of FBP might restrict the real general impact of this work." I agree that we do not know the downstream targets of FBP, but finding them would likely require many years of additional work. Such work will not be initiated if this paper is not published, and it would be a pity if it would be further delayed. I feel that the evidence is strong enough that FBP has an important role and with this paper published, it will motivate others to look for the downstream targets.

      Reviewer 3 makes the point: "Given that FBP levels are highly correlated with extracellular glucose levels (which impact glycolytic flux )(TeSlaa and Teitell, 2014) the authors should elaborate on why progressive increase in extracellular glucose does not affect PSM patterning, in the same way that increasing FBP levels does. " Here, I feel my reviewer colleague might be overlooking that in biochemistry molecular interactions typically reach a saturation at some point. The correlation between extracellular glucose and glycolytic flux has likely only a range where these two measures linearly correlate. Similarily, the correlation between glycolytic flxu and FBP likely also exists only within a certain range, and finally FBP levels and the downstream targets likely also only linearly interact within bounds. Thus, the absence of a correlation at "extremes" does by no mean mean that what the authors propose is incorrect. In fact, it just shows what you expect from biomolecular interactions that there a limits to linear correlations.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      *Summary. *

      *The work described in this paper first searches for potential sentinel metabolites of glycolytic flux, focusing on the process of somitogenesis during mouse embryonic development. By measuring the levels of different metabolites in the presomitic mesoderm (PSM) of E10.5 mouse embryos cultured in the presence of three different glucose concentrations, the authors identify 14 metabolites whose concentration rises with increasing glucose concentration in the culture medium. Among them, they selected fructose 1,6-bisphosphate (FBP) for further analyses, as it showed the highest linear correlation with extracellular glucose concentrations. They then show that addition of FBP to the incubation medium of cultured embryo tails interfere with somitogenesis and tail extension in a concentration-dependent fashion. In addition, they show that this effect is exacerbated when extracellular glucose levels are increased. By analyzing specific targets of Wnt and Fgf signaling, the authors also show that addition of FBP down-regulates both signaling pathways in the PSM. They then use a genetic trick (ubiquitous overexpression of cytoPfkfb3) to increase FBP levels by allosteric activation of Pfk (the enzyme that produces FBP) in developing embryos. When tails from these transgenic embryos were cultured in vitro and exposed to various glucose concentrations somitogenesis was affected in a way resembling the effects of FBP on cultured tails from wild type embryos. The authors then go on to determine the subcellular localization of different proteins in tails incubated in the presence of various FBP concentrations to identify that some enzymes involved in the glycolytic pathway (and they specifically focus on Pfkl and Aldoa) are excluded from nuclear fractions at high FBP concentrations. The authors conclude that FBP functions as a flux-signaling metabolite connecting glycolysis and PSM patterning, potentially through modulating subcellular protein localization. *

      *Major comments *

      *I think that in general the work described in this manuscript has been performed to the highest technical standards. However, I do not think that I can agree with the authors' conclusions (that FBP connects glycolysis with PSM patterning and that subcellular localization of glycolytic enzymes play a role in this process), which in my opinion go way beyond what can be proven by the data provided. *

      *1- Explants incubated with external glucose concentrations up to 25 mM have no obvious defects on somitogenesis or on the segmentation clock as determined by LuVeLu cycling activity. Under these conditions, explants are expected to contain very high FBP levels if this metabolite keeps its linear relationship with external glucose (in this work it was not measured beyond 10 mM glucose in the medium, where FBP concentration was already very high). This contrasts with the phenotypes observed upon exogenous supplementation of FBP, which affects somitogenesis already at 2 mM glucose. These latter results are at odds not only with the lack of phenotypic alterations under high glucose conditions, but also with the observation that exogenous addition of fructose 6-phosphate (F6P), the substrate of Pfk enzymes to generate FBP, does not alter somitogenesis. The authors take the absence of effects by incubation with F6P as a control of the specificity of FBP. However, as F6P is the natural substrate of Pfk, it is possible that supplementation of F6P also leads to an increase of FBP but in a way closer to a physiological condition. Therefore, I find it essential to determine FBP levels in tails incubated in the presence of increasing amounts of F6P, as if it increases FBP levels, similarly to what the authors described for the tails incubated with increasing glucose concentrations, it will have important implications to the interpretation of the work presented in this manuscript. *

      9. We agree with the reviewer and to directly address this central point, we have performed an extended, additional experiment, collecting 375 embryos to quantify FBP levels under five conditions with three biological replicates.

      There are two major results that we highlight here: First, we found that addition of F6P did not lead to increased FBP levels compared to control samples cultured in 10 mM glucose, which is in stark contrast to cytoPfkfb3 embryos cultured in 10 mM glucose (revised Figure 3E). Second, while increasing glucose concentration is mirrored by elevated FBP levels as we reported, we find clear evidence of saturation above a concentration of 10mM glucose: increasing glucose to 25mM does not increase FBP levels further (revised Figure 3E).

      This saturation effect seen in glucose titration, but also the absence of elevated FBP upon F6P addition, might be expected outcomes because, as also the reviewer 1 pointed out in the response, Pfk is commonly considered to be a rate-limiting enzyme in the glycolytic pathway. We now have the direct experimental data supporting this hypothesis and thank the reviewers to have initiated this additional (very involved..) experiment.

      This new data allows us to conclude more firmly on the correlation between FBP levels and phenotype: at high FBP levels, which are seen in cytoPfkfb3 samples, we observe PSM patterning defects. These high levels are not reached even at 25mM glucose or upon F6P addition, due to the saturation at the level of PFK enzymatic step. Hence, while glucose titration does elevate FBP significantly until this saturation, FBP levels are not as high as in cytoPfkfb3 samples. As a correlative finding, we see that only those conditions with very high FBP levels, or the direct addition of high levels of FBP, cause the arrest of segmentation clock activity. At moderately elevated FBP levels, observed in control explants with high glucose or in cytoPfkfb3 explants with low glucose, clock activity continues and we find a quantitative effect at the level of gene expression, i.e. Wnt signaling target downregulation (Figure S3, 5A).

      The new data has been included in the revised manuscript and the text has been adjusted accordingly:

      • (Result Part, line 245–254) "Consistently, we found that cytoPfkfb3 overexpression lifted the upper limit of FBP levels in PSM cells (Figure 3E, S4B, S4C). In control explants, FBP levels did not increase further when glucose concentration was increased from 10 mM to 25 mM. It was also the case when control explants were cultured in 20 mM of F6P (Figure 3E). These results indicate that the Pfk reaction carries a (rate-)limiting role for glycolytic flux and FBP levels, and that cytoPfkfb3 overexpression hinders the flux-regulation function of Pfk."

      • (Discussion Part, line 551–573) “Our findings suggest that flux-regulation at the level of Pfk is critical to keep FBP steady state levels within a range compatible with proper PSM patterning and segmentation. In agreement with such a rate-limiting function for Pfk, we found in glucose titration experiments that FBP levels saturated and did not further increase at glucose levels above 10 mM (Figure 3E). Along similar lines, the supplementation of high concentrations of the Pfk substrate F6P did not result in a significant increase of FBP levels, again compatible with a rate-limiting function at the level of Pfk (Figure 3E). The upper limit of glycolytic flux and FBP levels can be experimentally increased by cytoPfkfb3 overexpression (Figure 3B, 3E). We interpret the data as evidence that cytoPfkfb3 overexpression compromises the flux-control function of Pfk and hence much higher FBP (and secreted lactate) levels are reached. Such a drastic increase in glycolytic flux and FBP levels correlates with a severe PSM patterning phenotype (Figure 4), which resembles the phenotype induced by supplementation of high dose of FBP (Figure 2). Our results in mouse embryos hence provides evidence that flux regulation by Pfk, an evolutionary conserved role present from bacteria to humans, serves to maintain FBP levels below a critical threshold.”

      *The main difference between the experiments involving FBP supplementation and those involving high glucose concentrations or exogenous F6P addition is that in the later two cases increase in FBP would be restricted to the tissue(s) expressing Pfk, whereas upon FBP supplementation this metabolite would hit any tissue, regardless of whether or not it would ever be physiologically exposed to this molecule. In the case of the PSM, this might be relevant because it has been shown that there is a gradient of glycolysis, being high at the caudal tip and becoming lower at more anterior regions of the PSM, most likely mirroring the distribution of Pfk activity. Exogenous administration of FBP would flatten the gradient, which could lead to alterations in PSM patterning, whereas glucose (and eventually F6P) would not as they would increase FBP locally in the area where it is normally activated, keeping the natural gradient. *

      *On the basis of these arguments, to which extent does FBP connect glycolysis and somitogenesis under physiological conditions? *

      10. First, we would like to clarify that while indeed glycolytic activity is graded along the PSM, as other and we reported previously (reported in Bulusu et al., 2017 and Oginuma et al., 2017), the baseline expression of the entire glycolytic machinery (from glucose transport to lactate production) is very high, in all PSM cells. Hence, we see that cells all along the entire PSM have very active glycolysis, the posterior PSM being even more active.

      For this and related reasons, our interpretation about the difference seen between glucose titration/F6P addition on one side, and FBP addition/cytoPfkfb3 addition on the other side, is based on the role of Pfk in controlling either flux levels or dynamics in all PSM cells.

      Hence, while we agree that we generate experimental conditions that allow FBP levels to surpass those found in control embryos, we would like to highlight the fact that even moderate changes in flux does result in very robust functional consequences on gene expression (Figure S3, 5), as we show in this work.

      We can currently not fully address the first point raised, i.e. the role of graded flux/graded metabolite levels, due to the experimental limitations. Such a study requires, for instance, the generation of metabolite biosensor reporter lines in order to be able to monitor these changes dynamically, in space and time.

      *ESSENTIAL ADDITIONAL EXPERIMENT related to point #1: Measure FBP from PSM explants incubated under various exogenous concentrations of F6P. *

      11. We have performed this suggested experiment, which required the collection of n=375 embryos cultured under the various conditions and analysis by LC-MS to quantify metabolites. The outcome was indeed very informative (please refer to our response #9).

      *ANOTHER EXPERIMENT THAT COULD BE INFORMATIVE: measure FBP levels in PSM incubated under different glucose concentrations but instead of using the whole PSM together, dividing the PSM in posterior, medium and anterior parts (similarly to what was done in Oginuma et al, 2017, reference in the manuscript) to see if there is a gradient in FBP activation. *

      12. While in principle we agree that this experiment could be informative, we consider the proposed experiment beyond the scope of this work and technically very challenging (although possible). With a similar motivation, the development of metabolite biosensors is an alternative route that we are pursuing for future studies (for the detail, please refer to our response #10).

      *2- A similar argument could be presented for the results with the cytoPfkfb3 transgenics, as they are based on global artificial overactivation of Pfk, in addition to other possible effects of the ectopic activity of cytoPfkfb3, which were not controlled. Also, while the phenotypic alterations in the PSM in vitro, most particularly in the experiments involving incubation of the tails, are rather strong, the reported effects on somitogenesis in vivo are minor, also questioning the contribution of the in vitro conditions to the final phenotypic effects observed throughout the manuscript. *

      13. First of all, we would like to emphasize that the phenotype seen in cytoPfkfb3 embryos, i.e. the reduction of segmentation and downregulation of Wnt-target gene expression, occurs in a glucose dose dependent manner (Figure 4B and 5A). Hence, it is not the overexpression of cytoPfkfb3 per se that can account for the effects seen. But rather, increased glycolytic flux caused by the combination of transgene expression with high glucose results in functional consequences.

      In addition, ‘other possible effects’ that the reviewer is referring to should be evident in all transgenic embryos, irrespective of glucose dose. To the contrary, transgenic embryos cultured in low glucose conditions appear unaltered to control embryos.

      Second, we agree that we need to distinguish between strong phenotypes, visible at the level of clock arrest, and milder phenotypes, visible at the level of quantitative gene expression changes. It is important to note that the moderate phenotype, i.e. the quantitative gene expression changes seen in posterior PSM, are seen upon the addition of FBP at moderate levels and upon in glucose titration within the physiological concentration range, as well as in cytoPfkfb3 embryos. We take this as evidence that the effects seen in cytoPfkfb3 transgenic embryos reflect a common response also seen under physiological conditions.

      To extend this argument to the in vivo setting, we have performed additional experiments using a genetic mouse model for diabetes. As shown in our previous submission, cytoPfkfb3 transgenic animals do not exhibit a drastic in vivo phenotype when dissected at embryonic day 10.5. One interpretation of this finding is that since the cytoPfkfb3 phenotype is glucose and flux-dependent, the in vivo flux is low, reflecting low glucose concentrations described in vivo. To test the effect of increased flux in cytoPfkfb3 embryos in vivo, we therefore crossed the transgenic mice into a diabetic model called Akita, in which a point mutation in the Insulin2 gene causes high maternal glucose levels (Yoshioka et al., 1997; Wang et al., 1999). Using this experimental setup, we tested whether transgenic embryos in Akita diabetic females would manifest in vivo phenotypes.

      Indeed, we found that cytoPfkfb3 transgenic embryos developing in Akita diabetic females showed significantly increased cases of neural tube closure defects (50% of cytoPfkfb3 embryos) and developmental delay (control: 38 somites vs. cytoPfkfb3: 34 somites at E10.5), defects not seen in transgenic cytoPfkfb3 embryos from control females (please refer to Figure R2 below). This dependency of the in vivo phenotype on maternal glucose conditions again highlights that the defects observed in cytoPfkfb3 embryos are not due to the expression of cytoPfkfb3 per se, but are rather directly linked to increased/unregulated glycolytic flux.

      We included the new in vivo data in the revised Figure S5D-E and modified the text accordingly.

      Figure R2. In vivo phenotype of cytoPfkfb3 embryos grown in diabetic Akita females. (A) The number of somites in control (Ctrl) and cytoPfkfb3 (Tg) E10.5 embryos grown in diabetic Akita females. (B) In situ hybridization of Msgn, Uncx4.1, and Shh mRNAs in Ctrl and Tg E10.5 embryos grown in diabetic Akita females (ss, somite stage; scale bar, 500 µm).

      In conclusion, combining the arguments in the two previous comments, to which extent the results from the addition of FBP or from the transgenic activation of Pfk are not artefactual phenotypes without real physiological relevance?

      14. In our view, two main conclusions, both in vivo and in vitro, can be drawn based on the result we obtained:

      First, we find that a moderate increase in glycolytic flux, within the physiological range, leads to a quantitative and consistent change in gene expression, such as downregulation of Wnt target genes (Figure S3, 5). Such a phenotype was the result of either glucose titration or culturing cytoPfkfb3-transgenic embryos in low glucose concentration.

      In these conditions, while overall PSM patterning is qualitatively normal, we do find consistent changes at quantitative level, i.e. gene expression changes, which are also mirrored by a reduced rate of segmentation (Figure 4B). A detailed analysis of the quantitative changes at the level of segmentation clock dynamics is being carried out and will be presented in a dedicated follow up study.

      Second, we find that a very significant increase in FBP levels, i.e. when cytoPfkfb3 transgenic animals are cultured in high glucose conditions or when samples are cultured in high levels of FBP, PSM patterning is qualitatively altered and segmentation clock ceases to oscillate. In this case, we agree that it is not a physiological condition, as such high levels of flux and FBP are not reached in control samples which have intact flux regulation by Pfk. Nevertheless, such an experimental condition can be insightful, as it very clearly reveals the potential link between glycolysis, clock activity, PSM patterning and the Wnt signaling pathway.

      It is the combination between the moderate and the more severe effects, observed both in vitro, and now also in vivo using the Akita model (see above), that we take as evidence for an intrinsic, physiological link between glycolytic activity, PSM patterning and signaling.

      *3- The authors seem to give a strong functional meaning to the absence of Pfkl and Aldoa from the nuclear fraction in tails incubated with exogenous FBP, suggesting a "moonlighting" function of these enzymes under FBP regulation. In addition to the purely speculative nature of this interpretation (there is no proof for such activity or even an attempt to test it), the data provided is also difficult to interpret for various reasons. *

      15. We fully agree that we do not show a functional role for either the nuclear localization of enzymes or their dynamic change in sub-cellular localization and have tried to express this clearly in the original manuscript:

      • (Result Part, line 382-388) “While we have not been able to address the functional consequence of specific changes in subcellular localization, such as the nuclear depletion of Pfkl or Aldoa when glycolytic flux is increased, these results pave the way for future investigations on the mechanistic underpinning of how metabolic state is linked to cellular signaling and functions.”

      • (Discussion Part, line 575-577): “While future studies will need to reveal if nuclear localization of glycolytic enzymes is linked to their moonlighting functions or metabolic compartmentalization…”

      Based on this comment by the reviewer, we have further emphasised this point in the revised manuscript(line 635-639):

      “While we do not have any direct functional evidence so far for a functional role of nuclear localized glycolytic enzymes, our findings do raise the question whether their subcellular compartmentalization is linked to a non-metabolic, moonlighting function.”

      The protein levels in nuclear fractions are clearly much lower than those in the cytoplasm (this is best seen in the blots of Figure 6D). Does this represent similar subcellular distribution of these enzymes throughout the tissue or the different levels result from the presence of the enzymes in the nucleus of only a subset of the cells? This might be of importance to understand the possible relevance of the subcellular distribution of those enzymes. All the analyses were done on bulk tissue and, therefore, it is not possible to distinguishing between these possibilities. As the authors have antibodies for these enzymes, they could try to perform immunofluorescence analyses, which would provide spatial data.

      16: We agree that a spatially resolved analysis of the subcellular localization of these various enzymes is needed. Unfortunately, the immunofluorescence experiments that we performed did not yield clear, reliable results and hence we can’t provide the answer at this time.

      *In addition to this, it would be important to determine Pfkl and Aldoa subcellular localization in explants incubated with different external concentrations of glucose, which in a way reproduces better possible physiological effects (see point 1), to see if under those conditions high FBP also affects subcellular distribution of those enzymes. *

      17: Please find our response under #4 (attached below), as this important point was also raised by the reviewer 1.

      *(Our response #4) *

      *#4. We agree with the reviewer that based on the findings, one would expect the phenotype, i.e. in this case translocation of proteins, to correlate with FBP levels. Two of our results are of note in this regard. *

      *First, our data indicates that in order to see the effect on protein localization, high levels of FBP have to be reached. Accordingly, we find that Pfkl becomes depleted from the nuclear-cytoskeletal fraction in cytoPfkfb3 explants when cultured in 10 mM glucose but not (visibly) in 2.0 mM glucose (Figure 7D). Corresponding to this, FBP levels in cytoPfkfb3 explants show a significant increase (about 3-fold) from 2.0 to 10 mM glucose conditions (revised Figure 3E). *

      *Second, in control samples, FBP levels saturate in high glucose conditions. FBP levels in control samples do not further increase when glucose concentration is increased from 10mM to 25mM, and thus it does not become as high as in cytoPfkfb3 embryos cultured in 10 mM glucose (revised Figure 3E). *

      *Therefore, in order to reveal the translocation, it requires an experimental strategy that leads to significantly increased FBP levels, such as in cytoPfkfb3 explants with high glucose condition, or alternatively, direct supplementation of FBP. *

      As also pointed out by the other reviewers, we are experimentally generating controlled conditions that exceed the physiological range which the embryo is exposed to. Accordingly, our data does not constitute evidence that under physiological conditions an alteration of protein localization in response to change in glycolytic flux and FBP levels occurs, at a smaller scale.

      We regard our approach as a first step to reveal potential mechanisms and so far hidden possible responses to changes in metabolic flux. In order to see minor changes in translocation upon small changes in glycolytic-flux/FBP levels, more quantitative approaches, such as live-imaging of tagged proteins, will need to be developed. We hence decided to include these discussion in our revised manuscript (line 657-666):

      “Of note, the translocation of proteins was observed only when high levels of FBP were reached upon direct FBP supplementation or cytoPfkfb3 overexpression with high glucose (Figure 6, 7). Future studies hence need to investigate whether flux-dependent change in protein localization occurs upon moderate and more physiological changes in glycolytic-flux/FBP levels. To this end, the development of more quantitative approaches, such as live-imaging of tagged enzymes and the development of metabolite biosensors, are needed.”

      SUGGESTED ADDITIONAL EXPERIMENTS related to point #3:

      *3a- Analysis of subcellular localization of Pfkl and Aldoa by Immunofluorescence. This analysis is not limited by the amount of biological material available, so it could be applied to different experimental conditions. *

      18. We addressed this point in our response #15.

      *3b- Subcellular distribution of Pfkl and Aldoa in explants exposed to different exogenous glucose concentrations. As this involves wild type embryos, it can be done following similar protocols as in figures 6 and 7 of the manuscript. *

      19. We addressed this point in our response #16.

      4- The results from the work presented in this manuscript would indirectly indicate a negative relationship between glycolysis and somitogenesis. This contrasts with previous reports indicating the essential role of aerobic glycolysis for the same process. There is no explanation for this apparent (and important) contradiction (the authors only comment the discrepancy between the data provided in this paper and previous reports in what concerns the relationship between glycolysis and Wnt signalling, although they also do not provide an explanation).

      19. We cannot resolve this discrepancy, but now offer a more detailed discussion, also based on the additional data we obtained.

      First, it is important to point out that we have performed additional experiments to substantiate this part of the work, i.e. a transcriptome analysis with control and cytoPfkfb3 explants cultured in 10 mM glucose. We decided to focus on an early time point, i.e. three-hour after incubation, in order to increase the chance to score the primary response of PSM cells upon changes in glycolytic flux. In addition, our nanostring data in Figure S3 shows that glucose titration can change the expression levels of some Wnt-targets in both directions, i.e. decreasing glucose upregulates their expressions while increasing glucose downregulates their expressions. Again, this analysis was done at short time-scales to score the immediate effect.

      One possible explanation regarding the difference to Oginuma et al. could indeed be the late time point of analysis in their study, i.e. 16-hour after culture. This difference in sampling time, i.e. 3-hour vs. 16-hour after culture, is of particular importance given the dynamic nature of metabolic and signaling responses.

      We have added a sentence to explain this point in more detail (line 608-617):

      “This discrepancy could relate to the time point of analysis: while Oginuma et al. mainly focused on analyzing samples 16-hour after metabolic changes, we chose to score the effects of altered glycolytic flux/FBP levels already after a three-hour incubation, with the goal to capture the primary response of PSM cells. Whether the difference in sampling time underlies the observed difference is yet unknown, but both studies highlight that Wnt signaling is responsive to glycolytic flux, supporting a tight link between metabolism and PSM development.”

      Minor comments.

      *It was not specified the tissue used for the Western blot analyses (was it the PSM alone, the whole tails including somites, etc). This is of relevance to comment #3. *

      20. PSM explants without somites were cultured for one/three-hour and were subjected to subcellular protein fractionation. This information is now included in the revised method section.

      Reviewer #2 (Significance (Required)):

      -The work described in this manuscript identifies FBP as a sentinel metabolite for the glycolytic flux. This, itself has the potential to be important for different processes in which differences in glycolysis makes a difference, although I do not think that this will be relevant for the developmental process on which the authors focused their study (see major comments #1 and 2). Indeed, the lethality of global transgenic cytoPfkfb3 expression (although it was not analyzed if it was during development of in postnatal stages, or the cause of this lethality) but with very minor effects on somitogenesis in vivo supports this conclusion.

      21. Please see our detailed comments also based on the newly added in vivo experiments done with the Akita diabetic mouse model in our responses #9–14.

      *- The potential moonlighting activity of Pfk (connected with specific subcellular localization), is an interesting idea but so far does not go beyond pure speculation. This is prone to the typical double edged effect of stimulating research in that direction but also the potential negative effect of being taken for granted without rigorous proof. *

      22: We have added a statement to highlight the nature of this finding and the requirement for follow up studies both in this and other contexts. Please refer to our response #15 for the details.

      • The importance of metabolism in general and glycolysis in particular for somitogenesis and axial extension has been recently reported (the relevant papers are cited in the manuscript) and therefore the work described in this manuscript extends those studies. Also, the recent observations that metabolic process can influence cell activity beyond their participation on the classical pathways in which they are involved, including processes apparently as distant as epigenetic regulation of gene activity (see for instance Tarazona and Pourquie, 2020, Dev Cell 54, 282-292), is opening new perspectives to the study of the influence of metabolism on physiological and pathological processes (championed by cancer and immunological response). It also provides a link between control mechanisms across large scale phylogeny, from procaryotes to eukaryotes.

      -In principle, the potential audience for this work could be wide, as the interest in understanding the involvement of metabolism in the regulation of physiological and pathological processes has been growing over the last years. However, the lack of proven mechanism for the activity of FBP might restrict the real general impact of this work. In this regard, the suggestion that it might control some type of still unknown moonlighting activity of Pfk is so far totally speculative.

      • I am a developmental biologist with strong focus on mechanisms of somitogenesis and axial extension in vertebrate embryos. There is no part of this work for which I do not feel competent to evaluate.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      *Summary - *

      *In the present manuscript, Miyazawa and colleagues explore the role of glycolytic flux on embryonic development by using presomitic mesoderm (PSM) patterning as a model. *

      *First, the authors examined the steady-state levels of central carbon metabolism metabolites in PSM explants. Explants were cultured in various concentrations of glucose and subjected to gas chromatography mass spectrometry (GC-MS). These experiments allowed the identification of metabolites (such as lactate, 3PG, and FBP) that exhibit a linear correlation with glucose levels and can therefore serve as sentinel metabolites for glycolytic flux in PSM cells. Among the metabolites identified, fructose 1,6-bisphosphate (FBP) showed the strongest linear correlation with glucose levels and was used to inform the design of subsequent experiments. *

      *Second, to elucidate the functional role of FBP on PSM patterning, the authors supplement the media used to culture PSM explants with various concentrations of FBP and: *

      *- analyze the dynamics of Notch signaling (a critical player in mesoderm segmentation during embryogenesis) using real-time imaging of the LuVeLu reporter; *

      *- assess gene expression patterns using in situ hybridization of candidate genes. *

      *The authors find that supplementation with FBP, but not F6P or 3PG, impairs mesoderm segmentation and disrupts the activity of the segmentation clock in the posterior PSM. Furthermore, FBP supplementation led to the reduced expression of FGF- and WNT-target genes Dusp4 and Msgn, respectively. *

      *Third, the authors generate a conditional cytoPfkfb3 transgenic mouse line in which a cytoplasmic form of the Pfkfb3 enzyme is overexpressed. Pfkfb3 can promote glycolysis, and more importantly, leads to increased levels of FBP in a glucose-dependent manner. The authors find that cytoPfkfb3 transgenic PSM explants contain higher levels of FBP and secrete lactate at higher levels when compared to control explants. Importantly, cytoPfkfb3 transgenic PSM explants exhibit impaired somite formation and reduced expression of Msgn (but not Dusp4) in a glucose-dependent manner when compared to control explants. *

      Finally, the authors investigate changes in protein subcellular localization in their pharmacological and genetic models of FBP-driven glycolytic flux activation. This was prompted by previous reports on the changes in subcellular localization of glycolytic enzymes (Hu et al., 2016). To this end, the authors perform proteome-wide cell-fractionation analyses in drug-treated and cytoPfkfb3 transgenic PSM explants and find that certain glycolytic proteins exhibit altered subcellular localization in both cases (albeit in different fractions).

      *Major concerns: *

      *- (Re: Results from Fig. 2 and Fig. S1.) *

      *o Given that FBP levels are highly correlated with extracellular glucose levels (which impact glycolytic flux )(TeSlaa and Teitell, 2014) the authors should elaborate on why progressive increase in extracellular glucose does not affect PSM patterning, in the same way that increasing FBP levels does. This is especially important given the claim that FBP is a sentinel metabolite of glycolytic flux. *

      23. This important point was also addressed by the reviewer 2, so please see our responses that are also listed under #9, #10, #14 (attached below).

      *(Our response #9) *

      *We agree with the reviewer and to directly address this central point, we have performed an extended, additional experiment, collecting 375 embryos to quantify FBP levels under five conditions with three biological replicates. *

      *There are two major results that we highlight here: First, we found that addition of F6P did not lead to increased FBP levels compared to control samples cultured in 10 mM glucose, which is in stark contrast to cytoPfkfb3 embryos cultured in 10 mM glucose (revised Figure 3E). Second, while increasing glucose concentration is mirrored by elevated FBP levels as we reported, we find clear evidence of saturation above a concentration of 10mM glucose: increasing glucose to 25mM does not increase FBP levels further (revised Figure 3E). *

      This saturation effect seen in glucose titration, but also the absence of elevated FBP upon F6P addition, might be expected outcomes because, as also the reviewer 1 pointed out in the response, Pfk is commonly considered to be a rate-limiting enzyme in the glycolytic pathway. We now have the direct experimental data supporting this hypothesis and thank the reviewers to have initiated this additional (very involved..) experiment.

      *This new data allows us to conclude more firmly on the correlation between FBP levels and phenotype: at high FBP levels, which are seen in cytoPfkfb3 samples, we observe PSM patterning defects. These high levels are not reached even at 25mM glucose or upon F6P addition, due to the saturation at the level of PFK enzymatic step. Hence, while glucose titration does elevate FBP significantly until this saturation, FBP levels are not as high as in cytoPfkfb3 samples. As a correlative finding, we see that only those conditions with very high FBP levels, or the direct addition of high levels of FBP, cause the arrest of segmentation clock activity. At moderately elevated FBP levels, observed in control explants with high glucose or in cytoPfkfb3 explants with low glucose, clock activity continues and we find a quantitative effect at the level of gene expression, i.e. Wnt signaling target downregulation (Figure 5A, S3). *

      The new data has been included in the revised manuscript and the text has been adjusted accordingly:

      - (Result Part, line 245–254) "Consistently, we found that cytoPfkfb3 overexpression lifted the upper limit of FBP levels in PSM cells (Figure 3E, S4B, S4C). In control explants, FBP levels did not increase further when glucose concentration was increased from 10 mM to 25 mM. It was also the case when control explants were cultured in 20 mM of F6P (Figure 3E). These results indicate that the Pfk reaction carries a (rate-)limiting role for glycolytic flux and FBP levels, and that cytoPfkfb3 overexpression hinders the flux-regulation function of Pfk."

      - (Discussion Part, line 551–573) “Our findings suggest that flux-regulation at the level of Pfk is critical to keep FBP steady state levels within a range compatible with proper PSM patterning and segmentation. In agreement with such a rate-limiting function for Pfk, we found in glucose titration experiments that FBP levels saturated and did not further increase at glucose levels above 10 mM (Figure 3E). Along similar lines, the supplementation of high concentrations of the Pfk substrate F6P did not result in a significant increase of FBP levels, again compatible with a rate-limiting function at the level of Pfk (Figure 3E). The upper limit of glycolytic flux and FBP levels can be experimentally increased by cytoPfkfb3 overexpression (Figure 3B, 3E). We interpret the data as evidence that cytoPfkfb3 overexpression compromises the flux-control function of Pfk and hence much higher FBP (and secreted lactate) levels are reached. Such a drastic increase in glycolytic flux and FBP levels correlates with a severe PSM patterning phenotype (Figure 4), which resembles the phenotype induced by supplementation of high dose of FBP (Figure 2). Our results in mouse embryos hence provides evidence that flux regulation by Pfk, an evolutionary conserved role present from bacteria to humans, serves to maintain FBP levels below a critical threshold.”

      (Our response #10)

      *#10. First, we would like to clarify that while indeed glycolytic activity is graded along the PSM, as other and we reported previously (reported in Bulusu et al., 2017 and Oginuma et al., 2017), the baseline expression of the entire glycolytic machinery (from glucose transport to lactate production) is very high, in all PSM cells. Hence, we see that cells all along the entire PSM have very active glycolysis, the posterior PSM being even more active. *

      *For this and related reasons, our interpretation about the difference seen between glucose titration/F6P addition on one side, and FBP addition/cytoPfkfb3 addition on the other side, is based on the role of Pfk in controlling either flux levels or dynamics in all PSM cells. *

      Hence, while we agree that we generate experimental conditions that allow FBP levels to surpass those found in control embryos, we would like to highlight the fact that even moderate changes in flux does result in very robust functional consequences on gene expression (Figure S3, 5), as we show in this work.

      *We can currently not fully address the first point raised, i.e. the role of graded flux/graded metabolite levels, due to the experimental limitations. Such a study requires, for instance, the generation of metabolite biosensor reporter lines in order to be able to monitor these changes dynamically, in space and time. *

      (Our response #14)

      *In our view, two main conclusions, both in vivo and in vitro, can be drawn based on the result we obtained: *

      *First, we find that a moderate increase in glycolytic flux, within the physiological range, leads to a quantitative and consistent change in gene expression, such as downregulation of Wnt target genes (Figure S3, 5). Such a phenotype was the result of either glucose titration or culturing cytoPfkfb3-transgenic embryos in low glucose concentration. *

      In these conditions, while overall PSM patterning is qualitatively normal, we do find consistent changes at quantitative level, i.e. gene expression changes, which are also mirrored by a reduced rate of segmentation (Figure 4B). A detailed analysis of the quantitative changes at the level of segmentation clock dynamics is being carried out and will be presented in a dedicated follow up study.

      *Second, we find that a very significant increase in FBP levels, i.e. when cytoPfkfb3 transgenic animals are cultured in high glucose conditions or when samples are cultured in high levels of FBP, PSM patterning is qualitatively altered and segmentation clock ceases to oscillate. In this case, we agree that it is not a physiological condition, as such high levels of flux and FBP are not reached in control samples which have intact flux regulation by Pfk. Nevertheless, such an experimental condition can be insightful, as it very clearly reveals the potential link between glycolysis, clock activity, PSM patterning and the Wnt signaling pathway. *

      *It is the combination between the moderate and the more severe effects, observed both in vitro, and now also in vivo using the Akita model (see above), that we take as evidence for an intrinsic, physiological link between glycolytic activity, PSM patterning and signaling. *

      - (Re: Fig. 2A and Fig. 2B)

      *o The authors should be consistent with the glucose concentrations for the experiments where they assess the dynamics of Notch signaling (Figure 2A) and gene expression (Figure 2B) or otherwise elaborate on why different concentrations are used for these assays. *

      24: We agree that ideally the experimental parameters should be as consistent as possible. In regards to the control glucose concentration used in this study, both 0.5 mM and 2.0 mM glucose were used. It reflects that over the years, minor adjustments in the experimental protocol were made, i.e. we now use 2.0 mM glucose as standard setting for all experiments, while previously, 0.5 mM glucose was used (see Bulusu et al., 2017). This change is based on the observation of a slightly improved culture outcome, in terms of reporter gene expression. We have confirmed that the developmental outcome and also effects seen upon addition of FBP are consistent at 0.5 mM and at 2.0 mM glucose. We made a note in the methods section to explain this point (line 1082-1084):

      “Basal culture condition was 0.5 mM glucose at the beginning of this study but was later switched to 2.0 mM glucose which yields a slightly improved reporter gene expression. No major difference was observed in the effects of FBP between these glucose conditions.”

      *- (Re: Results from pharmacological and genetic models of increased FBP levels) *

      *o The authors state that FBP-driven impairment of mesoderm segmentation is most pronounced in the undifferentiated PSM cells (in the posterior-most end of the explants) and is, therefore, unlikely to be due to a toxic effect that might otherwise affect the whole explant. While this is a reasonable assumption, it does not discount the possibility that the spatial specificity of the effect of FBP could be driven primarily by increased cell death in the posterior end of the explant. Thus, the authors should test whether cell death underlies the mesoderm patterning defects seen in PSM explants subjected to increased FBP levels. *

      25. We have performed immunostaining of active caspase-3 in explants cultured for three-hour in medium containing 0.5 mM glucose and 20 mM FBP and found no difference between control and FBP-treated explants (please refer to the Figure R2 below). This qualitative result does not indicate a major effect via cell death in the tail bud region (i.e. posterior PSM) as the underlying reason for the observed phenotype. We included the new data in the revised Figure S2C and adjusted the text accordingly.

      Figure R3. Immunostaining of active caspase-3 in PSM explants. Explants were cultured for three hours in the presence or absence of 20 mM of FBP. Neural tubes were outlined by white dotted lines.

      *- (Re: Gene expression experiments/analyses) *

      *o This study would benefit greatly from transcriptomic analysis of wt and cytoPfkfb3 transgenic PSM explants (and/or transcriptomic characterization of FBP-treated vs. control PSM explants). The candidate approach used to assess gene expression (through in situ hybridization) may not be sufficient to conclude that cytoPfkfb3 over-expression leads to the downregulation of Wnt signaling (a claim the authors make at the beginning of the manuscript). *

      26. We fully agree with the reviewer’s comment. We have now performed RNA-sequencing (RNAseq) analysis using control and cytoPfkfb3 explants cultured in 10 mM glucose, importantly after three hours of incubation in order to score early effects at transcriptome level (please refer to Figure R4).

      We found clear evidence that many Wnt-target genes (i.e. Axin2, Cdx4, Dact1, Dkk1, Mixl1, Msgn1, Sp5, Sp8, T) were significantly downregulated in cytoPfkfb3 explants, supporting the conclusion that Wnt signaling activity is downregulated in cytoPfkfb3 explants under high glucose condition.

      Furthermore, in order to examine similarities between the effects of cytoPfkfb3 overexpression and FBP supplementation, we also performed RNAseq analysis with explants treated with high dose of FBP or F6P. FBP supplementation resulted in downregulation of Wnt target gene expression (i.e. Dact1, Dkk1, Mixl1, Lef1, Sp5, T, Tbx6), mirroring the effects seen in cytoPfkfb3 samples. Such a response was not detected in F6P-treated explants.

      Combined, these new data significantly strengthen our conclusion that an increase in glycolytic flux and FBP levels leads to downregulation of Wnt signaling activity. The new data is now included in the revised Figure 5C–E and adjusted the texts accordingly.

      Figure R4. Transcriptome analysis of control (Ctrl) and cytoPfkfb3 (TG) PSM explants. PSM explants were cultured for three hours under different culture conditions. (A) Effects of cytoPfkfb3 overexpression on gene expression under 10 mM glucose condition. (B, C) Effects of 20 mM FBP (B) or F6P (C) on gene expression under 2.0 mM glucose condition. Wnt-target genes that were significantly downregulated in cytoPfkfb3 or FBP/F6P-treated explants are highlighted in blue.

      *- (Re: Results related to the neural tube closure defects in cytoPfkfb3 transgenic embryos) *

      *o The section of the manuscript describing the neural tube closure defects in cytoPfkfb3 transgenic embryos is superficial, lacks detail, and distracts from the focus of the study. Perhaps the data and text on neural tube closure defects should be included as supplemental information. *

      27: We agree with the reviewer that in the previous version, this data appeared isolated. It also connects with the point raised by the reviewer 2 about the in vivo significance of our findings. To address both these points, we have now performed additional in vivo experiments using a diabetic mouse model (Akita) to directly test the in vivo consequence of cytoPfkfb3, which interestingly links to the previous findings of neural tube defects. Please see our response #13 for the details (attached below):

      (Our response #13)

      *First of all, we would like to emphasize that the phenotype seen in cytoPfkfb3 embryos, i.e. the reduction of segmentation and downregulation of Wnt-target gene expression, occurs in a glucose dose dependent manner (Figure 4B and 5A). Hence, it is not the overexpression of cytoPfkfb3 per se that can account for the effects seen. But rather, increased glycolytic flux caused by the combination of transgene expression with high glucose results in functional consequences. *

      In addition, ‘other possible effects’ that the reviewer is referring to should be evident in all transgenic embryos, irrespective of glucose dose. To the contrary, transgenic embryos cultured in low glucose conditions appear unaltered to control embryos.

      *Second, we agree that we need to distinguish between strong phenotypes, visible at the level of clock arrest, and milder phenotypes, visible at the level of quantitative gene expression changes. It is important to note that the moderate phenotype, i.e. the quantitative gene expression changes seen in posterior PSM, are seen upon the addition of FBP at moderate levels and upon in glucose titration within the physiological concentration range, as well as in cytoPfkfb3 embryos. We take this as evidence that the effects seen in cytoPfkfb3 transgenic embryos reflect a common response also seen under physiological conditions. *

      *To extend this argument to the in vivo setting, we have performed additional experiments using a genetic mouse model for diabetes. As shown in our previous submission, cytoPfkfb3 transgenic animals do not exhibit a drastic in vivo phenotype when dissected at embryonic day 10.5. One interpretation of this finding is that since the cytoPfkfb3 phenotype is glucose and flux-dependent, the in vivo flux is low, reflecting low glucose concentrations described in vivo. To test the effect of increased flux in cytoPfkfb3 embryos in vivo, we therefore crossed the transgenic mice into a diabetic model called Akita, in which a point mutation in the Insulin2 gene causes high maternal glucose levels (Yoshioka et al., 1997; Wang et al., 1999). Using this experimental setup, we tested whether transgenic embryos in Akita diabetic females would manifest in vivo phenotypes. *

      Indeed, we found that cytoPfkfb3 transgenic embryos developing in Akita diabetic females showed significantly increased cases of neural tube closure defects (50% of cytoPfkfb3 embryos) and developmental delay (control: 38 somites vs. cytoPfkfb3: 34 somites at E10.5), defects not seen in transgenic cytoPfkfb3 embryos from control females (please refer to Figure R2 below). This dependency of the in vivo phenotype on maternal glucose conditions again highlights that the defects observed in cytoPfkfb3 embryos are not due to the expression of cytoPfkfb3 per se, but are rather directly linked to increased/unregulated glycolytic flux.

      We included the new in vivo data in the revised Figure S5D-E and modified the text accordingly.

      *Figure R2. In vivo phenotype of cytoPfkfb3 embryos grown in diabetic Akita females. (A) The number of somites in control (Ctrl) and cytoPfkfb3 (Tg) E10.5 embryos grown in diabetic Akita females. (B) In situ hybridization of Msgn, Uncx4.1, and Shh mRNAs in Ctrl and Tg E10.5 embryos grown in diabetic Akita females (ss, somite stage; scale bar, 500 µm). *

      • (Re: Conclusions of the study)

      o A previous study by Oginuma et al., 2020 provided strong evidence for a mechanism underlying the positive regulation of Wnt signaling by glycolysis (initiated by the elevation of intracellular pH) in the chick embryo tailbud. As mentioned in the discussion, the results of the present study are not consistent with this mode - and this contradiction is not sufficiently resolved. This is a concern, given that the evidence that cytoPfkfb3 inhibits Wnt signaling is sparse (see above).

      28: This important point was also raised by the reviewer 2, please see our response as listed under #19 (attached below).

      (Our response #19)

      *We cannot resolve this discrepancy, but now offer a more detailed discussion, also based on the additional data we obtained. *

      *First, it is important to point out that we have performed additional experiments to substantiate this part of the work, i.e. a transcriptome analysis with control and cytoPfkfb3 explants cultured in 10 mM glucose. We decided to focus on an early time point, i.e. three-hour after incubation, in order to increase the chance to score the primary response of PSM cells upon changes in glycolytic flux. In addition, our nanostring data in Figure S3 shows that glucose titration can change the expression levels of some Wnt-targets in both directions, i.e. decreasing glucose upregulates their expressions while increasing glucose downregulates their expressions. Again, this analysis was done at short time-scales to score the immediate effect. *

      *One possible explanation regarding the difference to Oginuma et al. could indeed be the late time point of analysis in their study, i.e. 16-hour after culture. This difference in sampling time, i.e. 3-hour vs. 16-hour after culture, is of particular importance given the dynamic nature of metabolic and signaling responses. *

      We have added a sentence to explain this point in more detail (line 608-617):

      “This discrepancy could relate to the time point of analysis: while Oginuma et al. mainly focused on analyzing samples 16-hour after metabolic changes, we chose to score the effects of altered glycolytic flux/FBP levels already after a three-hour incubation, with the goal to capture the primary response of PSM cells. Whether the difference in sampling time underlies the observed difference is yet unknown, but both studies highlight that Wnt signaling is responsive to glycolytic flux, supporting a tight link between metabolism and PSM development.”

      *o Another discrepancy lies in the lack of an observable phenotype when culturing mouse PSM explants at very low glucose concentrations (e.g., 0.5 mM in Fig. 2A). Oginuma et al. observed clear disruptions to embryonic elongation and somite formation at a glucose concentration equal to 0.83 mM. Would this be due to species-specific mechanisms? Furthermore, while the authors focus on sentinel metabolites (such as FBP), experiments involving direct manipulation in glycolysis could resolve some of these inconsistencies. *

      29: Indeed species specific differences in the requirement for glucose are to be expected. Our extensive analysis shows that at 0.5mM glucose, segmentation and elongation proceeds (Bulusu et al., 2017).

      Regarding the second point, we have outlined several strategies to directly perturb glycolysis, i.e. glucose titration (mirrored by increase in lactate secretion) and by genetic targeting of the rate-limiting enzyme, Pfk. Glucose titration in wild-type embryos corresponds to the experiment the reviewer suggested, and we again found that higher glucose (i.e. higher flux) leads to down regulation of several Wnt-target genes (Figure S3). Of note, also in cytoPfkfb3 explants the effects are glucose-dose dependent (again mirrored by increase of lactate secretion), clearly indicating that we successfully and directly controlled glycolysis.

      *References - *

        • Hu, Hai, et al. "Phosphoinositide 3-kinase regulates glycolysis through mobilization of aldolase from the actin cytoskeleton." Cell 164.3 (2016): 433-446. *
        • TeSlaa, Tara, and Michael A. Teitell. "Techniques to monitor glycolysis." Methods in enzymology 542 (2014): 91-114. *
        • Oginuma, Masayuki, et al. "Intracellular pH controls WNT downstream of glycolysis in amniote embryos." Nature584.7819 (2020): 98-101. * *Reviewer #3 (Significance (Required)): *

      The experimental results reported in this study enhance our understanding of how cellular metabolic states regulate cellular behaviors during embryonic development. The study provides insight into how PSM elongation is controlled by morphogenetic mechanisms that are modulated by glycolytic flux. One of the strengths of the study is the use of an interdisciplinary approach that includes GC-MS, in vivo imaging and mouse transgenic lines. It should be noted that some of the conclusions of the study diverge from previous papers that examine the role of metabolism in developmental patterning (e.g., Oginuma et al., 2020).

    1. The geography theory postulates that prosperity and poverty of a country are caused by its geography especially tropical versus temperate climate which may influence the attitude of people, the diseases that can impact health, tropical soil which is not very conducive to agriculture as well as the flora/fauna of the place. I think there is another element of the geography theory that we can evaluate is the availability of broader natural resources. Logically the country with higher natural resources should be growing faster than those with lesser. However, in their paper, “Natural resource abundance and economic growth” (published in National Bureau of Economic Research, 1995 https://www.nber.org/system/files/working_papers/w5398/w5398.pdf), Jeffery Sachs and Andrew Warner from Harvard institute of international development, conclude that natural resource-rich countries tend to grow slower than those with scare resources in their study of 97 developing countries over two decades (1970-1989). The key hypothesis validated by them was that resource-rich countries tend to focus their labor on extracting natural resources thereby leaving few resources and investments into manufacturing and value-added industries. In addition, they practice protectionist state-led development policies which lead to lower investment and hence lower growth. So in many ways, riches by themselves become a curse versus a boon.

    1. Author Response

      Reviewer #2 (Public Review):

      The manuscript presents an interesting study on a timely topic (hyperacusis). The study was carried out in awake animals using modern approaches in neurosciences (calcium imaging, optogenetic). The amount of data is impressive, the study is very ambitious, and overall its quality is indisputable. However, I have some general comments and questions on some concepts that are critical for the study, and also on the interpretation of the data, in particular the behavioral data.

      We appreciate Reviewer 2’s overall positive evaluation as well as their more specific critiques, which we address below.

      The first point I want to mention is the concept of 'homeostatic plasticity'. I am not sure we agree on its definition. My understanding of it is that the AVERAGE of central activity will remain constant around a set point value. In case of a reduction of sensory inputs (hearing loss), the neurons' sensitivity will be enhanced in such a way that the averaged activity will be preserved. So, neural hyperactivity after partial or sensory deprivation is not 'maladaptive': it is a collateral effect, 'the price to pay' for maintaining neural activity stable around a given value. In my opinion, this point is crucial. The authors should also mention and cite the model's paper from Schaette et al.

      “Homeostasis” is a term used widely in physiology to describe a negative feedback process in which an internal adjustment compensates for an external perturbation to return a given system (temperature, pH, etc.) to a set point. To the reviewer’s point, homeostatic processes – broadly defined – can work at many different biological scales including perhaps large, distributed systems like the example s/he gave of neurons throughout the central auditory pathway. By contrast, “homeostatic plasticity” is a mechanism studied by dozens of laboratories in hundreds of papers by which neurons (typically studied in cortical neurons) adjust their synaptic and intrinsic excitability to maintain their activity around a set point range. A key feature of homeostatic plasticity is that neurons “sense” deviations from their set point and initiate a compensatory process to offset this deviation. Up to this point, it seems that we are on the same page as the reviewer.

      The first point of possible disagreement lies in the interpretation of how excess neural activity relates to homeostatic plasticity. The reviewer mentioned modeling papers by Schaette and Kempter (2006, 2007, 2012) on the cochlear nucleus, which are also based on homeostatic plasticity and their work is now cited in the revised text (see line 71). The reviewer is correct that there is a difference in how the term is used and interpreted, but the difference is fairly subtle. Their work and our work propose that homeostatic plasticity processes are applied within a single neuron to offset the reduced afferent input that accompanies cochlear damage. As the reviewer recalled, they describe hyperactivity as a consequence of this compensation, as we do as well. The only difference is that they and the reviewer describe hyperactivity as the byproduct of the normal, successful implementation of homeostatic plasticity, which it unequivocally is not because – by definition – homeostatic plasticity is a stabilizing process that maintains activity at a predetermined set point range.

      The second point of disagreement lies in the reviewer’s statement that “neural hyperactivity after partial or sensory deprivation is not 'maladaptive': it is a collateral effect, 'the price to pay' for maintaining neural activity stable around a given value.” We disagree. Hyperactivity can be both a collateral and maladaptive effect. Hyperactivity and hypersynchrony are understood to be the basis of tinnitus, which is a maladaptive, disordered state. The reviewer’s comment implies that there is no alternative for compensating for sensory deprivation but to make cortical neurons hyperactive. We see no reason why this must be so. In fact, stabilization of activity rates after sensory deprivation has been demonstrated in hundreds of studies in the developing visual system. In the adult auditory system, activity in cortical neurons is initially depressed after injury before rebounding to exceed baseline levels (see Resnik Polley 2017 eLife, Asokan 2018 Nat Comm., Resnik Polley 2021 Neuron). It is not obligatory for cortical activity rates to pass through the set point range and continue into hyperactivity, nor is it obligatory for cortical activity rates to remain elevated above baseline many days after the injury. Additional evidence for this point comes from Figures 4, 6, and 8, which show that some cortical neurons actually do homeostatically regulate their activity back to baseline (i.e., show stable gain). This raises the intriguing question of why some neurons recover to their homeostatic activity set point while others do not. Figure 8 provides new insight into this question by showing that that their baseline response properties can account for 40% of the variability in gain stabilization after peripheral insult.

      A third point of disagreement related to the reviewer’s statement that “My understanding of it is that the AVERAGE of central activity will remain constant around a set point value. In case of a reduction of sensory inputs (hearing loss), the neurons' sensitivity will be enhanced in such a way that the averaged activity will be preserved”. We agree that homeostatic plasticity processes are influenced by activity propagating through distributed neural networks. However, the biological implementation of the process is programmed into individual neurons. The activity set point is neuron-specific, the error signal that encodes a deviation from the set point is neuron-specific, and the transcriptional/translational changes deployed to stabilize the activity rate are neuron-specific. As an analogy, home climate control systems work autonomously for each house, because the sensors (thermostat) and actuators (heating/cooling) are sensitive to fluctuations in that home, not across other houses in the town. The heating and cooling systems for each house in town may be driven by a distributed, common source (e.g., a hot day) but the mechanisms that bring the ambient temperature back to the set point for each house are autonomous and reflect the particular thermostat programming for each house. The widely studied homeostatic plasticity mechanisms mentioned in our manuscript (e.g., excitatory synaptic scaling) are not sensitive to and do not target the averaged neural activity among millions of neurons distributed throughout the sensory neuroaxis.

      As a final point on this statement, there is no demonstration that we are aware of that average central activity remains constant after a reduction of sensory inputs. This would require recording from many neurons across multiple stages of the sensory pathway in a single animal to show that the increased gain at later stages in the system exactly offsets the reduced responsiveness at earlier stages of the system. So, the reviewer’s definition of homeostatic plasticity is based on a general supposition about a distributed process that has never been empirically demonstrated whereas the definition we use is consistent with the mechanisms and terminology used throughout the neuroscience literature (albeit often incorrectly in the hearing loss literature).

      The second point is that a lot is built on the behavioral procedure and d'. I am not convinced by the behavioral procedure (and the d') is a convincing measurement of loudness (and therefore loudness hyperacusis). So, in my opinion, the title may be changed and more importantly the entire spirit of the paper should be modified.

      The reviewer’s critique as well as comments from other reviewers helped us realize that we had used the terms “hyperacusis” and “loudness” imprecisely. We think that is part of the confusion. What we have studied here is auditory hypersensitivity after sensorineural hearing loss, which may or may not be a model of why persons with hyperacusis can exhibit loudness hypersensitivity.

      Once “hyperacusis” and “loudness” have been stripped away from the behavior, we contend that we have a behavioral assay for auditory hypersensitivity, which is the main point of our study. To be clear, the behavioral readout most commonly employed in the animal literature to model hyperacusis is reaction time, which has a less direct relationship to hypersensitivity than does d’. D-prime is widely used as the sensitivity index in detection behaviors. The main advantage of d’ is that it controls for differences in response bias either between subjects or after noise exposure. We used the d’ metric to show that mice can more reliably detect tone levels near their sensation threshold and can more reliably detect direct stimulation of thalamocortical projection neurons after acoustic trauma. These observations provide the framework for all of the neural measurements that follow.

      On the balance, the reviewer was correct that our imprecise use of hyperacusis and loudness was confusing and contradictory. The terms “hyperacusis” and “loudness” now only appear in the manuscript to describe other published findings or to describe what our study does not address. This resulted in several small text changes throughout the manuscript as well as a direct statement about the relationship between our work, loudness, and hyperacusis on Pg. 14, Lns 448-466.

      “While the findings presented here support an association between sensorineural peripheral injury, excess cortical gain, and behavioral hypersensitivity, they should not be interpreted as providing strong evidence for these factors in clinical conditions such as tinnitus or hyperacusis. Our data have nothing to say about tinnitus one way or the other, simply because we never studied a behavior that would indicate phantom sound perception. If anything, one might expect that mice experiencing a chronic phantom sound corresponding in frequency to the region of steeply sloping hearing loss would instead exhibit an increase in false alarms on high-frequency detection blocks after acoustic trauma, but this was not something we observed. Hyperacusis describes a spectrum of aversive auditory qualities including increased perceived loudness of moderate intensity sounds, a decrease in loudness tolerance, discomfort, pain, and even fear of sounds (Pienkowski et al., 2014a). The affective components of hyperacusis are more challenging to index in animals, particularly using head-fixed behaviors, though progress is being made with active avoidance paradigms in freely moving animals (Manohar et al., 2017). Our noise-induced high-frequency sensorineural hearing loss and Go-NoGo operant detection behavior were not designed to model hyperacusis. Hearing loss is not strongly associated with hyperacusis, where many individuals have normal hearing or have a pattern of mild hearing loss that does not correspond to the frequency dependence of their auditory sensitivity (Sheldrake et al., 2015). While the excess central gain and behavioral hypersensitivity we describe here may be related to the sensory component of hyperacusis, this connection is tentative because it was elicited by acoustic trauma and because the detection behavior provides a measure of stimulus salience, but not the perceptual quality of loudness, per se.”

      A lot is derived/interpreted from the results, but I believe there is a lot of over-interpretation. I would suggest the authors be more cautious and moderate in their speculations and conclusions. I would reconfigure the manuscript, and simplify it.

      We believe that the changes mentioned above and in the response to their specific comments below reduce over-interpretation and simplify the manuscript.

      As an example of a change made to moderate the conclusions from our work, we added the following to Pg. 14, Lns 442-447

      “Further, while the perceptual salience (Figure 2) and neural decoding of spared, 8kHz tones (Figure 5) were both enhanced after high-frequency sensorineural hearing loss, these measurements were not performed in the same animals (and therefore not at the same time). Definitive proof that increased cortical gain is the neural substrate for auditory hypersensitivity after hearing loss would require concurrent monitoring and manipulations of cortical activity, which would be an important goal for future experiments.”

      Reviewer #3 (Public Review):

      The study uses a mouse animal model of sensorineural hearing loss after sound overexposure at high frequencies that mimics ageing sensorineural hearing loss in humans. Those mice present behavioural hypersensitivity to mid-frequency tones stimuli that can be recreated with optogenetic stimulation of thalamocortical terminals in the auditory cortex. Calcium chronic imaging in pyramidal neurons in layers 2-3 of the auditory cortex shows reorganization of the tonotopic maps and changes in sound intensity coding in line with the loudness hypersensitivity showed behaviourally. After an initial state of neural diffuse hyperactivity and high correlation between cells in the auditory cortex, changes concentrate in the deafferented high-frequency edge by day 3, especially when using mid-frequency tones as sound stimuli. Those neurons can show homeostatic gain control or non-homeostatic excess gain depending on their previous baseline spontaneous activity, suggesting a specific set of cortical neurons prompt to develop hyperactivity following acoustic trauma.

      This study is excellent in the combination of techniques, especially behaviour and calcium chronic imaging. Neural hyperactivity, increase in synchrony, and reorganization of the tonotopic maps in the auditory cortex following peripheral insult in the cochlea has been shown in seminal papers by Jos Eggermont or Dexter Irvine among others, although intensity level changes are a new addition. More importantly, the authors show data that suggest a close association between loudness hypersensitivity perception and an excess of cortical gain after cochlear sensorineural damage, which is the main message of the study.

      The problem is that not all the high-frequency sensorineural hearing loss in humans present hyperacusis and/or tinnitus as co-morbidities, in the same manner that not all animal models of sensorineural hearing loss present combined tinnitus and/or hyperacusis. In fact, among different studies on the topic, there is a consensus that about 2/3rds or 70% of animals with hearing loss develop tinnitus too, but not all of them. A similar scenario may happen with hearing loss and hyperacusis. Therefore, we need to ask whether all the animals in this study develop hyperacusis and tinnitus with the hearing loss or not, and if not, what are the differences in the neural activity between the cases that presented only hearing loss and the cases that presented hearing loss and hyperacusis and/or tinnitus. It could be possible that the proportion of cells showing non-homeostatic excess gain were higher in those cases where tinnitus and hyperacusis were combined with hearing loss.

      We thank the reviewer for her/his careful reading of the original manuscript and many helpful suggestions and critiques that have been addressed in the revision. Both Reviewer 2 and Reviewer 3 understood that we were presenting our high-frequency sensorineural hearing loss manipulation as a way to model the clinical phenomenon of hyperacusis. This was not our intent, and we regret the wording of the original manuscript communicated this point. In fact, the clinical literature shows that hyperacusis does not have a strong association with hearing loss and moreover our behavioral and neural outcome measures were not designed to index the core phenotype of hyperacusis (a spectrum of sound-evoked distress, disproportionate scaling of loudness with sound level, and sound-evoked pain). Our study addresses the neural and behavioral signatures of auditory hypersensitivity, which is an “upstream” condition that may (or may not) be related to the presentation of clinical phenomena like hyperacusis and tinnitus.

      The reviewer mentions a litmus test for animal models of tinnitus, in which the utility of an animal model for tinnitus would be evaluated in part based on whether a controlled insult only produced a behavioral change suggestive of a chronic phantom percept in a fraction of animals. That may be so, but our study is clearly not modeling tinnitus and we make no claims to this effect in the original or revised manuscript. The Reviewer then goes on to say that “a similar scenario may happen with hearing loss and hyperacusis”. “May” is the operative word here because the association between sensorineural hearing loss and the clinical presentation hyperacusis is quite weak overall in human subjects but no study (that we are aware of) has attempted to document the probabilistic appearance of hyperacusis before and after acoustic trauma. So, we really don’t know whether hyperacusis has a probabilistic appearance like tinnitus or is more deterministic like cochlear threshold shift. But, again, the main point is that our experiments make no direct claim about hyperacusis one way or the other, which we now clarify and discuss throughout the revised text, as detailed below.

      We do contend that our experiments allow us to study auditory hypersensitivity, though again there is no precedent or consensus in the literature for expecting auditory hypersensitivity to present probabilistically or deterministically across mice after a controlled insult. Regardless, we agree with the reviewer that it is a very good idea to provide the individual animal data to the reader. We added new panels to Figure 2C to show that an increase in the 8kHz d’ slope after noise exposure (i.e., a change > 1) was observed in 7/7 mice that underwent acoustic trauma but 1/6 mice in the sham exposure group, suggesting a deterministic, binary behavioral effect found in every mouse with noise-induced high-frequency sensorineural damage. On the other hand, within the acoustic trauma cohort, 3 mice showed marked increases in the d’ growth slope (> 2) while 4 showed more subtle changes, suggesting a more graded or probabilistic effect. By providing the individual animal data as per the Reviewer’s request, the reader can now make a more informed determination about the reliability of auditory hypersensitivity within the acoustic trauma cohort.

      Regarding the relationship between the peripheral/cortical/perceptual auditory hypersensitivity we report here and the clinical conditions of tinnitus and hyperacusis, we revised the text such that the word “hyperacusis” only appears in the context of other publications and have added the following text (Pg. 14, Lns 448-466).

      “While the findings presented here support an association between sensorineural peripheral injury, excess cortical gain, and behavioral hypersensitivity, they should not be interpreted as providing strong evidence for these factors in clinical conditions such as tinnitus or hyperacusis. Our data have nothing to say about tinnitus one way or the other, simply because we never studied a behavior that would indicate phantom sound perception. If anything, one might expect that mice experiencing a chronic phantom sound corresponding in frequency to the region of steeply sloping hearing loss would instead exhibit an increase in false alarms on high-frequency detection blocks after acoustic trauma, but this was not something we observed. Hyperacusis describes a spectrum of aversive auditory qualities including increased perceived loudness of moderate intensity sounds, a decrease in loudness tolerance, discomfort, pain, and even fear of sounds (Pienkowski et al., 2014a). The affective components of hyperacusis are more challenging to index in animals, particularly using head-fixed behaviors, though progress is being made with active avoidance paradigms in freely moving animals (Manohar et al., 2017). Our noise-induced high-frequency sensorineural hearing loss and Go-NoGo operant detection behavior were not designed to model hyperacusis. Hearing loss is not strongly associated with hyperacusis, where many individuals have normal hearing or have a pattern of mild hearing loss that does not correspond to the frequency dependence of their auditory sensitivity (Sheldrake et al., 2015). While the excess central gain and behavioral hypersensitivity we describe here may be related to the sensory component of hyperacusis, this connection is tentative because it was elicited by acoustic trauma and because the detection behavior provides a measure of stimulus salience, but not the perceptual quality of loudness, per se.”

    1. Author Resonse

      Reviewer #1 (Public Review):

      The authors trained rats to self-initiated a trial by poking into a nose poke, and to make a sequence of 8 licks in the nose poke after a visual cue. Trials were considered valid (called "timely") only if rats waited for more than 2.5 sec after the end of the previous trial. An attempt to initiate a trial (nose poking) before the 2.5 sec criterion was regarded as "premature". The authors recorded from the dorsal striatum while rats performed in this task. The authors first show that some neurons exhibited a phasic activation around the time of port entry detected using an infrared detector ("Entry cell"), as well as port exit ("Exit cell). Some neurons showed activation at both entry and exit ("Entry and Exit cell") or between these two events ("Inside-port cell"). Fractions of neurons that fall into these four categories are roughly the same (Fig. 3C). The main conclusions drawn from this study are that (1) the activity preceding a port entry was positively correlated with the latency to initiate a trial (or "waiting time"; Fig. 4E), which appear to reflect the value upcoming reward, and that (2) in adolescent rats, the activity rose more steeply with the latency to trial initiation (Fig. 7J).

      These observations are potentially interesting, in particular, the possible difference between adult and adolescent rats is intriguing. However, this study does not examine whether this brain region actually plays a role in the task. Some of the conclusions appear to be premature.

      1) Previous studies have found correlations between the activity of neurons in the striatum and the latency to trial initiation (e.g. Wang et al., Nat. Neurosci., 2013) or action initiation more generally (e.g. Kunimatsu et al., eLife, 2018). In the former study, the trial initiation was self-generated, similar to the present study, and was modulated by the overall reward value (state value). In the latter study, the latency was instructed by a cue. Furthermore, there are many studies that showed correlations between striatal activity and future rewards (e.g. Samejima et al., Science, 2005; Lau and Glimcher, 2008). Many of these studies varied the value of upcoming reward (e.g. amount or probability). Although some details are different, the basic concepts have been demonstrated in previous studies.

      Although there are other studies linking striatal activity to trial/action initiation and reward probability, here the striatal activity preceding the execution of a learned sequence is dependent on the internal representation of the time waited. Elapsed time is the only cue the animal has regarding the possible outcome until it is too late and the trial has already been initiated. Although a light cue then tells the rat if the timing was correct or not, providing an opportunity to stop the behavior, the behavior released during premature trials resembles very closely that observed during unrewarded timely trials. This remarkable similarity between premature trials and timely unrewarded trials allowed comparing very advantageously the effect of wait time-based modulation of anticipatory striatal activity. Moreover, we have compared striatal activity between adult and adolescent rats finding a steeper wait time-based modulation of striatal activity in adolescent animals that correlates with a more impulsive behavior in these animals.

      2) The authors conclude that "in this task, the firing rate modulation preceding trial initiation discriminates between premature and timely trials and does not predict the speed, regularity, structure, value or vigor of the subsequently released action sequence". This conclusion is based on the observation that premature and timely trials did not differ in terms of kinematic parameters as measured using accelerometer. Although the result supports that the difference in activity between premature and timely cannot be explained by the kinematic variables, it does not exclude the possibility that the activity is modulated by some kinematic variables in a way orthogonal to these trial types.

      While our accelerometer data do not support that differences in movement initiation time or velocity could explain the differences in striatal activity between adolescent and adult rats, we can not rule out that kinematic variables not captured by the head accelerometer recordings could explain some of the results. This is acknowledged in the main text, results section, page 8, line 180.

      3) The firing rate plot shown in Figure 4D should be replotted by aligning trials by movement initiation (presumably available from accelometer or video recording). Is it possible that the activity rise similarly between trials types but the activity is cut off depending on when the animal enters the port at different latency from the movement initiation? In any case, the port entry is a little indirect measure of "trial initiation".

      Unfortunately, we have not systematically obtained video recordings of the sessions and only have accelerometer recordings of a few of the animals that provided the neuronal data, which precludes replotting the data as suggested. Accelerometer recordings are available from two of adult and two adolescent rats. Latency from movement initiation to port entry do not differ between premature and timely trials at both ages. This is now reported on page 8 line 175 for adult rats, and page 15 line 341 for adolescent rats. These results appear to be at odds with the idea that decreased neuronal activity in premature trials is the result of a cut-off of the response.

      4) The difference between adult and adolescent rats are not particularly big, with the data from the adolescent rats showing a noisy trace.

      New data from two adolescent rats reduced the variability and confirmed the behavioral and physiological differences with adult rats. All panels from figure 7 now include the data from 5 adolescent animals instead of 3. The number of neurons analyzed in the adolescent group passed from 552 to 876. The inclusion of these new data allowed us to perform new statistical comparisons. We adjusted a logistic function to accumulated trial initiation timing data (Fig.7N) and found that the rate of accumulation is higher in adolescent rats. Importantly, this is observed not only in the part of the curve corresponding to premature responding but also during timely responding, indicating that adolescent rats' premature responding is a manifestation of a more general behavioral trait that makes them self-initiate trials faster than adults (Fig. 7N). The noisy trace of curves showing the amplitude modulation of anticipatory activity as a function of waiting time was partly due to the relatively low number of premature trials that demanded using relatively long time bins. With more data available we have been able to replot these curves using a smaller bin size for the short waiting times (Fig. 7M). We have adjusted a logistic function to these data and observed a higher rate of increase of this activity modulation in adolescent rats, paralleling the behavioral data. Moreover, we report a significant correlation between the behavioral and neurophysiological data (a steeper rate of trial initiation times curve correlates with a steeper wait modulation of anticipatory activity, Fig. 7O). These new findings are reported in the results section, from page 17 line 405 to page 18 line 417.

      Reviewer #2 (Public Review):

      The authors conduct an ambitious set of experiments to study how neural activity in the dorsal striatum relates to how animals can wait to perform an action sequence for reward. There are a lot of interesting studies on striatal encoding of actions/skills, and additionally evidence that striatal activity can help control response timing and time-related response selection. The authors bridge these issues here in an impressive effort. Recordings were made in the dorsal striatum on several tasks, and activity was assessed with respect to action initiation, completion, and outcome processing with respect to whether animals could wait appropriately or could not wait and responded prematurely. Conducting recordings of this sort in this task, particularly in some adolescent animals, is technically advanced. I think there is a very timely and potentially very interesting set of results here. However, I have some concerns that I hope can be addressed:

      It seems like the recordings were made throughout the dorsal striatum (histology map), including some recordings near/in the DLS. Is this accurate? The manuscript is written as though only the DMS was recorded.

      We acknowledge that our recordings are spread along the medial and central regions of the dorsal striatum. Although we are not sure that there is a consensus regarding the limits of the DMS and DLS, we believe that none of our recordings are clearly located within the DLS. Following your suggestion, we have modified the text and refer to the location of our recordings as “dorsal striatum”. We believe that, as there is a lot of work on the roles of the DLS and DMS in reward learning, it is still important to refer to this work in the Introduction section and to discuss our findings in its context, particularly, since we find that most task-related activity is concentrated at the beginning and end of the task as shown in several studies focused in the DLS.

      If I understand correctly, the rats must lick 8 times to get the water. If this is true, one strategy is to just keep licking until the water comes. Therefore, the rats may not have learned an 8-lick action sequence. The authors should clarify this possibility, and if it is, to consider avoiding using phrases like "automatized action sequence" since no real action sequence might have been learned. In short, I am not convinced the animals have learned an action pattern rather than to just keep licking once a waiting period has elapsed.

      We acknowledge that the experiments do not allow us to establish if the rats know what the exact number of licks needed is; when the skill is acquired, licking becomes highly stereotyped and the rats might as well be learning a time after which continuous licking leads to reward. We still believe that the stereotyped performance, the inability to stop the behavior when the absence of the light cue unequivocally indicates that no reward will be obtained in premature trials, and the rapid decrease of lick rate after the eighth lick was emitted and no reward was obtained, support that the behavior is automatic until the time of expected reward delivery. A representative raster plot showing lick sequences during a whole session in a trained adult rat is presented in Fig. 1I and Figure 7 – supplement 1H shows an example of the licks of an adolescent rat.

      The number of subjects per group is very low. This is fine for analysis of within-animal neural activity. However, comparing the behavior between these groups of animals does not seem appropriate unless the Ns are substantially increased.

      The revised version of the manuscript includes a higher number of adolescent rats from which striatal activity and behavior were recorded, which allowed us to perform a more detailed statistical analysis of the correlations between these measures. In addition, we now include new behavioral data from an independent sample of non-implanted 6 adults and 6 adolescent rats that confirms the results obtained with the implanted animals (presented in Figure 7 – supplement 4).

      I found the manuscript difficult to decipher. There are many groups. If I understand correctly, there are the following:

      -ITI 2.5s experiment

      -ITI 5 s experiment

      -ITI2.5-5s experiment

      -ITI 2.5 s experiment (adolescent)

      -Two accelerometer animals (unclear which experiment)

      -Two animals in ITI 2.5 sec without recordings (unclear how incorporated into analyses)

      Within each group, there are multiple categories of behavioral performance. This produces a large list of variables. In some parts of the results, these groups are separated and compared, but not all groups are compared in those such sections. In other sections the different groups (all or just some?) appear to be combined for analysis, but it is not clearly described. Another consequence of mixing the groups and conditions together in analysis as they do is that some of the statements in the results are very hard to follow (E.g., line 305 "...similar behavior observed in 8-lick prematurely released and timely unrewarded trials...").

      To clarify the experimental groups, we now include a table (Table 1) summarizing which tasks were used and how many animals were trained in each task.

      Generally, it is difficult to understand the results without first understanding the details of the different tasks, the different groups of animals, and the different epochs of comparison for neural analysis. It took me a long time to work through the methods and I am still not sure I completely understand it. On this point, some sentences are very long and should be broken up into smaller, clearer sentences. There are a lot of phrases that only someone familiar with the cited articles might understand what they mean (e.g., even one paragraph starting with line 39 includes all of the following terms: automaticity in behavior; behavioral unit or chunk; reward expectancy; reward prediction errors and trial outcomes; explore-exploit; cost-benefit; speed-accuracy tradeoffs; tolerance to delayed rewards; internal urgency states). It is very hard to follow how each of these processes are to be understood in terms of behavioral measures used to study them and how they do or do not relate to the hypothesis of the present study. The discussion similarly uses a lot of different phrases to discuss the task and neural responses in a way that makes it hard to understand exactly what the author's interpretation of the data are. Is there maybe a 'most likely' interpretation that can be stated for some of the responses?

      Our main aim is to disclose the mechanisms underlying differences between adult and adolescent rats relating to impulsivity. We hope that this will become clearer in this version of the manuscript after deepening the analysis of the differences between them. We believe that our data do not allow us to unequivocally determine what is the ultimate cognitive process producing the striatal activity differences between adult and adolescent rats, i.e., differences in internal urgency states, time perception, tolerance to delayed rewards, and tried to reflect that fairly in the Discussion.

      The data set is extremely rich; there are lot of data here. As a result it can be hard to understand how all of the data relate to the main hypothesis of the article. It often reads as an exploratory set of results section rather than a series of hypothesis tests.

      We have tried to improve the overall clarity of the text.

      Reviewer #3 (Public Review):

      Cecilia-Martinez et al., implement a task that allows the study of premature versus timely actions in rats. First, they show that rats can learn this task. Next, they record the activity in the DMS showing start/stop signals in the cells recorded, next they propose that the activity detected before the release of actions sequences discriminate the premature vs the timely initiations showing a relationship between the waiting time and the activity of cells recorded, furthermore they show that it could be the expectancy of reward what could be encoded in the activity before entering the port. Last they show that adolescent rats show more premature starts than adult rats documenting a difference in activity modulation of DMS cells in the relation between waiting time and firing rate (although above the premature threshold, see comments below).

      Overall the paper is well presented describing a well-developed set of experiments and deserves publication attending only minor comments.

      1) I understand rats learn to execute sequences of <8licks or 8 licks, although diagrams are presented, no examples of the individual trials with 8 licks, neither distributions of bouts of these licks are presented.

      Rats learn to execute a lick sequence to obtain the reward. The experiments do not allow us to establish if they know what the exact number of licks needed is; when the skill is acquired, licking becomes highly stereotyped and the rats might as well be learning a time after which continuous licking leads to reward. A representative raster plot showing lick sequences in a session in a trained adult rat is presented in Figure 1I and Figure 7 - supplement 1H shows an example of the licks of an adolescent rat.

      2) Relevant to the statement: "in this task, the firing rate modulation preceding trial initiation discriminates between premature and timely trials and does not predict the speed, regularity, structure, value or vigor of the subsequently released action sequence"... It is not clear if the latency to first lick (plot 2D) and the inter-lick interval (2E) is only from the 8Lick sequences or not. If that is not the case, it is important to compare only the ones with 8Licks.

      The data are from 8 lick sequences, this is now indicated in the figure legend.

      3) Related to the implications of the previous statement, there seems to be a tendency for longer latency to first lick in timely vs premature trials in Figure 2D (timely-trials-Late vs premature-trials-late)? Again here it is important to compare the 8licks sequences only.

      Only 8-lick sequences are compared and the two-way ANOVA showed a significant effect of the training stage without significant effects of trial timing (premature versus timely) and a non-significant interaction. The average ± SEM latencies to the first lick (of the eighth lick sequence) were 0.717 s ± 0.063 for timely trials late and 0.805 s ± 0.086 for premature trials late.

      4) I could not find in the main text whether the individual points in Fig.2 (e.g. 2B-E) are individual animals. Please specify that.

      In this figure panels every individual point corresponds to the mean of a session, the data correspond to 5 adult animals (2-5 sessions per animal and timing condition). Whether the data correspond to animals or sessions is now clarified in all figure legends.

      5) Although very elegant the argument presented in Figure 4C and 6C, I wonder if the head acceleration may lose differences in movements outside the head in the two kinds of trials. If that is the case please acknowledge it.

      We acknowledge in the main text, results section, page 8, line 180, that the accelerometer does not allow us to determine if the movements of other body parts differ between trial types.

      6) Also in 4C, small separations between timely vs premature signals are seen before 0. Is there a way to know if animals in timely vs premature trials approached the entry port in the same way? This request is pertinent in order to rule out motor contribution to the differences in Figure 4A-B.

      Although it is not possible to completely rule out small movement differences between premature and timely trials, no evident behavioral differences can be detected by trained observers or by analyzing video recordings taken during some sessions. The available accelerometer recordings also suggest that a similar motor pattern is displayed in premature and timely trials (Figure 4C).

      7) when saying: "Similar results were obtained in rats trained with a longer waiting interval (Supplementary Figure 5)", "is hard to see the similarity in the premature range, while in the 2.5 seconds task there is a positive relationship in the 5 seconds task it is not.

      Please note that a positive relationship is observed for the two bins preceding trial initiation, which are about 2.75s and 1s before port entry. The bin that seems to not fit is centered 4s before port entry (1s after exiting the port in the previous trial). Because of the longer waiting time, in the 5 s task behavior becomes less organized during the first seconds after port exit, however, the modulation of activity is still observed in the bins that are close to port entry.

      8) The data showing that the waiting modulation of reward anticipation grows at a faster rate in adolescent rats is clear, however, it is not clear how it could be related to the data showing that the adolescent rats were more impulsive.

      We acknowledge that the data do not provide a causal link with behavior. After adding two new adolescent rats we have been able to study in more detail the relationship between the waiting modulation of neuronal activity and the accumulation of trial initiations (depicted in figures 7M and 7N respectively) by adjusting logistic functions to the data. The new results are explained on page 17,line 384. There is a striking parallel between the growth rate of both curves, and the curves of adolescent rats are significantly steeper than those of adult rats. Moreover, there is a significant correlation between the coefficients that mark the rate of growth of the behavioral and neurophysiological data (Fig. 7O).

      9) Related to the sentence: "the strength of anticipatory activity increased with the time waited before response release and was higher in the more impulsive adolescent rats"....One may expect to see a difference in the range of the premature time however the differences were observed in the range >2.5 seconds. Please explain how to reconcile this finding with the fact that the adolescent rats were more impulsive.

      Please, note that the more impulsive behavior of adolescent rats (and the faster growth of the wait modulation of anticipatory activity) is observed along waiting times that exceed the 2.5s criterion wait time; we added a phrase in the Results section (page 18, lines 413) and in the Discussion section (page 19, line 443) to emphasize this point. Regarding the premature trials, a related issue was raised by reviewer #1, concern 4. The addition of new data from adolescent animals allowed us to used smaller bins to better discriminate what happens at short waiting times and included an inset in Figure 7M that allows to better appreciate what happens at these intervals.

    1. Nothing gets people’s attention like something startling. Surprise, a simple emotion, hijacks a person’s mind and body and focuses them on a source of possible danger (Simons, 1996). When there’s a loud, unexpected crash, people stop, freeze, and orient to the source of the noise. Their minds are wiped clean—after something startling, people usually can’t remember what they had been talking about—and attention is focused on what just happened. By focusing all the body’s resources on the unexpected event, surprise helps people respond quickly

      It's interesting to see the the emotion of surprise no matter how composed, calm or worried you are, the feeling of surprise always affects everyone the same because you lose all that feeling of readiness when it hits you. On the other hand, surprises can sometimes show one's best moments as your whole body is reacting and focusing to the surprise, your reaction, thinking can also temporally be enhanced for that moment. I said in the last lecture that because we are different there are different results but i think this time for the emotion of surprise the background event and how unique it is what determines what emotion of surprise the person may feel.

    1. We may justly expect American men tobe as willing to grant to the women of the United States as generous consideration as those of GreatBritain have done

      This shows examples that women need change in society and their rights. I believe what this tells us is that this is what people think about women in general.

    1. Author Response

      Reviewer #2 (Public Review):

      McCoy et al. has developed a new urban tree species database from existing city tree inventories. They designed procedures to collect and clean a large amount of data, i.e., more than five million trees from 63 US cities. They found that urban trees were significantly clustered by species in 93% of cities using the compiled data. They also showed that climate significantly shaped both nativity and tree diversity. Also, they identified the homogenization effect of the non-native species. The interest in patterns of urban biodiversity and its driving mechanism has been rising recently. This paper provides an important data source for addressing research questions on this topic. The finding presented by the authors exemplified its potential. Strengths Compared to the existing urban tree database, such as the one developed by Ossola et al.(Global Ecology and Biogeography 2020), the new database added information on spatial location, nativity statuses, and tree health conditions besides occurrences. The new information expands data usability and saves valuable time for researchers. The authors also make the tools available so others can use them to process their own data sets. Because of the added information, various analyses of the diversity pattern of urban trees and the potential driving mechanism could be conducted. The authors found that individual species nonrandomly clustered urban trees. This finding corroborates the existing knowledge that some common species dominate urban trees. Nevertheless, the authors showed that the dominance was apparent in the spatial dimension. The preliminary finding that the native status of a tree had no apparent impact on tree health is interesting. It can potentially contribute to the debate on native vs. exotic in urban tree species selection, which the author mentioned in the paper.

      Thank you for the feedback!

      Weakness

      While the new database and the analysis based on it has strengths, some aspects of the concepts and data analysis need to be clarified and extended.

      We appreciate these helpful comments and have made many changes in response, detailed below.

      First, the authors need to define several critical concepts used in the paper, including city trees, urban forests, biodiversity, and species diversity. The authors used city trees and urban forests interchangeably throughout the paper. Nevertheless, a widely accepted definition of the urban forest is:"All woody and associated vegetation in and around dense human settlements." Konijnendijk et al. had a good discussion on the terminology used in urban forestry (Urban Forestry & Urban Greening, 2006). Similarly, biodiversity is different from species diversity. Effective species number is a diversity indicator. Therefore, it is challenging to accept conclusions being drawn on biodiversity in urban forests without clear definitions.

      We appreciate these clarifications– we have clarified our terminology throughout and added these important definitions.

      • “...urban forests, which are the woody and associated vegetation in and around dense human settlements (Konijnendijk et al., 2006).”

      • “City tree communities, an essential component of urban forests, provide many services.”

      We replaced the term “biodiversity” throughout the text where really we meant to say “tree species diversity” or just “diversity.”

      Second, the tree inventories varied significantly regarding the number of records (214~720,140). The variation can be due to the actual variation of tree abundance in studied cities or incomplete inventories. Biases can be introduced into the findings when comparing these inventories without adjusting the unequal sample sizes. The authors did not detail how they dealt with this issue when conducting the analysis.

      We redid all of our relevant analyses and applied Chao’s rarefaction and extrapolation techniques throughout the manuscript. The (substantial) changes are fully described above in the “Essential Revisions” section. We also copy them here.

      First, we redid all of our diversity calculations applying Chao’s rarefaction and extrapolation techniques through the R package iNext. Therefore, our summary datasheet now has many new columns to include the following values for each city:

      ○ Effective species number:

      ■ Raw effective species number

      ■ Asymptotic estimate of effective species number with confidence interval

      ■ Estimate of effective species number for a given population size (37,000 trees– the median population size rounded to the nearest 1,000) with confidence interval

      ○ Species richness:

      ■ Raw species richness (number of species)

      ■ Asymptotic estimate of number of species with confidence interval

      ■ Estimate of number of species for a given population size (37,000 trees– the median population size rounded to the nearest 1,000) with confidence interval

      ○ The same for the native-only population of trees in each city (e.g., not just raw number of effective number of native species but also the iNext estimates and confidence intervals)

      ○ Whether or not each of the values above was calculated using extrapolation or interpolation

      ○ Sample coverage estimates

      Second, we re-ran our models testing for significant correlations between species diversity in a city and other factors (including climate), where we used the extrapolated / interpolated effective species numbers from iNext. Specifically, we found the best fit model, which included the following predictors: environmental PCA1, environmental PCA1:environmental PCA2, and whether or not a city was designated as a Tree City USA. Then, we ran this model under six sensitivity conditions, varying the independent variable and/or which cities we included based on completeness of their sample. Climate was still a significant correlate of diversity.

      ○ first, with independent variable = effective species as calculated for a given population of 37,000 trees ("effective species for a standardized population size");

      ○ second, independent variable = the asymptotic estimate of the effective species number for that city as calculated using iNext;

      ○ third, the raw effective species number;

      ○ fourth, excluding cities with fewer than 10,000 trees;

      ○ fifth, excluding cities with <50% spatial coverage;

      ○ sixth, excluding cities with <0.995 sample coverage as calculated by iNext.

      ○ For the fourth, fifth, and sixth models, the independent variable was effective species for a standardized population size of 37,000 trees.

      Third, we redid our comparisons of tree populations in parks versus those in urban areas. Parks were still more diverse than urban areas.

      ○ Specifically, we used iNext to calculate diversity metrics based on the smaller of the two population sizes (park vs urban) to enable fair comparison for each city.

      ○ We reported comparison results for (i) raw effective species number, (ii) asymptotic estimate, and (iii) estimate for a given population.

      ○ In doing so, we eliminated Milwaukee from the comparison (it had only 28 trees recorded as being in an urban setting).

      Fourth, we redid our pairwise comparisons of tree community composition between cities in order to account for different population sizes and sampling efforts. To do so, we randomly subsampled the larger city to make its population equal to the smaller city, calculated comparison metrics, and repeated this process 50 times. We report the average comparison metrics.

      Our new Methods text is copied here for your convenience:

      ○ “Throughout our analyses, it was necessary to control for different sample sizes (and different, but unknown, sampling efforts across cities). To do so, we relied on the rarefaction / extrapolation methods developed by Chao and colleagues (Chao et al., 2015, 2014; Chao & Jost, 2012) and implemented through the R software package iNext (Hsieh et al., 2016). In short, these methods use statistical rarefaction and/or extrapolation to generate comparable estimates of diversity across populations with different sampling efforts or population sizes, alongside confidence intervals for these diversity estimates. iNext performs these tasks for Hill numbers of orders q = 0, 1, and 2. We used two techniques in iNext to allow for comparisons across cities (and between parks and urban areas within cities). First, we generated asymptotic diversity estimates for each; second, we generated diversity estimates for a given standardized population size. For our diversity analyses, the standardized population size we used was 37,000 trees (the rounded median of all cities). For analyses of the diversity of native trees, we used a standardized population size of 10,000 trees. For comparisons of the diversity between park and urban areas in a city, we used the smaller of the two population sizes (park or urban). In all cases we also recorded confidence estimates, and plotted rarefaction/extrapolation curves.

      ○ To control for variation in how uniformly trees were sampled across a city’s geographic range, we developed a procedure to score each city’s spatial coverage (see section Spatial Structure below).

      ○ We identified the best-fitting model, and then repeated our analysis under six sensitivity conditions to control for differences in population size, sampling effort, spatial coverage, and sample coverage. Our sensitivity analyses were as follows: first, with independent variable = effective species as calculated for a given population of 37,000 trees ("effective species for a standardized population size"); second, independent variable = the asymptotic estimate of the effective species number for that city as calculated using iNext; third, the raw effective species number; fourth, excluding cities with fewer than 10,000 trees; fifth, excluding cities with <50% spatial coverage; sixth, excluding cities with <0.995 sample coverage as calculated by iNext. For the fourth, fifth, and sixth models, the independent variable was effective species for a standardized population size of 37,000 trees.”

      Reviewer #3 (Public Review):

      This paper's strength is in the utility of the assembled datasets and some interesting and creative proof of concept analyses. This is an amazing resource for comparative analysis. However the paper felt a little sparse in the conceptual and methodological underpinnings of the questions asked to demonstrate the utility of the analysis. Specifically, I suggest:

      A) More substance in the introduction (currently only two short paragraphs) and a clear statement of research questions.

      We have added text to frame our goals and hypotheses:

      ○ “In particular, we wanted to know whether local climatic conditions are associated with the species diversity of city tree communities, how species diversity was distributed in space within cities, and whether introduced tree species contribute to biotic homogenization among urban ecosystems.”

      B) Add data on the extent to which each dataset represents a complete sample of each city's trees. I know are complete inventories, but some consist of 720 trees and cannot be a complete sample. A column in the meta data indicating effort and if there were any bias in where sampling occurred if the dataset is not complete are needed for others to use this data appropriately. For example, we know tree cover/diversity increases with wealth (which the author rightly cites). Let's say in City X, trees were only inventoried in one wealthy neighborhood. They would not be a representative sample of the city and dataset users need to be aware of this before they draw incorrect conclusions about City X where the sample was biased compared to city Y where the inventory was complete, including a sampling of all affluent and poor areas. This is also needed to support the research questions throughout the paper.

      We completely agree, and have made two major changes in response.

      First, we redid all of our diversity analyses after applying Chao’s rarefaction and extrapolation methods to permit comparison between populations of different sizes and sampling efforts. We added new columns to our datasheet with sample coverage estimates, asymptotic estimates of diversity, and diversity estimates for a given population size.

      Second, we also examined spatial coverage in a city because of the valid concern you raised that trees may only be sampled from particular neighborhoods or areas. In short, we divided each city into grid cells, counted trees per grid cell, and calculated metrics of coverage (adjusted number of trees per grid cell, and proportion grid cells that were empty) and bias (skew, kurtosis of number trees in occupied grid cells). These factors are presented in Spatial_Coverage_Supplement.zip. AS you can see even just from a glance at the spatial coverage plots, some cities are indeed extremely biased! Therefore, we ran a sensitivity analysis where we excluded cities with <50% spatial coverage.

      C) The authors chose to use effective species counts as their alpha diversity metric of choice. They explain why: "effective species counts (a measure that allows comparison between cities of different sizes)" (Ln 109). While effective species number is an excellent metric with much better behavior and attributes in linear modeling, I believe it is still strongly dependent on both city area and the number of individual trees sampled and so the above statement and all of the comparisons that flow out of it in the manuscript are currently unsupported. Just as species richness needs to be rarified or extrapolated to be compared at an equivalent # of individuals or area to be accurate so too does EFN (effective species count). Fortunately there is an R package (iNext) based on Chao's method (citation below) that makes it very easy to create effective species accumulation curves for each city by tree individuals sampled.

      a. Chao, Anne, Nicholas J. Gotelli, T. C. Hsieh, Elizabeth L. Sander, K. H. Ma, Robert K. Colwell, and Aaron M. Ellison. 2014. "Rarefaction and extrapolation with Hill numbers: a framework for sampling and estimation in species diversity studies." Ecological Monographs 84 (1): 45-67. https://doi.org/https://doi.org/10.1890/13-0133.1.

      b. The standardization (rarefaction/extrapolation) of EFN or richness for # individual trees sampled needs to be made for all analyses that make claims to compare diversity metrics across cities or between groups like urban and park areas (i.e. Fig 2a,b,c; Fig 3b; Fig 5a,b, S1a, S2a, S5, Table S2)

      c. If the authors have an argument for why diversity/area or diversity/sampling effort relationships do not apply for a particular question, then they should make that case instead.

      We very much appreciate this suggestion. Indeed, as described above, we applied Chao’s method to all of our analyses.

      D) The question posed by the Beta diversity analysis is fascinating (i.e. is it non-native species that are driving biotic homogenization across species. However, while frequency (which I assume is relative abundance but maybe it is incidence data- please define) is used to deal with different sample sizes consider whether it makes sense to include incomplete, or very small city datasets in the analysis even with frequency data. For example one city only has ~720 trees listed. If this is an incomplete dataset which seems likely, it will probably be much more differentiated (overlap less) from another city with small numbers simply due to incomplete sampling. Diversity analysis in cities always requires tradeoffs and cannot be identical to methods used in "natural" forested ecosystems, but I encourage the authors to explore this a bit. Perhaps a sensitivity analysis could help where incomplete or small sample sizes are dropped or datasets are resampled via random draw to equalize sizes? The latter would handle incomplete samples but would not deal with bias in which neighborhoods were sampled (see point B above).

      Great suggestion. We redid this analysis using a random drawn approach, as you suggested, to equalize sizes. The new analysis found the same results as our old analysis, with slightly different values. The new method is described here:

      ○ “How similar are species compositions across cities? For N = 1953 city-city comparisons of street tree communities, we could calculate weighted measures of similarity because we had frequency data. We calculated similarity scores for the entire tree population, the naturally-occurring trees only, and the introduced trees only. We used chi-square distance metrics on species frequency data, and we controlled for different population sizes (and potentially, sampling efforts) between cities by sub-sampling the larger city 50 times to match the smaller city’s tree population size and calculating average metrics. In this manner we controlled for differences in sample size.”

      E) Additional context/conceptual underpinning the clustering analysis would be great.

      a. The authors state in Line 390-395:"For city trees, which are often organized along grids or the underlying street layout of a city, this method can more meaningfully cluster trees than merely calculating the meters between trees and identifying nearest neighbors (which may be close as the crow flies but separated from each other by tall buildings)."- I very much agree with this sentiment and it is biologically meaningful for animal and plant dispersal, but as written it is unclear to me how the method described in the text "knows" that a tall building or elevation or some sort of feature exists to separate clusters rather than empty space or a ball field. Please clarify.

      We appreciate these comments, and we have added text and references for the interested reader. Here is the new description in full:

      ○ “We wanted to quantify the degree to which trees were spatially clustered by species within a city (rather than randomly arranged). To do so, we first clustered all trees within each city using hierarchical density based spatial clustering through the hdbscan library in Python (McInnes et al., 2017). HDBSCAN, unlike typical methods such as “k nearest neighbors”, takes into account the underlying spatial structure of the dataset and allows the user to modify parameters in order to find biologically meaningful clusters. For city trees, which are often organized along grids or the underlying street layout of a city, this method can more meaningfully cluster trees than merely calculating the meters between trees and identifying nearest neighbors (which may be close as the crow flies but separated from each other by tall buildings). In particular, using the Manhattan metric rather than Euclidean metrics improves clustering analysis in cities (which tend to be organized along city blocks). For further discussion of why hbdscan is preferable to other clustering metrics, see (Berba, 2020; Leland McInnes et al., 2016; McInnes et al., 2017).”

      b. Would you ever expect composition to be truly random either in a city or a natural forest given environmental conditions etc.? In some sense, the ones closest to random are the most surprising. Can you dive into one to give an example of what is going on in that city?

      c. It seems like there are two metrics here- the size of the cluster and then the observed/expected EFN per cluster. The latter is analyzed in this paper but is there any important information in the former? It seems like an interesting structural measurement of the city and possibly useful in its own right.

      d. Are there any target levels of randomness? Could the authors suggest how this might be determined moving forward with their datasets to illustrate this for foresters?

      Great points. We have given a lot of thought to your comments– these are large and interesting questions!! In the end, I think these questions fall mostly beyond the scope of this study, but we added a substantial amount of text to address your comments:

      ○ “Clustering by species is not necessarily a negative, nor indeed should we necessarily expect trees to be randomly arranged (see suggestions for further research in “Future Analyses” section below). Here, we take a first step toward making spatial clustering a metric of interest in city tree planning.”

      ○ “Researchers could also use this dataset to perform more refined analysis of clustering. For example, what is the biological significance of variation in cluster size (as determined by the hdbscan clustering algorithms)? The size and arrangement of the clusters themselves may be useful metrics. How clustered should we expect trees to be in both wild and urban settings? That is, what our are null expectations? Further, researchers could apply network theory to predict how pest species would proliferate through each of these cities (depending on the spatial arrangement of pest-sensitive trees).”

      F) The statement that this dataset enables "the design of rich heterogenous ecosystems built around urban forests" (Ln 72) seems strange. To my mind this tool will enable a more nuanced evaluation of the urban forests that already exist and suggest ways to target future plantings for increased resilience to climate, pest resistance, biodiversity support etc. I don't understand what ecosystem you would build around and not in the urban forest. If this is what is meant please elaborate. For example, do you mean non-tree installations?

      We agree with you and have changed the text as follows:

      ○ “With these tools, we may evaluate existing city tree communities with more nuance and design future plantings to maximize resistance to pests and climate change. We depend on city trees.”

    1. Put simply, conservatives hope that Twitter will now become a more willing vehicle for right-wing propaganda. Even if the platform tilts further in their direction, they will be motivated to continue to insist they are being censored—their criticisms likely exempting Musk himself in favor of attacking Twitter’s white-collar workers, whom conservatives paradoxically perceive as the “elite” while praising their billionaire bosses as populist heroes.

      This claim takes into account that those on the right wing are viewing the purchase of Twitter as a new way to push their narrative on this social media platform. However, they will still push the narrative that they are being censored. This is all a big scheme to make their followers view them as an oppressed group who are being silenced. I think it is important for platforms like this to implement free speech but we have to fact check sources and this is something I feel conservatives may not be taking into consideration.

  4. Aug 2022
    1. For instance, the emissions saved from living car-free may be lower than we calculated if public transit replaces car travel instead of biking or walking (living car free represents all the emissions associated with the life cycle of owning a car in our methodology).

      This contributes to the imperative idea that we talked about on the first day of classes. The imperatives that we have to keep in mind do not only think about the planet and what is best in the long run, but rather the needs of humans and the lives we are currently accustomed to as well. By substituting rather than completely cutting out certain routine actions, it would still result in emissions, but significantly less than those previous to the substitution

    1. Wouldn’t we all love to agree on a comprehensive worldview, consistent with science, that tells us how to behave individually and collectively?

      Personally I had this view about language before, wouldn't it be nice if we all spoke the same language. As I've grown, I realized that even though it may be convenient, I think that's the beautiful part about humans and society, that we coexist despite our differences. I think it would make us more of a dystopia than a utopia if we were all the same.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this paper, Staneva et al describe a novel complex found at RNA PolII promoters that they term the SPARC. The manuscript focuses on defining the core components of the complex and the pivotal role of SET27 in defining its function, and role in PolII transcription. This manuscript is a logical follow on from an initial paper (Staneva et al, 2021) by the same authors where they systematically analyzed chromatin factors, and their role in both transcription start and termination. What is also very clear, is that this complex is one made of histone readers and writers which suggests its function is to change the chromatin structure around a PolII promoters. The authors show that this complex is necessary for the correct positioning of PolII and directionality of transcription.

      This was a well-designed study and well written and clear manuscript that provides fascinating insight transcription control in bloodstream form parasites.

      I have no major comments only a few minor ones.

      1) Localisation of the different SPARC components appears to be either nuclear or nuclear and cytoplasmic. - Both SET27 and CRD1 show a nuclear and cytoplasmic localisation in the bloodstream form IFA (Supplementary Fig 1B), but only a nuclear localisation procyclic form.

      Did the authors attempt C terminally tagging SET27, CRD1 to see if this resulted in a change in the pattern?

      We have not tagged either protein at the C terminus, however SET27 (Tb927.9.13470) has been tagged both N- and C-terminally in procyclic form (PF) cells as part of the TrypTag project (http://tryptag.org). In both cases, SET27 localized to the nucleus, suggesting that the differences in localization we observe for SET27 depend on the life cycle stage, and not on the position of the tag. One caveat is that in the TrypTag project proteins are tagged with mNeonGreen whereas in our study proteins were tagged with YFP. Based on our images, CRD1 appears to be predominantly nuclear in both bloodstream form (BF) and PF parasites. CRD1 (Tb927.7.4540) has been tagged only N-terminally in PF cells as part of the TrypTag project where it has also been classified as mostly nuclear with only 10% of cells showing cytoplasmic localization for CRD1.

      We are well aware that tags can alter the behaviour of a protein. Absolute confirmation of location will require the generation of antibodies that detect untagged proteins. However, this is a longer-term undertaking. We have added the following statement to the Results section to address the point raised:

      “We tagged the proteins on their N termini to preserve 3′ UTR sequences involved in regulating mRNA stability (Clayton, 2019). We note, however, that the presence of the YFP tag and/or its position (N- or C-terminal) might affect protein expression and localization patterns”.

      • The point is made that JBP2 shows a 'distinct cytoplasmic localisation' in PF cells. by this logic, the SET27 localisation in BF is also distinctly cytoplasmic and a nuclear enrichment is not clear.

      Indeed the reviewer is correct - we have inadvertently over accentuated the significance of this difference in the text. We had emphasized the predominantly cytoplasmic localization of JBP2 in PF trypanosomes as potentially related to its weaker association with other (predominantly nuclear) SPARC components in the mass spectrometry experiments. The presence of SET27 in the nuclei of both BF and PF cells is confirmed by a positive ChIP signal. We have revised the manuscript text by changing “distinct cytoplasmic” to “predominantly cytoplasmic” to describe JBP2 localization in PF cells. We hope that this resolves the issue.

      • Why would the localisation pattern change between life cycle stages? Surely PolII transcription should remain the same?

      Although our analysis suggests that there may be some shift in SET27 and JBP2 localization between BF and PF stages, sufficient amounts of these proteins may be present in the nucleus for proper SPARC assembly and RNAPII transcription regulation in both life cycle forms. The proportion of SET27 and JBP2 proteins that localizes to the cytoplasm may have functions unrelated to transcription.

      2) Several of the images in Supplementary Fig 1B seem to show foci in the nucleus (CSD1, PWWP1, CRD1). Do you see foci throughout the cell cycle or just in G1/S phase cells as shown here?

      We have not systematically investigated protein localization at different cell cycle stages, so we do not have microscopy images for all proteins at all stages of the cell cycle. However, the images we did collect suggest the punctate pattern is preserved for CRD1 in the G2 phase in both BF and PF cells (see below) as we showed in Supplemental Figure S1B for cells with 1 kinetoplast and 1 nucleus (G1/S phase cells). The significance of these puncta remains to be determined.

      3) In Figure 6, what does 'TE' stand for?

      TE denotes transposable elements. We have added this to the figure legend.

      4) The authors show this interesting link between SPARC complex and subtelomeric VSG gene silencing. -In the CRD1 ChIP or RBP1 ChIP, are there any other peaks in telomere adjacent regions in the WT cells similar to that seen on chromosome 9A? And does the sequence at this point resemble a PolII promoter?

      Apart from peaks located on Chromosome 9_3A, there are other CRD1 and RPB1 ChIP peaks in chromosomal regions adjacent to telomeres in WT cells. We observed broadening of RPB1 distribution in these regions upon SET27 deletion, similar to what we show for Chromosome 9_3A. In particular, wider RPB1 distribution on Chromosome 8_5A coincides with upregulation of 10 VSG transcripts. These two loci explain most of the differentially expessed genes (DEGs) detected, but other subtelomeric regions show a similar pattern. We have added the following statement to the Results section to highlight that the phenotype shown for Chromosome 9_3A is not unique:

      “We also observed a similar phenotype at other subtelomeric regions, such as Chromosome 8_5A where 10 VSGs and a gene encoding a hypothetical protein were upregulated upon SET27 deletion (Supplemental Table S3)”.

      Cordon-Obras et al. (2022) have recently defined key sequence elements present at one RNAPII promoter. We searched for similar sequence motifs but failed to identify them as underlying CRD1 and RPB1 ChIP peaks, highlighting the likely sequence heterogeneity amongst trypanosome RNAPII promoters. To address this point, we have added the following sentence to the Discussion:

      “Sequence-specific elements have recently been found to drive RNAPII transcription from a T. brucei promoter (Cordon-Obras et al., 2022), however, we were unable to identify similar motifs underlying CRD1 or RPB1 ChIP-seq peaks, suggesting that T. brucei promoters are perhaps heterogeneous in composition”.

      -In the FLAG-CRD1 IP (Figure 3B), the VSG's seen here are not represented (as far as I can tell) in Figure 6B and C. If my reading is correct could, is this a difference in the FC cut off for what is significant in these experiments?

      The VSGs detected in the FLAG-CRD1 IP from set27D/D cells are indeed different from the ones shown in Figure 6 (even after setting the same fold change cutoffs). We have highlighted this by adding the following statement to the Results section: “Gene ontology analysis of the upregulated mRNA set revealed strong enrichment for normally silent VSG genes (Figure 6B-D) which were distinct from the VSG proteins detected in the FLAG-CRD1 immunoprecipitations from set27D/D cells (Figure 3B)”.

      The VSGs in the mass spectrometry experiments likely represent unspecific interactors of FLAG-CRD1. To clarify this, we have added the following statement to the Results section: ”Instead, several VSG proteins were detected as being associated with FLAG-CRD1 in set27D/D cells, though it is likely that these represent unspecific interactions”.

      Reviewer #1 (Significance (Required)):

      Trypanosomes are unusual in the way that they transcribe protein coding genes. Recent advances have defined the chromatin composition at the TSS and TTS, and the recent publication of a PolII promoter sequence(s) further adds to our understanding of how transcription here is regulated. Defining the SPARC complex now add to this understanding and highlights the role of potential histone readers and writers. I think that this will be of interest to the kinetoplastid community especially those working on control of gene expression.

      Our lab studies gene expression and antigenic variation in T. brucei.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the authors identify a six-membered chromatin-associated protein complex termed SPARC that localizes to Transcription Start Regions (TSRs) and co-localizes with and (directly or indirectly) interacts with RNA polymerase II subunits. Careful deletion studies of one of its components, SET27, convincingly show the functional importance of this complex for the genomic localization, accuracy, and directionality of transcription initiation. Overall, the experiments are well and logically designed and executed, the results are well presented, and the manuscript is easy to read.

      There are a few minor points that would benefit from clarification and/or from a more detailed discussion:

      1) The concomitant expression of many VSGs (37) in a SET27 deletion strain is remarkable and has important implications for their normally monoallelic expression. It is well established that VSG expression in wild-type T. brucei can only occur from one of ~15 subtelomeric bloodstream expression sites, which include the ESAGs. This result implies that VSG genes are also transcribed from "archival VSG sites" in the genome, not only from expression sites. Are there VSGs from the silent BESs among the upregulated VSGs? Is there precedence in the literature for the expression of VSGs from chromosomal regions besides the subtelomeric expression sites?

      Our analysis of differentially expressed genes (DEGs) revealed that 43 VSG genes (37 of which are subtelomeric) and 2 ESAG genes are upregulated in the absence of SET27. Both ESAGs but none of the upregulated VSGs in set27D/D cells are annotated as located in BES regions. While it is possible that recombination events have resulted in gene rearrangements between the reference strain and our laboratory’s strain, at least some of the upregulated VSGs are likely to be transcribed from non-BES archival sites. VSG transcript upregulation from non-BES regions was also recently described by López-Escobar et al (2022).

      We note that the upregulated mRNAs in set27D/D are still relatively lowly expressed (Figure 6C). This is presumably insufficient to coat the surface of T. brucei, and expression from BES sites instead may be required to achieve this. We have revised the manuscript Discussion section to make these points more clear:

      “Bloodstream form trypanosomes normally express only a single VSG gene from 1 of ~15 telomere-adjacent bloodstream expression sites (BESs). In contrast, in set27D/D cells we detected upregulation of 43 VSG transcripts, none of which were annotated as located in BES regions. Recently, López-Escobar et al (2022) have also observed VSG mRNA upregulation from non-BES locations, suggesting that VSGs might sometimes be transcribed from other regions of the genome. However, the VSG transcripts we detect as upregulated in set27D/D were relatively lowly expressed (Figure 6C) and may not be translated to protein or be translated at low levels compared to a VSG transcribed from a BES site”.

      2) The role of SPARC in defining transcription initiation is compelling. It's less clear to the reviewer if the observed transcriptional silencing within subtelomeric regions can also ascribed to SPARC. Have the authors considered the possibility that some components of the SPARC may be shared by other chromatin complexes, which could be responsible for the transcriptional activation of silent genes in SET27 deletion mutants?

      We cannot rule out indirect effects through the participation of some SPARC components in other complexes operating independently of SPARC. Indeed, the transcriptional defect within the main body of chromosomes appears to be somewhat different from that observed at subtelomeric regions, particularly with respect to distance from SPARC. We have added a statement in the Discussion section to highlight the possibility raised by the reviewer:

      “However, an alternative possibility is that transcriptional repression in subtelomeric regions is mediated by different protein complexes which share some of their subunits with SPARC, or whose activity is influenced by it”.

      3) The authors mention that the observed interaction of FLAG-CRD1 with VSGs in the immunoprecipitations (Fig. 3B) is evidence for the actual expression of normally silent VSGs on the protein level. This is true, but it should be spelled out that this interaction is nevertheless likely an artifact, at least the physiological relevance of these interactions is questionable.

      We agree that these are likely background associations and have added the following statement to the Results section to clarify this point:

      “Instead, several VSG proteins were detected as associated with FLAG-CRD1 in set27D/D cells, though it is likely that these represent unspecific interactions”.

      To avoid unnecessary confusion we have also removed the following sentence from the revised Discussion:

      “The interactions of FLAG-CRD1 with VSGs in the affinity selections from set27Δ/Δ cells indicate that some of the normally silent VSG genes are also translated into proteins in the absence of SET27”.

      4) "ophistokont" is misspelled in the introduction

      Thanks for noticing. We have corrected it to “Opisthokonta”.

      Reviewer #2 (Significance (Required)):

      The manuscript by Staneva et al. addresses the fundamental regulatory mechanism of gene transcription in the protozoan parasite Trypanosoma brucei, a highly divergent eukaryotic organism that is renowned for unusual features and mechanisms in gene regulation, metabolism, and other cellular processes. While post-transcriptional regulation is prevalent and relatively well established in T. brucei, much less is known about the mechanism of transcription initiation and transcriptional control, in part due to the general paucity of well-defined conventional promoter regions in this organism (only very few have been identified thus far). In this context, the work by Staneva et al. is highly significant and represents an important contribution to the field of gene regulation and chromatin biology in T. brucei and other related kinetoplastid parasites.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewers 1 and 2 are very positive about our manuscript, while reviewer 3 is surprisingly critical.

      However, except for the first observation, most of reviewer 3´s comments are based on incorrect interpretations of our results.

      We have integrated the useful comment into our revised version, and we will discuss in the following sections why reviewer 3’s remaining criticisms should be disregarded.

      Reviewer 1:

      Reviewer 1 has only minor suggestions and is satisfied that we prove convincingly our claims. The reviewer also finds our results reinforce our previously proposed hypothesis that the glands and the trachea evolved from common metamerically repeated ancient primordia.

      We have introduced the following changes to the text to accommodate Reviewer’s 1 minor suggestions.

      Main suggestion: Insert a paragraph in discussion explaining the relevance of new insights to more basal insects that do not form a ring gland.

      RESPONSE:We have introduced at the end of Discussion the following paragraph:

      “Our analysis of snail activation in the CA and PG shows that these glands and the trachea share similar upstream regulators, reinforcing the hypothesis that both diverged from an ancient segmentally repeated organ. In Drosophila melanogaster the CA and the PG primordia experiment a very active migration after which they fuse to the corpora cardiaca forming the ring gland (Sanchez-Higueras and Hombria, 2016). This differs from more basal insects where the CA fuses to the corpora cardiaca but not to the PG, and from the Crustacea where the three equivalent glands are independent of each other (Chang and O'Connor, 1977; Laufer et al., 1987; Nijhout, 1994; Wigglesworth, 1954). As the mechanisms we here describe relate to the early specification of the glandular primordia in Drosophila, it will be interesting to investigate if the equivalent genes are also involved in the endocrine gland specification of more distant arthropods”.

      Additional comment 1: Introduction, pg 3, a paragraph starting with "In comparison to the extensive knowledge we have of ..." - consider omitting or greatly shortening, this text breaks a flow as it is focused on tracheal development. I understand the authors' logic, but this information distracts from the main focus on CA and PG. RESPONSE:We agree that the trachea description paragraph breaks the flow of the introduction to gland development. As suggested by the reviewer, we have deleted most of the descriptive text on trachea development but left all the references so that interested readers can find the information.

      Additional comment 2: Beginning of discussion, pg 11: - change 2nd sentence to: " Our results indicate that the HH and the Wnt pathways act indirectly to negatively regulate the spatial activation ..." - the following sentence, starting with "Engrailed activation off hh transcription ...." is way too long and hard to follow, consider breaking into two sentences. RESPONSE:We have changed both sentences as suggested

      Additional comment 3: In Fig 4B, mx and lb segments should be labeled so this panel is consistent with labeling in 4A. RESPONSE:We have changed Fig.4B labels to be consistent with 4A

      Additional comment 4: In Fig 6, reduce a font size for labels on right-hand side (A1, A2, A1+A2 proximal, etc), so that they are visualy distinct from panel labels on left-hand side (A, B, C,..).

      RESPONSE:We have changed Fig.6 Font size as suggested

      Reviewer 2

      The reviewer is positive and agrees that the results we present in “this paper add to our understanding of how the CA and PG primordia are specified and highlights important similarities with the specification of the tracheal primordia”. The reviewer’s comments focus specially on the activation vs. maintenance of sna.

      Specific Comment a): Referring to Fig 1G-J, the reviewer says: It is not clear to me from either this figure or from the text whether the initial pattern of expression of the sna-rg reporter in stage 11 embryos is WT and then disappears at stage 12, or whether it is always defective. In trying to understand the activation process, I think it would be important to know for sure whether rg enhancer activity during the initiation phase in stage 11 is WT or not.

      RESPONSE: As suggested by the reviewer, we have included st11 embryos in Fig. 1 as panels G,J which illustrate that early sna-rg activation occurs normally in snaΔrgR2embryos prior to apoptosis kicking in. To make space for these images, we have taken out the st12 embryos that we had in our previous submitted version. This does not affect the manuscript’s message, as st12 phenotypes are similar to those at st13 which are presented in Fig. 1H,J.

      Moreover, in this revised version, the embryos in Fig. 1G-J have also been double stained with the apoptosis marker DCP1 to highlight the cell death observed in the gland primordia of snaΔrgR2 embryos (Fig. 1G’-J’).

      Specific Comment b) The authors argue that the rg deletion removes the only region driving sna expression in CA/PG. I'm not convinced that necessity necessarily implies sufficiency with respect to the requirements for rescue. While the sna-rg reporter is expressed in a pattern that seems to mimic the endogenous gene, do we know that a rg-sna transgene would fully rescue the rg deletion mutant?

      RESPONSE: In our previous paper (Sanchez-Higueras 2014) we presented evidence that in sna null embryos, a Snail BAC gene lacking the sna-rg CRM can fully rescue the mesoderm phenotypes but not the ring gland ones. This proved that in the BAC transgene there was no shadow CRM capable of rescuing the gland formation in the absence of sna-rg. In the current paper we show that deleting the endogenous sna-rg CRM in the sna locus results in the absence of sna transcription from the gland primordia.

      Making a sna-rg- construct expressing sna to test if this rescues the snaΔrgR2 homozygous mutants could be done, but it will delay this publication without adding much to the paper: we already know that sna-rg is sufficient to drive activation in all the CA and the PG cells (Sanchez-Higueras 2014 Fig 2J-M) and it would be expected to rescue the gland formation in snaΔrgR2 homozygous mutants.

      Having said that, we have changed the wording in the manuscript to one that may be acceptable to the reviewer.

      Instead of:

      “These results prove that snaΔrgR2 deletes the only regulatory region driving sna expression in the CA and PG gland primordia…”

      We now say:

      “These results prove that the snaΔrgR2 deletes mutation inactivates the only regulatory region driving sna expression in the CA and PG gland primordia…”

      Specific Comment c) is Sna required for maintaining sna expression?

      RESPONSE:This experiment is relevant to the maintenance mechanism of sna expression in the ring gland, and not to its activation which is the main focus of this paper.

      The search for the maintenance mechanisms is currently been followed in the laboratory and we prefer not deal with it in this paper. Providing a negative answer to this question would not be satisfactory, as we would need to search for the factors controlling sna’s maintenance.

      Specific comment d) The authors show that there is an expansion in the number of sna-rg reporter expressing cells along the AP axis when upd is ectopically expressed using a sal-Gal4 driver. Though not mentioned in the text at this juncture, sal is expressed in the PG primordia, while seven-up (svp) is expressed in the CA primordia. I assume that the upd induced expansion is only observed for the PG primorida (LB) and not the CA primordia (Mx)-at least this is what the figure looks like. (…) How about svp driven upd-assuming there is a svp-Gal4 driver-does it cause an expansion of Ca but not PG.

      RESPONSE: As the reviewer has noticed, there is a stronger expansion of sna-rg-GFP expression in the labial segment than in the maxillary segment. This is not due to the use of the sal-Gal4 line. We see the same effect with arm-Gal4 which drives similar expression on the maxilla and the labium. To illustrate this point, we have included two new panels (Fig.5D-E) where the ectopic expression of Upd has been induced with arm-Gal4. These embryos have been stained with anti-Sal to label the PG. This experiment shows clearly that the PG has expanded much more than the CA.

      There are several reasons why expansion of the glands could be more efficient in the labium than in the maxilla. One possible reason is the temporal response to Upd activation. Upd induction by the arm-Gal4 and sal-Gal4 lines may occur after the cells in the maxilla are no longer capable of activating sna-rg but still capable of activating it in the labium. This temporal hypothesis is based on our results showing that the CA expresses more transiently the upd gene and that STAT activation lasts for longer in the labium than in the maxilla (Fig. 4A-D)].

      A second possibility, that we favour, is the existence of dorso-ventral repressor genes modulating sna-rg expression intrasegmentally. Some of our results point towards the sna-rg CRM receiving repressor inputs that modulate intrasegmental spatial expression in the dorso-vental axis. When we delete the A2 distal region of the sna-rg enhancer, its expression in the labium expands ventrally (Fig. 6E,G and Sup.Fig. 4D). If a similar repressor was also modulating sna-rg in the maxilla it could be blocking its expansion. However, at this stage we have no solid data to support any of these hypotheses. As explained before for the maintenance mechanisms of sna-rg expression, our ongoing work aims to isolate and characterize further elements controlling the ring gland gene network, including these negative regulators.

      In the revised manuscript we now describe the different effects of Upd ectopic activation on the expression of sna-rg in the maxilla and the labium (underlined text is new to this revised version):

      “To test if generalised Upd expression in the maxilla and labium can activate sna-rg expression independently of other upstream positive or negative inputs, we induced UAS-upd with either the sal-Gal4 or the arm-Gal4 lines. We observe that, these embryos have expanded sna-rg expression along the antero-posterior axis in the maxillary and labial segments (Fig. 5C). Analysis of Sal expression, which labels the PG primordium (Sanchez-Higueras et al., 2014), shows that Upd ectopic expression induces a moderate expansion of the CA primordia while resulting a much larger increase of the PG primordium (Fig. 5D-E). This expansion occurs mostly in the anterior and posterior axis from cells where the Hh and the Wnt pathways are normally blocking sna-rg expression, while expansion is less noticeable in the dorso-ventral axis. This indicates that most of the antero-posterior intrasegmental inputs provided by the segment polarity genes converge on Upd transcription but that the dorso-ventral information is registered downstream of Upd.”

      The differential response of sna-rg to Upd activation in the maxillary and labial segments is also mentioned in Fig. 5 legend. (see Continuation comment d).

      * Continuation comment d) “It looks to me also like the vvl domain is expanding as well. This information should be clarified.*

      RESPONSE: Yes, ectopic upd expression also expands vvl1+2 expression. We have previously published that vvl1+2 is a direct target of JAK/STAT signalling in the trachea (Sanchez-Higueras 2019 and Sotillos et al. 2010 Dev.Biol). Although vvl1+2 expands dorsally in the Mx, those cells do not activate sna-rg dorsally. The ventral restriction of sna-rg in the maxilla is controlled by Dfd while in the labium its dorsal expression depends on Scr. We explain this in Fig.5’s figure legend where we now say (underlined text is new to this revised version):

      (C) Ectopic Upd expression driven with sal-Gal4 induces ectopic sna-rg and vvl1+2 expression in the gnathal segments, which for sna-rg is more pronounced in the labium than in the maxilla. Note that in the maxillary segment Upd can induce ectopic dorsal vvl1+2 but not sna-rg expression, this is expected as Dfd only induces sna-rg ventrally in the maxilla. (D-E) sna-rg-GFP embryos stained with anti-GFP (green) and anti-Sal (red). In control embryos (D) Sal labels the PG primordium but not the CA. In arm-Gal4 embryos ectopically expressing Upd, the PG is more expanded than the CA as shown by number of cells co-expressing Sal and GFP.

      Specific Comment e) The authors note a difference between CA and PG in the requirement for STAT binding sites in the enhancers. Is that related to the fact that svp is expressed in CA and sal is expressed in PG? Would driving svp expression using the sal-Gal4 driver maintain sna-rg expression.

      RESPONSE: During our preliminary ongoing experiments on sna maintenance mechanisms we looked in svp mutants and did not notice a change in sna-rg expression, thus it is unlikely that Svp is responsible for the difference. As said above, we continue looking for genes involved in gland formation. Sal could be involved in the maintenance of sna in the PG, but as Sal is expressed in the maxilla and labial segments before gland formation, it is difficult to disentangle if Sal is required for sna activation or maintenance (or both).

      Specific Comment f) Do svp or sal have a role in initiating sna expression when upd is present or maintaining sna expression after upd disappears? Presumably there is already published data that would answer these questions.

      RESPONSE: As explained above we did not find any effect of svp on activation of sna-rg, however we find that in sal mutants the labium does not express sna-rg. This shows that sal is likely to be another positive input. As in sal mutants both trh and Ubx become ectopically expressed in the Lb (Casanova1989 Roux's archives of developmental biology 198: 137-140; Castelli-Gair 1998 IJDB42:437-444) we have done the experiment in sal trh double mutants and in sal Ubx,abdA,Abd-B mutants. In both cases we still see a failure of sna activation in the Lb reinforcing the idea that Sal is an additional positive input. However, we prefer not to add the sal experiments as they would complicate the paper which currently focuses on the similar requirement of the Wnt, Hh and JAK/STAT signalling pathways.

      Reviewer 3

      Reviewer is very critical. We accept some of the points raised and have modified the manuscript accordingly. However, as we detail below, the most serious criticisms are incorrect and do not affect the conclusions reached by our work.

      We agree with the following comment:

      “In the Dfd Scr double mutant, both the CA and PG expression of the snail-rg-GFP reporter is still there - admittedly, the gland cells look abnormal at late stages, but this reporter that is supposed to function as a proxy for gland induction is still expressed. That either means that expression of sna-rg-GFP is not a proxy or that the glands are still being specified in the absence of the Hox genes that are proposed to specify these organs. The reporter should not be expressed if these Hox genes are what specify these endocrine organs.”

      RESPONSE: The reviewer has made a good observation. The expression of sna-rg-GFP is not completely absent in Dfd Scr mutant embryos (Fig. 5F in this revised version), which indicates that although the Hox genes are required to activate upd in the maxilla and labium and in their absence the gland primordia become apoptotic, there must be other positive inputs to the enhancer. However, this does not mean the Hox gene input is irrelevant for gland specification. Not only the Hox genes are required to keep normal levels of upd expression in the Mx and Lb primordia and gland viability, but previously we also showed that cephalic Hox genes influence the dorso-ventral position inside the vvl1+2 expressing cells where the sna-rg enhancer is activated: in the maxilla Dfd induces the ventral vvl1+2 expressing cells to activate sna-rg, while in the labium Scr induces the dorsal vvl1+2 cells to activate sna-rg (Sanchez Higueras 2014). The data presented in this paper indicate that the input of both Dfd and Scr over sna-rg CRM activation are indirect.

      As a result of the reviewer’s criticism, we have tested if the additional positive input could be provided by Ci. In our previous submitted version, we showed that the repressor form of Ci blocks sna-rg activation. In this revised version, we have tested what is the effect of expressing the activator form of Ci. In embryos overexpressing the activator CiPKA isoform, we have observed that the expression of sna-rg and upd are expanded, indicating that Ci can provide the additional Hox-independent positive input. In the revised version we present these new results as Fig.3G and Fig. 4I. We have modified accordingly the scheme that appears in panel 3I to include this. In the main text we describe the result in the Hh regulation section where we have added:

      “Although the above results indicate Ci is not absolutely required for sna-rg expression, we observed that overexpression of CiPKA, the active form of Ci, causes a non-fully penetrant expansion of sna-rg expression (Fig. 3G) suggesting the possibility that sna-rg may be responsive to Ci and to a second activator.”

      … and in the “Regulation of Upd ligand expression by the Wg and Hh pathways” section

      where we say:

      “We also found that ectopic expression of the activator Ci protein results in a non-fully penetrant expansion of upd expression in stage 10 embryos (Fig. 4H-I).”

      We have also modified the final scheme in Fig. 7 to mention that Dfd and Scr prevent the apoptosis of the gland primordia, and that there must be an additional positive input controlling upd activation besides the Hox input. However, in the figure we do not define Ci as the activating input as we would like to have additional evidence before making such claim.

      To clarify that the Hox input is not absolutely required we have modified the text in several places. Where we said:

      “Expression of the sna-rg reporter in the maxilla and the labium requires Dfd and Scr function …”

      We now say:

      “Development of the CA and PG and normal expression of the sna-rg reporter in the maxilla and the labium require Dfd and Scr function …”

      We also mention this in Fig. 5 legend where we have added:

      “In Dfd Scr mutant embryos (F), although the gland primordia become apoptotic, residual GFP expression indicates that there must exist Hox independent inputs activating the sna-rg enhancer.”

      As a result of reviewer 3’s comment, we have noticed a further example of similarity between the gland and the trachea specification, which we have commented in the revised discussion where we added the following paragraph:

      “Another interesting similarity between glands and trachea is that, although ectopic Hox gene expression can ectopically induce sna-rg and trh outside their normal domain, the lack of Hox expression does not completely abolish their endogenous expression, indicating that in both cases a second positive input can compensate for the absence of Hox mediated activation. Our results suggest that, in the glands, this redundant input could be provided by the activating Ci form (Figs. 3G and 4I), but further analysis to confirm this possibility and discard alternative sna-rg activators should be performed.”

      We disagree with the following comments:

      The finding that the CA and PGs form in slightly different DV positions from each other and slightly different DV positions from the trachea (based on the vvl1+2 mCherry reporter staining combined with that of the sna-rg-GFP reporter staining in Figure 5A, where staining does not overlap except where the CA cells have started to migrate over the vvl1+2 mCherry expressing cells) argues pretty strongly against the CA and PG being homologous to each other or absolutely homologous to the trachea primordia

      RESPONSE: This erroneous claim was based on Fig. 5A, that showed a double stained embryo where co-expression is difficult to appreciate without separating the channels. Co-expression of these two reporter lines in the ring gland has been previously documented beyond doubt in our 2014 publication, cited throughout the manuscript, where we presented eight different panels of glands clearly co-expressing both markers at various developmental stages (Current Biology 2014 Fig.2B-I). To prevent any readers reaching the same conclusion as the reviewer, we have modified Fig. 5A to show a double stained sna-rg-GFP vvl1+2-mCherry embryo alongside with the two separate channels (panels 5A’ and A’’) to make the co-expression evident.

      Although we are not including it in this manuscript, the reviewer will also be able to find images in the same 2014 Current Biology publication (Fig.3), where the ectopic activation of Dfd in the trunk leads to the activation of the sna-rg-GFP reporter in the vvl1+2 tracheal cells, proving that the glands and the trachea are formed at homologous positions.

      Having made clear that sna-rg activation in both the CA and the PG occurs in vvl1+2 expressing cells, we now refute a second criticism: The reviewer is puzzled that despite the glands being formed at different dorso-ventral positions in the vvl1+2 expressing patch of cells, we claim both groups of cells are homologous to the trachea.

      We are not saying that the CA are formed at homologous positions to those giving rise to the PG. What we say is that both the CA and the PG are formed at positions homologous to those giving rise to the trachea in the trunk segments.

      To make this clear in the revised version, we have changed the wording of a sentence in the Introduction section that might have originated the confusion.

      Instead of saying:

      “First, the CA, the PG and the traqueal primordia are specified in the lateral ectoderm at homologous positions”.

      Now, it reads:

      “First, the CA and the PG are specified in the cephalic lateral ectoderm at homologous positions to those forming the tracheal primordia in more posterior trunk segments.”

      It has been shown that each tracheal primordium (which are labelled by vvl1+2-mCherry) gives rise to different tracheal branches depending on the positions where they are specified: the dorsal cells give rise to the dorsal tracheal branches, the ventral cells to the ganglionic branches, the medial cells to the dorsal trunk etc. (for illustration see Fig.12 in Manning and Krasnow 1993). Each of these tracheal branches have a different shape and migrate to different positions. We believe that a similar positional specification occurs in the vvl1+2 cells in the maxilla and the labium. In the maxilla only the vvl1+2 ventral cells activate sna and svp (among other genes) to give rise to the CA. In the labium vvl1+2 dorsal cells activate sna, sal, phm (among other genes) to give rise to the PG. This regionalization is similar to what happens during tracheal branch specification, with the only difference that the interaction with Dfd and with Scr is what makes the positional outcome in the maxilla and the labium different (see in our Current Biology 2014 publication Fig. 3E-F and H-J). Thus, when the reviewer considers the equivalence between the CA/PG/trachea homology with that of the wing/haltere or that of the thoracic leg1/2/3 saying: “Indeed, the situation with these endocrine glands and the trachea is completely unlike the situation with the wing and haltere, wherein both structures arise from the same DV position in adjacent segments, or with legs 1, 2 and 3, which arise from the same DV position in adjacent segments

      …the reviewer should think about the coxa and the tarsi in the legs. The coxa in T1 is not homologous to the tarsi in T2 or T3, but when considering the leg structure as a whole, the coxa and the tarsi form part of the same homologous structure in T1, T2 and T3 despite being formed at different positions inside the leg primordia.

      The reviewer also doubts that the activation of upd occurs in the sna-rg primordium when saying: “Likewise, the STAT10X-GFP staining does not overlap with the sna-rg-mCherry staining (I see red cells and I see green cells - there are no yellow cells). If activation of snail is through Upd activation of STAT signaling, we should see that the snail reporter expression is within the domain of STAT10X-GFP expression.”

      RESPONSE: This is due to the fact that upd activation in the CA is extremely transient, leading to the loss of the x10STAT-GFP expression before the sna-rg-mCherry levels are robust enough in the maxilla. This criticism does not apply to the PG where due to upd expression lasting longer, co-expression of sna-rg-mCherry and x10STAT-GFP in panel 4B should be evident to the reviewer.

      To try to sort the CA co-expression problem, we are currently repeating the experiment but instead of analysing sna-rg-mCherry activation with the RFP antibody, we will do an mcherry RNA in situ. We hope that the mcherry transcript will be detectable earlier than the protein and the co-expression will be evident.

      We strongly disagree when the reviewer says: “This paper provides a strong basis for arguing that the CA and PG are induced independently of Jak/Stat signaling, whereas trachea require this signaling pathway.”

      RESPONSE: When making this claim, the reviewer is ignoring a large number of experiments presented in the manuscript. If the CA and the PG are induced independently of JAK/STAT signalling:

      (1) Why sna-rg expression disappears from the glands in mutants lacking the Upd ligands (Fig. 5B and 6K)?

      (2) Why deleting the region containing the putative STAT binding sites in the sna-rg enhancer causes the loss of enhancer expression (Fig. 6C)?

      (3) Why the smaller enhancer mentioned in point (2) recovers gland expression when adding a STAT binding site from an unrelated gene (Fig. 6G)?

      (4) Why the regained expression of the construct mentioned in (3) is lost by the mutation of two bases affecting this single STAT site (Fig. 6H)?

      The reviewer’s conclusion rests on giving an excessive importance to his reservations to CA co-expression in panel 4A while, surprisingly, disregarding the co-expression in the PG shown in panel 4B and all the experiments presented in Fig. 5 and Fig.6.

      Reviewer 3 Minor comments: RESPONSE: Both comments have been taken into account in the revised version.

      In summary, in this revised version we have answered most queries raised by reviewers 1 and 2. Moreover, reviewers 1 and 2 agree that the results presented in this manuscript reinforce the hypothesis that the CA and the PG glands and the trachea derive from the divergent evolution of a metamerically repeated homologous organ.

      Reviewer 3 has made a good point that we have taken into account and has improved the revised submission.

      However, reviewer 3 is wrong when concluding:

      This paper provides a strong basis for arguing that the CA, PG and trachea are not homologous structures, and when saying: the CA and PG are induced independently of Jak/Stat signaling, whereas trachea require this signaling pathway”.

      As we argue above, these conclusions are erroneous because:

      (1) Are based on the incorrect interpretation of Fig 5A and ignore previous published evidence cited throughout the manuscript.

      (2) It does not take into account key experiments presented in this work, while giving too much weigh to a result that can be easily interpreted.

      (3) It misinterprets the arguments justifying the positional homology between the CA/PG glands and trachea primordia.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This paper focuses on the specification of two endocrine glands that form from head ectoderm, the corpora allata (CA), which forms in the maxillary segment and secretes Juvenile hormone, and the prothoracic glands (PG), which form in the labial segment and secrete Ecdysone. Secretion of both hormones results in a larval molt. Secretion of only Ecdysone induces metamorphosis, the transition of the larvae into the adult forms. Both the CA and PGs form in positions homologous to the tracheal primordia (approximately) and previous reports indicate that ectopic expression of the appropriate Hox genes can result in homeotic transformations of the glands into tracheal primordia and of tracheal primordia into glands. Using a GFP reporter construct for the snail gene as a proxy for gland specification, the authors show that CA and PG formation is regulated by two segment polarity genes: Hh and Wnt, with Hh signaling activating reporter gene expression and Wnt signaling inhibiting reporter gene expression. They also suggest that their endocrine gland GFP reporter is regulated by the two Hox proteins expressed in those segments: Dfd (maxillary) and Scr (labial) (although figure 5D,E argue against this conclusion). They presumably show that reporter gene regulation by Wnt signaling and Hh signaling is indirect and through localized transcriptional activation of the JAK/STAT signaling pathway ligand gene upd (however, the STAT reporter and the snail reporter are expressed in different cells (fig 4B) - so I'm not so convinced of this conclusion). The authors also find that the CA and PG primordia form at slightly different dorsal ventral positions and that DV positional information is controlled downstream of upd JAK/STAT signaling.

      Major comments:

      The paper is well written and makes for a nice story, but the corresponding data are not supportive of most of the conclusions drawn by the authors.

      First, in the Dfd Scr double mutant, both the CA and PG expression of the snail-rg-GFP reporter is still there - admittedly, the gland cells look abnormal at late stages, but this reporter that is supposed to function as a proxy for gland induction is still expressed. That either means that expression of sna-rg-GFP is not a proxy or that the glands are still being specified in the absence of the Hox genes that are proposed to specify these organs. The reporter should not be expressed if these Hox genes are what specify these endocrine organs. This finding might explain why mutating the Hox consensus binding sites had no effect on expression of the smaller snail reporters.

      The finding that the CA and PGs form in slightly different DV positions from each other and slightly different DV positions from the trachea (based on the vvl1+2 mCherry reporter staining combined with that of the sna-rg-GFP reporter staining in Figure 5A, where staining does not overlap except where the CA cells have started to migrate over the vvl1+2 mCherry expressing cells) argues pretty strongly against the CA and PG being homologous to each other or absolutely homologous to the trachea primordia. Likewise, the STAT10X-GFP staining does not overlap with the sna-rg-mCherry staining (I see red cells and I see green cells - there are no yellow cells). If activation of snail is through Upd activation of STAT signaling, we should see that the snail reporter expression is within the domain of STAT10X-GFP expression. This would be consistent with observing a loss of upd mRNA in the maxillary and labial segments with loss of Dfd and Scr, but not seeing a loss of the sna-rg-GFP reporter. This would also argue against the proposed homology between the glands and the trachea. Indeed, the situation with these endocrine glands and the trachea is completely unlike the situation with the wing and haltere, wherein both structures arise from the same DV position in adjacent segments, or with legs 1, 2 and 3, which arise from the same DV position in adjacent segments. This paper provides a strong basis for arguing that the CA, PG and trachea are not homologous structures and that the CA and PG are induced independently of Jak/Stat signaling, whereas trachea require this signaling pathway.

      Minor comments:

      Page 3: tracheal is misspelled in the first paragraph, line 3.

      Page 5, end of first sentence in first full paragraph: "lethal" should be changed to "non-viable". I think the authors mean that homozygous embryos die, not that they cause the death of other life forms.

      Significance

      Nature of significance of advance:

      I think the significant finding is that the CA, PG, and trachea are not homologous structures. But that is not what the authors are concluding. The only findings consistent with the data provided are that Wg signaling represses expression of the snail reporter and Hh signaling activates its expression (Figures 1 - 3). Most of the other conclusions do not seem to be sufficiently supported by the data.

      Context of the work:

      These authors have published that the CA and PG are structures specified in homologous positions to the trachea. It has already been published that CA, PG and trachea primordia express the Vvl transcription factor - although I did not go back to see how that was determined. It has already been published that ectopic expression of specific Hox genes can transform the gland primordia into trachea and vice versa (these experiments may also warrant a closer look). So, idea that CA, PG and TR arose from divergent evolution of a segmentally repeated ancient structure has been proposed.

      Best target audience:

      With the findings that are consistent with the story line (figures 1 - 3), Drosophila embryologists working on the formation of these glands would be interested.

      My field of expertise:

      Drosophila development.

    1. Indie sites can’t complete with that. And what good is hosting and controlling your own content if no one else looks at it? I’m driven by self-satisfaction and a lifelong archivist mindset, but others may not be similarly inclined. The payoffs here aren’t obvious in the short-term, and that’s part of the problem. It will only be when Big Social makes some extremely unpopular decision or some other mass exodus occurs that people lament about having no where else to go, no other place to exist. IndieWeb is an interesting movement, but it’s hard to find mentions of it outside of hippie tech circles. I think even just the way their “Getting Started” page is presented is an enormous barrier. A layperson’s eyes will 100% glaze over before they need to scroll. There is a lot of weird jargon and in-joking. I don’t know how to fix that either. Even as someone with a reasonably technical background, there are a lot of components of IndieWeb that intimidate me. No matter the barriers we tear down, it will always be easier to just install some app made by a centralised platform.
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: Klein and colleagues generate an ES cell model system with inducible FACT depletion to understand how loss of FACT affects gene regulation in ES cells. They find that FACT is critical for ES cell maintenance through multiple mechanisms including direct regulation of key pluripotency transcription factors (Sox2, Oct4, and Nanog), maintaining open chromatin at enhancers, and regulated enhancer RNA transcription. The paper is well-written, the experiments are generally well-controlled and appropriately interpreted and placed within the context of the field.

      We appreciate the Reviewer’s support of this manuscript.

      Major comments: 1. In general, the ChIP-seq and CUT&RUN data are not that similar. Although correlation seems reasonable (S2A), looking at the heatmaps in S2B/C these seem pretty different. It's not very clear if this is a case where CUT&RUN has higher specificity (and signal-to-noise, which is very clear from example tracks) or if these two methods are picking up biologically different sites. Could the authors include some overlap analysis of peaks and comment on these discrepancies. Looking at the example tracks in Figure 2B, it seems likely that prior SPT16 and SSRP1 ChIP-seq were relatively high-noise.

      We have identified overlapping peaks between the two techniques, and while CUT&RUN identified substantially more peaks overall, percentage of peaks shared between datasets were relatively consistent (1-6% of total) between the individual ChIP-seq datasets and the CUT&RUN dataset (Response Figure 1). We note that the biological classes identified through all datasets were remarkably consistent (Fig. 2D), and therefore attribute the discrepancies to the greater number of reproducible peaks called from CUT&RUN data. As discussed in the paper, peak calling algorithms designed for the specific data types were used, and therefore peak calling could also contribute to differences.

      Response Figure 1. ChIP-seq and CUT&RUN peak overlap. Pie chart depicting the unique and overlapping peaks called from V5-SPT16 CUT&RUN data and FACT ChIP-seq data. These data are included in the revised manuscript (as a new Figure panel 2E). Peaks must have been identified in at least two technical or biological replicates.

      Are motifs described in Figure 2E CUT&RUN only, and do prior ChIP-seq experiments also identify these motifs?

      The motifs shown in Figure 2E (now 2F) are indeed CUT&RUN peaks only. We were unable to confidently assign enriched motifs to the ChIP-seq datasets (the most enriched motifs were approximately p = 10-18). By analyzing all SPT16 ChIP-seq peaks, rather than only intersected SPT16 ChIP-seq peaks, we were able to identify motifs recognized by two of the top three CUT&RUN motif hits (SOX2 and OCT4/SOX2/TCF/NANOG); however, enrichment was quite poor (p = 10-3). By limiting the analysis to intergenic regions, we were able to identify strong enrichment for motifs recognized by CTCF and BORIS (p = 10-58 and 10-51, respectively). As validation, we also called motifs from peak files published as supplementary material to the original Tessarz lab manuscript but were still unable to confidently call motifs (all p > 10-7 for SPT16 peaks, p > 10-15 for SSRP1 peaks). Related to major comment 1, we suspect that the weak motif enrichment is due to high background in ChIP-seq datasets compared to CUT&RUN datasets.

      The authors state that FACT depletion affects eRNA transcription and measured this using TT-seq. The analysis in Figure 3B seems to be all the different types of sites looked at together (genes, PROMPTs, etc). Is there evidence that eRNAs specifically are regulated by FACT loss.

      We apologize for the confusion and have clarified that Figure 3B (now 3A) is referring to mRNAs only in the text and figure. Our analysis of eRNA regulation by FACT is predominantly contained within Fig. 4B (TT-seq from DHSs, but no histone mark overlap assessment), Supp Fig. S4 (as in Fig 4B, but at DHSs overlapping H3K27ac or H3K4me1), Fig. 5E (FACT localization to putative enhancers, defined as in S4), and Fig. 6D (ATAC-seq demonstrating loss of accessibility at putative enhancers upon FACT depletion). Based on these results, we believe there are many eRNAs specifically misregulated by FACT loss and that potential direct targets (based on change in depletion and containing FACT binding) are in Fig 5E.

      Could these be compared to DHS sites that lack FACT binding to support a direct role for FACT at these sites?

      We appreciate the suggestion and have performed this analysis (see Response Figure 2). Relatedly, we analyzed putative silencers, defined as DHSs marked by H3K27me3, for FACT binding and expression changes (measured by TT-seq) following FACT depletion (Supp Fig. S7). As expected, FACT does not bind these putative silencer DHSs and transcription does not markedly increase or decrease from these regions after FACT depletion. Complicating the matter, FACT binds at many DHSs, even those that did not to meet our stringent peak-calling criteria (see Response Figure 2, middle cluster).

      __Response Figure 2. Overlap between FACT binding sites and gene-distal DHSs. __Individual clusters are sorted by V5-SPT16 binding. Clusters were assigned based on direct overlap between called V5-SPT16 peaks and assigned gene-distal DHSs. Overall, 17.6% of DHSs overlapped a FACT peak identified in at least one CUT&RUN replicate (8.5% of DHSs overlapped a peak present in multiple replicates).

      One mechanism proposed for how FACT regulates enhancers is that it is required for maintaining a nucleosome free area, and when FACT is depleted nucleosomes invade the site (Figure 7). It wasn't clear if they compared distal DHS sites were FACT normal bound to those without FACT binding in the MNase experiments, which could help support the direct role or specificity of FACT in regulating those enhancers (or a subset of them).

      We have subset the V5-SPT16 CUT&RUN peaks and distal DHSs into groups and have identified increased nucleosome occupancy after depletion at both FACT-bound and FACT-unbound DHSs suggesting both direct and indirect regulation (Fig. 6A, D). There is disruption to nucleosome arrays at non-FACT-bound DHSs (although more modest relative to the FACT bound locations), and therefore we speculate that a nucleosome remodeler is involved downstream of FACT (possibly CHD1, per recent work out of Patrick Cramer and François Robert’s labs, among others).

      1. Data quality for nucleosome occupancy was a little strange (Figure 7F), where the two clones had very different MNase patterns at TSS sites. Could the authors comment on why there is such a strong difference between clones here.

      We agree that the trends identified by visualizing differential MNase-seq signal near TSSs do not fully replicate; however, in examining the nondifferential MNase-seq heatmaps, we see a more expected distribution (see new Figure 7A). Per our newly-added Supp Fig. S9B, all MNase-seq replicates had a pairwise Pearson correlation value of at least 0.73 (SPT16-depleted clone 1/rep 1 vs untagged rep 3), and the vast majority of samples had pairwise correlations of above 0.85, suggesting that these discrepancies are not due to strong differences in sequencing depth or MNase-protected regions. We therefore suspect that the clonal distinctions are a result of different background occupancy of nucleosomes near the TSS, resulting in an array with increased occupancy in one clone and more generalized increased occupancy in the other clone. We also added the MNase-seq data over TSSs in a non-differential form in Fig 7A, and believe the difference between the clones is due to the differential analysis, and have commented accordingly in the revised manuscript.

      More details on some of the analysis steps would be really helpful in evaluating the experiments. Specifically, was any normalization done other than depth normalization? I ask this because the baseline levels for many samples in metaplots look quite different. For example, see Figure 7B where either clone 1 has a globally elevated (at least out 2kb) ratio of nucleosome in the IAA samples relative to the EtOH, or there is some technical difference in MNase. One suggestion is to look at methods in the CSAW R package to allow TMM based normalization strategies which may help.

      We appreciate the suggestion – we have expanded our explanation of normalization methodology in the paper. We initially used quartile and RPGC normalizations to attempt to mitigate technical differences in MNase-seq data. Size distribution plots did not suggest differences in MNase digestion between samples, and neither quartile/RPGC nor TMM-based normalization fully resolved this issue. Because our ATAC-seq datasets agree with the general trends identified by MNase-seq (which are consistent, despite technical differences between clones), we do not believe that the differences constitute true biological difference, but rather experimental noise.

      1. I appreciated the speculation section, and the possible relationship between FACT and paused RNAPII is interesting. While further experiments may be outside the scope of this work and I am not suggesting they do them, I am wondering if others have information on locations of paused RNAPII in ESC that would allow them to test if genes with paused RNAPII have a special requirement for FACT that they could use their current data to assess.

      We agree that experiments to test the relationship between paused RNAPII and FACT are an intriguing next step, and plan to dissect those in the near future.

      Minor comments: 1. When describing the peaks found in the text related to Figure 2 they refer to 'nonunique' peaks. Does this mean the intersection of the independent peak calls? Could they clarify this.

      We apologize for the confusion and have clarified in the text that nonunique peaks does indeed refer to the intersection of independent peak calls (now specified on manuscript page 8, line 15).

      In the text they refer to H3K56ac data in S2D and I don't see that panel. The color scheme for the 1D heatmaps (Figure 5A) is tough to appreciate the differences. I'd suggest something more linear rather than this spectral one might be easier to see.

      We apologize for the confusion and removed the remaining H3K56ac-related data and references in the text. We appreciate the suggestion regarding the 1D heatmap color scheme and have adjusted the colors to a linear (white à red) scheme.

      For the 2D heatmaps of binding, could they include the number of elements they are looking at for each group?

      We appreciate the suggestion and have included numbers of elements visualized wherever applicable in the figure panels and legends.

      1. Also for 2D heatmaps, I think the scale is Log2 (IAA/EtoH), but could they confirm that and include it in the figure?

      We apologize for the confusion; the only heatmaps displaying log2(IAA:EtOH) are those in Fig. 6; for those panels, we have clarified the scale in the figure and legend.

      Reviewer #1 (Significance (Required)):

      • The use of degrader based approaches to depleting a protein allows refined kinetic and temporal assays which I think are important. Several papers showed a rapid invasion of nucleosomes after SWI/SNF loss using these kinds of approaches and revealed surprisingly fast replacement of SWI/SNF. This paper is consistent with those models, showing that another remodeler behaves the same, suggesting there may be general requirements for active chromatin remodeling to maintain the expression of these genes. It also highlights a key gap in how specificity works to target these enzymes remains somewhat unknown.

      • This work will be of interest to those studying detailed mechanisms of gene regulation. Compared to some other chromatin regulators, FACT is understudied and so this work will allow comparison between different chromatin remodeling complexes.

      • My experience: chromatin, gene regulation, cancer, genomics

      We appreciate the thorough review and hope that we have sufficiently addressed your concerns.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors propose that the FACT complex can regulate pluripotency factors along with their regulatory targets through non-genic locations. They find that acute depletion of FACT leads to a "reduction" in pluripotency in mouse embryonic stem cell by disrupting transcription of master regulators of pluripotency. They also show FACT depletion affected the transcription of gene distal regulatory sites, but not silencers. They also stated that SPT16 depletion resulted in both, a reduction of chromatin accessibility and increase of nucleosome occupancy over FACT bound sites.

      Overall the study appears technically well executed. The use of an Auxin induced depletion system is a good model to study the acute effects of FACT depletion. However, I have a number of concerns relating to specificity and interpretation of the results that need to be addressed. We appreciate the careful review and have addressed your comments below:

      Major points o Authors claimed that depletion of the FACT complex "triggers a reduction in pluripotency". As evidence supporting this statement they present images of alkaline phosphatase assays of a time course performed upon depletion of FACT. These experiments indeed show that ESCs are destabilized in the absence of SPT16. However, some key questions regarding the phenotype remain unresolved: o What is are the kinetics of expression of selected naïve pluripotency and early differentiation markers? Are differentiation markers upregulated, consistent with normal differentiation upon FACT depletion?

      We appreciate the suggestion and have emphasized the decrease in pluripotency factor expression, accompanied by an increase in differentiation marker expression across all three germ layers. We graphed 7 pluripotency factors and 7 differentiation markers for each germ layer; generally speaking, pluripotency factors are decreased while differentiation markers are increased (Response Figure 4; pluripotency factors are included in the new Fig. 3B, while differentiation markers are included in the new Supp Fig. S3 F-H).

      We have also performed an immunocytochemistry (ICC) timecourse, per Reviewer 3’s suggestion. This ICC timecourse allows us to orthogonally assess decreased pluripotency factor expression, to pair with the OCT4 Western blot shown in Supp Fig. S1B. These new ICC data are shown in the new Fig. 1D and included here for convenience (Response Figure 5). In addition, we have added alkaline phosphatase staining at 12 hours of depletion to Fig. 1C.

      __Response Figure 4. Plots of DESeq2 analysis across experimental timecourse. __Shown are lineage markers denoting: A. Pluripotency B. Endoderm C. Mesoderm and D. Ectoderm. Generally, expression of pluripotency factors decrease over time, while differentiation markers of each lineage increase over time. These data are shown in Figure 3B and Supplemental Figure S3F-H.

      __Response Figure 5. Immunocytochemistry timecourse depicting DAPI staining (left panels, blue) and OCT4 immunofluorescence (right panels, green). __Images are representative of plate-wide immunofluorescence changes.

      O Is only ESC identity affected or does loss of FACT impair viability also of cells that have exited pluripotency? To address this, growth curves and/or cell cycle analysis upon FACT depletion could be performed. Alternatively, the authors could utilize surface markers to distinguish naïve pluripotent form differentiated cells in the cell cycle analysis experiments to identify a potential differential response of pluripotent and differentiated cells to FACT depletion.

      We have performed a growth curve with FACT depletion as suggested; as the two points are related, we will explain further below:

      o Another key question is whether it is only the metastable pluripotent state of ESCs in heterogeneous FCS/LIF conditions which is affected by FACT loss, and whether cells cultured in the more homogeneous and more robust 2i-LIF conditions can tolerate FACT removal. If that is indeed the case it would enable the authors to address one main concern I have with this manuscript, which is that it is nearly impossible to distinguish the direct effect of FACT loss from differences induced by differentiation (and maybe cell death, see comment above). This is a critical concern that needs to be addressed and discussed appropriately.

      We apologize for the confusion – all original experiments for this project were performed in the presence of LIF as well as GSK and MEK inhibitors CHIR99021 and PD0325091, respectively (2i+LIF conditions). To address the reviewers question, we have now performed a timecourse growth assay under both LIF-only and 2i+LIF conditions (Response Figure 6 and new Supp Fig S1F), and as suggested by the reviewer, observe a stronger effect of FACT depletion on cell viability in LIF-alone (FACT-depletion results in ~90% death within ~24 hours, with differences in growth observed by 12 hours) than in 2i+LIF (FACT-depletion results ~80% death within 48 hours, with differences in growth observed starting around 18 hours). Overall, ES cells in LIF alone are indeed more sensitive to FACT loss, supporting our decision to perform the experiments throughout the manuscript in 2i+LIF conditions.

      LIF alone LIF + 2i

      Response Figure 6. __Growth assays in LIF (left) and 2i+LIF (right) conditions. __Cells were treated with either EtOH or 3-IAA and counted at the indicated times. Viability was assessed using trypan blue exclusion. Error bars indicate standard deviation for biological triplicate experiments.

      o A further major concern is about the specificity of the effect of FACT depletion. The authors claim that FACT is required to maintain pluripotency. From the data presented this is unclear. FACT appears to be part of the general transcription machinery in ESCs. It appears generally associated with active promoters and active genes, according to the data in this manuscript. Whether there is any specific link to pluripotency remains to be shown. It is unclear how enrichment analyses have been performed. If they haven't been performed using a background list of genes actively transcribed in ES cells, they will obviously show enrichment of ESC specific GO categories, because ESCs express ESC specific genes robustly expressed in ESCs?

      We apologize for the confusion and have updated our methods section to include more comprehensive details on our pathway enrichment analyses. We have confirmed that pluripotency-related categories are still highly enriched in FACT-regulated DEGs, even when using a background dataset of all transcribed genes, per our TT-seq datasets (baseMean ≥ 1 in DESeq2 output).

      In line with this: the authors show that FACT bound loci well overlap with Oct4 bound regions. But which proportion of FACT targets loci are actually Oct4 bound too?Is FACT binding exclusive to Oct4 regulated enhancers and promoters? In other words, will FACT be recruited to all actively transcribed genes in ES cells? In that case, a specific effect on pluripotency network regulation cannot be claimed.

      We appreciate the suggestion, and have added the number of OCT4/SOX2/NANOG-bound FACT peaks and vice versa in the text and legend of Fig 3E-F. We have also summarized this information in Response Table 1, below (and included these data as Table 2 in the revised manuscript).

      OCT4 peaks

      Sox2 Peaks

      Nanog Peaks

      Any of OSN

      V5 Peaks

      8,544

      5,948

      5,307

      9,682

      OSN Peaks

      45,476

      19,211

      16,817

      52,899

      % of OSN peaks bound by FACT

      18.33%

      30.72%

      31.40%

      17.91%

      % of V5 peaks bound by pluripotency factor(s)

      52.41%

      36.85%

      32.94%

      59.63%

      V5-bound promoters

      4,261

      2,719

      2,327

      4,452

      OSN-bound promoters

      6,550

      1,542

      666

      6,948

      V5- and OSN-bound promoters

      2,040

      801

      343

      2,202

      OSN-bound gene-distal peaks

      38,926

      17,669

      16,151

      45,938

      V5-bound gene-distal OSN peaks

      6,504

      5,147

      4,964

      7,480

      __Response Table 1. Overlapping CUT&RUN and ChIP-seq peaks shared between OCT4, SOX2, NANOG, and V5-SPT16 under various stratifications. __Shown are numbers or percentages of peaks overlapping between V5 and OSN. The last column are peaks containing any of OCT4, SOX2, and/or NANOG. The first four rows include all peaks, regardless of location, and the last five rows are broken down by promoter (as defined by an annotated mRNA) or gene-distal location (defined by a minimum of +/- 1kb from a gene).

      Of the 45,865 OCT4 peaks, 3,688 are located at promoters, and 1,209 of these peaks are bound by V5-SPT16 (32.8%). Inversely, 13,228 of 42,177 gene-distal OCT4 peaks are called as SPT16-V5 peaks in at least one CUT&RUN replicate (31.36%), suggesting a relationship between OCT4 binding and FACT binding, which has long been identified with genic transcription, but has roles extending beyond gene-proximal regulation. We observe similar trends with NANOG and SOX2.

      o It is disappointing that neither raw data (GEO submission set to private) nor any Supplemental Tables containing differentially expressed transcripts and ChIP or Cut and Run peaks and associated genes were made available. This strongly reduces the depth of review that can be performed.

      We apologize if the reviewer token in the cover letter was not accessible. The GEO datasets (including differentially expressed transcripts, raw fastq files, and analyzed datasets) will be made public upon publication; in the meantime, the GEO entry (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE181624) can still be accessed using the previously provided reviewer token: wvkvwmwynjeffux.

      o To what extent do FACT bound loci overlap with genes differentially expressed 24h after FACT depletion? This analysis would help determine the direct targets of FACS regulation.

      We appreciate the suggestion. This analysis can be found in the original Figure S6, broken down by FACT-repressed (expression increased upon FACT depletion), unchanged, and FACT-stimulated (expression decreased upon FACT depletion) DESeq2 results (ordered left-to-right, respectively). Figure S6A-C shows that V5-SPT16 binding is enriched, but not exclusive to, genes with FACT-regulated expression, while Fig. S6D-F shows TT-seq data for each group, sorted by log2-fold change assigned by DESeq2.

      o The paper mainly relies on NGS analysis. Therefore, it is crucial that authors show as Supplemental Material some basic QC of these data. PCA analyses to show congruency of replicates are the minimum requirement.

      We appreciate the suggestion and have included a new Supp. Fig S9, with pairwise comparative Pearson correlation scatterplots and heatmaps for replicates in each dataset, in addition to the scatterplots shown for CUT&RUN and ChIP-seq data in the original Supp Fig. S2A.

      o Did the authors perform any filtering for gene expression levels before analysis? Are genes in the analysis robustly expressed in at least one of the conditions?

      We apologize for the confusion. Due to the sensitive nature of TT-seq and the germ layer-inconsistent pattern of cell differentiation following FACT depletion, we did not perform filtering for gene expression prior to any analyses. For the vast majority of genes analyzed, however, we are able to identify transcription via TT-seq, even in those that do not significantly change expression upon FACT depletion (see Supp Fig S6E). As discussed above, we did include a cutoff for expressed genes in our revised pathway analysis.

      o Wherever p values were reported for enrichment analyses, adjusted p values should be used

      We apologize for the oversight; the p values were in fact adjusted p values and have updated the text and figures to make it explicit that the adjusted p values were used wherever applicable.

      o I cannot follow the logic used by the authors to explain discrepant results from Chen et al about the role of FACT in ESCs. Chen et al showed that FACT disruption by SSRP1 depletion is compatible with ESC survival and leads to ERV deregulation. The authors of the present study attribute these differences to potential FACT independent roles of SSRP1. However, I would assume that if there are indeed FACT independent roles of SSRP1, then the phenotype of SSRP1 KOs in which FACT and other processes should be dysfunctional should be even stronger than a plain FACT KO. This needs a proper and careful explanation.

      We apologize that our discussion of FACT-independent roles of SSRP1 was not clear and have clarified our wording in the text (page 4, line 49 – page 5, line 4)in the revised manuscript); we intended to reconcile the results of Chen et al. 2020 with Goswami et al. 2022 and Cao et al. 2003; despite SSRP1 knockout viability in embryonic stem cells, SSRP1 knockout is lethal in mice between 5-40 weeks and general SSRP1 knockout is lethal 3.5 days post-conception (per Goswami et al. 2022). We therefore posit that the general requirement for SSRP1 may be due to distinct roles from those carried out by the FACT complex in ES cells, as discussed by Spencer et al. 1999, Zeng et al. 2002, Li et al. 2007, and Marciano et al. 2018.

      We note that our findings are in agreement with papers from the Gurova lab and others in that depletion of mRNA or protein of SPT16 leads to concomitant loss of SSRP1; we therefore do not expect total SSRP1 loss to have a stronger effect than SPT16 depletion. We therefore expect, and confirmed via Western blotting (Figure 1B, Supplemental Figure 1), that depletion of SPT16 leads to loss of both FACT subunits, and therefore all FACT subunit activity, complex-dependent or -independent.

      Also, did the authors observe any evidence for ERV deregulation upon acute SPT16 depletion?

      We did indeed observe ERV deregulation upon SPT16 depletion. When reviewing our TT-seq datasets, 7.1% of ERVs were derepressed, while 2.4% decreased in expression upon 24h FACT depletion (mm10 ERVs sourced from gEVE, Nakagawa and Takahashi, 2016). Further, we identified increased chromatin accessibility after FACT depletion at annotated LTR elements, as shown in the table below (Response Table 2). Here we are displaying the calculated enrichment score for accessibility detected at these locations. A negative value indicates lower accessibility than expected by region size, while a positive score indicates that reads are more enriched than expected at the indicated region.

      ATAC-seq enrichment score for locations losing accessibility with FACT depletion

      3h

      6h

      12h

      24h

      LTR Enrichment

      -1.445

      -1.299

      -0.917

      -0.559

      Intergenic Enrichment

      -6.046

      -4.765

      -3.926

      -2.972

      Promoter Enrichment

      3.335

      2.789

      2.726

      2.233

      ATAC-seq enrichment score for locations gaining accessibility with FACT depletion

      3h

      6h

      12h

      24h

      LTR Enrichment

      -1

      -0.436

      1.103

      1.13

      Intergenic Enrichment

      -1

      0.134

      0.435

      0.236

      Promoter Enrichment

      -1

      -3.585

      1.171

      1.39

      __Response Table 2. Changes in ATAC-seq peak enrichment for selected regions, annotated via HOMER. __At regions differentially accessible between SPT16-depleted and SPT16-undepleted samples, regions were assigned to an annotated genomic feature using HOMER annotatePeaks.pl and assigned an enrichment score based on the ratio of ATAC-seq signal to region size. Over time, LTR elements become more enriched among the ATAC-seq peaks both gaining and losing accessibility, indicating a role for FACT in maintaining LTR accessibility.

      We do wish to note, however, that Lopez et al. 2016 identified SPT16-independent regulation of LEDGF/HIV-1 replication by SSRP1, and therefore cannot rule out effects on ERV dysregulation due to SSRP1 loss that accompanies SPT16 depletion.

      Minor points o Figure S2A is very small and resolution is low. Page 10: "...while all four Yamanaka factors (Pou5f1, Sox2, Klf4, and Myc) and Nanog were significantly 24 reduced after 24 hours (Fig. 3A, S3A-B)". No data for myc is being shown.

      We apologize for the figure resolution and have included a larger image. Because pairwise comparative scatterplots are not space-efficient, we opted to display the Pearson correlations for the datasets including more samples (TT-seq and ATAC-seq timecourses) as heatmaps in the new Supp Fig S9. We have added Myc labeling to the volcano plot (now in Fig. 3A) and included a trace of Myc expression over time to the new pluripotency factor graph in Fig. 3B.

      o Are the two bands in the middle in figure 1B is supposed to be a ladder? This should be clarified.

      We thank the reviewer for noticing this and apologize for the oversight.

      o Figure 3C- This Figure is complicated to read. Also, information appears redundant with the Table 1, I recommend to remove this panel.

      We have moved the panel to the supplement (now Supp Fig. S3A). While the information is somewhat redundant with Table 1, we chose to include the former panel 3C as a visual representation of the consistent deregulation over depletion time across transcript categories.

      o Figure 6 and figure 7 could be presented in one single figure since both aspects are complementary and target related aspects.

      While we thank the reviewer for this suggestion, we do not believe that the information contained in Figs. 6 and 7 can effectively be conveyed in a single figure. While both figures focus on chromatin accessibility and nucleosome occupancy, Fig. 6 is designed to address the changes in chromatin accessibility over time, while Fig.­­­ 7 is more relevant to the biological mechanism through which FACT co-regulates targets of the core pluripotency network (OCT4/SOX2/NANOG) after 24 hours of depletion.

      o Are the authors certain that the effects observed are directly linked to the FACT complex in contrast to FACT independent roles of SPT16, if any exist? The experiment to address this would be to deplete SSRP1 and investigate whether the effects are identical, which would be the hypothesis to be tested.

      We thank the reviewer for this suggestion. We did attempt to create additional SSRP1-AID-tagged lines; however, generating these lines proved to be technically challenging, and comparison of the FACT-dependent and -independent roles of the individual subunits is beyond the scope of this work. Further complicating the matter, SSRP1 is effectively depleted within 6 hours of 3-IAA addition in SPT16-AID lines due to the interdependence of FACT subunits. We again thank the reviewer for their suggestion and will consider this work for a future study.

      Reviewer #2 (Significance (Required)):

      My expertise is pluripotency and GRNs.

      I would judge the significance of the study as presented as low, mainly because at this moment it remains unclear what FACT indeed does concerning regulation of pluripotency.

      We respect the reviewer’s opinion and hope that our revisions have made more clear how the FACT complex prevents nonspecific differentiation from occurring, thereby maintaining pluripotency and self-renewal in embryonic stem cells. Importantly, neither untagged cells treated with 3-IAA nor tagged cells treated with vehicle display the growth defects, loss of pluripotency factor expression, increased differentiation marker expression, phenotypic evidence of differentiation, and reduced alkaline phosphatase staining that the FACT-depleted cells do, highlighting a key requirement for FACT in pluripotent cells. Beyond this, we believe the novel gene distal regulatory role we have identified for FACT presents an exciting new role for this complex in gene regulation.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Klein, et al. addressed function of FACT complex in mouse ESCs, using cut&run, TT-seq, ATAC-seq, MNase-seq, together with Auxin-mediated FACT degradation system. The authors first reported that efficient and acute depletion of SPT16 with the Auxin-mediated degradation system resulted in over 5,000 up- and 5,000 down-regulated genes within 24 hours, including down-regulation of pluripotent gens. Then, they demonstrated that many of FACT binding sites overlap with Oct4, Sox2, Nanog binding sites by Cut&Run, and those loci increase nucleosome occupancy 24 hour after removal of FACT.

      The Auxin-mediated degradation system seems to be working very well (while I would like to see an over exposed version of Western blotting), and efficient degradation might explain the different phenotypes from the previous reported phenotypes by shRNA and the chemical inhibitor, which might not deplete FACT function completely and/or might have off-target effects. The Cut&Run data also have much sharper peaks than previously reported SSRP1, SPT16 ChIP-seq data. Doing ATAC-seq, MNase-seq upon removal of FACT is excellent. WIth the excellnet degradation system, depletion of FACT resulted in loss and gain of gene expression and differentiation. However, unfortunately it was not very clear to me what was the direct consequences of FACT removal and its mechanisms, waht was consequence of differentiation.

      We appreciate the kind words regarding our choice and execution of techniques and the reviewer’s time spent on this manuscript. We have made a number of changes to the manuscript in order to clarify the direct role of FACT and the consequences of FACT loss on embryonic stem cells.

      Although we did not develop the blots for a longer period when we performed the Westerns, we have artificially overexposed our V5-SPT16 Western blot from Figure S1 (in Adobe Illustrator) to highlight the more subtle bands at later depletion timepoints; we hope that this helps to clarify the effectiveness of the degron system.

      Response Figure 7. V5-SPT16 Western blot with adjusted exposure. We manually adjusted the entire blots’ exposures using Adobe Illustrator. L indicates ladders, and the timecourse depletion is shown above the blot.

      In my opinion, doing many of the analysis 24 hours after FACT depletion, where differential expressed (coding) genes (DEGs) are >10,000 (Table 1)), is too late to understand what the direct consequences are. Seeing 214 up- and 174 down-regulated DEGs 6 hours after FACT depletion, I do agree that FCAT seems to do both suppression and activation of target genes. It could have been really interesting to investigate what % of FACT bindign sites change chromatin accesibility and nucleosome occupancy at that time point, if those loci are close to any of the up- or down-regualted DEGs.

      We appreciate the suggestion and have included more information regarding the percentage of FACT binding sites with altered chromatin accessibility, as well as included some analyses to address the directness of FACT’s contribution to DEGs at all timepoints (see Supp Figs S4, S6). We would like to note that, we performed the TT-seq and ATAC-seq experiments at 0, 3, 6, 12, and 24 hours post 3-IAA treatment in order for us to explore the progressive change in both the transcriptome and chromatin accessibility, with only the MNase-seq limited to 24 hours. As originally shown in our Sankey plot in Supp Fig 4, we see a progressive change in expression for a small subset of genes over our timecourse running from 0-24 hours, with the largest effect observed at 24 hours, once the FACT protein levels are almost entirely depleted. Similarly, we see a progressive change in ATAC-seq signal over the same regions, with the strongest effects over the same regions visible at 24 hours post-depletion. Due to our observation that SPT16 is not depleted at 3 or 6 hours, with significant depletion seen at 24 hours (see Response Figure 7) and because we intended to study the FACT complex’s role in preventing differentiation, we were most interested in the effects at 24 hours of depletion, which allow us to analyze both the disruption of pluripotency factor expression and the facilitation of differentiation marker expression across all three germ layers (see Response Figure 4).

      Followings are reasons of above my judgement and suggestions to improve the manuscript.

      Major points 1. Figure 1. ALP staining is not very sensitive way to evaluate ESC differentiation. I recommend Immunofluorescence for pluripotency genes (NANOG and/or SOX2) and quantification. Or present changes of pluripotency genes in graphs over time course from RNA-seq data.

      We appreciate the suggestions and have taken both into account. We have included a new panel in Figure 3 (new 3B) to display the changes of pluripotency factor expression over our timecourse. We have also included some data showing differentiation factors as part of a response to Reviewer 1, which can be found above (Response Figure 4). In addition, we performed immunocytochemistry to examine OCT4 abundance over a depletion timecourse and have added a 12-hour to our alkaline phosphatase assay to address the sensitivity of differentiation over time (Figure 1C, D and Response Figure 5).

      1. Fig 2A, 3E, 3F. How many transcription start sites are shown here? (Throughout the manuscript, it is hard to know how many loci are shown in the heatmaps. It should be described within the figures)

      We apologize for the omission and have added numbers of loci shown to relevant Figure panels throughout the paper.

      It is nice to see nascent transcription high sites have high FACT binding, but can you also show actual nascent transcription of these loci as a heatmap, before and after FACT depletion? These heatmaps should be shown in a descending order of FACT Cut&Run signalling, as FACT binding is important in this manuscript.

      We appreciate the suggestion and have plotted those data below (see Response Figure 8).

      Response Figure 8. Nascent transcription from sites with high FACT binding. Top: TPM-normalized TT-seq signal after 12-hour treatment, oriented to mRNA strand and plotted as entire mRNA length, ± 500 bp. Data are sorted by SPT16 CUT&RUN signal over 1kb upstream of annotated TSSs. n = 1 over 22,597 rows (RefSeq Select mRNAs). Bottom: TPM-normalized TT-seq signal after 24-hour treatment, oriented to mRNA strand and plotted as entire mRNA length, ± 500 bp. Data are sorted by SPT16 CUT&RUN signal over 1kb upstream of annotated TSSs. n = 3 (mean) over 22,597 rows (RefSeq Select mRNAs).

      Strong FACT binding sites have strong transcription. Is FACT really supressing transcription?

      We agree that it is very difficult to disentangle FACT function due to its binding correlation with transcription; however, we see a clear trend of FACT binding at promoters that are sensitive to FACT depletion (Supp Fig. S6A/D and C/F). Intriguingly, the genes that see the greatest derepression by DESeq2 analysis are those that are directly bound by FACT (per ChIP-seq and CUT&RUN; Supplemental Figure S6A/D), while the greatest decrease in expression occurs at genes that are less bound by FACT (Supp Fig S6C/F). In our opinion, this trend lends credence to both direct repression by FACT and distal gene regulation. We note that others (e.g., Kolundzic et al. 2018) have shown direct repression of gene expression by FACT, in line with that aspect of our data.

      1. Fig 3ABD. It is more important to show 3h, 6h 12 h time points. The same apply to Fig 4. What %, how many of DEGs (coding and non-coding) at each time point had FACT binding nearby in ESCs?

      We agree that the early timepoints are important and have added volcano plots to the supplemental material for earlier timepoints, with genes of interest specifically annotated. We have also examined pluripotency and differentiation markers at earlier timepoints, per other reviewers’ suggestions, and have included the percentage of DEGs with nearby FACT binding in the manuscript. Specifically, 2013 replicated V5 peaks (out of 16,054; 12.54%) occurred within 1000 bp of a RefSeq Select TSS.

      Timepoint

      Total DEGs (up)

      V5-bound DEGs (up)

      Total DEGs (down)

      V5-bound DEGs (down)

      3h

      58

      16 (27.59%)

      5

      1 (20%)

      6h

      214

      38 (17.76%)

      174

      31 (17.82%)

      12h

      1366

      123 (9.00%)

      1932

      281 (14.54%)

      24h

      5398

      431 (7.98%)

      5000

      663 (13.26%)

      __Response Table 3. Table of DESeq2-assigned DEGs that are bound by SPT16-V5. __To be defined as V5-SPT16-bound, a DEG must have SPT16-V5 binding within 1000 bp upstream of its RefSeq-select annotated TSS.

      We believe that these earliest depletion timepoints are in line with FACT-mediated gene regulation occurring distal to the regulated genes’ promoters.

      Fig 3EF. Interesting data and the overlap between SPT16 binding sites and pluripotency binding sites look very strong. But it is difficult to know what % is overlapping from these figures.

      We appreciate the difficulty in quantifying the overlap between pluripotency factor binding sites and FACT binding sites; we have added those data to the manuscript below Figure 3E for OCT4; for other pluripotency factors, these data can be found in Response Figure 9 and Response Table 1. Briefly, 18.33% of OCT4 ChIP-seq peaks are bound by V5-SPT16 and 52.41% of V5-SPT16 peaks are bound by OCT4. Interestingly, 34.6% of gene-distal OCT4 ChIP-seq peaks are bound by V5-SPT16, implying greater convergence between FACT and pluripotency factors at gene-distal sites, in line with known trends for OCT4 binding. Overall, 59.63% of V5-SPT16 peaks are co-bound by at least one of OCT4, SOX2, or NANOG.

      Can you show 1 heatmap split into 3 groups, a. SPT16-V5 unique, common between SPT16-V5 and Oct4 ChIP-seq, Oct4 ChIP-seq unique, with indication of numbers each group has? Also make the same figures for Sox2 and Nanog. (E is less important. If the authors want, they can use the published FACT ChIP-seq data in the same loci.)

      We appreciate the suggestion and have plotted V5-SPT16 CUT&RUN data and pluripotency factor ChIP-seq over unique and shared regions for OCT4 (top) SOX2 (middle) and NANOG (bottom). Interestingly, although some peaks in the non-overlapping cluster were not called as peaks by the algorithms’ threshold, one can observe that a subset do seem to have overlapping binding. We again appreciate the suggestion and think that this was an excellent way to display the data and have included these data as a new panel (Fig. 3E) but also show below in Response Figure 9.

      Fig. 5. Basic information what % (how many) of SPT16-V5 CUT&RUN peaks belong to this 'enhancer' category is missing.

      We apologize for the oversight and have added numbers to the figure and legend.

      I am not sure the meaning of separating enhancers and TSS of coding genes in the analyses, though. If majority of SPT16-V5 CUT&RUN peaks overlap with Oct4 binding sites, it is not surprising that SPT16-V5 CUT&RUN peaks overlaps with ATAC-seq signal and enhancer marks.

      We agree that it is unsurprising that V5-SPT16 overlaps with accessible chromatin and enhancers, given the extensive overlap with OCT4 ChIP-seq peaks. We wanted to emphasize our novel finding of gene-distal FACT binding, given the more established trend of binding at promoters.

      1. Fig 6A. I could not figure out what % of DHSs overlaps with FACT binding sites.

      We have added this percentage to Fig 5C and included an analysis of altered chromatin accessibility in a new Table 3 (page 20). Briefly, 11,234 replicated V5-SPT16 peaks (out of 16,043; 70%) directly overlap a gene distal DHS. Orthogonally, 11,234 DHSs (out of 132,555; 8.5%) directly overlap a V5-SPT16 peak.

      I do not see the point of showing DHSs which do not overlap with FACT binding sites.

      In agreement with Reviewer 1, we believe that it is important to include FACT-unbound DHSs for a clearer understanding of the direct vs indirect effects of FACT depletion. We have condensed some of these data into a single heatmap, clustered between FACT-bound DHSs, non-FACT-bound DHSs, and FACT-bound non-DHS sites to streamline the information (now shown in Fig 3E).

      Response Figure 9. Heatmaps of clustered SPT16 and OSN binding data. Shown are clustered heatmaps depicting V5-SPT16 CUT&RUN binding overlapping ChIP-seq peaks for OCT4 (top) SOX2 (middle) and NANOG (bottom). In each set of heatmaps the top cluster is pluripotency factor-unique, the middle cluster is shared, and the V5-unique cluster is on the bottom. Each cluster is sorted by descending strength of V5-SPT16 binding (CUT&RUN). Clusters were assigned by directly overlapping peaks.

      How ATAC-seq signal changes upon depletion of FACT at FACT binding sites (Fig 6B) is important. Can you explain why ATAC-seq signals increase at the FACT binding site flanking regions (across +/- 2kb) where FACT binding is strong (without changing the chromatin accessibility at the FACT binding sites)? Perhaps authors need to show actual ATAC-seq track with EtOH or 3-IAA treatment over ~10kb regions flanking FACT binding sites. It is difficult to understand what is happening seeing only the changes (ratio) of ATAC-seq read counts, how big the differences are.

      We agree that the local window and ratio of ATAC-seq signal somewhat muddles the true biological trends. We have plotted non-differential ATAC-seq signal for each SPT16-AID clone over V5 binding sites, ±10 kb, to more accurately depict the local chromatin status (shown below in Response Figure 10). There is an apparent trend at V5-SPT16 CUT&RUN peaks of accessible chromatin, and this high local accessibility very likely contributes to the high ATAC-seq signal immediately flanking V5 binding sites; over the binding sites themselves, however, FACT depletion consistently triggers decreased accessibility (see Fig. 6).

      Can you identify differentially open loci based on 3-IAA- and Et-OH treated ATAC-seq data at each time point, and then how many of them overlap with FACT binding sites? There are a few tools to identify differential open regions with ATAC-seq data. That could help to understand the direct roles of FACT binding.

      We appreciate the suggestion and have performed this analysis using a combination of PEPATAC and HOMER (see Response Tables 4-6 below). FACT depletion leads to the following accessibility changes:

      3-hour

      6-hour

      12-hour

      24-hour

      Decreased accessibility

      220 (0.35%)

      3,713 (5.99%)

      6,885 (11.11%)

      8,441 (13.62%)

      Increased accessibility

      2 (0.00%)

      12 (0.02%)

      276 (0.45%)

      6,031 (9.73%)

      Response Table 4. Accessibility changes over consensus ATAC-seq peaks. Consensus ATAC-seq peaks were defined per PEPATAC standards (peaks called by MACS2 in (n/2)+1 samples, irrespective of condition.

      3-hour

      6-hour

      12-hour

      24-hour

      Decreased accessibility

      848 (1.64%)

      1870 (3.51%)

      2525 (4.83%)

      4,092 (7.90%)

      Increased accessibility

      107 (0.21%)

      283 (0.55%)

      534 (1.03%)

      2,449 (4.73%)

      Response Table 5. Accessibility changes over regions bound by V5-SPT16.

      Response Figure 10. ATAC-seq data shown over a 20kb window. Heatmaps depicting non-differential ATAC-seq data over FACT binding sites for SPT16-AID clones 1 (top) and 2 (bottom). Data are sorted by V5-SPT16 binding strength.

      All

      3-hour

      6-hour

      12-hour

      24-hour

      Decreased accessibility

      3,294 (2.46%)

      3,175 (2.37%)

      3,636 (2.71%)

      7,018 (5.23%)

      Increased accessibility

      102 (0.08%)

      313 (0.23%)

      1,797 (1.34%)

      5,975 (4.45%)

      V5-bound DHSs (11,234 total)

      3-hour

      6-hour

      12-hour

      24-hour

      Decreased accessibility

      1 (0.01%)

      9 (0.08%)

      96 (0.85%)

      2006 (17.86%)

      Increased accessibility

      5 (0.04%)

      28 (0.25%)

      71 (0.63%)

      87 (0.77%)

      Response Table 6. Accessibility changes over gene-distal DHSs and over only FACT-bound gene-distal DHSs.

      Together with Fig 1A and Fig 6C, do they mean the more FACT binding, the more transcription (Fig 1A). Also the higher transcription rate, the more increased chromatin accessibility upon depletion of FACT (Fig 6C)?

      While we do see that FACT binding correlates with transcription and with FACT-dependent chromatin accessibility, we do not wish to make the argument that FACT binding alone is indicative of high transcription, nor that transcription is necessarily the deciding factor in FACT-depleted chromatin accessibility changes. We do want to note that transcriptional disruption is a likely contributor to increased chromatin accessibility in the absence of FACT as it pertains to paused RNAPII, as speculated in our discussion, but that experiments to truly test this hypothesis are beyond the scope of this work. That being said, in response to Reviewer 1, we did assess the potential correlation of FACT binding to locations with greater paused RNAPII (Response Figure 3) and see a connection. We are excited to explore this more in future work.

      Perhaps plotting nascent transcripts at 12hr, 24 hr of FACT depletion next to these heatmaps might show if it colleates with transcription changes as well?

      We appreciate the suggestion, and have included this plot in Response Figure 8, sorted by FACT binding to gene promoters; however, we find it difficult to visualize differences in transcription with non-differential heatmaps.

      Sites losing chromatin accessibility (bottom half of Fig 6C) seem not to have FACT binding (bottom half of Fig 1A), thus it is likely to be indirect effects. It is better to make figures focussing on 'direct effects'.

      We agree that there are sites with reduced chromatin accessibility upon FACT depletion that are not bound by FACT; however, given the extensive binding of FACT at gene-distal regulatory regions (F2D, F4A, F5, F6A/D), we would suggest that these “indirect” effects are possibly the result of FACT-dependent gene-distal regulation.

      Fig 1A and Fig 6C indicated that FACT binding sites (i.e. TSS) decrease chromatin accessibility. I thought it does not fit with the idea of increasing nucleosome occupancy. But actually the data (Fig 7F) shows that TSS does not show increased nucleosome occupancy unlike Fig 7A-E. In fact, Fig 6B showed that about bottom 50% of weaker V5 binding sites decreased chromatin accessibility at 24 hr, which fits with increased nucleosome occupancy in Fig 7A. But then if you looked at only top 50% of stronger V5 binding sites, which did not decrease chromatin accessibility, nucleosome occupancy did not change as well? Why don't you make heatmap of MNase-seq next to Fig 6B?

      We have added heatmaps of non-differential MNase-seq data to Fig. 7A to address both concerns. Regarding Figure 6B, we note that the V5-SPT16 peaks themselves invariantly show decreased chromatin accessibility, and that it is the surrounding chromatin, not the V5-SPT16 peak itself, that shifts from increased to decreased chromatin accessibility at 12-24 hours of depletion. We would also like to clarify that the original heatmaps in Fig 6B were sorted by change in chromatin accessibility at 24h, rather than V5 binding.

      We disagree that the TSSs do not show increased nucleosome occupancy in Fig. 7F, as there is an increase in signal above background directly over the TSS in both replicates, per the differential metaplot shown in Fig. 7B, that is specific to the AID-tagged lines. However, the two clones did show variable results. To address this, we have plotted the non-differential MNase-seq plots (Fig. 7A), which show more consistent trends; it appears that the transformation of the data into differential at this location was the cause of the slightly variable plots over TSSs.

      1. I could not follow based on which data the model in Fig 8 is made. Again it is better to focus in the direct effects.

      Thank you for the suggestion; we have updated our model to focus more on the direct effects.

      Minor points. 10. Line 1 page 5, Kolundzic paper did not have MEF reprograming data. They reported human fibroblast reprogramming was enhanced by FACT KD.

      We appreciate the correction and have clarified the language to specify that the work of Kolundzic et al. included human fibroblast reprogramming and Shen et al. performed MEF reprogramming.

      1. Line 3, I disagree with "these data establish FACT as essential in pluripotent cells". One paper said FACT KD increased proliferation of mESCs, the other paper said chemical inhibition of FACT was necessary for passaging ESCs, but not proliferation. Importance of FACT in pluripotent cells was very unclear to me.

      We have clarified our language to specify that pluripotent cells have a FACT dependency that differentiated cells do not. We note that we were unable to recapitulate a relationship between FACT and trypsinization/passaging of ES cells, suggesting a more nuanced role for FACT in pluripotent cells, in line with work from the Tessarz and Gurova labs.

      Line 7 Page 7, reference the paper with the ChIP-seq data.

      We apologize for the oversight and have added the reference.

      Line 16, Page 7. It doesn't seem the the Cut&run and previously published ChIP-seq data agree well.. >50% look different. It is nothing the authors can do, but can you show venn diagram of peak overlap?

      In response to Reviewer 1, we have generated Response Figure 1 where we display a pie chart of the overlap. In addition to displaying this again to the right in Response Figure 11 this, we have included another analysis below in Response Figure 11, to address this comment. Specifically, we have plotted peak overlaps as a Venn diagram to compare peaks identified in at least two experimental replicates from either the CUT&RUN or ChIP-seq data (left). We have also overlapped replicated peaks between the individual targets and displayed them as a pie chart (right; same as Response Figure 1). While the CUT&RUN data do display a greater signal:noise ratio and call far more peaks, we note that more peak conservation between experiments is relatively consistent (1-6%) between all datasets, including the ChIP-seq experiments profiling opposite factors.

      Overall, we see strongly reproducible trends (albeit with less sharp definition in the ChIP-seq), complemented by highly similar biological feature assignment in Fig. 2D and Pearson correlation values of between 0.76 and 0.78 between SPT16 ChIP-seq and V5-SPT16 CUT&RUN (Supp Fig. S2A).

      __Response Figure 11. Overlaps between SPT16-V5 CUT&RUN, SPT16 ChIP-seq, and SSRP1 ChIP-seq. __Called peaks were compared between V5-SPT16 CUT&RUN, SPT16 ChIP-seq, and SSRP1 ChIP-seq, using both our own analysis pipeline (left) and the peaks published with the original manuscript by Tessarz et al. (2018; right). While our ChIP-seq peak-calling appears to have applied more stringent thresholds, trends are generally agreeable.

      Line 12, 22 page 10. Fig.3AB is 24 hrs. Do not match with the text.

      We apologize for the error and have changed the references in the text to the new panel 3C.

      1. Line 23, 24, page 10, Highlight Klf4 and Myc in the volcano plot.

      We have added KLF4 and MYC annotation to the volcano plot in Fig. 3A, as well as plotted their log2FC over time in the new panel 3B.

      1. Line 18, 19, page 16. This is not accurate statement. Sample 2 increased the accessibility at 6 hours. Sample 1 decreased, but even the control did so.

      We apologize for the unclear wording; we intended to suggest that all timepoints after 6 hours (i.e., 12 and 24 hours) display decreased accessibility directly over the DHS. We have corrected the text.

      1. Line 48-50, page 16. Two replicates show very different patterns. Difficult to agree with the statement based on the figure.

      We agree that the differential replicate patterns are not ideal; however, both replicates display an increase in nucleosome-sized reads over the promoter region, consistent with our ATAC-seq results presented in Fig 6C. Size distribution plots did not suggest differences in MNase digestion between samples, and neither quartile/RPGC nor TMM-based normalization fully solved this issue. Because our ATAC-seq datasets agree with the general trends identified by MNase-seq (which are consistent, despite technical differences between clones), we do not believe that the differences constitute biological difference, but rather experimental noise. We have included a heatmap of non-differential MNase-seq signal around TSSs in Fig 7A to highlight the experimental reproducibility between replicates. Based on this analysis it appears that the transformation of the data into differential at this location was the cause of the slightly variable plots over TSSs.

      1. Line 15, page 19. Where does "1.5 times" come from? which is 1.5 times more, and is that different from the proportion of those?

      We apologize for the unclear reference to the altered transcripts in Table 1 and have changed our wording to be more precise.

      1. Line 32, page 19. Is Fig S2B correct figure?

      We appreciate the correction; the text should have referred to Fig. 4 and has been fixed.

      Line 35-39, page 21. I understand FACT does not bind to silenced loci. If FACT does not bind, it is not surprising that expression from those loci does not change upon FACT deletion. I do not understand what the authors said.

      We agree that a lack of binding and unchanged expression after FACT depletion at putative silencers are unsurprising; given FACT’s extensive genic and gene-distal binding, we wished to show a class of transcribed regions unbound by FACT as a control, to show that non-FACT-regulated transcription was not affected by FACT transcription. We have clarified our wording in the text to emphasize that a lack of change was expected at silencers.

      Reviewer #3 (Significance (Required)):

      Previously it has been shown that Oct4 physically interacts with the FAcilitates Chromatin Transactions (FACT) complex. Seemingly contradicting phenotypes have been reporting upon suppression of FACT function in the maintenance and induction of pluripotent cells. Mylonas has reported that knockdown of SSRP1, a component of FACT complex, increased ESC proliferation (2018). Shen has described that chemical inhibition of FACT complex affected passaging of ESCs, but proliferation was not affected without passaging. Kolundzic has found that both SSRP1 and SUPT16H, another component of FACT complex, enhance human fibroblast reprogramming into iPSCs (2018), while Shen has reported that chemical inhibition of FACT blocks mouse iPSC generation form MEFs.

      My expertise lies on pluripotent stem cells and transcriptional regulations. I did like the Auxin-mediated FACT degradation system these authors used and acute depletion of FACT is an excellent way of evaluating FACT function in ESC, compared to previously published shRNA based knockdown or use of a chemical inhibitor. However, as I described above, it was not very clear what could the direct effects and I feel looking at 24 hours after depletion might be to late to address this question.

      We appreciate the review and agree that acute depletion of FACT has great potential to understand the complex’s function in ES cells. We understand that the nature of gene-distal regulation does make it difficult to cleanly elucidate direct regulation, and hope that our revisions have clarified that our goal was to examine direct, gene-distal regulation, rather than indirect effects. We would like to note that we examined transcription and chromatin accessibility after 3, 6, 12, and 24 hours of 3-IAA treatment, with all these data included in the original manuscript, and saw minimal change (likely because FACT was not fully depleted until later timepoints); to capture the true biological effects of FACT depletion, we explored most thoroughly the 24 hour 3-IAA treatment to understand the downstream effects between FACT loss and cellular differentiation. However, we have expanded discussion and analyses of the earlier timepoints in this revised manuscript.

    1. Let us, therefore, turn to the experience itself. Upon a black cloth two squares of gray cardboard lie side by side. I am to judge whether or not they are of equal grayness. What is my experience? I can think of four different possibilities. (1) I see on a black surface one homogeneous gray oblong with a thin division line which organizes this oblong into two squares. For simplicity's sake we shall neglect this line, although it has varying aspects. (2) I see a pair of "brightness steps" ascending from left to right. This is a very definite experience with well-definable properties. just as in a real staircase the steps may have different heights, so my experience may be that of a steep or a moderate ascent. It may be well-balanced or ill-balanced, the latter e.g. when there is a middle gray on the left and a radiant white on the right. And it has two steps. This must be rightly understood. If I say a real stair has two steps, I do not say there is one plank below and another plank above. I may find out later that the steps are planks, but originally I saw no planks, but only steps. Just so in my brightness steps: I see the darker left and the brighter right not as separate and independent pieces of color, but as steps, and as steps ascending from left to right. What does this mean? A plank is a plank anywhere and in any position; a step is a step only in its proper position in a scale. Again, a sensation of gray, for traditional psychology, may be a sensation of gray anywhere, but a gray step is a gray step only in a series of brightnesses. Scientific thought, concerned as it is with real things, has centered around concepts like "plank" and has neglected concepts like "step."[7] Consequently the assertion has become true without qualification that a "step" is a "plank". Psychology, although it is [p. 541] concerned with experiences, has invariably taken over this mode of procedure. But since the inadequacy occasioned by the neglect of the step-concept is much more conspicuous in psychology than it is in physics, it is our science that first supplied the impulse to reconsider the case. And when we do reconsider, we see at once that the assertion "a sensation of gray is a sensation of gray anywhere" loses all meaning,[8] and that the assertion that a real step is a plank is true only with certain qualifications.
    1. this article may be the first time you’re reading about, and considering, the complexities of Asian Americans as racialized subjects

      For me this I really am a first timer in terms of learning about the Asian American experience. It is very interesting to learn about more cultures and think of how we all face some sort of inequality.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      • *

      The authors proposed that the stable and opened membrane neck that connects the bud to the cytoplasm may persist for a long time in the infected cell during active RNA production. The viral ring-shaped nsPs is supposed to have an important role of maintaining this stable high-curvature membrane neck. It is suggested that the nsP1 dodecamer may pull together the membrane inner surface in the neck region via electrostatic interactions. Namely the authors observed that in the absence of negatively charged membrane lipids nsP1 did not bind appreciably to the membrane. The presented experimental data and theoretical consideration suggest that the CHIKV spherule consists of a membrane bud filled with viral RNA, and has a macromolecular complex gating the opening of this bud to the cytoplasm.

      The presented results are interesting, the manuscript is well written and can be published after revision. The following comments are offered to the authors' consideration.

      We thank the reviewer for this positive overall assessment.

      1.Since there is no protein coating over the curved surface of the membrane bud, the authors concluded that the membrane neck must be stabilised by specific mechanism involving nsP1. It was further assumed that the viral protein nsP1 serves as a base for the assembly of of a larger protein complex at the neck of the membrane bud. In addition to suggested mechanism of the neck stabilization, thin highly curved membrane neck can be stabilised also by accumulation of the membrane components having the appropriate membrane curvature (i. E. negative intrinsic curvature or anisotropic intrinsic curvature), see Kralj-Iglic et al., Eur. Phys. J. B., 10: 5-8 (1999), https://doi.org/10.1007/s100510050822.

      Please discuss this issue in the manuscript.

      This is a good point, thank you for making it. In the revised manuscript we discuss both the possibility of lipid sorting into the neck region by nsP1 (lines 217-222), and the mentioned paper regarding anisotropic inclusions (lines 268-271).

      • *

      2.In Eq. (1) the Gaussian curvature term (appearing in Helfrich bending energy term) is not included. Usually this term is omitted in the case of closed membrane shapes (i.e. so-called spherical topology) due to validity of the Gauss-Bonnet theorem. In the present manuscript/work the shape equation was solved for the membrane patch. Can you therefore please explain shortly to the reader why you can omit the Gaussian curvature term from Eq.(1). For example due fixed inclination angle and foxed curvature at the boundary, .....

      Thanks for finding this omission. We have now revised the manuscript to describe why we can omit the Gaussian curvature term (lines 241-245).

      • *

      • *

      3.«Sigma« and »P« can be considered also as global Lagrange multipliers for the constraint of the fixed total membrane area of the bud (including the neck membrane) and the constraint of the fixed volume of the bud. If you then take into account separately also the equation for the fixed membrane area you could predict different shapes of the bud (by solving the shape equation) at fixed area of the bud, calculated for different values of the model parameters (and different boundary conditions) - in this case Sigma is the result of variational procedure (as well P if you consider also the constraint for the fixed volume of the bud). See for example Medical & Biological Engineering & Computing, vol. 37, pp. 125-129, 1999 and J. Phys. Condens. Matter, vol. 4, pp. 1647-1657, 1992. Can you please shortly discuss in the manuscript also this issue.

      This is an interesting point. We now discuss this and cite the mentioned papers at the end of the theory section in the supplementary information (lines 203-205) as well as briefly mentioning it when discussing Eq. 1 (lines 240-242).

      • *

      **Referees cross-commenting**

      I agree as well.

      • *

      Reviewer #1 (Significance (Required)):

      The presented experimental and theoretical results are interesting, the manuscript is well written and can be published after revision.

      We thank the reviewer for this appreciative comment.

      • *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In their manuscript "Architecture of the chikungunya virus replication organelle" Laurent and colleagues show:

      • *

      - the 3D structure of the "neck complex" that forms the gateway between the Chikungunya virus replication/transcription organelle (termed "spherule") and the cytoplasm of infected cells. The structure was obtained by native electron cryo-tomography and sub-tomogram averaging of BHK cells infected with a single-cycle replicon system encoding all components of the viral replication machinery. The nominal resolution of the structure is 28 Å. The viral nsP1 protein, for which two high-resolution structures have previously been published, could unambiguously be located within the density of the neck complex.

      • *

      - nsP1 interaction with membranes relies on lipids with a single negative net charge, such as POPS, POPG and PI, whereas two different PIPs with a negative net charge greater than one support nsP1 binding less efficiently. These membrane determinants for nsP1 binding were elucidated using two complementary methods: multilamellar vesicle pulldown assays and confocal imaging of fluorescently labeled giant unilamellar vesicles in the presence of fluorescently labeled nsP1. Purified nsP1 was produced in E. coli.

      • *

      - nsP1 recruits nsP2 (another component of the neck complex) to membranes with suitable lipid composition. This observation was made using the same multilamellar vesicle pulldown assay.

      • *

      - the 3D organization of the viral genome within the spherule, demonstrating that each spherule contains one copy of the genome as a double-stranded RNA molecule. This analysis was carried out by segmentation of the same tomograms that were used to visualize the neck complex.

      • *

      - the force exerted by RNA polymerization within the spherules is sufficient to drive membrane remodeling. This is a theoretical argument based on mathematical modelling.

      • *

      Major comments:

      The article is written clearly and all major claims seem justified. The biochemical assays are presented in duplicates or triplicates, which is sufficient to derive the provided conclusions. The workflow for electron cryo-tomography analysis seems sound, even though the low number of individual particles (=64) for sub-tomogram averaging of the neck complex limits the resolution of its final structure. Given the strong competition in the field, and considering the high experimental workload that would be required for further improvement of the resolution, I do not recommend any additional benchwork for this paper.

      We thank the reviewer for this assessment, especially for recognising the challenge in obtaining a larger number of spherule subtomograms under the complex replicon particle protocol we had to use in order to study the BSL3 CHIKV under BSL2 conditions.

      • *

      My only concern is the accuracy of the experimental genome length measurements, which has important implications for their mechanistic interpretation. The type of tomograms that have been recorded here inherently suffers from anisotropy with respect to both resolution and contrast. This makes accurate tracing of tangled filaments very challenging, and in this light, I congratulate the authors for the impressively good agreement of their average experimentally determined genome length with the theoretical genome length (Figure 4C). As to be expected, however, the second supplementary video clearly shows multiple gaps in the traced genome, implying that there must necessarily be errors in the length measurements. Unless there is a possibility to confidently estimate the magnitude of these errors, my preferred interpretation would be that the vast majority of imaged spherules - regardless of their temporary volume in the moment of sample freezing - likely contains precisely one copy of the double-stranded RNA genome, and not fractions thereof as is suggested in the text (for example, line 305: "Analysis of the cryo-electron tomograms gave a clear answer to the question of the membrane bud contents: the lumen of full-size spherules consistently contains 0.8-0.9 copies."). I feel that this subject deserves more discussion in the manuscript. If the authors prefer to keep their original interpretation that the majority of spherules contains only fractions of full genomes, I invite them to provide an explanation for why their experimental genome length measurements are sufficiently accurate to favor this rather surprising conclusion over my more trivial interpretation. If I understand correctly, my preferred interpretation has implications for the mathematical model for membrane remodeling (Equation 2).

      This is a good point. In fact, we agree that our original manuscript and wording was unclear and we agree with the reviewer’s interpretation (“my preferred interpretation would be that the vast majority of imaged spherules - regardless of their temporary volume in the moment of sample freezing - likely contains precisely one copy of the double-stranded RNA genome”). We have now changed the text to reflect that we believe we have a 10-20% false negative rate in the filament tracing and that the most likely interpretation is indeed that each spherule has exactly one genome copy (lines 207-210). In addition, we looked at the possible consequences of the slight underestimation of the filament length for the mathematical model, and describe on lines 257-264 why this in fact would have no impact on the conclusions of the modeling.

      • *

      Minor comments:

      Virus taxa should be capitalized and written in italics wherever applicable. I recommend adhering to the following rules:

      https://talk.ictvonline.org/information/w/faq/386/how-to-write-virus-species-and-other-taxa-names

      Thank you for helping us clarify this. In response to this we have now italicized and capitalized all virus taxa.

      Figure 2I looks as if the pink cross-section of nsP1 has not been scaled correctly. Comparison to Figure 2H gives me the impression that the diameter of the pink nsP1 ring in Figure 2I should be scaled down relative to the greyscale neck complex.

      We would like to than the reviewer for their keen eye. There was indeed a scaling problem, which we have now solved in the updated Fig. 2.

      • *

      The caption of Figure 2 calls more panels than are provided in the figure. The caption "panel E" seems to be obsolete.

      Thanks for finding this mistake. We have now revised Fig. 2 and its legend.

      • *

      In the methods, centrifugation speed should be given in units of relative centrifugal force (rcf) instead of revolutions per minute (rpm), especially for the MLV pulldown assay where no rotor is indicated.

      We agree and have changed this on lines 482,490,524,531,543 and 597 of the manuscript

      • *

      In the methods for the MLV assay, the lipid:protein ratio is given with 500:1. It should be specified whether this is a mass ratio or a molar ratio.

      It was molar ratio which we have now specified on line 595.

      In the methods, the buffer composition for the mass photometry measurement should be indicated.

      Good point. We added this on lines 632-633.

      • *

      **Referees cross-commenting**

      • *

      I agree to the other reviewers' remarks.

      • *

      Reviewer #2 (Significance (Required)):

      • *

      Chikungunya virus is a very important human pathogen, and research on the architecture of its replication/transcription organelle holds great promise for the development of future therapies. Laurent and colleagues advanced this field by providing pioneering low-resolution 3D structures of the membrane-bound viral protein complex and the viral RNA content of this organelle in situ. In addition, they also assessed the lipid requirements for membrane interaction of the primary viral membrane anchor of this complex, nsP1, in vitro. Underlining the importance of these results, a competing group submitted a partially overlapping study to BioRXiv three months ahead (https://doi.org/10.1101/2022.04.08.487651). Whereas the competing group describes the structure of the neck complex at a much higher resolution, it neither analyzes the RNA content of the spherules nor does it address the lipid preferences of nsP1. The present study by Laurent and colleagues should therefore be of great interest to many virologists and cellular biologists.

      • *

      I am a structural virologist with a focus on envelope glycoproteins. Of relevance to this review, I have experience with cellular electron cryo-tomography and sub-tomogram averaging, as well as in-vitro protein/liposome interaction assays. I do not feel qualified to evaluate the details of the mathematical model for membrane remodeling that is used in the last results section of this manuscript.

      We thank reviewer 2 for this positive overall assessment of our work.

      • *

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      • *

      This is an interesting and well written paper describing the replication spherules generated by Chikungunya virus. Cryo-electron tomography was used to determine a low-resolution structure of the spherule, suggesting that nsP1 is located at the neck of the spherule. Segmentation of the tomograms combined with mathematical modeling was used to produce a structural model for RNA organization in the spherule, suggesting that each spherule contained approximately one copy of a full double-stranded RNA genome. I have a few minor comments:

      • *

      We are thankful for this positive overall assessment of our work.

      • *

      The structural studies were complemented with lipid binding assays, showing that nsP1 has an affinity for anionic lipids. While interesting, the connection of these experiments to the rest of the study seems tenuous. There is no further mention of them in the discussion or how they relate to the tomography and their replication model.

      We agree that those data were not as well integrated into the paper as they could have been, and are thankful that the reviewer pointed this out. To improve the integration of these data into the manuscript, we have expanded on two ways in which the reconstitution data relate to the rest of the paper: (i) the tomography led us to hypothesise that nsP1 recruits other nsPs to the membrane, which we could confirm with the reconstitution (lines 151-152, and throughout that parapgraph), and (ii) the lipid preferences of nsP1 that we could measure using the titrating pulldown experiments inform the possible models for how the spherule memebrane is remodeled since nsP1 binds lipids that cannot on their own stabilize a neck shape (lines 217-222). We have also slightly expanded the discussion of the biochemistry and its relation to other data in the paper (lines 307-311).

      • *

      It is a nice match between the calculated length of the RNA (assumed to be ds) and the length of the vector, but the segmentation of the RNA is not completely convincing based on the provided images. It is difficult to distinguish the RNA strands from the noise and other components in the spherule and, at least by eye, the segments do not seem very connected. Please provide some more details on the tracing algorithm. Has it been validated on a known system?

      We appreciate this comment and recognise that we did not sufficiently explain the tracing algorithm. This software was in fact custom written (by others, ca 10 years ago) for cryo-electron tomography and has since been used by others in several studies of cellular cryo-electron tomograms, e.g. to study actin cytoskeleton. We now mention this in the results (lines 195-196) and methods (lines 462-463).

      The tomogram video is nice, but it would be good to see a raw image as well, preferably covering a wider view that includes the whole cell, as well as a tomogram that represents the entire field of the reconstruction.

      This is a good suggestion. We unfortunately cannot provide images covering the entire cells since this is beyond the field of view of the electron microscope (and an image montage was not acquired at the time of data collection). However, we are now providing an additional supplementary movie that shows the entire field of view of the tomogram. In addition, we have uploaded two of the tomograms (including the uncropped tomogram from Figure 1) to EMDB where they will be downloadable by everyone after publication. We hope the reviewer appreciates that this is all that is technically possible at the moment.

      • *

      In figure 2, the panels are mislabeled relative to the legend, which refers to the color guide as its own panel.

      Thanks for pointing this out, we have rectified this in the revised Fig. 2 and its legend.

      Line 405: C36 symmetry? Why? Shouldn't it be C12 symmetry?

      36-fold symmetry was applied to the lipid membrane part to smoothen it further. The membrane part of the structure is simply outlining the neck shape and this is better visualised in this smoothened representation as also done e.g. in the study of the coronavirus neck complex (Wolff et al, Science 2020). We changed the methods text to make this more clear (line 449).

      • *

      Line 409: "fit" should be "fitted"

      Thanks, Corrected in the revised manuscript line 454.

      **Referees cross-commenting**

      • *

      I think we are all in good agreement, and I believe that the concerns raised can be addressed though a better explanation of the methods and improved discussion of their results.

      We also agree and believe we have addressed all of the remaining concerns in the revised manuscript.

      • *

      • *

      Reviewer #3 (Significance (Required)):

      • *

      This is a rather focused study, showing tomography data on the alphavirus replication complex. The main significance of the study is the description of the spherule's dimension and its relationship to the nature of the RNA, which provided a model for the replication process. While somewhat narrow in scope, the study should be of interest to people working in the virus replication and virus structure field. The lipid data are interesting, but does not seem well integrated with the rest of the study.

    1. LDST 200: Introduction to Leadership Studies and Applications Fall 2022 (Aug. 22-Oct. 14) Instructor Information: Jacob H. Stutzman, Ph.D Email: jhstutz@ku.edu Office Hours: by appointment (Links to an external site.) Required Materials Heifetz, R., Grashow, A., & Linsky, M. (2009). The practice of adaptive leadership: Tools and tactics for changing your organization and the world. Boston, MA: Harvard Business Press Institute for Leadership Studies. (2020). LDST 201: Introduction to Leadership Coursepack (5e) [provided on Canvas] Course Outcomes Upon completion of this course, students will examine and recall various theoretical approaches to leadership and leadership development; recall the four core leadership competencies and integrate each competency into their personal leadership development; explain and differentiate the role of ethics, diversity, and community development in leadership; theorize the ethical implications and applications of Adaptive Leadership and the four core leadership competencies; identify acts of Adaptive Leadership and distinguish between technical problems and adaptive challenges; distinguish between a learning/experimenting paradigm versus a problem/solution paradigm, along with contrasting the strengths and limitations of both; evaluate his/her own personal leadership strengths and challenges based on deliberate reflection; effectively communicate knowledge about and applications of leadership to others. How will we get from where we are to where we are hoping to go? Each week, you will work through a module that includes video lectures and readings (both from the assigned texts and some provided by the instructor) that will help you build a base of knowledge about leadership studies generally and Adaptive Leadership specifically. Each module will also include a quiz, a journal, and other writing assignments designed to help you put your knowledge to use by testing it and applying it to relevant scenarios. Most of the assignments will be completed individually, but there will be a limited amount of collaborative work. By thoughtfully and carefully completing each assignment, you will develop your knowledge of Adaptive Leader and explore the ways in which the principles of Adaptive Leadership can be useful in your own contexts. Assessments/Assignments Journals (7 @25 pts ea.) In each module (except for Module 8) there will be a prompt based on the material in the module. Responses should integrate the material from that week. The assignment expectations and sample journal entries are available on pp. 16-23 of the coursepack. Quizzes (6, 80 pts total) In each module (except for Module 8), there is a short quiz based on the material in the module. You may take each quiz twice, but the most recent score will always be the score that is recorded. Additional information is on p. 24 of the coursepack. Exams (2 @80 pts ea.) There will be two multiple choice exams in the course. These exams will cover material presented in the readings, webinars, supporting documents, and videos comprising the Modules. Exams will open and close with the Modules, so each exam must be taken before the Module deadline for that week. Exam dates are listed in the course schedule. Once you login to take the exam, you must complete it within 60 minutes. You will only have one chance to complete each exam. Study guides for each exam are available on Canvas throughout the semester. Reflection Paper (150 pts) You will complete a final reflection paper that draws on the material covered throughout the course. A full description, rubric, and sample paper are in the coursepack on pp. 38-50 of the coursepack. Application Project (3 parts @80 pts ea.) You will work through a three-part project to do the work of Adaptive Leadership in a community to which you belong. Each phase of the project will build on the work done before, with the first phase due in Module 3. TruTalent Assessment/Letter (10 pts for the results, 40 pts for the letter) Each of you will complete the TruTalent assessment through the University Career Center and upload your results when they are ready. In Module 5, there is also an assignment that calls on you to reflect on your results in the form of a letter. The specific assignment will be available in that module. Ethics Discussions (30 pts) In Modules 4, 5, & 6, you will respond to a prompt and then reply to your classmates' responses in an annotation assignment. Details will be available in Module 4. Ethics Paper (75 pts) Using the framework provided in Module 4, each of you will prepare an ethics case study on a situation of your choosing. Details are in Module 4. Self-Care Plan (40 pts) Each of you will also complete a self-care plan, because doing leadership is hard work, and you can't pour from an empty cup. Details for the assignment are in Module 7. Total points available: 1000 Grade Distribution           🦄 B+: 875-899 C+: 775-799 D+: 675-699 💩 A: 925-1000 B: 825-874 C: 725-774 D: 625-674 💩 A-: 900-924 B-: 800-824 C-: 700-724 D-: 600-624 F: 0-599 Schedule Module Date Open Date Due Items Due Course Information Module; Module 1--Introduction Aug. 22 Aug. 27 Pre-Course Survey; Values Worksheet; Journal Module 2--History of Leadership Theories Aug. 22 Sept. 3 Quiz; Journal Module 3--Introduction to Adaptive Leadership Aug. 28 Sept. 10 Quiz; Journal; TruTalent results; Application Phase 1 Module 4--Diversity and Ethics Sept. 4 Sept. 17 Quiz; Journal; Ethics Paper; Ethics Annotation: Exam 1 Module 5--Leadership and Personality Sept. 11 Sept. 24 Quiz; Journal; TruTalent Letter; Ethics Annotation Module 6--Manage Self and Energize Others Sept. 18 Oct. 1 Quiz; Journal; Application Phase 2, Ethics Annotation Module 7--Diagnose the Situation and Intervene Skillfully Sept. 25 Oct. 8 Quiz; Journal; Self-Care Plan Module 8--Celebrations of Knowledge Oct. 2 Oct. 15 Application Phase 3; Final Reflection Paper; Exam 2 Policies, Procedures, and the Like Canvas and Email This course will use Canvas for the dissemination of all lecture materials and reading assignments (other than the textbook), as well as the collection of all assessments. It is the student's responsibility to regularly check Canvas for updates and information. Emails sent through Canvas will go to your KU email address, so you must also check that email address regularly for information and communication. If you send an email from a non-university email address, I will reply to that address, but any emails I initiate will go to your university address. Assignments should not be submitted via email unless explicit, case-by-case arrangements are made. Incompletes In accordance with KU's policy on incompletesLinks to an external site., an I should only be assigned when some portion of the work for a course has not been done, for reasons beyond a student's control. Incompletes should be rare and will be assigned only in rare circumstances. If you believe such circumstances apply to your situation, please contact me as soon possible. Civility Each of us is an adult that has made the choice to be in this course. Recognizing that choice, each of us is expected to respect all points of view expressed in the classroom. Each person in this classroom should feel free to express her/his opinion and should feel an obligation to ensure that everyone else in the room feels the same freedom. Intolerance and incivility will not be tolerated, though disagreement and reasoned argument are strongly encouraged. Title IX of the Education Amendments of 1972 prohibits sex discrimination against any participant in an educational program or activity that receives federal funds, including federal loans and grants. Title IX also prohibits student‐to‐student sexual harassment. If you encounter unlawful sexual harassment, gender‐based discrimination, or other forms or prohibited harassment/discrimination, please talk with your professor or with the Office of Institutional Opportunity & Access at 785‐864-6414, or go to the Institutional Opportunity & AccessLinks to an external site. page for more information and reporting tools. Academic Integrity and Intellectual Property Academic misconduct of any kind is not tolerated in this class. Both the definition of academic misconduct and potential sanctions for it are defined by KU policyLinks to an external site.. Plagiarizing another's work. knowingly misrepresenting the source of any academic work, giving or receiving of unauthorized aid on assignments, and acting dishonestly in research are all subject to penalties. Similarly, submitting all or portions of an assignment completed in another class for a grade in this class is an act of academic misconduct. If you have outside work that you believe is appropriate and valuable to include in an assignment for this course, please speak with your instructor to establish appropriate guidelines. Additionally, all work produced for and in this course remains the intellectual property of the creator, including but not limited to: the textbooks, the lectures, and student assignments. No work may be reused, reproduced, or distributed without the express permission of the work's creator. This includes sharing notes or course materials to commercial or nonprofit services/databases. This policy does NOT include taking notes for personal use or a student volunteer taking notes for someone with a reasonable accommodation identified by the Student Access Center. Accessibility If you believe you need or would benefit from the accommodation of a disability, please contact the Student Access CenterLinks to an external site. to discuss accommodations. Since accommodations may require early planning and generally are not provided retroactively, please contact the Center as soon as possible.

      My name is Eden. I'm from the most haunted town in Kansas, Atchison! My major is Philosophy and my minor is in history. I think my goal for this class is to learn more about leadership in a conceptual way, so that I can apply it in more real-life situations. My walk up song would be Wake up by Young the giant!

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Neville et al addresses the link between the localization and the activity of the so-called "Pins complex" or "LGN complex", that has been shown to regulate mitotic spindle orientation in most animal cell types and tissues. In most cell types, the polarized localization of the complex in the mitotic cell (which can vary between apical and basolateral, depending on the context) localizes pulling forces to dictate the orientation. The authors reexplore the notion that this polarized localization of the complex is sufficient to dictate spindle orientation, and propose that an additional step of "activation" of the complex is necessary to refine positioning of the spindle.

      The experiments are performed in the follicular epithelium (FE), an epithelial sheet of cell that surrounds the drosophila developing oocyte and nurse cells in the ovarium. Like in many other epithelia, cell divisions in the FE are planar (the cell divides in the plane of the epithelium). The authors first confirm that planar divisions in this epithelium depends on the function of Pins and its partner mud, and that the interaction between the two partners is necessary, like in many other epithelial structures. Planar divisions are often associated with a lateral/basolateral "ring" of the Pins complex during mitosis. The authors show that in the FE, Pins is essentially apical in interphase and becomes enriched at the lateral cortex during mitosis, however a significant apical component remains, whereas mud is almost entirely absent from the apical cortex. Pins being "upstream" of mud in the complex, this is a first hint that the localization of Pins is not sufficient to dictate the localization of mud and of the pulling forces.

      The authors then replace wt Pins, whose cortical anchoring strongly relies on its interaction with Gai subunits, with a constitutively membrane anchored version (via a N-terminal myristylation). They show that the localization of myr-Pins mimics that of wt-Pins, with a lateral enrichment in mitosis, and a significant apical component. Since a Myr-RFP alone shows a similar distribution, they conclude that the restricted localization of Pins in mitosis is a consequence of general membrane characteristics in mitosis, rather than the result of a dedicated mechanism of Pins subcellular restriction. Remarkably, Myr-Pins also rescues Pins loss-of-function spindle orientation defects.

      They further show that the cortical localization of Pins does not require its interaction with Dlg (unlike what has been suggested in other epithelia). However, spindle orientation requires Dlg, and in particular it requires the direct Dlg/Pins interaction. The activity of Dlg in the FE appears to be independent from khc73 and Gukholder, two of its partners involved in its activity in microtubule capture and spindle orientation in other cell types. Based on all these observations, the authors propose that Dlg serves as an activator that controls Pins activity in a subregion of its localization domain (in this case, the lateral cortex of the mitotic FE cell). They propose to test this idea by relocalizing Pins at the apical cortex, using Inscuteable ectopic expression. With the tools that they use to drive Inscuteable expression, they obtain two populations of cells. One population has a stronger apical that basolateral Insc distribution, and the spindle is reoriented along the apical-basal axis; the other population has higher basolateral than apical levels of Insc distribution, and the spindle remains planar. The authors write that Pins localization is unchanged between the two subsets of cells (although I do not entirely agree with them on that point, see below), and that although Mud is modestly recruited to the apical cortex in the first population, it remains essentially basolateral in both. In this situation, the localization of Insc in the cell is therefore a better predictor of spindle orientation than that of Pins or Mud. Remarkably, removing Dlg in an Insc overexpression context leads to a dramatic shift towards apical-basal reorientation of the spindle, suggesting that loss of Dlg-dependent activation of the lateral Pins complex reveals an Insc-dependent apical activation of the complex. Overall, I find the demonstration convincing and the conclusion appropriate. One of the limitations of the study is the use of different drivers and reporters for the localization of Pins, which makes it hard to compare different situations, but not to the point that it would jeopardize the main conclusions. I do not have major remarks on the paper, only a few minor observations and suggestion of simple experiments that would complete the study.

      Minor:

      What happens to Pins and Mud in Dlg mutant cells that overexpress Insc and behave as InscA? Are they still essentially lateral, or are they more efficiently recruited to the apical cortex?

      This is a terrific question. Of course we would love to know and intend to find out.

      One way to do this (consistent with the manuscript) would be to generate flies that are Dlg[1P20], FRT19A/RFP-nls, hsflp, FRT19A; TJ-GAL4/+; Pins-Tom, GFP-Mud/UAS-Insc. (Note that these flies would only allow us to image Mud; we would have to repeat the experiment using GFP FRT19A; hsflp 38 to see Pins. This isn’t ideal given that we’d like to image both together). Generating these flies is a major technical challenge because of the number of transgenes and chromosomes involved.

      Our preferred way to do this would be to generate flies that are Dlg[1P20]/Dlg[2]; TJ-GAL4/+; Pins-Tom, GFP-Mud/UAS-Insc. So far, we’ve been unsuccessful. We are now undertaking a modified crossing scheme that we hope will solve the problem, though we aren’t overly optimistic about the outcome. We find that the temperature-sensitive mutation Dlg[2] presents an activation barrier; while we are able to generate flies that are Dlg[2] / FM7 in combination with transgenes and/or mutations on other chromosomes, we do not always recover the Dlg[2] / Y males (which must develop at 18degrees) from these complex genotypes.

      In the longer term (outside the scope of revision), we are working to develop more tools for imaging Mud and Pins that we hope will help answer this question.

      Regarding the competition between Pins and Insc for dictating the apical versus basolateral localization of Insc, the Insc-expression threshold model could be easily tested in Pins62/62 mutants, where it is expected that only InscA localization should be observed, even at 25{degree sign}C (unless Pins is required for the cortical recruitment of Insc, as it is the case in NBs - see Yu et al 2000 for example).

      This is another great experiment and one we’d love to carry out. Again, the genetics are currently challenging, only because both UAS-Inscuteable and FRT82B pinsp62 are on the third chromosome. (Right now we’re trying to hop UAS-Inscuteable to the second).

      However, we do have another idea for testing the threshold model, which is to repeat the experiment in which we express UAS-Insc in cells that are DlgIP20/IP20 at 25oC. Because the relevant cells (UAS-Insc OX in Dlg mitotic clones) are relatively rare, we have not yet been able to collect enough examples to make a firm conclusion. However, our preliminary results (only six cells so far!) suggest that more InscB cells are observed at the lower temperature, consistent with the threshold model.

      I do not agree with the authors on P.10 and Figure 6A-D, when they claim that the apical enrichment of Pins is equivalent in both InscA and InscB cells. The number of measured cells is very low, and the ratio of apical/lateral Pins differs between the two sets of cells. The number of cells should be increased and the ratios compared with a relevant statistic method.

      Totally fair. We are working to add more data to these panels (6B and 6D). The trend observed in 6D may be softening in agreement with the reviewer’s prediction, although we currently don’t yet have enough new data points to be confident in that conclusion. Therefore, we have not yet updated the manuscript, though we expect to do so during the revision period. We will also add a statistical comparison. Importantly, as the reviewer suggested, this does not alter our conclusions.

      A lot of the claims on Pins localization rely on overexpression (generally in a Pins null background) of tagged Pins expressed from different promoters or drivers, and fused to different fluorescent tags. Therefore, it is difficult to evaluate to which extent the localization reflects an endogenous expression level, and to compare the different situations. As the cortical localization of Pins relies on interaction with cortical partners (mostly GDP-bound Gai) which are themselves in limiting quantity in the cell, and in the case of Gai-GDP, regulated by Pins GDI activity, this poses a problem when comparing their distribution, because the expression level of Pins may contribute to its cortical/cytoplasmic ratio, but also to its lateral/apical distribution. Although I understand that the authors have been using tools that were already available for this study, I think it would be more convincing if all the Pins localization studies were performed with endogenously tagged Pins, even those with Myr localization sequences. In an age of CRISPR-Cas-dependent homologous recombination, I think the generation of such alleles should have been possible. Although this would probably not change the main claims of the paper, it would have made a more convincing case for the localization studies.

      We don’t disagree at all with this point. We did indeed try to stick with the published UAS-Pins-myr-GFP, not only for convenience but because it allows us to make comparisons to other studies using the same tool (Chanet et al Current Biology 2017 and Camuglia et al eLife 2022). Another consideration is that we used only one driver across our experiments (Traffic jam-GAL4). It is quite weak at the developmental stages that we examine, meaning that overexpression is not a major concern. (Indeed we have struggled with the opposite problem).

      We certainly take the reviewer’s comment seriously and we therefore described it in the manuscript. We are currently working to develop endogenous tools using CRISPR.

      Paragraph added to Discussion – Limitations of our Study:

      “Another technical consideration is that our work makes use of transgenes under the control of Traffic jam-GAL4. While this strategy allows us to compare our results with previous work employing the same or similar tools, a drawback is that we cannot guarantee that Traffic jam-GAL4 drives equivalent expression to the endogenous Pins promoter (Chanet et al., 2017, Camuglia et al., 2022). However, given that Traffic jam-GAL4 is fairly weak at the developmental stages examined, we are not especially concerned about overexpression effects.”

      The authors should indicate in the figure legends or in the methods that the spindle orientation measurements for controls for Pins62/62 are reused between figures 1, 3, 4, 5, 6 , and between figure 3, 4 and 5, respectively.

      Absolutely. Added to the Methods section.

      Reviewer #1 (Significance (Required)):

      Altogether, this study makes a convincing case that the localization of the core members of the pulling force complex, Pins and Mud, is not entirely sufficient to localize active force generation, and that the complex must be activated locally, at least in the FE. The notion of activation of the Pins/LGN complex has probably been on many people's mind for years: Pins/LGN works as a closed/open switch depending on the number of Gai subunits it interacts with, it must be phosphorylated, etc... suggesting that not all cortical Pins/LGN was active and involved in force generation. However the study presented here shows an interesting case where localization and activation are clearly disconnected. The authors show how Dlg plays this role in physiological conditions in the FE, and use ectopic expression of Insc to show that, at least in an artificial context, Insc can have the same "activating activity" (or at least an activating activity that is stronger than its apical recruitment capability and stronger than Dlg's activating activity). It is to my knowledge the first case of such a clear dissociation. In their discussion, the authors are careful not to generalize the observation to other tissues. Although I did not reexplore all that has been published on the Pins/LGN-NuMA/Mud complex over the last 20 years, my understanding is that despite interesting cases of distribution of the complex like that of Mud in the tricellular junction in the notum, the localization model can still explain most of the phenotypes that have been described without invoking an activation step. If it is the case, then the activation model is another variation (an interesting one!) on the regulation of the core machinery, which are plentiful as the authors indicate in their introduction, and is maybe specific to the FE; if not, then it would be interesting to push the discussion further by reexamining previous results in other systems, and pinpointing those phenotypes that could be better explained with an activation step.

      Overall, I find this is an elegant piece of work, which should be of interest to many cell and developmental biologists beyond the community of spindle orientation aficionados.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): Summary: The manuscript by Neville et al. addressed the mechanism how conserved spindle regulators (Pins/Mud/Gai/Dynein) control spindle orientation in the proliferating epithelia by revising "the canonical model", using the Drosophila follicular epithelium (FE). The authors examined the epistatic relationship among Pins, Mud and Dlg in FE and found that Pins controls the cortical localization of Mud by utilizing mutant analyses, and suggested their localization does not fully overlap using the newly generated knock-in allele. They also showed that Pins relocalization during mitosis depends on cortical remodeling, or passive model, where Pins localization changes with other membrane-anchored proteins. Their data further suggest that Pins cortical localization is not influenced by Dlg, but Pins-interacting domain of Dlg does affect spindle orientation. Based on these results, the authors propose that Dlg controls spindle orientation not by redistributing Pins, but by promoting (or "activating" from their definition) Pins-dependent spindle orientation. Interestingly, ectopic expression of Inscuteable (Insc) suggested that Insc localization, either apical or lateral, correlates with spindle orientation, and their localization is a dominant indicator of spindle orientation, compared to the localization of Pins and Mud, implicating potentially distinct roles of activation and localization of the spindle complex. Overall their genetic experiments are well-designed and provide stimulation for future research. However, their evidence is suggestive, but not conclusive for their proposal. I have several concerns about their conclusion and would like to request more detailed information as well as to propose additional experiments.

      Major concerns: 1. This report lacks technical and experimental details. As the typical fly paper, the authors need to show the exact genotypes of flies they used for experiments. This needs to be addressed for Figures 1-6, and Supplemental Figures. Especially, which Gal4 drivers were used for UAS-Pins wt or mutant constructs in Figure 4 with pins mutant background, Khc73, GUKH mutant backgrounds. Which exact flies were used for mutant clone experiments for Supplemental Figure 3? (A for typical mosaic, and B for MARCM). Without these details, it is impossible to evaluate results and reproduce by others.

      We take this concern very seriously!

      • We listed the GAL4 driver (Traffic jam-GAL4) in the first section of the Materials and Methods: Expression was driven by Traffic Jam-GAL4 (Olivieri et al., 2010). The transgene and relevant citation have been added to Table 1.
      • We explained the background stock for the MARCM experiment in the Materials and Methods: Mosaic Analysis with a Repressible Cell Marker (after the method of Lee and Luo) was carried out using GFP-mCD8 (under control of an actin promoter) as the marker. The transgene and relevant citation have been added to Table 1.
      • In line with other fly studies (eg. Nakajima et al., Nature 2013) and our own Drosophila work (Bergstralh et al Current Biology 2013, Bergstralh*, Lovegrove*, St Johnston NCB 2015, Bergstralh et al Development 2016, Finegan et al EMBO J 2019, Cammarota*, Finegan* et al Current Biology 2020) we were careful to show the relevant genotype components in each figure.
      • We included a fully referenced Supplementary Table (Table 1 – Drosophila genetics) listing every mutant allele or transgene with a citation and a note about availability. We have expanded this table in response to the author’s concern (see above).

        Related to the comment 1, how did the author perform "clonal expression of Ubi-Pin-YFP" in page 5? As far as I understand, Ubi-Pin-YFP is expressed ubiquitously by the ubiquitin promoter.

      The reviewer makes a good point. We regret that we did not make this experiment more clear. Ubi-Pins-YFP was recombined onto an FRT chromosome (FRT82B). We made mitotic clones.

      We have clarified this in the Methods section as follows:

      “Mitotic clones of Ubi-Pins-YFP were made by recombining the Ubi-Pins-YFP transgene onto the FRT82B chromosome”

      1. In page 6, if Pins relocalization is passive and is associated with membrane-anchored protein remodeling during mitosis, its relocalization can be suppressed by disrupting the process of mitotic remodeling (mitotic rounding). The authors should test this by either genetic disruption or pharmacological treatment for the actomyosin should cause defects in Pins relocalization, which bolster their conclusion.

      We agree that this is a cool experiment and are happy to give it another shot. However, we do note that interpretation could be difficult. We don’t know that mitotic rounding and membrane-anchored protein remodeling during mitosis are inextricably linked. Notably, the remodeling we describe reflects cell polarity; apical components are evidently moved to the lateral cortex. This is contrary to understanding of rounding, which reflects isotropic actomyosin activity (Chanet et al., (2017) Curr.Biol. & Rosa et al., (2015) Dev. Cell.). Therefore we don’t understand what a “negative” result would mean, or for that matter that a “positive” result would be safe to interpret.

      We have attempted many strategies to prevent cell rounding in the follicular epithelium, none of which have successfully prevented rounding. 1) We attempted to genetically knockdown Moesin in the FE and did not see an effect on cell rounding. However we couldn’t confirm knockdown and therefore are not confident in this manipulation. 2) It is difficult to interpret the result of genetically disrupting Myosin, because it causes pleiotropic effects, such as inhibition of the cell cycle, and disruption of monolayer architecture. 3) We treated egg chambers with Y-27632 (a Rok-inhibitor) and examined its effect on mitotic cell rounding and on cytokinesis, which are Rok-dependent processes. Our experiments were performed using manually-dissociated ovarioles treated for 45 minutes in Schneider Cell Medium supplemented with insulin. Even at our maximum concentration of 1mM Y-27632, several orders of magnitude above the Ki, we are unable to see any effect on mitotic cell shape or actin accumulation at the mitotic cortex and did not observe any evidence of defective cytokinesis. We also did not observe defects in spindle organization or orientation, as would be expected from failed rounding. We therefore do not believe that the inhibitor works in this tissue. One possible explanation is that the follicle cells are secretory, and likely to pass molecules taken up from the media quickly into the germline. Therefore, we do not anticipate that we can perform this experiment to our satisfaction.

      1. The critical message in this manuscript is that the core spindle complex mediated by Pins-Mud controls spindle orientation by "activation", but not localization. The findings that Pins and Mud localization is not influenced by Insc and that ecotpic Insc expression and genetic Mud depletion (Figure 6) might support their proposal, but these results just suggest their localization does not matter. I wonder how the authors could conclude and define "activation". What does this activation mean in the context of spindle orientation? Can the authors test activation by enzymatic activity or assess dynamics of spindle alignment?

      We intend for the critical message of the manuscript to be that “The spindle orienting machinery requires activation, not just localization.” We absolutely do not make the claim that localization is not important, only that it is not sufficient. The reviewer recognizes this point here and in a subsequent comment: “The authors showed that Pins and Mud localization themselves are not sufficient for the control of spindle orientation with genetic analyses.”

      We also do not claim that Pins and/or Mud localization are not impacted by Inscuteable. On the contrary, we plainly see and report that they are; the intensity profiles in Figure 6 are distinct from those in Figure 2, as discussed in the text.

      We appreciate the reviewer’s point about activation. Since we do not understand these proteins to be enzymes, we aren’t sure what enzymatic activity would be tested. The dynamics of spindle alignment in this slowly developing system are prohibitively difficult to measure: the mitotic index is very low (~2%) and only a very small fraction of those cells will be in a focal plane that permits accurate live imaging in the apical-basal axis. Alternative modes of activation include conformational change and/or a connection with other important molecules. The simplest possibility would be that Dlg allows Pins to bind Mud, but so far our data do not support it. We have added the following paragraph to our discussion:

      “The mechanism of activation remains unclear. While the most straightforward possibility is that Dlg promotes interaction between Pins and Mud, our results show that Mud is recruited to the cortex even when Dlg is disrupted (Figure 4D). Alternatively, Discs large may promote a conformational change in the spindle-orientation complex and/or a change in complex composition. Furthermore, the Inscuteable mechanism is not likely to work in the same way. Dlg binds to a conserved phosphosite in the central linker domain of Pins and should therefore allow for Pins to simultaneously interact with Mud (Johnston et al., 2009). Contrastingly, binding between Pins and Inscuteable is mediated by the TPR domains of Pins, meaning that Mud is excluded (Culurgioni et al., 2011; 2018). While a stable Pins-Inscuteable complex has been suggested to promote localization of a separate Pins-Mud-dynein complex, our work raises the possibility that it might also or instead promote activation.”

      1. In page 7-8, although Pins-S436D rescue spindle orientation, but Pins-S436A does not in pins null clones background, Pins localization is not influenced by Dlg. This questions how exactly Pins and Dlg can interact, and how Dlg affect Pins function. Related to this observation, in the embryonic Pins:Tom localization in dlg mutant does not provide strong evidence to support their conclusion given the experimental context is different from previous study (Chanet et al., 2017).

      We agree with the reviewer. Our data (this paper and previous papers) and the work of others indicate that this interaction is important for spindle orientation (Bergstralh et al., 2013a; Saadaoui et al., 2014; Chanet et al., 2017). However, we show here that Dlg doesn’t obviously impact Pins localization (as proposed in our earlier paper), but does impact the ability of the spindle orientation machinery to work (hence activity).

      The reviewer makes a very good point. Our experimental context is different from the previous study concerning Pins and Dlg in embryos: Chanet et al (2017) performed their work in the embryonic head, whereas we look at divisions in the ventral embryonic ectoderm. These are distinct mitotic zones (Foe et al. (1989) Development) and exhibit distinct epithelial morphologies. We show that Pins:Tom localizes at the mitotic cell cortex in Dlg[2]/Dlg[1P20] in cells in the ventral embryonic ectoderm. Our only conclusion from this experiment is that Pins:Tom can localize without the Dlg GUK domain in another cell type (outside the follicular epithelium). In the current preliminary revision we have softened our claim as follows:

      “We also examined the relationship between Pins and Dlg in the Drosophila embryo. A previous study showed that cortical localization of Pins in embryonic head epithelial cells is lost when Dlg mRNA is knocked down (Chanet et al., 2017). We find that Pins:Tom localizes to the cortex in the ventral ectoderm of early embryos from Dlg1P20/Dlg2 mothers, indicating that Pins localization in the ventral embryonic ectoderm epithelium does not require direct interaction with Dlg. We therefore speculate that Dlg plays an additional role in that tissue, upstream of Pins (Figure 4G).

      Our intention is to elaborate on our findings with additional data from embryos. To this end we have already acquired preliminary control data investigating the spindle angle with respect to the plane of the epithelium, and are in the process of examining spindle angles in dlg mutant embryonic tissue.

      In page 11, the authors state "... that activation of pulling in the FE requires Dlg". I was not convinced by anything related to "pulling". There is no evidence to support "pulling" or such dynamic in this paper, just showing Mud localization, correct?

      We appreciate the reviewer’s concern. The original sentence read that “We interpret [our data] to mean that interaction between Pins and Dlg, which is required for pulling, stabilizes the lateral pulling machinery even if Dlg is not a direct anchor.” This statement is based on work across multiple systems, including the C. elegans embryo (Grill et al Nature 2001), the Drosophila pupal notum (Bosveld et al, Nature 2016), and HeLa cells (Okumura et al eLife 2018), which shows that Mud/dynein-mediated pulling (on astral microtubules) orients/positions spindles. This is described in the introduction.

      To address the reviewer’s particular concern, we have replaced “pulling” with “spindle-orentation machinery,” so that this sentence now reads …“activation of the spindle-orientation machinery in the FE requires Dlg.”

      1. Ectopic expression of Insc (Figure 6) provided a new idea and hypothesis, but the conclusion is more complicated given that Insc is not expressed in normal FE. For example, the statement that "Inscuteable and Dlg mediate distinct and competitive mechanism for activation of the spindle-orienting machinery in follicle cells" is probably right, but it does not show anything meaningful since Insc does not exist in normal FE. Is Dlg in a competitive situation during mitosis of FE? If so, which molecules are competitive against Dlg? The important issue is to provide a new interpretation of how spindle orientation is controlled epithelial cells. I strongly recommend to add models in this manuscript for clarity.

      We considered the addition of model cartoons very carefully in preparing the original manuscript, and again after review. While we are certainly not going to “dig in” on this point, our concern is that model figures would obscure rather than clarify the message. As the reviewer points out, we do not understand how activation works, and as discussed in the manuscript we don’t think it’s likely to work the same way in follicle cells (Dlg) as it does in neuroblasts (Insc). Therefore model figure(s) are premature.

      We do not agree with the statement that "Inscuteable and Dlg mediate distinct and competitive mechanism for activation of the spindle-orienting machinery in follicle cells… does not show anything meaningful.” This is a remarkable finding because it suggests that there is more than one way to activate Pins. Given the critical importance of spindle orientation in the developing nervous system, and the evolutionary history of the Dlg-Pins interaction, we think that this finding supports a model in which the Dlg-Pins interaction evolved in basal organisms, and a second Inscuteable-Pins interaction evolved subsequently to support neural complexity. These ideas are raised in the Discussion.

      The reviewer also writes that “The important issue is to provide a new interpretation of how spindle orientation is controlled epithelial cells.” We find this concern perplexing, since the reviewer clearly recognizes that we have provided a new interpretation: Dlg is not a localization factor but rather a licensing factor for Pins-dependent spindle orientation.

      Minor comments: 8. Some sections were not written well in the manuscript. "It does not" in page 6. "These predictions are not met". I just couldn't understand what they stand for. Their writing has to be improved.

      Again, we are not going to dig in here, but we would prefer to retain the original language, which in our opinion is fairly clear. Our study is hypothesis-driven and based on assumptions made by the current model. We used direct language to help the reviewer understand what happened when we tested those assumptions.

      1. In page 9, Supplementary Figure 4 should be cited in the paragraph (A potential strategy for..), not Supplemental Figure 1A, and 1B.

      Good catch, thank you! We have corrected this.

      1. In page 10, the authors examine aPKC localization in Insc expressing context of FE. Does aPKC localization correlate with Insc localization (Insc dictates aPKC?)? aPKC is not involved in spindle orientation from the author's report (Bergstralh et al., 2013), so it does not likely provide any supportive evidence.

      I’m afraid we don’t entirely understand this comment. The interdependent relationship between aPKC and Inscuteable localization is long-established in the literature and was previously addressed in the follicle epithelium (Bergstralh et al. 2016). We do not make the claim here that aPKC governs spindle orientation. We are emphasizing that the difference between InscA and InscB cells extends to the relocalization of polarity components involved in Insc localization. As described in the manuscript, these data are provided to support our threshold model:

      “In agreement with interdependence between Inscuteable and the Par complex, we find that aPKC is stabilized at the apical cortex in InscA cells but enriched at the lateral cortex in InscB cells (Figure 6E). This finding is consistent with an Inscuteable-expression threshold model; below the threshold, Pins dictates lateral localization of Inscuteable and aPKC. Above the threshold, Inscuteable dictates apical localization of Pins and aPKC.”

      1. In Dicussion page 12, "In addition, we find that while the LGN S408D (Drosophila S436D) variant is reported to act as a phosphomimetic, expression of this variant has no obvious effect on division orientation (Johnston et al., 2012)". Where is the evidence for this? I interpret that this phosphomimetic form can rescue like wt-Pins not like unphospholatable mutant S436A, so it means that have an effect on spindle orientation, correct?

      The reviewer makes a good point. We regret the confusion. We mean to point out that the S436D variant is no different from the wild type. We have amended the text to clarify:

      “In addition, we find that while the LGN S408D (Drosophila 436D) variant is reported to act as a phosphomimetic, this variant does not cause an obvious mutant phenotype in the follicular epithelium (Johnston et al., 2012). What then is the purpose of this modification? Since the phosphosite is highly conserved through metazoans, one possibility to consider is that the phosphorylation regulates the spindle orientation role of Pins, whereas unphosphorylated Pins plays a different role (Schiller and Bergstralh, 2021).”

      Reviewer #2 (Significance (Required)):

      The authors showed that Pins and Mud localization themselves are not sufficient for the control of spindle orientation with genetic analyses. While the authors tried to challenge the concept of "canonical model", there is no clear demonstration of "activation" of the spindle complex. I appreciate their genetic evidence and new results, and understand the message that Pins and Mud effects are beyond localization, but there is no alternative mechanism to support their model. At the current stage, their evidence provides more hypothesis, not conclusion. Based on my expertise in Developmental and Cell biology, I suggest that the work has an interest in audience who studies spindle machinery, but for general audience.

      We think that the reviewer fundamentally shares our perspective on the study. Our work tests assumptions made by the canonical model and shows that they aren’t always met (meaning that the question of how spindle orientation works in epithelia at least is still unsolved), and that in the FE at least one component (Dlg) has been misunderstood. We reach a major conclusion, which is that localization of Pins is not enough to predict spindle orientation in the FE.

      It’s absolutely true that the precise molecular role of Dlg has not been solved by our study. This is a major question for the lab, and we are currently undertaking biochemical work to address it. It’s probably more work than we can (or should) do on our own, which is just one reason to share our current results with colleagues.

      One fundamental reason for undertaking this study is that 25 years of spindle orientation studies released into an environment in which “positive” conclusions are the bar for publication success may have burdened the field with claims that are overly-speculative. We appear to have contributed to this problem ourselves in 2013. With that in mind we contend that providing an alternative molecular mechanism at this point is premature and would impair rather than improve the literature.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Neville et al re-examine the role and regulation of Pins/LGN in Drosophila follicular epithelial cells. They argue that polar or bipolar enrichment of Pins localisation at the plasma membrane is not crucial for spindle orientation, and therefore propose that Pins must be somehow activated to function. These interpretations are not supported by the data. However, the data strongly suggest an alternative interpretation which is of major biological significance.

      As an initial point, we disagree with the summary above. We do not argue that enrichment of Pins is not crucial for spindle orientation. We argue that enrichment of Pins is not sufficient. This is why we titled the paper “The spindle orienting machinery requires activation, not just localization” instead of “The spindle orienting machinery requires activation, not localization.”

      Although we disagree with reviewer, we appreciate their criticism of our manuscript and are glad for the opportunity to clarify our findings. In our responses to the specific comments (below) we explain why our data contradict the reviewer’s model and what we will do to improve the manuscript.

      Comments:

      1. In the experiments on Dlg mutants (Fig 4D, S3) visualising Pins:Tom, the wild-type needs to be shown next to the Dlg mutant image, otherwise a comparison cannot be made. For example, Pins:Tom looks strongly enriched at the lateral membranes in the wild-type shown in Fig 2B&C, but much more weakly localised at the lateral membranes in Dlg1P20/2 mutants in Fig 4D. Thus, it looks like the Dlg GUK domain is required for full enrichment of Pins:Tom at lateral membranes, even if some low level of Pins can still bind to the plasma membrane in the absence of the Dlg GUK domain. Quantification would likely show a reduction in Pins:Tom lateral enrichment in the Dlg1P20/2 mutants - consistent with the spindle misorientation phenotype in these mutants.

      The reviewer raises a reasonable concern about Figure 4D. We noted the difficulty of imaging Pins:Tom, which is exceedingly faint, in our original manuscript. For technical reasons, only one copy of the transgene was imaged in the experiment presented in 4G (two copies were used in Figure 2B), and the lack of signal presented an even greater challenge. In the manuscript we went with the clearest image. To address the reviewer’s concern, we have added signal intensity plots to this figure showing that Pins:Tom and Pins-myr are both laterally enriched at mitosis in Dlg[1P20]/Dlg[2] mutants. These data have been added as a new panel (E) in Figure 4. We were also able to replace the pictures in 4D with new ones generated after review.

      1. In Fig 4E, the phosphomutant PinsS436A-GFP looks more strongly apical and less strongly lateral than the wildtype Pins-GFP, consistent with the spindle misorientation phenotype in S436A rescued pins mutants.

      The reviewer has an eagle eye! We did not detect a difference in localization across the three transgenes, though we were certainly looking for it (that’s why we generated these flies in the first place). Again, the strength of signal was a major challenge in these experiments, and we therefore went with the cleanest image. In response to the reviewer’s concern, we note that the S436A and S436D examples shown have equivalent apical signal, but only the S436A fails to rescue spindle orientation.

      Together, Reviewer Comments 1 and 2 suggest a model in which Dlg is required for lateral enrichment of Pins at mitosis. As described in the manuscript, this is the very model proposed in our own previous study (Bergstralh, Lovegrove, and St Johnston; 2013), and reiterated in a subsequent review article (Bergstralh, Dawney, and St Johnston; 2017). We point these publications out because the senior author of the current manuscript is not especially enthusiastic about showing himself to be wrong (twice!) in the literature. He therefore insisted on seeing multiple lines of evidence before making the counterargument:

      • The reviewer’s model (the 2013 model) is firstly challenged by work shown in Figure 3. We find that membrane-anchored proteins (even just myristoylated RFP!) demonstrate lateral enrichment at mitosis, regardless of whether or not they interact with the Dlg-GUK domain.
      • Even stronger evidence is shown in Figure 4F. Pins-myr-GFP is very plainly enriched at the lateral cortex in Dlg[IP20]/Dlg[2] mutant cells (now demonstrated with signal intensity plots in FIGURE 4E). However, the spindle doesn’t orient correctly (quantified in 4C). Since Dlg is impacting spindle orientation independently of Pins localization, these data support our “claim in the final sentence of the abstract ‘Local enrichment of Pins is not sufficient to determine spindle orientation; an activation step is also necessary’.”

        In the InscA examples, Pins:Tom looks apical. In the InscB examples, Pins:Tom looks more laterally localised, consistent with the spindle orientations in these experiments.

      These figures (6A-D) don’t only show/quantify Pins:Tom localization. They also show localization of GFP-Mud. Whereas Pins:Tom is certainly apically enriched in the InscA examples, the interesting finding is that GFP-Mud is not. In strong contrast, it instead shows a weak apical localization and a strong lateral enrichment. As described in the manuscript, this pattern of Mud localization predicts normal spindle orientation, which is not observed in these cells.

      Thus, these data appear to support the existing model that Pins enrichment at the plasma membrane is a key factor directing mitotic spindle orientation in these cells. The author's claim in the final sentence of the abstract "Local enrichment of Pins is not sufficient to determine spindle orientation; an activation step is also necessary" is not supported by the data.

      We are pleased that the reviewer shared this quote; our claim is that Pins localization is not sufficient, not that it is unnecessary (see above). We absolutely do not dispute that “Pins enrichment at the plasma membrane is a key factor directing mitotic spindle orientation.”

      The open question posed by the data is why GFP-Mud is excluded apically & basally during mitosis, while Pins:Tom is not. The simple alternative model is that Mud only localises to the plasma membrane where Pins is most strongly concentrated, such that Mud strongly amplifies any Pins asymmetry. Thus, even myr-Pins can still rescue a pins n mutant, because myr-Pins is still enriched laterally compared to apically (or basally).

      Thus, I would strongly suggest re-titling the manuscript to: "Mud/NuMA amplifies minor asymmetries in Pins localisation to orient the mitotic spindle".

      Well, that is a good-looking title, and we’re therefore sorry to decline the suggestion. However, as described above, Figure 4D shows that Pins enrichment does not always predict spindle orientation. More importantly, Figure 6A (cited by the reviewer in Comment 3) very plainly shows that Mud does not “only locali[ze] to the plasma membrane where Pins is most strongly concentrated.” In this picture – and across multiple InscA cells (Figure 6B) - Pins is strongly concentrated at the apical surface, whereas Mud is not.

      Mud/NuMA presumably achieves this amplification by binding to the plasma membrane only where Pins is concentrated above a critical threshold level. This would mean a non-linear model based on cooperativity among Pins monomers that increases the binding avidity to Mud above the threshold concentration of Pins monomers.

      This is essentially a minor revision of the standard model, which we expected would hold true in the FE. As described above, it is not supported by our data.

      Reviewer #3 (Significance (Required)):

      The manuscript is focused on the question of mitotic spindle orientation in epithelial cells, which is a fundamental unsolved problem in biology. The data reported are impressive and important, providing new insights into how the key spindle orientation factors Mud/NuMA and Pins/LGN localise during mitosis in epithelia. I recommend publication after major revisions.

      We are delighted that the reviewer finds our data impressive and important, and our experiments insightful. We understand that the “major revisions” requested are meant to bring the paper in line with their model (our own earlier model). Since the data in our original manuscript contradict that model, the revisions are instead focused on clarifying and strengthening our message.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank for reviewers for their feedback and were pleased they think that the manuscript is “of great interest to the scientific community”. The reviewers agree that the manuscript addresses an important question and that the identification of ASNS as a potential vulnerability of late-stage colorectal cancer is significant. The reviewers agree that our findings would be substantially strengthened by validation in state-of-the-art organoid model systems. We agree with this and are currently liaising with collaborators (Owen Sansom, Beatson Institute and Laura Thomas, Swansea University) to replicate our findings in both mouse and human colorectal organoid models. We will determine the sensitivity of colorectal organoid models to ASNS inhibition across a range of tumorigenicities and mutational profiles representing different stages of the adenoma-carcinoma progression. We believe these experiments will substantially strengthen the manuscript and lend weight to our finding that late-stage adenocarcinoma cells are vulnerable to ASNS inhibition.

      This is the predominant concern across reviewers, we are confident we can address this and all other, relatively minor, concerns as detailed below.

      Please find below a point-by-point reply to the reviewer’s comments. Reviewer comments are in italicized text and our responses follow.

      Reviewer #1

      • All of the findings in this manuscript are limited to in vitro observations, we know that most of the in vitro findings can not be translated in vivo. The manuscript would significantly benefit from in vivo experiments using the cells described in Fig.1 A and confirming the in vitro findings.*

      We agree that validation of our results in a more physiological context would significantly elevate our manuscript. In order to address this, we intend to use both human and mouse colorectal organoid models (please see detailed description of this in response to reviewer 2). We have decided to take this approach rather than conduct in vivoexperiments using the AA series (C1, SB, 10C and M) for two main reasons. Firstly, the C1 and SB cell lines do not form tumours in mice, consistent with them representing early colorectal adenoma cells. As such, we are not able to use the entire series in in vivo experiments. Secondly, we are keen to demonstrate replication of our findings in an alternative model. An organoid model would offer increased functional relevance, whilst allowing us to retain the ability to validate our observed metabolic dependencies across the adenoma to carcinoma sequence. We hope the reviewer agrees that these experiments would address their concerns.

      • The authors should provide proliferation data for the cell lines they used in this manuscript (C1, SB, 10C and M). In Fig. 1 B they show clear differences in EACR, can the authors provide data on glucose uptake differences in these analyzed cell lines.*

      We agree that proliferation and glucose uptake data would be a useful addition to the manuscript. We will provide doubling times for the cell lines used in this study and will measure glucose uptake by analysing extracellular glucose levels in the cell culture media from each of the cell lines.

      • In Figure 2 C the authors should provide isotope tracing data for the upper glycolysis (e.g. glucose and glucose-6-P) and alanine. In Figure 2 D the authors should provide the isotope tracing data for glutamine and glutamate.*

      We have data for glycolytic intermediates; glycerol-3-phosphate and dihydroxyacetone phosphate (DHAP) and alanine and will add them to the figures as requested.

      • Do the authors see any sign of reductive carboxylation in their U-13C glutamine experiments?*

      We observe only a low level of reductive carboxylation across the AA series cell lines (

      • Can the authors speculate how the C1, SB, 10C and M cell lines would react when glucose would be replaced with galactose in the culture environment and forcing the cells to increase oxidative phosphorylation (OXPHOS).*

      We would speculate that the cells would react similarly to our experiments in low glucose conditions displayed in Fig 3A-K. Given that M cells were shown to be the most flexible with regards to fuel source, we would expect them to be able to survive and proliferate more efficiently than the other cell lines in challenging culture conditions. Additionally, we would expect the C1s to survive well in galactose conditions, given that they rely less on glycolysis for ATP production and have significantly higher spare respiratory capacity compared to the more progressed cell lines.

      • Can the authors comment whether C1, SB, 10C and M cell lines show differences in coping with oxidative stress?*

      Again, we would speculate that the M cells would cope with exposure to oxidative stress best, given their metabolic flexibility. However, we would aim to test this by measuring the cellular response to hydrogen peroxide (which would induce oxidative stress) across all cell lines.

      • In the ASNS knockdown experiments do the authors detect an increase in glucose uptake in ASNS deficient cells.*

      This is an interesting question; we will address it by comparing extracellular glucose levels in C1 and M cells transfected with control and siRNA targeting ASNS.

      • Can the authors provide gene expression data that would explain the metabolic switch between early and late-stage adenocarcinoma? Do the authors detect any differences in mTORC1 activation among the C1, SB, 10C and M cell lines? ASNS is an ATF4 target, can the authors provide any expression data on ATF4 in their cell lines.*

      To address the first question, using our proteomics data, we have generated heatmaps showing protein abundance data from key metabolic pathways including glycolysis, the TCA cycle and the electron transport chain in the C1, SB and M cell lines. These data show an array of variation in protein expression of these pathways between the C1, SB and M cells, with no clear up or downregulation of these pathways as a whole, but rather more intricate regulation of clusters of proteins within the pathways. These data align well with the metabolomic data presented in Figure 2 and will allow us to investigate the mechanisms underlying the metabolic switch. These heat maps will be incorporated into the manuscript. Using the heatmaps we will identify and discuss key nodes we predict to explain the metabolic switch between early and late-stage adenocarcinoma. We will then determine whether manipulation of these nodes impact the metabolic phenotype of the cells experimentally. For example, the heat maps have highlighted that ENO3 or enolase 3 is strongly upregulated in the SB and M cells in comparison to the C1 cells and may be involved in driving the metabolic switch. Indeed, ENO3 has previously been found to promote colorectal cancer progression by enhancing glycolysis (Chen et al, Med Oncol, 2022), consistent with what we see here. To test this, we will knock down ENO3 across the cell line series and determine the impact on cellular phenotype and metabolism (using Seahorse extracellular flux analysis).

      With regards to mTORC1 activation, we have further analysed our proteomics data from C1, SB and M cells and have found that the M cells show significantly higher serine 235/236 phosphorylation of ribosomal S6 protein, a common readout for mTORC1 activation, compared to C1 and SB cells. Further, we aim to carry out immunoblotting across the four cell lines to analyse phospho-S6 (relative to total S6), 4E-BP1 and phospho-ULK-1 (relative to total ULK-1) levels.

      With regards to ATF4, using our proteomics data we have generated a heatmap of gene expression changes of ATF4 target genes in C1, SB and M cells that we will provide in supplementary material . These data suggest that there does not appear to be any clear pattern of enhanced or reduced ATF4 transcriptional activity across the cell lines, with different clusters of genes within this signature up or downregulated across the series. Moreover, Ingenuity Pathway Analysis (IPA) revealed that the ATF4 pathway showed an activation z-score of -0.41 (p=0.0134) in SB versus C1 cells, and 0.35 (p=0.00051) in M versus C1 cells (where a threshold of +/- 2 indicates activation/suppression of the pathway, respectively), confirming there is no clear regulation of this pathway between the cell lines. In addition, we will carry out immunoblotting for ATF4 expression levels across the cell line series.

      Reviewer #2

      *Major comments: *

      *Early CRC *

      *Molecular understanding of CRC is obviously of great interest and importance for the clinics. However, tumors of early stages are almost exclusively resected and not treated with systemic agents. Hence, the argument by the authors that the metabolic understanding of early CRC is of clinical relevance is somewhat misleading. Overall, it would have been much more clinically relevant to investigate the multiple steps of later stages during CRC progression. How about metabolic changes during metastasis. Deep mechanistic understanding of process during metastasis has striking clinical relevance. *

      We agree with the reviewer that understanding metastatic progression is of clinical relevance and should indeed be investigated in more detail. Using our model, we do shed light on a vulnerability of late-stage adenocarcinoma cells (sensitivity to asparagine synthetase (ASNS) inhibition). Indeed, we show that ASNS expression is elevated in both colorectal tumour and metastatic tissue in comparison to normal suggesting that our study may have revealed a vulnerability with utility for treating late stage (and potentially metastatic) tumours. The reviewer raises an important issue with the way we frame the utility of the model in the manuscript text. We will rewrite this to emphasise its utility in identifying late-stage vulnerabilities and the clinical value of this approach. We maintain that the molecular understanding of colorectal cancer across all stages of its progression will provide a valuable contribution to the field but agree that we should be more specific with regards to the clinical utility of our findings.

      *Model system *

      The cell lines used in this study are not state-of-the-art to investigate the complex process during CRC progression. The original paper is from 1993 in which the cell lines were generated does not allow understanding of the characteristics of these cell lines. Recently, multiple models have been established, for example in organoids, to investigate the progression of CRC much more reliably. There are systems that use CRISPR/CAS9 edited human organoids that follow the genetic alterations of CRC progression with accompanied phenotypes. Further, extensive biobanks of organoids from patients are available (also commercially) which better represent the stages of CRC. Similarly, the question raised above of how representative this progression cell line set is needs to addressed. The mutagen-induced progression could generate various alterations that are not detected in patients, hence create an artificial system. Overall, biological replicates are missing.

      We thank the reviewer for their critique and agree that our manuscript would be significantly strengthened if we were able to replicate our key findings in another model. We agree that the cell line series we have used here has limitations and we will make sure these are discussed by adding a ‘Limitations’ section to the ‘Discussion’. We maintain that the cell line series is a valuable tool in which to effectively identify metabolic vulnerabilities for further research. A key advantage of this system is that it is a human cell line series of the same lineage. In addition, we can easily conduct metabolomics and stable isotope tracer analysis allowing us to investigate cellular metabolic activity and manipulate any identified pathways easily. As such, the cell line series is an effective tool in which to identify potential vulnerabilities, but we agree that these vulnerabilities need to be validated in state-of-the-art organoid systems for the impact of the work to be clearer.

      To address this, in collaboration with Owen Sansom (Beatson Institute) and Laura Thomas (Swansea University), we aim to validate our identified metabolic dependency in mouse and human colorectal organoids respectively. We will determine the sensitivity of colorectal organoid models across a range of tumorigenicities and mutational profiles representing different stages of the adenoma-carcinoma progression to asparagine synthetase (ASNS) inhibition. We believe these experiments will substantially strengthen the manuscript and lend weight to our finding that late-stage adenocarcinoma cells are vulnerable to ASNS inhibition.

      *Gene Expression analysis *

      In Figure 5 C and D is the expression of ASNS to stages and overall survival from online available datasets correlated. Its unclear what the difference between tumor and metastatic in C means. The labelling in D is too small. Is the difference between the two groups significant? Are these patients only at a specific stage? It seems not that ASNS is a strong prognosticator; further stratification is needed to clarify the role of ASNS in CRC.

      The data displayed in Fig 5C and 5D are from separate datasets so are not correlated. In Fig 5C ‘Tumour’ refers to gene expression from the primary tumour site (in this case the colorectum), whereas ‘Metastatic’ refers to gene expression from a metastatic tumour (from which the primary tumour was of colorectal origin). We will make this clearer in the text and figure legend. We will also make the labelling on the survival plot in D clearer, indicating that the difference between the two groups is significant and displaying the p value clearly.

      The data included in the survival plots in 5D encompass all tumour stages. We have further analysed these data, adjusting for tumour stage. We found that high ASNS expression in later stage tumours (stage 3 and 4) is associated with poorer overall survival, whereas there is no significant difference in overall survival in earlier stage tumours (stage 1 and 2) in relation to ASNS expression. We plan to add this to the supplementary materials and discuss in the main text as it is consistent with our findings from the AA cell line series.

      *Western Blot controls *

      For the Western Blots in Figure 6 A and C the total S6 and ULK1 controls are missing what is needed to assess the effect on pS6 and pULK1 correctly.

      We will add total S6 and ULK1 controls to these figures.

      In the same panels, the KO efficacy is not very high in A (-ASN). However, this is crucial to make the conclusion that this cell line (C1) is not dependent on ASNS.

      The average knockdown efficiency in the C1 cells is 72% across n=3 experiments. Therefore, levels of ASNS are significantly reduced. However, to further validate this finding, we will use L-Albizziine, a competitive inhibitor of ASNS, at the same concentration in both C1 and M cells to eliminate any issues surrounding variation in knockdown efficiency and to replicate the results obtained using ASNS siRNA. These data will be included in supplementary material.

      *Minor comments: *

      *Statistical analysis of proliferation assays *

      The statistical significance for proliferation assays are missing.

      The statistical significance at the final timepoints of the proliferation assays are displayed on bar graphs in Supplementary Figure 5 (Figure S5B and C). We will add these to the proliferation curves in the main figure.

      Reviewer #3

      *A major concern is the model used in this study: *

      Sodium butyrate and the carcinogen N-methyl-N-nitro-nitrosoguanidine (MNNG) were used for the transformation. I believe this model was developed by one of the co-authors of the study, A.C. Williams in the 1990s. The relevance of the model for in vivo colon carcinogenesis is not entirely clear to me and I miss information why in particular sodium butyrate and MNNG were used. I am not an expert on colon carcinogenesis but I did not have the impression that this model has been widely adopted and I miss detailed information on the model itself as well as a critical discussion of its limitations.

      We thank the reviewer for raising these concerns and will include a ‘Limitations’ section in the manuscript ‘Discussion’ to elaborate on both the utility and the limitations of this model system. As described in response to concerns raised by reviewer #1 and reviewer #2, we plan to validate our findings in organoid models of colorectal tumourigenesis to strengthen the discoveries made using the AA cell line series.

      With regards to the use of sodium butyrate and MNNG for transformation of the C1 cells, justification was provided in the original paper describing generation of the cell line model series (Williams et al, Cancer Research. 1990). Sodium butyrate is naturally occurring in the gut and was used for the transformation of the C1 cells as it had been proposed to play a role in promoting colorectal tumorigenesis through upregulating carcinoembryonic antigen (CEA) expression and enhancing proliferation in adenoma cells able to resist growth arrest following treatment (Berry et al, Carcinogenesis. 1988). At the time of generating the cell line series, few reagents were known to induce transformation in human epithelial cells. However, MNNG was one of those and had been previously used to transform keratinocytes (Rhim et al, Science. 1986). Crucially, tumours formed in mice from xenografted 10C cells were found to be heterogeneous, displaying areas of differentiation with glandular organisation, the presence of functional goblet cells enabling mucin production, as well as areas of poorly or undifferentiated cells. Furthermore, cytogenetic analyses revealed that genetic changes in the cell line progression model such as chromosome 18q loss and KRAS activation replicate those seen in CRC patients (Williams et al, Oncogene. 1993). Together, these characteristics recapitulate human tumours in vivo, validating the use of sodium butyrate and MNNG in generating an in vitro CRC cell line model that represents human colorectal tumorigenesis.

      Figure 6: total levels of ribosomal S6 protein and ULK1 should be detected, quantified and used for normalization.

      We agree with the reviewer, we will add total S6 and ULK1 controls to these figures.

      Can you measure ASN upon inhibition of autophagy? Does it go down further?

      This is an interesting question, and we will address this experimentally by measuring ASN levels following treatment with chloroquine in the C1 and M cell lines. We will do this using stable isotope labelling and mass spectrometry and include the results in supplementary material.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Comments

      We thank all reviewers for providing very detailed, knowledgeable, and informative reviews.

      All reviewers were complementary about the data and the rigor of the study. Reviewers 2 and 3 commented on the significance of the work, and their assessments were complementary, specifically about the fact that it bridges several previous studies and links these to kinase-phosphatase regulation on the BUB complex. We agree that this a major strength of the work. That is why we also believe the comment by reviewer 1 that “most of the phenotypes/observations are consistent with the literature and not surprising” is actually a strength and not a weakness. Sometimes manuscripts that bring together various different findings into one conceptual model can be very powerful, even if each finding in isolation is not so surprising. In this case, the concept that a dual kinase-phosphatase module integrates two major mitotic processes will, we believe, prove to be a significant breakthrough that helps to explain how these processes are properly integrated at kinetochores.

      The main criticism of all reviewers related to the interpretations and writing style, which in general, we felt were valid. We will take on board all these comments, reword the manuscript during revision, and provide a detailed response to each of these points at resubmission.

      In terms of points requiring new experiments, there were 3 in total:

      1) Reviewers 1 and 3 raised an important issue about the feedback loop which will be addressed with new experiments to uncouple the feedback.

      2) Reviewer 2 made an important point about KNL1 levels, including a good suggestion to perform FRAP analysis to examine BUB complex dynamics when MELT numbers are increased. We will carry out this experiment prior to revision.

      3) Reviewer 1 had a second major comment regarding the modulation of MELT number and how this cannot be directly linked to PLK1/PP2A levels. We have 3 new experiments to add regarding this, performed already, which are discussed in the section below.

      All other comments were textual points that in most case we felt were valid. They showed that all reviewers had a very good grasp of the paper, the concepts, and the field in general. So, we finish by thanking all reviewers again for their thorough and detailed assessments of our manuscript. The comments they raised will help us to improve the manuscript after revision.

      Description of the planned revisions

      Three main points:

      1) The role of the feedback loop [reviewers 1 and 3]:

      The general issue is explained succinctly by reviewer 3’s comment:

      “The argument linking the negative feedback loop to biological functions is not straightforward. The authors provide evidence in Figure 1 for regulatory pathways between docked PLK1 and bound PP2A. However, their assays in Figure 2 bypass the feedback loop by directly modulating PP2A activity. These experiment supports an argument that the kinase/phosphatase activity balance is important, but do not address the feedback loop specifically (which could potentially be done using mutations that disrupt the feedback regulation). The claim that "a homeostatic feedback loop maintains an optimal balance of PLK1 and PP2A on the BUB complex" is too strong because there is no direct evidence connecting the feedback loop to optimal function.”

      This is a good point that we will address at revision. We demonstrate that the enzymes regulate each other on the BUB complex in figure 1 (PLK1 recruits PP2A, and PP2A removes PLK1), which balances their levels on the BUB complex. To determine consequences of upsetting this balance we locked either the kinase-bound or phosphatase-bound states (Figure 2). Importantly, this is required to assess direct phenotypes associated with each, but it does not directly demonstrate the role of the feedback loop. To do this we will generate mutants, as suggested, and analyse their phenotypes.

      We will mutate the PLK1 binding site (T620A) and recover the PLK1-regulated sites in the KARD motif to phospho-mimicking aspartates (S676D/T680D), analyzing effects on PLK1/PP2A recruitment, chromosome alignment and SAC strength. We predict that this will remove PLK1 and recover some PP2A, but to lower levels overall than the BUBR1-B56 fusion. In that case the phenotypes will probably be milder, but that would not change the overall conclusions.

      We maintain that locking PLK1 on its phospho-binding site (in BUBR1-DPP2A) is the ideal scenario to test direct PLK1 roles, but we will also now create alanine mutants of the PLK1 site (S676A/T680A) and the CDK1 site (S670A) to address the feedback loops controlled by CDK1 and PLK1. Our prediction is that these will skew the balance towards PLK1, without fully removing PP2A, again likely to produce milder intermediate phenotypes.

      It is definitely worth testing these predictions, because it would directly address the role of the feedback loop and it would avoid relying solely on “artificially high levels” as mentioned by reviewer 1. One final point on this however, the PLK1 recruitment in DPP2A cells is not artificial – it is PLK1 bound to its native phospho-motif when PP2A is unbound (without any feedback from PP2A this phospho-site and PLK1 binding increase to the observed maximal levels). The fusion of B56 is admittedly less optimal, but this does still lock the phosphatase-bound state in a set stoichiometry, crucially in the absence of kinase. This is required to assess direct phosphatase effects. These PLK1/PP2A levels may well be higher than observed physiologically on the BUB complex when considering the behavior of all BUBR1 molecules, since we doubt they ever reach 1:1 stoichiometry with either PLK1 or PP2A. However, the feedback loop is operating within individual molecules (figure 1), which may well individually flip between PLK1 or PP2A bound states. This may occur on certain molecules at specific times. Therefore, locking the PLK1.PP2A-bound state on all molecules is, in our opinion, still a valid and useful perturbation to assess function of these two states.

      2) The increase recruitment of BUB1-PLK1/PP2A when MELT numbers are increased [reviewer 2]

      "While in the 12x and 19x mutant conditions there are more molecules of BUBs per Knl1, the overall BUB levels are the same as in wild-type controls. Since the MELT repeat used throughout the paper is a consensus sequence that is likely optimal for BUB binding, it is possible that the phenotypes of the 12x and 19x mutants are explained because of an increase in the affinity of BUBs for Knl1 rather than overall levels. This would also help explain why Knl1 and BUBs are observed at the spindle midzone in the 19x mutant (Fig. S4)"

      The reviewer raises an important issue here, when stating that increasing MELT numbers decreases KNL1 kinetochore recruitment. This has the net effect of normalizing overall BUB1-PLK1/PP2A kinetochore levels, even though BUB1-PLK1/PP2A recruitment per KNL1 molecule is increased. That is why we were careful to state BUB1-PLK1/PP2A were increased “on each KNL1 molecule” and not “on kinetochores” when referring to the effect in the 12x/19x MELT mutant. However, this could easily be misinterpreted so this point will be clarified at revision.

      The issue of why the phenotype is so dependent on kinase/phosphatase level per KNL1 molecules is an important one however, which has puzzled us until now. We think the suggestion to look at turnover by FRAP is a good one, because enhanced binding strength could underlie the phenotype here, and potentially explain the lack of disassembly at anaphase. We will perform these experiments at revision to see if they can clarify the issue.

      3) The link between MELT number and PLK1/PP2A levels [Reviewer 1]

      “My second comment relates to the fact that the two parts of the paper are not directly linked although the authors try to do this. They nicely manipulate the MELT repeats on KNL1 to change the number of Bub complexes. However, they cannot directly link the data to changes in Plk1 and PP2A-B56 levels only as many other things are changing. By increasing MELT numbers Bub complex and Mad1/Mad2 levels increase as well as an example and this makes interpretations complicated. To me these experiments are not addressing the main conclusions of the paper.”

      We do not agree with this overall assessment, but there are two elements to this comment: the effect of modulating MELT number on SAC strength (and its link to PLK1) or on KT-MT stability (and link to P2A). We will therefore discuss each separately:

      For SAC regulation, we feel that the data is clear and the interpretations are justified, although we will add new data to support this point after revision. Increasing MELT number causes defects in MELT-BUB dissociation and SAC silencing (4a-c). Importantly, these phenotypes can be completely rescued by inhibiting PLK1 (4d-e). So, we do link the effects of high MELT number to PLK1 activity. Our interpretation is that when MELT numbers are increased the ability of PLK1 to phosphorylate these motifs and maintain the SAC platform is enhanced (when MPS1 is inhibited pharmacologically or upon KT-MT attachment). So, whilst it is true that many factors, such as the kinetochore levels of BUB/MAD1/MAD2, are crucial for the SAC, the ability of PLK1 to maintain these levels (via pMELT-BUB1) is crucial and that changes as MELT number increases. This contributes directly to the observe SAC silencing phenotype, as confirmed by the complete rescue of this phenotype after PLK1 inhibition.

      We did also explore the possibility that increased BUB1 activity could also contribute to SAC strengthening, for example, by enhancing Aurora B recruitment to centromeres. However, BUB1 inhibition did not alter SAC strength or MELT dephosphorylation kinetics. We will add this data after revision.

      We also evaluated the levels of phosphorylated MAD1-pT716, which is important for MCC assembly (Ji et al. 2017, Ji et al. 2018, Faesen, 2017). Our data show that WT and 19xMELT exhibit similar MAD1-pT716 levels during a nocodazole arrest and following MPS1 inhibition. In summary, the main changes we observe are elevated BUB1 levels due to MELT phosphorylation, and increased BUB1 phosphorylation on pT461 (as shown in Figure 4h). All this points towards a localized effect of PLK1 on/around the BUB complex. We will add this data and make this point clearly at revision.

      For KT-MT attachment regulation, we agree that we do not have a similar way to inhibit PP2A-B56 activity to rescue hyperstable microtubule attachment when MELT numbers are high. For this, we require a way to rapidly inhibit PP2A-B56 activity after attachments have formed, something that is not technical feasible at the present. We can also not say for certain that reduced MELT numbers destabilize microtubule due to lack of PP2A, however we feel this is the most like interpretation for the following reasons. The phenotype of removing PP2A from BUBR1 or removing the MELT from KNL1 (along with all associated factors), is identical: mutant cells have comparable chromosome misalignment due to unattached kinetochores (compare 2F-I with 5A-D). Therefore, the additional factors lost by removing the MELTs cannot be having such a strong impact in KT-MT attachment. The obvious factor that could affect attachment strength is again BUB1, via Aurora B recruitment to centromeres. However, loss of BUB1 (after MELT removal) is predicted to enhance attachment stability (reduced Aurora B) and not decrease it, as we observe. So, whilst we cannot definitely conclude that modulating MELT number affect attachment stability via PP2A, we feel that this is certainly the most likely explanation. We will state this clearly in the revised text.

      Description of analyses that authors prefer not to carry out

      “SAC strength of BubR1 WT, ΔC and B56γ was analysed in the presence of nocodazole + MPS1i. It would be interesting to see what the phenotypes are without MPS1i [Reviewer 1]”

      In the absence of MPS1i basal MELT phosphorylation increases (DC) or decreases (B56g) as predicted (Figure 2d; compare timepoint 0 all conditions). This does not cause any change to SAC strength when all kinetochores are unattached in nocodazole (not shown). The sensitize SAC assay (nocodazole + MPSi) has been used by many groups (originally Santaguida et al, 2011; Saurin et al, 2011), because it reduces SAC signals from all unattached kinetochores which would otherwise produce a saturated response. In this case, we specifically chose a dose of MPS1 inhibitor that gave a partial SAC response from which we could observe either strengthening or weakening – a key point of the assay. Indeed, this showed that the SAC was strengthened (DC) or weakened (B56g), as predicted (Figure 2E). The only other way to do this, which has been used by some in the literature, is to use a low dose of nocodazole which prevents all kinetochore from signaling to the SAC. We specifically wanted to avoid this situation because then you cannot untangle the effects on SAC and KT-MT attachment stability – this was crucial in our case.

    1. Who Can Name the Bigger Number?by Scott Aaronson [Author's blog] [This essay in Spanish] [This essay in French] [This essay in Chinese] In an old joke, two noblemen vie to name the bigger number. The first, after ruminating for hours, triumphantly announces "Eighty-three!" The second, mightily impressed, replies "You win." A biggest number contest is clearly pointless when the contestants take turns. But what if the contestants write down their numbers simultaneously, neither aware of the other’s? To introduce a talk on "Big Numbers," I invite two audience volunteers to try exactly this. I tell them the rules: You have fifteen seconds. Using standard math notation, English words, or both, name a single whole number—not an infinity—on a blank index card. Be precise enough for any reasonable modern mathematician to determine exactly what number you’ve named, by consulting only your card and, if necessary, the published literature. So contestants can’t say "the number of sand grains in the Sahara," because sand drifts in and out of the Sahara regularly. Nor can they say "my opponent’s number plus one," or "the biggest number anyone’s ever thought of plus one"—again, these are ill-defined, given what our reasonable mathematician has available. Within the rules, the contestant who names the bigger number wins. Are you ready? Get set. Go. The contest’s results are never quite what I’d hope. Once, a seventh-grade boy filled his card with a string of successive 9’s. Like many other big-number tyros, he sought to maximize his number by stuffing a 9 into every place value. Had he chosen easy-to-write 1’s rather than curvaceous 9’s, his number could have been millions of times bigger. He still would been decimated, though, by the girl he was up against, who wrote a string of 9’s followed by the superscript 999. Aha! An exponential: a number multiplied by itself 999 times. Noticing this innovation, I declared the girl’s victory without bothering to count the 9’s on the cards. And yet the girl’s number could have been much bigger still, had she stacked the mighty exponential more than once. Take , for example. This behemoth, equal to 9387,420,489, has 369,693,100 digits. By comparison, the number of elementary particles in the observable universe has a meager 85 digits, give or take. Three 9’s, when stacked exponentially, already lift us incomprehensibly beyond all the matter we can observe—by a factor of about 10369,693,015. And we’ve said nothing of or . Place value, exponentials, stacked exponentials: each can express boundlessly big numbers, and in this sense they’re all equivalent. But the notational systems differ dramatically in the numbers they can express concisely. That’s what the fifteen-second time limit illustrates. It takes the same amount of time to write 9999, 9999, and —yet the first number is quotidian, the second astronomical, and the third hyper-mega astronomical. The key to the biggest number contest is not swift penmanship, but rather a potent paradigm for concisely capturing the gargantuan. Such paradigms are historical rarities. We find a flurry in antiquity, another flurry in the twentieth century, and nothing much in between. But when a new way to express big numbers concisely does emerge, it’s often a byproduct of a major scientific revolution: systematized mathematics, formal logic, computer science. Revolutions this momentous, as any Kuhnian could tell you, only happen under the right social conditions. Thus is the story of big numbers a story of human progress. And herein lies a parallel with another mathematical story. In his remarkable and underappreciated book A History of π, Petr Beckmann argues that the ratio of circumference to diameter is "a quaint little mirror of the history of man." In the rare societies where science and reason found refuge—the early Athens of Anaxagoras and Hippias, the Alexandria of Eratosthenes and Euclid, the seventeenth-century England of Newton and Wallis—mathematicians made tremendous strides in calculating π. In Rome and medieval Europe, by contrast, knowledge of π stagnated. Crude approximations such as the Babylonians’ 25/8 held sway. This same pattern holds, I think, for big numbers. Curiosity and openness lead to fascination with big numbers, and to the buoyant view that no quantity, whether of the number of stars in the galaxy or the number of possible bridge hands, is too immense for the mind to enumerate. Conversely, ignorance and irrationality lead to fatalism concerning big numbers. Historian Ilan Vardi cites the ancient Greek term sand-hundred, colloquially meaning zillion; as well as a passage from Pindar’s Olympic Ode II asserting that "sand escapes counting." ¨ But sand doesn’t escape counting, as Archimedes recognized in the third century B.C. Here’s how he began The Sand-Reckoner, a sort of pop-science article addressed to the King of Syracuse: There are some ... who think that the number of the sand is infinite in multitude ... again there are some who, without regarding it as infinite, yet think that no number has been named which is great enough to exceed its multitude ... But I will try to show you [numbers that] exceed not only the number of the mass of sand equal in magnitude to the earth ... but also that of a mass equal in magnitude to the universe. This Archimedes proceeded to do, essentially by using the ancient Greek term myriad, meaning ten thousand, as a base for exponentials. Adopting a prescient cosmological model of Aristarchus, in which the "sphere of the fixed stars" is vastly greater than the sphere in which the Earth revolves around the sun, Archimedes obtained an upper bound of 1063 on the number of sand grains needed to fill the universe. (Supposedly 1063 is the biggest number with a lexicographically standard American name: vigintillion. But the staid vigintillion had better keep vigil lest it be encroached upon by the more whimsically-named googol, or 10100, and googolplex, or .) Vast though it was, of course, 1063 wasn’t to be enshrined as the all-time biggest number. Six centuries later, Diophantus developed a simpler notation for exponentials, allowing him to surpass . Then, in the Middle Ages, the rise of Arabic numerals and place value made it easy to stack exponentials higher still. But Archimedes’ paradigm for expressing big numbers wasn’t fundamentally surpassed until the twentieth century. And even today, exponentials dominate popular discussion of the immense. Consider, for example, the oft-repeated legend of the Grand Vizier in Persia who invented chess. The King, so the legend goes, was delighted with the new game, and invited the Vizier to name his own reward. The Vizier replied that, being a modest man, he desired only one grain of wheat on the first square of a chessboard, two grains on the second, four on the third, and so on, with twice as many grains on each square as on the last. The innumerate King agreed, not realizing that the total number of grains on all 64 squares would be 264-1, or 18.6 quintillion—equivalent to the world’s present wheat production for 150 years. Fittingly, this same exponential growth is what makes chess itself so difficult. There are only about 35 legal choices for each chess move, but the choices multiply exponentially to yield something like 1050 possible board positions—too many for even a computer to search exhaustively. That’s why it took until 1997 for a computer, Deep Blue, to defeat the human world chess champion. And in Go, which has a 19-by-19 board and over 10150 possible positions, even an amateur human can still rout the world’s top-ranked computer programs. Exponential growth plagues computers in other guises as well. The traveling salesman problem asks for the shortest route connecting a set of cities, given the distances between each pair of cities. The rub is that the number of possible routes grows exponentially with the number of cities. When there are, say, a hundred cities, there are about 10158 possible routes, and, although various shortcuts are possible, no known computer algorithm is fundamentally better than checking each route one by one. The traveling salesman problem belongs to a class called NP-complete, which includes hundreds of other problems of practical interest. (NP stands for the technical term ‘Nondeterministic Polynomial-Time.’) It’s known that if there’s an efficient algorithm for any NP-complete problem, then there are efficient algorithms for all of them. Here ‘efficient’ means using an amount of time proportional to at most the problem size raised to some fixed power—for example, the number of cities cubed. It’s conjectured, however, that no efficient algorithm for NP-complete problems exists. Proving this conjecture, called P¹ NP, has been a great unsolved problem of computer science for thirty years. Although computers will probably never solve NP-complete problems efficiently, there’s more hope for another grail of computer science: replicating human intelligence. The human brain has roughly a hundred billion neurons linked by a hundred trillion synapses. And though the function of an individual neuron is only partially understood, it’s thought that each neuron fires electrical impulses according to relatively simple rules up to a thousand times each second. So what we have is a highly interconnected computer capable of maybe 1014 operations per second; by comparison, the world’s fastest parallel supercomputer, the 9200-Pentium Pro teraflops machine at Sandia National Labs, can perform 1012 operations per second. Contrary to popular belief, gray mush is not only hard-wired for intelligence: it surpasses silicon even in raw computational power. But this is unlikely to remain true for long. The reason is Moore’s Law, which, in its 1990’s formulation, states that the amount of information storable on a silicon chip grows exponentially, doubling roughly once every two years. Moore’s Law will eventually play out, as microchip components reach the atomic scale and conventional lithography falters. But radical new technologies, such as optical computers, DNA computers, or even quantum computers, could conceivably usurp silicon’s place. Exponential growth in computing power can’t continue forever, but it may continue long enough for computers—at least in processing power—to surpass human brains. To prognosticators of artificial intelligence, Moore’s Law is a glorious herald of exponential growth. But exponentials have a drearier side as well. The human population recently passed six billion and is doubling about once every forty years. At this exponential rate, if an average person weighs seventy kilograms, then by the year 3750 the entire Earth will be composed of human flesh. But before you invest in deodorant, realize that the population will stop increasing long before this—either because of famine, epidemic disease, global warming, mass species extinctions, unbreathable air, or, entering the speculative realm, birth control. It’s not hard to fathom why physicist Albert Bartlett asserted "the greatest shortcoming of the human race" to be "our inability to understand the exponential function." Or why Carl Sagan advised us to "never underestimate an exponential." In his book Billions & Billions, Sagan gave some other depressing consequences of exponential growth. At an inflation rate of five percent a year, a dollar is worth only thirty-seven cents after twenty years. If a uranium nucleus emits two neutrons, both of which collide with other uranium nuclei, causing them to emit two neutrons, and so forth—well, did I mention nuclear holocaust as a possible end to population growth? ¨ Exponentials are familiar, relevant, intimately connected to the physical world and to human hopes and fears. Using the notational systems I’ll discuss next, we can concisely name numbers that make exponentials picayune by comparison, that subjectively speaking exceed as much as the latter exceeds 9. But these new systems may seem more abstruse than exponentials. In his essay "On Number Numbness," Douglas Hofstadter leads his readers to the precipice of these systems, but then avers: If we were to continue our discussion just one zillisecond longer, we would find ourselves smack-dab in the middle of the theory of recursive functions and algorithmic complexity, and that would be too abstract. So let’s drop the topic right here. But to drop the topic is to forfeit, not only the biggest number contest, but any hope of understanding how stronger paradigms lead to vaster numbers. And so we arrive in the early twentieth century, when a school of mathematicians called the formalists sought to place all of mathematics on a rigorous axiomatic basis. A key question for the formalists was what the word ‘computable’ means. That is, how do we tell whether a sequence of numbers can be listed by a definite, mechanical procedure? Some mathematicians thought that ‘computable’ coincided with a technical notion called ‘primitive recursive.’ But in 1928 Wilhelm Ackermann disproved them by constructing a sequence of numbers that’s clearly computable, yet grows too quickly to be primitive recursive. Ackermann’s idea was to create an endless procession of arithmetic operations, each more powerful than the last. First comes addition. Second comes multiplication, which we can think of as repeated addition: for example, 5´3 means 5 added to itself 3 times, or 5+5+5 = 15. Third comes exponentiation, which we can think of as repeated multiplication. Fourth comes ... what? Well, we have to invent a weird new operation, for repeated exponentiation. The mathematician Rudy Rucker calls it ‘tetration.’ For example, ‘5 tetrated to the 3’ means 5 raised to its own power 3 times, or , a number with 2,185 digits. We can go on. Fifth comes repeated tetration: shall we call it ‘pentation’? Sixth comes repeated pentation: ‘hexation’? The operations continue infinitely, with each one standing on its predecessor to peer even higher into the firmament of big numbers. If each operation were a candy flavor, then the Ackermann sequence would be the sampler pack, mixing one number of each flavor. First in the sequence is 1+1, or (don’t hold your breath) 2. Second is 2´2, or 4. Third is 3 raised to the 3rd power, or 27. Hey, these numbers aren’t so big! Fee. Fi. Fo. Fum. Fourth is 4 tetrated to the 4, or , which has 10154 digits. If you’re planning to write this number out, better start now. Fifth is 5 pentated to the 5, or with ‘5 pentated to the 4’ numerals in the stack. This number is too colossal to describe in any ordinary terms. And the numbers just get bigger from there. Wielding the Ackermann sequence, we can clobber unschooled opponents in the biggest-number contest. But we need to be careful, since there are several definitions of the Ackermann sequence, not all identical. Under the fifteen-second time limit, here’s what I might write to avoid ambiguity: A(111)—Ackermann seq—A(1)=1+1, A(2)=2´2, A(3)=33, etc Recondite as it seems, the Ackermann sequence does have some applications. A problem in an area called Ramsey theory asks for the minimum dimension of a hypercube satisfying a certain property. The true dimension is thought to be 6, but the lowest dimension anyone’s been able is prove is so huge that it can only be expressed using the same ‘weird arithmetic’ that underlies the Ackermann sequence. Indeed, the Guinness Book of World Records once listed this dimension as the biggest number ever used in a mathematical proof. (Another contender for the title once was Skewes’ number, about , which arises in the study of how prime numbers are distributed. The famous mathematician G. H. Hardy quipped that Skewes’ was "the largest number which has ever served any definite purpose in mathematics.") What’s more, Ackermann’s briskly-rising cavalcade performs an occasional cameo in computer science. For example, in the analysis of a data structure called ‘Union-Find,’ a term gets multiplied by the inverse of the Ackermann sequence—meaning, for each whole number X, the first number N such that the Nth Ackermann number is bigger than X. The inverse grows as slowly as Ackermann’s original sequence grows quickly; for all practical purposes, the inverse is at most 4. ¨ Ackermann numbers are pretty big, but they’re not yet big enough. The quest for still bigger numbers takes us back to the formalists. After Ackermann demonstrated that ‘primitive recursive’ isn’t what we mean by ‘computable,’ the question still stood: what do we mean by ‘computable’? In 1936, Alonzo Church and Alan Turing independently answered this question. While Church answered using a logical formalism called the lambda calculus, Turing answered using an idealized computing machine—the Turing machine—that, in essence, is equivalent to every Compaq, Dell, Macintosh, and Cray in the modern world. Turing’s paper describing his machine, "On Computable Numbers," is rightly celebrated as the founding document of computer science. "Computing," said Turing, is normally done by writing certain symbols on paper. We may suppose this paper to be divided into squares like a child’s arithmetic book. In elementary arithmetic the 2-dimensional character of the paper is sometimes used. But such use is always avoidable, and I think it will be agreed that the two-dimensional character of paper is no essential of computation. I assume then that the computation is carried out on one-dimensional paper, on a tape divided into squares. Turing continued to explicate his machine using ingenious reasoning from first principles. The tape, said Turing, extends infinitely in both directions, since a theoretical machine ought not be constrained by physical limits on resources. Furthermore, there’s a symbol written on each square of the tape, like the ‘1’s and ‘0’s in a modern computer’s memory. But how are the symbols manipulated? Well, there’s a ‘tape head’ moving back and forth along the tape, examining one square at a time, writing and erasing symbols according to definite rules. The rules are the tape head’s program: change them, and you change what the tape head does. Turing’s august insight was that we can program the tape head to carry out any computation. Turing machines can add, multiply, extract cube roots, sort, search, spell-check, parse, play Tic-Tac-Toe, list the Ackermann sequence. If we represented keyboard input, monitor output, and so forth as symbols on the tape, we could even run Windows on a Turing machine. But there’s a problem. Set a tape head loose on a sequence of symbols, and it might stop eventually, or it might run forever—like the fabled programmer who gets stuck in the shower because the instructions on the shampoo bottle read "lather, rinse, repeat." If the machine’s going to run forever, it’d be nice to know this in advance, so that we don’t spend an eternity waiting for it to finish. But how can we determine, in a finite amount of time, whether something will go on endlessly? If you bet a friend that your watch will never stop ticking, when could you declare victory? But maybe there’s some ingenious program that can examine other programs and tell us, infallibly, whether they’ll ever stop running. We just haven’t thought of it yet. Nope. Turing proved that this problem, called the Halting Problem, is unsolvable by Turing machines. The proof is a beautiful example of self-reference. It formalizes an old argument about why you can never have perfect introspection: because if you could, then you could determine what you were going to do ten seconds from now, and then do something else. Turing imagined that there was a special machine that could solve the Halting Problem. Then he showed how we could have this machine analyze itself, in such a way that it has to halt if it runs forever, and run forever if it halts. Like a hound that finally catches its tail and devours itself, the mythical machine vanishes in a fury of contradiction. (That’s the sort of thing you don’t say in a research paper.) ¨ "Very nice," you say (or perhaps you say, "not nice at all"). "But what does all this have to do with big numbers?" Aha! The connection wasn’t published until May of 1962. Then, in the Bell System Technical Journal, nestled between pragmatically-minded papers on "Multiport Structures" and "Waveguide Pressure Seals," appeared the modestly titled "On Non-Computable Functions" by Tibor Rado. In this paper, Rado introduced the biggest numbers anyone had ever imagined. His idea was simple. Just as we can classify words by how many letters they contain, we can classify Turing machines by how many rules they have in the tape head. Some machines have only one rule, others have two rules, still others have three rules, and so on. But for each fixed whole number N, just as there are only finitely many distinct words with N letters, so too are there only finitely many distinct machines with N rules. Among these machines, some halt and others run forever when started on a blank tape. Of the ones that halt, asked Rado, what’s the maximum number of steps that any machine takes before it halts? (Actually, Rado asked mainly about the maximum number of symbols any machine can write on the tape before halting. But the maximum number of steps, which Rado called S(n), has the same basic properties and is easier to reason about.) Rado called this maximum the Nth "Busy Beaver" number. (Ah yes, the early 1960’s were a more innocent age.) He visualized each Turing machine as a beaver bustling busily along the tape, writing and erasing symbols. The challenge, then, is to find the busiest beaver with exactly N rules, albeit not an infinitely busy one. We can interpret this challenge as one of finding the "most complicated" computer program N bits long: the one that does the most amount of stuff, but not an infinite amount. Now, suppose we knew the Nth Busy Beaver number, which we’ll call BB(N). Then we could decide whether any Turing machine with N rules halts on a blank tape. We’d just have to run the machine: if it halts, fine; but if it doesn’t halt within BB(N) steps, then we know it never will halt, since BB(N) is the maximum number of steps it could make before halting. Similarly, if you knew that all mortals died before age 200, then if Sally lived to be 200, you could conclude that Sally was immortal. So no Turing machine can list the Busy Beaver numbers—for if it could, it could solve the Halting Problem, which we already know is impossible. But here’s a curious fact. Suppose we could name a number greater than the Nth Busy Beaver number BB(N). Call this number D for dam, since like a beaver dam, it’s a roof for the Busy Beaver below. With D in hand, computing BB(N) itself becomes easy: we just need to simulate all the Turing machines with N rules. The ones that haven’t halted within D steps—the ones that bash through the dam’s roof—never will halt. So we can list exactly which machines halt, and among these, the maximum number of steps that any machine takes before it halts is BB(N). Conclusion? The sequence of Busy Beaver numbers, BB(1), BB(2), and so on, grows faster than any computable sequence. Faster than exponentials, stacked exponentials, the Ackermann sequence, you name it. Because if a Turing machine could compute a sequence that grows faster than Busy Beaver, then it could use that sequence to obtain the D‘s—the beaver dams. And with those D’s, it could list the Busy Beaver numbers, which (sound familiar?) we already know is impossible. The Busy Beaver sequence is non-computable, solely because it grows stupendously fast—too fast for any computer to keep up with it, even in principle. This means that no computer program could list all the Busy Beavers one by one. It doesn’t mean that specific Busy Beavers need remain eternally unknowable. And in fact, pinning them down has been a computer science pastime ever since Rado published his article. It’s easy to verify that BB(1), the first Busy Beaver number, is 1. That’s because if a one-rule Turing machine doesn’t halt after the very first step, it’ll just keep moving along the tape endlessly. There’s no room for any more complex behavior. With two rules we can do more, and a little grunt work will ascertain that BB(2) is 6. Six steps. What about the third Busy Beaver? In 1965 Rado, together with Shen Lin, proved that BB(3) is 21. The task was an arduous one, requiring human analysis of many machines to prove that they don’t halt—since, remember, there’s no algorithm for listing the Busy Beaver numbers. Next, in 1983, Allan Brady proved that BB(4) is 107. Unimpressed so far? Well, as with the Ackermann sequence, don’t be fooled by the first few numbers. In 1984, A.K. Dewdney devoted a Scientific American column to Busy Beavers, which inspired amateur mathematician George Uhing to build a special-purpose device for simulating Turing machines. The device, which cost Uhing less than $100, found a five-rule machine that runs for 2,133,492 steps before halting—establishing that BB(5) must be at least as high. Then, in 1989, Heiner Marxen and Jürgen Buntrock discovered that BB(5) is at least 47,176,870. To this day, BB(5) hasn’t been pinned down precisely, and it could turn out to be much higher still. As for BB(6), Marxen and Buntrock set another record in 1997 by proving that it’s at least 8,690,333,381,690,951. A formidable accomplishment, yet Marxen, Buntrock, and the other Busy Beaver hunters are merely wading along the shores of the unknowable. Humanity may never know the value of BB(6) for certain, let alone that of BB(7) or any higher number in the sequence. Indeed, already the top five and six-rule contenders elude us: we can’t explain how they ‘work’ in human terms. If creativity imbues their design, it’s not because humans put it there. One way to understand this is that even small Turing machines can encode profound mathematical problems. Take Goldbach’s conjecture, that every even number 4 or higher is a sum of two prime numbers: 10=7+3, 18=13+5. The conjecture has resisted proof since 1742. Yet we could design a Turing machine with, oh, let’s say 100 rules, that tests each even number to see whether it’s a sum of two primes, and halts when and if it finds a counterexample to the conjecture. Then knowing BB(100), we could in principle run this machine for BB(100) steps, decide whether it halts, and thereby resolve Goldbach’s conjecture. We need not venture far in the sequence to enter the lair of basilisks. But as Rado stressed, even if we can’t list the Busy Beaver numbers, they’re perfectly well-defined mathematically. If you ever challenge a friend to the biggest number contest, I suggest you write something like this: BB(11111)—Busy Beaver shift #—1, 6, 21, etc If your friend doesn’t know about Turing machines or anything similar, but only about, say, Ackermann numbers, then you’ll win the contest. You’ll still win even if you grant your friend a handicap, and allow him the entire lifetime of the universe to write his number. The key to the biggest number contest is a potent paradigm, and Turing’s theory of computation is potent indeed. ¨ But what if your friend knows about Turing machines as well? Is there a notational system for big numbers more powerful than even Busy Beavers? Suppose we could endow a Turing machine with a magical ability to solve the Halting Problem. What would we get? We’d get a ‘super Turing machine’: one with abilities beyond those of any ordinary machine. But now, how hard is it to decide whether a super machine halts? Hmm. It turns out that not even super machines can solve this ‘super Halting Problem’, for the same reason that ordinary machines can’t solve the ordinary Halting Problem. To solve the Halting Problem for super machines, we’d need an even more powerful machine: a ‘super duper machine.’ And to solve the Halting Problem for super duper machines, we’d need a ‘super duper pooper machine.’ And so on endlessly. This infinite hierarchy of ever more powerful machines was formalized by the logician Stephen Kleene in 1943 (although he didn’t use the term ‘super duper pooper’). Imagine a novel, which is imbedded in a longer novel, which itself is imbedded in an even longer novel, and so on ad infinitum. Within each novel, the characters can debate the literary merits of any of the sub-novels. But, by analogy with classes of machines that can’t analyze themselves, the characters can never critique the novel that they themselves are in. (This, I think, jibes with our ordinary experience of novels.) To fully understand some reality, we need to go outside of that reality. This is the essence of Kleene’s hierarchy: that to solve the Halting Problem for some class of machines, we need a yet more powerful class of machines. And there’s no escape. Suppose a Turing machine had a magical ability to solve the Halting Problem, and the super Halting Problem, and the super duper Halting Problem, and the super duper pooper Halting Problem, and so on endlessly. Surely this would be the Queen of Turing machines? Not quite. As soon as we want to decide whether a ‘Queen of Turing machines’ halts, we need a still more powerful machine: an ‘Empress of Turing machines.’ And Kleene’s hierarchy continues. But how’s this relevant to big numbers? Well, each level of Kleene’s hierarchy generates a faster-growing Busy Beaver sequence than do all the previous levels. Indeed, each level’s sequence grows so rapidly that it can only be computed by a higher level. For example, define BB2(N) to be the maximum number of steps a super machine with N rules can make before halting. If this super Busy Beaver sequence were computable by super machines, then those machines could solve the super Halting Problem, which we know is impossible. So the super Busy Beaver numbers grow too rapidly to be computed, even if we could compute the ordinary Busy Beaver numbers. You might think that now, in the biggest-number contest, you could obliterate even an opponent who uses the Busy Beaver sequence by writing something like this: BB2(11111). But not quite. The problem is that I’ve never seen these "higher-level Busy Beavers" defined anywhere, probably because, to people who know computability theory, they’re a fairly obvious extension of the ordinary Busy Beaver numbers. So our reasonable modern mathematician wouldn’t know what number you were naming. If you want to use higher-level Busy Beavers in the biggest number contest, here’s what I suggest. First, publish a paper formalizing the concept in some obscure, low-prestige journal. Then, during the contest, cite the paper on your index card. To exceed higher-level Busy Beavers, we’d presumably need some new computational model surpassing even Turing machines. I can’t imagine what such a model would look like. Yet somehow I doubt that the story of notational systems for big numbers is over. Perhaps someday humans will be able concisely to name numbers that make Busy Beaver 100 seem as puerile and amusingly small as our nobleman’s eighty-three. Or if we’ll never name such numbers, perhaps other civilizations will. Is a biggest number contest afoot throughout the galaxy? ¨ You might wonder why we can’t transcend the whole parade of paradigms, and name numbers by a system that encompasses and surpasses them all. Suppose you wrote the following in the biggest number contest: The biggest whole number nameable with 1,000 characters of English text Surely this number exists. Using 1,000 characters, we can name only finitely many numbers, and among these numbers there has to be a biggest. And yet we’ve made no reference to how the number’s named. The English text could invoke Ackermann numbers, or Busy Beavers, or higher-level Busy Beavers, or even some yet more sweeping concept that nobody’s thought of yet. So unless our opponent uses the same ploy, we’ve got him licked. What a brilliant idea! Why didn’t we think of this earlier? Unfortunately it doesn’t work. We might as well have written One plus the biggest whole number nameable with 1,000 characters of English text This number takes at least 1,001 characters to name. Yet we’ve just named it with only 80 characters! Like a snake that swallows itself whole, our colossal number dissolves in a tumult of contradiction. What gives? The paradox I’ve just described was first published by Bertrand Russell, who attributed it to a librarian named G. G. Berry. The Berry Paradox arises not from mathematics, but from the ambiguity inherent in the English language. There’s no surefire way to convert an English phrase into the number it names (or to decide whether it names a number at all), which is why I invoked a "reasonable modern mathematician" in the rules for the biggest number contest. To circumvent the Berry Paradox, we need to name numbers using a precise, mathematical notational system, such as Turing machines—which is exactly the idea behind the Busy Beaver sequence. So in short, there’s no wily language trick by which to surpass Archimedes, Ackermann, Turing, and Rado, no royal road to big numbers. You might also wonder why we can’t use infinity in the contest. The answer is, for the same reason why we can’t use a rocket car in a bike race. Infinity is fascinating and elegant, but it’s not a whole number. Nor can we ‘subtract from infinity’ to yield a whole number. Infinity minus 17 is still infinity, whereas infinity minus infinity is undefined: it could be 0, 38, or even infinity again. Actually I should speak of infinities, plural. For in the late nineteenth century, Georg Cantor proved that there are different levels of infinity: for example, the infinity of points on a line is greater than the infinity of whole numbers. What’s more, just as there’s no biggest number, so too is there no biggest infinity. But the quest for big infinities is more abstruse than the quest for big numbers. And it involves, not a succession of paradigms, but essentially one: Cantor’s. ¨ So here we are, at the frontier of big number knowledge. As Euclid’s disciple supposedly asked, "what is the use of all this?" We’ve seen that progress in notational systems for big numbers mirrors progress in broader realms: mathematics, logic, computer science. And yet, though a mirror reflects reality, it doesn’t necessarily influence it. Even within mathematics, big numbers are often considered trivialities, their study an idle amusement with no broader implications. I want to argue a contrary view: that understanding big numbers is a key to understanding the world. Imagine trying to explain the Turing machine to Archimedes. The genius of Syracuse listens patiently as you discuss the papyrus tape extending infinitely in both directions, the time steps, states, input and output sequences. At last he explodes. "Foolishness!" he declares (or the ancient Greek equivalent). "All you’ve given me is an elaborate definition, with no value outside of itself." How do you respond? Archimedes has never heard of computers, those cantankerous devices that, twenty-three centuries from his time, will transact the world’s affairs. So you can’t claim practical application. Nor can you appeal to Hilbert and the formalist program, since Archimedes hasn’t heard of those either. But then it hits you: the Busy Beaver sequence. You define the sequence for Archimedes, convince him that BB(1000) is more than his 1063 grains of sand filling the universe, more even than 1063 raised to its own power 1063 times. You defy him to name a bigger number without invoking Turing machines or some equivalent. And as he ponders this challenge, the power of the Turing machine concept dawns on him. Though his intuition may never apprehend the Busy Beaver numbers, his reason compels him to acknowledge their immensity. Big numbers have a way of imbuing abstract notions with reality. Indeed, one could define science as reason’s attempt to compensate for our inability to perceive big numbers. If we could run at 280,000,000 meters per second, there’d be no need for a special theory of relativity: it’d be obvious to everyone that the faster we go, the heavier and squatter we get, and the faster time elapses in the rest of the world. If we could live for 70,000,000 years, there’d be no theory of evolution, and certainly no creationism: we could watch speciation and adaptation with our eyes, instead of painstakingly reconstructing events from fossils and DNA. If we could bake bread at 20,000,000 degrees Kelvin, nuclear fusion would be not the esoteric domain of physicists but ordinary household knowledge. But we can’t do any of these things, and so we have science, to deduce about the gargantuan what we, with our infinitesimal faculties, will never sense. If people fear big numbers, is it any wonder that they fear science as well and turn for solace to the comforting smallness of mysticism? But do people fear big numbers? Certainly they do. I’ve met people who don’t know the difference between a million and a billion, and don’t care. We play a lottery with ‘six ways to win!,’ overlooking the twenty million ways to lose. We yawn at six billion tons of carbon dioxide released into the atmosphere each year, and speak of ‘sustainable development’ in the jaws of exponential growth. Such cases, it seems to me, transcend arithmetical ignorance and represent a basic unwillingness to grapple with the immense. Whence the cowering before big numbers, then? Does it have a biological origin? In 1999, a group led by neuropsychologist Stanislas Dehaene reported evidence in Science that two separate brain systems contribute to mathematical thinking. The group trained Russian-English bilinguals to solve a set of problems, including two-digit addition, base-eight addition, cube roots, and logarithms. Some subjects were trained in Russian, others in English. When the subjects were then asked to solve problems approximately—to choose the closer of two estimates—they performed equally well in both languages. But when asked to solve problems exactly, they performed better in the language of their training. What’s more, brain-imaging evidence showed that the subjects’ parietal lobes, involved in spatial reasoning, were more active during approximation problems; while the left inferior frontal lobes, involved in verbal reasoning, were more active during exact calculation problems. Studies of patients with brain lesions paint the same picture: those with parietal lesions sometimes can’t decide whether 9 is closer to 10 or to 5, but remember the multiplication table; whereas those with left-hemispheric lesions sometimes can’t decide whether 2+2 is 3 or 4, but know that the answer is closer to 3 than to 9. Dehaene et al. conjecture that humans represent numbers in two ways. For approximate reckoning we use a ‘mental number line,’ which evolved long ago and which we likely share with other animals. But for exact computation we use numerical symbols, which evolved recently and which, being language-dependent, are unique to humans. This hypothesis neatly explains the experiment’s findings: the reason subjects performed better in the language of their training for exact computation but not for approximation problems is that the former call upon the verbally-oriented left inferior frontal lobes, and the latter upon the spatially-oriented parietal lobes. If Dehaene et al.’s hypothesis is correct, then which representation do we use for big numbers? Surely the symbolic one—for nobody’s mental number line could be long enough to contain , 5 pentated to the 5, or BB(1000). And here, I suspect, is the problem. When thinking about 3, 4, or 7, we’re guided by our spatial intuition, honed over millions of years of perceiving 3 gazelles, 4 mates, 7 members of a hostile clan. But when thinking about BB(1000), we have only language, that evolutionary neophyte, to rely upon. The usual neural pathways for representing numbers lead to dead ends. And this, perhaps, is why people are afraid of big numbers. Could early intervention mitigate our big number phobia? What if second-grade math teachers took an hour-long hiatus from stultifying busywork to ask their students, "How do you name really, really big numbers?" And then told them about exponentials and stacked exponentials, tetration and the Ackermann sequence, maybe even Busy Beavers: a cornucopia of numbers vaster than any they’d ever conceived, and ideas stretching the bounds of their imaginations. Who can name the bigger number? Whoever has the deeper paradigm. Are you ready? Get set. Go. References Petr Beckmann, A History of Pi, Golem Press, 1971. Allan H. Brady, "The Determination of the Value of Rado’s Noncomputable Function Sigma(k) for Four-State Turing Machines," Mathematics of Computation, vol. 40, no. 162, April 1983, pp 647- 665. Gregory J. Chaitin, "The Berry Paradox," Complexity, vol. 1, no. 1, 1995, pp. 26- 30. At http://www.umcs.maine.edu/~chaitin/unm2.html. A.K. Dewdney, The New Turing Omnibus: 66 Excursions in Computer Science, W.H. Freeman, 1993. S. Dehaene and E. Spelke and P. Pinel and R. Stanescu and S. Tsivkin, "Sources of Mathematical Thinking: Behavioral and Brain-Imaging Evidence," Science, vol. 284, no. 5416, May 7, 1999, pp. 970- 974. Douglas Hofstadter, Metamagical Themas: Questing for the Essence of Mind and Pattern, Basic Books, 1985. Chapter 6, "On Number Numbness," pp. 115- 135. Robert Kanigel, The Man Who Knew Infinity: A Life of the Genius Ramanujan, Washington Square Press, 1991. Stephen C. Kleene, "Recursive predicates and quantifiers," Transactions of the American Mathematical Society, vol. 53, 1943, pp. 41- 74. Donald E. Knuth, Selected Papers on Computer Science, CSLI Publications, 1996. Chapter 2, "Mathematics and Computer Science: Coping with Finiteness," pp. 31- 57. Dexter C. Kozen, Automata and Computability, Springer-Verlag, 1997. ———, The Design and Analysis of Algorithms, Springer-Verlag, 1991. Shen Lin and Tibor Rado, "Computer studies of Turing machine problems," Journal of the Association for Computing Machinery, vol. 12, no. 2, April 1965, pp. 196- 212. Heiner Marxen, Busy Beaver, at http://www.drb.insel.de/~heiner/BB/. ——— and Jürgen Buntrock, "Attacking the Busy Beaver 5," Bulletin of the European Association for Theoretical Computer Science, no. 40, February 1990, pp. 247- 251. Tibor Rado, "On Non-Computable Functions," Bell System Technical Journal, vol. XLI, no. 2, May 1962, pp. 877- 884. Rudy Rucker, Infinity and the Mind, Princeton University Press, 1995. Carl Sagan, Billions & Billions, Random House, 1997. Michael Somos, "Busy Beaver Turing Machine." At http://grail.cba.csuohio.edu/~somos/bb.html. Alan Turing, "On computable numbers, with an application to the Entscheidungsproblem," Proceedings of the London Mathematical Society, Series 2, vol. 42, pp. 230- 265, 1936. Reprinted in Martin Davis (ed.), The Undecidable, Raven, 1965. Ilan Vardi, "Archimedes, the Sand Reckoner," at http://www.ihes.fr/~ilan/sand_reckoner.ps. Eric W. Weisstein, CRC Concise Encyclopedia of Mathematics, CRC Press, 1999. Entry on "Large Number" at http://www.treasure-troves.com/math/LargeNumber.html. Back to Writings page Back to Scott's homepage Back to Scott's blog

      Why do we even care about big numbers is there any use?

    1. Applied Ecology textbook.

      I really appreciate this project overall as this will really mean a lot to some scientists that may not ever be in a textbook when they should be. Even if they don't see it I think it's awesome we can do something about giving more people appreciation for their work they may not get.

    1. Author Response

      Reviewer #1 (Public Review):

      “The synthesis and metabolism of sphingolipid (SL) are involved in wide range of biological processes. In the present study, the authors investigate the role of SPTLC1, one of the essential subunits of serine palmitoyl transferase complex, in both physiological and pathophysiological angiogenesis, via using inducible endothelial-specific SPTLC1 knockout mice. They found SPTLC1 deficiency in ECs inhibited retinal angiogenesis along with reducing several SL metabolites in plasma, red blood cells, and peripheral organs. In addition, the authors found SPTLC1 EC-KO mice are resistant to APAP-induced liver injury. Overall, the in vivo findings in the present study are of potential interest and the authors have given clear evidence that endothelial SPTLC1 is critical to retinal angiogenesis. However, the underlying mechanisms are completely lacking in the present study. Most of the evidence provided is circumstantial, associative, and indirect.”

      We appreciate the positive comments of the reviewer. We have addressed the reviewer’s concern regarding underlying mechanisms as detailed below.

      “To be specific,

      1. The authors found endothelial SPTLC1 is important to both angiogenesis and the plasma lipid profile. However, the authors did not present the data to demonstrate the relationship between them. The in vivo findings about the phenotype and the plasma lipid profile might be true and unrelated. It would be important to know whether supplementing the reduced lipid induced by SPTLC1 KO could rescue the angiogenesis related phenotype in mice, or, whether the alternative way to inhibit the SL synthesis could mimic the phenotype of KO mice.”

      In the manuscript, we discussed the possibility whether S1P is involved, since it is one of the most down-regulated SL in the plasma and a major regulator of angiogenesis. We think it is unlikely that reduced plasma S1P is responsible for the phenotype. First, the retinal angiogenesis defect in Sptlc1 ECKO mice is the opposite of S1pr1 ECKO as we have published previously (PMID: 22975328, PMID: 32059774). Moreover, deletion of sphingosine kinase, the enzyme produces S1P, in the endothelium does not influence retinal angiogenesis at P6 (Figure 3 Supplement 2 A and B). Loss of S1P chaperone ApoM- i.e., Apom KO, which exhibits 50% reduction of plasma S1P, does not show change in retinal vascular development (Figure 3 Supplement 2 C and D). Taken together, our results strongly suggest that reduction in plasma S1P is not the cause of vascular defect in Sptlc1 ECKO retinas.

      Based on our results in the manuscript, loss of SPT enzyme activity in endothelial cells reduced SL species in the endothelial cells and the plasma. Our in vitro and VEGF intraocular injection experiments (new data) suggests that the angiogenic defects seen in Sptlc1 ECKO mice is due to cell intrinsic defects in VEGF signaling and not due to changes in plasma SL levels. We have edited the discussion section to address this issue.

      “2. A major issue is that the present study did not reveal is a real downstream target. It is possible that VEGF signaling might be impaired by SPTLC1 knockout as discussed by the authors. However, the authors did not demonstrate this point with data. Including both in vivo and in vitro data to evaluate the effects of SPTLC1 deficiency on VEGF signaling might further strengthen the hypothesis. Besides, with in vitro experiments, the authors might further find the critical metabolite(s) involved in VEGF signaling and angiogenesis.”

      As discussed above, we agree with the review’s critique and have addressed this essential point with new experiments (both in vitro and in vivo) in Figure 5. Our new data shows that SPT pathway supplies the glycosphingolipid GM1, which is needed for efficient VEGF-induced ERK phosphorylation and tip cell formation.

      Reviewer #2 (Public Review):

      “Andrew Kuo et al. investigated the role of endothelial de novo sphingolipids (SL) synthesis using endothelial cell specific SPTLC1 knockout (ECKO) mice. They showed that these mice exhibited low concentration of various SL species in not only ECs but also RBC, circulation, and other non-EC tissues. They also showed that ECKO mice exhibited impaired angiogenesis in normal and oxygen-induced retinopathy models, consistent with the decrease of endothelial proliferation and tip cell formation. They finally revealed that these mice were resistant to acetaminophen-induced acute liver injury in early phase. The experiments were well-designed, and the results were clear and convincing. The authors concluded that endothelial cells were the major source of SL in circulation and various organs (liver and lung) other than retina (and probably brain). The weakness of the current version of the manuscript is that the authors did not elucidate the mechanisms underlying the observed phenomena.

      1) The authors showed impaired angiogenesis in ECKO mice using neonatal retina model. Based on the fact that this phenotype was similar to that in endothelial VEGFR2 deficient mice, they suggested that VEGF responsiveness is altered in ECKO mice. Although this hypothesis is plausible, the authors would need to prove it by evaluating VEGFR signaling (VEGFR phosphorylation, Akt activation etc.) in ECKO mice.”

      We thank the reviewer for positive comments. As for the weakness identified, we have addressed this point by conducting new in vitro and in vivo experiments (detailed above). The new Figure 5 addresses this issue directly.

      “2) The acetaminophen-induced liver injury was reduced in ECKO mice in early phase. However, it is still unclear whether SL production itself affects liver injury. The authors discussed the possibility that gene deficiency increases unconsumed serine resulting in GSH increase, but it is essentially independent to SL. If possible, it would be good if the authors could investigate the effect of SL administration on the liver injury progression.”

      We appreciate the reviewer’s concern about liver injury model in the Sptlc1 ECKO mice. Our data suggests that SL species supplied from EC impacts hepatocyte response to stress. Since the acetaminophen induced liver injury is highly dependent on reactive oxygen species, our finding that increased glutathione levels in the Sptlc1 ECKO mice may be involved in the phenotype. However, we are simply considering them as biochemical markers of liver injury. This has been addressed in the discussion.

      “3) This paper showed the impaired cell proliferation in Sptlc1 KO EC mice, and discussed it. Authors described that this phenotype was similar to that of Nos3 KO mice, but its inconsistency with Sptlc2 ECKO adult mice was only justified by a word "isoform-selective function". Authors could quantify eNOS expressions in Sptlc1 KO mice, compared results and then discuss this matter. “

      In figure 1C, we used eNOS as an EC marker to show purity during our EC isolation process. In fact, we did not observe change of eNOS expression in Sptlc1 ECKO. We also did not detect elevated phospho-eNOS in Sptl1c ECKO in contrast to Sptlc2 ECKO adult mice (Figure1 supplement 4). Additionally, our work in the retina was performed in postnatal-genedeletion pups from P6-P17 which is different from the published Sptlc2 ECKO study. The differences in gene deletion strategy (early postnatal vs. adult) could result in differences in eNOS expression . We have added discussion about this issue.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Ciliates extensively rearrange their somatic genome every time a new somatic nucleus develops from the zygotic germline nucleus. In this manuscript, Feng et al report the sequencing, assembly and annotation of the germline and somatic genomes of Euplotes woodruffi and the germline genome of Tetmemena sp. (whose somatic genome was sequenced and assembled by the same lab in 2015). They present a comparative analysis of developmentally programmed genome rearrangements in these two species and in the model ciliate Oxytricha trifallax. Their major findings are that:

      (i) E. woodruffi and Tetmemena sp. eliminate a smaller fraction of their germline genome (~54%) from their somatic macronucleus (MAC) than O. trifallax (>80%)

      (ii) Transposable elements (TE) represent a smaller fraction of the germline genome (~2%) in the first two ciliates than in O. trifallax (~15%). TEs are mainly located at the boundaries of germline chromosomes and in intergenic regions, but can also be found inside IESs

      (iii) Several thousands of genes are scrambled in the germline genome of all three species

      The authors have also addressed the possible origin of gene scrambling. They report an interesting association with local paralogy and propose a model for the emergence of the odd-even pattern of gene unscrambling between two paralogous copies.

      Major comments:

      1. Based on the statistics presented in Table 1, genome assemblies are of good quality, with a reasonable N50 size of germline (MIC) contigs. It seems, however, that no entire MIC chromosome could be assembled, since no two-telomere contig is mentioned in the list. As proposed by the authors (p.7) the presence of numerous TEs at the boundaries of MIC contigs (Fig S1) may have hindered the assembly of MIC chromosome ends. I would have appreciated to have more information on the "other repeats" (which seem to differ from tandem repeats according to Fig 2) and their location along MIC contigs.

        Subcategories of “other repeats” were included in Table S2 based on Repeatmasker annotations. We now analyzed the locations of other repeats in MIC contigs and include those as well in new Figure S1B. About 30% of “other” transposable elements are present at the boundaries of MIC contigs, which may also hinder the assembly. Notably, 35-45% of “other TEs” are in assembled, intergenic regions.

      The definition of "Internal Eliminated Sequences" (IES) is not clear. The authors make a distinction between IESs and TEs. I understand that IESs are DNA segments that separate two macronuclear-destined sequences (MDS) in the germline genome. Thus they appear to be restricted to those regions that eventually yield gene-sized MAC chromosomes. IESs are eliminated between two pointers that may not be identical on both sides in case of scrambled genes. Some clarification is needed here.

      To illustrate my point: I found the statement "with many TE insertions within IESs, suggesting that TE insertions may have generated IESs" particularly confusing (p. 9 lines 5-6). Does this mean that IESs extend beyond the ends of inserted TEs? The legend of Fig S1 should also be clarified.

      We clarified the text and legend. IESs can extend beyond the ends of inserted TEs, even if the original IES is a decayed TE, due to subsequent sequence evolution at the boundaries or if the original insertion was into an existing IES. David Prescott referred to sequence evolution at the edges of IESs as “pointer sliding” (ref.36).

      p. 10 lines 2-4 and Fig S2: Could the authors explain the difference they make between MDS (in the text) and CDS (in Fig S2)? My understanding is that a CDS is the entire gene coding sequence and may be made of multiple MDSs. If this is correct, the sentence should read "We compared the number of MDSs between single-copy orthologs for single-gene MAC chromosomes across the three species and found that the orthologs have similar CDS lengths".

      Yes, we made the correction.

      p. 12 lines 10-15: the discovery that paralogous MDSs can be found in scrambled genomic loci is interesting. If the two paralogs can be distinguished based on the number of substitutions, it would be informative to go back to individual reads and check whether each of the two copies can be incorporated in the unscrambled CDS (and at which frequency). Would the pointers be compatible with this?

      The paralogous MDSs in the MIC are often not identical. The copy with the highest similarity is assigned as “preliminary match” by SDRAP (ref. 52), and others are assigned as “additional matches”. To validate SDRAP assignments, we did pairwise BLASTN alignments (“-task megablast”) of paralogous MIC MDSs and their corresponding MAC MDSs. We confirmed that in the three species, the preliminary match has the best or equally best pid (percentage of identity) in most cases. Therefore, the MDS assigned as preliminary match is more likely the paralog incorporated into the MAC chromosome.

      We used genome assemblies of Euplotes woodruffi, which had the highest Nanopore coverage, to further investigate the frequency of MDS incorporation. We followed the reviewer’s suggestion and called SNP variants on both MAC and MIC genomes. For MAC SNP calling, we used Illumina reads as input for freebayes (ref a). For MIC SNP calling, we used Nanopore reads, instead of Illumina reads, to avoid non-specific short-read mapping on paralogous MDSs and to avoid the presence of any contaminating MAC reads. Variants were called and phased by PEPPER-Margin-DeepVariant (ref b), a new tool published in 2021 in Nature Methods, which has been reported to have similar accuracy to Illumina read variant calling, especially at high read coverage. We used the parameter “--pepper_min_coverage_threshold 20” to call confident variants when at least 20 reads cover the position. Only 92 MIC SNPs in the paralogous MDSs passed all filters of the program. Using this small set of MIC SNPs, we were unfortunately unable to distinguish which paralogous MIC MDS was incorporated into the MAC. Therefore, we cannot infer with what frequency one paralogous MDS is incorporated over another, until they become sufficiently diverged, which is compatible with the model.

      a. Garrison E, Marth G. Haplotype-based variant detection from short-read sequencing. arXiv preprint arXiv:1207.3907. 2012 Jul 17.

      b. Shafin K, Pesout T, Chang PC, Nattestad M, Kolesnikov A, Goel S, Baid G, Kolmogorov M, Eizenga JM, Miga KH, Carnevali P. Haplotype-aware variant calling with PEPPER-Margin-DeepVariant enables high accuracy in nanopore long-reads. Nature methods. 2021 Nov;18(11):1322-32.

      The hypothesis that odd-even scrambled loci have evolved from paralogous genes in E. woodruffi is supported by the existence of paralogous MDSs, length conservation of MDS/IES pairs and sequence similarity between corresponding MDS and IES in a pair. The correlations presented for Oxytricha and Tetmemena are much less convincing (Fig S5D and E). I recommend that the authors are even more cautious in their statement on p.13 ("For Oxytricha and Tememena, the MDS and IES lengths for such MDS/IES pairs also correlate positively, but more moderately").

      Thank you, we rephrased the text.

      p. 15 last paragraph: Why did the authors focus only on TBEs inserted in non-scrambled IESs to look for orthologous TBE insertions? Is there a reason to believe that no recent TBE insertion occurred at other genomic loci? Or was it only for practical reasons? It is also not clear to me whether the authors have considered full-length TBEs or the presence of at least one TBE ORF.

      This analysis was limited for practical reasons, because we identify position conservation of TBEs by aligning protein sequences of MAC genes. We only consider TBEs inserted in non-scrambled IESs in exons. It would be difficult and less meaningful to align completely non-coding MIC-limited regions.

      Partial TBEs are also included if they contain at least one TBE ORF (detected by BLAST).

      Furthermore, TE insertion cannot explain the origin of scrambled IESs, and TEs rarely map to scrambled IESs (Figure S1A), but there is a clear evolutionary model for the origin of nonscrambled IESs from decay of TBEs (ref. 49). Initial purifying selection would act on the TE to maintain its ability to self-excise, whereas we advocate for a different model for the origin of scrambled IESs by decay of paralogous MDSs.

      p. 16: the authors report that some introns of E. woodruffi map "near" Oxytricha/Tetmemena pointers. How near? Based on the information provided by the authors, I don't think this observation necessarily implies that IESs were converted to introns (or reciprocally) during evolution. If this were true, shouldn't at least one intron boundary coincide exactly with a pointer? The authors should clarify this (also in the discussion, on p. 20, top paragraph).

      We used a 20bp window (~7 amino acids), as described in the Methods, and added that to the Results. Full detail is provided in the Methods section, “Ortholog comparison pipeline and Monte Carlo simulations”. 103 E. woodruffi introns are within 20bp from the midpoint of Oxytricha/Tetmemena pointers. Among these, 43 intron boundaries overlap an Oxytricha or Tetmemena pointer. We observed 306 cases of precisely matching boundaries between any two species, where the exon junction of one species maps inside the MDS/IES pointer of another species, although we would only expect the boundaries of introns and IESs to coincide so precisely if they were recent conversions. Hence we feel that a window analysis is informative.

      p. 19 2nd paragraph: the suggested mechanism explaining the 5' bias of IESs in E. woodruffi genes is unclear. How could germline recombination take place between a MIC chromosome and a MAC reverse transcript or nanochromosome? This would imply that DNA could be imported in the MIC. Is there evidence that this might occur?

      The ability of TEs to invade the MIC demonstrates that even foreign DNA can be incorporated into the MIC. Since MAC DNA is present at high copy number, it offers a potential source for a recombination template that could erase IESs, as could an errant reverse transcript of one of the long noncoding template RNAs. Any of these would be infrequent events that would matter on an evolutionary time scale even if developmentally rare.

      According to Figure 1, no scrambled genes have been reported in Paramecium tetraurelia. Within the frame of the proposed model, this is somewhat unexpected because this ciliate went through several whole genome duplications during evolution and harbors many paralogous gene pairs. Is there a reason why no gene scrambling took place in Paramecium?

      Paramecium uses only TA dinucleotide pointers for IES elimination, unlike the rich diversity of pointers in spirotrichous ciliates. This limitation in its machinery may explain why no scrambled loci have been observed in Paramecium, despite the abundance of paralogs. Our model suggests that local MIC paralogy is associated with the origin of scrambling. But most of the paralogy reported in Paramecium is at the level of whole chromosomes in the MAC (ref. 104) rather than local MIC paralogy.

      Minor comments:

      p. 4 (4th bottom line): To my knowledge, ref #28 presents a draft (incomplete) MIC assembly of the Paramecium genome.

      Thank you, we added reference 29 and adjusted the wording describing the quality of MIC genome draft assemblies.

      p. 7 (last paragraph): "encoding" should be replaced by "carrying"

      Thank you, we made the change.

      p. 10 (2nd paragraph): insert a missing "o" into "nanochromosomes"

      Thank you, corrected.

      p. 10 (same paragraph): the weak 5' bias of IES distribution in Tetmemena should be shown (either as an additional panel in Fig 3 or in a Sup Figure.

      Thank you, we added it as Figure S2C.

      p. 24 2nd paragraph: "a" is missing in "Trinity, which is a software..."

      Thank you, we made the correction.

      CROSS-CONSULTATION COMMENTS

      I agree with most comments of reviewer 3.

      The authors have actually defined "TE" in the introduction (p. 6). Depending on the journal's rules for abbreviation use, it may not be necessary to define it again in the results section

      Reviewer #1 (Significance (Required)):

      Ciliates are unicellular models to study developmentally programmed genome rearrangements at the mechanistic, genome-wide and evolutionary levels. These aspects have so far mostly been addressed in three species: P. tetraurelia and Tetrahymena thermophila on the one hand, the spirotrichous ciliate O. trifallax on the other.

      One new piece of information that can be found in the present manuscript is the assembly and annotation of the germline genome of two novel species: Tetmemena sp, closely related to Oxytricha, and the more distant E. woodruffi. Feng et al establish that, similar to other ciliates, Tetmemena and Euplotes eliminate TEs and other germline-specific sequences during programmed genome rearrangements. They also undergo extensive gene unscrambling, which results in IES removal and MDS reordering to assemble coding sequences.

      A TE origin was discussed previously for Paramecium (Arnaiz et al PLoS Genet; Sellis et al 2021 PLoS Biol) and Tetrahymena IESs (Hamilton et al 2016 eLife). While this may also hold true in spirotrichous ciliatesThe present manuscript proposes a completely new evolutionary scenario for IESs from scrambled genes. Here, Feng et al establish that scrambled genes of spirotrichous ciliates tend to be associated with local paralogy. They provide evidence supporting that IESs from scrambled genes may have evolved from paralogous MDSs.

      Although I am more an expert in the molecular mechanisms involved in genome rearrangements, I feel that the work reported here should draw the attention of a broader audience interested in genome dynamics and evolution, beyond the specific field of spirotrichous ciliate biology.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Feng et al. provide a solid analysis of the evolution of genome rearrangement in spirotrich ciliates. The authors applied a variety of state-of-the-art sequencing and bioinformatic methods to investigate the intriguing and extremely complex patterns of genome architecture in this protist lineage. Methods (including statistical analyses) are adequate and explained in detail. Results and discussions reflect careful, clever analysis of the data and excellent linkage with the literature. Figures and tables complement the text in a compelling way. I have only minor suggestions:

      Summary: more gradually introduce Spirotrichea and the phylogenetic relationship among the three species analyzed. This would better position the reader to understand the evolutionary context you are working in. Also, it would be helpful to more clearly differentiate novel vs. existing data. A suggestion: "This study focuses on three spirotrich species: two in the family Oxytrichidae (Oxytricha trifallax and Tetmemena sp) and Euplotes woodruffi as an outgroup. To complement existing data, we sequenced, assembled and annotated the germiline and somatic genomes of E. woodruffi and the germline genome of Tetmemena sp."

      Thank you, we clarified the summary (abstract).

      Introduction, first paragraph: Replace "The species in this study..." for a more precise statement, such as "The three spirotrich species studied here..."

      Thank you, we have made this statement more precise.

      p. 4: This sentence is unclear: "These useful tools provide partial insight to guide selection of species for full genome sequencing, which allows construction of complete rearrangement maps of a MIC genome onto a MAC genome for a reference species."

      Thank you, we have clarified this sentence.

      p. 8: define TE on first mention.

      Defined on page 6.

      Table 1. Indicate which MIC and MAC data are from this study.

      References are included for published data and a note has been added to indicate data from this study.

      Reviewer #3 (Significance (Required)):

      The present work represents a significant advance in the field of evolutionary genomics. The focus of the paper is on ciliates, an ancient (2 billion-year old) and highly diverse eukaryotic phylum that presents many peculiarities, including sex, nuclear dimorphism, genome rearrangement, high numbers of paralogs and transposons, etc. While some data exist on a few model ciliates of disparate phylogenetic position, this work focuses on two species taxonomically placed in the same family, plus a more distant outgroup within the same class. This gives a novel dimension to this study, that goes beyond exploring genome architecture in a single clade. Instead, it allows to explore evolutionary trends in genome rearrangement among relatively closely related species. This paper should be of high interest not only for ciliate biologists (like me), but also in relation to comparative genomics of protists/eukaryotes and germ-soma biology. I highly recommend publication.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Ciliates extensively rearrange their somatic genome every time a new somatic nucleus develops from the zygotic germline nucleus. In this manuscript, Feng et al report the sequencing, assembly and annotation of the germline and somatic genomes of Euplotes woodruffi and the germline genome of Tetmemena sp. (whose somatic genome was sequenced and assembled by the same lab in 2015). They present a comparative analysis of developmentally programmed genome rearrangements in these two species and in the model ciliate Oxytricha trifallax. Their major findings are that:

      (1) E. woodruffi and Tetmemena sp. eliminate a smaller fraction of their germline genome (~54%) from their somatic macronucleus (MAC) than O. trifallax (>80%)

      (2) Transposable elements (TE) represent a smaller fraction of the germline genome (~2%) in the first two ciliates than in O. trifallax (~15%). TEs are mainly located at the boundaries of germline chromosomes and in intergenic regions, but can also be found inside IESs

      (3) Several thousands of genes are scrambled in the germline genome of all three species

      The authors have also addressed the possible origin of gene scrambling. They report an interesting association with local paralogy and propose a model for the emergence of the odd-even pattern of gene unscrambling between two paralogous copies.

      Major comments:

      (1) Based on the statistics presented in Table 1, genome assemblies are of good quality, with a reasonable N50 size of germline (MIC) contigs. It seems, however, that no entire MIC chromosome could be assembled, since no two-telomere contig is mentioned in the list. As proposed by the authors (p.7) the presence of numerous TEs at the boundaries of MIC contigs (Fig S1) may have hindered the assembly of MIC chromosome ends. I would have appreciated to have more information on the "other repeats" (which seem to differ from tandem repeats according to Fig 2) and their location along MIC contigs.

      (2) The definition of "Internal Eliminated Sequences" (IES) is not clear. The authors make a distinction between IESs and TEs. I understand that IESs are DNA segments that separate two macronuclear-destined sequences (MDS) in the germline genome. Thus they appear to be restricted to those regions that eventually yield gene-sized MAC chromosomes. IESs are eliminated between two pointers that may not be identical on both sides in case of scrambled genes. Some clarification is needed here.

      To illustrate my point: I found the statement "with many TE insertions within IESs, suggesting that TE insertions may have generated IESs" particularly confusing (p. 9 lines 5-6). Does this mean that IESs extend beyond the ends of inserted TEs? The legend of Fig S1 should also be clarified.

      (3) p. 10 lines 2-4 and Fig S2: Could the authors explain the difference they make between MDS (in the text) and CDS (in Fig S2)? My understanding is that a CDS is the entire gene coding sequence and may be made of multiple MDSs. If this is correct, the sentence should read "We compared the number of MDSs between single-copy orthologs for single-gene MAC chromosomes across the three species and found that the orthologs have similar CDS lengths".

      (4) p. 12 lines 10-15: the discovery that paralogous MDSs can be found in scrambled genomic loci is interesting. If the two paralogs can be distinguished based on the number of substitutions, it would be informative to go back to individual reads and check whether each of the two copies can be incorporated in the unscrambled CDS (and at which frequency). Would the pointers be compatible with this?

      (5) The hypothesis that odd-even scrambled loci have evolved from paralogous genes in E. woodruffi is supported by the existence of paralogous MDSs, length conservation of MDS/IES pairs and sequence similarity between corresponding MDS and IES in a pair. The correlations presented for Oxytricha and Tetmemena are much less convincing (Fig S5D and E). I recommend that the authors are even more cautious in their statement on p.13 ("For Oxytricha and Tememena, the MDS and IES lengths for such MDS/IES pairs also correlate positively, but more moderately").

      (6) p. 15 last paragraph: Why did the authors focus only on TBEs inserted in non-scrambled IESs to look for orthologous TBE insertions? Is there a reason to believe that no recent TBE insertion occurred at other genomic loci? Or was it only for practical reasons? It is also not clear to me whether the authors have considered full-length TBEs or the presence of at least one TBE ORF.

      (7) p. 16: the authors report that some introns of E. woodruffi map "near" Oxytricha/Tetmemena pointers. How near? Based on the information provided by the authors, I don't think this observation necessarily implies that IESs were converted to introns (or reciprocally) during evolution. If this were true, shouldn't at least one intron boundary coincide exactly with a pointer? The authors should clarify this (also in the discussion, on p. 20, top paragraph).

      (8) p. 19 2nd paragraph: the suggested mechanism explaining the 5' bias of IESs in E. woodruffi genes is unclear. How could germline recombination take place between a MIC chromosome and a MAC reverse transcript or nanochromosome? This would imply that DNA could be imported in the MIC. Is there evidence that this might occur?

      (9) According to Figure 1, no scrambled genes have been reported in Paramecium tetraurelia. Within the frame of the proposed model, this is somewhat unexpected because this ciliate went through several whole genome duplications during evolution and harbors many paralogous gene pairs. Is there a reason why no gene scrambling took place in Paramecium?

      Minor comments:

      • p. 4 (4th bottom line): To my knowledge, ref #28 presents a draft (incomplete) MIC assembly of the Paramecium genome.

      • p. 7 (last paragraph): "encoding" should be replaced by "carrying"

      • p. 10 (2nd paragraph): insert a missing "o" into "nanochromosomes"

      • p. 10 (same paragraph): the weak 5' bias of IES distribution in Tetmemena should be shown (either as an additional panel in Fig 3 or in a Sup Figure.

      • p. 24 2nd paragraph: "a" is missing in "Trinity, which is a software..."

      CROSS-CONSULTATION COMMENTS

      I agree with most comments of reviewer 3.

      The authors have actually defined "TE" in the introduction (p. 6). Depending on the journal's rules for abbreviation use, it may not be necessary to define it again in the results section

      Significance

      • Ciliates are unicellular models to study developmentally programmed genome rearrangements at the mechanistic, genome-wide and evolutionary levels. These aspects have so far mostly been addressed in three species: P. tetraurelia and Tetrahymena thermophila on the one hand, the spirotrichous ciliate O. trifallax on the other.

      • One new piece of information that can be found in the present manuscript is the assembly and annotation of the germline genome of two novel species: Tetmemena sp, closely related to Oxytricha, and the more distant E. woodruffi. Feng et al establish that, similar to other ciliates, Tetmemena and Euplotes eliminate TEs and other germline-specific sequences during programmed genome rearrangements. They also undergo extensive gene unscrambling, which results in IES removal and MDS reordering to assemble coding sequences.

      • A TE origin was discussed previously for Paramecium (Arnaiz et al PLoS Genet; Sellis et al 2021 PLoS Biol) and Tetrahymena IESs (Hamilton et al 2016 eLife). While this may also hold true in spirotrichous ciliatesThe present manuscript proposes a completely new evolutionary scenario for IESs from scrambled genes. Here, Feng et al establish that scrambled genes of spirotrichous ciliates tend to be associated with local paralogy. They provide evidence supporting that IESs from scrambled genes may have evolved from paralogous MDSs.

      • Although I am more an expert in the molecular mechanisms involved in genome rearrangements, I feel that the work reported here should draw the attention of a broader audience interested in genome dynamics and evolution, beyond the specific field of spirotrichous ciliate biology.

    1. Author Response

      Reviewer #1 (Public Review):

      1) While the authors identify the suppressors in known genetic interactors (GIs) of the yeast SEC53, it is worth testing if the compensatory mutations are rewiring the GIs, thereby explaining the lack of comparable compensations observed in reconstituted strains. If altered GIs explain the suppression, then while yeast serves as an excellent tool to perform these assays, the human context of the disease may require a different set of genetic suppressors and, therefore, a different target than the yeast PGM1 ortholog.

      Our data show that pgm1 mutations alone greatly improve growth of sec53-V238M strains. Our data also indicate other pathways of compensation. Whether each of these compensatory mechanisms translate to humans is unknown. However, the observed enrichment of compensatory mutations in genes whose human homologs are associated with Type 1 CDG, suggests that many of these genetic interactions are likely to be conserved.

      Also, are Sec53 and Pgm1 proteins directly interacting in yeast and whether these mutations are on the interaction interface?

      As we mention above, there is no support for a direct physical interaction between Sec53 and Pgm1.

      2) Based on the data obtained between pACT1 and pSEC53-driven expression of the SEC53 mutant alleles, the pattern of suppressors appears to be different. Authors report that the variants expressed from strong pACT1 promoters show more suppressors than those driven by native promoters. Is this a general trend in experimental evolution that slower-growing strains tend to show lesser suppressors? For example, on Page 6, line 154, "compensating for Sec53-F126L dimerization defects are rare or not easily accessible". The statement suggests that the authors did obtain suppressors that compensate for the dimerization defect. At the same time, while rare (also, are authors suggesting suppression of dimerization defect as in better dimerization?), the rate of obtaining suppressors seems to be linked to the severity of the fitness defects of the strains. The lack of suppressors may be a limitation of the evolution experiments. Indeed later in the manuscript, the authors noticed that while PGM1 suppressors obtained in V238M can also suppress F126L alleles, the suppression was not as efficient. Could it be that evolution experiments in slower-growing strains predominantly enrich suppressors in other pathways (i.e., not in the CDG orthologs) that restore the growth better and compete out the relatively weaker suppressors in PGM1? In fact, the authors report similar effects on Page 7, lines 204-210. These two paragraphs are contradictory and should be explained further.

      All of our sequencing was performed on strains with sec53 under the control of the pACT1 promoter. While we did not identify unique sec53-F126L suppressors, we cannot exclude that sec53-F126L suppressors exist, so we describe them as “rare or not easily accessible”. While it is possible that the slower growth rate of the sec53-F126L allele could impact the likelihood of observing suppressors, we think it is more likely due to the nature of the variant (dimerization defect versus stability defect) rather than growth rate. In other laboratory evolution experiments the same beneficial mutation typically has a greater effect in slower-growing backgrounds (for example: doi.org/10.1126/science.1250939).

      3) Authors report that the LOF of PGM1 compensates for the SEC53 mutations. However, the evolution experiments did not capture any LOFs in PGM1. The fitness comparisons in evolution experiments are different as many different genotypes compete in a mix. Therefore, the fitness assays in a clonal population may not represent these differences well. To test this argument, authors can try to mimic the evolution experiments by mixing two genotypes to check competitive fitness, like the co-culture of pgm1 suppressor obtained via evolution experiments with pgm1Δ.

      Though we did not perform a direct head-to-head competition between a pgm1 suppressor and a pgm1Δ, our data suggest that the pgm1 delete would outcompete some of the lower-fitness suppressors. In the Discussion we speculate as to why we do not see deletion mutations: “Given that most of the evolved clones containing pgm1 mutations are more fit than the reconstructed strains, it is possible that other evolved mutations interact epistatically only with non-loss-of-function pgm1 mutations.”. Though it is beyond the scope of the present manuscript, it would be possible to rerun the evolution experiment in sec53-V238M strains carrying either a pgm1 missense suppressor or a pgm1Δ. Under the hypothesis of additional interacting loci, only the pgm1 missense suppressors would be more likely to acquire additional compensatory mutations.

      Reviewer #3 (Public Review):

      Vignogna et al. used yeast genetics, experimental evolution and biochemistry to tackle human congenital disorders of glycosylation (CDG), a disease mostly caused by mutations in PMM2. They took advantage of the observation that the budding yeast gene SEC53 is almost identical to human PMM2, and used experimental evolution to find interactors of SEC53/PMM2. They found an overrepresentation of mutations in genes corresponding to other human CDG genes, including PGM1. Genetic and biochemical characterizations of the pgm1 mutations were carried out. This work is solid, although authors did not reveal why reduction of pgm1 activity could compensate for defects of a particular mutant allele of sec53.

      Out of curiosity, if the authors were to simply focus on the preexisting mutations, would they have gotten the materials for most of the experiments in this article? In other words, how important is the experimental evolution?

      The evolution experiment was crucial as the specific pgm1 mutations we identified here have not been reported elsewhere, nor have the orthologous mutations been identified in human PGM1.

      A strain table with full genotypes is needed.

      We added a strain genotype table (Supplemental Dataset 2).

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01541

      Corresponding author(s): Hubert Hilbi

      1. General Statements

      Upon infection of eukaryotic host cells, Legionella pneumophila forms a unique compartment, the Legionella-containing vacuole (LCV). While the role of vesicle trafficking pathways for LCV formation has been quite extensively studied, the role of putative membrane contact sites (MCS) between the LCV and the ER has been barely addressed. In our study, we provide a comprehensive analysis of the localization and function of protein and lipid components of LCV-ER MCS in the genetically tractable amoeba Dictyostelium discoideum.

      We would like to thank the 3 reviewers for their thorough and constructive reviews. Overall, the reviewers state that the study is of interest to researchers in the field of Legionella and other intracellular pathogens (Reviewer 2), as well as to cell biologists (Reviewer 3). Reviewer 1 does not ask for additional experiments but is critical about the overall structure of the manuscript and the proteomics approach. As requested by the reviewer, we have substantially restructured the revised manuscript, now clearly outline the hypotheses put forward in the study and streamlined the proteomics data. Reviewer 2 asks for additional experiments to support our model of LCV-ER MCS. In the revised manuscript, we have included additional experiments addressing lipid exchange at the MCS, and we plan to perform further co-localization experiments. Reviewer 3 appreciates the comprehensive LCV proteomics and asks for only minor revisions, which we have incorporated in the revised version of the manuscript. We include below a point-by-point response to all the comments made by the reviewers.

      2. Description of the planned revisions

      Reviewer #2

      Major comment

      1) MCS contain protein complexes or a group of proteins, but the proteins here are studied in isolation and do not support the model shown in Figure 7. Co-localization studies of the putative LCV-ER MCS proteins are critical, especially given that the authors hypothesize the proteins are working together to modulate PI(4)P levels.

      Response: As suggested by the reviewer, we will perform additional co-localization experiments with MCS components. To this end, we will construct mCherry-Vap, and we will co-transfect the parental D. discoideum strain Ax3 with plasmids producing mCherry-Vap and OSBP8-GFP or GFP-OSBP11. Using these dually fluorescence labelled D. discoideum strains, the co-localization of Vap with the OSBPs will be assessed at 1, 2, and 8 h post infection. The data will be presented as fluorescence micrographs, and co-localization of Vap with the OSBPs will be quantified using Pearson’s correlation coefficient and fluorescence intensity profiles. The data will be outlined in the text (l. 258 ff.) and shown in the new Fig. 2 and__ Fig. S4__.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1 (Evidence, reproducibility and clarity):

      In the manuscript by Vormittag, et al., the authors perform proteomics identification of proteins associated with the Legionella-containing vacuole (LCV) in the model amoeba Dictyostelium discoideum comparing WT to atlastin knockout mutants. The authors find approximately half the D. discoideum proteome associated with the LCV, but there was enrichment of some proteins on the WT relative to the mutant. They focus on proteins involved in forming membrane contact sites (MCS) that previously were shown to be important for expansion of the Chlamydia-containing vacuole. Most significant are the oxysterol binding proteins (OSBP) and VapA (similar to that seen in Chlamydia). The authors show differential association of these proteins with either the LCV or presumably the ER associated with the LCV. Using a linear scale over 8 days, they show that mutations in some of the MCS reduce yields in two of the OSPB knockout mutants and the growth rate of the vap mutant is slowed but ultimate yield is increased. Using some nice microscopy techniques, they measure LCV size, and the osbK mutant appears particular small relative to other strains, whereas the osbH mutant generates large vacuoles. This doesn't necessarily correlate with the PI4P quantities on the vacuoles (which is higher in all of them), but I am not totally sure how this is measured, and whether is it PI4P/pixel or PI4P/LCV. In all cases, this was reduced by Sac1 mutation. Surprisingly, even though there was uniform increase in PI4P in each of the mutants, loss of PI4P only affects localization of some of the proteins. Finally, in what seems to be a peripherally related experiment, the authors show that a pair of Legionella translocated effectors are required to maintain PI4P levels, although it is not clear how this is related to the other data in the manuscript.

      It is not clear from the manuscript if the authors are just cataloging things or trying to test a hypothesis. This is an extremely difficult manuscript to read and reconstruct what the authors showed. I really think that the only people who will understand what is written are people who are familiar with the work in Chlamydia starting in 2011 in Engel's and Derre's laboratories, which clearly showed that MCS and most specifically Vap/OSBPs are involved in vacuole expansion. If the authors could rewrite the manuscript along these lines, perhaps comparing their data to the Chlamydia data it would help a lot. Otherwise, I don't think anyone else will understand why they are focusing on these things. I don't recommend new experiments (although re-analyzing data is necessary), but the manuscript has to be taken apart and claims removed, and data be interpreted properly. Otherwise, the manuscript seems like just a clearing house for data.

      Response: Thank you for the concise summary of our data and pointing out the need to restructure the manuscript and to clearly outline the hypotheses underlying the study. According to the reviewer’s suggestions, we have now re-structured the manuscript. In the revised manuscript the story unfolds from the observation that the ER tightly associates with (isolated) LCVs, and the proteomics approach is used as a validation of the presence of MCS proteins at the LCV-ER MCS.

      As suggested by the reviewer, we now highlight the seminal work on Chlamydia by the Engel and Derré laboratories not in the Discussion section (as in the original version of the manuscript) but already in the Introduction section (l. 142-148). We believe that it makes a stronger case to start out an analysis of LCV-ER MCS with a Legionella-specific cell biological finding (LCV-ER association) and an unbiased proteomics approach, as compared to a more derivative and defensive approach starting out with what is known about Chlamydia.

      The reviewer’s comment “This is an extremely difficult manuscript to read” appears overly harsh and conflicts with the positive evaluation of Reviewer #2 and Reviewer #3. Finally, we respectfully disagree with the reviewer’s statement that experiments characterizing L. pneumophila effectors implicated in the formation and function of LCV-ER MCS are peripheral. These experiments significantly contribute to a mechanistic understanding of how L. pneumophila forms and exploits LCV-ER MCS, and they are central for studies on pathogen-host interactions. The studies are analogous to the work on Chlamydia effectors by the Engel and Derré laboratories, but the mode of action of Legionella and Chlamydia effectors is obviously different. Another important distinction of our work to the studies on Chlamydia is the use of the genetically tractable amoeba, D. discoideum, which allows an analysis of LCV-ER MCS by fluorescence microscopy at high spatial resolution.

      Specific comments

      1. The problems start with the first figure, in which the authors state that almost half the D. discoideum proteome is LCV-associated. I doubt that this is correct, and they should base this on some selective criterion. Furthermore in Fig. 1A, they show Venn diagrams for how they whittled this down, but the Supplemental Dataset gives us no clue on how this was done. I can only sit down myself with the dataset and try to figure that out, but that is an unreasonable expectation for the reader. The dataset provided should have a series of sheets, describing how the large protein set was whittled down and how they were sorted, so the reader can evaluate how robust the final results were. To me (at least), if they said: "look we got this surprising result that suggests MCS are involved in promoting LCV formation, and although this is well recognized in Chlamydia but poorly recognized in Legionella", that would be satisfactory to me.

      Response: According to the reviewer’s suggestions, we have now thoroughly re-structured the manuscript. In the revised manuscript the story unfolds from the observation that the ER tightly associates with LCVs in infected cells and with isolated LCVs. The proteomics approach is now used as a validation of the presence of MCS proteins at the LCV-ER MCS and relegated to the Supplementary Information section (former Fig. 1, now Fig. S3).

      For the proteomics analysis, all protein identifications have been filtered for robustness applying a constant FDR (false discovery rates) of protein and PSM (peptide spectrum match) of 0.01, which is a commonly accepted threshold in the field. Moreover, two identified unique peptides were required for protein identification. The parallel application of both filter criteria results in very robust and reliable data sets. This is outlined in the Material and Methods section (l. 683-693).

      In the data set of LCV-associated proteins, 2,434 D. discoideum proteins have been identified (Table S1). This is 18.5% of the total of 13,126 predicted D. discoideum proteins (UniprotKB) and considerably less than “almost half the D. discoideum proteome”, as stated by the reviewer. Moreover, 1,224 L. pneumophila proteins have been identified (among 3,024 predicted L. pneumophila proteins in the database). This is a reasonable number of proteins identified from an intracellular vacuolar pathogen, given the LCV isolation and proteomics methods applied. We now outline these findings more extensively in the Results section (l. 207-213). Moreover, to render Table S1 more reader-friendly, we added to the datasheet “All data” the datasheets “Dictyostelium”, “Legionella” and “Info”.

      The Venn diagram in Fig. S3A (previously Fig. 1A) does not show a subset of proteins “whittled down” from the entire proteomes, but simply summarizes LCV-associated proteins, which were either identified exclusively in the parental strain Ax3 but not in the Δsey1 mutant strain, or only in Δsey1 but not in Ax3, thus identifying possible candidates relevant for the LCV-ER MCS. This information is now outlined more clearly in the text (l. 238-241). Moreover, we now explicitly define in the Material and Methods section (l. 697-704) the “on” and “off” proteins shown in Fig. S3A.

      The overall rational for the comparative proteomics approach was our previous finding that compared to the D. discoideum parental strain Ax3, the Δsey1 mutant strain accumulates less ER around LCVs (PMID: 28835546, 33583106). This finding suggests that formation of the LCV-ER MCS might be compromised in the Δsey1 mutant strain. This hypothesis is now outlined at the beginning of the Results paragraph (l. 204-207).

      I am clueless regarding how Fig. 6 fits with the rest of the manuscript. If this is about MCS, there is no demonstration these effectors are directly involved in MCS other than the somewhat diffuse argument that there is some correlative connection to PI4P levels, that I am not particularly convinced by.

      Response: The PtdIns(4)P gradient between two different cellular membranes is an intrinsic feature of MCS. To date, a quantification of PtdIns(4)P levels on LCVs in response to the presence or absence of specific L. pneumophila effectors is lacking. Accordingly, we opted for quantifying the PtdIns(4)P levels on LCVs in presence and absence of an L. pneumophila effector putatively generating PtdIns(4)P on LCVs, the phosphoinositide 4-kinase LepB, or titrating PtdIns(4)P on LCVs, the PtdIns(4)P-binding ubiquitin ligase SidC. To address the concerns of Reviewer 1 and Reviewer 3 (see below), we now outline in detail the rational to assess the role of LepB and SidC for MCS function (l. 385-387). Importantly, we now also provide data that at LCV-ER MCS PtdIns(4)P/cholesterol lipid exchange is functionally important (new Fig. 6 and Fig. S10). In the revised version of the manuscript, this new data is preceding the experiments with the L. pneumophila effectors, which should render our choice of effectors more comprehensible to the reader and increase the flow of the manuscript.

      Line 146 and associated paragraph. We don't need a catalog of proteins in narrative. There is more detail in the narrative than there is in the tables and figures, which would be a more appropriate way to present the data.

      Response: As suggested by the reviewer, we summarized the LCV-associated D. discoideum proteins and considerably reduced the list in the text (l. 214-230).

      Line 186. There is nothing wrong with pursuing MCS based on the idea that this was seen before with Chlamydia and you wanted to test if this was a previously unappreciated aspect of Legionella biology. I don't see the rationale based on the proteomics, partly because I don't understand how the proteomics dataset was parsed.

      Response: As suggested by the reviewer, we thoroughly re-structured the manuscript and now highlight the seminal work on Chlamydia by the Engel and Derré laboratories already in the Introduction section (not in the Discussion section as in the original version of the manuscript). We believe that it makes a stronger case to start out an analysis of LCV-ER MCS with a Legionella-specific cell biological finding (LCV-ER association) and an unbiased proteomics approach, as compared to a more derivative and defensive approach starting out with what is known about Chlamydia.

      Figure 3: These growth curves are super-weird. I am not used to looking at 8 days of logarithmic growth in a linear scale and seeing no (apparent) growth for 4 days. Considering all the microscopy data are performed in the first 18 hrs of infection, it’s hard to see how this is related to data at 8 days post infection. If this were plotted in logarithmic scale, as microbiologists are used to doing, then perhaps we could see a connection. Also, in some cases, it might be helpful to calculate a growth rate, because it’s possible the author may now see some effects by comparing logarithmic growth rates.

      Response: We have been characterizing growth of L. pneumophila in D. discoideum in several studies using growth curves with RFU vs. time plotted in linear scale (e.g., Finsel et al., 2013, Cell Host Microbe 14:38; Rothmeier et al., 2013, PloS Pathog 9: e1003598; Swart et al., 2020, mBio 11: e00405-20). The D. discoideum-L. pneumophila infection model is peculiar, since the amoebae do not survive temperatures beyond 26 degC. This is substantially below the optimal growth temperature of L. pneumophila (35-40 degC). This means that - due to the many genetic tools available - D. discoideum is an excellent model to investigate cell biological aspects of the infection at early time points (ca. 1-18 h p.i.), but the amoebae are not an optimal system to quantify (several rounds) of intracellular growth.

      Figure 2: The images don't necessarily show what the bar graphs show. In particular, look at Osp8. That image doesn't make sense to me.

      Response: The individual channels of the merged images in Fig. 1 (formerly Fig. 2) are shown in Fig. S2. By looking at the individual channels, it becomes clear that OSBP8-GFP co-localizes with calnexin-mCherry (overlapping signals), but not with P4C-mCherry or AmtA-mCherry (adjacent signals). Co-localization was quantified in a non-biased manner by Pearson’s correlation coefficient. To further visualize co-localization, we now also provide fluorescence intensity profiles for all confocal micrographs (amended Fig. 1).

      In summary, I think the authors hit on something that is probably important for Legionella biology, but it’s not clear what they want to show. They are very invested in connecting everything to PI4P levels, which may or may not be correct, but it seems to me that perhaps taking more care in showing the importance of the Vap/OSPB nexus in supporting Legionella growth should be the first priority.

      Response: Given the importance of the PtdIns(4)P gradient for lipid exchange at MCS, we believe it is justified to put considerable emphasis on this lipid. To further substantiate a functional role of PtdIns(4)P at LCV-ER MCS, we now also show that an increase in PtdIns(4)P at the LCV correlates with a decrease of cholesterol (new Fig. 6 and Fig. S10). The inverse correlation of these two lipids is in agreement with the notion that cholesterol is a counter lipid of PtdIns(4)P at LCV-ER MCS.

      It is not clear from the manuscript if the authors are just cataloging things or trying to test a hypothesis.

      Response: In the revised version of the manuscript, we put forward several specific hypotheses, which we then tested in our study (l. 152-155).

      If I understand Fig. 1, only one of the candidates (VapA) was verified as being more enriched in WT relative to atlastin mutants. This argues even more strongly that the authors have to describe their criteria for choosing these candidates.

      Response: As outlined above (specific point 1), we have now re-structured the manuscript according to the reviewer’s suggestions. In the revised manuscript the story unfolds from the observation that the ER tightly associates with LCVs in infected cells and with isolated LCVs. The proteomics approach is now used as a validation of the presence of MCS proteins at the LCV-ER MCS and relegated to the Supplementary Information section (formerly Fig. 1, now Fig. S3). We consider the proteomics approach a powerful hypothesis generator, and the experimental identification of several MCS proteins by proteomics validated the cell biological and bioinformatics insights.

      Reviewer #1 (Significance (Required)):

      As stated above, the manuscript can't decide if it’s about MCS or PI4P, and I would argue strongly that the emphasis on PI4P detracts from the manuscript, as well as its inability to draw connection to previous work that is likely to be important.

      Response: We respectfully disagree with the reviewer on this important point and hold that proteins as well as lipids are crucial functional determinants of MCS. The PtdIns(4)P gradient is a pivotal process for lipid exchange at MCS. Therefore, we believe it is justified to put considerable emphasis on this lipid. In the Introduction section, we now specify several hypotheses on the localization and function of lipids and proteins at LCV-ER MCS (l. 152-155). Moreover, we now also refer to the previous work on Chlamydia MCS in the Introduction section (l. 142-148).

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary of paper and major findings

      Membrane contact sites (MCS) are locations where two membranes are in close proximity (10-80nm). MCS have a defined protein composition which tether the membranes together and function in small molecule and lipid exchange. Typically, MCS proteins contain structural (e.g., tethers) and functional (e.g., exchange lipids) proteins, in addition to proteins which regulate the structure and function of the MCS. In this manuscript, Vormittag et al describe protein components of MCS between the Legionella-containing vacuole (LCV) and the host endoplasmic reticulum (ER) in the amoeba Dictyostelium. Proteomics of isolated LCVs followed by microscopy analysis identified several proteins which localize to either the LCV-associated ER (OSBP8), the LCV (OSBP11), or both (VAP and Sac1). The mammalian homologs of these proteins have been shown to play important roles in ER MCS, with VAP serving a structural role, Sac1 a PI(4P) phosphatase regulating PI(4)P levels, and OSBP8 and OSBP11 lipid transferring proteins. Given the importance of PI(4)P in formation and maintenance of the Legionella-containing vacuole, the authors used dicty mutants to determine the importance of these proteins in bacterial growth, LCV size, and PI(4)P levels on the LCV. While VAP and OSBP11 appear to promote Legionella infection, OSBP8 appears to restriction infection, although all identified MCS components appear to play a role in decreasing PI(4P) shortly after infection. Finally, VAP and OSBP8 localization to the LCV is PI(4)P-dependent. Overall, the authors conclude that these MCS components play a role in modulating PI(4)P levels on the LCV.

      Overall, this is an interesting study further exploring the role of PI(4)P in LCV-ER interactions, and how PI(4)P levels are regulated. The figures are clearly presented, there is an impressive amount of data, and rigor appears to be strong with appropriate replicates and statistical analysis. The phenotypes are often mild, but the authors are careful to not overinterpret the data. While this is an interesting study, additional experiments are necessary to support the overall model and the text needs to put the findings into the larger context.

      Response: We would like to thank the reviewer for this positive and constructive assessment. We performed and planned additional experiments to further strengthen the study and support our model.

      Major comments

      1) MCS contain protein complexes or a group of proteins, but the proteins here are studied in isolation and do not support the model shown in Figure 7. Co-localization studies of the putative LCV-ER MCS proteins are critical, especially given that the authors hypothesize the proteins are working together to modulate PI(4)P levels.

      Response: To further explore the possible interactions between Vap and OSBP proteins, we plan co-localization experiments using D. discoideum strains producing mCherry-Vap and either OSBP8-GFP or GFP-OSBP11, as outlined above (Section 2, new__ Fig. 2__ and Fig. S4).

      Moreover, we included additional data on PtdIns(4)P/cholesterol lipid exchange (Fig. 6 __and Fig. S10__), which have been incorporated into the model (amended Fig. 8). Based on the available data, we do not postulate direct interactions between Vap and OSBP proteins. The previous model, which now has been amended, might have been misleading in that respect.

      2) The phenotypes are relatively mild, suggesting functional redundancy. Double knockouts, particularly in VAP and OSBP11, may generate a stronger phenotype that better supports the hypothesis and demonstrate the importance during infection.

      Response: Thank you for this interesting suggestion. Please see Section 4 below for our arguments, why we believe that this intriguing approach is beyond the scope of the current study.

      3) The timing of PI(4)P and MCS protein localization during infection is critical to understanding how MCS might be functioning. Based on Figure 6C, PI(4)P levels decrease on the LCV during infection, but this is not fully explained in the context of what's known in the literature and what is observed the previous figures. How does localization of different MCS components change during infection, and does this correlate with the changes in growth or LCV size? A better description in the Introduction on LCV-associated PI(4)P levels would be beneficial in orienting the reader to why PI(4)P levels are modulated.

      Response: As suggested by the reviewer, we added to the Introduction section more detail about the kinetics of PtdIns(4)P accumulation on LCVs (l. 65-71), and we discuss the limited spatial resolution of the IFC approach (formerly Fig. 6C, now Fig. 7C; l. 407-408). Importantly, we also provide new data showing that within 2 h p.i. an increase in PtdIns(4)P at the LCV coincides with a decrease of cholesterol (new Fig. 6 and Fig. S10). The new data is put into this context in the Discussion section (l. 449-454).

      4) OSW-1 has other targets besides OSBPs, and depleting Sac1 and Arf1 in A549 cells is not specifically targeting the MCS, as these proteins have other functions. The data in mammalian cells is not convincing and should be removed.

      Response: As suggested by the reviewer, we removed the data on depleting Sac1 in A549 cells (Fig. 3D, and Fig. S6BC). We propose to leave the pharmacological data on inhibition of L. pneumophila replication by OSW-1 in the manuscript, but to clearly point out that OSW-1 has other targets besides OSBPs (l. 297-299).

      Minor comments

      1) Figure 2 is missing details on number of experiments/replicates and statistical analysis.

      Response: Thank you for having noted this oversight. The number of independent experiments and statistical analysis have now been added to Fig. 1 (formerly Fig. 2) (l. 1009-1010).

      2) Can the authors hypothesize why VAP promotes growth early during infection, but appears to restrict growth at later timepoints (Figure 3A)?

      Response: Thank you for raising this intriguing point. The opposite effects on growth of Vap at early and later timepoints during infection might be explained by interactions with antagonistic OSBPs. Vap likely co-localizes with OSBP8 as well as with OSBP11 on the limiting LCV membrane or the ER, respectively (experiment to be performed; Fig. 2 and__ Fig. S4__). The absence of OSBP8 (ΔosbH) or OSBP11 (ΔosbK) causes larger or smaller LCVs, and increased or reduced intracellular replication of L. pneumophila, respectively. Thus, OSBP8 seems to restrict and OSBP 11 seems to promote intracellular replication. Accordingly, if Vap affects or interacts with OSBP11 early and with OSBP8 later during infection, opposite effects on growth of Vap might be explained. These reflections are now outlined in the Discussion section (l. 431-441).

      3) There is a large amount of data, which makes it difficult at times to follow. I suggest adding additional information to table 1, including LCV size and whether or not the protein's localization is PI(4)P-dependent.

      Response: Thank you for this suggestion. As proposed by the reviewer, we added the additional information to Table 1 (PtdIns(4)P-dependency of protein localization, LCV size).

      Reviewer #2 (Significance (Required)):

      Membrane contact sites during bacterial infection are a growing area of research. In Legionella, several papers point to the presence of MCS. Further, PI(4)P is known to be an important component on the LCV. This paper shows that MCS protein members are important in modulating LCV PI(4)P levels. The model as presented is not completely supported by the data as co-localization experiments are needed, along with more detailed analysis of how PI(4)P levels change over infection and the role of these MCS proteins in that process. This study will be of interest to those studying Legionella and other vacuolar pathogens. Area of expertise is on membrane contact sites and lipid biology.

      Response: Thank you very much for the overall positive and constructive evaluation.

      Reviewer #3 (Evidence, reproducibility and clarity):

      The authors perform proteomic analysis of Legionella-containing vacuoles. The observe association of membrane contact site (MCS) proteins including VAP, OSBPs, and Sac1. Functional data indicates that these proteins contribute to PI4P levels on LCVs and their ability to acquire lipid from the ER to enable LCV expansion/stability. Overall, the paper is an important contribution to the field and builds upon a growing appreciation for MCS in establishment of intracellular niches by microbial pathogens. I have only minor comments for the authors consideration.

      Response: We would like to thank the reviewer for this enthusiastic assessment.

      Minor comments:

      -line 145, "This approach revealed 3658 host or bacterial proteins identified on LCVs...". This number seems high... how does it compare to prior proteomic studies of pathogen-containing vacuoles?

      Response: As outlined above (reviewer 1, point 1), we have now changed the text (l. 207-213): “This approach revealed 2,434 LCV-associated D. discoideum proteins (Table S1), of a total of 13,126 predicted D. discoideum proteins (UniprotKB). Moreover, 1,224 L. pneumophila proteins were identified (among 3,024 predicted L. pneumophila proteins), which is a reasonable number of proteins identified from an intracellular bacterial pathogen within its vacuole with the proteomics methods applied (Herweg et al, 2015; Schmölders et al., 2017).”

      • line 160. Can the authors comment on why mitochondrial proteins are observed in their proteomic analysis? Are these non-specific background signals or reflecting relevant organelle contact?

      Response: The dynamics of mitochondrial interactions with LCVs and the effects of L. pneumophila infection on mitochondrial functions have been thoroughly analyzed (PMID: 28867389). This seminal work is now cited in the text (l. 227-230).

      • line 268. It is reported that LCVs are smaller with MCS disruption at 2 and 8 h p,i.. Does this also lead to instability or rupture of LCVs? And related to this why would LCVs be bigger at 16h with MCS disruption?

      Response: MCS components affect LCV size positively or negatively. E.g., the absence of OSBP8 (ΔosbH) or OSBP11 (ΔosbK) causes larger or smaller LCVs, and increased or reduced intracellular replication of L. pneumophila, respectively. However, as outlined in the Discussion section (l. 442-454), we believe that the relatively small size likely reflects a structural remodeling of the pathogen vacuole rather than a substantial LCV expansion. LCV rupture takes place only very late in the infection cycle (beyond 48 h) and is followed by lysis of the host amoeba (PMID: 34314090).

      • lines 288 and 299 "data not shown" this data should be included in a supplemental figure.

      Response: The data on the localization of GFP-Sac1 and GFP-Sac1_ΔTMD are included in the Figs. 1A, 4A, 5AD, S2A, S7A, and__ S9__ (l. 328, l. 339).

      • line 327. The authors choose to focus on the role of LepB and SidC in MCS modulation. The rationale for choosing these two amongst the ca 330 effectors was not given. Were other effectors also examined?

      Response: LepB and SidC were chosen due to their activities producing or titrating PtdIns(4)P, respectively, and their LCV localization. This rational is now given in the text (l. 385-387). No other effectors were examined up to this point.

      Reviewer #3 (Significance (Required)):

      Comprehensive LCV proteomics of interest to field of cellular microbiology. Studies of MCS broadly relevant to cell biologists.

      Response: Thank you very much for the overall very positive evaluation.

      4. Description of analyses that authors prefer not to carry out

      Reviewer #2

      Major comment

      2) The phenotypes are relatively mild, suggesting functional redundancy. Double knockouts, particularly in VAP and OSBP11, may generate a stronger phenotype that better supports the hypothesis and demonstrate the importance during infection.

      Response: Thank you for raising the important question of functional redundancy. We now outline this concept in the Discussion section (l. 427-429). A further analysis of the genetic and biochemical relationship between Vap and OSBP11 or OSBP8 are without doubt some of the most interesting aspects of further studies on the topic of LCV-ER MCS.

      The construction of a D. discoideum double mutant strain is time consuming and usually takes 1-2 months. Provided that a Vap/OSBP11 double deletion mutant strain is viable and can be generated, it takes another 1-2 months to thoroughly characterize the strain regarding intracellular replication of L. pneumophila (Fig. 3), LCV size (Fig. 4), and PtdIns(4)P score (Fig. 5). Moreover, there is already a large amount of data in the paper (to quote Reviewer #2), and therefore, adding new data might makes it even harder to follow the story and focus on the key points. Finally, we believe that the planned colocalization experiments (Reviewer #2, point 1) and the new data on lipid exchange kinetics (new Fig. 6 and Fig. S10) fit the current story more coherently, and thus, are more straightforward and informative than the generation and characterization of double mutant strains. For these reasons, we believe that the generation and characterization of D. discoideum double mutant strains is beyond the scope of the current study.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the manuscript by Vormittag, et al., the authors perform proteomics identification of proteins associated with the Legionella-containing vacuole (LCV) in the model amoeba Dictyostelium discoideum comparing WT to atlastin knockout mutants. The authors find approximately half the D. discoideum proteome associated with the LCV, but there was enrichment of some proteins on the WT relative to the mutant. They focus on proteins involved in forming membrane contact sites (MCS) that previously were shown to be important for expansion of the Chlamydia-containing vacuole. Most significant are the oxysterol binding proteins (OSBP) and VapA (similar to that seen in Chlamydia). The authors show differential association of these proteins with either the LCV or presumably the ER associated with the LCV. Using a linear scale over 8days, they show that mutations in some of the MCS reduce yields in two of the OSPB knockout mutants and the growth rate of the vap mutant is slowed but ultimate yield is increased. Using some nice microscopy techniques, they measure LCV size, and the osbK mutant appears particular small relative to other strains, whereas the osbH mutant generates large vacuoles. This doesn't necessarily correlate with the PI4P quantities on the vacuoles (which is higher in all of them), but I am not totally sure how this is measured, and whether is it PI4P/pixel or PI4P/LCV. In all cases, this was reduces by Sac1 mutation. Surprisingly, even though there was uniform increase in PI4P in each of the mutants, loss of PI4P only affects localization of some of the proteins. Finally, in what seems to be a peripherally related experiment, the authors show that a pair of Legionella translocated effectors are required to maintain PIF4P levels, although it is not clear how this is related to the other data in the manuscript.

      It is not clear from the manuscript if the authors are just cataloging things or trying to test a hypothesis. This is an extremely difficult manuscript to read and reconstruct what the authors showed. I really think that the only people who will understand what is written are people who are familiar with the work in Chlamydia starting in 2011 in Engel's and Derre's laboratories, which clearly showed that MCS and most specifically Vap/OSBPs are involved in vacuole expansion. If the authors could rewrite the manuscript along these lines, perhaps comparing their data to the Chlamydia data it would help a log. Otherwise, I don't think anyone else will understand why they are focusing on these things. I don't recommend new experiments (although re-analyzing data is necessary), but the manuscript has to be taken apart and claims removed, and data be interpreted properly. Otherwise, the manuscript seems like just a clearing house for data.

      1. The problems start with the first figure, in which the authors state that almost half the D. discoideum proteome is LCV-associated. I doubt that this is correct, and they should base this on some selective criterion. Furthermore in Fig. 1A, they show Venn diagrams for how they whittled this down, but the Supplemental Dataset gives us no clue on how this was done. I can only sit down myself with the dataset and try to figure that out, but that is an unreasonable expectation for the reader. The dataset provided should have a series of sheets, describing how the large protein set was whittled down and how they were sorted, so the reader can evaluate how robust the final results were. To me (at least), if they said: "look we got this surprising result that suggests MCS are involved in promoting LCV formation, and although this is well recognized in Chlamydia but poorly recognized in Legionella", that would be satisfactory to me.
      2. I am clueless regarding how Fig. 6 fits with the rest of the manuscript. If this is about MCS, there is no demonstration these effectors are directly involved in MCS other than the somewhat diffuse argument that there is some correlative connection to PI4P levels, that I am not particularly convinced by.
      3. Lin 146 and associated paragraph. We don't need a catalog of proteins in narrative. There is more detail in the narrative than there is in the tables and figures, which would be a more appropriate way to present the data.
      4. Line 186. There is nothing wrong with pursuing MCS based on the idea that this was seen before with Chlamydia and you wanted to test if this was a previously unappreciated aspect of Legionella biology. I don't see the rationale based on the proteomics, partly because I don't understand how the proteomics dataset was parsed.
      5. Figure 3: These growth curves are super-weird. I am not used to looking at 8 days of logarithmic growth in a linear scale, and seeing no (apparent) growth for 4 days. Considering all the microscopy data are performed in the first 18 hrs of infection, its hard to see how this is related to data at 8 days post infection. If this were plotted in logarithmic scale, as microbiologists are used to doing, then perhaps we could see a connection. Also, in some cases, it might be helpful to calculate a growth rate, because its possible the author may now see some effects by comparing logarithmic growth rates.
      6. Figure 2: The images don't necessarily show what the bar graphs show. In particular, look at Osp8. That image doesn't make sense to me.

      In summary, I think the authors hit on something that is probably important for Legionella biology, but its not clear what they want to show. They are very invested in connecting everything to PI4P levels, which may or may not be correct, but it seems to me that perhaps taking more care in showing the importance of the Vap/OSPB nexus in supporting Legionella growth should be the first priority.

      It is not clear from the manuscript if the authors are just cataloging things or trying to test a hypothesis.

      If I understand Fig. 1, only one of the candidates (VapA) was verified as being more enriched in WT relative to atlastin mutants. This argues even more strongly that the authors have to describe their criteria for choosing these candidates

      Significance

      As stated above, the mansucript can't decide if its about MCS or PI4P, and I would argue strongly that the emphasis on PI4P detracts from the manuscript, as well as its inability to draw connection to previous work that is likely to be important.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This paper examines the formation and repair of micronuclei in non-cancerous cells, specifically in mouse embryonic fibroblasts. This work was performed completely in culture and used a combination of western blot, confocal and superresolution microscopy to assess the contents of micronuclei over a repair period of 5 hours after 2 hours of induction of double strand breaks by treatment with etoposide. The authors found that the bodies colocalised with LC3, Beclin 1 and lysosomes suggestive of autophagy. However no evidence of autophagic flux has been demonstrated.

      Major issues are as follows:

      Figure 2

      A - Any sense of the autophagic flux? LC3B - I and LC3B - II seem to be in equal quantities most of the time. Maybe using the tandem LC3 in this system could provide further insight. Also remove the violin plots from this graph and from G and H, as there are too few data points.

      Thank you for your comment. We have evidence of a functional autophagic flux, since we observed an increasing number of acidic vesicles stained with Lysotracker in response to DNA damage, which were reduced after DNA repair. Some of the micronuclei were also co-stained with Lysotracker, suggesting their lysosomal degradation. We reorganized the data in the revised figure 2A to communicate better these observations. We reproduce here the dynamic of Lysotracker stain, please notice an increase in the abundance of acidic vesicles after 2h of DNA damage. A further evidence of activation of functional autophagy is the dynamic intracellular distribution of both LC3 and BECN1, indicative of autophagy induction. Please notice in revised Figure 2A that LC3 surrounding vesicles increases after 2h of DNA damage and diminish when DNA is repaired. BECN1 in control MEFs is highly concentrated inside the nucleus, predominantly at the nucleolus, and after DNA damage it redistributes towards the cytoplasm. Finally after DNA repair, BECN1 appears highly concentrated at the nucleus again. These dynamic changes correlate with autophagosomes formation and successful fusion with lysosomes. In the revised manuscript we removed the violin plot as suggested. Since the elimination of nuclear components occurs in a subset of cells, the role of the autophagic machinery needs to be analyzed cell by cell. We considered better to eliminate also the Western blot, as an analysis of the whole population does not provide information relevant for this study.

      • Can you reduce the brightness in the merge image, as I cannot see DAPI nor a convincing Beclin-1/LC3 co-localisation.

      Thank you for the observation. We improved the quality of the images and reorganized Figure 2 to convincingly show BECN1 and LC3 co-localization, together with Lysotracker, in nuclear alterations (buds and micronuclei). We modified the results text accordingly.

      • Although the data is convincing, It would be clearer if the brightness of the merge image was reduced.

      Thank you for your comment. We improved the images shown, these data is now integrated in new Figure 2A.

      • Is the significant result the difference between 5h R Control si and 5h R Atg7? if so, there is no significant change in micronuclei as the same time point, can you explain this disconnect? are the buds being degraded prior to becoming micronuclei?

      That is correct, we found no statistical significant difference in the number of micronuclei formed silencing Atg7, although there was a trend to reduce them. To consolidate the role of autophagy in nuclear buds and micronuclei formation, we studied Atg4-/- MEFs. We confirmed a statistical significant reduction of buds formation when autophagy is impaired (new Figure 2G). However, we observed that the number of micronuclei increased after 2h of DNA damage in Atg4-/- MEFs, suggesting that autophagy does not contribute to micronuclei formation but elimination. Together, our results suggest that the origin of buds and micronuclei are mechanistically different. A difference in the biogenesis of buds and micronuclei has been previously suggested studying cells cultured under strong stress conditions that induce DNA amplification, as well as in cells under folic acid deficiency. While interstitial DNA without telomere was more prevalent in buds than in micronuclei, telomeric DNA was more frequently observed in micronuclei (Fennech et al. 2011, Mutagenesis 26:125-132). We agree with the reviewer, it seems that not all the buds become micronuclei.

      Figure 3 A - nice microscopy showing the co-localisation of TOP2A and LC3-GFP. I'm interested in DAPI being on some bodies and not others. Do you have any sense of the dynamics of this?

      Thank you for the interesting question. Since removal of nuclear alterations as nuclear buds and micronuclei is a very dynamic process, we detect nuclear damaged material in the cytoplasm are at different degradation stages. Nucleases could be degrading DNA in micronuclei. Another possibility to the lack of DAPI signal in some micronuclei containing TOP2A and GFP-LC·is that TOP2A could be expelled from the nucleus with undetectable fragments of DNA or even without DNA, as a renewal process. We believe that nuclear buds can form without extruding DNA in some cases, perhaps to modulate proteostasis in addition to protect genome stability. In the revised manuscript we discuss this possibility further.

      G - c shows a strand of mostly TOP2B coming from the nucleus. Is there any evidence that this occurs using either confocal microscopy or super resolution approaches. Could you try Z-stack to find these?

      Thank you for the suggestion, we analyzed Z-stack images and tried to observe it also by immunofluorescence. We could detect some tubular signal connecting the nucleolus with a micronucleus containing TOP2B and BECN1 (arrow head in Fig 3B reproduced below), although we cannot be certain we are detecting the same nuclear extrusion mechanism by Electron Microscopy than by immunofluorescence.

      Figure 4 C - is there a significant increase in FBL negative bodies, this would make sense if FBN is being degraded in the micronuclei during the repair process

      We found that the number of micronuclei without FBL increased with statistical significant difference by Two-way-ANOVA followed by Dunnett´s multiple comparison test (P=0.463 comparing cells with 2h of DNA damage with control cells; P=0.0017 comparing cells after 5h of DNA repair with untreated cells; n=5). We agree with the reviewer, a possible explanation is that FBL is being degraded in micronuclei during the repair process. Although it could also be possible that nucleolar is less sensitive to Etoposide poisoning, or that nucleolar DDR is mechanistically different.

      • Would it be possible to increase the n of these experiments to confirm either no change in FBL/LC3 co-loc, or evidence of increase?

      Thank you for the suggestion. We repeated the experiment two more times to increase the n to 5. We found no statistical difference in the number of nuclear buds or micronuclei containing both FBL and LC3 during DNA damage and repair. Therefore it seems that the release of nucleolar components is not enhanced by Etoposide-induced DSB, suggesting that nucleolar DDR is a unique response, independent of DDR elsewhere in the genome (reviewed in Nucleic Acids Research, 2020, Vol. 48, No. 17 9449–9461 doi: 10.1093/nar/gkaa713).

      Minor issues:

      Figure 4 and 5 legends are in a different font.

      Thank you. We correct the font in the current manuscript.

      Reviewer #1 (Significance (Required)):

      There is little specific data on the role of autophagy in clearing micronuclei in cancer cells, so this may be suggestive of a new mechanism that occur during normal cellular homeostasis. There are known links between lamin A defects and the formation of micronuclei, but not explicitly that the micronuclei are also Lamin A positive. it is likely that analogous processes occur in both cancer and non-cancer, so the impact of these data is not clear to me. This paper may be of interest to researchers interested in nuclear structure and DNA damage, but based on the data presented the significance is limited.

      The significance of the present work is to discover that autophagy is relevant both during physiological DNA damage and in response to an exogenous DNA damaging agent, to extrude damaged DNA, TOP2cc and Fibrillarin from the nucleus. This knowledge is relevant since insufficiencies on autophagy imply a risk of genomic instability, which in turn could drive the cell into a senescent or malignant state. We present data showing that autophagy regulates the dynamic formation and elimination of nuclear buds and micronuclei in a mechanistically differentiated way. While autophagy contributes to nuclear buds formation, it is necessary for micronuclei elimination. Our data suggest that nucleophagy could be also a mechanism to alleviate basal nucleolar stress. As the reviewer noticed, some micronuclei did not have DNA. It is conceivable then that nuclear buds and micronuclei form also for a proteostatic function, not necessarily involving DNA damage elimination. We believe the significance of our work contributes to our understanding of the cell, as well as to cancer research. Whether common mechanisms between cancerous and normal cells occur is relevant to know, to consider the specificity of potential therapeutic approaches.

      I don't have sufficient expertise to evaluate the super resolution microscopy beyond assessing the images.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Peer review of the manuscript with the number RC-2021-01181 by Muciño-Hernandez G et. al. at Review Commons and with the tittle "Nucleophagy contributes to genome stability 1 though TOP2cc and nucleolar components degradation"

      1. Summary Muciño-Hernandez G et. al. show in this manuscript that mouse embryonic fibroblasts (MEFs) have basal levels of nuclear buds and micronuclei, which are indicators of genomic DNA damage. These basal levels of nuclear buds and micronuclei in MEFs increased after Etoposide treatment, which is known to induce DNA Double stranded Breaks (DSD). Interestingly, the nuclear buds and micronuclei co-localize with makers for nucleophagy (BECN1 and LC3) and acidic vesicles, suggesting that they are cleared by nucleophagy. The authors propose that basal levels of nucleophagy clear basal levels of genomic DNA damage that occurs as result from DNA-dependent biological processes in the cell nucleus, thereby contributing to nuclear stability of MEFs under physiological conditions. These basal levels of nucleophagy increase after the action of factors that induce DNA damage and nuclear stress. The concepts proposed by Muciño-Hernandez G et. al. are novel, since most of the current published data on nucleophagy related to DNA damage have been obtained under pathological conditions, e.g. implementing cancer cells.

      The authors use in their manuscript various molecular biology techniques to obtain data that support their claims, including Western Blot analysis of protein extracts from MEFs, immunostaining on MEFs and neutral comet assays, complemented with state of the art imaging techniques, such as confocal microscopy, immunoelectron microscopy and super resolution microscopy. The quality of the data is sound. The structure of the manuscript support the understanding of the reader. However, I would like to suggest several improvements that will help to increase the quality of the manuscript, in order that fits to the standards of articles recently published in journals affiliated to Review Commons, such as the Journal of Cell Biology, the EMBO Journal or eLife.

      1. Major comments

      2.1 The authors have to improve the description of the results. Especially the description of those Figure panels containing plots that were generated using data from several experiments has to be improved.

      One example is the description of the Figure 1D, which is in the lanes 137-151 of the current version of the manuscript. Whereas the authors describe in lanes 137-147 observations related to representative pictures of confocal microscopy after immunostaining presented in Figure 1D (left), the description of the quantification from 9 independent experiments presented in the plots in Figure 1D (right) comes relatively short in lanes 147-150 without mentioning any of the values implemented for creating the plots.

      "Interestingly, while the frequency of nuclear buds gradually increased after DNA damage and during DNA repair, the frequency of micronuclei also increased after DNA damage, but diminished upon DNA repair."

      The other plots presented in the different figure panels across the manuscript are described in a similar manner. I would like to suggest to the authors to improve their manuscript by including during the description of their results the values that were implemented for the degeneration of the plots presented in the manuscript. For example, in the specific case of Figure 1D above:

      "Interestingly, the percentage of MEFs with nuclear buds gradually increased from XY% ({plus minus} XY SD) in control non-treated (Ctrl) MEFs to XY% ({plus minus} XY SD; P=XY) after 2 h Etoposide-induced DSB in MEFs and XY% ({plus minus} XY SD; P=XY) after DNA repair take place in MEFSs 5 h upon stop of Etoposide treatment (Figure 1D, right). In contrast, the percentage of MEFs with micronuclei significantly increased from XY% ({plus minus} XY SD) in Ctrl MEFs to XY% ({plus minus} XY SD; P=XY) after 2 h Etoposide-induced DSB, whereas it was reduced to XY% ({plus minus} XY SD; P=XY) 5 h after stop of Etoposide treatment (Figure 1D, right)."

      Descriptions of the plots as mentioned above will make the text more intuitive for the reader, and they will make possible to read the Results Section without switching to the Figure Legends or the Material and Methods Section or to Supplementary Files. Even though the representative pictures from different microscopy techniques presented in the manuscript are of good quality and support the claims of the authors, it is important to mention that the quantifications presented in the plots demonstrate the statistical significance of these representative pictures. Thus, the authors should consistently include in the manuscript during the description of theirs results all the information (mean values, standard error of the means, P values, n values, etc.) that support their interpretation of the results and demonstrate the statistical significance of their claims.

      Thank you for your clear and valuable advice. We followed it and in the revised manuscript we included the data in the results section.

      2.2 Following a similar line of argumentation as in the previous point, the authors should provide as Supplementary Material an Excel file containing a statistical summary, including all statistical relevant information from each one of the plots presented in each Figure panel, such as n values, P values, Test implemented, values used for the plots, numbers of experiments, etc. The information could be organized in the Excel file in different data sheets according to the Figure panels, in order that the reader can easily navigate through the data. In the current version of the manuscript, one cannot find the values used for the generation of the plots presented in the manuscript in any of the submitted files.

      Thank your for this suggestion. We have included in Table S1 an Excel file with a data sheet for each Figure panel, containing all the data collected and the statistical analysis performed.

      Minor comments

      3.1 In general, prior studies were appropriately referenced. Only few references has to be added.

      Line 48: Add to the already included reference "Dobersch et al., 2021" also the reference Singh et al., 2015 PMID 26045162.

      Thank you, we added this reference.

      Line 53: Add the corresponding reference after the word "respectively".

      We added the corresponding reference.

      Line 82: Add the corresponding reference after the word "them".

      We added the corresponding reference.

      Line 125: Add the corresponding reference after the word "cells".

      We added the corresponding reference.

      Line 130: The expression "...by analyzing the recruitment of the phosphorylated histone γH2AX..." is the first time that the authors mention in the manuscript the DNA damage maker γH2AX. I suggest that is better introduced as " ... by analyzing the recruitment of the DNA damage marker γH2AX (histone variant H2A.X phosphorylated a serine 139, Rogakou EP, et al., 1998, PMID 9488723) to DSB sites."

      Thank you very much for your suggestion. In the revised manuscript we corrected the text as suggested.

      Line 199: Add the corresponding reference after the word "formation".

      We added the corresponding reference.

      Line 205: Add the corresponding reference after the word "cells".

      We added the corresponding reference.

      3.2 The use of the English language is appropriate throughout the manuscript. However, there are minor errors in the use of punctuation marks, in the use of prepositions and typos. I will list some of them below. However, I would like to recommend that manuscript is corrected by an English native speaker.

      Thank you for your careful review of our manuscript. We corrected all the errors listed. A college proficient in English has reviewed the revised manuscript.

      Line 41: "...and reproductive systems; genome instability also..." the semicolon can be replaced by a period.

      Line 43: "Since early in development DNA is under constant endogenous..." between "development" and "DNA" there should a comma.

      The sentence in lanes 53-55 has to be rephrased.

      Lines 62-63: the expression "...throughout life." should be substituted.

      Line 70: The abbreviation "rDNA" has to be explained the first time that is used.

      Lines 81-82: It has to be explained for the scientist that is not specialized in the field of nucleophagy, how the integrity of the genome is threatened by micronuclei and nuclei-derived material.

      √ Lines 106-110: The sentence is long. It would be easier to understand for the reader if this sentence is divided into two sentences.

      Lines 121-122: The subtitle should be rephrased.

      Lines 132-138: The sentence is long. It would be easier to understand for the reader if this sentence is divided into two sentences, e.g. with a period before the word "hence".

      Lines 143-144: "... in a subpopulation of healthy, untreated cells...". The interpretation of "healthy" might be subjective. I would like to suggest substituting in the complete manuscript the word "healthy" by "control".

      Line 163: The abbreviation for γH2AX was already introduced in line 130.

      Line 182: A comma after "cell lines" is missed.

      Line 183: delete "either". √ Lines 190-194: The sentence is long. It would be easier to understand for the reader if this sentence is divided into two sentences, e.g. with a period after the word "decreased" in line 191.

      Line 218: I assume that instead of "bus", it should be "buds".

      Line 220: I assume that instead of "iRNA", it should be "siRNA". In addition, it is the first time that the abbreviation is used. Thus, I suggest introducing it as "...was silenced by specific small interfering RNA (siRNA) previous to ..."

      Line 327: delete the word "chronic".

      Line 344: I assume that instead of "(figures 4C)", it should be "(Figure 4D)".

      3.3 The structure of the Figures is ok for the peer review process and it might be optimized during editing of the manuscript. Nevertheless, I would like to suggest to the authors to increase the lettering size throughout all the figures. It will make the figures more intuitive.

      Thank you for the suggestion. We increase the font size of the figures.

      Reviewer #2 (Significance (Required)):

      Significance

      The work presented by Muciño-Hernandez G et. al. will be clearly a significant contribution to the scientific community working on autophagy, DNA damage repair and cancer, among others. It will be of interest to a broad spectrum of scientists, as I will elaborate in the following lines. The authors propose that MEFs have basal levels of genomic DNA damage under physiological conditions, which are cleared by basal levels of nucleophagy. On one hand, these findings are in line with various publications demonstrating that DNA-dependent biological processes in the cell nucleus, such as transcription, replication, recombination, and repair, involve intermediates with DNA breaks that may compromise the integrity of DNA. Thus, there must be mechanisms that ensure the integrity of the genome during these processes under physiological conditions, one of them seems to be nucleophagy. This perspective might explain the fact that proteins and histone modifications that were initially characterized during DNA repair also play a role during transcription, recombination, and replication. For example, phosphorylated H2AX at S139 (γH2AX) is often used as a marker for DNA-DSB [PMID 9488723]. However, accumulating evidences suggest additional functions of this histone modification [PMIDs 19377486; 22628289; 23382544]. In addition, McManus et al. [PMID 16030261] analyzed the dynamics of γH2AX in normal growing mammalian cells and found γH2AX in all phases of cell cycle with a maximum during M phase, suggesting that γH2AX may contribute to the fidelity of the mitotic process, even in the absence of ectopic- induced DNA damage. Further, Singh et al [PMID 26045162] and Dobersch et al [PMID 33594057] report that γH2AX plays a role in transcriptional activation in response to TGFB-signaling. Moreover, classical DNA-repair complexes have been linked to DNA demethylation and transcriptional activation [PMIDs 17268471; 28512237; 25901318], and DNA-DSB is known to induce ectopic transcription that is essential for repair, supporting a tight mechanistic correlation between transcription, DNA damage, and repair [PMID 24207023]. Perhaps, the authors might consider introducing several of the aspects and the citations written above into the Discussion section of the revised version of their manuscript. On the other hand, most of the published data related to nucleophagy have been obtained from cancer cells. Muciño-Hernandez G et. al. obtained their data implementing MEFs to demonstrate that the proposed mechanisms take also place under non-pathological conditions, what is one of the novel aspects of the present work.

      I hope that my suggestions help the authors to improve their manuscript, thereby reaching the standards of manuscripts recently published in journals affiliated to Review Commons AND increasing the impact of their contribution to the scientific community.

      Thank you very much for your suggestions. They helped us to present now a much-improved manuscript. We hope the revised work is now suitable for publication in the Journal of Cell Science.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Muciño-Hernández and colleagues suggest that basal formation of nuclear buds and micronuclei increases in primary mouse embryonic fibroblasts following etoposide-induced double strand breaks (DSBs). The study combines the use of biochemical methodologies with confocal and super resolution microscopy in an effort to explore the contribution of nucleophagy to genome stability. The authors provide evidence that autophagy is induced upon etoposide treatment. They detected GFP-LC3 and BECN1 signals in nuclear buds and micronuclei even in untreated control and to a higher extent in etoposide-treated cells. Then, the authors examined whether nucleophagy is required for the removal of nuclear buds and micronuclei, by treating fibroblasts with control and Atg7 siRNA. The authors claim that the percentage of cells with micronuclei or nuclear buds decrease upon Atg7 knockdown, suggesting that components of the autophagy machinery induce the formation of these nuclear abnormalities. Moreover, Type II DNA Topoisomerases (TOP2A and TOP2B) and the ribosomal protein fibrillarin were detected in nuclear buds and micronuclei in fibroblasts treated or not with etoposide. Again in this case, GFP-LC3 was detected in fibrillarin-containing nuclear alterations. Based on these observations, the authors suggest that nucleophagy contributes to the elimination of chromosomal fragments or nucleolar bodies exiting the nucleus under DNA damage -inducing conditions. Specifically, they propose a key role for nucleophagy in maintaining genome stability by eliminating Type II DNA Topoisomerase cleavage complex (TOP2cc) and nucleolar components such as fibrillarin.

      While it seems that there is a relationship between nuclear-extruded TOP2 with endogenous BECN1 and GFP-LC3 suggesting autophagic engagement, inconsistencies of fluorescent images between different figures indicate possible technical problems/limitations (please see specific comments, below), compromising authors' claims. LC3 immunoblotting and GFP-LC3 localization results appear over-interpreted (comments below). Neither TOP2 nor Fibrillarin have been shown to be actual autophagic substrates. Also, the link between genomic stability, micronuclei formation and autophagy has been previously reported (Zhao et al., PMID: 33752561).

      An additional major concern is relates to nucleophagy being a selective type of autophagy. As such it requires efficient recognition and sequestration of the nuclear material destined to be degraded. Cargo specificity is mediated by receptor proteins, but no evidence for such receptors is provided in this study. Moreover, there is no real mechanistic insight on how nucleophagy mediates genome stability and how this can be interpreted in terms of cell survival under physiological and stress conditions. In other words, the biological significance of the findings presented has not been addressed.

      Specific comments are summarized below:

      The authors suggest that autophagy is induced after etoposide treatment and during the DNA repair process. However, the Western blot presented in Fig. 2A is not convincing and quantification does not support a significant autophagy induction in any of these cases. Autophagy appears to be induced 1h after etoposide removal, as evidenced LC3II/LC3 I increase (Fig. 2A and S2A). Nevertheless, all these changes should be more rigorously assessed.

      Thank you for the observation. We removed the analysis of LC3II/LC3I by Western blot in the revised manuscript because a basal and induced elimination of nuclear components by the autophagic machinery occurs only in a subset of cells. It needs to be analyzed cell by cell. Pooling together all the cells dilutes the observation. Nevertheless, the dynamic intracellular distribution of both LC3 and BECN1 indicate autophagy induction. Please notice in revised Figure 2A that LC3 surrounding vesicles increases after 2h of DNA damage and diminish when DNA is repaired. BECN1 in control MEFs is highly concentrated inside the nucleus, at the nucleolus as it co-localized with Fibrillarin (new Figure 4E), and after DNA damage it redistributes towards the cytoplasm. Finally after DNA repair, BECN1 appears highly concentrated at the nucleus again. A further evidence of a functional autophagic flux, is the observation of an increasing number of acidic vesicles stained with Lysotracker in response to DNA damage, which were reduced after DNA repair. Some of the micronuclei were also co-stained with Lysotracker, suggesting their lysosomal degradation.

      Line 190 and Fig. 2A: It is totally unclear whether "autophagy activation" takes place during the two waves described. There is no LC3B-I to LC3B-II conversion to initially suggest "autophagy activation". It rather suggests that autophagy is stalled. Fig. 2F shows that GFP-LC3 is strongly fluorescent into the lysotracker-stained lysosomes, further pointing to possible functional or technical problems.

      As pointed out by reviewer 1, the images presented in original Figure 2F were over-exposed. In the current version we replaced those images with new images of better quality. We also reorganized the presentation of the data, and in revised Figure 2A we present photos where more convincingly can be observed a co-localization of BECN1 with LC3, with o without Lysotracker signal in nuclear buds and micronuclei. We also performed immunolocalization of endogenous LC3 (new Figure 2D) to rule out a possible misinterpretation of GFP-LC3 aggregates. As explained before, we removed original Figure2A.

      Fig. 2B and Sup. Fig. 2B: BECN1 staining looks problematic. There is extreme BECN1 accumulation in the nucleus. Are those nuclear patterns of endogenous BECN1 and GFP-LC3 normal (see also minor comment 6 and 7)? Is there literature supporting such a distribution?

      Yes, it has been documented BECN1 localization in the nucleus during development and in response to DNA damage stimuli such as ionizing radiation, and with a function related to DNA repair alternative to autophagosome formation (Fei Xu, et al. 2017, Scientific Reports | 7:45385 | DOI: 10.1038/srep45385). In the current manuscript we also detected endogenous LC3, to avoid a possible artifact with GFP-LC3 expression. We observed endogenous LC3 also localized in the nucleus (new Figure 2D).

      It is hard to imagine how BECL1 is implicated in a (here hypothetical) nuclear lamina degradation event driven by LC3-lamin B1 direct interaction (Dou et al., 2015). BECL1 is an upstream to LC3 component and is a subunit of the PI3K complex catalyzing the local PI3P generation. The above should cause recruitment of the downstream autophagic machinery. Other subunits of the same complex or downstream effectors should be identified at the same spots to support authors' claims.

      Our proposal that BECN1 is contributing to nucleophagy is supported by its co-localization with LC3 and Lysotracker stained vesicles (new figure 2A), as well as with TOP2 (Figure 3A-C). We appreciate the interesting idea of the reviewer; we certainly did not analyze the presence of BECN1 interacting partners. We agree, further studies analyzing their localization could complement our current findings. Supporting our work, others have observed UVRAG in the nucleus, specifically in centromeric regions, and it also has a role in DNA repair through its interaction with DNA-PK (Dev Cell. 2012 May 15; 22(5): 1001–1016. doi: 10.1016/j.devcel.2011.12.027). Given the anti-tumorigenic role of several autophagic molecules, it is tempting to speculate that several of them could have triple roles in the nucleus: directly interacting with DNA repair machinery, eliminating unrepairable DNA damaged and preventing excessive protein accumulation in the nucleus. Further experiments are necessary to probe this hypothesis, but are beyond the scope of the present manuscript.

      U, 2h D and 5h R images of whole cells are necessary. The authors should also provide representative images of cells under different conditions i.e. control, etoposide-treatment and during DNA repair. Along similar lines, untreated control cells are not included in Fig. 2E and F. These images are needed for a better comparison between normal and DNA damage-inducing conditions.

      The reviewer is right. In the revised Figure 2 we included representative images of control cell, Etoposide-treatment and during DNA repair cells. Images of whole cells are now shown in supplementary Figure 2S.

      The authors state that autophagy is required for nuclear buds and micronuclei formation. However, the data shown in Fig. 2G and H are hardly convincing given that the statistical difference between cells treated with control and Atg7 siRNA is not strong (for example, *p˂0.5, 5h after etoposide removal). To provide further support to this notion, they should use cells from autophagy defective mutants and examine the appearance of nuclear abnormalities across different conditions compared to control cells.

      We agree with the reviewer and followed his/her suggestion. We established collaboration with Dr. Sandra Cabrera, who kindly shared with us Atg4b-/- mice from which we isolated MEFs to compare side by side with WT MEFs the appearance of nuclear abnormalities. We confirmed a statistical significant reduction in the formation of nuclear buds in both conditions: silencing the expression of Atg7 by siRNA and in Atg4b-/- MEFs, suggesting that the autophagic machinery contributes to buds formation (new Figure 2F-G). Interestingly, we observed a different result analyzing micronuclei. While we found no statistical significant difference in the percentage of cells with micronuclei silencing the expression of Atg7 by siRNA, we found a statistical significant increment of cells with micronuclei in Atg4b-/- MEFs (new Figure 2F-G). This apparently discrepant result suggests that nuclear buds and micronuclei have a different mechanistic origin. A difference in the biogenesis of buds and micronuclei has been previously suggested studying cells cultured under strong stress conditions that induce DNA amplification, as well as in cells under folic acid deficiency. While interstitial DNA without telomere was more prevalent in buds than in micronuclei, telomeric DNA was more frequently observed in micronuclei (Fennech et al. 2011, Mutagenesis 26:125-132).

      Lines 223-228: The role of autophagic machinery in the formation of nuclear buds is not supported and furthermore hard to conceptualize. How the components of autophagy are implicated during the nuclear buds and micronuclei formation? Colocalization of autophagic proteins might mean that autophagy is engaged at some point after or during the above formation. The causal, mechanistic and temporal aspects of the above budding and nucleophagic events need experimental support and/or more accurate interpretation.

      We agree with the reviewer, and now we expressed our interpretation with more caution. The role of autophagic machinery in the formation of nuclear buds is supported by the following findings: a) the localization of LC3 and BECN1 in nuclear buds; b) the inhibition of Atg7 expression by specific siRNAs reduced the number of cells with buds and c) Atg4b-/- MEFs had reduced number of cells with buds (new Figure 2G). How the components of autophagic machinery are implicated in nuclear buds formation is an interesting question and deserves further investigation, beyond the scope of the present manuscript.

      The authors claim that nucleophagy eliminates topoisomerase cleavage complex because TOP2A and TOP2B appear to more extensively co-localize with GFP-LC3 and BECN1 after etoposide-induced DSBs. However, the quantification presented in Fig. 3D-F to support this statement does not, in general, show a statistically significant difference in fibroblasts across different conditions (normal, etoposide treatment, etoposide removal).

      Autophagic elimination of TOP2 protein is supported by the following findings: 1) both BECN1 and LC3 were detected in micronuclei in acidic vesicles (labeled with Lysotracker), which is indicative of the autolysosomal nature of the cytoplasmic compartment containing TOP2 (Figure 2A); 2) TOP2B was found by electron microscopy in some cells exiting the nucleus surrounded by LC3 (Figure 3G); 3) TOP2B accumulated in cells lacking ATG4, as expected if it is degraded by autophagy (Figure 3H).

      Why would BECLIN colocalise with TOP2B in Figure 3g, given that beclin is involved in the initiation process?

      We think that BECN1 is involved in additional functions to the initiation process of bud formation. For example, it has been shown by others that BECN interacts with TOP2 (Dev Cell. 2012 May 15; 22(5): 1001–1016. doi: 10.1016/j.devcel.2011.12.027). It could be working as an autophagic receptor targeting TOP2cc to buds and micronuclei. We are aware that further studies are necessary to test this hypothesis, but they are beyond the scope of this manuscript.

      Fig. 4A and B: There is no enrichment of GFP-LC3 in "the nuclear alterations containing Fibrillarin" as stated in lines 341-343 comparing to the rest of the cellular GFP fluorescence.

      It is true that there is not a local enrichment of GFP-LC3 as those normally reported as LC3 puncta in response to autophagy induction by starvation, for example. Nevertheless we are confident of the specificity of the observation, as not every nuclear alteration was found having GFP-LC3. We detected GFP-LC3 in 72% (mean ± 3.61 SD) of the nuclear alterations containing Fibrillarin in untreated cells, in 65.7% (mean ± 1.97 SD) of cells with 2h of DNA damage and in 90.33% (mean ±6.36 SD) after 5 h of DNA repair (in 5 independent experiments).

      Moreover, there is no statistical significance in Fig. 4C and D measurements limiting the safety of authors' conclusions in lines 341-346.

      We agree with reviewer´s observation. We repeated these experiments two more times and did not find a statistical significant difference in the percentage of cells with nuclear lesions containing Fibrillarin and GFP-LC3 after DNA damage nor after DNA repair. These results suggest that nucleolar DDR is a particular response, independent of DDR elsewhere in the genome, as has been suggested (reviewed in Nucleic Acids Research, 2020, Vol. 48, No. 17 9449–9461; doi: 10.1093/nar/gkaa713). An alternative is that the release of nucleolar components is not enhanced by Etoposide at the dose and time used in this work.

      Lines 368-370: As discussed by the authors and reported in previous publication (Xu et al., 2017), "BECN1 interacts directly with TOP2B, which leads to the activation of DNA repair proteins, and the formation of NR and DNA-PK repair complexes", independent of its role in autophagy. Currently, there are no rigorous findings supporting the contribution of BECN1 (as a functional constituent of the core autophagic machinery) to nuclear damaged material extrusion (lines 382-384).

      We agree with the reviewer in that we did not perform an assay to demonstrate that BECN1 is contributing to TOP2 nuclear extrusion as a functional constituent of the core autophagic machinery. Nevertheless, the following data support the proposal of an autophagic elimination of TOP2cc: 1) TOP2B was detected in micronuclei containing BECN1 (Figure 3B); 2) BECN1 was found in micronuclei containing LC3 and in an acidic vesicle (labeled with Lysotracker), indicative of the autolysosomal nature of the compartment (Figure 2A); 3) TOP2 was found in some cells exiting the nucleus surrounded by LC3 (Figure 3G); d) TOP2 accumulated in cells lacking ATG4, suggesting its autophagic degradation (Figure 3H).

      Lines 435-441 and Fig. 5: The current findings do not support the proposed model. It is hard to support and conceptualize the statement "proteasome and nucelophagy function in a dynamic way inside the nucleus".

      The reviewer is right. We made a mistake integrating an interpretation within the summary of the actual findings of this work. We correct the text in the current version.

      In Fig. 5, LC3 appears to decorate inner nuclear membrane and probably to interact with some of the other proteins depicted, which is misleading.

      We agree with the reviewer. We removed the scheme in the current manuscript.

      Beclin-1 appears to interact with Fibrillarin (Nucleolus).

      This is correct. We observed by immunofluorescence a co-localization of BECN1 with Fibrillarin (new Figure E), and demonstrated by co-immunoprecipitation that they are constituents of a complex (new Figure F).

      Most of the differences in Sup. Fig. 3 lack statistical significance compromising the authors' claims.

      We agree with the reviewer. To perform a separated statistical analysis of the percentage of cells with nuclear buds or micrnonuclei did not provide further information. We eliminated this analysis in the current version.

      Many conclusions are drawn by colocalisation-immunofluorescence analysis. Co-immunoprecipitation experiments should also be performed to show that TOP2B and fibrillarin interact with LC3/autophagic machinery.

      Thank you for your suggestion. We performed immunoprecipitation analysis and confirmed an interaction of Fibrillarin with BECN1, this result is now presented in Figure 4F. We found no co-immunoprecipitation of LC3 with either Fibrillarin or TOP2A, nor of TOP2B with BECN1.

      Additionally, colocalisation analysis should be performed using tools such as Pearson's correlation and is an initial indication of nucleophagy. In the case of fibrillarin, immunofluorescence images do not indicate colocalisation, they need to be repeated.

      The transport of Fibrillarin out of the nucleus by micronuclei formation and its autophagic degradation implies that both proteins are contained in the same vesicular compartment, it does not necessarily requires a direct interaction of Fibrillarin with LC3. Therefore, a co-localization detected by Pearson´s analysis is not a necessary confirmation of the nucelophagic degradation of Fibrillarin. Actually, Fibrillarin does not seem to interact with LC3, since we could not detect both proteins by co-immunoprecipitation. Nevertheless, we observed a nucleolar localization of BECN1 overlapping with Fibrillarin (new Figure 4E), and we confirm by co-immunoprecipitation the presence of both BECN1 and Fibrillarin in a complex (new Figure 4F). Following reviewer´s advice, we repeated two more times the analysis of Fibrillarin immunolocalization. We corroborated its localization in micronuclei and nuclear buds in 5.86% (mean ± 5.03 SD) of untreated cells, indicating a basal level of nucleolar material exclusion from the nucleus. Interestingly, the percentage of cells with Fibrillarin in nuclear alterations did not increased with statistical significance with Etoposide treatment. At 2 h of DNA damage we observed only a slight increase to 6.8% (mean ± 4.03 SD) of cells having nuclear buds and micronuclei with Fibrillarin, while the number of cells with nuclear lesions increased to 30.6% (mean ± 4.2 SD). Similarly, the proportion of cells having Fibrillarin in nuclear lesions after 5 h of DNA repair increased only to 7.66 % (mean ±6.08 SD), while the total number of cells having nuclear buds and micronuclei increased to 38.42% (mean ± 9.3SD). These results suggest that nucleolar components are constantly sent out of the nucleus as a homeostatic process, and not significantly in response to Etoposide-induced DSB.

      Measurement of LC3/fibrillarin positive puncta should be performed, under basal conditions, genotoxic, and nucleolar stress under control and Atg7 knockdown conditions.

      Since we observed no statistical significant change in the number of micronuclei with Fibrillarin under Etoposide-induced DSB nor DNA repair, we did not perform the suggested experiment.

      Moreover, if nuclear proteins described are substrates of autophagy, then their levels would decrease upon autophagic induction i.e. starvation or in this case DNA damage and nucleolar stress. Thus, western blot analysis of relative protein levels can be performed.

      Thank you for the suggestion. Since only 5% of the cells have micronuclei with Fibrillarin, and this proportion did not increased significantly in response to DNA damage, it is unlikely to detect a difference in the amount of Fibrillarin in response to autophagy manipulation performing a population analysis (as it is in a Western blot). Nevertheless, we compared Fibrillarin abundance by Western blot in WT MEFs vs. Atg4-/- MEFs untreated (U), treated for 2 h with Etoposide (D) and after 5 h of DNA repair (5) shown in the top panel of the follow figure. As expected, we found no statistical significant difference determined by 2way-ANOVA followed by Sidak´s multiple comparisons test (n=3). Ajusted P values are shown for each comparison (left graph).

      On the other hand, since the percentage of cells with TOP2B in micronuclei and nuclear buds increased in response to DNA damage and during DNA repair, it was possible to detect a statistical significant accumulation of TOP2B in cells lacking ATG4 after 5h of DNA repair (bottom panel and right graph in the figure above). This observation is now included in new Figure 3H. Supporting our finding, TOP2A is reduced in cancerous cells grown under glucose deprivation (Alchanati, I., et al. 2009. PLoS One. 4:e8104).

      Endogenous LC3 nuclear buds should also be detected to verify nucleophagy as GFP-LC3 has been shown to aggregate, causing artifacts under certain conditions.

      We agree with the reviewer. We detected endogenous LC3 by immunofluorescence. This result is now included in Figure 2D.

      Minor comments

      In the Discussion section, the paragraph focused on the role of the ubiquitin-proteasome system is not substantiated by the data presented in the manuscript. Along similar lines, formation of aggresomes following etoposide treatment and their subsequent removal has not been monitored.

      We apologized for the confusion, we corrected the text to now clearly distinguish which are our findings and which are published data that we just attempt to relate.

      Western blots of better quality should be provided with assigned markers of protein size.

      The Western blots shown have markers of protein size.

      There are several language errors in the text that need to be corrected. Several sentences are too long and confusing or must be re-phrased. For example, see the lines: 123-125, 209-210,212, 218,221-222.

      We apologize for our language errors. We corrected all errors indicated and asked colleges proficient in English to review our text.

      Fig. 1B. Place "μm" into parenthesis.

      Sup. Fig. 1B: Replace "gH2AX" with "γH2AX".

      Fig. 1D: Separate DAPI and γH2AX channel images would be informative.

      We now show also separated channels.

      Fig. 2E: Enlarged separate DAPI, GFP-LC3 and lamin A/C channel images would be informative.

      We now show also separated channels.

      Line 218: Replace "bus" with "buds".

      Fig. 2B, 2E, 2F, 3A and probably Sup. Fig. 2B represent MEFs treated for 2h with etoposide. The pattern of GFP-LC3 in 2B looks extensively nuclear and almost absent from cytoplasm.

      We confirmed our finding detecting endogenous LC3.

      In addition, Fig. 2B and 3B represent MEFs treated for 2h with Etoposide. The pattern of endogenous BECN1 in Fig. 2B looks extensively nuclear and almost absent from cytoplasm. In Fig. 3B the pattern is notably different.

      BECN1 pattern of distribution is rather similar, predominantly in the nucleolus. We demonstrate it further by detecting BECN1 overlapping localization with Fibrillarin (new Figure 4E) and co-immunoprecipitation (new Figure 4F).

      Sup. Fig. 2C: Index box is not properly aligned.

      Thank you. We reviewed the alignment of each index box and reorganized the figure in the revised manuscript to add the whole blots of the new experiments we performed to analyze MEFs Atg4-/-.

      Lines 154, 343 and 837: Replace "DBS" with "DSB".

      Thank you, we corrected these typos.

      Fig. 4 panels are not clearly cited at the text.

      We apologize, we reviewed that they are clearly cited now.

      Line 220: siRNA

      Thank you, we corrected the text.

      Lines 373-374: References "Lenain et al., 2015" and "Li et al., 2019" are missing.

      Thank you for noticing it, we added the missing references. We use EndNote X9, we did not expect it to fail.

      Lines 400-401 and 407: Probably the second "Latonen, 2011" reference needs "et al".

      It is correct. We now cite this paper properly.

      Line 427: Do authors refer to Fig. 1E rather than Fig. 2B?

      Yes, we are sorry for this mistake. Thank you for pointing it out.

      Line 434: Correct "clearance" spelling.

      Thank you, we corrected it.

      Reviewer #3 (Significance (Required)):

      The authors suggest that nucleophagy contributes to the elimination of chromosomal fragments or nucleolar bodies exiting the nucleus under DNA damage -inducing conditions. Specifically, they propose a key role for nucleophagy in maintaining genome stability by eliminating Type II DNA Topoisomerase cleavage complex (TOP2cc) and nucleolar components such as fibrillarin.

      However, neither TOP2 nor Fibrillarin have been shown to be actual autophagic substrates. Also, the link between genomic stability, micronuclei formation and autophagy has been previously reported (Zhao et al., PMID: 33752561).

      We found nuclear buds and micronuclei with markers of different stages of the autophagic pathway, suggesting an active role of autophagy proteins in buds formation, and micronuclei removal. We detected TOP2 and Fibrillarin in micronuclei and propose their elimination by nucleophagy by the following findings: 1) both BECN1 and LC3 were detected in micronuclei in acidic vesicles (labeled with Lysotracker), which is indicative of autolysosomes (Figure 2A); 2) TOP2B was found by electron microscopy in some cells exiting the nucleus surrounded by LC3 (Figure 3G); 3) TOP2B accumulated in cells lacking ATG4, as expected if it is degraded by autophagy (Figure 3H); 4) BECN1 has a dynamic cytoplasmic-nucelar traffic in response to DNA damage; 5) BECN1co-localized with Fibrillaron in nucleolus and both proteins were co-immunoprecupitated.

      The link between genomic stability, micronuclei formation and autophagy has been previously reported only in cancerous cells. Considering that physiological DNA damage occurs constantly in the cell, basal nucleophagy is potentially fundamental to maintain cells healthy.

    1. Author Response

      Reviewer #1 (Public Review):

      This paper addresses an important question: whether the conduction velocity in white matter tracts is related to individual differences in memory performance. The authors use novel MRI techniques to estimate the "g-ratio" in vivo in humans - the ratio of the inner axon relative to the inner axon plus its outer myelin sheath. They find that autobiographical recall is positively related to the g-ratio in a specific white matter tract (the parahippocampal cingulum bundle) in a population of 217 healthy adults. This main finding is extended by showing that better memory is associated with larger inner axon diameters and lower neurite dispersion, which suggests more coherently organised neurites. The authors also argue that their results show that the magnetic resonance (MR) g-ratio can reveal novel insights into individual differences in cognition and how the human brain processes information.

      The study is exploratory in nature and the analyses were not pre-registered. The technique has not been used before to associate cognitive performance with MR estimates of conduction velocity in candidate white matter tracts. It is therefore unknown how strong any associations are likely to be and what sort of sample size might be needed to observe them. Nevertheless, if the technique proves to be reliable, then it certainly offers a valuable new tool to understand individual differences in cognitive abilities. However, brain structure to behavior associations are notoriously variable across studies and have been argued to require very large sample sizes to obtain reproducible results.

      We respectfully disagree that the study was exploratory. We had distinct aims and hypotheses from the outset. Our prime interest is in autobiographical memory, the hippocampus and its connectivity. This motivated our focus on three specific white matter tracts. We also planned from the time of study design to examine the MR g-ratio, and even contributed to refining the pre-processing pipeline for this approach, as reported in a previous paper (Clark et al., 2021, Frontiers in Neuroscience). Moreover, in the current manuscript we outlined well thought through possible outcomes and declared specific predictions.

      Regarding pre-registration, due to the scope of this work, the experiment was planned eight years ago, and data collection commenced seven years ago. At that time, formal pre-registration was not common practice. However, it has been a long-standing feature of our Centre that proposed studies and their analysis plans undergo rigorous internal peer review, including presentation to the whole Centre, before data acquisition can commence. The proposal for the research under consideration here was presented on 26th September, 2014.

      As noted in our response to the Editors’ Public Evaluation Summary above, someone has to be the first to report a novel result, and we believe that the depth and transparency of our approach permits confidence in the findings. Not least, and to reprise, because we employed the most widely-used and best-validated method of testing autobiographical memory recall that is currently available – Levine’s Autobiographical Interview. Our primary analyses were performed using the behavioural outcome measure from this test, the results of which were directly compared to those from a closely-matched control measure to test whether significantly larger effects were observed for our variable of interest. The potential for false positives was further reduced by extracting microstructure data from hypothesised tracts of interest (instead of performing whole brain voxel-wise analyses), with statistical correction performed on all structure-behaviour analyses. Moreover, we performed partial correlations with age, gender, scanner and number of voxels in a region of interest (ROI) as covariates. Complementary investigations were also conducted using other commonly-reported measures, providing supporting evidence. We report all analyses (and provide all the source data), including those finding no relationships. The consistent results throughout were associations between autobiographical memory recall ability and the microstructure of the parahippocampal cingulum bundle only. Moreover, thanks to the excellent suggestions of the Reviewers, the revised version reports additional analyses that allow us to further corroborate and interpret our findings.

      Our sample of 217 participants allowed for sufficient power to identify medium effect sizes when conducting correlation analyses at alpha levels of 0.01 and when comparing correlations at alpha levels of 0.05 (Cohen, 1992, Psychological Bulletin). While it has recently been suggested that thousands of participants are required in order to investigate brain structure-behaviour associations (Marek et al., 2022, Nature), other, more sophisticated, analyses suggest that samples of ~200 participants can be sufficient, in line with our estimates (Cecchetti and Handjaras, https://psyarxiv.com/c8xwe; DeYoung et al., https://psyarxiv.com/sfnmk). Given that our study was principled, well-controlled, analysed appropriately and produced very specific and consistent findings, we are confident that the findings are robust.

      The authors decided to analyse performance on a single task - the Autobiographical Memory Interview - and identified three candidate white matter tracts that connect the hippocampal region with other brain regions. While it is clear why these three tracts were chosen, it is less obvious why the authors chose to investigate associations with the Autobiographical Memory Interview and not other memory tests that were part of the battery of tests administered to the participants. It is reasonable to assume that something as general as the conduction velocity of a white matter tract would have an effect on memory ability across a range of tasks, so to single out one seems an unnecessarily narrow focus.

      Our main interest over many years, and hence the focus of this study, is autobiographical memory recall because it directly relates to how people function in real life. As noted above, autobiographical experiences occur in dynamic, multisensory, multidimensional, non-linear, ever-changing contexts; they involve actively engaging with the environment and other people; they are embodied; they span milliseconds to decades. Many of these features cannot be captured by laboratory-based episodic memory tests. This issue is increasingly being discussed (for example, see recent reviews by Nastase et al., 2020, NeuroImage; Mobbs et al., 2021, Neuron; Miller et al., 2022, Current Biology). It is further laid bare in McDermott et al.’s (2009, Neuropsychologia) meta-analysis of functional MRI studies which showed that laboratory-based and autobiographical memory retrieval tasks differ substantially in terms of their neural substrates. Consequently, we were not surprised to find that when we analysed laboratory-based memory test performance, there were no correlations with the MR g-ratio. Recall of vivid, detailed, multimodal, autobiographical memories may rely on inter-regional connectivity to a greater degree than simpler, more constrained laboratory-based memory tests. Therefore, as well as speaking to conduction velocity, these findings also contribute to wider discussions about real-world compared to laboratory-based memory tests. We thank the Reviewer for making the excellent suggestion to include these additional data, analyses and discussion points.

      The results of the study are interesting and highlight a key role of the parahippocampal cingulum bundle in autobiographical memory recall. The results are corrected for multiple comparisons across the three fiber tracts of interest and the recall of "external details" provides a nice control compared to the "internal details" which are the measure of interest. The main findings are extended to show that it is likely to be an increase in axon diameter and an increase in neurite coherency that characterize those individuals with better autobiographical recall. Despite these positives, it remains unclear whether memory recall, in general, is better in people with higher g-ratios in this tract (as implied in the Abstract), or if this effect is specific to scores on the Autobiographical Memory Interview.

      Our interest is in autobiographical memory, and so we employed the most widely-used and best-validated method of testing autobiographical memory recall that is currently available – Levine’s Autobiographical Interview. Not only does this test include a control measure, external details (as noted by the Reviewer), but we had independent raters score the autobiographical memory descriptions, and found that the inter-class correlation coefficients were very high (see Materials and Methods). Despite using this current, gold standard approach, at the request of the Reviewer we have now analysed data from eight additional laboratory-based memory tests. These are standard memory tests that are often used in neuropsychological studies: testing recall - the immediate and delayed recall of the Logical Memory subtest of the Wechsler Memory Scale IV, the immediate and delayed recall of the Rey Auditory Verbal Learning Test, the delayed recall of the Rey–Osterrieth Complex Figure; testing recognition memory - the Warrington Recognition Memory Tests for Words and Faces; testing semantic memory - the “Dead or Alive Test”. While these tests can assess some aspects of memory recall, they cannot be regarded simply as proxies for autobiographical memory recall, for the reasons we outlined in our response to the previous point. They do not capture key aspects of autobiographical memories. It is therefore all the more interesting that we found no associations between these laboratory-based memory tasks and the MR g-ratio of the parahippocampal cingulum bundle, in contrast to the relationship identified with autobiographical memory recall ability. Recall of vivid, detailed, multimodal, autobiographical memories may rely on inter-regional connectivity to a greater degree than simpler, more constrained laboratory-based memory tests. Therefore, as well as speaking to conduction velocity, these findings also contribute to wider discussions about real-world compared to laboratory-based memory tests. We thank the Reviewer once again for making the excellent suggestion to include these additional data, analyses and discussion points.

      Reviewer #2 (Public Review):

      In this study, Clark and colleagues tackle a very intriguing question: how differences in autobiographical recall abilities reflect in the human brain structure and function? To answer this question, they interviewed a large cohort of subjects and proceeded to acquire MRI data, specifically diffusion-weighted imaging and magnetization transfer data, to estimate the g-ratio, a measure of myelination deeply linked to conduction velocity. Looking at three specific white matter pathways of interest, all interconnecting the hippocampus with other brain structures, they studied the relationship between the g-ratio and the autobiographical recall abilities, together with many more measures from MRI. They found a significant positive association between the g-ratio of the parahippocampal cingulum bundle and the number of inner details from the interviews. These results can provide new potential directions to further study the underlying neural features beyond memory.

      I think that this is a very interesting article, it is well written, the methods are extensively explained, and the appendix provides further details for more expert readers. The authors put an effort into providing a comprehensive context in the introduction and in the discussion, and as a result, the paper seems overall quite suitable for both general and specialistic readerships.

      Thank you.

      The main issue I can currently see in the paper is that the mentioned relationship between g-ratio and recall abilities is then used to infer that better recall abilities are associated with higher conduction velocity and larger axons. The authors' line of reasoning is that given the hypothesized association, the increase in the g-ratio implies increases in myelin and axonal diameter. Despite this scenario being indeed possible given the current result, an increased g-ratio may also not indicate higher conduction velocity. In fact, the first potential inference would be that, without having any information on the axon size, the quantify of myelin can indeed be lower and as result, the conduction velocity would decrease. I understand that the authors expected higher conduction velocity associated with better autobiographical memory recall, but it is hard to see any experimental outcome that could have disproved this hypothesis: from the possible scenarios depicted in the introduction, any change in the g-ratio (and even not any change at all) could indicate higher conduction velocity. What would be then needed to corroborate one of these scenarios is some independent or complementary measure, which unfortunately is missing.

      The mentioned issue does not mean that the paper loses relevance - I think that it should focus on the very practical result, a change in myelination and microstructure, and discuss what are the potential implications, including the one that currently dominates the discussion section.

      Thank you for these comments and the opportunity to provide further clarification.

      First, we have now provided additional background information regarding the relationship between the MR g-ratio and conduction velocity. We explicitly note that while finding a significant relationship between the MR g-ratio and autobiographical memory recall suggests the existence of an association between autobiographical memory recall and parahippocampal cingulum bundle conduction velocity, it cannot speak to the direction of this association.

      Second, we have further noted that interpretation of the parahippocampal cingulum bundle MR g-ratio in relation to the underlying microstructure requires knowledge, or an assumption, about whether the associated change in conduction velocity is faster or slower. Given that faster conduction velocity is thought to promote better cognition (e.g. Brancucci, 2012; Dicke and Roth, 2016; Miller, 1994; Reed and Jensen, 1992), we interpreted our MR g-ratio findings under the assumption of faster conduction velocity, and now explicitly note in several places in the revised manuscript that this is an assumption.

      Third, we thank the Reviewer for the excellent suggestion that a complementary measure could help to further inform the findings. Consequently, we now also include additional analyses examining the relationship between the extent of myelination and autobiographical memory recall ability. This is possible using the magnetisation transfer saturation maps, which are optimised to assess myelination. Given our assumption of faster conduction velocity when interpreting our positive MR g-ratio correlations, then no relationship between parahippocampal cingulum bundle magnetisation transfer saturation and autobiographical memory recall would be expected. On the other hand, if conduction velocity is actually decreasing, then a negative correlation between magnetisation transfer saturation values and autobiographical memory recall ability would be observed. In fact, we found no relationship between parahippocampal cingulum bundle magnetisation transfer saturation and autobiographical memory recall. This suggests that myelin was not associated with autobiographical memory recall ability, supporting our assumption that relationships with the MR g-ratio were indicative of faster rather than slower, conduction velocity.

      We have now added these new data, analyses and discussion points to the revised manuscript.

      It would also be helpful to include some paragraphs on both interpretation and methodological issues when it comes to MRI-based microstructural imaging, which at the moment is lacking. This would provide a better picture of the results for a more general readership.

      We agree, and additional consideration of interpretational and methodological limitations have now been included in the manuscript.

      As one of the first works using an MRI-based microstructural measure of myelin, the g-ratio, to study cognition in a large cohort of subjects, I think this work will be a needed and significant step towards merging the neuroscience and MRI physics community - the methodology presented here is robust and could be used in many other applications.

      Thank you.

      Reviewer #3 (Public Review):

      The manuscript adds useful information about how structural properties of the brain are related to individual differences in autobiographical memory. A novel metric is used to assess features of white matter in tracts that are important for information exchange between the hippocampus and other brain regions. In one of these, the parahippocampal bundle, a relationship between the MR g-ratio and autobiographical memory recall is identified. This represents new and interesting information. The authors interpret the results in line with the theory that speed of signal transmission is important for cognitive function.

      Thank you for this positive summary.

    1. Author Response

      Reviewer #1 (Public Review):

      Rasicci et al. have developed a FRET biosensor that is designed to light up when cardiac myosin folds. This structure is extremely important to understand, and its link to the super-relaxed (SRX) state has not been fully shown. Their study provides a comprehensive review of the literature and provides compelling data that the 15 heptad+leucine zipper+GFP construct does function well and that the DCM mutant E525K has a similar IVM velocity despite a reduced ATPase compared with HMM. They rely on the ionic strength-dependent changes in the rate of MantATP release to argue that the E525K mutation stabilizes the 'interacting heads motif' (IHM) state, which makes logical sense.

      Strengths:

      Well written and comprehensive.

      Utilizes the appropriate fluorescence-based sensor for measuring the folding of the myosin structure. Provides a detailed range of techniques to support the premise of the study

      Weaknesses:

      Over-interpretation of the outcomes from this study means that the IHM and SRX are the same. Similar studies, e.g. Anderson 2018 and Chu 2021 support the opposite view that IHM and SRX are not necessarily the same, Anderson (and Rohde 2018) point out that S1 has some element of a reduced ATPase, this clearly cannot be due to folding of the molecule. Also, mavacamten was used in these studies to show that even S1 is inhibited suggesting that SRX and IHM are not connected. This is not to say that with enough supporting evidence that these observations cannot be over-ridden, it is just not clear that there is enough in this study to support this conclusion.

      We have revised our discussion to emphasize that our results support a model in which the SRX state is enhanced by formation of the IHM, but given the S1 and 2HP data the IHM may not be required for populating the SRX biochemical state (see page 8).

      I felt that the authors passed over the recent Chu 2021 paper too quickly, the Thomas group used a FRET sensor as well and provides a direct comparison as a technique, but with opposite conclusions. They also have supporting data in Rohde 2018 that their constructs were less ionic strength sensitive. It would be useful to understand what the authors think about this.

      We have discussed the Rohde and Chu papers in more detail in the discussion (see page 8). In the Rhode paper they used proteolytically prepared HMM and S1. Rohde found 20% SRX at all KCl concentrations in S1, while HMM shifted from 50% to 20% SRX in low and high salt conditions, respectively. Our results are different in terms of the absolute fraction of the SRX state but the trend is similar in terms of S1 being salt-insensitive and HMM being salt-sensitive. The difference could be proteolytic HMM, which is a longer construct, and proteolytic S1, which is prone to internal cleavage that can impact ATPase activity. Another difference could be the mixed isoform of mantATP used in previous studies and the single isoform of mantATP used on our study (see page 5)

      Reviewer #2 (Public Review):

      The paper by Rasicci et al. examines the impact of the DCM mutation E525K in beta-cardiac myosin on its function and regulation by autoinhibition. The role of the auto-inhibited state of beta-cardiac myosin in fine-tuning cardiac contractility is an active and exciting area of current research related to muscle biology and cardiomyopathies. Several studies in the past have linked the destabilization of the autoinhibited, super-relaxed (SRX) state of myosin to the pathogenesis of hypertrophic cardiomyopathy. This timely study provides one of the first few examples where the hypocontractile phenotype of a DCM mutation has been linked to the stabilization of the SRX state.

      One of the strengths here is the utilization of a wide variety of both pre-existing and novel biochemical and biophysical assays for the study. The authors have characterized a new two-headed long-tailed myosin construct containing 15-heptad repeats of the proximal S2 (15HPZ), which they show allows myosin to form the SRX state in vitro using single ATP turnover assays. The authors go on to compare the E525K and WT proteins using the 15HPZ myosin construct in terms of their steady-state actin-activated ATPase activity, in-vitro actin-sliding velocity and single ATP turnover measurements. These assays reveal that the predominant effect of this mutation is the stabilization of the SRX state which is maintained even at 150 mM salt concentration where the WT SRX is largely disrupted. This is an important observation because DCM mutations so far have been believed to only affect the force-generating capacity of myosin.

      One of the biggest strengths of this study is the attempt to develop a FRET-based approach to directly ask if the biochemical SRX state here correlates well with the structural IHM state, which is an important unresolved question in the field. The authors have designed a FRET pair (C-terminal GFP and Cy3ATP bound to the active site) that is sensitive to the relative position of the heads and the tail, allowing them to distinguish between the low-FRET closed IHM conformation and the no-FRET open conformation. Remarkably, the authors show that the salt dependence of the FRET efficiency values closely follows their results from the salt dependence of the percent SRX for both WT and E525K proteins. The authors then attempt to substantiate their FRET results by a direct visual analysis of the conformational states populated by both WT and E525K proteins at low salt using negative staining EM analysis. The authors have optimized conditions to allow the deposition of the IHM state on grids without adding the small molecule mavacamten, which was found to be necessary in an earlier study to visualize the closed state using EM. The authors conclude that the SRX state correlates well with the IHM state and that the E525K mutation indeed stabilizes the folded-back conformation of myosin.

      This study significantly strengthens the previously illustrated correlation between the SRX and IHM states and provides methodological advances (especially visualization of the IHM state by negative EM in the absence of cross-linking agents) that will be very useful to the field going forward. The observation that a DCM mutation can lead to stabilization of the folded back state is a novel insight that should spark interest in the field to test how broadly this applies to other DCM mutations. The conclusions of the paper are mostly supported by the data; however, some clarifications and qualifications are needed.

      Weaknesses:

      The extremely low enzymatic activity of the M2β 15HPZ myosins as compared to the WT S1 control (which is a historical control not assayed in parallel with the 15HPZ proteins), is concerning for the low protein quality of the 15HPZ myosins. The authors attribute the low kcat to the high proportion of SRX population in their ensembles. However, the DRX rates reported for the WT and E525K 15HPZ proteins in the single ATP turnover assay are ~3-4 fold lower than those of their S1 counterparts. These rates reflect basal turnover of ATP in the open state and thus should not be affected by the presence of the S2 tail, which leads to concerns about the 15HPZ protein activity. In addition, the very high percentage of stuck filaments in the in vitro motility assay for the 15HPZ constructs (despite the use of dark actin) is also concerning for significant amounts of enzymatically inactive protein.

      We thank the reviewer for pointing out the differences in the S1 and HMM DRX rates. We performed additional single turnover measurements with S1, adding two sets of measurements from one additional preparation (N=3), and we demonstrate that there is a significant increase in the DRX rates of WT S1 compared to WT HMM (see pages 4-5, Table 3, Figure 3- figure supplement 3). A faster rate in S1 was also reported in Rohde et al. 2018. Indeed, the DRX rates of E525K S1 are significantly higher than WT in S1, which we also now report in the results (see page 5, Figure 3 – figure supplement 3). We addressed the concerns about 15HPZ activity by performing NH4+ ATPase assays to demonstrate that the number of active heads was similar in S1 and 15HPZ HMM (see page 4). It is possible that the higher percentage of stuck filaments in the HMM motility is due to myosin heads in the IHM state on the motility surface, which generate a drag force by non-specifically interacting with actin, but further study is necessary to examine this question.

      The authors assert that the E525K mutation represents a new mechanism by which DCM-causing mutations lead to decreased contractility - by stabilizing the sequestered state rather than affecting motor function. However, there is no evaluation of the motor function (actin-activated ATPase activity or in vitro motility) of the E525K S1, which would reveal the effects of the mutation without confounding effects due to the sequestering of heads. Interestingly, in the single ATP turnover assay, the DRX rate of the E525K S1 is >2-fold higher than the WT control, suggesting that the mutation may have effects beyond stabilization of the SRX state. The conclusion that the E525K mutation's effect on myosin function is mediated via stabilization of the SRX state would be strengthened if the effects of the mutation on the motor domain alone were also known.

      We thank the reviewer for this suggestion. We performed actin-activated ATPase assays with WT and E525K S1 and found that E525K increases kcat and lowers KATPase, demonstrating enhanced intrinsic motor activity in the mutant S1 construct (see page 4, Figure 2B). This adds an interesting dimension to the manuscript because we report a mutant that enhances the intrinsic motor activity but stabilizes the SRX/IHM (see Discussion page 10). We did not perform in vitro motility, because this assay depends on the surface attachment strategy, and we would like to compare all constructs with the same attachment strategy using a C-terminal GFP tag (mutant and WT S1 and 15HPZ HMM). Therefore, we are making the S1 construct with a C-terminal GFP tag for this purpose, to be examined in a future study.

      While the authors show strong qualitative correlations between the SRX and IHM states using single ATP turnover, FRET, and EM experiments, attempts to quantitatively compare the fraction of heads in the IHM state using the various experimental approaches is problematic. For example, the R0 value of the FRET pair used here doesn't allow precise measurement of the distances being probed here to be made, but the distances are reported and compared to predicted distances. The authors report that the R0 for their FRET pair is 63 Å. Surprisingly the authors go on to use the steady-state FRET efficiency values to determine the average D-A distance (Fig 5B) which is 100 Å when all heads are in the IHM configuration and becomes larger than that when heads open. R0 of 63 Å allows a precise distance measurement to be made in the 31.5-94.5 Å range which corresponds to 0.5-1.5 R0. It is therefore technically incorrect to use the steady-state FRET efficiency values to determine the D-A distance here. Besides, there are several unknown factors here like orientation factor (κ2) which further complicate these calculations. Similarly, the quantification of IHM state molecules from the negative stain EM experiments is significantly hampered by the disruptive effect of the grid surface on the structure of the IHM state. The authors find that limiting the contact time with the grid to ~ 5s is necessary to preserve the IHM state.

      Despite that, only ~15% WT molecules were seen in the IHM state at low salt (Fig. 6B). In contrast, ~56% E525K molecules were in the IHM state. Both these proteins have similar SRX proportions (Fig. 3C) and similar FRET efficiency values (Fig. 5A) at this salt concentration. This mismatch highlights the problem arising due to not having a measure of the populations from the FRET data. It is not clear if the hugely different proportions of the IHM state in EM experiments are indicative of the relative stability of this state in the two proteins or a random difference in the electrostatic interactions of WT vs mutant with the grid. These experiments do not provide a correct idea of the %IHM in the two proteins. In the absence of any IHM population measurement, it is important to proceed with caution when quantitatively correlating the SRX and IHM.

      We thank the reviewer for pointing out that measuring precise distances by FRET can be difficult. We agree that the low FRET efficiency makes precise distance determination even more challenging. However, FRET is quite good at measuring a change in distance given a specific donor-acceptor pair. We feel our FRET biosensor clearly demonstrates FRET efficiencies that are salt-insensitive in E525K but a clear decrease in FRET at higher salt concentrations in WT. In order to compare the trend in the predicted FRET, based on the single turnover measurements, and the actual FRET we thought it was important to plot the two together on the same graph. We understand that this could have been misleading that we were reporting actual distances. We have now plotted the FRET efficiency instead of distance as a function of KCl concentration (Figure 5B), to prevent any confusion with reporting distances. In addition, we have emphasized that the data are plotted to allow for a comparison of the trend in the single turnover and FRET data (see page 6, 10, Figure 5B).

      We agree that it is important to proceed with caution when comparing the EM to the FRET and single turnover data. The EM does not give a quantitative estimate of the fraction of IHM molecules, due to the disruptive effect of the grid surface on protein conformation. However, it does provide direct (though qualitative) evidence that the conformation underlying SRX and enhanced FRET is the IHM, and it is consistent with our interpretation that the E525K mutation enhances FRET and SRX by stabilizing the IHM. To consolidate this result, we have performed EM experiments now with a total of 3 preparations of WT and mutant (see page 6-7 and Figure 6D). We find that while there is variability from experiment to experiment, likely because the grid surface is slightly different each time the experiment is performed, in all cases there was a ~4-fold higher fraction of folded molecules in the mutant. Since each WT/mutant experimental pair was studied in parallel, using identically prepared grids, the results provide further evidence that the mutant stabilizes the IHM. However, we agree that a quantitative, direct visual correlation of the SRX and IHM is not possible based on the current EM data.

      Finally, the utility of the methods described in the paper to the field would be greatly enhanced if they were described in more detail. As currently written, it would be difficult for others to replicate these experiments.

      Thank you for the comment. We have made significant changes in the methods to clarify the details of the experiments (see pages 11-14). In addition, we have added details to the results and figure legends.

    1. Author Response

      Reviewer #1 (Public Review):

      “This study investigates the dynamics of brain network connectivity during sustained experimental pain in healthy human participants. To this end, capsaicin was applied to the tongues of two cohorts of participants (discovery cohort, N=48; replication cohort, N=74). This procedure resulted in pain for several minutes. During sustained pain, pain avoidance/intensity ratings and fMRI scans were obtained. The analyses (i) compare the pain state with a resting state, (ii) assess the dynamics of brain networks during sustained pain, and (iii) aim to predict pain based on the dynamics of brain networks. To this end, the analyses focus on community structures of time-evolving networks. The results show that sustained pain is associated with the emergence of a brain network including somatomotor, frontoparietal, basal ganglia and thalamic brain areas. The somatomotor area of the tongue is particularly involved in that network while this area is decoupled from other parts of the somatomotor cortex. Moreover, the network configuration changes over time with the frontoparietal network decoupling from the somatomotor network. Frontoparietal-cerebellar connections were predictive of decreases of pain. Together, the findings provide novel and convincing insights into the dynamics of brain network during sustained pain.

      Strengths

      • The brain mechanisms of sustained pain is a timely and relevant topic with potential clinical implications.

      • Assessing the dynamics of sustained pain and relating it to the dynamics of brain networks is a timely and promising approach to further the understanding of the brain mechanisms of pain.

      • The study includes discovery and replication cohorts and pursues a cutting-edge analysis strategy.

      • The manuscript is very well-written and the results are visualized in an exemplary manner including a graphical outline and summary of the findings.”

      We thank the reviewer for the thoughtful summarization and evaluation of our study.

      “Weaknesses

      • It remains unclear whether the changes of brain networks over time simply reflect the duration of sustained pain or whether they essentially reflect different levels of pain intensity/avoidance.”

      We appreciate the editor and reviewer’s comment on this issue. With the current experimental paradigm, it is difficult to dissociate the pain duration from the level of pain because the delivery of oral capsaicin commonly induces initial bursting and then a gradual decrease of pain over time. That is, the pain duration is correlated with the pain intensity in our task.

      However, when we examined the time-course of the ratings at each individual level (as shown in Figure S2), the time duration explained 53.7% of the rating variance, R2 = 0.537 ± 0.315 (mean ± standard deviation). In addition, if we constrain the beta coefficient of the time duration to be negative (i.e., ratings should decrease over time), the explained variance decreases to 48.2%, R2 = 0.482 ± 0.457, leaving us enough variance (i.e., greater than 50%) for examining the distinct effects of time duration and ratings on the patterns of functional brain reorganization.

      Indeed, the two main analyses included in the manuscript—consensus community detection and predictive modeling—were designed to examine those two aspects of the task, i.e., time duration and pain avoidance ratings, respectively. First, through the consensus community detection analysis, we examined the community structure that changes over time, i.e., across the early, middle, and late periods (as shown in Figure 3). We then developed predictive models of pain avoidance ratings in the second main analysis (as shown in Figure 5).

      Though it is still a caveat that we cannot fully dissociate the effects of time duration versus pain ratings, we could interpret the first set of results to be more about time duration, while the second set of results is more about pain ratings.

      We now added a description of the implication of predictive modeling for isolating the effects of pain ratings. In addition, a discussion on the caveat of the current experimental design and relevant future direction.

      Revisions to the main manuscript:

      p. 25: Moreover, developing models to directly predict the pain ratings is helpful to complement the group-level analysis, because the changes in consensus community structure over the early, middle, and late periods only indirectly reflect the different levels of pain.

      p. 27: This study also had some limitations. First, with the current experimental paradigm, it is difficult to dissociate the pain duration from the level of pain because the delivery of oral capsaicin commonly induces initial bursting and then a gradual decrease of pain over time. Though we aimed to model the effects of pain duration and pain avoidance ratings with our two primary analyses, i.e., consensus community detection and predictive modeling, we cannot fully dissociate the impact of time duration versus pain ratings.

      “• Although the manuscript is very well-written it might benefit from an even clearer and simpler explanation of what the consensus community structure and the underlying module allegiance measure assesses.”

      We thank you for the suggestion. Now we added additional (but simple) descriptions of module allegiance and consensus community detection methods.

      Revisions to the main manuscript:

      pp. 8-9: Here, the consensus community means the group-level representative structures of the distinct community partitions of individuals. To determine the consensus community across different individuals and times, we first obtained the module allegiance (Bassett et al., 2011) from the community assignment of each individual. Module allegiance assesses how much a pair of nodes is likely to be affiliated with the same community label, and is defined as a matrix T whose element Tij is 1 when nodes i and j are assigned to the same community and 0 when assigned to different communities. This conversion of the categorical community assignments to the continuous module allegiance values allows group-level summarization of different community structures of individuals.

      p. 14: Here, high module allegiance indicates the voxels of two regions are likely to be in the same community affiliation, and vice versa.

      “• The added value of the assessment of the dynamics of brain networks remains unclear. Specifically, it is unclear whether the current analysis of brain networks dynamics allows for a clearer distinction between and prediction of pain and no-pain states than other measures of static or dynamic brain activity or static measures of brain connectivity.”

      The main goal (and thus, the added value) of the current study was to provide a “mechanistic” understanding of the brain processes of sustained pain, rather than the “prediction.” Even though we included the results from the predictive modeling, as in Figures 4-6, our focus was more on the interpretation of the model to quantitatively examine the functional changes in the brain, not on the maximization of the prediction performance.

      Indeed, maximizing the prediction performance was the main goal of our previous study (Lee et al., 2021), in which we developed a predictive model of sustained pain based on the patterns of dynamic functional connectivity. The model showed better prediction performances compared to the current study, but it was challenging to interpret the model because of the high dimensionality of the model and its features. In addition, functional connectivity itself provides only limited insight into how functional brain networks are structured and reconfigured over time.

      In this sense, the multi-layer community detection method has several advantages to achieving our goal. First, the community detection analysis allows us to summarize the complex, high-dimensional whole-brain connectivity patterns into neurobiologically interpretable subsystems. Second, the multi-layer community detection method allows us to study the temporal changes in community structure by connecting the same nodes across different time points.

      Now we added a description of the rationale behind the choice of the multi-layer community detection analysis over the conventional functional connectivity methods, and the added value of our study.

      Revisions to the main manuscript:

      p. 3: In this study, we examined the reconfiguration of whole-brain functional networks underlying the natural fluctuation in sustained pain to provide a mechanistic understanding of the brain responses to sustained pain.

      p. 7: In this study, we used this approach to examine the temporal changes of brain network structures during sustained pain, which cannot be done with conventional functional connectivity-based analyses (Lee et al., 2021).

      p. 27: However, the previous model provides a limited level of mechanistic understanding because of the high dimensionality of the model and its features. In addition, functional connectivity itself provides only limited insight into how functional brain networks are structured and reconfigured over time.

      Reviewer #2 (public Review):

      “The Authors J-J Lee et al., investigated cortical and subcortical brain networks and their organization in communities over time during evoked tonic pain. The paper is well-written, and the findings are interesting and relevant for the field. Interestingly, other than confirming well known phenomena (e.g., segregation within the primary somatomotor cortex) the Authors identified an emerging "pain supersystem" during the initial increase of pain, in which subcortical and frontoparietal regions, usually more segregated, showed more interactions with the primary somatomotor cortex. Decrease of pain was instead associated to a reconfiguration of the networks that sees subcortical and frontoparietal regions connected with areas of the cerebellum. The main novelty of the proposed analysis, lies in the resulting high performances of the classifier, that shows how this interesting link between frontoparietal network and subcortical regions with the cerebellum, is predictive of pain decrease. In summary, the main strengths of the present manuscript are: • Inclusion of subcortical regions: most of the recent papers using the Shaefer parcellation in ~200 brain areas1, do not consider subcortical areas, ignoring possible relevant responses and behaviors of those regions. Not only the Authors smartly addressed this issue, but most of their results showed how subcortical regions played a key role in the networks reconfiguration over time during evoked sustained pain.

      • Robust classification results: high accuracy obtained on training dataset (internal validation), using a leave-one-out approach, and on the available independent test dataset (external validation) of relatively large sample size (N=74).

      • Clarity in the description of aim and sub-aims and exhaustive presentation of the obtained results helped by appropriate illustrations and figures (I suggest less wording in some of them).

      • Availability of continuous behavioral outcome (track ball).”

      We appreciate the reviewer’s summary and positive evaluations.

      “Even though the results are mostly cohesive with previous literature, some of the results need to be discussed in relationship to recently published papers on the same topic as well as justifying some of the non-standard methodological procedures adding appropriate citations (or more detailed comments). The Authors do not touch upon the concept of temporal summation of pain, historically associated with tonic pain, especially when the study is finalized to better understanding brain mechanisms in chronic pain populations (chronic pain patients often exhibit increased temporal summation of pain2). I would suggest starting from the paper recently published by Cheng et al. that also shares most of the methodological pipeline3 to highlight similarities and novelties and deepen the comparison with the associated literature.”

      We thank the reviewer and editor for the comment on this important topic. Temporal summation of pain indicates progressively increased sensation of pain during prolonged noxious stimulation (Price, Hu, Dubner, & Gracely, 1977), and has been suggested as a hallmark of chronic pain disorders including fibromyalgia (Cheng et al., 2022; Price et al., 2002). In a recent study by Cheng et al. (2022), the authors induced tonic pain using constantly high cuff pressure and examined whether the participants experienced increased pain in the late period compared to the early period of pain. On the contrary, in our experimental paradigm, the capsaicin liquid initially delivered into the oral cavity is being cleaned out by saliva, and thus overall pain intensity was decreasing over time, not increasing (Figure 1B). Therefore, the temporal summation of pain may occur in a limited period (e.g., the early period of the run), but it is difficult to examine its effect systematically in our study.

      However, it is notable that Cheng et al.’s results overlap with our findings. For example, Cheng et al. reported the intra-network segregation within the somatomotor network and the inter-network integration between the somatomotor and other networks during the temporal summation of pressure pain in patients with fibromyalgia, which were similar to the findings we reported in Figure S9 and Figure 4. Although it is unclear whether these results reflect the temporal summation of pain, these network-level features shared across the two studies are likely to be an essential component of the sustained pain processes in the brain.

      Now we added a comment on the temporal summation of pain in the main manuscript.

      Revisions to the main manuscript (p. 26):

      Interestingly, a recent fMRI study on the temporal summation of pain in fibromyalgia patients reported results similar to ours (Cheng et al., 2022), including the intra-network dissociation within the somatomotor network and the inter-network integration between the somatomotor and other networks during pain. Although we cannot directly examine whether the temporal summation of pain gave rise to these network-level changes due to the limitation of our experimental paradigm, these consistent findings between the two studies may suggest that our findings could be generalized to clinical conditions.

      We thank the reviewer and editor for the information about this recent publication. Cheng et al. (2022) was not published at the time we wrote the manuscript, and we were surprised that Cheng et al. shares many aspects with our study, e.g., both used multilayer community detection and also reported similar findings, as described above.

      However, there were some differences between the two studies as well.

      First, the focus of our study was on the brain dynamics during the natural time-course of sustained pain from its initiation to remission in healthy participants, whereas the focus of Cheng et al. was on the temporal summation phenomenon of pain (TSP) and the enhanced TSP in patients with fibromyalgia patients. Because of this difference in the research focuses, our study and Cheng et al. are providing many nonoverlapping results and insights. For example, our study paid particular attention to the coping mechanisms of the brain (e.g., the network-level changes in the subcortical and frontoparietal network regions) and the brain systems that are correlated with the natural decrease of pain (e.g., the cerebellum in Figure 5). In contrast, Cheng et al. (2022) identified the brain connectivity and network features important for the increased TSP in fibromyalgia patients.

      Second, our great interest was in identifying and visualizing the fine-grained spatiotemporal patterns of functional brain network changes over the period of sustained pain. To utilize fine-grained brain activity information, we conducted our main analyses at a voxel-level resolution and on the native brain space, such as in Figures 2-3 and Figures S5, S7, and S8. With this fine-grained spatiotemporal mapping, we were able to identify small, but important voxel-level dynamics.

      We now cited Cheng et al. (2022) in multiple places and revised the manuscript accordingly.

      Revisions to the main manuscript (p. 26):

      Interestingly, a recent fMRI study on the temporal summation of pain in fibromyalgia patients reported results similar to ours (Cheng et al., 2022), including the intra-network dissociation within the somatomotor network and the inter-network integration between the somatomotor and other networks during pain. Although we cannot directly examine whether the temporal summation of pain gave rise to these network-level changes due to the limitation of our experimental paradigm, these consistent findings between the two studies may suggest that our findings could be generalized to clinical conditions.

      “Here the main significant weaknesses of the study:

      • The data analysis is entirely conducted on young healthy subjects. This is not a limitation per se, but the conclusion about offering new insights into understanding mechanisms at the basis of chronic pain is too far from the results. Centralization of pain is very different from summation and habituation, especially if all the subjects in the study consistently rated increased and decreased pain in the same way (it never happens in chronic pain patients). A similar pipeline has been actually applied to chronic pain patients (fibromyalgia and chronic back pain)3,4. Discussing the results of the present paper in relationship to those, could offer a more robust way to connect the Authors' results to networks behavior in pathological brains.”

      We are grateful for the opportunity to discuss the clinical implication of our study. First of all, we agree with the reviewer and editor that we cannot make a definitive claim about chronic pain with the current study, and thus, we revised the last sentence of the abstract to tone down our claim.

      Revisions to the main manuscript (p. 2, in the abstract):

      This study provides new insights into how multiple brain systems dynamically interact to construct and modulate pain experience, advancing our mechanistic understanding of sustained pain.

      However, as we noted above in E-4, some of our findings were consistent with the findings from a previous clinical study (Cheng et al., 2022), suggesting the potential to generalize our study to clinical pain conditions. In addition, we previously reported that a predictive model of sustained pain derived from healthy participants performed better at predicting the pain severity of chronic pain patients than the model derived directly from chronic pain patients (Lee et al., 2021), highlighting the advantage of the “component process approach.”

      The component process approach aims to develop brain-based biomarkers for basic component processes first, which can then serve as intermediate features for the modeling of multiple clinical conditions (Woo, Chang, Lindquist, & Wager, 2017). This has been one of the core ideas of the Research Domain Criteria (RDoC) (Insel et al., 2010) and the Hierarchical Taxonomy of Psychopathology (HiTOP) (Kotov et al., 2017). If the clinical pain of a patient group is modeled as a whole, it becomes unclear what is being modeled because of the multidimensional and heterogeneous nature of clinical pain (Melzack, 1999) as well as other co-occurring health conditions (e.g., mental health issues, medication use, etc.). The component process approach, in contrast, can specify which components are being modeled and are relatively free from heterogeneity and comorbidity issues by experimentally manipulating the specific component of interest in healthy participants.

      The current study was conducted on healthy young adults based on the component process approach. We used oral capsaicin to experimentally induce sustained pain, which unfolds over protracted time periods and has been suggested to reflect some of the essential features of clinical pain (Rainville, Feine, Bushnell, & Duncan, 1992; Stohler & Kowalski, 1999). Therefore, the detailed characterization of the brain processes of sustained pain will be able to serve as an intermediate feature of multiple clinical conditions in future studies.

      Now we added the discussion on the clinical generalizability issue in the discussion section.

      Revisions to the main manuscript:

      pp. 25-26: An interesting future direction would be to examine whether the current results can be generalized to clinical pain. Experimental tonic pain has been known to share similar characteristics with clinical pain (Rainville et al., 1992; Stohler & Kowalski, 1999). In addition, in a recent study, we showed that an fMRI connectivity-based signature for capsaicin-induced orofacial tonic pain can be generalized to chronic back pain (Lee et al., 2021). Therefore, a detailed characterization of the brain responses to sustained pain has the potential to provide useful information about clinical pain.

      p. 26: Interestingly, a recent fMRI study on the temporal summation of pain in fibromyalgia patients reported results similar to ours (Cheng et al., 2022), including the intra-network dissociation within the somatomotor network and the inter-network integration between the somatomotor and other networks during pain. Although we cannot directly examine whether the temporal summation of pain gave rise to these network-level changes due to the limitation of our experimental paradigm, these consistent findings between the two studies may suggest that our findings could be generalized to clinical conditions.

      “Vice versa, the behavioral measure used to assess evoked pain perception (avoidance ratings), has been developed for chronic pain patients and never validated on healthy controls5. It might not be an appropriate measure considering the total absence of pain variability in the reported responses over forty-eight subjects6,7.”

      We acknowledge that pain avoidance measures are not fully validated in the healthy population. Nevertheless, we used this measure in this study for the following two main reasons that outweigh the limitations.

      First, a pain avoidance rating provides an integrative measure that can reflect the multi-dimensional aspects of sustained pain. One of the essential functions of pain is to avoid harmful situations and promote survival, and the avoidance motivation induced by pain is composed of not only sensory-discriminative, but also cognitive components including learning, valuation, and contexts (Melzack, 1999). According to the fear-avoidance model (Vlaeyen & Linton, 2012), if the pain-induced avoidance motivation is not resolved for a long time and is maladaptively associated with innocuous environments, chronic pain is likely to develop, suggesting the importance and clinical relevance of pain avoidance measures. In addition, our experimental design is particularly suitable for the use of avoidance rating because the oral capsaicin stimulation is accompanied by the urge to avoid the painful sensation, but it cannot immediately be resolved similar to chronic pain. Moreover, capsaicin is sometimes experienced as intense but less aversive (or even appetitive) in some cases, e.g., spicy food craver (Stevenson & Yeomans, 1993). In this case, avoidance ratings can provide a more reasonable measure of pain compared to the intensity rating.

      Second, the avoidance measure provides a common scale on which we can compare different types of aversive experiences, allowing us to conduct specificity tests for a predictive model of pain. For example, a recent study successfully compared the brain representations of two types of pain and two types of aversive, but non-painful experiences (e.g., aversive auditory and visual experiences) using the same avoidance measure (Ceko, Kragel, Woo, Lopez-Sola, & Wager, 2022). These comparisons were possible because the avoidance measure provided one common scale for all the aversive experiences regardless of their types of stimuli.

      To provide a better justification for the use of the avoidance measure, we now included the specificity test results of our pain predictive models. More specifically, we tested our module allegiance-based SVM and PCR models of pain on the aversive taste and aversive odor conditions (Figure S13).

      Despite these advantages, the use of avoidance rating without thorough validation is a limitation of the current study, and thus future studies need to examine the psychometric properties of the avoidance rating, e.g., examining the relationship among pain intensity, unpleasantness, and avoidance measures. However, the current study showed that the predictive models derived with pain avoidance rating (Study 1) could be used to predict the pain intensity rating (Study 2). In addition, the overall time-course of pain avoidance ratings in Study 1 was similar to the time-course of pain intensity ratings in Study 2, providing some supporting evidence for the convergent validity of the pain avoidance measure.

      As to the following comment, “It might not be an appropriate measure considering the total absence of pain variability in the reported responses over forty-eight subjects,” there are pieces of evidence supporting that the low between-individual variability of ratings is due to the characteristics of our experimental design, not to the fact that we used the avoidance measure. As we discussed in more detail in our response to E-1, our experimental procedure based on capsaicin liquid commonly induces the initial burst of painful sensation and the subsequent gradual relief for most of the participants (Figure 1B, left). A similar time-course pattern of ratings was observed in Study 2 (Figure 1B, right), which used the pain “intensity” rating, not the pain avoidance rating. In addition, previous studies with a similar experimental design (i.e., intra-oral capsaicin application) (Berry & Simons, 2020; Lu, Baad-Hansen, List, Zhang, & Svensson, 2013; Ngom, Dubray, Woda, & Dallel, 2001) also showed a similar time-course of pain ratings with low between-individual variability regardless of the rating types (e.g., VAS or irritation intensity), confirming that this observation is not unique to the pain avoidance rating.

      Now we added descriptions on the small between-individual variability of pain ratings and the use of avoidance ratings.

      Revisions to the main manuscript:

      pp. 5-7: Note that the overall trend of pain ratings over time was similar across participants because of the characteristics of our experimental design, which has also been observed in the previous studies that used oral capsaicin (Berry & Simons, 2020; Lu et al., 2013; Ngom et al., 2001). However, also note that each individual’s time-course of pain ratings were not entirely the same (Figures S2 and S3).

      p. 26: However, there are also differences between the characteristics of capsaicin-induced tonic pain versus clinical pain. For example, clinical pain continuously fluctuates over time in an idiosyncratic pattern (Apkarian, Krauss, Fredrickson, & Szeverenyi, 2001), whereas capsaicin-induced tonic pain showed a similar time-course pattern across the participants—i.e., increasing rapidly and then decreasing gradually (Figure 1B). This typical time-course of pain ratings has been reported in previous studies that used oral capsaicin (Berry & Simons, 2020; Lu et al., 2013; Ngom et al., 2001).

      pp. 26-27: Note that Study 1 used a pain avoidance measure that is not yet fully validated in healthy participants. However, we chose to use the pain avoidance measure, which can provide integrative information on the multi-dimensional aspects of pain (Melzack, 1999; Waddell, Newton, Henderson, Somerville, & Main, 1993). It also has a clinical implication considering that the maladaptive associations of pain avoidance to innocuous environments have been suggested as a putative mechanism of transition to chronic pain (Vlaeyen & Linton, 2012). Lastly, the avoidance measure can provide a common scale across different modalities of aversive experience, allowing us to compare their distinct brain representations (Ceko et al., 2022) or test the specificity of their predictive models (Lee et al., 2021) (Figure S13). Although the psychometric properties of the pain avoidance measure should be a topic of future investigation, we expect that the pain avoidance measure would have a high level of convergent validity with pain intensity given the observed similarity between pain avoidance (Study 1) and pain intensity (Study 2) in their temporal profiles. The generalizability of our PCR model across Studies 1 and 2 also supports this speculation. However, there would also be situations in which pain avoidance is dissociated from pain intensity. For example, capsaicin can be experienced to be intense but less aversive or even appetitive in some contexts, such as cravings for spicy food (Stevenson & Yeomans, 1993). In addition, the gradual rise of avoidance ratings during the late period of the control condition in Study 1 would not be observed if the intensity measure was used. Future studies need to examine the relationship between pain avoidance and the other pain assessments and the advantage of using the pain avoidance measure.

      “• The dynamic measure employed by the Authors is better described from the term "windowed functional connectivity". It is often considered a measure of dynamic functional connectivity and it gives information about fluctuations of the connectivity patterns over time. Nevertheless, the entire focus of the paper, including the title, is on dynamic networks, which inaccurately leads one to think of time-varying measures with higher temporal resolution (either updating for every acquired time point, as the Authors did in their previous publication on the same dataset4, or sliding windows involving weighting or tapering8,9). This allows one to follow network reorganization over time without averaging 2-min intervals in which several different brain mechanisms might play an important role3,10,11. In summary, the assumption of constant response throughout 2-min periods of tonic pain and the use of Pearson correlations do not mirror the idea of dynamic analysis expressed by the Authors in title and introduction. I would suggest removing "dynamic" from the title, reduce the emphasis on this concept, address possible confounds introduced by the choice of long windows and rephrase the aim of the study in terms of brain network reconfiguration over the main phases of tonic pain experience.”

      Now we removed the word ‘dynamic’ from many places in the manuscript, including the title. In addition, we added a brief discussion on the reason we chose to use the long and non-overlapping windows for connectivity calculation.

      Revisions to the main manuscript (p. 8):

      Although the long duration of the time window without overlaps may obscure the fine-grained temporal dynamics in functional connectivity patterns, we chose to use this long time window based on previous literature (Bassett et al., 2011; Robinson, Atlas, & Wager, 2015), which also used long time windows to obtain more reliable estimates of network structures and their transitions.

      “• Procedure chosen for evoking sustained pain. To the best of my knowledge, capsaicin sauce on the tongue is not a validated tonic pain procedure. In favor of this argument is the absence of inter-subject variability in the behavioral results showed in the paper, very unusual for response to painful stimulations. The procedure is well described by the Authors, and some precautions like letting the liquid drying before the start of the scan, have helped reducing confounds. Despite this, the measures in figure 1B suggest that the intensity of the painful stimulation is not constant as expected for sustained pain (probably the effect washes out with the saliva). In this case, the first six-minute interval requires particular attention because it encapsulates the real tonic pain phase, and the following ones require more appropriate labels. Ideally the Author should cite previous studies showing that tongue evoked pain elicits a very specific behavioral response (summation, habituation/decrease of pain, absence of pain perception). If those works are missing, this response need to be treated as a funding rather than an obvious point.”

      We addressed this comment. Moreover, we could find previous studies that experimentally induced tonic pain through the application of capsaicin on the tongue (Berry & Simons, 2020; Boudreau, Wang, Svensson, Sessle, & Arendt-Nielsen, 2009; Green, 1991; Ngom et al., 2001), suggesting that our experimental procedure is in line with previous literature.

      Reviewer #3 (Public Review ):

      “In their manuscript, Lee and colleagues explore the dynamics of the functional community structure of the brain (as measured with fMRI) during sustained experimental pain and provide several potentially highly valuable insights into, and evaluate the predictive capacity of, the underlying dynamic processes. The applied methodology is novel but, at the same time, straightforward and has solid foundations. The findings are very interesting and, potentially, of high scientific impact as they may significantly push the boundaries of our understanding of the dynamic neural processes during sustained pain, with a (somewhat limited) potential for clinical translation.

      However (Major Issue 1), after reading the current manuscript version, not all of my doubts have been dissolved regrading the specificity of the results to pain. Moreover (Major Issue 2), some of the results (specifically, those related to the group level analysis of community differences) do not seem to be underpinned with a proper statistical inference in the current version of the manuscript and, therefore, their presentation and discussion may not be proportional to the degree of evidence. Next to these Major Issues (detailed below), some other, minor clarifications might also be needed before publications. These are detailed below or in the private part of the review ("Recommendations for the authors").

      Despite these issues, this is, in general, a high quality work with a high level of novelty and - after addressing the issues - it has a very high potential for becoming an important contribution (and a very interesting read) to the pain-research community and beyond.”

      We appreciate the reviewer’s thoughtful comments. We have revised the manuscript to address the Reviewer’s major concerns, as described below.

      “Major Issue 1:

      The main issue with the manuscript is that it remains somewhat unclear, how specific the results are to pain.

      Differences between the control resting state and the capsaicin trials might be - at least partially - driven by other factors, like:

      • motion artifacts

      • saliency, attention, axiety, etc.

      Differences between stages over the time-course might, additionally, be driven by scanner drifts (to which the applied approach might be less sensitive, but the possibility is still there ) or other gradual processes, e.g. shifts in arousal, attention shifts, alertness, etc.

      All the above factors might emerge as confounding bias in both of the predictive models.

      This problem should be thoroughly discussed, and at least the following extra analyses are recommended, in order to attenuate concerns related to the overall specificity and neurobiological validity of the results:

      • reporting of, and testing for motion estimates (mean, max, median framewise displacement or anything similar)

      • examining whether these factors might, at least partially, drive the predictive models.

      • e.g. applying the PCR model on the resting state data and verifying of the predicted timecourse is flat (no inverse U-shape, that is characteristic to all capsaicin trials).

      Not using the additional sessions (bitter taste, aversive odor, phasic heat) feels like a missed opportunity, as they could also be very helpful in addressing this issue.”

      We thank the reviewer for this comment on the important issue regarding the specificity of our results and the potential influences of noise. The effects of head motion and physiological confounds are particularly relevant to pain studies because pain involves substantial physiological changes and often causes head motion. To address the related concerns of specificity, we conducted additional analyses assessing the independence of our predictive models (i.e., SVM and PCR models) from head movement and physiology variables and the specificity of our models to pain versus non-painful aversive conditions (i.e., bitter taste and aversive odor) in Study 1.

      First, we examined the overall changes of framewise displacement (FD) (Power, Barnes, Snyder, Schlaggar, & Petersen, 2012), heart rate (HR), and respiratory rate (RR) in the capsaicin condition (Figure S11). For the univariate comparison between the capsaicin vs. control conditions (Figure S11A), the results showed that, as expected, the capsaicin condition caused significant changes in head motion and autonomic responses. The mean FD and HR were significantly higher, and the RR was lower in the capsaicin condition compared to the control condition (FD: t47 = 5.30, P = 2.98 × 10-6; HR: t43 = 4.98, P = 1.10 × 10-5; RR: t43 = -1.91, P = 0.063, paired t-test). In addition, the increased motion and autonomic responses were more prominent in the early period of pain (Figure S11B). The 10-binned (2 mins per time-bin) FD and HR showed a decreasing trend while the RR showed an increasing trend over time in the capsaicin condition. The comparisons between the early (1-3 bins, 0-6 min) vs. late (8-10 bins, 14-20 min) periods of the capsaicin condition showed significant differences both for FD and HR (FD: t47 = 6.45, P = 8.12 × 10-8; HR: t43 = 6.52, P = 6.41 × 10-8; RR: t43 = -1.61, P = 0.11, paired t-test). These results suggest that while participants were experiencing capsaicin tonic pain, particularly during the early period, head motion and heart rate were increased, while breathing was slowed down. Note that we needed to exclude 4 participants’ data in this analysis due to technical issues with the physiological data acquisition.

      Next, we examined whether the changes in head motion and physiological responses influenced our predictive model performance (Figure S12). We first regressed out the mean FD, HR, and RR (concatenated across conditions and participants as we trained the SVM model) from the predicted values of the SVM model with leave-one-subject-out cross-validation (2 conditions × 44 participants = 88) and then calculated the classification accuracy again (Figure S12A). The results showed that the SVM model showed a reduced, but still significant classification accuracy for the capsaicin versus control conditions in a forced-choice test (n = 44, accuracy = 89%, P = 1.41 × 10-7, binomial test, two-tailed). We also did the same analysis for the PCR model (10 time-bins × 44 participants = 440) and the PCR model also showed a significant prediction performance (n = 44, mean prediction-outcome correlation r = 0.20, P = 0.003, bootstrap test, two-tailed, mean squared error = 0.159 ± 0.022 [mean ± s.e.m.]) (Figure S12B). These results suggest that our SVM and PCR models capture unique variance in tonic pain above and beyond the head movement and physiological changes.

      Lastly, we examined the specificity of our predictive models to pain, by testing the models on the non-painful but aversive conditions including the bitter taste (induced by quinine) and aversive odor (induced by fermented skate) conditions (Figure S13). All the model responses were obtained using leave-one-participant-out cross-validation. The results showed that the overall model responses of the SVM model for the bitter taste and aversive odor conditions were higher than those for the control condition but lower than the capsaicin condition (Figure S13A). Classification accuracies for comparing capsaicin vs. bitter taste and capsaicin vs. aversive odor were all significant (for capsaicin vs. bitter taste, accuracy = 79%, P = 6.17 × 10-5, binomial test, two-tailed, Figure S13C; for capsaicin vs. aversive odor, accuracy = 83%, P = 3.31 × 10-6, binomial test, two-tailed, Figure S13E), supporting the specificity of our SVM model of pain. Similarly, the model responses of the PCR model for the bitter taste and aversive odor conditions were lower than the capsaicin condition, and their temporal trajectories were less steep and fluctuating compared to the capsaicin condition (Figure S13B). The time-course of the model responses for the control condition was flatter than all other conditions and did not show the inverted U-shape. Furthermore, the model responses of the bitter taste and aversive odor conditions did not show the significant correlations with the actual avoidance ratings (bitter taste: mean prediction-outcome correlation r = 0.05, P = 0.41, bootstrap test, two-tailed, mean squared error = 0.036 ± 0.006 [mean ± s.e.m.], Figure S13D; aversive odor: mean prediction-outcome correlation r = 0.12, P = 0.06, bootstrap test, two-tailed, mean squared error = 0.044 ± 0.004 [mean ± s.e.m.], Figure S13F), suggesting the specificity of PCR model to pain.

      Overall, we have provided evidence that our models can predict pain ratings above and beyond the head motion and physiological changes and that the models are more responsive to pain compared to non-painful aversive conditions.

      Now we added descriptions on the specificity tests to the main manuscript and also to the Supplementary Information.

      Revisions to the main manuscript (p. 20):

      Specificity of the module allegiance-based predictive models To examine whether the predictive models were specific to pain and the prediction performances were not influenced by confounding variables such as head motion and physiological changes, we conducted additional analyses as shown in Figures S11-13. The SVM and PCR models showed significant prediction performances even after controlling for head motion (i.e., framewise displacement) and physiological responses (i.e., heart rate and respiratory rate) (Figures S11 and S12) and did not respond to the non-painful but aversive conditions including the bitter taste and aversive odor conditions (Figure S13), supporting the specificity of our predictive to pain. For details, please see Supplementary Results.

      Revisions to the Supplementary Information (pp. 2-4):

      Specificity analysis (Figures S11-13) To examine whether the predictive models (i.e., SVM and PCR models) were specific to pain and not influenced by confounding noises, we conducted additional specificity analysis assessing the independence of the models from head movement and physiology variables and specificity of our models to pain versus non-painful aversive conditions (i.e., bitter taste and aversive odor) in Study 1. First, we examined the overall changes of framewise displacement (FD) (Power et al., 2012), heart rate (HR), and respiratory rate (RR) in sustained pain (Figure S11). For the univariate comparison between capsaicin vs. control conditions (Figure S11A), the results showed that, as expected, capsaicin condition caused significant changes in motion and autonomic responses. The mean FD and HR were significantly higher, and the RR was lower in the capsaicin condition compared to the control condition (FD: t47 = 5.30, P = 2.98 × 10-6; HR: t43 = 4.98, P = 1.10 × 10-5; RR: t43 = -1.91, P = 0.063, paired t-test). For the temporal changes of movement and physiology variables (Figure S11B), the results showed that the increased motion and autonomic responses are more prominent in the early period of pain. The 10-binned (2 mins per time-chunk) FD and HR showed decreasing trend while the RR showed increasing trend over time in capsaicin condition. Additional univariate comparisons between early (1-3 bins, 0-6 min) vs. late (8-10 bins, 14-20 min) period of capsaicin condition showed that differences were significant for FD and HR (FD: t47 = 6.45, P = 8.12 × 10-8; HR: t43 = 6.52, P = 6.41 × 10-8; RR: t43 = -1.61, P = 0.11, paired t-test). This suggests that while participants were experiencing tonic pain, particularly in the early period, motion and heart rate was increased but breathing was slowed. Note that we needed to exclude 4 participants’ data due to technical issues with physiological data acquisition. Next, we examined whether the head movement and physiological responses are the main driver of our predictive models (Figure S12). For all the original signature responses from SVM model (2 conditions × 44 participants = 88), we regressed out the mean FD, HR, and RR (concatenated across conditions and participants as the SVM model was trained) and calculated the classification accuracy (Figure S12A). Although the signature responses were controlled for movement and physiology variables, the SVM model still showed a high classification accuracy for the capsaicin versus control conditions in a forced-choice test (n = 44, accuracy = 89%, P = 1.41 × 10-7, binomial test, two-tailed). Similarly, for all the original signature responses from PCR model (10 time-bins × 44 participants = 440), we regressed out the 10-binned FD, HR, and RR (concatenated across time-bins and participants as the PCR model was trained) and calculated the within-individual prediction-outcome correlation (Figure S12B). Again, the PCR model showed a significantly high predictive performance (n = 44, mean prediction-outcome correlation r = 0.20, P = 0.003, bootstrap test, two-tailed, mean squared error = 0.159 ± 0.022 [mean ± s.e.m.]) while controlling for movement and physiology variables. These results suggest that our SVM and PCR models captures unique variance in tonic pain above and beyond the head movement and physiological changes. Lastly, we examined the specificity of our predictive models to pain, by testing the models onto the non-painful but tonic aversive conditions including bitter taste (induced by quinine) and aversive odor (induced by fermented skate) (Figure S13). All the signature responses were obtained using leave-one-participant-out cross-validation. The results showed that the overall signature responses of SVM model for bitter taste and aversive odor conditions were higher than those for control conditions, but lower than capsaicin condition (Figure S13A). Classification accuracy between capsaicin vs. bitter taste and vs. aversive odor were all significantly high (capsaicin vs. bitter taste: accuracy = 79%, P = 6.17 × 10-5, binomial test, two-tailed, Figure S13C; capsaicin vs. aversive odor: accuracy = 83%, P = 3.31 × 10-6, binomial test, two-tailed, Figure S13E), suggesting the specificity of SVM model to pain. Similarly, the temporal trajectories of the signature responses of PCR model for bitter taste and aversive odor conditions were not overlapping with that of the capsaicin condition (Figure S13B). Furthermore, the signature responses of bitter taste and aversive odor conditions do not have significant relationship with the actual avoidance ratings (bitter taste: mean prediction-outcome correlation r = 0.05, P = 0.41, bootstrap test, two-tailed, mean squared error = 0.036 ± 0.006 [mean ± s.e.m.], Figure S13D; aversive odor: mean prediction-outcome correlation r = 0.12, P = 0.06, bootstrap test, two-tailed, mean squared error = 0.044 ± 0.004 [mean ± s.e.m.], Figure S13F), suggesting the specificity of PCR model to pain. Overall, we have provided evidence that the module allegiance-based models can predict pain ratings above and beyond the movement and physiological changes, and are more responsive to pain compared to non-painful aversive conditions, which suggest the specificity of our results to pain.

      “Major Issue 2:

      Another important issue with the manuscript is the (apparent) lack of statistical inference when analyzing the differences in the group-level consensus community structures (both when comparing capsaicin to control and when analysing changes over the time-course of the capsaicin-challenge).

      Although I agree that the observed changes seem biologically plausible and fit very well to previous results, without proper statistical inference we can't determine, how likely such differences are to emerge just by chance.

      This makes all results on Figs. 2 and 3, and points 1, 4 and 5 in the discussion partially or fully speculative or weakly underpinned, comprising a large proportion of the current version of the manuscript.

      Let me note, that this issue only affects part of the results and the remaining - more solid - results may already provide a substantial scientific contribution (which might already be sufficient to be eligible for publication in eLife, in my opinion).

      Therefore I see two main ways of handling Major Issue 2:

      • enhancing (or clarifying potential misunderstandings regarding) the methodology (see my concrete, and hopefully feasible, suggestions in the "private part" of the review),

      • de-weighting the presentation and the discussion of the related results.

      I believe there are many ways to test the significance of these differences. I highlight two possible, permutation testing-based ideas.

      Idea 1: permuting the labels ctr-capsaicin, or early-mid-late, repeating the analysis, constructing the proper null distribution of e.g. the community size changes and obtain the p-values. Idea 2: "trace back" communities to the individual level and do (nonparametric) statistical inference there.”

      We appreciate this important comment. We did not conduct statistical inference when comparing the group-level consensus community affiliations of the different conditions (Figure 2) or different phases (Figure 3) because of the difficulty in matching the community affiliation values of the networks to be compared.

      For example, let us assume that the 800 out of 1,000 voxels of community #1 and 1,000 out of 4,000 voxels of community #2 in the control condition are commonly affiliated with the same community #3 in the capsaicin condition. To compare the community affiliation between two conditions, we should first match the community label of the capsaicin condition (i.e., #3) to that of the control condition (i.e., #1 or #2), and here a dilemma occurs; if we prioritize the proportion of the overlapping voxels for the matching, the common community should be labeled as #1, whereas if we prioritize the number of the overlapping voxels for the matching, the label of the common community should be #2. Although both choices look reasonable, none of them can be a perfect solution.

      As the example above, it is impossible to exactly match the community affiliation of the different networks. We must choose an imperfect criterion for the matching procedure, which essentially affects the comparison of network structure. This was the main reason that we limited our results of Figures 2-3 to a qualitative description based on visual inspection. Moreover, the group-level consensus community structures in Figures 2-3 are not a simple group statistic like sample mean; they were obtained from multiple steps of analyses including permutation-based thresholding and unsupervised clustering, which could further complicate the interpretation of statistical tests.

      Alternatively, there is a slightly different but more rigorous approach to the comparisons of the community structures, which is the Phi-test (Alexander-Bloch et al., 2012; Lerman-Sinkoff & Barch, 2016). Instead of direct use of the community labels, this method converts the community label of each voxel into a list of module allegiance values between the seed voxel and all the voxels of the brain (i.e., 1 if the seed and target voxels have the same community label and 0 otherwise). This allows quantitative comparisons of voxel-level community profiles between different conditions without an arbitrarily matching of the community labels. We adopted this Phi-test for our analyses to examine whether the regional community affiliation pattern is significantly different between (i) the capsaicin vs. control conditions and (ii) the early vs. late periods of pain (Figure S6), which correspond to the main findings of the Figures 2 and 3 in our manuscript, respectively.

      More specifically, to compare the group-level consensus community structures between the capsaicin vs. control conditions and the early vs. late periods, we first obtained a seed-based module allegiance map for each voxel (i.e., using each voxel as a seed). Then, we calculated a correlation coefficient of the module allegiance values between two different conditions for each voxel. This correlation coefficient can serve as an estimate of the voxel-level similarity of the consensus community profile. Because module allegiance is a binary variable, these correlation values are Phi coefficients. A small Phi coefficient means that the spatial pattern of brain regions that have the same community affiliation with the given voxel are different between the two conditions. For example, if a voxel is connected to the somatomotor-dominant community during the capsaicin condition and the default-mode-dominant community during the control condition, the brain regions that have the same community label with the voxel will be very different, and thus the Phi coefficient will become small. Moreover, the Phi coefficient can be small even if a voxel is affiliated as the same (matched) community label for both conditions, when the spatial patterns of the same community is different between conditions.

      To calculate the statistical significance of the Phi coefficient, we conducted permutation tests, in which we randomly shuffled the condition labels in each participant and obtained the group-level consensus community structure for each shuffled condition. Then, we calculated the voxel-level correlations of the module allegiance values between the two shuffled conditions. We repeated this procedure 1,000 times to generate the null distribution of the Phi coefficients, and calculated the proportion of null samples that have a smaller Phi coefficient (i.e., a more dis-similar regional community structure) than the non-shuffled original data.

      Results showed that there are multiple voxels with statistical significance (permutation tests with 1,000 iterations, one-tailed) in the area where the community affiliations of the two contrasting conditions were different (Figure S6). For example, the frontoparietal and subcortical regions for the capsaicin vs. control (c.f., Figure 2), and the frontoparietal, subcortical, brainstem, and cerebellar regions for the early vs. late period of pain (c.f., Figure 3) contain voxels that survived after thresholding with FDR-corrected q < 0.05, suggesting the robustness of our main results.

      Particularly, the somatomotor and insular cortices showed statistical significance in the permutation test, and this may reflect the large changes in other areas that are connecting to the somatomotor and insular cortices across different conditions. The statistical significance was also observed in the visual cortex, which was unexpected. We interpret that the spatial distribution of the visual network community is too stable across conditions, and thus the null distribution from permutation formed a very narrow distribution of Phi coefficients. Therefore, a small change in the community structure could achieve statistical significance.

      Now we added descriptions on the permutation tests.

      Revisions to the main manuscript:

      p. 9: Permutation tests confirmed that the community assignment in the frontoparietal and subcortical regions showed significant changes between the capsaicin versus control conditions (Figure S6A).

      p. 13: Permutation tests further confirmed that the community assignment in the frontoparietal, subcortical, and brainstem regions showed significant changes between the early versus late period of pain (Figure S6B).

      pp. 36-37: Permutation tests for regional differences in community structures. To test the statistical significance of the voxel-level difference of consensus community structures (Figures 2 and 3), we performed the following Phi-test (Alexander-Bloch et al., 2012; Lerman-Sinkoff & Barch, 2016). First, for each given voxel, we compared the community label of the voxel to the community label of all the voxels, generating a list of voxel-seed module allegiance values that allow quantitative comparison of voxel-level community profile (e.g., [1, 0, 1, 1, 0, 0, ...], whose element is equal to 1 if the seed and target voxels were assigned to the same community and 0 otherwise). Next, a correlation coefficient was calculated between the module allegiance values of the two different brain community structures (i.e., capsaicin versus control, and early versus late). This correlation coefficient is an estimate of the regional similarity of community profiles (here, the correlation coefficient is Phi coefficient because module allegiance is a binary variable). To estimate the statistical significance of the Phi coefficient, we performed permutation tests, in which we randomly shuffled the labels and then obtained the group-level consensus community structures from the shuffled data. Then, the Phi coefficient between the module allegiance values of the two shuffled consensus community structures was calculated. We repeated this procedure 1,000 times to generate the null distribution of the Phi coefficient for each voxel. Lastly, we examined the probability to observe a smaller Phi coefficient (i.e., a more dissimilar community profile) than the one from the non-shuffled original data, which corresponds to the P-value of the permutation test. All the P-values were one-tailed as the hypothesis of this permutation test is unidirectional.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Avar et al report on the development of a high-throughput method to screen modifiers of prion replication in cell lines using a genome-wide siRNA library. They identified a number of hits and further studied one candidate, the ribonucleoprotein Hnrnpk. The authors convincingly show the interest of their method. However, the claims that the ribonucleoprotein Hnrnpk impact prion propagation need to be more quantitatively and statistically substantiated.

      1. * A large part of the manuscript is dedicated to the validation of the high-throughput assay (called QUIPPER). QUIPPER is made in 384-plates and provides great technological improvement. It works with different prion-permissive cell lines and different prion strains. QUIPPER is an antibody-FRET-based assay that detects a specific population of PrPSc that resists phospholipase C (PIPLC) treatment. Historically, PIPLC has been shown to cleave cell surface PrPC while preserving PrPSc (which is endocytic or inaccessible). I would recommend that the authors quantify the proportion of PIPLC-resistant PrPSc (PrPPIPLC) versus total PrPSc in their different models. First, PrPPIPLC proportion may be cell and strain dependent. Second and most importantly, as siRNA effects are studied using PrPPIPLC as readout, it is crucial to know if this form is a bona fide surrogate of PrPSc and infectivity or only a specific, subcellular, potentially minor form of PrPSc. This is particularly important as the effects of Hnrnpk knock-down in QUIPPER and western blot sounds discordant; in QUIPPER, the effects are strong (> 5-fold) while by western blot, the effects are much more modest (We addressed this issue in several ways; firstly, we quantified the proportion of PIPLC-resistant PrP (PrPPLC) versus PrPSc in two different models (Fig. 1B and D). Secondly, we directly compared residual infectivity of cells treated with PK or PIPLC (Figure 1C), using the standard scrapie cell assay. The results show that infectivity is retained upon PIPLC treatment. In addition, we assessed the 161 hits obtained via QUIPPER using PrPSc as a readout (Fig. 3B).

      To provide further data on the robustness of our PIPLC-based readout, we have performed western blotting of infected and uninfected cells upon PIPLC treatment and assessed the band patterns following PIPLC administration. This Figure is now incorporated in the manuscript as Supp. Fig. 1C and demonstrates that upon PIPLC digestion of NBH and RML infected CAD5 and GT-1/7 cells, PrP is barely detectable in the non-infected cells, while it is in the prion infected ones. The blots also show that the PIPLC-resistant PrP (PrPPLC) is resistant to PK digestion. These new data, together with those provided in Fig. 1B and Figure 1C, show that PrPPLC is equivalent to PrPSc in terms of PK resistance and infectivity.

      The reviewer pointed out a discordance between Western Blotting and QUIPPER. Although it is not clearly stated, we think the reviewer may be suggesting a discordance based on Fig. 3D. We would like to point out that Fig. 3D does not report fold changes as the reviewer is suggesting, but Z-scores, measured by standard deviations from the mean, not allowing to infer fold-changes. We quantified the effect of NT and HNRNPK targeting siRNAs on prion levels (Fig. 4A) and saw a three-fold change. We believe that the quantifications provided in the new version of the manuscript alleviate the concerns regarding any discordance.

      Technically, this is quite easy as it necessitates, after PIPLC treatment, the quantification of PrPSc in the supernatant versus PrPSc in the cell pellet. In Fig. 1C, the authors show that PrPPIPLC is infectious in a cell-scrapie assay. Using this approach, they could also quantify the infectivity of these species relative to the total infectivity content.

      We addressed this in Supplementary Fig. 1C as depicted above. Supplementary Fig. 1C shows the alikeness of the PrP species measured via the QUIPPER vs. the canonical PK digestion: upon digestion with PIPLC following a PK treatment, we detect PrPSc. Therefore, the experiment demonstrates that PrPPLC is alike in nature to PrPSc. The difference between the PK digested (lanes 3&4) vs PIPLC treated then PK digested lanes (lanes 7&8) is the PrPSc that is released into the media following PIPLC digestion.

      • *

      • The authors identified a list of prion modifiers candidate. Surprisingly, the authors did not perform a pathways analysis to identify potential pathways that could impact prion propagation.*

      Despite extensive efforts, there were no pathways that were enriched in our 40 hits, which is mentioned in the discussion part of the manuscript. Two analyses (for the 161 candidates and 40 hits) are now added to Supplementary Fig. 3C and pasted below.

      • *

      • The authors then studied in more details one hit, the ribonucleoprotein Hnrnpk. They studied the impact of Hnrnpk knock-down on PrPC and PrPres levels in different cell lines. These data (Fig 4 and Fig S4) lack quantitative (on a higher number of wells) and statistical analyses. The western blot that are shown suggest that PrPC levels are slightly increased by the siRNA and that the increase in PrPres levels is modest, barely significant given the western blot method. Same comment after PSA treatment, at least in PG127-infected hovS cells.*

      We performed a quantification on the western blots for all figures mentioned by the reviewers throughout the manuscript. These are incorporated to the manuscript for the figures: Fig. 4A, Fig. 4B, Supplementary Fig. 4A, Supplementary Fig. 4C, Supplementary Fig. 4D, Supplementary Fig. 4F, Supplementary Fig. 4G.

      Additionally, statistical analyses have been incorporated into the manuscript in these figures: Fig. 4C, Fig. 4D, Fig. 4E, Fig, 4F, Fig, 4G, Fig, 4H, Supplementary Fig. 4F. The analyses and the quantitative data demonstrate the effect of Hnrnpk downregulation and PSA treatment on prion levels to be significant. Moreover, we also addressed the regulation of prions via HNRNPK using vacuoles as a read-out as well as with a different mode of regulating HNRNPK expression using shRNAs. All these results, point to HNRNPK as a true modulator of PrPSc.

      In Figure 4A and B, the use of POM1 and/or POM2 to detect PrPC / PrPres is confusing. POM2 is supposed to detect mostly full-length PrPC (Fig 4A top panel), but more than 3 glycoforms are detected. In Fig 4B, POM1 is used for PrPC but because it has a central epitope, it detects both PrPC and PrPSc.

      Both antibodies are able to recognize both PrPC and PrPSc as it has been shown in many publications from the Aguzzi lab as well as other labs in the field. https://pubmed.ncbi.nlm.nih.gov/19060956/

      Note also in Fig 4B, that DMSO alone seems to impact PrPC levels in PG127-infected hovS cells. This advocates again for a more quantitative analysis.

      We have quantified the western blots using the DMSO control as standard value. As DMSO was used to dilute PSA, this should take into account potential effects coming from DMSO (Fig. 4D, Fig. 4F, Fig. 4H and Supplementary Fig. 4F).

      • Psammaplysene A (PSA) is a pharmacological Hnrnpk binder. The authors used this molecule to further demonstrate that Hnrnpk is involved in prion propagation. I disagree with the author's conclusion that "PSA effect does seem to be limited when HNRNPK shRNAs are applied". In Fig S4D, 1µM PSA seems do decrease PrPres levels at similar levels whether the shRNA is applied or not. Again quantification and statistical analyses from several independent experiments would help supporting the authors conclusions.*

      We assessed this point carefully by quantification of the western blots (Fig. 4H) and providing statistical data (Student’s t-test) from three experiments. As we see a threefold lower decrease of prions with and without Hnrnpk regulation when PSA is present, we concluded that the effect we see from PSA should be arising through Hnrnpk. However, we cannot conclusively delineate the effect of PSA, because Hnrnpk ablation is not possible due to essentiality of Hnnrpk. This has now been added to the discussion portion of our manuscript.

      • The authors finally tested PSA on organotypic brain slices (in that case, they provide statistical results) and on flies infected with ovine PG137 prions. PSA administration significantly reduced the locomotor deficits prion-infected flies. The authors quantified the effects of PSA on prion accumulation in flies. Because the overall levels were not detectable by immunoblot, they used a cell-free assay termed RT-QuIC to address prion seeding activity in fly heads. I have specific comments about these experiments:
      • Maybe I missed it, but I could not find which recombinant PrP is used in RT-QuIC assay.*

      This information is provided in the M&M section of the manuscript at hand. The relevant section on P25 reads, where HaPrP23-231 refers to hamster PrP:

      The reaction buffer of the RT-QuIC consisted of 1 mM EDTA (Life Technologies), 10 μM thioflavin T, 170 mM NaCl, and 1× PBS (incl. 130 mM NaCl) and HaPrP23-231 filtered using 100-kD centrifugal filters (Pall Nanosep OD100C34) at a concentration of 0.1 mg/ml.

      In addition, we added this information to the main text as well.

      - This is important as recombinant PrP self-polymerize after a period of time and here the authors have left the RT-QuIC assay running for unusually long period of times (RT-QuIC are stopped after 24h-48h).

      For prions, long RT-QuIC experiments are often performed (also see: https://pubmed.ncbi.nlm.nih.gov/32598380/, https://journals.asm.org/doi/10.1128/mBio.02451-14, https://www.nature.com/articles/s41598-021-84527-9, https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3458796/ and others).

      In addition, this is controlled for in all experiments performed in the lab, as the prion-negative sample containing the same RT-QuIC substrate does not become positive after the entire duration of the assay (Fig. 5D).

      - Instead of titrating prion seeding activity by endpoint titration, the authors quantified PSA activity by measuring the effect on another parameter of the RT-QuIC, the length of the lag phase before the conversion reaction is visible. While this is an interesting criterion, reduction of seeding activity must be shown to unequivocally demonstrate that PSA has delayed prion pathogenesis in flies.

      Based on the data presented in the manuscript, we assessed prion pathogenesis in flies using a well-established climbing assay, demonstrating that treatment with PSA significantly improves locomotor behavior, which has been shown to be directly linked to prion levels and is known to have even greater sensitivity then the traditional mouse bioassay (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5998032/, https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6113635/, https://link.springer.com/article/10.1007/s00441-022-03586-0).. The RT-QuIC represented here represents itself as a secondary read-out to the climbing assay, for which Lag-time quantification is used routinely (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3893511/, https://www.nature.com/articles/s41598-017-10922-w, https://journals.asm.org/doi/10.1128/mBio.02451-14, https://www.nature.com/articles/s41598-021-87295-8). Our results effectively highlight the overlap between the complementary read-outs.

      - Can the authors exclude any interfering effect of PSA on the RT-QuIC reaction, given the amount of material used to seed the reaction (1:20 diluted head homogenates)?

      We do not know how much PSA has reached the Drosophila brain, therefore, the experiment suggested by the reviewer cannot be tied to a 1:20 dilution. However, the concern of the reviewer is valid, and we therefore performed a spiking experiment of a prion positive sample using 1uM PSA (the highest amount used to treat cells, for which we saw a strong prion-reducing effect). We did not see an interference in the RT-QuIC signal due to PSA in the reaction. This has been incorporated into Figure 5D.

      • could the authors comment on the fact that HNRNPK knock-out is not possible and that their siRNA and shRNA are not affecting the cell viability?*

      To select hits during the screen process, we apply a viability filter, excluding siRNAs that reduce viability by more than 50% when compared to the non-targeting control siRNA (Supplementary Fig. 1F). For GT-1/7 cells we do not see any effect on viability of siRNA treatment after 96h. However, as downregulation of HNRNPK worsens the cytopathological vacuolation in the hovS model, as shown in Supp. Fig 4A, we do see an effect on cell fitness using both siRNA as well as shRNA. In addition, as knocking down HNRNPK will not lead to its complete loss, the remaining levels might be enough to sustain viability. Moreover, the longest knockdown experiment we performed is 7 days, we cannot exclude that longer exposure would have an impact on viability, but this question is not in the scope of the paper.

      • In the discussion the authors do not discuss how Hnrnpk could impact prion propagation. This may deserve a comment as this protein is present in the nucleus. As PrPC has been also identified in this compartment, can this specific form be involved in prion pathogenesis?*

      We additionally elaborated on potential ways of how Hnrnpk might impact prion propagation in the discussion, which includes potential nuclear PrPSc as well as with regards to our data obtained from the sequencing efforts shown in Fig. 4I. In addition, we investigated some functional targets of Hnrnpk how they are affected by PSA, which is now added to Supp. Fig 4G.

      Reviewer #1 (Significance (Required)):

      The QUIPPER method is a great conceptual and technological approach that could be applied to genome-wide analyses and screening for therapeutic molecules.

      * The study will interest a general audience interested in neurodegenerative diseases linked to protein misfolding. There are commonalities in pathways and modifiers of the conversion. Further PrP has emerged as a receptor for alpha-synuclein (Parkinson disease) and A-beta peptides (Alzheimer's disease).

      Expertise key words: prion diseases - prion pathogenesis in cell models*

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Prions are protein-based infectious agents that underlie neurodegenerative disease. For prion diseases (e.g., mad cow disease), the infectious agent is the cellular prion protein (PrPc). It exists in a normal conformation and carries out its normal cellular function. However, when it becomes misfolded and aggregates it can adopt an altered conformation, referred to as the prion conformation, or PrPSc. PrPSc aggregates can template the conversion of other PrPc molecules into the PrPSc form. In this way the prions can propagate from one cell to the next and throughout an organism. Prion diseases are truly devastating and identifying ways of stopping prion propagation is of great interest. In this manuscript by Aguzzi and colleagues, the authors designed a way to screen for prion propagation modifiers in mammalian cells. They built a highly sensitive readout of PrPSc propagation and adapted it to a 384-well plate format in adherent cells. They then used this to perform a genomewide siRNA screen, looking for genes that increased or decreased PrPSc propagation when knocked down.

      * They identified nearly 1,200 modulators of prion propagation and then subjected them to various validations and filtering to focus on only those hits that affected PrPSc but not PrPc (though hits that affect levels of PrPc could certainly be interesting). All this led to 40 genes (20 that increased and 20 that decreased prion propagation.*

      * Among these 40, the authors focused on one hit, hnRNPK, an essential RNA-binding protein with diverse cellular functions. They provide evidence that reducing levels of hnRNPK leads to increase prion levels.*

      * They next move to a marine compound called Psammaplysene A (PSA), which had previously been shown to have some neuroprotective properties and to be able to bind to hnRNPK. Because of the latter observation, the authors test if PSA can affect prion levels. They show that indeed treatment of their cell line prion infection model, or an organotypic slice model, or a fly model with PSA is sufficient to decrease prion levels.*

      * The authors propose that PSA works to reduce prion levels by increasing the activity of hnRNPK and that this also implies a role of RNA (because hnRNPK is an RNA-binding protein) in prion propagation. * In a nutshell, in my opinion the design and execution of this genomewide screen is ingenious and has yielded a treasure trove of potential prion modifiers. The ability to distinguish between modifiers of Prpc and PrpSc is super powerful. However, the follow-up and focus on hnRNPK and its connections (which seem tenuous) to the marine compound PSA are incomplete and raise more questions than answers. In its present form, it is hard to assess the potential significance of hnRNPK in prion propagation. I have some comments and suggestions for the authors to consider.

      * 1.To my eye, Fig. 4A looks like Hnrnpk siRNA leads to slightly increased levels of PrPc (detected with POM2 antibody) and this could explain the increase in PrPSc levels. Can the authors assess Prnp RNA levels and the effects of their siRNAs on Prnp expression? It would also be useful to provide quantification of immunoblots if possible.*

      We quantified the western blots as mentioned in our response to reviewer 1. The quantifications are now provided for figures: Fig. 4A and Supplementary Fig. 4A, showing that the increase in prion levels is much stronger than that of PrPC. These confirm the results from the screen as seen in Fig. 3D. In addition, we would again like to point out that the use of shRNAs to knockdown HNRNPK did not yield the increase in PrPC levels aforementioned, as evident by Supplementary Fig. 4D which demonstrates a decrease of PrPC, despite increasing PrPSc levels. Moreover, we show quantification of RNA levels upon downregulation of Hnrnpk and with PSA, which show that downregulation of Hnrnpk via siRNAs indeed increases Prnp mRNA levels and that PSA does not change RNA levels of neither Hnrnpk nor Prnp (Fig. 4C).

      • In Supplemental Fig. 4B it also looks like knocking down Hnrnpk results in decreased PrPc levels in this experiment and its not clear how robust the increase in PrPSc levels are. Quantification of these experiments, if possible, would be helpful.*

      Please see response above. We now provide quantification to all western blots.

      • The authors treat with PSA, which is supposed to bind to Hnrnpk. They state that this treatment does not affect PrPc levels but to my eye Supplemental Fig. 4C looks like highest doses of PSA cause a decrease in PrPc levels. Quantification of the immunoblots would also be useful here.*

      Please see response above. We now provide quantification to all western blots and added a sentence to the manuscript.

      • The authors use Hnrnpk knockdown along with PSA to test if the effects of PSA depend on Hnrnpk. They see PSA decreases PrPSc levels and that this is, to my eye, only slightly attenuated by Hnrnpk reduction. I interpret these results slightly different than the authors. To me, it seems that this result indicates that PSA's effects are (mostly) independent of Hnrnpk.*

      Addressed in point 4 from reviewer one.

      • In the original paper identifying PSA and hnRNPK physical interaction, RNA-binding was important. In the authors' assays, does Hnrnpk's effect on prions depend on RNA-binding? Specific mutations to the RNA-binding domains can be made to assess this.*

      This is a very interesting point. We did try to obtain data to support this claim, however, due to the essentiality as well as tight control of Hnrnpk expression, we were not able to express different forms of Hnrnpk and acquire conclusive data. Therefore, it is currently being pursued how Hnrnpk might affect prion propagation in the scope of another publication.

      • The genetic interaction in the vacuolation phenotype between Prnp and Hnrnpk that the authors report is very interesting (Supplemental Fig. 4A). It seems like this system and phenotype could be useful for the authors in exploring mechanisms by which HnrnpK is functioning.*

      • *

      We absolutely agree to the reviewer’s comment. As mentioned above a second publication is under way to investigate the mechanisms of Hnrnpk’s antiprion function, which is not in the scope of this study.

      • The authors propose that PSA increases activity of Hnrnpk but does it change any Hnrnpk RNA targets from their RNA sequencing? Some functional readout of Hnrnpk function would be useful here to test this hypothesis.*

      Although we do suspect RNA binding has an important role in the anti-prion function of Hnrnpk, we cannot exclude other modalities which Hnrnpk might be function through, such as DNA binding and protein-protein interactions. Therefore, to answer this question, a considerable effort that explores each of the potential of these modalities with regards to the anti-prion function of Hnrnpk would be needed. This extensive effort, however, is out of the scope of the manuscript at hand. However, we investigated the effect of PSA on some known functional targets of Hnrnpk (as suggested by the reviewer) from our sequencing efforts and added this analysis as Supplementary Fig. 4H to the manuscript. These results suggest that PSA leads to an increase of the expression of DNA targets of Hnrnpk, potentially suggesting a modality of action. Moreover, we amended the discussion with regards to potential pathways that might be yielding the effect seen as evidenced by the RNAseq data.

      • In the Introduction, the authors mention two yeast papers in introducing the concept of using unicellular model organisms to perform modifier screens. The first paper (Outeiro and Lindquist, 2003) is a classic but does not contain a yeast screen. The other one does include a loss of function screen in yeast (for polyQ toxicity modifiers) but those results seems to be due to loss of the [RNQ+] prion from certain deletion strains instead of from specific roles of modifier genes, so that paper might not be the best exemplar of yeast modifier screens.*

      We sincerely thank the reviewer for their careful readthrough of the manuscript, the portion that refers to the manuscripts as screens was amended and two new citations for appropriate yeast screens were added to the manuscript.

      • The authors asked if any of their hits from their screen had human genetics connections to neurodegeneration. They mention one of their hits Dock3 right after saying that no hit reached statistical significance after multiple testing corrections. This seems a bit misleading since any time one makes a list of anything there will always be, by definition, one at the top of the list.*

      We amended the wording to improve clarity of the manuscript.

      • The authors perform RNA sequencing on prion infected cells that either had Hnrnpk siRNA or PSA and since these two treatments had opposite effects they looked for genes that went in the corresponding directions. They didn't find anything significant when looking for genes downregulated by Hnrnpk siRNA and upregulated by PSA. They did find glucose metabolism genes when looking in the opposite direction. The significance of this finding is unclear and the authors do not expand on it.*

      Addressed in point 7 of reviewers 1 and 2, we expanded the discussion portion of the manuscript with regards to these results.

      • To me, the data with PSA seem more robust than the Hnrnpk data and it seems that the authors are trying to perhaps over-fit them together. It is possible that PSA affects prion levels independent of Hnrnpk function. This would not dampen my enthusiasm at all for this finding and could be of interest to those in the prion field, in which the search for anti-prion compounds is of great interest.*

      Upon statistical analysis of the result in Fig 4H, we see a three-fold decrease of PSA activity upon HNRNPK downregulation, suggesting PSA activity might be linked to HNRNPK. However, the reviewers point is well taken and we emphasized the value of understanding the function of PSA or mimicry of its effect as potential therapy in the future.

      ***Cross-commenting:**

      All three reviewers seem to appreciate the novelty and impact of the new QUIPPER method the authors have developed to discover modifiers of prion propagation. All three reviewers also seem to be somewhat less convinced by the connection to hnRNPK, including how the compound PSA's anti-prion effects involve hnRNPK (or not).*

      * In my opinion, this manuscript presents important and novel work and a really ingenious new method to study prion propagation, which will be broadly useful to the prion field. I feel that the hnRNPK data could be strengthened, especially with more quantitative analyses. The PSA treatment data are compelling but it seems that the effects might be independent of hnRNPK and that the authors are trying to force a connection which might not be there.*

      * Reviewer #2 (Significance (Required)):*

      * *** Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. ****

      I have expertise in neurodegenerative disease, protein misfolding, yeast modifier screens, CRISPR modifier screens in human cells, and RNA-binding proteins. I have general knowledge about prions, including PrP, but I am not a prion expert.*

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The authors conducted an arrayed RNAi-based genome-wide high-throughput screening of all protein-coding modifier genes that affect prion propagation in cultured cells (murine and human cell lines) using a novel quantitative high throughput QUIPPER assay that they developed. They identified 1191 genes, of which 40 selectively affect PrPSc. Half of the 40 genes seem to inhibit PrPSc (limiter) whereas the other half do the opposite (stabilizers). One of the strong limiters is Hnrnpk, is an essential small heterogeneous nuclear ribonucleoprotein that has been implicated in a few protein misfolding diseases. The biological relevance of the findings is demonstrated by the detection of previously reported modifier genes as well as thorough verification of Hnrnpk as an effective prion limiter that seems to be independent of the two prion strains or host species (mouse and human cell lines as well as Drosophila).

      * The manuscript is very well written, the approach is novel, very well verified, and effective, the data are solid, and the main conclusions convincing.*

      * Two issues need to be discussed.*

      * Major comments:*

      * First, some genes encoding proteins involved in PrP processing, such as ADAM10 and ADAM8, are known to affect PrPC levels, but they are not among the modifier genes identified. Based on Table 2, ADAM8 expression is very low in the GT-1/7 cells. This points to one of the caveats of the RNAi screening approach in that potential roles of low expressing genes in the cell lines used could be missed. Although it is beyond the scope of this manuscript, it would be helpful to add discussions on complimentary screening enhancing gene expression and the use of more cell lines that will allow identification of more modifiers.*

      We thank the reviewer for their concern. The point regarding the screen being less sensitive for genes that are low-expressed in the cell line in question is valid. Upon advancing of the CRISPR-based technologies and the improvement of these technologies to be used in combination with prions, we see their value. We added a sentence to the discussion, talking about gene activation as a future alternative to perform a complimentary screen.

      Second, the statement that PSA's anti-prion effect potentially arises through enhancing the activity of HNRNPK makes sense, but it is also possible that PSA can directly inhibit prion replication as well. It would be helpful to calculate the percentage of reduction in PrPSc by PSA treatment and the percentages compared between shNT and shHNRNK cells.

      We thank the reviewer for the careful read through of the manuscript. The point was addressed for reviewer 1 point 4. In addition, if PSA is added to the RT-QuIC, it does not prevent aggregate formation, indicating that PSA is unlikely to directly inhibit prion replication, but rather depends on a cellular host-intrinsic molecule for its activity. However, we also elaborate more on the possibility of potential other mechanisms for Hnrnpk and PSA’s function on regulating prion levels in the discussion section of our manuscript.

      Minor comments:

      * First, Figure 1C shows that the relative intensity for RML CAD5 cell lysate infected cells is less than with PIPLC treated or PK treated, which seems to be the opposite of what is expected, because PIPLC or PK treatment should not increase infectivity. Please explain.*

      We agree with the reviewer that the results were surprising. For the practicality of the screen, we wanted to show that the treatment does not eliminate the infectious species, which we were able to demonstrate. However, the increase of infectivity could stem from many different factors, e.g. the amount of duration of PK treatment might not harm but instead rather expose the infectious species, or PIPLC might remove cell surface molecules that could prevent infection of cells. However, as there are a plethora of possible scenarios and it was not relevant for the study at hand, we did not go into further detail.

      Second, in Fig S1 e, the labels are too small to read. In Fig 3D, it would be easier to match the stabilizer or limiter genes with the corresponding Z score dots if the genes with a negative Z scores are labelled on the left side while genes with positive Z scores be labelled on the right side.

      We amended the figures as per the reviewer’s suggestion.

      Third, The following sentence on page 11 is confusing: "20 out of these 40 candidates reduce prion propagation upon silencing, and 20 candidates enhanced prion propagation, and henceforward are called stabilizers or limiters, respectively (Fig. 3D-E, Supplementary Table 1)." Did the author mean to say "....and 20 candidates enhanced prion propagation upon silencing, and hence..."?

      We reworded the sentence according to the reviewer’s comment.

      * Fourth, In the subheading "Hnrnpk expression limits of prion propagation in mouse and human cells", "of" should be deleted.*

      We addressed this in the main manuscript file.

      ***Cross-commenting:**

      I agree with Reviewer #2's assessment that more quantification will be helpful and the link between the effect of PSA treatment and hnRNPK can be strengthened. I want to stress that the knockdown data clearly shows the involvement of hnRNPK as a prion limiter in cultured cells. The question on PSA does affect the interpretation of the ex vivo and in vivo data.*

      * The blot in Fig. S4c seems to show some decrease in PrPC levels in NBH-treated GT-1/7 cells. This blot needs to be quantified to confirm whether the PrPC level is changed by PSA treatments. Whether PSA directly inhibits prion replication can be relatively easily assessed in RT-QuIC reactions. Alternative to the use of PSA, RNAi-mediated hnRNPK knockdown can also be done on cultured tissue slices or in brain, but this will require a lot more time and efforts and may be too much to ask for in this manuscript.*

      Quantifications for blots were added throughout the manuscript and the text was amended accordingly, and all the points mentioned have been addressed throughout this response letter.

      Reviewer #3 (Significance (Required)):

      * The findings are novel and very significant. They identified a large number of modifier genes, and established a solid foundation for future studies on prion modifier genes to study prion replication and pathogenesis and for novel therapies against prions and potentially some other protein misfolding diseases. HNRNPK seems to be good target for therapeutic intervention and PSA may be a good candidate for prion treatment. The novel QUIPPER assay can be used to screen for anti-prion compounds and potentially adapted to study other misfolding proteins associated with cells.*

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      All the Reviewer’s comments are reproduced below, with our responses interspersed in [[brackets]]. Citations from the revised manuscript are included in “quotation marks”. The website accepts input only as plain text. Consequently, we had to transform the mathematical expressions into plain text. We apologize for the reduced readability.

      Reviewer #1

      1) The authors state that: "the conductance density mediated by the expression of the mutant was 2.5 times smaller than the wild type, although we transfected the same amount of plasmid DNA (Fig. 2E). Assuming that protein expression is independent of the mutation, the observation suggested that the unitary proton flux ratio RC of wild type to mutant channel was equal to 2.5" (lines 82‐85).

      Macroscopic conductance (G) depends on channel number (N), microscopic or unitary conductance (γ), and open probability (PO) by G=N γ PO. The authors assume that the level of WT and D174A mutant protein expression on plasma membrane, which determines N, are equal; however, this critical assumption does not appear to have been tested.

      The fact that conductance density (nS/pF) is plotted in Fig. 2E does not alter this caveat because this procedure normalizes the data only for cell surface area (i.e., size). The authors' conclude that "The conductance density relationship (Fig. 2E) compares the maximal conduction of both constructs; this is the fully open channel (open probability ≈ 1)"(lines 87‐88). However, neither raw currents nor G‐V data are shown. Typically, currents measured at large, near‐saturating PO are used to compare the relative conductances of WT and mutant ion channels. The currents shown in Fig. 2A and 2B exhibit prominent 'droop' at even modest depolarizing potentials (+10 mV for D174A and +30 mV for WT), indicating that the proton gradient has been substantially perturbed by the flow of ge depolarizing voltages needed to drive channels to near‐maximal PO. Furthermore, there is no evidence that maximal PO itself is also not different in WT and D174A channels. Indeed, maximal PO for native Hv1 channels measured using variance analysis is reported by significantly smaller than 1.0, and assuming that PO = 1.0 for either WT or D174A is therefore not well supported. Maximal could be altered by the D174A mutation, which has a clear and strong effect on channel gating evidenced by the large (‐70 mV) negative shift in threshold potential reported both here and previously in the literature. Effects of mutations on maximal PO due to altered gating behavior could be separate and distinct from any change in plasma membrane channel number (N). 3 Lastly, because D174A channels have a much higher PO than WT at 0 mV, the mutant will necessarily conduct inward proton currents at the physiological resting membrane potential (RMP) in tsa‐201 cells (perhaps ‐30 mV?). Inwardly directed proton currents will therefore cause intracellular acidification under resting conditions.

      The constitutive acid load in cells expressing D174A, but not WT, is likely to have a variety of physiological consequences, including decreased protein expression or plasma membrane targeting of D174A. There is evidence that another constitutively open Hv1 mutant (R205H) also generates smaller currents macroscopic conductance than WT, and this phenomenon is likely to result from decreased cell surface expression. To conclude that the microscopic conductances of WT and D174A are unequal, the authors must demonstrate that N is not different. The authors' conclusion that D174A "conducts protons at a lower rate" (line 89) is therefore not well supported by the experimental data.

      [[

      We toned down our conclusions from the experiments to accommodate the reviewer's criticism: (page 4): " Consequently, the mutant channel is nearly fully open (Fig. 2D), readily seen when the membrane potential is 0 mV and external voltage is absent. The high open probability of the D174 mutant under symmetrical pH conditions is readily seen in the tail current amplitude reaching a quasi-saturation (Fig. 2A). The resulting outward currents have a higher amplitude in the wild-type (Fig. 2A+B). Interestingly, the conductance density mediated by the expression of the mutant was 2.5 times smaller than the wild type, although we transfected the same amount of plasmid DNA (Fig. 2E). Our observation suggests a reduced flux through the mutant if we assume that protein abundance in the plasma membrane is independent of the mutation."]]

      2) The authors indirectly measure apparent proton flux rates (λD) in LUVs containing WT and D174A mutant Hv1 channels using a fluorescence‐based approach, and conclude that λD is 2.4 times smaller for D174A than WT. However, the method for estimating λD is not performed under voltage clamp, and the driving force for proton current is neither known nor measured.

      [[

      The reviewer is mistaken. The method for estimating λD is performed under voltage clamp, and the driving force for proton current is known.

      Page 6: “To obtain λD, we encapsulated c_k^i=150 mM KCl in the HV1 containing large unilamellar vesicles (LUVs) and exposed these vesicles to a buffer with a K+ concentration c_k^o= 3 mM. The addition of valinomycin facilitated K+ efflux, thereby inducing a membrane potential, ψ. ψ constituted the driving force for H+ uptake. It can be calculated according to the Goldman equation:

      ψ = -RT/F ln ((c_k^i+(P_H/P_K ) c_H^i)/(c_k^o+(P_H/P_K ) c_H^o ))

      (1)

      The ratio of the HV1 mediated proton permeability P_H to the valinomycin-mediated potassium permeability P_K is always smaller than 0.04. We base our conclusion on the observation that the CCCP-mediated proton permeability represents an upper limit for P_H since CCCP always induces a faster vesicular proton uptake than HV1 (Fig. 3). Accordingly, the maximum value of P_H/P_K can be estimated as the ratio of valinomycin to CCCP conductivities. The respective values are equal to 1.6 10-3 Ω-1 cm2 [1] and 4 10-6 Ω-1 cm-2 [2]. At pH 7.5, we find c_H^o=10^(-7.5) M, i.e., c_k^o ≫ (P_H/P_K )c_H^o. Similarily, c_k^I ≫ (P_H/P_K ) c_H^i for a broad range of intravesicular pH. With these simplifications, Eq. 1 transforms into the Nernst equation yielding:

      ψ = -RT/F ln (c_k^i)/(c_k^o )=-100 mV

      (2)

      ψ of such size may decrease intravesicular pH by nearly two units. Such acidification does not violate c_k^i ≫ (P_H/P_K ) c_H^i so that ψ remains constant throughout the experiment. That is, the vesicle experiments proceed under voltage clamp conditions. The simple explanation is that, due to the small proton concentration and the limited buffer capacity, the K+ conductance exceeds H+ conductance under all conditions. The conclusion is in line with simulations (32), confirming that the membrane potential is driven very near the Nernst potential for K+.”]]

      The authors state that "Transmembrane voltage constituted the driving force for proton uptake into LUVs (Figure M). It resulted from facilitated K+ efflux out of the vesicles (30)", (lines 261‐262), but this voltage is unknown and not likely to equal the Nernst equilibrium potential for K+ once Hv1 channels begin to open.

      [[

      The reviewer is mistaken. The voltage is known (see the equations above). The opening of the HV1 channels does not alter the potential because c_k^o ≫ (P_H/P_K ) c_H^o and c_k^i ≫ (P_H/P_K ) c_H^i for a broad range of intravesicular pH (see above).]]

      Once Hv1 channels begin to open, intra‐lumenal pH (pHi) will necessarily occur during the experiment. Such changes are likely exacerbated by a) the low proton buffering capacity of the system (5 mM HEPES) and b) the absence of any counter‐charge pathway to balance the effect of proton charge movement on the membrane potential.

      [[

      Vesicle acidification occurs. It signifies the presence of functional proton channels. Nevertheless, the membrane potential does not change (see Equation 1 above). The statement b) is not correct because the outward K+ movement counters the inward-directed proton charge movement.]]

      Given the small volume of LUVs, even a relatively modest difference in either membrane potential or pHi could substantially alter the driving force for proton movement. Together, these factors are highly likely to result in a rapid and potentially large change in the driving force for proton flux.

      [[

      As outlined above, membrane potential stays invariant. Vesicle acidification changes the driving force for proton flux. The steady state is reached when the electrochemical potentials for protons on the two sides of the membrane are equal to each other.]]

      Driving force changes may also be different for WT and D174A because their relative PO may be different under the experimental conditions used here. Because D174A activates at much more negative voltages, it is likely to open more quickly and to a higher PO than WT at early times after depolarization is initiated by addition of valinomycin (Fig. 3A). This fact will likely result in a larger initial inward current being carried by D174A than WT channels. The result would be a more rapid acidification of LUVs by D174A.

      [[

      The reviewer is mistaken. Assuming a transport rate of 20,000 potassium ions per second (G. Stark, B. Ketterer, R. Benz and P. Läuger; Biophys. J. 1971 Vol. 11 Pages 981-981) and a membrane capacity of 1 μF cm-2, it takes valinomycin about 10 ms to drive the vesicular potential to near Nernst values. Activation of the proton channel is at least 10 times slower. Thus, both mutant channel and wild type channel may open at roughly the same instant. The driving force is sufficient to open both channels to the same probability.]]

      The experimental data in Fig. 3A are consistent with the expectation that the proton gradient and driving force more rapidly approach equilibrium for D174A than WT channels: the apparent rate of AMCA fluorescence change is slower in D174A. Although the authors correctly interpret the experimental data to mean that the apparent λD is slower for D174A, they do not rule out the artifactual explanation for the measured differences. Indeed, the observation in Fig. 3A that AMCA fluorescence change eventually reaches a plateau and is not affected by CCCP means that the proton gradient has become exhausted during the experiment, and directly demonstrates that the proton driving force is uncontrolled under the current experimental conditions.

      [[

      The reviewer's interpretation of our results is flawed. Instead of becoming exhausted, the proton gradient builds up during the experiment. Initially, extravesicular and intravesicular pH values are equal to each other. Valinomycin-mediated K+ efflux results in a membrane potential that drives Hv1-mediated H+ influx.

      Page 8: “The number NC of reconstituted HV1 dimers per vesicle determines the acidification rate λ, i.e., the time that elapses before reaching the steady state. The final intraluminal pH is independent of NC. Similarly, CCCP addition in the steady state does not change the intraluminal pH of HV1-containing vesicles. But CCCP will affect the intraluminal pH of vesicles deprived of HV1 since H+ background permeability is too small to allow vesicle acidification within the time allotted for the experiment. Consequently, only HV1-free vesicles will acidify upon CCCP addition. That is, CCCP addition allows estimating the fraction of vesicles deprived of HV1.”]]

      In contrast to the authors' statement that "Our experiments with the purified and reconstituted channels corroborated the conclusion (Fig. 3A)", (lines 92‐93) it is not clear that unitary proton flux rates/unitary conductances are actually different in WT and D174A.

      [[

      The reviewer is mistaken. Since we measured under voltage clamp conditions, ensured rapid installment of the membrane potential, and selected a potential large enough to allow for the same open probability of wild-type and mutant channels, the measured transport rates, λ, are valid. Moreover, we determined the number of HV1 channels per vesicle and thus calculated the transport rate of an individual channel, λD. Since λD is different for WT and D174A, the unitary proton flux rates/unitary conductances are actually different in the wild type and mutant.]]

      3) The presumed differences in unitary conductances (i.e., 'transport rate') between WT and D174A are used to estimate Arrhenius activation energies (Ea): ("The difference in measures transport rates allows a rough estimation of the Arrhenius 128 activation energy Ea for HV1‐mediated proton flow. It amounts to 40 kJ/mol for the wild type and 23 kJ for the mutant. Thus, Ea exceeds the corresponding 15 kJ/mol barrier measured for gramicidin A (32, 33)", (lines 128‐130). The method for determining Ea in the current work is not well‐described. In Ref. 32, the authors estimate Arrhenius activation energy (Ea = 20 kJ/mol) for gramicidin D (not gramicidin A) from the slope of a line fit to measurements of currents at various temperatures. Here, the authors measure AMCA fluorescence decay rates at 4 °C and 23 °C and observe a similar temperature‐dependent difference in WT and D174A (Fig. S2). Given that the data indicate that WT and D174A are similarly temperature‐dependent, it is unclear how the authors arrive at different Ea values. The authors' conclusion that "The increment in Ea suggests that the transport mechanism may be different from a pure Grotthuss type, where the proton uses an uninterrupted water wire to cross the membrane", (lines 131‐133) therefore does not appear to be well‐supported.

      [[

      We removed both the calculation and discussion of activation energies. Knowledge and discussion of activation energies distract from the scope of the manuscript. We show the experiments at different temperatures solely to demonstrate that Hv1 and D174A facilitate proton transport at a decreased temperature where the background conductivity of the lipid bilayer to water is small.]]

      4) The authors report no difference in water permeability in WT vs. D174A (Fig. 5 and S1) and interpret the results to mean that proton currents are not associated with measurable bulk water flow. A similar conclusion was reached for native Hv1 channels using deuterium substitution (DeCoursey & Cherny, 1997).

      [[

      The comment of the reviewer is misleading:

      • Equal water permeabilities of WT and D174A would not exclude an association between proton currents and water flow. Accordingly, our manuscript does not contain the stipulated interpretation.
      • DeCoursey & Cherny (1997) did not evaluate bulk water flow through proton channels. They compared D+ and H+ currents across the plasma membrane of rat alveolar epithelial cells. Page 2: “Comparing deuterium ion and proton currents through the plasma membrane of rat alveolar epithelial cells, DeCoursey & Cherny (22) found an isotope effect exceeding that for hydrogen bond cleavage in bulk water. It suggested the involvement of an amino acid side chain in proton conduction (22). Alternatively, altered properties of confined water could have been responsible for the higher isotope effect.”]]

      However, the absence of bulk water flow does not itself rule out the possibility that 'trapped' waters within the Hv1 pore do not themselves carry the measured proton current. If intra‐pore water molecules are tethered by hydrogen bonds with protein atoms, they may not move when Hv1 channels open.

      [[

      The reviewer’s comment contains one misinterpretation and one unfounded statement:

      1. We never stated that 'trapped' waters within the Hv1 pore do not themselves carry the measured proton current. On the contrary, we envisioned the trapped waters delivering the protons to one or more titratable amino acid side chains and accepting the protons from them.
      2. The reviewer’s view that intra‐pore water molecules tethered by hydrogen bonds with protein atoms may not move when Hv1 channels open is a misconception. Page 12 bottom: “The contrasting opinion that instead of a channel obstruction hydrogen bonds may immobilize the pore water (19) is not convincing. First, the lifetime of a hydrogen bond is in the ps range while HV1’s mean open time exceeds 100 ms (41). Thus, hydrogen bonds may break more than 1011 times during the open state, rendering them unfit for tethering intraluminal water molecules. Second, the effect of hydrogen bonds between water molecules and pore residues is limited to decreased water mobility in narrow channels (23). Their number, NH, allows for predicting pf (26). Specifically, every H-bond donating or receiving pore-lining residue contributes an average increment ΔΔG╪ of 0.1 kcal/mol to the Gibbs free energy of activation ΔG╪ (24). Equation (1) allows the calculation of ΔG╪:

      ΔG╪= N_H ΔΔG╪ + ΔΔG╪_i (13)

      where ΔΔG╪_i = 2 kcal/mol (24). Since N_H = 6 (Fig. S1) in the open HV1 conformation, Eq. 1 predicts ΔG╪ = 2.6 kcal/mol. Eq. (2) allows calculating HV1’s pf from this value (42):

      p_f = v_0 v_w exp(-ΔG╪/RT) (14)

      where vw = 3 × 10−23 cm3 is the volume of one water molecule and ν0 is the universal attempt frequency, ν0 = kB∙T/h ≈ 6.2 × 1012 s−1 at room temperature (kB is Boltzmann’s and h is Planck’s constant).”]]

      Proton transfer through a hydrogen‐bonded network of waters requires only that the electronic structure of the network be rearranged during proton transfer; water is not required. As in the previous study (DeCoursey & Cherny, 1997), the lack of water flux reported here demonstrates seems to reinforce the notion that H+ moves separately from its waters of hydration (i.e., hydronium, H3O+, is not the permeant species) and does not necessarily imply information about the mechanism of proton transfer (i.e., side chain ionization vs. Grotthuss‐type transfer in a water‐wire).

      [[

      The reviewer is mixing two unrelated issues. Of course, proton transport may be separated from mass transfer. Yet, charge transfer may or may not include one or several titratable amino acid side chains. If proton side chain ionization is not involved in proton transfer, a water wire must exist that connects the aqueous solutions on both sides of the membrane. In this case, an osmotic gradient will drive water molecules through the open channel. Since we did not observe such water flux, we conclude that the water wire is interrupted by at least one side chain. Thus, our experiments imply information about the mechanism of proton transfer.]]

      The authors state that: 1) "every H‐bond donating or receiving pore‐lining residue would have contributed an increment ΔΔ𝐺‡ of 0.1 kcal/mol to the Gibbs free energy of activation Δ𝐺‡ (25)" (lines 145‐147), and 2) calculating NH from this Δ𝐺‡ allows estimation of the channel's unitary water permeability (Eqn. 2). Although hydrogen bonding patterns will undoubtedly alter the free energy for channel activation, this is not the same free energy change as that for proton transfer.

      [[

      The reviewer's remark is in line with the previous and the current versions of our manuscript.]]

      Hv1 gating involves conformational changes that are both voltage and Δ pH-dependent, and the D174A mutation is known to alter the voltage dependence of gating (Fig. 2 and previous studies). The effect of D174A on Hv1 unitary conductance, however, is speculated but not unambiguous (see above).

      [[

      Our experiments unambiguously demonstrate the effect of D174A on Hv1 unitary conductance. The interpretation of the experiments is straightforward – there is no speculation involved. The contrasting opinion of the reviewer rests on his misinterpretations of (i) our measurements of proton transport rate λD for wild-type and mutant (see above) and the CCCP-effect (see above).]]

      In the absence of definitive experimental data showing differences in the unitary conductance of WT vs. D174A, the authors' assumption that water permeability would be strongly temperature‐dependent (lines 154‐160) seems premature and their ensuing conclusion tenuous: "pore residues interrupt the HV1 spanning water wire, trapping the water molecules inside the HV1 channel. In contrast to water, protons cross the pore by hopping from one acidic residue to another through one or more bridging water molecules (Fig. 6)" (lines 161‐164).

      [[

      The reviewer chooses to misinterpret our lines. We did not assert that water permeability through the Hv1 channel would be strongly temperature‐dependent. We referred to the well-known fact that there is a strong temperature dependence of lipid bilayer water permeability - in contrast to the tiny effect of temperature on the water permeation across aqueous channels.

      Page 11, bottom: “Considering the stark dependence of the activation energy for background water flow across lipid bilayers (24), we repeated the experiments at a decreased temperature of 4°C. Thanks to the low background water permeability at 4°C, even tiny contributions of HV1 to Pf should be detectable. Yet, the channels did not contribute to the water flow through the vesicular membrane even though channel water permeability but weakly depends on temperature (24).”]]

      Furthermore, the authors calculate the number of hydrogen bonds (NH) that pore waters could form with pore lining residues based on an X‐ray structure of a chimeric proton channel protein (pdb: 3WKV) that is: a) manifests discontinuous transmembrane water density and is known to represent a non‐conductive conformation, b) contains residues from Ci‐VSP in the critical S2‐S3 linker that form part of the proton transfer pathway, and c) exhibits structural features (i.e., highly conserved ionizable residues such as D185 and R205, which like D174 are reported to dramatically alter Hv1 gating, are packed into a solvent‐free crevice) that are inconsistent with physiological function. Given that all Hv1 ionizable mutant combinations tested so far (the sole exception of D112V ‐ other nonionizable substitutions at D112 are tolerated) remain functional (Musset, Smith et al., 2011, Ramsey, Mokrab et al., 2010), the identities of water‐interacting residues speculative.

      [[

      We substituted the X‐ray structure of the chimeric proton channel protein for the AlphaFold structure. We now provide views of the open and closed conformations in the Supplement based on the homology structure (13). Microsecond-long molecular dynamics simulations have optimized the latter.

      The experimental observation of mutants’ functionality (with the sole exception of D112V) supports our view that proton transfer occurs through a hydrogen‐bonded network of waters that is only once (at D112) interrupted by an amino acid side chain. The nature of the amino acids interacting with the proton transferring water molecules is of little importance.]]

      Interpreting differences in the calculated NH based on pdb: 3WKV therefore seems unlikely to reveal fundamentally important insights into Hv1 function. The author's conclusion that "The observation rules out the formation of an uninterrupted water chain spanning the open channel from the aqueous solution at one side of the membrane to the other. NH would have governed water mobility if such a water wire had formed (24)", (lines 143‐145) therefore does not appear to be strongly supported.

      [[

      We did not base our conclusion of an obstructed water pathway on the analysis of structural models. In contrast, the conclusion is the result of our experiments. The structural models permitted the prediction of the expected water permeability. Depending on the model and the channel conformation, we find NH values between six and 16. All of these NH values translate into water permeabilities exceeding gramicidin’s water permeability. Thus, we would have been able to detect the water flux through an unobstructed proton channel.]]

      Reviewer #2:

      Summary: Voltage‐gated proton channels are peculiar members of the voltage‐gated ion channel family due to their absence of canonical pore. Instead, protons permeate through their voltage‐sensing domain. The mechanisms of proton permeation in Hv1 channels are still unclear, with currently two competing hypotheses: (i) hopping through titrable residues within the protein; or (ii) via Grotthuss mechanism involving proton jumping through a continuous water wire. So far, these hypotheses were only tackled by computation. The authors therefore aimed to experimentally test the two hypotheses. To do so, the authors measured the transport rates of protons and water through wild‐type and mutant D174A Hv1 reconstituted in lipid vesicles. Overall, the presented data are convincing and support their conclusion that proton conduction through the channel is not solely mediated by water transport. However, there are several aspects of the paper that I did not understand and would require clarification.

      [[

      We thank the reviewer for the positive evaluation.]]

      Major comments: My major concern is about the relevance of using the D174A mutant. The authors explain at the beginning of the paper that Hv1‐D174A is open at 0 mV, which allows measuring proton flux in systems in which voltage cannot be controlled. However, it seems from the proton flux experiments that wild‐type Hv1 can conduct protons perfectly well in the used experimental paradigm. So why test a mutant? It is actually not clear why wild‐type Hv1 can conduct protons in the proton conduction assay.

      [[

      We introduced the D174A mutation to measure water flux in a setting where the membrane potential is zero. We only performed the proton flux measurements to show that our reconstituted HV1 channels are functional. HV1 can conduct protons because we establish a transmembrane potential in the proton conduction assay. That is, only initially, extravesicular and intravesicular pH values are equal. Valinomycin addition results in a K+ efflux that, in turn, generates a membrane potential. This potential drives the HV1-mediated H+ influx.]]

      The authors should clearly state the trans‐membrane potential created by the K+ gradient across the vesicle, as well as the pH inside and outside the vesicle, and related these conditions to their electrophysiology data to give us an idea of the open probability of wild‐type Hv1 in the conditions used in the proton conduction assays. This is critical to be able to compare the relative rates of proton transport between the wild‐type and the mutant.

      [[Page 6, bottom:

      " ...we encapsulated c_k^i=150 mM KCl in the HV1 containing large unilamellar vesicles (LUVs) and exposed these vesicles to a buffer with a K+ concentration c_k^o= 3 mM. The addition of valinomycin facilitated K+ efflux, thereby inducing a membrane potential, ψ. ψ constituted the driving force for H+ uptake. It can be calculated according to the Goldman equation:

      ψ = -RT/F ln ((c_k^i+(P_H/P_K ) c_H^i)/(c_k^o+(P_H/P_K ) c_H^o ))

      (1)

      The ratio of the HV1 mediated proton permeability P_H to the valinomycin-mediated potassium permeability P_K is always smaller than 0.04. We base our conclusion on the observation that the CCCP-mediated proton permeability represents an upper limit for P_H since CCCP always induces a faster vesicular proton uptake than HV1 (Fig. 3). Accordingly, the maximum value of P_H/P_K can be estimated as the ratio of valinomycin to CCCP conductivities. The respective values are equal to 1.6 10-3 Ω-1 cm2 [1] and 4 10-6 Ω-1 cm-2 [2]. At pH 7.5, we find c_H^o=10^(-7.5) M, i.e., c_k^o ≫ (P_H/P_K )c_H^o. Similarily, c_k^I ≫ (P_H/P_K ) c_H^i for a broad range of intravesicular pH. With these simplifications, Eq. 1 transforms into the Nernst equation yielding:

      ψ = -RT/F ln (c_k^i)/(c_k^o )=-100 mV

      (2)

      ψ of such size may decrease intravesicular pH by nearly two units.

      Such acidification does not violate so that remains constant throughout the experiment. That is, the vesicle experiments proceed under voltage clamp conditions. The simple explanation is that, due to the small proton concentration and the limited buffer capacity, the K+ conductance exceeds H+ conductance under all conditions. The conclusion is in line with simulations (32), confirming that the membrane potential is driven very near the Nernst potential for K+.”]]

      Similarly, the buffers and pH used for the water transport assay are not explicitly mentioned. Are they the same as for the proton transport assay or are the buffers inside and outside the vesicle symmetrical?

      [[

      We added the information about buffers and pH used to the legend. Except for 150 mM sucrose, the internal and external solutions were identical: 150 mM KCl, 5 mM HEPES (pH 7.5), and 0.5 mM EGTA.]]

      Finally, in the introduction the authors base their assumptions about water transport on an X‐ray structure of Hv1 in a closed conformation (3WKV). I do not think it is relevant to study permeation, which in theory should only happen in an open state. If the authors want to make assumptions about the number of hydrogen bonds in the pore and how many water molecules are in the pore (and I don't think they need to do it), they should rather base their assumptions on the computational models of Hv1 open state.

      [[

      We thank the reviewer for the advice. We added a figure to the Supplement. It shows Hv1 models from long-timescale molecular dynamics simulations (Geragotelis et al, Proc Natl Acad Sci U S A 2020 Vol. 117 Issue 24 Pages 13490-13498). The open structure reveals NH=6. We used this value for our calculations.]]

      Minor comments:

      1) Figure 6: the authors should precise that the model of proton conduction through Hv1 is just an assumption. The structural features of Hv1 open state are indeed unknown.

      [[We modified the figure based on the simulation results of Geragotelis et al. We indicated in the legend that the scheme is based on HV1 homology models.]]

      2) Page 9, lines 170‐171 "Drastically prolonged tail current kinetics might reflect a decreased voltage‐dependence of the deactivation in the D174 mutant". Or rather the prolonged kinetics reflect the stabilization of the open state by the mutation (as stated by the authors just after).

      [[Page 14:

      “Drastically prolonged tail current kinetics might reflect (i) a decreased voltage dependence of the deactivation in the D174A mutant or (ii) a stabilized open state (14).”]]

      3) Supplementary figures are displayed in an odd fashion. Figure S3 should be placed before Figures S1 and S2.

      [[We added two more Supplementary Figures and displayed them in the order of text mentionings.]]

      4) In Figure 2, displaying the current trace corresponding to the 0 mV voltage step would improve readability of the figure, by showing that Hv1‐D174A mutants conduct protons at 0 mV and not wt Hv1.

      [[

      We show the current trace corresponding to the 0 mV voltage step for the D174A mutant in panel A and the trace for the wild-type in panel B of Fig. 2.]]

      5) Figure 2 legend "Pronounced inward H+ currents activate negatively to the reversal potential (here ‐70 mV)". I think the authors mean "Here 0 mV", ‐70 mV is the threshold potential. Panel (c), I guess the EH vs Vrev plot is for D174A mutants but it is not mentioned in the legend

      [[

      We corrected the legend. “Pronounced inward H+ currents activate negatively (here – 70 mV) to reversal potential (here – 8 mV), indicating a high open probability of the D174A mutant at 0 mV.” And “Comparison of calculated Nernst potential for protons (EH) and measured reversal potential (Vrev) for the D174A mutant.”]]

      6) Page 4, line 89: the fact that D174A conducts protons at a lower rate is, at this point, based on a lot on assumption. I would just correct the last sentence by saying "Thus, D174A, while opening with less depolarization, seems to conduct protons at a lower rate"

      [[We toned down our statement and inserted a phrase very close to the one suggested.

      Page 5: “Our observation suggests a reduced flux through the mutant if we assume that the protein expression level is independent of the mutation.”]]

      7) Page 6, line 107. The word "therefore" is not necessary

      [[ok]]

      8) Page 7, line 128: "of" in "measures of transport" is missing

      [[We deleted the paragraph.]]

      9) Page 12, lines 261‐262: "Figure M" ??

      [[“Inset of Figure 3A”]]

      CROSS‐CONSULTATION COMMENTS I agree with the two other reviewer's comments. I think our reviews more or less raise the same weaknesses in the study.

      Significance

      This paper addresses a single question with a clearly defined experimental paradigm. Once the issues addressed, the paper should bring important significance to the field of voltage‐gated ion channels since the nature of proton conduction in Hv1 was not known. It could help explain ion conduction in some channelopathies involving ion conduction through the voltage‐sensing domain. The audience is mainly the voltage‐gated ion channel community, as well as the community of membrane permeation mechanisms My field of expertise is in ion channel structure‐function and pharmacology. I have little expertise in the described proton and water flow assays. Therefore I do not have sufficient expertise to evaluate the detailed experimental protocol that led to the measurements.

      Reviewer #3:

      Summary: This study addresses a fundamental question about the mechanism of proton conduction in the voltage gated proton channel Hv1 i.e., whether protons hop through an uninterrupted water wire, or move by other means involving titratable channel residues. The authors argue that an uninterrupted water wire entails a certain rate of water movement through the open channel, which they estimate to be around 10‐12 cm3s‐1 based on a structural model of Hv1 and previous work on other channels. They then measure water permeability of LUVs containing a purified Hv1 mutant expected to be open at 0 mV via light scattering, and proton flux using a pH sensitive fluorescent dye. They calculate a water permeability much lower than predicted and conclude that the water in the conduction pathway does not form an uninterrupted water wire. The manuscript is written clearly, and the experimental measurements are convincing.

      [[We thank the reviewer for the positive evaluation.]]

      There are nonetheless some ambiguities in the way the formation of water wires is discussed.

      Major comments: A protein like Hv1 is larger and more complex than small peptides like gramicidin. In this context, transient water wires, frequently interrupted by titratable residues, or by steric hindrance from hydrophobic sidechains etc. are likely. Can the authors provide an estimate for the maximum frequency and lifetime of uninterrupted proton wires compatible with their measurements? This would be helpful to evaluate whether short‐lived uninterrupted water wires could contribute significantly to proton conduction or not. Trapping usually implies restricted movement. So, for how long do water molecules need to stay inside the channel in order to be considered trapped? Are the water molecules really trapped or simply forming broken wires?

      [[Page 13, bottom:

      “The question arises whether the obstacle in the water pathway is permanent. HV1’s titratable residues or steric hindrance from fluctuating sidechains may frequently interrupt otherwise intact water wires. Yet, our calculations (Eqs. 7 – 11) show that proton diffusion from the bulk solution to the pore mouth is the transport limiting step. Undoubtedly, transient closure would have caused a detectable pore resistance because part of the protons arriving at the pore mouth could not enter the pore. If the pore was closed longer than one ps, an arriving H+ may diffuse out of the capture zone and vanish into the bulk:

      t_c=(r_0^2)/6D = 10^(-16)/(6 × 8.65 × 10^(-5) ) s = 2 × 10^(-13) s

      (16)

      where tc denotes the time a proton requires to diffuse a distance equal to the capture radius r0. Since transient closures would give rise to experimentally undetected pore resistance, they must be ruled out. The observation agrees well with noise experiments, where Lorentzian time constants, albeit smaller than the time constants for H+ current activation but larger than 0.1 s were observed (41).

      We provided the calculations showing the diffusion limitations on page 9:

      “…we show that the transport limiting step is H+ diffusion to the pore (access resistance) and not transport through the pore. Therefore, we first calculate the maximum current Imax permitted by diffusion for a constantly open pore (35):

      I_max=2π F r_o D_H c_H

      (7)

      where F, r0, DH, and cH are Faraday's constant, the capture radius, the H+ diffusion constant, and the H+ concentration, respectively. The only unknown parameter is r0. Taking the gA estimate r0 = 0.87 Å (36), disregarding buffer effects and assuming DH = 8.65×105 cm2s-1, we find:

      I_max=2π (9.6 ×10^4 As)/mol × 0.87 × 10^(-8) cm × 8.65 x 10^(-5) (cm^2 s^(-1) × 4 × 10^(-7.5) mol)/(1000 cm^3 )

      (8)

      I_max=5.6 × 10^(-17) A

      (9)

      Eq. 8 considers that the approximately 25 % charged lipids in the bilayer induce an increase in surface proton concentration, i.e. it accounts for a surface potential of roughly -40 mV in 150 mM salt. The maximal unitary rate would then be equal to:

      q_max = 5.6 × 10^(-17) C/s/1.6 × 10^(-19) C =348 s^(-1)

      (10)

      Here we used the r0 value determined for gA (36). Acidic moieties at the entrance of HV1 and proton surface migration along the lipid bilayer could serve to increase that value (37, 38). The observation suggests transport limitations by poor proton availability. Calculation of the channel resistance, Rch (35), confirms the hypothesis:

      R_ch = R_pore+R_access =[l_ch+(π a_ch)/2] ρ/(π a_ch^2 )

      (11)

      where R_pore is the resistance of the pore proper and R_access is the access resistance. Assuming a channel radius, a_ch, of 0.15 nm, a length, l_ch of 4 nm and solution resistivity (H+ as the sole conducted ion at bulk pH of 7.5 and a surface potential of -40 mV), ρ, of 2×105 Ω cm, we find R_ch = 4×1013 Ω. Thus, the resulting current, Iρ, that we may expect for the vesicular membrane potential of 100 mV is equal to 3×10-15 A. Accordingly, Iρ exceeds Imax by more than one order of magnitude. Consequently, we may safely conclude that HV1 conductance is limited by proton availability under our conditions. ”]]

      The main conclusion of the paper rests on the negative results from the water permeability assay of Fig. 5. It is recommended to include a positive control (e.g., with gramicidin A), run under the same conditions and similar number of channels per LUV, to show how the results should look like in case of significant water permeability.

      [[We included the gramicidin measurements (Fig. 6) as requested.]]

      Figure 6 show a simplified scheme of proton transport with trapped water molecules in Hv1. Panel A represents a resting state (nonconductive); panel B represents an open state (conductive), favored by the D174A mutation. So, what makes B conductive and A nonconductive? Is it the presence of two salt bridges in B vs. three salt bridges in A? This should be clarified.

      [[

      We modified the figure based on the simulation results of Geragotelis et al. We indicate with arrows the parts of the channel where the proton is free to move and crosses the sites with insurmountable energy barriers.

      Legend to the figure (now Fig. 8): “In the region of the selectivity filter adjacent to D112, the channel is too narrow to let water molecules pass (see also Fig. S1). Yet, the proton may bypass the electrostatic barrier of the open channel at D112 (18), i.e., jump between the two neighboring water molecules. Removal of D174 shifts the voltage sensitivity so that most channels are already open at a transmembrane potential of 0 mV. B) The closed channel. It neither allows water nor proton transport. In its new location, D112 provides an insurmountable electrostatic barrier to proton passage.”]]

      Minor comments: The interpretation of Fig. 2E strongly depends on the assumption that the D174A mutation does not alter membrane trafficking. It is recommended to check the validity of this assumption, e.g., by colocalization with a plasma membrane marker. Images of SDS‐PAGE results for the studied Hv1 proteins should be provided to show preparation purity.

      [[

      We toned down the interpretation of Fig. 2E. As it stands now, Fig. 2 shows that the mutant (i) is functional and (ii) has a high open probability at 0 mV. These conclusions are independent on membrane trafficking. We included images of SDS page results for the studied HV1 proteins in the Supplement.]]

      CROSS‐CONSULTATION COMMENTS I agree with the comments from the other two reviewers. My major point is that refuting major water permeability in Hv1 is not the same thing as refuting that protons can be conducted by transient water wires, unless it is proved that the transient water wires cannot sustain enough proton movement to account for the single channel conductance. Reviewer #3 (Significance (Required)): The Hv1 channel plays important roles in the human body, including the immune, respiratory, and reproductive systems. Despite recent advances in understanding the mechanism of proton conduction by Hv1, whether or not protons hop within a continuous water wire in the open channel is a subject of debate (DeCoursey J. Physiol. 2017, Bennett & Ramsey J. Physiol. 2017). This work provides important insights on the debate by refuting the existence of a water wire that can sustain large water permeability. The findings reported here will be of interest to ion channel biophysicist like this reviewer, but also to biologists studying cellular pH homeostasis and the pathophysiology of Hv1.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      August 17, 2022

      RE: Review Commons Refereed Preprint #RC-2022-01442

      Dear Editor of the EMBO Journal,

      Please find our updated manuscript and response to the reviewers’ comments. We appreciate the effort that the reviewers have put into the evaluation of our manuscript.

      We are happy with the potential importance the reviewers realise in the study:

      Reviewer 1: The finding that ubiquitination occurs inside mitochondria would be an important conceptual advance, which would open new perspectives both for ubiquitination and mitochondrial biology

      Reviewer 2: This work would represent a significant/exceptional discovery if supported by compelling data.

      Reviewer 3: the results are interesting and very important, as mentioned in the major comments section…

      With regard to the major comments raised by the reviewers, you will find below our specific response point by point with explanations and suggested novel experiments (highlighted in yellow). In summary we suggest the following actions to fully support our model:

      • We will perform a-complementation with ubiquitin (lacking the GG motif) fused at its C-terminus to the short fragment of b-galactosidase (a). Blue colonies with ωm will indicate import.
      • As shown in Figure S2, now added to the manuscript, we show detection of ubiquitinated proteins and mono ubiquitin in extracts of mitochondria pre-treated with trypsin.
      • A bio-archives address of our other manuscript will be provided.
      • The use of a-complementation for protein localization was developed by us 15 years ago and since then has been used by us and other groups verifying its use as a screening tool. One point is clear, ωm or ωc do not leak into other subcellular compartments. Nevertheless, in the research of specific genes validation is important. Yes!!! ωm and ωc are exclusively located in mitochondria or the cytosol respectively.
      • We will highly purify mitochondria on gradients and treat them with protease.
      • We cannot be sure that we will be able to detect a protein with ubiquitin modifying activity which functions solely on certain proteins in mitochondria, so publication cannot rely on this.
      • Repeat mass spectrometry with careful editing will be undertaken as suggested by the reviewer.
      • We will attempt to perform protease protection assays in the presence of specific detergents.

      Before tackling the very tough revision, we would like to know if EMBO Journal would positively consider acceptance of our manuscript based on the review and planned revision.

      Prof. Ophry Pines Microbiology & Molecular Genetics Hebrew University of Jerusalem Jerusalem 91220 Israel


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, Zhang et al. investigate whether ubiquitination occurs inside mitochondria of the budding yeast S. cerevisiae. They first observe thanks to a sensitive complementation assay that several components of the yeast ubiquitination (and deubiquitination) machinery can localize inside mitochondria. To be able to specifically probe ubiquitin conjugates assembled inside mitochondria they fused HA-tagged ubiquitin to a mitochondrial targeting sequence. Using this construct, they demonstrate that ubiquitin conjugates can be assembled in mitochondria. A series of elegant experiments demonstrates that the pattern of ubiquitin conjugates depends on the mitochondrial localization and the activity of the ubiquitin conjugating enzyme Rad6. Altogether, these results convincingly demonstrate that ubiquitination can occur inside yeast mitochondria when ubiquitin is intentionally targeted inside this organelle. It however remains unclear whether mitochondrial ubiquitination occurs in endogenous conditions (without targeting ubiquitin into this compartment) and whether it affects mitochondrial functions.

      Response: Regarding the question whether mitochondrial ubiquitination occurs in endogenous conditions, we feel that this is obvious based on our results. We detect numerous ubiquitination related enzymes (E1, E2, E3, DUB) eclipsed in mitochondria but none of the proteasome subunits. As pointed out by the reviewer “these results convincingly demonstrate that ubiquitination can occur inside yeast mitochondria”. With that said, additional data will be incorporated into the manuscript as suggested by the reviewer and can be seen below.

      Major comments:

      1) The materials and methods section is lacking important information (western blot protocol, details of antibodies, strains, plasmids...). It is thus difficult to evaluate how several experiments were performed and how their design (e.g. the promoters chosen to express tagged proteins) could impact the interpretation of the results. This is a major issue that needs to be corrected. The main text should also explicitly indicate whether tagged proteins used in the alpha-complementation assay are overexpressed or not.

      Response: The materials and methods section will be updated accordingly.

      2) Despite the previous comment, the data presented in the manuscript convincingly demonstrate that multiple components of the ubiquitination machinery can localize within mitochondria and that ubiquitin conjugates can be assembled in mitochondria when ubiquitin is modified to be intentionally targeted into this compartment. However, little data is shown to support the hypothesis that ubiquitin conjugates can be assembled in mitochondria when ubiquitin is not fused to a mitochondrial targeting sequence. Thus, in my opinion, the evidences presented in the current manuscript are not sufficient to conclude that ubiquitin conjugates are assembled in mitochondria in endogenous conditions (as this is done implicitly). Additional evidences are needed to draw this conclusion (see some experimental suggestions hereafter). Without further evidences, the speculative aspects of the claim that "ubiquitination occurs in the mitochondrial matrix" should be discussed explicitly.

      Response: See the discussion above why we are confident that ubiquitination occurs in mitochondria. Our major problem with ubiquitin and the ubiquitination enzymes is that they are eclipsed in mitochondria. We propose as suggested by the reviewer (item 4 of his review) to perform a-complementation with ubiquitin fused at its C-terminus to the short fragment of b-galactosidase (a). Blue colonies with ωm will indicate import.

      3) The authors used a mass spectrometry approach to identify mitochondrial ubiquitination substrates. However, they have not yet succeeded in identifying a substrate whose modification is specifically regulated by a given component of the mitochondrial ubiquitination machinery. They have also not identified a phenotype or process impacted by mitochondrial ubiquitination. Thus, at this stage, the biological consequences of mitochondrial ubiquitination remain elusive.

      __Response: __We have not identified a substrate whose modification is dependent on a given component of the mitochondrial ubiquitination machinery, even though we have tried. Again, the problem is low levels of these proteins eclipsed in mitochondria. Even when we do find a protein that is ubiquitinated (e.g. Aco1) its ubiquitination is not exclusively dependent on Rad6. Thus, different ubiquitin enzymes may have the same substrates.

      4) The authors have not directly investigated whether ubiquitin itself (without a mitochondrial targeting sequence) localizes in mitochondria. I encourage them to address this question since it would provide an important piece of evidence suggesting that mitochondrial ubiquitination can occur in endogenous conditions. This could be done using the alpha-complementation assay and the results could be presented within Figure 1. Ideally this experiment should be performed without overexpressing ubiquitin. Note that if the authors decide to use a C-terminally tagged form of ubiquitin for this experiment, the GG motif of ubiquitin should be mutated to avoid cleavage of the alpha tag by cellular DUBs. This form of ubiquitin will not be conjugatable, but this is not an issue for this experiment since its aim is to determine whether ubiquitin can be targeted to mitochondria, not to probe conjugates.

      Response: We will perform experiments as suggested by the reviewer including ubiquitin fused at its C-terminus to the short fragment of b-galactosidase (a), see item 2. We have previously made a PreSu9-Ubi lacking a GG motif but now will look at a different combination of this and other constructs.

      5) In the top panels of Figure 2 and S1, free ubiquitin is well detectable in the total and cytosolic fractions. It is however not clear to me whether it is also detectable in the concentrated mitochondrial fraction. If yes and if it would be resistant to trypsin digestion, it would provide additional evidence that endogenous ubiquitin can be targeted to the mitochondrial matrix (see previous comment).

      Response: See Item 6.

      6) The data shown in the top panel of Figure 2 and S1 also suggest that free ubiquitin is less concentrated in mitochondria than in the cytosol (since it is more difficult to detect in the concentrated mitochondrial fraction than in the cytosolic fraction, see previous comment). It is thus possible that the use of preSu9-HA-Ubi (or preFum1-HA-Ubi) lead to an artificially high intra-mitochondrial concentration of free ubiquitin. As the concentration of free ubiquitin is known to impact ubiquitination processes, I encourage the authors to compare the relative levels of free ubiquitin present in the mitochondrial fraction prepared from WT and preSu9-HA-Ubi (or preFum1-HA-Ubi) expressing cells. If free ubiquitin is detectable in mitochondrial fractions and resistant to trypsin (see previous comment), this could be done by repeating the experiment shown in Figure 3B and probing the blot with an antibody that recognizes free ubiquitin.

      Response to 5 and 6: Detection of ubiquitin in mitochondria is extremely difficult even when mitochondria are 15-fold concentrated versus the cytosol and when HA-Ubi is overexpressed. Thus, ubiquitin is eclipsed in mitochondria. Nevertheless, as shown in the Figure below which was not part of the submitted manuscript yet was performed in parallel to experiments done early on, shows detection of very weak bands of free ubiquitin in extracts of mitochondria pre-treated with trypsin.

      Endogenous ubiquitination pattern in mitochondria of _Δrad6 _cells is restored to normal by Rad6-α. __WT or Δrad6 cells containing a Rad6-α construct or an empty plasmid were subjected to subcellular fractionation. Mitochondrial fractions with or without trypsin treatment, were probed for ubiquitin by WB. Aco1 is a matrix mitochondrial protein, and Tom70 is a mitochondrial outer membrane protein (MOM) facing the cytosol.

      7) I strongly encourage the authors to provide more data indicating that "ubiquitination occurs in mitochondria" by performing experiments that do not rely on the use of the preSu9-HA-Ubi or other forms of ubiquitin that are intentionally targeted to mitochondria. For instance, they could analyse the pattern of HA-Ubi conjugates of trypsin digested mitochondrial fractions prepared from wt, rad6-delta, and rad6-delta complemented with preSu9-Rad6-alpha-SL17. Note that if trypsin digested mitochondrial fractions are too contaminated by ubiquitinated proteins present outside mitochondria to perform this experiment, the authors may use the unspecific DUB Usp2 as an alternative protease to strip ubiquitinated proteins from the mitochondria periphery.

      Response: Concentrated mitochondrial extracts from WT and Δrad6 cells untreated or treated with trypsin were probed with anti-ubiquitin antibodies (Figure above). A very weak band corresponding to free ubiquitin can be detected in extracts of mitochondria treated with trypsin but these are very weak and are on the limit of detection.

      Minor comments:

      1) Overall, the manuscript is well organized and easy to follow. The text is clearly written; the figures are well annotated.

      2) The authors should provide full images of all the blots with anti-ubiquitin and anti-HA antibodies so that one can see the bands corresponding to free ubiquitin (or free HA-Ubi). For instance, in Figure 3B, it is not possible to see the presence (or absence) of the band corresponding to free HA-Ubi because the very bottom of the image is cut.

      3) The authors should indicate whether the MTS of Su9 (and Fum1) are expected to be cleaved after import of preSu9-HA-Ubi (and preFum1-HA-Ubi) in mitochondria. They should also label on the corresponding immunoblots the presence (or absence) of the band corresponding to the free preSu9-HA-Ubi (and preFum1-HA-Ubi) (or HA-Ubi if the MTS is expected to be cleaved from these constructs).

      4) In Figure 3B, the ubiquitin conjugates produced with preSu9-HA-Ubi and preFum1-HA-Ubi have different migration patterns. I think this should be explicitly mentioned and discussed. Could it be due to the presence of lysine residues in the Su9 or Fum1 MTS that could lead to the assembly of artificial ubiquitin chains?

      5) The authors indicate that "endogenous Rad6 [...] is expressed at very low levels and can hardly be detected in the mitochondrial fraction by WB (Figure S5)". I did not manage to observe the band corresponding to endogenous Rad6 in the mitochondrial fraction in the pdf. The authors should provide a more contrasted or better quality image.

      CROSS-CONSULTATION COMMENTS I agree with reviewer 2 that proper validation of the complementation assay is crucial for this manuscript. I was myself wondering whether it uses endogenously tagged proteins or whether it is based on an overexpression system. I imagine this information will be detailed in the manuscript in preparation mentioned by the authors. I am therefore wondering whether it would be possible to ask the authors to provide the draft of this manuscript (or at least the validation part).

      Response: A bio-archives address of our other manuscript will be provided upon resubmission. See other issues referred to the response Reviewer 2.

      I agree with most comments of reviewer 3. Regarding the hypothesis that preSu9-HA-Ubi could form aggregates on the cytosolic surface of the mitochondria, I think that the results presented on Figure 7B rather argue against it (since they indicate that Rad6 localized inside mitochondria can restore the pattern of ubiquitin conjugates). That's why (in my opinion) the major question the author now need to adress is whether intra-mitochondrial ubiquitination occurs in endogenous conditions (ie without forcing ubiquitin into this compartment and without E2 or E3 overexpression).

      Response: See response to the other reviewers

      Reviewer #1 (Significance (Required)):

      The finding that ubiquitination occurs inside mitochondria would be an important conceptual advance, which would open new perspectives both for ubiquitination and mitochondrial biology research. However, the significance of the current manuscript is limited because the presented evidences heavily rely on the use of artificial conditions (ubiquitin tagged with a mitochondrial-targeting sequence) that may trigger irrelevant ubiquitination events. The significance would be much higher if the authors would provide further evidences indicating that intra-mitochondrial ubiquitination occurs in endogenous conditions and/or if they had identified a mitochondrial process specifically impacted by mitochondrial ubiquitination.

      Expertise of the reviewer: Ubiquitination, Yeast biology, protein-protein interactions. No specific expertise in mitochondrial biology

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In the manuscript by Yu et al., the authors test the concept that certain proteins are unevenly distributed within distinct cell compartments. Due to this localization discrepancy, protein detection in some subcellular compartments can be "eclipsed" by a predominant subset of specific protein localizing in another cell compartment their actual distribution. Therefore, tiny amounts of physiologically relevant proteins could be biologically relevant. Still, their function in some locations can be overlooked (or eclipsed) because of the high expression level of the same protein in another subcellular compartment(s). Although, this concept is not particularly novel. For example, it is already known that many different proteins can localize to distinct cellular locations (e.g., permanent mitochondrial and peroxisomal localization of many proteins or transient localization of particular proteins to separate cell compartments). The authors apply a yeast system and an α-complementation assay to test further the role of such eclipsed proteins in mitochondrial biology. Specifically, they focus on the ubiquitin (Ub, or as abbreviated incorrectly in this manuscript; Ubi) conjugation pathway, components of which have never been convincingly shown to localize inside the mitochondria. This work proposes that certain ubiquitination events can occur inside yeast mitochondria. This work would represent a significant/exceptional discovery if supported by compelling data. However, the major problem with this work is that the conclusions are based on the ectopic expression of distinct proteins. This approach is not failproof in precise protein expression/delivery to the specific subcellular locations and is likely to result in a non-specific localization. Thus, the problem of eclipsed proteins is addressed by the methodology that may lead to the artificial generation of eclipsed overexpressed proteins. A more effective approach would be if the authors found a way to study this issue with endogenous proteins. The need for overexpression of mitochondria-targeted ubiquitin makes it challenging to reconcile the physiological role of these fundings. In addition, some critical technical issues and omissions further reduce the potential impact of this work (see Specific comments below). For example, strong evidence of mitochondria fraction purity and additional evidence that all the essential constructs used in this work are not misdirected to a different compartment are needed.

      Response: “Although, this concept is not particularly novel” is a very disappointing remark by the reviewer!! While dual targeting of proteins has been known for many, many years, how widespread the phenomenon was unknown and thought to be negligible. We are leaders for the last 30 years in the field of dual targeting and distribution and in particular distribution of single translation products. We coined the terms “echoforms” and “eclipsed distribution” and developed methods to detect and screen for dual targeting. The concept of eclipsed distribution and in particular eclipsed targeting to mitochondria is very new, and is leading to a novel perception of the mitochondrial proteome (see MS submission). While the reviewer appears to be an expert on ubiquitination, we are experts on dual targeting.

      • Ub was abbreviated incorrectly in this manuscript, Ubi. __Response: __This will be corrected.

      Other comments will be referred to in the response to Specific comments.

      Specific comments 1. The authors should demonstrate beyond doubt that the ω components of their assay (ω-C, which supposedly stays in the cytosol-ONLY and the ω-M component, which seemingly remains in the mitochondria-ONLY) are in the compartment that the authors claim. These two proteins are transfected into yeast cells and overexpressed. Therefore it is possible that they leak to other, not intended, subcellular compartments. The authors assume that ω-M and ω-C are exclusively located either in the mitochondria or the cytosol. However, this should be shown as validation of the assay. The indicated reference from 2005 (Ref.13) and others are irrelevant since assays have variations and are often researcher/lab dependent. This validation is very important since a misallocation of the overexpressed ω-M or ω-C, leaking into other subcellular compartments, may cause misdetection of the α-constructs.

      Response: The use of a-complementation for protein localization was developed by us 15 years ago and since then has been used by us and other groups verifying its use as a screening tool. One point is clear, ωm or ωc do not leak into other subcellular compartments. Nevertheless, in the research of specific genes validation is important. Yes!!! ωm and ωc are exclusively located in mitochondria or the cytosol respectively.

      It is not surprising that Ub conjugates are detected in mitochondrial fractions. It could be due to ubiquitination of the OMM (coming from the cytosol) or perhaps since the subcellular fractions were not pure mitochondria free from contamination (the likely culprit could be the ER). The mitochondrial fractions in this work were obtained by 10,000 g separation between cytosolic and mitochondrial crude fractions. Indeed, these 10,000 g crude fractions are highly impure with membranes from other compartments (i.e., microsomes, lysosomes, and so on). Therefore, more sophisticated purification methods should be used. In addition, the authors should also test these fractions for non-mitochondrial proteins from other membrane organelles.

      Response: We agree with the reviewer and therefore will take the following approaches:

      1. i) We will treat isolated mitochondria with protease in order to remove adhering proteins and digest OMM proteins…… see attached figure.
      2. ii) We will highly purify mitochondria on gradients and this will be straight forward since we are now employing such methods in other projects in the lab. iii) Matrix protein enrichment (by mass spec) is associated with IP for preSu9-HA-Ub conjugates which is three-fold higher than for HA-Ub. In any case the fact that we identify conjugates of proteins not known to be mitochondrial, strongly supports our thesis.

      Figure 2. Coomassie blue staining does not show any signal in the "M" fraction. It can be interpreted that the authors do not get any mitochondria there, and therefore the lack of Ub signal is due to the absence of the protein in the samples. Using the same amount of protein from each fraction would probably reduce the necessity of 15x enrichment.

      Response: The Coomassie blue staining does show a signal in the "M" fraction which is weak yet when a 15x enrichment is run, the protein level by Coomassie blue staining is similar to the cytosolic fraction.

      Figure 3. It is puzzling why the HA-UBQ presence is so strong in the crude mitochondrial fraction, but the preSu9-HA-Ub signal (mito-matrix) is comparatively weak. These data suggest that the crude mito-fraction could be highly contaminated with OTHER membranes. On the other hand, the preSu9-HA-UBQ signal is no more than 1-5% of the total mitochondrial signal. The high enrichment of the HA-Ubi in both cytosols and the mitochondria could indicate the OMM ubiquitination or (again) contamination by other compartments. The constructs with MTS are detected in the mitochondria. However, the localization of tagged MTS-Ubi in a non-targeted compartment (e.g., cytosol) should be excluded by additional exposure times. Because the manuscript talks about eclipsed proteins, this is important.

      Response: The HA-Ub is strong in the mitochondrial fraction, in the absence of trypsin, but is very weak in the presence of the protease indicating that most of the ubiquitinated proteins are externally attached to mitochondria. In contrast, PreSu9-HA-Ub is imported into the mitochondrial matrix and is protected from trypsin. This manuscript refers to “eclipsed in mitochondria” (not the cytosol) and this is true for ubiquitination enzymes as well as for ubiquitin.

      Figure 3C-E. These data indeed suggest that the Ub-conjugates could be formed inside the mitochondria. However, the above-discussed possibility that other than mitochondria compartments co-sediment in the 10,000g fractions makes the data interpretation highly challenging.

      __Response: __We will highly purify mitochondria on gradients and this will be straight forward since we are now employing such methods in other projects in the lab.

      Figure 4. Unsurprisingly, mitochondrial targeting of Ub leads to detecting some co-immunoprecipitating mitochondrial proteins. However, these data do not support the notion that Ub conjugation machinery acts inside the mitochondria and that the target proteins are indeed conjugated with Ub (the interaction with Ub is not equal to being conjugated). At the minimum, the authors should provide a validation that some of the detected mitochondrial matrix proteins are indeed ubiquitinated. To this end, purified mitochondria could be used for the candidate protein IP under denaturing conditions and then blotted for the candidate protein and Ub.

      __Response: __As shown in Table S2 and figure S7, forms of Ilv5, a mitochondrial protein, are ubiquitinated in WT and Drad6 cells. These modified forms of Ilv5 can be eluted from mitochondrial extracts of WT and Drad6 cells. However, the ubiquitination of ilv5 is not dependent or effected by the Drad6 mutation. We cannot be sure that we will be able to detect a protein with ubiquitin modifying activity which functions solely on certain proteins in mitochondria.

      Figure 5. The knock-out of the E2 Rad6 causes a change in the mitochondria ubiquitination pattern. This is an interesting observation, but again it does not prove that the change in the mitochondrial ubiquitination is due to the activity of Rad6 inside of the mitochondria, as opposed to ubiquitination of the OMM proteins or contaminating fractions. One also wonders why overexpression of mitochondria-targeted Ub would be necessary to detect the ubiquitination if this process was physiologically relevant, especially given that detecting endogenous Ub is not challenging. Furthermore, the apparent increase in ubiquitination in E2 mutant cells (Fig. 5) should also be addressed in more detail. Finally, data from one WB is shown, and quantification of several independent experiments should also be provided.

      __Response: __We show in the MS that RAD6 is exclusively targeted to mitochondria (Su9MTS) while unimported molecules are degraded (SL17; degron). This hybrid Rad6 can restore the WT ubiquitin pattern, while a rad6 active site mutant cannot.

      Figure 6. Can the authors provide Western blot data showing the expression of Rad6? Furthermore, quantifying these rescue experiments is necessary to make this conclusion more solid.

      Response: Even though we did not succeed in making good Rad6 antisera, we can clearly detect Rad6-a fusion proteins (Figure 7B).

      Figure 7. The authors found that preSu9-Rad6-α have problems being imported into the mitochondria matrix; therefore, they rebuild it as a preSu9-Rad6-α-SL17 protein. SL17 is a degron that targets the cytosolic protein (not imported into the mitochondria) to the proteasome and degraded (Figs. 7A-B-C). These issues could be a red flag for the rest of the manuscript, suggesting that other constructs (that were not critically evaluated for their localization in this work) could leak to different cellular compartments.

      Response: The wording used by the reviewer is particularly disturbing since current understanding in cell biology of eukaryotic cells does not accept “leaking” of proteins to different cellular compartments. One wouldn’t want DNAses, RNAses, Proteases etc leaking from one compartment to another. The localization of proteins to different cellular compartments involves very precise signals on the proteins, and specific cellular components, such as translocases, are required to target proteins to their exact destination. This is true for Rad6; it contains an MTS like sequence which when removed blocks import of the protein into mitochondria. Rad6 according to our analysis is an eclipsed dual targeted protein, so it no surprise that it is in two compartments and the trick with the SL17 degron solves the problem.

      The manuscript needs to be carefully edited, some references are in the not correct format, and there are issues with figure labels.

      Response: Careful editing will be undertaken as suggested by the reviewer.

      CROSS-CONSULTATION COMMENTS I agree with a great summary by reviewer 1. This discovery should be validated by top-quality data.

      Reviewer #2 (Significance (Required)):

      In the manuscript by Yu et al., the authors test the concept that certain proteins are unevenly distributed within distinct cell compartments. Due to this localization discrepancy, protein detection in some subcellular compartments can be "eclipsed" by a predominant subset of specific protein localizing in another cell compartment their actual distribution. Therefore, tiny amounts of physiologically relevant proteins could be biologically relevant. Still, their function in some locations can be overlooked (or eclipsed) because of the high expression level of the same protein in another subcellular compartment(s). Although, this concept is not particularly novel. For example, it is already known that many different proteins can localize to distinct cellular locations (e.g., permanent mitochondrial and peroxisomal localization of many proteins or transient localization of particular proteins to separate cell compartments). The authors apply a yeast system and an α-complementation assay to test further the role of such eclipsed proteins in mitochondrial biology. Specifically, they focus on the ubiquitin (Ub, or as abbreviated incorrectly in this manuscript; Ubi) conjugation pathway, components of which have never been convincingly shown to localize inside the mitochondria. This work proposes that certain ubiquitination events can occur inside yeast mitochondria. This work would represent a significant/exceptional discovery if supported by compelling data. However, the major problem with this work is that the conclusions are based on the ectopic expression of distinct proteins. This approach is not failproof in precise protein expression/delivery to the specific subcellular locations and is likely to result in a non-specific localization. Thus, the problem of eclipsed proteins is addressed by the methodology that may lead to the artificial generation of eclipsed overexpressed proteins. A more effective approach would be if the authors found a way to study this issue with endogenous proteins. The need for overexpression of mitochondria-targeted ubiquitin makes it challenging to reconcile the physiological role of these fundings. In addition, some critical technical issues and omissions further reduce the potential impact of this work (see Specific comments above). For example, strong evidence of mitochondria fraction purity and additional evidence that all the essential constructs used in this work are not misdirected to a different compartment are needed.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: In this study, the authors detected a set of components of a ubiquitination system in the mitochondrial matrix in budding yeast using the subcellular compartment-dependent α-complementation assay. The authors detected the conjugates of mitochondrial targeting signal sequence-directed HA-Ub (preSu9-HA-Ub) in the mitochondrial matrix. The immunoprecipitates of the preSu9-HA-Ubi conjugates were highly enriched for the mitochondrial matrix proteins. Subsequently, the authors focused on the Rad6 E2 ubiquitin conjugating enzyme in the mitochondrial matrix and evaluated its inactivation-altered ubiquitination pattern in the organelle. The authors conclude that ubiquitination occurs in the mitochondrial matrix because of the eclipsed targeted components of the ubiquitination machinery.

      Major comments: The authors argued that the proteins that were modified with preSu9-HA-Ubi, which was forced to be imported into the mitochondria, are present in the mitochondrial matrix, because these species are resistant to trypsin digestion. However, it was possible that they formed severe aggregates on the cytosolic surface of the mitochondria, and hence, were resistant to the proteinase. In other words, a small amount of proteins that were not imported into the mitochondria could be deposited on the cytosolic surface of the mitochondria, where they were modified with preSu9-HA-Ubi by cytosolic Rad6. To confirm if the preSu9-HA-Ubi-modified proteins were really present in the mitochondrial matrix, they should perform the protease protection assay in the presence of an appropriate detergent (Figure 3D). In addition, subcellular fractionation of the organelle by density gradient centrifugation, indirect immunofluorescence microscopic analysis of the preSu9-HA-Ubi conjugates, and/or experiments on the in vitro import of preSu9-HA-Ubi and Rad6 into the mitochondria would strongly support the authors conclusion. Other experiments that might support the authors conclusion would be to test whether the band pattern for the preSu9-HA-Ubi conjugates changes when the mitochondrial import is impaired.

      Response: We will attempt to perform 1) Protease protection assay in the presence of a detergent (Figure 3D). 2) Subcellular fractionation of the organelle by density gradient centrifugation. 3) In vitro import of Rad6 into the mitochondria.

      Minor comments: In Figure 3B, the molecular weight distributions of the preSu9-HA-Ubi conjugates and those of the preFum-HA-Ubi conjugates are different. Is there any reason for this difference?

      In Figure 3E, the position of "-" (MG132) for lane 1 is not correct.

      In Figure 6A: The band pattern for preSu9-HA-Ubi (lane 13) in the rad6-delta cells expressing Ubc8-alpha is different from that of the wild-type cells expressing Ubc8-alpha (lane 12) as well as that obtained from the rad6-delta cells harboring empty plasmids (lane 9). Is there any explanation for this observation?

      In Figure 7B and S6: The level of preSu9-Rad6-alpha-SL17 in the rad6-delta cells is always lower than that in the wild-type cells (compare lanes 13 and 10 in Figure 7B, and lanes 13 and 12 in Figure S6). Is there any explanation for this observation? The protease protection assay (with detergent control) is needed to fully confirm that preSu9-Rad6-alpha-SL17 is present in the mitochondria.

      In Figure S7, the authors presented the matrix proteins, Ilv5 and Aco1, detected in the preSu9-HA-Ubi IPed samples and described this observation in the main text. However, the authors also showed the blots for Idh1 and Fum1, which were also pulled down with preSu9-HA-Ubi from the WT cells more than from the rad6-delta cells. Is this correct? If so, please elucidate this observation in the main text.

      Figure 8D and 8E are not cited in the main text. Although there are no explanations for these figures in the main text, it looks like Rad6-deltaN11-alpha resides in the mitochondrial fraction. However, the alpha-complementation assay suggests that it resides in the cytosol. Please explain this discrepancy.

      First page of the discussion section, item 6): E2 Rad6, but not E3 Rad6?

      Figure S7: HA-Ub (cytosolic form) control is needed in addition to the empty vector control.

      Figure S7, left panel: There is an unnecessary line break in "Hsp60" and "Ilv5."

      Figure S7, right panel: There is an unnecessary line break in "Hsp60."

      CROSS-CONSULTATION COMMENTS I agree with comments of reviewer 1 and 2. -Validation of the complementation assay. -I also think that it is important to address whether intra-mitochondrial ubiquitination can be observed with endogenous level of ubiquitin. If even a small amount of preSu9-HA-Ub is mistargeted to the cytosol, proteins at the cytosolic side of mitochondrial outer membrane could be ubiquitinated and detected in the mitochondrial fraction. -Preparation of mitochondria with more sophisticated purification methods (i.e. high resolution density gradient) would be needed to separate mitochondria from ER and other organelles. -More information is needed in the materials and methods section.

      Reviewer #3 (Significance (Required)): Significance Although the results are interesting and very important, as mentioned in the major comments section, additional experiments are needed to support their model. However, researchers working on the mitochondrial biology and ubiquitin systems might be interested in and influenced by the reported findings.

    1. Author Response

      Reviewer #1 (Public Review):

      The authors report a public browser in which users can easily investigate associations between PGSs for a wide range of traits, and a large set of metabolites measured by the Nightingale platform in UKBB. This browser can potentially be used for identifying novel biomarkers for disease traits or, alternatively, for identifying novel causal pathways for traits of interest.

      Overall I have no major technical concerns about the study, but I would encourage the authors to revisit whether they can find a more compelling example that can better showcase the work that they have done. I understand that this is partly a resource paper but I think the resource itself can have more impact if the paper provides a clearer use-case for how it can drive novel biological insight.

      Many thanks for your comments. We have undertaken a new application of bi-directional Mendelian randomization to demonstrate how users may use this approach to disentangle whether associations in our atlas likely reflect either causes or consequences of PGS traits/diseases. This example is described on page 9:

      ‘For example, we applied Mendelian randomization (MR) to further evaluate associations highlighted in our atlas with triglyceride-rich very low density lipoprotein (VLDL) particles. For instance, both VLDL particle average diameter size and concentration were associated with the PGS for body mass index (BMI) (Beta=0.04, 95% CI=0.033 to 0.046, P<1x10-300 & Beta=0.012, 95% CI=0.006 to 0.019, P=2.7x104 respectively) and coronary heart disease (CHD) (Beta=0.026, 95% CI=0.019 to 0.032, P<1x10-300 & Beta=0.035, 95% CI=0.028 to 0.042, P<1x10-300 respectively). Conducting bi-directional MR suggested that the associations with average diameter of VLDL particles are likely attributed to a consequence of BMI and CHD liability as opposed to the size of VLDL particles having a causal influence on these outcomes (Supplementary Table 6). In contrast, MR analyses suggested that the concentration of VLDL particles increases risk of CHD (Beta=1.28 per 1-SD change in VLDL particle concentration, 95% CI=1.25 to 1.65, P=2.8x10-7) which may explain associations between the CHD PGS and this metabolic trait within our atlas.’

      and discussed in the discussion on page 21:

      ‘We likewise conducted bi-directional MR to demonstrate that associations between the CHD PGS and VLDL particle size likely reflect an effect of CHD liability on this metabolic trait. In contrast, the association between the CHD PGS and VLDL concentrations are likely attributed to the causal influence of this metabolic trait on CHD risk, suggesting that it is the concentration of these triglyceride-rich particles that are important in terms of the aetiology of CHD risk as opposed to their actual size. We envisage that findings from our atlas, as well as other ongoing efforts which leverage the large-scale NMR data within UKB, should facilitate further granular insight into lipoprotein lipid biology.’

      PGS construction: It's unclear how well the PGS work. Should the reader prefer the stringent or lenient PGS? Perhaps there could be some validation with traits that have decent sample sizes in UKBB. Was there any filtering to remove traits with few GWS hits, low sample sizes, or low SNP heritability as these are unlikely to produce useful PGSs?

      An example of validation was previously included for the chronic kidney disease PGS and its association with circulating creatinine, although this has now been removed due to the feedback you provided in your comments below. However, we have now provided the weights for all of the PGS included in our web atlas should users want to use these scores for prediction purposes (page 7):

      ‘The specific weights for clumped variants used in all PGS can be found at https://tinyurl.com/PGSweights.’

      On page 8 we have mentioned that in this work we have used a more lenient threshold to facilitate endeavours in a ‘reverse gear Mendelian randomization’ framework. However, the option to use the more stringent threshold remains an option for users interested in this as an alternative:

      ‘In this paper, we have discussed findings using PGS that were derived using the more lenient criteria (i.e., P<0.05 & r2<0.1), although all findings based on both thresholds can be found in the web atlas.’

      ‘Specifically, we believe our findings can facilitate a ‘reverse gear Mendelian randomization’ approach to disentangle whether associations likely reflect metabolic traits acting as a cause or consequence of disease risk (Holmes and Davey Smith, 2019) as illustrated using triglyceride-rich very low density lipoprotein (VLDL) particles in the next section.’

      We have not filtering based on other criteria such as the number as SNPs given that certain scores, despite only been constructed using few SNPs, may still provide useful to users. For example, our score for ‘Drinks per day’ based on the more stringent threshold (i.e. P<5x10-8) consists of only 6 SNPs. However, one of these is rs1229984, a missense variant located at the alcohol dehydrogenase ADH1B gene region and known to be a strong predictor of alcohol use (e.g. https://pubmed.ncbi.nlm.nih.gov/31745073/).

      Reviewer #2 (Public Review):

      The authors set out to create an atlas of associations between phenome-wide polygenic scores and circulating lipids, fatty acids, and metabolites. To do so, they utilize GWAS from 129 traits available in the OpenGWAS database to derive polygenic (risk) scores (PGS) along with the recently released NMR metabolomics data containing 249 biomarkers (and ratios) in ~120,000 UK Biobank participants. The authors create a publicly available web portal containing PGS to NMR biomarker associations:

      http://mrcieu.mrsoftware.org/metabolites_PGS_atlas/.

      The strength of this study is in the comprehensive nature of the atlas, containing associations for 129 traits phenome-wide, the large sample size of the UK Biobank NMR data, and the use of PGS for prioritising molecular traits for follow-up experiments, which is an emerging area of interest (International Common Disease Alliance, 2020; Ritchie et al., 2021a). To our knowledge this study is the first to explore this for circulating metabolites.

      In its current form the atlas has several limitations, which should be straightforward to address. Notably, results in the current atlas may be confounded by (1) technical variation in the NMR data (Ritchie et al., 2021b), and (2) major biological determinants of biomarker concentrations, including body mass index, fasting time, and statin usage.

      Firstly, thank you for the suggestion to use your ‘ukbnmr’ R package to help remove technical variations from the UK Biobank NMR metabolites data. We have applied it to remove outliers and variation in the individual data due to (1) the duration between sample preparation and sample measurement, (2) position of samples on shipment plates, (3) different equipment (spectrometers) used. This meant that we needed to re-run our entire analysis pipeline for this project from scratch to the updated dataset. Results do not appear to have drastically changed, although nonetheless we have updated results from all downstream analyses in our online web atlas using this updated dataset provided by ‘ukbnmr’.

      Secondly, the reviewer is correct that biological factors, such as body mass index (BMI) and statin usage, are indeed strongly correlated with metabolites levels. However, we are not able to adjust for such biological factors directly in our analyses, given that they are potential colliders in the causal relationship between diseases/traits and metabolites. Statin usage may be caused by both the high genetic liability to coronary artery disease as well as abnormal lipoprotein lipid levels. Likewise, obesity (and changes in BMI) may result from a high genetic predisposition to cardiometabolic disorders and disrupted metabolism. Thus, adjusting for statin usage and BMI will induce collider bias (https://jamanetwork.com/journals/jama/fullarticle/2790247), which creates spurious associations between the disease/trait PGS and metabolites.

      To better illustrate this issue, we have added additional text on page 14 to justify this study design decision as well as added a new figure (Figure 3) to help demonstrate this clearly to the readers. Fasting time on the other hand we believe is unlikely to act as a collider and was adjusted as a covariate in all linear regression models in this work. This is mentioned on page 25.

      …Further, association results for two (of the 129) PGSs, systolic blood pressure (SBP) and diastolic blood pressure (DBP), are invalid (vastly inflated) as the GWASs used to construct these PGSs included UK Biobank samples.

      Many thanks for your suggestion. We have now removed the SBP and DBP PGS from our atlas due to overlapping samples in UKB. Furthermore, our colleagues at the University of Bristol have notified us that the Glioma GWAS data obtained from the OpenGWAS platform was uploaded with incorrect effect alleles. This PGS has also been subsequently removed from the atlas. Additionally, we removed the Alzheimer’s disease (without APOE) PGS because the pleiotropic effect of lipid associated genes is now systematically examined using lipid gene excluded PGS.

      To demonstrate how one might use these PGS to NMR biomarker associations to prioritise (or deprioritise) findings for follow-up, the authors select a biomarker of interest, glycoprotein acetyls (GlycA), to perform bi-directional Mendelian randomization to orient the direction of causal effects between GlycA and traits of associated PGS. However, the conclusions of this analysis are hampered by the heterogeneous nature of the GlycA biomarker, which captures the levels of five proteins in circulation (Otvos et al., 2015; Ritchie et al., 2019), making it a difficult target to appropriately instrument for Mendelian randomization analysis. This, however, does not detract from the broader point the authors make: that PGS can help prioritize molecular traits for experimental follow-up.

      We have now conducted further sensitivity analyses to evaluate the genetically predicted effects of each of the five proteins in the reference you have provided. This is discussed on page 11:

      ‘We also conducted further sensitivity analyses given that the NMR signal of GlycA is a composite signal contributed by the glycan N-acetylglucosamine residues on five acute-phase proteins, including alpha1-acid glycoprotein, haptoglobin, alpha1-antitrypsin, alpha1-antichymotrypsin, and transferrin (Otvos et al., 2015). Using cis-acting plasma protein (where possible) and expression quantitative trait loci (pQTLs and eQTLs) as instrumental variables for these proteins (Supplementary Table 12) did not provide convincing evidence that they play a role in disease risk for associations between PGS and GlycA (Supplementary Table 13). The only effect estimate robust to multiple testing was found for higher genetically predicted alpha1-antitrypsin levels on gamma glutamyl transferase (GGT) levels (Beta=0.05 SD change in GGT per 1 SD increase in protein levels, 95% CI=0.03 to 0.07, FDR=3.6x10-3), although this was not replicated when using estimates of genetic associations with GGT levels from a larger GWAS conducted in the UK Biobank data (Beta=1.6x10-3, 95% CI=-6.9 x10-3 to 0.01, P=0.71). For details of pleiotropy robust analysis and replication results see Supplementary Table 14.’

      There are also several important limitations to the study which cannot be addressed, which the authors discuss appropriately in the paper. First, the NMR data does not provide a comprehensive view of the metabolome - it is heavily focused on lipids and fatty acids. Many small metabolites in circulation cannot be measured by NMR spectroscopy, and further insights must wait for data from molecular profiling efforts planned or underway in UK Biobank (e.g. mass spectrometry). Second, the authors restricted analysis to participants of European ancestries. This a pragmatic analysis choice given (1) the PGSs were derived from GWAS performed in European ancestries, (2) PGS associations are particularly susceptible to confounding from genetic stratification and differences in environment, and (3) the very small sample sizes for which NMR data is currently available in UK Biobank participants. Finally, although a large sample size, UK Biobank is not a random sample of the population: healthy adults are over-represented, meaning PGS to metabolite associations may be different in disease cases or less healthy individuals.

      Overall this study has strong potential, with straightforward to address limitations, and the resulting atlas will provide a useful characterisation of the relationships between NMR biomarkers and polygenic predisposition to various traits and diseases, which can be used by domain experts to prioritise biomarkers or traits for experimental follow-up.

      Reviewer #3 (Public Review):

      Fang et al. created an atlas for associations between the genetic liability of common risk factors or complex disorders and the abundance of small molecules as well as the characteristics of major apolipoproteins in blood. The whole study is well executed, and the statistical framework is sound. A clear strength of the study is the large array of common risk factors and disease analyzed by means of polygenic risk scores (PGS). Further, the development of an open access platform with appealing graphical display of study results is another strength of the work. Such a reference catalog can help to identify novel biomarkers for diseases and possible causative mechanisms. The authors further show, how such a systematic investigation can also help to distinguish cause from causation. For example, an inflammatory molecule readily measured by the NMR platform and strongly associated in observational studies, is likely to be a consequence rather than a cause for common complex diseases.

      However, in its current form, the study suffers from some weakness that would need to be addressed to improve the applicability of the 'atlas'. This includes a distinction of locus-specific versus real polygenic effects, that is, to what extent are findings for a PGS driven by strong single genetic variants that have been shown to have dramatic impact on small molecule concentrations in blood.

      Thank you for your suggestions to help refine our work. In line with this comment, we have repeated all analyses 1) after applying the ‘ukbnmr’ R package as recommending by reviewer #2 to remove technical variations and outliers and 2) conducted sensitivity analyses to remove an established list of lipid gene loci from PGS construction. Full results can be interrogated in the web atlas to evaluate whether PGS association may be driven by locus-specific effects at these regions, which may be particularly informative given the representation of lipoprotein lipid metabolites on the NMR panel. Findings are reported on page 19:

      ‘The polygenic nature of complex traits means that the inclusion of highly weighted pleiotropic genetic variants in PGS may introduce bias into genetic associations within our atlas. To provide insight into this issue, we constructed PGS excluding variants within the regions of the genome which encode the genes for 14 major regulators of NMR lipoprotein lipids signals which captured 75% of the gene-metabolite associations in the Finnish Metabolic Syndrome In Men (METSIM) cohort (Gallois et al., 2019). For details of these genes see Supplementary Table 5).

      For PGS with these lipid loci excluded, anthropometric traits such as waist-to-hip ratio (N=209), waist circumference (N=206) and body mass index (N=205) still provided strong evidence of association with the majority of metabolic measurements on the NMR panel based on multiple testing corrections. Elsewhere however, the Alzheimer’s disease PGS, which was associated with 60 metabolic traits robust to P<0.05/19 in the initial analysis including these lipid loci (Supplementary Table 17), provided no convincing evidence of association with the 249 circulating metabolites after excluding the lipid loci based on the same multiple testing threshold (Supplementary Table 18). Further inspection suggested that the likely explanation for this attenuation of evidence were due to variants located within the APOE locus which are recognised to exert their influence on phenotypic traits via horizontally pleiotropic pathways (Ferguson et al., 2020).’

      …Further, it is unclear how much NMR spectroscopy adds over and above established clinical biomarkers, such as LDL-cholesterol or total triglycerides. This is in particular important, since the authors do not adequately distinguish between small molecules, such as amino acids, and characteristics of lipoprotein particles, e.g., the cholesterol content of VLDL, LDL or HDL particles, the latter presenting the vast majority of measures provided by the NMR platform. Finally, the study would benefit from more intriguing or novel examples, how such an atlas could help to identify novel biomarkers or potential causal metabolites, or lipoprotein measures other than the long-established markers named in the manuscript, such as creatinine or lipoproteins.

      To address these comments, we have added a new example focusing on the granular measures of VLDL particles provided by the NMR data (on top of the examples listed at the start of the response to reviewer document), which as the review points out is one of its strengths of the measures generated by this platform over long-established biomarkers (page 21):

      ‘We likewise conducted bi-directional MR to demonstrate that associations between the CHD PGS and VLDL particle size likely reflect an effect of CHD liability on this metabolic trait. In contrast, the association between the CHD PGS and VLDL concentrations are likely attributed to the causal influence of this metabolic trait on CHD risk, suggesting that it is the concentration of these triglyceride-rich particles that are important in terms of the aetiology of CHD risk as opposed to their actual size. We envisage that findings from our atlas, as well as other ongoing efforts which leverage the large-scale NMR data within UKB, should facilitate further granular insight into lipoprotein lipid biology.’

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:


      1) The authors could consider qualifying the observations as preliminary as no

      mechanistic data or longer-term pathophysiology is investigated. Indeed, the latter is well

      beyond the current scope and may require generation of cell-type specific STING ki mice.

      • *

      Thank you for the comment. We have qualified our observations as preliminary (line 662).

      Indeed, generating cell-type specific STING ki mice is part of our future plans.

      2) The authors consistently write "NF-kB/inflammasomes" - these two pathways (although

      related) are quite distinct and should not be lumped together in such a way.

      • *

      Thank you for this important note, we now corrected the text (for example see section headings in the Results section, lines 338 and 415).

      3) Line 79: "NRLP3" should be corrected to NLRP3.

      • *

      Line 210: age of "adult mice" in weeks should be state in the text and figure legend.

      Thank you, corrected.

      4) Line 262: In Figure 3B and D the images look very different and there is no indication of

      what a positive inclusion is? This should be indicated on the image.

      • *

      Thank you for the suggestion. We replaced the corresponding panels with new images, where we show the nuclei with blue, and the Thioflavin S staining with magenta pseudo-color (current Figure 3E). We marked the outline of Thioflavin S positive cells with yellow. An inset showing the magnification of some neurons with inclusions is also presented.

      5) Line 280: The data of Ifi44 should also be mentioned in the text.

      • *

      Thank you. We performed new experiments to show the gene expression changes in the

      striatum and in the substantia nigra, therefore majority of the gene expression data from the cortex has been moved to the supplementary material (supplemental Figures 3 and 5), and is not discussed in detailed in the current manuscript.

      6) Line 290: Figure 4, Examining IL-1B and Caspase-1 transcripts is not a readout of

      inflammasome activation. pro-IL1B is upregulated in response to NFkB activity. Inflammasome

      activation is commonly examined in other methods e.g. via ASC puncta formation (imaging

      based), active IL-1B secretion (ELISA), Caspase-1 and IL-1B cleavage via western blot

      Thank you for this suggestion, we performed new experiments and added the data as Figure 6.

      • We performed Western blot analysis to detect IL-1β cleavage and NLRP3 proteins from the striatum (Figure 6C-E). 2) We quantified the number of ASC puncta within microglia and astroglia from striatal sections (Figure 6F-I). 3). 3) We also measured the protein levels of several additional immune mediators in the striatum of STING ki and KO animals (supplemental Figure 6, summary heatmap is on Figure 7A). 7) Line 310: The NF-kB subunit examined should be stated (p65?). Furthermore, IRF3

      translocation might be a better readout for STING activation.

      • *

      We indeed detected the p65 subunit of NF-kB (antibody is listed in the supplemental Table), and now it is also indicated in the text (line 366). We also performed subcellular fractionation and quantified IRF3 in the nuclear and cytoplasmic fractions. The data is now added on Figure 6A, B.

      8) Discussion: Given the findings here suggest a strong role for NF-kB, a short discussion

      of IFN vs non-IFN responses from STING should be included. There have been a number of

      seminal papers demonstrating the importance of non-IFN STING responses of late as well as

      much evidence from SAVI mice to suggest some non-IFN driven pathologies.

      • *

      Thank you for the suggestion. The data on inflammasomes were given a separate section in the results (from line 395). In the discussion, from line 535 we discuss the IFN dependent response and from line 548 we discuss the non-IFN driven pathways.

      9) Discussion: Is there any evidence from the human SAVI patients of neuroinflammation

      etc. This should be mentioned either way in the discussion

      • *

      Thank you for this comment. The manifestation of neurological symptoms is not a core feature of the human SAVI disease. Some patients suffer from various neurological symptoms e.g. calcification of basal ganglia, spastic diplegia and episodes of seizure (Fremond et al., 2021). We inserted a short text in discussion (lines 532-534).

      10) Discussion: There is a large body of work demonstrating STING-induced cell death in

      numerous cell types. Despite this it is not mentioned nor discussed but should be. It could

      represent how dopaminergic neurons are lost in the STING ki mice.

      • *

      Thank you for pointing out the gap in our discussion. We added additional text in lines 604-618.

      11) The resolution/quality of some of the imaging is not great but this may be due to PDF

      Compression

      Thank you, we upload the figures with higher resolution.

      Reviewer #2:


      1) The authors base their conclusions (line 215-216) on the neuroinflammatory status of

      their mice strongly on an assessment of the Iba1 and GFAP-positive area fraction. Increase of

      Iba1 and GFAP areas does not necessarily correlate with an increased cytokine production and

      release by the cells. Therefore, in addition the measurement of cytokine mRNAs it would be

      necessary to measure cytokines also on protein level (see also #4 and #5).

      • *

      Thank you for this suggestion, we measured the protein levels of several immune mediators with LEGENDplex™ assay from the striatum, and the new data are included as Figure 7A and supplemental Figure 6.

      2) In the same context: Is the increase of Iba1 and GFAP- covered area due to increased

      proliferation of microglia and astrocytes or due to increased expression of these markers in

      activated glia? How is the number of Iba1/GFAP-positive cells affected?

      • *

      We quantified the number of glia cells in the striatum and in the substantia nigra of adult STING WT and STING ki mice, and, parallel with higher immunoreactivity for the corresponding markers, we detected increased number of cells as well. The quantifications are now included in supplemental Figure 1.

      3) Nowadays we know that microglia and astrocytes can exist in a variety of activated

      states which can be either beneficial or detrimental. An analysis of disease-associated

      microglial markers (Keren-Shaul et al. 2017) would give a good picture of the state microglia are

      in.

      • *

      Thank you for the suggestion. In addition to the panel of immune modulators at the protein level (supplemental Figure 6), we performed qPCR analysis of additional “M1” marker (Nos2) and additional “M2” markers (Il4, Fizz2, Ym1) (Gong et al., 2019). The data is included in Figure 7A and shown in supplemental figure 6. The findings are described from line 431.

      4) It also would be of interest to determine which cell type is responsible for the observed

      neurodegeneration. Which cytokines are released by microglia or astrocytes upon STING

      activation? Even in vitro experiments would help here to get a more profound understanding.

      • *

      We agree with the suggestion, however, the further in vitro experiments are beyond the scopes of this study and will be the basis of a future project.

      5) In line 273 the authors describe that STING is known to activate NFkB and the

      inflammasome. As proof that this is also occurring in their mouse, they perform qPCR analysis

      of whole brain IL-1b, TNF-a and Casp1 expression. While this analysis indicates that there is

      indeed an increased mRNA production of proinflammatory cytokines in the brains of STING ki

      mice, it does not give any indication whether the inflammasome is active or not. The inflammasome is a protein complex largely regulated on protein level. Meaning an assessment

      of the cleavage of Caspase 1 on protein level or the presence of cleaved IL-1b in comparison to

      uncleaved Pro-IL-1b by Western Blot as well as a staining for the number of inflammasomes

      would be required to draw these conclusions.

      • *

      Thank you for the suggestion. We performed additional experiments: 1) Western blot to detect pro-IL1b and IL1b and NLRP3 proteins from the striatum (Figure 6C-E), and 2) we quantified the number of ASC puncta within microglia and astroglia from striatal sections (Figure 6F-I).

      6) To conclude that NFkb/inflammasome pathway is the most active/crucial in astrocytes

      (line 354) a staining for ASC inflammasomes would be of importance, especially as astrocytes

      normally do not express NLRP3.

      • *

      Thank you for this comment. We stained brain sections for ASC specks and for microglia (Iba1) and astroglia (GFAP) markers (Figure 6F-I). Although amount of ASC specks in astroglia was lower than in microglia, we found still a substantial amount of ASC specks in astroglia in the brains of STING ki animals.

      7) As already shown for ALS (Yu et al., 2020) and Parkin KO (Sliter et al. 2018), the authors want to

      further assess the relevance of the STING pathway to PD (line 27-28). Therefore, an in-depth analysis of

      key PD hallmarks beyond phosphorylated a-synuclein, loss the other was parkin/PINK related (so TDP

      deleted) of TH-stained neurons and dopamine reduction is needed. In the discussion the authors

      hypothesize that autophagy (line 467) may be linked to the observed phenotype. Therefore,

      assessment of autophagy/mitophagy as well as mitochondrial dysfunction and mtDNA should

      be analysed. In the same line of thought it would be important to know if and how the observed

      dopamine reduction effects mouse behaviour, thus mice should be subjected to the Rotarod or

      pole or beam walk test.

      • *

      Thank you for these suggestions. In the work by Yu et al. and Sliter et al., the STING pathway was shown to mediate neurodegeneration resulting from TDP-43 pathology and mitochondrial damage. Our work is complementary by investigating the effects of constitutive activation of STING. We have therefore focused on the signaling pathways downstream of STING. As mentioned above, the most important next step will be to separate the contributions of neuronal and glial cells by generating cell type specific STING activation. Of course, it will be interesting to see at a later time point whether STING activation feeds back. We also speculate that STING activation may also cause TDP-43 pathology. Yet, this will be part of a future study. To acknowledge that the pathology is not specific to alpha-synuclein, we added a short statement from line 634.

      With respect to the comprehensive analysis of the PD phenotype, our work includes the

      classical parameters of TH neuron number, TH fiber density, dopamine concentration and

      synuclein pathology. With respect to mouse behavior, we note that the STING ki mice have severe inflammation in the lung, kidney and other (peripheral) organs, reduced body weight and reduced lifespan (Luksch et al., 2019; Motwani et al., 2019; Siedel et al., 2020). Motor deficits cannot be attributed to dopamine neuron degeneration and for this reason were not included (stated in the Discussion, lines 624-625). In order to expand the description of the PD phenotype we now included measurements of cytosolic reactive oxygen species, mitochondrial oxygen species and nitric oxide, which result from inflammation and are known to affect dopaminergic neurons (new Figure 8).

      Reviewer #3:


      1) The method for quantification of TH-positive cells is not sufficient. They just described

      how they stained every fifth sections but did not mention how they count. This is a critical point

      and they should carefully provide information more than just referring their previous paper.

      Counting of dopaminergic neurons and quantification of fibers was described in a dedicated section of the methods. This section has now been expanded (from line 154).

      2) It is not persuasive that they did not investigate local inflammation in SN. They

      presented increased microglia and astrocytes in the striatum but not analyzed these cells in SN

      • *

      Indeed, we measured neuroinflammation in the substantia nigra as well, however, although increased in STING ki mice, it was less pronounced than neuroinflammation in the striatum. We now include the quantification of area fraction as well as cell number counting of microglia and astroglia in the substantia nigra of STING WT and STING ki animals (supplemental Figure 1), and also the expression of inflammatory mediators in Figure 4.

      3) In Figure 3, they analyzed alpha-synuclein phosphorylation and beta-sheet structure in

      the striatum. This is funny from the aspect of Parkinson's disease, which dominantly affects SN.

      They should perform similar experiments with SN samples. In a different aspect, the aggregates

      detected by Thio S may not be alpha-synuclein and could be tau, TDP43 or other substances.

      Phospho-synuclein of course does not mean aggregation, so they can consider electron

      microscopy.

      • *

      We agree with the reviewer. To complement our data, we therefore performed solubility assay both from the striatum and from the substantia nigra to quantify the ratio of alpha-synuclein in the Triton X-100 soluble and insoluble fractions (Figure 3C, D) as previously (Szego et al., 2022; Szegő et al., 2019). Additionally, we quantified phosphorylated alpha-synuclein from the substantia nigra as well Figure 3A,B).

      We also agree with the reviewer that the presence of Thioflavin S-positive inclusions may also contain other, beta-sheet forming proteins and noted this from line 634.

      4) Figure 5, pSTAT3 increased in Iba1-negative cells, which seem neurons from the size of

      nuclei. First, the authors should investigate the identity of pSTAT3-positive cells with GFAP and

      MAP2. If pSTAT3 is actually increased in neurons, what does it mean in the pathology? For

      instance, in viral infection, STAT3 activation triggers suicide of neurons to prevent further

      proliferation of viral particles in neurons. Is it homologous or other function?

      • *

      Thank you for this suggestion. The brain sections were stained for Iba1 and GFAP. pSTAT3 nuclear staining indeed increased in non-glia cells, based on the morphology, we think in neurons. However, detailed characterization of the signal is out of the scopes of this (preliminary) study.

      5) In Figure 6 and overall, cell types in which the activation of three signaling pathways,

      were mixed up and hard to understand the actual situation in the brain.

      • *

      In our model, STING is activated in all cells. Consequently, we cannot determine the origin of immune mediators found elevated in the STING ki mice. This will require cell type specific STING activation. In order to react to the reviewer’s comment and be clearer, we have added more details about the brain region and age of mice used for each analysis also in the figures.

      6) In the method section, the original paper for generation of heterozygous STING N153S

      KI mice should be Warner et al, JEM 2017.

      • *

      We used a STING N153S ki mouse strain that was independently generated in the Technical University Dresden (Luksch et al., 2019).

      7) NF-κB stains seem located in cytoplasm in Figure 5B.

      • *

      We agree: especially in the young STING ki mice, cytoplasmic NF-kB staining is increased

      compared to STING WT mice. To quantify nuclear translocation, however, we counted the

      number of those cells where NF-kB signal was overlapping with the nuclear Hoechst staining.

      8) In Figure 4 and 6, why the authors evaluate gene expressions in frontal cortex instead of

      SN or striatum.

      • *

      As noted in several comments, we show here that the STING-induced pathology involves

      dopaminergic neurons, but believe that it is not specific for the dopaminergic system given that STING-ki is ubiquitously expressed. For practical reasons, we have used cortical samples for the expression analysis. For consistency, we now performed additional qPCR measurement from the striatum and from the substantia nigra and included them as new Figure 4 and supplemental Figure 6N-Q. The previous data from the cortex was moved to the supplemental Figures 3 and 5. Additionally, we measured the levels of several inflammatory modulators from the striatum of STING ki and KO animals (Figure 7A and supplemental figure 6A-M).

      9) In some groups (Sting-ki;ifnar1-/- in Fig 6C, 6E), the values were separated to two

      groups, which makes readers to doubt on soundness of their genotyping.

      • *

      Our genotyping protocol is highly standardized, and the genotype of the animals were correctly assigned. Here we provide an example of gel images showing the products after PCR reactions for the STING N153S allele (Figure 1a), STING WT allele (Figure 1b), Ifnara WT allele (Figure 1c) and lack of Ifnara allele (Figure 1d) of the same animals. We note that a bimodal distribution of phenotypes is often observed in Ifnar-/- mice.

    1. Author Response

      Reviewer #2 (Public Review):

      This work will be of potential interest to biologists studying aging. While transposable elements have been reported to have higher expression as organisms age, it was previously unclear if their expression can exacerbate aging phenotypes or if they are a byproduct of aging. The authors present evidence in this manuscript that artificially increasing transposable element expression during the whole Drosophila life cycle can worsen aging phenotypes.

      Strengths

      The authors provide direct evidence that expression of their gypsy construct across the whole life of animals decreases fly lifespan (Figure 4), and that this outcome is dependent on reverse transcriptase (Figure 6).

      Monitoring TE mobilization can be difficult in general and is often expensive when using a sequencing approach. The authors accurately monitor gypsy mobilization from their ectopic copy through qPCR and sequencing.

      Weaknesses

      Experiment design, data interpretation, and story structure:

      The current model proposes that TE increases activity in aged animals and potentially contributes to the aging process. However, this paper artificially drives gypsy activation throughout the whole fly life cycle. Under this design, TE may already bring deleterious effects from early developmental stages or young age, thus ultimately shortening their life cycle. To truly test the function of TE during the aging process, the authors need to temporally control gypsy expression and only express their construct in aged animals.

      Figure 1: I am not sure I got any convincing messages from this figure. First, flies at 30 days of age should not be considered as old. Second, the authors try to claim that TE expression increased with aged FOXO mutants. However, there is no data to show the comparison between aged wild-type and FOXO mutants (panel e is young wt vs young FOXO null). Meanwhile, Figure 1 has nothing to do with Gypsy. How could this figure fit into the story?

      It is clear that we did not do a good job explaining this section. First, we did not mean to imply that the 30-day flies are old. They are simply older than the 5-day flies. The 30-day timepoint was chosen to match previous experiments and data sets in the literature. It was also chosen to minimize any survivor bias that could occur by doing the assay in very old flies. We have clarified this in the text and figures.

      Second, it is the number of transposons that show an increase in expression in the dFOXO null animals that we mean to highlight (18 for dFOXO vs only 2 for wDAH). Panel e is meant to illustrate that the transposon landscapes, even in young flies differ by genotype making a direct transposon to transposon comparison impossible. We have added text to clarify these points.

      Third, we also do not mean to imply that anything here is specific for gypsy. The work going forward in the paper uses gypsy as a tool because it is one of the better understood retrotransposons, there existed a validated active clone of the transposon and it has already been implicated in aging in the fly. We took gypsy as a model retrotransposon. We have added text to clarify here.

      Figure 3: While the data presented in this Figure is sound, it is unclear how this data fits into the overall narrative that transposon activity drives aging.

      Figure 3 is a continuation of the characterization of our ectopic gypsy. We wanted to rule out that there is a “hotspot” of insertion that would account for any phenotypes we observe. We find no hotspot in the males we use for analysis suggesting it is the act of transposition, not a specific target gene that is important. We have added to the text to clarify the motivation for these experiments.

      Figure 5: It is interesting to see the copies of gypsy are not increased after 5 days. Does gypsy still mobilize after this young age? If yes, the authors should observe increased gypsy gDNA in later time points, unless the cells having gypsy new insertions keep dying. The authors should specifically check tissues with low cell turnover (such as brain) or high cell turnover (such as gut).

      Reviewer 2 makes a great observation. In fact, using primer pairs that specifically detect the ectopic gypsy, we consistently see insertion numbers go down in very old animals (figure 5a&b). With our current understanding of retrotransposition, we should not be able to see loss of insertions unless the host cells are being lost from the analysis. We favor the idea that the reviewer suggests; that the cells that have high levels of insertion are dying and disappearing from the analysis. We think this is also reflected in the bias for intergenic or intronic sequences in our insertion mapping of figure 3. In an attempt to address this question we did measure insertions in heads versus bodies. In male flies aged 14 days there was no difference in the average number of insertions (although the variability was greater in heads). This data is reported in Supplemental Figure 6a.

      Figure 8: Using Ubiquitin GAL4 to drive both gypsy and FOXO expression could dilute the expression of each individual gene. Thus, it is possible the rescue effect seen by expressing FOXO in addition to gypsy may just be due to lower gypsy expression. Including qPCR data showing gypsy expression levels in Ubi>gypsy, UAS FOXO flies compared to Ubi>gypsy flies would be helpful.

      We included this data in Figure 2b and 8c. Unfortunately, we did not clearly direct the reader to compare the values. Comparing Figure 2b with Figure 8c shows the RNA level of the ectopic gypsy is comparable in both cases. Perhaps even slightly higher in the UAS-FOXO case. We have added a sentence to make this clear.

      It is unclear if FOXO can rescue TE-specific aging phenotypes. While it appears that FOXO overexpression rescues the decrease in lifespan caused by gypsy expression, the authors did not test if FOXO overexpression could rescue the effects of gypsy in the paraquat resistance assays or rhythmicity experiments.

      We include in this revision data showing dFOXO overexpression rescues the paraquat resistance and lowers the levels of overall insertions in the animals.

    1. Author Response

      Reviewer #1 (Public Review):

      The authors of the paper provide new evidence of how prefrontal cortex of mutant mice used as a disease model of schizophrenia differs from wild type littermates. By analyzing local network dynamics at the level of specific cell type, authors shed new light on the circuit mechanisms that underlie changes in network dynamics in these mice.

      The claims in the submitted manuscript are supported by the data. I have a few comments and questions that need to be clarified.

      We thank the reviewer for highlighting the novelty of our work and its relevance (…shed new light on the circuit mechanisms that underlie changes in network dynamics in these mice…) for the field and the validity of our data (….claims in the submitted manuscript are supported by the data).

      1) Average firing rates

      Authors claim that they saw a significant reduction in interneuron firing rates in Disc1 mutant mice compared to control mice Fig.1c. However, the difference could be general and not interneuron specific. Due to the high firing rates of interneurons, the statistical test will work better on interneurons than on pyramidal cells as pyramidal cells average firing rates are lower. What I suggest to do is to take interneuron cells that fire at a lower rate (lower 33% for example ) and compare the control and Disc1 groups. Also I would suggest to take pyramidal cells that have higher firing rates (upper 33% for example) and compare firing rates across the same groups. One would like to see if these differences are not due to changes in firing rates per se.

      We thank the reviewer for pointing out this important aspect. In our original analysis, we did not take into account that additional differences in the PYR population might be present but ‘masked’ by the overall lower firing rate of that neuronal population. As suggested by the reviewer, we separately considered the firing rate of the ‘top 33%” of the PYR population, which did not significantly differ between genotypes (p=0.958, n=209 control and 245 Disc1 PYRs, Welch’s test). As suggested, we moreover considered the ‘bottom 33%’ of INT firing rates, for which the significantly lower rates of Disc1-mutant INTs remained (control: 4.2 ± 0.6 Hz vs. Disc1: 1.8 ± 0.2, n=26 and 34 neurons, p=0.013, Mann-Whitney U-test). Since only few INTs were recorded per session in some cases (ranges: Disc1: 2-12/session; control: 2-19/session), we performed this analysis on the basis of individual cells (see also our reassessment of the main statistical comparisons in response to #1 by reviewer 2 and #4 by reviewer 3). These data are now reported in the new Fig. 1 – figure supplement 3 and referred to in line 76 ff. (line 72 ff. without tracked changes) of the revised manuscript.

      2) Optogenetic tagging

      Authors indicate that light triggered and spontaneous spike waveform are similar Fig.1d. This is nice, but would be better to see all the tagged neurons. I would suggest showing all optically tagged neurons spike features. Authors can impose with a different color spike features of tagged neurons in Fig.1a. I suspect that since all PVI are narrow spiking and they must fall into the area of blue colored cells in Fig.1a.

      Following the reviewers suggestions, we included the average waveforms with and without light for all opto-tagged PVIs in the revised Fig. 1f. Moreover, we included the kinetic features of opto-tagged PVIs in Fig. 1a (red dots), and separately for control and Disc1-mutant mice in the new Figure 1-figure supplement 2. As predicted by the reviewer, the PVIs indeed cluster with the other putative INTs. We would moreover like to point to our new analysis in response to #2 of reviewer 2 addressing the spike kinetics of optotagged PVIs versus untagged putative INTs, which are similar in their trough-to-peak duration and asymmetry index. These data are shown in the novel Fig. 1 – figure supplement 2.

      3) It was not clear why authors assessed only firing rates in last 25ms (line 348-349). If they have a clear justification for this they should provide it. But why not use the latency of the first spike also as an additional metric. A well tagged cell will respond to light pulse with short latency (within 5 ms). My concern is that non PVI cells may increase firing rate after 25ms of stimulation of PVI cells due to disinhibition.

      Despite the latency to the first spike being frequently used as a method to detect ChR2-positive neurons, the laser stimulation produced significant photoartefacts in our hands. We were therefore concerned that spikes that happen shortly after the onset of the light pulse might be missed, and hence the latency to the first spike might be misinterpreted. Selecting a later time point in the stimulation interval allowed us to assess the firing rate during light application without the interference by artefacts. Nevertheless, we fully agree with the reviewer’s concern that ChR2-negative non-PVIs might increase their rate due to disinhibition, and that these neurons might thus be falsely classified as PVIs. However, we are confident that that is not the case. First, optotagged PVIs cluster well within the population of electrophysiologically identified INTs (see our response to your first remark on ‘optogenetic tagging’) and were indistinguishable from this population in terms of spike kinetics (see our response to #2 of reviewer 2 and the new Fig. 1 – figure supplement 2), suggesting that no disinhibited PYRs were included in the optotagged sample of cells. Second, we performed an additional analysis to address the time course of firing rate changes in optotagged PVIs. We computed smoothed spike trains (convolved with a 5 ms SD Gaussian kernel), and extracted the average firing rate of each optogenetically identified PVI centered on the onset of the light pulses. This analysis revealed a rapid increase in firing rate upon light delivery, arguing against disinhibitory network effects. These new data are now shown in the new Fig. 1 – figure supplement 5 and reported in line 89 (85 without tracked changes) of the revised manuscript.

      4) Spike cross-correlations

      The authors show that spike transmission probability from PYR to PVI is reduced in Disc1 mice compared to the controls Fig.2d and Fig.2e, but what happens to PVI to PYR spike transmission probability? Is it different in those groups? Answering this question is important since the authors discuss this topic in line 185-193.

      Inhibitory synaptic interactions are indeed detectable by spike-train cross-correlation. However, we find these to be harder to quantitatively interpret than excitatory connections. Those interactions are not visible as spike transmission but rather as a reduction in spike transmission. Reliable estimates of the reduction in spike rate of postsynaptic PYRs require very large spike numbers of postsynaptic neurons that need to be sampled. For instance, Senzai et al., 2019 (Neuron 101: 500-513.e5) identified inhibitory interactions in continuous recordings lasting up to 68 h. Since we did not explicitly design our experiments to investigate inhibitory interactions, our recordings were substantially shorter than the required length. Using the method of Senzai et al., 2019 to identify inhibitory interactions, we detected only 5 INT-INT interactions (in the pooled Disc1-mutant and control data set). This low number does not allow the quantification of potentially reduced spike transmission. Thus, attempts to quantify inhibitory interactions properly would require a substantial amount of additional long-duration recordings. While the point raised by the reviewer is highly relevant and should be investigated in future, we think that given the extensive amount of experimentation needed to address this question, it is beyond the scope of the current manuscript.

      5) Authors could try to link oscillations with spike transmission probabilities. On line 180 authors discuss that lower synchrony between PVI might be responsible for observed reduction in gamma power in Disc1 mutant mice. With the available data authors could test this hypothesis. They can look at spike cross correlations in their pool of INT and PVI (if they have pairs of PVI recorded in the same session) population.

      We thank the reviewer for this excellent suggestion! We computed the cross-correlations for all simultaneously recorded putative INTs and quantified the baseline-subtracted mean cross-correlation within 10 ms around zero time lag. This analysis revealed weaker cross-correlation in Disc1-mutant mice (p=0.026, Mann-Whitney U test, tested on averages from n=7 control and Disc1 mice with at least 2 INTs recorded simultaneously), suggestive of reduced synchronization of putative INTs at short time lags. These new data are now included in the new Fig. 4 and reported in line 201 ff. (185 ff. without tracked changes) of the revised manuscript.

      6) An alternative way to link oscillations with lower spike transmission probabilities in PYR-PVI pairs is to use synchrony triggered LFP analysis. One could take all time points when PVI and PYR cells fired acausal spikes within 2ms window and look at the LFP around this time point. Than take the average of the synchrony-triggered LFP and look at the power spectrum.

      The proposal to link spike transmission with LFP power is indeed intriguing. As suggested by the reviewer, we extracted the 60-90 Hz-filtered LFPs triggered by INT spikes that followed a spike in a presynaptic PYR by <2 ms and measured the average gamma amplitude in a window of 20 ms around the INT spike. This analysis revealed comparable gamma amplitudes in Disc1 compared to control pairs. This finding suggests that local PYR-INT loops are still capable to produce gamma oscillations, and that the gamma oscillation defect of Disc1 mice is likely not caused by such a local defect. To investigate the relationship between INT spike timing and gamma oscillations more generally, we further extracted gamma amplitudes of spike-triggered LFPs using all available spikes of the INTs. Moreover, we compared the data to gamma amplitudes measured at randomly selected time points. ANOVA analysis followed by Tukey tests performed on the level of mouse averages indicated that while INT spiking-associated gamma amplitudes were significantly larger than those depicted from random time points in wild type mice (p=0.001). However, the same was not true for Disc1-mutant mice (p=0.591). Furthermore, this analysis revealed significantly reduced spike-triggered high gamma amplitudes in Disc1-mutant compared to control mice (p=0.011). While these results argue against a driving role of local connection alterations in gamma defects, they generally confirm the impaired synchrony of INT spiking relative to gamma oscillation that we observed in our analysis of phase coupling. These data are now shown in the new Fig. 4, which summarizes all new analyses regarding gamma oscillations and phase-coupling, and in figure 4 – figure supplement 2. The new results are described in the main text of the revised manuscript in line 188 ff. (172 ff. without tracked changes).

      Considering the reduced short time scale synchronization of INTs (see our new results towards the reviewer’s #5) and reduced gamma amplitude of INT spike-triggered LFPs, it is possible that impaired synchronization among prefrontal INTs might contribute to the observed reduction in gamma power of Disc1-mutant mice (thereby, essentially, reflecting impaired INT gamma (ING)). Additionally, reduced long-range excitatory drive maintaining local gamma oscillations might be a contributing factor. For example, recent work showed that high gamma oscillations in the mPFC occur synchronized with gamma oscillations in the olfactory bulb (Karalis & Sirota, 2022, Nat Commun 13:467). It remains to be investigated whether local INTs are rhythmically driven by input from the olfactory bulb (in a multi-synaptic pathway including olfactory cortex) and to what extent that drive maintaining afferent gamma might be altered in Disc1-mutant mice. While the current data set does not allow a systematic evaluation of these possibilities, they should be further explored in future experiments.

      7) Cell assembly analysis

      The authors used 10ms for testing synchronization among pairs of PYR neurons in Fig.4a but 25ms for analysis of assembly dynamics. I think the authors justified why they used 25ms bin size, but it was not clear why they used 10ms? Could the authors clarify the reasons behind this decision?

      The synchronization analysis was originally applied to PYRs converging on a common postsynaptic INT. English et al. (Neuron 95:505-520, 2017) systematically tested the effect of presynaptic cooperativity on spike transmission in the hippocampus (their Fig. 5). Their analysis revealed a maximum in cooperativity at ~10 ms. To maximize the sensitivity of our approach, we thus focused on 10 ms for this analysis. However, we agree that using the same time window as for assembly extraction is a reasonable proposal, in particular since we find no difference in the synchronization of identified presynaptic PYRs (Fig. 3e of the revised manuscript). Thus, we have recomputed cross-correlations using a 25 ms bin size. To further improve the analysis, we restricted it to neurons with at least 1000 spikes and simplified the quantification of excess spiking by using the ‘coinicident_spikes’ function of the Python package neuronpy.utils.spiketrain. Excess synchrony is now estimated by quantifying the number of coincident spikes between a reference and a comparison spike train detected in a 25 ms time window normalized by the firing rate expected by chance (2*frequency of comparison train * synchrony window * number of the reference train).

      By using this improved analysis with a 25 ms time window, we could replicate our original finding of enhanced synchronization of PYR spiking. However, when we averaged the data on the basis of individual mice as suggested in #1 of reviewer 2 and #4 of reviewer 3, we could not observe this effect (irrespective of whether we used the new, coincident spikes-based analysis or the original excess synchrony analysis at either 10 or 25 ms synchrony window). This result is now stated in line 215 ff. (199 ff. without tracked changes) of the revised manuscript.

      Reviewer #2 (Public Review):

      This is an interesting paper, in which the authors assessed spiking and network deficits in a well-established mouse model of schizophrenia. This mouse model carries a genetic deletion of the Disrupted-in-schizophrenia-1 (Disc1) gene, which is highly penetrant in the human condition. The authors combined behavioral analyses with state-of-the-art electrophysiological recordings in vivo, coupled to optogenetic tagging, to study a subnetwork formed by a major inhibitory neuron subclass (the parvalbumin (PV)-expressing interneuron) and principal excitatory pyramidal neurons in the medial prefrontal cortex. This work indicates reduced firing rates of PV cells in Disc1-KO mice, likely due to reduced coupling with pyramidal neurons, leading to alterations in local network activity. Indeed, the authors found that Disc-KO mice exhibited reduced levels of gamma oscillations and somewhat hypersynchronous networks.

      Taking advantage of novel techniques and analytical strategies, the manuscript provides rich, novel insight into the neurobiology of a mouse model of this severe psychiatric condition. The data is of high quality, the findings interesting and the manuscript is well written.

      Overall, the results support the authors' conclusions, although some additional analyses are necessary to corroborate their interpretations.

      Although the paper does not give information on how PV cell dysfunctions are engaged during cognitive tasks, this study can be considered as an important first step in advancing our knowledge on the basic dysfunctions of cortical networks in this model of schizophrenia

      We thank the reviewer for praising the ‘high quality’ of our work, and the ‘rich, novel insights’ on the neurobiology of a mouse model of a psychiatric disorder.

      1) The major findings stem from the analysis of the spiking activity of individual neurons recorded using either silicon probes or arrays of tetrodes. Both techniques allow simultaneous recording of many neurons from a single animal; therefore, from a statistical point of view neurons recorded from one animal are pseudo replicas and cannot be considered as independent measurements. Throughout the manuscript, the authors perform two-sample tests on the pooled data from all recorded neurons to compare differences between genotypes; therefore, artifactually increasing the power of statistical tests. Comparisons between genotypes should be performed using each mouse as an independent measurement.

      To be able to compare the data on the basis of mouse averages, we performed additional recordings, which resulted in a final data set of 9 Disc1 and 7 control mice. We recomputed the main results of this study based on mouse averages. First, consistent with our original cell-by-cell analysis, we found significantly reduced firing rates of putative INTs but not of PYRs (line 72 (69 without tracked changes)). Moreover, we confirmed our results on decreased spike transmission probability at PYR-INT connections (line 121 (107 without tracked changes)), decreased spike transmission in the resonance window (line 163 (147 without tracked changes)), reduced high gamma power (line 173 ff. (157 ff. without tracked changes)), lower phase-coupling of INT spikes to high gamma oscillations (line 178 (162 without tracked changes)), and reduced strength of assembly activations in Disc1 compared to control mice (line 229 ff. (211 ff. without tracked changes)). Similarly, we performed new analysis on INT-INT synchronization and INT spike-triggered gamma amplitudes (as requested by reviewer 1 #5 & 6), which showed significant effects on the level of mouse averages (line 188 ff. (line 172 without tracked changes)). Second, our original finding on significant differences in the synchronization of individual PYR-PYR pairs could not be reproduced on the level of individual mice. This is reported in line 215 (199 without tracked changes) of the revised manuscript. Finally, the analyses based on optogentically identified PVIs did not allow comparison by mouse averages due to the low number of experiments (n=3 mice each). Given that the vast majority of our conclusions is based on electrophysiologically identified INTs, with optogenetic identification experiments being only confirmatory in nature, and that performing additional experiments for optogentic identification of PVIs would be very laborious, we report the results of these analyses as comparisons between neurons or connected pairs. This is clearly stated at the respective sections throughout the revised manuscript. We hope that the reviewer can agree with our decision.

      2) The superficial layers of the mPFC are difficult to reach with a vertical approach of the probes due to the presence of a large blood vessel located medially in the frontal dura. Therefore, the authors are most likely reaching mPFC deep layers where PYR neurons produce fast spikes at high rates. If this is the case, this would make it difficult to sort the spiking of PYR from that of INs based on the spike kinetics and rate. The authors used opto-tagging of PVIs in a set of experiments. It would be reassuring to confirm that the spike waveform and kinetics that they extracted from PVIs are similar to those they assigned as INTs in their experiments with no opto-tagging. Identified PVIs should be statistically different from putative PYRs (not responding to light). Although opto-tagging of PVIs can solve this issue, the amount of cells isolated remains low and the number of animals is not stated. Opto-tagged cells are subsequently used for further analyses but the statistical value of those remain unclear. Since the entire interpretation of the rest of the results depend on this result, this must be clarified.

      As correctly pointed out by the reviewer, we indeed targeted deep layers of the mPFC (~0.4 mm lateral of the midline; see also the histological information about the recordings sites that is now included in Figure 1 – figure supplement 1), where higher spike rates are expected compared to superficial layers. To assess whether this might have influenced the identification of putative INTs, we separately plotted the duration and asymmetry index used to classify the neurons in PYRs and putative INTs for Disc1 and control mice. This analysis yielded well separated clusters in both cases. In addition, as suggested by the reviewer, we compared the kinetic properties (spike duration and asymmetry index) and rates of PYRs, putative INTs, and optotagged PVIs. In both genotypes, ANOVA analysis followed by Tukey post-hoc testing revealed significant differences between the PYRs and both groups of INTs, both for rate (smaller in PYRs) and kinetic properties (longer spikes of PYRs) while we found no difference between putative INTs and PVIs. These results thus suggest that the method used to identify INTs works reliably. These new data are now shown in the revised Fig. 1a and the new Figure 1 – figure supplement 2 and mentioned in line 89 ff. (85 without tracked changes) of the revised manuscript.

      We agree that the number of experiments using PVI opto-tagging is low (n=3 mice per genotype, this information is now included in the main text in line 93 ff. (88 ff. without tracked changes)). However, our analysis of spike transmission probability using the population of untagged putative fast-spiking INTs revealed similar results as for the sample of optogenetically identified PVIs. We view the PVI optotagging experiment as an additional confirmation that the difference in firing rate and spike transmission did likely not arise from sampling from different INT types in Disc1 and control mice, as pointed out in line 80 (76 without tracked changes) of the revised manuscript. The limitation of the low number of PVIs in our study is critically reflected in the revised discussion in line 249 ff. (229 without tracked changes).

      3) Proportion of gamma coupled neurons. The authors mention the use of pairwise phase consistency (PPC). PPC is a good method to measure phase coupling independent of differences in firing rates. However, it is not entirely clear how PPC is used to measure the extent of phase locking. In the methods, the authors mention that they ran the PPC analysis after determining significant phase locking with Rayleigh's test. Moreover, they provide PPC values for high gamma oscillations but not for other frequency ranges. Perhaps, it would be better to test significant coupling of all units by nonrandom spike-phase distributions crossing a confidence interval, estimated by Monte Carlo methods from independent surrogate data set. These can be obtained upon randomly jittering each spike times. Indeed, PPC values estimated by the authors for high gamma are higher for PYR than INT (Fig. 1- Fig. Suppl 4 b). This is at odds with previously published observations in V1 (e.g. Perrenoud et al., PLoS Biol. 2016 PMID: 26890123). Given the existing reports of reduced excitatory transmission in DISC-1 mice, phase locking of PYR to other frequency bands might be affected.

      Following the reviewer’s suggestion we have revised our phase-coupling analysis. First, Perrenoud et al (2016) show that gamma oscillations occur in short bursts of high power. To better reflect the coupling of putative INTs to those transient gamma events, we restricted the phase-coupling analysis to epochs within the largest quintile of gamma amplitude (assessed by the envelope of the gamma-filtered signal obtained by Hilbert transformation). Second, instead of the Rayleigh test, we obtained for each unit randomized spike trains by shuffling the inter-spike intervals (500 iterations). Significant phase locking was then obtained by testing whether two consecutive bins of the phase histogram exceeded the 95th percentile of the random distribution. This analysis was performed separately for the low (20-40 Hz) and high gamma bands (60-90 Hz) for both putative INTs and PYRs. Third, the depth of phase coupling was assessed by PPC for all significantly phase-coupled neurons. While this metric is more robust against changes in spike rates than traditional measures, it is still not completely independent of it. Perrenoud et al, for instance, showed using spike sub-sampling that the reliability in estimating PPC depends on spike rate (with >1000 spikes being optimal). However, our data set of PYRs contained fewer than 1000 spikes during high gamma events (mean Disc1: 657 ± 32, mean control: 840 ± 43). To better account for the effect of rate dependence, we restricted the analysis to neurons with >250 spikes. To further limit the potential impact of different spike counts across neurons, we used random subsampling with a fixed spike number of 250 (100 iterations per cell), computed PPC in each iteration, and averaged over the PPC estimates per cell. Finally, in response to the reviewers point 1, the results of all neurons (PYR and INT separately) were then averaged for each mouse.

      Consistent with our original analysis, we found a significantly reduced proportion of phase-coupled INTs but unaltered PPC of significantly coupled INTs to the high gamma band. Moreover, we observed no significant effects for low gamma oscillations or for the phase-coupling of PYRs to either low or high gamma bands. These results are now shown in the new Fig. 4 and the new Figure 4 – figure supplement 1, and are described in line 170 ff. (154 without tracked changes) of the revised manuscript. In addition, we provide a detailed explanation of the revised phase coupling analysis, including a formal description how PPC is computed, in the Methods section of the revised manuscript in line 524 ff. (486 without tracked changes).

      Using the revised phase-coupling analysis, we observed comparable PPC values of significantly coupled PYRs (0.013) and INTs (0.014) to high gamma in control mice. While the improved analysis thus resolved the paradoxical finding of lower PPC in INTs, we did not observe weaker phase-coupling of PYRs as reported in Perrenoud et al. (2016). A possible explanation for this discrepancy might be genuine differences in gamma coupling of the PYR population between visual cortex (Perrenoud et al., 2016) and the prefrontal cortex (our study), which will require further investigation in future.

      Reviewer #3 (Public Review):

      In the present study, the authors aim to assess network activity alterations in the prefrontal cortex of mice with a deletion variant in the schizophrenia susceptibility gene DISC1 ("DISC1 mutants"). Using silicon probe in vivo recordings from the prefrontal cortex, they find that mutant mice show reduced firing rates of fast-spiking interneurons, reduced spike transmission efficacy from pyramidal cells to interneurons, and enhanced synchronization and activation of cell assemblies. The authors conclude that "interneuron pathology is linked with the abnormal coordination of pyramidal cells, which might relate to impaired cognition in schizophrenia."

      The cellular and circuit basis of psychiatric disorders has received strong interest in the recent past. In particular, alterations of the "excitation-inhibition balance" in cortical circuits has been the focus of extensive scrutiny (reviewed in pmid 22251963). Specifically, in both human samples as well as in mouse models, disruption of interneuron development and function have been implicated in the pathogenesis of schizophrenia. In the DISC1 mouse model, studies have reported disrupted interneuron development (e.g. pmid 23631734, 27244370), reduced numbers of GABAergic neurons (e.g. pmid 18945897), reduced inhibition from GABAergic neurons ex vivo (e.g. pmid 32029441), and reduced firing rates of fast-spiking neurons in vivo in the basal forebrain (pmid 34143365).

      The present manuscript makes a potentially important contribution to this question by probing the microcircuitry of the prefrontal cortex in vivo in the DISC1 mouse model of schizophrenia. It goes beyond previous work in assessing circuit dynamics in vivo in more detail, albeit with indirect methods. The experiments and analysis have generally carefully been performed, though the statistical analysis raises some questions. The advances made by the present work compared to previous studies could be delineated more clearly.

      We thank the reviewer for praising the analysis of our data ‘…have generally carefully been performed..’ and the ‘important contribution’ of our work to the field.

  5. www.janeausten.pludhlab.org www.janeausten.pludhlab.org
    1. we women never mean to have anybody. It is a thing of course among us, that every man is refused, till he offers

      See also Emma "A woman may not marry a man merely because she is asked, or because he is attached to her" (chapter 7) and Mansfield Park "I think it ought not to be set down as certain that a man must be acceptable to every woman he may happen to like himself" (Chapter 35)

    1. Author Response

      Reviewer #1 (Public Review):

      Neural stem cells express cascades of transcription factors that are important for generating the diversity of neurons in the brain of flies and mammals. In flies, nothing is known about whether the transcription factor cascades are build from direct gene regulation, e.g. factor A binding to enhancers in gene B to activate its expression. Here, Xin and Ray show that one temporal factor, Slp1/2, is regulated transcriptionally via two molecularly defined enhancers that directly bind two other transcription factors in the cascade as well as integrating Notch signaling. This is a major step forward for the field, and provides a model for subsequent studies on other temporal transcription factor cascades.

      Thanks for the positive comments!

      Reviewer #2 (Public Review):

      The manuscript addresses an important question concerning the mechanisms regulating temporal transitions in Drosophila neural progenitors called neuroblasts. Here, they concentrate on a specific transition between the transcription factors Ey and Slp1/2 that are sequentially expressed within a cascade involving at least 6 temporal transcription factors. Using a combination of new transgenes, bioinformatics and genome-wide profiling of transcription factor biding sites (Dam-ID), they functionally characterize two enhancers of the Slp1/2 genes that are active during this transition. This led to the identification of the Notch pathway as an important facilitator of the transition. They also show that Notch signaling requires cell cycle progression and that Slp1/2 is a direct target of Ey, validating the importance of transcriptional cross-regulatory interactions among the temporal transcription factors to trigger progression.

      In my opinion, the study is very interesting, representing the first careful analysis of enhancers involved in temporal transitions in neural progenitors, and leading to new insights into the mechanisms promoting temporal progression.

      Thanks for the positive comments!

      Reviewer #3 (Public Review):

      In this manuscript, the authors present data to suggest that transcriptional activation of the Slp1/2 temporal factors in the medulla neuroblasts of the developing Drosophila optic lobe is dependent on two enhancer elements. The authors concluded that these two enhancers were able to be activated by Ey and Scro, two other factors identified to be involved in the temporal cascade of the medulla NB. The authors show that cell cycle progression is necessary for Notch signaling, and that Notch signaling activates and sustains the temporal transcription factor cascade. The authors use GFP reporter assays to correlate the enhancer activity to Slp1/2 expression and used DamID to show in-vivo binding of Su(H) and Ey to the enhancer fragments.

      I agree with the authors that it is important to define the mechanisms by which Notch, cell cycle control and these temporal transcription factors function through their cis-regulatory elements to establish this self-propagating cascade to generate diverse cell types during neurogenesis. However, the findings in this study offer limited new insights toward reaching this goal for a myriad of reasons. First, studies in invertebrate and vertebrate neurogenesis have agreed on the conceptual framework that transcriptional control plays a key role in regulating the generation of diverse cell types. The data showing the patterns of slp1/2 transcript simply reaffirm the proposed model as well as recently published single-cell transcriptomic analyses of fly optic lobe neuroblasts. Second, it remains unclear how physiologically relevant the enhancer analyses presented in this study are to the regulation of Slp1/2 expression, as the data can only suggest that they act redundantly to each other. It is also troubling to see that mutating binding sites of a single transcription factor appears to completely abolish enhancer activity while Slp1/2 protein expression is delayed in mutant clonal analyses. Third, the authors do not offer any explanation for how Notch signaling contributing to the timing of Slp1/2 expression, considering that Notch signaling should be active during the entire life of the neuroblast based on canonical Notch target gene expression. What action do Ey and Scro play in this timely enhancer activation as both appear to be necessary to activate the enhancers along with Notch. Fourth, many studies including the Okamoto et al., 2016 study cited in this study have contributed to our appreciation of the role of proper cell cycle control in promoting generation of diverse neurons in vertebrate neurogenesis. It is unclear to me if findings from the current study contribute to significant advancement on this regulatory link.

      Thanks for raising these concerns. Here are our responses:

      First, we agree that there have been great advances in this field including classical studies in the ventral nerve cord, recent studies on type II lineages and medulla including our own scRNA-seq study of medulla neuroblasts. These studies have revealed the sequential expression of transcription factors in neuroblasts of different ages, and proposed that these transcription factors form a transcriptional cascade based on the cross-regulations among them. However, these cross-regulations were based on mutant phenotypes, and in most cases, the cis-regulatory elements of these TTFs have not been characterized, and it hasn’t been studied whether these cross-regulations are direct or not. Little is known about exactly how the timing of the transition is regulated and coordinated with cell-cycle control. We have addressed these questions and identified two enhancer elements for slp1/2, and demonstrated that the previous TTF Ey, another TTF Scro, and Notch signaling directly regulate slp expression. Further we demonstrated that Notch signaling is dependent on cell cycle progression in neuroblasts, and supplying Notch signaling rescues the delay in Slp expression in cell cycle mutants. We believe this study has provided important insights in this field and is another step forward.

      Second, now we provide evidence that deletion of both enhancers specifically abolished Slp1 and Slp2 expression in medulla neuroblasts.

      Regarding the concerns about binding site mutation:

      1) Ey: With loss of Ey, Slp is completely lost. The Ey binding site mutation phenotype is consistent with loss of Ey phenotype.

      2) Su(H): For the u8772 250bp enhancer, mutating all four predicted Su(H) binding sites did abolish the reporter expression. During the revision, we generated another construct, in which we mutated the two predicted Su(H) binding sites which are perfect matches to the consensus, and found that this dramatically reduced the reporter expression. For the d5778 850bp enhancer, mutation of Su(H) binding sites caused strong glial expression which prevented us to precisely assess the neuroblast expression. Thanks to the excellent advice from review #3, we used repo-Gal4 and GFP-RNAi to remove the glial expression. This approach turned out very informative. We found that mutation of four or six out of six predicted Su(H) binding sites actually did not decrease the reporter expression in neuroblasts, suggesting that Notch signaling does not active the d5778 850bp enhancer through these binding sites. However, we think this is the explanation why this enhancer drives a delayed expression comparing to the 220bp enhancer and the endogenous Slp. In addition, this also explains why with loss of Notch signaling, endogenous Slp expression is only delayed but not completely lost. This is because although the 220bp enhancer driven expression is completely lost, the d5778 850 bp enhancer still directs a delayed expression of Slp and this expression is not dependent on Notch signaling.

      3) Scro: Mutation of Scro binding sites caused a decreased expression level of the reporter, consistent with the scro RNAi phenotype on Slp, which is a decreased expression level.

      Third, regarding how Notch signaling which is active in the entire neuroblast life, can act to activate Slp expression in a specific time We tested genetic interactions between Ey, Scro, and Notch in the regulation of Slp expression. We found that with loss of Ey, supplying constitutive active Notch or Scro is not sufficient to rescue Slp expression. Thus Ey as the previous TTF, may be required to prime the slp locus, so that Notch signaling and Scro can act to further activate Slp expression. Therefore, Notch signaling requires Ey to specifically further activate Slp at the correct time. We have added these experimental results and discussion.

      Fourth, the Okamoto et al., 2016 study actually concluded that cell cycle progression is not required for the temporal progression. In their experimental setup, they supply Notch to maintain the un-differentiated status of cortical neural progenitors when they block cell cycle progression. The observed that temporal transition still happened, and they concluded that cell cycle progression is not required for temporal transitions. However, they didn’t consider the possibility that Notch signaling, which is itself dependent on cell cycle progression, actually rescued the possible phenotype caused by arrest of cell cycle progression. Our result demonstrated that in Drosophila medulla, supplying Notch signaling can rescue the delay in the transition to the Slp stage in cell-cycle arrested neuroblasts, and further showed that the mechanism is by direct transcriptional regulation. We believe that publication of our results will be informative to the vertebrate study, promoting vertebrate researchers to re-consider the role of cell cycle progression and Notch signaling in temporal progression.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01528

      Corresponding author(s): Elena Taverna and Tanja Vogel

      1. General Statements [optional]

      We thank the reviewers for the comments and points they raised. We think what we have been asked is a doable task for us and we are confident we will manage to address all points in a satisfactory manner.

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Reviewer’s comment: The manuscript investigated the role of DOT1L during neurogenesis especially focusing on the earlier commitment from APs. Using tissue culture method with single-cell tracing, they found that the inhibition of DOT1L results in delamination of APs, and promotes neuronal differentiation. Furthermore, using single cell RNA-seq, they seek possible mechanisms and changes in cellular state, and found a new cellular state as a transient state. Among differentially expressed genes, they focused on microcephaly-related genes, and found possible links between epigenetic changes led by DOT1L inhibition and epigenetic inhibition by PRC2. Based on these findings, they suggested that DOT1L could regulate neural fate commitment through epigenetic regulation. Overall, it is well written and possible links from epigenetic to metabolic regulation are interesting. However, there are several issues across the manuscript.

      Response to Reviewer and planned revision:

      We thank the reviewer’s 1 for her/his comments and constructive criticism.

      We hope the revision plan will address the points raised by the reviewer in a satisfactory manner.

      Major issues:

      * *Reviewer’s comment: 1) It is not clear whether the degree of H3K79 methylation (or other histones) changes during development, and whether DOT1L is responsible for those changes. It is necessary to show the changes in histone modifications as well as the levels of DOT1L from APs to BPs and neurons, and to what extent the treatment of EPZ change the degree of histone methylation.

      Response to Reviewer and planned revision:

      • As for the level of DOT1L protein We tried several commercially available antibodies, but they do not work in the mouse, even after multiple attempts and optimization. So, unfortunately we will not be able to provide this piece of information.

      • As for the level of DOT1L mRNA We will provide info regarding the DOT1L mRNA level in APs, BPs and neurons by using scRNAseq data from E12, E14, E16 WT cerebral cortex.

      • As for the levels of H3K79methylation, we did not intend to claim that the histone methylation is responsible for the reported fate transition. We will edit the text to avoid any possible confusion. If it is deemed to be necessary to address the point raised by the reviewer, we do have 3 options, that we here in order of priority and ease of execution from our side.

      • immunofluorescence with an Ab against H3K79me2 using CON and EPZ-treated hemispheres.

      • FACS sort APs, BPs and neurons from CON and EPZ-treated hemispheres, followed by immunoblot for H3K79me2 to assess the H3K79me2 levels. As for the FACS sorting, we will use a combinatorial sorting in the lab on either a TUBB3-GFP or a GFP-reporter line using EOMES-driven mouse lines. This strategy has already been employed in the lab by Florio et al., 2015 and we will use it with minor modifications.
      • scCut&Tag for H3K79me2 from CON and EPZ-treated hemispheres. This option entails a collaboration with the Gonzalo Castelo-Branco lab in Sweden and might therefore require additional time to be established and carried out. Reviewer’s comment:

      Furthermore, the study mainly used pharmacological bath application. DOT1L has anti-mitotic effect, thus it is not clear whether the effect is coming from the inhibition of transmethylation activity.

      Response to Reviewer and planned revision:

      In a previous work we used a genetic model (DOT1L KO mouse) that showed microcephaly (Franz et al. 2019). For this study, we wanted to fill a gap in knowledge by understating if the DOT1L effect was mediated by its enzymatic activity. For this reason, we choose to use the pharmacological inhibition with EPZ, whose effect on DOT1L activity has been extensively reported and documented in literature (EPZ is a drug currently in phase clinical 3 studies).

      The stringent focus of this study on the pharmacological inhibition is thus a step toward understanding what specific roles DOT1L can play, both as scaffold or as enzyme.

      Here, we concentrate on the enzymatic function and the scaffolding function is beyond the scope of this specific study. We can further discuss and elaborate on the rationale behind this in the revised manuscript.

      Reviewer’s comment:

      In addition, the study assumed that the effect of EPZ is cell autonomous. However, if EPZ treatment can change the metabolic state in a cell, it would be possible that observed effects was non-cell autonomous. It would be important to address if this effect is coming in a cell-autonomous manner by other means using focal shRNA-KD by IUE.

      Response to Reviewer and planned revision:

      We did not claim that the effect of EPZ is cell autonomous, we are actually open on this point, as we consider both explanations to be potentially valid. We will edit the text to avoid any possible confusion on what we assume and what not.

      As a general consideration, it is entirely possible that the effects are non-cell autonomous. We will comment and elaborate on that in the revised manuscript.

      If the reviewer/journal considers this a point that must be addressed experimentally, then we will proceed as follows:

      • DOT1L shRNA-KD via in utero electroporation, followed by either
      • in situ hybridization for ASNS to check if ASNS transcript is increased upon DOT1L shRNA-KD compared to CON
      • FACS sorting of the positive electroporated cells (CON and DOT1L shRNA-KD), followed by qPCR to assess the levels of ASNS
      • If the reviewer wants us to check for a more downstream effect on fate, then we will immuno-stain the DOT1L shRNA-KD and CON with TUBB3 AB and/or TBR1 AB (as already done in the present version of the manuscript). Reviewer’s comment: 2) The possible changes in cell division and differentiation were found by very nice single-cell tracing system. However, changes in division modes occurring in targeted APs such as angles of mitotic division and the expression of mitotic markers were not addressed. These information is critical information to understand mechanisms underlying observed phenotype, delamination, differentiation and fate commitment.

      Response to Reviewer and planned revision:

      Previous effects of DOT1L manipulation on the mitotic spindle were observed in a previous paper, using DOT1L KO mouse (Franz et al. 2019). Considering that in our experiments we do use a pharmacological inhibition, we will address this point by quantifying the spindle angle in CON and EPZ-treated cortical hemispheres.

      We will co-stain for DAPI to visualize the DNA/chromosomes, and for phalloidin (filamentous actin counterstain) that allows for a precise visualization of the apical surface and of the cell contour, as it stains the cell cortex.

      Of note, the protocols we are referring to are already established in the lab, based on published work from the Huttner lab (Taverna et al, 2012; Kosodo et al, 2005).

      Reviewer’s comment: 3) The scRNA-seq analysis indicated interesting results, but was not fully clear to explain the observed results in histology. In fact, in single cell RNA-seq, the author claimed that cells in TTS are increased after EPZ treatment, which are more similar to APs. However, in histological data, they found that EPZ treatment increased neuronal differentiation. These data conflicts, thus I wonder whether "neurons" from histology data are actually neurons? Using several other markers simultaneously, it would be important to check the cellular state in histology upon the inhibition/KD of DOT1L.

      Response to Reviewer and planned revision:

      The reviewer’s comment is valid, and we indeed found that TTS cells are an intermediate state between APs and neurons in term of transcriptional profile. This is the reason why we called this cell cluster transient transcriptional state.

      We plan to address this point by staining for TBR1 and/or CTIP2 in CON and EPZ-treated hemispheres and to expand with this EOMES and SOX2 co-staining.

      Minor issues:

      Reviewer’s comment: Figure 1 - It is not clear delaminated cells are APs, BPs or some transient cells (Sox2+ Tubb3+??). It is important to use several cell type-specific and cell cycle markers simulnaneously to characterize cell-type specific identity of the analysed cells by staining. These applied to Fig1B,D,E,F,G,as well as Fig2,3.

      Response to Reviewer and planned revision:

      We will address this point by using a combinatorial staining scheme for several fate markers such as TUBB3, EOMES and SOX2, as suggested by the reviewer.

      Reviewer’s comment: - Please provide higher magnification images of labelled cells (Fig 1H)

      Response to Reviewer and planned revision:

      In the revised manuscript, we will provide higher magnification for the staining.

      Reviewer’s comment: - Please provide clarification on the criteria of Tis21-GFP+ signal thresholding.

      Response to Reviewer and planned revision:

      In the revised manuscript, we will provide a clarification on the criteria of Tis21-GFP+ signal thresholding.

      Reviewer’s comment: - Splitting the GFP signal between ventricular and abventricular does not convincingly support the "more basal and/or differentiated" states after EPZ treatment.

      Response to Reviewer and planned revision:

      We will provide a clarification regarding this point.

      Reviewer’s comment: - Please explain the presence of Tis21-GFP+ cells at the apical VZ.

      Response to Reviewer and planned revision:

      Tis21-GFP+ cells at the apical VZ has been extensively reported in the literature, since the first paper by Haubensak et al. regarding the generation of the Tis21-GFP+ line. In a nutshell, T Tis21-GFP+ cells are present throughout the VZ (therefore also in the apical portion) as neurogenic, Tis21-GFP positive cells are undergoing mitosis at the apical surface. Indeed, the presence of Tis-21 GFP signal have been extensively used by the Huttner lab and collaborators to score apical neurogenic mitosis. In addition, since AP undergo interkinetic nuclear migration, it follows that Tis21-GFP+ nuclei are going to be present throughout the entire VZ.

      In the revised manuscript, we will explain this point and cite additional literature.

      Reviewer’s comment: - Order the legends in same order as the bars.

      Response to Reviewer and planned revision:

      We will follow reviewers’ recommendation and order the legends accordingly.

      Reviewer’s comment: Figure 2 -Fig 2B) The difference between CON and EPZ apical contacts is not clear and does not match with the graph in Fig 2E.

      Response to Reviewer and planned revision:

      We will explain Fig. 2B in more detail and provide additional images in the revised manuscript.

      Reviewer’s comment: -Supp Fig 2 - are these injected slices cultured in control conditions? Please include this in the text and figure/figure legend

      Response to Reviewer and planned revision:

      In the revised manuscript, the text will be changed to address this point and provide clearer info.

      Reviewer’s comment: Fig 2C) The EPZ-treated DxA555+ cells exhibit morphological change of cell shape. Is this phenotype? please comment on the image shown for EPZ treatment panel.

      Response to Reviewer and planned revision:

      We thank the reviewer for having raised this point.

      The change in morphology might be a consequence of delamination and or of cell fate. In the revised manuscript, we will certainly better comment on this very relevant point and expand the discussion accordingly.

      Reviewer’s comment: Fig 2F - 2G) Data presented on EOMES+ and TUBB3+ % are counterintuitive. The authors claimed that TUBB3+ cells are increased and neuronal differentiation is promoted. However, no changes in EOMES+ are observed. What is the explanation? Did the author check the double positive cells? These could be TSS cells?

      Response to Reviewer and planned revision:

      We thank the reviewer to have raised this point.

      As envisioned by the reviewer, we suspect that the counterintuitive data might be due to TSS cell, which based on our scRNAseq data are expressing at the same time several cell type specific markers. It is possible that, since the treatment with EPZ is 24h long, cells (like the TTS cluster) have no time to completely eliminate the EOMES protein. If that were to be the case, then we would expect to still detect (as we indeed do) EOMES immunoreactivity.

      To address this point, we will:

      • analyze scRNA-seq data and check which is the extent of co-expression of Eomes and Tubb3 mRNAs in the TTS population.
      • Check for EOMES and TUBB3 double positive cells in the microinjection experiment. Reviewer’s comment: Figure 2 and Figure 3) the number of pairs analyzed for EPZ is twice as that of Con for comparison of the parameters taken into account. Please include n of each graph in the figure legend of the specific panel if not the same for all panels in that figure (i.e. for figure 3)

      Response to Reviewer and planned revision:

      We will revise the text accordingly.

      Reviewer’s comment:

      Figure 3) The data indicated that the number of daughter cell pairs in EPZ samples is almost double than Control. Is this the phenotype? More numbers of daughter cells in EPZ treated samples were observed from the same number of injections? or the number of injected cells were different?

      Response to Reviewer and planned revision:

      Due to technical reasons, we indeed performed a higher number of injections in EPZ-treated slices. We think this is the main reason behind the difference in number.

      If the reason were to be biological, one would expect to see the same trend in IUE experiments, but this is actually not the case. This does suggest/corroborate the idea that the reason behind the difference is mainly technical.

      Reviewer’s comment: Figure 4)

      • Please clarify if the single cell transcriptomic analysis has been performed only once, and if yes, how statistical testing to compare the cell proportion is carried out with only one batch. Fig 4G)

      Response to Reviewer and planned revision:

      As for the scRNAseq on microinjected cells:

      the scRNA-seq analysis was done once using cells pooled from 3 different microinjection experiments performed in 3 different days.

      As for the scRNAseq on IUE cells:

      The scRNA-seq analysis was done once using cells pooled from 2-3 different IUE experiments performed in 3 different days.

      For all scRNAseq experiments the statistical testing is achieved by intrasample comparisons according to established bioinformatics pipelines. We will better explain this point in the revised manuscript.

      Reviewer’s comment: Figure 4 and 5) - Figures are not supportive of the statement regarding APs' neurogenic potential upon DOT1L inhibition. TSS transcriptomic profile resembles more progenitors than neurons. Please comment on TSS neurogenic capacity taking into account the provided GO and RNAseq.

      Response to Reviewer and planned revision:

      We thank Reviewer 1 for raising this point, It is indeed true that TTS resemble more AP than neurons (as indicated in the Fig. S5B, C). We took that to indicate the fact that these cells are transient and therefore still maintain some AP features. Interestingly, TTS downregulate cell division markers, suggesting a restriction of proliferative potential, as one would expect for cells with an increased neurogenic potential. We will discuss this point in the revised manuscript.

      Reviewer’s comment: - Please provide GO analysis for APs and BPs.

      Response to Reviewer and planned revision:

      Following the reviewer’s suggestion, we will incorporate a more careful and in-depth analysis in the revised version of the manuscript.

      Reviewer’s comment: - Reconstruct figure 5A by listing genes in the same order in both Con and EPZ and prioritize EPZ-Con differences instead of cell-cell differences.

      Response to Reviewer and planned revision:

      We will revise Figure 5A based on the reviewer’s comment.

      Reviewer’s comment:

      Moreover, the presented genes in the heatmap is not the same in two conditions (i.e. NEUROG1 is present in EPZ but absent in Con). Please justify.

      Response to Reviewer and planned revision:

      This observation is based on different activities of transcription factor networks in the control and EPZ condition. They are not supposed to be the same as the cell states are altered and different TF are expressed and active upon the treatment in the diverse cell types. In a revised manuscript we will justify this point.

      Reviewer’s comment: Fig 5D)

      • Please explain why binding of EZH2 on the promoter of Asns is strongly reduced in comparison to a mild significant reduction of H3K79me/H3K27me3 in EPZ compared to Control.

      Response to Reviewer and planned revision:

      Several explanations are possible

      First, the variation can be due to batch effects.

      Second, the acute reduction of EZH2 might not be directly accompanied by a reduced histone mark, which is reduced either by cell division or by demethylases. The two processes of getting rid of the mark might be slower than the reduction of EZH2 presence at the respective site.

      Based on the reviewer’s comment, we will explain this point in the revised manuscript.

      • *

      Reviewer’s comment:

      Also is the changed directly medicated by DOT1L?

      Please test whether DOT1L can bind the promoter of Asns.

      Response to Reviewer and planned revision:

      To address this relevant issue we will proceed with the following protocol:

      • electroporate a tagged version of DOT1L into ESCs
      • select ESCs and differentiate them into NPC_48h.
      • treat NPC with DMSO (Con) or EPZ
      • harvest CON and EPZ-treated NPC
      • perform ChIP-qPCR DOT1L at the Asns promoter Reviewer’s comment: Please provide the expression patterns of DOT1L and Asns during neuronal differentiation.

      Response to Reviewer and planned revision:

      As for Dot1l

      Dot1l expression was shown in Franz et al 2019, by ISH from E12.5 to E18.5.

      As for Asns

      We will provide E14.5 in situ staining of Asns in the developing mouse brain using the Gene Paint database (see Figure below).

      We will also show immunostainings for ASNS at mid-neurogenesis, provided that Ab against ASNS works in the mouse.

      Other General comments:

      Reviewer’s comment: Please Indicate VZ, SVZ and CP on the side of the pictures/ with dot lines in the pictures both for primary figures and supplementary.

      Response to Reviewer and planned revision:

      We will revise the figures accordingly.

      Reviewer’s comment: - The Results and figures sometimes do not support the statement made by the authors

      Response to Reviewer and planned revision:

      We will carefully check on this and eliminate any overinterpretation or non-supported statements from the text.

      • Schemes are not informative/explanatory enough, i.e. time windows of treatment and sample collection, culture conditions details.

      Response to Reviewer and planned revision:

      We will revise the schemes to include more details. In particular, we plan to add a supplementary figure with a detailed visual description of the protocol, to match the detailed description presented in the materials and methods.

      Reviewer’s comment: - A more extensive characterization of TTS cells in terms of differentiation progression and integration would be enlightening

      Response to Reviewer and planned revision:

      In general, we are facing two main challenges while studying the TTS population: one is the lack of a specific marker gene for TTS, the other is the relatively small size of the TTS subpopulation.

      For these reasons, our ability to carry on an in-depth analysis of this cell state is limited.

      Considering the reviewer’s comment, in the revised manuscript we will expand the analysis ad characterization of the differentiation potential of TTS using RNA velocity trajectory.

      We can also expand the discussion on this point.

      Reviewer’s comment: - Picture quality can be improved, provide high magnification images.

      Response to Reviewer and planned revision:

      We will revise the figures to include higher magnification images.

      Reviewer #1 (Significance (Required)):

      Reviewer’s comment: The study could be important for the specific field in neural development. It aims to understand mutations in respective genes and brain malformation. If the link between epigenetic and metabolic changes is clearly shown, it will be interesting. However, the current manuscript is still rather descriptive, and clear mechanistic insights were not provided. The study have potentials and additional data will strength the value of study.

      Response to Reviewer and planned revision:

      We will address the direct impact of DOT1L and H3K79me2 on the Asns gene locus during the revision (see the rationale of the experimental strategy also in the revision plan above). We hope we will thus provide a mechanistic link between epigenetics and altered metabolome.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Reviewer’s comment: Appiah et al. present a concise manuscript that provides details and possible mechanisms of their previous work (Franz et al., 2019; Ferrari et al., 2020). The study uses diverse lines of investigation to arrive at most conclusions. However, as interesting as the data is, we find that at the present state, it is not sufficient to prove that, indeed, the asparagine metabolism is regulated by DOTL1/PRC2 crosstalk. The neurogenic shift presented in the first part of the paper is not comprehensive and, therefore, not very convincing. The quality of images provided in the main and supplementary data is less than ideal. Additional data analysis and interpretation of the scRNA seq data may be needed. The authors finally conclude with rescue experiments done in culture and in-vivo, which we believe is the stand-out part of this study. Overall the manuscript has some interesting observations that are often over-interpreted with less supporting data. The manuscript reads well but requires additional data and changes in the claims/interpretation to be suited for publication.

      Response to Reviewer and planned revision:

      In the revised manuscript, we hope we will address the comments and concerns raised by the reviewer in a satisfactory manner. Comments

      Reviewer’s comment: 1) Abstract: Is this statement correct: "DOT1L inhibition led to increased neurogenesis driven by a shift from asymmetric self-renewing to symmetric neurogenic divisions of APs. AP undergoes symmetric division for self-renewal and asymmetric neurogenic divisions.

      Response to Reviewer and planned revision:

      Based on the current literature (cit. Huttner and Kriegstein), AP undergo:

      • symmetric division for proliferative division at early stages of neurogenesis
      • asymmetric self-renewing division, generating an AP and a BP at mid neurogenesis. This division is also described as neurogenic, as it produces a BP, that is a step further than AP in term of neurogenic potential.
      • symmetric consumptive division at late neurogenesis To avoid any possible confusion, we will re-phrase the sentence to include the adjective “consumptive” and specify the composition of the progeny.

      In the revised manuscript, the sentence will read as follow:

      "DOT1L inhibition led to increased neurogenesis driven by a shift of APs from asymmetric self-renewing (generating one AP and one BP) to symmetric consumptive divisions (generating two neurons)"

      Reviewer’s comment: All the data is based on treatments with EPZ (DOTL1 inhibitor), yet no information is shown to support its targeted activity in this system. A proof of principle in the chosen experimental system is missing; for instance, examining the activity or protein level of DOTL1 and decreased methylation of the target(s) is essential.

      Response to Reviewer and planned revision:

      EPZ is a well characterized drug, that has been used previously in our lab and by others as well.

      As for our lab, the information regarding the inhibitor, its activity and efficiency in inhibiting DOT1L towards H3K79me2 was shown in Franz et al. Supplementary Fig. S6 D, E.

      In the present manuscript, an additional confirmation that EPZ targets DOT1L in regard to its H3K79me2 activity is shown in Fig. 5D.

      We would refer to this information more explicitly in a revised manuscript.

      Reviewer’s comment: 2) Figure 1: The scoring of centrosomes and cilia is insufficient to conclude delamination and increase in basal fates. The effect could be on ciliogenesis or centrosome tethering to the apical end-feet of the AP, and other possible explanations for this observation also exist. The images are too small; larger images or graphic representations could be helpful in addition to the data.

      Response to Reviewer and planned revision:

      We did not intend to claim that the change in centrosome location demonstrate delamination, but only that it suggests delamination. This criterion has been extensively used as a proxy for delamination by several labs working on the cell biology of neurogenesis, such Huttner and Gotz labs. If the issue persists, we can re-phrase in a more cautious way the text referring to Figure 1 to highlight that the data only suggest delamination.

      Response to Reviewer and planned revision:

      To make a statement regarding delamination, I would like to see either the dynamics of delamination (organotypic slices images), staining with BP markers, or morphological changes of AP (staining that will reveal loss of adherence) or comparable data to support the observation. In my opinion Supp. Figure 1 is insufficient; the single image is not convincing; I would like to see 3D reconstruction and better-quality images.

      Response to Reviewer and planned revision:

      We can certainly provide better images and co-stain with relevant markers.

      We think it is beyond the scope of the manuscript embarking in live imaging as we are not studying the dynamics of delamination per se.

      Reviewer’s comment: Tis21 data (1H), again of low quality, is only a single piece of evidence and the conclusion "suggesting that the acquisition of a basal fate was paralleled by a switch to neurogenesis" is premature. I think other cell cycle exit reporters, Fucci markers, pHis, BrdU, NeuroD, or Tbr2 reporters (Li et al., 2020, (Haydar and Sestan labs)) to name a few, are necessary to establish the conclusions. The authors should show other markers such as PAX6, EOMES, or other upper-layer markers upon cell cycle exit in the SVZ/CP. These additional experiments will assist in cell fate analysis.

      Response to Reviewer and planned revision:

      We completely understand the points raised by the reviewer, and we plan to address them by co-staining with PAX6/SOX2, PH3 and/or EOMES.

      We think establishing the Fucci or EOMES mouse system is beyond the scope of the manuscript. In addition, given the present setting of all labs involved, it would be logistically unattainable (see also comments in the section below).

      We think the co-staining scheme and plan will be informative enough to satisfactory address the concerns raised by the reviewer.

      Reviewer’s comment: 2) Figure 2: The microinjection experiments are elegant; the images, however, do not complement the experiment. The images of the microinjected cells seem not to be reconstructed from z-stacked optical slices, so often, processes are not continuous (panel B, for example); therefore, it is not clear if an apical process is indeed missing or just not seen.

      Response to Reviewer and planned revision:

      The mentioned images are reconstructed from continuous Z-stacks, as we always do given the type of data. We can provide better reconstructions and/or additional images.

      Reviewer’s comment:

      The data analysis should include other parameters; BrdU staining could have given information on cell cycle exit, PAX6, SOX2, and EOMES on the location of the cells in the VZ/sVZ. The quality of images showing EOMES and TUBB3 staining is so low that it makes the reader doubt the validity of the quantifications. "Taken together, these data suggest that the inhibition of DOT1L might favor the acquisition of a neuronal over BP cell fate" This interpretation should be subjected to more investigations. It is possible that this treatment just accelerates the AP-> BP -> Neuronal fate. The author's claim needs to be backed by additional experiments or be changed.

      Response to Reviewer and planned revision:

      To address this point, we will include in the revised manuscript staining and co-staining with PAX6, SOX2 (see also response above) and provide a BrdU labeling experiment.

      Reviewer’s comment: 3) Figure 3: The experiment concept and its performance are impressive, yet the data is insufficient. The images in A that are supposed to be representative show two cells; their location is not clear, and the expression of GFP is not clear; in fact, both pairs seem to be GFP negative (not clear what is the threshold for background). Staining with anti-GFP and a second method to follow neurogenesis is necessary.

      Response to Reviewer and planned revision:

      We did use different staining methods and schemes to follow neurogenesis. As specified above, we will deepen our analysis by using additional markers, such as TBR1.

      Reviewer’s comment: 4) On page 9, lines 8-10, the authors claim that their number of cells was "sufficient" for single-cell analysis; the numbers are Response to Reviewer and planned revision:

      In the revised manuscript, we will include the analysis of how many cells are needed to identify cluster of 6 cell types in this paradigm, based for example on the algorithms developed in Treppner et al. 2021.

      Reviewer’s comment: 5) The authors use Seurat and RaceID without their appropriate citations in the first mention during the results. The authors also stop immediately after DEG analysis along with clustering. The authors could analyze their RNA-seq data with a trajectory; to say the least, the identification/characterization of TTS and neurons as Neurons I, II, and III are insufficient. There could be multiple ways to show the "fate" of cells in the isolated FACS, which the authors have missed.

      Response to Reviewer and planned revision:

      We will include the respective citations in a revised manuscript. We provide already differentiation trajectories but will include other methods, including scVelo of FateID to extend the trajectory analyses. We kindly ask the reviewer to also refer to the comments above regarding the TTs cluster characterization as part of our effort to provide a better picture of the different clusters.

      Reviewer’s comment: 6) The authors detected candidates like Fgfr3, Nr2f1, Ofd1, and Mme as part of their treated (different approaches) datasets (from their DEG analysis). They correctly cite Huang et al., 2020 but fail to give us a sense of the consequences of these gene dysregulations. The authors can also validate if these proteins are expressed in their treated cells.

      Response to Reviewer and planned revision:

      In the revised manuscript we will comment on the function of the four genes mentioned.

      In addition, we will validate the expression of these genes on protein and transcriptional level through immunostainings -provided that antibodies are working in our system- or smFISH, respectively.

      Reviewer’s comment: 7) The authors list a few GO terms (page 10, lines 1-10) and associate them with reduced proliferation; they must cite relevant studies. The authors can also add supplementary data showing which genes in their data correspond to these GO terms.

      Response to Reviewer and planned revision:

      We thank the reviewer for pointing out the missing citations.

      We of course agree on the need to add them, and we will do so in the revised manuscript.

      Reviewer’s comment: 8) On Page 11, lines 3-7, the authors describe their method to arrive at the 17 targets with TF activity from the previous analysis. Can the authors describe the method used to correlate the two? The reviewer understands this could be MEME analysis or analysis of earlier datasets of Ferrari et al. 2020. But it must be explicitly stated, and a few examples in supplementary need to be exemplified as this analysis is key to discovering the three metabolic genes.

      Response to Reviewer and planned revision:

      In the revised manuscript, we will clarify the exact analysis that resulted in the identification of the 17 target genes, using the specific tool for gene network analysis, that is based on our scRNA-seq data alone, but not on the Ferrari et al 2020 data set.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      n/a

      4. Description of analyses that authors prefer not to carry out

      Reviewer’s comment: Tis21 data (1H), again of low quality, is only a single piece of evidence and the conclusion "suggesting that the acquisition of a basal fate was paralleled by a switch to neurogenesis" is premature. I think other cell cycle exit reporters, Fucci markers, pHis, BrdU, NeuroD, or Tbr2 reporters (Li et al., 2020, (Haydar and Sestan labs)) to name a few, are necessary to establish the conclusions. The authors should show other markers such as PAX6, EOMES, or other upper-layer markers upon cell cycle exit in the SVZ/CP. These additional experiments will assist in cell fate analysis.

      Response to Reviewer and planned revision:

      As pointed out above, we think establishing the Fucci or EOMES mice system is beyond the scope of the manuscript as it will not provide more information than the ones we will obtain from systematic and extensive co-staining experiments. In addition, all labs involved are facing a logistic issue (animal house not ready yet, construction works etc) that made the importing and setting up of the colony unattainable for the next 6-10months. If the reviewer and/or the editorial board think this is a major point compromising the entire revision, we kindly ask to contact us again so that we can discuss the issue and arrive to a shared conclusion.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Appiah et al. present a concise manuscript that provides details and possible mechanisms of their previous work (Franz et al., 2019; Ferrari et al., 2020). The study uses diverse lines of investigation to arrive at most conclusions. However, as interesting as the data is, we find that at the present state, it is not sufficient to prove that, indeed, the asparagine metabolism is regulated by DOTL1/PRC2 crosstalk. The neurogenic shift presented in the first part of the paper is not comprehensive and, therefore, not very convincing. The quality of images provided in the main and supplementary data is less than ideal. Additional data analysis and interpretation of the scRNA seq data may be needed. The authors finally conclude with rescue experiments done in culture and in-vivo, which we believe is the stand-out part of this study. Overall the manuscript has some interesting observations that are often over-interpreted with less supporting data. The manuscript reads well but requires additional data and changes in the claims/interpretation to be suited for publication.

      Comments

      1. Abstract: Is this statement correct: "DOT1L inhibition led to increased neurogenesis driven by a shift from asymmetric self-renewing to symmetric neurogenic divisions of APs". AP undergoes symmetric division for self-renewal and asymmetric neurogenic divisions.

      All the data is based on treatments with EPZ (DOTL1 inhibitor), yet no information is shown to support its targeted activity in this system. A proof of principle in the chosen experimental system is missing; for instance, examining the activity or protein level of DOTL1 and decreased methylation of the target(s) is essential. <br /> 2. Figure 1: The scoring of centrosomes and cilia is insufficient to conclude delamination and increase in basal fates. The effect could be on ciliogenesis or centrosome tethering to the apical end-feet of the AP, and other possible explanations for this observation also exist. The images are too small; larger images or graphic representations could be helpful in addition to the data.

      To make a statement regarding delamination, I would like to see either the dynamics of delamination (organotypic slices images), staining with BP markers, or morphological changes of AP (staining that will reveal loss of adherence) or comparable data to support the observation. In my opinion Supp. Figure 1 is insufficient; the single image is not convincing; I would like to see 3D reconstruction and better quality images.

      Tis21 data (1H), again of low quality, is only a single piece of evidence and the conclusion "suggesting that the acquisition of a basal fate was paralleled by a switch to neurogenesis" is premature. I think other cell cycle exit reporters, Fucci markers, pHis, BrdU, NeuroD, or Tbr2 reporters (Li et al., 2020, (Haydar and Sestan labs)) to name a few, are necessary to establish the conclusions. The authors should show other markers such as PAX6, EOMES, or other upper-layer markers upon cell cycle exit in the SVZ/CP. These additional experiments will assist in cell fate analysis. 2. Figure 2: The microinjection experiments are elegant; the images, however, do not complement the experiment. The images of the microinjected cells seem not to be reconstructed from z-stacked optical slices, so often, processes are not continuous (panel B, for example); therefore, it is not clear if an apical process is indeed missing or just not seen. The data analysis should include other parameters; BrdU staining could have given information on cell cycle exit, PAX6, SOX2, and EOMES on the location of the cells in the VZ/sVZ. The quality of images showing EOMES and TUBB3 staining is so low that it makes the reader doubt the validity of the quantifications. <br /> "Taken together, these data suggest that the inhibition of DOT1L might favor the acquisition of a neuronal over BP cell fate" This interpretation should be subjected to more investigations. It is possible that this treatment just accelerates the AP-> BP -> Neuronal fate. The author's claim needs to be backed by additional experiments or be changed. 3. Figure 3: The experiment concept and its performance are impressive, yet the data is insufficient. The images in A that are supposed to be representative show two cells; their location is not clear, and the expression of GFP is not clear; in fact, both pairs seem to be GFP negative (not clear what is the threshold for background). Staining with anti-GFP and a second method to follow neurogenesis is necessary. 4. On page 9, lines 8-10, the authors claim that their number of cells was "sufficient" for single-cell analysis; the numbers are <500 for all samples. The authors need to justify this statement or articles that carefully analyze the number required for such a conclusion as references. 5. The authors use Seurat and RaceID without their appropriate citations in the first mention during the results. The authors also stop immediately after DEG analysis along with clustering. The authors could analyze their RNA-seq data with a trajectory; to say the least, the identification/characterization of TTS and neurons as Neurons I, II, and III are insufficient. There could be multiple ways to show the "fate" of cells in the isolated FACS, which the authors have missed. 6. The authors detected candidates like Fgfr3, Nr2f1, Ofd1, and Mme as part of their treated (different approaches) datasets (from their DEG analysis). They correctly cite Huang et al., 2020 but fail to give us a sense of the consequences of these gene dysregulations. The authors can also validate if these proteins are expressed in their treated cells. 7. The authors list a few GO terms (page 10, lines 1-10) and associate them with reduced proliferation; they must cite relevant studies. The authors can also add supplementary data showing which genes in their data correspond to these GO terms. 8. On Page 11, lines 3-7, the authors describe their method to arrive at the 17 targets with TF activity from the previous analysis. Can the authors describe the method used to correlate the two? The reviewer understands this could be MEME analysis or analysis of earlier datasets of Ferrari et al. 2020. But it must be explicitly stated, and a few examples in supplementary need to be exemplified as this analysis is key to discovering the three metabolic genes.

      Significance

      Appiah et al. present a concise manuscript that provides details and possible mechanisms of their previous work (Franz et al., 2019; Ferrari et al., 2020). The study uses diverse lines of investigation to arrive at most conclusions. However, as interesting as the data is, we find that at the present state, it is not sufficient to prove that, indeed, the asparagine metabolism is regulated by DOTL1/PRC2 crosstalk. The neurogenic shift presented in the first part of the paper is not comprehensive and, therefore, not very convincing. The quality of images provided in the main and supplementary data is less than ideal. Additional data analysis and interpretation of the scRNA seq data may be needed. The authors finally conclude with rescue experiments done in culture and in-vivo, which we believe is the stand-out part of this study.

      Overall the manuscript has some interesting observations that are often over-interpreted with less supporting data. The manuscript reads well but requires additional data and changes in the claims/interpretation to be suited for publication.

    1. I like to think of thoughts as streaming information, so I don’t need to tag and categorize them as we do with batched data. Instead, using time as an index and sticky notes to mark slices of info solves most of my use cases. Graph notebooks like Obsidian think of information as batched data. So you have a set of notes (samples) that you try to aggregate, categorize, and connect. Sure there’s a use case for that: I can’t imagine a company wiki presented as streaming info! But I don’t think it aids me in how I usually think. When thinking with pen and paper, I prefer managing streamed information first, then converting it into batched information later— a blog post, documentation, etc.

      There's an interesting dichotomy between streaming information and batched data here, but it isn't well delineated and doesn't add much to the discussion as a result. Perhaps distilling it down may help? There's a kernel of something useful here, but it isn't immediately apparent.

      Relation to stock and flow or the idea of the garden and the stream?

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this study, the authors consider the problem of inferring transcription dynamics from smFISH data. They distinguish between two important experimental situations. The first one considers measurements of mature mRNAs, while the second one considers measurements of nascent mRNA through fluorescent probes targeting PP7 stem loops. The former problem has been previously dealt with extensively, but less work has been done on the context of the latter. The inference approaches are based on maximum likelihood estimation, from which point estimates for promoter-switching and transcription rates are obtained. The study focuses on steady state measurements only. The authors perform several analyses using synthetic data to understand the limitations of both approaches. They find that inference from nascent mRNA is more reliable than inference from mature mRNA distributions. Moreover, they show that accounting for different cell-cycle stages (G1 vs G2) is important and that pooling measurements across the cell-cycle can lead to quantitatively and even qualitatively different inferences. Both approaches are then used to analyze transcription in an experimental system in yeast, for which they find evidence of gene dosage compensation. I consider this an interesting and relevant study, which will appeal to the systems- and computational biology community. The paper is well written and the (computational) methods are described in detail. The experimental description is quite minimal and could profit from further details / explanations. I have several technical criticisms and questions, which I believe should be addressed before publication. Since I am a theorist, I will comment predominantly on the statistical / computational aspects.

      Major comments/questions:

      -A key reference that is missing is Fritzsch et al. Mol Syst Biol (2018). In this work, the authors have used nascent mRNA distributions and autocorrelations (obtained from live-imaging) to infer promoter- and transcription dynamics. I believe this work should be appropriately cited and discussed.

      Synthetic case study:

      -Inference and point estimates. The authors use a maximum-likelihood framework to extract point estimates of the parameters. Subsequently, relative absolute differences are used to assess the accuracy of the inference. However, as far as I have understood, this is performed for only a single simulated dataset, for each considered parameter configuration. The resulting metric, however, does not really capture the inference accuracy, since it is based on a single (random) realization of the MLE. I would recommend to at least repeat the inference multiple times for different realizations of the simulated dataset (per parameter configuration) to get a better feeling of the distribution of the MLE (e.g., its bias / variance). Alternatively, identifiability analyses based on the Fisher information could be performed for (some of) the different parameter configurations although this may be computationally more demanding.

      -It would be useful to include confidence intervals based on profile likelihoods also for the synthetic case study, in particular for the 6 reported datasets. I would also find it helpful to see comprehensive profile likelihood plots for the key results / parameter inferences in the supplement. This would also provide useful insights into the identifiability of the parameters.

      Experimental case study:

      -Validation against live-cell data. In the simulation of the autocorrelation function, what was the ratio of cells initialized in G1 / G2, respectively? I'd expect this to have direct influence on the simulated ACF. Moreover, a linear fit is used to correct for "non-stationary effects" in the ACF that supposedly stem from cell-cycle dynamics. First, I don't think this terminology is really accurate, since non-stationarity would lead to an ACF that depends on two parameters (tau_1 and tau_2). I suppose the goal of the linear correction is to remove slow / static population heterogeneity? If yes, wouldn't it be easier / more direct to also change the simulations to non-synchronized cell-cycles? In this case, they should also display the very slow / static components as displayed in the data, which would eliminate the need for the post-hoc correction. I was also wondering whether other statistics (e.g., mean, variance, distributions) match between the simulations and the live-cell experiment? This could provide further validation of the inferred parameters.

      -If I understood correctly, the signal intensity of the measured transcription spot is normalized by the median cytoplasmic spot brightness. Since the normalized intensity of a single complete transcript is 1, the cumulative intensity should give a lower bound on the nascent mRNAs. The histograms in Fig. 4b show intensity values in the range of 30, which would mean that at least 30 transcripts contribute to the transcription spot. The total number of nucleoplasmic and cytoplasmic mRNA, however, is in the range of 10 (Fig. 3a). I am probably missing something but how can we reconcile these numbers? The authors mention that the brightest spot just counts for one transcript, but argue that this has negligible influence on mature RNA counts. Could this be a possible explanation for the mismatch?

      Minor comments:

      -In the experimental case study, the authors argue that the "correct" inference result is the one that accounts for cell-cycle stage, while the other one termed "incorrect". I find this terminology too strong, since every estimate is subject to uncertainty.

      -Page 2: "... in a asynchronous population" -> "... in an asynchronous population"

      -Page 7: "...parameters sets 3 and 4" -> "...parameter sets 3 and 4"

      -Figures 5a and 6a: parameter names and units should go on the y-axis.

      Significance

      Quantifying kinetic parameters from incomplete and noisy experimental data is a core problem in systems biology. I therefore consider this manuscript to be very relevant to this field. The contribution of this manuscript is largely methodological, although its potential usefulness is demonstrated using experimental data in yeast.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In recent years, the field has investigated crosstalk between cGMP and cAMP signaling (PMID: 29030485), lipid and cGMP signaling (PMID: 30742070), and calcium and cGMP signaling (PMID: 26933036, 26933037). In contrast to the Plasmodium field, which has benefited from proteomic experiments (ex: PMID 24594931, 26149123, 31075098, 30794532), second messenger crosstalk in T. gondii has been probed predominantly through genetic and pharmacological perturbations. The present manuscript compares the features of A23187- and BIPPO-stimulated phosphoproteomes at a snapshot in time. This is similar to a dataset generated by two of the authors in 2014 (PMID: 24945436), except that it now includes one BIPPO timepoint. The sub-min​​ute phosphoproteomic timecourse following A23187 treatment in WT and ∆cdpk3 parasites is novel and would seem like a useful resource.

      CDPK3-dependent sites were detected on adenylate cyclase, PI-PLC, guanylate cyclase, PDE1, and DGK1. This motivated study of lipid and cNMP levels following A23187 treatment. The four PDEs determined to have A23187-dependent phosphosites were characterized, including the two PDEs with CDPK3-dependent phosphorylation, which were found to be cGMP-specific. However, cGMP levels do not seem to differ in a CDPK3- or A23187-dependent manner. Instead, cAMP levels are elevated in ∆cdpk3 parasites. This would seem to implicate a feedback loop between CDPK3, the adenylyl cyclase, and PKA/PKG: CDPK3 activity reduces adenylyl cyclase activity, which reduces PKA activity, which increases PKG activity. The authors don't pursue this direction, and instead characterize PDE2, which does not have CDPK3-dependent phosphosites, and seems out of place in the study

      Response:

      We agree with reviewer 1 that a feedback loop between CDPK3, the adenylyl cyclase and PKA/PKG is certainly one of several possibilities (and we acknowledge this in the manuscript).

      We felt, however, that given the observation that A23187 and BIPPO treatment leads to phosphorylation of numerous PDEs (hinting at the presence of an Ca2+-regulated feedback loop), it was entirely relevant to study these in greater detail. Coupled with the A23187 egress assay on ΔPDE2 parasites - our findings suggest that PDE2 plays an important role in this signalling loop (an entirely novel finding). While PDE2 appears to exert its effects in a CDPK3-independent manner (indeed suggesting that CDPK3 might exert its effects on cAMP levels in a different fashion), this does not detract from the important finding that PDE2 is one of the (likely numerous) components that is regulated in a Ca2+-dependent feedback loop to regulate egress.

      We have modified our writing to better reflect the fact that our decision to pursue study of the PDEs was not solely CDPK3-centric.

      While we feel that our reasoning for studying the PDEs is solid, we appreciate that further clarification on the putative CDPK3-Adenylate cyclase link would make it easier for the reader to follow the rationale.

      We have not studied the direct link between CDPK3 and the Adenylate Cyclase β in more detail, as ACβ alone was shown to not play a major role in regulating lytic growth (Jia et al., 2017).

      **MAJOR COMMENTS**

      1.Some of the key conclusions are not convincing.

      The data presented in Figure 6E, F, and G and discussed in lines 647-679 are incongruent. In Figure 6E, the plaques in the PDE2+RAP image are hardly visible; how can it be that the plaques were accurately counted and determined not to differ from vehicle-treated parasites?

      Are the images in 6E truly representative? Was the order of PDE1 and PDE2 switched? The cited publication by Moss et al. 2021 (preprint) is not in agreement with this study, as stated. That preprint determined that parasites depleted of PDE2 had significantly reduced plaque number and plaque size (>95% reduction); and parasites depleted of PDE1 had a substantially reduced plaque size but a less substantial reduction in plaque number.

      Response:

      The plaques for PDE2+RAP were counted using a microscope since they are difficult to see by eye. We thank the reviewer for detecting our incorrect reference to Moss et al. (2021). This has been corrected in the text. We confirm, however, that the images in 6E are representative of what we observed and do indeed differ from what was seen by Moss et al.. We have acknowledged this clearly in the text.

      The differences cannot easily be explained other than by the different genetic systems used. Further studies of the individual PDEs will likely illuminate their role in invasion/ growth, but we feel this would be beyond the scope of this study.

      Unfortunately, the length of time required for PDE depletion (72h) is incompatible with most T. gondii cellular assays (typically performed within one lytic cycle, 40-48h). Although the authors performed the assays 3 days after initial RAP treatment, is there evidence that non-excised parasites don't grow out of the population. This should be straightforward to test: treat, wait 3 days, infect onto monolayers, wait 24-48h fix, and stain with anti-YFP and an anti-Toxoplasma counterstain. The proportion of the parasite population that had excised the PDE at the time of the cellular assays will then be known, and the reader will have a sense of how complete the observed phenotypes are. As a reader, I will regard the phenotypes with some level of skepticism due to the long depletion time, especially since a panel of PDE rapid knockdown strains (depletion in __Response:

      1. Cellular assays using KO parasites are commonly performed at the point at which protein depletion is detected. Both our western blots and plaque assay results demonstrate that, at the point of assay, there is no substantial outgrowth of non-excised parasites. The original manuscript also includes PCRs performed at the 72 hr time point (See Fig. 6B) to support this.
      2. We appreciate the reviewer’s comment re the panel of PDE KD strains. The reviewer notes that there are substantial limitations to conditional KO systems, which similarly applies to KD systems - there are notable pros and cons to each approach. When designing our strategy (pre-publication of the Moss et al., 2022), we made a deliberate decision to use conditional KO strains in light of the fact that residual protein levels in KD systems can cause significant problems, particularly for membrane proteins (all of the investigated PDEs have a transmembrane domain). Tagging of proteins with the degradation domain can have further issues, leading to protein mis-localisation, which we have experienced with several unrelated proteins in the lab.

        The authors should qualify some of their claims as preliminary or speculative, or remove them altogether.

      The claims in lines 240-260 are confusing. It seems likely that the two drug treatments have at least topological distinctions in the signaling modules, given that cGMP-triggered calcium release is thought to occur at internal stores, whereas A23187-mediated calcium influx likely occurs first at the parasite plasma membrane.The authors' proposed alternative, that treatment-specific phosphosite behavior arises from experimental limitations and "mis-alignment", is unsatisfying for the following reasons: (1) From the outset, the authors chose different time frames to compare the two treatments (15s for BIPPO vs. 50s for A23187); (2) the experiment comprises a single time point, so it does not seem appropriate to compare the kinetics of phosphoregulation. There is still value in pointing out which phosphosites appear treatment-specific under the chosen thresholds, but further claims on the basis of this single-timepoint experiment are too speculative. Lines 264-267 and 281-284 should also be tempered.

      Relatedly, graphing of the data in Figure 1G (accompanying the main text mentioned above) was confusing. Why is one axis a ratio, and the other log10 intensity? What does log10 intensity tell you without reference to the DMSO intensity? Wouldn't you want the L2FC(A23187) vs. L2FC(BIPPO) comparisons? Could you use different point colors to highlight these cases on plot 1E? Additionally, could you use a pseudocount to include peptides only identified in one treatment condition on the plot in 1E? (Especially since these sites are mentioned in lines 272-278 but are not on the plot)

      Response:

      1. The kinetics of the responses to A23187 and BIPPO are very different. This is why treatment timings are purposely different as they were selected to align pathways to a point where calcium levels peak just prior to calcium re-uptake. We make no mention of kinetic comparisons, and merely demonstrate that at the chosen timepoints, overall signalling correlation is very high. The observation that most of the sites that behave differently between conditions sit remarkably close to the threshold for differential regulation (in the treatment condition where they are not DR - see Fig. 1G) led us to speculate that many of these sites are likely on the cusp of differential regulation. While it is entirely possible that some of these differences are, in fact, treatment specific (and we clearly acknowledge this in the text), we simply state that we cannot confidently discern clear signalling features that allow us to distinguish between the two treatments. We feel that this is an entirely relevant observation given the observed preponderance of both A23187 and BIPPO-dependent DR phosphosites on proteins in the PKG signalling pathway (as current models place this upstream of Ca2+release).
      2. Log10 intensity only serves to spread the data for easier visualisation. The only comparison being made relates to the LFCs. Fig. 1Gi shows the LFC scores (x axis) for all sites regulated following A23187 treatment (for which peptides were also identified in BIPPO treatment). On this plot we have highlighted the sites that are differentially regulated following BIPPO but not A23187 treatment (with red showing the DRup and blue showing the DRdown sites). This demonstrates that many of the sites that are regulated following BIPPO but not A23187 treatment cluster close to the threshold for differential regulation in the A23187 dataset - suggesting that many of these sites are likely on the cusp of differential regulation. Fig. 1Gii shows the reverse. While we could highlight the above-mentioned sites on the plot in Fig. 1E, we do not feel that it would demonstrate our point as clearly.

      We feel that including a pseudocount on Fig. 1E for peptides lacking quantification in one treatment condition would be visually misleading as the direct correlation being made in Fig. 1E is BIPPO vs A23187 treatment. The sites mentioned in lines 272-278 in the original manuscript (now lines 268-276) are available in the supplement tables.

      3.Additional experiments would be essential to support the main claims of the paper.

      Genetic validation is necessary for the experiments performed with the PKA inhibitor H89. H89 is nonspecific even in mammalian systems (PMID: 18523239) and in this manuscript it was used at a high concentration (50 µM) The heterodimeric architecture of PKA in apicomplexans dramatically differs from the heterotetrameric enzymes characterized in metazoans (PMID: 29263246), so we don't know what the IC50 of the inhibitor is, or whether it inhibits competitively. Two inducible knockdown strains exist for PKA C1 (PMID: 29030485, 30208022). The authors could request one of these strains and construct a ∆cdpk3 in that genetic background, as was done for the PDE2 cKO strain. Estimated time: 3-4 weeks to generate strain, 2 weeks to repeat assays.

      Response:

      1. While we appreciate that H89 is not 100% specific for PKA, this is not our only line of evidence that cAMP levels are altered. We demonstrate that cAMP levels are elevated in CDPK3 KO parasites – further substantiating our finding.

      The H89 concentration used in our experiment is in keeping with/lower than the concentrations used in other Toxoplasma publications (Jia et al., 2017), and both the Toxoplasma and Plasmodium fields have shown convincingly that H89 treatment phenocopies cKD/cKO of PKA (see Jia et al., 2017; Flueck et al., 2019).

      While we agree that the genetic validation suggested by reviewer 1 would serve to further support our findings (though it would not provide further novel insights), the suggested time frame for experimental execution was not realistic. Line shipment, strain generation, subcloning and genetic validation would take substantially longer than 3-4 weeks.

      cGMP levels are found to not increase with A23187 treatment, which is at odds with a previous study (lines 524-560). The text proposes that the differences could arise from the choice of buffer: this study used an intracellular-like Endo buffer (no added calcium, high potassium), whereas Stewart et al. 2017 used an extracellular-like buffer (DMEM, which also contains mM calcium and low potassium). An alternative explanation is that 60 s of A23187 treatment does not achieve a comparable amount of calcium flux as 15 s of BIPPO treatment, and a calcium-dependent effect on cGMP levels, were it to exist, could not be observed at the final timepoint in the assay. The experiments used to determine the kinetics of calcium flux following BIPPO and A23187 treatments (Fig. 1B, C) were calibrated using Ringer's buffer, which is more similar to an extracellular buffer (mM calcium, low potassium). In this buffer, A23187 treatment would likely stimulate calcium entry from across the parasite plasma membrane, as well as across the membranes of parasite intracellular calcium stores. By contrast, A23187 treatment in Endo buffer (low calcium) would likely only stimulate calcium release from intracellular stores, not calcium entry, since the calcium concentration outside of the parasite is low. Because calcium entry no longer contributes to calcium flux arising from A23187 treatment, it is possible that the calcium fluxes of A23187-treated parasites at 60 s are "behind" BIPPO-treated parasites at 15 s. The researchers could control these experiments by *either* (i) performing the cNMP measurements on parasites resuspended in the same buffer used in Figure 1B, C (Ringer's) or (ii) measuring calcium flux of extracellular parasites in Endo buffer with BIPPO and A23187 to determine the "alignment" of calcium levels, as was done with intracellular parasites in Figure 1C. No new strains would have to be generated and the assays have already been established in the manuscript. Estimated time to perform control experiments with replicates: 2 weeks. This seems like an important control, because the interpretation of this experiment shifts the focus of the paper from feedback between calcium and cGMP signaling, which had motivated the initial phosphoproteomics comparisons, to calcium and cAMP signaling. Further, the lipidomics experiments were performed in an extracellular-like buffer, DMEM, so it's unclear why dramatically different buffers were used for the lipidomics and cNMP measurements.

      Response:

      While the initial calibration experiments to measure calcium flux were indeed performed in Ringer’s buffer, the parasites were intracellular. We therefore chose to measure cNMP concentrations of extracellular parasites syringe lysed in Endo buffer, which is better at mimicking intracellular conditions than any other described buffer.

      As the reviewer suggested, we measured the calcium flux of extracellular parasites in Endo buffer upon stimulation with either A23187 or BIPPO.

      We found that peak calcium response to BIPPO in Endo buffer was similar to that of intracellular parasites (~15 seconds post treatment) (See Supp Fig. 6A). Upon treatment with A23187, extracellular parasites in Endo buffer had a much faster response compared to their intracellular counterparts, with peak flux measured at ~25 seconds post treatment (see Supp Fig. 6B). This indeed does suggest that extracellular parasites in Endo buffer behave differently to A23187 compared to their intracellular counterparts. However, peak calcium response is still occuring within the experimental time course and is not being missed, as the reviewer worries. Moreover, since we are able to detect increased cAMP levels in A23187 treated parasites, Ca2+ flux appears sufficient to alter cNMP signalling.

      We did notice however that the intensity of the calcium flux was much weaker in Endo buffer compared to intracellular parasites (see Supp Fig. 6B). We found that this was due to the lack of host-derived Ca2+, since supplementation of Endo buffer with 1 uM CaCl2 restored the intensity of the calcium response to match that of intracellular parasites (see Supp Fig. 6C). We therefore decided to repeat our cGMP measurements, this time using extracellular parasites in Endo buffer supplemented with 1 uM CaCl2. However, we found no differences in cGMP levels in the response to ionophore under these conditions (now Supp Fig. 6D) compared to the previous experiments, so the conclusions from the previous data do not change.

      As for the lipidomics experiments, we chose to use DMEM so that our dataset could be compared with other published lipidomic datasets (Katris et al., 2020; Dass et al., 2021) where DMEM was also used as a buffer when measuring global lipid profiles of parasites.

      We now acknowledge in the paper that Endo buffer has its shortcomings, and that this could be the reason why we do not detect changes in cGMP concentrations. We do, however, believe that Endo buffer is the best alternative to intracellular parasites and is supported by its consistent use in numerous publications studying Toxoplasma signalling (McCoy et al., 2012; Stewart et al., 2017).

      Additional information is required to support the claim that PDE2 has a moderate egress defect (lines 681-687). T. gondii egress is MOI-dependent (PMID: 29030485). Although the parasite strains were used at the same MOI, there is no guarantee that the parasites successfully invaded and replicated. If parasites lacking PDE2 are defective in invasion or replication, the MOI is effectively decreased, which could explain the egress delay. Could the authors compare the MOIs (number of vacuoles per host cell nuclei) of the vehicle and RAP-treated parasites at t = 0 treatment duration to give the reader a sense of whether the MOIs are comparable?

      Response:

      Since PDE2 KO parasites have a substantial growth defect, we did notice that starting MOIs were consistently lower for the RAP-treated samples compared to the DMSO-treated samples. However, this was also the case for PDE1 KO parasites where we did not see an egress delay. We also found that the egress delay was still evident for ∆CDPK3 parasites, despite having higher starting MOIs than WT parasites in our experiments. Therefore there does not appear to be a link between starting MOIs and the egress delay.

      To be sure of our results, we also performed egress assays where we co-infected HFFs with mCherry-expressing WT parasites (WT ∆UPRT) and GFP-expressing PDE2 cKO parasites that were treated with either DMSO or RAP or ∆CDPK3 parasites. This recapitulated our previous findings, confirming the deletion of PDE2 leads to delay in A23187-mediated egress.

      4.A few references are missing to ensure reproducibility.

      The manuscript states that the kinetic lipidomics experiments were performed with established methods, but the cited publication (line 497) is a preprint. These are therefore not peer reviewed and should be described in greater detail in this manuscript, including any relevant validation.

      Response:

      We thank the reviewer for pointing this out. We have included a greater description of the methods used in the materials and methods section such that the experiment is reproducible, as per the reviewer’s suggestion. We decided to still make mention of the BioRxiv preprint since we thought it was appropriate for the reader to be informed of ongoing developments in the field.

      Please cite the release of the T. gondii proteomes used for spectrum matching (lines 972-973).

      Response:

      We have included this as per the reviewer’s suggestion.

      Please include the TMT labeling scheme so the analysis may be reproduced from the raw files.

      Response:

      We have included this as per the reviewer’s suggestion in Supp Fig. 3A.

      5.Statistical analyses should be reviewed as follows:

      Have the authors examined the possibility that some changes in phosphopeptide abundance reflect changes in protein abundance? This may be particularly relevant for comparisons involving the ∆cdpk3 strain. Did the authors collect paired unenriched proteomes from the experiments performed? Alternatively, there may be enriched peptides that did not change in abundance for many of the proteins that appear dynamically phosphorylated.

      Response:

      We did not collect unenriched proteomes from the experiments performed (although we did perform unenriched mixing checks to ensure equal loading between samples), and believe that this wasn’t a necessity for the following reasons:

      1. For within-line treatment analyses, treatment timings are so short (a maximum of 15-50s in the single timepoint experiment) that it would be unlikely to detect substantial changes in protein abundance. Moreover, these unlikely events would affect all phosphosites across a protein, and therefore be detectable.

      In our CDPK3 dependency timecourse experiments, we normalise both the WT and ∆CDPK3 strain to 0s, and measure signalling progression over time. Therefore, any difference at timepoints that are not “0” are not originating from basal differences. We also see a consistent increase/decrease in phosphosite detection across the sub-minute timecourse, further confirming that the observed changes are truly down to dynamic changes in phosphorylation and not protein levels.

      In the single timepoint CDPK3 dependency analyses (44 regulated sites identified, Data S2), we acknowledge that there could be some risk of altered starting protein abundance between lines. However, if protein abundance were responsible for the changes in phosphosite detection, we would expect all phosphosites across the protein to shift, and we do not observe this. Moreover, when we look at these CDPK3 dependent proteins and compare their phosphosite abundance in untreated WT and ∆CDPK3 lines, we find that for each protein, either all or the majority of phosphosites detected are unchanged (highlighting that there is no substantial difference in this protein’s abundance between lines). Where there are phosphosite differences between lines, these are only ever on single sites on a protein while most other sites are unchanged - implying that these are changes to basal phosphorylation states and not protein levels.

      It seems like for Figs. 3B and S5 the maximum number of clusters modeled was selected. Could the authors provide a rationale for the number of clusters selected, since it appears many of the clusters have similar profiles.

      The number of clusters is chosen automatically by the Mclust algorithm as the value that maximizes the Bayes Information Criterion (BIC). BIC in effect balances gains in model fit (increasing log-likelihood) against increasing the number of parameters (i.e. number of clusters).

      Please include figure panel(s) relating to gene ontology. Relevant information for readers to make conclusions includes p-value, fold-enrichment or gene ratio, and some sort of metric of the frequency of the GO term in the surveyed data set. See PMID: 33053376 Fig. 7 and PMID: 29724925 Fig. 6 for examples or enrichment summaries. Additionally, in the methods, specify (i) the background set, (ii) the method used for multiple test correction, (iii) the criteria constituting "enrichment", (iv) how the T. gondii genome was integrated into the analysis, (v) the class of GO terms (molecular function, biological process, or cellular component), (vi) any additional information required to reproduce the results (for example, settings modified from default).

      Response:

      We have included the additional information requested in the materials and methods.

      We purposely did not include GO figure panels as our analyses are being done across many clusters, making it very difficult to display this information cohesively. We have included all data in Tables S2-S5. These tables included all the relevant information on p-value, enrichment status, ratio in study/ratio in population, class of GO terms etc.

      The presentation of the lipidomics experiments in Figure 4A-C is confusing. First, the ∆cdpk3/WT ratio removes information about the process in WT parasites, and it's unclear why the scale centers on 100 and not 1. Second, the data in Figure S6 suggests a more modest effect than that represented in Fig. 4; is this due to day to day variability? How do the authors justify pairing WT and mutant samples as they did to generate the ratios?

      Response:

      This is a common strategy used by many metabolomics experts (Bailey et al., 2015; Dass et al., 2021; Lunghi et al., 2022). We had originally chosen to represent the data as a ratio since this form of representation helps get rid of the variability that arises between experiments and allows us to see very clear patterns which would otherwise go unnoticed. This variability arises from the amount of lipids in each sample which varies between parasites in a dish, the batch of FBS and DMEM used, and the solutions and even room temperature used to extract lipids on a given day.

      However, we agree with the reviewer that depicting the data in Figure 4A-C as a ratio of ∆CDPK3/WT parasites can be confusing, so we have now changed the graphs, plotting WT and ∆CDPK3 levels instead, and have moved the ratio of ∆CDPK3/WT to the Supplementary Figure 5.

      The significance test seems to be performed on the difference between the WT and ∆cdpk3 strains, but not relative to the DMSO treatment? Wouldn't you want to perform a repeated measures ANOVA to determine (i) if lipid levels change over time and (ii) if this trend differs in WT vs. mutant strain?

      Response:

      The reviewer correctly points out that ANOVA is often used for time courses, but we must point out that it is not always strictly appropriate since it can overlook the purpose of the individual experiment design, which in this case is, 1) to investigate the role of CDPK3 compared to the WT parental strain, and 2) specifically to find the exact point at which the DAG begins to change after stimulus to match the proteomics time course.

      Our data is clearly biassed towards earlier time points where we have 0, 5, 10, 30, 45 seconds where DAG levels are mostly unchanged compared to the single timepoint 60 seconds which shows a significant difference in DAG using our method of statistical comparison by paired two tailed t-test. Therefore, it would be unwise to use ANOVA when we really want to see when the A23187 stimulus takes effect, which appears to be after the 45 second mark. Therefore, analysing the data by ANOVA would likely provide a false negative result, where the result is non-significant but there is clearly more DAG in WT than CDPK3 after 60 seconds. T-tests are commonly used when comparing the same cell lines grown in the same conditions with a test/treatment, and in this case the test/treatment is CPDK3 present or absent (Lentini et al., 2020).

      In the main text, it would be preferable to see the data presented as the proteomics experiments were in Figure 4B and 4C, with fold changes relative to the DMSO (t = 0) treatment, separately for WT and ∆cdpk3 parasites.

      Response:

      We have now changed the way that we represent the data, plotting %mol instead of the ratio.

      Signaling lipids constitute small percentages of the overall pool (e.g. PMID: 26962945), so one might not necessarily expect to observe large changes in lipid abundance when signaling pathways are modulated. Is there any positive control that the authors could include to give readers a sense of the dynamic range? Maybe the DGK1 mutant (PMID: 26962945)?

      Response:

      DGK1 is maybe not a good example because the DGK1 KO parasites effectively “melt” from a lack of plasma membrane integrity ((Bullen et al., 2016), so this would likely be technically challenging. We don’t see the added value in including an additional mutant control since we can already see the dynamic change over time from no difference (0 seconds) to significant difference (60 seconds) between WT and CDPK3 for DAG and most other lipids. We already see a significant difference between WT and CDPK3 after 60 seconds for DAG, and we can clearly see in sub-minute timecourses the changes or not at the specific points where the A23187 is added (0-5 seconds), the parasites acclimatise, for the A23187 to take effect (10-30 seconds) and for the parasite lipid response to be visible by lipidomics (45-60 +seconds).

      Figure 4E: are the differences in [cAMP] with DMSO treatment and A23187 treatment different at any of the timepoints in the WT strain? The comparison seems to be WT/∆cdpk3 at each timepoint. Does the text (lines 562-568) need to be modified accordingly?

      Response:

      In WT (and ∆CDPK3) parasites, [cAMP] is significantly changed at 5s of A23187 treatment (relative to DMSO). We have modified our figures to include this analysis. The existing text accurately reflects this.

      Figure 6I: is the difference between PDE2 cKO/∆cdpk3 + DMSO or RAP significant?

      Response

      In our original manuscript, there was no statistical difference in [cAMP] between PDE2cKO/∆CDPK3+DMSO and PDE2cKO/∆CDPK3+DMSO+RAP, likely due to the variation between biological replicates. To overcome the issues in variability between replicates, we have now included more biological replicates (n=7). This has led to a significant difference in [cAMP] between PDE2cKO/∆CDPK3 DMSO- and RAP-treated parasites and between PDE2cKO DMSO- and RAP-treated parasites (now Fig. 6I).

      **MINOR COMMENTS**

      1.The following references should be added or amended:

      Lines 83-85: in the cited publication, relative phosphopeptide abundances of an overexpressed dominant-negative, constitutively inactive PKA mutant were compared to an overexpressed wild-type mutant. In this experimental setup, one would hypothesize that targets of PKA should be down-regulated (inactive/WT ratios). However, the mentioned phosphopeptide of PDE2 was found to be up-regulated, suggesting that it is not a direct target of PKA.

      Response:

      We thank the reviewer for spotting this error, we have now modified our wording.

      Cite TGGT1_305050, referenced as calmodulin in line 458, as TgELC2 (PMID: 26374117).

      Response:

      We have included this as per the reviewer’s suggestion.

      Cite TGGT1_295850 as apical annuli protein 2 (AAP2, PMID: 31470470).

      Response:

      We have included this as per the reviewer’s suggestion.

      Cite TGGT1_270865 (adenylyl cyclase beta, Acβ) as PMID: 29030485, 30449726.

      Response:

      We have included this as per the reviewer’s suggestion.

      Cite TGGT1_254370 (guanylyl cyclase, GC) as PMID: 30449726, 30742070.

      Response:

      We have included this as per the reviewer’s suggestion.

      Note that Lourido, Tang and David Sibley, 2012 observed that treatment with zaprinast (a PDE inhibitor) could overcome CDPK3 inhibition. The target(s) of zaprinast have not been determined and may differ from those of BIPPO (in identity and IC50). The cited study also used modified CDPK3 and CDPK1 alleles, rather than ∆cdpk3 and intact cdpk1 as used in this manuscript. That is to say, the signaling backgrounds of the parasite strains deviate in ways that are not controlled.

      Response:

      While it is true that zaprinast targets have not been unequivocally identified, zaprinast-induced egress is widely thought to be the result of PKG activation, a conclusion that is further supported by the finding that Compound 1 completely blocks zaprinast-induced egress (Lourido, Tang and David Sibley, 2012). Similarly, BIPPO-induced egress is inhibited by chemical inhibition of PKG by Compound 1 and Compound 2 (Jia et al., 2017). Moreover, like zaprinast, BIPPO has been clearly shown to partially overcome the ∆CDPK3 egress delay (Stewart et al., 2017).

      2.The following comments refer to the figures and legends:

      Part of the legend text for 1G is included under 1H.

      Response:

      This has been corrected

      Figure 1H: The legend mentions that some dots are blue, but they appear green. Please ensure that color choices conform to journal accessibility guidelines. See the following article about visualization for colorblind readers: https://www.ascb.org/science-news/how-to-make-scientific-figures-accessible-to-readers-with-color-blindness____/ . Avoid using red and green false-colored images; replace red with a magenta lookup table. Multi-colored images are only helpful for the merged image; otherwise, we discern grayscale better. Applies to Figures 1B, 5C, 6D. (Aside: anti-CAP seems an odd choice of counterstain; the variation in the staining, esp. at the apical cap, is distracting.)

      Response:

      We thank reviewer #1 for bringing this to our attention, and have modified our colour usage for all IFAs and Figures 1H and 3E.

      We chose CAP staining as the antibody is available in the laboratory and stains both the apical end (which has been shown to contain several proteins important for signalling as well as PDE9) and the parasite periphery, the location of CDPK3.

      Figure 1B: When showing a single fluorophore, please use grayscale and include an intensity scale bar, since relative values are being compared.

      Response:

      We have modified this as per the reviewer’s suggestion

      Figure 1C: it is difficult to compare the kinetics of the calcium response when the curves are plotted separately. Since the scales are the same, could the two treatments be plotted on the same axes, with different colors? Additionally, according to the legend, a red line seems to be missing in this panel.

      Response:

      Fig1C is not intended to compare kinetics, merely to show peak calcium release in each separate treatment condition. We have removed mention of a red line in the figure legend.

      Figure 2A: Either Figure S4 can be moved to accompany Figure 2A, or Figure 2A could be moved to the supplemental.

      Figure S4 has now been incorporated into Figure 2.

      Reviewer #1 (Significance (Required)):

      This manuscript would interest researchers studying signaling pathways in protozoan parasites, especially apicomplexans, as CDPK3 and PKG orthologs exist across the phylum. To my knowledge, it is the first study that has proposed a mechanism by which a calcium effector regulates cAMP levels in T. gondii. Unfortunately, the experiments fall short of testing this mechanism.

      Response:

      We thank reviewer #1 for their comments, but disagree with their assessment that the key points of the manuscript “fall short of experimental testing”.

      1. We demonstrate that, following both BIPPO and A23187 treatment, there is differential phosphorylation of numerous components traditionally believed to sit upstream of PKG activation (as well as several components within the PKG signalling pathway itself).
      2. We show that some of these sites are CDPK3 dependent, and that deletion of CDPK3 leads to changes in lipid signalling and an elevation in levels of cAMP (dysregulation of which is known to alter PKG signalling).
      3. We show that pre-treatment with a PKA inhibitor is able to largely rescue this phenotype.
      4. We demonstrate that a cAMP-specific PDE is phosphorylated following A23187 treatment (i.e. Ca2+ flux)
      5. We show that this cAMP specific PDE plays a role in A23187-mediated egress.
      6. While the latter PDE may not be directly regulated by CDPK3, these findings suggest that there are likely several Ca2+-dependent kinases that contribute to this feedback loop.

        Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      In this manuscript, Dominicus et al investigate the elusive role of calcium-dependent kinase 3 during the egress of Toxoplasma gondii. Multiple functions have already been proposed for this kinase by this group including the regulation of basal calcium levels (24945436) or of a tyrosine transporter (30402958). However, one of the most puzzling phenotypes of CDPK3 deficient tachyzoites is a marked delay in egress when parasites are stimulated with a calcium ionophore that is rescued with phosphodiesterase (PDE) inhibitors. Crosstalk between, cAMP, cGMP, lipid and calcium signalling has been previously described to be important in regulating egress (26933036, 23149386, 29030485) but the role of CDPK3 in Toxoplasma is still poorly understood.

      Here the authors first take an elegant phosphoproteomic approach to identify pathways differentially regulated upon treatment with either a PDE inhibitor (BIPPO) and a calcium ionophore (A23187) in WT and CDPK3-KO parasites. Not much difference is observed between BIPPO or A23187 stimulation which is interpreted by the authors as a regulation through a feed-back loop.

      The authors then investigate the effect of CDPK3 deletion on lipid, cGMP and cAMP levels. The identify major changes in DAG, phospholipid, FFAs, and TAG levels as well as differences in cAMP levels but not for cGMP. Chemical inhibition of PKA leads to a similar egress timing in CDPK3-KO and WT parasites upon A23187 stimulation.

      As four PDEs appeared differentially regulated in the CDPK3-KO line upon A23187, the authors investigate the requirement of the 4 PDEs in cAMP levels. They show diverse localisation of the PDEs with specificities of PDE1, 7 and 9 for cGMP and of PDE2 for cAMP. They further show that PDE1, 7 and 9 are sensitive to BIPPO. Finally, using a conditional deletion system, they show that PDE1 and 2 are important for the lytic cycle of Toxoplasma and that PDE2 shows a slightly delayed egress following A23187 stimulation.

      **Major comments:**

      -Are the key conclusions convincing?

      The title is supported by the findings presented in this study. However I am not sure to understand why the authors imply a positive feed back loop. This should be clarified in the discussion of the results.

      Response:

      We believe in a positive feedback loop as, upon A23187 treatment (resulting in a calcium flux), ΔCDPK3 parasites are able to egress, albeit in a delayed manner. This egress delay is substantially, but not completely, alleviated upon treatment with BIPPO (a PDE inhibitor known to activate the PKG signalling pathway). In conjunction with our phosphoproteomic data (where we see phosphorylation of numerous pathway components upstream of PKG upon BIPPO and A23187 treatment - both in a CDPK3 dependent and independent manner), these observations suggest that calcium-regulated proteins (CDPK3 among them) feed into the PKG pathway. As deletion of CDPK3 delays egress, it is reasonable to postulate that this feedback is one that amplifies egress signalling (i.e. is positive).

      The phosphoproteome analysis seems very strong and will be of interest for many groups working on egress. However, the key conclusion, i.e. that a substrate overlaps between PKG and CDPK3 is unlikely to explain the CDPK3 phenotype, seems premature to me in the absence of robustly identified substrates for both kinases.

      Response:

      We certainly do not fully exclude the possibility of a substrate overlap but do lean more heavily towards a feedback loop given (a) the inability to clearly detect treatment-specific signalling profiles and (b) the phospho targets observed in the A23187 and BIPPO phosphoproteomes. We have further clarified our reasoning, and overall tempered our language in the manuscript as per the reviewer’s suggestion.

      I am not sure there is a clear key conclusion from the lipidomic analysis and how it is used by the authors to build their model up. Major changes are observed but how could this be linked with CDPK3, particularly if cGMP levels are not affected?

      Response:

      Our phosphoproteomic analyses identify several CDPK3-dependent phospho sites on phospholipid signalling components (DGK1 & PI-PLC), suggesting that there is indeed altered signalling downstream of PKG. To test whether these lead to a measurable phenotype, we performed the lipidomics analysis. We did not pursue this arm of the signalling pathway any further as we postulated that the changes in the lipid signalling pathway were less likely to play a role in the feedback loop. Nevertheless, we felt that it was worthwhile to include these findings in our manuscript as they support the conclusions drawn from the phosphoproteomics - namely that lipid signalling is perturbed in CDPK3 mutants. We, or others, may follow up on this in future.

      We agree with the reviewer that it is surprising that cGMP levels remain unchanged in our experiments when we treat with A23187. Given the measurable difference in cAMP levels between WT and ΔCDPK3 parasites, we postulate that CDPK3 directly or indirectly downregulates levels of cAMP. This would, in turn, alter activity of the cAMP-dependent protein kinase PKAc. Jia et al. (2017) have shown a clear dependency on PKG for parasites to egress upon PKAc depletion, but were also unable to reliably demonstrate cGMP accumulation in intracellular parasites. Similarly, their hypothesis that dysregulated cGMP-specific PDE activity results in altered cGMP levels has not been proven (the PDE hypothesised to be involved has since been shown to be cAMP-specific).

      While it is possible that our collective inability to observe elevated cGMP levels is explained by the sensitivity limits of the assay, it is similarly possible that cAMP-mediated signalling is exerting its effects on the PKG signalling pathway in a cGMP-independent manner.

      The evidence that CDPK3 is involved in cAMP homeostasis seems strong. However, the analysis of PKA inhibition is a bit less clear. The way the data is presented makes it difficult to see whether the treatment is accelerating egress of CDPK3-KO parasites or affecting both WT and CDPK3-KO lines, including both the speed and extent of egress. This is important for the interpretation of the experiment.

      Response:

      Fig. 4F shows that there is a significant amount of premature egress in both WT and ∆CDPK3 parasites following 2 hrs of H89 pre-treatment (consistent with previous reports that downregulation of cAMP signalling stimulates premature egress). When we subsequently investigated A23187-induced egress rates of the remaining intracellular H89 pre-treated parasites (Fig. 4Gi-ii) we found that the ∆CDPK3 egress delay was largely rescued. We have moved Fig. 4F to the supplement (now Supp Fig. 5E) in order to avoid confusion between the distinct analyses shown in 4F (pre-treatment analyses) and 4G (egress experiment). These experiments provided a hint that cAMP signalling is affected, which we then validate by measuring elevated cAMP levels in CDPK3 mutant parasites.

      The biochemical characterisation of the four PDE is interesting and seems well performed. However, PDE1 was previously shown to hydrolyse both cAMP and cGMP (____https://doi.org/10.1101/2021.09.21.461320____) which raises some questions about the experimental set up. Could the authors possibly discuss why they do not observe similar selectivity? Could other PDEs in the immunoprecipitate mask PDE activity? In line with this question, it is not clear what % of "hydrolytic activity (%)" means and how it was calculated.

      The experiments describing the selectivity of BIPPO for PDE1, 7 and 9 as well as the biological requirement of the four tested PDEs are convincing.

      Response:

      We believe that the disagreement between our findings and those published by Moss and colleagues are due to the differences in experimental conditions. We performed our assays at room temperature for 1 hour with higher starting cAMP concentrations (1 uM) compared to them. They performed their assays at 37ºC for 2 hours with 10-fold lower starting cAMP concentrations (0.1 uM). We have now repeated this set of experiments using the Moss et al. conditions, and find that PDEs 1, 7 and 9 can be dual specific, while PDE2 is cAMP-specific, thereby recapitulating their findings (Now included in the revised manuscript under Supp Fig. 7B). However, we also now performed a timecourse PDE assay using our original conditions and show that the cAMP hydrolytic activity for PDE1 can only be detected following 4 hours of incubation, compared to cGMP activity that can be detected as early as 30 minutes, suggesting that it possesses predominantly cGMP activity (See Supp Fig. 7C). We therefore believe that our experimental setup is more stringent, because if one starts with a lower level of substrate and incubates for longer and at a higher temperature, even minor dual activity could make a substantial difference in cAMP levels. Our data suggests that the cAMP hydrolytic activity of PDEs 1, 7 and 9 is substantially lower than the cGMP hydrolytic activity that they display.

      We have also included a clear description of how % hydrolytic activity was calculated in the methods section.

      -Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The claim that CDPK3 affects cAMP levels seems strong however the exact links between CDPK3 activity, lipid, cGMP and cAMP signalling remain unclear and it may be important to clearly state this.

      Response:

      We have modified our wording in the text to more clearly describe our current hypothesis and reasoning.

      -Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      I think that the manuscript contains a significant amount of experiments that are of interest to scientists working on Toxoplasma egress. Requesting experiments to identify the functional link between above-mentioned pathways would be out of the scope for this work although it would considerably increase the impact of this manuscript. For example, would it be possible to test whether the CDPK3-KO line is more or less sensitive to PKG specific inhibition upon A23187 induced?

      -Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      The above-mentioned experiment is not trivial as no specific inhibitors of PKG are available. Ensuring for specificity of the investigated phenotype would require the generation of a resistant line which would require significant work.

      __Response: __We agree that this would be an interesting experiment to further substantiate our findings. As indicated by the reviewer, however, the lack of specific inhibitors of PKG means a resistant line would likely be required to ensure specificity.

      -Are the data and the methods presented in such a way that they can be reproduced?

      It is not clear how the % of hydrolytic activity of the PDE has been calculated.

      Response: We have included a clearer description of how % hydrolytic activity was calculated in the methods section.

      -Are the experiments adequately replicated and statistical analysis adequate?

      This seems to be performed to high standards.

      **Minor comments:**

      -Specific experimental issues that are easily addressable.

      I do not have any comments related to minor experimental issues.

      -Are prior studies referenced appropriately?

      Most of the studies relevant for this work are cited. It is however not clear to me why some important players of the "PKG pathway" are not indicated in Fig 1H and Fig 3E, including for example UGO or SPARK.

      Response:

      We have modified Fig 1H and 3E to include all key players involved in the PKG pathway.

      -Are the text and figures clear and accurate?

      While all the data shown here is impressive and well analysed, I find it difficult to read the manuscript and establish links between sections of the papers. The phosphoproteome analysis is interesting and is used to orientate the reader towards a feedback mechanism rather than a substrate overlap. But why do the authors later focus on PDEs and not on AC or CNBD, as in the end, if I understand well, there is no evidence showing a link between CDPK3-dependent phosphorylation and PDE activity upon A23187 stimulation?

      Response:

      We thank reviewer#2 and appreciate their constructive feedback re the flow of the manuscript.

      Our key findings from the phosphoproteomics study were that 1) BIPPO and A23187 treatment trigger near identical signalling pathways, 2) that both A23187 and BIPPO treatment leads to phosphorylation of numerous components both upstream and downstream of PKG signalling (hinting at the presence of an Ca2+-regulated feedback loop) and 3) several of the abovementioned components are phosphorylated in a CDPK3 dependent manner.

      While several avenues of study could have been pursued from this point onwards, we chose to focus on the feedback loop in a broader sense as its existence has important implications for our general understanding of the signalling pathways that govern egress.

      We reasoned that, given the differential phosphorylation of 4 PDEs following A23187 and BIPPO treatment (none of which had been studied in detail previously), it was relevant to study these in greater detail.

      Coupled with the A23187 egress assay on PDE2 knockout parasites - our findings suggest that PDE2 plays a role in the abovementioned Ca2+ signalling loop. While PDE2 may not exert its effects in a CDPK3-dependent manner (and CDPK3 may, therefore, alter cAMP levels in a different fashion), this does not detract from the important finding that PDE2 is one of the (likely numerous) components that is regulated in a Ca2+-dependent feedback loop to facilitate rapid egress.

      We have modified our wording to better reflect our rationale for studying the PDEs irrespective of their CDPK3 phosphorylation status.

      While we feel that our reasoning for studying the PDEs is solid, we do appreciate that further clarification on the putative CDPK3-Adenylate cyclase link would elevate the manuscript substantially. However, given the data that the ACb is not playing a sole role in the control of egress, this is likely a non-trivial task and requires substantial work.

      It is also unclear how the authors link CDPK3-dependent elevated cAMP levels with the elevated basal calcium levels they previously described. This is particularly difficult to reconcile particularly in a PKG independent manner.

      Response:

      We previously postulated that elevated Ca2+ levels allowed ΔCDPK3 mutants to overcome a complete egress defect, potentially by activating other CDPKs (e.g. CDPK1). It is similarly plausible that elevated Ca2+ levels in ΔCDPK3 parasites may lead to elevated cAMP levels in order to prevent premature egress.

      As noted in our previous responses, we acknowledge that our inability to detect cGMP is surprising. However, given the clarity of our cAMP findings, and the phosphoproteomic evidence to suggest that various components in the PKG signalling pathway are affected, we postulate that we are either unable to reliably detect cGMP due to sensitivity issues, or that cAMP is exerting its regulation on the PKG pathway in a cGMP-independent manner. As noted previously, while the link between cAMP and PKG signalling has been demonstrated by Jia et al., it is not entirely clear how this is mediated.

      The presentation of the lipidomic analysis is also not really clear to me. Why do the authors show the global changes in phospholipids and not a more detailed analysis?

      Response:

      We performed a detailed phospholipid profile of WT and ∆CDPK3 parasites under normal culture conditions. However, due to the sheer quantity of parasites required for this detailed analysis, we were unable to measure individual phospholipid species in our A23187 timecourse. We therefore opted to measure global changes following A23187 stimulation.

      As the authors focus on the PI-PLC pathway, could they detail the dynamics of phosphoinositides? I understand that lipid levels are affected in the mutant but I am not sure to understand how the authors interpret these massive changes in relationship with the function of CDPK3 and the observed phenotypes.

      Response:

      Our phosphoproteomic analyses identified several CDPK3-dependent phospho sites on phospholipid signalling components (DGK1 & PI-PLC), suggesting that (in keeping with all of our other data), there is altered signalling downstream of PKG. To test whether these changes lead to a measurable phenotype, we performed the lipidomics analysis. Following stimulation with A23187, we found a delayed production of DAG in ∆CDPK3 parasites compared to WT parasites. Since DAG is required for the production of PA, which in turn is required for microneme secretion, our finding can explain why microneme secretion is delayed in ∆CDPK3 parasites, as previously reported (Lourido, Tang and David Sibley, 2012; McCoy et al., 2012).

      We did not follow this arm of the signalling pathway any further as we postulated that the changes in the lipid signalling pathway were less likely to play a role in the feedback loop. Nevertheless, we felt that it was worthwhile to include these findings in our manuscript as they support the conclusions drawn from the phosphoproteomics - namely that lipid signalling is perturbed in CDPK3 mutants. We, or others, may follow up on this in future.

      Finally, the characterisation of the PDEs is an impressive piece of work but the functional link with CDPK3 is relatively unclear. It would also be important to clearly discuss the differences with previous results presented in this this preprint: https://doi.org/10.1101/2021.09.21.461320____.

      My understanding is while the authors aim at investigating the role of CDPK3 in A23187 induced egress, the main finding related to CDPK3 is a defect in cAMP homeostasis that is not linked to A23187. Similarly, the requirements of PDE2 in cAMP homeostasis and egress is indirectly linked to CDPK3. Altogether I think that important results are presented here but divided into three main and distinct sections: the phosphoproteomic survey, the lipidomic and cAMP level investigation, and the characterisation of the four PDEs. However, the link between each section is relatively weak and the way the results are presented is somehow misleading or confusing.

      Response:

      As mentioned in a previous response, we chose to study PDEs in greater detail because of our observation that both A23187 and BIPPO treatments lead to their phosphorylation (hinting at the presence of a Ca2+regulated feedback loop). We were particularly intrigued to study the cAMP specific PDE, as CDPK3 KO parasites suggested that cAMP may play a role in the Ca2+ feedback mechanism. As PDE2 may not be directly regulated by CDPK3, Ca2+ appears to exert its feedback effects in numerous ways. We have modified our wording to better reflect our rationale for studying the PDEs irrespective of their CDPK3 phosphorylation status.

      -Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      This is a very long manuscript written for specialists of this signalling pathway and I would suggest the authors to emphasise more the important results and also clearly state where links are still missing. This is obviously a complex pathway and one cannot elucidate it easily in a single manuscript.

      Response:

      We have included an additional summary in our conclusions to better illustrate our findings and clarify any missing links.

      Reviewer #2 (Significance (Required)):

      -Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This is a technically remarkable paper using a broad range of analyses performed to a high standard.

      -Place the work in the context of the existing literature (provide references, where appropriate).

      The cross-talk between cAMP, cGMP and calcium signalling is well described in Toxoplasma and related parasites. Here the authors show that, in Toxoplasma, CDPK3 is part of this complex signalling network. One of the most important finding within this context is the role of CDPK3 in cAMP homeostasis. With this in mind, I would change the last sentence of the abstract to "In summary we uncover a feedback loop that enhances signalling during egress and links CDPK3 with several signalling pathways together."

      Response:

      In light of feedback received from several reviewers, we have made our wording less CDPK3 centric - as our findings relate in part to CDPK3 and, in a broader sense, to a Ca2+ driven feedback loop.

      The genetic and biochemical analyses of the four PDEs are remarkable and highlight consistencies and inconsistencies with recently published work that would be important to discuss and will be of interest for the field.

      __Response: __We thank reviewer#2 and agree that the PDE findings are of significant importance to the field.

      While I understand the studied signalling pathway is complex, I think it would be important to better describe the current model of the authors. In the discussion, the authors indicate that "the published data is not currently supported by a model that fits most experimental results." I would suggest to clarify this statement and discuss whether their work helps to reunite, correct or improve previous models.

      __Response: __We have expanded on the abovementioned statement to clarify that the presence of a feedback loop is a major pillar of knowledge required for the complete interpretation of existing signalling data.

      Could the authors also speculate about a potential role of PDE/CDPK3 in host cell invasion as cAMP signalling has be shown to be important for this process (30208022 and 29030485)?

      __Response: __Existing literature (Jia et al., 2017) suggests that perturbations to cAMP signalling play a very minor role in invasion since parasites where either ACα or ACβ are deleted show no impairment in invasion levels. We currently do not have substantial data on invasion, and are not sure that pursuing this is valuable given the minor phenotypes observed in other studies.

      -State what audience might be interested in and influenced by the reported findings.

      This paper is of great interest to groups working on the regulation of egress in Toxoplasma gondii and other related apicomplexan pathogens.

      -Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I am working on the cell biology of apicomplexan parasites.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      Dominicus et al aimed to identify the intersecting components of calcium, cyclic nucleotides (cAMP, cGMP) and lipid signaling through phosphoproteomic, knockout and biochemical assays in an intracellular parasite, Toxoplasma gondii, particularly when its acutely-infectious tachyzoite stage exits the host cells. A series of experimental strategies were applied to identify potential substrates of calcium-dependent protein kinase 3 (CDPK3), which has previously been reported to control the tachyzoite egress. According to earlier studies (PMID: 23226109, 24945436, 5418062, 26544049, 30402958), CDPK3 regulated the parasite exit through multiple phosphorylation events. Here, authors identified differentially-regulated (DR) phosphorylation sites by comparing the parasite samples after treatment with a calcium ionophore (A23178) and a PDE inhibitor (BIPPO), both of which are known to induce artificial egress (induced egress as opposed to natural egress). When the DCDPK3 mutant was treated with A23187, its delayed egress phenotype did not change, whereas BIPPO restored the egress to the level of the parental (termed as WT) strain, probably by activating PKG.

      The gene ontology enrichment of the up-regulated clusters revealed many probable CDPK3-dependent DR sites involved in cyclic nucleotide signaling (PDE1, PDE2, PDE7, PDE9, guanylate and adenylate cyclases, cyclic nucleotide-binding protein or CNBP) as well as lipid signaling (PI-PLC, DGK1). Authors suggest lipid signaling as one of the factors altered in the CDPK3 mutant, albeit lipidomics (PC, PI, PS, PT, PA, PE, SM) showed no significant change in phospholipids. To reveal how the four PDEs indicated above contribute to the cAMP and cGMP-mediated egress, they examined their biological significance by knockout/knockdown and enzyme activity assays. Authors claim that PDE1,7,9 proteins are cGMP-specific while PDE2 is cAMP-specific, and BIPPO treatment can inhibit PDE1-cGMP and PDE7-cGMP, but not PDE9-cGMP. Given the complexity, the manuscript is well structured, and most experiments were carefully designed. Undoubtedly, there is a significant amount of work that underlies this manuscript; however, from a conceptual viewpoint, the manuscript does not offer significant advancement over the current knowledge without functional validation of phosphoproteomics data (see below). A large body of work preceding this manuscript has indicated the crosstalk of cAMP, cGMP, calcium and lipid signaling cascades. This work provides a further refinement of the existing model In a methodical sense, the work uses established assays, some of which require revisiting to reach robust conclusions and avoid misinterpretation. The article is quite interesting from a throughput screening point of view, but it clearly lacks the appropriate endorsement of the hits.The authors accept that identifying the phosphorylation of a protein does not imply a functional role, which is a major drawback as there is no experimental support for any phosphorylation site of the protein identified through phosphoproteomics. In terms of the mechanism, it is not clear whether and how lipid turnover and cAMP-PKA signaling control the egress phenotype (lack of a validated model at the end of this study).

      Response:

      We thank reviewer #3 for their comments, but respectfully disagree with their assessment that the work presented does not advance current knowledge.

      1. We demonstrate that, following both BIPPO and A23187 treatment, there is differential phosphorylation of numerous components traditionally believed to sit upstream of PKG activation (as well as numerous components within the PKG signalling pathway itself). While it may have been inferred from previous studies that A23187 and BIPPO signalling intersect, this has never been unequivocally demonstrated - nor has a feedback loop ever been shown.

      We provide a novel A23187-driven phosphoproteome timecourse that further bolsters the model of a Ca2+-driven feedback loop.

      We show that deletion of CDPK3 leads to a delay in DAG production upon stimulation with A23187.

      We show that some of the abovementioned sites are CDPK3 dependent, and that deletion of CDPK3 leads to elevated levels of cAMP (dysregulation of which is known to alter PKG signalling).

      We show that pre-treatment with a PKA inhibitor is able to largely rescue this phenotype.

      We demonstrate that a cAMP-specific PDE is phosphorylated following A23187 treatment (i.e. Ca2+ flux)

      We show that this cAMP specific PDE plays a role in egress.

      While the latter PDE may not be directly regulated by CDPK3, these findings suggest that there are likely several Ca2+-dependent kinases that contribute to this feedback loop.

      We also firmly disagree with the reviewer’s assertion that without phosphosite characterisation, we have no support for our model. Following treatment with A23187 (and BIPPO), we clearly show broad, systemic changes (both CDPK3 dependent and independent) across signalling pathways previously deemed to sit upstream of calcium flux. Given the vast number of proteins involved in these signalling pathways, and the multitude of differentially regulated phosphosites identified on each of them, it is highly likely that the signalling effects we observe are combinatorial. Accordingly, we believe that mutating individual sites on individual proteins would be a very costly endeavour which is unlikely to substantially advance our understanding of signalling during egress. Moreover, introducing multiple point mutations in a given protein to ablate phosphorylation may lead to protein misfolding and would therefore not be informative. One of the key aims of this study was to assess how egress signalling pathways are interconnected, and we believe we have been able to show strong support for a Ca2+-driven feedback mechanism in which both CDPK3 and PDE2 play a role through the regulation of cAMP.

      While we agree with the reviewer’s statement that a large body of work preceding this manuscript has indicated the crosstalk of cAMP, cGMP, calcium and lipid signalling cascades, a feedback loop has not previously been shown. We believe that this finding is absolutely central to facilitate the complete interpretation of existing signalling data. Furthermore, no previous studies have gone to this level of detail in either proteomics or lipidomics to analyse the calcium signal pathway in any apicomplexan parasite. We argue that the novelty in our manuscript is that it is a carefully orchestrated study that advances our understanding of the signalling network over time with subcellular precision. The kinetics of signalling is not well understood and we believe that our study is likely the first to include both proteomic and lipidomic analyses over a timecourse during the acute lytic cycle stage of the disease. In doing so, we found evidence for a feedback loop that controls the signalling network spatiotemporally, and we characterise elements of this feedback in the same study.

      **Major Comments:**

      Based on the findings reported here there is little doubt that BIPPO and A23187-induced signaling intersect with each other, as very much expected from previous studies. The authors selected the 50s and 15s post-treatment timing of A23187 and BIPPO, respectively for collecting phosphoproteomics samples. At these time points, which were shown to peak cytosolic Ca2+, parasites were still intracellular (Line #171). How did authors make sure to stimulate the entire signaling cascade adequately, particularly when parasites do not egress within the selected time window? There is significant variability between phosphosite intensities of replicates (Line #186), which may also be attributed to insufficient triggers for the egress across independent experiments. This work must be supported by in vitro egress assays with the chosen incubation periods of BIPPO and ionophore treatment (show the induced % egress of tachyzoites in the 50s and 15s).

      Response:

      1. We appreciate that the reviewer acknowledges that our data clearly shows that BIPPO and A23187-induced signalling intersect. While this may have been expected from previous studies, this has not previously been shown - and is therefore valuable to the field. Specifically, the fact that A23187-treatment leads to phosphorylation of targets normally deemed to sit upstream of calcium release is entirely novel and adds a substantial layer of information to our understanding of how these signalling pathways work together.

      Treatments were purposely selected to align pathways to a point where calcium levels peak just prior to calcium reuptake. At these chosen timepoints, we clearly show that overall signalling correlation is very high. We know from our egress assays using identical treatment concentrations (Fig. 2C), that the stimulations used are sufficient to result in complete egress. We are simply comparing signalling pathways at points prior to egress.

      As mentioned in point 2, we show convincingly that the treatments used are sufficient to trigger complete egress. As detailed clearly in the text, we believe that these variations in intensities between replicates are due to slight differences in timing between experiments (this is inevitable given the very rapid progression of signalling, and the difficulty of replicating exact sub-minute treatment timings). We demonstrate that the reporter intensities associated with DR sites correlate well across replicates (Supp Fig. 3C), suggesting that despite some replicate variability, the overall trends across replicates is very much consistent. This allows us to confidently average scores to provide values that are representative of a site’s phosphorylation state at the timepoint of interest.

      The reviewer’s suggestion that we should demonstrate % egress at the 50s and 15s treatment timepoints is obsolete - we state clearly in the text that parasites have not egressed at these timepoints. Our egress assays (Fig. 2C) further support this.

      The authors discuss that CDPK3 controls the cAMP level and PKA through activation of one or more yet-to-be-identified PDEs(s). cAMP could probably also be regulated by an adenylate cyclase, ACbeta that was found to have CDPK3-dependent phosphorylation sites. If CDPK3 is indeed a regulator of cAMP through the activation of PDEs or ACbeta, it would be expected that the deletion of CDPK3 would perturb the cAMP level, resulting in dysregulation of PKAc1 subunit, which in turn would dysregulate cGMP-specific PDEs (PMID: 29030485) and thereby PKG. All these connections need to explain in a more clear manner with experimental support (what is positive and what is negatively regulated by C____DPK3).

      Response:

      1. We do not firmly state that CDPK3 regulates cAMP by phosphorylation of a PDE - this is one of the possibilities addressed. We acknowledge the possibility that this could also be via the adenylate cyclase (see line 792).

      PMID: 29030485 demonstrates clearly a link between cAMP signalling and PKG signalling, but does not demonstrate how this is mediated. The authors postulate that a cGMP-specific PDE is dysregulated given their observation that PDE2 is differentially phosphorylated in a constitutively inactive PKA mutant, however this was not validated experimentally. We and others (Moss et al., 2022), however, demonstrate that PDE2 is cAMP-specific. This suggests that the model built by PMID: 29030485 requires revisiting. We acknowledge clearly in the text that Jia et al. have shown a link between cAMP and PKG signalling, and hypothesise that CDPK3’s modulation of cAMP levels may affect this (this is in keeping with our phosphoproteomic data).

      Moreover, the egress defect is not due to a low influx of calcium in the cytosol because when the ionophore A23187 was added to the CDPK3 mutant, its phenotype was not recovered. Rather, the defect may be due to the low or null activity of PKG that would activate PI4K to generate IP3 and DAG. The latter would be used as a substrate by DGK to generate PA that is involved in the secretion of micronemes and Toxoplasma egress. In this context, authors should evaluate the role of CDPK3 in the secretion of micronemes that is directly related to the egress of the parasite.

      1. We agree with the reviewer on their point about calcium influx, and have already acknowledged in the text that the feedback loop does not control release of Ca2+ from internal stores as disruption of CDPK3 does not lead to a delay in Ca2+

      We agree, and clearly address in the text, that the egress defect could be due to altered PKG/phospholipid pathway signalling.

      (Lourido, Tang and David Sibley, 2012; McCoy et al., 2012) have both previously shown that microneme secretion is regulated by CDPK3. We therefore do not deem it necessary to repeat this experiment, but have made clearer mention of their findings in our writing.

      When the Dcdpk3 mutant with BIPPO treatment was evaluated, it was observed that the parasite recovered the egress phenotype. It is concluded that CDPK3 could probably regulate the activity of cGMP-specific PDEs. CDPK3 could (in)activate them, or it could act on other proteins indirectly regulating the activity of these PDEs. Upon inactivation of PDEs, an increase in the cGMP level would activate PKG, which will, in turn, promote egress. From the data, it is not clear whether any phosphorylation by CDPK3 would activate or inactivate PDEs, and if so, then how (directly or indirectly). To reach unambiguous interpretation, authors should perform additional assays.

      Response:

      As mentioned previously, given the abundance of differentially regulated phosphosites, we do not believe that mutating individual sites on individual proteins is a worthwhile or realistic pursuit.

      We clearly show systematic A23187-mediated phosphorylation of key signalling components in the PKA/PKG/PI-PLC/phospholipid signalling cascade, and demonstrate that several of these are CDPK3-dependent. We demonstrate that CDPK3 alters cAMP levels (and that the ∆CDPK3 egress delay in A23187 treated parasites is largely rescued following pre-treatment with a PKA inhibitor). We similarly demonstrate that A23187 treatment leads to phosphorylation of numerous PDEs, including the cAMP specific PDE2, and show that PDE2 knockout parasites show an egress delay following A23187 treatment. While PDE2 may not be directly regulated by CDPK3 (suggesting other Ca2+ kinases are also involved), these findings collectively demonstrate the existence of a calcium-regulated feedback loop, in which CDPK3 and PDE2 play a role (by regulating cAMP).

      We acknowledge that we have not untangled every element of this feedback loop, and do not believe that it would be realistic to do so in a single study given the number of sites phosphorylated and pathways involved. We do believe, however, that we have shown clearly that the feedback loop exists - this in itself is entirely novel, and of significant importance to the field.

      On a similar note, a possible experiment that can be done to improve the work would be to treat the CDPK3 mutant with BIPPO in conjunction with a calcium chelator (BAPTA-AM) to reveal, which proteins are phosphorylated prior to activation of the calcium-mediated cascades?

      Response:

      We agree that this would be an interesting experiment to carry out but would involve significant work. This could be pursued in another paper or project but is beyond the scope of this work.

      The manuscript claims that PDE1, PDE7, PDE9 are cGMP specific, and BIPPO inhibits only cGMP-specific PDEs. All assays are performed with 1-10 micromolar cAMP and cGMP for 1h. There is no data showing the time, protein and substrate dependence. Given the suboptimal enzyme assays, authors should re-do them as suggested here. (1) Repeat the pulldown assay with a higher number of parasites (50-100 million) and measure the protein concentration. (2) Set up the PDE assay with saturating amount of cAMP and cGMP, which is critical if the PDE1,7,9 have a higher Km Value for cAMP (means lower affinity) compared to cGMP. An adequate amount of substrate and protein allows the reaction to reach the Vmax. Once you have re-determined the substrate specificity (revise Fig 5D), you should retest BIPPO (Fig 5E) in the presence of cAMP and cGMP. It is very likely that you would find the same result as PDE9 and PfPDEβ (BIPPO can inhibit both cAMP and cGMP-specific PDE), as described previously

      We have repeated our assay using the exact same conditions outlined by Moss et al. This involved using a similar number of parasites, a longer incubation time of 2 hours at a higher temperature (37ºC) and with a lower starting concentration of cAMP (0.1 uM). We demonstrate that we are able to recapitulate both the Moss et al. and Vo et al. (see Supp Fig. 7B). However, we noticed that these reactions were not carried out with saturating cAMP/cGMP concentrations, since all reactions had reached 100% completion at the end of the assay whereby all substrate was hydrolysed. We therefore believe that based on our original assay, as well as the new PDE1 timecourse that we have performed (Supp Fig. 7C), that PDEs 1, 7 and 9 display predominantly cGMP hydrolysing activity, with moderate cAMP hydrolysing activity.

      We also repeated the BIPPO inhibition assay using the Moss et al. conditions, and still observe that the cGMP activity of PDE1 is the most potently inhibited of all 4 PDEs. We also see moderate inhibition of the cAMP activities of PDE1 and PDE9, suggesting that cAMP hydrolytic activity can also be inhibited. Interestingly, the cGMP hydrolytic activities of PDEs 7 & 9, which were previously inhibited using our original assay conditions, no longer appear to be inhibited. This is likely due to the longer incubation time, which masks the reduced activities of these two PDEs following treatment with BIPPO.

      The authors did not identify any PKG substrate, which is quite surprising as cAMP signaling itself could impact cGMP. Authors should show if they were able to observe enhanced cGMP levels in BIPPO-treated sample (which is expected to stimulate cGMP-specific PDEs). The author mention their inability to measure cGMP level but have they analyzed cGMP in the positive control (BIPPO-treated parasite line)? Why have they focused only on CDPK3 mutant, whereas in their phosphoproteomic data they could see other CDPKs too? It could be that other CDPK-mediated signaling differs and need PKA/PKG for activation.

      In the title, the authors have mentioned that there is a positive feedback loop between calcium release, cyclic nucleotide and lipid signaling, which is quite an extrapolation as there is no clear experimental data supporting such a positive feedback loop so the author should change the title of the paper.

      Response:

      1. As addressed in our previous response to the reviewer, PMID: 29030485 demonstrates clearly a link between cAMP signalling and PKG signalling, but does not confirm how this is mediated. The authors surmise that a cGMP-specific PDE is dysregulated (although the PDE hypothesised to be involved has since been shown to be cAMP-specific), but are similarly unable to detect changes in cGMP levels. This suggests that their model may be incomplete.

      The BIPPO treatment experiment suggested by the reviewer was already included in the original manuscript (see Fig. 4D in original manuscript, now Fig. 4E). With BIPPO treatment we are able to detect changes in cGMP levels.

      We did not deem it to be within the scope of this study to study every single other CDPK. We chose to study CDPK3, as its egress phenotype was of particular interest given its partial rescue following BIPPO treatment. We reasoned that its study may lead us to identify the signalling pathway that links BIPPO and A23187 induced signalling.

      As addressed in greater detail in our response to reviewer #2, the fact that the feedback loop appears to stimulate egress implies that it is positive.

      **Minor Comments:**

      Materials & Methods

      Explanation of parameters is not clear (Line #360-367). Phosphoproteomics with A23187 (8 micromolar) treatment in CDPK3-KO and WT, for 15, 30 and 60s at 37{degree sign}C incubation with DMSO control. Simultaneously passing the DR and CDPK3 dependency thresholds: CDPK3-dependent phosphorylation

      __Response: __We have modified the wording to make this clearer as per the reviewer’s suggestion.

      Line #368: At which WT-A23187 timepoint did the authors identify 2408 DR-up phosphosites (15s, 30s or 60s)? Or consistently in all? It should be clarified?

      __Response: __As already stated in the manuscript (see line 366 in original manuscript, now line 1047), phosphorylation sites were considered differentially regulated if at any given timepoint their log2FC surpassed the DR threshold.

      A23187 treatment of the CDPK3-KO mutant significantly increased the cAMP levels at 5 sec post-treatment, but BIPPO did not show any change. The authors concluded that BIPPO presumably does not inhibit cAMP-specific PDEs. However, the dual-specific PDEs are known to be inhibited by BIPPO, as shown recently (____https://www.biorxiv.org/content/10.1101/2021.09.21.461320v1____). Authors do confirm that BIPPO-treatment can inhibit hydrolytic activity of PfPDEbeta for cAMP as well as cGMP (Line #612). Besides, it was shown in Fig 5E that BIPPO can partially though not significantly block cAMP-specific PDE2. The statements and data conflict each other under different subtitles and need to be reconciled. Elevation of basal cAMP level in the CDPK3 mutant indicates the perturbation of cAMP signaling, however BIPPO data requires additional supportive experiments to conclude its relation with cAMP or dual-specific PDE.

      Response:

      1. The manuscript to which the reviewer refers does not use BIPPO in any of their experiments. They show that continuous treatment with zaprinast blocks parasite growth in a plaque assay, but do not test whether zaprinast specifically blocks the activity of any of the PDEs.

      Having repeated the PDE assay using the Moss et al. conditions (as outlined above), we are now able to recapitulate their findings, showing that PDEs 1, 7 and 9 can display dual hydrolytic activity while PDE2 is cAMP specific. As explained further above, we believe that our original set of experiments are more stringent than the Moss *et al. * To confirm this, we also performed an additional experiment, incubating PDE1 for varying amounts of time using our original conditions (1 uM cAMP or 10 uM cGMP, at room temperature). This revealed that PDE1 is much more efficient at hydrolysing cGMP, and only begins to display cAMP hydrolysing capacity after 4 hours of incubation.

      We also measured the inhibitory capacity of BIPPO on the PDEs using the Moss *et al. * During the longer incubation time, it seems that BIPPO is unable to inhibit PDEs 7 and 9, while with the more stringent conditions it was able to inhibit both PDEs. We reasoned that since BIPPO is unable to inhibit these PDEs fully, the residual activity over the longer incubation period would compensate for the inhibition, eventually leading to 100% hydrolysis of the cNMPs. We also see that while the cGMP hydrolysing capacity of PDE1 is completely inhibited, its cAMP hydrolysing capacity is only partially inhibited. These findings and the fact that PDE2 is not inhibited by BIPPO are in line with our experiments where we measured [cAMP] and showed that treatment with BIPPO did not lead to alterations in [cAMP].

      The method used to determine the substrate specificity of PDE 1,2,7 and 9 resulted in the hydrolytic activity of PDE2 towards cAMP, while the remaining 3 were determined as cGMP-specific. However, PDE1 and PDE9 have been reported as being dual-specific (Moss et al, 2021; Vo et al, 2020), which questions the reliability of the preferred method to characterize substrate specificity by the authors. It is also suggested to use another ELISA-based kit to double check the results.

      Response:

      As outlined above, we have repeated the assay using the conditions described by Moss et al. (lower starting concentrations of cAMP, 2 hour incubation period at 37ºC) and find that we are able to recapitulate the results of both Moss et al. and Vo et al.. However, using the Moss et al. conditions, the PDEs have hydrolysed 100% of the cyclic nucleotide, suggesting that these conditions are less stringent than the ones we used originally using higher starting concentrations of cAMP and incubating for 1 hour only at room temperature. With enzymatic assays it is always important to perform them at saturating conditions (as already suggested by the reviewer) and therefore we believe that our original conditions are more stringent than the results using the Moss et al. conditions.

      Line #607-608: Authors found PDE9 less sensitive to BIPPO-treatment and concluded PDE2 as refractory to BIPPO inhibition; however, the reduction level of activity seems similar as seen in PDE9-BIPPO treated sample? This strong statement should be replaced with a mild explanation.

      __Response: __We have tempered our wording as per the reviewer’s suggestion

      Figures and legends:

      The introductory model in Fig S1 is difficult to understand and ambiguous despite having it discussed in the text. For example, CDPK1 is placed, but only mentioned at the beginning, and the role of other CDPKs is not clear. In addition, the arrows in IP3 and PKG are confusing. The location of guanylate and adenylate cyclase is wrong, and so on... The figure should include only the egress-related signaling components to curate it. The illustration of host cell in orange color must be at the right side of the figure in connection with the apical pole of the parasite (not on the top). Figure legend should also be rearranged accordingly and citations of the underlying components should be included (see below).

      __Response: __We have modified Supp Fig. 1 as per the suggestions of reviewer#2 and #3. We have now modified the localisations of the proteins and have also removed the lines showing the cross talk between pathways. We have also highlighted to the reader that this is only a model and may not represent the true localisations of the proteins, despite our best efforts.

      In Figure 5D, would you please provide the western blot analysis of samples before and after pulling down to demonstrate the success of your immunoprecipitation assay. Mention the protein concentration in your PDE enzyme assay. Please refer to the M&M comments above to re-do the enzyme assays.

      Response:

      We have now included western blots for the pull downs of PDEs 1, 2, 7 and 9 (Supp Fig. 7A). We chose not to measure protein concentrations of samples since all experiments were performed using the same starting parasite numbers, and we do not see large differences in activities between biological replicates of the PDEs.

      Figure legend 1C: Line #194: There is no red-dotted line shown in graph! Correct it!

      __Response: __We have modified this.

      Figure 4Gi-ii: Shouldn't it be labelled i: H89-treatment and ii: A23178, respectively instead of DMSO and H89? (based on the text Line #579).

      __Response: __Our labelling of Fig. 4Gi-ii is correct as panel i parasites were pre-treated with DMSO, while panel ii parasites were pre-treated with H89. Subsequent egress assays on both parasites were then performed using A23187.

      We have modified the figures to include mention of A23187 on the X axis, and modified the figure legend to clarify pre-treatment was performed with DMSO and H89 respectively.

      Bibliography:

      Line #57 and 58: Citations must be selected properly! Carruthers and Sibley 1999 revealed the impact of Ca2+ on the microneme secretion within the context of host cell attachment and invasion, not egress as indicated in the manuscript! Similar case is also valid for the reference Wiersma et al 2004; since the roles of cyclic nucleotides were suggested for motility and invasion. Also notable in the fact that several citations describing the localization, regulation and physiological importance of cAMP and cGMP signaling mediators (PMID: 30449726 , 31235476 , 30992368 , 32191852 , 25555060 , 29030485 ) are either completely omitted or not appropriately cited in the introduction and discussion sections.

      Response:

      We have modified the citations as per the reviewer’s suggestions. We now cite Endo et al., 1987 for the first use of A23187 as an egress trigger, and Lourido, Tang and David Sibley, 2012 for the role of cGMP signalling in egress. We also cite all the GC papers when we make first mention of the GC. We have also removed the Howard et al., 2015 citation (PMID: 25555060) when referring to the fact that BIPPO/zaprinast can rescue the egress delay of ∆CDPK3 parasites.

      Grammar/Language

      Line #31: After "cAMP levels" use comma

      Response:

      We have modified this.

      36: Sentence is not clear. Does conditional deletion of all four PDEs support their important roles? If so, the role in egress of the parasite?

      Response:

      We have clarified our wording as per the reviewer’s suggestion. We state that PDEs 1 and 2 display an important role in growth since deletion of either these PDEs leads to reduced plaque growth. We have not investigated exactly what stage of the lytic cycle this is.

      40: "is a group involving" instead of "are"

      Response:

      We found no mention of “a group involving” in our original manuscript at line 40 or anywhere else in the manuscript, so we are unsure what the reviewer is referring to.

      108: isn't it "discharge of Ca++ from organelle stores to cytosol"?

      __Response: __We thank the reviewer for spotting this error. We have now modified this sentence.

      120: "was" instead of "were"

      __Response: __Since the situation we are referencing is hypothetical, then ‘were’ is the correct tense.

      Reviewer #3 (Significance (Required)):

      There is a significant amount of work that underlies this manuscript; however, from a conceptual viewpoint, the manuscript does not offer significant advancement over the current knowledge without functional validation of phosphoproteomics data. In terms of the mechanism, it is not clear whether and how lipid turnover and cAMP-PKA signaling control the egress phenotype (lack of a validated model at the end of this study).In a methodical sense, the work uses established assays, some of which require revisiting to reach robust conclusions and avoid misinterpretation.

      Compare to existing published knowledge

      A large body of work preceding this manuscript has indicated the crosstalk of cAMP, cGMP, calcium and lipid signaling cascades. This work provides a further refinement of the existing model. The article is quite interesting from a throughput screening point of view, but it clearly lacks the appropriate endorsement of the hits.

      Response:

      Please refer to our first response to reviewer #3 for our full rebuttal to these points. We respectfully disagree with the assessment that the work presented does not advance current knowledge.

      Audience

      Field specific (Apicomplexan Parasitology)

      Expertise

      Molecular Parasitology

      References

      Bailey, A. P. et al. (2015) ‘Antioxidant Role for Lipid Droplets in a Stem Cell Niche of Drosophila’, Cell. The Authors, 163(2), pp. 340–353. doi: 10.1016/j.cell.2015.09.020.

      Bullen, H. E. et al. (2016) ‘Phosphatidic Acid-Mediated Signaling Regulates Microneme Secretion in Toxoplasma Article Phosphatidic Acid-Mediated Signaling Regulates Microneme Secretion in Toxoplasma’, Cell Host & Microbe, pp. 349–360. doi: 10.1016/j.chom.2016.02.006.

      Dass, S. et al. (2021) ‘Toxoplasma LIPIN is essential in channeling host lipid fluxes through membrane biogenesis and lipid storage’, Nature Communications. Springer US, 12(1). doi: 10.1038/s41467-021-22956-w.

      Endo, T. et al. (1987) ‘Effects of Extracellular Potassium on Acid Release and Motility Initiation in Toxoplasma gondii’, The Journal of Protozoology, 34(3), pp. 291–295. doi: 10.1111/j.1550-7408.1987.tb03177.x.

      Flueck, C. et al. (2019) Phosphodiesterase beta is the master regulator of camp signalling during malaria parasite invasion, PLoS Biology. doi: 10.1371/journal.pbio.3000154.

      Howard, B. L. et al. (2015) ‘Identification of potent phosphodiesterase inhibitors that demonstrate cyclic nucleotide-dependent functions in apicomplexan parasites’, ACS Chemical Biology, 10(4), pp. 1145–1154. doi: 10.1021/cb501004q.

      Jia, Y. et al. (2017) ‘ Crosstalk between PKA and PKG controls pH ‐dependent host cell egress of Toxoplasma gondii ’, The EMBO Journal, 36(21), pp. 3250–3267. doi: 10.15252/embj.201796794.

      Katris, N. J. et al. (2020) ‘Rapid kinetics of lipid second messengers controlled by a cGMP signalling network coordinates apical complex functions in Toxoplasma tachyzoites’, bioRxiv. doi: 10.1101/2020.06.19.160341.

      Lentini, J. M. et al. (2020) ‘DALRD3 encodes a protein mutated in epileptic encephalopathy that targets arginine tRNAs for 3-methylcytosine modification’, Nature Communications. Springer US, 11(1). doi: 10.1038/s41467-020-16321-6.

      Lourido, S., Tang, K. and David Sibley, L. (2012) ‘Distinct signalling pathways control Toxoplasma egress and host-cell invasion’, EMBO Journal. Nature Publishing Group, 31(24), pp. 4524–4534. doi: 10.1038/emboj.2012.299.

      Lunghi, M. et al. (2022) ‘Pantothenate biosynthesis is critical for chronic infection by the neurotropic parasite Toxoplasma gondii’, Nature Communications. Springer US, 13(1). doi: 10.1038/s41467-022-27996-4.

      McCoy, J. M. et al. (2012) ‘TgCDPK3 Regulates Calcium-Dependent Egress of Toxoplasma gondii from Host Cells’, PLoS Pathogens, 8(12). doi: 10.1371/journal.ppat.1003066.

      Moss, W. J. et al. (2022) ‘Functional Analysis of the Expanded Phosphodiesterase Gene Family in Toxoplasma gondii Tachyzoites’, mSphere. American Society for Microbiology, 7(1). doi: 10.1128/msphere.00793-21.

      Stewart, R. J. et al. (2017) ‘Analysis of Ca2+ mediated signaling regulating Toxoplasma infectivity reveals complex relationships between key molecules’, Cellular Microbiology, 19(4). doi: 10.1111/cmi.12685.

      Vo, K. C. et al. (2020) ‘The protozoan parasite Toxoplasma gondii encodes a gamut of phosphodiesterases during its lytic cycle in human cells’, Computational and Structural Biotechnology Journal. The Author(s), 18, pp. 3861–3876. doi: 10.1016/j.csbj.2020.11.024.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      In this manuscript, Dominicus et al investigate the elusive role of calcium-dependent kinase 3 during the egress of Toxoplasma gondii. Multiple functions have already been proposed for this kinase by this group including the regulation of basal calcium levels (24945436) or of a tyrosine transporter (30402958). However, one of the most puzzling phenotypes of CDPK3 deficient tachyzoites is a marked delay in egress when parasites are stimulated with a calcium ionophore that is rescued with phosphodiesterase (PDE) inhibitors. Crosstalk between, cAMP, cGMP, lipid and calcium signalling has been previously described to be important in regulating egress (26933036, 23149386, 29030485) but the role of CDPK3 in Toxoplasma is still poorly understood.

      Here the authors first take an elegant phosphoproteomic approach to identify pathways differentially regulated upon treatment with either a PDE inhibitor (BIPPO) and a calcium ionophore (A23187) in WT and CDPK3-KO parasites. Not much difference is observed between BIPPO or A23187 stimulation which is interpreted by the authors as a regulation through a feed-back loop. The authors then investigate the effect of CDPK3 deletion on lipid, cGMP and cAMP levels. The identify major changes in DAG, phospholipid, FFAs, and TAG levels as well as differences in cAMP levels but not for cGMP. Chemical inhibition of PKA leads to a similar egress timing in CDPK3-KO and WT parasites upon A23187 stimulation.

      As four PDEs appeared differentially regulated in the CDPK3-KO line upon A23187, the authors investigate the requirement of the 4 PDEs in cAMP levels. They show diverse localisation of the PDEs with specificities of PDE1, 7 and 9 for cGMP and of PDE2 for cAMP. They further show that PDE1, 7 and 9 are sensitive to BIPPO. Finally, using a conditional deletion system, they show that PDE1 and 2 are important for the lytic cycle of Toxoplasma and that PDE2 shows a slightly delayed egress following A23187 stimulation.

      Major comments:

      -Are the key conclusions convincing?

      The title is supported by the findings presented in this study. However I am not sure to understand why the authors imply a positive feed back loop. This should be clarified in the discussion of the results. The phosphoproteome analysis seems very strong and will be of interest for many groups working on egress. However, the key conclusion, i.e. that a substrate overlaps between PKG and CDPK3 is unlikely to explain the CDPK3 phenotype, seems premature to me in the absence of robustly identified substrates for both kinases.

      I am not sure there is a clear key conclusion from the lipidomic analysis and how it is used by the authors to build their model up. Major changes are observed but how could this be linked with CDPK3, particularly if cGMP levels are not affected?

      The evidence that CDPK3 is involved in cAMP homeostasis seems strong. However, the analysis of PKA inhibition is a bit less clear. The way the data is presented makes it difficult to see whether the treatment is accelerating egress of CDPK3-KO parasites or affecting both WT and CDPK3-KO lines, including both the speed and extent of egress. This is important for the interpretation of the experiment.

      The biochemical characterisation of the four PDE is interesting and seems well performed. However, PDE1 was previously shown to hydrolyse both cAMP and cGMP (https://doi.org/10.1101/2021.09.21.461320) which raises some questions about the experimental set up. Could the authors possibly discuss why they do not observe similar selectivity? Could other PDEs in the immunoprecipitate mask PDE activity? In line with this question, it is not clear what % of "hydrolytic activity (%)" means and how it was calculated. The experiments describing the selectivity of BIPPO for PDE1, 7 and 9 as well as the biological requirement of the four tested PDEs are convincing.

      -Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The claim that CDPK3 affects cAMP levels seems strong however the exact links between CDPK3 activity, lipid, cGMP and cAMP signalling remain unclear and it may be important to clearly state this.

      -Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      I think that the manuscript contains a significant amount of experiments that are of interest to scientists working on Toxoplasma egress. Requesting experiments to identify the functional link between above-mentioned pathways would be out of the scope for this work although it would considerably increase the impact of this manuscript. For example, would it be possible to test whether the CDPK3-KO line is more or less sensitive to PKG specific inhibition upon A23187 induced?

      -Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      The above-mentioned experiment is not trivial as no specific inhibitors of PKG are available. Ensuring for specificity of the investigated phenotype would require the generation of a resistant line which would require significant work.

      -Are the data and the methods presented in such a way that they can be reproduced?

      It is not clear how the % of hydrolytic activity of the PDE has been calculated.

      -Are the experiments adequately replicated and statistical analysis adequate?

      This seems to be performed to high standards.

      Minor comments:

      -Specific experimental issues that are easily addressable.

      I do not have any comments related to minor experimental issues.

      -Are prior studies referenced appropriately?

      Most of the studies relevant for this work are cited. It is however not clear to me why some important players of the "PKG pathway" are not indicated in Fig 1H and Fig 3E, including for example UGO or SPARK.

      -Are the text and figures clear and accurate?

      While all the data shown here is impressive and well analysed, I find it difficult to read the manuscript and establish links between sections of the papers. The phosphoproteome analysis is interesting and is used to orientate the reader towards a feedback mechanism rather than a substrate overlap. But why do the authors later focus on PDEs and not on AC or CNBD, as in the end, if I understand well, there is no evidence showing a link between CDPK3-dependent phosphorylation and PDE activity upon A23187 stimulation? It is also unclear how the authors link CDPK3-dependent elevated cAMP levels with the elevated basal calcium levels they previously described. This is particularly difficult to reconcile particularly in a PKG independent manner.

      The presentation of the lipidomic analysis is also not really clear to me. Why do the authors show the global changes in phospholipids and not a more detailed analysis? As the authors focus on the PI-PLC pathway, could they detail the dynamics of phosphoinositides? I understand that lipid levels are affected in the mutant but I am not sure to understand how the authors interpret these massive changes in relationship with the function of CDPK3 and the observed phenotypes.

      Finally, the characterisation of the PDEs is an impressive piece of work but the functional link with CDPK3 is relatively unclear. It would also be important to clearly discuss the differences with previous results presented in this this preprint: https://doi.org/10.1101/2021.09.21.461320. My understanding is while the authors aim at investigating the role of CDPK3 in A23187 induced egress, the main finding related to CDPK3 is a defect in cAMP homeostasis that is not linked to A23187. Similarly, the requirements of PDE2 in cAMP homeostasis and egress is indirectly linked to CDPK3. Altogether I think that important results are presented here but divided into three main and distinct sections: the phosphoproteomic survey, the lipidomic and cAMP level investigation, and the characterisation of the four PDEs. However, the link between each section is relatively weak and the way the results are presented is somehow misleading or confusing.

      -Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      This is a very long manuscript written for specialists of this signalling pathway and I would suggest the authors to emphasise more the important results and also clearly state where links are still missing. This is obviously a complex pathway and one cannot elucidate it easily in a single manuscript.

      Significance

      -Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This is a technically remarkable paper using a broad range of analyses performed to a high standard.

      -Place the work in the context of the existing literature (provide references, where appropriate).

      The cross-talk between cAMP, cGMP and calcium signalling is well described in Toxoplasma and related parasites. Here the authors show that, in Toxoplasma, CDPK3 is part of this complex signalling network. One of the most important finding within this context is the role of CDPK3 in cAMP homeostasis. With this in mind, I would change the last sentence of the abstract to "In summary we uncover a feedback loop that enhances signalling during egress and links CDPK3 with several signalling pathways together."

      The genetic and biochemical analyses of the four PDEs are remarkable and highlight consistencies and inconsistencies with recently published work that would be important to discuss and will be of interest for the field.

      While I understand the studied signalling pathway is complex, I think it would be important to better describe the current model of the authors. In the discussion, the authors indicate that "the published data is not currently supported by a model that fits most experimental results." I would suggest to clarify this statement and discuss whether their work helps to reunite, correct or improve previous models.

      Could the authors also speculate about a potential role of PDE/CDPK3 in host cell invasion as cAMP signalling has be shown to be important for this process (30208022 and 29030485)?

      -State what audience might be interested in and influenced by the reported findings.

      This paper is of great interest to groups working on the regulation of egress in Toxoplasma gondii and other related apicomplexan pathogens.

      -Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I am working on the cell biology of apicomplexan parasites.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01384

      Corresponding author(s): Mary O’Riordan and Basel Abuaita

      1. General Statements [optional]

      We appreciated the positive feedback and helpful suggestions from the reviewers that pointed to a need for more clarity regarding the central focus of the study. Our goal was to take an unbiased approach to evaluating the role of neutrophils during S. Typhimurium (STM) infection of human intestinal epithelial cells (IEC), using human intestinal organoids as a model. An abundance of data point to important inflammatory roles for neutrophils during STM infection of human intestine but the critical mechanisms involved have not been fully elucidated. New data now included in the revised manuscript provide strong support for human PMN-derived IL1-beta as a driver of epithelial cell shedding in STM-infected HIOs, consistent with known differences in local inflammation between human and mouse infection, and this is the focus of the current study. Our data did not support a significant role for human neutrophils in controlling luminal bacterial numbers, but instead the primary human PMNs robustly stimulated epithelial cell responses that led to decreased intraepithelial bacteria. Several recent studies have suggested that caspase-1 is not a critical inflammasome component during STM infection of IEC, which instead use non-canonical inflammasomes, including caspases-4 and -11. Our data point to a human neutrophil-intrinsic function for caspase-1 and IL1-beta that contributes to the inflammatory tone of the intestinal milieu early in STM infection.

      2. Point-by-point description of the revisions

      Reviewer #1

      Major comments:

      Some important links are missing to fully support the mechanistic model proposed:* *

      1- PMN activity

      The authors may strengthen their evidence of PMN activities presented in lines 135 to 143 and in Fig.S2 and S3. In particular, the authors claim that PMNs form NETs in PMN-HIOs but the evidence displayed are limited. In fact, Fig S2 shows the same condition and same staining as Fig 1B but the MPO-positive structures are different. Clarification in the text or the figure would be welcome. Besides, as the authors insist on the relevance of NETs in the discussion, it seems that a clear demonstration and characterization of these structures in the PMN-HIO model would highly benefit the manuscript.

      While we commented on NETs in our original manuscript, our conclusions do not rely on the presence or absence of NETs. We have therefore removed the NET data and the reference to NETs. While NETs are potentially interesting in the context of intestinal infection, we understand the reviewer's concern about NETs and anticipate that a more quantitative characterization of NETs may be challenging given the structure and variability of the PMN-HIOs.

      Regarding the analyses of the culture supernatants (Fig.S3), only 3 out of the 5 displayed datasets are commented on in the text. The data obtained for BD2 and N-Gal should be either commented or removed from the figure. The author further suggests that Elafin expression in presence of PMN may restrict PMNs' ability to kill Salmonella. Repeating the experiment displayed in Fig S1 in the presence of Elafin as well as in the presence of the supernatant extracted from HIOs and PMN-HIOs would clarify the potential inhibition of PMN killing capacity in the PMN-HIO model.

      We now include a sentence on the antimicrobials BD2 and N-GAL to the text (line 135-136). Elafin is one of many molecules that could potentially affect the ability of PMNs to kill Salmonella. We repeated the experiments in S3 Fig with recombinant Elafin. There was a very weak effect on killing in the presence of Elafin, however Elafin can also kill Salmonella directly, complicating interpretation of these experiments. We have now added a sentence in the Discussion to speculate that Elafin is one example of how the epithelium may inhibit the ability of PMNs to kill (line 366-372). These data are not central to our main conclusions and are only intended to provide context to the reader about possible explanations for why PMNs can kill Salmonella directly, but do not significantly alter total bacterial numbers in the HIO model.

      The author proposed that infected and uninfected cells are extracted from the epithelium due to PMN activation, suggesting that Salmonella infection of epithelial cells is only indirectly involved in cell shedding. This is an interesting hypothesis that could be tested by measuring cell shedding in a non-infected but PMN-activated (for instance with PMA) PMN-HIO model. This would clarify further the role of PMN in controlling epithelial response to the infection.

      We tested this possibility by microinjecting LPS into the lumen of PMN-HIOs (S6 Fig). There was significantly less TUNEL+ signal in LPS-injected PMN-HIOs compared to STM-infected PMN-HIOs, suggesting that active Salmonella infection is required for shedding of both infected and uninfected cells in the presence of PMNs__. __

      2- Specificity of RNA-seq profiling:

      The authors analyzed the transcriptomic profiling of PMN-HIOs and HIOs infected or not. While these experiments bring to light an interesting difference in inflammasome/cell death transcriptomic programs at the scale of the co-culture model, it is not possible to conclude from which cell type these transcriptomic shifts emerge. To clarify this, the authors stain the co-culture for ASC and observe that ASC-positive cells are PMNs. They conclude that PMNs are most likely the primary site of caspase-1 dependent production of IL1. While their model is theoretically consistent, more direct proofs are necessary to conclude on the cell-type specific transcriptomic program during infection of PMN-HIO and could be obtained by FACS sorting of the cells prior to RNA-seq, for instance using MPO to detect PMNs and E-cadherin to detect epithelial cells.

      We now provide evidence that pretreating PMNs with an irreversible Caspase-1 inhibitor before co-culturing with STM-infected HIOs prevented accumulation of luminal TUNEL+ cells (Fig 6B,C). Additionally, IL-1β treatment in the absence of PMNs recapitulated the cell death phenotype of the infected PMN-HIOs (Fig 6D,E) suggesting Caspase-1 activity in PMNs and IL-1β production are necessary for epithelial cell death in the PMN-HIOs.

      3- Roles of cytokine

      After showing an increased expression/release of IL1 and IL1RA in infected PMN-HIOs, the authors move on to testing the role of caspases on cell shedding. Yet, they do not test the impact of IL1 and IL1RA on cell shedding. As, according to their proposed model, IL1 is acting upstream of caspase-1 to promote cell shedding, testing cell shedding in infected PMN-HIOs in the presence of an IL1 inhibitor would clarify that link. The author also proposed that the decrease of IL33 in PMN-HIOs compared to HIOs could be due to PMN processing, which would give an additional role to PMNs in controlling the epithelial response to infection. In the context of this manuscript, it would be highly relevant to test this hypothesis by measuring the rate of cleaved IL-33.

      We now provide data to address these questions about IL-1 signaling. HIOs were microinjected with recombinant IL-1β during STM infection and PMN-HIOs were also treated with IL1RA during STM infection. Cell shedding was measured under these conditions in Fig. 6D-F. Cell shedding was dependent on IL-1 signaling and the model has been updated to reflect this.

      We also concentrated supernatants from STM-infected HIOs and PMN-HIOs, probed for cleaved IL33 via western blot and did see some cleavage. However, without being able to block this process it is not possible to conclude what role cleaved IL33 has during infection in the PMN-HIO and IL-1β seems to be sufficient to drive the cell shedding phenotype. Since the status of IL33 is not central to our conclusions, we have removed these data from the manuscript.

      4- Roles of caspase

      The interpretations of the role of Caspases to restrict bacteria burden are unclear and should be revised (see also minor comment). It appears that both Caspase-1 and Caspase-3 are necessary for efficient cell shedding (Fig4B), Caspase-1 (but not Caspase-3) decreases intraluminal bacteria burden (Fig4C) and Caspase-3 (but not Caspase-1) decreases epithelium-associated bacteria (Fig4D). To reconcile these observations with the hypothesis that cell shedding is responsible for the decrease of intraluminal and epithelium-associated bacterial burden, one may propose that caspase-3 (but not caspase-1) induces cell shedding of mainly non-infected cells (possibly bacteria-associated) and caspase-1 (but not caspase-3) induced cell shedding of infected cells. This could be tested by measuring the % of infected extruded cells upon caspase inhibitor treatments. In addition, these data don't allow to propose that Caspase-3 activation happens downstream of Caspase-1 as suggested by the authors in their abstract figure.

      It is difficult to accurately quantify the percent infected cells that are extruded since both infected and uninfected cells are extruded into a luminal space full of bacteria, which may associate with uninfected cells post-extrusion. However, we did observe cells positive for cleaved Caspase-3 when HIOs were treated with IL-1β leading us to infer that Caspase-1 mediated cytokine signaling through IL1R can trigger downstream Caspase-3 activation (Fig. 6G). We have expanded the Discussion to talk about differing roles of Caspases on bacterial burden and association with the epithelium (lines 374-397).

      Minor comments:

      The majority of the points listed below can be addressed with further analyses of pre-acquired data sets:

      Fig1E/1F/4D: each green dot is not likely to be individual bacteria but rather a cluster of bacterium (based on their size). So the y-axis in Fig 1E and Fig4D should not be #STM.

      Y-axis labels have been changed to #STM objects

      Fig2A: Variations in organoid size and epithelial thickness can be observed between figures. In particular, in Fig 2A, the HIO seems much younger than the other ones displayed in the manuscript.

      There is considerable natural variability between HIOs and between batches, a phenomenon observed by many HIO researchers (Hofer et al. Nature Reviews Materials 2021). HIOs were all treated the same way prior to infection, and based on our extensive observations, epithelial thickness does not correlate significantly with a particular experimental condition, as we now show in S10 Fig.

      Line 176 to 178, the authors mentioned the TUNEL+ cells in the mesenchyme but rule out the possibility that this phenotype could be infection or PMN-dependent because it is observed in the different conditions. As the picture displayed in Fig2A suggests high differences in the number of TUNEL+ cells in the mesenchyme under the 4 tested conditions, the authors should still quantify this phenomenon (possibly in the supplementary).

      This is likely an artifact of culturing and not due to the infection or PMNs. There is variability between HIO batches in the amount of TUNEL signal in mesenchymal cells (for example HIOs in Fig 4A and 5A have very low or no TUNEL positivity in the mesenchyme).

      "DAPI" should be written in blue.

      This has been corrected.

      Fig2C: Could the authors comment on the % of E-cadherin cells that are also TUNEL+? Is it 100%?

      On average about 75% of TUNEL+ cells are E-cadherin+. We think that this may be an underestimate because E-cadherin staining intensity decreases in many cells after shedding. This is commented on in the text (lines 178-179).

      Fig 2D: The point made on lines 182 to 186 that HIOs contain TUNEL + cells retained in the epithelial lining in the absence of PMNs is not very strongly supported by Fig 2D. Quantification of the number of intraepithelial TUNEL+ cells in the 4 compared conditions would make a more solid case.

      We quantified TUNEL intensity in epithelial cells retained in the monolayer (S7 Fig). We do note that there is some variability in this phenotype that correlates with different batches of HIOs__.__

      Fig2E: This experiment should be completed with a quantification of the percentage of TUNEL+ cells that are also cleaved caspase3-positive. The data, as currently displayed, do not prove that the cells negative for cleaved caspase 3 are apoptotic cells and thus do not support the sentence "suggesting that multiple forms of cell death were occurring in the PMN-HIO" (line 194).

      Cells negative for cleaved Caspase-3 that are TUNEL+ may be undergoing some other form of cell death that is not Caspase-3 dependent, such as necrosis. This possibility is consistent with the decreased TUNEL signal observed upon inhibition of Caspase-4 (Fig 5A,B)__. __However, we have reworded our conclusion to identify more clearly what the data indicate, and where we are drawing inferences.

      Fig3A: "IL1RN" should be changed for "IL1RA (IL1RN)" for consistency with Fig 3B.

      The heatmap shows gene expression data so IL1RN is more consistent with the gene nomenclature. However, we have added an asterisk to the label on the heatmap, along with a sentence in the figure legend to elucidate.

      Fig 4C: The authors should provide the percentage of infected cells rather than the number of bacteria per cell (this information can be included in supplementary).

      Percent infected cells has been moved to Fig 4C and the number of bacteria per cell has been moved to Fig 4D__.__

      FigS2: The different thicknesses of the epithelial layer observed between PBS and STM panels suggest a difference in scale. This may be double-checked by the authors.

      The images are scaled similarly – as noted earlier (S10 Fig), there is considerable natural variability between HIOs that is not correlated with any experimental condition in this study.

      Line 197-199, the authors claimed that uninfected cells may be observed in the cell lumen. This seems hard to observe/conclude at this resolution. The authors may show a non-infected cell at higher magnification. __

      We have added higher magnification images, uninfected cells are indicated with white arrows in S8 Fig.

      Discussion: Some important points should be added to the discussion. In particular, what is the fate of intracellular salmonellae after cell shedding? Can the bacterium survive cell apoptosis and burst out of the cell to re-infect the epithelium or be transferred to phagocytic cells during the clearance of intraluminal apoptotic cells? Previous studies showed that cytosolic hyper-replication could fuel cell shedding. The importance of bacterial load in PMN-induced cell shedding could be discussed.

      We have expanded the discussion to elaborate on what may happen to shed cells. One useful feature of the HIOs is that the enclosed lumen allows us to capture the cells to fully measure the extent of cell shedding, however in the intestine where there is flow these cells would be washed away and could help to reduce bacterial load in the intestine. This point is now made in lines 386-388 in the discussion.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Major concerns

      1) The authors show that only ~5% of the neutrophils have migrated to the lumen, which is a barely noticeable increase compared to PBS treated organoids. Does this reflect that the mucosal layer of the organoids might not produce neutrophil chemoattractants and that neutrophil recruitment during Salmonella is a bystander effect from a different cell type?

      This number indicates that PMNs are ~5% of total cells in the PMN-HIO (including epithelial and mesenchymal cells) during Salmonella infection (not that only 5% of PMNs added migrated). Moreover, PMNs were added to a well containing multiple HIOs. We also show that HIOs do produce neutrophil chemoattractants during infection (S1 Fig).

      2) How quickly are neutrophils recruited to the HIOs? The authors show one time point of 8 hours. Related to the relatively low number of neutrophils seen in their HIOs, is this perhaps a result of the time point they chose? Will they see more neutrophils recruited if they go longer?

      It is likely that 5% of total cells in the PMN-HIO represents a significant recruitment of PMNs, and our data clearly indicate a marked effect on the infected epithelium. PMNs can cause substantial tissue damage, and their recruitment and activation is known to be tightly regulated. Due to the short-lived nature of human PMNs it would be difficult to extend this experiment to later timepoints. We have experimentally characterized PMN migration at 24h and by that time, most of the PMNs that we observe are non-viable, thus we focused our studies earlier.

      3) The authors show that PMNs did not kill STM in their organoids, but they do in pure culture. Is this simply because of the low levels of neutrophils present in their HIOs, which would result in lower concentrations of antimicrobials being produced in the HIO lumen? If the authors are able to get higher levels of neutrophils in their HIOs, would they see increased bacterial killing?

      Neutrophils have both inflammatory signaling and microbicidal functions. For example, Cho, et al (PLoS Pathogens 2012) find that neutrophil-derived IL-1 beta is sufficient to support abscess formation in the innate immune response to Staphylococcus aureus soft tissue infection. Similarly, a recent study showed that activation of neutrophils by keratinocyte defensins in a S. aureus skin infection led to neutrophil IL1 beta and CXCL2 release that amplified antibacterial defenses (Dong, et al Immunity 2022). Moreover, in the native environment of the gut with extensive microbiome colonization, direct neutrophil microbicidal activity might be less effective against infection than signaling. Recruitment of higher levels of neutrophils in vivo or in the HIO might exacerbate damage of the epithelial barrier. In the discussion, we speculate there may be proteins, like Elafin, that are upregulated during infection and inhibit some neutrophil functions as a trade-off to control host tissue damage. We reason that our data strongly support an inflammatory signaling role for neutrophils to promote innate immune responses of the intestinal epithelium.

      4) Related to the above point, if the authors treat their HIOs with known neutrophil chemoattractants, can they increase the number of neutrophils that migrate into their organoids?

      High levels of chemoattractants are already being produced in the HIO in response to infection (S1 Fig). The most effective number of neutrophils in the context of intestinal infection may not be the highest number, given that neutrophils can cause tissue damage. Since we see a marked phenotype with the neutrophils that are recruited, we propose that this PMN-HIO model reveals important inflammatory signaling roles for PMNs to promote intestinal epithelial immune function.

      5) The authors speculate that Salmonella may "employ specific mechanisms to overcome PMN effector functions in the HIO luminal environment". Are any such mechanisms known? If so, the authors could test this hypothesis by repeating these experiments with Salmonella mutants in which these mechanisms are ablated. In this case, they should see increased killing of Salmonella by PMNs in the HIO lumen.

      The focus of this study was to test how PMNs contribute to the host response against wildtype Salmonella. In the PMN-HIO model, we find that neutrophils direct a robust epithelial cell extrusion response, impacted intracellular bacterial numbers, and that Salmonella luminal colonization is not affected by PMNs. Thus, our data are pointing to an important inflammatory role for neutrophils in the infected intestine. Indeed, reliance on direct bactericidal mechanisms in the intestinal lumen which in vivo would be colonized with the microbiota might be a losing strategy for neutrophils, which would be hugely outnumbered.

      6) Furthermore there is no information of the activation status of the neutrophils. How does the surface expression of CD16 CD62L, CD66 and CD11b look between the migrated and non-migrated and between infected and uninfected controls? Did the neutrophils de-granulate? Are they CD63+ or is the high levels of NGAL and S100 proteins an effect of lysis? The authors should also be careful in claiming that there is NETosis as the image in the supplement look more like an artifact than actual NETs.

      Our new findings suggest that IL-1 production by PMNs is the biggest factor in driving the cell death phenotype. We have also added a figure with CD63 staining. We were able to visualize some localization of CD63 to the cell surface of PMNs, consistent with degranulation (S4 Fig).

      7) Why does ASC translocate to the nucleus? Is the IL-1b cleavage mediated through Caspase-1 or Caspase-11? The neutrophils stained positive in the lumen appear to be intact, does this mean that pyroptosis does not occur, or does the IL-1b come from cells that did not migrate through the mucosal membrane? Staining for IL-1 and the different caspases might help resolve this question.

      ASC does not appear to be translocating to the nucleus. In Fig 3D the green signal (ASC) is primarily excluded from the DAPI-stained area. In this human model, Caspase-11 is not present, and inhibition of Caspase-1 is sufficient to block the cell shedding phenotype (Fig. 5A,B and Fig. 6B,C). We are unable to distinguish whether IL-1 is being produced by intact PMNs or PMNs that are undergoing pyroptosis. Unfortunately, there are not suitable antibodies for fixed immunofluorescence staining for cleaved Caspase-1, and as a secreted protein, IL-1 beta likely will not remain localized with the producer cell.

      8) The authors comment that there is substantial TUNEL staining in the mesenchyme independent of STM or PMNs, however, there is no explanation for why this happens. Does this have any downstream effects on the neutrophils that doesn't migrate towards the lumen?

      TUNEL positivity in the mesenchyme is likely an artifact of culturing and we have noted this in the text (line 169-172). The extent of TUNEL+ mesenchymal cells appears to be dependent on the batch of HIOs as not all HIOs exhibit this phenotype (for example Figs 4A and 6B). In contrast, the extent of TUNEL+ luminal cells is significantly dependent on the presence of PMNs and Salmonella.

      Minor comments

      1) The authors should remove that MPO is neutrophil-specific, monocytes are known to have higher MPO expression than neutrophils.

      In this controlled co-culture system there are no monocytes, therefore we have modified our text to indicate that MPO is used as a neutrophil marker in the PMN-HIOs (line 161).

      2) If the authors performed flow cytometry as they say, they should provide the flow plots and the gating strategy they used in the supplement.

      Representative flow plots for the data presented in Fig 1A are now included in S2 Fig. The data shown in Figs. 1A and S2 Fig are not gated.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Major Comments

      1.Overall the study is convincing and it is well-conducted. This reviewer found it surprising that the PMNs did not alter the total levels of STM in the HIOs as neutrophils are expected to control the infection. Can the authors elaborate on if the intraepithelial numbers are reduced, what happens to STM in the lumen? It would be convincing if the authors can extend the infection timeline to see if the neutrophils are capable of killing luminal STM. *

      One of the limitations of the HIO model is the lack of flow in the lumen. It is likely that shed cells would be removed from the body following extrusion in vivo. In the HIOs, since the cells are trapped in the lumen, Salmonella could then reinvade and so this phenotype might be even stronger in a model that incorporates flow. We have added this point to the discussion (lines 387-390). Due to the short-lived nature of PMNs, it is difficult to extend the infection beyond 8h. While in vitro experiments with just neutrophils and STM as we and others have performed might set the expectation that neutrophils would alter luminal bacterial levels, there is little to no direct evidence that neutrophil bactericidal activity is critical in the context of the intestinal environment (vs. releasing ROS or inflammatory signals that may have complex indirect effects). Indeed, an advantage of the HIO model is that we are able to test the function of neutrophils in a multi-component system, but one that is still sufficiently simplified that we can do some mechanistic analysis.

      2-It would be powerful to conduct the caspase inhibition on neutrophils prior to HIO co-culturing to convincingly show that the effects of caspase inhibition effect neutrophils which in turn effect the epithelium disrupting the epithelial load of STM.

      We appreciated this suggestion. We pretreated the PMNs with a Caspase-1 inhibitor for 1h prior to co-culture with infected HIOs. We found that this was sufficient to block TUNEL cell accumulation in the lumen of infected PMN-HIOs. These results are now presented in Fig 6B,C.

      3- While other caspases are well-established to be involved in Salmonella-related cell death and epithelial shedding, why did the authors picked caspase 3 but not caspase 4/5 to show activation in Fig 2?

      We have now also tested the role of Caspase-4 on cell shedding using z-LEVD-fmk inhibitor. Consistent with prior published studies, we found that Caspase-4 inhibition reduced the accumulation of TUNEL-positive cells in the PMN-HIO lumen. These results are presented in Fig 5. There are no detectable levels of Caspase-5 in the HIOs (S9 Fig).

      Minor comments

      Fig 1C It is not clear how the total bacterial burden was determined. Please include details such as the timepoint and sufficient details of the technique both in the results section and the legend.

      These details have been added in the figure legend (line 605-607). Briefly, HIOs were washed with PBS and homogenized in PBS at 8hpi. CFU/HIO were enumerated by serial dilution and plating on LB agar.

      • Fig S2. Authors claim that the PMNs form NETs in the lumen, however, the marker used in the immunostaining is MPO. Although a NETting is seen in the images, MPO staining is not sufficient to claim these are NETs. Additional staining is required to show if the neutrophils in the lumen are intact or formed NETs*.

      As noted in response to Reviewer #1, although we commented on NETs in our original manuscript, our conclusions do not rely on the presence or absence of NETs and our new data implicates PMN IL-1 as necessary and sufficient for the cell shedding phenotype. We have therefore removed the NET data and the reference to NETs. While NETs are potentially interesting in the context of intestinal infection, we understand the reviewer's concern about NETs and anticipate that a more quantitative characterization of NETs may be challenging given the structure and variability of the PMN-HIOs.

    1. ABSTRACT

      This work has been published in GigaByte Journal under a CC-BY 4.0 license (https://doi.org/10.46471/gigabyte.66), and has published the reviews under the same license. These are as follows.

      Reviewer 1. Linzhou Li

      Are the data and metadata consistent with relevant minimum information or reporting standards? No. Geographic location (country and/or sea, region, latitude and longitude) is missing, as well as environmental context.

      Is there sufficient data validation and statistical analyses of data quality? No. The genome size and gene number of Dendrobium hybrid cultivar ‘Emma White’ differ greatly from the published Dendrobium genomes (e.g. Zhang et. al Scientific Reports 2016, Zhang et. al Horticulture Research 2021, Han et. al Genome Biology and Evolution 2020...). Specifically, the authors assembled a smaller genome and predicted a larger number of genes compared with the previous study. Therefore, I strongly suspect that the assembled genome is incomplete and fragmented, resulting in more fragmental genes.

      Is the validation suitable for this type of data? No. There's not enough raw data (~24Gb) to assemble a 600Mb (or ~1.2Gb from the previous study) genome. I highly recommend the authors get more raw data and do a genome survey.

      Additional Comments: The Complete BUSCOs only account for 16.6% which is quite low. The authors explain that the large loss of BUSCOs is due to the fact that the mutant genome has a lot of specific sequences, but these genes are very conserved in plants and should not be easily mutated.

      Reviewer 2. Stephanie Chen

      Is the language of sufficient quality? No. Most of the manuscript is written in a sufficient quality, but there are certain parts that require revision to improve readability. Please see detailed comments on the Word document.

      Are all data available and do they match the descriptions in the paper? No. The SRA link is coming up as a permission error, but I assume it will be released once the paper is available. There is no information on where to access the annotation file.

      Is the data acquisition clear, complete and methodologically sound? No. The contiguity (635,396 contigs, N50 of 1,620 bp) and completeness (16.60 %) of the genome is quite low and this may limit its downstream uses. It would be good to incorporate some long-reads or increased sequencing coverage to improve your genome. There are a number of chromosome-level Dendrobium genomes that are available (e.g. D. chrysotoxum and D. huoshanense) and scaffolding off these may be attempted to improve the assembly. Scaffolding from existing assemblies may be a good option if generating more sequencing reads is not feasible.

      Is there sufficient detail in the methods and data-processing steps to allow reproduction? No. Some details on the DNA extraction and library preparation steps are missing. In the methods section, there are also missing details for multiple programs in terms of the version and parameters (e.g. BUSCO version and database used, QUAST version, AUGUSTUS version, details on adapter removal and trimming). It is mentioned 'similarity score and description of each gene was filtered out using in-house pipeline'. The script and details of the pipeline are not provided; please add a reference or details in the manuscript e.g. link to GitHub repository.

      Is there sufficient data validation and statistical analyses of data quality? No. The reporting and interpretation of BUSCO results ('BUSCO version 5.2.2 analysis reveals 913 (56.57%) single-copy orthologs doesn’t match with any data bases indicates the unique and possible uncharacterized sequences in mutant genome of Dendrobium hybrid cultivar') needs to be revisited. There needs to be additional validation of the gene annotation (e.g. BUSCO, comparison with existing Dendrobium annotations) and also some validation of the genome size (e.g. GenomeScope and comparison with reported flow cytometry measures).

      Is the validation suitable for this type of data? Yes. The type of validation in the manuscript (BUSCO) is suitable to assess genome completeness, but reporting and discussion of the results needs to be revised. Some additional validation is also needed (see box above).

      Additional Comments: In this manuscript, the authors provide a draft genome of a gamma-ray induced mutant of a Dendrobium hybrid cultivar using Illumina sequencing that will assist with future breeding efforts and studies. However, I am not convinced of the genome's usefulness in its current form. There are some methods that need to be described in more detail to be reproducible. Revisions will also help improve the readability of the manuscript. As page and line numbers are not provided on the manuscript, please find additional comments directly added to manuscript file attached.

      https://gigabyte-review.rivervalleytechnologies.com/download-api-file?ZmlsZV9wYXRoPXVwbG9hZHMvZ3gvRFIvMzA2L1Jldmlld19TQ185Njc2XzA1MDIyMl9HaWdhYnl0ZV9HYW1tYSBXR1MgZGF0YW5vdGUgKDEpLmRvY3g~

      Re-review: Thank you to the authors for addressing the previous comments on the manuscript. I generally find the revisions satisfactory, although have some follow up comments. The addition of details on the genetic origin of the Dendrobium ‘Emma White’ hybrid cultivar and requested details on bioinformatic tool versions/parameters have strengthened the manuscript. The authors have not followed up on the suggestion to improve the genome via scaffolding, but provide an explanation that existing chromosome-level assemblies/sequencing data of Dendrobium species are not suitable as they are not related to the hybrid cultivar the authors studied, implying that they are highly diverged and scaffolding would not meaningfully improve the genome. Given this information, I think the Dendrobium ‘Emma White’ hybrid cultivar genome can still be useful for orchid breeding efforts despite low contiguity and completeness. However, I do not agree with the author’s point of, “Third, we used low coverage genome analysis with short reads of gamma mutant Dendrobium hybrid cultivar, as it was the first case study and obtained SRA, genome assembly and TSA accessions from NCBI. The genome assemblies of Dendrobium species from earlier studies used both long reads and short reads in their study. Construction of scaffolding from such database species using our contigs may be skewed and shall give unreliable data based on above points mentioned. Hence, I opinioned that suggestion given by Reviwer 2 on scaffolding suggestion may not be correct point.” Even if different types of sequencing technologies were used in comparison to Emma White genome, the availability of a contiguous closely related reference genome would still be useful for reference-guided scaffolding of the draft genome and well as comparative analyses. Lines 107-109: Reorder sentence to make the order of the steps clear i.e. adapter removal and quality filtering before assembly with MaSuRCA. Also, on the MaSuRCA GitHub (https://github.com/alekseyzimin/masurca), it says “Avoid using third party tools to pre-process the Illumina data before providing it to MaSuRCA, unless you are absolutely sure you know exactly what the preprocessing tool does. Do not do any trimming, cleaning or error correction. This will likely deteriorate the assembly.” Did the authors find that the pre-processing meaningfully improved the quality of the assembly, compared to if the raw reads were input straight into the assembler? Please justify the preprocessing of reads. Suggest to reword lines 137-139 “BUSCO version 5.2.2 analysis reveals 913 (56.57%) single-copy orthologs doesn’t match with any data bases indicates the impact from evolutionary development of hybrid cultivars and influence of gamma radiation. It is because, the genome of ‘Emma White’ hybrid cultivar of Dendrobium derived from five unique different species is complex genome and continuously hybridized repeatedly 11 times over a period of 68 years with selection process for economic trait improvement” to make the explanation clearer and also to include the number and/or percentage of complete BUSCOs. This was flagged in the previous comments, but not fully resolved and would benefit from revisiting the interpretation of BUSCO results. There are a large number of missing BUSCOs in your assembly, likely related to low contiguity (as well as radiation which is mentioned). Can you discuss if/how this may be a limitation for using this genome in further studies? You suggest that the BUSCOs are not found in the assembly due to many rounds of trait selection and radiation. It is possible that some of the BUSCOs are indeed missing from the particular plant sequenced, but how can you be certain that this is due to the breeding history and radiation applied as implied in the text, and not low genome contiguity? Some papers which characterised gamma irradiation-induced mutations in plants (DOIs: 10.1093/jrr/rraa059, 10.1186/s12864-019-6182-3, 10.1534/g3.119.400555) indicate that it is unlikely as many as 913 BUSCO genes have been affected. Even with stronger doses of radiation than used on the orchid, the number of mutations/genes affected is much lower. The genus name needs to be consistently italicised throughout the manuscript.

      Re-re-review: Thank you to the authors for addressing the previous comments on the manuscript. The authors have followed up on the suggestion scaffold the genome by using the published Dendrobium huoshanense genome to scaffold their draft genome using RagTag. This is an appropriate tool to use and has improved the contiguity of the draft assembly which is good to see. In the methods, the version of RagTag is missing, as are the parameters used to run the program. Please also provide specification on the specific RagTag utilities used (correct, scaffold, patch and/or merge). The authors have added genome statistics for two other orchids and the scaffolded assembly in Table 1, however, have not added BUSCO results for their scaffolded assembly in Table 2. Also, can the authors provide a comment on if the low BUSCO values may be related to the fragmented assembly as brought up in the previous round of review? It will be interesting to see if BUSCO has improved with the scaffolding. BUSCO results for the other two species, D. catenatum and D. huoshanense, would also be a good point of comparison and this is relatively simple and quick to add. The authors could consider concatenating Table 1 and 2 in this case. The draft assembly has improved, and the authors should report numbers on the final version of the assembly presented in the paper (i.e. the scaffolded assembly) in terms of the analysis they have run. In the results and discussion section, it appears some of the statistics (e.g. 96,529 genes, 216,232 SSRs) still refer to the first draft assembly. The authors have clarified that raw reads were used as input into MaSuRCA (line 111) and have now included the necessary detail for the input and parameterisation of the program. Line 157-159: “Taxonomical analysis of mutant Dendrobium at raw sequence data also revealed limited synteny with its closest Dendrobium catenatum species at below 9% and genetically heterogeneous with outcrossing nature”. Details of how this analysis was done is missing from the methods. It may be more appropriate to perform synteny analysis at the genome level and compare the published D. catenatum genome with the scaffolded Dendrobium hybrid genome.

      Editors comment: Additional Editorial Board assessment and feedback was received during the review process.

    1. Reviewer #1 (Public Review):

      The present study aims to define the main immune cell subsets found in the hemolymph of the white shrimp, P. vannamei. This is significant because this species is heavily farmed around the world to meet the demand of the human consumption market. Yet, farmed shrimp suffer from infectious diseases and therefore we need to understand how their immune system works to design strategies that decrease infection losses.

      Classification of crustacean (and other invertebrates) hemocytes is difficult due to the lack of antibodies to use traditional flow cytometry approaches. Furthermore, hemocyte purification is not easy, cells die and clump, again precluding flow cytometry studies. Thus, the majority of what we know about shrimp hemocytes is based on morphological classification. This study contributes significantly to advancing our knowledge of shrimp Immunobiology by defining hemocyte subsets based on their transcriptional profiles.

      Another strength of the paper is that some function in vivo assays (phagocytosis) are presented in an attempt to validate the single-cell data. The authors frame their question or try to frame their question with a more evolutionary angle, such as whether the macrophage-like cell is the evolutionary precursor of human macrophages. I think that this question is not really achievable because the evolution of innate immune systems may have diverged in many branches of the metazoan tree of life. The authors, however, identify gene markers that are conserved in macrophages from shrimp and humans and that is a fair conclusion. There are some methodological caveats to the study and the manuscript needs to be heavily edited to improve language as well as to increase the depth of the interpretation.

      In summary, there are interesting findings in this manuscript but the manuscript needs to be significantly improved so that its quality and impact are elevated.

    1. Author Response

      Reviewer #1 (Public Review):

      The manuscript by Dr Riley and colleagues reports a novel link between molecular clock operative in skeletal muscle and titin mRNA, encoding for essential regulator of sarcomere length and muscular strength. Surprisingly, this clock-mediated regulation of titin occurs at the level of splicing, as demonstrated by SDS-VAGE analyses of skeletal muscle from muscle-specific Bmal1KO mice compared to Bmal1wt counterpart. Concomitant with switch of predominant isoform of titin, skeletal muscle of muscle specific Bmal1KO mice exhibited irregular sarcomere length. Moreover, the authors show that this shift of titin splice is causal for such sarcomere length irregularity and for altered sarcomere length in muscle from the mice with compromised clock function. Importantly, the authors provide compelling evidence that Rbm20, encoding for RNA-binding protein that mediates splicing of titin, is cooperatively regulated by Bmal1-Clock heterodimer and MyoD, via enhancer element in intron 1 of Rbm20, thus identifying Rbm20 as a novel direct clock-regulated gene in the skeletal muscle. Strikingly, rescue of Rbm20 in muscle specific Bmal1KO animals' results in rescue of titin splicing pattern and protein size, suggesting that Rbm20 mediates the regulatory effect of Bmal1 on titin splicing and represents a mechanistic link between the clock and regulator of sarcomere length and regularity.

      We thank reviewer 1 for the very kind comments. We agree that the circadian regulation of titin in any capacity is surprising. We are excited about the implications of our work for cardiac muscle and its therapeutic potential in human skeletal muscle.

      Reviewer #2 (Public Review):

      In this work the authors investigated whether deleting the BMAL1 gene, an integral component of the cellular clock that drives the circadian rhythms of cells, affects the giant protein titin. They report that deleting BMAL1 in skeletal muscle alters the splicing of titin and that this might underlie an increase in sarcomere length dispersion. They show that the effect is through the titin splicing factor RBM20. This work has high novelty and has the potential to add to our understanding of muscle physiology. It is unclear whether splicing of skeletal muscle titin indeed undergoes a circadian rhythm. This could be easily checked using protein gels or RNA seq in muscle samples collected at different times of the day.

      We appreciate the question and recognize that our original manuscript did not clearly outline that the circadian clock regulates both rhythmic and non-rhythmic gene expression. In this study, the target of the muscle clock is expression of Rbm20 mRNA which is not a rhythmically expressed gene in muscle. This has now been addressed in the manuscript.

      Based on the estimated titin turnover and incorporation rates of titin (Cadar et al., 2014), we do not believe that skeletal muscle titin splicing undergoes a circadian rhythm. However, we believe our data highlights the growing recognition of the molecular clock in regulating non-rhythmic processes. We have added data from a chronic phase advance model of circadian disruption with wildtype mice and identify that disrupted circadian rhythms are sufficient to change Rbm20 expression in skeletal muscle (Figure 5).

      This work would be more convincing if the sarcomere length dispersion was investigated in greater detail. Showing this in one muscle type only (TA), in muscles fixed at one length only, and not showing sarcomere length dispersion in the rescue experiment of Figure 6, is rather limited.

      We agree that our analysis of sarcomere length dispersion across joint angles would be interesting but we think it is beyond the scope of this study. As noted above, the premise of this study emerged from our early work in which we found that skeletal muscle from 2 different genetic mouse models of circadian disruption, Bmal1 KO mice as well as the Clock mutant mice, exhibit decreased maximum specific force with significant disruptions to sarcomere structure (Andrews et al., PNAS, 107 (44) 19090-19095 2010). The primary focus of this study was to address the mechanistic link between the muscle circadian clock, its transcriptional targets with a focus on sarcomere structure and our first clue was with the expression of titin isoforms. We included analysis of sarcomere length as an outcome measure because it is a fundamental feature of skeletal muscle, it has links to mechanical function and it is a structure that can be modified by titin spliceforms.

      A small increase in sarcomere length variation as suggested in Figure 2 is unlikely to have a great functional consequence. If it were, how can muscles that express naturally long titin isoforms (soleus, EDL, diaphragm, etc), function well?

      We did not intend to suggest that we see an increase in sarcomere length in Figure 2 and have clarified the figure and text accordingly. The change we see is related to the variability of sarcomere length; we do not see any change in the average sarcomere length. The topic of titin spliceform specialization and the contribution to sarcomere structure and function across different muscle groups (soleus vs. EDL vs. Diaphragm) is a really interesting question but beyond the scope of this study.

      Reviewer #3 (Public Review):

      This manuscript is using an inducible and skeletal muscle specific Bmal1 knockout mouse model (iMSBmal1-/-) that was published previously by the same group. In this study, they utilized the same mouse model and further investigated the effect of a core molecular clock gene Bmal1 on isoform switching of a giant sarcomeric protein titin and sarcomere length change resulted from titin isoform switching. Lance A. Riley et al found that iMSBmal1-/- mouse TA muscle expressed more longer titin due to additional exon inclusion of Ttn mRNA compared to iMSBmal+/+ mice. They observed that sarcomere length did not significantly change but more variable in iMSBmal1-/- muscle compared to iMSBmal+/+ muscle. In addition, they identified significant exon inclusion in the proximal Ig region, so they measured the proximal Ig length domain and confirmed that proximal Ig domain was significantly longer in iMSBmal1-/- muscle. Subsequently, they experimentally generated a shorter titin in C2C12 myotubes and observed that the shorter titin led to the shorter sarcomere length. Since RBM20 is a major regulator of Ttn splicing, they determined RBM20 expression level, and found that RBM20 expression was significantly lower in iMSBmal1-/- muscle. The reduced RBM20 expression was regulated by the molecular clock controlled transcriptional factor MyoD1. By performing a rescue experiment in vivo, the authors found that rescue of RBM20 in iMSBmal1-/- TA muscle restored titin isoform expression, however, they did not measure whether sarcomere length was restored. These data provide new information that the molecular cascades in the circadian clock mechanism regulate RBM20 expression and downstream titin isoform switching and sarcomere length change. Although the conclusion of this manuscript is mostly supported by the data, some aspects of experimental design and data analysis need be clarified and extended.

      Strengths:

      This paper links the circadian rhythms to skeletal muscle structure and function through a new molecular cascade: the core clock component Bmal1-transcription factor MyoD1-RBM20 expression-titin isoform switching-sarcomere length change.

      Utilization of muscle specific bmal1 knockout mice could rule out the confounding factors from the molecular clock in other cell types

      The authors performed the RNA sequencing and label free LC-MS analyses to determine the exon inclusion and exclusion through a side-by-side comparison which is a new approach to identify individual alternative spliced exons via both mRNA level and protein level.

      We agree that the side-by-side analysis from RNAseq and LC-MS data are novel and provides a foundation for others wanting to study both titin mRNA and protein. In this version, we have expanded this work to include samples from our Rbm20 rescue model (Figure 6). Similarly, to our approach in the muscle specific Bmal1 knockout model, these results confirm our RNA-seq results and indicate that LC-MS is a suitable method to measure titin protein isoform. We note that while more work is needed to confirm the broad utility of the LC-MC approach, it may be a suitable alternative to RNA-seq for measuring region-specific, and possibly exon-specific, changes in titin isoform expression.

      Weaknesses:

      Both RBM20 expression and titin isoform expression varies in different skeletal muscles. The authors only detected their expression in TA muscle. It is not clear why the authors only chose TA muscle.

      The reviewer, like Reviewer 2, raises a good point about muscle specificity as this is a significant challenge for research in the field of skeletal muscle. As we noted above, our primary focus was on the TA because our goal was to study the molecular links between the muscle circadian clock and titin expression with inclusion of analysis of a structural outcome, sarcomere length variability. This muscle is well suited for the combination of approaches employed. We recognize the limits of using a single muscle, but we note that the we used ChIPseq data that provided the initial clues that CLOCK and BMAL1 bind to a site within intron 1 of the Rbm20 gene came from gastrocnemius and not TA muscle samples . Our targeted ChIP-PCR confirms that CLOCK and BMAL1 bind to the same intron 1 location from TA muscle samples. In addition, we have included data from quadriceps and TA muscles in our chronic jet lag model in which we use an environmental manipulation to disrupt the muscle clocks. We believe that the edits to the text and inclusion of this data strengthen and extends our findings to other muscles through circadian disruption and not only a genetic knockout model.

      The sarcomere length data are self-contradictory. The authors stated that sarcomere length was not significantly changed in muscle specific KO mice in Line 149, however, in Line 163, the measurements showed significantly longer in muscle specific KO muscle. The significance is also indicated in Figures 2C and 3B.

      We apologize for the miscommunication. The significance indicated in Figure 2C refers to the significant difference in variability of sarcomere length and not a significant difference in sarcomere length. The difference in Figure 3B is to indicate a slightly longer but significantly different from control sarcomere length, but also a significant difference in sarcomere length variability. To make this difference clear, we have changed the symbol for significantly different variability from * to # in both Figures 2C and 3B. We hope this clarifies our findings.

      Manipulating titin size using U7 snRNPs linking to the changes in sarcomere length and overexpressing RBM20 to switch titin size are the concepts that have been proved. These data do not directly support the impact of muscle specific Bmal1 KO on ttn splicing and RBM20 expression

      We agree that the use of U7 snRNPs does not directly support the impact of muscle specific Bmal1 KO on titin splicing and RBM20 expression; however, that was not the goal of this set of experiments. Several papers have recently indicated titin’s role as a sarcomeric ruler (Tonino 2017, Brynnel 2018), but none of them have investigated the proximal Ig domain that we identified as regulated by the circadian clock disruption. Because of this, we thought it necessary to show this region specifically contributes to sarcomere length using our cell culture model. Further, we think this point strengthens our study as it suggests that in the absence of a clock effect, altering the proximal Ig domain of titin directly alters sarcomere length adding to the growing evidence base that titin acts as a sarcomeric ruler. We have edited the text of the results and the discussion to clarify this point.

      There is no evidence to show if interrupted circadian rhythms in mice change RBM20 expression and ttn splicing, which is critical to validate the concept that circadian rhythms are linked to Ttn splicing through RBM20.

      We recognize this concern and have performed a new study in which we used a model of chronic jet lag in normal adult C57BL6 mice as a model to disrupt the muscle clock (Wolff, Duncan and Esser, JAP 2013). This new data has been added in Figure 5 and shows that by altering the lights on: lights off schedule every 4 days for 8 weeks, mimicking repeated jet lag, we disrupt Rbm20 expression in TA and gastrocnemius muscle (note, this is new data for both the muscle and clock fields). Concomitant with changes in clock gene expression we reported in 2013, we found that mRNA expression of Rbm20 is altered as well. These findings confirm that normal muscle clock disruption is sufficient to alter expression of Rbm20.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      This is already a full revision, not a revision plan. All points were carefully addressed. TMF

      July 28, 2022

      RE: Review Commons Refereed Preprint #RC-2022-01555

      Dear Dr. Fuchs,

      Thank you for sending your manuscript entitled "Dissecting the invasion of Galleria mellonella by Yersinia enterocolitica reveals metabolic adaptations and a role of a phage lysis cassette in insect killing" to Review Commons. We have now completed the peer review of the manuscript. Please find the full set of reports below.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript Saenger et al. concentrate on the pathophysiological details of insect larvae infection by Yersinia enterocolitica. The authors studied the colonisation, proliferation, tissue invasion, and killing activity of the bacteria in Galleria mellonella larvae. Their study provides valuable evidence for the biological relevance of Tc toxins and a neighboring holin-endolysin cassette during establishment of Y. enterocolitica infection in Galleria mellonella larvae through the oral route. The findings of the authors provide important novel insights, that can be used for the development of Tc toxins as biopesticides.

      In general, this is a nice study. The data and the methods are presented well so that they can be reproduced and the key conclusions convincing.

      Unfortunately, the manuscript is sloppily written in some places, including grammatical and formatting errors. Citations regarding the structure and mechanism of action of Tc toxins are arbitrarily chosen, often taking the wrong ones and important aspects are left out. I highly recommend that the authors read the review of Roderer and Raunser 2019 that nicely describes and summarizes the molecular mechanism of Tc toxins.

      Answer: We have now improved the writing of the manuscript and corrected several errors and typos. In particular, the review by Roderer and Raunser, as well as other literature in the field, is now considered and cited in the text.

      The abstract ends with a speculation: "Suggesting that this dual lysis cassette is an example for a phage-related function that has been adapted for the release of a bacterial toxin" - this is likely true, but not proven in this work. What if it is used for the release of something else like extracellular DNA needed for biofilm formation (see https://doi.org/10.1038/ncomms11220)?

      Answer: This sentence was carefully written as a hypothesis strengthened by the data obtained in our study. Experimental evidence for this assumption is the strong correlation of toxin and HE cassette phenotypes of mutants (see abstract), the highly conserved localisation of the cassette within Tc loci of distinct bacterial genera (see discussion for literature), and the synchronic regulation of both the toxin and the lysis genes (manuscript in preparation). Moreover, strain W22703 is unable to form biofilms in contact with invertebrates (Spanier et al., AEM 2010). There, also in accordance with other reviewers, we would like to keep this statement in the text. However, to address this interesting point, we now mention the finding of Turnbull et al. in the discussion (see last paragraph).

      In addition to that, several outstanding issues must be addressed:

      1. Line 45 3-D structural analysis of the tripartite Tc suggests a 4:1:1 stoichiometry of the A, B and C subunits, with the A subunit forming a cage-like pentamer that associates with a tightly bound 1:1 sub-complex of B and C. This is wrong. The stoichiometry is 5:1:1 and the structure is not a cage. The statement was taken from citation 3. However, citation 3 should not be used, since the stoichiometry as well as the structure that was determined there is wrong. Use Landsberg et al. 2012 PNAS, Gatsogiannis et al. 2013 Nature instead.

      Answer: We apologize for misunderstanding the literature. Reference Lee et al. was removed here, and the two papers plus Meusch et al. (Nature, 2014) are now cited. The stoichiometry was corrected, “cage” was removed.

      "Few bacteria are known to successfully colonize and infect invertebrates" - needs a reference.

      Answer: This was modified to “Several bacteria…”, and we cite the recent paper by Weber and Fuchs (in press) that in Table 7g lists more than 40 bacterial species pathogenic towards insects.

      "Their oral insecticidal activity is comparable to that of the Bacillus thuringiensis- (Bt)- toxin" - reference missing.

      Answer: The reference is now cited (Bowen et al., Science 1998). Please see the last paragraph of the paper.

      "Type a, type b and type c" subunits is not usual for the literature. Please use TcA, TcB, TcC. A-, B-, and C-components should be abbreviated as TcA, TcB and TcC respectively in order to be in line with recent literature on the topic.

      Answer: This was corrected accordingly.

      Is TccC an ADP-ribosyltransferase or does it have a different biochemical activity?

      Answer: This is unknown with respect to the Tc of Y. enterocolitica. In the introduction, we now refer on P. luminescens and do not further attribute such a function to the TcC of Y. enterocolitica. In the abstract, we replaced “ADP-ribosylating” with “toxic”.

      "The toxic and highly variable carboxyl-terminus of TccC that has recently been demonstrated to ADP-ribosylate actin and Rho-GTPases" - this is only certain for TccC3 and TccC5 from P. luminescens. There are many such C-termini, called HVRs which have not had their activities determined yet, see here: https://doi.org/10.1371/journal.ppat.1009102

      Answer: We agree and cite this article. See also the response to comment 5 above.

      "is probably followed by receptor-mediated endocytosis" - more recent references exist for the receptor binding of Tc toxins.

      Answer: We added two references pointing to glycans as receptors of the Tc (line 52).

      "A pH decrease then triggers the injection of a translocation channel formed by the pentameric TcaA subunits into the endosomal vacuole, followed by the subsequent release of the BC subcomplex into the cytosol of the target cell" - this again is incorrect. Please read the above mentioned review and correct this passage accordingly.

      Answer: We agree. This phrase was rewritten to “The attachment of the Tc to the host cell membrane is either followed by receptor-mediated endocytosis or release of the ADP-ribosyltransferase into the target cell {Landsberg, 2011 #738;Sheets, 2011 #742}{Meusch, 2014 #788}. In a pH-dependent manner, the TcA translocation channel injected into the membrane of the host cell. Conformational changes then allow the toxic component to be released into the translocation channel of TcA and from there into the cytosol {Meusch, 2014 #788}{Roderer, 2019 #871}.” (Lines 51-56)

      What is meant by "environmental Yersinia species"?

      Answer: This was corrected to “…and in Y. mollaretii.”

      In the relevant W22703 pathogenicity island sequence (https://www.ncbi.nlm.nih.gov/nuccore/AJ920332) previously submitted by the same group, something odd is going on with the TcA component: it appears to be split into three polypeptides (tcaA, tcaB1, tcaB2). In the manuscript you state TcA is made up from only tcaA and tcaB. Could you please address this?

      Answer: Shotgun sequencing was performed 15 years ago, and mapping revealed a frameshift within tcaB that resulted in the split annotation of tcaB. Even if this frameshift is not the result of a sequencing error, it obviously does not result in Tc inactivation. As this frameshift was not identified in most other Tc-PAI of yersiniae, we assume our statement to be correct.

      "And their products were recently shown to act as a holin and an endolysin, respectively" - missing reference.

      Answer: The reference is now cited (Springer et al., JB 2018).

      "Its Tc proteins are produced at environmental temperatures, but silenced at 37{degree sign}C." versus "Remarkably, HolY and ElyY lyse Y. enterocolitica at body temperature, but not at 15{degree sign}C". Please address the issue that HolY/ElyY lyse the bacteria at temperatures where Tc proteins are not produced.

      Answer: In the absence of in vitro conditions activating the HE gene cassette, we used the pBAD system to artificially overexpress the two genes and showed cell lysis at 37°C, but not at 15°C (Springer et al., JB, 2018). This finding points to a lack of cell lysis as prerequisite for TC release and strengthens the hypothesis of a new secretion system as now corroborated in the last paragraph of the discussion. To avoid confusion of readers, the sentence was removed from the manuscript.

      "Nematodes, which are easily maintained in the laboratory without raising ethical issues, have successfully been used to identify virulence-related genes in a broad set of bacterial pathogens" - what is the relevance of this for the current manuscript?

      Answer: Invertebrates are introduced here as infection models. Nematodes are mentioned here for two reasons: yersiniae are nematocidal due to the Tc, and their immune system is less elaborated than that of G. mellonella, thus explaining its preferred use as insect model. We shortened the sentence by deleting the phrase in commas.

      Fig. 1C - no description is given for the labels 1-8.

      Answer: This is given below figures 1E-H. The labels are valid for all figure panels to ease reading.

      "The hemolymph of these cadavers was found full of Y. enterocolitica cells" - injected CFUs are provided here, but not final CFUs in the cadavers (although referred to in a later section). Please address this.

      Answer: These were preliminary experiments to identify the optimal infection dose. Hemolymph content was plated, but cell numbers in the hemolymph were not enumerated. This sentence therefore now reads: “…and the hemolymph of these cadavers contained Y. enterocolitica cells.” (lines 113-114).

      What is the inducing agent used for pACYC-tcaA and pACYC-HE? Why would "slight leakiness of the pBAD-promoter" make pBAD-tccC non-inducible? Were colonies taken from the cadavers to verify that the bacteria still contained these plasmids?

      Answer: Within pACYC, the genes tcaA and hlyY/elyY (HE) are under control of their own promoters as indicated in Table S2. In general, pACYC vectors are often and successfully used for complementation due to middle copy number.

      This now reads “Due to the slight leakiness of the pBAD-promoter, arabinose was not added to further induce tccC transcription.” (lines 133-134).

      The presence of the plasmids in vivo was confirmed by periodic plating on selective and non-selective plates, not revealing differences in cell numbers.

      Can the authors please address the TD50 of 1.83 days for W22703 ΔHE/pACYC-HE versus 3.67 days for WT bacteria? This would mean that the former kill larvae twice as fast as usual. I would not call this "did not significantly differ in their insecticidal activity".

      Answer: This statement is indeed not very intuitive given the variations of the TD50-values. However, the significance here (and elsewhere in the text) is based on a statistical calculation. For the Kaplan-Meier-plot, we used an application (K.T.Bogen, Advances in Molecular Toxicology, 2016; Exponent Health Sciences, Oakland, CA, United States; Johann Kummermehr, Klaus-Rüdiger Trott, Stem Cells, 1997; Academic Press, London, San Diego) based on all data of a graph. However, to consider this point and to not confuse the readers, the phrase was modified to “…did not significantly differ in their insecticidal activity from that of the parental strain W22703 after one week, demonstrating…” (lines 135-138).

      Fig. 2 is missing survival data for larvae infected with tcaA, HE, and tccC KO bacteria.

      Answer: These data are shown and are equal to the LB-control, e. g. the survival rate of larvae infected with strains W22703 lacking HE, tcaA, or tccC were 100%.

      "And a slight colouring of some of the larvae from one h p.i. on (data not shown)" - best show the data or remove this statement.

      Answer: Although we observed this phenomenon regularly, monitoring and documentation cannot be provided and would not substantially strengthen the manuscript. We therefore deleted this phrase.

      The infection of larvae by W22703 ΔtccC/pBAD-tccC is missing, the other bacterial variants are present. Please address this.

      Answer: Infections with W22703 DtccC are not shown to not overload the figure, please see the panel below. W22703 DtccC/pBAD-tccC infections have not been documented by photos. Figure legend 3 now reads “Infections with W22703 DtccC and DtccC/pBAD-tccC are not shown.”

      "initially proliferated from an application dose of 4.0 × 105 CFU and 4.0 × 105 CFU, respectively, to 2.2 × 106 CFU and 2.8 × 106 CFU, but could not be detected from day three on. This finding strongly suggests that TcaA is involved in adherence to epithelial cells and thus in midgut colonization". Please address the "initially proliferated" (which day post-infection?), their elimination from the larvae (how, why?), why the tccC KO bacteria were more virulent than tcaA KO bacteria, and where the suggestion about TcaA involvement specifically in adherence comes from.

      Answer: “initially proliferated” was rewritten to “proliferated within the first day p.i.”. (line 163)

      Elimination: This now reads “…was completely absent six days p.i., probably due to passage through the gut followed by excretion”. (lines 161-162)

      In our view, the tccC knockout mutant is not more virulent than W22703 DtcaA (se Fig. 2), but replicates during the first day post infection, whereas the cell numbers of the tcaA KO mutant strongly decrease already within the first 24 h p.i.. This prompted us to speculate that Tc is involved in two infection steps, e.g. adherence and hemocyte inactivation. For clarity, this sentence was modified to: “This discrepancy suggests that TcaA is involved in adherence to epithelial cells and thus in midgut colonization, without requiring TccC.” (lines 165-166)

      In Fig. 4, the CFUs for W22703 ΔtccC/pBAD-tccC are essentially the same as for the other rescued KOs and WT, while in the text a point about weaker growth is made. Is this justified? Also, even though the CFU data is present here, data on infection of larvae by W22703 ΔtccC/pBAD-tccC is missing unlike the other bacterial variants. Please explain.

      Answer: We agree that this part of the results is misleading. We want to stress that the complementation very well restores the phenotype of the wildtype. The weaker growth of DtccC may be due to the distinct vector system used here. This part was there shortened and rephrased to: “When larvae were infected with 4.0 × 105 CFU of the DtcaA and DHE mutants, and with 1.4 × 106 CFU of strain W22703 DtccC/pBAD-tccC, all of which carrying the deleted genes on recombinant plasmids, the bacterial burden at days one to six p.i. increased approximately to that of the parental strain W22703 applied with 9.0 × 105 CFU, indicating a successful complementation of the gene deletions.”

      ” (lines 166-170).

      Missing data on W22703 ΔtccC/pBAD-tccC infection in Fig. 3, please the answer to point 20 above.

      Fig. 6b - The presence of an anti-RFP signal is not obvious in any of the bottom row images. The top row images are missing the same kind of annotation provided for Fig. 6a, without which non-histologists will find understanding the figure difficult.

      Answer: The anti-RFP signal is visible only on the left photo of the bottom panel, and not in the other three photos as explained in the text. We understand that the signals are not very strong, but they are visible on the screen.

      "In the absence of the lysis cassette, however, TcaA::Rfp was not detected despite the presence of W22703 ΔHE tcaA::rfp cells." + "To test whether or not the promoter of the lysis cassette is active in vivo, we infected G. mellonella larvae with strain W22703 PHE::rfp. Although Y. enterocolitica cells densely proliferated within the hemolymph (FIG. 6B), no staining signal that would point to the presence of TcaA was obtained, possibly due to no or weak PHE activity." Does this mean that without HE, tcaA does not express?

      Answer: No, we performed Western Blots showing that TcaA is detected in cells lacking HE. Therefore, a negative feedback regulation (e. g. increasing intracellular amounts of TcaA repress its own transcription) can be excluded. This is also in line with the low transcriptional activity of the lysis cassette in vivo (new Fig. S1B).

      "These data suggest that the HE cassette is responsible for the extracellular activity of the insecticidal Tc." Please explain how the preceding paragraph leads to this conclusion.

      Answer: This was poorly written and now reads “…for the transport…” (line 224).

      "As expected, bacterial cells, e.g. Y. enterocolitica, are visible in the hemolymph obtained from W22703-infected animals, but not in all other preparations." - which figure are the authors referring to?

      Answer: We have indeed identified, but not immunostained, bacterial cells in those preparations, but they are not visible in Fig. 7. This sentence was removed. However, the presence of W22703, but not its tc-PAIYe-mutants, in the hemolymph is demonstrated in Fig. 6A.

      "To delineate the transcriptional profile of Y. enterocolitica during infection of G. mellonella, we applied immunomagnetic separation to isolate Y. enterocolitica from the larvae 12 h and 24 h after infection" - do the authors store the bacteria for up to 24 h at 4 {degree sign}C, as indicated in the methods section?

      Answer: Yes, the probes were stabilized with RNAlater and then stored up to 24 h to synchronize all samples of one experiment.

      "The endolysin located within Tc-PAIYe was significantly up-regulated after 24 h, but not after 12 h, pointing to its possible role in the release of the Tc" - I could not find the endolysin in Table S1. Could the authors mark it clearly? Also, why is the holin also not upregulated?

      Answer: The endolysin gene is lacking in Table S1 due to its FC=1.02. We now added a table to Fig. S1 that shows the FC values of all genes from Tc-PAIYe. The FC-value of holin gene is 0.87, thus pointing to a very slight transcription of this lysis gene as discussed, thus preventing cell death.

      "This is in line with the fact that a T3SS is lacking in strain W22703" - Is a complete genomic sequence available for this strain, so readers could validate this statement?

      Answer: The genome sequence is available, and the reference is now cited (line 358). The common virulence plasmid of yersiniae, pYV that encodes the T3SS, is missing in this strain. We do not mention here the presence of a second, but probably incomplete, chromosomally encoded T3SS in strain W22703 do not overload the manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This is a very, very nice study as it actually describes the role of different Tc toxin components in a model infection system using an important bacterium- really for the first time in a properly controlled manner. The mutants lacking either the syringe (AB) or the bullet (C) make 'sense' for a loss of function perspective. The description of the phage cassette in loss of function is also interesting and could do with some more speculation? For example, some groups of Photorhabdus bacteria release their oral toxicity (Tc's) into their bacterial supernatants- whereas in others it remains cell associated. The likely role of this phage cassette in this process should be discussed (is cell suicide required for release?).

      Answer: We now discuss the possibly role of the lysis cassette in more detail, including the possibility that a subpopulation commits cell suicide (see lines 375-396).

      Reviewer #2 (Significance (Required)):

      This is highly significant finding as despite all of the very elegant structural studies done on these important toxins there is still very little work in vivo. These studies clearly show the role of the different components of these ABC toxins in vivo. It should be published with priority.

      Congratulations to the authors.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors analyze the phases of infection of Galleria mellonella by Yersinia enterocolitica following forced oral feeding. They study different phases of infection, including survival within the gut and invasion of the hemolymph. By analyzing differences in the genes up- and down regulated, they show that for example transporters for food sources from the hemocoel are regulated for making those sources available for the bacteria.

      Major comments: This is an interesting paper demonstrating genes of Y. enterocolitica dependent for colonization, growth and crossing of the epithelial gut barrier in G. mellonella.

      Major points which have to be addressed:

      Introduction: line 54: the BC subcomplex is not released into the cytosol! It is only the hypervariable region (enzymatic part) which enters the cytosol. This has to be corrected.

      Answer: This has been corrected accordingly.

      Fig.2/3: Why have different CFU been used for the distinct bacterial strains? This does not allow a direct comparison of their toxicity. For me the dead larvae shown in Fig. 3 are not represented in Fig 2 (data are not concordant), because of the loss before day one depicted in Fig. 2: The curves should be normalized to the same starting point (should be 100 %)?

      Answer: We would like to stress here that infection doses are hard to reproduce if frozen and diluted stocks are used. We decided for overnight culture to better mimic natural conditions and controlled each culture for its viable cell numbers by plating. Moreover, we choose the infection doses in a conservative manner, e.g. the number of mutants was higher than that of the parental strain.

      The data of Fig. 3 are concordant with Fig. 2 for two reasons: First, this experiments was performed in replicates with a total of 36 larvae per strain (see Fig. 2 legend), so that representative photos are shown. Second, larvae were considered dead if they failed to respond to touch, and many larvae without strong sign of melanisation were already killed.

      We analysed the algorithmus of the Kaplan-Meier-plot. All graphs start at 100%, this is now mentioned in the legend. There are no data between day 0 and day 1, and a stepwise graph is essential for this plot.

      Fig. 3: Why is the strain W22703 delta tccC/pBAD - tccC missing in the data set?

      Infections with W22703 DtccC are not shown to not overload the figure, please see the panel below. Answer: W22703 DtccC/pBAD-tccC infections have not been documented by photos. Figure legend 4 now reads “Infections with W22703 DtccC and DtccC/pBAD-tccC are not shown.”

      Minor: line 221: "the" is doubled

      Answer: This has been corrected accordingly.

      Reviewer #3 (Significance (Required)):

      The manuscript shows the use of G. mellonella as a straight foreward method to study gene functions of pathogenic bacteria, a significant knowledge for scientists of the field.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Summary: Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      Answer: There are already three sections that summarize the results and the methods applied, namely the abstract, the last paragraph of the introduction, and the conclusion following the discussion. In our view, a further summary would overload the manuscript. Nevertheless, depending on the journal the manuscript will be published in, an additional authors´ summary would be provided.

      Outlines proposed role of lysis cassette in oral infection of Galleria as a model insect for host pathogen interaction, data which is fortified through use of histology and RNAseq.

      Introduction could extend to additional background eg Aleniz et al and other entomopathogen transcriptome data, more so other studies using Yersinia and Galleria as a model (refer references provided in the below comments)

      Answer: We again carefully screened PubMed for studies in the field and added few papers. However, in vivo transcriptome analyses are still rare, as indicated by a lack of a respective investigations with the highly relevant entomopathogen Photorhabdus luminescens. The literature suggested by the reviewer is now cited in the introduction and the discussion (see below for details).

      The strength of the paper lies in understanding the progression of the disease in the insect host as mentioned L316-317 and clearance of the bacteria via in TcaA mutant

      Major comments: - Are the key conclusions convincing? Yes for mode of action Fig 5 could have additional panels -this is a strength of the paper

      Answer: We agree that this time course is a strength of the paper, and we carefully selected representative photos. There are several to be shown, but to our view, they are rather illustrative than providing a substantial additional value.

      Fig 6 legend could better describe the observed insect components

      Answer: The insect components are now indicated in Fig. 6B and in Fig. 5.

      Figure 7 may be lost in PDF conversion -the figure appears un resolved? are there more high resolution photos

      Answer: Fig. 7 was present in the merged PDF provided by the publisher. We used the photos with the best resolution.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? the data provided is in places rudimentary (i.e. validation of the role of the lysis cassette in virulence) and could be bolstered with the construction and use of a lysis translational reporter etc I was left unsure how the HE::rfp and TcA::rfp constructs were made. I had assumed red florescent protein however it appears an antibody is used. This needs to be clarified as I then found it hard to interpret the results.

      Answer: The transcriptional PHE::rfp fusion is mentioned in the results section, but immunostaining failed probably due to a very low promoter activity (line 223). This is well in line with the transcriptome data. Please see a detailed answer how the HE::rfp and tcaA::rfp were constructed below. We applied the RFP-antibody for two reasons: first, fluorescence microscopy did not reveal clear red fluorescence in the tissue sections, and second, a TcaA antibody failed to match quality criteria for this purpose.

      It appear l114-125 that their may be enough data to derive a LD50 values and or LT value at a fixed dose - if so reporting this data of interest. It may also allude as to why a 10e5 dose was selected for subsequent expts

      Answer: This is an interesting point. The LD50 (dose of cells that kills 50% of all larvae) is usually not calculated in publications in this field of research, because its calculation requires a very huge separate data set that cannot be used to answer the questions addressed here. Such a dat set is not available. We published the dose-dependent toxicity of Y.enterocolitica W22703 upon subcutaneous injection, and from these data, we determined a LD50 for this strain of approximately 2 x 104 cells. The paper is cited in our manuscript. The 10E05 dose was selected due to our preliminary work and the reproducibility of the experimental phenotypes.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. Use of lysis the reporter - discuss commonalties of the in host transcriptome with other Yersinia Galleria systems eg Paulson etc al (refer below). Are there any thoughts on the host range of this Yersinia and can this be placed in a pathogen host evolutionary context?

      Answer: Paulson et al. are now cited twice in the text. The host range of Yersinia enterocolitica has not been investigated to our knowledge. However, its nematocidal activity has been described by Spanier et al., and Manduca sexta larvae, the tobacco hornworm, is also killed by W22703 (see references). Moreover, there are two copies of tccC in the genome of strain W22703 encoding the cytotoxic Tc subunit with its hypervariable C-terminus that is assumed to contribute to host specificity. This is discussed in very detail by Song et al. (see references).

      Evolution: Yes, this has been addressed by Waterfield et al. 2004 (see references) where insects are hypothesized as a source of emerging pathogens. We placed our findings in the context of this article in lines 91-94 and 305-310.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. Yes

      • Are the data and the methods presented in such a way that they can be reproduced? yes but I think some vector construction methodology is missing e.g. ::rfp (refer above)

      Answer: The plasmids used to construct the two strains W22703 tcaA::rfp and W22703 PHE::rfp are listed in Table S2. References for details are given (Starke et. al., 2013, Starke and Fuchs, 2014). Briefly, we used a suicide vector (pUTs) carrying the gene encoding the red fluorescent protein (RFP). This vector replicates in E. coli helper strains such as SM10, but not in Y. enterocolitica. Strain SM10 is now listed in Table 2. Following conjugation, the construct is chromosomally inserted upon recombination via the fragments cloned into the plasmid. In case of tcaA, we cloned the 3´-end of the gene to generate a translational fusion, and in case of HE its promoter, resulting in a transcriptional fusion with the reporter RFP.

      Fig 2 I am a little lost mortality seems quick on day 0 is this a result of aberrant injection damage mortality or are the authors observing a different effect across mutants through the initial 24 hours? If data available could this time plot be extended out 0-24 hours. The dash used for W222703 tcaA /TccC look similar can a different symbol be used.

      Answer: The reviewer is right that the mortality is high on the first day. However, larvae monitoring for up to nine days is a standard in the literature. No data are available for a better resolution of the first 24 h that, however, were investigated in more detail in the time course of Fig. 5. Moreover, we observed changes in motility and colouring of some of the larvae from one h p.i. on (data not shown). Aberrant injection damage was avoided, and damaged larvae or larvae that not completely took up the infection solution were not further considered in the experiment. This is mentioned in lines 107-109.

      A different symbol is now used for W222703 DtccC /pBAD-tccC.

      • Are the experiments adequately replicated and statistical analysis adequate? Yes

      Minor comments: - Specific experimental issues that are easily addressable. - Are prior studies referenced appropriately? Other entomopathogenic transcriptome studies could be compared to and or cross referenced (I have provided references in the response

      Answer: Repetition of our answer above: We again carefully screened PubMed for studies in the field and added few papers. However, in vivo transcriptome analyses are still rare, as indicated by a lack of a respective investigations with the highly relevant entomopathogen Photorhabdus luminescens. The literature suggested by the reviewer is now cited in the introduction and the discussion (see below for details).

      I am unsure on the use of immuno pulldown and efficiency of recovering the Yersinia using this method as opposed to direct sequencing total RNA has this method been used in other systems,

      Answer: Isolating RNA from in vivo probes of infected insects encounters two challenges: first, a possible contamination with commensal bacteria, and a too high amount of host RNA that reduces the number of sequence reads. This might be the reason for the relatively low sequence depth found in related papers in the field of in vivo transcriptomics. We overcame these problems by immunomagnetic separation that is easily applicable and enriches the samples with respect to Yersinia cells, this is now mentioned in the results. We also cite a study (Prax et al., in which we established the protocol of IMS.

      • Are the text and figures clear and accurate? Yes though in places better naming of insect components could be listed

      Answer: This was done, see above.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      As listed above potential use of reporters and or comparison and transcriptome analysis to other systems and an evolutionary pathogen host context (refer comments above) would strengthen the manuscript

      Answer: Please see answer to comments above. We explained the use of the reporter fusions, and put the transcriptome analysis into the context of related studies.

      Minor comments as per below When first mentioned good to state the larval instar used

      Answer: We used larvae of instar 5-6 according to Jorjao et al. (2018), this is now mentioned and cited in the M&M section, line 434.

      l 78 lon protease? what type? this is an important SOS protease affecting many regulatory systems please clarify

      Answer: This is a Lon A endopeptidase, and its function for the temperature-dependent activity of the lysis cassette has ben described (Springer et al. 2021, see references). Its relevance for the thermodependent regulation of Yersinia virulence has been documented by Herbst et al. (PMID: 19468295) and Jackson et al. (https://doi.org/10.1111/j.1365-2958.2004.04353.x).

      l103-113 an description of the elemental tract which is depicted, perhaps this could be placed in the Fig. 1 figure legend

      Answer: We agree and substantially shortened the first paragraph of the results. Relevant aspects are now mentioned in Figure legend 2, redundancies with the figure legend were removed.

      l 133 use of the word larvae in place of the word animals might be more appropriate

      Answer: This was corrected accordingly.

      l 133 clarify delta HE mutant description when first mentioned

      Answer: The abbreviation HE is now introduced in the introduction in line 74.

      Lines 220-234 hard to follow mainly as I am unsure how then strains are constructed, perhaps clarify what rfp is how was it made :: demotes and insertion but yet then they seek to detect TcaA? I could not find the methodology on its or HE::rfp construction

      Answer: The plasmids used to construct the two strains W22703 tcaA::rfp and W22703 PHE::rfp are listed in Table S2. References for details is given (Starke et. Al., 2013, Starke et al. 2014). Briefly, we used a suicide vector (pUTs) carrying the gene encoding the red fluorescent protein (RFP). Following conjugation, the construct is chromosomally inserted upon recombination via the fragments cloned into the plasmid. In case of tcaA, we cloned the 3´-end of the gene to generate a translational fusion, and in case of HE its promoter, resulting in a transcriptional fusion with the reporter RFP.

      Please see above why we used RFP-antibodies to detect TcaA.

      l247 immuno-magnetic separation to isolate Yersinia - is there an efficiency behind this method, might be good to mention (I am unfamiliar with this technique)

      Answer: We here repeat our answer to the point above: Isolating RNA from in vivo probes of infected insects encounters two challenges: first, a possible contamination with commensal bacteria, and a too high amount of host RNA that reduces the number of sequence reads. This might be the reason for the relatively low sequence depth found in related papers in the field of in vivo transcriptomics. We overcame these problems by immunomagnetic separation that is easily applicable and enriches the samples with respect to Yersinia cells, this is now mentioned in the results. We also cite a study (Prax et al., in which we established the protocol of IMS.

      l313 alludes to role of Tca in hemoceol which contradicts an earlier statements in l 130 please clarify

      Answer: The reviewer is right. The sentence in former line 130 (now lines 123-124) was corrected to “…suggesting that the Tc plays a main role in the initial phases of infection”. This statement does not exclude its activity towards hemocytes. Moreover, subcutaneous infection is very artificial and was therefore replaced by oral application in our study to mimic natural routes of infection. This is now elaborated in more detail in the discussion (Lines 305-310).

      For clarity table 1 could colour highlight (different colours) tc and lysis genes

      Answer: We now added a table to Fig. S1 that shows the FC values of all genes from Tc-PAIYe.

      CROSS-CONSULTATION COMMENTS I am in agreement with all points of reviewer 1 who has a clear understanding on Tc toxin composition TcA pentamer etc. Being familiar to the field I regret I did not pick up on these errors

      Answer: This has been corrected according to R1.

      Point 13 agree and should possibly bring in other researchers who have used Galleria as a model. It also needs to be kept in mind that the target host for many Tcs has yet to be determined hence the importance of oral activity of this isolate

      Answer: This has been corrected according to R1.

      I am similarly in agreement with comments of reviewer 3

      Reviewer 4 I over looked the LT50 data -- apologies but agree with reviewer 1 where WT should be the more potent strain --I still think if possible LD50 for WT would be of value more so to define its oral activity

      Answer: We repeat our answer from above. This is an interesting point. The LD50 (dose of cells that kills 50% of all larvae) is usually not calculated in publications in this field of research, because its calculation requires a very huge separate data set that cannot be used to answer the questions addressed here. Such a dat set is not available. We published the dose-dependent toxicity of Y.enterocolitica W22703 upon subcutaneous injection, and from these data, we determined a LD50 for this strain of approximately 2 x 104 cells. The paper is cited in our manuscript. The 10E05 dose was selected due to our preliminary work and the reproducibility of the experimental phenotypes.

      Reviewer #4 (Significance (Required)):

      SECTION B - Significance ========================

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Extends from work of Fuchs - research group Extends from work of Palmer et al on lysis cassettes as potential T10SS Extends from work off Vesga Pseudomonas and Paulson Yersinia(refs provided below) on insect transcriptomics

      Of interest and possibly understated is the oral activity of enterocolitica in the insect host as mentioned L316-317 and how this might relate to the lifestyle/evolution of this microbe further elaboration here would be of interest

      Answer: We agree that this is an important aspect. Therefore, we added the following sentences here: “In contrast to subcutaneous injection in the use of insect larvae as model for bacterial virulence properties towards mammals, oral application mimics natural routes of infection that in particular take place during the bioconversion of animal cadavers by bacteria, fungi, and larvae {Carter, 2007 #879}. Together with the broad cytocidal host spectrum of bacterial toxins {Mendoza-Almanza, 2020 #880}, investigation of yet neglected natural infections of invertebrates will contribute to a better understanding of microbial pathogenicity {Waterfield, 2004 #480}.” (lines 305-310)

      • Place the work in the context of the existing literature (provide references, where appropriate).

      Relevant Transcriptome papers which could be referred to in the discussion i.e. are similar genes in play or is their a point of difference? https://doi.org/10.1093/g3journal/jkaa024;https://doi.org/10.1038/s41396-020-0729-9; https://doi.org/10.1099/mic.0.000311

      Answer: Paulson et al. mainly address virulence factors, whereas metabolism is not uncovered. We now cite similarities with respect to hemolysis and iron scavenging. The focus of Vesga et al. is on the interaction of a plant pathogen with wheat and two insect hosts, including their transcriptome. Although metabolic details are missing, there is an interesting overlap with the paper by Vesga et al. (hemocoel as permissive environment for proliferation) and a difference (upregulation of chitinases was not observed) that are now cited in the discussion. The Alenzi paper mainly investigated the general virulence of Y. enterocolitica strain. We cite its finding on the importance of motility, thus confirming our transcriptome analysis.

      • State what audience might be interested in and influenced by the reported findings. The oral activity of enterocolitica towards Galleria of interest and an evolutionary context insect vs mammalian activity in the discussion could be provided. Potential role of TcaA in gut association For the targeted journal I feel additional technical data is required and a broader context to other global systems (bacterial species) provided

      Answer: All points were addressed carefully and in detail. We refer to our answers to points detailed above.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. Reviewers expertise entomopathogens, their toxins and pathogen ecology
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the both the reviewers for their constructive comments. Please see our point-by-point response to all the comments.

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      • Summary *
      • The authors of this manuscript confirm data found by others by determining replication kinetics of the ancestral B.6 SARS-CoV-2 virus, Delta and Omicron BA.1 and BA.2 in Calu-3 cells. The authors quantify barrier integrity between variants and interferon induction to conclude that Delta is more cytopathic and induced less interferon than Omicron, possibly leading to its increased pathogenesis. In addition the authors identify CuCl2 and FeSO4 as potential antivirals. *

      *Major comments *

      1. *__Reviewer comment: __The author's argue that Omicron's slower replication on Calu-3 cells correlates with mild disease, however many publications show that Omicron replicates more efficiently/ rapidly in primary human airway cultures: *
      2. Hui et al., (Nature, 2022) doi: https://doi.org/10.1038/s41586-022-04479-6*
      3. Peacock et al., (bioRxiv) doi: https://doi.org/10.1101/2021.12.31.474653*
      4. Lamers et al., (bioRxiv) doi: https://doi.org/10.1101/2022.01.19.476898 * Response: Previous reports including the citations indicated by the reviewer have shown that the Omicron variant replicates at a lower levels in lung tissue as compared to cells of bronchial origin or upper respiratory tract. In fact, Omicron variant was shown not to productively infect at all in alveolar type II cells. Omicron replication was severely compromised in Calu-3 cells grown in 96-well plates (https://doi.org/10.1080/22221751.2021.2023329) which is consistent with our observations.

      *__Reviewer comment: __Can the authors explain why air-liquid grown Calu-3 cells appear to display similar viral titers for Omicron and Delta at 24 and 36 h.p.i (Figure 5B), however lower viral replication in Figure 3B? If the cells in Figure 3B are submerged, then the authors should identify why ALI grown Calu-3 cells are more susceptible to Omicron. *

      Response: Cells were grown in plastic multi-well plates for growth curve experiments shown in Figure 3. The cells in this condition are not polarized and the virus titers are the total amount of virus released into the culture supernatant. The infection conditions in Figure 5 is under air-liquid culture conditions, from polarized cells. Therefore, the virus titers are only from the basolateral chamber. The outcomes of figure 3 and figure 5 are not comparable due to these technical differences. We will add this explanation in the results section.

      *__Reviewer comment: __The authors suggest that Delta disrupts epithelial barrier integrity to a larger extent compared to B.6 and Omicron, however this may be due to fewer infected cells (despite equal viral titers, the nucleocapsid staining in Figure 2 and 5C suggests fewer infected cells). Have the authors imaged B.6 or Omicron at a later timepoint (or normalized virus input for equal infected cells) to determine barrier integrity when the amount of infected cells is equal? Alternatively, the authors should discuss this as a possible limitation of their study, especially since they argue this is a major reason why Delta has a growth advantage (lines 345 to 349). *

      Response: We performed confocal imaging of transwells from air-liquid interface model using a 20X objective and have obtained data to show that the percent of infected cells is similar between Omicron and Delta variant. We will include this data in the revised manuscript. In an in vitro system, once the infection is set in, the infected cells eventually die and the TEER reaches background levels. We are proposing a delay in disruption of barrier integrity most probably due to lower cytopathogenicity of the Omicron variant. As per the reviewer’s suggestion, we will discuss the possible limitation of the models and provide additional interpretations.

      Minor comments *A) __Reviewer comment: __Line 118: Implications of this sentence are too strong. The authors have not shown the causality of Ct values and transmission, therefore they should reword the sentence: "indicating a high viral burden in patients during this period resulting in increased transmission of the virus among the contacts" to "likely attributing to increased transmission..." *

      Response: We will correct this.

      *__B) Reviewer comment: __Line 289: The authors suggest that infection with the Omicron variant generated higher levels of antibodies to the Delta variant, however these individuals are already vaccinated and elicit cross-neutralizing antibodies against Delta even before their Omicron infection. Therefore the Delta response is boosted and the Omicron response is essentially a primary response since vaccination elicits almost no cross-protection in itself. Therefore the authors should compare primary Delta infected individuals to primary Omicron infected individuals to determine cross-protection levels. *

      Response: We agree with the reviewer’s argument. Please note that the two vaccines used in India are against the ancestral virus (inactivated) or the spike protein expressed by the adenovirus vector backbone. As over 90% of the population in India have been fully vaccinated with these two vaccines and a majority of them may also have been infected with delta variant and now with omicron, it is practically impossible to compare primary delta cases vs primary omicron cases at this stage. As part of another study in mid 2021, after the second wave of COVID-19 infections due to the Delta variant in India, we randomly selected 55 samples which had a detectable FRNT50 value for the delta variant, to test for their ability to neutralize the Omicron variant. Only twenty of the 55 samples had detectable levels of neutralizing antibodies against the Omicron variant. By assigning a FRNT50 value of 10 for the samples which had no detectable levels of antibodies in the starting dilution (1:20) of the assay, we obtained a GMT of 22.5 (95% CI: 16, 31) for these 55 samples. This value was 20-fold lower than the GMT of Delta variant which was 404 (95% CI:248, 658). This clearly indicates that even during the peak of delta wave, there were barely any cross-reactive antibodies to the Omicron variant. This study was recently published [NATURE COMMUNICATIONS | https://doi.org/10.1038/s41467-022-31170-1]. It would be interesting to eventually compare the antibody responses in reinfections with other sub-lineages of Omicron variant which is beyond the scope of our manuscript. We will add this description in the results and discussion section of the revised manuscript.

      *C) __Reviewer comment: __There appears to be no reference to Figure 6G, however this reference is most likely missing from line 306. *

      Response: Thank you for bringing this to our notice. We will insert the reference to Figure 6G.

      *D) __Reviewer comment: __Line 359-362: The authors suggest that waning antibody titers increase susceptibility to new variants of concern, however their cohort already possessed very low antibody titers against Omicron a month after vaccination (Figure 7F) suggesting they could be equally susceptible to Omicron 1 and 6 months after vaccination. *

      Response: Please note that nine out of 15 samples had FRNT50 value above the level of detection after vaccination in June 2021. The number of samples positive for Omicron antibodies reduced to six out of 15 by Dec 2021 suggesting that relatively more people were without protective antibodies for Omicron variant by Dec 2021. Around 70% of the population was seropositive by Aug 2021 (https://doi.org/10.1016/j.ijid.2021.12.353) and most adults in India received both doses of their vaccine after June 2021 which would have boosted the humoral and cellular response to SARS-CoV-2. This is corroborated in a recently published report, where we showed that 36 out of 55 previously infected subjects had neutralizing antibodies for the Omicron variant after receiving a single dose of inactivated vaccine. Therefore, in the context of hybrid immunity in India, we speculate that waning antibody titers could have played a significant role in the emergence and spread of Omicron variant in addition to the ability of the Omicron variant to escape neutralization, replicate more efficiently in the upper respiratory tract etc., The fact that booster doses of vaccines developed against the ancestral virus/viral protein was capable of increasing the level of neutralizing antibodies to omicron variant suggests that the level of antibodies above a certain threshold may play a significant role in protecting against the omicron variant.

      Reviewer #1 (Significance (Required)):

      • __Reviewer comment: __Many of the conclusions based on replication and barrier integrity may not represent the situation in primary human tissues and does not explain the rapid spread of Omicron. In addition, interferon induction has already been described for these variants and this finding is not novel. The manuscripts most interesting and novel finding is the role of CuCl2 and FeSO4 as antivirals. It would be interesting to test these salts in primary human airway cultures. *

      Response: The study was conducted in the months of Jan-March 2022 and the first version of the results were uploaded on a preprint server in March 2022. The process of journals handling the manuscript and obtaining reviews is not under our control. We cannot argue to defend the comments on novelty when the Omicron variant is barely six months old and new variants continue to emerge. The deluge of publications should not result in reviewers branding most of the efforts as not novel or insignificant. We have been trying since three months to obtain primary cells but the distributors are unable to supply the same. We will continue to try to obtain cells from one or the other source. Transwells are back-ordered with expected delivery dates in three months. Meanwhile, we now have HBEC3-KT cells which are normal human bronchial epithelial cells immortalized with CDK4 and hTERT. We will perform the inhibition experiments in these cell lines to convince the reviewers.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      *In the manuscript entitled "BA.1 and BA.2 sub-lineages of Omicron variant have comparable replication kinetics and susceptibility to neutralization by antibodies" the authors assess the kinetics of growth of SARS-CoV-2 variants in Calu-3 cells and their effects on epithelial junction, and the interferon response. The authors also analyze the capacity of metal salts to block SARS CoV-2 replication in Calu-3 cells. Finally, the authors characterize the ability of vaccinated and/or COVID-19 patients to develop neutralizing antibodies to different variants using FRNT and specific binding assays (ELISA). *

      • The paper largely confirms several previous reports on the replication capacity and interferon responses of the different variants. Although the title and abstract focus on the Omicron sub-lineages, the paper is mostly focused on comparing original CoV2, with Kappa, Delta and Omicron. *
      • Figures 1-5 compare the replication kinetics, interferon responses, and epithelial barrier disruption of Kappa, Delta and the original Omicron (B.1.1.529) to the original B6 variant. On a separate note, Figure 7 shows the ability of metal salts (especially iron, copper, and zinc) to block viral RNA-dependent RNA polymerase activity (RdRp) in vitro. The authors also show the effect on virus replication in Calu-3 cells (Delta and Omicron B.1.1.529 only). The data mainly focus on the variants, the Delta and the Omicron (BA.1.1.529 and not the BA.1 and BA.2 sub-lineages) except in Fig 6A, B, G. *

      • __Reviewer comment: __Most importantly, a major limitation of the paper is that when human samples are analyzed, the authors assume that the patients have been infected with a specific variant according to the "peak" of infection, but sequencing is never performed. When neutralization and binding of antibodies are analyzed, the information on the patients is unclear - for example, were the patients exposed to Delta or Omicron or any of their sub-lineages? What was the vaccination status of SARS CoV-2 positive patients? And why non-tested individuals showing symptoms were included in the study (lines 302-304)? *

      Response: We thank the reviewer for the comments. Over 90% of the population in India is vaccinated. All the participants of the study have been vaccinated in 2021. The participants were enrolled into the study almost 4 weeks after recovery from illness. We have enrolled participants who have reported to have had fever or COVID-19-like symptoms in the preceding weeks with or without confirmed RT-PCR test results. Testing is an individual and voluntary choice now. Therefore, it would be difficult to find RT-PCR confirmed cases. Our assumption about exposure is based on a nationwide sequencing effort of thousands of samples every week and this approach is reliable and credible. As indicated in the text and in the supplementary figure, Omicron lineages BA.1 followed by BA.2 were the circulating virus lineages since Jan 2021 in India.

      *__Reviewer comment: __The authors show that BA.1 and BA.2 have similar replication kinetics in Calu-3 cells and induce similar neutralizing antibodies in the patients tested. However, there is a large disconnection with the rest of the paper that is mostly focused on Kappa, Delta, and Omicron B.1.1.529. Also, no comparisons between these variants and BA.1 or BA.2 have been shown. Similarly, a large assumption in the paper is that the patients who tested positive for COVID-19 have had "natural Omicron infection" (lines 36-37; lines 307-311) when it could be any other variants or Omicron sub-lineages as well. *

      Response: Please note that the B.1.1.529 which was used at the beginning of the study is the BA.1 sub-lineage which has been compared with Kappa and Delta variants. BA.2 emerged at later stages and therefore we have compared the kinetics and neutralization titer between BA.1 and BA.2. It is unreasonable to expect to repeat all the comparisons with BA.2 considering the cost and challenges of working in a BSL-3 environment. The initial version of this data was uploaded on preprint server in March 2022 when only two sub-lineages of Omicron namely BA.1 and BA.2 existed. Our data from the national SARS-CoV-2 sequencing consortium clearly shows that there were no other sub-lineages circulating at that time.

      Reviewer #2 (Significance (Required)):

      *__Reviewer comment: __In light of the fact that most of the paper does not look at the subvariants BA.1 and BA.2 of Omicron- either the authors compare BA.1 and BA.2 more comprehensively with Omicron B.1.1.529 or rewrite the conclusions and claims of the current paper. Similar to the experiments comparing B6 with Kappa, Delta and Omicron, Omicron B.1.1.529 should be compared similarly to BA.1 and BA.2 in a separate figure. In any case, the novelty compared to other papers -also cited by the authors- remains limited. *

      Response: We will revise the conclusions and claims of the paper as per the suggestions. Please see our response to reviewer 1 with regards to the novelty of our observations. The B.1.1.529 variant was later classified as the BA.1 variant. Our study was uploaded on the preprint server in March 2022 and the entire review process has taken four months. It is unfair to now demand comparison of BA.2 with Kappa or Delta variant which does not add any additional value to our observations.

      *__Reviewer comment: __In addition to the concerns mentioned above, there are more pressing variants circulating right now, such as BA.4 and BA.5. These variants are not referred in the paper. It might be beyond the scope of the paper, but including more analyses with BA.1, BA.2 (as the ones done with B.1.1.529) and adding some key data with BA.3, BA.4, BA.5 might substantially increase the relevance and importance of the paper. *

      Response: Please see our comments above. Our efforts are continuing in this direction to further look at antibody responses and replication kinetics of newer variants which have emerged recently. However, the scarcity of positive clinical samples and lower probability of getting samples that would be suitable for virus isolation are the challenges we are dealing with. We think testing newer variants which have emerged during the review process is certainly valuable but is extremely difficult under the current circumstances. We will have to apply to seek import permits to obtain these sub-lineages or enrol patients with symptoms and keep testing them to isolate, culture the virus and obtain whole genome sequence. We will have to establish neutralization assays with newer sub-variants to test in parallel with other Omicron lineages. All this is beyond the scope of our manuscript and will take few months of paper work and experimentation.

    1. Reviewer #3 (Public Review):

      The present study aims to elucidate posterior cingulate cortex (PCC) function with both single-unit and population-level depth electrodes. The results clearly show that the dorsal PCC (dPCC) is involved in executive functions (search and add), but that it also contains neurons that are selective for episodic memory (past and future) and rest conditions. With this impressive study design, the authors are able to reconcile discrepancies between human and primate studies. Furthermore, the derived conclusion that PCC function is more diverse than merely its participation in the DMN is of great importance for the field. Thus, I believe that this work will have a great impact on how we think about the PCC, by (1) emphasizing its participation in executive processes and (2) providing evidence of distinct single-unit response profiles that do not manifest on a population level.

      The main strength of this work is the combination of population-level measurements that clearly show the participation of dPCC in executive processes with microelectrode single-unit measurements and an unsupervised hierarchical clustering approach that allows for the identification of 4 distinct SU response profiles within the dPCC. In addition, the population-level electrodes mostly engaged in executive function cluster around an fMRI meta-analysis peak related to executive processing derived from neurosynth, providing a bridge to human fMRI research.

      Nevertheless, there is one concern regarding the data collected within the ventral PCC (vPCC) in this study and the way the authors integrated it into their conclusions.

      Specifically, the conclusion that "Together, they [the findings] inform a view of PCC as a heterogeneous region composed of dorsal and ventral subregions specializing in executive and episodic processing respectively" may not be completely supported by the data. The dPCC macroelectrode data does clearly show a functional specialization in executive processing, but does the data from vPCC presented in this manuscript also support the claim? While taking a closer look at the vPCC data, several inconsistencies stood out: First, the total number of vPCC electrodes was much smaller (6 vs 29 microelectrodes and one microwire probe that was not analyzed). Second, it is not clear which of the presented electrodes in figure 3 were considered to be ventral. From comparing figure 3 with the dorsal/ventral split displayed in figure 1B, it seems as if only one electrode was unambiguously placed in vPCC. Third, BBG statistics of these 6 electrodes are not presented, thus the claim that they show vPCC functional specialization is not statistically supported.

    1. Author Response

      Reviewer #1 (Public Review):

      Jones et al. investigated the relationship between scale free neural dynamics and scale free behavioral dynamics in mice. An extensive prior literature has documented scale free events in both cortical activity and animal behavior, but the possibility of a direct correspondence between the two has not been established. To test this link, the authors took advantage of previously published recordings of calcium events in thousands of neurons in mouse visual cortex and simultaneous behavioral data. They find that scale free-ness in spontaneous behavior co occurs with scale free neuronal dynamics. The authors show that scale free neural activity emerges from subsets of the larger population - the larger population contains anticorrelated subsets that cancel out one another's contribution to population-level events. The authors propose an updated model of the critical brain hypothesis that accounts for the obscuring impact of large populations on nested subsets that generate scale free activity. The possibility that scale free activity, and specifically criticality, may serve as a unifying theory of brain organization has suffered from a lack of high-resolution connection between observations of neuronal statistics and brain function. By bridging theory, neural data, and behavioral dynamics, these data add a valuable contribution to fields interested in cortical dynamics and spontaneous behavior, and specifically to the intersection of statistical physics and neuroscience.

      Strengths:

      This paper is notably well written and thorough.

      The authors have taken a cutting-edge, high-density dataset and propose a data-driven revision to the status-quo theory of criticality. More specifically, due to the observed anticorrelated dynamics of large populations of neurons (which doesn't fit with traditional theories of criticality), the authors present a clever new model that reveals critical dynamics nested within the summary population behavior.

      The conclusions are supported by the data.

      Avalanching in subsets of neurons makes a lot of sense - this observation supports the idea that multiple, independent, ongoing processes coexist in intertwined subsets of larger networks. Even if this is wrong, it's supported well by the current data and offers a plausible framework on which scale free dynamics might emerge when considered at the levels of millions or billions of neurons.

      The authors present a new algorithm for power law fitting that circumvents issues in the KS test that is the basis of most work in the field.

      Weaknesses:

      This paper is technically sound and does not have major flaws, in my opinion. However, I would like to see a detailed and thoughtful reflection on the role that 3 Hz Ca imaging might play in the conclusions that the authors derive. While the dataset in question offers many neurons, this approach is, from other perspectives, impoverished - calcium intrinsically misses spikes, a 3 Hz sampling rate is two orders of magnitude slower than an action potential, and the recordings are relatively short for amassing substantial observations of low probability (large) avalanches. The authors carefully point out that other studies fail to account for some of the novel observations that are central to their conclusions. My speculative concern is that some of this disconnect may reflect optophysiological constraints. One argument against this is that a truly scale free system should be observable at any temporal or spatial scale and still give rise to the same sets of power laws. This quickly falls apart when applied to biological systems which are neither infinite in time nor space. As a result, the severe mismatch between the spatial resolution (single cell) and the temporal resolution (3 Hz) of the dataset, combined with filtering intrinsic to calcium imaging, raises the possibility that the conclusions are influenced by the methods. Ultimately, I'm pointing to an observer effect, and I do not think this disqualifies or undermines the novelty or potential value of this work. I would simply encourage the authors to consider this carefully in the discussion.

      R1a: We quite agree with the reviewer that reconciling different scales of measurement is an important and interesting question. One clue comes from Stringer et al’s original paper (2019 Science). They analyzed time-resolved spike data (from Neuropixel recordings) alongside the Ca imaging data we analyzed here. They showed that if the ephys spike data was analyzed with coarse time resolution (300 ms time bins, analogous to the Ca imaging data), then the anticorrelated activity became apparent (50/50 positive/negative loadings of PC1). When analyzed at faster time scales, anticorrelations were not apparent (mostly positive loadings of PC1). This interesting point was shown in their Supplementary Fig 12.

      This finding suggests that our findings about anticorrelated neural groups may be relevant only at coarse time scales. Moreover, this point suggests that avalanche statistics may differ when analyzed at very different time scales, because the cancelation of anticorrelated groups may not be an important factor at faster timescales.

      In our revised manuscript, we explored this point further by analyzing spike data from Stringer et al 2019. We focused on the spikes recorded from one local population (one Neuropixel probe). We first took the spike times of ~300 neurons and convolved them with a fast rise/slow fall, like typical Ca transient. Then we downsampled to 3 Hz sample rate. Next, we deconvolved using the same methods as those used by Stringer et al (OASIS nonnegative deconvolution). And finally, we z-scored the resulting activity, as we did with the Ca imaging data. With this Ca-like signal in hand, we analyzed avalanches in four ways and compared the results. The four ways were: 1) the original time-resolved spikes (5 ms resolution), 2) the original spikes binned at 330 ms time res, 3) the full population of slow Ca-like signal, and 4) a correlated subset of neurons from the slow Ca-like signal. Based on the results of this new analysis (now in Figs S3 and S4), we found several interesting points that help reconcile potential differences between fast ephys and slow Ca signals:

      1. In agreement with Sup Fig 12 from Stringer et al, anticorrelations are minimal in the fast, time-resolved spike data, but can be dominant in the slow, Ca-like signal.

      2. Avalanche size distributions of spikes at fast timescales can exhibit a nice power law, consistent with previous results with exponents near -2 (e.g. Ma et al Neuron 2019, Fontenele et al PRL 2019). But, the same data at slow time scales exhibited poor power-laws when the entire population was considered together.

      3. The slow time scale data could exhibit a better power law if subsets of neurons were considered, just like our main findings based on Ca imaging. This point was the same using coarse time-binned spike data and the slow Ca-like signals, which gives us some confidence that deconvolution does not miss too many spikes.

      In our opinion, a more thorough understanding of how scale-free dynamics differs across timescales will require a whole other paper, but we think these new results in our Figs S3 and S4 provide some reassurance that our results can be reconciled with previous work on scale free neural activity at faster timescales.

      Reviewer #2 (Public Review):

      The overall goal of the paper is to link spontaneous neural activity and certain aspects of spontaneous behavior using a publicly available dataset in which 10,000 neurons in mouse visual cortex were imaged at 3 Hz with single-cell resolution. Through careful analysis of the degree to which bouts of behavior and bouts of neural activity are described (or not) by power-law distributions, the authors largely achieve these goals. More specifically, the key findings are that (a) the size of bouts of whisking, running, eye movements, and pupil dilation are often well-fit by a power-law distribution over several decades, (b) subsets of neurons that are highly correlated with one of these behavioral metrics will also exhibit power-law distributed event sizes, (c) neuron clusters that are uncorrelated with behavior tend to not be scale-free, (d) crackling relationships are generally not found (i.e. size with duration exponent (if there is scaling) was not predicted by size power-law and duration power-law), (e) bouts of behavior could be linked to bouts of neural activity. In the second portion of the paper, the authors develop a computational model with sets of correlated and anti-correlated neurons, which can be accomplished under a relatively small subset of connection architectures: out of the hundreds of thousands of networks simulated, only 31 generated scale-free subsets/non-scale-free population/anti correlated e-cells/anti-correlated i-cells in agreement with the experimental recordings.

      The data analysis is careful and rigorous, especially in the attention to fitting power laws, determining how many decades of scaling are observed, and acknowledging when a power-law fit is not justified. In my view, there are two weaknesses of the paper, related to how the results connect to past work and to the set-up and conclusions drawn from the computational modeling, and I discuss those in detail below. While my comments are extensive, this is due to high interest. I do think that the authors make an important connection between scale-free distributions of neural activity and behavior, and that their use of computational modeling generates some interesting mechanistic hypotheses to explore in future work.

      My first general reservation is in the relationship to past work and the overall novelty. The authors state in the introduction, "according to the prevailing view, scale-free ongoing neural activity is interpreted as 'background' activity, not directly linked to behavior." It would be helpful to have some specific references here, as several recent papers (including the Stringer et al. 2019 paper from which these data were taken, but also papers from McCormick lab and (Anne) Churchland lab) showed a correlation between spontaneous activity and spontaneous facial behaviors. To my knowledge, the sorts of fidgety behavior analyzed in this paper have not been shown to be scale-free, and so (a) is a new result, but once we know this, it seems that (e) follows because we fully expect some neurons to correlate with some behavior.

      R2a: We agree with the reviewer that our original introductory, motivating arguments needed improvement. We have now rewritten the last 2 paragraphs of the introduction. We hope we have now laid out our argument more clearly, with more appropriate supporting citations. In brief, the logic is this:

      1. Previous theory, modeling, and experiments on the topic of scale-free neural activity suggest that this phenomenon is an autonomous, internally generated thing, independent of anything the body is doing.

      2. Relatively new experiments (including those by Churchland’s lab and McCormmick’s lab: Stringer 2019; Salkoff 2020; Clancy 2019; Musall 2019) suggest a different picture with a link between spontaneous behaviors and ongoing cortical activity, but these studies did not address any questions about scale-free-ness.

      3. Moreover, these new experiments show that behavioral variables only manage to explain about 10-30% of ongoing activity.

      4. Is this behaviorally-explainable 10-30% scale-free or perhaps the scale-free aspects of cortical dynamics fall withing the other 70-90%. Our goal is to find out.

      Digging a bit more on this issue, I would argue that results (b) and (c) also follow. By selecting subsets of neurons with very high cross-correlation, an effective latent variable has emerged. For example, the activity rasters of these subsets are similar to a population in which each neuron fires with the same time-varying rate (i.e., a heterogeneous Poisson process). Such models have been previously shown to be able to generate power-law distributed event sizes (see, eg., Touboul and Destexhe, 2017; also work by Priesemann). With this in mind, if you select from the entire population a set of neurons whose activity is effectively determined by a latent variable, do you not expect power laws in size distributions?

      Our understanding is that not all Poisson processes with a time-varying rate will result in a power law. It is quite essential that the fluctuations in rate must themselves be power-law distributed. As a clear example of how this breaks down, consider a Poisson rate that varies according to a sine wave with fixed period and amplitude. In this case, the avalanche size distribution is definitely not scale-free, it would have a clear typical scale. Another point of view on this comes from some of the simplest models used to study criticality – e.g. all-to-all connected probabilistic binary neurons (like in Shew et al 2009 J Neurosi). These models do generate spiking with a time-varying Poisson rate when they are at criticality or away from criticality. But, only when the synaptic strength is tuned to criticality is the time-varying rate going to generate power-law distributed avalanches. I think the Priesmann & Shriki paper made this point as well.

      My second reservation has to do with the generality of the conclusions drawn from the mechanistic model. One of the connectivity motifs identified appears to be i+ to e- and i- to e+, where potentially i+/i- are SOM and VIP (or really any specific inhibitory type) cells. The specific connections to subsets of excitatory cells appear to be important (based on the solid lines in Figure 8). This seems surprising: is there any experimental support for excitatory cells to preferentially receive inhibition from either SOM or VIP, but not both?

      R2b: There is indeed direct experimental support for the competitive relationship between SOM, VIP, and functionally distinct groups of excitatory neurons. This was shown in the paper by Josh Trachtenberg’s group: Garcia-Junco-Clemente et al 2017. An inhibitory pull-push circuit in frontal cortex. Nat Neurosci 20:389–392. However, we emphasize that we also showed (lower left motif in Fig 8G) that a simpler model with only one inhibitory group is sufficient to explain the anticorrelations and scale-free dynamics we observe. We opted to highlight the model with two inhibitory groups since it can also account for the Garcia-Junco-Clemente et al results.

      In the section where we describe the model, we state, “We considered two inhibitory groups, instead of just one, to account for previous reports of anticorrelations between VIP and SOM inhibitory neurons in addition to anticorrelations between groups of excitatory neurons (Garcia-Junco-Clemente et al., 2017).”

      More broadly, I wonder if the neat diagrams drawn here are misleading. The sample raster, showing what appears to be the full simulation, certainly captures the correlated/anti-correlated pattern of the 100 cells most correlated with a seed cell and 100 cells most anti-correlated with it, but it does not contain the 11,000 cells in between with zero to moderate levels of correlation.

      R2c: We agree that our original model has several limitations and that one of the most obvious features lacking in our model is asynchronous neurons (The limitations are now discussed more openly in the last paragraph of the model subsection). In the data from the Garcia-Junco-Clemente et al paper above there are many asynchronous neurons as well. To ameliorate this limitation, we have now created a modified model that now accounts for asynchronous neurons together with the competing anticorrelated neurons (now shown and described in Fig S9). We put this modified model in supplementary material and kept the simpler, original model in the main findings of our work, because the original model provides a simpler account of the features of the data we focused on in our work – i.e. anticorrelated scale-free fluctuations. The addition of the asynchronous population does not substantially change the behavior of the two anticorrelated groups in the original model.

      We probably expect that the full covariance matrix has similar structure from any seed (see Meshulam et al. 2019, PRL, for an analysis of scaling of coarse-grained activity covariance), and this suggests multiple cross-over inhibition constraints, which seem like they could be hard to satisfy.

      R2d: We agree that it remains an outstanding challenge to create a model that reproduces the full complexity of the covariance matrix. We feel that this challenge is beyond the scope of this paper, which is already arguably squeezing quite a lot into one manuscript (one reviewer already suggested removing figures!).

      We added a paragraph at the end of the subsection about the model to emphasize this limitation of the model as well as other limitations. This new paragraph says:

      While our model offers a simple explanation of anticorrelated scale-free dynamics, its simplicity comes with limitations. Perhaps the most obvious limitation of our model is that it does not include neurons with weak correlations to both e+ and e- (those neurons in the middle of the correlation spectrum shown in Fig 7B). In Fig S9, we show that our model can be modified in a simple way to include asynchronous neurons. Another limitation is that we assumed that all non-zero synaptic connections were equal in weight. We loosen this assumption allowing for variable weights in Fig S9, without changing the basic features of anticorrelated scale-free fluctuations. Future work might improve our model further by accounting for neurons with intermediate correlations.

      The motifs identified in Fig. 8 likely exist, but I am left with many questions of what we learned about connectivity rules that would account for the full distribution of correlations. Would starting with an Erdos-Renyi network with slight over-representation of these motifs be sufficient? How important is the homogeneous connection weights from each pool assumption - would allowing connection weights with some dispersion change the results?

      R2e: First, we emphasize that our specific goal with our model was to identify a possible mechanism for the anticorrelated scale-free fluctuations that played the key role in our analyses. We agree that this is not a complete account of all correlations, but this was not the goal of our work. Nonetheless, our new modified model in Fig S9 now accounts for additional neurons with weak correlations. However, we think that future theoretical/modeling work will be required to better account for the intermediate correlations that are also present in the experimental data.

      We confirmed that an Erdo-Renyi network of E and I neurons can produce scale-free dynamics, but cannot produce substantial anticorrelated dynamics (Fig 8G, top right motif). Additionally, the parameter space study we performed with our model in Fig 8 showed that if the interactions between the two excitatory groups exceed a certain tipping point density, then the model behavior switches to behavior expected from an Erdos-Renyi network (Fig 8F). Finally, we have now confirmed that some non-uniformity of synaptic weights does not change the main results (Fig S9). In the model presented in Fig S9, the value of each non-zero connection weight was drawn from a uniform distribution [0,0.01] or [-0.01,0] for excitatory and inhibitory connections, respectively. All of these facts are described in the model subsection of the paper results.

      As a whole, this paper has the potential to make an impact on how large-scale neural and behavioral recordings are analyzed and interpreted, which is of high interest to a large contingent of the field.

      Reviewer #3 (Public Review):

      The primary goal of this work is to link scale free dynamics, as measured by the distributions of event sizes and durations, of behavioral events and neuronal populations. The work uses recordings from Stringer et al. and focus on identifying scale-free models by fitting the log-log distribution of event sizes. Specifically, the authors take averages of correlated neural sub-populations and compute the scale-free characterization. Importantly, neither the full population average nor random uncorrelated subsets exhibited scaling free dynamics, only correlated subsets. The authors then work to relate the characterization of the neuronal activity to specific behavioral variables by testing the scale-free characteristics as a function of correlation with behavior. To explain their experimental observation, the authors turn to classic e-i network constructions as models of activity that could produce the observed data. The authors hypothesize that a winner-take-all e-i network can reproduce the activity profiles and therefore might be a viable candidate for further study. While well written, I find that there are a significant number of potential issues that should be clarified. Primarily I have main concerns: 1) The data processing seems to have the potential to distort features that may be important for this analysis (including missed detections and dynamic range), 2) The analysis jumps right to e-i network interactions, while there seems to be a much simpler, and more general explanation that seems like it could describe their observations (which has to do with the way they are averaging neurons), and 3) that the relationship between the neural and behavioral data could be further clarified by accounting for the lop-sidedness of the data statistics. I have included more details below about my concerns below.

      Main points:

      1) Limits of calcium imaging: There is a large uncertainty that is not accounted for in dealing with smaller events. In particular there are a number of studies now, both using paired electro-physiology and imaging [R1] and biophysical simulations [R2] that show that for small neural events are often not visible in the calcium signal. Moreover, this problem may be exacerbated by the fact that the imaging is at 3Hz, much lower than the more typical 10-30Hz imaging speeds. The effects of this missing data should be accounted for as could be a potential source of large errors in estimating the neural activity distributions.

      R3a: We appreciate the concern here and agree that event size statistics could in principle be biased in some systematic way due to missed spikes due to deconvolution of Ca signals. To directly test this possibility, we performed a new analysis of spike data recorded with high time resolution electrophysiology. We began with forward-modeling process to create a low-time-resolution, Ca-like signal, using the same deconvolution algorithm (OASIS) that was used to generate the data we analyzed in our work here. In agreement with the reviewer’s concern, we found that spikes were sometimes missed, but the loss was not extreme and did not impact the neural event size statistics in a significant way compared to the ground truth we obtained directly from the original spike data (with no loss of spikes). This new work is now described in a new paragraph at the end of the subsection of results related to Fig 3 and in a new Fig S3. The new paragraph says…

      Two concerns with the data analyzed here are that it was sampled at a slow time scale (3 Hz frame rate) and that the deconvolution methods used to obtain the data here from the raw GCAMP6s Ca imaging signals are likely to miss some activity (Huang et al., 2021). Since our analysis of neural events hinges on summing up activity across neurons, could it be that the missed activity creates systematic biases in our observed event size statistics? To address this question, we analyzed some time-resolved spike data (Neuropixel recording from Stringer et al 2019). Starting from the spike data, we created a slow signal, similar to that we analyzed here by convolving with a Ca-transient, down sampling, deconvolving, and z-scoring (Fig S3). We compared neural event size distributions to “ground truth” based on the original spike data (with no loss of spikes) and found that the neural event size distributions were very similar, with the same exponent and same power-law range (Fig S3). Thus, we conclude that our reported neural event size distributions are reliable.

      However, although loss of spikes did not impact the event size distributions much, the time-scale of measurement did matter. As discussed above and shown in Fig S4, changing from 5 ms time resolution to 330 ms time resolution does change the exponent and the range of the power law. However, in the test data set we worked with, the existence of a power law was robust across time scales.

      2) Correlations and power-laws in subsets. I have a number of concerns with how neurons are selected and partitioned to achieve scale-free dynamics. 2a) First, it's unclear why the averaging is required in the first place. This operation projects the entire population down in an incredibly lossy way and removes much of the complexity of the population activity.

      R3b: Our population averaging approach is motivated by theoretical predictions and previous work. According to established theoretical accounts of scale-free population events (i.e. non-equilibrium critical phenomena in neural systems) such population-summed event sizes should have power law statistics if the system is near a critical point. This approach has been used in many previous studies of scale-free neural activity (e.g. all of those cited in the introduction in relation to scale-free neuronal avalanches). One of the main results of our study is that the existing theories and models of critical dynamics in neural systems fail to account for small subsets of neurons with scale-free activity amid a larger population that does not conform to these statistics. We could not make this conclusion if we did not test the predictions of those existing theories and models.

      2b) Second, the authors state that it is highly curious that subsets of the population exhibit power laws while the entire population does not. While the discussion and hypothesizing about different e-i interactions is interesting I believe that there's a discussion to be had on a much more basic level of whether there are topology independent explanations, such as basic distributions of correlations between neurons that can explain the subnetwork averaging. Specifically, if the correlation to any given neuron falls off, e.g., with an exponential falloff (i.e., a Gaussian Process type covariance between neurons), it seems that similar effects should hold. This type of effect can be easily tested by generating null distributions using code bases such as [R3]. I believe that this is an important point, since local (broadly defined) correlations of neurons implying the observed subnetwork behavior means that many mechanisms that have local correlations but don't cluster in any meaningful way could also be responsible for the local averaging effect.

      R3c: We appreciate the reviewer’s effort, trying out some code to generate a statistical model. We agree that we could create such a statistical model that describes the observed distribution of pairwise correlations among neurons. For instance, it would be trivial to directly measure the covariance matrix, mean activities, and autocorrelations of the experimental data, which would, of course, provide a very good statistical description of the data. It would also be simple to generate more approximate statistical descriptions of the data, using multivariate gaussians, similar to the code suggested by the reviewer. However, we emphasize, this would not meet the goal of our modeling effort, which is mechanistic, not statistical. The aim of our model was to identify a possible biophysical mechanism from which emerge certain observed statistical features of the data. We feel that a statistical model is not a suitable strategy to meet this aim. Nonetheless, we agree with the reviewer that clusters with sharp boundaries (like the distinction between e+ an e- in our model) are not necessary to reproduce the cancelation of anticorrelated neurons. In other words, we agree that sharp boundaries of the e+ and e- groups of our model are not crucial ingredients to match our observations.

      2c) In general, the discussion of "two networks" seems like it relies on the correlation plot of Figure~7B. The decay away from the peak correlation is sharp, but there does not seem to be significant clustering in the anti-correlation population, instead a very slow decay away from zero. The authors do not show evidence of clustering in the neurons, nor any biophysical reason why e and i neurons are present in the imaging data.

      R3d: First a small reminder: As stated in the paper, the data here is only showing activity of excitatory neurons. Inhibitory neurons are certainly present in V1, but they are not recorded in this data set. Thus we interpret our e+ and e- groups as two subsets of anticorrelated excitatory neurons, like those we observed in the experimental data. We agree that our simplified model treats the anticorrelated subsets as if they are clustered, but this clustering is certainly not required for any of the data analyses of experimental data. We expect that our model could be improved to allow for a less sharp boundary between e+ and e- groups, but we leave that for future work, because it is not essential to most of the results in the paper. This limitation of the model is now stated clearly in the last paragraph of the model subsection.

      The alternative explanation (as mentioned in (b)) is that the there is a more continuous set of correlations among the neurons with the same result. In fact I tested this myself using [R3] to generate some data with the desired statistics, and the distribution of events seems to also describe this same observation. Obviously, the full test would need to use the same event identification code, and so I believe that it is quite important that the authors consider the much more generic explanation for the sub-network averaging effect.

      R3e: As discussed above, we respectfully disagree that a statistical model is an acceptable replacement for a mechanistic model, since we are seeking to understand possible biophysical mechanisms. A statistical model is agnostic about mechanisms. We have nothing against statistical models, but in this case, they would not serve our goals.

      To emphasize our point about the inadequacy of a statistical model for our goals, consider the following argument. Imagine we directly computed the mean activities, covariance matrix, and autocorrelations of all 10000 neurons from the real data. Then, we would have in hand an excellent statistical model of the data. We could then create a surrogate data set by drawing random numbers from a multivariate gaussian with same statistical description (e.g. using code like that offered by reviewer 3). This would, by construction, result in the same numbers of correlated and anticorrelated surrogate neurons. But what would this tell us about the biophysical mechanisms that might underlie these observations? Nothing, in our opinion.

      2d) Another important aspect here is how single neurons behave. I didn't catch if single neurons were stated to exhibit a power law. If they do, then that would help in that there are different limiting behaviors to the averaging that pass through the observed stated numbers. If not, then there is an additional oddity that one must average neurons at all to obtain a power law.

      R3f: We understand that our approach may seem odd from the point of view of central-limit-theorem-type argument. However, as mentioned above (reply R3b) and in our paper, there is a well-established history of theory and corresponding experimental tests for power-law distributed population events in neural systems near criticality. The prediction from theory is that the population summed activity will have power-law distributed events or fluctuations. That is the prediction that motivates our approach. In these theories, it is certainly not necessary that individual neurons have power-law fluctuations on their own. In most previous theories, it is necessary to consider the collective activity of many neurons before the power-law statistics become apparent, because each individual neurons contributes only a small part to the emergent, collective fluctuations. This phenomenon does not require that each individual neuron have power-law fluctuations.

      At the risk of being pedantic, we feel obliged to point out that one cannot understand the peculiar scale-free statistics that occur at criticality by considering the behavior of individual elements of the system; hence the notion that critical phenomena are “emergent”. This important fact is not trivial and is, for example, why there was a Nobel prize awarded in physics for developing theoretical understanding of critical phenomena.

      3) There is something that seems off about the range of \beta values inferred with the ranges of \tau and $\alpha$. With \tau in [0.9,1.1], then the denominator 1-\tau is in [-0.1, 0.1], which the authors state means that \beta (found to be in [2,2.4]) is not near \beta_{crackling} = (\alpha-1)/(1-\tau). It seems as this is the opposite, as the possible values of the \beta_{crackling} is huge due to the denominator, and so \beta is in the range of possible \beta_{crackling} almost vacuously. Was this statement just poorly worded?

      R3g: The point here is that theory of crackling noise predicts that the fit value of beta should be equal to (1-alpha)/(1-tau). In other words, a confirmation of the theory would have all the points on the unity line in the rightmost panels of Fig9D and 9E, not scattered by more than an order of magnitude around the unity line. (We now state this explicitly in the text where Fig 9 is discussed.) Broad scatter around the unity line means the theory prediction did not hold. This is well established in previous studies of scale-free brain dynamics and crackling noise theory (see for example Ma et al Neuron 2019, Shew et al Nature Physics 2015, Friedman et al PRL 2012). A clearer single example of the failure of the theory to predict beta is shown in Fig 5A,B, and C.

      4) Connection between brain and behavior:

      4a) It is not clear if there is more to what the authors are trying to say with the specifics of the scale free fits for behavior. From what I can see those results are used to motivate the neural studies, but aside from that the details of those ranges don't seem to come up again.

      R3h: The reviewer is correct, the primary point in Fig 2 is that scale-free behavioral statistics often exist. Beyond this point about existence, reporting of the specific exponents and ranges is just standard practice for this kind of analysis; a natural question to ask after claiming that we find scale behavior is “what are the exponents and ranges”. We would be remiss not to report those numbers.

      4b) Given that the primary connection between neuronal and behavioral activity seems to be Figure~4. The distribution of points in these plots seem to be very lopsided, in that some plots have large ranges of few-to-no data points. It would be very helpful to get a sense of the distribution of points which are a bit hard to see given the overlapping points and super-imposed lines.

      R3i: We agree that this whitespace in the figure panels is a somewhat awkward, but we chose to keep the horizontal axis the same for all panels of Fig 4B, because this shows that not all behaviors, and not all animals had the same range of behavioral correlations. We felt that hiding this was a bit misleading, so we kept the white space.

      4c) Neural activity correlated with some behavior variables can sometimes be the most active subset of neurons. This could potentially skew the maximum sizes of events and give behaviorally correlated subsets an unfair advantage in terms of the scale-free range.

    1. Author Response

      Reviewer #1 (Public Review):

      In this study, Scalabrino et al. show persistent cone-mediated RGC signaling despite changes in cone morphology and density with rod degeneration in CNGB1 mouse model of retinitis pigmentosa. The authors use a linear-nonlinear receptive field model to measure functional changes (spatial and temporal filters and gain) across the RGC populations with space-time separable receptive fields. At mesopic and photopic conditions, receptive field changes were minor until rod death exceeded 50%; while response gain decreased with photoreceptor degeneration. Using information theory, the authors evaluated the fidelity of RGC signaling demonstrated that mutual information decreased with rod loss, but cone-mediated RGC signaling was relatively stable and was more robust for natural movies than artificial stimulus. This work reveals the preservation of cone function and a robustness in encoding natural movies across degeneration. This manuscript is the first demonstration of using information theory to evaluate the effects of neural degeneration on sensory coding. The study uses a systematic evaluation of rod and cone function in this model of rod degeneration to make the following findings: (1) cone function persists for 5-7 months, (2) spatial and temporal changes to the ganglion cell receptive fields were not monotonic with time, (3) mutual information between spikes and photopic stimuli remained relatively constant up to 3-5 months, and (4) information rates were higher for natural movies than for checkerboard noise stimuli.

      The strengths of this paper include the following:

      A systemic evaluation of potentially confusing data. The authors do an excellent job of organizing the results in terms of light levels and time points. The results themselves are confusing and difficult to draw across metrics, but the data are presented as clearly as possible. The work is especially well executed and presented.

      The insight that cone responses remain relatively stable despite rod loss. The study clearly demonstrates that despite cone loss and morphological changes, cone-mediated responses remain robust and functional.

      The application of information theory to degeneration is the first of its kind and the study clearly shows the utility of the metric.

      The results are thoughtfully interpreted.

      We thank the reviewer for these comments.

      The weaknesses of this study include the following:

      The inability to follow the same ganglion cell types over time is a major weakness that could confound the interpretation in terms of whether the changes are happening from artifacts of the recording method or from dynamic changes in the pooled population of ganglion cells. Is there even a single cell class, for example the ON-OFF direction-selective ganglion cells, that this group has so well quantified on the MEA, that the study could track over time, in addition to examining the pooled population changes over time? Tracking a single cell type for each of the metrics would make the population data more convincing or could clearly show that not all ganglion cells follow the population trend.

      As suggested by the reviewer, we have added a cell type that is tracked through all the analyses: ON brisk sustained RGCs. Example receptive field mosaics, temporal receptive fields, and spike train autocorrelation functions for WT and 4M Cngb1neo/neo animals are shown in Figure 2-figure supplement 1E-F. These RGCs follow the trends displayed by the larger populations of RGCs in each analysis. We chose this cell type because they are readily identified by their spike train autocorrelation functions compared to other RGC types and they have approximately space-time separable receptive fields (RFs). There are many text changes associated with adding an analysis of the ON Brisk sustained RGCs (see lines 202-207; 227-229; 264-267, etc).

      We chose not to focus on direction selective RGCs because we are analyzing the spatial and temporal RFs of RGCs in Figures 3-5 and direction-selective RGCs do not have space-time separable RFs (see example in Figure 2C-D). Thus, those cells could not be used to track those receptive field properties across degeneration. Also, we did not collect responses to drifting gratings or bar responses across a range of speeds or contrasts, so we are unable to reliably distinguish the different types of direction-selective RGCs (e.g., ON vs ON-OFF) from these data.

      While the non-monotonic changes are interesting, they are also difficult to make sense of. Can the authors speculate in the Discussion what could be underlying mechanisms that give rise to non-monotonic changes. In the absence of potential mechanisms, the concern of recording artifacts arises.

      Thank you for raising this point. We have added some speculation for the cause of these non-monotonic changes in the Discussion (lines 455-462). “While we do not know why non-monotonic changes are occurring for some RF properties, they largely occurred in the 3-5M range. During this time, there is a transient decrease in the rate of rod death (4-5M) and cone death begins (Figure 1). Consequently, there may be complex changes to retinal circuitry as the retina reacts to a temporary stabilization in rod numbers and an acceleration in cone death. Intracellular studies of the light-driven synaptic currents impinging onto bipolar cells and RGCs during this time will be important for understanding the origin of these non-monotonic changes in RF properties.”

      The mutual information calculation seems to be correlated with the spike rate despite the argument made in Fig 10E-F. Can the authors show this directly by calculating the bits per spike in Figures 8 and 9? Of all the metrics, the gain function and the mutual information seem to be more consistent with each other. Can the authors demonstrate or refute a connection between the spike rate and information rates?

      We added a supplementary figure to each of the information figures (see for Figures 8-10 figure supplement 1) showing the trends hold after dividing the information rate by the spike rate. Certainly, changing spike rates are contributing, but there are also clear changes in the bits/spike plots (Figure 8-figure supplement 1D; Figure 9-figure supplement 1D, Figure 10-figure supplement 1D).

      Can the authors provide an explanation for why the mutual information calculation remains stable despite lower SNR and lower gain, especially after the contributions of oscillations have been ruled out?

      The mutual information depends more strongly on the precision of spiking (both in terms of time and spike number within a small time bin) than the mean spike rate (averaged over the stimulus). Diminishing the total number of spikes (because of reduced gain) will have a relatively small effect on the information rate if the spike trains continue to exhibit low variability (high precision). Indeed, spike generation by RGCs is distinctly sub-Poisson (Berry, Warland, and Meister 1997), indicating it can exhibit relatively high information rates even when spike rates are relatively low. We clarified this in Results at lines 493-496.

      Lack of age-matched WT controls to accompany the different time points. It is known that photoreceptor degeneration can occur naturally in WT mice. Though the authors have used controls pooled from across the ages used in the CNGB1 mutants, it would be informative to know if there are age-dependent changes in any of the metrics for WT mice.

      WT recordings were pooled from retinas from littermate control mice between 2 and 7 months of age (n=3 2M, n=1 each 4M, 6M, 7M). We have added data points from individual retinal recordings to the figure supplements for Figure 2-6 and 8-10 to illustrate the consistency between these recordings, which allowed us to confidently pool the results.

      Can the authors elaborate on why cone function persists despite the rod loss and morphological changes? This is unique for other models of rod loss and is worth extra discussion.

      This is something we are also very interested in, but outside the scope of this study. The Sampath Lab (co-author and collaborator) has data from single cell recordings in late stage rd10 retinas that show abnormal cone signaling (and structure similar to the 7M Cngb1neo/neo cones), yet relatively normal cone bipolar cell and horizontal cell responses. Thus, somehow there is either compensation or a high level of redundancy in the transmission of signals from cones to 2nd-order neurons that makes the responses of the 2nd-order neurons robust to deteriorating cone function. These results suggest our observations in Cngb1neo/neo mice are not unique to this model of RP. Future experiments are needed to understand how this compensation is occurring.

      Reviewer #2 (Public Review):

      In this study, the authors assess the decline of retinal function in a mouse model of slow photoreceptor degeneration - the Cngb1neo/neo. Rod loss occurs between 1-7 months and complete cone loss occurs by 8-9 months. The authors characterize cone loss in the first 7 months and find that 70% of cones are still there at 7 months, though their outer segments are highly degraded. They then use MEA recordings to characterize retinal function using a variety of measures. First, they use spike-triggered averaging to determine the spatial and temporal receptive fields, restricting this analysis to RGCs that have separable spatial and temporal receptive fields. They find that both rod and cone receptive fields are surprisingly intact over the first 5 months, identifying primarily a reduction in contrast response functions (and a reduction in the number of rods that are light responsive-though this is not quantified). Second, they show that oscillatory activity does not appear until after photoreceptors are completely deteriorated-in sharp contrast to other PR degeneration models (e.g. rd10) in which oscillatory activity appears while there are still light-evoked responses. Third, they use information theory to assess the reliability of signaling. When examining the 10% of RGCs with the highest information rates they see a significant decrease at mesoscopic light levels, while information rates were mostly stable at photopic light levels. Finally, they showed that at photopic light levels, the mutant retinas conveyed more information about natural movies than a repeating checkerboard, and this was maintained across light levels.

      My primary question is whether this represents a significant advance. There have been many studies regarding the changing retinal circuits in various rodent models of photoreceptor degeneration. The authors make a few arguments regarding the uniqueness of this study.

      One is that this is a novel analysis that is not limited to particular cell types but rather characterized the retinal as a "whole". But in this point is also its weakness. First, one cannot speak to the retinal as a "whole" since they state that there is a reduction in the number of light-responsive cells across degeneration - yet they do not quantify it. This seems incredibly important to know because even presuming the remaining cells have perfect receptive field structure if only 10% of cells are left, assessing the receptive fields of only the remaining cells is clearly not a characterization of the retention of visual function.

      We never claim that we have assessed the “retina as a whole”. We do state that we are measuring certain features of RGC signaling that reflect the “net changes” induced by photoreceptor degeneration (e.g., changes in photoreceptor function, retinal rewiring, homeostatic mechanisms, etc.) on those features. In fact, we are explicit that we are only measuring certain RF properties in certain RGC types, such as the linear spatial and temporal RFs in cells with space-time separable RFs: Figure 2 makes this point explicitly. We do not measure changes in direction-selectivity, object motion sensitivity, orientation selectivity, edge detection, looming detection, luminance encoding, chromatic opponency, contrast adaptation, motion reversal signaling, etc., because doing so would produce a manuscript with at least one figure for every RGC type (e.g., 45 figures). This would clearly be an unreasonable amount for a single study.

      We agree with the Reviewer that explicitly quantifying the number of light responsive RGCs is important, and we now include this information as a function of degeneration time point in Figure 2-figure supplement 1. Under photopic conditions, this fraction is quite stable until 5M and then begins to deteriorate. We also observe a decrease in the number of RGCs with space-time separable RFs at 5M (Figure 2F), suggesting (but not proving) that these RGCs are representative of changes across all RGCs. We also described these results in the Results (lines 167-174).

      Second, it is hard to assess whether this mouse model is better than existing models for human disease. Their phenotype is different than the rat model of this same disease. It also shows a lack of oscillatory activity that is apparent in rd models.

      We are not making the claim that this model is better than other models. Each model has value. However, because the degeneration in this model is relatively slow, it may be more representative of changes that occur in slower forms of human retinal degeneration (emphasis on “may be”). This is a discussion point, not something that we are aiming to prove. We also believe the utility of a model depends on the questions being asked. In this case, we aimed to track changes over time during photoreceptor loss to better understand the extent to which retinal output is impaired.

      Also, retinitis pigmentosa is a heterogenous disease with a spectrum of phenotypes that may or may not be genotype specific. A patient with a PDE6B mutation presents with differing phenotypes than a patient with CNGB1 mutation, despite both having an RP diagnosis. It is fallacy to assume a mouse is the exact same as a human, just as it is incorrect to assume clinical presentations are identical for all patients for one broad disease that is known to have a diverse set of underlying causes. Studying a range of models is thus essential to understanding the disease. Given that mutations causing RP have different impacts on retinal signaling, we believe it is important to contextualize findings to their mutation. We make this point in Discussion: Comparison to previous studies of RGC signaling in retinitis pigmentosa (beginning on line 436).

      Finally, the model we study does not lack oscillatory activity, it simply arises later than in rd1 or rd10 mice and does so only after all the photoreceptors have died (Figure 7). To our knowledge, it is not clear when or even if RGCs exhibit oscillations in human patients with RP. We discuss why oscillation might arise at different time points in different genetic models of RP in lines 555-570.

      Reviewer #3 (Public Review):

      In the manuscript by Scalabrino et al. a rigorous characterization of the functionality of retinal ganglion cells in a mouse model of rod photoreceptor degeneration is presented. The authors analyzed the degeneration of cone photoreceptors, which is known to be linked to rod degeneration. Based on the time course of cone degeneration they investigated the functional properties of retinal ganglion cells aged between 1 month and seven months.

      The most interesting finding is robust preservation of functional properties, as reflected in little changes of the receptive fields (spatial and temporal characteristics) or signaling fidelity/information rate. In contrast to other mouse models, the present one shows no oscillatory activity until a complete loss of cone photoreceptors occurred at an age of nine months.

      Although the receptive fields of retinal ganglion cells remain nearly intact, the number of ganglion cells with identifiable receptive fields decreases significantly with age (Fig.2F). Could the authors comment, if this might imply a "patchy" vision?

      Visual field loss is a predominant clinical observation in patients with retinitis pigmentosa, including those with Cngb1 mutations. We connect to this observation in the Discussion at lines 521-529: “At the latest stages of photoreceptor degeneration in the Cngb1neo/neo mice (5-7M), we did observe a decrease in the fraction of RGCs with spike rates that were strongly modulated by checkerboard noise (Supplemental Figure 2). It is possible these RGCs were losing their light response completely, or that changes in their light response properties made them relatively unresponsive to checkerboard noise. If the former, it is possible that light responsive RGCs are becoming sparser at the later stages of degeneration which may result in inhomogeneous, or “patchy”, visual sensitivity described by RP patients (see reviews by Hull et al., 2017; Nassisi et al., 2021).”

      Reviewer #4 (Public Review):

      Scalabrino et al. report the remarkable persistence of cone-driven retinal ganglion cell responses in a mouse model of retinitis pigmentosa (i.e., Cngb1 KO mice). The authors first map the time course of primary rod and secondary cone degeneration in Cngb1 KO mice. Approximately 30% of rods are gone at one month (1M), and all rods are lost by 7M in Cngb1 KO retinas. The cone morphology changes progressively as rods degenerate, cone outer segments shrink and are largely absent by 5M. Cones die between 8-9M. Scalabrino et al. next perform multielectrode array recordings from wild-type and Cngb1 KO retinas from 1M to 5M in mesopic and photopic stimulus conditions. They find that spatiotemporal receptive fields remain relatively stable in the face of photoreceptor degeneration, whereas contrast gain gradually decreases. Oscillatory spontaneous ganglion cell activity emerges late (~9M) in Cngb1 KO mice compared to other retinal degeneration models. Finally, the authors analyze mutual information between stimuli (white noise and naturalistic movies) and ganglion cell spikes trains and find that the encoding of the most informative ganglion cells is preserved relatively late into photoreceptor degeneration and that information rates decline less in photopic vs. mesopic conditions and for naturalistic movies vs. white noise stimuli.

      Overall, this is an exciting study that shows remarkable preservation of cone-driven ganglion cell light responses in advanced stages of a retinitis pigmentosa model when most rods have died, and cone morphologies are dramatically altered. The results are presented clearly in the text and figures and are scholarly discussed. Nonetheless, the authors should address a few specific comments to clarify and better support some of the conclusions they draw.

      Specific comments:

      1) In describing the results on information encoding, the authors write and show data (panels A of Figures 8-10) that suggest that most ganglion cells, even in recordings from wild-type retinas, respond unreliably to white noise stimuli and naturalistic movies. Why does such a large fraction of cells have such low repeat reliability? Does this reflect unreliable spike detection and sorting, poor cell or tissue health, or true variability in the responses of healthy retinal ganglion cells. The latter does not seem to align with results from patch-clamp recordings targeted to specific ganglion cell types. The limited repeat reliability also raises questions about how well the linear-nonlinear model, which the authors use to compare responses between wild-type and Cngb1 KO mice of different ages, predicts the responses of these cells. Comparing model parameters (receptive field size, temporal filtering, and contrast sensitivity) between genotypes and ages only makes sense if the model is a good description in the acquired datasets.

      We agree with the reviewer that this is an important point to be clear about. In Figures 8-10 some RGCs exhibit high repeatability, some exhibit low repeatability as quantified by their information rates. The reviewer is concerned about those cells with low repeatability and the ability of capturing their responses with an LN model. This is a valid concern, but to be clear, we are not fitting an LN model to cells with low information rates. In Figures 3-6, where an LN model is being used to estimate the spatial and temporal components of the RFs, we are fitting a subset of all the RGCs: those with space-time separable RFs (see Figure 2). Those particular cells exhibit high information rates and highly reproducible responses, and an LN model captures ~60% of the explainable variance in the spike rate (see Figure 2-figure supplement 1A-B; also see lines 157-151). This is typical for LN models that approximately predict the responses of RGCs to checkerboard noise. Thus, we think the LN model reasonably captures the responses of cells for which we use the LN model. The information rate estimates include these cells as well as other cells that are not well described by an LN model. Note, the LN model is not used to calculate the mutual information rates. We have added text in the Results (lines 324-327) to clarify this.

      In addition, the information rates we estimated in mouse are consistent with past studies from guinea pig (Koch et al, 2004 and Koch et al, 2006). We think cells with very low repeatability are not well driven by checkerboard noise or the particular 10s natural movies we showed. We have updated the example neurons to better reflect the reliability of the cells near the median of the MI distributions in Figures 8-10.

      2) The authors should, maybe in figure supplements and parts of the main figures, break results down by recordings. Inter-experimental variability has been well documented (e.g., Shah et al. Neuron 2022, Zhao et al Sci Rep 2020), and it would be reassuring to see that the conclusions drawn by the authors are supported by statistics in which n = number of recordings (e.g., there is a somewhat difficult to explain broadening of temporal filters in 4M Cngb1 KO retinas that recover by 5M).

      We agree that inter-experiment variability can be large and is important to control for. We now show all the analyses broken down by experiment in Supplemental Figures (2, 3, 4, 5, 6, 8, 9, and 10) for each analysis. None of the trends we describe or highlight in the manuscript were driven by inter-experiment variability.

      3) At different points in their manuscript, the authors conclude that their results "suggest that homeostatic mechanisms in the retina serve to compensate for deteriorating photoreceptors" (or similar). I think that this may well be the case. However, in its present form, the study provides no evidence that retinal circuits in Cngb1 KO mice change to preserve function compared to the alternative that the observed stability is evidence for functional redundancy or resilience in retinal circuits (as they are) without the need for adjustments. Distinguishing between these alternatives would be conceptually important. For example, Care et al. Cell Rep 2019 and Care et al. Cell Rep 2020 used partial stimulation to activate fewer photoreceptors and compare light responses in downstream neurons to those in retinas with fewer photoreceptors. Other studies have directly observed changes in circuit wiring in models of retinal degeneration. If the authors cannot provide experimental evidence for homeostatic changes, it would be good to reflect this in the interpretation and discussion.

      The reviewer raises a terrific point and potential alternative interpretation. We agree. We have not been able to identify an equivalent analysis to that in Care et al. 2019 that we can run that will cleanly distinguish between these two possibilities, without doing many more experiments across timepoints of degeneration. We have thus rewritten portions of the Introduction and the Discussion to recognize the potential of this alternative interpretation.

      Introduction (lines 39-44): Alternatively, homeostatic plasticity or redundancy in retinal circuitry may compensate for photoreceptor loss (Care et al., 2020; Lee et al., 2021; Shen et al., 2020). Such mechanisms could facilitate reliable signaling at the level of retinal output, despite deterioration in photoreceptor function. Identifying the extent to which changes in photoreceptor morphology impact retinal output will inform treatment timepoints for gene therapies aimed at halting rod loss to preserve cone-mediated vision.

      Discussion (lines 514-520): There are two potential classes of mechanisms for this compensation. First, homeostatic plasticity has been documented in models of photoreceptor loss in which the retina remodels to preserve signal transmission (Care et al., 2019; Keck et al., 2013, 2011, 2008; Leinonen et al., 2020; Shen et al., 2020). Alternatively, functional redundancy within the circuit could explain how robust retinal signaling is retained longer than the changes in cone morphology would suggest (Care et al., 2020). This study did not distinguish between the two compensation models.

      4) The authors do not attempt to classify retinal ganglion cells into functional types as functional changes from degeneration may confound such classifications. However, it would be beneficial to separate some categorical response types (direction-selective ON-OFF and ON ganglion cells, maybe orientation-selective [horizontal, vertical, ON, OFF] ganglion cells) and compare how their responsiveness, reliability, and information encoding change with degeneration. This would provide additional insights and address concerns that changes caused by degeneration may be obscured by the differences between ganglion cell types in the present analysis.

      We agree. We now track ON brisk sustained RGCs across degeneration time points for the RF analyses and mutual information analyses. These RGCs are likely the ON sustained alpha cells because they generate very large spikes on the MEA as would be expected for cells with large somata. Example receptive field mosaics, temporal receptive fields, and spike train autocorrelation functions for WT and 4M Cngb1neo/neo animals are shown in Figure 2-figure supplement 1E-F. These RGCs follow the trends displayed by the larger populations of RGCs in each analysis. We chose this cell type because they are readily identified by their spike train autocorrelation functions compared to other RGC types and they have approximately space-time separable receptive fields (RFs). There are many text changes associated with adding an analysis of the ON Brisk sustained RGCs (see lines 202-207; 227-229; 264-267, etc).

      We chose not to focus on direction selective RGCs because we are analyzing the spatial and temporal RFs of RGCs in Figures 3-5 and direction-selective RGCs do not have space-time separable RFs (see example in Figure 2C-D). Thus, those cells could not be used to track those receptive field properties across degeneration. Also, we did not collect responses to drifting gratings or bar responses across a range of speeds or contrasts, so we are unable to reliably distinguish the different types of direction-selective RGCs (e.g., ON vs ON-OFF) from these data.

    1. Historical Hypermedia: An Alternative History of the Semantic Web and Web 2.0 and Implications for e-Research. .mp3. Berkeley School of Information Regents’ Lecture. UC Berkeley School of Information, 2010. https://archive.org/details/podcast_uc-berkeley-school-informat_historical-hypermedia-an-alte_1000088371512. archive.org.

      https://www.ischool.berkeley.edu/events/2010/historical-hypermedia-alternative-history-semantic-web-and-web-20-and-implications-e.

      https://www.ischool.berkeley.edu/sites/default/files/audio/2010-10-20-vandenheuvel_0.mp3

      headshot of Charles van den Heuvel

      Interface as Thing - book on Paul Otlet (not released, though he said he was working on it)

      • W. Boyd Rayward 1994 expert on Otlet
      • Otlet on annotation, visualization, of text
      • TBL married internet and hypertext (ideas have sex)
      • V. Bush As We May Think - crosslinks between microfilms, not in a computer context
      • Ted Nelson 1965, hypermedia

      t=540

      • Michael Buckland book about machine developed by Emanuel Goldberg antecedent to memex
      • Emanuel Goldberg and His Knowledge Machine: Information, Invention, and Political Forces (New Directions in Information Management) by Michael Buckland (Libraries Unlimited, (March 31, 2006)
      • Otlet and Goldsmith were precursors as well

      four figures in his research: - Patrick Gattis - biologist, architect, diagrams of knowledge, metaphorical use of architecture; classification - Paul Otlet, Brussels born - Wilhelm Ostwalt - nobel prize in chemistry - Otto Neurath, philosophher, designer of isotype

      Paul Otlet

      Otlet was interested in both the physical as well as the intangible aspects of the Mundaneum including as an idea, an institution, method, body of work, building, and as a network.<br /> (#t=1020)

      Early iPhone diagram?!?

      (roughly) armchair to do the things in the web of life (Nelson quote) (get full quote and source for use) (circa 19:30)

      compares Otlet to TBL


      Michael Buckland 1991 <s>internet of things</s> coinage - did I hear this correctly? https://en.wikipedia.org/wiki/Internet_of_things lists different coinages

      Turns out it was "information as thing"<br /> See: https://hypothes.is/a/kXIjaBaOEe2MEi8Fav6QsA


      sugane brierre and otlet<br /> "everything can be in a document"<br /> importance of evidence


      The idea of evidence implies a passiveness. For evidence to be useful then, one has to actively do something with it, use it for comparison or analysis with other facts, knowledge, or evidence for it to become useful.


      transformation of sound into writing<br /> movement of pieces at will to create a new combination of facts - combinatorial creativity idea here. (circa 27:30 and again at 29:00)<br /> not just efficiency but improvement and purification of humanity

      put things on system cards and put them into new orders<br /> breaking things down into smaller pieces, whether books or index cards....

      Otlet doesn't use the word interfaces, but makes these with language and annotations that existed at the time. (32:00)

      Otlet created diagrams and images to expand his ideas

      Otlet used octagonal index cards to create extra edges to connect them together by topic. This created more complex trees of knowledge beyond the four sides of standard index cards. (diagram referenced, but not contained in the lecture)

      Otlet is interested in the "materialization of knowledge": how to transfer idea into an object. (How does this related to mnemonic devices for daily use? How does it relate to broader material culture?)

      Otlet inspired by work of Herbert Spencer

      space an time are forms of thought, I hold myself that they are forms of things. (get full quote and source) from spencer influence of Plato's forms here?

      Otlet visualization of information (38:20)

      S. R. Ranganathan may have had these ideas about visualization too

      atomization of knowledge; atomist approach 19th century examples:S. R. Ranganathan, Wilson, Otlet, Richardson, (atomic notes are NOT new either...) (39:40)

      Otlet creates interfaces to the world - time with cyclic representation - space - moving cube along time and space axes as well as levels of detail - comparison to Ted Nelson and zoomable screens even though Ted Nelson didn't have screens, but simulated them in paper - globes

      Katie Berner - semantic web; claims that reporting a scholarly result won't be a paper, but a nugget of information that links to other portions of the network of knowledge.<br /> (so not just one's own system, but the global commons system)

      Mention of Open Annotation (Consortium) Collaboration:<br /> - Jane Hunter, University of Australia Brisbane & Queensland<br /> - Tim Cole, University of Urbana Champaign<br /> - Herbert Van de Sompel, Los Alamos National Laboratory annotations of various media<br /> see:<br /> - https://www.researchgate.net/publication/311366469_The_Open_Annotation_Collaboration_A_Data_Model_to_Support_Sharing_and_Interoperability_of_Scholarly_Annotations - http://www.openannotation.org/spec/core/20130205/index.html - http://www.openannotation.org/PhaseIII_Team.html

      trust must be put into the system for it to work

      coloration of the provenance of links goes back to Otlet (~52:00)

      Creativity is the friction of the attention space at the moments when the structural blocks are grinding against one another the hardest. —Randall Collins (1998) The sociology of philosophers. Cambridge, MA: Harvard University Press (p.76)

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the reviewers

      1. General Statements

      It is the common view of all three reviewers that we have not utilized adequate in vitro/biochemical evidence to support the idea that SATB1 protein undergoes liquid-liquid phase separation. We do agree with the reviewers that our manuscript lacks biochemical evidence to support such notion. Though we find it quite interesting and we would like to suggest for the first time in the field of chromatin organization and function, based upon the action of SATB1, that this protein does exist in at least two polypeptide isoforms (764 and 795 amino acids long) which display different phase separation propensity and therefore confer different actions in regulating the (patho)physiological properties of a murine T cell.

      Every single research group that works on SATB1, considered so far only a single protein isoform, that is, the shorter isoform of 764 amino acids and no tools, such as isoform-specific antibodies have been developed to discriminate the two isoforms and thus being able to assign unique functions to each isoform. We do understand that such a report, suggesting the presence of two protein isoforms, with potentially quite diverse functions, would question (not necessarily by the authors of this manuscript, since no such comment is included in our manuscript) the conclusions drawn in the literature assigning all biochemical properties to a single, short isoform of SATB1. Moreover, all the genetically modified mice that have been analyzed so far (including our group), deleted both Satb1 isoforms. Our future research approaches should, from now on, consider unraveling the isoform-specific functions of SATB1 and their involvement in physiology and disease. This could also deem useful to explain the quite diverse, both positive and negative effects of SATB1 in transcription regulation. Another major objection of the reviewers was that we should provide cumulative supporting evidence for the existence of the long SATB1 isoform, or at least evaluate the specificity of our custom-made antibody.

      Taking under consideration the aforementioned constructive criticism of the three reviewers we would like to perform (most of the suggested experiments have already been performed) additional experiments to support our claims in the manuscript. These experiments are described below as a point-by-point reply to each point raised by the reviewers.

      In line with the aforementioned rationale, we propose the title of our manuscript to change into “Two SATB1 isoforms display different phase separation propensity”, if our manuscript is considered for publication.

      1. Description of the planned revisions

      **Reviewer #1**:

      4) Lack of in vitro reconstitution experiments with purified long and short SATB1

      **PLANNED EXPERIMENT #1**

      We do realize this shortcoming of our work. We have to note that purifying recombinant SATB1 protein is quite a challenging task, yet we 1. cloned both Satb1 cDNAs for the long and short isoforms, 2. we successfully expressed both proteins in great quantity and quality and we are willing to perform these experiments if our work is considered for publication.

      This proposed experiment has also been requested by Reviewers #2 and #3.

      **Reviewer #2**:

      1. Moreover, an important and direct experiment would be to clone the long isoform in a suitable vector and overexpress in the cell line (as done for the canonical isoform in Supp Fig 1a). This would unequivocally show the efficacy of the antibody and thus the following usage of the same for various assays.

      **PLANNED EXPERIMENT #2**

      This is a great suggestion. We have cloned the long and short Satb1 cDNAs in pEGFP-C1 vector. We will transfect these plasmids in NIH 3T3 fibroblasts and we will perform Western blot analysis, utilizing the antibody raised against the extra 31 amino acids long peptide present only in the long SATB1 isoform, for the following samples: 1. NIH-3T3 whole cell protein extracts, 2. protein extracts from NIH 3T3 fibroblasts transiently transfected with the pEGFP-C1 plasmid, 3. protein extracts from NIH 3T3 fibroblasts transiently transfected with the pEGFP-long_Satb1_ plasmid and 4. protein extracts from NIH 3T3 fibroblasts transiently transfected with the pEGFP-short_Satb1_ plasmid.

      This experiment will consist another proof regarding the specificity of the antibody raised against the extra 31 amino acids long peptide present only in the long SATB1 isoform.

      **Minor comments:**

      1. On pg 6, related to Figure 1, the authors mention 'It should also be noted that when investigating the SATB1 protein levels, we have to bear in mind that the antibodies targeting the N-terminus of SATB1 protein cannot discriminate between the short and long isoforms'. The authors reason that their sizes are too close. It is indeed possible, and widely studied in biochemistry to assess various factors on protein migration (such as PTMs). The authors should validate this aspect (as it is important as per their premise) and perform separation based on charge as well and also use a commercial antibody to validate the same.

      (Experiments already performed)

      We have adapted the text so that it does not imply that the two isoforms cannot be separated by size. This part in lines 102-107 then reads: “It should also be noted that when investigating the SATB1 protein levels, we have to bear in mind that the antibodies targeting the N-terminus of SATB1 protein cannot discriminate between the short and long isoforms, thus we can only compare the amount of the long SATB1 isoform to the total SATB1 protein levels in vivo conditions. To overcome this limitation and to specifically validate the presence of the long SATB1 protein isoform in primary murine T cells, we designed a serial immunodepletion-based experiment (Fig. 1e, Supplementary Fig. 1a).”

      Moreover, in the revised version of the manuscript we now provide a number of additional proofs supporting the presence of the long isoform and also the specificity of the long isoform-specific antibody. As evident in the text cited above, in the revised Fig. 1e,f and revised Supplementary Fig. 1a,b; we present two immunodepletion experiments which should alone address the Reviewer’s concerns. Moreover, we added Supplementary Fig. 1c; demonstrating that the long isoform-specific antibody does not detect any protein in cells with conditionally depleted SATB1 (Satb1_fl/fl_Cd4-Cre+), supporting its specificity. The custom-made and publicly available antibodies targeting all SATB1 isoforms were also verified in Supplementary Fig. 1d. Moreover, the long isoform and all isoform antibodies display similar localization in the nucleus (Supplementary Fig. 1e; their co-localization based on super-resolution microscopy is also quantified in Supplementary Fig. 5a).

      In our accompanying revised manuscript Zelenka et al., 2022 (https://doi.org/10.1101/2021.07.09.451769), we will provide yet another piece of evidence, consisting of bacterially expressed short and long SATB1 protein isoforms detected by western blot using either the long isoform-specific or the non-selective all SATB1 isoform antibodies.

      **PLANNED EXPERIMENT #3**

      Although we think that in the revised version of the manuscript, we have provided enough proof about the existence of the long isoform in primary murine thymocytes we would like to try the following approach as suggested by this Reviewer.

      The pI of the two SATB1 isoform is quite similar. The pI of the short SATB1 isoform is 6.09 and for the long SATB1 isoform is 6.18. We will perform 2D PAGE coupled to Western blotting utilizing the antibodies detecting the long and all SATB1 isoforms. Given the fact that both isoforms are post-translationally modified to a various degree, it will be extremely difficult to discriminate between the long and short unmodified versus the long and short post-translationally modified proteins especially in the absence of a specific antibody only for the short isoform.

      **Reviewer #3**

      1. Hexanediol is another assay frequently used in phase-separation studies. However, hexanediol has many deleterious effects on the cell, even at a fraction of the concentration normally used in phase-separation studies. Authors should show controls of cell viability, control proteins that do not phase-separate, etc. See https://www.jbc.org/article/S0021-9258(21)00027-2/fulltext.

      Secondly, hexanediol treatment should cause phase-separated protein aggregates to disperse. It is difficult to determine from the images whether or not the aggregates actually disperse or there is just less protein. In any case, small aggregates remain even after treatment, and this appears different from most other hexanediol experiments reported in the literature where the signals become more dispersed and uniform. This is likely because the samples are fixed.

      One of the main features of using hexanediol in phase-separation is to show that upon washout, LLPS aggregates can reform. Because the cells are fixed, the critical aspect of this assay is not performed. A washout and LLPS recovery would control for cell viability issues described above and would provide the opportunity to show that total SATB1 protein levels did not change, but its distribution did, which is the essence of this assay in the context of LLPS. This review from the Tjian group is very informative and may be a good resource:

      http://genesdev.cshlp.org/content/33/23-24/1619

      In line with our reply to point #1 of this Reviewer (page 26 of this document), we should again emphasize that we utilized the hexanediol treatment in primary murine developing T cells as this is the only way to investigate the properties of SATB1 speckles under physiological conditions. This also explains why some small insoluble structure remains after the hexanediol treatment. Note that under physiological conditions, there is a contribution of several protein variants (such as differential PTMs) out of which some will tend to form more stable structures while others could undergo LLPS. It is not clear how the washout experiment could be applied in the primary cell conditions that include cell fixation as the heterogeneity and big variation among cells would make such data analysis highly unreliable.

      **PLANNED EXPERIMENT #1**

      As we answered to point #4 of Reviewer 1 (page 2), we propose the following experiment. Although the purification of recombinant SATB1 protein is quite a challenging task, yet we 1. cloned both Satb1 cDNAs for the long and short isoforms, 2. we successfully expressed both proteins in great quantity and quality and we are willing to perform in vitro reconstitution experiments if our work is considered for publication.

      1. The major difference between the long and short isoform of SATB1 is the 31aa segment within the IDR. However the authors find that neither the long or short isoform SATB1 forms LLPS aggregates, and the IDR alone forms aggregates in the cytoplasm (Fig5) but they do not respond to Cry2 light activation. When forced to localize to the nucleus, it does not form aggregates as well (Fig6). The short isoform also did not form any aggregates. These results seem to argue against any isoform specific phase-separation. This experiment seems critical for the story, yet it does not support their overall conclusions. The authors might consider using a different cell line or perhaps do an in vitro assay using purified protein.

      I am not certain what to make of the cytoplasmic aggregation, which appears to not form upon localization to the nucleus. Because of this, it is difficult to place weight on the significance of the S635A mutation and the role that a phosphorylation of SATB1 contributes to phase-separation, let alone function There are many additional points of concern, but the ones listed above are perhaps the most significant in terms of the overall conclusions of the paper.

      In Fig. 5c we show that the full length long SATB1 isoform often aggregates unlike the short isoform. These data are accompanied with the results for the IDR region, where the situation is even more obvious (Fig. 5f,g). However, in the latter, we have to bear in mind the absence of the multivalent N-terminal part of the protein which seems to be essential for the overall phase behavior of the protein as indicated in Fig. 4b,c.

      **PLANNED EXPERIMENT #1**

      To further support LLPS of SATB1, we are considering performing the following in vitro experiment, as we answered to point #4 of Reviewer 1 (page 2). Although the purification of recombinant SATB1 protein is quite a challenging task, yet we 1. cloned both Satb1 cDNAs for the long and short isoforms, 2. we successfully expressed both proteins in great quantity and quality and we are willing to perform in vitro reconstitution experiments if our work is considered for publication.

      1. Description of the revisions that have already been incorporated in the transferred manuscript

      **Reviewer #1 (Evidence, reproducibility and clarity)**:

      This paper looks at an important nuclear matrix protein SATB1, which is a well known global chromatin organizer and help chromatin loop attach to the nuclear matrix. The paper starts with identification of novel short and long form of SATB1. Both the isoform consist of a prion like low complexity domains, but the long isoform additionally contain an extra EPF domain next the Prion like low complexity domain. The paper reports that in murine cells the long isoform is 3-4 fold more abundant than the short isoform. By using STED microscopy they show SATB1 foci lie next to transcription sites in the nucleus. They conclude by looking at the spherical shape of the SATB1 foci and the susceptibility of SATB1 staining after 1,6 hexanediol treatment that SATB1 forms the small foci in the nucleus due to LLPS. The authors also use RAMAN spectroscopy to conclude a change in nuclear chemical space in absence of SATB1 but without much explanation about which chemical bond or nuclear sub structure change correspond to the change in principal component analysis from Raman spectroscopy. The authors use the light inducible aggregation cry2 tag with the PrD domain of SATB1 and compare it with the Cry2-FUS-LC domain to conclude that the SATB1 LC domain can undergo LLPS. The authors hint at involvement of RNA and also DNA in the LLPS of the SATB1 but without going into any detail. Reviewer: The paper reports that in murine cells the long isoform is 3-4 fold more abundant than the short isoform.

      Actually, in page 5 (lines 94-96) of the manuscript we write: “We confirmed that in murine thymocytes the steady state mRNA levels of the short Satb1 transcripts were about 3-5 fold more abundant compared to the steady state mRNA levels of the long Satb1 transcripts (Fig. 1d).” Although the steady state mRNA levels of the long isoform are less abundant compared to the shorter isoforms, the long isoform protein levels are almost comparable to the short isoform as deduced based on immunofluorescence experiments. Moreover, Using our two immunodepletion experiments we quantified the difference, estimating the long isoform being 1.5× to 2.62× less abundant than the short isoform (Fig. 1f and Supplementary Fig. 1b; compare lanes 2 & 3 at the lower panel). • Regarding the RAMAN spectroscopy experiments please see Minor Comment #1 of this Reviewer (page 10).

      The key conclusions of the paper are- A) SATB1 undergoes LLPS. But this conclusion is drawn after correlative experiments as detailed below-

      This conclusion is indeed made based on correlative experiments only for the primary murine T cells, which do not allow for any targeted experiments. However, the use of in vitro cell lines allowed us to validate these findings using the optogenetic approaches, utilizing additional experimentation.

      1) observation of spherical punctae by STED-which could also seem spherical due to their small size. The resolution limit achieved by the STED microscopy used in this paper is not determined or mentioned clearly.

      In the revised version of the manuscript, we have specified the resolution of our systems, for STED in Lines 745-746: ”This system enables super-resolution imaging with 35 nm lateral and 130 nm axial resolution.” and for SIM in Lines 759-761: “Images were acquired over the majority of the cell volume in z-dimension with 15 raw images per plane (five phases, three angles), providing ~120-135 nm lateral and ~340-350 nm axial resolution for 488/568 nm lasers, respectively.” The size of the observed speckles is thus above the resolution limit with sizes ranging between 40-80 nm.

      The resolution of our systems is routinely verified by the following methods: The resolution of our OMX (SIM-3D) system was tested using ARGO-SIM slide containing a pattern of 36 µm long lines with gradually increasing spacing ranging from (left to right) 0 to 390 nm, with a step of 30 nm (Fig. 1 below). Our SIM system was able to clearly resolve two lines separated by 120 nm.

      2) No live cell FRAP experiment with fluorescent SATB1 long or short isoform to show that these foci are liquid like

      We did perform FRAP experiments for the SATB1 N-terminus optogenetic construct as demonstrated in Fig. 4f. We did not perform FRAP in the primary murine T cells as this is not technically feasible without creating a new mouse line with fluorescently labeled protein. In the revised version of the manuscript, we additionally performed FRAP experiments for the full length short and long isoform of SATB1 labeled with EGFP and transfected into the NIH-3T3 cell line (Supplementary Figure 6f).

      5) LLPS is strongly coupled to the cellular concentration of the proteins. Authors should quantify the cellular concentration of the long and short isoform in the cells.

      We did consider protein concentration in our analyses of optogenetic constructs in Fig. 4b,d,e and Supplementary Fig. 6a,b,c. Quantifying the physiological cellular concentration of short and long SATB1 protein isoforms in primary T cells is impossible due to the inherent inability to discriminate between the isoforms by two antibodies, in the absence of Satb1 isoform-specific knockout mice.

      However, an approximation of the cellular concentration can be obtained from our immunodepletion experiments. On top of the original immunodepletion experiment that we now present in Supplementary Fig. 1a,b; in the revised version of the manuscript we have repeated the experiment in Fig. 1e,f. Comparison of the two bands for the long and short SATB1 isoforms in the lower panel of the western blot figures suggest that the long SATB1 isoform protein levels are 1.5× to 2.62× less abundant than the short isoform, according to the original and new immunodepletion experiment, respectively. This is now also included in the main text in Lines 110-116: “This experiment can also be used for approximation of the cellular protein levels of SATB1 isoforms in primary murine thymocytes. Comparison of the two bands for long (lane 2) and short SATB1 (lane 3) isoform in the lower panel of Fig. 1f and Supplementary Fig. 1b, suggests that the long SATB1 isoform protein levels may be about 1.5× to 2.62× less abundant than the short isoform, according to the two replicates of our immunodepletion experiment, respectively.”

      Major conclusion B)- SATB1 regulates transcription and splicing.

      This was also shown previously and in this paper they show the close proximity of the transcription site and SATB1 foci by microscopy. Hexanediol treatment which lead to loss of colocalization between FU foci and SATB1 is also taken as an evidence in regulation of transcription is not right as the transcription foci itself can be dissolved using 1,6 Hexanediol. Although the rate of transcription is not measured quantitatively.

      As mentioned in comment #3 (page 29) of this Reviewer, unfortunately there is no better tool to investigate these questions in primary cells than using microscopy approaches in conjunction with hexanediol treatment. However, we should also note that there is an accompanying manuscript from our group that is currently being under revision in another journal (preprint available: Zelenka et al., 2021; https://doi.org/10.1101/2021.07.09.451769). In the preprint manuscript, we showed that: 1. the long SATB1 isoform binding sites have increased chromatin accessibility than what expected by chance (Fig. 3b), 2. there is a drop in chromatin accessibility at SATB1 binding sites in Satb1 cKO mouse (Fig. 3c) and 3. this drop in chromatin accessibility is especially evident at the transcription start sites of genes (Supplementary Fig. 1i)

      We believe that, together these data suggest a direct involvement of SATB1 in transcription regulation. Also note the vast transcriptional deregulation that occurs in Satb1 cKO T cells, affecting the expression of nearly 2000 genes (Fig. 2f, this revised manuscript). That is why we believe that the co-localization analysis, using super-resolution microscopy, presented in Fig. 2c and quantified in Fig. 3g, represents a nice additional support to our claims. Moreover, in the revised version of the manuscript we now present a positive correlation between SATB1 binding and deregulation of splicing (Supplementary Fig. 4d) which also supports its direct involvement in the regulation of transcriptional and co-transcriptional processes.

      In the revised version of the manuscript we have made this clear in Lines 182-194: “Satb1 cKO animals display severely impaired T cell development associated with largely deregulated transcriptional programs as previously documented19,37,38. In our accompanying manuscript19, we have demonstrated that long SATB1 isoform-specific binding sites (GSE17344619) were associated with increased chromatin accessibility compared to randomly shuffled binding sites (i.e. what expected by chance), with a visible drop in chromatin accessibility in Satb1 cKO. Moreover, the drop in chromatin accessibility was especially evident at the transcription start site of genes, suggesting that the long SATB1 isoform is directly involved in transcriptional regulation. Consistent with these findings and with SATB1’s nuclear localization at sites of active transcription, we identified a vast transcriptional deregulation in Satb1 cKO with 1,641 (922 down-regulated, 719 up-regulated) differentially expressed genes (Fig. 2f). Specific examples of transcriptionally deregulated genes underlying SATB1-dependent regulation are provided in our accompanying manuscript19. Additionally, there were 2,014 genes with altered splicing efficiency (Supplementary Fig. 4d-e; Supplementary File 3-4). We should also note that the extent of splicing deregulation was directly correlated with long SATB1 isoform binding (Supplementary Fig. 4d).”

      Major conclusion C)-Post transcriptional modification is important for SATB1 function.

      This point is just barely touched upon in the last figure of the paper

      We would not call the identification of the novel phosphorylation site as a main conclusion of our manuscript. Though, it is already known that posttranslational modifications of SATB1 are important for its function as they can function as a molecular switch rendering SATB1 into either an activator or a repressor (Kumar et al., 2006; https://doi.org/10.1016/j.molcel.2006.03.010).

      In the revised manuscript, we support the effect of serine phosphorylation on the DNA binding capacity of SATB1 by another experiment. We have performed DNA affinity purification experiments utilizing primary thymocyte nuclear extracts treated with phosphatase (Supplementary Fig. 7b). We found that SATB1’s capacity to bind DNA (RHS6 hypersensitive site of the TH2 LCR) is lost upon treatment with phosphatase (Supplementary Fig. 7c). These results are in line with the data presented in Supplementary Fig. 7d, indicating the lost ability of SATB1 to bind DNA upon mutating the discovered phosphorylation site S635. Given the importance of posttranslational modifications of proteins on LLPS, we found it relevant to include it in our manuscript. Even more so, when we identified SATB1 aggregation, upon mutation of this phospho site (Fig. 6d).

      Overall I find that the major conclusion-point A and B, is based on very indirect experiments and needs much more convincing data and the role of SATB1 LLPS in cells should be demonstrated more rigorously. And conclusion C is barely described and needs a lot more cell biological and genetic evidence.

      One of the major assets of our work is that most of our data are based on the analysis of primary murine T cells and thus investigating the biological roles of the endogenous SATB1 protein, under physiological conditions. We apologize that we did not make it clear to this Reviewer, that our system has certain inherent limitations due to the utilization of primary cells.

      I do not recommend publishing the paper in current state. The story needs much more experiment to convincingly prove the major conclusions. Further, the MS needs more careful thinking and presentation to make it streamlined.

      We hope that in the revised version we have significantly improved the quality of our manuscript by implementing the suggested changes.

      Minor comments: One of the major flaw of the paper is the use too many techniques without proper explanation. E.g. use of STED and RAMAN microscopy need controls and explanation on what is being quantified. The use of Raman microscopy to quantify the nuclear environment of nucleus is not related to the chromatin organization or LLPS of SATB1 at all. And no information is provided at all which aspect of nuclear organization is being measured in Raman and what it means for the LLPS of SATB1.

      We do provide quite a thorough explanation of Raman spectroscopy and the underlying quantification in Lines 224-231: “we employed Raman spectroscopy, a non-invasive label-free approach, which is able to detect changes in chemical bonding. Raman spectroscopy was already used in many biological studies, such as to predict global transcriptomic profiles from living cells42, and also in research of protein LLPS and aggregation43–47. Thus we reasoned that it may also be used to study phase separation in primary T cells. We measured Raman spectra in primary thymocytes derived from both WT and Satb1 cKO animals and compared them with spectra from cells upon 1,6-hexanediol treatment. Principal component analysis of the resulting Raman spectra clustered the treated and non-treated Satb1 cKO cells together, while the WT cells clustered separately (Fig. 3h).” We also do provide controls as the method was performed on both treated and untreated WT and Satb1 cKO cells.

      Regarding the RAMAN spectroscopy experiments we now provide more information on the changes of chemical bonds altered between wild type and Satb1 cKO thymocytes. Following principal component analysis, we have extracted the two main principal components that were used for the clustering of our data. The differences are presented in Supplementary Fig. 5d.

      We do realize that RAMAN spectroscopy, although a quite novel approach utilized to study LLPS, has not been used to study LLPS in live cells. If deemed proper we are willing to avoid presenting these results in this manuscript.

      Similarly for Hexanediol treatment, duration of treatment is missing. Hexanediol can also dissolve the liquid like transcription foci. And hence a decrease in correlation between SATB1 foci and FU foci cannot be taken as a measure of SATB1 foci connection to transcription alone

      The duration of hexanediol treatment was 5 minutes as presented in Line 724 and in the revised version of the manuscript also in Lines 1206-1207. We should also note that additionally, we performed experiments with different hexanediol concentrations and timing varying from 1 minute to 10 minutes with results consistent with the data presented.

      It is not very clear how many times the STED or Raman microscopy is done on how many samples and biological replicates. Similarly for RNA sequencing number of samples and description of controls are missing. Also if the sequencing data is made publicly available is not clear.

      Data availability is clearly stated in Lines 506-509: “RNA-seq experiments and SATB1 binding sites are deposited in Gene Expression Omnibus database under accession number GSE173470 and GSE173446, respectively. The other datasets generated and/or analyzed during the current study are available upon request.”

      The Reviewer’s token is “wjwtmeeeppovzqx”.

      RNA sequencing was performed in a biological triplicate for each genotype as stated in the GEO repository and now also in Line 566 of the revised manuscript.

      In Lines 180-181, we also state that it was performed on Satb1 cKO animals and WT mice as a control: “we performed stranded-total-RNA-seq experiments in wild type (WT) and Satb1fl/flCd4-Cre+ (Satb1 cKO) murine thymocytes”.

      In Lines 739-740, we now also state that all imaging approaches were performed on at least two biological replicates (different mice) and please also note the fact that all findings were based on data from both STED and 3D-SIM methods, allowing to minimize detection of artifacts. In the Raman spectroscopy figure, each point represents measurements from an individual cell and for each condition we used 2-5 biological replicates (Lines 831-832 & Line 1169).

      Similarly, in Lines 129-132 we provided a quite detailed description of differences between STED and 3D-SIM, even though these techniques are not that rare as Raman spectroscopy in biology research.

      Additional control is needed to report the resolution limit of Superresolution techniques-STED and 3D-SIM systems used by them.

      We have already provided this information in our reply to comment #1 of this Reviewer (pages 6-7): In the revised version of the manuscript, we have specified the resolution of our systems, for STED in Lines 745-746: ”This system enables super-resolution imaging with 35 nm lateral and 130 nm axial resolution.” and for SIM in Lines 759-761: “Images were acquired over the majority of the cell volume in z-dimension with 15 raw images per plane (five phases, three angles), providing ~120-135 nm lateral and ~340-350 nm axial resolution for 488/568 nm lasers, respectively.” The resolution of our systems is routinely verified by the following methods: The resolution of our OMX (SIM-3D) system was tested using ARGO-SIM slide containing a pattern of 36 µm long lines with gradually increasing spacing ranging from (left to right) 0 to 390 nm, with a step of 30 nm (Fig. 1 below). Our SIM system was able to clearly resolve two lines separated by 120 nm.

      Would be very helpful if the zonation was plotted for the FluoroUridine (FU) also to show that Zone1 (heterochromatin) is completely depleted of FU, and is present in other regions.

      In the revised version of the manuscript, we performed the suggested analysis and in Supplementary Fig. 3a we now show that indeed FU is significantly less localized to Zone 1 (heterochromatin) and has the most abundant localization in Zones 3 and 4, similar to the localization of SATB1 protein, as demonstrated in Fig. 2b.

      Scale bar needed figure 3d

      In the revised version of the manuscript, we included scale bars which are both 0.5 µm (line 1213).

      Perfectly rounded SATB1 foci- this does not mean LLPS. For LLPs measurement, protein condensate dynamics measurement by FRAP or fusion experiments is required. What is the size of condensates? and cellular concentration of SATB1? Will SATB1 undergo LLPS in vitro at similar concentrations? does SATB1 interact with DNA or RNA to undergo LLPS ?

      We toned down this sentence which now reads: “Here we demonstrated its connection to transcription and found that it forms spherical speckles (Fig. 1g), markedly resembling phase separated transcriptional condensates. (Lines 200-202)”.

      Moreover, as explained in earlier replies to comments of this Reviewer, we cannot perform FRAP on primary murine T cells without generating a new mouse line. We did, however, use FRAP and other in vitro approaches including visualization of droplet fusion in ex vivo experiments utilizing cell lines. Moreover, we are willing to demonstrate the LLPS properties of SATB1 on in vitro purified SATB1 protein as indicated in the suggested experiment of Point#4 (page 2).

      After careful reading of the MS I conclude that the main conclusions of the paper are very preliminary and need much more detailed experiments. So does not qualify to get published at all at this stage.

      **Reviewer #1 (Significance)**:

      The present manuscript tries to connect the phase separation of SATB1 to understanding the mechanism of SATB1 function in cells. One of the major hallmarks of phase separation is dynamic, liquid-like behaviour and in absence of these measurements, it is very difficult to say that the current manuscript has made any contribution to showing that SATB1 can phase separate.

      The presence of 2 isoforms of SATB1 is a novel finding and the paper could have focused more on this. E.g. elucidate expression of the isoform during thymocyte development and maturation.

      As a reviewer my expertise are cell biology experiments, microscopy, in vitro reconstitution assays, RNA binding proteins, RNA and RBP condensate formation. And I feel that the reconstitution experiments are an important tool for understanding phase behaviour of proteins and also to gauge if this behaviour can occur or not in cellular concentration and conditions.

      I do not have sufficient expertise in Raman microscopy and hence the information provided in the MS on this part was not enough to understand the experiment and conclusions drawn from it.

      **Reviewer #2 (Evidence, reproducibility and clarity)**:

      The authors have reported the existence of a 'long' SATB1 isoform which also undergoes LLPS. The authors tried to draw multiple comparisons and pointed out distinction between phase properties of SATB1 isoforms. The authors also touch upon two functional roles of SATB1. Although a wide array of assays are used, the data presented and hence the manuscript makes multiple transitions into disparate hypotheses without diving deep into a single hypothesis. As a result, the connections drawn are unclear, and do not converge at best. The authors have used number of techniques, however, the results do not support their conclusions and they appear hastily drawn. It is not clear why the authors jump from one context to the other, discussing LLPS first, then transcription, splicing, post-translational modification and finally cancer. The link between all of these isn't clear and not fully supported by data. It appears that the authors wish to focus on Satb1's physiological role in development, hence the data on breast cancer is confusing. Thus, this work suffers from multiple pitfalls. Specific comments are given below:

      Major comments 1. Importantly, in Fig 1d, there is no statistics shown. There is no mention of number of replicates as well in the legends. Proper statistical evaluation is critical for interpreting this result.

      Please note that Fig. 1d only serves as a control to the sequencing experiment in Fig. 1b. In Line 566, we now state that for the RNA-seq: “A biological triplicate was used for each genotype.” To validate these data, we further designed a RT-qPCR experiment which was performed on three technical replicates from a male and female mouse. We now state this in Line 636. For the low number of samples, statistical tests are not accurate but we still added t test into the figure Fig. 1d and specified it also in the figure legend in Line 1169-1170.

      1. Figure 1f presents one of the weakest evidences in the manuscript. There are a number of corrections needed. Firstly, being their major and only validation figure for their custom antibody, the immunoblot is not clean, bands are fuzzy. Importantly, as the authors claim that the antibody is highly specific to 'long' SATB1, after the IP there should be only a single band (like input) of Satb1 long. But that does not seem to be the case, rather an array of bands are visible below (lane 2 top panel). This could easily mean that the shorter isoforms or non-specific protein bands are also pulled down with the 'long' form specific antibody. Therefore, raising a critical concern regarding the specificity of the antibody.

      • The long antibody was raised in mice inoculated with the extra peptide present in the long isoform only. Therefore, the capacity of this antibody precipitating the shorter isoforms, which do not express the sequence of the extra peptide (EP, Figure 1a) in not possible. • We have repeated the immunodepletion experiment and we now provide the results in Fig. 1f and Supplementary Fig. 1b. The western blot in Fig. 1f is now cleaner and supports quite convincingly the presence of a long SATB1 isoform. Given the lack of isoform-specific knockouts which we could utilize to immunoprecipitate or detect the different isoforms in a single cell (or cell population), the utilized approach of immunodepletion and subsequent western blotting is the approach we thought of implementing. • As shown in Fig. 1f and Supplementary Figure 1b, the long isoform SATB1 antibody has the capacity to recognize the long isoform in murine thymocyte protein extracts but not the short SATB1 isoform (please compare lane 3 in the two western blots utilizing either the antibody for the long isoform -top panel - or the antibody that detects both isoforms (lower panel). • We have performed Immunofluorescence experiments utilizing the antibody detecting the long SATB1 isoform in thymocytes isolated from either C57BL/6 or Satb1 cKO mice. The antibody is specific to the SATB1 protein since there is no signal in immunofluorescence experiments utilizing the knockout cells (Supplementary Figure 1c). • We have performed Immunofluorescence experiments utilizing thymocytes and the antibody detecting the long SATB1 or a commercially available antibody detecting all SATB1 isoforms. The pattern of SATB1 subnuclear localization is similar for both antibodies (Supplementary Figure 1e). • In our accompanying revised manuscript Zelenka et al., 2022 (https://doi.org/10.1101/2021.07.09.451769), we provide yet another piece of evidence, consisting of bacterially expressed short and long SATB1 protein isoforms detected by western blot using either the long isoform-specific or the non-selective all SATB1 isoforms antibodies. • Regarding the additional bands detected in the immunoprecipitation experiment presented in the original Supplementary Figure 1b (lane 2), it is not surprising that additional bands appear in a sample of protein extracts that is used for several hours for the immunoprecipitation experiments, while the “input” sample simply denotes protein extract that is frozen at -80oC right after the preparation of protein extracts until use. It is well-established that SATB1 is the target of proteases which might as well be active during the immunoprecipitation steps (2 consecutive immunoprecipitation steps take place). Therefore, the immunoprecipitated material cannot necessarily be a copy of the input material displaying a single protein band even if protease inhibitors are included in the buffers.

      Taken together the experiments described here we showed that the antibody raised against the extra 31 aa long peptide, present only in the long SATB1 isoform, is specific for this isoform.

      1. Related to Fig. 2 a, the authors state on Pg 5, '....the euchromatin and interchromatin regions (zones 3 & 4, Fig. 2a, b).' Although the DAPI correlation seems clear, there is no mention on how they reached the above said correlation. They should at least show a parallel speckle staining for HP1 or signature modification such as H3K4me9 STEDs for making supporting such a claim. DAPI alone is not sufficient. The authors should rectify the text thoroughly for many such interpretations without validation/reference or provide relevant data.

      This is a great suggestion we have again taken under consideration and we added the following experiments and the appropriate changes in the revised version of our manuscript. • We modified the text and added a reference to Miron et al., 2020 (https://doi.org/10.1126/sciadv.aba8811) supporting our claims regarding SATB1 localization in relation to DAPI staining. • We have also added new microscopy images for HP1, H3K4me3 and fibrillarin staining and quantified the localization of FU-stained sites of active transcription in nuclear zones, to further support our claims. • This whole modified part in Lines 139-167 then reads: “ “The quantification of SATB1 speckles in four nuclear zones, derived based on the relative intensity of DAPI staining, highlighted the localization of SATB1 mainly to the regions with medium to low DAPI staining (zones 3 & 4, Fig. 2a, b). A similar distribution of the SATB1 signal could also be seen from the fluorocytogram of the pixel-based colocalization analysis between the SATB1 and DAPI signals (Supplementary Fig. 2a). SATB1’s preference to localize outside heterochromatin regions was supported by its negative correlation with HP1β staining (Supplementary Fig. 2b). Localization of SATB1 speckles detected by antibodies targeting all SATB1 isoforms and/or only the long SATB1 isoform, revealed a significant difference in the heterochromatin areas (zone 1, Fig. 2b), where the long isoform was less frequently present (see also Fig. 2a and Fig. 3c). Although, this could indicate a potential difference in localization between the two isoforms, due to the inherent difficulty to distinguish the two based on antibody staining, we refrain to draw any conclusions. The prevailing localization of SATB1 corresponded with the localization of RNA-associated and nuclear scaffold factors, architectural proteins such as CTCF and cohesin, and generally features associated with euchromatin and active transcription32. This was also supported by colocalization of SATB1 with H3K4me3 histone mark (Supplementary Fig. 2c), which is known to be associated with transcriptionally active/poised chromatin. Given the localization of SATB1 to the nuclear zones with estimated transcriptional activity32 (Fig. 2b, zone 3), we investigated the potential association between SATB1 and transcription. We unraveled the localization of SATB1 isoforms and the sites of active transcription labeled with 5-fluorouridine. Sites of active transcription displayed a significant enrichment in the nuclear zones 3 & 4 (Supplementary Fig. 3a), similar to SATB1. As detected by fibrillarin staining, SATB1 also colocalized with nucleoli which are associated with active transcription and RNA presence (Supplementary Fig. 3b). Moreover, we found that the SATB1 signal was found in close proximity to nascent transcripts as detected by the STED microscopy (Fig. 2c). Similarly, the 3D-SIM approach indicated that even SATB1 speckles that appeared not to be in proximity with FU-labeled sites in one z-stack, were found in proximity in another z-stack (Supplementary Fig. 3c). Additionally, a pixel-based colocalization of SATB1 and sites of active transcription is quantified later in the text in Fig. 3g, supporting their colocalization.”

      1. The authors mention, '...of the different SATB1 isoforms, uncovered by the use of the two different antibodies, relied in the heterochromatin areas (zone 1), where the long isoform was less frequently...' There is no supporting figure number mentioned. The authors need to show a zone-by-zone comparison images for 'all iso' vs 'long' iso of SATB1. Just to reiterate, there is a need for a heterochromatin mark to unambiguously call out the distinction.

      We should remind that there is an inherent difficulty to accurately compare localization of short and long SATB1 isoforms in primary cells, especially due to the lack of Satb1 isoform-specific knockout mice. There is no way to detect only the short isoform in these primary cells as there are only antibodies targeting the long or all SATB1 isoforms. Therefore, we cannot set up additional experiments probing these questions.

      In line with this, in the revised version of the manuscript, we toned down our statements regarding the differential localization of the two isoforms in primary cells. We only refer to it as an indication and we support it by adding references to the relevant figures. This part now reads: “Localization of SATB1 speckles detected by antibodies targeting all SATB1 isoforms and/or only the long SATB1 isoform, revealed a significant difference in the heterochromatin areas (zone 1, Fig. 2b), where the long isoform was less frequently present (see also Fig. 2a and Fig. 3c). Although, this could indicate a potential difference in localization between the two isoforms, due to the inherent difficulty to distinguish the two based on antibody staining, we refrain to draw any conclusions. (Lines 145-150)”

      1. On the same lines, '....Given the localization of SATB1 to the nuclear zones with estimated transcriptional activity (Fig. 2b, zone 3)....' How was the region labelled as transcriptionally active? For the statistical analysis of speckle count for the two antibodies' staining, the claim posited is a bit bigger. This could simply be true for that cell. The authors thus need to statistically analyse the speckle counts for multiple cells. This needs to be done for all imaging statistics done in multiple figures throughout the manuscript.

      As mentioned in our reply to the two previous comments of this Reviewer, transcriptional activity in relation to the nuclear zonation is well established in the literature. To make this clear, we have now added the reference to Miron et al., 2020 (https://doi.org/10.1126/sciadv.aba8811) supporting our claims and additionally we have also included HP1, H3K4me3 and fibrillarin staining and quantification of FU signal in the nuclear zones. Moreover, it is not clear to which particular cell the comment refers to. The presented dots in Fig. 2b represent individual cells and the relative proportions of speckles in each nuclear zone are plotted on the y axis. In the revised version of the manuscript, we added into the figure the number of cells scored and we adapted the figure legend so that it is absolutely clear that we have analyzed multiple cells:

      “Nuclei of primary murine thymocytes were categorized into four zones based on the intensity of DAPI staining and SATB1 speckles in each zone were counted. Images used represented a middle z-stack from the 3D-SIM experiments. The graph depicts the differences between the long and all SATB1 isoforms’ zonal localization in nuclei of primary murine thymocytes. (Lines 1189-1193)”

      1. For figure 2c. the authors have used 5 Fluorouridine for nascent RNA speckles. 5FU is known to have a spread signal type (with strong association to nucleolus as well). This is not the case for the image presented 2c. The authors should resolve this by showing different sets of images.

      Developing and naive T cells are very unique in terms of their metabolic features and thus they should not be directly compared with other cell types. Therefore, we would not expect to see such a spread FU pattern as previously shown for other cell types. Having said that, we could not find any reference publication that utilized super-resolution microscopy to detect localization of FU-stained sites of active transcription in developing primary T cells. However, we performed additional immunofluorescence experiments to demonstrate the colocalization or its lack between SATB1 and HP1 (Supplementary Fig. 2b), H3K4me3 (Supplementary Fig. 2c) and fibrillarin (Supplementary Fig. 3b). Moreover, we provide additional regions of SATB1 and FU staining in Supplementary Fig. 3c. The modified text reads:

      “We unraveled the localization of SATB1 isoforms and the sites of active transcription labeled with 5-fluorouridine. Sites of active transcription displayed a significant enrichment in the nuclear zones 3 & 4 (Supplementary Fig. 3a), similar to SATB1. As detected by fibrillarin staining, SATB1 also colocalized with nucleoli which are associated with active transcription and RNA presence (Supplementary Fig. 3b). Moreover, we found that the SATB1 signal was found in close proximity to nascent transcripts as detected by the STED microscopy (Fig. 2c). Similarly, the 3D-SIM approach indicated that even SATB1 speckles that appeared not to be in proximity with FU-labeled sites in one z-stack, were found in proximity in another z-stack (Supplementary Fig. 3c). Additionally, a pixel-based colocalization of SATB1 and sites of active transcription is quantified later in the text in Fig. 3g, supporting their colocalization. (Lines 157-167)”

      1. Fig 2 d., the authors have suddenly jumped solely to 'all iso' Satb1 here for IP MS. Is there a reason for that? The authors either need to do this with 'long iso' antibody or remove the analysis from the manuscript as it does not add to their primary aim of the manuscript. Also, the authors have only selectively talked about two clusters? What about chromatin related proteins? It is quite intuitive to have highest enrichment of these given previous literature and even IP MS data by other groups. Thus, it is necessary to revise this thoroughly or remove it.

      We appreciate the acknowledgment by the Reviewer that our IP-MS data identified anticipated factors. In the revised version of the manuscript we modified the underlying text to accommodate references to these former findings revealing interactions between SATB1 and chromatin modifying complexes: “Apart from subunits of chromatin modifying complexes that were also detected in previous reports25,33–36, unbiased k-means clustering of the significantly enriched SATB1 interactors revealed two major clusters consisting mostly of proteins involved in transcription (blue cluster 1; Fig. 2d and Supplementary Fig. 4c) and splicing (yellow cluster 2; Fig. 2d and Supplementary Fig. 4c). (Lines 170-174)”

      Please note that many subunits of chromatin modifying and chromatin-related complexes are in fact characterized as transcription-related factors, therefore our statements are not in disagreement with the former findings. Note also that we provide Supplementary File 1 & 2 with comprehensive description of our IP-MS data for the readers’ convenience. Please also note that we are the first group to report on the existence of the long isoform. Therefore, we find it absolutely reasonable to perform IP-MS experiment for all SATB1 isoforms which can then be used for a comparison with other publicly available datasets. We believe that there is no contradiction in this experimental setup in relation to the rest of the manuscript. We discuss the two major clusters simply because they are the two major clusters identified as indicated in Fig. 2d. Additionally, in Supplementary Fig. 4c, we provide a comprehensive description of all significantly enriched interactors including their cluster annotation and thus anyone can investigate the data if needed.

      1. In relation to Fig. 2f, the authors have not mentioned any of the previously published work on Satb1 CD4 specific KO, not even the RNA seq studies the other groups have reported under the same condition. Only an unpublished reference of their own (preprint) is cited. It is imperative to show how much their data corroborates with other published studies. Additionally, what is the binding site status of dysregulated genes?

      In the revised version of the manuscript, we have included the references to other studies using the same Satb1 conditional knockout. Moreover, we have clarified the relationship between SATB1 binding and gene transcription. The modified part in Lines 182-194 now reads: “Satb1 cKO animals display severely impaired T cell development associated with largely deregulated transcriptional programs as previously documented19,37,38. In our accompanying manuscript19, we have demonstrated that long SATB1 isoform specific binding sites (GSE17344619) were associated with increased chromatin accessibility compared to randomly shuffled binding sites (i.e. what expected by chance), with a visible drop in chromatin accessibility in Satb1 cKO. Moreover, the drop in chromatin accessibility was especially evident at the transcription start site of genes, suggesting that the long SATB1 isoform is directly involved in transcriptional regulation. Consistent with these findings and with SATB1’s nuclear localization at sites of active transcription, we identified a vast transcriptional deregulation in Satb1 cKO with 1,641 (922 down-regulated, 719 up-regulated) differentially expressed genes (Fig. 2f). Specific examples of transcriptionally deregulated genes underlying SATB1-dependent regulation are provided in our accompanying manuscript19. Additionally, there were 2,014 genes with altered splicing efficiency (Supplementary Fig. 4d-e; Supplementary File 3-4). We should also note that the extent of splicing deregulation was directly correlated with long SATB1 isoform binding (Supplementary Fig. 4d).”

      1. In context of Figure 3a and b, the authors write .'...The long SATB1 isoform speckles evinced such sensitivity as demonstrated by a titration series with increasing concentrations of 1,6-hexanediol treatment followed...' Whereas it is apparent from the image at least that overall numbers of individual speckles are instead increased at both 2 and 5%. There is although a clear spreading of restricted speckles compared to the controls. The authors should revise their figures to substantiate the associated text. Furthermore, there needs to be 'all iso' SATB1 3D SIM imaging and not just quantitation for comparison. This is also true for panel c in order to demonstrate the effect.

      In the revised Fig. 3a we provide new images which better reflect the underlying data analysis. Moreover, in Fig. 3c and Fig. 3d we provide an additional comparison between SATB1 all isoforms and long isoform staining and their changes upon hexanediol treatment, detected by both the 3D-SIM and STED approaches. It is true that upon treatment, there tend to be more speckles, however these are much smaller as they are gradually being dissolved. Depending on the treatment duration, the cells are swollen which is reflected in increased spreading of speckles. Nevertheless, the nuclear size was considered in all the quantification analyses. We believe that the new images provide better evidence of SATB1’s sensitivity to hexanediol treatment.

      1. Fig. 3 d also does not clearly demonstrate what the authors have claimed '...hexanediol treatment highly decreased colocalization between...' The figure shows at best decreased signal intensity for both SATB1 and FU. We suggest that the authors should give a statistical analysis as well for the colocalization points between the two using multiple source images. Lastly, the two images shown (control and treated), there seems to be a clearly visible magnification difference. The authors should clarify this.

      • In the revised version of the manuscript in Figure 3d, we have provided scale bars, which are both 0.5 µm (line 1213). The difference observed by this Reviewer is actually the main reason why we provided this image. Figure 3d demonstrates that upon hexanediol treatment, the speckles are mostly missing or significantly reduced in size, for both FU and SATB1 staining. • Moreover, the suggested statistical analysis is also provided – in Figure 3e. In Figure 3e, we performed pixel-based colocalization analysis which is a method that allows both quantification and statistical comparison of colocalization between two factors and between different conditions. Please note especially the decreased colocalization between long SATB1 isoform and FU-stained sites of active transcription in the left graph, which is in agreement with our claims in the manuscript. • Moreover, our data are compared to a negative control, i.e. 90 degrees rotated samples, which is a common method in colocalization experiments as described for example in Dunn et al., 2011 (https://doi.org/10.1152/ajpcell.00462.2010). • Additionally, we provide Costes’ P values which are based on randomly scrambling the blocks of pixels (instead of individual pixels, because each pixel’s intensity is correlated with its neighboring pixels) in one image, and then measuring the correlation of this image with the other (unscrambled) image. Please see Costes et al., 2004 (https://doi.org/10.1529%2Fbiophysj.103.038422) for more details.

      1. Figure 3f. The authors show the PC plot for Raman spectroscopy for phase behaviour due to Satb1. The experiment and its related text seems misinterpreted; the authors write...' ese bonds were probably enriched for weak interactions responsible for LLPS that are susceptible to hexanediol treatment. This shifted the cluster of WT treated cells towards the Satb1 cKO cells. However, the remaining covalent bonds differentiated the WT samples from Satb1 cKO cells......' whereas the clusters are clearly far away in 3D for both WT and KO while being closer to their respective treatments. Which is also intuitive given the sensitivity of Raman spectroscopy. Thus, it is more likely to be treatment effect and KO effect as separate. Treatment of WT leads to KO like spectra is far-fetched. Thus, the authors need to show separate PCs and modify their text thoroughly.

      We do not present any 3D graph hence it is not clear what the Reviewer refers to. Please also note that as stated in Lines 817-818, we used a customized Raman Spectrometer. Therefore, this approach allowed us to measure Raman spectra at cellular and even sub-cellular levels. For example, solely by utilizing Raman spectroscopy, we can now distinguish euchromatin and heterochromatin, methylated and unmethylated DNA and RNA, etc. This, together with other reports, such as Kobayashi-Kirschvink et al., 2018 (https://doi.org/10.1016/j.cels.2018.05.015) and Kobayashi-Kirschvink et al., 2022 (https://doi.org/10.1101/2021.11.30.470655), indicate a potential use of Raman in biological research. In our manuscript, we used this method as a supplementary approach, however we do find it noteworthy. We should also emphasize that in the revised Raman spectroscopy Fig. 3h, each point represents measurements from an individual cell and for each condition we used 2-5 biological replicates (Lines 831-832 & Lines 1225-1226). We specifically refer to the principal component 1 (PC1) that differentiates the samples. Therefore, there are certain spectra (representing certain chemical bonding) that allowed us to differentiate between WT and Satb1 cKO. The same type of bonding was then affected when WT samples were treated with hexanediol and we also had controls to rule out the impact of hexanediol on the resulting spectra.

      1. In Fig 4. b, The authors have shown the propensity of SATB1 N terminus to phase separate using different optodroplet constructs. Although the imaging is clear, why are the regions selected not uniform when comparing various constructs?

      We have selected images that would best represent each category. Please note that this was live cell imaging of photo-responsive constructs, thus there are many limitations regarding the area selection. Very often, even the brief time of bright light exposure to localize cells may trigger protein clustering. Upon disassembly, every new light exposure of the same cell then triggers much faster assembly which skews the overall results. It is therefore desired to work fast, while neglecting selection of equally sized cells. Moreover, it is not clear how would the proposed change improve the quality of our manuscript.

      1. Figure 5a, the disassembly should be shown for 'long' SATB1 as well. On pg 13, the authors write '....cytoplasmic protein aggregation has been previously described for proteins containing poly-Q domains and PrLDs..' no reference given.

      • In the revised version of the manuscript, we present the assembly and disassembly for both short and long full length SATB1 optogenetic constructs. To increase clarity, we present the behavior of the short and long isoforms as two separate images in Figure 5a and Figure 5b, respectively. • Moreover, we provided references to the statement regarding aggregation of PrLD and poly-Q-containing proteins in Lines 305-309, which now reads: ”Since protein aggregation has been previously described for proteins containing poly-Q domains and PrLDs8,11,38,39, we next generated truncated SATB1 constructs encoding two of its IDR regions, the PrLD and poly-Q domain and in the case of the long SATB1 isoform also the extra peptide neighboring the poly-Q domain (Fig. 1a and 4a).”

      1. Fig. 5d, Is there an amino-acid specific reasoning to support the authors claim of the phase behaviour due to extra peptide? They need to show a proper control with equal extra (unrelated) peptide to show the specificity. Are the shorter isoform aggregates responsive to light?

      • We have referred to the amino acid composition bias in Fig. 5c. In the revised version of the manuscript, we made this clear by showing the composition bias in the new revised Fig. 5e. The related part of the main text then reads: “Computational analysis, using the algorithm catGRANULE37, of the protein sequence for both murine SATB1 isoforms indicated a higher propensity of the long SATB1 isoform to undergo LLPS with a propensity score of 0.390, compared to 0.379 for the short isoform (Fig. 5d). This difference was dependent on the extra peptide of the long isoform. Out of the 31 amino acids comprising the murine extra peptide, there are six prolines, five serines and three glycines – all of which contribute to the low complexity of the peptide region3 (Fig. 5e).” (Lines 298-304) • Moreover, we should note that the low complexity extra peptide of the long SATB1 isoform directly extends the PrLD and IDR regions as indicated in Fig. 4a and which we now directly state in Lines 304-305: “Moreover, the extra peptide of the long SATB1 isoform directly extends the PrLD and IDR regions as indicated in the Fig. 4a.” • We show in Fig. 4, that the N terminus of SATB1 undergoes LLPS. Since this part of SATB1 is shared by both isoforms, it is reasonable to assume that both isoforms would undergo LLPS. This is also in line with the observed photo-responsiveness of both short and long full length SATB1 isoforms in CRY2 optogenetic constructs in revised Fig. 5a,b, and similar FRAP results for both short and long full length SATB1 isoform constructs transiently transfected in NIH-3T3 cells in the revised Supplementary Fig. 6f. However, the main reason why we think that the difference in LLPS propensity between the isoforms is important is because the long isoform is more prone to aggregate compared to the short isoform, as documented in Fig 5c,f,g and Supplementary Fig. 5f.

      1. Fig 6c., It is important that authors show the data for NLS+short iso data as well to prove their hypothesis.

      As shown in original Figure 5d, the long SATB1 isoform undergoes cytoplasmic aggregation, unlike the short SATB1 isoform (as shown in the same Figure). Therefore, an image of the NLS + short isoform would not be related to our hypothesis. Actually, we wanted to reverse the long SATB1 isoform’s relocation, from the aggregated form in the cytoplasm into the nucleus. Nevertheless, to show the complete picture, in the revised version of the manuscript in Figure 6c, we now provide data for both short and long SATB1 isoforms.

      1. Fig 6d., The authors claim that mutating a specific P site changes the phase behaviour of the 'short iso'. Does it also increase for the long isoform? The authors need to confirm this in order to verify the effect of a single P site outside of oligomerization domain. ...' phosphorylation status; when phosphorylated it remains diffused, whereas unphosphorylated SATB1 is localized to PML bodies....' This being an important premise, thus should be moved to the results text.

      In the revised version of the manuscript, we moved the part regarding PML in the results section, as suggested by the Reviewer. Moreover, we included additional experiments probing the impact of association between PML and two SATB1 full length isoforms on their dynamics. The modified section in Lines 357-368 now reads: “In relation to this, a functional association between SATB1 and PML bodies was already described in Jurkat cells64. We should note that PML bodies represent an example of phase separated nuclear bodies65 associated with SATB1. Targeting of SATB1 into PML bodies depends on its phosphorylation status; when phosphorylated it remains diffused, whereas unphosphorylated SATB1 is localized to PML bodies66. This is in line with the phase separation model as well as with our results from S635A mutated SATB1, which has a phosphorylation blockade promoting its phase transitions and inducing aggregation. To further test whether SATB1 dynamics are affected by its association with PML, we co-transfected short and long full length SATB1 isoforms with PML isoform IV. The dynamics of long SATB1 isoform was affected more dramatically by the association with PML than the short isoform (Supplementary Fig. 7e), which again supports a differential behavior of the two SATB1 isoforms.”

      Moreover, given the localization of the discussed phosphorylation site in the DNA binding region of SATB1 we did test its impact on DNA binding as documented in the revised Supplementary Fig. 7d. Additionally, as we have noted in our answer in Major Comment C of this reviewer, to further support the effect of serine phosphorylation on the DNA binding capacity of SATB1 we have performed DNA affinity purification experiments utilizing primary thymocyte nuclear extracts treated with phosphatase (Supplementary Fig. 7b) We found that SATB1’s capacity to bind DNA (RHS6 hypersensitive site of the TH2 LCR) is lost upon treatment with phosphatase (Supplementary Fig. 7c).

      1. Pg 16,. The authors have tried to explain multiple things (concepts of self-regulation, accessibility) which is quite tangential. There is no inference to Fig 6f., which is showing the opposite to what the authors had postulated. This portion should either be removed or explained with a rationale. The writing also needs to be revised thoroughly in this section. Similarly, the discussion should also be modified.

      The rationale for the original Fig. 6f (revised Fig. 6g) was described in great detail in Lines 330-343 of the original manuscript. It is not clear why the Reviewer assumes that it shows the opposite to our hypothesis. As we explained, the increased accessibility allows faster read-through by RNA polymerase, and thus the exon with higher accessibility is more likely to be skipped. The exact relationship is shown in the revised Fig. 6g where the increased accessibility is associated with the expression of the short isoform, whereas the long isoform expression needs lower chromatin accessibility which allows the splicing machinery to act on the specific exon to be included. We reason that these findings are important and relevant because: 1) we suggest a potential regulatory mechanism for the SATB1 isoforms production. This is highly relevant to this manuscript given the fact that this is the first report on the existence of the long SATB1 isoform, and 2) the differential production of the long/short SATB1 isoforms has a potential relevance to breast cancer prognosis. In the revised version of the manuscript we added Fig. 6f, which now indicates the differential chromatin accessibility in human breast cancer patients and accordingly the expression of the long SATB1 isoform are associated with worse patient prognosis as indicated in Fig. 6h and Supplementary Fig. 8a,b. In the revised version of the manuscript, we substantially modified the text in Lines 374-408, to make the relevance of all these conclusions clear. The modified text now reads: “Therefore, we reasoned that a more plausible hypothesis would be based on the regulation of alternative splicing. In our accompanying manuscript19, we have reported that the long SATB1 isoform DNA binding sites display increased chromatin accessibility than what expected by chance (Fig. 3b in 19), and chromatin accessibility at long SATB1 isoform binding sites is reduced in Satb1 cKO (Fig. 3c in 19), collectively indicating that long SATB1 isoform binding promotes increased chromatin accessibility. We identified a binding site specific to the long SATB1 isoform19 right at the extra exon of the long isoform (Fig. 6e). Moreover, the study of alternative splicing based on our RNA-seq analysis revealed a deregulation in the usage of the extra exon of the long Satb1 isoform (the only Satb1 exon affected) in Satb1 cKO cells (deltaPsi = 0.12, probability = 0.974; Supplementary File 4). These data suggest that SATB1 itself is able to control the levels of the short and long Satb1 isoforms. A possible mechanism controlling the alternative splicing of Satb1 gene is based on its kinetic coupling with transcription. Several studies indicated how histone acetylation and generally increased chromatin accessibility may lead to exon skipping, due to enhanced RNA polymerase II elongation48,49. Thus the increased chromatin accessibility promoted by long SATB1 isoform binding at the extra exon of the long isoform, would increase RNA polymerase II read-through leading to decreased time available to splice-in the extra exon and thus favoring the production of the short SATB1 isoform in a negative feedback loop manner. This potential regulatory mechanism of SATB1 isoform production is supported by the increased usage of the extra exon in the absence of SATB1 in Satb1 cKO (Supplementary File 4). To further address this, we utilized the TCGA breast cancer dataset (BRCA) as a cell type expressing SATB150. ATAC-seq experiments for a series of human patients with aggressive breast cancer51 revealed differences in chromatin accessibility at the extra exon of the SATB1 gene (Fig. 6f). In line with the “kinetic coupling” model of alternative splicing, the increased chromatin accessibility at the extra exon (allowing faster read-through by RNA polymerase) was positively correlated with the expression of the short SATB1 isoform and slightly negatively correlated with the expression of the long SATB1 isoform (Fig. 6f). Moreover, we investigated whether the differential expression of SATB1 isoforms was associated with poor disease prognosis. Worse pathological stages of breast cancer and expression of SATB1 isoforms displayed a positive correlation for the long isoform but not for the short isoform (Fig. 6g and Supplementary Fig. 6c). This was further supported by worse survival of patients with increased levels of long SATB1 isoform and low levels of estrogen receptor (Supplementary Fig. 6d). Overall, these observations not only supported the existence of the long SATB1 isoform in humans, but they also shed light at the potential link between the regulation of SATB1 isoforms production and their involvement in pathological conditions.”

      1. The authors should not draw conclusions based on any data which is not shown '....ed differences in chromatin accessibility at the extra exon of the SATB1 gene (data not shown), suggesting its potential involvement in alternative splicing regulation according to the "kinetic coupling" model...'. This has led to overspeculation and needs correction.

      In the revised version of the manuscript, we included the ATAC-seq data from human breast cancer patients in the revised Fig. 6f. The legend of this figure now reads: “Human TCGA breast cancer (BRCA) patient-specific ATAC-seq peaks51 span the extra exon (EE: extra exon; labeled in green) of the long SATB1 isoform. Note the differential chromatin accessibility in seven selected patients, emphasizing the heterogeneity of SATB1 chromatin accessibility in cancer. Chromatin accessibility at the promoter of the housekeeping gene DNMT1 is shown as a control. (Lines 1281-1285)” Accordingly, we have also modified the main text: “ATAC-seq experiments for a series of human patients with aggressive breast cancer68 revealed differences in chromatin accessibility at the extra exon of the SATB1 gene (Fig. 6f). In line with the “kinetic coupling” model of alternative splicing, the increased chromatin accessibility at the extra exon (allowing faster read-through by RNA polymerase) was positively correlated with the expression of the short SATB1 isoform and slightly negatively correlated with expression of the long SATB1 isoform (Fig. 6g).” (Lines 395-339)”

      Minor comments: 1. On pg 4, the authors state 'Here, we utilized primary murine T cells, in which we have identified two full-length SATB1 protein isoforms.' Whereas only one 'long' isoform is identified and the other is the canonical version. The authors should correct the statement.

      In the revised version of the manuscript, we modified this statement as follows: ”In this work, we utilized primary developing murine T cells, in which we have identified a novel full-length long SATB1 isoform and compared it to the canonical “short” SATB1 isoform.” (Lines 64-66)”

      1. Fig. 1 a , Is there a specific reason to generate a custom-made antibody for 'all' SATB1, using similar regions that are already commercially available. This becomes redundant otherwise, because there is no apparent difference in detection compared to the commercial one (Suppl. Fig 1a). Antibody generation strategy (1a) should be moved to supplementary. Additionally, authors have obtained the custom antibodies from a commercial source, therefore, the text should reflect the same alongside relevant details.

      The custom-made SATB1 antibody targeting the amino-terminal region of the protein has been developed in order to be utilized for detecting the native form of the protein. Unlike commercially available antibodies raised against either short peptides or denatured forms of the protein we have utilized the native form of the amino-terminal part of the protein for raising this antibody. To be honest, this antibody has been raised in order to be utilized in ChIP-seq experiments since no commercially available antibody is of high quality for this approach. Moreover, the original Figure 1a was utilized in order to provide an overview of the SATB1 protein structure which is highly relevant to understand its biophysical properties and not for presenting the strategy for raising a custom-made antibody for SATB1.

      1. Fig 3e: what is the control used here? In their Pearson correlation analysis, there seem to be significant reduction in control sets as well upon treatment. This needs to be clarified.

      We used scans rotated by 90° which served as a negative control, as stated in Line 769: “SATB1 scans rotated by 90° served as a negative control for the colocalization with FU.” Note that this is a commonly used control in colocalization experiments as described for example in Dunn et al., 2011 (https://doi.org/10.1152/ajpcell.00462.2010).

      Additionally, we provide Costes’ P values which are based on randomly scrambling the blocks of pixels (instead of individual pixels, because each pixel’s intensity is correlated with its neighboring pixels) in one image, and then measuring the correlation of this image with the other (unscrambled) image. Please see Costes et al., 2004 (https://doi.org/10.1529%2Fbiophysj.103.038422) for more details. Moreover, it was actually anticipated to see a decrease in colocalization upon hexanediol treatment even in the negative control, as hexanediol significantly reduces both SATB1 and FU speckles as established in Fig. 3a-d.

      1. Pg 10, the authors claim that '..., thus we reasoned that it may also be used to study phase separation...' But there have been numerous reports starting from 2018, which have utilized this technique in corelation to phase behaviour (albeit individual proteins). The authors should include proper citations as they are extending an idea from the same field to their specific need.

      In the revised version of the manuscript, we included relevant citations to support the use of Raman spectroscopy in LLPS research: “Raman spectroscopy was already used in many biological studies, such as to predict global transcriptomic profiles from living cells42, and also in research of protein LLPS and aggregation43–47. Thus we reasoned that it may also be used to study phase separation in primary T cells.” (Lines 225-228)”

      1. For Fig 5b, there should be a comparative image for 'short' isoform.

      In the revised Figure 5c we have included a comparative image for the short SATB1 isoform.

      1. In the context of Figure 5c, the authors claim ...' Note also the higher LLPS propensity of the human long SATB1 isoform compared to the murine SATB1...' Why suddenly human and mouse comparisons are drawn? This figure should be moved to supplementary.

      The comparison between the human and mouse SATB1 isoforms has been implemented because it is relevant for our claims regarding the increased SATB1 aggregation in human cells in relation to the revised Fig. 6f,g,h and Supplementary Fig. 6c,d. This is also discussed in Lines 479-482, which read: “This is particularly important given the higher LLPS propensity of the human long SATB1 isoform compared to the murine SATB1 (Fig. 5d). Therefore, human cells could be more susceptible to the formation of aggregated SATB1 structures which could be associated with physiological defects.”

      **Reviewer #3 (Evidence, reproducibility and clarity):**

      Zelenka et al., focus on a T cell genome-organizing protein, SATB1, to show that SATB1 undergoes liquid liquid phase-separation (LLPS), and distinct isoforms confer different LLPS-related biophysical properties. They generate a long-isoform specific antibody and conduct several experiments to test for LLPS and compare LLPS properties between the long-isoform relative to the whole SATB1 protein population. Given that SATB1 plays important roles in T cell development and in cancer, interrogating SATB1 biophysical properties is an important question. However, there are multiple problems with the experimental setup and data that weaken their support of the conclusions. I will detail some of the major issues below:

      Regarding phase-separation There are several assays to determine whether a protein undergoes LLPS. 1. One of the first the authors address is the spherocity or roundness. Indeed, formation of spherical droplets is one evidence of the liquid nature of a protein. However, the authors use fixed preparations (which can introduce artifacts), not free-floating protein, and determine roundness by showing a 2D image. Roundness should take into account the diffraction-limits of fluorescent imaging, as many structures can be imaged to appear round by the detector. There are quantifiable measurements that can be taken on 3D images to show roundness. This would best be shown using non-fixed protein.

      • We thank this Reviewer for several insightful comments. Although, we agree with most of them, we should highlight the main goal of our manuscript, i.e. to investigate the SATB1 protein with an emphasis on its physiological roles in primary developing murine T cells. We highlight this already in the introduction in Line 64 “In this work, we utilized primary developing murine T cells,...” and mainly also in the respective part of the result section: “To probe differences in phase separation in mouse primary cells, without any intervention to SATB1 structure and expression, we first utilized 1,6-hexanediol treatment, which was previously shown to dissolve the liquid-like droplets34.(Lines 203-205)”

      • We believe that this is a very important aspect of our study that should not be overlooked. The majority of proteins perhaps behave differently under physiological and in vitro conditions. However, due to the extensive post-translational modifications affecting the properties of SATB1, its completely different localization patterns between primary developing T cells and other cell types but especially cell lines and many other aspects, it was of utmost importance to focus our research on primary T cells. Unfortunately, this was accompanied with multiple difficulties, such as that we have to use fixed cells as this is the only way to visualize SATB1 in these cells. Alternatively, one could create a new mouse line expressing a fluorescently tagged SATB1 protein, but this is beyond the scope of our work.

      • However, we should also note that many LLPS-related studies do not pay any focus on primary physiological functions of proteins and they simply focus on the investigation of protein’s artificial behavior in in vitro conditions. Having said that, we too extended our experiments in primary cells to the ex vivo studies in cell lines to further support our claims. In these experiments, we utilized live cell imaging in Fig. 4-6, quantified the spherocity in Supplementary Fig. 6, showed the ability of speckles to coalesce in Fig. 4c and also used FRAP in Fig. 4f and also in the revised version of the manuscript in Supplementary Figure 6f. Moreover, we should note that most of these experiments were designed and performed during 2017 and 2018 conforming with the standards. We are well aware of the progress in the field and impact of fixation on LLPS, as described in Irgen-Gioro et al., 2022 (https://doi.org/10.1101/2022.05.06.490956), but after over seven months of review process in another journal we also believe that these aspects should be considered not to delay further progress of the SATB1 field.

      Regarding the isoform specificity of SATB1 biophysical properties 1. The authors generate a long isoform-specific antibody. However, the western blot is not convincing that this is indeed specific to the long isoform as there is a rather large smear. Can this be improved with antibody preabsorption? Since this is a key reagent for the manuscript, improvement in antibody quality is essential.

      The custom-made antibody for the long isoform has been raised against the unique 31 amino acids long peptide present in the long SATB1 isoform. The polyclonal serum has undergone affinity chromatography utilizing the immobilized peptide (antigen) to purify the antibody. In the revised version of the manuscript we have included another immunodepletion experiment with cleaner bands (Fig. 1f). Moreover, please read our answer to Major comment #2 of Reviewer 1 that follows: • The long antibody was raised in mice inoculated with the extra peptide present in the long isoform only. Therefore, the capacity of this antibody precipitating the shorter isoforms, which do not express the sequence of the extra peptide (EP, Figure 1a) in not possible.

      • We have repeated the immunodepletion experiment and we now provide the results in Fig. 1f and Supplementary Fig. 1b. The western blot in Fig. 1f is now cleaner and supports quite convincingly the presence of a long SATB1 isoform. Given the lack of isoform-specific knockouts which we could utilize to immunoprecipitate or detect the different isoforms in a single cell (or cell population), the utilized approach of immunodepletion and subsequent western blotting is the approach we thought of implementing.

      • As shown in Fig. 1f and Supplementary Figure 1b, the long isoform SATB1 antibody has the capacity to recognize the long isoform in murine thymocyte protein extracts but not the short SATB1 isoform (please compare lane 3 in the two western blots utilizing either the antibody for the long isoform -top panel - or the antibody that detects both isoforms (lower panel).

      • We have performed Immunofluorescence experiments utilizing the antibody detecting the long SATB1 isoform in thymocytes isolated from either C57BL/6 or Satb1 cKO mice. The antibody is specific to the SATB1 protein since there is no signal in immunofluorescence experiments utilizing the knockout cells (Supplementary Figure 1c).

      • We have performed Immunofluorescence experiments utilizing thymocytes and the antibody detecting the long SATB1 or a commercially available antibody detecting all SATB1 isoforms. The pattern of SATB1 subnuclear localization is similar for both antibodies (Supplementary Figure 1e).

      • In our accompanying revised manuscript Zelenka et al., 2022 (https://doi.org/10.1101/2021.07.09.451769), we provide yet another piece of evidence, consisting of bacterially expressed short and long SATB1 protein isoforms detected by western blot using either the long isoform-specific or the non-selective all SATB1 isoforms antibodies.

      • Regarding the additional bands detected in the immunoprecipitation experiment presented in the original Supplementary Figure 1b (lane 2), it is not surprising that additional bands appear in a sample of protein extracts that is used for several hours for the immunoprecipitation experiments, while the “input” sample simply denotes protein extract that is frozen at -80oC right after the preparation of protein extracts until use. It is well-established that SATB1 is the target of proteases which might as well be active during the immunoprecipitation steps (2 consecutive immunoprecipitation steps take place). Therefore, the immunoprecipitated material cannot necessarily be a copy of the input material displaying a single protein band even if protease inhibitors are included in the buffers.

      Taken together the experiments described here we showed that the antibody raised against the extra 31 aa long peptide, present only in the long SATB1 isoform, is specific for this isoform.

      1. Fig 4 Optodroplet experiment appears to show that the N-terminus of SATB1 can undergo LLPS. The results of this assay show that SATB1 has a domain that can undergo phase-separation in isolation, but it does not show that the protein itself is a phase-separating protein. The FRAP assay methods are not provided by the authors, but this is important, as continued light activation means proteins are continuously forming aggregates, and the bleaching for FRAP should be balanced with the levels of Cry2 activation. A very good description of the methods is described in the original Optodroplet paper: https://www.sciencedirect.com/science/article/pii/S009286741631666X?via%3Dihub#sec4

      We should note that we did follow the FRAP protocol provided by the recommended study Shin et al., 2017 (https://doi.org/10.1016/j.cell.2016.11.054). Indeed, these experiments are very tricky to perform and interpret, as every cell expresses slightly different amounts of protein which is directly associated with the different speed of optoDroplet formation, and thus its propensity to aggregate upon overactivation. On the other hand, there need to be continuous activation during the FRAP experiment as the lack of activation laser would result in fast disassembly of the optoDroplets, counteracting the FRAP results. Moreover, the optoDroplets actively move around the cell in all dimensions which makes the accurate measurement of signal intensity really challenging, even with an adjusted pinhole. Therefore, we do not think that FRAP is the best approach to examine the behavior of optoDroplets.

      Either way, we have now described the detailed FRAP protocol in Lines 889-898, which read: “For the FRAP experiments, cells were first globally activated by 488 nm Argon laser illumination (alongside with DPSS 561 nm laser illumination for mCherry detection) every 2 s for 180 s to reach a desirable supersaturation depth. Immediately after termination of the activation phase, light-induced clusters were bleached with a spot of ∼1.5 μm in diameter. The scanning speed was set to 1,000 Hz, bidirectionally (0.54 s / scan) and every time a selected point was photobleached for 300 ms. Fluorescence recovery was monitored in a series of 180 images while maintaining identical activation conditions used to induce clustering. Bleach point mean values were background subtracted and corrected for fluorescence loss using the intensity values from the entire cell. The data were then normalized to mean pre-bleach intensity and fitted with exponential recovery curve in Fiji or in frapplot package in R.”

      1. Description of analyses that authors prefer not to carry out

      **Reviewer #1**:

      Can they use the all and long isoform antibodies together, then subtract the signal from long isoform to conclude about the localization of the shorth isoform ?

      We thank the Reviewer for the suggestion, though given the differential efficiency of antibodies and other limitations of imaging experiments, we do not find the suggested experiment to have a potential to improve the quality of our manuscript. However, we should note that we have performed a pixel-based colocalization experiment between the signal detected by all isoform and long isoform SATB1 antibodies. Fluorocytogram of the pixel-based colocalization, based on 3D-SIM data is provided on the left, with quantified colocalization on the right of the revised Supplementary Fig. 5a.

      3) Lack of better staining with antibody against the long and short SATB1 isoforms after treatment with 1,6 Hexanediol. 1,6 Hexanediol treatment can change many other chromatin associated proteins to which SATB1 can be bound to indirectly. This experiment can

      We do understand the controversy and difficulties of experiments using 1,6-hexanediol treatment. However, we have to note that there is no better approach available for the investigation of LLPS in our primary murine T cells. We did use alternative approaches in ex vivo experiments, utilizing cell lines to validate our hypothesis without the involvement of 1,6-hexanediol.

      **Reviewer #2**:

      1. The authors mention, '...of the different SATB1 isoforms, uncovered by the use of the two different antibodies, relied in the heterochromatin areas (zone 1), where the long isoform was less frequently...' There is no supporting figure number mentioned. The authors need to show a zone-by-zone comparison images for 'all iso' vs 'long' iso of SATB1. Just to reiterate, there is a need for a heterochromatin mark to unambiguously call out the distinction.

      We should remind that there is an inherent difficulty to accurately compare localization of short and long SATB1 isoforms in primary cells, especially due to the lack of Satb1 isoform-specific knockout mice. There is no way to detect only the short isoform in these primary cells as there are only antibodies targeting the long or all SATB1 isoforms. Therefore, we cannot set up additional experiments probing these questions.

      In line with this, in the revised version of the manuscript, we toned down our statements regarding the differential localization of the two isoforms in primary cells. We only refer to it as an indication and we support it by adding references to the relevant figures. This part now reads: “Localization of SATB1 speckles detected by antibodies targeting all SATB1 isoforms and/or only the long SATB1 isoform, revealed a significant difference in the heterochromatin areas (zone 1, Fig. 2b), where the long isoform was less frequently present (see also Fig. 2a and Fig. 3c). Although, this could indicate a potential difference in localization between the two isoforms, due to the inherent difficulty to distinguish the two based on antibody staining, we refrain to draw any conclusions. (Lines 145-150)”

      1. Fig. 6a, The authors wished to see the effect of RNA on Satb1 nuclear localization. This is not related to the main theme of the paper, thus should be moved to supplementary (true for b as well). Importantly, the experiments should be performed with total cells to show the divergence of localization (like the paper the authors referred to) instead of matrix for clarity.

      • We did not wish to see the effect of RNA on SATB1 localization. In fact, there is a long history of SATB1 research that is inherently linked with the concept of nuclear matrix, a putative nuclear structure which is highly associated with nuclear RNAs. SATB1 was described many times as a nuclear matrix protein (https://doi.org/10.1016/0092-8674(92)90432-c; https://doi.org/10.1128/mcb.14.3.1852-1860.1994; https://doi.org/10.1074/jbc.272.17.11463; https://doi.org/10.1128/mcb.17.9.5275; https://doi.org/10.1021/bi971444j; https://doi.org/10.1083/jcb.141.2.335; https://doi.org/10.1101/gad.14.5.521; https://doi.org/10.1038/ng1146).

      • Moreover, our data discussed in comments 4-7 of this Reviewer, such as i. the localization of SATB1 to the nuclear zones associated with RNA and nuclear scaffold factors (Fig. 2b, Supplementary Fig. 1c), ii. colocalization of SATB1 with actively transcribed RNAs (Fig. 2c, Fig. 3g, Supplementary Fig. 2a, Supplementary Fig. 2c), iii. including its association with nucleoli (Supplementary Fig. 3b), and also iv. its computationally predicted interaction with Xist lncRNA (Agostini et al., 2013; https://doi.org/10.1093/nar/gks968) as a notable factor of nuclear matrix, all suggest that the interaction between RNA and SATB1 is plausible and potentially relevant for its function and/or at least its subnuclear localization. It is relevant even more so, when considering numerous reports on the ability of RNA-binding, poly-Q and PrLD-containing proteins to undergo LLPS https://doi.org/10.1016/j.molcel.2015.08.018; https://doi.org/10.1042/bcj20160499; https://doi.org/10.1016/j.cell.2018.03.002; https://doi.org/10.1016/j.cell.2018.06.006; https://doi.org/10.1093/nar/gkaa681), including RNAs specifically regulating LLPS behavior, especially for poly-Q and PrLD-containing proteins, such as SATB1 (https://doi.org/10.1126/science.aar7366; https://doi.org/10.1126/science.aar7432; https://doi.org/10.1016/j.ceb.2019.03.007; https://doi.org/10.1038/s41598-020-57994-9; https://doi.org/10.1016/j.molcel.2015.09.017; https://doi.org/10.1038/s41598-019-48883-x; https://doi.org/10.1038/s41467-019-11241-6).

      • It should also be noted that SAF and various hnRNPs, as the most prominent proteins of nuclear matrix were many times reported to phase separate (https://doi.org/10.1016/j.molcel.2019.10.001; https://doi.org/10.1074/jbc.ra118.005120; https://doi.org/10.1016/j.celrep.2019.12.080; https://doi.org/10.1038/s41467-019-09902-7; https://doi.org/10.1016/j.molcel.2017.12.022; https://doi.org/10.1074/jbc.tm118.001189). All these aspects show that the relation between nuclear matrix, SATB1 and RNA are quite relevant to our manuscript.

      • Moreover, in light of the aforementioned information, we believe that it is much clearer to follow the protocol we did – i.e. to remove soluble proteins by CSK treatment and then, upon RNase treatment, extract the released proteins using ammonium sulfate. In an experiment utilizing whole cells, one would need to microinject RNase A into the nucleus, which 1. is very challenging for primary T cells having a radius of 3-5 micrometers, 2. is of low throughput, 3. would not allow for released protein removal which would thus make the results hard to interpret. Please note that in the reference paper, the authors used cell lines overexpressing heterologous GFP-tagged proteins, which is not related to our setup.

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Reply to the Reviewers

      I thank the Referees for their...

      Referee #1

      1. The authors should provide more information when...

      Responses + The typical domed appearance of a hydrocephalus-harboring skull is apparent as early as P4, as shown in a new side-by-side comparison of pups at that age (Fig. 1A). + Though this is not stated in the MS 2. Figure 6: Why has only...

      Response: We expanded the comparison

      Minor comments:

      1. The text contains several...

      Response: We added...

      Referee #2

    1. If he had takencare to specify that when he said “we”, “us”, and “our” he meant each one of us, actingand responding as a user of quantum mechanics, he would have had it exactly right. Butit seems to me likely that he was using the first person plural collectively, to mean all ofus together, thereby promulgating the Copenhagen confusion
      • PEIERLS
      • (I THINK) "OUR"==="OBJETIVO", "COLECTIVO"

      • (I DON'T UNDERSTAND) "promulgating the Copenhagen CONFUSION"

      • (google) lo he encontrado en OTRO libro de Mermin, en referencia a una "CRITICA" de BELL al uso de la palabra "KNOWLEGDE" en las "explicaciones" de Heisenberg, y PEIERLS (!!!)

      • BELL: Whose Knowlegde? Knowlegde about WHAT?

      • (google) tambien aqui [MASTERPIECE-SEE 3 LAST PARAGRAPHS)

      • [Peierls responde a Bell]: "That leaves the question: whose knowledge should be represented in the density matrix? In general there will be many who may have some information about the state of a physical system. Each of them has to use his or her density matrix. These may differ, as the nature and amount of knowledge may differ. People may have observed the system by different methods, with more or less accuracy; they may have seen part of the results of another physicist. However, there are limitations to the extent to which their knowledge may differ. This is imposed by the uncertainty principle. For example if one observer has knowledge of s, of our Stern-Gerlach atom, another may not know sx, since the measurement of sx would have destroyed the other person's knowledge of J2, and vice versa. This limitation can be compactly and conveniently expressed by the condition that the density matrices used by the two observers must commute with each other."

      • (I THINK) Ha respondido a la pregunta??? I DONT THINK SO!

      • Me recuerda la "respuesta" de Bohr a EPR
      • "AMBOS" vuleven a "repetir" como "aplican" ellos la QM, pero sin "responder" a la pregunta!
    1. As We May Think

      Da a entender que los nuevos inventos han ampliado el poder físicos del hombre, pero no el de su mente. Por lo cual, Bush, dice que están a la mano instrumentos que, si se desarrollan adecuadamente darán al hombre acceso sobre el conocimiento.

    1. Author Response

      Reviewer 1

      Bailon-Zambrano and colleagues were trying to answer the general question: what contributes to phenotypic variation when a gene of strong effect is mutated?

      The work has several major strengths for answering this interesting question. First, they decided to study mef2ca in zebrafish for which they had previously shown that mutants displayed highly variable facial phenotypes. To learn how phenotypic variation depends on phenotypic severity, they realized they had studied more alleles, and so induced two more alleles to have three different types of molecular lesions (start codon mutation, premature stop codon, and full coding gene deletion). Investigating these alleles showed that increasingly severe alleles had more variation among individuals in the population but not necessarily more variation between the left and right sides of the face within individuals.

      Over several years, these investigators had spent considerable effort to select lines of fish that segregate the start-codon mutation and have either severe or weak effects on facial phenotypes. wondered: what factors were selected out of the original genetic background that would increase or decrease phenotypic severity? They hypothesized that one or more of the five mef2 paralogs in zebrafish might help to ameliorate the phenotype in the low line or reciprocally intensify the phenotype in the high line. They studied expression of the mef2 paralogs in neural crest cells by single-cell transcriptomics. They found that paralogs were downregulated in the high-penetrance line with respect to an unselected line, a result expected if expression of the paralogs contributed to buffering phenotypic severity. This experiment has two weaknesses, first that the method only examined neural crest cells but we know that signals from the ectodermal and endodermal epithelia contribute to craniofacial morphologies by diffusible signals. If genes regulating craniofacial morphologies that act in epithelia had genetic variation that contributes to severity, those genes would not be investigated in these crest-only experiments. A minor problem (which is associated with the expense of the experiment) is that the scRNA-seq experiments compared only the high and unselected lines, not the low line. To address both problems, the investigators performed qPCR on RNAs extracted from whole heads of genetically mef2ca-wild types from the high and low line. In these qPCR experiments, however, they did not investigate the unselected line. Leaving out the low line in one approach and leaving out the unselected line in the other approach somewhat weakens the strength with which one can draw conclusions (e.g., the qPCR conclusion assumes that the unselected line would be intermediate between the two selected lines) but is unlikely to change the basic conclusions the authors drew. In addition, using whole heads in the qPCR experiments, while it has the advantage that it includes epithelia, does not distinguish between genes expressed only in the crest and genes expressed in other cell types, and these experiments did not test for any genes known to affect craniofacial development that are epithelium-specific.

      In response to this comment, and those below, we removed the scRNA-seq comparing neural crest cells from unselected and high-penetrance strains. We replaced those data with new important results which considerably advance our model. We found significant paralog expression variation among unselected zebrafish families (Fig. 4D). These results strongly suggest that our breeding selected upon standing paralog variation the unselected parental strains. See more below.

      Finally, in key experiments that are a major strength of the work and require significant effort, the researchers systematically made mutations in four of the five zebrafish mef2 paralogs (mef2aa, mef2b, mef2cb, and mef2d, all except mef2ab, which didn't become mutated despite significant effort) in the genetic background of the lowpenetrance strain and studied them in single homozygotes, in double mutants, and in various heterozygous combinations. These important experiments showed that some paralogs provided significant buffering in the low-penetrance strain, the strain that up-regulated expression of these paralogs. It would be helpful in the discussion to mention that mef2ab couldn't be mutated and a phrase added about what that means for the general conclusions - in the opinion of this reviewer, the impact of this is not great but it should be acknowledged.

      We acknowledge that mef2ab couldn’t be mutated and consider what that means for the general conclusions in the text.

      A strength of the experiments is that the workers quantified effects of various genotypes by focusing on the length of the symplectic, a convenient element for quantification both within single individuals and among fish in a population. It would be helpful to have a statement on the evidence that this measure is a good representative for other aspects of the phenotype.

      We provide new data indicating that the symplectic cartilage length is significantly correlated with another mef2ca-associated phenotype (Fig. 1-figure supplement 2). See more below.

      Finally, the paper presents a model for understanding the results presented that does a good job of summarizing the data and, importantly, suggests ways to move the analysis deeper. Missing from the description of the model is a discussion about whether the genetic variation that was selected and ultimately upregulated mef2 paralogs is in regulatory elements of the mef2 paralogs themselves or whether it might be in trans-acting transcriptional regulators that simultaneously regulate all mef2 paralogs due to the authors' hypothesized 'cryptic vestigial' functions.

      We considerably revised the discussion, thoroughly considering both these possibilities.

      This work is likely to have a significant impact on the fields of developmental biology, the interpretation of human mutational variation (in for example the concept of phenotypic expansion), and the way people think about the evolution of new morphologies over time. A brief comparison of the authors' results and interpretations to those of C.H. Waddington's concept of genetic assimilation would provide improved historical context and broaden the potential impact of the work.

      We now include a discussion of our study in the context of Waddington’s genetic assimilation.

      Reviewer 2

      Bailon-Zambrano et al study the possible mechanisms that contribute to the oft-observed phenomenon that an individual mutation may be associated with variable expression of a phenotype. They focus on loss-of-function of the mef2ca gene of zebrafish, which is needed for the normal development of several craniofacial structures. They demonstrate that recessive putative loss-of-function mutant alleles of the mef2ca gene of zebrafish are associated with a range of expressivity. By focusing on one aspect of the mutant phenotype, the length of the symplectic cartilages that support the jaw, they find a correlation between the average strength of the phenotype of an allele (measured as reduction in length) and the extent of variability between mutant individuals that carry the allele. I am concerned about this conclusion and generalizations that may be drawn from focus on a single quantifiable character, the symplectic cartilage. Perhaps there is always a fixed variation in the length of this cartilage. As stronger alleles produce shorter cartilage pieces, variations in size may appear to be of greater significance when affecting shorter average length.

      We now show that the symplectic cartilage length is a good proxy for other craniofacial phenotypes (Fig. 1figure supplement 2). Further, we clarify in the text that we use the coefficient of variation (standard deviation/mean) which is the accepted best practice for determining and comparing variation. We also use the F-test statistic which is the standard statistical method to test for equality of two variances. This test tells us if the standard deviations from two datasets are significantly different.

      The authors hypothesize that one factor that contributes to the varied phenotypic expression of an allele (expressivity) is the co-expression of paralogs that may provide wildtype function and thus partially or wholly rescue the mutant phenotype. They test this hypothesis by "fixing" conditions where a single mutation may be expressed with low or high penetrance. By selective breeding based on phenotype, they create two sets of strains that carry an identical mef2ca mutation: one strain has high penetrance of the mutant phenotype and the other low penetrance. They then investigate the factors that are likely responsible for the high vs low penetrance. Historically we would call these factors "genetic modifiers". There is extensive literature on the nature of genetic modifiers and there are many current screens in both mice and Drosophila to identify genetic modifiers and uncover their nature, but there is little reference to these studies in the current manuscript. Further, there is previously published work that hypothesizes that one important function of paralogs in multicellular organisms is to provide a buffer to stabilize levels of gene expression needed for developmental decisions.

      Following this reviewer’s suggestion, we now include many new references (increased from ~50 to >80) incorporating much of the important work leading up to our study. These include referencing both genetic modifier mutagenesis screens, paralogous buffering in other systems, and “natural” modifier studies that set the stage for our work.

      The authors find that paralogs of the mef2ca gene are expressed in cells that normally express mef2ca, and that these paralogs are expressed at higher levels in the mutant strain with low penetrance than in the mutant strain with high penetrance. They say that selection for high penetrance of the mef2ca mutant phenotype "leads to down-regulation" of paralog expression. As the authors only show that paralog expression is at lower levels in high penetrance vs low penetrance strains, it is not clear what they mean by "down-regulation". Perhaps their breeding scheme has only "captured" what is natural variation and there is no active mechanism of "down-regulation". The authors need to clarify what they mean.

      Thank you for this suggestion. We clarified that we do not mean active down or up regulation but rather selection on preexisting genetic variation. This conclusion is supported by new data (Fig. 4D).

      The authors also find that individuals from the high penetrance strains that don't carry the mef2ca mutation (they are wildtype for this gene) sometimes exhibit mef2ca mutant characters. They suggest the reduced paralog expression is responsible for the occasional emergence of the mef2ca mutant characters. In contrast with this suggestion, the authors later claim the paralogs "have no function" in craniofacial development. The authors need to clarify their thoughts about what is paralog function in craniofacial development and why reduced paralog function might contribute to the expression of mef2ca mutant characters. This topic is worthy of discussion.

      We considerably revised our discussion of this topic including our interpretation that the decreased expression of mef2ca in high penetrance strain led to the phenotypes we observe in mef2ca wild types from this strain. We also are more careful with our language, stating that the paralog mutants are indistinguishable from wild types, rather than stating that paralogs do not function in craniofacial development. In fact, they do function in craniofacial development, as buffers. Thank you for this suggestion that strengthened our manuscript.

      The authors claim is there is both up-regulation of paralogs in low penetrance strains and down-regulation of paralogs in high penetrance strains. As they only compare steady state levels of expression in each strain, they can only reasonably conclude that there are differences - they seem to imply a mechanism and they need to be clear about what they are thinking.

      Excellent point. In the revised manuscript, we are clear that there is not active up or down regulation but rather selection upon preexisting variation.

      They hypothesize that paralog expression in the low penetrance strain masks the effects of loss of mef2ca. They test this by creating CRISPR-engineered mutations of two paralogs and examining the effects of the paralog mutations in wildtype fish or in fish carrying the mef2ca mutation. They find the putative loss-offunction mutations in the paralogs have no effect in wildtype backgrounds and conclude these paralog genes have no function in craniofacial development. However, the paralog mutations enhance the mutant phenotype in fish that carry the mef2ca mutation. This provides strong evidence consistent with the model that the elevated expression of the paralogs functions to reduce the severity of the phenotype associated with the mef2ca mutation.

      Reviewer 3

      In this elegant genetic study, Bailon-Zambrano et al. draw on classical genetic concepts to address the clinically pertinent question of how genetic variants in the same gene can yield wildly different phenotypes in different individuals. They focus on the Mef2c gene, which is required for craniofacial and cardiac development in humans and model organisms yet shows highly variable phenotypes across and within individuals. Previous work by this lab had established that zebrafish mef2ca craniofacial phenotypes are highly variable and, importantly, that this variability is heritable and can be selectively bred for low vs. high penetrance. The authors hypothesize that vestigial expression of paralogous genes variably compensates for loss of mef2ca, such that individuals with higher levels of paralogous genes will show lessened severity and vice versa. To test their hypothesis, they methodically quantify the penetrance, expressivity, and variability of all known mef2caassociated craniofacial phenotypes in fish carrying 1) different mef2ca mutations, 2) the same mutation but after selecting for high vs. low penetrance for many generations, and 3) mef2ca mutations combined with mutations in paralogous genes. They find that not only does allele severity directly correlate with variation, but also that different paralogs buffer the severity and variability of different craniofacial phenotypes. Another particularly interesting finding is that some of the craniofacial phenotypes are apparent even in mef2ca wildtypes from the high penetrance strain, which they explain by the very low expression of paralogs on this background. A weakness of the study is that the authors do not directly show whether paralog expression is increased in the low-penetrance strain relative to the initial, unselected genetic background. It is therefore not clear whether the selection for low penetrance worked in this manner, as the authors imply. Overall, the authors have achieved an important step forward in understanding the genetic basis for the high variability of human faces among both healthy individuals and those with craniofacial anomalies.

      We can’t go back (over ten generations) to survey the original parental strain. However, we can use the unselected AB strain as a proxy for the initial unselected genetic background. In an important addition to the manuscript, we found significant paralog expression variation between unselected AB families (Fig. 4D). These results strongly suggesting there is cryptic, standing paralog expression variation that we selected upon. We would like to thank the reviewer for this excellent critique which motivated these important new experiments considerably advancing our model.

    1. Author Response

      Reviewer 1

      Ting Tang et al. present the results of a species x genotype diversity experiment within BEF China. The authors assess the relative impacts of species and genotype diversity on community-level primary productivity of the trees and the potential mediation of this effect via interactions of plants with soil fungi and herbivores. The results show that both species and genotype diversity influence productivity via changes in herbivory, soil fungal diversity, and other unknown mechanisms. Most of the species diversity effects could be directly related to functional diversity, while genotype diversity effects were not well represented by the way functional diversity was measured in this study.

      Thanks for the positive comments on the paper.

      The study is based on an impressive experiment that will certainly allow achieving major insights into the role of genotype and species diversity on ecosystem functioning. However, there are some significant shortcomings in the methods that limit this study. In particular, the incomplete assessment of functional traits, herbivory, and fungal diversity across the subplots used for this study reduces statistical power. Specific measurements of traits, herbivory and fungal diversity in each plot would substantially simplify the design and the analyses and likely also reduce the unexplained variance observed in the study. However, this is nothing that can be changed now and has the likely explanation of feasibility constraints.

      Thank you for the positive comments on the paper and the understanding of the feasibility constraints. In our study, functional traits of all the seed families of the four species across all the species × genetic diversity combinations were sampled, but to reduce circularity, we used the seed-family means across all tree diversity combinations to calculate functional diversity for every subplot instead of only using the functional trait measures obtained in that particular subplot. We have taken up the suggestion to also calculate functional diversity based on trait measurements of individual trees, but also here used data across all plots to reduce circularity. Additionally, we now acknowledge the incomplete assessment of herbivory in the Methods and state that fungal diversity in plant species mixtures was sampled on plot level because of feasibility constraints.

      Lines 334–337: “To reduce circularity, we used the seed-family means across all species × genetic diversity combinations to calculate FDis values per subplot that did not only depend on the functional trait measures obtained in that particular subplot. Using traits measured in a particular subplot to calculate FDis for that subplot bears the risk that the measured traits reflect a response to the local environment, yet we want to use FDis as a predictor variable for the performance of that subplot.

      Lines 380–382: “The mean value of herbivore damage per species × genetic diversity level was used to fill in missing values in a few subplots with tree individuals lacking herbivory data (Table S3).

      Lines 385–388: “Soil fungal diversity was used as a proxy of unspecified trophic interactions. To be consistent with the species and genetic diversity treatment design, soil samples were taken on subplot level for the 1.1 and 1.4 diversity treatments, but, due to feasibility constraints, on plot level for the 4.1 and 4.4 diversity treatments in 2017.”

      The writing of the manuscript is generally good. However, given the somewhat diffuse results obtained for genetic diversity effects, they receive a lot of attention in the discussion, while species diversity effects are little mentioned. This could be better balanced and also referred back to the hypotheses. For example, I miss the discussion of the very clear hypothesis that genotype diversity effects are positive in species monocultures but neutral in species mixtures. How do your results fit with this hypothesis? My general impression is that the study is very well framed, but lacks to stick to this frame in the discussion. I am aware that this might be a challenge with the results obtained, but worth trying.

      Thank you for the positive comments on the writing and pointing out the unclear part of the genetic diversity effects. To better connect the discussion to our hypothesis that genotype diversity effects are “more important in species monocultures than in species mixtures” (lines 114–115), we have rewritten the corresponding Discussion section.

      Lines 248–164: “In contrast of our second hypothesis, we found that the effects of genetic diversity via functional diversity and multi-trophic feedbacks were negative in species monocultures but positive in the species mixture (Fig. 5 and Fig. S3). We found genetic diversity had positive effects on tree functional diversity and soil fungal diversity, which supports the trade-offs between genetic and species diversity discussed in the previous section. However, the hypothesized positive effects of tree functional diversity on productivity turned negative in species monoculture. This result indicates that functional diversity may not have positive effects on the ecosystem functioning under low environmental heterogeneity, i.e. species monocultures in our study (Hillebrand and Matthiessen 2009). Therefore, our findings show that the different effects of genetic diversity on tree productivity between species monocultures and mixtures, not only depend on the different effects of genetic diversity on functional diversity and trophic interaction but also on the varied tree productivity consequences from functional diversity and trophic interaction on tree productivity between species monocultures and mixtures. Moreover, other aspects of tree genetic diversity seem to play an important role not only for productivity in tree species mixtures (see previous section) but also for productivity in tree species monocultures. These may include unmeasured functional traits such as root traits (Bardgett et al., 2014) or unknown mechanisms underpinning effects of tree genetic diversity.

      Given the complex results obtained, I also feel that the title and main message received in the abstract do not fully reflect the results. Genetic diversity effects on productivity, but also on herbivory and fungal diversity, are not general (e.g. Fig. 2) nor are all genetic diversity effects on productivity mediated by functional diversity and trophic feedback. I think the title and main message of the study should be articulated more precisely.

      In this study we did not find direct effects of genetic diversity on tree productivity in the binary analyses (Fig. 2), but we did find indirect effects of genetic diversity on tree productivity via functional diversity and trophic feedbacks in the path analysis (Fig. 4). Now we have pointed this out in the Discussion.

      Lines 201–204: “Although only species diversity but not genetic diversity was found to affect tree productivity in binary analyses, both kinds of diversity positively affected tree community productivity and trophic interactions via functional diversity according to our structural equation models (SEMs) depicted in the corresponding path-analysis diagrams (see Fig. 4).

      We agree that not all genetic diversity effects on productivity were mediated by functional diversity and trophic feedbacks. This may have been because we did not include all relevant functional traits and trophic interactions in this study. Nevertheless, our findings support the hypothesis that genetic diversity can affect productivity via functional diversity and trophic feedbacks and suggest more possibilities for further research. We have explained this in the Discussion.

      Lines 230–238: “Even after accounting for tree functional diversity and trophic feedbacks, we still detected a direct negative effect of tree genetic diversity on tree productivity, while the direct effect of tree species diversity was fully explained by functional diversity and trophic feedbacks. This suggests that aspects of genetic diversity that do not contribute to functional diversity or trophic interactions as measured in this study may reduce ecosystem functioning, e.g. due to trade-offs between genetic diversity and species diversity. For example, it has been shown that in species-diverse grassland ecosystems, niche-complementarity between species can increase at the expense of reduced variation within species (van Moorsel et al., 2018; van Moorsel et al., 2019; Zuppinger-Dingley et al., 2014; Zvereva et al., 2012).

      Lines 260–264: “Moreover, other aspects of tree genetic diversity seem to play an important role not only for productivity in tree species mixtures (see previous section) but also for productivity in tree species monocultures. These may include unmeasured functional traits such as root traits (Bardgett et al., 2014) or unknown mechanisms underpinning effects of tree genetic diversity.”

      Reviewer 2

      This study aims to disentangle the contributions of genetic and species diversity to tree community fitness. It confirms the role of genetic diversity in functional and ecological traits but shows how these effects change when plant species diversity is increased, which can potentially add to our understanding of the interplay between plant diversity at various levels and community and ecosystem functions. It would be desirable to make emphasis whether differences between the effects of genetic and species diversity are comparable since they can act at complementary but different levels. It is hard to establish whether the effects of species diversity override the effects of genetic diversity by shared mechanisms; or whether a high species diversity reduces plant intraspecific interactions and the consequent effects of genetic diversity by density-dependent effects. However, this point has to be emphasized in the discussion.

      Thank you for your positive comments on this paper. In the binary analyses in this paper, we used general linear mixed-model analysis to detect the effects of genetic diversity within species. Now we have clarified this in the Methods and the Results section. However, in Fig. 2 we also indicate the significance of the main effect of genetic diversity. We do not focus on this because of the interaction between species and genetic diversity. In statistical terms, fitting genetic diversity effects separately for species monocultures and mixture (2 degrees of freedom) is equivalent (i.e. has the same sum of squares) as fitting the main effect of genetic diversity (1 degree of freedom) and the interactions species x genetic diversity (1 degree of freedom).

      Lines 415–424: “To determine how species and genetic diversity and their interaction affected tree functional diversity and trophic interactions, linear mixed-effects models (LMMs) were fitted with two types of contrast coding. In the first, we used the ordinary 2-way analysis of variance with interaction and in the second we replaced the genetic diversity main effect and the interaction with separate genetic diversity effects for species monocultures and the species mixture (Table S6). Note that as our design was orthogonal, fitting sequence did not matter in either of the codings. However, we focused our major analysis on the second type of coding to make it consistent with our hypotheses. Main effects of genetic diversity are presented in inset panels in Fig. 2. Our second contrast coding ensured that we tested the effects of genetic diversity separately in species monocultures and species mixture, but within the same analysis.

      Lines 120–121: “Using linear mixed-model analyses, we tested the effects of species diversity and genetic diversity within species on trophic interactions and community productivity.

      Meanwhile, to emphasize that species diversity and genetic diversity could affect each other, we discussed that the trade-offs between species and genetic diversity could contribute to the effects of tree diversity on tree community productivity. We also discussed that the different effects of genetic diversity between species monocultures and mixtures may occur because different biotic environments resulted from different species diversity.

      Lines 232–238: “This suggests that aspects of genetic diversity that do not contribute to functional diversity or trophic interactions as measured in this study may reduce ecosystem functioning, e.g. due to trade-offs between genetic diversity and species diversity. For example, it has been shown that in species-diverse grassland ecosystems niche-complementarity between species can increase at the expense of reduced variation within species (van Moorsel et al., 2018; van Moorsel et al., 2019; Zuppinger-Dingley et al., 2014; Zvereva et al., 2012).

      Lines 250–260: “We found genetic diversity had positive effects on tree functional diversity and soil fungal diversity, which supports the trade-offs between genetic and species diversity discussed in the previous section. However, the hypothesized positive effects of tree functional diversity on productivity turned negative in species monoculture. This result indicates that functional diversity may not have positive effects on the ecosystem functioning under low environmental heterogeneity, i.e. species monocultures in our study (Hillebrand and Matthiessen 2009). Therefore, our findings show that the different effects of genetic diversity on tree productivity between species monocultures and mixtures, not only depend on the different effects of genetic diversity on functional diversity and trophic interaction but also on the varied tree productivity consequences from functional diversity and trophic interaction on tree productivity between species monocultures and mixtures.

      The experimental design has to be explained in more detail, in particular how plants were planted in the species monocultures. It is not stated whether the same or different species were used in the plots or in subplots. The design lacks proper replication for the treatment with high genetic diversity in species monocultures (n=2) which could lead to a biased result, especially if those plots were located in the same area.

      Thank you for the valuable comments on the experiment design. In total, we used four species and eight seed families per species for the whole experiment, and now we have added a diagram of the experimental design to the supplementary material (Fig. S5) to show the species and seed-family information for every subplot. Furthermore, we have added a table to the supplementary material to indicate the occurrence time of each species and each seed family across all the tree diversity-treatment combinations (Table S2). The high genetic diversity in species monoculture (1.4 treatment) was replicated 2 times per species and thus had 8 replications (Fig. S5). However, because we did not have enough seedlings, we could only establish these treatments at subplot level and thus put the different species for the 1.4 treatment into only two plots. Now we have added more explanation of the plot design in the Methods part. The plot distribution was completely randomized across the experimental site and plots of the same treatments were mostly located at least 50 m from each other (see Fig. 1 from Bongers et al., 2020, pasted here further below). The reason that there are more plots for the 1.1 treatment is that typically in biodiversity experiments more plots are needed at the lowest diversity treatment because of the desire to have all seed families occurring in any mixture also present as monoculture. Regarding the point that the four diversity treatments varied between rather than within plots, we ensured that diversity effects were tested at the plot level by including plot as random-effects term in the mixed models.

      Lines 305–323: “For each of the four species, we collected seeds from eight mother trees to allow for two replications of four-family mixtures per species. Furthermore, to avoid the effects of unequal representation of particular seed families and correlations between seed family presence and diversity treatments, we made sure that every seed family occurred the same number of times at each diversity level (see Table S2, small deviations from the rule were required where not enough seeds from a seed family could be obtained). Due to budget limitations and the number of replicates required per single seed family, the 1.1 and 1.4 diversity treatments were applied at subplot level (0.25 mu) and replicated 32 and 8 times, respectively. The 4.1 and 4.4 diversity treatments were applied at plot level (1 mu) and were replicated 8 and 6 times, respectively (Fig. S5; see also Fig. 1 in Bongers et al., 2020). To allow for simpler analysis, we obtained most community measures at subplot level also for the 4.1 and 4.4 diversity treatments and thereafter used the subplots for all tests of diversity effects on these community measures, including plots as error (i.e. random-effects) term for testing the diversity effects in the corresponding mixed models. In total, because one 1-mu plot could not be established due to logistic constraints, the number of subplots used was 92 (32 subplots of 1.1, 8 subplots of 1.4, 28 subplots of 4.1 and 24 subplots of 4.4 diversity treatment). Note that in biodiversity experiments lower richness levels represent more different communities and thus require more plots. For the highest richness level, where there is typically only one species composition, this same community is typically replicated multiple times, as we did here for the 4.4. diversity treatment.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      The manuscript describes the formation of supernumerary centriole protein assemblies ("cenpas") upon silencing of the E3 ubiquitin ligase TRIM37. These "cenpas" resemble centrioles, centriole precursors, or electron-dense striped structures, termed "tigers". Similar observations are made in cells from patients lacking functional alleles of TRIM37. The "cenpas" usually lack the full complement of centriolar proteins, but contain increased amounts of the pro-centriole marker centrobin. It is further shown that the formation of "cenpas" depends on centrobin, or on a parallel pathway involving Plk1 and SAS-6. Overall, the experiments in this study are of high technical quality and most of them are carefully controlled. The discovery of centrobin-containing striped protein assemblies ("tigers") is very interesting and provokes the question of their molecular composition and their mechanistic role in centriole assembly. Since striated fibres containing the protein rootletin have a similar periodicity of stripes (75nm) as the "tigers" in this study (Vlijm et al., PNAS 2018, 115:E2246-53), I was wondering whether the authors couldn't simply test for colocalization of their "tiger"-stripes with rootletin. A potential identity of "tigers" with striated fibres would help understanding the mechanisms of "cenpas" and centriole assembly upon depletion of TRIM37: striated fibres or "tigers" might be controlling the balance of centriole cohesion vs. disengagement and thereby centriole duplication, or they might play a role in the recruitment of additional proteins involved in pro-centriole assembly.

      We are grateful to the reviewer for this interesting suggestion. Accordingly, we will test the distribution of Rootletin and potentially CEP68 by immunofluorescence analysis of cells depleted of TRIM37.

      In the same context, did the authors correct for the experimentally induced sample expansion in Figure 5B, when comparing inter-stripe distances between U-ExM and EM samples?

      Yes, we did. We will clarify the text of the revised manuscript to make this more explicit.

      Other major points: The amount of TRIM37-depletion upon siRNA-treatment should be indicated prominently. I see in the "Materials and Methods" and in Fig. S4 that quantitative RT-PCR has been performed. Could Western blotting be performed to have direct information on the protein levels? Fig. 2C demonstrates that this is possible in cells from human patients, so why are there no data on the majority of other experiments in this manuscript?

      We previously reported Western blot analysis to estimate the extent of TRIM37 depletion upon siRNA treatment (Balestra et aI., 2013). However, following the suggestion of the reviewer, we will repeat this analysis for select experiments of this study.

      Moreover, what is the transfection efficiency in the siRNA experiments? Is there variability between cells that might explain variability in the "cenpas" phenotypes?

      The reviewer brings up an interesting point. However, in the absence of an antibody to detect endogenous TRIM37 by immunofluorescence analysis, we cannot provide an accurate figure in this case. We will mention this limitation explicitly in the text of the revised manuscript.

      Minor point: In line 353 (page 12), it is stated that centrobin in si-TRIM37 cells migrates slower (Fig. 4D), suggesting that TRIM37 regulates the post-translational state of centrobin. It looks to me as if the corresponding gel in Fig. 4D was "smiling" (see curvature of centrobin in the neighboring lane). I think that the authors should tone down their statement, or replace Fig. 4D with a more convincing image.

      We thank the reviewer for having noticed this. We will provide another gel that is not “smiling” -the difference in migration has been observed in a reproducible manner.

      Reviewer #1 (Significance):

      The findings of this manuscript are highly significant for our understanding of centriole biogenesis. They should be of interest to a large community of cell biologists working on mitosis and on the centrosome, and they are of further importance for biomedical research related to developmental growth abnormalities (Mulibrey nanism). The manuscript shows for the first time a mechanistic link between TRIM37-dependent control of centrobin protein levels, and their impact on the formation of centriole precursors during the cell cycle. The manuscript is well presented, and the relevant scientific literature is cited correctly. However, I would prefer that a potential relationship between "cenpas", "tigers", and the welldescribed rootletin-containing striated fibres be discussed, if not controlled by additional experiments.

      We thank the reviewer for her/his appreciation of our work and support for publication.

      Field of expertise of this reviewer: centrosome, microtubules, mitosis, cell culture, light and electron microscopy, biochemistry.

      Reviewer #2 (Evidence, reproducibility and clarity):

      In this work, the authors investigated roles of TRIM37 in regulation of centriole numbers. It was previously observed that depletion of TRIM37 results in supernumerary centrioles and centriole-like structures (Balestra et al., Dev. Cell, 2013; Meitinger et. al., 2016). Here, the authors characterized these centriolar protein assemblies (Cenpas). Cenpas were formed, following an atypical de novo pathway and eventually trigger centriole assembly. They observed that Centrobin is frequently present in Cenpas from the early stage and other centriolar components are sequentially recruited. Furthermore, they established that Cenpas formation upon TRIM37 depletion requires PLK4 activity. TRIM37 depletion also activates PLK1-dependent centriole multiplication. 1.They propose that the tiger structure acts as platform for PLK4-dependent Cenpas assembly. Cenpas may evolve into centriole-like structures after a stepwise incorporation of other centriolar proteins. Fig. 6E suggests that a series of events seem to occur within G2 phase. Therefore, this reviewer suggests to perform a detailed time-course experiments at G2 phase. According to the model, the Centrobin-positive tiger structures may appear first, and then a Centrobin- and centrin-2-double positive structure starts to appear.

      We fully agree with the reviewer that this is an important experiment, which we will perform by analyzing TRIM37 depleted cells at successive time points after release from a double thymidine block, using antibodies against Centrobin and Centrin.

      2.They claim that Mulibrey patient cells exhibited evidence of chromosome mis-segregation, as would be expected from multipolar spindle assembly, and conclude that Cenpas are present and active also in Mulibrey patient cells. Chromosome mis-segregation may be observed in the normal cells, too. Therefore, they have to perform statistical analysis on Fig. 2D.

      In response to this suggestion and to the related comment of reviewer 3 (see below), we will conduct additional immunofluorescence analysis and quantification of patient and normal cells, assessing the distribution of Centrin, Centrobin, microtubules and γ-tubulin, as well as scoring the extent of chromosome mis-segregation.

      3.In Fig. 2A, They claimed that mitotic microtubules were disrupted with the cold treatment for 30 min. In our experience, cold treatment for 30 min is not sufficient to disrupt mitotic microtubules. They may show control panel before microtubule regrowth.

      We will show the control panel as requested.

      Reviewer #2 (Significance):

      Significance of this work resides in identification and description of Cenpas as a novel centriole assembly pathway. The authors used cutting-edge microscopy techniques to visualize Cenpas. The manuscript raised more questions than answers. Nonetheless, it is worth to publish the manuscript after revision.

      We thank the reviewer for supporting publication after revision.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Balestra and colleagues investigate the function of Trim 37 in centrosome biogenesis. Trim 37 is a ubiquitin ligase that has previously been identified by the authors as a regulator of centriole duplication. Mutations in Trim37 cause a rare syndrome named Mulibrey that is responsible for a severe form of dwarphism Here they show that depletion of Trim37 in human cells results in the assembly of structures that they name Cenpas. They follow the possibility that Trim37 localises to the centrosome, which might inhibit the assembly of these structures. Further they show that Trim37 depleted cells (or in patient fibroblasts ) assemble multipolar mitosis. Further analysis shows that what the authors defined as abnormal centriole structures are formed in Trim37 depleted cells. These structures recruit centrobin, a daughter centriole component and this process requires the activity of PLK4 and PLK1. Major comments: This study characterizes Trim37 and its possible role in centriole biogenesis. Most conclusions are convincing, although some of the claims taken by the authors might require more data to be corroborated.

      1)The major point to be taken into consideration in my opinion relates with the Cenpas structure. According to the beautiful cryo-EM data shown on Fig 3, I wonder why the authors describe these structures as centriole like- or centriole related. I think these appear very different from centrioles and this might be even quite interesting if these structures nucleate microtubules and can participate in mitotic spindle assembly.

      We have a different opinion on this point. Most of the “centriole-like” or “centriole-related” structures do resemble the organelle, in that they contain microtubule bundles and are of a related size (in addition to bearing centriolar markers). However, recognizing that the distinction between these two categories of structures is somewhat arbitrary, we will combine them into the most prudent term “centriole-related”, and further explain in the revised manuscript that they comprise a range of structures.

      The authors correlate these non-canonical centriole structures as possible microtubule nucleators that might be responsible for multipolar configurations like in Fig 2D. This correlation has to be established. In Figure 2D, the authors analyze configurations of mitotic cells in terms of centrosome number and characterized frequency of extra foci. To me the foci they show are quite different in nature. Poles 1 and 3 have both centrin and g-tubulin (presumably centrioles), pole 2 has only a tiny amount of centrin and no g-tubulin, while pole 4 appears to contain both but less of each protein. So the question is are they all nucleating microtubules and participating in spindle assembly? This is particularly important in light of what the authors then mention, which is the occurrence of chromosome mis-segreation in patient cells (this is not shown). Also they describe these extra poles, and then say that Cenpas are active in patient cells. But, active in which manner? By nucleating microtubules? First, in either siRNA cells or in patient cells the authors should analyze microtubules and show that all the extra poles (made of non-canonical centriole) nucleate microtubules and participate in spindle assembly.

      In response to this suggestion and to the related comment of reviewer 2 (see above), we will conduct additional immunofluorescence analysis and quantification of patient and normal cells, assessing the distribution of Centrin, Centrobin, microtubules and γ-tubulin, as well as scoring the extent of chromosome mis-segregation.

      If they want to propose that this might be the cause of genome integrity loss in patients (as stated in the abstract and suggested a few times throughout the paper) they have to show that cells divide abnormally and generate aneuploidy progeny.

      See response just above.

      2) Another important point that is only partially addresses is the function of Trim37 in stabilizing centrobin. Does Trim37 ubiquitinates centrobin? While the western blot on Figure 4 shows an increase at 8hrs in Trim37 RNAi, this is also the case for tubulin (Fig 4E). But the overall levels appear only slightly increased when compared to its levels at time point zero (Fig. 4F). I can see that in siRNA Ctrl Trim 37 levels go down, but it is still present so how do they explain the lack of Cenpas in this case? Is there a threshold that supports centriole duplication without any major defect but accumulation of a certain level of centrobin then generates Cenpas? Can the authors generate Cenpas just by over-expressing centrobin directly?

      It appears from the comment of the reviewer that we were not sufficiently clear here. The experiment reported in Figure 4E and 4F is done in the presence of cycloheximide to analyze the half-life of Centrobin in control conditions and upon TRIM37 depletion. We will clarify the text in the revised manuscript to facilitate understanding.

      In Figure 2, they analyze configurations of mitotic cells in terms of centrosome number and characterized frequency of extra foci. To me the foci they show are quite different in nature. Poles 1 and 3 have both centrin and g-tubulin (presumably centrioles), pole 2 has only a tiny mount of centrin and no g-tubulin, while pole 4 appears to contain both but less of each protein. So the question is are they all nucleating microtubules and participating in spindle assembly? This is particularly important in light of what the authors then mention, which is the occurrence of chromosome mis-segreation in patient cells without showing it. Also they describe these extra poles, and then say that Cenpas are active in patient cells. But, active in which manner? By nucleating microtubules? This has to be shown. Also analysis of mitosis should be included to back up a defect in chromosome segregation and also to identify which type of defect.

      The above section is a copy/paste mistake (as indicated also in a correspondence between Review Commons and the reviewer).

      So in conclusion, the link between Cenpas and multipolarity has to be better investigated in my opinion. This should not be time consuming and also not extremely costly. Authors should label spindle MTs in patient fibroblasts to show that indeed Cenpas are nucleating microtubules. Ideally Cenpas would be distinguished by centrobin labeling. In siRNA depleted cells maybe time lapse microscopy can be used to image mitosis and show a correlation between Cenpas and multipolarity?

      As mentioned above, we will conduct additional immunofluorescence analysis and quantification of patient and normal cells, assessing the distribution of Centrin, Centrobin, microtubules and γ-tubulin, as well as scoring the extent of chromosome mis-segregation.

      The data is presented without statistical analysis on the figures only on Fig legends, This is really difficult for the reader. The number of experiments and cells analyzed maybe should be also included in each Figure.

      We had kept this information to the legends merely to have lean figures, but will consider moving it to the figure panels in the revised manuscript.

      Minor comments: Some picture lack scale bars

      Apologies. This will be fixed.

      the localization of GFP-Trim37. On Figure 1 the authors describe a different localization when fused to a NES localization. It is true that a dotty signal is seen on the panel of NES (Figure 1D), but a nuclear signal is not seen on Trim-GFP in any of the images provided. Shouldn't this be the case?

      There is some GFP-TRIM37 nuclear signal in the left panel of Figure 1D, although it is very weak. We will explore the possibility of providing an inset with adjusted brightness/contrast to emphasize this point.

      Fig 1C is missing a siCtrl.

      The control quantification will be included (no extra centrioles are present in this case).

      Why Trim37GFP does not rescue completely the assembly of the extra foci?

      In general, there can be many reasons why rescue in such an experimental setting is not complete, including slightly different protein levels, distribution, or interaction with partner proteins. Such possibilities will be discussed explicitly in the revised manuscript.

      In Fig 6E, are the authors sure that in the condition of siTRim3 plus si Centrobin and Plk1 inhibition, cells are not stuck in S-phase? This might explain the lack of being in a permissive G2 phase to generate Cenpas?

      Although Plk1 inhibition is not expected to block cells in S phase, we cannot rule out this possibility from the data currently available. Therefore, we plan to conduct FACS analysis in a repeat of this experiment to assess cell cycle status.

      The data is presented without statistical analysis on the figures. This can be found on figure legends, but it is better to include on the figures to facilitate the reader's job. The number of experiments and cells analyzed maybe should be also included in each Figure?

      As mentioned above also, we had kept this information to the legends merely to have lean figures, but will consider moving it to the figure panels in the revised manuscript.

      Reviewer #3 (Significance):

      Interesting findings and quite novel since a role for Trim 37 in centriole biogenesis has never been reported. Also quite interesting the possible link between multipolarity (needs better characterization) and Mulibrey syndrome.

      We thank the reviewer for recognizing the interest and novelty of our work

    1. I was thinking about everything, and that includes general relativity theory, since actually this theory is rather complicated. It has many branches and there was a lot of material which had been worked out for many years. People have studied it, and quantum gravity is extremely complicated. I was just lucky that such beautiful things were at the surface so I could see them. You see, my mind is not very technical. I work best of all in those places where I can use my intuition. Lightman: That's very interesting. I'd like to start asking you questions about that. I've noticed from your technical papers and in your paper in Physics Today[6] and your lectures that you describe things intuitively, with pictures and so forth. I know there have been certain physicists in the past who have used images and visualization and pictures more than other physicists. I think Einstein used a lot of visual images. All of his Gedanken experiments were based on mental images rather than on writing out equations. Even here [at Harvard] we make a joke in the physics department that Weinberg is very technical and [Sheldon] Glashow is very intuitive. So there do seem to be different styles of doing physics. One question that I've been very interested in, and some psychologists are interested in too, is how physicists use mental pictures. Maybe not exactly pictures but, for example, the way we say in quantum mechanics that sometimes things act as particles and sometimes as waves. I guess we're attempting to make a connection to our daily experience with the world. How do you use images in your work? Do you find images useful or harmful? Linde: Typically, I just use them. Of course, I use mathematics, certainly. Lightman: Of course. Linde: But first we usually have a rough idea of how it could work and why, and what is the purpose. Without understanding the purpose of what we are doing, you may try many different ways and you just solve equations without understanding why it is necessary.
      • "METHOD"
    1. Author Response

      Reviewer 1

      In this article Farrell et al. leverage existing datasets which measure frailty longitudinally in mice and humans to model 'robustness' (the ability to resist damage) and 'resilience' (the ability to recover from damage), their dynamics across age, and their relative contributions to overall frailty and mortality. The concept of separating damage/robustness from recovery/resilience is valid and has many important applications including better assessment and prediction of effective intervention strategies. I also appreciate the authors' sophisticated attempts to effectively model longitudinal data, which is a challenge in the field. The use of human and mouse data is another strength of the study, and it is quite interesting to see overlapping trends between the two species.

      While I find the rationale sound and appreciate the approach taken at a high level, there are a few key considerations of the specific data used which are lacking. The authors conceptualize resilience based on studies which primarily use short time scales and dynamic objective measures (ex. complete blood cell counts in Pyrkov et al.) often in conjunction with an acute stress stimulus. For example, they heavily cite Ukraintseva et al. who define resilience as "the ability to quickly and completely recover after deviation from normal physiological state or damage caused by a stressor or an adverse health event."

      Resilience and robustness are typically studied at short time-scales, with small numbers of continuous health attributes. We study transitions of binary health attributes, which we call damage and repair, and which we suggest should be thought of as resilience and robustness. Our approach is well suited for studying large numbers of binary health attributes over long time-scales without acute external stimuli. How resilience and robustness in these limits (binary, large numbers, long times, intrinsic dynamics) compare with resilience and robustness as has been typically measured (continuous, short times, acute stimuli) is an interesting and important question that arises from our work.

      Given these definitions, the human data used seem to fit within this framework, but we should carefully consider the mouse data. The mouse frailty index is a very useful tool for efficiently measuring the organismal state in large cohorts. A tradeoff for quickly measuring a broad range of health domains is that the individual measurements are low resolution (categorical) and involve inherent subjectivity (which may be considered part of the measurement error). Some transitions in individual components are due to random measurement error and I believe this is especially likely with decreases (or 'resilience' transitions).

      The reason I think the resilience transitions are subject to high measurement error is that I am skeptical as to whether many of the deficits in the mouse index are reversible under normal physiologic conditions. For example, it is exceptionally unlikely for a palpable/visible tumor to resolve in an aged mouse over the time scales studied here, thus any reversal that was observed is very likely due to random measurement error. Other components which I have doubts about reversibility are alopecia, loss of fur color, loss of whiskers, tumors, kyphosis, hearing loss, cataracts, corneal capacity, vision loss, rectal prolapse, genital prolapse.

      In summary, I applaud the authors' efforts in generating complex models to better understand longitudinal aging data. This is an important area that needs further development. I appreciate their conceptualization of resilience and robustness and think this framework has an important place in aging research. I also appreciate their cross-species approach. However, the authors may have over-conceptualized and made some assumptions about the mouse data which may not be valid. It will be important to assess the results with careful consideration of the time scales of the underlying biology and the resolution and measurement error inherent to these tools.

      For each of our mouse attributes, there are published studies demonstrating reversibility (see our new Supplementary Table 1). Nevertheless, we cannot distinguish what causes the observed discrete transitions (measurement error, stochastic fluctuations in underlying organismal features, or logisticlike continuous transitions in underlying continuous variables). We analyze the discrete data as given.

      The question of time-scale is interesting. From survival curves of individual binarized attributes, we obtain reasonable fits to exponential models (i.e. a single timescale) see Fig 5 supplement 1 and 2. For the human data there are a broad range of timescales for both robustness and resilience. For the mouse data there appears to be a similarly broad range (note the logarithmic scale) though with considerable uncertainty. We work with the data we have, so we are unable to probe shorter timescales than the measurement interval (months for mice, and years for humans). We have reinforced this caveat in the discussion.

      Reviewer 2

      This study uses repeated measurements of the frailty index (FI), composed of multiple binary parameters. It is posited that newly detected changes in the number of these parameters represent damage and that the parameters that have previously been detected but are not detected currently represent damage repair. Statistical treatment then follows, deriving resilience and robustness and their changes over time. This is an interesting idea. Strengths of the study include analyses across species (mice and humans), including multiple datasets in mice.

      To be clear, our data analysis is on the binary health attributes that are used in the FI. By considering the damage/repair (binary transitions) of individual attributes, we can obtain the aggregate damage/repair rates.

      What are the elements of FI that increase at each period of life, and what are those that decrease? For example, humped phenotype or alopecia are more likely to appear in old mice and are essentially irreversible, whereas weight loss due to infection may be more common in young mice and is reversible. Therefore, the choice of health deficits would affect the model and, for example, may artificially lead to a decreased value of what the authors call damage repair.

      More generally, information on the frailty index lacks sufficient details. I doubt that this method has sufficient accuracy to draw conclusions from as little as 32 female mice (21 + 11 animals in datasets 1 and 2) and 63 males (13 + 6 + 44 animals in datasets 1, 2 and 3). Also, only 25 enalapril-treated mice of each sec were analyzed, and only 17 exercised mice (11 females and 6 males). The number of human participants is large, but the total follow-up period is not shown, and the subjects were assessed based on 23 parameters only.

      We have not examined other choices of health attributes. While we picked standard sets from available data, we do not know whether other attributes would behave differently. It would be difficult to do our detailed modelling on single attributes in the mouse data, since the data is so sparse. Our approach was developed specifically to be able to draw conclusions from limited mouse data. Where possible we aggregate the individual mice, sex, health attributes, studies, and measurement times. The analysis of human data shows that the approach generalizes.

      While we have mostly not studied individual attributes (we have considered survival times, but without age or time effects), we would expect that some of them may have behavior that qualitatively differs from our aggregate results. If attribute selection was biased towards (or away from) qualitatively distinct behaviors that would, of course, be reflected in aggregate results. We suspect that this would be unlikely, but that any such distinctive behavior would be interesting and important to identify and understand. We have added some discussion on this point, since we cannot exclude this possibility.

      A key assumption in this work is that increased FI is equivalent to the rise in damage. However, the relationship between changes in FI and damage is unknown. One can imagine a situation when damage increases, but protection also increases. In this case, fitness may increase, decrease or remain unchanged. What is the basis for calling an increased number of health deficits damage? Is there a more reliable method to measure damage that could support the authors' claims?

      See also discussion point #1 in essential revisions. We call binary states 0 “healthy” and 1 “damaged”, but we could instead say “more healthy for most individuals” and “less healthy for most individuals” – where “healthy” means associated with desirable (low FI and low mortality) health outcomes. We have not explored other measures of organismal damage. We have not explored how interactions between variables could affect resilience or robustness for individuals. We do not think that alternative approaches would be easy to study without much more data (for mice) that is more finely resolved in time (for mice and humans). We are quite happy to have found an approach to use with binarized data, but would welcome viable alternative approaches to compare with.

      Reviewer 3

      In this work, the authors aimed at investigating two related components of aging-related processes of health deficits accumulation in mice and humans: the processes of damage (representing the robustness of an organism) and repair (corresponding to resilience), and at determining how different interventions (the angiotensin-converting enzyme inhibitor enalapril and voluntary exercise) in mice and a representative measure of socio-economic status (household wealth) in humans affect the rates of damage and repair. Two key elements in this study allowed the authors to achieve their goals: 1) the use of relevant data containing repeated measurements of health deficits from which they were able to compute the cumulative indices of health deficits in mice and humans and which are also necessary to evaluate the processes of damage and repair; 2) the methodological approach that allowed them to formulate the concepts of damage and repair, model them and estimate from the available data. This methodological framework coupled with the data resulted in important findings about the contribution of the age-related decline in robustness and resilience in health deficits accumulation with age and the differential impact of interventions on the processes of damage and repair. This provides important insights into these key components of the process of aging and this research should be of interest to both lab researchers who plan experimental studies with laboratory animals to study potential mechanisms and interventions affecting health deficits accumulation as well as researchers working with human longitudinal studies who can apply this approach to further investigate the impact of different factors on robustness and resilience and their contribution to the overall health deterioration, onset of diseases and, eventually, death.

      The key strength of this work is a rigorous analytic approach that includes joint modeling of longitudinal measurements of health deficits and mortality (in mice). This approach avoids biased inference which would be observed if longitudinal data were analyzed alone, ignoring attrition due to mortality. Another strength is a comprehensive analysis of both laboratory animal data that allows exploring the impact of different interventions on the processes of damage and repair and human data that allows investigating disparities in these processes in individuals with different socioeconomic backgrounds (represented by household wealth).

      One weakness (which is commonplace for human studies) is self-reported data on health deficits in humans which makes it difficult to compare with lab data where deficits are assessed objectively by lab researchers. The subjective nature of health deficits measurements complicates the interpretation of findings, especially about repairs of deficits. In addition, it is not clear whether the availability/absence of caregivers at different exams/interviews factors into the answers on difficulty/not difficulty with specific activities constituting health deficits and, respectively, into their change over time reflected in damage/repair estimates.

      Variability of the evaluator is expected in any longitudinal study, and amounts to a variety of measurement error. The question of whether there are age-effects in the measurement error, such as bias or age-dependent variability is interesting. For the mouse data, evaluator training is designed to minimize such errors and inter-evaluator differences are not large (Feridooni et al, 2015; Kane et al, 2017). For the human self-report data any such age-effects are unavoidable.

    1. Author Response

      Reviewer 1

      Sadeh and Clopath analyze two mouse datasets from the Allen Brain Atlas and show that sensory representations can have apparent representational drift that is entirely due to behavioral modulation. The analysis serves as a caution against over-interpreting shifts in the neural code. The analysis of data is coupled with careful modeling work that shows that the behavioral state reliably shifts sensory representations independently of stimulus modulation (rather than acting as a gain factor), and further show that it is reproducibly shifted when the behavioral state is adequately controlled for. The methods presented point towards a more careful consideration and measurement of behavioral states during sensory recordings, and a re-analysis of previous findings. The findings held up for both standard drifting grating stimuli as well as natural movies.

      The fact that neurons may have different tuning depending on the behavioral state of the animal raises obvious questions about readout. The authors show that neurons with strong behavioral shifts should simply be ignored and that this can be achieved if the downstream decoder weights inputs with more stimulus information. While questions remain about why behavior shifts representations and how that could be more effectively utilized by downstream circuits, the results presented clearly show that sensory representations might not always be simply drifting over time, and will spark some careful analysis of past and future experimental results.

      Many thanks for a clear summary of the work and emphasizing the significance of the results.

      Reviewer 2

      Studies from recent years have shown that neuronal responses to the same stimuli or behavior can gradually change with time - a phenomenon known as representational drift. Other recent studies have shown that changes in behavior can also modulate neuronal responses to a given sensory stimulus. In this manuscript, Sadeh and Clopath analyzed publicly available data from the Allen Institute to examine the relationship between animal behavioral variability and changes in neuronal representations. The paper is timely and certainly has the potential to be of interest to neuroscientists working in different fields. However, there are currently several important issues with the analysis of the data and their interpretations that the authors should address. We believe that after these concerns are addressed, this study will be an important contribution to the field.

      We really appreciate the time and the effort the reviewer(s) have taken to evaluate our results and analysis in detail. Their comments are very relevant and critical to the improvement of the manuscript. We explain below how we addressed their various comments and concerns

      1. The manuscript raises a potential problem: while previous work suggested that the passage of time leads to gradual changes in neuronal responses, the causality structure is different: i.e., the passage of time leads to gradual changes in behavior, which in turn lead to gradual changes in neuronal responses. The authors conclude that "variable behavioral signal might be misinterpreted as representational drift". While this may be true, in its current form, the paper lacks critical analyses that would support such a claim. It is possible that both factors - time and behavior - have a unique contribution to changes in neuronal responses, or that only time elicits changes in neuronal responses (and behavior is just correlated with time). Thus, the authors should demonstrate that these changes cannot be explained solely by the passage of time and elucidate the unique contributions of behavior (and elapsed time) to changes in representations.

      This is a very important point and we addressed it with new analyses, by dedicating a new figure (Figure 1–figure supplement 5) and a new part of the Results section to it. The results of our new analyses show that strong representational drift mainly exists in those animals/sessions with large behavioral changes between the two blocks, and that in animals/sessions with small behavioral changes, such drift is minimal, despite the passage of time (see our responses below to Major comments for further details).

      1. There are also several issues with the analysis of the data and the presentation of the results. The most concerning of which is that the data shows a non-linear (and non-monotonic) relationship between behavioral changes and representational similarity. In many of the presented cases, the data points fall into two or more discrete clusters. This can lead to the false impression that there is a monotonic relationship between the two variables, even though there is no (or even opposite) relationship within each cluster. This is a crucial point since the clusters of data points most likely represent different blocks that were separated in time (or separation between within-block and acrossblock comparisons).

      This is an important concern. To address this, we analyzed the source of the non-monotonic relationship / opposite trend in the data and demonstrated the results in a new figure (Figure 4–figure supplement 2). Our results show that the non-monotonic relationship does not compromise the result of our previous analysis. Furthermore, it suggests that the non-monotonic / opposite trend is emerging as a result of more complex interactions between different aspects of behavior. We have also shown, in separate analyses, that the passage of time is not the main contributing factor to representational drift, rather large behavioral changes are correlated with strong drifts between the two blocks of presentation (Figure 1—figure supplement 5, and Figure 3—figure supplement 2).

      More generally, we did not intend to claim that the relationship with behavioral changes is linear or/and monotonic. We used linear analysis just to show the main trend of decrease in representational similarity with large behavioral changes. Any other analysis should assume some form of nonlinearity, but because the nonlinear relationships between behavior and activity were complex, it was not easy to assume such nonlinearity.

      We in fact tried to use two other ways of analysis, nonlinear correlations and generalized linear models (GLM), but there were issues hindering a proper use of each analysis. Nonlinear correlations assume a specific type of nonlinearity, but the nature of nonlinearity underlying the data is not clear (in fact, it looks to be different in different example non-monotonic trends in the data). We could not, therefore, assume a nonlinearity that best fitted all the data; we believe the nature of this nonlinearity, or how behavior modulates neuronal activity in a nonlinear manner, is in itself an interesting and open question for future investigation, but beyond the scope of this study. GLM did not provide useful results either, as the relationship between behavioral changes and neural activity/representational similarity was state-dependent and transitioning between nonlinear states, therefore hindering the usage of linear methods.

      We therefore opted for the simplest analysis which can show and quantify this dependence - emphasizing that further analyses are in fact needed to get to the bottom of the exact nonlinear relationship (for further details, see the responses below to Major comments).

      1. The authors also suggest that using measures of coding stability such as 'population-vector correlations' may be problematic for quantifying representational drift because it could be influenced by changes in the neuronal activity rates, which may be unrelated to the stimulus. We agree that it is important to carefully dissociate between the effects of behavior on changes in neuronal activity that are stimulus-dependent or independent, but we feel that the criticism raised by the authors ignores the findings of multiple previous papers, which (1) did not purely attribute the observed changes to the sensory component, and (2) did dissociate between stimulus-dependent changes (in the cells' tuning) and off-context/stimulus-independent changes (in the cells' activity rates).

      That’s a very valid point. As population vector correlations are used quite often in (experimental and theoretical) works on representational drift, we wanted to highlight the pitfalls of such a metric in dissociating between sensory-evoked and sensory-independent components. However, as the reviewers have mentioned, these two aspects have been separated and addressed independently in some of the past literature in the field. For instance, as we discussed in the Discussion, Deitch et al. (Current Biology, 2021) have calculated this for different metrics, including tuning curve correlations, which can potentially alleviate this problem:

      A recent analysis of similar datasets from the Allen Brain Observatory reported similar levels of representational drift within a day and over several days5. The study showed that tuning curve correlations between different repeats of the natural movies were much lower than population vector and ensemble rate correlations5; it would be interesting to see if, and to which extent, similarity of population vectors due to behavioural signal that we observed here may contribute to this difference.

      We tried to highlight these contributions better in the revised manuscript (see further on this below in our responses to Major comments).

      1. Another important issue relates to the interchangeable use of the terms 'representational drift' and 'representational similarity'. Representational similarity is a measure to identify changes in representations, and drift is one such change. This may confuse the reader and lead to the misconception that all changes in neuronal responses are representational drift.

      We thank the reviewer(s) for raising this point. We have clarified our use of the terms representational similarity and representational drift in the revised manuscript. Specifically, we have quantified representational drift index between the two blocks according to a previously used metric (RDI; Marks & Goard, 2021) in our new analysis (Figure 1–figure supplement 5).

      For the main part of the paper, however, we have decided to base our analysis on representational similarity (RS), and to evaluate the drop of RS with changes in behavior. Our reasoning for this is twofold. First, any measure of representational drift should ultimately be a function of the representational similarity. The measure we used above, for instance, is calculated as RD = (RS_ws - RS_bs)/(RS_ws + RS_bs) (Marks and Goddard, 2021), with RS_ws and RS_bs referring to the average representational similarity within a session or between different sessions. However, RS contains more information, especially with regard to fine-tuned changes - the above metric, for instance, averages all the changes within each block of presentation. By focusing on the basic function of representational similarity, we could capture both the gross changes between the blocks as well as more nuanced changes that can arise within them, especially with regard to behavioral changes. Another aspect that would have been lost by only using the usual metric of representational drift is the direction of change. In our analysis, we in fact found that the average RS increased within the second block of presentation, which might be contrary to the usual direction of drift. We found this unconventional change of RS interesting and informative too. We could highlight that, presenting the raw RS provided a better analysis strategy. Based on these reasons, we think representational similarity would be a better metric to base our analyses upon, although we have now calculated a conventional representational drift index for comparison too.

      Reviewer 3

      Although it is increasingly realized that cortical neural representations are inherently unstable, the meaning of such "drift" can be difficult or impossible to interpret without knowing how the representations are being read out and used by the nervous system (i.e. how it contributes to what the experimental animal is actually doing now or in the future). Previous studies of representational drift have either ignored or explicitly rejected the contribution of what the animal is doing, mostly due to a lack of high-dimensional behavioural data. Here the authors use perhaps the most extensive opensource and rigorous neural data available to take a more detailed look at how behaviour affects cortical neural representations as they change over repeated presentations of the same visual stimuli.

      The authors apply a variety of analyses to the same two datasets, all of which convincingly point to behavioural measures having a large impact on changing neural representations. They also pit models against each other to address how behavioural and stimulus signals combine to influence representations, whether independently or through behaviour influencing the gain of stimuli. One analysis uses subsets of neurons to decode the stimulus, and the independent model correctly predicts the subset to use for better decoding. However, one caveat may be that the nervous system does not need to decode the stimulus from the cortex independently of behaviour; if necessary, this could be done elsewhere in the nervous system with a parallel stream of visual information.

      Overall the authors' claims are well-supported and this study should lead to a re-assessment of the concept of "representational drift". Nonetheless, a weakness of all analyses presented here is that they are all based on data in head-fixed mice that were passively viewing visual stimuli, such that it is unclear what relevance the behaviour has. Furthermore, the behavioural measurements available in the opensource dataset (pupil movements and running speed) are still a very low dimensional representation of what the mice were actually doing (e.g. detailed kinematics of all body movements and autonomic outputs). Thus, although the authors here as well as other large-scale neural recording studies in the past decade or so make it clear that relatively basic measures of behaviour can dramatically affect cortical representations of the outside world, the extent to which any cortical coding might be considered purely sensory remains an important question. Moreover, it is possible that lowerdimensional signals are overly represented in visual areas, and that in other areas of the cortex (e.g. somatosensory for proprioception), the line between behaviour parameters and sensory processing is blurred.

      Many thanks for the clear and insightful summary of the results, significance and caveats of our analysis. We totally agree with this critical evaluation - and suggestions for future work.

    1. Author Response

      Reviewer 2

      In the manuscript, the cellular deformation that is due to the shear stress generated in a classical microfluidic channel is used to deform detached cells that are moving in the flow. A very elegant point of the paper is that the same cells are used in the provided software to determine the fluid flow, which is a key parameter of the method. This is particularly important, as an independent way to crosscheck the fluid flow with the expected values is important for the reliability of the method. Instead of complicated shape analysis that are required in other microfluidic methods, here the authors simply use the elongation of the cell and the orientation angle with respect to the fluid flow direction. The nice thing here is that a well-known theory from R. Roscoe can be successfully used to relate these quantities to the viscoelastic shear modulus. Thanks to the knowledge of the fluid flow profile, the mechanical properties can be related to the tank treading frequency of the cells, which in turn depends on the position in the channel, and the flow speed. Hence, after knowing the flow profile, which can be determined with a sufficiently fast camera, and the actual static cell shape, it is possible to obtain frequency dependent information. Assuming then that cells do have a statistically accessible mean viscoelastic property, the massive and quick data acquisition can be used to get the shear modulus over a large span of frequencies.

      The very impressive strength of the paper is that it opens the door for basically any, non-specialized cell biology lab to perform measurements of the viscoelastic properties of typically used cell types in solution. This allows to include global mechanical properties in any future analysis and I am convinced that this method can become a main tool for a rapid viscoelastic characterization of cell types and cell treatment.

      Although it is both elegant and versatile, there remain a couple of important questions open to be further studied before the method is as reliable as it is suggested by the authors. A main problem is that the model and the data simply don't really work together. This is most prominent in Figure 3a. This is explained by the authors as a result of non-linear stress stiffening. Surely this is a possible explanation, but the fact that the question is not fully answered in the paper makes the whole method seems not sufficiently backed. I agree that the test with the elastic beads are beautiful, but also here the results obtained with the microfluidic method and the AFM seem not to match sufficiently to simply use the proposed model in conjecture with a single power law approach to fully translate the single frequency data into a frequency dependent plot. There are more and more hints that two power law models are more reasonable to describe cell mechanics. If true this would abolish the approach to exploit only a single image to get the mechanical power law exponent and the prefactor in a single image. Despite all the excitement about the method, I have the feeling that the used models are stretched to their extreme, and the fact that the only real crosscheck (figure 3a) does not work for the power law exponent undermines this impression.

      We had assumed that the probing frequency equals the tank treading frequency. This is incorrect. As the cell undergoes a full rotation, any given volume element inside the cell is compressed twice and elongated twice. Hence, the frequency with which the cell is probed is twice the tank-treading frequency. This correction shifts the G’ and G” versus frequency curves to the right (by a factor of two), and in addition, the G” data points are shifted (increased) by a factor of two (Eq. 17). This also increases the fluidity alpha (and hence the slope of the power-law relationship) roughly by a factor of two (Eq. 22), and since the actual slope of the G’ and G” versus frequency data “cloud” is unchanged by the correction, the single power-law description now describes the data much better (see new Fig. 3a).

      Regarding the critique that models are stretched to their extreme: The Roscoe model assumes that cells behave as the visco-elastic continuum-mechanics equivalent of a Kelvin-Voigt body consisting of an elastic spring in parallel with a resistive (or viscous) dash-pot element . This then gives rise to a complex shear modulus with storage modulus G’ and loss modulus G”, measured at twice the tank treading frequency 𝜔. Roscoe makes no assumptions whatsoever about how G’ and G” might change as a function of frequency. Hence, our “raw” G’ and G” data, e.g. in Fig. 3a, are obtained without any power law assumption.

      One could leave it at that, as the reviewer suggests below, and only present the raw G’ and G” vs. frequency plots. However, this would also make it nearly impossible to compare our measurements to those obtained with other techniques that operate at different, non-overlapping time- or frequency-scales. For such a comparison to work, one needs a model to predict how G’ and G” scale with frequency.

      A commonly used and very simple model to predict how G’ and G” scale with frequency, which is also the model used by Fregin et al. and many others, is that of a Kelvin-Voigt body consisting of an elastic spring in parallel with a resistive element (dash-pot), both with a frequency-independent stiffness and resistance (viscosity), respectively. However, our data show that G’ and G” of different cells, all measured at different tank-treading frequencies, exhibit a behavior that is very unlike that of a simple Kelvin-Voigt body with a constant, frequency-independent stiffness and resistance. In this case, G’ would be flat (power law exponent zero), and G” would increase proportional with frequency (power law exponent of unity). This is clearly not what our data show.

      Rather, we find that G’ and G” increase with increasing frequency according to a power law, with the same exponent 𝛼 for G’ and G”. At high frequencies (beyond the range of our microfluidic method, but in the range of our AFM measurements), G” increases more strongly with frequency, akin to a Newtonian viscosity (power law exponent of unity), which we take into account in the case of the AFM measurements. A large number of publications have shown that many types of cells, including cells in suspension, follow power law rheology, regardless of the measurement method. Also the AFM measurements that we include in this study support the validity of power-law rheology.

      Power law rheology predicts a peculiar behavior: The ratio of G”/G’ in the low-frequency regime (where the high-frequency viscous term is not yet dominating) must be equal to tan(𝛼𝜋/2), for mathematical reasons (Eq. 22). With our correction (that the probing frequency is twice the tank-treading frequency), we find that Eq. 22 correctly predicts the power-law exponent of the G’ and G” vs. frequency data.

      Note that we actually do not fit a power law model (Eq. 1) to the population data of G’ and G” vs. frequency in Fig. 3a. The G’ and G” data are obtained by applying Roscoe-theory, without any further assumptions such as power-law rheology. Only the lines shown in Fig. 3a that go nicely through the data are a prediction of how a typical cell (selected from the mode of the joint probability density of alpha and k, see Fig. 3b) would behave if we had measured it at different frequencies, under the assumption that this cell follows power law rheology, based on Eq. 22. With this assumption, we can directly convert the measured G’ and G” of any cell into a stiffness k and power law exponent 𝛼 using Eqs. 21 and 22 - no fit is needed here.

      Since we only measure two parameters for any given cell at twice its tank-treading frequency, namely strain and alignment angle, we can only extract two parameters for each cell (i.e., G’ and G”, or k and alpha) but not a third parameter. In essence, the reviewer expresses concerns that the G' and G" behavior of a typical cell, when extrapolated to higher or lower frequencies, may not necessarily match the frequency behavior of the entire cell population (Fig. 3a). However, our data show that a single (typical) cell that was measured at a single mid-range frequency comes remarkably close to describing the G’ and G” versus frequency behavior of all other cells.

      The reviewer suggests that a power law model with two exponents may be able to even more accurately describe the mechanics of the cell population. This is certainly correct, and in particular when cell mechanics is measured over a larger range of frequencies or strain rates, as we have done here using AFM, we find that at higher frequencies, G” deviates from a weak power law and merges into a different power law with a larger slope (i.e., power law exponent) that approaches unity or a value close to unity, akin to a Newtonian viscous term. Therefore, the single power law expression (Eq. 1) is not sufficient for the AFM data, and we use Eq. 2 instead. However, in the case of our shear stress cytometry measurements, the tank-treading frequency remains below the range where this second power law behavior becomes prominent. Therefore, the Newtonian viscosity term of Eq. 2 cannot be fitted with reasonable fidelity to the data from a single measurement.

      In the case of polyacrylamide beads, we start to see a hint of an upward trend in G” versus frequency at tank-treading frequencies of around 10 Hz, and therefore have performed a global fit with Eq. 2 to the shear flow data where we keep the Newtonian viscous term constant for all conditions (different shear stresses and bead stiffnesses).

      The reviewer furthermore cautioned that mechanical non-linearities such as strain stiffening may distort or otherwise bias the results. As the reviewer brings up this issue in more detail below, we have addressed it there.

      Regarding the concern that “results obtained with the microfluidic method and the AFM seem not to match sufficiently to simply use the proposed model in conjecture with a single power law approach to fully translate the single frequency data into a frequency dependent plot.”:

      First, we tend to agree more with the opinion of Reviewer #1 who found it remarkable that results obtained with the microfluidic method and the AFM method are actually fairly similar. Now that we have introduced the correction that the probing frequency is twice the tank-treading frequency, the cells in suspension turn out to be softer and more fluid-like compared to the cells measured with AFM. But there are many more commonalities between the AFM data and the shear flow data, which we list above in our reply to reviewer #1, the most relevant here is that cells show power-law behavior both when measured with AFM and with our new method.

      Second, we did not use a single power law to fit the AFM data. Rather, we used Eq. 2, which contains two power law relationships (the second power law exponent of unity for the Newtonian viscosity therm is usually not explicitly written). However, the origin of the Newtonian viscosity therm arises mainly from the hydrodynamic drag of the cantilever with the surrounding liquid, and less so from the cells. This hydrodynamic drag is absent in our shear flow deformation cytometry method, and moreover the tank treading frequency of most cells remains far below 10 Hz where an additional Newtonian viscosity therm does not yet come into play.

      Third, we disagree that Fig. 3a is “the only real crosscheck for the power law exponent”. The inverse relation that we see between the power law exponent and the stiffness of individual cells (Fig. 3b) has been previously reported for different cell types and methods. Moreover, we find a power law exponent close to zero for PAA beads at small strain values, which is to be expected for a predominantly elastic material such as PAA. We think that this last result is a particularly convincing experimental cross-check.

  6. Jul 2022
    1. Author Response

      Reviewer #1 (Public Review):

      My primary criticism of this paper is that it misses the opportunity to give some key details about the statistics of neural activity during 'ripples' rather than studying identified replay events. A secondary criticism is that they limit their analyses to neurons that have place fields in both environments. I think the activity of the other 3 categories of neurons (active in Track 1 only, active in Track 2 only, and not active in either track) are also of critical interest.

      We agree with the reviewer that it is important to demonstrate that the main observations are not due to a small subset of neurons or replay events. We have described above the inclusion of Figure 1- figure supplement 6, where the threshold for replay detection is made less stringent and the ratio of significant replay events/candidate replay events are now reported in the manuscript. To address the concern that the analysis is limited to neurons only with place fields on both tracks, we have added four more subpanels to Figure 1-figure supplement 6, where we perform our regression analysis on all spatially tuned (pyramidal) neurons (Figure 1-figure supplement 6E), neurons with only place fields on one track (track 1 and track 2 neurons will be in the upper right and lower left quadrant of plot respectively, Figure 1-figure supplement 6F), neurons with peak amplitude <1Hz on each tracks (Figure 1-figure supplement 6G) and finally, interneurons (Figure 1-figure supplement 6H). Consistent with our previous findings, we observe significant regressions for POST replay events for all spatially tuned neurons and neurons with place fields only one track. Conversely, neurons that were not active on either track and interneurons are not rate modulated by experience during replay.

      It is important to note that replay detection uses all spatially tuned cells, but the regression analysis is limited to cells active on both tracks in the main analysis. The reason for this is now explained in more detail in the revised manuscript (page 5):

      “It is important to note that a significant regression would be expected when analyzing neurons with a place field only on one track, as they are expected to participate in replay events of this track, while being silent during the replay of the other track. As such, our regression analysis only analyzed place cells active on both tracks and stable across the whole run (Figure 1-figure supplement 1B and see Methods).”

      Reviewer #2 (Public Review):

      This study by Tirole et al. addresses to what extent differences in firing rate that occurs during the awake experience of two different tracks are replayed during SWRs.

      In principle, this is a topic broadly relevant to our understanding of the circuit-level mechanisms and neural coding of memory, because it can provide insight into the ways in which experience is transformed into memory traces, and in particular, whether an entire coding modality (firing rate patterns) is available for replay. However, I didn't have an easy time situating this study in the context of the existing literature. When I first read the title, I expected this work was going to address the question of if there is replay of rate-remapped experiences, which is still an understudied topic (but see Takahashi, 2015) and would be important to examine. But once I realized that the two experiences here are actually more like global remapping, it was less clear to me what is novel here.

      My best guess about what's novel is that even though on the one hand, many studies have shown a distinguishable replay of two (or more) distinct experiences, e.g. different mazes like in Karlsson et al. 2009, different arms of a T-maze in Gupta et al. 2010, the overlapping central stem element of different trajectories in various mazes (Takahashi, 2015 and work from the Jadhav lab). On the other hand, there have been extremely detailed examinations of the contributions of firing rate changes (as distinct from temporal order or synchrony) as in Farooq et al. 2019. But perhaps the authors think that the intersection of those two kinds of work has not been studied, that is, how much do firing rate changes specifically contribute to the replay of two distinct experiences? In any case, regardless of whether I understood that correctly or not, the authors need to be more explicit in the introduction and discussion in contextualizing their work. I also suspect that the current findings are a direct logical consequence of putting together these well-established previous results; this would not mean the current work isn't a useful advance, but it would moderate the novelty and general interest.

      Beyond this overall question of how the work relates to the extant literature, I have a suggested modification to the data analysis. I think that the quality of the data and the care taken in the analyses were very high in general, so I do not have any major concerns, and the conclusions are very thoroughly supported. However, I wonder if there is a way to simplify some of the analyses and make them a bit more straightforward to interpret. As the authors have realized, there is potential for a circularity in the analysis, in the sense that to compare firing rate differences for two tracks between Track and Replay, Replay events first need to be assigned to one or the other (decoded) Track. But then any firing rate differences may be contributing to the output of the decoder, rendering the analysis circular. I understand the authors use various methods like the firing-rate-insensitive method in Figure 2 to deal with this crucial issue. But wouldn't a simpler way be to leave out the cell whose firing rates are being analyzed out of the decoding step so that the labeling of Replay events is independent of that cell? This seems an intuitive and rigorous way to address the central question the authors have. Is there some reason why that isn't done?

      We thank the reviewer for this feedback, and agree it is important to emphasize the novel contributions of the manuscript (as we see it), and clarify this further if needed. The reviewer is correct that there are several studies that have looked at rate remapping during reactivation. We have cited some of these, but have now updated our citations in the intro and discussion based on the comments here. While we have avoided directly criticizing a particular study in our earlier draft of the manuscript, these previous studies are affected generally by several issues: 1) replay detection methods were sensitive to rate modulation, creating a circular argument for the existence of rate modulation in replay. [Our study thoroughly addresses this with several controls]. 2) the analysis of reactivations rather than replay, which lacks the statistical rigor of sequence detection [we have focused on replay using a strict threshold for significance] 3) Replay/reactivations are analyzed for a single environment, making it difficult to distinguish between rate modulation and changes in the overall excitability levels of neurons maintained over behavior and sleep. [our studies uses two tracks to avoid this potential issue]. 4) When multiple contexts were decoded, neurons that only fired in one context were not removed from the analysis, artificially “inflating” any observed rate modulation. [we have circumvented this issue by only analyzing neurons with place fields in both environments]

      The suggestion to repeat the analysis and leave one neuron out for replay detection is excellent, however this was avoided due to the required processing time- to run our complete analysis takes more than a week, and repeating this for each possible “leave-one-out” combination would take significantly longer (this has to be done independently for each neuron). We used multiple controls (track rate shuffle, replay rate shuffle, rank order correlation- figure 2, figure 2—figure supplement 2) to eliminate any possibility that a neuron’s firing rate could influence replay detection. Specifically, for rank-order correlation based replay detection, each burst of spikes is only treated as a single event (median of spike times in the burst), which directly circumvents the problem of firing rate biasing replay event selection.

    1. Author Response

      Reviewer 1

      In general, I consider that the manuscript reflects a huge effort in terms work done and data collection, the manuscript is very well written, and it brings new knowledge in terms of cooperative breeding and its connection with groups size in ostrich. My major concerns are about the title and introduction that are in my opinion too broad and not enough detailed.

      In the introduction the scientific background that led to this research is lacking, and the manuscript would benefit from a more supported introduction, which makes it difficult to understand how far this study went comparatively to previous studies. The research work was well conducted, and adjusted to the study aims. However, it would benefit from including more details on the observational data collected by the authors.

      I think the research topic is interesting, and the study was well performed, but the manuscript would benefit from a more clear approach to the working hypothesis, expected results and background theories/hypotheses.

      We are very grateful for the positive and constructive feedback. The title and introduction have been revised according to the reviewer’s suggestions. We provide a more extensive introduction to the hypotheses being tested, which are now explicitly stated. The observational data we collected have been described in more detail and we integrate our observational and experimental data more thoroughly.

      In the evaluation summary, the reviewer highlights that we did not address some aspects of groups, such as relatedness and parentage. We have now added additional analyses to show these do not change the conclusions of our study (for details please see responses to reviewer 2 who raises similar concerns more extensively). These were not originally included in the manuscript as the aim of our study was to examine how group size and composition influence the average reproductive success for any given individual, irrespective of variation in relatedness and parentage within groups.

      Reviewer 2

      This work sets out to investigate experimentally the effect of differences in group size and group composition on reproductive behavior and success in ostrich groups. Direct field observations of the relationship between group composition/group size and reproductive success, do not allow for causal inference, as there may be several reasons why patterns may arise. For example, observing individuals having a higher reproductive success in larger groups than in smaller groups may not be a direct result of a larger group size per se, but it may be that higher quality individuals manage to establish themselves more often in larger groups. Hence, experimental manipulation of group size and group condition in natural contexts is important. 96 experimental groups of ostriches were established in fenced off areas in the Karoo in South Africa, varying the number of males (1 / 3) and the number of females (1 / 3 / 4 / 6) across groups. Groups were followed for almost a year, studying a period without parental care (eggs were removed and incubated in an incubator to measure reproductive success) and a period with parental care (eggs were left in the enclosures).

      In the latter case, behavioral observations were done to study nest incubation, and sexual conflict (interruptions of incubation). The study was done for seven years, and having such data on experimental manipulations in semi-wild conditions is very valuable. The combination of behavioral analysis, with careful tracking of the fate of eggs (by daily nest checks), the experimental nature, and measuring reproductive success make for a very complete analysis of the breeding ecology of this system and can serve as a blueprint for more of such work in the fields of cooperation, group living and breeding ecology.

      Some aspects, however, deserve more attention. First, at present, the origin and familiarity and possible relatedness among the group members of the experimentally composed groups is not discussed, and it may be that these factors play a role in shaping the results. Second, the reproductive measure used was the average number of chicks per sex, but it was not calculated at the individual level. There were no genetic analysis done to establish which individuals were actually successful in terms of reproduction. Since individual level selection is likely very important in this system, the results of average reproductive success need to be interpreted with great care. Third, the study was done under semi-natural conditions, meaning that the effects of other factors possibly shaping the success of group size and group composition in the wild (e.g., possible nest predation) were weakened. Finally, a closer connection between the experimental results on optimal group size, and whether this can actually be found in the dataset on natural variation in group size and group composition can be explored.

      We are very grateful for the careful review of our work and positive feedback. The suggestions and comments have been extremely helpful in revising the manuscript, which have led to the following changes:

      1) We have added details about the origin and familiarity of group members, together with extra analyses verifying that our results are not confounded by variation in within-group relatedness. The study population has a nine-generation pedigree allowing us to accurately estimate relatedness between individuals. In the design phase of the experiment, relatedness amongst individuals was kept low in accordance with data from natural populations, but there were related individuals of the same sex in some groups. We tested if the average relatedness within groups influenced the average number of chicks individuals produced and found no significant relationship (Supplementary file 1 – Tables S16 and S17).

      2) We have included genotyping analyses of 3227 offspring to verify that our non-genetic estimates of average reproductive success per sex (total chicks produced by groups / number of same sex individuals) accurately reflect measures obtained using genetic estimates of individual reproductive success. Genetic and non-genetic measures were highly correlated (R >0.95). We have added these verification analyses to the manuscript. The text has also been edited to further clarify that our aim is to estimate the average reproductive benefits for any given individual of being in group of a particular size, rather than examining differences in reproductive success between individuals within groups, for which genetic methods are required.

      3) We have clarified the advantages and limitations of experimental studies. As reviewer 2 highlights, observational studies alone do not provide causal insight into the factors influencing group size, but as reviewer 1 indicates, experimental studies can lack ecological context. Consequently, both have their merits. Experimental manipulations of entire social groups are currently lacking on large vertebrate cooperative breeders, but can be used to estimate the costs and benefits of living in different group sizes that arise independently of ecological conditions. The results of such experimental studies can be used as a benchmark against which other data can be compared, such as observational data on wild groups subject to ecological pressures, including nest predation. The discrepancies between experimental and observational data can then be used to infer the relative importance of social versus ecological factors in shaping social groups.

      4) We have added a figure (Figure 1 - figure supplement 1) and extended the discussion to better connect our experimental data with our observations of natural variation in group size.

    1. Author Response

      Reviewer 1

      This manuscript attempts to explain the well-known difference in DNA mutation rates between father vs. mother (paternal mutation is 4 times higher than maternal mutation in humans). Although the mutation rate difference was believed to arrive from the number of cell divisions (male germ cells undergo many more divisions compared to female germ cells), recent studies suggested that most mutations arise from DNA damage (which will be proportional to the absolute time) rather than DNA replication-induced mutations (which will be proportional to the number of cell divisions). The authors thus revisited the question as to why the paternal mutation rate is higher (if absolute time is more important than the number of cell divisions in causing mutations). They used 'taxonomic approaches' comparing paternal/maternal mutation rates of mammals, birds, and reptiles, correlating them to specifics of reproductive mode in these species. To measure paternal vs. maternal mutation rate, they compared the mutation rates of neutrally evolving DNA sequences between the X chromosome vs. autosomes, as well as the Z chromosome (utilizing the fact that the X chromosome will spend twice more generations in females than males, while autosomes spend equal time. Likewise, the Z chromosome will spend twice more time in males than in females, while autosomes spend equal time).

      They first confirm the paternal bias across a broad range of species (amniotes), eliminating many species-specific parameters (longevity, sex chromosome karyotype (XY vs. ZW), etc) as a contributor to the paternal bias. This implies that something common in males in these broad species causes paternal bias. They show that in mammals, the paternal bias correlates with a generation time. They propose that the total mutation is determined by the combination of the mutation rate during early embryogenesis (when both male and female have the same mutation rate) and the later mutation rate when two sexes exhibit different mutation rates. This model seems to explain why generation time correlates well with the extent of paternal bias in mammals. However, this does not explain at all why birds do not exhibit any correlation with a generation time. The speculation on this feels rather weak (although there is nothing they can do about this. Fact is fact).

      The logic behind their analysis is well laid out and seems mostly sound. Their finding is of broad interest in the field.

      • I am confused by this statement (the last sentence in the result section): 'If indeed the developmental window when both sexes have a similar mutation rate is short in birds then, under our model, generation times are expected to have little to no influence on α." Based on their model, if the early period is gone, when the mutation rates are similar between sexes are similar, intuitively it feels that generation time influences α even more. Am I missing something? (if the period with the same mutation rate is gone, then females and males are mutating at different rates the whole time).

      We apologize for the lack of clarity, as we should have made clear that here we are assuming a fixed ratio of paternal to maternal generation times. Under that assumption, if female and male germ cells are accumulating mutations as a fixed rate over time, then for each sex, the number of mutations accumulated with time is a line that goes through the origin, and the ratio of the paternal-to-maternal slopes (α) will be constant regardless of the age of reproduction. In other words, if Me=0 in equation 1, then α would be constant for any fixed ratio Gm/Gf. We have revised this sentence to be clearer; lines 334-338 now read:

      If indeed the mutation rate in the two bird sexes differs from very early on in development (i.e., if term Me ≈ 0 in equation 1), then assuming a fixed ratio of paternal-to-maternal generation times, our model predicts the sex-averaged age of reproduction will have little to no influence on α.

      • The authors state that this paper provides a simple explanation as to why paternal biases arise without relying on the number of cell divisions. However, it seems to me that the entire paper relies on the recent findings that mutation arises based on absolute time (instead of cell division number), and the novelty in this paper is the idea of 'two-phase mutation rates' to explain the observed numbers of paternal bias in various species. Yet it fails to explain the mutation rate difference in birds. There is not enough speculation or explanation as to what determines different mutation rates in males of various species. Although the modeling seems to be sound and there is nothing that can be done experimentally, I felt somewhat unsatisfied at the end of the manuscript.

      We agree with the reviewer that our paper does not address why the ratio of paternal-tomaternal mutation rates is lower in birds than mammals, and had stated so explicitly (lines 358360): “Another question raised by our findings is why, after sexual differentiation of the germline, mutation appears to be more paternally-biased in mammals (∼4:1) than in birds and snakes (∼2:1).

      To try to gain more insight into this question, we are now analyzing mutations in a set of three generation pedigrees from birds and reptiles, which should allow us to obtain a direct estimate of α and characterize sex differences in the mutation spectra, which we can then compare to what is seen in mammals. While this analysis is beyond the scope of this manuscript, we now note how this question might be pursued (lines 360-362):

      In that regard, it will be of interest to collect pedigree data from these taxa, with which to compare mutation signatures to those typically seen in mammals.

      Reviewer 2 The primary goal of this paper is to re-assess the cause for the excess of male over female germline mutations seen in many animals. By re-analyzing X (Z) and autosomal substitution rates across 42 species of mammals, birds, and snakes, and fitting a model that allows for a constant and equal-sex embryonic mutation rate, along with a mutation rate that increases with age, the authors show that there is no need to invoke the model that assumes mutation rate depends strictly on numbers of cell divisions.

      Strengths 1. The paper challenges a dogma in evolutionary genomics, which states that males have a higher germline mutation rate than females. It establishes convincingly that the count of premeiotic mitotic divisions is NOT the primary driver of the excess male mutations, but instead, it is the intrinsic mutation rate in males (balance of DNA damage vs DNA repair) that accumulates over time.

      1. The authors establish a simple model where the number of mutations that accumulate each generation depends on the embryonic mutation rate (which is shown empirically to not differ between the sexes) and a post-maturity mutation rate, which has elevated male mutation (driven presumably by a shift in the balance between DNA damage and DNA repair). The model is very clear and intuitive described.

      2. The paper is extremely carefully thought-out, planned, and executed. Criteria for inclusion and exclusion of species in the phylogenetic work are clearly laid out. Similarly, decisions about filtering genomic regions (avoiding repeats, etc.) are well done and exhaustively documented. The standard of scholarship is very high - for example, the analysis of de novo mutation rates in mammals pulled in data from no fewer than 15 published studies.

      Weaknesses 1. The method of estimating alpha relies on the assumption that the mutation process (and rates) are the same in autosomes and sex chromosomes. There is an attempt to control for GC content and replication timing, but it is easy to imagine other factors at play, including the inactivation of one X in females, the extensive differences in chromatin modifications, especially of the X, that differ in males vs. females. The case of the cat X chromosome, with its 50 Mb of recombination cold spot and corresponding oddly slow substitution rate, might be just one example of features in other species that cause other perturbations in the substitution rate of the X. This does not seriously erode confidence in the results, but there is more potential for intrinsic mutation rates of sex chromosomes and autosomes to differ than is suggested by the authors.

      We agree with the reviewer that despite our attempts, we do not control for all factors that distinguish X and autosomes beyond exposure to sex. We had written that “while our pipeline may not account for all the differences between autosomes and X (Z) chromosomes unrelated to sex differences in mutation, the qualitative patterns are reliable.” and have now included a sentence to make this limitation clearer (lines 165-167):

      Nonetheless, it is unlikely that our regression model perfectly accounts for all the genomic features that differ between sex chromosomes and autosomes other than exposure to sex.”

      In turn, the assumption that mutation rates in X (Z) and autosomes differ only with regard to their exposure to sex (after accounting for base composition and other genomic features) is unproven; we now state this assumption explicitly in the Methods (lines 678-681). Nonetheless, it seems warranted by the high concordance of evolutionary- and pedigree-based estimates of alpha in humans, mice and cattle. With regard to the specific factors mentioned by the reviewer, excluding CpG sites has little effect on our qualitative conclusions for mammals (see Fig S1E), suggesting that DNA methylation differences between X and autosomes are not having a major influence on our findings. Moreover, X-inactivation in the germline of mammals (as distinct from the soma) is likely quite short-lived, given that it lasts around three days in early development of mice (Chuva de Sousa Lopes et al. 2008) and at most four weeks in humans (Guo et al. 2015). Thus, it is unlikely to be an important mutation rate modifier. We have now reworked three paragraphs in the main text to make the limitations above clearer (lines 127-175).

      1. The authors point out that the human mutations in spermatogonia are due to mutation signatures SBS5/40 ( which are known not to be correlated with cell division rates). The work on the nonhuman species could be greatly extended with this mutation spectrum approach. For each species, one could ask: Are the mutation spectra of the embryonic mutations consistent between males and females? What about the mutation spectra for the post-puberty individuals? Is alpha consistent across mutation signatures? Does the GC bias correction impact these inferences?

      Unfortunately, there is not enough de novo data to address this question outside of humans. In turn, the analysis of substitution data is unreliable, because of the differential impact of repeated substitutions at a site and the effects of GC-biased gene conversion.

      1. While the data do not suggest reasons WHY males display a higher mutation rate, it is fair to ask whether the evolutionary drive for a higher mutation rate might shape the mechanism whereby it happens. There is a certain amount of speculation in the paper as it is, and it is done in a way that is often well supported by data after the fact. Speculation about why males have an elevated mutation rate would not erode the overall quality of the paper, and I would expect that many readers would be eager to see what the authors have to say on the subject.

      As we envisage it, along the lines of Lynch’s models for the evolution of germline mutation (Lynch 2010), there is likely selection to keep the mutation rate as low as possible, subject to the constraints of the need to replicate DNA, repair damage, etc. efficiently. Why the attainable lower limit would be higher in males than in females is unclear to us, both mechanistically and in terms of evolutionary selection pressures. As we now note lines 353-355, a potential proximal cause is a greater effect of reactive oxygen species, a major source of DNA damage, in male germ cells than in oocytes (Smith et al. 2013; Rodríguez-Nuevo et al. 2022). Potential evolutionary causes are even less clear to us, but could be related to the greater competition among sperm vs. oocytes (added in lines 354-357).

      Another way to think about these results is as shifting the question somewhat, broadening it from the long-standing puzzle of the selection pressures shaping sex differences to asking what determines the relative mutation rates of different cell types, including oocytes and spermatagonia but also somatic cell types/tissues. We had previously written that “our results recast long standing questions about the source of sex bias in germline mutations as part of a larger puzzle about why certain cell types (here, spermatogonia versus oocytes) accrue more mutations than others.” We have revised the final paragraph of the Discussion to try to emphasize this point.

      Overall the paper achieves its intended goal of toppling the dogma that the excess male mutation rate is driven by number of rounds of cell division in spermatogenesis (compared to oogenesis).

    1. Author Response

      Reviewer 3

      The number of identified anti-phage defense systems is increasing. However, the general understanding of how phages can overcome such bacterial defense mechanisms is a black box. Srikant et al. apply an experimental evolution approach to identify mechanisms of how phages can overcome anti-phage defense systems. As a model system, the bacteriophage T4 and its host Escherichia coli are applied to understand genome dynamics resulting in the deactivation of phage-defensive toxin-antitoxin systems.

      Strengths: The application of a coevolutionary experimental design resulted in the discovery of a geneoperon: dmd-tifA. Using immunoprecipitation experiments, the interaction of TifA with ToxN was demonstrated. This interaction results in the inactivation of ToxN, which enables the phage to overcome the anti-phage defense system ToxIN. The characterization of the genomes of T4 phages that overcome the phage-defensive ToxIN revealed that the T4 genome can undergo large genomic changes. As a driving force to manipulate the T4 phage genome, the authors identified recombination events between short homologous sequences that flank the dmd-tifA operon. The discovery of TifA is well supported by data. The authors prepared several mutant strains to start the functional characterization of TifA and can show that TifA is present in several T4-like phages.

      In addition, they describe T4 head protein IPIII as another antagonist of a so far unknown defense system.

      In summary, the application of a coevolutionary approach to discover anti-phage defense systems is a promising technique that might be helpful to study a variety of virus-host interactions and to predict phage evolution techniques.

      Weaknesses: The authors apply Illumina sequencing to characterize genome dynamics. This NGS method has the advantage of identifying point mutations in the genome. However, the identification of repetitive elements, especially their absolute quantification in the T4 genome, cannot be achieved using this method. Thus, the authors should combine Illumina Sequencing with a longread sequencing technology to characterize the genome of T4 in more detail.

      We think the combination of Illumina-based sequencing and PCR analyses presented are more than sufficient to arrive at the conclusions drawn about the repeats that emerge in our evolved T4 clones.

      To characterize the influence of TifA during infection, T4 phage mutants are generated using a CRISPR-Cas-based technique. The preparation of these phages is unclearly described in the methods section. The authors should describe in detail whether a b-gt deficient strain was applied to prepare the mutants. Information about the used primers and cloning schemes of the Cas9 plasmid would allow the community to repeat such experiments successfully.

      We have added details to the Methods section to clarify and expand on our mutagenesis approach.

      The discovery of TifA would benefit from additional data, e.g. structure-based predictions, that describe the protein-protein interaction TifA/ToxN in more detail.

      We were unable to predict the ToxN-TifA interaction interface using AlphaFold, and we are currently conducting follow-up work to characterize how TifA neutralizes ToxN.

      Several publications have described that antitoxins can arise rapidly during a phage attack. The authors should address that this concept has been described before as well by citing appropriate publications.

      We believe that we have already addressed this point sufficiently in the Introduction of the manuscript, in which we discuss (1) the emergence of phage-encoded pseudo-toxI repeats to overcome P. atrosepticum toxIN and (2) the presence of the naturally-occurring antitoxins Dmd and AdfA in T4 and T-even phages, respectively. We also discuss the similarities between TifA, Dmd, and AdfA in the discussion of the manuscript. To our knowledge, these are the only known examples of antitoxins arising during phage attack outside of TifA, but we are happy to include additional citations of which the reviewers are aware.

      The authors propose that accessory genomes of viruses reflect the integrated evolutionary history of the hosts they infected. However, the experimental data do not support such a claim.

      We disagree with the reviewer’s comment, as our evolution experiment demonstrates the plasticity of the T4 genome during adaptation to different hosts, as well as showing that the T4 accessory genome includes genes necessary for infection of some, but not all hosts. The proposal also comes as the last sentence of the Abstract and is framed not as a conclusion, but as a proposal based on the work done here, with the clear intention of providing a sense of how future work may build off our work.

    1. Author Response

      Reviewer 1

      They adopted a comprehensive experimental and analytic approach to understand molecular and cellular mechanisms underlying tissue-specific responses against 3-CePs. They used two cell lines - BxPC-3 and HCT-15 - as example models for responsive and non-responsive cell lines, respectively. Although mutation rates didn’t differ by the drug treatment, they observed changes in cell cycle and expression of genes involved in DNA damage, repair and so on. Furthermore, they combined RNA-seq and ATAC-seq data and applied two approaches, pairwise and crosswise, to identify a number of gene groups that are altered in each cell line upon the drug treatment. Finally, they calculated enrichment of up/down genes in different cell lines, tumor types and samples to estimate potential responsitivity against the drug. This study is unique in in-depth analysis of RNA-seq and ATAC-seq data in identifying genetic signature underlying drug treatment. This study has the potential to be applied to different drugs and cell lines.

      We thank the reviewer for the precise and kind summary of our work.

      However, several major concerns need to be resolved. First of all, the biological and clinical performance of 3-CePs is not clearly described. They referenced several papers but they seem to have focused on the chemical properties of the drug. Without proven activity of 3-CePs against cancers in vitro and in vivo, the rationale of the study would be compromised.

      We apologize for not being clear enough when introducing previous findings on the differential sensitivity of HCT-15 and BxPC-3 cancer cell lines to 3-CePs. In the revised manuscript, we now cite references on the preferential activity of these agents against the pancreatic cancer cell line in 2D and 3D in vitro cancer models (see lines 71-74, 128-129). These compounds have been selected to exemplify the use of the pipeline in drug discovery and early-stage of drug development: indeed, only cellular data are available for these molecules, which have not yet been characterized in vivo. The pipeline itself offered a final perspective on directions to take for their further development, i.e. most sensitive tumor types to target (PAAD, KIRC).

      Their RNA-seq analysis was focused on discovering differentially expressed genes between cell lines, time points, etc. Interestingly, they found that DNA damage and repair signal was specifically increased in HCT-15. But is this approach capable of finding signals that are constitutively expressed in different cell lines? In other words, what if the differential responsiveness to 3-CePs was already there even before the drug was introduced?

      We thank the reviewer for pointing out such key concept. The premise for the developed approach is that factors determining the overall cellular sensitivity to a treatment must be determined by intrinsic characteristics of the cell line. For this reason, we built the sensitivity signature on basal transcriptome profiles, where we prioritized a subset of genes based on perturbational evidence (perturbation-informed basal signature).

      Beyond signature genes, we show in figure R1 (see above) the results of a GSEA analysis on the whole overlap (300 genes) between DE genes from the baseline comparison (BxPC-3 ctrl vs HCT-15 ctrl) and those from the 6 h M treatment comparison, in the sensitive cell line (BxPC-3 M 6 h vs BxPC-3 ctrl). Pathways like ribosome biogenesis, ROS metabolism, UPR also arise, attesting that genes activated in response to the treatment also have a constitutively different expression in unperturbed cells.

      Are there any overlapping signals between pairwise vs crosswise approaches?

      We thank the reviewer for this question. To make it easier for the reader to compare the output from the two types of integration and to intuitively grasp their functional overlap, we changed the visualization of the results from the pairwise approach (Figure 4 D).<br /> Indeed, some functional pathways both new or already emerging from previous analysis, arise from both integrations. This overlap has now been directly discussed from the functional point of view in the main text (from line 348 and in the following crosswise integration paragraph).

      Genes used as input in both types of integration are DE or DAR-associated, so this means that many of the hits that we find having the same double regulation (pairwise) also appear in CoCena modules. Among them, only few hits show both 1) the same double regulation in a specified comparison (as suggested by crosswise) and also 2) end up having the similar pattern of regulation across all conditions (contributing to the same CoCena module, one of the strengths of the crosswise integration). Indeed, while the pairwise integration checks one single comparison per time, CoCena checks the pattern throughout conditions providing a more holistic view of the gene regulation (e.g one gene can have a different pattern across conditions at the transcriptional and chromatin level). This is due to the biological fact that RNA and chromatin regulation is not 1:1 (also, for instance, from a timing perspective).

      The major added value of the two approaches consists in their intrinsically different output information. Within a specific comparison, the pairwise integration detects genes consistently activated at the transcriptome and chromatin level. At this information level gene set enrichment can simplify the coherent functional role of this set of genes; we now report this extra information in figure 4 to provide a more granular description of the pairwise integration. Instead, CoCena analyzes the pattern throughout conditions, and clusters together genes and peaks that behave similarly. Functional annotation of genes behaving similarly can put together promoters and/or transcripts that together may orchestrate a specific process (as highlighted by GSEA on each module).

      Probably a similar question with the above: is this methodology applicable to other drugs in addition to 3-CePs?

      To address this extremely important point, that we agree with the reviewer would be key to prove the versatility of our approach, we further applied the pipeline to the prediction of cancer cell lines’ sensitivity to cisplatin, a thoroughly reported broad-acting chemotherapeutic also acting as a DNA damaging agent. Results strongly supported the broad applicability of our approach, which was able to predict sensitivity to this reference drug with extremely high accuracy.

      Reviewer 2

      Carraro et al. describe a framework to understand MoA and susceptibility of drug candidates by integrating RNA-seq and ATAC-seq information. More specifically, by collecting drug responses from high-sensitive and low-sensitive cell lines, the authors identified a key set of pathways with co-expression analysis, and further predicted sensitivity of different cancer cell lines.

      The authors provided a new bioinformatics pipeline to integrate multi-omics data (RNA-seq and ATAC-seq) in a drug response study. This approach increased detection power and identified additional key pathways that are associated with drug 3-CePs. This framework has the potential to be applied to the general drug discovery process.

      We thank the reviewer for the precise summary of our study.

      However, the current manuscript failed to describe the integration methodology in a clear and concise way. Without a full understanding of the methodology, it’s tough to evaluate the downstream results in an unbiased manner.

      We apologize for not having included sufficient details in describing the difference between CoCena and the other two horizontal and vertical approaches. As already discussed in the response to Reviewer 1, we now included a more detailed description not only in the Methods section (from line 894) but also in the main text (lines 393-400).

      In addition, the authors didn’t mention how much additional value this multi-omics approach provided compared to the single-omic data set, as multi-omics approaches are more expensive and labor-intensive.

      We thank the reviewer for this valuable point. To better support the claim for multi-omics approaches, we have extended the Introduction (lines 96-98), as successful integration of information derived from multiple omic layers usually strengthens the determination of the major observed cellular responses. Here, this information helps dissecting and predicting how perturbations (here by drugs) can affect the overall cellular dynamics and mechanisms underlying a certain niveau of sensitivity. We agree with the reviewer that current costs are still prohibitive for large scale use of multi-layer omics in many settings, mainly when it comes to clinical use or drug development. However, significantly less expensive technologies (90% cost-reductions, lines 53-55) have recently been announced, which assures us that approaches as outlined here, will be applicable to many more clinical questions in the near future. Further, we show evidence that some cellular responses to the drug-induced perturbation was only revealed by applying multilayer analysis, but not by a single omics layer, e.g. TGF beta and EMT signaling (see lines 456-459).

      Reviewer 3

      Carraro et al utilize systems biology approaches to decode the mechanism of action of 3chloropiperidines (a novel class of cancer therapeutics) in cancer cell lines and build a drugsensitivity model from the data that they evaluate using samples from The Cancer Genome Atlas and cancer cell lines. The approach provides a framework for integrating transcriptomic and open-chromatin data to better understand the mechanism of action of drugs on cancer cell types. The author’s approach is of sound design, is clearly explained, and is bolstered by validation via holdout sets and analysis in new cell lines which lends the findings and approach credibility.

      The major strength of this approach is the depth of information provided by performing RNA-seq and ATAC-seq on cells treated with 3-CePs at various time points, and the author’s utilization of this data to perform pairwise and crosswise analyses. Their approach identified gene modules that were indicative of why one cell type was more sensitive to a particular drug compared to another. The data was then used to build a sensitivity model which could be applied to samples from The Cancer Genome Atlas, and the authors evaluated their sensitivity predictions on a set of cancer cell lines which validated the predictions.

      We thank the reviewer for the accurate recapitulation of our work.

      The major drawback to this type of approach is that it relies on next-generation sequencing (somewhat costly) and requires intricate bioinformatics analyses. While I agree with the author’s perspective that this approach can be applied to additional classes of drugs and cancer samples, I disagree with their view that it is efficient and versatile. However, for research teams with the means to perform both transcriptomic and open-chromatin studies, I think this integrated approach has promise for evaluating novel classes of drugs, particularly in cancer cell lines that are easy to manipulate in vitro.

      We thank the reviewer for this insightful comment. As with almost every technology, the early years are more difficult and at times adventurous. However, we have seen enormous improvements in robustness of the technology and significant cost reduction with more to come. Only recently sequencing technologies have been introduced into the market with a further 90% cost reduction (as stated in line 53-55). We are convinced that due to their increasing affordability and robustness, RNA-seq and ATAC-seq will be implemented routinely into clinical contexts. As a group working at the cross-section between drug discovery and bioinformatics, we hope that our current work, accompanied by a fair and detailed sharing of our scripts, will become a head start to run this type of analysis also by others in the field who are not (yet) so close to bioinformatics and computational biology.

      While there are examples of similar frameworks being applied to drug development, this work will add to the body of literature utilizing an integrated systems biology approach for pairing drugs with specific tumor or cancer types and understanding their mechanism of action on an epigenetic level.

      We thank the reviewer for this very positive statement and the support for our approach and her/his interest in the described pipeline.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01501

      Corresponding author(s): Prachee Avasthi

      [The “revision plan” should delineate the revisions that authors intend to carry out in response to the points raised by the referees. It also provides the authors with the opportunity to explain their view of the paper and of the referee reports.

      • *

      The document is important for the editors of affiliate journals when they make a first decision on the transferred manuscript. It will also be useful to readers of the reprint and help them to obtain a balanced view of the paper.

      • *

      If you wish to submit a full revision, please use our "Full Revision" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      • *

      We thank the reviewers for their careful reading and evaluation of our manuscript. The reviewers have emphasized the need for several important changes which we plan to address.

      First, they request better evidence and specificity of the BCI target in Chlamydomonas. We have created double mutants between the dusp6 ortholog mutants and found severe defects in ciliogenesis similar to what we see with BCI treatment. We plan to include this data in the paper as well as the subsequent analyses we performed with the single dusp6 ortholog mutants. This data will provide stronger evidence that this pathway regulates ciliary length in Chlamydomonas aside from the other potential off target effects that could be impacting this pathway that we may be seeing through the use of BCI.

      Second, the reviewers have requested more consistency and clarity both in statistics and descriptions of the data and to expand upon our findings in the discussion. We will create a clear guideline for our use of statistics and adjust the descriptions of the data to fit this guideline more strictly and prevent overstating/oversimplifying results. We will also add more discussion and information related to off target effects of BCI, the importance of the subtle defects in NPHP4 protein expression in the transition zone, and the relevancy of the membrane trafficking data in light of this study.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):____


      SUMMARY:____


      The authors investigated the effects of an allosteric inhibitor of DUSP (BCI) on cilia length regulation in Chlamydomonas. Among seven conclusions summarized in Fig. 7, BCI is found to severely disrupt cilia regeneration and microtubule reorganization. Additionally, changes in kinesin-II dynamic, ciliary protein synthesis, transition zone composition and membrane trafficking are also explored. All these aspects have been shown to affect cilia length regulation. Findings from this body of work may give insights on how MAPK, a major player in cilia length regulation, functions in various avenues. Additionally, the study of BCI and other specific phosphatase inhibitors may provide a unique addition to the toolset available to uncover this important and complicated mechanism.

      MAJOR COMMENTS

      Major comment 1____

      The addition of BCI increases phosphorylated MAPK in Chlamydomonas based on Fig 1B. However, the claim that BCI inhibits Chlamydomonas MKPs is not supported at all. SF1A shows CrMKP2, 3 and 5 are related to each other but distant from HsDUSP6 and DrDUSP6. At the same time, 2 out 3 predicted BCI interacting residues are different from the Hs and Dr DUSP6 in SF1B, contradicting "well conserved" in line 172. Consistently, mutants of these orthologs have little to no ciliary length and regeneration defects compared to BCI treatment (see major comment 6 about statistical significance). I am not convinced that BCI inhibits the identified orthologs or any MKPs in Chlamydomonas. It's possible that BCI inhibits a broad range of phosphatases including the ones listed and/or those for upstream kinases. But such a point is not demonstrated by the presented data.

      While BCI is predicted to interact with these residues, it is also predicted to interact with the “general acid loop backbone” by fitting in between the a7 helix and the acid loop backbone (Molina et al., 2009).

      MKP2 has ciliary length defects compared to wild type, though it regenerates normally. In addition, we have crossed these mutants together and have found that cells (2x3 12.2 and 3x5 29.4) cannot generate cilia. We will include this data in the supplement and perform follow up analyses on these double mutants. Because these structures are not 100% conserved, and we have changed the text to “partially conserved” to reflect this, it is possible that BCI is hitting all of these DUSPs rather than just one, or the DUSPs may serve compensatory functions that rescue ciliary length.

      Major comment 3____

      The claims that "BCI inhibits KAP-GFP protein expression" (line 271) and "BCI inhibits ciliary protein synthesis" (line 286) are not convincingly demonstrated. Overlooking that only KAP is investigated instead of kinesin-II, none of the relative intensity from the WB in 30 or 50 µM BCI and the basal body fluorescence intensity indicates a statistically significant difference. The washout made no difference in any of the assay and it's not explained how phosphatase inhibition by BCI might affect overall ciliary protein synthesis. The claims about protein expression may need a fair amount of effort and time investment to demonstrate, therefore I suggest leaving these out for this manuscript.

      Though it's very interesting to see that in SF 2C cilia in 20 µM BCI treatment can regeneration slowly. Line 162, the author claimed "In the presence of (30 µM) BCI, cilia could not regenerate at all (Fig 1E)". Since Fig 1E only extends to 2 hours, I think it's important to clarify if in 30 µM BCI cilia indeed can not generate even after 6 or 8 hours.

      We have altered the text to be more specific with our wording that KAP-GFP is investigated rather than kinesin-2, and we have added text to indicate that downstream phosphorylation events could impact transcription and translation of proteins necessary for ciliary maintenance. This interpretation of the data mentioned above is correct; KAP-GFP is not significantly altered at the basal bodies or in accordance with the steady state western blots. What we see here and demonstrated in Figure 2F-I is the depleted KAP-GFP protein which is not restored following a 2 hour regeneration in BCI. We likely do not see a difference in steady state conditions because the protein is not degraded, just being moved around in the cell. We can only see the difference when the majority of KAP-GFP, which the data suggests is mostly present in cilia, is physically removed through ciliary shedding. This protein is not replaced during a 2 hour regeneration which allows us to conclude that this protein is inhibited due to BCI.

      The washout made a small difference in the double regeneration whereby we begin to see cilia begin to form in washed out conditions, though this was not statistically significant. It is possible that BCI has a potent effect on the cell similar to how other drugs, such as colchicine, cannot be easily washed out. The purpose here is to show that regardless of the statistical significance, cells can begin to regenerate their cilia after BCI washout, though this occurs 4 hours after washout in doubly regenerated cells, and we do not see this potent effect on the singly regenerated cells in SF 2C. Though in SF2C, as mentioned, we do see slowly growing cilia, and this could, once again, be due to the potent inhibition BCI has on ciliary protein synthesis. We will confirm and clarify if 30 µM BCI cannot regenerate even after 6 or 8 hours.

      Major comment 5____

      It is very interesting that BCI disrupts microtubule reorganization induced by deciliation and colchicine. Data in Fig 6B and C are presented differently than those in SF 4C. For example, in SF 4C, BCI treatment for 60 min has close to 50 % cells with microtubule partially reorganized while in Fig 6C about 20% cells with microtubule fully (or combined?) reorganized. The nature of the difference is unclear to me without an assay comparing the two directly. Hence the implied claim that BCI affects colchicine induced microtubule reorganization differently than deciliation induced one is hard to interpret (line 398, line 388 vs line 403).


      The fact that taxol doesn't rescue cilia regeneration defect by BCI is very interesting. Here taxol treatment results in fully regenerated cilia while Junmin Pan's group (Wang et. al., 2013) reported much shorter regenerated cilia. It might be worthwhile to compare the experimental variance as this is a key data point in both instances. The relationship between cilia regeneration and microtubule dynamic is not in one direction. On one side, there's a significant upregulation of tubulin after deciliation. While many microtubule depolymerization factors such as katanin, kinesin-13 positively regulate cilia assembly (though not without exceptions). It is hard to determine that the BCI induced cilia regeneration defect can't be rescued by other forms of microtubule stabilization. Microtubule reorganization is one of the most striking defects related to BCI treatment. I suggest changing the oversimplified claim to a more limited one (such as "PTX stabilized microtubule ...") and an expansion on the discussion about microtubule dynamics and cilia length regulation beyond the use of taxol. Meanwhile, I strongly encourage authors to continue to investigate this aspect and its connection to the cilia regeneration.

      We will remove data regarding “partially” formed cytoplasmic microtubules and only include fully formed for each of these experiments for clarity.

      It is important to note the different taxol concentration used here. While Wang et al., 2013 used 40 µM taxol to study ciliary affects, we use 15 µM where stabilization still occurs. There have been reports of varied cell responses to higher vs. lower doses of taxol (see Ikui et al., 2005, Pushkarev 2009, Yeung 1999) mostly with regards to the cell’s mitotic/apoptotic response. We could be seeing altered responses at this lower concentration because Chlamydomonas cells also behave differently in higher vs. lower taxol concentrations. Thank you for your suggestions. We have adjusted the text to be more specific to PTX treatment as opposed to general stabilization.

      Major comment 7:____

      There are several places where the technical detail or presentation of the data are missing or clearly erroneous.

      Fig 1B: pMAPK and MAPK antibodies used in the WB are not described in the Material and methods. It's not clear if the same #9101, CST antibody used for RPE1 cell in Fig 1J is used.

      We have updated the materials and methods to include that this antibody was used for both RPE1 and Chlamydomonas cells.


      line 260 and Fig 3A state 20 µM BCI was used while Fig 3 legend repeatedly states 30 µM until (J). Also 30 µM in SF 2A.

      We have corrected the text to 20 µM BCI in the mentioned places.

      Fig 6C, the two lines under p value on top mostly likely start from the second column (B) instead of the first (D). Fig 6G, the line is perhaps intended for the second and fourth columns?

      We will make these comparisons more clear. We had performed a chi-square analysis and were comparing the difference between DMSO and BCI before PTX stabilization or MG132 treatment to after. We will add brackets to more clearly show these comparisons.

      Fig 6C, legends indicate bars representing each category. But only one bar is shown for each column. Same for 6G?

      This is the same as the previous comment for the way we represented the statistics. We will make this clearer with brackets to show the comparisons.

      Minor comments:____

      1. A number of small errors in text were noted above. Done.

      "orthologs" is misused in place of "ortholog mutants": line 176, 352, 421 (first), 879, 882, 898, 902, 938 , 939.

      Done.

      Capital names is misused as mutant names (e.g. "MKP2"should be "mkp2"): line 178, SF 1C, 1D and 1E, SF 3C, SF 6A

      Done.

      At several places such statistical analysis lines indicated are chosen confusingly. A simplest example is in Fig 1D, the comparison between 0 to 45 is less important than 0 to 30. Same as in Fig 1H, 1I. The line ends are inconsistent as well. They either end in the middle or the edge of the columns/data points (such as in SF 4B) and some with vertical lines (SF 2B, SF 4A, SF 6B). I suggest adding vertical lines pointing to the middle to indicate the compared datasets clearly.

      Thank you for this suggestion. We agree and will update the figures to reflect this and provide clarity for statistical comparisons.

      line 101 remove "the"

      Done.

      line 120 "modulate" to "alter"

      Done.

      line 198 "N=30" should be "N=3"

      Done.

      line 212. The legend for p value is likely for (G)

      Done.

      line 284, "singly" should be "single"

      Done.

      The dataset for "Pre" and "0m" in Fig 6D and 6E are clearly the same. Consider combining the two as in Fig 6C.

      This is correct. We will combine the data sets.

      Fig 6E, "BCI" on the X-axis should be "DMSO".

      This is correct. We will correct this.

      line 685, remove "?".

      Done.

      line 894: "Fig 3J" instead of "Fig 3H"

      Done.

      SF 1 legend, (C) and (D) are inverted.

      Done.

      SF 4A "Recovered" should be "Full"

      Done.

      SF 5, row 5, under second arrow perhaps missing +PTX

      Done. We greatly appreciate this close reading of the text and the list of changes making these errors easy to find. We will make these changes in the manuscript.

      Reviewer #1 (Significance (Required)):____


      Increasing evidence indicates that several MAPKs activated by phosphorylation negatively control cilia length while few studies focus on how MAPK dephosphorylation affects cilia length regulation, largely due to the unknown identity of the phosphatase(s) specifically involved in cilia length regulation. The authors set out to investigate the effect of BCI on cilia length control. BCI specifically inhibits DUSP1 and DUSP6, both of which are known MAPK phosphatase, and therefore may provide a unique opportunity to understand how MAPK pathway is controlled by specific phosphatase(s) activity in cilia length regulation.


      Overlooking some inconclusive results and oversimplified interpretations, I find the most striking findings are the BCI's effects including ciliogenesis, kinesin-2 ciliary dynamics and microtubule reorganization. I believe these findings have significant relevance to the stated goal (line 131) and conclusions (line 57) and readers may find them a good starting point for further investigation of the role phosphatases play in cilia length regulation.

      Cilia length regulation is a complicated mechanism that is affected by many aspects of the cell and functions differently in various systems. My field of expertise may be summarized by cilia biology, cilia length regulation, IFT, kinesin, kinases (MAPKs), microtubules. The membrane trafficking's role in cilia length regulation is somewhat unfamiliar to me. Additionally, the authors used a number of statistical tests and corrections in various assays. The nuance of these choices is not clear to me and neither explained to general readers.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In their manuscript, "ERK pathway activation inhibits ciliogenesis and causes defects in motor behavior, ciliary gating, and cytoskeletal rearrangement," Dougherty et al investigate how BCI, an activator of MAPK signaling, regulates ciliary length. Despite advances in our understanding of the structure and function of cilia, a fundamental question remains as to what are the mechanisms that control ciliary length. This is a critical question because cilia undergo dynamic changes in structure during the cell cycle where they must disassemble as they enter the cell cycle and must rebuild after cell division. This work contributes to a growing body of work to determine mechanisms that regulate cilia length.

      The authors use a well-established model system, Chlamydomonas, to study cilia dynamics. This work expands on previous findings from these authors that inhibition of MAPK signaling using U0126 lengthens cilia as well as other publications that implicate MAPK signaling in controlling ciliary length. However, the authors only observe a few significant phenotypes with other subtle trends, leaving the conclusion regarding the role of MAPK signaling murky. Furthermore, it is unclear through what mechanism BCI impacts ciliary length. Several issues must be addressed:

      MAJOR ISSUES

      1. The basis for this study is the use of the ERK activator BCI, which the authors show activates MAPK signaling. While the authors do use putative DUSP6 ortholog mutants to corroborate some of the phenotypes, the majority of the data (and conclusions) uses BCI. However, there may be off target effects and the authors do not address this limitation of the study. The authors only use 1 pharmacological tool to manipulate MAPK signaling, so it is unclear whether these ciliary disruptions are specifically due to increased MAPK. It is necessary to clarify the following questions about BCI action to interpret the results:
      2. ____a.____ What are off target effects of BCI? Does BCI impact proliferation? Why is the BCI phenotype of cilia shortening transient and dose dependent? Why does the phenotype of cilia length and regeneration capacity in Chlamydomonas differ from both ortholog mutants and hTERT-RPE1 cells? While we do mention following supplemental figure 1 that other MKPs could be the target for BCI, we also cite Molina et al., 2009 who showed specificity for BCI hydrochloride in zebrafish. BCI targets primarily DUSP6, but also exhibited some activity towards DUSP1. In this study, the authors had also used zebrafish embryos to check expression of 2 other FGF inhibitors, spry 4 and XFD, in the presence of BCI but found that their effects were not reversed. In addition, they checked the ability for BCI to suppress activity of other phosphatases including Cdc25B, PTP1B, or DUSP3/VHR and found that BCI could not suppress these phosphatases. BCI inhibition has previously been found to be more specific to MAPK phosphatases. In addition, we have previously confirmed that U0126 has a slight lengthening effect on Chlamydomonas which further implicates this pathway in cilium length tuning (Avasthi et al. 2012).

      While cell proliferation assays maybe provide more support for MAPK signaling, it does not clarify lack of off target effects that could also contribute to this same phenotype. We do provide a cell proliferation assay for RPE1 cells where we show that higher concentrations of BCI result in cellular senescence as well (Fig 1I).

      The BCI phenotype of cilia shortening is likely transient and dose dependent due to its effect on ciliary protein synthesis demonstrated in Figure 3J. The increase in drug likely increases its substrate binding to exert its effects on the cell faster, even if this includes off target proteins.

      In RPE1 cells, we are likely seeing differences in regeneration capacity potentially due to their different mechanisms of ciliogenesis (RPE1 cells partake in intracellular ciliogenesis where axonemal assembly begins in the cytosol whereas Chlamydomonas cells partake in extracellular ciliogenesis where axonemal assembly begins after basal bodies dock to the apical membrane), or it could be that we’re missing a delay in regeneration in RPE1 cells after waiting 48 hours for ciliogenesis. We do not check this process sooner. There may be a defect that cells overcome. Additionally, among ortholog mutants and RPE1 compared to BCI-treated wild-type Chlamydomonas, there indeed could be off target effects or the drug could be targeting all of these MKPs rather than just one. We will add this to the discussion for clarity.

      Reviewer #2 (Significance (Required)):


      see above

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      SUMMARY:

      In this study, the authors used a pharmacological approach to explore the function of ERK pathway in ciliogenesis. It has been reported that the alteration of FGF signaling causes abnormal ciliogenesis in several animal models including Xenopus, zebrafish, and mice. However, it remains elusive the molecular detail of how ERK pathway is associated with cilia assembling process. The authors found that the ERK1/2 activator/DUSP6 inhibitor, BCI inhibits ciliogenesis, highlighting the importance of ERK during ciliogenesis. Overall, this paper is well written, data are solid and convincing. This paper will be of great interest to many researchers who are interested in understanding ciliogenesis. The following comment is not mandatory requests but suggestions to improve the paper's significance and impact.

      MAJOR COMMENTS:

      - Combination of chemical blocker experiments were well controlled and data are solid. The authors are aware of the side effects of BCI, thus they carefully characterized the phenotypes of Mkp2/3/5 in Chlamydomonas. This reviewer wonders if the levels of ERK1/2 phosphorylation are activated in these mutants. Did the authors examine the levels of ERK1/2 phosphorylation in these mutants?

      While we do not include the data showing ERK activation in these mutants, we have checked pMAPK activation and found that it is not significantly upregulated in these mutants. This could likely be due to compensatory pathways preventing persistent pMAPK activation. For example, constant ERK activation can lead to negative feedback to regulate this signal for cell cycle progression (Fritsche-Guenther et al., 2011). The ERK pathway has not been fully elucidated in Chlamydomonas, but it is possible that these similar mechanisms are in place for MAPKs. We will include this data in the supplement.

      Reviewer #3 (Significance (Required)):


      Accumulated studies suggest that the FGF signaling pathway plays a pivotal role in ciliogenesis. Disruption of either FGF ligands or its FGF receptor results in defective ciliogenesis in Xenopus and zebrafish. On the other hand, FGF signaling negatively controls the length of cilia in chondrocytes that would cause skeletal dysplasias seen in achondroplasia. Therefore, there is strong evidence suggesting that FGF signaling participates in ciliogenesis in cell-type and tissue-context dependent manners. However, the detailed mechanism of the downstream of FGF signaling in ciliogenesis is still unclear. In this regard, this paper is beneficial for the cilia community to expand the knowledge of how ERK1/2 kinase contributes to the regulation of ciliogenesis.


      This reviewer therefore suggests that the authors may want to add more discussion to explain how their finding possibly moves the field forward to understand the pathogenesis of multiple ciliopathies.

      We will add a description of this to the discussion.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      Reviewer 1:


      Major comment 4____

      A single panel in Fig 4A also can't support the shift in protein density in the TZ in line 317. As line 324 implies protein synthesis defect by BCI, the very minor (in amount and significance) reduction of the NPHP4 fluorescence should not be interpreted as any disruption at all to the transition zone. I suggest checking other TZ proteins such as CEP290 etc or leave this section out.

      Also, The additive effect from BFA and BCI treatment in Fig 5A suggests BCI affects cilia length independent of Golgi. The "actin puncta" and arpc4 mutant are not sufficiently introduced. And more importantly, how increase in the actin puncta explains the shorter cilia length caused by BCI while actin puncta are absent in arpc4 mutant with shorter cilia? Also, the Arl6 fluorescence signal "increase" is not significant in either time point. I suggest leaving this section out as well.

      We agree that one EM image cannot support a protein shift and have removed our observation in the text. However, we do see a statistically significant decrease in NPHP4 fluorescence in BCI treated cells which we consider a disruption in the sense that the structural composition is altered. We will change the word “disruption” to “alteration” for clarity. Though this is a minor defect, we believe it is still worth noting. We believe this data still adds to the model that though the EM-visible structure is unaltered, finer details within the transition zone are indeed altered and we cannot rule out that these smaller changes are not impacting protein entry into cilia. Awata et al. 2014 shows that NPHP4 is important for controlling trafficking of ciliary proteins at the transition zone, and its loss from the transition zone has been found to have effects in ciliary protein composition. Because we see decreased NPHP4 expression, we believe this is a notable finding as we see effects on the abundance of a protein which is known to affect ciliary protein composition and have therefore chosen to leave the data in the manuscript. We will adjust the language to most accurately describe our findings.

      We also agree with the interpretation that the additive effect seen from BFA and BCI treatment could suggest independent pathway collapse separate from the Golgi which we have mentioned in the manuscript.

      We have provided more information to introduce actin puncta and ARPC4 with regards to membrane trafficking. Bigge et al. 2020 shows that ARPC4, a subunit of the ARP2/3 complex which is an actin binding protein important for nucleating actin branches, has a role in ciliary assembly. ARPC4 mutants have repressed ability to regenerate their cilia. One feature they noticed in regenerating cells is the immediate formation of actin puncta which are reminiscent of yeast endocytic pits. This observation in addition to altered membrane uptake pathways in Chlamydomonas suggests that ciliogenesis involves reclaiming plasma membrane for use in ciliogenesis (because of the diffusion barrier preventing a contiguous membrane). Here, we incorporate this assay to assess the ability for the cell to reclaim membrane during BCI treatment and find that there is increased actin puncta. This could indicate that there is increased number of endocytic pits or alternatively that the lifetime of these pits is increased (perhaps due to incomplete endocytosis) such that we are able to detect more of them at a fixed point in time. While we cannot say which is happening here, we have previously found that these actin puncta are likely endocytic and needed to reclaim membrane for early ciliogenesis. An increase in these puncta may suggest dysregulated endocytosis in one way or another. ARPC4 cells cannot form the actin puncta in the first place, whereas we are seeing defects following puncta formation. We have taken out the Arl6 data.

      Major comment 6____

      Throughout this manuscript, the standard the authors used to interpret statistical significance is erratic. In a few instances, the threshold for p value is clearly indicated such as in Fig 1 legend. Though other times, much higher p values are considered differences. Here are some examples:

      SF 1C, p=0.1167 is considered "(mkp5) shorter than wildtype ciliary lengths" (also line 177 "SF 1C" instead of "SF 1D")

      Fig 3C, p=0.083 interpreted as "slightly less" in line 262 and possibly as "(KAP-GFP) not being able to enter (cilia)" in line 268

      Fig 3G, p=0.1087 is considered "not decrease after two hours" line 267

      SF 3C, p=0.2929 for mkp2 mutant (misuse of "orthologs" in line 352) is considered "fewer actin puncta compared to wild type cells" (line 352).

      SF 6B, p=0.1565for mkp3 mutant (line 421: misuse of "orthologs" and correct use of "ortholog mutants") is considered not be able to "fully reorganize their microtubules" (line 421).

      These instances sometimes serve as basis for major conclusions and should be clarified or more carefully characterized.

      We agree the interpretations are very erratic in places and greatly appreciate this detailed list making it easy to find and correct these interpretations. We have adjusted the text in the mentioned places to reflect these changes, and we have made a statement in the text and under statistical methods that say we consider p Reviewer 2:

      In multiple instances the conclusions are overstated, and the author must clarify the interpretation of the results to reflect the data presented. Here are some examples:

      • ____a.____ The conclusion that protein synthesis is disrupted is incorrect in two instances (line 258 and 275) as the experiments in figure 3 do not directly examine changes in synthesis (they look at cilia regeneration as a proxy). We show that KAP-GFP expression is not normal during regeneration at 120 minutes which suggests, in addition to the inability for cilia to grow in BCI, that synthesis is inhibited because this protein is not replaced. In addition, blocking the proteosome did not rescue this decrease in KAP-GFP expression indicating that this is not a matter of KAP-GFP protein being degraded rapidly. We use regeneration and KAP-GFP readout as a proxy for protein synthesis. We have clarified this in the text.

      • ____b.____ The conclusion that BCI disrupts membrane trafficking is too broad when the authors only examined trafficking of one membrane protein, Arl6. While we only looked at one membrane protein specifically, we assess other membrane trafficking paths. We looked at BCI vs. BFA to assess Golgi trafficking (Dentler 2010) in addition to formation of actin puncta which is used in Bigge et al. 2020 as an assay for membrane uptake from the plasma membrane for incorporation into cilia.

      • ____c.____ The conclusion that the transition zone is disrupted is too broad based on a decrease in the expression of one transition zone protein, NPHP4. We have changed the text to be more specific to NPHP4.

      Highlighting the overstatement, the conclusion of the header and figure caption on page 10 contradict one another. The manuscript states that "BCI partially disrupts the transition zone" (line 313) and that "The TZ structure is structurally unaltered with BCI treatment" (line 329).

      In the manuscript, we show that the EM-visible structure is indeed unaltered. Because we see a decrease in NPHP4 fluorescence, we concluded that while the EM-visible structure is unaltered, protein composition within the transition zone is altered which suggests that BCI partially disrupts the transition zone.

      Why is kinesin-2 the only target studied for ciliogenesis? Ciliogenesis is a complex process that involves many other critical proteins and investigating kinesin-2 alone is not sufficient to conclude why BCI prevents cilia assembly.

      We use kinesin-2 because it is the only ciliary anterograde motor in Chlamydomonas which is required for proper ciliogenesis. By assessing kinesin-2, we were able to address whether this protein alone was the cause for inhibited ciliary assembly (and we find that it’s not), whether its ability to enter was impacted (likely owing to defects in other protein entry), and we were able to use this protein to understand how its protein expression was affected. Because KAP-GFP is a cargo adaptor protein and interacts with IFT complexes and other cargoes, defects in this protein can have a wide range of implications. We agree and the data agree that kinesin-2 alone is not sufficient to conclude why BCI prevents cilia assembly. Because of this, we assessed other pathways including membrane trafficking and microtubule stabilization to better understand why we see defects in ciliary assembly. Certainly many other proteins are important in ciliogenesis and we hope that this study sparks further work in this area to identify additional causative explanations for impaired ciliogenesis upon MAPK activation..

      Tagged ciliary proteins are sensitive to disruptions in function and expression within cilia. It is important to include proper controls in the study using KAP-GFP Chlamydomonas cells to ensure that KAP-GFP maintains endogenous expression levels and normal function as untagged KAP. Furthermore, if this information is available through the resource where the cells were purchased, then this needs to be discussed.

      KAP-GFP expressing Chlamydomonas has previously been validated as described in Mueller et al., 2005. We will provide details in the text about validation of this strain.

      The authors need to provide clear explanations to a general audience of why this technique is used and how the authors reached the interpretations. There are several instances where the authors use techniques that are cited as fundamental papers in Chlamydomonas. Here are two examples:

      • ____a.____ It is unclear how the authors concluded that decreased frequency and velocity of train size shows that kinesin entry, specifically, is disrupted. We have expanded on this in the text. Please see response to reviewer 1, Major comment 2 above.

      • ____b.____ It was impossible to follow how the experiment where cells treated with cycloheximide could not regenerate their cilia following BCI treatment shows that BCI inhibits protein synthesis. We have adapted the text to be more clear regarding this experiment. In this experiment, we deplete the ciliary protein pool by forcing ciliary shedding two times. Following the first shedding, there is enough protein to assemble cilia to half length (Rosenbaum, 1969). We ensure that the protein pool is completely used up by inhibiting further ciliary protein synthesis with cycloheximide. For the second shedding event, completely new ciliary protein must be synthesized for ciliogenesis to occur which is why ciliogenesis takes much longer compared to a single regeneration where half of the ciliary protein pool still remains and can be immediately incorporated into cilia (SF 2C). In the presence of BCI, cilia cannot grow at all as expected; but 4 hours after BCI is washed out, we see ciliogenesis just beginning to occur which indicates that there is protein present for ciliogenesis to begin whereas in cells where BCI is not washed out, we do not see any ciliogenesis.

      The impact of BCI treatment on membrane trafficking as presented is confusing. BCI exacerbated the effects of BFA treatment on Golgi, yet the authors do not address that this could be an indirect effect of BCI or an off-target effect of BCI.

      This is addressed in the discussion (paragraph 4).

      The discussion section includes many interpretations of the results, but leaves the reader confused as to what the authors think might be happening. The manuscript would be far clearer if the authors would provide a working model for why BCI impacts cilia length. It is fine for this to be left for future work but, as the experts, the authors must have relevant thoughts to share with the field.

      Figure 7 provides a model with as much as we can conclude given the data; what we show is that BCI inhibits many different processes in the cell, but we do not necessarily show links between these processes to provide a complete working model of how these are all interconnected; we have provided a summary model that depicts the various, still disconnected processes that are inhibited by BCI. MAP kinases such as ERK have dozens of downstream targets both within and outside the nucleus. Ciliogenesis also is a complex process coordinating many cellular mechanisms. The intersection of these two seem to have a multi-fold effect that results in a dramatic ciliary phenotype through a combination of factors, however not one that fully explains the severity upon initial deciliation in BCI/MAPK activation. Further work is needed to identify the precise cause of completely inhibited cilium growth from zero length.

      MINOR ISSUES

      1. The title of the manuscript is inaccurate and overstates the pathway involvement in cilia. The authors do not directly show that ERK pathway activation causes the ciliary phenotypes due to the use of BCI, a drug that modulates ERK. We have adjusted the title to “The ERK activator, BCI, causes…”

      When discussing results of data that are not statistically significant it creates confusion to state that the results "increased/decreased slightly".

      We agree that references to statistics are inconsistent or confusing throughout the text and have adjusted these references accordingly.

      Reviewer 3:

      Major comment:

      - If the authors want to emphasize their finding is associated with MAP kinases, it would be also beneficial to examine other major MAP kinase pathways such as P38/JNK. If not, then this reviewer suggests revising the text as ERK through this manuscript to avoid confusions.

      Because the ERK pathway has not been fully elucidated in Chlamydomonas, we have refrained from using “ERK” as a descriptor because this particular MAPK shares equal identity with multiple MAPKs in Chlamydomonas. Further, BCI may be targeting more than one MAPK phosphatase resulting in the myriad phenotypes we have discovered. At this time, we lack a level of gene-level resolution to map to known MAPK pathways.

      • *

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.


      Reviewer 1:

      Major comment 2____

      The claim that "BCI treatment decreases kinesin-2 entry into cilia" (line 236) is a misinterpretation of the data presented. The data indicates KAP-GFP have reduced accumulation in cilia, decreased IFT (anterograde) frequency, velocity and injection size associated with BCI treatment. Though as shown in Fig 1D and Fig 2C, cilia length is also shorter due to BCI treatment. Ludington et. al, 2013 showed a negative correlation of cilia length and KAP injection rate in various treatments that affect cilia length. It's essential to rule out that the KAP dynamics reported in the current manuscript is not an outcome of shortened cilia in order to claim as line 236 seems to suggest. One way to demonstrate specific effect by BCI would be to compare KAP dynamic in cilia with equal or similar length, either by only selecting the shorter cilia from wt or use other treatments that are known to decrease cilia length (chemicals, cell cycle, mutants etc.). Given the capability and resource represented in this manuscript, I don't expect a significant cost and time investment for these experiments.

      Ludington et al., 2013 shows that injection size decreases with increasing length. Our data show that the shorter length cilia have decreased injection size and rate inconsistent with the cause being due to shortened length alone. In other words, in figure 2C and 2G, we see decreased KAP-GFP fluorescence in shorter cilia as opposed to greater fluorescent signal in shorter cilia seen in Ludington et al., 2013. This data, in combination with the decreasing frequency of KAP-GFP entry overtime in figure 2E and decreased velocity in figure 2F support decreased kinesin-2 entry into cilia. If entry was unaltered, we would expect increased KAP-GFP fluorescence in the cilia over time in BCI-treated cells.


      Reviewer 2:

      The authors state that the decreased length of cilia following BCI treatment could be a result of reduced assembly or increased assembly. Disruptions to cilia assembly and disassembly are not mutually exclusive and both must be evaluated. The authors do not test whether cilia disassembly is disrupted in BCI treatment and therefore, cannot conclude that BCI solely disrupts cilia assembly.

      While effects on disassembly remains a possibility, the striking inability to increase from zero length upon deciliation and the effects on anterograde IFT through the TIRFM assays suggest an affect on assembly. There may be effects on disassembly and likely many other cilia related processes not investigated but we feel it remains accurate to conclude that assembly is affected by BCI treatment.

      Reviewer 3:

      - If time allows, in addition to examining NPHP4, it would be beneficial to examine other TZ/TF markers such as CEP164 to confirm if BCI partially disrupts the TZ.

      Given the known outcomes of NPHP4 loss in Chlamydomonas (Awata et al., …) in affecting ciliary protein composition, we suspect the changes in NPHP4 abundance at the transition zone will have a significant impact and agree it would be interesting in a follow up study to see how other transition zone proteins (particularly ones known to interact with NPHP4 or others critical for TZ function) are impacted following BCI treatment.


      MINOR COMMENTS:

      - I suggest moving supplemental figure 1 to the main figure (Fig. 1?) so that the readers appreciate the author's careful examination of BCI through this manuscript.

      Thank you for your suggestion and kind critique. We have included this data in the supplement for consistency with mutant data in all of the other supplemental figures.


    1. Overview Q&A Notebook Transcript INSTRUCTOR Jeff Toister Author, Consultant, Trainer Follow on LinkedIn RELATED TO THIS COURSE Learning Groups Show all Exercise Files (2) Show all Certificates Show all Continuing Education Units Show more Exam Start Exam Course details 1h 22m Beginner Updated: 11/18/2020 4.7 (12,712) View Jeff's LinkedIn NewsletterDo your customers feel valued? When they do, they keep coming back. When they don't, your business suffers. In this course, writer and customer service consultant Jeff Toister teaches you the three crucial skill sets needed to deliver outstanding customer service and increase customer loyalty. Learn how to build winning relationships, provide the right assistance at the right times, and effectively handle angry customers. He also shares ways to find out what your customers really think about your service, and use their feedback to improve. Learning objectives Explore how you can use customer surveys to build rapport. Name three ways you can use active listening to serve your customers more effectively. Identify the different types of needs that must be addressed in order to solve problems. Explain the benefits of taking ownership of a problem. Define “preemptive acknowledgment” and recognize its impact on customer service. List three types of attitude anchors and explain their differences. Skills covered Customer Loyalty Customer Service Learners 24,449 members like this content 537,649 people started learning CEU - Continuing Education Units (2 certifications available) National Association of State Boards of Accountancy (NASBA) Continuing Professional Education Credit (CPE): 3 Recommended NASBA Field of Study: Communications and Marketing Sponsor Identification number: 140940 To earn CPE credits the learner is expected to: Complete all videos and chapter quizzes Complete the final exam within one year from completing the course Score 70% or higher on final exam Glossary: see PDF file in the Exercise Files area Program Level: Basic Prerequisite Education: There are no prerequisites for this course. Advanced Preparation: There is no advance preparation required for this course. If you undertake this course for CPE credits, you can leave final comments in the Self Study Course Evaluation. LinkedIn Learning is registered with the National Association of State Boards of Accountancy (NASBA) as a sponsor of continuing professional education on the National Registry of CPE Sponsors. State boards of accountancy have final authority on the acceptance of individual courses for CPE credit. Complaints regarding registered sponsors may be submitted to the National Registry of CPE Sponsors through its web site: www.nasbaregistry.org Register here with LinkedIn Learning. For course refund policy, issue resolution, and additional info please see the LinkedIn User Agreement. For more information regarding administrative policies such as complaint and refund, please contact our offices at +1 650-687-3600. Project Management Institute (PMI)® PDUs/ContactHours: 1.75 LinkedIn Learning has been reviewed and approved by the PMI® Authorized Training Partner Program. This course qualifies for professional development units (PDUs). The PMI Authorized Training Partner logo is a registered mark of the Project Management Institute, Inc. To view the activity and PDU details for this course, click here. Related courses POPULAR 32m COURSE Course Customer Service: Problem Solving and Troubleshooting 293,029 learners Save POPULAR 27m COURSE Course Building Rapport with Customers 238,646 learners Save POPULAR 49m COURSE Course De-Escalating Conversations for Customer Service 278,035 learners Save POPULAR 23m COURSE Course Customer Service: Call Control Strategies 188,760 learners Save POPULAR 33m COURSE Course Creating Positive Conversations with Challenging Customers (2019) 275,662 learners Save Learner reviews 4.7 out of 5 12,712 ratings How are ratings calculated? 5 star Current value: 9,973 78% 4 star Current value: 2,159 17% 3 star Current value: 444 3% 2 star Current value: 44 <1% 1 star Current value: 92 <1% Olatunji Awesu 3rd Sales Team Lead July 25, 2022 Great course Helpful Report Ayanda Hlatshwayo Call Center Representative July 25, 2022 ... Helpful Report thobani mkhize agent July 25, 2022 very helpful Helpful Report Show more reviews Live office hours with experts Show all Show all upcoming events Jun 16, 10:00 AM EVENT Event Motivating customer service employees By: Jeff Toister Ask here to share with learners, experts and others Ask Looking for technical assistance (e.g. downloading certificates)? Visit Learning Help Question asked by Tye Locke Tye Locke Willing to help but are you? 5d More options for this question Copy link to question Report this post Where can I download the worksheet? From the video: Define outstanding customer service (00:38) 4 Answers Like Answer Add your answer here Add your answer here Answered by sadam arab sadam arab Student at alpha university 10h More options for this answer Report this post also I want download so how I can download Like Reply Answered by Sydney Sabelo Sydney Sabelo Risk Controller at Robor 1d More options for this answer Report this post PDF  is the best or recommended to download your worksheet with Like Reply Load more answers Question asked by Kufre Edet Kufre Edet Information Technology Specialist at Akwa Ibom State Agency for the Control of AIDS 1w More options for this question Copy link to question Report this post I cant find where to download the PDF files recommended in the course From the video: Create a plan (02:02) 2 Likes 1 Answer Like Answer Add your answer here Add your answer here Answered by Jeff Toister Jeff Toister Instructor Your service culture guide. 1w More options for this answer Report this post Hi Kufre. The exercise files are available to LinkedIn Learning subscribers. To download the files, navigate to the "Overview" tab and look for a link marked "exercise files" near the top. I'd recommend contacting LinkedIn Learning directly for technical assistance if you run into any more difficulty: www.linkedin.com/help/learning Like Reply Question asked by Sandip Kaur Badhesha Sandip Kaur Badhesha Passionate IT Analyst Looking for a Challenging Opportunity 1w More options for this question Copy link to question Report this post I can't find all the documents he suggests to Download in each Video. From the video: Implement techniques to build rapport (00:22) 1 Like 1 Answer Like Answer Add your answer here Add your answer here Answered by Jeff Toister Jeff Toister Instructor Your service culture guide. 1w More options for this answer Report this post Hi Sandip, The exercise files are available to LinkedIn Learning subscribers. They can be accessed by navigating to the course's Overview tab. Look for a link labeled "exercise files" near the top. I'd recommend contacting LinkedIn Learning directly for technical support if you run into any difficulties: www.linkedin.com/help/learning  -Jeff Like Reply Question asked by Lucas M. Ladeveze Lucas M. Ladeveze Surgeon Specialized Knee-Foot and Ankle -Specialized Sports Medicine - Profesional Football Coach - Professional Padel Coach - 2w More options for this question Copy link to question Report this post LEarning a lot! But I cannot find all the documents he suggests to Download in each Video.   From the video: Implement techniques to build rapport (00:23) 3 Answers Like Answer Add your answer here Add your answer here Answered by Jeff Toister Jeff Toister Instructor Your service culture guide. 1w More options for this answer Report this post Hi Lucas, I'm glad you're learning a lot so far! The exercise files are available to LinkedIn Learning subscribers. They can be accessed by navigating to the course's Overview tab. Look for a link labeled "exercise files" near the top. I'd recommend contacting LinkedIn Learning directly for technical support if you run into any difficulties: www.linkedin.com/help/learning  -Jeff Like Reply 1 Like Answered by Maha M. Maha M. Entrepreneurial professional with growth mindset, excellent interpersonal skills, problem-solving abilities. Successful at team-leading & building ,showcasing strong emotional intelligence & full filling business needs. 1w More options for this answer Report this post good content Like Reply 1 Like Load more answers Question asked by Marlene Ranallo Seelig Marlene Ranallo Seelig Recruiter 2w More options for this question Copy link to question Report this post Where are these downloads?  From the video: Implement techniques to build rapport (00:20) 1 Like 1 Answer Like Answer Add your answer here Add your answer here Answered by Jeff Toister Jeff Toister Instructor Your service culture guide. 2w More options for this answer Report this post Hi Marlene. The exercise files are available to LinkedIn Learning subscribers. They can be accessed by navigating to the course's Overview tab. Look for a link labeled "exercise files" near the top. I'd recommend contacting LinkedIn Learning directly for technical support if you run into any difficulties: www.linkedin.com/help/learning -Jeff Like Reply 1 Like Question asked by Charisa Chinyere Ndinojuo Charisa Chinyere Ndinojuo I am a professional freelancer in customer support, social media marketing, virtual assistant and data entry 1mo More options for this question Copy link to question Report this post I am done with watching all the video in this course and I still can't download the certificate, why? From the video: Keep your customers happy (00:28) 6 Likes 4 Answers Like Answer Add your answer here Add your answer here Answered by Ekemini Eyoh Ekemini Eyoh -- 3w More options for this answer Report this post I am not able to download the questions or try out the quizzes. Please how do I go about it,? Like Reply Answered by Quach T Dung Quach T Dung -- 4w More options for this answer Report this post Me too, I'm trying a lot but I can not get certificate Like Reply Load more answers Question asked by Patience Chekwube Patience Chekwube General virtual Assistant/ Data entry specialist/ lead generator 1mo More options for this question Copy link to question Report this post Please how do I download the learning plan worksheet.  Thank you From the video: What to know before watching this course (01:22) 1 Like 1 Answer Like Answer Add your answer here Add your answer here Answered by Patience Chekwube Patience Chekwube General virtual Assistant/ Data entry specialist/ lead generator 1mo More options for this answer Report this post Ok, I saw similar questions here and the answer to it. Have downloaded it but can't seem to open the downloaded file. What should I do Like Reply 1 Reply Commented by Jeff Toister Jeff Toister Instructor Your service culture guide. 1mo More options for this comment Report this post Hi Patience Chekwube . I'd recommend contacting LinkedIn Learning for technical support. www.linkedin.com/help/learning Like Reply Question asked by Eze Joy Eze Joy Student at Nnamdi Azikiwe University 1mo More options for this question Copy link to question Report this post Hello I have completed my course with a total of 73%in my exam but was not issued any certificate what will I do? From the video: Identify emotional needs (00:54) 1 Answer Like Answer Add your answer here Add your answer here Answered by Jeff Toister Jeff Toister Instructor Your service culture guide. 1mo More options for this answer Report this post Thanks for completing the course, Eze Joy . I hope it was very valuable to you! Here's a guide I found on the LinkedIn site for getting your certificate. It includes some troubleshooting steps. https://www.linkedin.com/help/learning/answer/a700836 Like Reply Question asked by Manar Fakhri Manar Fakhri MSc Master degree in Business Administration with Specialisation in International Marketing ( SMART CITY ) 1mo More options for this question Copy link to question Report this post I complete course and did the assessment and got 75% but no certification got !!!!!!!!!! From the video: Create a plan (00:01) 3 Likes 5 Answers Like Answer Add your answer here Add your answer here Answered by Esther Mutisya Esther Mutisya Operations Manager at Greenvale Hotel 1mo More options for this answer Report this post how do i download the pdfs? Like Reply 1 Reply Commented by Jeff Toister Jeff Toister Instructor Your service culture guide. 1mo More options for this comment Report this post Hi Esther. LinkedIn Learning subscribers can access the course worksheets by navigating to the Overview tab. There's a link near the top marked Exercise Files. Like Reply 1 Like Answered by Jeff Toister Jeff Toister Instructor Your service culture guide. 1mo More options for this answer Report this post Hi Manar. Thanks for completing the course! I found this guide on the LinkedIn Learning site with some troubleshooting steps for downloading certificates of completion: https://www.linkedin.com/help/learning/answer/a700836 If those steps don't help, I recommend contacting LinkedIn Learning directly for technical support: https://www.linkedin.com/help/learning While I don't work for LinkedIn Learning, and my technical skills are limited, I'd be happy to answer any questions you have about the course itself. -Jeff Like Reply 1 Like 3 Replies Load previous replies Commented by Jeff Toister Jeff Toister Instructor Your service culture guide. 32m More options for this comment Report this post Janh Delantar Here's what I shared with Manar. Hopefully, this will help you: I found this guide on the LinkedIn Learning site with some troubleshooting steps for downloading certificates of completion: https://www.linkedin.com/help/learning/answer/a700836 If those steps don't help, I recommend contacting LinkedIn Learning directly for technical support: https://www.linkedin.com/help/learning While I don't work for LinkedIn Learning, and my technical skills are limited, I'd be happy to answer any questions you have about the course itself. -Jeff Like Reply Commented by Janh Delantar Janh Delantar -- 1d More options for this comment Report this post How i can get my certificate i finish the course Like Reply Load more answers Question asked by Kingsley Chinemerem Kingsley Chinemerem Customer Relationship Officer at Sendme.ng 2mo More options for this question Copy link to question Report this post I'm not able to take the first lesson in the path. what could be the problem? From the video: Keep your customers happy 2 Likes 3 Answers Like Answer Add your answer here Add your answer here Answered by Dishita Peketi Dishita Peketi Customer Success Account Manager ( Sales Service Operations) CRM! 1mo More options for this answer Report this post Hello sir I am dishita I  couldn't able to open the exercise file which I  downloaded. Like Reply 2 Replies Commented by Bulelani lunathi Bulelani lunathi Student at Afedilem 1mo More options for this comment Report this post In other to be able to open your exercise file,i think you should go back to google out about how to open that type of file so that they will show you steps of opening the file you about to open. Like Reply Commented by Jeff Toister Jeff Toister Instructor Your service culture guide. 1mo More options for this comment Report this post Hi Dishita. I'd suggest contacting LinkedIn Learning's support team directly for technical assistance. These forums are focused on content-related questions, so your question might not get as fast and thorough a response as if you contacted support: www.linkedin.com/help/learning Like Reply 1 Like Answered by Sphamandla Hopewell Mchunu Sphamandla Hopewell Mchunu Cisco Network Academy IT. Computer Literacy. NACCW (Child and Youth Care).Department of Education (Learn Support Agent). Department of Health (TB screener and Lay counseling) Department 1mo More options for this answer Report this post Hi I have managed to finish all the quiz and exam but I cant access the certificate please help Like Reply 1 Reply Commented by Jeff Toister Jeff Toister Instructor Your service culture guide. 1mo More options for this comment Report this post Hi Sphamandla. I'd suggest contacting LinkedIn Learning's support team directly for technical assistance. These forums are focused on content-related questions, so your question might not get as fast and thorough a response as if you contacted support: www.linkedin.com/help/learning Like Reply Load more answers Show more Join the community of learners Project Management Institute (PMI) Prep - LI Learning Group 117,984 Members This group is for learners who are interested in Project Management Institute certification prep and want to connect, share, collaborate, learn, and teach in an open, safe environment. Learning is fun when done together. Let’s make it great and enjoy the conversation. *Note: By joining this group, your profile will be visible to other group members but your network will NOT be notified. Join National Association of State Boards of Accountancy (NASBA) - LinkedIn Learning Group 98,159 Members This group is for learners who are interested in NASBA and want to connect, share, collaborate, learn, and teach in an open, safe environment. Learning is fun when done together. Let’s make it great and enjoy the conversation. *Note: By joining this group, your profile will be visible to other group members but your network will NOT be notified. Join Graphic Design Tips & Tricks - LinkedIn Learning 30,908 Members This group is for learners who are interested in <topic> and want to connect, share, collaborate, learn, and teach in an open, safe environment. Learning is fun when done together. Let’s make it great and enjoy the conversation. *Note: By joining this group, your profile will be visible to other group members but your network will NOT be notified. Join Customer Service Skills & Management - LinkedIn Learning 17,488 Members This group is for learners who are interested in Customer Service Skills & Management and want to connect, share, collaborate, learn, and teach in an open, safe environment. Learning is fun when done together. Let’s make it great and enjoy the conversation. *Note: By joining this group, your profile will be visible to other group members but your network will NOT be notified. Join Show all Learning Groups 0 Notes taken Press Enter to save No notes saved yet Take notes to remember what you learned! Export your notes Get your notes for this course which includes description, chapters, and timestamps Download Filter results by video selected In this video Determine the value of outstanding customer service Selecting transcript lines in this section will navigate to timestamp in the video - When people think about outstanding customer service, there's often an employee who goes above and beyond to be the hero. Think about an experience where you received outstanding customer service. There's a good chance that an individual employee went above and beyond to make it happen. Have you ever wondered why they gave that extra effort? People go above and beyond, because they get something out of it. Even if it's just the satisfaction of knowing they made a difference. Let's explore some of the ways you, your coworkers and even your organization might benefit when you make the effort to provide outstanding customer service. You can download the value of outstanding service worksheet to help you, or just jot down some notes on a blank piece of paper. A good place to start is to look at how you personally benefit from providing your customers with service that exceeds their expectations. Make a list of what you gain from putting in that extra effort. It may help to think about a specific situation where you went out of your way to delight a customer. Here's some examples that might be on your list. Happy customers are easier to serve. You enjoy helping people, and you feel a sense of accomplishment when you are able to help someone else solve a problem. We can also have a positive impact on our coworkers when we personally provide outstanding service. Try making a list of ways your extra effort might benefit the people you work with. This time, it might be helpful to think about how you felt when one of your coworkers delivered outstanding service. Here's some examples that might be on that list. Your coworkers will have to fix fewer problems. Great service brings positive energy to the entire team, and you can be a positive role model to your colleagues. Customers often look at the people who serve them as representatives of the entire organization. As a third step in this exercise, make a list of benefits your organization receives when you personally provide outstanding customer service. Here are a few examples that might be on that list. Increased profits, retained customers, and positive word of mouth from customers who refer your organization to others. Hopefully this exercise helped you identify some reasons that providing outstanding service is important to you. Whenever you have a tough day, reread the list you've just created and reflect on why you worked so hard to help your customers. Customer service isn't always easy. But the important thing to remember is that you can choose to give that extra effort to be outstanding.

      customer service

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      • The authors claim that bin2 has a "confused" phenotype, which they define as high variability in shoot versus root lengths along with a low degree of response to water limitation. bin2-1 is a semi-dominant gain-of-function mutant, which can only be propagated as a heterozygote (homozygous individuals are viable, but don't produce seeds). There is no mention in the manuscript about genotyping or selection of homozygous bin2-1 individuals for the phenotyping assays. Could the high variability observed in fact be caused by the authors looking at a segregating population of bin2-1? * By propagating plants under optimal growth conditions over > 4 months at the TUMmesa ecotron, we were in fact able to obtain over 24 individual homozygous bin2-1 plants. We distinguish homo- and heterozygous seed by (i) adult phenotype (ii) segregation in the next generation (iii) root:shoot ratios from dark-grown seedlings on plate and (iv) sequencing of the TREE domain (as shown in Fig. 2e). Therefore, we are sure to have used only homozygous mutants in our analysis. This is now specified in the supplementary method S5.

      *The authors state that bin2 mutants had considerably more severe phenotypes than other BR biosynthesis, perception, or transcription factor mutants. This is like comparing apples to oranges, as the set of mutants they've examined consists of gain-of-function and partial loss-of-function alleles. Null alleles for BR biosynthesis (e.g. cpd, dwf4), perception (bri1brl1brl3 triple mutants) and transcription factors (bzr1bes1beh1-4 sextuple mutants) are described in the literature and would need to be tested before arriving at such a conclusion. *

      This is an important point and the nature of all alleles was and still is clearly outlined in Table S1 “Lines used in this study”. We have obtained and propagated bri1brl1brl3 triple mutant seed from Christian Hardtke (Kang et al., 2017), as well as null cpd alleles from NASC and these now complement or replace det2-1 and bri1-6 in our analysis. We compare null alleles, semi-dominant or dominant or higher order null alleles with each other. To make these comparisons clear we have highlighted these different allele types in the manuscript as depicted in the table, with null in regular font, semi-dominant or dominant in bold and higher order mutants underlined. This is described in Table S1 and in the figure legends, where applicable. We have not been able to obtain and propagate enough seed in the period of review to extend the analysis to sextuple transcription factor mutants. Therefore, we have removed the comparison between brassinosteroid mutants and now refer to the importance and role of the brassinosteroid pathway in general and, more specifically, to BR signaling rather than to BIN2.

      *For most of the phenotyping experiments a "RQ ratio" is presented. This is the ratio adjustment of the mutant/ratio adjustment of WT. While this derived quantity is useful for interpretation, we're missing plots of the raw data, and particularly those that show the underlying distribution of data points. *

      We understand that the RQratio (Fig. 4e) value is a step removed from the raw data. Please note that we also show the RQshoot (Fig. S8a) and the RQroot (Fig. S8b) in the supplement. We now depict violin plots in Fig. 4a-c and Fig. S7 as a best representation of the raw data, as follows

      Results page 10: “The violin plots compare organ length distributions in mutants versus the corresponding wild-type ecotype, which depicts dwarfism in some brassinosteroid mutants. It is also apparent that wild-type (Col-0) root length varies under water-deficit in the dark (Fig. S7). Although we have optimized protocols for PEG plates to the best of our ability, there is still a lot-to-lot and plate-to-plate variation. This emphasizes the need for normalizing each mutant line to its corresponding wild-type ecotype on the same (PEG) plate in the same experiment. To this end, the response to water stress in the dark was represented as a normalized response quotient (RQ), which is an indication of how much the mutant deviates from the corresponding wild type (Fig. 4e; see methods).”

      The RQhypocotyl, RQroot and RQratio are a necessary consequence of the variance in the data, and we consider them to be the most relevant metrics. Representative experiments were chosen from at least three replicates on the bases of RQ and P values (as specified in the legends of Fig. 3 and Fig. S10).

      Root growth involves both cell division in meristematic cells at the tip of the root and subsequent elongation as cells exit the meristem and begin to differentiate. The authors claim a nine-fold difference in CycB1,1:GUS in the root meristem in dark vs darkW, however their images show similar CycB1,1:GUS expression patterns. Furthermore, the meristems of darkW are actually smaller than dark, which would be unexpected if cell division *was increased. *

      We have reviewed the raw data again, applying blinding to avoid bias, and chosen a more representative image for the dark; the mitotic indexes are represented in a violin plot (Fig. 6c) to better show the distribution of datapoints. The conclusions are unchanged. We reimaged the wild-type under light, dark and darkW, specifically focusing on meristem properties and on final cell length. The results are presented in Fig. 6, Fig. S14, Fig. S15 and described as follows:

      Results page 14:

      “It is generally accepted that root growth correlates with the size of the root apical meristem (RAM; Beemster and Baskin 1998). Meristem size was assessed by computing the number of isodiametric and transition cells (González-García et al., 2011; Verbelen et al., 2006; Method S8). In addition, we applied a Gaussian mixed model of cell length to distinguish between short meristematic cells and longer cells in the elongation zone (Fig. S14; Fridman et al., 2021). Meristem size was shortest under water deficit in the dark (Fig. 6a; Fig. S15a,b) and, surprisingly, did not correlate well with final organ length (Fig. 1c; Fig. 6g). “

      Discussion page 16:

      “it appears counterintuitive that meristem size and organ length do not correlate in our conflict-of-interest scenario. Questions arise as to why the meristem is smaller under water deficit in the dark even though the mitotic index is higher than in the dark, and how growth is promoted under our additive stress scenarios. An important difference between our conditions and those described by others is that we germinated seed under limiting conditions in the dark in the absence of a carbon source… When water stress was applied in the dark, the mitotic index increased, but the newly produced meristematic cells immediately elongated, thereby exiting the meristem. As a consequence, meristem size remained small despite the increased number of mitotic cells. It appears that what our study shows is a novel paradigm for root growth under limiting conditions, which depends not only on shoot-versus-root trade-offs in the allocation of limited resources, but also on an ability to deploy different strategies for growth in response to abiotic stress cues.”

      We are not aware of any other study that has addressed root growth under water deficit in the dark and in the absence of a carbon source.

      • In addition, the authors claim that the longer root length in dark water stress was at least in part due to increased elongation (Fig. 7c). Elongation was only assessed by looking at the first elongating cell (~10-14um) and the differences found are on the order of magnitude of ~2um, but final cell size in Arabidopsis roots often reaches several hundred um. Therefore, a comparison of final cell size would be more appropriate. *

      Results page 14:

      “mature cell length… was highest in the dark, the condition with the shortest roots (Fig. 6b). Thus, neither meristem size nor mature cell length account for the fold-change in final organ length (Fig. 6g).”

      *Finally, the authors phenotype plt1/2 double mutants and show that they fail to elongate in response to water limitation. Their interpretation is that this supports a centralized control model for the root apical meristem. PLT1/2 are important determinants of meristem function and are necessary to maintain stem cell identity. Given the strong phenotype of plt1/2 double mutants it is not surprising that they are unable to elongate in response to this stimulus. This does not necessarily indicate that only the RAM controls root growth, but rather that functional stem cells are required for root growth, which also involves subsequent steps such as cell elongation. *

      This is an important point and we thank the reviewer for pointing it out. We now write:

      Results page 15:

      “Taken together, the cell length and anisotropy curves (Fig. 6) and genetic analyses (Fig. 6; Fig. S15f; Fig. S16) suggest that root length under our different environmental conditions is regulated by (i) the mitotic index, (ii) the timing of cell elongation or of exit from the meristem and (iii) cell geometry. We also conclude that these are differentially modulated to account for increased root length under different environmental conditions (Fig. 6c-e).”

      We also modulate the conclusion and model (Fig 7c) to state that RAM function accounts “in part” for root growth. However, it is to be noted that mature cell length in our study did not correlate with root length (Fig. 6b, 6g). Our conclusion is now reached not solely based on plt1plt2 but also on a careful and quantitative cellular analysis of the root apical meristem in the wild-type and in bin2-3bil1bil3 mutants. The major contribution of our study, however, is the difference between the different conditions, and the ability to respond to stimulus.

      *Reviewer #1 (Significance (Required)): *

      * While the study system and some of the findings in this manuscript are interesting, there are major flaws in the authors' primary claims. *

      Contested claims have been (i) deleted where unessential to the storyline or (ii) substantiated by independent methods.

      *Reviewer # 2 *

      1. I recommend to exchange shoot for hypocotyl when hypocotyls were examined to avoid to confuse the readers. We thank the reviewer for pointing this out and have exchanged shoot for hypocotyl throughout.

      2. The authors have chosen SnRK2 (and should also indicate it in all Figures as SnRK2, to not confuse the readers with SnRK1), and implement ABA signaling in parallel to BR action, but this must be proven in higher order mutants of both pathways, at the moment the results are to preliminary to allow conclusions. *

      We concur with the reviewer that higher order mutants between the BR and ABA pathways would be required to make this claim. We also concur that this would require numerous generations and therefore that it does not lie within the scope of this manuscript.

      • When the authors are interested in shoot dominance/photosynthetic activity, why didn't they look on snrk1 mutants, which are known to regulate those processes. *

      The issue of energy signaling is a key one, and we mention this in the final “perspective” paragraph of the discussion (p. 18) as follows:

      “As a limited budget is an essential component of our screen conditions, the role of energy sensing and signaling (Baena-González and Hanson, 2017) in growth tradeoffs will need to be elucidated.”

      • In Fig6d the authors propose a sketch of the mechanism, but the data of this study don't show direct interaction of the pathways and as indicated in the figure text parts of the information are taken from other papers, I recommend to remove this sketch or shift it to the supplements. * We concur with the reviewer and have deleted former panels 6d, 6e and 6f as well as reference to the mutants these included. We now focus on the BR pathway, as discussed below.

      *To discriminate the role of downstream BR signaling events from other roles of BIN2, I suggest to complement the data with pharmacological experiments (eBL or bikini where appropriate), and if possible to implement phenotyping of OE lines. *

      In response to this comment, we attempted bikinin experiments. Unfortunately, it is difficult to germinate seed on bikinin and seedlings grow poorly on this shaggy-like kinase inhibitor. As the assay relies on seed germination rather than on seedling transfer, applying bikinin was suboptimal. Because of the requirement for germination in the dark, and in lieu of eBL or PPZ or a combination thereof, we now include a null allele of a BR biosynthesis mutant, cpd, in Fig. 3b, to replace the leaky det2-1 mutant we had previously used.

      How many independent ko lines were tested, can the authors exclude that the BR independent phenotype indeed corresponds to BIN2 activity and not to a off target effect.

      Four independent bin2 mutants (B1, bin2-1, ucu1, dwarf12) were analyzed in our study. In total, 83000 M2 seed were used in our forward genetic screen; of these and for BIN2 the B1 line is the one we rescreened, mapped and characterized. We complemented B1 with bin2-1 and ucu1 alleles and compared it to bin2-1, ucu1 and dwarf12 alleles at the BIN2 locus; these three published mutant lines exhibited the same behavior as B1, including semi-dominance and phenotypes under single versus multiple stress conditions (Fig. 2c cf Fig. 3d; Fig. S6). Fine mapping (Fig. 2d), segregation analysis (Table S2), allele sequencing (Fig. 2e), backcrossing, outcrossing and complementation analysis provide independent lines of evidence that B1 is a BIN2 allele. Please note that the conclusions regarding BIN2 in this manuscript are based not on B1 but on the published bin2-1 and bin2-3bil1bil2 lines.

      We write results page 10:

      “We complemented B1 with bin2-1 and ucu1 alleles and compared it to bin2-1, ucu1 and dwarf12 (Perez-Perez et al., 2002; Choe et al., 2002) alleles at the BIN2 locus; these three published mutant lines exhibited the same behavior as B1, including semi-dominance and partial etiolation.”

      *I further recommend to exchange the pictures in Fig7a showing BRI1-GFP to pictures showing fewer cells, but with higher resolution. *

      We now show higher resolution images in Fig. 7b.

      • Regarding the implementation of photoreceptor mutants and the claim that photoreceptors are more abundant in shoot, I want to point out that the situation is more complex, as the root also reacts differently to light of different quality and quantity, with different responses in the meristem, by inhibiting cell proliferation, or in the elongation zone by triggering negative phototropism. this should be corrected in the text. *

      We are aware that light, especially when Arabidopsis is grown on media, is perceived by photoreceptors within the root system. Phototropic growth would not have affected measurements of root length as measurements were performed in ImageJ with the freehand tool. This is described in the methods on page 6, and in the supplementary method S5. For the model, we have now modulated our discussion as follows:

      Discussion p. 16-17:

      “ we postulate that a hypocotyl to root (basipetal) signal coordinates trade-offs in organ growth in response to light (Fig. 7c green arrow). However, and even though photoreceptors are considerably more abundant in the hypocotyl than in the root (van Gelderen et al., 2018), it needs to be borne in mind that photoreceptors in the root could be playing a role in root responses to light or to darkness (Mo et al., 2015).”

      *The data and methods are presented in a clear and sufficient way, as well as the statistical analysis. *

      We thank the reviewer for this positive assessment.

      *Altogether, the hypothesis and work amount are worth to be recognized, but the manuscript also resembles partially more a review and I would suggest to shorten those parts in the manuscript, reduce the amount of described lines and focus strictly on the BR pathway, in response to the environmental changes. Before implementing photoreceptors and ABA/SnRK2 pathway into the story to either test higher order mutants between the signaling pathways of interest or come up with a pharmacological screen connecting the data. Therefore I suggest to reduce the amount of mutants investigated and focus on BIN2 action, implementing also a pharmacological screen to track a fluorescent tagged BIN2 upon the mentioned treatments. And if possible to add proteomics and phosphoproteomics to understand better what changes are undergoing in the bin2 mutant vs WT upon stress. *

      We thank the reviewer for suggesting that we “focus strictly on the BR pathway, in response to the environmental changes”, as this has truly supported us in tightening the story line.

      We have removed the sections of the manuscript that resembled a review and focus entirely on the BR pathway, with additional or tighter mutants. We also look at BIN2 more closely and at a cellular level, with SEM micrographs for the hypocotyl and CSLM for the root tip. The BIN2 interactome on BIOGRID comprises 36 well annotated interactions (https://thebiogrid.org/12898/summary/arabidopsis-thaliana/bin2.html), of which 2 are documented by multiple lines of evidence and 27 are from low throughput studies. Adding adequately validated interactions to this exceeds the scope of this manuscript. Furthermore, as we no longer make the claim that BIN2 mutants are the most severely impacted (see response to reviewer #1), BIN2 is no longer the primary focus of this study; we now refer more loosely to the BR pathway, or to facets thereof referred to as BR biosynthesis, perception, signaling or BR-responsive gene expression. We have also updated and extended the reference list to include references on light perception and energy sensing or signaling. Phosphoproteomics is an important suggestion that we have also taken into the perspective.

      In brief, the manuscript has a new focus on what we consider is its true contribution: a cellular analysis of cell division, elongation and anisotropy in the wild type and in BR mutants under resting or additive stress conditions.

      *Reviewer #3 *

      1. *My major concern is that in the search of a decision mutant the authors performed the first screening not under 'a conflict of interest' scenario but under dark conditions. Can the authors explain the reasons behind this more clearly? * The reason we did not use the dark water stress condition as an initial but as a secondary screen is the variability of the response. In the new violin plots (Fig. 4a-c; Fig. S7), the variance especially in root length can be seen to be considerably greater in darkW than in dark even for the wild-type. This is why we initially screened individual M2 seed in the dark and then rescreened M3 populations under darkW conditions. Due to the relatively high variance, all conclusions in the manuscript are drawn on populations of seedlings rather than on individuals.

      We write in the results section on page 9:

      “We initially screened in the dark because the high variance in root growth under water deficit in the dark in the wild-type (see below) would obscure the distinction between putative mutants versus stochastically occurring wild-type seedlings with short roots under darkW.”

      • Related to above, the role of the BR pathway in etiolation has been well established with the prominent constitutive photomorphogenesis phenotypes of BR related mutants; since both bin2 alleles are impaired in light responses this mutant may behave in dark vs darkW, like a wildtype plant in light vs. lightW (maybe also partially as shown in SFig. 5a). However, the authors show that the growth tradeoff was not evident under light conditions (Fig 2). I think to conclude that bin2 is a decision mutant it requires more evidence to excluded that a defect in efficient sensing and signaling of dark conditions are not the primary source of the 'confused' phenotype. In addition to the phenotype in SFig. 5a where light responses are attenuated in B1 when compared to Wt, a comparison of gene expression analysis of some established light regulated genes could help to show that bin2 is able to efficiently sense the absence of light. *

      This is an important point. We have looked at the expression levels of the light responsive gene LHCB1.2 via qPCR in wild-type Ws-2 versus bin2-3bil1bil2. The data show that the gene expression is light-regulated in bin2-3bil1bil2 seedlings (Fig. S12) and are described in the Results on page 13.

      In addition, Fig. S10 and Fig. S11 are dedicated to a careful analysis of light responses in all the BR pathway mutants we analyze. In Fig. S10d, bin2-1 can be seen to have a significant (P-value We write, in the Results on page 13.

      “Interestingly, the BR mutant lines with the strongest etiolation phenotypes (cpd and bri1-116brl1brl3, Fig. S11a,b) in the dark were not the ones with the strongest deviation from the wild-type under water deficit in the dark (Fig. S8).”

      3. Cells that fail to elongate in the dark may cannot - or only to a limited extent - reduce further their cell length in the darkW conditions. Since BR-mutants fail to expand hypocotyl cells in the dark, an analysis of the hypocotyl epidermis cell length in bin2 mutants compared to wt in light vs dark vs darkW (as in Fig. 8c) could be a feasible experiment to exclude that the general BR-related cell elongation defects led to the confused phenotypes of this mutant.

      This is an excellent suggestion and we thank the reviewer for pointing it out. Accordingly, bin2-1 mutants were imaged via scanning electron microscopy (SEM) and cellular parameters assessed. We also investigated root meristem properties in bin2-3bil1bil2, which had the most aberrant root response to water stress in the dark (Fig. 3e; Fig. S8b). Our new observations are described in Fig. 5, Fig. 6h-j, Fig. S16 and in the results on pages 13-15 as follows:

      “To explore whether general BR-related cell elongation defects led to the confused phenotypes of some BR pathway mutants, we analysed bin2-1 mutants, which were among the most severely impaired hypocotyl response to water stress in the dark (Fig. S8a). The data show a most striking impact of bin2-1 on growth anisotropy, assessed in 2D as length/width (Fig. 5f). Indeed, in a comparison between dark and dark with water stress (darkW), the anisotropy of hypocotyl cells decreased considerably in the wild type (Fig. 5c), but showed no adjustment in bin2-1 (Fig. 5f). Cell length alone showed the elongation defect typical of bin2-1 mutants, with a much greater deviation from the wild type under darkW than under dark or light conditions; nonetheless, there was a significant length adjustment to water stress in the dark, even in bin2-1 (Fig. 5e). These observations suggest that the impaired bin2-1 hypocotyl response can be attributed to an inability to differentially regulate cell anisotropy in response to the simultaneous withdrawal of light and water. ….

      Meristem size and mature cell length followed the same trends in a comparison between bin2-3bil1bil2 (Fig. S16a, S16b) and the wild type (Fig. 6a, 6b), but the extent of elongation in cells proximal to the QC differed (Fig. S16c). Indeed, bin2-3bil1bil2 length and anisotropy curves lacked the steep slopes characteristic for darkW in the wild type (compare the green arrows in Fig. 6d, 6f & 6j to the purple arrows in Fig. 6j & Fig. S16c). We conclude that bin2-3bil1bil2 mutants fail to adjust their root length due to an inability to differentially regulate the elongation of meristematic cells in the root in response to water stress in the dark.”

      • The experiments with the BR-deficient and signaling mutant and the bypass mutant may suggest that BR hormone is playing a relative minor role in the 'decision activity' of BIN2. bri1-6 was described to respond like wildtype (page10 line 6-8). Since this seems because of normal root responses in dark vs. darkW (Fig. 5) it could also be caused by the role of BRL1 and BRL3 in root drought responses (Fabregas et al., 2018). To verify if functional BRL1 and BRL3 in bri1-6 could cause the root response to water stress an additional experiment with bri1,brl1,brl3 triple mutant is required; In my opinion this is very important to state if the BR input is at all required for BIN2 signal integration or not. *

      We have extended our analysis to include bri1brl1brl3 lines (Kang et al., 2017). These are dwarf mutants, yet able to respond to water stress in the dark with reduced hypocotyl and increased root growth (Figure panel former 5c replaced new Fig. 3c, shown left). Note that the lines have a null bri1-116 allele and segregate (bri1-/+ brl1-/- brl3 -/-)quite clearly, as was verified by propagating seedlings on plate after the scan on day 10 (Supplementary Method S5).

      ***Minor comments:** *

      *5. The authors separate conceptually growth tradeoffs in sensing, signaling, decision making and execution processes. A clearer explanation of the expected phenotypes from mutants in only decision making with and without stress would be interesting to add (page 8)? *

      We have now moved up phya phyb cry1 cry2 quadruple photoreceptor mutant and write:

      Results on page 9

      “Perception mutants would fail to perceive light or water stress; a good example of this is the phya phyb cry1 cry2quadruple photoreceptor mutant, which had a severely impaired light response (Fig. S4d), but a “normal” response to water stress in the dark (Fig. S4e). In contrast, execution mutants may have aberrantly short hypocotyls or roots that are nonetheless capable of differentially (and significantly) increasing in length depending on the stress conditions. Decision mutants would differ from perception or execution mutants as they would clearly perceive the single stress factors yet fail to adequately adjust their hypocotyl/root ratios in response to a gradient of single or multiple stress conditions. Failure to adjust organ lengths would be seen as a non-significant response, or as a significant response but in the wrong direction as compared to the wild-type. We thus used organ lengths, the hypocotyl/root ratio and the significance of the responses as decision read outs. We specifically looked for mutants in which at least one organ exceeded wild-type length under darkW.“

      Later in the results on page 11 and in the legend to Fig. 4 we pick up on this as follows:

      “For bin2-1, the response to water stress in the dark was severely impaired: the hypocotyl and root responses were non-significant …bin2-3bil1bil2 mutants fit the above definition of decision mutants as they have a significant root response but in the wrong direction as compared to the wild-type, as denoted by red asterisks (Fig. 3e)…

      Figure 4. … bin2-3bil1bil2 mutants qualified as decision mutants on 3 counts: (i) failure to adjust the hypocotyl/root ratio to darkW (the ratio for darkW is the same as for dark in panel c), (ii) low or non-significant P-value (see panel f below) and (iii) one organ (here the hypocotyl in panel a) exceeded wild-type length under darkW.”

      Line 26 page 17: BR responses in the epidermis of the hypocotyl have been shown to be already sufficient to control hypocotyl growth (Savaldi-Goldstein et al 2007), showing that not all cells of the hypocotyl need to receive the signal (at least in the case of brassinosteroids) We have deleted the sentence because it is too speculative. However, the issue of different tissue layers is now mentioned in the perspective on page 18, as follows:

      “3D imaging will be required to assess the impact of abiotic stress and/or of BR signalling on different cell files or tissue layers in the root (see Hacham et al., 2011; Fridman et al., 2014; Fridman et al., 2021; Graeff et al., 2021). .”

      Because of the importance of distinguishing between different cell files and cell layers, we have now removed the confocal images of BRI1-GFP under the different environmental conditions (formerly Fig. 7a); this needs to be extended to a 3D analysis, which is not within the scope of this manuscript.

      1. *Page 6 Line 11: In the volcano blots the mean RQ ratio is shown in Fig. 6c and 6f. *

      We thank the reviewer for pointing this out, we had accidentally written median RQratio, this has been rectified in the results text.

      *Some parts of the ms could be shortened and the amount of Fig. could be reduced. Fig. 1-3 could be merged as one figure showing the optimal conditions to analyze tradeoffs in shoot vs. root growth and all the conditions not suitable could be supplementary figures. *

      We concur with the reviewer and have merged the first three figures as suggested. Reviewer #2 has also requested that we slim the manuscript and all reviewers request that we strengthen our conclusions on the brassinosteroid pathway mutants. To reduce the number of figure panels, we have removed the analysis of all mutants that are not in the BR pathway, with the exception of the quadruple photoreceptor mutant in Fig. S4d,e and plethora mutants in Fig. S15. Nonetheless, incorporating the new data generated in response to reviewer comments leaves us with 7 main and 16 supplementary figures.

      *In the ms several experiments are described as 'screen' this is confusing with the forward genetic screen that was performed. *

      This is indeed ambiguous. We now use the terms “single versus multiple stress conditions/additive stress/conflict-of-interest scenario ” versus “forward genetic screen”.

      *Reviewer #3 (Significance (Required)): *

      * Mechanisms how growth trade-offs between multiple stresses are controlled are highly interesting. Growth vs. biotic stress tradeoffs have already been investigated and were found to be interdependent with light (Leone et al. 2014; Campos et al 2016; Fernandez-Milmanda et al. 2020) and hormone signaling (Lozano-Duran and Zifpel et al., 2016 and Ortiz-Morea et al 2020; van Butselaar and van den Ackerveken, 2020). Less is known about growth tradeoffs between two abiotic stress responses (Bechtold and Field, 2018; Hayes et al., 2019). The separation of root meristem growth and cell expansion in the hypocotyl is interesting. Whether the two directional root-to-shoot and shoot-to-root signals are independent or whether they may employ the same mechanism with a different output remains open. Different sensitivities of organs and cell types to BRs have for example been reported (Müssing et al. 2003 and Fridman et al. 2014). The findings that BIN2 most likely act to integrate multiple signals is in line with the reported roles of BIN2 to crosstalk with several pathways (reviewed by Nolan et al. 2020). In my point of view, it remains to be strengthened if this is through 'decision making' and not through signaling and execution. I think if the authors carefully separate the defects in bin2 this work will be interesting to many plant biologists. * We thank the reviewer for highlighting references we had not referred to in the former draft. The references pertaining to the growth versus defense trade-off are now included in the introduction (page 3) and the ones on abiotic stress factors in the Discussion on page 18:

      “In addition to its role in light and drought responses… BIN2 has been implicated in regulating hypocotyl elongation in response to far-red light and salt stress (Hayes et al., 2019). Studies on responses to abiotic stress factors have typically addressed growth arrest or tradeoffs between growth and acclimation (Bechtold and Field, 2018). Indeed, root growth is inhibited by, for example, phosphate deprivation or salt stress (Balzergue et al., 2017; West et al., 2004). Recent efforts have addressed strategies for engineering drought resistant or tolerant plants that do not negatively impact growth (Fàbregas et al., 2018; Yang et al., 2019). In contrast to other studies, here we look at two abiotic stress factors that promote organ growth. Indeed, hypocotyl growth is promoted by darkness or low light and primary root growth by water deficit in this study.”

      We emphasize the above point about decision making in the discussion. In the in the introduction and early on in the results we introduce conceptual frameworks for decision making. Yet after a forward genetic screen and mutant characterization, we revise this in the Discussion on page 18 as follows:

      “In the judgement and decision-making model for plant behaviour put forth by Karban and Orrock (2018), signal integration might be considered integral to judgement. ….Whether judgement and decision making can be distinguished from each other empirically remains unclear. As BR signalling regulates cell anisotropy and growth rates in the hypocotyl and root apical meristem, it may play a role not only in signal integration but also in the execution of decisions (or in an implementation of the action; González-García et al., 2011; Vilarrasa-Blasi et al., 2014). Thus, this study does not enable us to empirically distinguish between decision making on the one hand and signalling and execution on the other.”

    1. https://niklas-luhmann-archiv.de/bestand/zettelkasten/zettel/ZK_2_SW1_001_V

      One may notice that Niklas Luhmann's index within his zettelkasten is fantastically sparce. By this we might look at the index entry for "system" which links to only one card. For someone who spent a large portion of his life researching systems theory, this may seem fantastically bizarre.

      However, it's not as as odd as one may think given the structure of his particular zettelkasten. The single reference gives an initial foothold into his slip box where shuffling through cards beyond that idea will reveal a number of cards closely related to the topic which subsequently follow it. Regular use and work with the system would have allowed Luhmann better memory with respect to its contents and the searching through threads of thought would have potentially sparked new ideas and threads. Thus he didn't need to spend the time and effort to highly index each individual card, he just needed a starting place and could follow the links from there. This tends to minimize the indexing work he needed to do regularly, but simultaneously makes it harder for the modern person who may wish to read or consult those notes.

      Some of the difference here is the idea of top-down versus bottom-up construction. While thousands of his cards may have been tagged as "systems" or "systems theory", over time and with increased scale they would have become nearly useless as a construct. Instead, one may consider increasing levels of sub-topics, but these too may be generally useless with respect to (manual) search, so the better option is to only look at the smallest level of link (and/or their titles) which is only likely to link to 3-4 other locations outside of the card just before it. This greater specificity scales better over time on the part of the individual user who is broadly familiar with the system.


      Alternatively, for those in shared digital spaces who may maintain public facing (potentially shared) notes (zettelkasten), such sparse indices may not be as functional for the readers of such notes. New readers entering such material generally without context, will feel lost or befuddled that they may need to read hundreds of cards to find and explore the sorts of ideas they're actively looking for. In these cases, more extensive indices, digital search, and improved user interfaces may be required to help new readers find their way into the corpus of another's notes.


      Another related idea to that of digital, public, shared notes, is shared taxonomies. What sorts of word or words would one want to search for broadly to find the appropriate places? Certainly widely used systems like the Dewey Decimal System or the Universal Decimal Classification may be helpful for broadly crosslinking across systems, but this will take an additional level of work on the individual publishers.

      Is or isn't it worthwhile to do this in practice? Is this make-work? Perhaps not in analog spaces, but what about the affordances in digital spaces which are generally more easily searched as a corpus.


      As an experiment, attempt to explore Luhmann's Zettelkasten via an entryway into the index. Compare and contrast this with Andy Matuschak's notes which have some clever cross linking UI at the bottoms of the notes, but which are missing simple search functionality and have no tagging/indexing at all. Similarly look at W. Ross Ashby's system (both analog and digitized) and explore the different affordances of these two which are separately designed structures---the analog by Ashby himself, but the digital one by an institution after his death.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      __Manuscript number: __RC-2022-01357

      __Corresponding author(s): __Peter Novick and Gang Dong

      1. General Statements [optional]

      We would like to thank both reviewers for their thorough and constructive evaluation and comments on our manuscript. Following their suggestions, we have reworked our manuscript and added several pieces of new data to address questions from them, including (1) evaluation of how M7 mutant of Sso2 affects its interaction with Sec3 using three independent methods (in vitro); (2) investigation of how the M7 mutant affects the interaction of Sso2 with Sec3 by co-immunoprecipitation (in vivo). We hope that, with all these further introduced changes, this manuscript will be suitable for publication in your journal. Detailed point-to-point responses are shown below.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      Using the entire cytoplasmic domain of Sso2 and protein crystallization, Peer and colleagues show that two N-terminal peptides (NPY) of Sso2 synergistically interact with the Sec3 PH domain. This interaction provides an additional low affinity binding site to the previously published interface between the Sso2 four-helix bundle and the PH domain. Mutagenesis, in particular of both NPY motifs, results in reduced cell growth, in the accumulation of transport vesicles at the budding site, and in decreased secretion of invertase and Bgl2. The paper is well written, the data are convincing and the characterization of these novel peptide interaction sites clearly advances the field. Although, the exact role of the Sec3 NPY - Sec3 interaction still needs to be established, the overall functional relevance is apparent and thus the paper could be published with minor changes. *

      __Response: __We really appreciate the reviewer for his/her positive comments and clear/constructive feedbacks.

      *Nevertheless, the authors may consider to address the following issues to improve the manuscript. - To strictly exclude the possibility that the Sso2 NPY motif also interacts with other components of the exocytosis machinery (e.g. Sec1), thereby causing the observed phenotypes, Sec3 mutagenesis of the NPY motif-binding site would be required. *

      __Response: __It would be a good idea to generate reverse mutants on Sec3. However, the pocket on Sec3 bound by the NPY motifs of Sso2 is mostly hydrophobic and contains many semi-buried residues that are in close contact with other residues in the hydrophobic core of structure (including L78, Y82, I109, V112, V208, etc.; see Fig. S3D, E) and thus essential in maintaining the folding of Sec3. Making mutations on these residues would destabilize the folding of Sec3. This was why we have not done this as suggested by the reviewer.

      *- The authors suggest that the NPY-peptide binding contributes to the initial interaction/recruitment of Sso2 to the exocytosis site, defined by the localization of Sec3 (exocyst). Further data sustaining this concept/hypothesis could improve the impact of the manuscript. Thus, an experiment analyzing the co-distribution of the Sec3 with Sso2 would directly support the authors' conclusion. (In Figure 7, the authors already show the highly polarized distribution of Sec3-3xGFP.) The M7 mutant could impact the distribution of Sso2. In addition, it would be helpful to determine to which degree the Sso2 NPY - Sec3 PH domain interaction increases the overall affinity of Sso2 for the Sec3 PH domain; e.g. comparison of the binding of Sso2 (1-270) wt and M7 to Sec3 PH domain using ITC. *

      Responses:

      • We greatly value the reviewer’s suggestion. For the suggestion to investigate how the M7 mutant affects the co-distribution of Sso2 with Sec3 in yeast, we have tried a variety of conditions with both the original serum and affinity purified Sso antibodies. In neither case did we see a clear concentration at sites where we would expect to see Sec3, such as the tips of small buds. We were able to see some detectable concentration of HA-tagged Sso2 in small buds using anti-HA Ab, but it would be difficult to tag the M7 mutant at the same site since it is so close to the M7 mutation. We are also worried that the tag might interfere with Sec3 binding due to the proximity. Given the lack of detectable concentration of WT Sso2, it would not be possible to see a loss of localization in M7.
      • For the suggestion to check the binding of Sec3 with either the WT or M7 mutant of Sso2 (aa1-270), we have generated M7 mutant within the same fragment of Sso2 as the WT (i.e. aa1-270) and carefully checked how this M7 mutant affects the interaction of Sso2 with the Sec3 PH domain using three independent methods. Our ITC data show that WT Sso2 bound Sec3 very robustly, with a Kd of approximately 2 µM (Fig. 8C). Surprisingly, however, the M7 mutant of Sso2 (aa1-270) completely abolished its interaction with Sec3 (Fig. 8D). To further verify this observation, we carried out electrophoresis mobility shift assays (EMSA) and size-exclusion chromatography (SEC). Our EMSA data on a native PAGE gel shows that WT Sso2 (aa1-270) bound Sec3, whereas the M7 mutant did not (Fig. S5A, B). Similarly, our SEC data demonstrate that Sec3 was co-eluted with WT Sso2 in the higher molecular weight peak; in contrast, Sec3 and the M7 mutant of Sso2 (aa1-270) were eluted in separate peaks and no stable complex of the two was formed (Fig. S5C, D). All these new data confirm that the NPY motifs play an essential role in maintaining the stable interaction between Sso2 and Sec3, which would explain why the M7 mutant gave such dramatic phenotype in vivo (Fig. 4B-E; Fig. 5D-F; Fig. 6D, E). *Minor point: In the discussion, the authors should mention to which degree the NPY binding site within Sec3 is accessible for / occupied by other known exocyst components, or PI(4,5)P2, etc. *

      Response: __Thank you for the suggestion. A new diagram has been added to __Fig. 9E to compare the structures of the previously reported Sec3/Rho1 complex and the Sso2/Sec3 complex determined by us. It shows that the NPY binding site on Sec3 is on the opposite side of the membrane-binding surface patch. The NPY binding site is also far away from the Rho1 interacting site on Sec3 and thus does not interfere with Rho1 binding either.

      *Reviewer #1 (Significance (Required)):

      The manuscript significantly contributes to our understanding of how the vesicle tethering machinery interacts and coordinates the assembly of the membrane fusion machinery and will be of broad interest in the field of membrane trafficking. I am not an expert in X-ray crystallography. *

      __Response: __We sincerely appreciate this reviewer’s positive feedbacks.

      ***Referees cross-commenting**

      I agree with the comments of the other reviewer. It would be nice to show the effect of the M7 mutant in a reconstituted liposome fusion assay, but as already mentioned this may require an additional collaboration. Whether the relatively weak Sec3 - NPY interaction can be resolved in the liposome fusion assay needs to be shown.*

      __Response: __Please check our detailed answer to the other reviewer’s question about this.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): * The manuscript of Peer et al. Describe the structural characterization of the interaction of the syntaxin-like Sso2 protein with the exocyst subunit Sec3. The authors identify here a dual NPY motif at the N-terminal part of Sso2 that binds to Sec3 and thus confers functionality. Using x-ray crystallography, they show a nearly full-length Sso2 in complex with Sec3, which reveals how Sso2 binds to Sec3. Subsequent mutagenesis shows that both NPY motifs act together in binding, and are both required for functionality in vivo, using established assays in localization of exocyst subunits, secretion assays and growth tests. Their data suggest an overall model how Sso2 is efficiently recruited by exocyst to promote vesicle secretion.

      This is__ an overall very complete and clear manuscript__, where the authors nicely demonstrate, how the two NPY motifs both contribute to efficient Sso2 interaction with Sec3. Their data further show that each motif alone can contribute to function, whereas loss of both motifs (the M7 mutant) result in deficient binding. Likewise, their established assays to determine cellular importance of the NPY motifs in Sso2 show that trafficking and localization in the secretory pathway is strongly impaired in the mutant. I only have a few questions and suggestions. *

      __Response: __Thank you for the positive feedback.

      *1. The authors present in Figure 4 the mutants. I recommend to show the alignment of the mutants (M5,M6,M7) similar to panel A in Figure S4 here to orient the reader. They could also be listed in Figure 3, since the authors have here the sequences. *

      Response: __Alignment of M5-M7 has been added in __Fig. 4A as suggested. Thank you.

      2. The authors previously showed that Sso2 mutants affect the Sec3 driven assembly and also the fusion. I am wondering if they have the tools ready to also conduct this assay with their M7 mutant, which has the strongest defect. I am aware that this may be challenging if the tools are not established here as in the previous collaboration (Yue et al., 2017). It may provide additional information on the functional crosstalk.

      Responses:

      • Thank you for the suggestion. However, we do not think it is necessary to perform such assay based on our new results. As shown in 8C&D and Fig. S5, we found that the M7 mutant of Sso2 (aa1-270) completely abolished its interaction with Sec3, which is in contrast to the robust interaction between the WT Sso2 (aa1-270) and Sec3. Therefore, we expect that the M7 mutant would fail to accelerate liposome fusion in the same way as we had previously seen for the WT Sso2.
      • On the other hand, we have to admit that to perform such assay would indeed be challenging for us as the PhD student who had carried out the in vitro liposome fusion assay has left our previous collaborator’s lab and it would take quite a while to re-establish the assay in our own group and to optimize various parameters in that assay. *3. Along the same line, it would be good if the authors show that the mutation also impairs the interaction of Sec3 and Sso2 in vivo. *

      Response: __We appreciate the reviewer’s suggestion and have carried out co-immunoprecipitation of Sec3-3×Flag and Sso2 from yeast extract to find out how the M7 mutant affects Sso2’s interaction with Sec3 (__Fig. S6). Our results show that in contrast to the clear signal of WT Sso2 pulled down by Sec3-3×Flag, the pull-down band for the M7 mutant was much weaker and at a similar level to the negative control. This is consistent with what we saw in our in vitro binding assays (Fig. 8D; Fig. S5).

      *4. I really like the similarity of the different Munc18-Syntaxin interactions and the Sec3-Sso2 interaction. Do the authors think that Sec3 is an ancestral fragment of a Sec1 like protein, which just maintained this interaction? *

      __Response: __This is a very interesting idea. However, it seems too speculative to us to draw such conclusion. It could also be due to co-evolution in function for Sec3 to use a simpler structure (i.e. PH domain) to mimic syntaxin binding of SM proteins and to employ the extra “add-on” NPY motifs as a handle to facilitate and regulate their interaction.

      1. *Small mistake in the discussionResponses: "plasmas membrane" *

      __Response: __This has been corrected. Thank you.

      *Reviewer #2 (Significance (Required)): Important advance in our understanding of Exocyst function, which deserves publication. I only had minor issues that can be addressed quickly. *

      __Response: __We sincerely appreciate the reviewer’s positive feedbacks and constructive suggestions.

    1. Author Response

      Reviewer 3

      This is work by an internationally recognized group with strong expertise in sophisticated single-molecule microscopy assays in cells. They present here a single-molecule fluorescence-based assay for proximity in the nanometer range.

      It has long been reported that cyanine dyes such as Cy3, Cy5 and derivatives such as AF555, AF647 can undergo a photoswitching mechanism by which the shorter wavelength dye when being excited can switch the longer wavelength dye which is in a dark state back into the bright state. And it has furthermore been reported that this switching mechanism is not based on FRET, as the distance requirement is more stringent (up to ~ 2 nm). However, this mechanism has not been fully explored for the investigation of molecular interactions yet.

      The authors in the present work present a similar mechanism for a different class of rhodamine-based fluorophores, specifically JF549 and JFX650. They describe the discovery of this mechanism in dual-color labeling of a pentameric protein and initial characterization to distinguish it from UV-light-mediated recovery from a pumped dark state as reported for (d)STORM-like measurements. They extend their observation to TMR, JF529 as lower wavelength "senders" and JF646 and JFX646 as longer wavelength "receivers" that can become reactivated into the ground state upon illumination of a nearby "sender". The authors then test activation pulse length and distance dependence and find that longer pulses lead to more recovery and that PAPA of JF549/JFX650 has unlike previously observed for the Cy3/Cy5 pair a smaller distance dependence than FRET of the same fluorophore pair. The authors then move on to use both the UV-light mediated direct reactivation "DR" and proximityassisted photoactivation "PAPA" to activate different molecules that are either double-labeled for PAPA or singly labeled with JFX650 for DR. They succeeded in four different cases to identify clear population shifts to distinguish molecules of different mobility.

      Overall, I think the authors made an interesting discovery and characterizing this previously poorly characterised interaction for cellular single-molecule experiments is certainly an important effort. The authors make an honest and quite complete effort to work out the practical details of this interaction and designed experiments that convincingly highlight the basic capabilities this technique offers to the detection of verified interactions and the mobility of interacting molecules in cells.

      The weakness is that these capabilities do not seem to be as clear-cut as the reviewer hoped for when starting to read this manuscript. It remains unclear to this reviewer, to what extant PAPA molecules can be separated from DR molecules. In all but the last diffusion experiment(s) in Figure 4, PAPA molecules seem to be significantly perturbed by DR molecules, casting doubt on the usefulness in real experiments. Similarly, in Figure 5, a difference is seen but does not allow for quantification. This certainly is not the case for other methods of sensing as well, but maybe the authors could more specifically compare their efforts and the dynamic range to other sensors for example in Figure 5? This would make it easier for the reader to make up their mind if the assay is worthwhile adopting for their system.

      We agree that a problem with PAPA at present is that although PAPA trajectories are significantly enriched for double-labeled complexes, they are still “contaminated” with singlelabeled molecules. As we described in the Discussion (and as pointed out by Reviewer 1), we think that one major contribution to this background arises from chance proximity of sender and receiver molecules independent of direct physical interaction. Additionally, some background is expected from continual spontaneous (a.k.a. “thermal”) reactivation of molecules from the dark state.

      In response to the reviewers’ comments, we have tried to quantify more precisely how much PAPA enriches for one population over another by fitting the diffusion spectra of 2-component mixtures to linear combinations of the corresponding individual components (Figure 4–figure supplement 4). We estimate that the fold enrichment of double-labeled molecules ranged from 3.7 to 37-fold between different 2-component mixtures.

      We fully agree that it is critical that researchers who use PAPA be aware of its limitations, so that they do not fallaciously assume that all green-reactivated localizations are protein complexes. To avoid committing a bait-and-switch against our readers, we now state explicitly in the Introduction that PAPA in its current form enriches for complexes but does not provide perfect selectivity. In Appendix 2, we now discuss the problem of background reactivation in more detail and outline what we think will be required to correct quantitatively for this background. Though we believe that such corrections will ultimately be possible, at least in some cases, figuring out how to do this rigorously will require substantial additional development of experimental and computational methods, which we hope the editor and reviewers agree is beyond the scope of the current paper.

      At the end of Appendix 2, we briefly mention another technical problem that we have noticed with SNAP ligand background staining. While this background was negligible for the experiments described in this paper, which involved highly expressed SNAPf transgenes, it may pose a more significant problem for SNAPf-tagged proteins with lower expression levels. We think it is worth mentioning this problem to make readers aware of it and hopefully to motivate the development of better orthogonal pairs of self-labeling tags.

      While there are obviously limitations to PAPA, we think this should not overshadow the fact we have identified a novel photophysical property of commonly used fluorophores and harnessed it to detect molecular interactions in live cells. Our initial proof-of-concept study provides a foot in the door of this new biophysical approach, which we and others will continue to refine. Immediate applications of PAPA could include disambiguation of peak assignments in complex diffusion spectra, confirmation of proposed interactions between proteins (and subsequent investigations into the molecular mechanisms supporting such interactions), or integration into SPT-based high-throughput screening (https://www.eikontx.com/technology) to provide a useful additional readout for each experimental condition.

    1. Author Response:

      Reviewer #1 (Public Review):

      The authors show that the unmitigated generation interval of the original variant of SARS-CoV-2 is longer than originally thought. They argue that in the absence of interventions that limit transmission late in the course of infection, the fraction of transmission events that occur before symptom onset would be considerably lower, and the fraction of transmission events occurring 10 days or more after infection of the index case would be substantially higher.

      These findings improve our ability to accurately estimate the basic reproductive number (R0), to evaluate quarantine and isolation policies, and to model counterfactual intervention-free scenarios. Many applied analyses rely on accurate generation interval estimates. To head off confusion, it would be helpful if the authors could provide more comprehensive guidance about which applied analyses should be informed by the unmitigated generation interval, or the observed generation interval. (E.g. the unmitigated interval is useful for quarantine and isolation policies, but would we ever want to use the unmitigated interval to estimate R?).

      The unmitigated generation-interval should be used for estimation of the R0 of the initial epidemic phase, but not for the R(t) of the current epidemics. Estimation of R(t) must account for changes in generation-interval distributions caused by the invasion of new variants and changes in behavior. When analyzing policies of quarantine, isolation or contact tracing, the unmitigated interval should also be adopted to account for late transmissions.

      We added few sentences at the end of our introduction to clarify this point:

      “The estimated unmitigated generation-interval distribution could be adopted for answering questions about quarantine and isolation policy, as well as for estimating the original R0 at the initial spread in China. However, estimation of instantaneous R(t) should account for changes in generation-interval distributions, reflecting mitigation effects and the current variant.”

      The analysis estimates a longer generation interval after accounting for three main sources of bias or error that are common in other analyses: 1. Recently infected individuals are intrinsically overrepresented in data on a growing epidemic. Thus, shorter incubation periods and forward serial intervals are more likely to be observed, even in the absence of interventions. This analysis adjusts for these dynamical biases. 2. Interventions or behavioral changes can prevent transmission late in the course of infection. This can shorten the generation and serial intervals over the course of an epidemic. This analysis focuses specifically on transmission pairs observed very early, before the adoption of interventions. 3. The incubation period and generation interval should be correlated - infectors that progress relatively quickly to symptoms should also become infectious sooner (symptom onset occurs near the peak of viral titers). Most existing analyses assume these intervals are uncorrelated, but this analysis accounts for their correlation.

      Overall, the conclusions seem reasonable and well-supported. The observation that the generation interval decreases over the course of an epidemic is also consistent with existing studies that show the serial interval has similarly decreased over time. But given the implications of the findings, I hope the authors can address a few questions about potential additional sources of bias:

      1. It is somewhat reassuring that the generation interval decreases relatively smoothly as the cutoff date increases (Fig. S6), but it would be helpful if the authors address the potential impact of ascertainment biases. One of the main reasons that the authors estimate a shorter generation interval is that they define January 17th, early in the outbreak before interventions and behavioral changes had taken place, as the cutoff point for the infector's date of symptom onset. This cutoff eliminates biases from interventions, but it also severely limits the size of the transmission-pair dataset (Fig. S3), and focusing on this very early subset of cases may increase the influence of ascertainment bias. Prior to January 17th, should we expect observed transmission pairs to involve more severe cases on average? And is the unmitigated generation interval correlated with case severity?

      We thank the reviewer for identifying a source of possible bias that we overlooked. Following the comment we performed a new sensitivity analysis for the inclusion of the severe cases, summarized in Appendix 1—figure 11.

      Severity of the cases was reported only in Ali et al.’s data, for some of the individuals. In these cases, individuals are divided into one of three conditions: mild, severe (non-fatal) and death. As non-mild cases represent a small fraction of the dataset, we combine them into one category, which we denote as severe.

      Severe cases (including deaths) were overrepresented in the period prior to January 17, with 8 out of 77 cases, compared to 18 out of 745 in the period of January 18-31. The effect of inclusion of severe cases was analyzed by comparing the means of the estimated generation-interval distribution, separately for the two periods in question, using the inference framework with 30 bootstrapping runs. For the earlier period, the estimated means were compared between the dataset with or without the severe cases. For the later period, we also consider the “enriched” dataset, in which severe cases are oversampled for each bootstrap such that the proportion of severe cases matches that during the earlier period (10%). In both cases we see that the effect on the estimated mean generation interval is small.

      1. The analysis assumes the incubation period follows a fixed distribution, whose parameterization comes from a meta-analysis of previously estimated incubation periods. But p.5 discusses the idea that observed incubation periods are affected by the same dynamical biases as forward serial intervals, "For example, when the incidence of infection is increasing exponentially, individuals are more likely to have been infected recently. Therefore, a cohort of infectors that developed symptoms at the same time will have shorter incubation periods than their infectees on average, which will, in turn, affect the shape of the forward serial-interval distribution." Has the incubation period been adjusted for these dynamical biases, or should it be?

      In our analysis we use the incubation period distribution from Xin et al. 2021 which already considers the backward bias caused by the expanding epidemic with the corrected growth rate of 0.1/d. Xin et al. showed in their meta-analysis that the mean incubation period reported by the various sources changed according to the dates used by the source. Incubation periods prior to the peak of the epidemic in China were lower than ones from after the peak, in a manner that coincided with the backward correction they performed (using a similar derivation to that suggested by Park et al. 2021). Accordingly, the distribution of incubation period they report is the intrinsic incubation period, after correction for the growth rate of the initial spread in China. We added two sentences in our methods section to clarify this point:

      “In their meta-analysis, Xin et al. found an increase of the incubation period following the introduction of interventions in China, matching the theoretical framework shown above. Their inferred incubation period distribution includes a correction for the growth rate of the early spread, accordingly.”

      Furthermore, we perform a sensitivity analysis for the shape of the incubation period distribution, and show that it has a minor effect on our conclusions (Appendix 1—figure 10).

      1. It appears that correlation parameter estimates co-vary with estimates of the mean generation interval (Fig. S6; S13b). Are the authors confident that the correlation parameter is identifiable? How much would the median generation interval estimate in the main analysis change if the correlation parameter had been fixed to 0 (which isn't realistic) or to 0.5 (which might be plausible)?

      As the reviewer pointed out, the correlation parameter estimates co-vary with estimates of the mean generation interval. We further analyzed this relation following the comment. The analysis is summarized in supplemental figures S19-20.

      We first examine the relation between the mean generation interval and the correlation parameter based on the uncertainty estimates, consisting of 1000 bootstrap runs. Appendix 1—figure 12 shows a joint bivariate scatter plot of the parameters, together with contours of equal probability. As can be seen there is a connection between the parameters. The estimates centered around the maximum likelihood estimate with correlation parameter of 0.75 and mean generation interval of 9.7 days. The confidence interval for the correlation parameter of 0.45-0.95 corresponds to mean generation intervals in the range of 8-11 days, supporting the conclusion of this study.

      Next, we reanalyzed the dataset while fixing the correlation parameter, as suggested by the reviewer. Appendix 1—figure 13 shows the estimated mean generation interval for fixed correlation parameters with values of 0, 0.25, 0.5, 0.75, 0.9. For each fixed correlation parameter 100 bootstrapping runs. As can be seen, the results reflect the same connection that can be seen in Appendix 1—figure 12, with probable values in the range of 8-11 days, for correlation parameters in the range of 0.5-0.9. Assuming no correlation would cause underestimation of the mean generation interval match previous literature (Hart, Maini, and Thompson 2021; Park et al. 2022).

      Reviewer #2 (Public Review):

      There have been several estimates of the generation time and serial interval published for SARS-CoV-2, but as the authors note, estimates can be subject to biases including shifted event timing depending on the phase of the epidemic, correlation in characteristics between infector and infectee, and impact of control measures on truncating potential infectiousness. This study, therefore, has several strengths. It first collates data on transmission events from the earliest phase of the COVID-19 pandemic, then makes adjustments for these potential biases to estimate the generation time in absence of control measures, and finally discusses implications for transmission.

      Given many subsequent aspects of the COVID-19 pandemic have been defined relative to earlier phases (e.g. relative transmissibility of variants, relative duration of infectiousness), understanding the baseline characteristics of the infection is crucial. I thought this paper makes a useful contribution to this understanding, generating adjusted estimates for infectiousness (which is longer than previous estimates) and corresponding values for the reproduction number (which is remarkably similar to earlier estimates, presumably because of the different sources of bias in the growth rate and generation time distribution somehow end up canceling each other out).

      However, there are some weaknesses at present. The study correctly flags several potential sources of bias in estimates, but in making adjustments uses estimates from the literature that themselves could suffer from these biases, e.g. the distribution of incubation period from a 2021 meta-analysis. Although the authors conduct some sensitivity analysis it would be worth including some more explicit consideration of whether they would expect any underlying bias to propagate through their calculations. The authors also conduct some sensitivity analysis around the underlying data (e.g. ordering of transmission pairs), but again it would be useful to know whether there could be systematic biases in these early data. Specifically, the paper references Tsang et al (2020), which highlighted variability in early case definitions - is it possible that early generation times are estimated to be longer because intermediate cases in the transmission chain were more likely to go undetected than later in the epidemic?

      We recognize the potential biases in the transmission pairs data. We therefore developed an extensive framework of sensitivity analyses for identifying biases that could substantially affect the results. In the results section and figure 5, we show that the main study result, that the unmitigated generation-interval distribution is longer than previously estimated, is robust to reasonable amounts of ascertainment bias. We discuss this point at length and have added several supplemental figures to support this claim.

      As reviewer #3 mentioned, we conducted a sensitivity analysis for the inclusion of the longest serial intervals, to investigate possible effects of missing links in the longest transmission pairs. We also discuss why we think it’s not necessary to explicitly model the short intervals that may be unobserved due to missing links.

      “Second, we considered the possibility that long serial intervals may be caused by omission of intermediate infections in multiple chains of transmission, which in turn would lead to overestimation of the mean serial and generation intervals. Thus, we refit our model after removing long serial intervals from the data (by varying the maximum serial interval between 14 and 24 days). We also considered “splitting” these intervals into smaller intervals, but decided this was unnecessarily complex, since several choices would need to be made, and the effects would likely be small compared to the effect of the choice of maximum, since the distribution of the resulting split intervals would not differ sharply from that of the remaining observed intervals in most cases.”

      We added to the discussion text regarding the effect of possible bias in the dataset, explicitly specifying the ascertainment bias.

      “Our analysis relies on datasets of transmission pairs gathered from previously published studies and thus has several limitations that are difficult to correct for. Transmission pairs data can be prone to incorrect identification of transmission pairs, including the direction of transmission. In particular, presymptomatic transmission can cause infectors to report symptoms after their infectees, making it difficult to identify who infected whom. Data from the early outbreak might also be sensitive to ascertainment and reporting biases which could lead to missing links in transmission pairs, causing serial intervals to appear longer (For example, people who transmit asymptomatically might not be identified). Moreover, when multiple potential infectors are present, an individual who developed symptoms close to when the infectee became infected is more likely to be identified as the infector. These biases might increase the estimated correlation of the incubation period and the period of infectiousness. We have tried to account for these biases by using a bootstrapping approach, in which some data points are omitted in each bootstrap sample. The relatively narrow ranges of uncertainty suggest that the results are not very sensitive to specific transmission pairs data points being included in the analysis. We also performed a sensitivity analysis to address several potential biases such as the duration of the unmitigated transmission period, the inclusion of long serial intervals in the dataset, and the incorrect ordering of transmission pairs (see Methods). The sensitivity analysis shows that although these biases could decrease the inferred mean generation interval, our main conclusions about the long unmitigated generation intervals (high median length and substantial residual transmission after 14 days) remained robust (Figure 5).”

      It would also be helpful to have some clarifications about methodology, particularly in how the main results about generation time are subsequently analyzed. For example, estimates such as the conversion of generation time to R0 and VOC scalings are described very briefly, so it is currently unclear exactly how these calculations are being performed.

      Following the reviewer comments we made edits to the Methods section in order to make it more readable and clearer. We added subheadings for the various sections. Moreover, we added a section explaining the derivation of the basic reproduction number and clarified the section regarding the VOCs extrapolations.

      We made some edits to the methods section in order to make it more accessible and clear, for example, we added subheadings for the various sections, added a section explaining the derivation of the basic reproduction number, and clarified the section regarding the VOCs extrapolations.

      Reviewer #3 (Public Review):

      Sender & Bar-On et al. perform robust analyses of early SARS-CoV-2 line list data from China to estimate the intrinsic generation interval in the absence of interventions. This is an important topic, as most SARS-CoV-2 data are from periods when transmission-reducing interventions are in place, which will lead to underestimation of the potential infectious period.

      The authors highlight two shortcomings in previous approaches. First, the distribution of 'observed' serial intervals (the time between symptom onset in the infector and symptom onset in the infectee) depends not only on the timeline of each infector's infection, but also the epidemic growth rate, which weights the proportion of observed short vs. long serial intervals. The authors argue that by accounting for this weighting, more accurate estimates of the intrinsic generation interval - the metric on which isolation policies are based - can be obtained. Second, the authors find that the original SARS-CoV-2 generation interval distribution has both a higher mean and longer tail than previous estimates when using only data prior to the introduction of interventions. Finally, the authors use publicly available data on viral load trajectories to extrapolate their estimates to other SARS-CoV-2 variants, finding that alpha, delta, and omicron may have shorter generation intervals than original SARS-CoV-2. These findings are important, as case isolation policies are based on assumptions for how long individuals remain infectious. More broadly, these methods will be important for future work to correctly estimate generation intervals in other outbreaks.

      The conclusions are well supported by the data, and a suite of sensitivity analyses give confidence that the findings are robust to deviations from many of the key assumptions. The code is well documented and publicly available, and thus the findings are easily reproducible. Key strengths of the paper include the clarity and rigor of the modeling methods, and the exhaustive consideration of potential biases and corresponding sensitivity analyses - it is very difficult to think of potential biases that the authors have not already considered! I think this is a well-written and well-executed study. The work is likely to be impactful for reconsidering SARS-CoV-2 isolation policies and revisiting generation interval estimates from other data sources. I also expect this to be a key reference and method for future studies estimating the generation interval.

      I have some minor comments on potential weaknesses and interpretation:

      1. Uncertainty in early generation interval estimates. One of the conclusions is that the estimated mean generation interval is longer than the observed mean serial interval. However, this conclusion does not seem justified given that the observed mean serial interval (9.1 days) is well within the 95% CI of 8.3-11.2 days for the mean generation interval. The confidence intervals for the serial interval in figure 2 are also wide for pre-Jan 17th (though presumably these would be reduced if all pre-Jan 17th serial intervals were combined). Further, only 77 of the ~1000 transmission pairs are actually from pre-January 17th. The actual sample size used for these estimates is much smaller than suggested by Figure S1 and thus this should be made clear. Therefore, although the intuition for why observed serial intervals may differ from the generation interval is correct, I do not think that the data alone demonstrate this. A related issue is on ascertainment bias - could the early serial interval data be biased longer because ascertainment is initially poor and thus more intermediate infectors are missed? The authors consider removing particularly long serial intervals to try and account for this, but that does not deal with e.g. chains of multiple short serial intervals being incorrectly recorded as a single long serial interval (but still within 16 days).

      We agree with the reviewer that due the large uncertainty we cannot deduce that the mean generation interval is longer than the mean serial interval. We changed the phrasing to emphasize this statement is supported by mathematical theory.

      “We note that our estimated mean generation-interval is longer than the observed mean serial-interval (9.1 days) of the period in question. This is supported by the theory (Park et al. 2021) of the dynamical effects of the epidemic -- in contrast to the common assumption that the mean generation and serial intervals are identical. During the exponential growth phase, the mean incubation period of the infectors is expected to be shorter than the mean incubation period of the infectee - this effect causes the mean forward serial interval to become longer than the mean forward generation interval of the cohorts that developed symptoms during the study period. However, these cohorts of infectors with short incubation periods will also have short forward generation (and therefore serial) intervals due to their correlations. When the latter effect is stronger, the mean forward serial interval becomes shorter than the mean intrinsic generation interval, as these findings suggest.“

      Following the comment, we added to Figure S1 the filtering according to date, to reflect the true sample size we use for the main analysis (We renamed it: Appendix 1—figure 1).

      We recognize the potential biases in the transmission pairs data. We therefore developed an extensive framework of sensitivity analyses for identifying biases that could substantially affect the results. In the results section and figure 5, we show that the main study result, that the unmitigated generation-interval distribution is longer than previously estimated, is robust to reasonable amounts of ascertainment bias. We discuss this point at length and have added several supplemental figures to support this claim.

      As reviewer #3 mentioned, we conducted a sensitivity analysis for the inclusion of the longest serial intervals, to investigate possible effects of missing links in the longest transmission pairs. We also discuss why we think it’s not necessary to explicitly model the short intervals that may be unobserved due to missing links.

      “Second, we considered the possibility that long serial intervals may be caused by omission of intermediate infections in multiple chains of transmission, which in turn would lead to overestimation of the mean serial and generation intervals. Thus, we refit our model after removing long serial intervals from the data (by varying the maximum serial interval between 14 and 24 days). We also considered “splitting” these intervals into smaller intervals, but decided this was unnecessarily complex, since several choices would need to be made, and the effects would likely be small compared to the effect of the choice of maximum, since the distribution of the resulting split intervals would not differ sharply from that of the remaining observed intervals in most cases.”

      We added to the discussion text regarding the effect of possible bias in the dataset, explicitly specifying the ascertainment bias.

      “Our analysis relies on datasets of transmission pairs gathered from previously published studies and thus has several limitations that are difficult to correct for. Transmission pairs data can be prone to incorrect identification of transmission pairs, including the direction of transmission. In particular, presymptomatic transmission can cause infectors to report symptoms after their infectees, making it difficult to identify who infected whom. Data from the early outbreak might also be sensitive to ascertainment and reporting biases which could lead to missing links in transmission pairs, causing serial intervals to appear longer (For example, people who transmit asymptomatically might not be identified). Moreover, when multiple potential infectors are present, an individual who developed symptoms close to when the infectee became infected is more likely to be identified as the infector. These biases might increase the estimated correlation of the incubation period and the period of infectiousness. We have tried to account for these biases by using a bootstrapping approach, in which some data points are omitted in each bootstrap sample. The relatively narrow ranges of uncertainty suggest that the results are not very sensitive to specific transmission pairs data points being included in the analysis. We also performed a sensitivity analysis to address several potential biases such as the duration of the unmitigated transmission period, the inclusion of long serial intervals in the dataset, and the incorrect ordering of transmission pairs (see Methods). The sensitivity analysis shows that although these biases could decrease the inferred mean generation interval, our main conclusions about the long unmitigated generation intervals (high median length and substantial residual transmission after 14 days) remained robust (Figure 5).”

      1. Frailty of using viral loads to extrapolate generation intervals. The authors take the observation that variants of concern demonstrate faster viral clearance on average to estimate shorter generation intervals for alpha, delta, and omicron. The authors rightly point out in the discussion that using viral load as a proxy for infectiousness has many limitations. I would emphasize even further that it is very difficult to extrapolate from viral load data in this way, as infectiousness appears to vary far more between variants than can be explained by duration positive or peak viral load. Other factors are potentially at play, such as compartmentalization in the respiratory tract, aerosolization, receptor binding, immunity, etc. Further, there is considerable individual-level variation in viral trajectories and thus the use of a population-mean model overlooks a key component of SARS-CoV-2 infection dynamics. An important reference, which came out recently and thus makes sense to have been missed from the initial submission, is Puhach et al. Nature Medicine 2022 https://doi.org/10.1038/s41591-022-01816-0.

      We agree with the reviewer about the frailty of using viral loads to extrapolate generation intervals. We therefore expanded our discussion of the limitation of using viral load data for inferring infectiousness including many of the points mentioned by the reviewer. We use viral load data in the most minimal way to try to enable some discussion of new VOC, and try to emphasize the needed caution.

      Viral load trajectories data have potential for informing estimates of the infectiousness profile. However the relationship between viral load, culture positivity, symptom onset, and infectivity is complex and not well characterized. Due to this limitation we tried to use viral loads in a more limited way, extrapolating our results to variants of concerns (which lack unmitigated transmission data). Following the comment, we added a detailed discussion of the limitations of using viral loads as a proxy for infectiousness, including the variation of viral loads across individuals. We also added supplementary figures (Figure 6—figure supplements 1-2) to show the possible effect of an individual's viral loads in relation to the infectiousness and for comparison with new viral load and culture results (Chu et al. 2022; Killingley et al. 2022). As the viral load trajectories data for the different VOC is given only as a function of time from the onset of symptoms, it is not possible to directly link it to the fraction of transmission post 14 days from infection. We made changes to Figure 6 to clarify the possible connection of viral load with the TOST (time from symptoms onset to transmission) distribution and the resulting extrapolation to the unmitigated generation-interval distributions.

      “SARS-CoV-2 viral load trajectories serve an important role in understanding the dynamics of the disease and modeling its infectiousness (Quilty et al. 2021; Cleary et al. 2021). Indeed, the general shapes of the mean viral load trajectories and culture positivity, based on longitudinal studies, are comparable with our estimated unmitigated infectiousness profile (Figure 6—figure supplements 1-2, comparison with (Chu et al. 2022; Killingley et al. 2022; Kissler et al. 2021)). However, the nature of the relationship between viral load, culture positivity, symptom onset, and real-world infectivity is complex and not well characterized. Therefore, the ability to infer infectiousness from viral load data is very limited, especially near the tail of infectiousness, several days following symptom onset and peak viral loads. Viral load models are usually made to fit the measurements during an initial exponential clearance phase and in many cases miss a later slow decay (Kissler et al. 2021). Furthermore, there is considerable individual-level variation in viral trajectories that isn’t accounted for in population-mean models (Kissler et al. 2021; Singanayagam et al. 2021). Other factors limiting the ability to compare generation-interval estimates with viral loads models are the variability of the incubation periods and its relation to the timing of the peak of the viral loads, and the great uncertainty and apparent non-linearity of the relation between viral loads and culture positivity (Jaafar et al. 2021; Jones et al. 2021). Due to these caveats and in order to avoid over interpretation of viral load data, we restrict our extrapolation of new VOCs’ infectiousness to a single parameter characterizing the viral duration of clearance.”

      We also edited another paragraph in the discussion:

      “Our extrapolations are necessarily crude given the complex relationship between viral load, symptomaticity, and infectiousness discussed above. Moreover, compartmentalization in the respiratory tract, aerosolization, receptor binding affinity, and immune history can also play important roles in determining the infectiousness profiles of SARS-CoV-2 variants (Puhach et al. ). ”

      1. Lack of validation with other datasets This study hinges on data from a single setting in a short window of time. Although the data are from multiple publications, the fact that so many reported the same transmission pair data demonstrates that these are overlapping datasets. As the authors note, there are potential biases e.g., ascertainment rates and behavioral changes which will impact the generation interval estimates. Thus, generalizability to other settings is limited.

      We agree with the reviewer that the dataset used in our study is limited, and consists of overlapping transmission pairs. We perform some analysis of the possible bias caused by inclusion of each dataset, as can be seen in Appendix 1—figure 4.

      The best validation would have been a comparison with another independent dataset from the early spread of the epidemic, but no such dataset exists. We added a sentence to the discussion to emphasize this point.

      “Due to the nature of early spread of a new unknown disease it is nearly impossible to find two completely unrelated datasets from the period prior to mitigation, limiting the ability of further validation of the current results.”

      1. The impact of epidemic dynamics on infector vs. infectee serial intervals. It took me a long time to get my head around the assertion that the forward serial interval distribution will be longer during epidemic growth due to the overrepresentation of short incubation periods among infectors relative to infectees. A supplementary figure, similar to the way Figure 1 is laid out, to illustrate this concept may go a long way to aid the reader's understanding.

      We added an explanation to the paragraph in order to make it clearer:

      “A cohort of individuals that develop symptoms on a given day is a sample of all individuals who have been previously infected. When the incidence of infection is increasing, recently infected individuals represent a bigger fraction of this population and thus are over-represented in this cohort. Therefore, we are more likely to encounter infected individuals with a short incubation period in this cohort compared to an unbiased sample. The forward serial-interval is calculated for a cohort of infectors who developed symptoms at the same time and therefore is sensitive to this bias. These dynamical biases are demonstrated using epidemic simulations by Park et al."

      1. Simulations to illustrate concepts and power Given the assertion that observed serial intervals will depend on epidemic growth rates, reporting, and timing of interventions, I think a simple simulation to illustrate some of these ideas would be very helpful. For example, a simple agent-based model with simulated infectivity profiles and incubation periods using the estimated bivariate distribution would be extremely helpful in illustrating how serial intervals and estimates of the generation interval can differ from the true intrinsic generation interval (I coded such a simulation to help me understand this paper in a couple of hours with <100 lines of R code, so I do not think this would be much work). This would also be very helpful for illustrating statistical power re. comment 1.

      The current paper is based on a strong theoretical foundation provided by previous works, specifically Park et al. 2021, which used simulations similar to the reviewer’s suggestions to demonstrate the dynamical biases. We now mention these simulations somewhere in the introduction section:

      “These dynamical biases are demonstrated using epidemic simulations by Park et al."

    1. Any available documents regarding student-led activism on cam

      I don't know if this would count as one of the sources we should be using, but perhaps you could also look into what schools offer on-campus gender affirming health care. Overall, I think the project pitch is well thought out and organized, with a good plan of action put in place. The research question is specific enough to produce intriguing result (that are not general) as well as may make it easier to know what to search for when it comes to sources.

    1. Author Response

      Reviewer #1 (Public Review):

      Bohère, Eldridge-Thomas and Kolahgar have studied the effect of mechanical signalling in tissue homeostasis in vivo, genetically manipulating the well known mechano-transductor vinculin in the adult Drosophila intestine. They find that loss of vinculin leads to accelerated, impaired differentiation of the enteroblast, the committed precursor of mature enterocytes, and stimulates the proliferation of the intestinal stem cell. This leads to an enlarged intestinal epithelium. They discriminate that this effect is mediated through its interaction with alpha-catenin and the reinforcement of the adherens junctions, rather than with talin and integrin-mediated interaction with the basal membrane. This results aligns well, as the authors note, with previous observations from Choi, Lucchetta and Ohlstein (2011) doi:10.1073/pnas.1109348108. Bohère et al then explore the impact that disrupting mechano-transduction has on the overall fitness of the adult fly, and find that vinculin mutant adult flies recover faster after starvation than wild types.

      The main conclusions of the paper are convincing and informative. Some important results would benefit from a more detailed description of the phenotypes, and others could have alternative explanations that would warrant some additional clarification.

      1) - Interpretation of phenotypes in vinc[102.1] mutants

      The paper presents several adult phenotypes of the homozygous viable, zygotic null mutant vinculin[102.1], where the fly gut is enlarged (at least in the R4/5 region). In many cases, they correlate this phenotype with that of RNAi knockdown of vinculin in the gut induced in adult stages. This is a perfectly valid approach, but it presents the difficulty of interpretation that the zygotic mutant has lacked vinculin throughout development and in every fly tissue, including the visceral mesoderm that wraps the intestinal epithelium and that also seems enlarged in the vinc[102.1] mutant. So this phenotype, and others reported, could arise from tissue interactions. To me, the quickest way to eliminate this problem would be to express vinculin in ISCs and/or EBs the vinc[102.1] background, either throughout development or after pupariation or emergence, and observe a rescue.

      We agree with the reviewer that we cannot exclude additional vinculin role(s) in other tissues during or after development that might have an impact on the intestinal epithelium. Our attempts to express a full-length Vinculin construct (Maartens et al, 2016) in the vinc102.1 flies, either in adulthood or throughout development, were not very conclusive: although we observed some degree of rescue, it was not fully penetrant. This was in contrast to the complete rescue observed with the genomic rescue of vinculin. Thus, it is possible that some form of tissue interaction contributes to the phenotype observed, for example if vinculin loss affects muscle structure. Alternatively, just like it was shown that too much active vinculin is detrimental to the fly (Maartens et al, 2016), our experiment suggests that too much vinculin may be deleterious to the intestine.

      In any case, because of cell-specific knockdowns in the adult gut, we are confident that EB reduction of vinculin levels or activity is sufficient to accelerate tissue turnover, at least in a specific portion of the posterior midgut. We have amended the text to acknowledge a role for tissue interactions (see page 6 (end of first paragraph), page 7 (start of last paragraph), page 12 (starvation experiments).

      An experiment where this is particularly difficult is with the starvation/refeeding experiment. The authors explored whether the disruption of tissue homeostasis, as a result of vinculin loss, matters to the fly. So they tested whether flies would be sensitive to starvation/re-feeding, where cellular density changes and vinculin mechano-sensing properties may be necessary. They correctly conclude that mutant flies are more resistant to starvation, and suggest that this may be due to the fact that intestines are larger and therefore more resilient. However, in these animals vinculin is absent in all tissues. It is equally likely that the resistance to starvation was due to the effect of Vinculin in the fat body, ovary, brain, or other adult tissues singly or in combination. The fact that the intestine recovers transiently to a size slightly larger than that of the fed flies seems anecdotal, considering the noise within the timeline of fed controls. I am not sure this experiment is needed in the paper at all, but to me, the healthy conclusion from this effort is that more work is needed to determine the impact of vinculin-mediated intestinal homeostasis in stress resistance, and that this is out of the scope of this paper.

      Please the new data presented in Figure 8A-B (text page 12).

      2) - Cell autonomy of the requirement of Vinculin and alpha-Catenin

      Authors interpret that Vinculin is needed in the EB to maintain mechanical contact with the ISC, restrict ISC proliferation through contact inhibition, and maintain the EB quiescent. This interpretation explains seemingly well the lack of an obvious phenotype when knocking down vinculin in ISCs only, while knockdown in ISCs and EBs, or EBs only, does lead to differentiation problems. It also sits well with the additional observation that vinculin knockdown in mature ECs does not have an obvious phenotype. However, a close examination makes the results difficult to explain with this interpretation only. If the authors were correct, one would expect that in mutant clones, eventually, vinculin-deficient EBs will be produced, which should mis-differentiate and induce additional ISC proliferation. However, the clones only show a reduction in ISC proportions; the most straight forward interpretation of this is that vinculin is cell-autonomously necessary for ISC maintenance (which is at odds with the phenotype of vinculin knockdown using the ISC and ISC/EB drivers).

      We apologise that we were unclear in the text. With hindsight, the confusion may have been caused by our describing the phenotype of MARCM clones before reporting the accumulation of EBs in the vinc102.1 guts. Therefore, we swapped these two sections and improved the description of these experiments in the manuscript (see section: “The pool of enterocyte progenitors expands upon vinculin depletion” pages 6-8).

      In brief, we do not think that our results are at odds with the phenotype of vinculin knockdown using the ISC and ISC/EB drivers - we realise the text was misleading and hope to have clarified our observations in the revised manuscript (pages 7 and 8). From cell conditional RNAi experiments, like the reviewer, we would predict that vinculin knockdown or loss of function in mitotic clones (MARCM experiments, Figure 4E-G) will induce accelerated differentiation of vinculin deficient enteroblasts, which in turn will increase proliferation. We observed that vinc102.1 or vinc RNAi mitotic clones contained similar number of cells compared to control clones, but reduced proportion of stem cells (Figure 4G). We interpret this as indicating that to maintain an equivalent clone size, stem cells must have divided more frequently, with some divisions producing two differentiated daughter cells. This type of symmetric division would increase the EB pool (as seen in Figure 4-figure supplement 2B), at the expense of the ISC population, in turn decreasing long term clonal growth potential. Altogether, the results obtained with MARCM clones highlight changes in tissue dynamics compatible with those observed with cell-specific vinculin knockdowns.

      Also, from the authors interpretation, it would follow induce that the phenotype of vinculin knockdown in ISCs+EBs and in EBs only should be the same. However, in ISCs+EBs vinculin knockdown, differentiation accelerates, which is likely accompanied by increased proliferation (judging by the increase in GFP area, PH3 staining would be more definitive).

      Indeed, the accelerated differentiation observed with esgGal4>UAS VincRNAi is accompanied by increased proliferation with the two independent RNAi lines used. We have added this result in Figure 1-figure supplement 1G (and in text, page 5).

      This contrasts with the knockdown only in EBs, which leads to accumulation of EBs due to misdifferentiation, and increased proliferation, mostly of ISCs, as measured directly with PH3 staining, but not additional late EBs or mature ECs. The authors call this "incomplete maturation due to accelerated differentiation". I think that one should expect to find incomplete differentiation/maturation when the rate of the process is very slow, not the other way around. To me, these are different phenotypes, which could perhaps be explained if vinculin was also needed in the ISC to transmit tension to the EB and prevent its differentiation, and removing it only in the EB may be revealing an additional, cell-autonomous requirement in maturation.

      When vinculin is knocked down in EBs, cells appear bigger than controls (as judged by the RFP+ nuclei in Figure 5E). This, compared to yw and vinc102.1 guts shown in Figure 4D suggests that these cells are more advanced in their differentiation. We have removed the sentence, to not confuse the reader, and clarified the text (see page 8). The discrepancy in the differentiation index between the esgGal4 and KluGal4 experiments might result from differences in the drivers, or an additional role of vinculin in EC differentiation, which we now mention in the text (page 8).

      So far, we have no evidence to support the idea that vinculin is also needed in the ISC to transmit tension to the EB and prevent its differentiation; for example, the lack of any phenotype when we knocked down vinculin specifically in ISCs (Figure 3) – notably, no increase in ISC ratio and no increase in cell density (unlike the reduction seen in Figure 1F with ISC+EB Knockdown).

      Another unexpected result, considering the authors interpretation, is that the over expression of activated Vinculin (vinc[CO]) does not seem to have much of an effect. It does not change the phenotype of the wild type (where there is very little basal turnover to begin with) and it only partially rescues the phenotype of the vinc[102.1] mutants, when the rescue transgene vinc:RFP does. This again suggests that there may be tissue interactions, in development or adulthood, that may explain the vinc[102.1] phenotypes. It could also be that this incomplete rescue is due to the deleterious effect of Vinc[CO]; this is another reason for doing the vinc[102.1]; esg-Gal4; UAS-vincFL experiments suggested above). An alternative experiment to perform this rescue would be to knock down vinculin gene while overexpressing the Vinc[CO] transgene - this may be possible with the RNAi HSM02356, which targets the vinculin 3'UTR and is unlikely to affect UAS-vinc[CO].

      Please refer to essential point 2c; as VincCO is not a simple overactive protein, like a constitutively active kinase, additional effects in the tissue can be expected.

      The claims of the authors would be more solid if the reporting of the phenotypes was more homogeneous, so one could establish comparisons. Sometimes conditions are analysed by differentiation index, others by extension of the GFP domains, others with phospho-histone-3 (PH3), others through nuclear size or density, and combinations. I do not think the authors should evaluate all these phenotypes in all conditions, but evaluating mitotic index and abundance of EBs and "activated EBs/early ECs" to monitor proliferation and differentiation rates should be done across the board (ISC, ISC+EB, EB drivers).

      To improve consistency, in all conditions we have compared cell types ratios and cellular density upon vinculin knockdown: see Figure 1E-F for ISC+EB, Figure 3B-C for ISC, and Figure 5 – figure supplement 1C-E for EB (with panel E newly added). As we did not observe any effect on ratio or density, we did not monitor cell proliferation for ISC knockdown.

      Nonetheless, we added the mitotic index for the ISC+EB driver (new Figure 1- figure supplement 1G) to be consistent with the results from the EB driver (Figure 5- figure supplement 1C).

      If the primary role of Vinculin is to induce contact inhibition in the ISC from the EB and prevent the EB differentiation and proliferation, one would expect that over expression of Vinc[CO] (or perhaps VincFL or sqhDD) in EBs should prevent or delay the differentiation and proliferation induced by a presumably orthogonal factor, like infection with Pseudomonas entomophila or Erwinia carotovora.

      This is indeed an exciting prediction, but outside the scope of this manuscript.

      3) - Relationship between Vinculin and alpha-Catenin

      The authors establish a very clear difference in the phenotypes between focal adhesion components and Vinculin, whereas the similarity of alpha-catenin and vinculin knockdowns is very compelling. Therefore I am sure the authors are in the right path with their interpretation of this part of the paper. However, some of the alpha-Catenin experiments are not very clear. The result from the rescue experiment of alpha-Cat knockdown with alpha-Cat-deltaM1b does not seem to show what the authors claim, and differentiation does not seem affected, only the amount of extant older ECs (which may be due to other reasons as this is a non-autonomous effect).

      Like the reviewer, we were surprised about the milder rescue with M1b compared to M1a and are unsure of the reasons for this. Nevertheless, quantifications of the differentiation and retention indices show significant differences for M1a and M1b compared to the FL control (Figure 6F-G), with phenotypes resembling the vinc knockdown. In Figure 6E, we have added a row of zoomed views to better highlight the similarity of phenotype between M1a and M1b and have acknowledged the mild differences in the text (bottom of page 9). For the sake of rigour, we think it is important to include results from both M1 deletions, even if there is not yet a logical reason to explain why they have different effects.

      Ulrich Tepass produced a UAS-alpha-catenin construct with the full deletion of the M1 region, perhaps that would show a clearer phenotype.

      This is a good suggestion, however for technical reasons this is not possible. The strategy devised by Ken Irvine and his group relies on rescuing the RNAi with an RNAi resistant construct, which is not the case for the constructs generated in the Tepass lab. Furthermore, we cannot adopt a MARCM strategy as -cat is too close to the centromere (80F).

      Also, the autonomy of the phenotype is difficult to address with these experiments alone. It would be expected that the phenotype of alpha-catenin knockdown should be similar to that of vinculin knockdown in the ISCs only or EBs only.

      This is not what our understanding of cadherin-mediated adhesion would predict. Forming cadherin adhesions requires cadherins and catenins in both cells, so we would expect similar phenotypes in ISCs only and EBs only. What is exciting about our findings is that the mechanosensitive machinery is not equally important in the two adherent cells, i.e. the EB is using vinculin to measure force on the contact and regulate differentiation, whereas the ISC needs to resist that force, but does not use vinculin to sense that force and regulate its behaviour.

      We have added new data showing the role of the vinculin/α-catenin interaction in ISCs or EBs by co-expressing α-Cat RNAi and α-Cat ΔM1a. We observed that absence of VBS in α-catenin has no effect in ISCs but promotes EB differentiation and increase in numbers (new Figure 6 – figure supplement 2), similar to our observations with vincRNAi (see text page 10).

      Reviewer #2 (Public Review):

      Vinculin functions as an important structural bridge that connects cadherin and integrin-mediated adhesions to the F-actin cytoskeleton. This manuscript carefully examined the mutant phenotype of vinc in the Drosophila intestine and found that vinc mutant in EBs causes significant increases of EB to EC differentiation, stem proliferation, and tissue growth. By analyzing the mutant phenotype of the cadherin adaptor alpha-catenin, the authors suggest that the vinc functions through the cell-cell junctions instead of cell-CEM adhesions in EBs. Finally, manipulation of myosin activity in EBs phenocopies the vinc mutant, suggesting that vinculin is regulated by the mechanical tension transduced through the cytoskeleton.

      The authors claim that the vinculin mutant phenotype is opposite compared to the loss of the major integrin components, suggesting a function independent of the cell-ECM adhesions. However, the phenotype of vinc and integrin may not be completely opposite. Besides loss of ISCs, both mys and talin knockdown in ISCs clearly causes ISCs differentiation into EC cells (Fig.3A), suggesting a possible involvement of integrin in EB to EC differentiation. Therefore, it will be important to test the phenotype of integrin KD in EBs using EB-specific Gal4.

      The reviewer raised an important point. To test this we had to overcome the ISC defect of mys or talin RNAi, and specifically tested their function in enteroblasts using the KluGal4 driver. This revealed a similar phenotype of accelerated differentiation, assayed with the ReDDM system (see new Figure 6 -figure supplement 4). Thus, as the reviewer suggested both integrins and cadherins function in this process, we have amended the text to indicate this (see page 10, and sentence in the discussion page 12). It appears however that, unlike vinculin, they also have a key role in ISCs.

      The authors proposed a model that the cell-cell adhesion between ISC and EBs is required for vinculin mediated differentiation suppression. However, this model is not directly supported by the data as the EB-ISC adhesion and EB-EC adhesion have not been tested separately.

      This is an important point and we have amended the text to address this.

      We have focussed our model on EB-ISC adhesion as the adherens junctions are stronger between progenitor cells than EBs-ECs, and because of previous data from the Ohlstein lab (Choi et al, 2011) demonstrating the relationship between adherens junction stability and EB differentiation/ISC proliferation. Nonetheless we agree it is possible that EB-EC adhesion might contribute to this mechanism and have modified the last sentence of the result section (page 12) and the legend associated to the model (Figure 8) to take this into account.

      In addition, previous short-term manipulation of E-cadherin in ISCs and EBs shows no change in cell proliferation (Liang J. et al. 2017), which seems to contradict the authors' model. To support the authors' conclusion, long-term manipulation of E-cadherin in ISCs and EBs must be tested.

      A main feature of the vinculin phenotype is the regional accelerated differentiation observed in R4/5, potentially reflecting areas more subject to mechanical forces. Strikingly, this accelerated differentiation is rarely observed more anteriorly (such as region R4a/b studied in Liang et al, 2017). In fact, these regional differences were previously reported with E-cadherin knockdown by the Adachi-Yamada group (see Figure S1, Maeda et al, 2008). This highlights the importance of considering regional control of cell fate for the field.

      To test our hypothesis further, we have knocked down E-cadherin and α-catenin in EBs only (with Klu-Gal4). As shown in new Figure 6-figure supplement 3, we observed an accumulation of EBs as early as 3 days after induction, reminiscent of vinculin loss of function phenotype. Longer E-cadherin EB knock-down with KluGal4 appears particularly detrimental for survival as all flies died after 4 days of continuous RNAi expression preventing any further observations (see new text page 10). These observations support our model that junctional stability slows down EB differentiation. Our results are also in agreement with the work described in Choi et al (2011), whereby after 6 days of E-Cadherin RNAi expression in progenitors or EBs (using a different driver from us, Su(H)Gal4), the mitotic index increases, showing a feedback regulation on ISC proliferation. Therefore, our work and the Liang et al 2017 study are not in fact contradictory: the differences in the contribution of junctions to tissue dynamics might reflect the variety of molecular mechanisms involved along the small intestine.

      The result of MARCM analysis seems inconsistent with the rest of the data. In MARCM, no significant change of clone sizes is observed between WT and vinc mutant (Fig. 3E). However, vinc mutant in EBs clearly promotes ISC proliferation in other experiments such as esg>vinc-RNAi and the EB>vinc-RNAi (Fig. 1A, Fig. 4).

      Please refer to point 2a, essential revisions. We do not think that our results are at odds with the phenotype of vinculin knockdown using the ISC and ISC/EB drivers - we realise the text was misleading and hope to have clarified our observations in the revised manuscript (pages 7-8).

      In Fig. 4H, the authors suggest that vinculin mutant prevent terminal EC formation. However, this may be simply caused by longer retention of Klu expression in the newborn ECs. To test if EB differentiation is indeed affected, the EC marker pdm1 staining will provide more convincing evidence. Another experiment to strengthen the conclusion will be the tracking of clone sizes generated from a single EB cell using the UAS-Flp system (such as G-trace).

      These are good suggestions to strengthen our findings. Unfortunately, we have not managed to obtain a working Pdm1 antibody (or other commercially available EC marker), which is why we assayed nuclear size and the tracking of KluReDDM cells. Therefore, we have not been able to test if Klu is retained in newborn ECs.

      As we agree this section of the text was misleading, we have rephrased and highlighted that the phenotype seen with KluGal4ReDDM resembles the accumulation of activated EBs and newborn ECs observed in vinc102.1 guts. (page 8).

      In Fig. 6D, the survival rate of WT and vinc mutant flies were compared. However, as there is no additional assay about the feeding behavior or metabolic rate, the systematic mutant of vinc does not provide a direct link between animal survival and intestinal EBs. Therefore, an experiment with vinc level specifically manipulated in fly intestine using esg>vinc-RNAi or BE>vinc-RNAi will be more relevant.

      This experiment has now been added in Figure 8B and the text modified to acknowledge the limitations of the survival experiments with whole mutant flies (see point 3, essential revisions above).

      Reviewer #3 (Public Review):

      Prior work had identified essential roles for Integrin signaling in regulating intestinal stem cell (ISC) proliferation, and the authors studies were motivated by trying to understand whether Vinculin (Vinc) might participate in this. However, Vinc is involved in mechanotransduction at both focal adhesions (FA) and adherens junctions (AJ), and their results revealed that Vinc phenotypes do not match those of FA proteins like Integrin. Conversely, they do match a-catenin (a-cat) RNAi phenotypes, and together with the localization of Vinc and the phenotypes associated with a-cat mutants that can't bind Vinc, this led to the conclusion that Vinc is acting at AJ rather than FA in this tissue. The results here are convincing, with clear presentation, nice images, and appropriate quantitation. It's also worth emphasizing that initial characterization of Vinc mutant flies failed to reveal any essential roles for this protein in Drosophila, so finding a mutant phenotype of any sort is significant.

      While the manuscript is strong as a descriptive report on the requirement for Vinc in the Drosophila intestine, it doesn't provide us with much understanding of the mechanism by which Vinc exerts its effects, nor how its requirement is linked to intestinal physiology.

      There is always more to learn, and the importance of our work so far is that it demonstrates a very specific role for vinculin as a mechanoeffector in regulating cell fate decisions in specific regions of the midgut, and provide the foundation for future work addressing the detailed mechanism of this function and physiological role.

      Prior work has shown that mechanical stretching of intestines stimulates ISC proliferation (presumably through Integrin signaling), which is opposite to what Vinc does here.

      We would like to stress that very little mechanistic knowledge is available regarding how mechanical stretching stimulates ISC proliferation, in Drosophila or mammalian systems. To our knowledge, the only work linking gut mechanical stretching to cell fate decisions in Drosophila identified Msn/Hippo pathway (Li et al., 2018) and the ion channel Piezo requirement (He et al., 2018). We agree with the reviewer that integrin signaling would most likely contribute, especially given the composition of gels for organoid cultures (Gjorevski et al, 2016), yet the actual molecular mechanisms remain to be elucidated.

      There is a suggestion that Vinc is involved in maintaining homeostasis, but how its regulated remains a bit murky. The authors report that reductions in myosin activity result in phenotypes reminscent of Vinc phenotypes, which they interpret as supporting a model where Vinc's role is to help maintain tension at AJ. Of course it could also be reversed - maybe they are similar because tension is needed to maintain Vinc recruitment to AJ? They lack of epistasis tests and lack of analysis of whether Vinc localization to AJ in EBs is affected by tension or the M2 deletion of a-cat leaves us uncertain as to the actual basis for the relationship between Vinc and myosin phenotypes.

      Thank you for all these suggestions. New experiments have been done to test the relationship between cellular tension and vinculin at junctions (see essential point 1).