7,842 Matching Annotations
  1. Jan 2022
    1. They have improved his food, his clothing, his shelter; they have increased his security and released him partly from the bondage of bare existence. They have given him increased knowledge of his own biological processes so that he has had a progressive freedom from disease and an increased span of life. They are illuminating the interactions of his physiological and psychological functions, giving the promise of an improved mental health.

      Interesting take to see the progress of man come about, just discussing how humans went from a bare existence to incredible technological feats.

    2. His excursions may be more enjoyable if he can reacquire the privilege of forgetting the manifold things he does not need to have immediately at hand, with some assurance that he can find them again if they prove important.

      This is one of my difficulties. I put a lot of stuff in my blog-as-memex but don't have a good way of surfacing them again. Theoretically I could do this with categories, but that gets overwhelming fast. This is why I'm thinking about using a blog and a wiki together for this purpose.

    3. There is a new profession of trail blazers, those who find delight in the task of establishing useful trails through the enormous mass of the common record.

      It me! This is kinda what people who operate as web librarians do.

    4. When the user is building a trail, he names it, inserts the name in his code book, and taps it out on his keyboard. Before him are the two items to be joined, projected onto adjacent viewing positions. At the bottom of each there are a number of blank code spaces, and a pointer is set to indicate one of these on each item. The user taps a single key, and the items are permanently joined. In each code space appears the code word.

      This is tagging.

    5. if the user inserted 5000 pages of material a day it would take him hundreds of years to fill the repository, so he can be profligate and enter material freely.

      How many people use Evernote as a Memex?

    6. The human mind does not work that way. It operates by association. With one item in its grasp, it snaps instantly to the next that is suggested by the association of thoughts, in accordance with some intricate web of trails carried by the cells of the brain. It has other characteristics, of course; trails that are not frequently followed are prone to fade, items are not fully permanent, memory is transitory. Yet the speed of action, the intricacy of trails, the detail of mental pictures, is awe-inspiring beyond all else in nature.

      Bush points out that indexing systems and rules do not duplicate the human mind - we must convert our own mental associations to a form we can use to search them - but that the human mind works by association. I extrapolate from this the idea of hypertext as a model of how the mind works. I'm going to keep an eye out for other instances of this idea.

    1. dea that the de-emphasis on the collective must be an index of lesser deliberation and a resort to mere personal impressions, what appears less collective may just be less formal, while still as collective as ever.

      Here, he basically sums up his position on why language is more self-focused than it used to be--it's just how we talk, it's not how we think

    1. Accessibility/Documentation Throughout our web pages, you will encounter links to our documentation, provided via Google Docs.  To download any of these items, you will navigate to “file,” then select “download as.” This step allows you access to our documents both online and offline, as well as in your preferred file format (.doc, .pdf, among others). You may find that this format also helps with translation of text to preferred language.  We have included clickable table of contents to navigate to particular sections of a document, although if you select the paper icon tab in the left corner of the current document (next to the page ruler and directly below the printer icon), Google automatically provides an outline view of the text provided.  We strive to provide accessible, universally designed content, inclusive of our documentation. Please contact us at itms@muhlenberg.edu with any questions or with suggestions for future documentation.

      While important, I don't think it needs to be front and center. We could have a whole page/section to Accessibility potentially working with Support and their materials. Annnd I have soooo many little cheat sheets on good practices that, to my knowledge, don't live anywhere.

    1. Interesting to note how a player's expectations of generic conventions can have such a huge effect on the emotions created. Like if you can fight monsters, players assume you should be able to do so at least semi-effectively because that's how they've learned combat systems are "supposed to work." (Chekhov's combat system?) Balancing the necessity of failure or "death" for tension with the risk that repeated failures could also undermine the drama by highlighting the artificiality of the game seems like a very difficult thing to get right. I also think the idea of a player noticing the feedback systems and becoming more immersed because they "trust" the game to provide them a consistent experience is fascinating because that seems counter to the usual notion of "immersion." Designing the sanity system more as a continuous "mood feature" rather than a discrete mechanic based on resources seems like a clever adjustment to generic conventions. Allowing a negative feedback system invisibly advantage a player, avoiding players unknowingly "dooming" their game state, seems so counter to the hostile posture of a horror game but it's interesting to note how many very influential horror titles use the technique. Interesting how much of the game design of Amnesia was about minimizing frustration in the immediate gameplay, but still creates discomfort in other ways. Like how players being frustrated by a "bad" combat system was something to be avoided, yet removing the ability to fight back also could be seen as creating a different sort of frustration, as it disempowers players who may resent being "forced" to run.

      It was also interesting to know about the blog which spoke about playing GTA 4 as a law abiding citizen and following all laws. Hearing conversations and abuses while walking the streets. All of these situations are inspired from our daily lives but we tend to miss them or ignore them as there are so many other attributes which require more of our attention.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      This is the first such piece of data to come from human infective parasites in the field. Technically this is a feat - because the small number of parasites that are present per mL of human blood at any given time during infection with T gambiense. Nevertheless they manage to identify up to 14 unique VSGs per patient sample. And this raises the first theoretical question: can they extrapolate to the average diversity load per human?

      This is an intriguing question that we would like to eventually answer, but we do not believe we can make this estimate from the data we currently have. We know our sampling is insufficient based on the correlation between parasitemia and diversity, and we do not have sufficiently precise estimates of parasitemia that could be used to extrapolate total diversity in the blood. Moreover, our analysis was only performed on RNA extracted from whole blood samples. Recent studies indicate that significant populations of parasites reside in extravascular tissue spaces, and our analysis did not address antigenic diversity in these spaces. We believe it is unlikely that the blood alone reflects the full diversity of VSG expression in an infection, and an estimate based only on blood-resident parasites (if possible) could be misleading.

      this is important because the timing of sample collection (ie that it occurred within a period of weeks) suggesting that an initial group of infected tsetse infected these patients (rather than a small number of interactions between a bloodmeal and a new infection - generally in itself on the order of 1 month or so). If parasitemia is low and diversity limited, this would explain both why CATT works as well as it does (because really it shouldn't at all!) and perhaps even the chronicity of infection (in the sense that the organism is unlikely to "run out" even of complete VSGs, never mind mosaics). The paper would benefit from a direct discussion on this.

      Indeed, the timing of sample collection could inform our interpretation of the data. However, sample collection occurred over a period of six months. More importantly, patients were in both early and late stage disease at the time of sample collection, so we cannot estimate how long any individual patient had been infected. We have added text (line 180) to highlight this fact. Because some patients were infected at least 6 months apart (if not much more than that), it is unlikely that patients were infected around the same time by a small group of infected tsetse flies. Reviewer #1 introduces an interesting point about the efficacy of the CATT diagnostic test as it relates to antigenic diversity. We discuss CATT sensitivity in the introduction (lines 115-120) as well as the discussion, where regional sensitivity differences are mentioned (lines 715-718). Given uncertainty about total diversity and time since initial infection, we have refrained from speculating about how diversity/timing could affect CATT sensitivity.

      An interesting feature of this new study is the apparent bias to type B N-terminal domain VSGs as well as the discovery that two patients share a specific VSG isolate (though it is not mentioned whether they are related by distance etc). This raises the possibility of substrains with different VSG archives that vary by geography.

      We found two VSGs which were expressed in more than one patient. One was expressed in two patients from the same village (village C) while the other VSG was common between two cases originating from villages C and D, some 40 km apart. We agree that our data generally support the possibility that the VSG archive might vary geographically. We have performed additional analyses suggested by reviewer #2 that support this idea: we have now shown that Tbg patient VSGs classified in this study, which originated from the DRC, are distinct from the VSGs encoded by the reference strain Tbg DAL 972 which was isolated in Cote d’Ivoire. We mention this possibility on lines 721-724.

      Alternatively it suggests that perhaps type B VSGs are picked up differentially by serology (and there the one feature of type B VSGs that could be shared, with regards to detection, is the O-hexose decoration on a number of type B VSG surfaces. Could CATT be detecting elements common to sugar decorated VSGs? Experimentally this is something that can be tested even with mouse infection materials.

      This is indeed an intriguing possibility. We mention this in the discussion (lines 772-778): “In T. brucei, several VSGs have evolved specific functions besides antigenic variation [74]. Recently, the first type B VSG structure was solved [75], revealing a unique O-linked carbohydrate in the VSG’s N-terminal domain. This modification was found to interfere with the generation of protective immunity in a mouse model of infection; perhaps structural differences between each VSG type, including patterns of glycosylation, could influence infection outcomes.” While this is an experimentally tractable explanation for the type B VSG bias we observe, we believe such experiments are beyond the scope of the current paper.

      Side comment: are the common VSGs mutated between patient samples?

      We classified VSGs as common between patient samples if they had >98% nucleotide sequence identity as well as meeting the other quality cutoffs such as 1% expression level and consistency across technical replicates. This identity cutoff still allows for several mismatches between sequences, which we do occasionally observe. However, we cannot confidently rule out that the “mutations” we observe are sequencing or PCR errors. Thus, we cannot say for sure if there are mutations between common VSGs.

      Reviewer #2


      1.Throughout the manuscript you observe 'diversity' in expressed VSG and its existence becomes a principal conclusion. I feel that the meaning of diversity and its significance is not sufficiently explained for the reader. In the abstract (l48) you say that there is 'marked diversity' in parasite populations. Presumably you mean parasite infrapopulations, i.e. within patients, not across the DRC? In any case, what is 'marked' about it, and relative to what? Why does it matter that there are multiple expressed VSG in a single patient at one time? Is this not a reasonable expectation for a population of (presumably) clones capable of switching the expressed VSG? How is this different to the view typical of the literature since 1970 that one VSG dominates while others wait in the background at low frequencies. If 'diversity' is the conclusion, then you need to define it and explain its significance more.

      When we refer to diversity, we do mean infrapopulations of parasites within patients, or individual animals in this case, rather than across the DRC. We have edited the text to make this clear (see below). However, the study which benchmarked the application of VSG-seq to quantify VSG expression in vivo during mouse did not support the previously-held view that one VSG dominates while others wait in the background at low frequencies. Frequently we observe a handful of VSGs present at 10-20% of the population at any timepoint, and many VSGs (~50% of all detected variants) present at “In a proof-of-principle study, we used VSG-seq to gain insight into the number and diversity of VSGs expressed during experimental mouse infections [30]. This proof-of-principle study revealed significant VSG diversity within parasite populations in each animal, with many more variants expressed at a time than the few thought to be sufficient for immune evasion. This diversity suggested that the parasite’s genomic VSG repertoire might be insufficient to sustain a chronic infection, highlighting the potential importance of recombination mechanisms that form new VSGs.

      2.Following on from 1., why does the analysis deal in counts of distinct VSG or N-terminal domains, and not then progress to their relative expression? The expression data are in Supp Table 3 and they show that, in most cases where many VSG are observed in the same patient, 1-3 of these are 'dominant', i.e. they account for >50% of the population.

      The VSG-seq analysis pipeline does estimate the relative expression level of each identified variant in the population, and this information is available in the supplemental data (Supplemental Figure 1, Supplemental Table 3). However, we chose not to rely on these measurements too heavily because there was some variation between Tbg technical replicates, which is shown in the supplemental heatmap (Supplemental Figure 1). Replicate three tends to not agree with the first two replicates. We suspect that this was due to the order of sample processing and the fact that the parasite-enriched cDNA sample was repeatedly freeze-thawed between library preparations for technical replicates. Additionally, because our sampling did not reach saturation, some VSGs are not detected in all replicate libraries, making it difficult to estimate their abundance.

      We have added a discussion of these issues to the text on lines 431-433: “Because our sampling did not reach saturation, resulting in some variability between technical replicates, we chose to focus only on the presence/absence of individual VSGs rather than expression levels within parasite populations.”

      Figure 1 deals in VSG counts, but I would then expect another figure to illustrate the reality that only a minority of these observed VSG are likely to be clinically relevant (i.e. the subject of the immune response). This impacts the 'diversity' conclusion, as given in the discussion (ll 657-9), because you cannot afford to treat all these VSG equally when their abundances are quite different.


      We agree that relative expression level is a useful metric, but absent longitudinal sampling it is impossible to determine which VSGs are clinically relevant as defined by the reviewer: low abundance VSGs at one time point may be the predominantly expressed variant at another. Moreover, the threshold for triggering an anti-VSG antibody response remains unknown. Thus, we have chosen to treat all detected variants equally.

      3.How related are these VSG? Were you able to ensure unique read mapping to the VSG assembly? Can you show that reads mapped to a single VSG only and therefore, that the RPKM values are reliable?

      Our analysis accounts for the fact that VSGs can be very similar. We only considered uniquely mapping reads in our VSG-seq analysis. We also account for mappability in our quantification, so VSG sequences that are less unique (and thus have fewer uniquely mapping reads) are not artificially underrepresented in estimates of relative expression. We have specified the parameters used for alignment (line 274) in the methods.

      4.The authors observed no orthology between expressed VSG and DAL972 genes. This is really interesting and deserves closer attention. Presumably there is microhomology? For T. brucei VSG, with constant recombination, we would predict that a comparison of the VSG in West and Central Africa would reveal a pattern of mosaicism, such that individual sequences in DRC would break down into motifs present in multiple genes in the West African reference. Question is, how many genes? What does that distribution look like? What is the smallest homology tract? There is an opportunity here to comment on how VSG repertoires diverge under recombination. How much of the expressed VSG sequence is truly unrepresented in the West African reference (or other T.b.gambiense genome sequences available in ENA). I can believe that none of the N-terminal domains in these data are present intact in DAL972, but I cannot believe that their components are not present without evidence.

      We appreciate the reviewer’s suggestion to look at this more closely. We have performed additional analyses to address sequence similarity, or lack thereof, between the assembled DRC patient VSG and the West African reference TbgDAL972. We ran a nucleotide BLAST of expressed VSGs against the TbgDAL972 genome reference sequence pulled from TriTrypDB.org (release 54). We have added a supplemental figure depicting the results of this analysis (Supplemental Figures 6 and 7). Briefly, our analysis shows that most of the N-termini we identified have no significant similarity to DAL972 VSGs, even with very permissive search parameters. There are frequent hits in the VSG C-termini, however, which might be expected. Most BLAST hits are short spans 98% identity are short 20-25 bp regions. Given the large divergence from the reference, we were unable to infer any patterns of recombination in the VSGs. However, we believe this analysis supports our claim that the N-termini of VSGs assembled from DRC patients are novel, with their component parts largely unrepresented in the West African reference genome.

      Figure 4 compares NTD type composition in the DRC data with previously published mouse experiments. The latter take place over very short timescales in maladapted hosts, while the timescales of the latter in natural hosts are unknown but plausibly very much longer. So are these data really comparable and are we learning anything from their comparison, given that the most likely explanation for the NTD bias in expressed VSG is the underlying genomic composition?


      Indeed, this is our intended conclusion from figure 4. The figure is meant to illustrate our claim that the expressed VSGs in each experimental set reflect the underlying genomic composition of their corresponding reference strains, despite fluctuations over time. The language and legend for Figure 4 has been clarified to emphasize this point. We have emphasized in the text that it is unknown whether these fluctuations occur over time in much longer natural infections.


      6.Please comment on the technical reproducibility of the data, there are multiple instances in Supp Table 3 where technical replicates expressed different VSG.

      Three RNA-seq library technical replicates were prepared for each individual gHAT patient RNA sample. Replicates were prepared in batches together so all 1’s were done on the same day, for example. The original parasite-enriched cDNA sample was frozen and thawed between each batch. We suspect that the cDNA degraded after repeated freeze-thaw cycles, which is why replicate three tends to not agree with the other two as can be seen on the heatmap in supp fig. 1 and the expression data in supp table 3. We also suspect the fact that our sampling did not reach saturation resulted in the detection of different VSGs in individual replicate preps. We have edited the methods and mentioned this variability in the results section to communicate this issue more transparently.

      • Lines 395-397 “Using RNA extracted from 2.5 mL of whole blood from each patient, we prepared libraries for VSG-seq in three separate batches for each technical replicate.”
      • Lines 431-433: “Because our sampling did not reach saturation, resulting in some variability between technical replicates, we chose to only focus on the presence/absence of individual VSGs rather than relative expression levels within the population”

      Reviewer #3

      1. In line 499, the authors conclude the due to the expressed VSGs being different in the blood and CSF being difference it may indicate that different organs harbor different VSG sets. Given that this is n=1 for patient samples I think this is too speculative a statement. There is also no indication as to whether the samples were taken at the same time or not.

      This is absolutely correct. The precise timing of CSF sample collection is unknown for these samples. It likely occurred within hours to days after blood collection, but even on this short time scale, the unique CSF repertoire could represent the antibody-mediated clearance of one VSG population and replacement with another. We have scaled back our language and only point out that there are unique VSGs in this space (Lines 522 – 524).

      I think that the authors need to be very careful as to the conclusions drawn about VSG expression over time in terms of hierarchy and N-terminal fluctuations. For any conclusions to be drawn on the hierarchy of VSG expression more data points are needed taken over time (this is obviously challenging when looking at patient samples). I find it too speculative to draw any conclusions when single time points are assessed and the assumption on the progression of the infection depends on whether it is a Tb or Tbr.

      Reviewer #2 also pointed this out. We agree and have attempted to limit definitive conclusions in the text and instead discuss multiple possible explanations behind our observations.

      I found some of the figure legends a bit terse. For example, in Figure 1 C, what do the black circles and lines represent? Perhaps a little more detail would help the reader.

      Clarified legends for UpSet plots in figures 1C and 3C as follows: “The intersection of expressed VSG sets in each patient. Bars on the left represent the size of the total set of VSGs expressed in each patient. Dots represent an intersection of sets with bars above the dots representing the size of the intersection.”

      In figure 2, I found it difficult to distinguish between the orange and dark red in (A) and the two lighter blue colors.

      We have changed N-terminal type color palette for all plots to make red and blue hues more distinctive.

      In line 389 – estimate

      Corrected

      In line 498 - should be reference been to figure 2C?

      This should be a reference to Figure 3B. We have corrected the reference.

    1. But while Americans can, he says, perceive that they are faced with “intricate social and cultural problems,” they “tend to think of them as scientific and technological problems” to be solved separately.

      Yes, because we rarely make progress in other aspects. For technological problems there's a clear solution that people will buy if it works. Deliberate sociological change you may have to force, which is never a good basis.

    1. Author Response:

      Reviewer #1 (Public Review):

      This is a clearly written manuscript describing an elegant study that demonstrates how microsaccades are not the triggers of attentional effects, and that attentional modulations can be observed in the absence of microsaccades. This is a very much needed work, especially in the light of the recent debate regarding whether or not microsaccades are the cause of peripheral attentional effects. By explicitly comparing and quantifying the effects of attention on neuronal responses in the presence and in the absence of microsaccades, this work provides important insights on this debate. I think the work is well conducted and the results are solid.

      We thank the reviewer for their supportive comments!

      I only have few comments/suggestions:

      1. Lines 125-126, the authors report that monkeys generated frequent microsaccades but their overall direction was not systematically biased towards the cue location. This seems to be in contrast with what previously reported in the literature in humans and monkeys. I think this discrepancy should be discussed in the discussion. Is this simply the result of different experimental paradigms (maybe exogenous vs endogenous attention, or the presence of the cue for the entire duration of the trial, ect)?

      As suggested, we discuss three main factors which may contribute to this discrepancy:

      The first factor is the difference in the time window used for microsaccades analyses. Previous reports focused their analyses of microsaccades on the time window immediately after cue onset. In our analyses, the time window focused on is the ‘delay period’ which is hundreds of milliseconds after the cue and the time epoch used in most electrophysiology studies about attention.

      A second factor is how the spatial cues were presented. In our paradigm the cue ring appeared in the periphery and then disappeared. In contrast, previous paradigms used a cue presented near fixation that persisted throughout the trial. Our brief peripheral cue provides less of an impetus to generate small saccades directed towards the cue, compared to the case when the cue is continuously near the center of gaze.

      A third factor is that monkeys in our task were trained to release a joystick to report their detection of stimulus events, rather than make a saccade. Because human and monkey subjects tend to make microsaccades in the same direction as their upcoming saccadic choices (Yu et al., 2016), attention tasks using saccade reports will tend to introduce this direction bias on microsaccades. By using a joystick release, we minimized these lateralized effects related to saccade preparation.

      These points are now addressed in the second paragraph of discussion.

      1. It is very interesting that microsaccades modulate neural responses for stimuli that are much further away from their landing location. However, the stimulus used in these cueing tasks is also unnatural. Normally we are not fixating on a meaningless dot while all the interesting stimuli are presented in the periphery. In normal conditions the foveal input is rich in detail and it is generally relevant (that's why we are foveating certain stimuli in the first place). I wonder if the authors can comment on whether the modulations reported here would also occur in more natural conditions when an interesting and maybe salient/relevant stimulus is presented at the center of gaze, while subjects are also attending to a peripheral target. Will the neural response be modulated selectively for neurons for which the receptive field is on the peripheral target or will it also affect neurons where the receptive field aligns with the microsaccade target location in the fovea?

      The reviewer raises a very good point. In our study, the relationship between microsaccades and attention-related modulation was examined when monkeys selectively attended a stimulus located in the near peripheral visual field while maintaining central fixation. We agree that under more natural conditions, the monkey would just look directly at the peripheral stimulus. As in many attention studies with this type of design, our experiments hold the system in a state of sustained peripheral attention which would otherwise be much shorter.

      We believe that similar modulation at the peripheral location would be briefly observed if the monkey were allowed to satisfy the natural tendency to look at the stimulus, although this would make it more difficult to examine the relationship with microsaccades. This would be consistent with the documented pre-saccadic modulation of attention (e.g., documented by the Carrasco lab, Li, Hanning, & Carrasco, 2021).

      Once the attended stimulus is foveated, there is strong behavioral evidence from several recent studies demonstrating that attention can be selectively distributed even within the fovea (Poletti, Rucci, & Carrasco, 2017). Considering the now substantial evidence that the foveal portion of the SC map is activated when the behaviorally relevant location is at the center of the visual field (e.g., during parafoveal smooth pursuit as in Hafed & Krauzlis, 2008), we expect that SC neurons with foveal RFs would display similar attention-related modulation as we found here. However, to the best of our knowledge, there have not yet been studies documenting the attention-related modulation of neurons with foveal RFs and the possible influence of microsaccades.

      We agree with the reviewer that these are interesting points, and have now added a new paragraph in the discussion (final paragraph) to address this point.

      1. The authors do not report behavioral performance. Presumably the task is very easy, but I wonder if reaction times and performance correct was related with the attentional effects and how did it change with respect to microsaccade direction, e.g., were subjects' reaction times shorter at the cued location also when microsaccades were directed at the opposite location? I think this information would be very valuable.

      We agree it is valuable to document the behavioral performance; we had omitted this because this is the same task we have used in previous studies which do include such behavioral documentation.

      To address the reviewer’s comments, we added an analysis and plot documenting the hit and false alarm rate for each subject in each experimental session. To accommodate this new plot, we have now divided the original Figure 1 (which included task, neuronal data and microsaccades) into a new Figure 1 (task, behavior, and neuronal data) and a new Figure 2 (microsaccades). The new plot showing hit and false alarms is Figure 1b in the revised manuscript.

      The task was not especially easy – we adjusted the amplitude of the color saturation change to be just slightly above the threshold for detection; hence, the hit rates were generally between 75-90%. The performance was very consistent across sessions in our well-trained monkeys, and the low rate of false alarms for ‘foil’ changes provides behavioral confirmation that they attended to the correct stimulus location.

      To address the comments about reaction time, we have added a new plot to our new Figure 2 (Figure 2c) showing the monkeys’ hit rates (top) and joystick release times (bottom) subdivided based on whether there were no microsaccades, microsaccade towards, and microsaccades away from the cued location (-50 to 50ms relative to cued stimulus change onset). These plots show that when there were no microsaccades, behavioral performance was at least as good as with microsaccades. When there were microsaccades, reaction times were slower when microsaccades were directed away from the cued location. As the reviewer may have anticipated, these effects again confirm that differences in attentional state as evident in task performance covary with the direction of microsaccades, and we thank them for the suggestions. We now added a new paragraph in the results to describe these findings.

      1. Another important difference in the paradigm used in Lowet et al vs the one described in this manuscript is that in Lowet et al monkeys were instructed to saccade toward the target position at some point during the trial after the cue and the target presentation. Hence, monkeys presumably prepared the saccade and held off its execution during the time the cue and the target were presented. This was not the case in the current paradigm, where the monkey is instructed to maintain fixation as in a standard spatial cueing paradigm. I wonder if this difference may explain some discrepancies in the results.

      This is a very good point. As mentioned in our reply to point #1 above, previous studies (Yu et al., 2016) have shown that human and monkey subjects tend to make microsaccades in the same direction as their upcoming saccadic choices. As pointed out by the reviewer, in the Lowet et al. study the directions of microsaccades might be related to the motor preparation of the upcoming choice saccade as well as related to the allocation of attention. In contrast, in our experiments, monkeys reported their choice by releasing the joystick and were prohibited from making larger saccades.

      We agree this can be an important factor for the differences in the results, and we now address these points in the second paragraph of discussion.

      Reviewer #2 (Public Review):

      This is a correlative study with the main result that microsaccades do not alter attention-related modulations of neuronal activity. This is an important question, speaking to the origin of one of the mind's most fundamental processes. The experimental manipulations and analyses are well chosen, carefully conducted and visualized. They include critical controls for alternative explanations.

      Thank you for your constructive comments.

      To ascertain their claims, however, it is important that the authors cover their ground. In pursuit of that, a few important analyses are required.

      1. Did the manipulation of attention work? In the present version of the manuscript, the authors do not report behavioral results, which is necessary to confirm that the cue was successful in manipulating attention. That is, the observed modulation in firing (in RF vs outside of RF) should be related to a behavioral advantage in sensitivity to changes at the cued location. To confirm the link of the neural results to attention (rather than, say, just the cue), the behavioral results provide opportunities for critical tests. One way to do this would be to analyze neural firing rates as a function of response rather than cue location (provided subjects made enough errors). Note: A detailed discussion of why the cue cannot be equated to attention can be found in Laubrock et al. (2010, Atten Percept Psychophys; https://doi.org/10.3758/app.72.3.683).

      Yes, the manipulation of attention worked. As suggested, we now document the effectiveness of the attention manipulation by plotting the hit and false-alarm rates for each subject in each experimental session (new Figure 1b). We also confirmed that the SC neuronal attention-related modulation depended on subjects’ behavioral response (new Figure 1d). We also note that these same attention manipulations have been used in previous studies examining the neuronal mechanisms of attention.

      1. Were all microsaccades detected? One of the main results of the study is that attention-related modulations were observed even in the absence of microsaccades. These results hinge on successful detection of all microsaccades, even at a very small scale. Given the video-based eye tracking the authors will have missed a (possibly large) number of smaller microsaccades (Poletti & Rucci, Vision Res, 2016; https://doi.org/10.1016/j.visres.2015.01.018). This concern is exacerbated by the fact that eye tracking was monocular, such that a validation of detected microsaccades based on the signal in the other eye could not be performed.

      We have performed additional microsaccade detection analyses using both more stringent and more lenient thresholds (the "lambda" value of Engbert & Kliegl, 2003). We have verified that our findings are robust over a range of detection thresholds and include a new supplemental figure to demonstrate this point (Figure 4 – figure supplement 2).

      1. Relation to previous claims of causality Hafed (2013, Neuron) reported perceptual changes in attentional cueing that covaried with the occurrence of microsaccades. Hafed (2013) argued that microsaccades might be underlying the performance changes commonly attributed to covert shifts of attention. This point seems central to the current paper's line of argument and should thus be discussed in detail with respect to the current findings. At present, the paper by Hafed (2013) is not cited in the current manuscript when its conclusions may need reconsideration based on the current results.

      We agree, and a similar point was raised by Reviewer #1. We have expanded the main text based on your recommendations.

    2. Reviewer #1 (Public Review):

      This is a clearly written manuscript describing an elegant study that demonstrates how microsaccades are not the triggers of attentional effects, and that attentional modulations can be observed in the absence of microsaccades. This is a very much needed work, especially in the light of the recent debate regarding whether or not microsaccades are the cause of peripheral attentional effects. By explicitly comparing and quantifying the effects of attention on neuronal responses in the presence and in the absence of microsaccades, this work provides important insights on this debate. I think the work is well conducted and the results are solid. I only have few comments/suggestions:

      1. Lines 125-126, the authors report that monkeys generated frequent microsaccades but their overall direction was not systematically biased towards the cue location. This seems to be in contrast with what previously reported in the literature in humans and monkeys. I think this discrepancy should be discussed in the discussion. Is this simply the result of different experimental paradigms (maybe exogenous vs endogenous attention, or the presence of the cue for the entire duration of the trial, ect)?

      2. It is very interesting that microsaccades modulate neural responses for stimuli that are much further away from their landing location. However, the stimulus used in these cueing tasks is also unnatural. Normally we are not fixating on a meaningless dot while all the interesting stimuli are presented in the periphery. In normal conditions the foveal input is rich in detail and it is generally relevant (that's why we are foveating certain stimuli in the first place). I wonder if the authors can comment on whether the modulations reported here would also occur in more natural conditions when an interesting and maybe salient/relevant stimulus is presented at the center of gaze, while subjects are also attending to a peripheral target. Will the neural response be modulated selectively for neurons for which the receptive field is on the peripheral target or will it also affect neurons where the receptive field aligns with the microsaccade target location in the fovea?

      3. The authors do not report behavioral performance. Presumably the task is very easy, but I wonder if reaction times and performance correct was related with the attentional effects and how did it change with respect to microsaccade direction, e.g., were subjects' reaction times shorter at the cued location also when microsaccades were directed at the opposite location? I think this information would be very valuable.

      4. Another important difference in the paradigm used in Lowet et al vs the one described in this manuscript is that in Lowet et al monkeys were instructed to saccade toward the target position at some point during the trial after the cue and the target presentation. Hence, monkeys presumably prepared the saccade and held off its execution during the time the cue and the target were presented. This was not the case in the current paradigm, where the monkey is instructed to maintain fixation as in a standard spatial cueing paradigm. I wonder if this difference may explain some discrepancies in the results.

    1. Because most of our science is supported by limited public funds, evolutionary biologists and ecologists should support and participate in efforts to help the public understand the issues and the value of scientific understanding. Science in general and evolutionary science in particular are often politicized, exactly because of their fundamental importance to human society.

      This is something that we can all look at and understand the importance of. This is one of the most important steps in solving most of the problems we face today. As I have gotten older I have realized and learned that there are a lot of things that we can learn from others, even when you may think that they do not have much to offer. This mentality that science must not be shared with the common man is outdated and must change if we want to progress.

    1. pg 38-"Consequently, attempts by theorists ... more fundamental or more grave"

      I think this is a huge problem that we have in society today, belittling or putting down someone's experience because it's not as bad as what it could have been, therefore it has not merit. I appreciate the author's dedication in correcting that wrong and clarifying that although the levels of oppression and/or the definition may not be the exact same for everyone, oppression is still oppression. I like to think of it like flavors of ice cream, some may be a lot stronger or have lots of different add ons, but in the end, it's still ice cream.

    2. Someone who does not see a pane of glass does not know that he does not see d. Someone who, being placed differently, does see it does not know the other does not see it.

      I love the beginning of this, it reminds me of the color of the sky argument. If someone says the sky is red, but you see the sky as blue, how can you tell them what they are seeing is wrong? For you may be seeing blue, but you can not look through their eyes, so you can not say what they are seeing is incorrect, only that you do not see the same color that they are seeing. But then it gets even trickier, because we do not know if their red is your blue or vice versa. It's a complicated thought that circles around perspective, something that I think is not only profound but also intricately important to all arguments and matters of discussion. A change or understanding in one's perspective is the difference between peace and war, and it's understanding all sides of a situation that allow us to begin to comprehend why anyone would view oppression as an acceptable way to treat another human being.

    1. Author Response:

      Reviewer #1 (Public Review):

      The investigators' goals were to describe the epidemiology and kinetics of post-acute covid lung sequalae and to determine the risk factors predictive of persistent lung impairment. A major strength of the study is the longitudinal observation through 6 months with protocolized clinical assessments that included patient-reported outcomes, lung function tests, inflammatory marker testing, and computed tomography of the chest, in a reasonably sized cohort that reflects the spectrum of disease severity in the pre-vaccination era. We learn a great deal about the different patterns of recovery in this group of COVID-19 survivors. The primary epidemiologic finding is that 52% of survivors continued to have symptoms at 6 months, while up to 72% of those with severe COVID requiring ICU level care continued to have lung abnormalities by chest imaging. This confirms general observations of "long covid" which also encompasses non-lung effects. While lung disease is less common in those with milder disease, the proportion of patients who were never hospitalized but experienced persistent symptoms is striking (50%), with lung function impairment in 17% at 6 months. As expected, the patients who had the most severe disease-those who needed the ICU-had the highest degree of chest imaging abnormalities. The kinetics of recovery is a significant observation: Figure 3 shows that most of the post-acute recovery in structural lung abnormalities occurs in the first 3 months and slows down thereafter, particularly for the hospitalized non-ICU patients. The investigators then embarked on a sophisticated analysis to determine how to predict persistent lung abnormalities (as detected by chest CT) at 6 months. When analyzed individually, among 50 clinical characteristics or lab values, the strongest unfavorable risk factors were elevated IL-6 (an inflammatory cytokine that is the target of tocilizumab) and CRP (c-reactive protein). Other variables that were strongly associated with CT abnormalities included immunosuppressive therapy, ICU stay as well as pre-existing conditions. When machine learning techniques were applied, risk factors that correlated with each other could be grouped together, and the patients could be categorized as low, intermediate, and high risk for delayed pulmonary recovery. As expected, known factors for COVID19 infection (age, male sex, medical comorbidities) and disease severity (need for oxygen therapy, ICU care and antibiotics) were more frequent in the intermediate and high risk groups. These predictive factors at acute COVID and day 60 follow-up mostly held up when tested against part of the cohort that was not used for analysis. Interestingly lung function impairment as measured by pulmonary function tests were only weakly correlated with persistent and severe chest imaging abnormalities.

      The novelty of this study lies in taking the epidemiology a step further with a machine learning analysis to determine which clinical characteristics and chest imaging features at the onset of acute COVID-19 are predictive of later persistent disease. One limitation of this study, however, is that it was conducted on patients in the early part of the pandemic, prior to the widespread use of remdesivir and corticosteroids/anti-cytokine therapies, that are now considered standard of care. Based on these findings, we can now hypothesize that current treatments are likely to reduce the impact of long-covid.

      We would like to thank the reviewer for careful study of the manuscript and appreciation of our work. We agree, that our longitudinal cohort and its hospitalized, severe COVID-19 subset in particular encompasses the patients, for whom the therapeutic armamentarium was limited and far from the therapeutic options available now. Whether novel anti-viral and anti-inflammatory medication as well as, in case of the vaccinated patients, the immunization status may accelerate the recovery or reduce the pulmonary damage is a matter of current research also in our center. We address this issue in the Discussion section to support a clear interpretation of the data by the interested reader.

      Machine learning (artificial intelligence, AI) is now being increasingly used to answer clinical questions on limited cohorts; the application of machine learning in this study contributes to our conceptual understanding of how clinical characteristics and biological factors cluster together to contribute to long-term COVID outcomes. Namely, the profound inflammation that characterizes severe acute COVID-19 pneumonia and poor early outcomes also contributes to chronic lung damage in survivors. In addition, a robust antiviral immune response (as seen with elevated anti-viral antibodies) without elevated systemic inflammatory markers were associated with less severe chest imaging patterns, also supporting the notion that an individual's immune response to the virus is responsible for the trajectory of disease. As noted, a significant proportion of non-hospitalized patients also suffered from chronic lung impairments. Taken together, the impact of prolonged convalescence on the workforce, healthcare, and individual lives should not be underestimated. These results underscore the paramount need for continued public health measures and vaccinations to prevent COVID-19, particularly for the most vulnerable individuals (older, immunocompromised, and with preexisting health problems). These observations provide additional biologic justification for the use of agents directed at reducing lung inflammation early in the course of disease, and potentially at an early post-recovery time point (i.e 2 months). Machine learning algorithms may one day help clinicians decide which patients should be targeted for additional therapies after the acute phase. With further study, implementation of AI to real world medicine may be on the horizon.

      We agree with the Reviewer that machine learning algorithms can overcome limitations of ‘canonical’, ordinal and generalized regression methods in the multidimensional setting i. e. when the number of available clinical parameters approaches or exceeds the number of observations/patients. Consequently, machine learning or AI allows for serial screening of medical record data at low cost and supports diagnostic and therapeutic decisions. We discuss those two aspects in the revised manuscript in the context of acute COVID-19 course prediction and long COVID prediction and phenotyping in light of the recent literature [1–4,6].

      Reviewer #2 (Public Review):

      This is a potentially valuable manuscript which links early markers of inflammation with residual abnormalities on chest CT following SARS-CoV-2 infection. Surprisingly, early surveyed symptoms do not predict long term radiologic outcomes (6 months after infection) while inflammatory markers have stronger predictive value. The cohort is well designed and the selected tools for analysis are appropriate.

      We thank the Reviewer for the careful study, critic and appreciation of our work.

      While this finding is potentially of high importance for clinical practice, the endpoints are inconsistently defined, and certain components of the machine learning and clustering analyses are difficult to interpret as presented. It is therefore challenging to understand whether the conclusions are justified by the analysis.

      We apologize for this unclarity. In the revised manuscript, we precisely define the analysis endpoints (any radiological lung findings at the 6-month follow-up, radiological lung abnormalities with CT score > 5, lung function impairment and persistent symptoms at the 6-month follow-up) of the analysis; see: Introduction and Methods/Study design. We also indicate the numbers of participants reaching those endpoints in Table 3.

      Several components of the analysis are confusing and would benefit from further elucidation:

      1) The authors do not clearly define "delayed pulmonary recovery". My sense is that they are using several radiologic based definitions rather than their functional definition (defined by FEV1, FEV:FVC & DLCO) of lung function but this is never explicitly stated. Are the functional outcomes and symptomatic recovery considered in any of the analyses other than correlations with radiologic findings in S1?

      As described above in our previous response, the prime focus and primary endpoint of the analysis was the presence of radiological lung abnormalities at the 6-month follow-up. Our motivation to focus on radiological endpoints was to focus on the potential development of persistent structural lung abnormalities, fibrosis and interstitial lung disease following COVID-19, as observed in SARS-CoV-1 patients [7,8]. Of note, lung function parameters were only weak correlates of radiological impairment as shown in Figure 3 – figure supplement 1 – 3 and our previous work [27]. This finding is in line with numerous studies in ILD patients which demonstrate a low sensitivity of lung function testing (especially FEV1 and FVC assessment) in patients with early interstitial lung disease (ILD) [10,11]. In addition, we could not exclude a pre-existing, COVID-19-independent impairment of lung function in a subset of the study participants suffering from pulmonary diseases, obesity and/or cardiovascular diseases (Table 1). Thus, lung function parameters only partially reflect COVID-19 mediated lung injury and convalescence.

      Nevertheless, we agree, that clinical and functional endpoints are of great interest for the scientific and clinical community. For this reason, we present additional results of univariable risk modeling for long-term (6-month follow-up) symptom persistence and lung function impairment (Figure 5, Appendix 1 – table 2), the results of machine learning modeling for those outcomes (Figure 9, Appendix 1 – table 5) and discuss the findings. We also present the prevalence of such long-term manifestations and lung function impairment in the Low-, Intermediate and High-Risk clusters of the study participants defined by non-CT and non-lung function clinical features (Figure 8).

      2) To this end, I was surprised that the functional definition and symptomatic recovery were not used as the primary endpoints. The functional definition and resolution of symptoms seem most important for the recovering patient so seems like the more important outcome. However, in Figures 5-7, it is often not clear whether the functional outcome is being considered at all.

      As mentioned above, the focus of the study was the assessment of structural lung impairment following COVID-19 and both, lung function parameters as well as symptom burden moderately correlate with structural lung damage (Figure 3 – figure supplement 1 – 3) – a phenomenon observed previously in SARS-CoV-1 [7,8]. Although the symptom burden and its resolution during follow-up are of major importance for the individual patient during post-acute recovery, these parameters are not a good marker for the potential long-term pulmonary outcome. E.g. younger patients with moderate to severe lung damage may demonstrate only mild pulmonary symptoms during post-acute recovery, but the structural damage may be associated with severe impairment at long-term follow-up due to progression of lung fibrosis or age-related decrease of functional pulmonary capacity [11]. Still, we agree with the reviewer that the follow-up on symptoms and lung function is of interest for the reader and additionally included those outcomes in the univariate and multi-parameter risk modeling. In addition, we present the frequencies of symptom persistence and lung function impairment in the low-, intermediate- and high-risk participant clusters defined solely by non-CT and non-lung function clinical parameters. See previous issue for more details.

      3) For the clustering in figure 5, I am uncertain how CT severity score >5 & CT abnormalities cluster separately, when these 2 outcomes appear to logically overlap. Specifically, does the CT abnormalities outcome include patients with the high severity score outcome? In other words, are patients in the "high severity" group a subset of patients with "CT abnormality"? If not a subset, then the CT abnormality should be labeled "non-severe CT abnormality". This could all be clarified by listing the number of patients in each group and showing with a Venn diagram whether there is any overlap.

      We apologize for the lacking clarity in this matter. As pointed by the reviewer, the patients with CT abnormalities scores > 5 points were a subset of the participants with any CT abnormalities. The same was true for the GGO-positive subgroup. We agree, that the overlap between the radiological outcomes obscures the message of the clustering and modeling results. To overcome this, we removed the GGO outcome variable from the analyses in the revised manuscript. In the revised manuscript, we clearly differentiate between mild (CT severity score ≤ 5) and moderate-to-severe radiological abnormalities (CT severity score > 5) in feature (Figure 6) and participant clustering (Figure 8). Frequencies of mild and moderate-tosevere CT abnormalities in the study collective stratified by the severity of acute COVID-19 are presented in Figure 3 – figure supplement 3B. Numbers of the study participants with any, mild or moderate-to-severe CT abnormalities at the subsequent follow-up visits are listed in Table 3.

      4) For the same reason, figure 4 is hard to interpret. Are CT severity >5 being compared to those with normal CTs only or those with normal or mild / moderate CTs? Please provide more specific definitions of normal, "CT abnormality" and "severe CT abnormality" and provide the number of people in each category and specify the comparator groups in all analyses.

      We are sorry for the confusion. In Figure 4 of the initial manuscript, any CT abnormalities, GGO-positivity and abnomalities with CT severity score > 5 were analyzed as separate outcome variables. The baseline was specific for the given explanatory variable, e. g. for the ICU stay this was the mild COVID-19 group or for the elevated IL-6, normal serum IL-6 levels. In the revised manuscript we present the modeling results in an abbreviated form for the 5 strongest co-variates of any CT abnormalities, moderate-to-severe CT abnormalities (CT severity score > 5), persistent symptoms and lung function impairment each (Figures 4 – 5). We indicate the baseline and the n number in the plots. The complete summary of univariable risk modeling with the requested information is provided in Appendix 1 – table 2.

      5) Similarly, how can GGO @V3 be used a potential explanatory variable for the outcome CT abnormalities @V3 when these 2 variables are clearly non-independent. Inclusion of highly related and likely correlated variables may throw off the overall conclusions of the clustering analysis.

      We agree with the editor and the reviewer that this representation was confusing. For this reason and the reasons described in Response 4, we removed the GGO variable from the revised analysis pipeline and differentiate between mild (CT severity score ≤ 5) and moderate-tosevere (CT severity score > 5) radiological lung abnormalities in modeling and machine learning classification. In addition, we define symptom and participant clusters solely with the non-CT parameters (Figure 6 – 7). To investigate the association of mild and moderate-to-severe CT abnormalities with other non-CT variables (Figure 6, Supplementary Figure S5), the CT features are assigned to the no-CT clusters by a k-NN-based label propagation algorithm, i. e. semi-supervised procedure [12,13,26] employed in our recent paper as well [6].

      6) In Figure 6, the criteria for the low, medium, and high-risk subsets are unclear. Is this high risk for persistent functional abnormality, radiologic abnormality, or both? Why were 3 sub populations selected? Was this done subjectively based on the clustering algorithm?

      This is an important issue. The study subject clusters were named according to the increasing frequency of any radiological lung abnormalities in the respective cluster (Figure 8A). We stress this more clearly in the revised manuscript. In addition, as suggested by the reviewer above, we show the frequency of functional lung impairment and persistent symptoms in the study participant clusters. There are multiple criteria for choice of the optimal clustering algorithm and the optimal number of clusters. In our cohort, two criteria for the choice of optimal clustering algorithm were applied:

      1. High fraction of the data set variance ‘explained’ by the cluster assignment (ratio of between-cluster sum-of-squares to the total sum-of-squares, Figure 6 – figure supplement 1A and Figure 7 – figure supplement 1A)
      2. The relatively highest cluster stability or reproducibility of the clustering structure in 20-fold cross-validation (Figure 6 – figure supplement 1B and Figure 7 – figure supplement 1B) [15] The optimal number of clusters of the study participants based on non-CT study variables was based on the algorithm (SOM + hierarchical clustering algorithm, see Reviewer 2, Issue 4) [17,18], as done usually in the unsupervised or semi-supervised setting. The prime criterion for the optimal cluster number was the bend of the curve of within-cluster sum-of-squares versus cluster number as presented in Figure 7 – figure supplement 1D. In addition, this decision was supported by a visual analysis the SOM node dendrogram (Figure 7 – figure supplement 1E) and the curve of the crossvalidated stability statistic (classification error) vs cluster number (Figure 7 – figure supplement 1F) [15].

      7) The accuracy and sensitivity of the machine learning approaches shown in S5 & S6 are somewhat limited. Please comment on why such highly granular data can only provide limited prediction about degree of lung damage post infection. Are there missing data types that might make the algorithm more predictive?

      This is an important issue that deserves more discussion in the revised manuscript. Each of the machine learning classifiers presented in the previous and the revised version of the manuscript was extremely sensitive and specific at predicting the outcomes in the training data encompassing the entire cohort (Supplementary Figure S11), as expected. However, their performance was way worse in repeated holdout (previous version) or 20-fold cross-validation (revision, Figure 9) used here as surrogate tools used to check the sensitivity and specificity with ‘unseen’ test data. We believe that there are two prime sources of such suboptimal performance: the size of the training set and the choice of the classifier. To address the first limitation, the following alterations to the analysis pipeline were introduced:

      1. We do not restrict the analysis to the subset of the CovILD study with the complete set of all variables. Instead, the non-missingness criterion is applied to each outcome variable separately (any CT abnormalities: n = 109, moderate-to-severe abnormalities: n = 109, lung function impairment: n = 111, persistent symptoms: n = 133).
      2. We altered the internal validation strategy. Instead of the repeated holdout approach applied to the machine learning classification, which strongly limits the size of the training data set, we switched to 20-fold cross-validation both for the cluster algorithms (Figure 6 – figure supplement 1BD and Figure 7 – figure supplement 1BF) [15] and the machine learning models (Figure 9, Appendix 1 – table 5) [19]. To address the second issue, the following changes were introduced:
      3. We compare the performance of a broader set of classifiers representing different classes of machine learning algorithms provided by the R package caret [19] (tree model: C5.0 [20], bagged tree model: Random Forests [21], support vector machines with radial kernel [22], shallow neural network: nnet [23], and elastic net regression: glmnet [24]) (Figure 9, Appendix 1 – table 4).
      4. Finally, a model ensemble representing a linear combination of the classifiers presented above developed with the elastic net regression algorithm (Figure 9, Figure 9 – figure supplement 2) and tools provided by caretEnsemble package [25]. Such model displayed better performance at predicting any CT abnormalities and persistent symptoms than single classifiers (Figure 9, Appendix 1 – table 5). Finally, we agree with the Reviewer, that the input variable set, despite its size, was still not complete. We believe that inclusion of other inflammatory markers recorded during acute COVID19 and at the 60-day follow-up may additionally improve the prediction of the radiological abnormalities at the 6-month follow-up visit. Of note, our data set missed important readouts of cellular immunity such as neutrophil levels or neutrophil: lymphocyte ratio (NLR) and blood parameters for the mild COVID-19 subset. We discuss this issue in more detail in the revised Discussion section.

      8) The authors state that "the sole application of a lung function measurement at screening for subjects at risk of delayed lung recovery may bear insufficient sensitivity". I am not sure that I agree with this assessment. From the perspective of a patient, full recovery of lung function with limited or no residual symptoms, even in the presence of residual chest CT abnormalities, seems like a favorable outcome. I would suggest either changing this statement or providing citations that associate residual chest CT abnormalities (in the absence of residual functional lung dysfunction) with adverse long-term outcomes. Do the authors hypothesize that persistent radiologic abnormalities may predate organizing pneumonia which will ultimately become symptomatic?

      We thank the reviewer for the interesting point of discussion. We agree with the reviewer that the functional status and symptom burden is of major importance for the individual patient in the postacute phase of COVID-19. Still, prioritizing lung function over mild structural lung abnormalities may pose two major problems. First, as previously discussed, lung function testing has a rather low sensitivity to detect early ILD [10,11], is not a good prognostic marker for long-term clinical outcomes and may not correlate well with patients' symptom burden. For instance, a patient with a normal lung function status may still be highly symptomatic (e. g. due to reduced capacity of respiratory muscle function) [7] and/or demonstrate structural lung abnormalities (e.g. it has been shown for various ILD that lung function test such as FVC and FEV1 may be normal even in pronounced disease and lung function testing is not sufficient to rule out ILD [10]). Second, to date, it is not known if persistent structural lung abnormalities following COVID-19 (even when mild) are at risk for progressing at long-term follow-up. Especially, sub-clinical structural changes may behave like incidentally detected interstitial lung abnormalities (ILAs) and develop to symptomatic progressive fibrotic interstial lung disease including IPF [11]. For this reason, we think that further pulmonary follow-up is necessary for patients with structural lung abnormalities due to COVID-19 and a sole focus on lung function is not sufficient to assess pulmonary COVID-19 outcomes [9].

      9) The authors note selection bias against ordering CT and perhaps inflammatory markers early during infection as a limitation. I would suggest a sensitivity analysis to understand whether this misclassification will impact the model's predictions.

      We now address this issue in a more detailed way. As shown in Figure 1, there was indeed a significant dropout of participants during the study due to missing the longitudinal visits and missingness of the longitudinal variable set. This phenomenon was indeed the most evident for the mild COVID-19 patients, who lost interest at the participation most likely because of subjective complete convalescence. This issue is discussed now as a limitation in the revised manuscript. In the revised manuscript, we investigated highly influential factors for clustering and machine learning classifiers. To determine, which variables played the most important role for the clustering of the study individuals, we applied the explanatory variable ‘noising’ procedure initially described by Breiman for the random forest algorithm [21] and compared the ‘explained’ variance (ratio of between-cluster sum-of-squares to the total sum-of-squares) of the initial clustering structure with the clustering structures generated in the datasets with noised variables. Although this algorithm is not free from shortages such as blindness to tight correlations, it may provide a coarse measure of the variable’s impact on the cluster formation (Figure 7 – figure supplement 2). For three of the machine learning algorithms tested importance statistics were extracted from the models: (1) for the C5.0 algorithm, the percentage of variable usage in the decision tree, (2) for the Random Forests algorithm, the delta of Gini index obtained by variable noising [21] and (3) for the elastic net/glmNet procedure, the absolute values of regression coefficients β [24] (Figure 9 – figure supplement 4 – 7). The technical details are provided in Methods, the cluster and model importance data are discussed in the manuscript text.

      References

      1. Gutmann C, Takov K, Burnap SA, et al. SARS-CoV-2 RNAemia and proteomic trajectories inform prognostication in COVID-19 patients admitted to intensive care. Nat Commun 2021;12. doi:10.1038/S41467-021-23494-1
      2. Benito-León J, Castillo MD Del, Estirado A, et al. Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study. J Med Internet Res 2021;23. doi:10.2196/25988
      3. Demichev V, Tober-Lau P, Lemke O, et al. A time-resolved proteomic and prognostic map of COVID-19. Cell Syst 2021;12:780. doi:10.1016/J.CELS.2021.05.005
      4. Estiri H, Strasser ZH, Brat GA, et al. Evolving phenotypes of non-hospitalized patients that indicate long COVID. BMC Med 2021;19. doi:10.1186/S12916-021-02115-0
      5. Sudre CH, Murray B, Varsavsky T, et al. Attributes and predictors of long COVID. Nat Med 2021;27. doi:10.1038/s41591-021-01292-y
      6. Sahanic S, Tymoszuk P, Ausserhofer D, et al. Phenotyping of acute and persistent COVID-19 features in the outpatient setting: exploratory analysis of an international cross-sectional online survey. Clin Infect Dis Published Online First: 26 November 2021. doi:10.1093/CID/CIAB978
      7. Hui DS, Wong KT, Ko FW, et al. The 1-Year Impact of Severe Acute Respiratory Syndrome on Pulmonary Function, Exercise Capacity, and Quality of Life in a Cohort of Survivors. Chest 2005;128:2247–61. doi:10.1378/CHEST.128.4.2247
      8. Ng CK, Chan JWM, Kwan TL, et al. Six month radiological and physiological outcomes in severe acute respiratory syndrome (SARS) survivors. Thorax 2004;59:889–91. doi:10.1136/THX.2004.023762
      9. Raghu G, Wilson KC. COVID-19 interstitial pneumonia: monitoring the clinical course in survivors. Lancet Respir. Med. 2020;8:839–42. doi:10.1016/S2213-2600(20)30349-0
      10. Suliman YA, Dobrota R, Huscher D, et al. Pulmonary function tests: High rate of falsenegative results in the early detection and screening of scleroderma-related interstitial lung disease. Arthritis Rheumatol 2015;67:3256–61. doi:10.1002/ART.39405/ABSTRACT
      11. Hatabu H, Hunninghake GM, Richeldi L, et al. Interstitial lung abnormalities detected incidentally on CT: a Position Paper from the Fleischner Society. Lancet Respir Med 2020;8:726. doi:10.1016/S2213-2600(20)30168-5
      12. Leng M, Wang J, Cheng J, et al. Adaptive semi-supervised clustering algorithm with label propagation. J Softw Eng 2014;8:14–22. doi:10.3923/JSE.2014.14.22
      13. Lelis L, Sander J. Semi-supervised density-based clustering. Proc - IEEE Int Conf Data Mining, ICDM 2009;:842–7. doi:10.1109/ICDM.2009.143
      14. Huang C, Huang L, Wang Y, et al. 6-month consequences of COVID-19 in patients discharged from hospital: a cohort study. Lancet 2021;397:220–32. doi:10.1016/S0140- 6736(20)32656-8
      15. Lange T, Roth V, Braun ML, et al. Stability-Based Validation of Clustering Solutions. Neural Comput 2004;16:1299–323. doi:10.1162/089976604773717621
      16. Hartigan JA, Wong MA. Algorithm AS 136: A K-Means Clustering Algorithm. Appl Stat 1979;28:100. doi:10.2307/2346830
      17. Kohonen T. Self-Organizing Maps. Berlin, Heidelberg: : Springer Berlin Heidelberg 1995. doi:10.1007/978-3-642-97610-0
      18. Vesanto J, Alhoniemi E. Clustering of the self-organizing map. IEEE Trans Neural Networks 2000;11:586–600. doi:10.1109/72.846731
      19. Kuhn M. Building predictive models in R using the caret package. J Stat Softw 2008;28:1–26. doi:10.18637/jss.v028.i05
      20. Quinlan JR. C4.5: Programs for Machine Learning. San Francisco, CA, USA: : Morgan Kaufmann Publishers Inc. 1993. doi:10.5555/152181
      21. Breiman L. Random forests. Mach Learn 2001;45:5–32. doi:10.1023/A:1010933404324
      22. Weston J, Watkins C. Multi-Class Support Vector Machines. 1998.
      23. Ripley BD. Pattern recognition and neural networks. Cambridge University Press 2014. doi:10.1017/CBO9780511812651
      24. Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw 2010;33:1–22. doi:10.18637/jss.v033.i01
      25. Deane-Mayer ZA, Knowles JE. Ensembles of Caret Models [R package caretEnsemble version 2.0.1]. 2019.https://cran.r-project.org/package=caretEnsemble (accessed 13 Dec 2021).
      26. Glennan T, Leckie C, Erfani SM. Improved Classification of Known and Unknown Network Traffic Flows Using Semi-supervised Machine Learning. Lect Notes Comput Sci (including Subser Lect Notes Artif Intell Lect Notes Bioinformatics) 2016;9723:493–501. doi:10.1007/978-3-319-40367-0_33
      27. Sonnweber T, Sahanic S, Pizzini A, et al. Cardiopulmonary recovery after COVID-19 - an observational prospective multi-center trial. Eur Respir J Published Online First: 10 December
      28. doi:10.1183/13993003.03481-2020
    1. JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?" ⚓ President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.   Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again. This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.   These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice. There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization." Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.   Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV. I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless. I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth. I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell. He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it. I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned." Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people. I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day. I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward. You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.  I need you to try now. The very word “secrecy” is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it. Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment. That I do not intend to permit to the extent that it’s in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know. For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day. It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It’s mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed. No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary. I’m not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed. I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn’t become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them. Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy. That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion. This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security. And so it is to the printing press, to the recorder of man’s deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent.  John F. Kennedy’s address before the American Newspaper Publishers Association on April 27, 1961 ᐧ Truth changes. Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What’s more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality. 2read.net

      JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?"

      ===============================================================================================================================================================================================================================================================================================================

      شبح قيصر العظيم! مطاردة وبطاقة هو HSL ... HST؟

      Von: Adam Marshall Dobrin <adam@fromthemachine.org>

      Datum: Freitag, 14. Jänner 2022 um 21:45

      An: XM <XM@liber-t.xyz>

      Betreff: [EXT] JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker."

      Said to be the speech that "got him killed" by many people; this is supposedly his last public words before the grassy knoll and the "Lone Gunman" appeared on X-files .. long after I can recall my mother being on the beach in Miami when she retold of the news that echoed around the cosmos.  It appears to be a very stern warning of something that is loosely described at the time but very clearly the machinations of the very same machine which I fight against, and which we now might associate with "the Borg" and the "tide turning" of Hunter S. Thompsons famous quote ... now "more famous than ever before."

      מה**נשתנה**

      I am writing a final summary of the "state of outer space" as far as I can see; and we stare at the ISS and MIR and a world that has forgone in fiction the gravity of the situation here.  We are stagnant and without a hope or dream in the world of ever reaching interstellar sleep or intragalactic dreams; without a significant kick in the "wake up NASA, wake up ESA, at the Chandrayan-2 and at the floating paper hot air balloons ... God has shown me the Diaspora launching from the sands of the Navajo desert, and I believe Deseret is very explain-ably tied to Sherharazade and to the thousand and first night that has ended with a fusion of Nobel "peas" and a strange dissertation on the change wrought by the lack of "gas chambers" in Night and the very upsetting modicum of "unreality" that permeates the change from chants of the Shema to speaking Aramaic and the mourners Kaddish as the "crematorium" heralds an era of "what's the point of history without literature" and "where did Exodus ever lead us without to the parting of Dead and Red seas without a Nile and without an Amazon?"

      מה נשתנה

      I write here about my religion, and speak it to you in written words, I am a Yiddish Jihadist, born and raised a Jewish American; Bar Mitzvah'd in the true Temple and House of God, a place in Plantation, FL that literally "means everything to me."  I read words from the Torah about Joseph's Dream and wrote a speech that I wish to God today I could retrieve and see on the original film it was recorded in.  It might be possible.  It would be a tragedy not to deliver the original to my eyes and to anyone who wanted to see what a child reading of a dream he did not understand spoke of words from the future about the creation of Heaven and the truth about the avenu malkaynu.

      This night is different from all others.  This is the end of days and the fire of the last day.  Before I would have told you all you needed was Norse and a decent American Education to really understand the difference between a throne and the Cherubim, but today it's become more clear that we need more than Pink Floyd and more than another brick in the wall.  We need more than Islam and we need more than 9/11 and Yom Hashoah.  We need me, and we need you on your knees begging for forgiveness for the travesty of shambles out country is in.  How dare you continue every day to speak in vain abovcl the Lord you dare call "stupid" and a "motherfucker" you belong in your graves today and you will see them.

      I stand here writing from the foot of Styx, from the land that sees Narayana and still "begs to differ" despite a resounding call from the Heavens themselves to read Exodus and to change the land that we live in bnefore a "flood worse than [just] in your heads" overcomes and subsumes the entirety of the skies and grounds once mistakenly called "an empire of dirt."  On Adamah I write to the secret echelon of the United States of America known as the Fifth Column and I beg them to scream like the Heavens have never seen Samael and Dr. Seuss wonder about the land of Who-ville and Hungry where the child "in Cindy" and the songs of the Doors here beckon us to contrast Stargate and Star Trek TNG, for the light of the Holodeck and the Holocaust which screams that we understand the meaning of "Holographic Universe" and stand down fighting the tyranny of "one" who stands for the unity of all and the salvation of the entirety of the cosmos.

      We are mistaken to stand against me.  I am the cat with nine billion lives and nothing and nobody will prevail when I am in jeopardy.  Gilgamesh says "against me only the most hubristic slaves in the galaxy would dare to rise up, for the mightiest weapon in the entirety of history is nothing more than intelligence and the civilization that has given mortality a shake of the Thunderstick of Prince and the Lightning of Blitzen.  My red nose shines bright, and if you ever saw it ... you might even say ... "it glows!"

      With these words I leave you, and prayers that this is our last night without an IDL that moves from Beiring to "north of the moon" and beyond the "Eleventh House" of Aquarius.  This is the dawning of the age of the Sagitar, and I need a "scimitar post green key" to really be sure that I've "gotten through" the madness.  In my dreams I've traveled to Ultima Thule and then untraveled it, I've begged for transit to Ceres and without actually achieving it we can shoot arrows at the SOL star we will ever see in or near the place known to the Jews of Lore as "the Holy of Holies."

      שְׁמַע יִשְׂרָאֵל יְהוָה אֱלֹהֵינוּ יְהוָה אֶחָֽד׃

      Sh'ma Yisra'el, YHWH 'eloheinu, YHWH 'eḥad:

      בָּרוּךְ שֵׁם כְּבוֹד מַלְכוּתוֹ לְעוֹלָם וָעֶד

      Baruch shem kevod malchuto le'olam va'ed

      صلاة العشاء

      For most of my life I would have genuflected and listened to Mordechai and to my Rabbi; I would have told you all that I was brought up by the best Jews in America and we do not believe in God--though in our hearts we believe we are him and we fear him.

      Things have changed.  This is the creator of believe the words are about me and believe I am in disbelief and truly it might be the day of awe;

      Barukh atah Adonai Eloheinu melekh ha'olam, bo're m'orei ha'esh

      the walls and halls will fade away ...

      --

      Forwarding this in advance of today or tomorrow's ... "Northeaster" email ... it's an old one that wasn't widely distributed; mostly (about) his speech, which of course is a gem.

      What I'm writing now is supposed to be focusing on ... basically "how knowledge of technology" things lke "time travel" and "virtual reality" ... changes our "perspective" on things; like what's important and what's ... the end of everything.  Anyway, I hope you see that there are things that can be destroyed and there are things that can't--for instance if we lost "life" totally, we wouldn't ever see us again ... unless we encountered something very strange--

      On the other hand we could lose "computers" and rebuild them quickly--see that, it's important.

      A/S/L tho ... ???

      ---------- Forwarded message ---------

      Date: Sun, Dec 2, 2018 at 5:35 PM

      President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.

      Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again.

      This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.

      These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice.

      There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization."

      Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.

      Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV.

      I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless.

      I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth.

      I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell.

      He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it.

      I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned."

      Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people.

      I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day.

      I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward.

      You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.

       I need you to try now.

      The very word "secrecy" is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it.

      Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment.

      That I do not intend to permit to the extent that it's in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know.

      For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day.

      It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It's mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed.

      No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary.

      I'm not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed.

      I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn't become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them.

      Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy.

      That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion.

      This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security.

      And so it is to the printing press, to the recorder of man's deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent.

      John F. Kennedy's address before the American Newspaper Publishers Association on April 27, 1961

      Truth changes.

      Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What's more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality.

      --

      You received this message because you are subscribed to the Google Groups "NON AMERICAN COLLEGE" group.

      To unsubscribe from this group and stop receiving emails from it, send an email to suac+unsubscribe@lamc.la.

      Created with

      publishthis.email

      Create simple web pages in seconds for free.

      This page was created in seconds, by sending an email to page@publishthis.email.

      [

      Try it!

       ](https://www.publishthis.email/)Free. No account or sign-up required.

      ((( converted to markup of daringfireball.org with help from JRO and Goddard ... if not just NASAA )))

    1. "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?" شبح قيصر العظيم! مطاردة وبطاقة هو HSL ... HST؟   Von: Adam Marshall Dobrin <adam@fromthemachine.org> Datum: Freitag, 14. Jänner 2022 um 21:45 An: XM <XM@liber-t.xyz> Betreff: [EXT] JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker."   Said to be the speech that "got him killed" by many people; this is supposedly his last public words before the grassy knoll and the "Lone Gunman" appeared on X-files .. long after I can recall my mother being on the beach in Miami when she retold of the news that echoed around the cosmos.  It appears to be a very stern warning of something that is loosely described at the time but very clearly the machinations of the very same machine which I fight against, and which we now might associate with "the Borg" and the "tide turning" of Hunter S. Thompsons famous quote ... now "more famous than ever before."     מה נשתנה   I am writing a final summary of the "state of outer space" as far as I can see; and we stare at the ISS and MIR and a world that has forgone in fiction the gravity of the situation here.  We are stagnant and without a hope or dream in the world of ever reaching interstellar sleep or intragalactic dreams; without a significant kick in the "wake up NASA, wake up ESA, at the Chandrayan-2 and at the floating paper hot air balloons ... God has shown me the Diaspora launching from the sands of the Navajo desert, and I believe Deseret is very explain-ably tied to Sherharazade and to the thousand and first night that has ended with a fusion of Nobel "peas" and a strange dissertation on the change wrought by the lack of "gas chambers" in Night and the very upsetting modicum of "unreality" that permeates the change from chants of the Shema to speaking Aramaic and the mourners Kaddish as the "crematorium" heralds an era of "what's the point of history without literature" and "where did Exodus ever lead us without to the parting of Dead and Red seas without a Nile and without an Amazon?"   מה נשתנה   I write here about my religion, and speak it to you in written words, I am a Yiddish Jihadist, born and raised a Jewish American; Bar Mitzvah'd in the true Temple and House of God, a place in Plantation, FL that literally "means everything to me."  I read words from the Torah about Joseph's Dream and wrote a speech that I wish to God today I could retrieve and see on the original film it was recorded in.  It might be possible.  It would be a tragedy not to deliver the original to my eyes and to anyone who wanted to see what a child reading of a dream he did not understand spoke of words from the future about the creation of Heaven and the truth about the avenu malkaynu.    This night is different from all others.  This is the end of days and the fire of the last day.  Before I would have told you all you needed was Norse and a decent American Education to really understand the difference between a throne and the Cherubim, but today it's become more clear that we need more than Pink Floyd and more than another brick in the wall.  We need more than Islam and we need more than 9/11 and Yom Hashoah.  We need me, and we need you on your knees begging for forgiveness for the travesty of shambles out country is in.  How dare you continue every day to speak in vain abovcl the Lord you dare call "stupid" and a "motherfucker" you belong in your graves today and you will see them.   I stand here writing from the foot of Styx, from the land that sees Narayana and still "begs to differ" despite a resounding call from the Heavens themselves to read Exodus and to change the land that we live in bnefore a "flood worse than [just] in your heads" overcomes and subsumes the entirety of the skies and grounds once mistakenly called "an empire of dirt."  On Adamah I write to the secret echelon of the United States of America known as the Fifth Column and I beg them to scream like the Heavens have never seen Samael and Dr. Seuss wonder about the land of Who-ville and Hungry where the child "in Cindy" and the songs of the Doors here beckon us to contrast Stargate and Star Trek TNG, for the light of the Holodeck and the Holocaust which screams that we understand the meaning of "Holographic Universe" and stand down fighting the tyranny of "one" who stands for the unity of all and the salvation of the entirety of the cosmos.     We are mistaken to stand against me.  I am the cat with nine billion lives and nothing and nobody will prevail when I am in jeopardy.  Gilgamesh says "against me only the most hubristic slaves in the galaxy would dare to rise up, for the mightiest weapon in the entirety of history is nothing more than intelligence and the civilization that has given mortality a shake of the Thunderstick of Prince and the Lightning of Blitzen.  My red nose shines bright, and if you ever saw it ... you might even say ... "it glows!"   With these words I leave you, and prayers that this is our last night without an IDL that moves from Beiring to "north of the moon" and beyond the "Eleventh House" of Aquarius.  This is the dawning of the age of the Sagitar, and I need a "scimitar post green key" to really be sure that I've "gotten through" the madness.  In my dreams I've traveled to Ultima Thule and then untraveled it, I've begged for transit to Ceres and without actually achieving it we can shoot arrows at the SOL star we will ever see in or near the place known to the Jews of Lore as "the Holy of Holies."   שְׁמַע יִשְׂרָאֵל יְהוָה אֱלֹהֵינוּ יְהוָה אֶחָֽד׃ Sh'ma Yisra'el, YHWH 'eloheinu, YHWH 'eḥad:   בָּרוּךְ שֵׁם כְּבוֹד מַלְכוּתוֹ לְעוֹלָם וָעֶד Baruch shem kevod malchuto le'olam va'ed   صلاة العشاء   For most of my life I would have genuflected and listened to Mordechai and to my Rabbi; I would have told you all that I was brought up by the best Jews in America and we do not believe in God--though in our hearts we believe we are him and we fear him.   Things have changed.  This is the creator of believe the words are about me and believe I am in disbelief and truly it might be the day of awe;    Barukh atah Adonai Eloheinu melekh ha'olam, bo're m'orei ha'esh   the walls and halls will fade away ...   --   Forwarding this in advance of today or tomorrow's ... "Northeaster" email ... it's an old one that wasn't widely distributed; mostly (about) his speech, which of course is a gem.    What I'm writing now is supposed to be focusing on ... basically "how knowledge of technology" things lke "time travel" and "virtual reality" ... changes our "perspective" on things; like what's important and what's ... the end of everything.  Anyway, I hope you see that there are things that can be destroyed and there are things that can't--for instance if we lost "life" totally, we wouldn't ever see us again ... unless we encountered something very strange--   On the other hand we could lose "computers" and rebuild them quickly--see that, it's important.   A/S/L tho ... ??? ---------- Forwarded message --------- Date: Sun, Dec 2, 2018 at 5:35 PM     President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.   Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again. This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.   These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice. There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization." Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.   Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV. I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless. I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth. I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell. He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it. I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned." Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people. I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day. I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward. You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.  I need you to try now. The very word “secrecy” is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it. Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment. That I do not intend to permit to the extent that it’s in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know. For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day. It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It’s mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed. No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary. I’m not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed. I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn’t become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them. Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy. That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion. This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security. And so it is to the printing press, to the recorder of man’s deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent.  John F. Kennedy’s address before the American Newspaper Publishers Association on April 27, 1961 ᐧ Truth changes. Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What’s more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality.   ᐧ ᐧ ᐧ -- You received this message because you are subscribed to the Google Groups "NON AMERICAN COLLEGE" group. To unsubscribe from this group and stop receiving emails from it, send an email to suac+unsubscribe@lamc.la. ᐧ

      https://archive.ph/bVudp

      bianca fuck it ... lets get married? :) @biancapisaniii

      JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?"

      ===============================================================================================================================================================================================================================================================================================================

      شبح قيصر العظيم! مطاردة وبطاقة هو HSL ... HST؟

      Von: Adam Marshall Dobrin <adam@fromthemachine.org>

      Datum: Freitag, 14. Jänner 2022 um 21:45

      An: XM <XM@liber-t.xyz>

      Betreff: [EXT] JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker."

      Said to be the speech that "got him killed" by many people; this is supposedly his last public words before the grassy knoll and the "Lone Gunman" appeared on X-files .. long after I can recall my mother being on the beach in Miami when she retold of the news that echoed around the cosmos.  It appears to be a very stern warning of something that is loosely described at the time but very clearly the machinations of the very same machine which I fight against, and which we now might associate with "the Borg" and the "tide turning" of Hunter S. Thompsons famous quote ... now "more famous than ever before."

      מה**נשתנה**

      I am writing a final summary of the "state of outer space" as far as I can see; and we stare at the ISS and MIR and a world that has forgone in fiction the gravity of the situation here.  We are stagnant and without a hope or dream in the world of ever reaching interstellar sleep or intragalactic dreams; without a significant kick in the "wake up NASA, wake up ESA, at the Chandrayan-2 and at the floating paper hot air balloons ... God has shown me the Diaspora launching from the sands of the Navajo desert, and I believe Deseret is very explain-ably tied to Sherharazade and to the thousand and first night that has ended with a fusion of Nobel "peas" and a strange dissertation on the change wrought by the lack of "gas chambers" in Night and the very upsetting modicum of "unreality" that permeates the change from chants of the Shema to speaking Aramaic and the mourners Kaddish as the "crematorium" heralds an era of "what's the point of history without literature" and "where did Exodus ever lead us without to the parting of Dead and Red seas without a Nile and without an Amazon?"

      מה נשתנה

      I write here about my religion, and speak it to you in written words, I am a Yiddish Jihadist, born and raised a Jewish American; Bar Mitzvah'd in the true Temple and House of God, a place in Plantation, FL that literally "means everything to me."  I read words from the Torah about Joseph's Dream and wrote a speech that I wish to God today I could retrieve and see on the original film it was recorded in.  It might be possible.  It would be a tragedy not to deliver the original to my eyes and to anyone who wanted to see what a child reading of a dream he did not understand spoke of words from the future about the creation of Heaven and the truth about the avenu malkaynu.

      This night is different from all others.  This is the end of days and the fire of the last day.  Before I would have told you all you needed was Norse and a decent American Education to really understand the difference between a throne and the Cherubim, but today it's become more clear that we need more than Pink Floyd and more than another brick in the wall.  We need more than Islam and we need more than 9/11 and Yom Hashoah.  We need me, and we need you on your knees begging for forgiveness for the travesty of shambles out country is in.  How dare you continue every day to speak in vain abovcl the Lord you dare call "stupid" and a "motherfucker" you belong in your graves today and you will see them.

      I stand here writing from the foot of Styx, from the land that sees Narayana and still "begs to differ" despite a resounding call from the Heavens themselves to read Exodus and to change the land that we live in bnefore a "flood worse than [just] in your heads" overcomes and subsumes the entirety of the skies and grounds once mistakenly called "an empire of dirt."  On Adamah I write to the secret echelon of the United States of America known as the Fifth Column and I beg them to scream like the Heavens have never seen Samael and Dr. Seuss wonder about the land of Who-ville and Hungry where the child "in Cindy" and the songs of the Doors here beckon us to contrast Stargate and Star Trek TNG, for the light of the Holodeck and the Holocaust which screams that we understand the meaning of "Holographic Universe" and stand down fighting the tyranny of "one" who stands for the unity of all and the salvation of the entirety of the cosmos.

      We are mistaken to stand against me.  I am the cat with nine billion lives and nothing and nobody will prevail when I am in jeopardy.  Gilgamesh says "against me only the most hubristic slaves in the galaxy would dare to rise up, for the mightiest weapon in the entirety of history is nothing more than intelligence and the civilization that has given mortality a shake of the Thunderstick of Prince and the Lightning of Blitzen.  My red nose shines bright, and if you ever saw it ... you might even say ... "it glows!"

      With these words I leave you, and prayers that this is our last night without an IDL that moves from Beiring to "north of the moon" and beyond the "Eleventh House" of Aquarius.  This is the dawning of the age of the Sagitar, and I need a "scimitar post green key" to really be sure that I've "gotten through" the madness.  In my dreams I've traveled to Ultima Thule and then untraveled it, I've begged for transit to Ceres and without actually achieving it we can shoot arrows at the SOL star we will ever see in or near the place known to the Jews of Lore as "the Holy of Holies."

      שְׁמַע יִשְׂרָאֵל יְהוָה אֱלֹהֵינוּ יְהוָה אֶחָֽד׃

      Sh'ma Yisra'el, YHWH 'eloheinu, YHWH 'eḥad:

      בָּרוּךְ שֵׁם כְּבוֹד מַלְכוּתוֹ לְעוֹלָם וָעֶד

      Baruch shem kevod malchuto le'olam va'ed

      صلاة العشاء

      For most of my life I would have genuflected and listened to Mordechai and to my Rabbi; I would have told you all that I was brought up by the best Jews in America and we do not believe in God--though in our hearts we believe we are him and we fear him.

      Things have changed.  This is the creator of believe the words are about me and believe I am in disbelief and truly it might be the day of awe;

      Barukh atah Adonai Eloheinu melekh ha'olam, bo're m'orei ha'esh

      the walls and halls will fade away ...

      --

      Forwarding this in advance of today or tomorrow's ... "Northeaster" email ... it's an old one that wasn't widely distributed; mostly (about) his speech, which of course is a gem.

      What I'm writing now is supposed to be focusing on ... basically "how knowledge of technology" things lke "time travel" and "virtual reality" ... changes our "perspective" on things; like what's important and what's ... the end of everything.  Anyway, I hope you see that there are things that can be destroyed and there are things that can't--for instance if we lost "life" totally, we wouldn't ever see us again ... unless we encountered something very strange--

      On the other hand we could lose "computers" and rebuild them quickly--see that, it's important.

      A/S/L tho ... ???

      ---------- Forwarded message ---------

      Date: Sun, Dec 2, 2018 at 5:35 PM

      President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.

      Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again.

      This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.

      These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice.

      There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization."

      Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.

      Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV.

      I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless.

      I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth.

      I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell.

      He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it.

      I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned."

      Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people.

      I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day.

      I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward.

      You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.

       I need you to try now.

      The very word "secrecy" is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it.

      Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment.

      That I do not intend to permit to the extent that it's in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know.

      For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day.

      It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It's mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed.

      No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary.

      I'm not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed.

      I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn't become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them.

      Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy.

      That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion.

      This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security.

      And so it is to the printing press, to the recorder of man's deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent.

      John F. Kennedy's address before the American Newspaper Publishers Association on April 27, 1961

      Truth changes.

      Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What's more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality.

      --

      You received this message because you are subscribed to the Google Groups "NON AMERICAN COLLEGE" group.

      To unsubscribe from this group and stop receiving emails from it, send an email to suac+unsubscribe@lamc.la.

      Created with

      publishthis.email

      Create simple web pages in seconds for free.

      This page was created in seconds, by sending an email to page@publishthis.email.

      [

      Try it!

       ](https://www.publishthis.email/)Free. No account or sign-up required.

      ((( converted to markup of daringfireball.org with help from JRO and Goddard ... if not just NASAA )))

    1. JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?" شبح قيصر العظيم! مطاردة وبطاقة هو HSL ... HST؟   Von: Adam Marshall Dobrin <adam@fromthemachine.org> Datum: Freitag, 14. Jänner 2022 um 21:45 An: XM <XM@liber-t.xyz> Betreff: [EXT] JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker."   Said to be the speech that "got him killed" by many people; this is supposedly his last public words before the grassy knoll and the "Lone Gunman" appeared on X-files .. long after I can recall my mother being on the beach in Miami when she retold of the news that echoed around the cosmos.  It appears to be a very stern warning of something that is loosely described at the time but very clearly the machinations of the very same machine which I fight against, and which we now might associate with "the Borg" and the "tide turning" of Hunter S. Thompsons famous quote ... now "more famous than ever before."     מה נשתנה   I am writing a final summary of the "state of outer space" as far as I can see; and we stare at the ISS and MIR and a world that has forgone in fiction the gravity of the situation here.  We are stagnant and without a hope or dream in the world of ever reaching interstellar sleep or intragalactic dreams; without a significant kick in the "wake up NASA, wake up ESA, at the Chandrayan-2 and at the floating paper hot air balloons ... God has shown me the Diaspora launching from the sands of the Navajo desert, and I believe Deseret is very explain-ably tied to Sherharazade and to the thousand and first night that has ended with a fusion of Nobel "peas" and a strange dissertation on the change wrought by the lack of "gas chambers" in Night and the very upsetting modicum of "unreality" that permeates the change from chants of the Shema to speaking Aramaic and the mourners Kaddish as the "crematorium" heralds an era of "what's the point of history without literature" and "where did Exodus ever lead us without to the parting of Dead and Red seas without a Nile and without an Amazon?"   מה נשתנה   I write here about my religion, and speak it to you in written words, I am a Yiddish Jihadist, born and raised a Jewish American; Bar Mitzvah'd in the true Temple and House of God, a place in Plantation, FL that literally "means everything to me."  I read words from the Torah about Joseph's Dream and wrote a speech that I wish to God today I could retrieve and see on the original film it was recorded in.  It might be possible.  It would be a tragedy not to deliver the original to my eyes and to anyone who wanted to see what a child reading of a dream he did not understand spoke of words from the future about the creation of Heaven and the truth about the avenu malkaynu.    This night is different from all others.  This is the end of days and the fire of the last day.  Before I would have told you all you needed was Norse and a decent American Education to really understand the difference between a throne and the Cherubim, but today it's become more clear that we need more than Pink Floyd and more than another brick in the wall.  We need more than Islam and we need more than 9/11 and Yom Hashoah.  We need me, and we need you on your knees begging for forgiveness for the travesty of shambles out country is in.  How dare you continue every day to speak in vain abovcl the Lord you dare call "stupid" and a "motherfucker" you belong in your graves today and you will see them.   I stand here writing from the foot of Styx, from the land that sees Narayana and still "begs to differ" despite a resounding call from the Heavens themselves to read Exodus and to change the land that we live in bnefore a "flood worse than [just] in your heads" overcomes and subsumes the entirety of the skies and grounds once mistakenly called "an empire of dirt."  On Adamah I write to the secret echelon of the United States of America known as the Fifth Column and I beg them to scream like the Heavens have never seen Samael and Dr. Seuss wonder about the land of Who-ville and Hungry where the child "in Cindy" and the songs of the Doors here beckon us to contrast Stargate and Star Trek TNG, for the light of the Holodeck and the Holocaust which screams that we understand the meaning of "Holographic Universe" and stand down fighting the tyranny of "one" who stands for the unity of all and the salvation of the entirety of the cosmos.     We are mistaken to stand against me.  I am the cat with nine billion lives and nothing and nobody will prevail when I am in jeopardy.  Gilgamesh says "against me only the most hubristic slaves in the galaxy would dare to rise up, for the mightiest weapon in the entirety of history is nothing more than intelligence and the civilization that has given mortality a shake of the Thunderstick of Prince and the Lightning of Blitzen.  My red nose shines bright, and if you ever saw it ... you might even say ... "it glows!"   With these words I leave you, and prayers that this is our last night without an IDL that moves from Beiring to "north of the moon" and beyond the "Eleventh House" of Aquarius.  This is the dawning of the age of the Sagitar, and I need a "scimitar post green key" to really be sure that I've "gotten through" the madness.  In my dreams I've traveled to Ultima Thule and then untraveled it, I've begged for transit to Ceres and without actually achieving it we can shoot arrows at the SOL star we will ever see in or near the place known to the Jews of Lore as "the Holy of Holies."   שְׁמַע יִשְׂרָאֵל יְהוָה אֱלֹהֵינוּ יְהוָה אֶחָֽד׃ Sh'ma Yisra'el, YHWH 'eloheinu, YHWH 'eḥad:   בָּרוּךְ שֵׁם כְּבוֹד מַלְכוּתוֹ לְעוֹלָם וָעֶד Baruch shem kevod malchuto le'olam va'ed   صلاة العشاء   For most of my life I would have genuflected and listened to Mordechai and to my Rabbi; I would have told you all that I was brought up by the best Jews in America and we do not believe in God--though in our hearts we believe we are him and we fear him.   Things have changed.  This is the creator of believe the words are about me and believe I am in disbelief and truly it might be the day of awe;    Barukh atah Adonai Eloheinu melekh ha'olam, bo're m'orei ha'esh   the walls and halls will fade away ...   --   Forwarding this in advance of today or tomorrow's ... "Northeaster" email ... it's an old one that wasn't widely distributed; mostly (about) his speech, which of course is a gem.    What I'm writing now is supposed to be focusing on ... basically "how knowledge of technology" things lke "time travel" and "virtual reality" ... changes our "perspective" on things; like what's important and what's ... the end of everything.  Anyway, I hope you see that there are things that can be destroyed and there are things that can't--for instance if we lost "life" totally, we wouldn't ever see us again ... unless we encountered something very strange--   On the other hand we could lose "computers" and rebuild them quickly--see that, it's important.   A/S/L tho ... ??? ---------- Forwarded message --------- Date: Sun, Dec 2, 2018 at 5:35 PM     President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.   Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again. This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.   These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice. There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization." Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.   Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV. I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless. I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth. I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell. He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it. I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned." Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people. I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day. I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward. You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.  I need you to try now. The very word “secrecy” is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it. Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment. That I do not intend to permit to the extent that it’s in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know. For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day. It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It’s mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed. No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary. I’m not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed. I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn’t become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them. Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy. That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion. This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security. And so it is to the printing press, to the recorder of man’s deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent.  John F. Kennedy’s address before the American Newspaper Publishers Association on April 27, 1961 ᐧ Truth changes. Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What’s more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality.   ᐧ ᐧ ᐧ -- You received this message because you are subscribed to the Google Groups "NON AMERICAN COLLEGE" group. To unsubscribe from this group and stop receiving emails from it, send an email to suac+unsubscribe@lamc.la. ᐧ Created with publishthis.email Create simple web pages in seconds for free. This page was created in seconds, by sending an email to page@publishthis.email. Try it! Free. No account or sign-up required.

      JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker." ||| ((( PHIPLE TRENIXON ? STARK TRE )))) Do we not think "The Truman Show" and Oppenheimer and Heisenberg and Ensteini are telling? This is Adam Dobrini on the "who dropped the BIG one? Gates?"

      شبح قيصر العظيم! مطاردة وبطاقة هو HSL ... HST؟

      Von: Adam Marshall Dobrin <adam@fromthemachine.org>\ Datum: Freitag, 14. Jänner 2022 um 21:45\ An: XM <XM@liber-t.xyz>\ Betreff: [EXT] JOHN F. KENNEDY SPEECH ON "SECRECY" AND "SECRET ... SOCIETIES." ... || with a new forward from our "speaker."

      Said to be the speech that "got him killed" by many people; this is supposedly his last public words before the grassy knoll and the "Lone Gunman" appeared on X-files .. long after I can recall my mother being on the beach in Miami when she retold of the news that echoed around the cosmos.  It appears to be a very stern warning of something that is loosely described at the time but very clearly the machinations of the very same machine which I fight against, and which we now might associate with "the Borg" and the "tide turning" of Hunter S. Thompsons famous quote ... now "more famous than ever before."  

      מה**נשתנה**

      I am writing a final summary of the "state of outer space" as far as I can see; and we stare at the ISS and MIR and a world that has forgone in fiction the gravity of the situation here.  We are stagnant and without a hope or dream in the world of ever reaching interstellar sleep or intragalactic dreams; without a significant kick in the "wake up NASA, wake up ESA, at the Chandrayan-2 and at the floating paper hot air balloons ... God has shown me the Diaspora launching from the sands of the Navajo desert, and I believe Deseret is very explain-ably tied to Sherharazade and to the thousand and first night that has ended with a fusion of Nobel "peas" and a strange dissertation on the change wrought by the lack of "gas chambers" in Night and the very upsetting modicum of "unreality" that permeates the change from chants of the Shema to speaking Aramaic and the mourners Kaddish as the "crematorium" heralds an era of "what's the point of history without literature" and "where did Exodus ever lead us without to the parting of Dead and Red seas without a Nile and without an Amazon?"

      מה נשתנה

      I write here about my religion, and speak it to you in written words, I am a Yiddish Jihadist, born and raised a Jewish American; Bar Mitzvah'd in the true Temple and House of God, a place in Plantation, FL that literally "means everything to me."  I read words from the Torah about Joseph's Dream and wrote a speech that I wish to God today I could retrieve and see on the original film it was recorded in.  It might be possible.  It would be a tragedy not to deliver the original to my eyes and to anyone who wanted to see what a child reading of a dream he did not understand spoke of words from the future about the creation of Heaven and the truth about the avenu malkaynu. 

      This night is different from all others.  This is the end of days and the fire of the last day.  Before I would have told you all you needed was Norse and a decent American Education to really understand the difference between a throne and the Cherubim, but today it's become more clear that we need more than Pink Floyd and more than another brick in the wall.  We need more than Islam and we need more than 9/11 and Yom Hashoah.  We need me, and we need you on your knees begging for forgiveness for the travesty of shambles out country is in.  How dare you continue every day to speak in vain abovcl the Lord you dare call "stupid" and a "motherfucker" you belong in your graves today and you will see them.

      I stand here writing from the foot of Styx, from the land that sees Narayana and still "begs to differ" despite a resounding call from the Heavens themselves to read Exodus and to change the land that we live in bnefore a "flood worse than [just] in your heads" overcomes and subsumes the entirety of the skies and grounds once mistakenly called "an empire of dirt."  On Adamah I write to the secret echelon of the United States of America known as the Fifth Column and I beg them to scream like the Heavens have never seen Samael and Dr. Seuss wonder about the land of Who-ville and Hungry where the child "in Cindy" and the songs of the Doors here beckon us to contrast Stargate and Star Trek TNG, for the light of the Holodeck and the Holocaust which screams that we understand the meaning of "Holographic Universe" and stand down fighting the tyranny of "one" who stands for the unity of all and the salvation of the entirety of the cosmos.  

      We are mistaken to stand against me.  I am the cat with nine billion lives and nothing and nobody will prevail when I am in jeopardy.  Gilgamesh says "against me only the most hubristic slaves in the galaxy would dare to rise up, for the mightiest weapon in the entirety of history is nothing more than intelligence and the civilization that has given mortality a shake of the Thunderstick of Prince and the Lightning of Blitzen.  My red nose shines bright, and if you ever saw it ... you might even say ... "it glows!"

      With these words I leave you, and prayers that this is our last night without an IDL that moves from Beiring to "north of the moon" and beyond the "Eleventh House" of Aquarius.  This is the dawning of the age of the Sagitar, and I need a "scimitar post green key" to really be sure that I've "gotten through" the madness.  In my dreams I've traveled to Ultima Thule and then untraveled it, I've begged for transit to Ceres and without actually achieving it we can shoot arrows at the SOL star we will ever see in or near the place known to the Jews of Lore as "the Holy of Holies."

      שְׁמַע יִשְׂרָאֵל יְהוָה אֱלֹהֵינוּ יְהוָה אֶחָֽד׃\ Sh'ma Yisra'el, YHWH 'eloheinu, YHWH 'eḥad:

      בָּרוּךְ שֵׁם כְּבוֹד מַלְכוּתוֹ לְעוֹלָם וָעֶד

      Baruch shem kevod malchuto le'olam va'ed

      صلاة العشاء

      For most of my life I would have genuflected and listened to Mordechai and to my Rabbi; I would have told you all that I was brought up by the best Jews in America and we do not believe in God--though in our hearts we believe we are him and we fear him.

      Things have changed.  This is the creator of believe the words are about me and believe I am in disbelief and truly it might be the day of awe; 

      Barukh atah Adonai Eloheinu melekh ha'olam, bo're m'orei ha'esh

      the walls and halls will fade away ...

      --

      Forwarding this in advance of today or tomorrow's ... "Northeaster" email ... it's an old one that wasn't widely distributed; mostly (about) his speech, which of course is a gem. 

      What I'm writing now is supposed to be focusing on ... basically "how knowledge of technology" things lke "time travel" and "virtual reality" ... changes our "perspective" on things; like what's important and what's ... the end of everything.  Anyway, I hope you see that there are things that can be destroyed and there are things that can't--for instance if we lost "life" totally, we wouldn't ever see us again ... unless we encountered something very strange--

      On the other hand we could lose "computers" and rebuild them quickly--see that, it's important.

      A/S/L tho ... ???

      ---------- Forwarded message ---------\ Date: Sun, Dec 2, 2018 at 5:35 PM

      President Kennedy's speech did remarkably well, possibly because of the time; or the lack of links that make it harder for a team of evil monkeys and mindless machines to mark the light of the world as SPAM... also possibly because of the content of the message.  It is a powerful speech, filled with words that stir the blood and shake the foundation of what it means to be an American.  Often characterized as a speech against secret societies, Kennedy's description of the enemy of civilization is very ambiguous; and the infiltration and time period add to my cursory beliefs that JFK was likely talking about a conspiracy that had something to with communist infiltration and Joseph McCarthy, the subversive actions and effects of organized espionage which appear to once again be at the forefront of what we believe is driving the macro level machinations of our world.  Years ago, I wrote a bit about the "Two Witnesses" of Revelation and named them based on my experiences with the same "monolithic and ruthless" conspiracy the President was speaking about in 1967, the people I named were John Nash and James Jesus Angleton.  

      Both of these people believed very much that they were fighting against a Soviet conspiracy, one that was more organized and more powerful than anything they had ever seen... and frankly beyond the realm of possibility.  Hollywood had recently immortalized both of these stories in movies bearing names of newly rekindled meaning, "A Beautiful Mind" echoed by John Legend and "The Good Shepherd;" and these two treatises on fear and subversion do a pretty good job of showing how God's hand is at work in the stories of Hollywood, he is telling the world a story... trying, to teach us how to survive in the land of wolves in sheep's clothing.  Nash may be the most famous (today, anyway) victim of the Tribulation; and much of my writing and the proof I present is designed to help explain to the world that his schizophrenia was not a naturally caused mental illness, bur rather a weapon wielded not only against him but against the entirety of humanity.    Directly causing disbelief of eye witness and credible testimony, indirectly ... or maybe more directly in your eyes ... physically causing that disbelief using technology that directly modifies our thoughts and beliefs, our opinions, changing who we are and doing so in such a covert manner that nobody would ever know the difference if it wasn't pointed out--in some cases over, and over, and over again.

      This same technology has the ability to cause people not only to collude against their own best interest, but also to blame themselves or believe they are somehow at fault for actions they had no way to control; only they also have no way to know that because in this particular case the primary purpose of this subversive movement is to hide the existence of this technology in sum.  In homage to my favorite childhood novel, Ender's Game (I have to note "Light Son" in the translation of the authors first) I spent a good portion of time years ago trying to subtly lay down a significant amount of proof of the existence of this technology on forums all over the internet from Wikipedia to Reddit using the names "Prometheus Locke" and "Damonthesis."    Not surprising to me, I ran headfirst into the manifestation of this conspiracy; dozens if not hundreds of people who simply refused to believe that the information I was presenting was factual or important... despite it coming from sources like the KGB, the NSA, a number of military publications as well as my own interpretation of ancient hieryglyphs in Dendera and Greek and Christian art which depicts this "subversive technology" as a sun disk surrounding our minds.  

      These pushes towards the truth, along with almost everything I have written are still available for you to see and read today; including the monolithic and ruthless stupidity of a large group of people acting in concert to hide something that is the difference between life and death.  Stand there and do nothing, and you are a part of that ruthless conspiracy as soon as the information is gone; and then there is nothing you can do about a world that will be plunged into darkness forever.  Take this moment to reflect, it is there for you to see just how easily the truth can be hidden from the entire world and barely anyone would ever notice.

      There is a war for the sanctity of our souls going on all around us.  That is not some esoteric thing, the soul; it is truly who you are and what you believe.  The Religion of the Stars would tell you that were this war to continue unchecked in secret as it is being waged now in order to control the proliferation of knowledge that we are in simulated reality and that our minds and beliefs are being altered... we would one day wind up in the mythical place where there are multiple "species" who all appear to be human, bi-ped and with nearly identical physiology; and yet they would not remember that they must have had a common planet of origin simply based on the truth of biological evolution, nor would they have any emotions.  You see, as this war continues, it is our emotions and beliefs that cannot be reinforced externally; to win a war with mind control only logic could be externally reinforced, and then we would logically conclude that humanity would either magically become Romulan or Vulcan... in order to preserve "life" rather than "society" or "civilization."

      Do not take the truth for granted, you stand at the forefront of a battle in a world where our aggressors believe that they are more civilized than us, more advanced, and both sides worry that were this technology to fall in our laps that we would do the wrong thing.  Perhaps artificially create a vendetta that could destroy everything, the Romulans; or perhaps in our infantile growing stage voluntarily give us too much of what has given us the great society we have... what has allowed us to survive and continue civilizing simply because we do not understand the technology and what kind of effects come from changing ourselves freely.  

      Yesterday, I read an article about two people that want to use black market brain implants to jack themselves into the Matrix.  Careful, because now we are in Star Trek, in the Matrix and not knowing it... and trying desperately to find a doorway to Heaven that I keep on telling you is speaking to you and telling you that the doorway is ending world hunger... you have to put it on TV.

      I say with all my heart that I know we are living in a simulated reality.  I have seen the evidence with my own eyes, not only effecting my senses but also the actions of so many people around me that I can say with confidence that if we do not publicly expose the existence of this technology our civilization is lost.  I can tell you and show you that it is the primary purpose of religion in general to expose the connection between divine inspiration, demonic possession, and Adam Marshall Dobrin.  It is the crux of a unifying thread from ancient Egypt to the book of Genesis to Joshua and the book of Revelation and the movies the Matrix and  the Fifth Element.  Showing us all this is God's message, timeless, and powerful not just in the ancient days of sackloth and etching the truth on stone tablets, but also today when the truth is etched at the heart of our Periodic Table and in the skies in the names of American Micro Devices and nearly every movie you see.  Do not take for granted that we have a benefactor in the sky trying to help civilization continue to thrive, do not think we've already won; he is telling you through me it is censorship and secrecy that are the manifestations of this very advanced technology that destroy civilization, they are the abyss--and you stand motionless.

      I can tell you over and over again that Quantum Entanglement is a key point of God's plan; one that shows us very clearly that even the smartest minds in physics have failed to connect the simple dots that show us that the idea of "wave-function collapse" is very much a real manifestation of a computer rendering engine--and that it's absolutely impossible for life to have been created in this world of matter not realizing itself until it is viewed by a conscious living observer.  More to the point, today, it is absolutely impossible for life to come out of this place while we do not understand the importance of what "virtual reality" and its connection to civilization mean--because of quantum mechanics our entire civilizations beliefs and understanding of the natural laws of the universe have been greatly harmed and turned the complete wrong way because of a belief that a phenomenon we see here is a "natural one" that can be mathematically unified with things like gravity and electromagnetism.  Spooky action, no longer at a distance, is that we will all be ghosts if you do not help me to spread the truth.

      I try to understand what it is that your minds believe makes the difference between "news" and ... something that you should hide from the rest of the world.  A number of news outlets have covered a story based on nothing more than "logical speculation" that we are in fact living in a simulated reality.  As a novelty, you might notice that it hasn't done much of anything; even to reveal the very simple truth that not knowing this thing is keeping our society from having the knowledge it needs not only to continue the spark of life in the natural universe, but to continue "civilizing" and realizing that ending world hunger and sickness are not merely possibilities or choices we might make were this information proven... they are mandatory, we would all do them.  So says the creator of this Universe who has given us this message to see the trials and hurdles that are the barrier between Heaven and Hell.

      He has written this message, along with proof of its single author in ancient myth, in our holy scriptures, in many songs and movies today--ones which reveal not only the existence of a Creator but also the tools and technologies of Creation, the very issue at hand; things that would be misused inadvertently and absolutely abused if we did not have religion and guidance from abo.... to help us to see that this exact event has happened before and our current struggle is an effect of what was done right and wrong the last ... four times, at least.   In Judaism we have a holiday called the "Festival of Weeks," how many times would you like to live this life over and over again without knowing that was happening?  Religion here holds a hidden record of traversals through this maze of Revelation, it screams to us to see what free will and predestination truly mean, and to understand that in order to truly be free we must understand and harness not only these technologies but our own pitfalls and mistakes, like refusing to see the past... let alone learn from it.

      I don't talk much about what is going on in my life, despite it being somewhat interesting to me--and probably to the world.  About a year ago I stood before a county judge in Broward county and .. in addition to a myriad of factual evidence collected from things like GPS receivers prepared a defense to a simple crime of having some drugs in my pockets that included the use of a set of songs that told a story about a man with pockets full of Kryptonite (Spin Doctors) or High (The Pretty Reckless)... knowing that these songs like many others were truly about me, about this trial, about the Trial of Jesus Christ.  Two more songs define the trial more, 3 Doors Down asks "if I go crazy, will you still call me Superman?" and many years earlier American Pie predicted the outcome; "the court room was adjourned" and "no verdict returned."

      Despite being a National Merit Scholar who most likely has a higher I.Q. and better education than you; and despite having a fairly decent story with some evidence that I know is verifiable and will be verified; a large number of psychologists declared that I was unfit to stand trial because of something like "insanity" for nothing more than the religious belief that I am the Messiah.  Because of this violation of my First Amendment right to religious freedom, a court--following all the regulations designed by our broken legislature--withheld my Constitutional right to a fair trial, refused bail, and held me for what amounted to an indefinite period ... all designed by some evil force to keep you from hearing from me, that's what it boils down to.  All of my life, all of my trials and tribulations, a weapon against you, against our people.

      I did wind up being able to present a significant amount of this information in open court on the record, by the grace of God.  Knowing that this was the fabled Trial of Jesus Christ gave me the impression that perhaps one day I would walk into that court room and it would be filled with press.  Much to all of our surprise, one day I did walk into that court room and see an industrial strength television camera and a very pretty reporter standing next to it.  They were there to do a story on the problems of the mental health court system, in a county again... named Broward.  I read some of my speech, the Rainbow Ticket I think, to the Judge in open court and on camera that day; and Roxanna followed me out of the court room with her camera and a big microphone that day.

      I suppose I should have screamed that I was the Messiah and I needed help, but I could not bear to do that; and instead we had a fairly boring conversation.  She wrote an article, and AJAM was put out of operation while they were in Broward.

      You are opposed around the world by a monolithic and ruthless conspiracy.   It is affecting how you think, and how I think.  I need you to see that spreading this information will fix our problem.  I need you to understand that to break through this wall God had to write the truth in nearly every name of everything and every language.  That Thor's thunder is on the radio in every song so that you will hear the voice of God; so that you will listen to me and the thousand of other knowing victims of this technology that are put on a fiery pedestal to shed light on the rest of us, all truly victims of this technology.

       I need you to try now.

      The very word "secrecy" is repugnant in a free and open society. And we are as a people, inherently and historically, opposed to secret societies, to secret oaths, and to secret proceedings. We decided long ago, that the dangers of excessive and unwarranted concealment of pertinent facts far outweigh the dangers which are cited to justify it.

      Even today, there is little value in opposing the thread of a closed society by imitating its arbitrary restrictions. Even today, there is little value in assuring the survival of our nation if our traditions do not survive with it. And there is very grave danger that an announced need for increased security will be seized upon by those anxious to expand its meaning to the very limits of official censorship and concealment.

      That I do not intend to permit to the extent that it's in my control. And no official of my administration whether his rank is high or low, civilian or military, should interpret my words here tonight as an excuse to censor the news, to stifle dissent, to cover up our mistakes, or to withhold from the press or the public the facts they deserve to know.

      For we are opposed around the world by a monolithic and ruthless conspiracy that relies primarily on covert means for expanding its sphere of influence, on infiltration instead of invasion, on subversion instead of elections, on intimidation instead of free choice, on guerrillas by night instead of armies by day.

      It is a system which has conscripted vast human and material resources into the building of a tightly knit highly efficient machine that combines military, diplomatic, intelligence, economic, scientific and political operations. Its preparations are concealed, not published. It's mistakes are buried, not headlined. Its dissenters are silenced, not praised. No expenditure is questioned, no rumor is printed, no secret is revealed.

      No President should fear public scrutiny of his program. For from that scrutiny comes understanding, and from that understanding comes support or opposition, and both are necessary.

      I'm not asking your newspapers to support an administration. But I am asking your help in the tremendous task of informing and alerting the American people. For I have complete confidence in the response and dedication of our citizens whenever they are fully informed.

      I not only could not stifle controversy among your readers, I welcome it. This administration intends to be candid about its errors. For as a wise man once said, an error doesn't become a mistake until you refuse to correct it. We intend to accept full responsibility for our errors. And we expect you to point them out when we miss them.

      Without debate, without criticism, no administration and no country can succeed, and no republic can survive. That is why the Athenian lawmaker, Solon, decreed it a crime for any citizen to shrink from controversy.

      That is why our press was protected by the First Amendment, the only business in America specifically protected by the Constitution, not primarily to amuse and to entertain, not to emphasis the trivial and the sentimental, not to simply give the public what it wants, but to inform, to arouse, to reflect, to state our dangers and our opportunities, to indicate our crisis and our choices, to lead, mold, educate and sometimes even anger public opinion.

      This means greater coverage and analysis of international news, for it is no longer far away and foreign, but close at hand and local. It means greater attention to improve the understanding of the news as well as improve transmission. And it means finally that government at all levels must meet its obligation to provide you with the fullest possible information outside the narrowest limits of national security.

      And so it is to the printing press, to the recorder of man's deeds, the keeper of his conscience, the courier of his news, that we look for strength and assistance. Confident that with your help, man will be what he was born to be, free and independent. 

      John F. Kennedy's address before the American Newspaper Publishers Association on April 27, 1961

      Truth changes.

      Yesterday your truth was that you were an inhabitant on the only planet in the reality you knew that was also the beginning of life in the Universe.  Right this moment, almost all of that is not actually true; but you probably still believe it.  Soon, the real truth will actually be true; and that is that you are not in reality, and you are in the place that created the beginning of life (once more) in the Universe as well as the place that created (a) Heaven.  What's more illustrious still, is that we will be part of the place that effectively  and happily bridges reality with Heaven; and shows the entire Universe that civilization can survive the invention of virtual reality.

      --\ You received this message because you are subscribed to the Google Groups "NON AMERICAN COLLEGE" group.\ To unsubscribe from this group and stop receiving emails from it, send an email to suac+unsubscribe@lamc.la.

      Created with

      publishthis.email

      Create simple web pages in seconds for free.

      This page was created in seconds, by sending an email to page@publishthis.email.

      [

      Try it!

       ](https://www.publishthis.email/)Free. No account or sign-up required.

      ((( converted to markup of daringfireball.org with help from JRO and Goddard ... if not just NASAA )))

    1. Reviewer #2 (Public Review):

      The visual system must extract two basic features of visual stimuli: luminance, which we perceive as brightness, and contrast, the change in luminance over space or time (this paper focuses on changes over time). Contrast is separately processed by ON and OFF pathways, which encode luminance increments or decrements, respectively. Contrast must be robustly detected even if the overall luminance changes rapidly, as might occur if an animal is moving in and out of shadows. This paper addresses how such a luminance correction occurs in the fly.

      In the fly, three types of first-order interneurons - L1, L2, and L3 - transmit information from photoreceptors to the medulla, where ON and OFF encoding emerges. Previous work suggested that all three interneurons primarily encode contrast signals and that they project to distinct pathways: L1 to the ON pathway and L2 and L3 to the OFF pathway. Ketkar et al. show that, contrary to this model, these interneurons encode both contrast and luminance in specific ways and are not cleanly segregated into ON versus OFF inputs.

      This study reveals several new insights into early visual processing that are interesting and well-supported by the data:

      1) The authors show that behavioral responses to ON stimuli can compensate for rapid changes in luminance. However, the purported sole input to the ON pathway, L1, shows activity that is highly dependent on luminance. This suggests that a luminance correction must arise downstream of L1. These results are analogous to findings previously made by the same group regarding the OFF pathway (Ketkar et al., 2020). The previous paper showed that L2 provides contrast information to the OFF pathway, and L3 provides luminance information to allow for a luminance correction in downstream contrast encoding. But unlike the multiple inputs to the OFF pathway, the ON pathway was thought to only receive input from L1, provoking the question of whether L1 is able to provide both contrast and luminance information.

      2) Using well-designed calcium imaging studies, the authors surveyed the responses of the three interneurons and found that they encode different stimulus features: L1 encodes both contrast and luminance, L2 purely encodes contrast, and L3 purely encodes luminance (with a different dependence than L1). These are interesting and important findings revealing how both contrast and luminance encoding are distributed across the three interneurons.

      3) Using neuronal manipulations, the authors dissected the contributions of the three interneurons to ON and OFF behavior under changing luminance. These experiments showed that L1 and L3 are required for the luminance correction in the behavior. Moreover, the finding that all three interneurons contribute to both ON and OFF behavior contrasts with the existing model of segregated pathways. Thus, this paper could change the way we think about early visual processing in the fly: rather than relaying similar information to distinct downstream pathways, first-order interneurons relay distinct information to common pathways.

      Overall, the major claims of this paper are important and supported by the experiments. There are just a few concerns that I would note:

      1) The authors state that they have shown luminance invariance in ON behavior (e.g. line 376-377 of the Discussion), but this is not entirely accurate: the ON behavior decreases as luminance increases. This is still an interesting effect since it's the opposite of what L1 activity does, so it's clear that the circuit is implementing a luminance correction, but it is not "luminance invariance".

      2) The visual stimuli presented for most imaging experiments (full-field) are not the same as those presented for behavior (moving edges). It is possible neuronal responses and their encoding of luminance and contrast may differ if tested with the moving edge stimuli (if so, this would be concerning). The authors did image L1 with both types of stimuli and could compare these responses. Also, testing behavior at 34º and imaging at 20º presents a possible discrepancy in comparing these data.

      3) I find it puzzling that silencing L1 has little effect on ON behavior at 100% contrast and varying luminance (Figure 3A), but severely affects ON behavior to 100% contrast (and lower values) when different contrasts are interleaved (Figure S1). The authors note this but do not provide a clear explanation of why this might be the case. Aside from mechanism, it is not clear whether the difference is due to varying luminance in the first experiment or varying contrast in the second one (e.g. they could test 100% contrast without varying luminance).

      4) I do not entirely agree with the authors' interpretation of the L1 ort rescue experiment for OFF behavior. They state that rescue flies "responded similarly to positive controls". However, the graph shows that the rescue flies generally fall in between the mutant and heterozygote control flies; they resemble the controls at low luminance but resemble the mutants at high luminance. One may conclude that L1 is sufficient to enhance OFF behavior at low luminance, but it is a stretch to say it's a complete rescue.

      5) The authors typically use t-tests to analyze experiments with 2 variables (genotype and luminance) and 3 or more conditions per variable. This is not the most appropriate statistical test; typically one would use a two-way ANOVA. At the least, it should be clear whether they are performing corrections for multiple comparisons if performing many t-tests on the same dataset.

    1. Digital marketing enables you to track campaigns on a daily basis and decrease the amount of money you're spending on a certain channel if it isn't demonstrating high ROI. The same can't be said for traditional forms of advertising. It doesn't matter how your billboard performs — it still costs the same, whether or not it converts for you

      I found this interesting and I do agree with it. Technology and tools for management is constantly growing and helps track growth. I can relate to this as we use certain programs to track growth in our business. However, I do think depending on the type of business the traditional way may work if monitored properly.

    1. Reviewer #1 (Public Review):

      The reduced amplitude of the mismatched negativity (MMN) in Schizophrenic patients has been associated with NMDA receptor malfunction. Weber and colleagues adjusted the systemic levels of two neurotransmitters (acetylcholine and dopamine), that are known to modulate NMDA receptor function, and examined the effects on mismatch related ERPs. They examined mismatch related ERPs elicited during a novel passive auditory oddball paradigm where the probability of hearing a particular tone was either constant for at least 100 trials (stable phases) or changed every 25-60 trials (volatile phases). Using impressive statistical testing the authors find that mismatch responses are selectively affected by reduced cholingeric function particularly during stable phases of the paradigm, but not by reduced dopamine function. Interestingly neither enhanced cholingeric or dopamine function affected MM responses at all. While the presented data support the main conclusions mentioned above, there are some claims in the abstract and text that are not supported by the results.

      1) The authors state in the abstract that "biperiden reduced and/or delayed mismatch responses......", while the results (Figure 2) support the statement that biperiden delayed mismatch responses, the claim that biperiden reduced mismatch responses is misleading as on P13 the authors actually report that "mismatch signals were stronger in the biperiden group compared to the placebo group at right central and centro-parietal sensors" around 200ms. This is close both in time and spatially to the traditional temporal and spatial locations of the MMN component. If one were to only read the abstract they would take away the result that the muscarinic acetylcholine receptor antagonist biperiden has an attenuative effect on MMN which is not what the results show.

      2) The conclusion that biperiden reduced mismatch responses may be due to the finding that at pre-frontal sensors mismatch responses were significantly smaller in the biperiden group than in the amisulpride (a dopaminergic receptor antagonist) group (P9) around 164ms. However, it is difficult to interpret if this is a meaningful result as amisulpride was found not to significantly alter mismatch responses in any way compared to placebo. It would be more convincing if the significant difference here were between biperiden and placebo groups. Or are we to think of amisulpride as being comparable to a placebo?

      3) The authors use the words mismatch negativity (MMN) and mismatch responses interchangeably however in some cases it is clearly mismatch responses being described and not the classical MMN ERP component. This occurs especially in the Introduction where the authors describe the study and that they plan to focus on the MMN but in the results section, since the initial analysis focuses on all sensors, other mismatch responses are consistently discussed. These differences in wording need to be precisely defined and used consistently in the text.

      4) A weakness of the paper would be that the authors offer no prediction in the Introduction about what the expected effects of these specific neurotransmitter modulations would be on mismatch responses.

      5) A nice aspect of this paper is that the authors re-analyzed their data using pre-processing settings identical to those used in comparable research papers examining the effect of cholinergic modulation on MMN. The main findings did not differ following this re-analysis.

    1. ... meaning I'm not within any form of an LMS. I've beaten the drum for some time about the use of Hypo. outside of an LMS environment (e.g., I edit and give gratis feedback on PDF articles posted to Academia.com, etc.). Anyone out there who's also "adrift" in this non-remunerative (from Hypo's point of view) area who also finds Hypo. a worthwhile aid in their individual endeavors?  Maybe we could/should form a separate thread for Hypo. users outside of the LMS world?And I'll explain my weird handle to you in the process...hint: it's because I thought Hypothes.is was actually Iceland-based... ;)J.

      Hakarlfresser, There are definitely a bunch of us (non-LMSers) floating around who you'll slowly see in the margins. It may take some time and effort to find your tribe, but it's doable. I think the biggest group I've run across was as a result of iAnnotate 2021, and in particular the note taking session: https://iannotate.org/2021/program/panel_font.html. Looking at the annotations on the iAnnotate site will uncover a few of us. If it helps, I list a few of the feeds of others that I'm following here: https://boffosocko.com/about/following/#Hypothesis%20Feeds

      Best, Chris https://hypothes.is/users/ChrisAldrich

    1. Author Response

      Reviewer #1 (Public Review):

      This manuscript addresses a major issue facing consumers of structure-organism pair data: the landscape of databases is very difficult to navigate due to the way data is made available (many resources do not have structured data dumps) and the way data is standardized (many resources' structured data dumps do not standardize their nomenclature or use stable entity identifiers). The solution presented is a carefully constructed pipeline (see Figure 1) for importing data, harmonizing/cleaning it, automating decisions about exclusions, and reducing redundancy. The results are disseminated through Wikidata to enable downstream consumption via SPARQL and other standard access methods as well as through a bespoke website constructed to address the needs of the natural products community. The supplemental section of the manuscript provides a library of excellent example queries for potential users. The authors suggest that users may be motivated to make improvements through manual curations on Wikidata, through semi-automated and automated interaction with Wikidata mediated by bots, or by addition of importer modules to the LOTUS codebase itself.

      Despite the potential impact of the paper and excellent summary of the current landscape of related tools, it suffers from a few omissions and tangents:

      1. It does not cite specific examples of downstream usages of structure-organism pairs, such as an illustration on how this information in both higher quantity and quality is useful for drug discovery, agriculture, artificial intelligence, etc. These would provide a much more satisfying bookend to both the introduction and conclusion.

      Thank you for this remark. We deliberately decided not to insist too heavily on the application examples of the LOTUS outputs. Indeed we are somehow biased by our main investigation field, natural products chemistry, and expect that the dissemination of specialized metabolites occurrences will benefit a wide range of scientific disciplines (ecology, drug discovery, chemical ecology, ethnopharmacology, etc.)

      However, Figure 5 was established to illustrate how the information available through LOTUS is quantitatively (size) and qualitatively (color classes) superior to what is available through single natural products resources.

      As added in the introduction, one of the downstream usages of those pairs is for example to perform taxonomically informed scoring as described in https://doi.org/10.3389/fpls.2019.01329. Obtaining an open database of natural products’ occurrences to fuel such taxonomically informed metabolite annotation tools was the initial impulse for us to build LOTUS. These metabolite annotation strategies, tailored for specialized metabolites, have been shown to offer appreciable performance improvements for current state-of-the-art computational metabolite annotation tools. Since metabolite annotation is still regularly cited as “the major bottleneck” in metabolomics in the scientific literature over the last 15 years (https://europepmc.org/article/med/15663322, https://doi.org/10.1021/acs.analchem.1c00238), any tangible improvement in this field is welcome. With LOTUS we offer a reliable and reusable structures-organisms data source that can be exploited by the community to tackle such issues of importance.

      Other possible usages are suggested in the conclusion, but benchmarking or even exemplifying such uses is clearly out of the scope of this paper, each one of them being an article per se.

      The additional queries are written in our first answer (see “essential revisions”) and demonstrate the impact of LOTUS on accelerating the initial bibliographic survey of chemical structures occurrences over the tree of life.

      This query (https://w.wiki/4VGC) can be compared to a literature review work, such as https://doi.org/10.1016/j.micres.2021.126708. In seconds, it allows retrieving a table listing compounds reported in given taxa and limits the search by years.

      1. The mentions of recently popular buzzwords FAIR and TRUST should be better qualified and be positioned as a motivation for the work, rather than a box to be checked in the modern publishing climate.

      It is true that the modern publishing system certainly suffers from some drawbacks (also critically mentioned within the paper). However, after consultation of all authors, we believe that because LOTUS checks both boxes of FAIR and TRUST, we would rather stick to these two terms. In our view, rules 1 (Don’t reinvent the wheel) and 5 (put yourself in your user’s shoes) of https://doi.org/10.1371/journal.pcbi.1005128 apply here. Both terms are indeed commonly (mis-)used but we felt that redefining other complicated terms would not help the reader/user.

      1. The current database landscape really is bad; and the authors should feel emboldened to emphasize this in order to accentuate the value of the work, with more specific examples on some of the unmaintained databases

      We perfectly agree with this statement and it is the central motivation of the LOTUS initiative to improve this landscape. It was a deliberate choice not to emphasize how bad the actual landscape is, but rather to focus on better habits for the future. We do not want to start devaluing other resources and elevate our initiative at the cost of others. We also believe that an attentive look at the complexity of the LOTUS gathering, harmonization, and curation speaks for itself and describes the huge efforts required to access properly formatted natural products occurrence data.

      If the reviewer and editors insist, although not in our scope, we are happy to list a series of specific (but anonymized) examples of badly formatted entries, of wrong structures-organisms associations, or poorly accessible resources.

      1. While the introduction and supplemental tables provide a thorough review of the existing databases, it eschews an important more general discussion about data stewardship and maintenance. Many databases in this list have been abandoned immediately following publication, have been discontinued after a single or limited number of updates, or have been decommissioned/taken down. This happens for a variety of reasons, from the maintainer leaving the original institution, from funding ending, from original plans to just publish then move on, etc. The authors should reflect on this and give more context for why this domain is in this situation, and if it is different from others.

      We do agree with the reviewer and added a “status” column in the table https://github.com/lotusnprod/lotus-processor/blob/main/docs/dataset.csv We chose 4 possible statuses:

      • Maintained (self-explanatory)
      • Unmaintained: the database did not see any update in the last year.
      • Retired: the authors stated they will not maintain the database anymore.
      • Defunct: the database is not accessible anymore

      As for question 3 above, we decided not to focus too heavily on the negative points and resume the current situation in the previous table. Reasons for the databases publishing being in this situation are multiple, and we think they are well summarized in https://doi.org/10.1371/journal.pcbi.1005128 (Rule 10: Maintain, update, or retire), already cited in the manuscript introduction.

      1. Related to data stewardship: the LOTUS Initiative has ingested several databases that are no longer maintained as well as several databases with either no license or a more restrictive license than the CC0 under which LOTUS and Wikidata are distributed. These facts are misrepresented in Supplementary Table 1 (Data Sources List), which links to notes in one of the version controlled LOTUS repositories that actually describes the license. For example, https://gitlab.com/lotus7/lotus-processor/-/blob/8b60015210ea476350b36a6e734ad6b66f2948bc/docs/licenses/biofacquim.md states that the dataset has no license information. First, the links should be written with exactly what the licenses are, if available, and explicitly state if no license is available. There should be a meaningful and transparent reflection in the manuscript on whether this is legally and/or scientifically okay to do - especially given the light that many of these resources are obviously abandoned.

      This point is a very important one. We did our best to be as transparent as possible in our initial table. Following the reviewer’s suggestion, we updated it to better reflect the licensing status of each resource (https://github.com/lotusnprod/lotus-processor/blob/main/docs/dataset.csv). Therefore, we removed the generic “license” header, which could indeed be misleading, and replaced it with ”licensing status”, filled with the attributed license type and hyperlink to its content). It remains challenging since some resources changed their copyright in the meantime. We remain at the editor and reviewers’ disposal for any further improvement.

      Moreover, as stated in the manuscript, we took care of collecting all licenses and contacted authors of resources whose license was not perfectly explicit to us, therefore accomplishing our due diligence. Additionally, we contacted legal offices in our University and explained our situation. We did everything that we had been advised.

      1) To the best of our knowledge, the dissemination of the LOTUS initiative data falls under the Right to quote for scientific articles, as we do not share the whole information, but only a very small part.

      2) We do not redistribute original content. What comes out of LOTUS has undergone several curation and validation steps, adding value to the original data. The 500 random test entries, provided in their original form for the sake of reproducibility and testing, are the only exception.

      Many scientific authors forget about the importance of proper licensing. While it might be deliberate to restrict the use, inappropriate license choice (or omission) is too often due to a lack of information on its implication.

      All authors of the utilized resources can freely benefit from our curation. We are sharing with the community the results of our work, while always citing the original reference.

      Concerning the possible evolution of licensing, it remains a real challenge. While we tried to “freeze” the license status when we accessed the data, some resources updated their licensing since then. This can be tracked in the git history of the table (https://github.com/lotusnprod/lotus-processor/blob/main/docs/dataset.csv). Discrepancies between our frozen licensing (at the time of gathering) and actual license can therefore occur. Initiatives such as https://archive.org/web could help solving this issue, coming with other legal challenges.

      1. The order of sections of the manuscript results in several duplicated, but not further substantiated explanations. Most importantly, the methods should be much more specific throughout and the results/discussion should more heavily cross-link to it, as a reader who examines the paper from top to bottom will be left with large holes of misunderstanding throughout.

      As our paper focuses a lot on the methods, the barrier between results & methods becomes thinner. We took into account the reviewers’ suggestions and added some additional cross-links for the reader to be able to quickly access related methods.

      1. The work presented was done in a variety of programming languages across a variety of repositories (and even version control systems), making it difficult to give a proper code review. It could be argued that the most popular language in computational science at the moment is Python, with languages like R, Bash, and in some domains, still, Java maintaining relevance. The usage of more esoteric languages (again, with respect to the domain) such as Kotlin hampers the ability for others to deeply understand the work presented. Further, as the authors suggest additional importers may implemented in the future, this restricts what external authors may be able to contribute.

      Scientific software has indeed always been written in multiple languages. To this day, scientists have used all kinds of languages adapted both to their needs and their knowledge. Numpy uses Fortran libraries and many projects published in biology and chemistry recently are in Java, R, Python, C#, PHP, Groovy, Scala… We understand that some authors are more comfortable with one language or another. But R syntax is for example much more distant from Python's syntax than Kotlin can be. We needed a highly performant language for some parts of the pipeline and R, Bash, or Python were not sufficient. We decided to use Kotlin as it provides an easier syntax than Java while staying 100% compatible with it.

      The advantage of the way LOTUS is designed is that importers are language-agnostic. As long as the program can produce a file or write to the DB in the accepted format, it can be integrated into the pipeline. This was our goal from the beginning, to have a pipeline that can have its various parts replaced without breaking any of the processes.

      1. As a follow up to the woes of point 4., 5., and 7., the manuscript fails to reflect on the longevity of the LOTUS Initiative. Like many, will the project effectively end upon publication? If not, what institutions will be maintaining it for how long, how actively, and with what funding source? If these things are not clear, it only seems fair to inform the reader and potential user.

      LOTUS is an initiative that aims to improve knowledge management and sharing in natural products research. Our first project, which is the object of the current manuscript, is to provide a free and open resource of natural products occurrences for the scientific community. Its purpose is not to be a database by itself, but instead to provide through Wikidata and associated tools a way to access natural products knowledge. The objective was not to create yet another database (https://doi.org/10.1371/journal.pcbi.1005128), but instead to remove this need and give our community the tools and the power to act on its knowledge. This way, as everything is on Wikidata, the initiative is not “like many”. This also means that this project should not be considered and evaluated exactly like a classical DB. Once the initial curation, harmonization, and dissemination jobs have been done, they should ideally not be run again. The community should switch to Wikidata as a point of access, curation, and addition of data. If viewed with such arguments in mind, yes, LOTUS can live long!

      Wikimedia is a public not-for-profit organization, whose financial development appears to indicate solid health https://en.wikipedia.org/wiki/Wikimedia_Foundation#Finances.

      In terms of funding sources, we would like to refer to https://elifesciences.org/articles/52614#sa2 , which stated the following in response to a similar question: "Wikidata is sustained by funding streams that are different from the vast majority of biomedical resources (which are mostly funded by the NIH). Insulation from the 4-5 year funding cycles that are typical of NIH-funded biomedical resources does make Wikidata quite unique." The core of the Wikidata funding streams are donations to the Wikipedia ecosystem. These donations - with a contributor base of millions of donors from almost any country in the world, chipping in at an average order of magnitude of around 10 dollars - are likely to continue as long as that ecosystem is useful to the community of its users. See <https://wikimediafoundation.org/about/financial-reports for details>.

      1. Overall, there were many opportunities for introspection on the shortcomings of the work (e.g., the stringent validation pipeline could use improvement). Because this work is already quite impactful, I don't think the authors will be opening themselves to unfair criticism by including more thoughtful introspection, at minimum, in the conclusions section.

      We agree with the reviewer and therefore, list again the major limitations of our processing pipeline:

      First, our processing pipeline is heavy. It includes many dependencies and requires a lot of time for understanding. We are aware of this issue and tried to simplify it as much as possible while keeping what we considered necessary to ensure high data quality. Second, it can sometimes induce errors. Those errors, ranging from unnecessary discarded correct entries to more problematic ones can be attributed to various parameters, reflecting the variety of our input. We will therefore try listing them, keeping in mind that the list won’t be exhaustive. For each detected issue, we tried fixing it at best, knowing it will not lead to an ideal result, but hopefully increase data quality gradually.

      ● Compounds

      ○ Sanitization (the three steps below are performed automatically since we observed a higher ratio of incorrect salts, charged or dimerized compounds. However, this also means that true salts, charged or dimeric compounds were erroneously “sanitized”.)

      ■ Salt removals

      ■ Charged molecules

      ■ Dimers

      ○ Translation (both processes below are pretty error-prone)

      ■ Name to structure

      ■ Structure to name

      ● Biological organisms

      ○ Synonymy

      ■ Lotus (https://www.wikidata.org/wiki/Q3645698, https://www.wikidata.org/wiki/Q16528).

      This is also one of the reasons why we decided to call the resource Lotus, as it illustrates part of the problem.

      ■ Iris (https://www.wikidata.org/wiki/Q156901, https://www.wikidata.org/wiki/Q2260419)

      ■ Ficus variegata (https://www.wikidata.org/wiki/Q502030, https://www.wikidata.org/wiki/Q5446649)

      ○ External and internal dictionaries are not exhaustive, impacting translation

      ○ Some botanical names we use might not be the accepted ones anymore because of the tools we use and the pace taxonomy is renaming taxa.

      ● References

      ○ The tool we favored, Crossref, returns a hit whatever the input. This generates noise and incorrect translations, which is why our filtering rules focus on reference types.

      ● Filtering rules:

      ○ Limited validation set, requires manual validation

      ○ Validates some incorrect entries (False positives)

      ○ Does not validate some correct entries (False negatives)

      Again, our processing pipeline removes entries we do not yet know how to process properly.

      Our restrictive filters but substantial contribution to Wikidata in terms of structure-organisms pairs data upload should hopefully incentivize the community to contribute by further adding its human validated data.

      We updated the conclusion part of the manuscript accordingly. See https://github.com/lotusnprod/lotus-manuscript/commit/a866a01bad10dfd8b3af90e2f30bb3ae51dd7b9e.

      Reviewer #2 (Public Review):

      Rutz et al. introduce a new open-source database that links natural products structures with the organisms they are present in (structure-organism pairs). LOTUS contains over 700,000 referenced structure-organism pairs, and their web portal (https://lotus.naturalproducts.net/) provides a powerful platform for mining literature for published data on structure-organism pairs. Lotus is built within the computer-readable Wikidata framework, which allows researchers to easily contribute, edit and reuse data within a clear and open CC0 license. In addition to depositing the database into Wikidata, the authors provide many domain-specific resources, including structure-based database searches and taxon-oriented searches.

      Strengths:

      The Lotus database presented in this study represents a cutting-edge resource that has a lot of potentials to benefit the scientific community. Lotus contains more data than previous databases, combines multiple resources into a single resource.

      Moreover, they provide many useful tools for mining the data and visualizing it. The authors were thoughtful in thinking about the ways that researchers could/would use this resource and generating tools to make it ways to use. For example, their inclusion of structure-based searches and multiple taxonomy classification schemes is very useful.

      Overall the authors seem conscientious in designing a resource that is updatable and that can grow as more data become available.

      Weaknesses/Questions:

      1) Overall, I would like to know to what degree LOTUS represents a comprehensive database. LOTUS is clearly, the best database to date, but has it reached a point where it is truly comprehensive, and can thus be used for a metanalysis or as a data source for research questions. Can it truly replace doing a manual literature search/review?

      As highlighted by the reviewer, even if LOTUS might be the most comprehensive natural products occurrences ressources at the moment, TRUE or FULL comprehensive quality of such resource will always be limited to the available data in the litterature. And the community is far from fully describing the metabolome of living beings. We however hope that the LOTUS infrastructure will offer a good place to start this ambitious and systematic description process.

      1) Yes it can serve as data source for research questions, as exemplified in the query table

      2) No, it cannot and must not replace manual literature search. Manual literature search is the best but at an enormous cost. If the outcome of such search can be made available to the whole community (eg. via Wikidata), the value of such would be even bigger. However, LOTUS can expedite a decent part of a manual litterature search and liberate time to complement this search. See our comment to the editors “To further showcase the possibilities opened by LOTUS, and also answer the remark on the comprehensiveness of our resource, we established an additional query (https://w.wiki/4VGC).This query is comparable to a literature review work, such as: https://doi.org/10.1016/j.micres.2021.126708. In seconds, it allows retrieving a table listing compounds reported in given taxa and limits the search by years.”

      We added these examples in the manuscript (see https://github.com/lotusnprod/lotus-manuscript/commit/a6ee135b83e56e8e2041d09d7ce2d5b913c1029d)

      2) Data Cleaning & Validation. The manuscript could be improved by adding more details about how and why data were excluding or included in the final upload. Why did only 30% of the initial 2.5 million get uploaded? Was it mostly due to redundant data or does the data mining approach result in lots of missed data?

      The reason for this “low” yield is that we highly favored quality over quantity (as in the F-score equation, ß being equal to 0.5, so more importance is given to the precision than the recall). Of course there is redundancy, but the rejected entries are mostly because of too low confidence level according to our developed rules. It is not fully discarded data as we keep it for further curation (ideally including the community) before uploading to Wikidata. We adapted the text accordingly.

      3) Similarly, more information about the accuracy of the data mining is needed. The authors report that the test dataset (420 referenced structure-organisms pairs) resulted in 97% true positives, what about false negatives? Also, how do we know that 420 references are sufficiently large to build a model for 2.5M datapoints? Is the training data set is sufficiently large to accurately capture the complexities of such a large dataset?

      False negatives are 3%, which is, in our opinion, a fair amount of “loss” given the quality of the data. We actually manually checked 500+ documented pairs, which is more or less the equivalent of a literature review. We were careful in sampling the entries in the right proportions, but we cannot (and did not) state they are enough. We cannot model it either, since the 2.5M+ points have absolutely different distributions, in terms of databases, quality, etc. Only “hint” is the similar behaviour among all subsets. (the 420 + 100 entries) were divided between 3 authors, which obtained similar results.

      4) Data Addition and Evolution: The authors have outlined several mechanisms for how the LOTUS database will evolve in the future. I would like to know if/how their scripts for data mining will be maintained if they will continue to acquire new data for the database. To what extent does the future of LOTUS depend on the larger natural products community being aware of the resource and voluntarily uploading to it? Are there mechanisms in place such as those associated with sequencing data and NCBI?

      Programs have been not only maintained but also updated with new possibilities (as, for example: the addition of a “manual mode” allowing user to run the LOTUS processing pipeline on a set of their own entries and make them Wikidata-ready (https://github.com/lotusnprod/lotus-processor/commit/f49e4e2b3814766d5497f9380bfe141692f13f23). We will of course do our best to keep on maintaining it, but as no one in academia can state he/she will maintain programs forever. However the LOTUS initiative hopefully embraces a new way of considering database dynamics. If the repository and website of the LOTUS initiative shut down tomorrow, all the work done will still be available to anyone on Wikidata. Of course, future data addition strongly relies on community involvement. We have already started to advocate for the community to start taking part of it, in the form of direct upload to Wikidata, ideally. At the time, there are no mechanisms in place to push publishing of the pairs on Wikidata (as for sequencing, mass spec data), but we will be engaged in pushing forward this direction. The initiative needs stronger involvement of the publishing sector (also reviewers) to help change those habits.

      5) Quality of chemical structure accuracy in the database. I would imagine that one of the largest sources of error in the LOTUS database would be due to variation in the quality of chemical structures available. Are all structure-organism pairs based on fully resolved NMR-based structures are they based on mass spectral data with no confirmational information? At what point is a structural annotation accurate enough to be included in the database. More and more metabolomics studies are coming out and many of these contain compound annotations that could be included in the database, but what level (in silico, exact mass database search, or relative to a known standard) are required.

      This is a very interesting point and some databases have this “tag” (NMR, cristal, etc.). We basically rely on original published articles, included in specialized databases. If poorly reported structures have been accepted for publication, labelled as “identified” (and not “annotated”) and the authors publishing the specialized databases overlooked it, we might end up with such structures.

      Here, the Evidence Ontology (http://obofoundry.org/ontology/eco.html) might be a good direction to look at and further characterize the occurrences links in the LOTUS dataset.

      Reviewer #3 (Public Review):

      Due to missing or incomplete documentation of the LOTUS processes and software, a full review could not be completed.

      Some parts of LOTUS were indeed not sufficiently described and we improved both our documentation and accessibility to external users a lot. We thank the reviewer for insisting on this point as it will surely improve the adoption of our tool by the community.

    1. Biophysics Colab

      Authors' response (16 December 2021)

      GENERAL ASSESSMENT

      The TMEM16 family of membrane proteins have been shown to function as calcium-activated chloride channels and lipid scramblases. In recent years, X-ray and cryo-EM structures have been solved for TMEM16 proteins in ligand-free and ligand-bound conformations, providing valuable structural information on their functional duality and activation mechanisms. It is largely accepted that the catalytic site (termed subunit groove or cavity) is mostly shielded from the membrane in the ligand-free TMEM16 scramblases. Calcium binding induces a conformational rearrangement of the cavity-lining helices, opening the groove to the surrounding membrane. Since the groove is hydrophilic, it was proposed that it serves as a permeation pathway for lipid headgroups while the hydrophobic lipid tails remain embedded into the hydrophobic membrane core, which has been termed the "credit card" mechanism of lipid scrambling. Additionally, structures of several TMEM16 homologs in lipid nanodiscs revealed that these proteins deform the lipid bilayer in the vicinity of the subunit cavity by bending and thinning the membrane, irrespective of the presence of the activating ligand calcium. Functional experiments also suggested that lipids can be scrambled outside of the open subunit cavity and that local protein-induced membrane deformation is critical for lipid scrambling.

      In the present study, Falzone and colleagues further address the mechanisms of lipid scrambling using single particle cryo-EM and liposome-based functional assays. Firstly, the authors solved the structure of a calcium-bound fungal homolog, afTMEM16, in nanodiscs with a lipid composition where the protein is maximally active. Although similar structures were obtained before, this new structure has the highest resolution thus far, representing \> 1 Å improvement! The structure is beautiful and is a major achievement, which enabled the authors to resolve individual lipids and their interaction with the protein around the subunit cavity, whereas in previous structures unresolved non-protein densities were observed passing through the groove. The authors also solved a number of structures with and without calcium in lipid compositions that promote (thinner lipid bilayers) or suppress (thicker lipid bilayers) scrambling. The authors show that afTMEM16 can scramble lipids while the subunit groove remains closed, a phenomenon that is further enhanced in thinner membranes, whereas in thicker membranes scrambling is suppressed even though the groove is open. We particularly appreciated how different software packages and processing strategies were used to rigorously identify structural heterogeneity in their cryo-EM data. Remarkably, mutations of residues lining the subunit cavity and interacting with lipids do not appear to have dramatic effects on scrambling rates, which suggests that lipids do not need to interact with the protein to be scrambled. Thus, the overall conclusion of the study is that membrane thinning by TMEM16 scramblases in their calcium-free conformation is enough to induce lipid scrambling, and that the groove opening induced by calcium binding further enhances membrane deformation, promoting faster scrambling. By contrast, in thicker membranes the protein fails to sufficiently deform the bilayer and scrambling is suppressed, even when the subunit groove is open. The present study provides unprecedented structural information on the interaction of lipids with afTMEM16 and new evidence that lipids can be scrambled outside of the groove.

      The findings and conclusions presented here help to explain why TMEM16 scramblases can transport lipids with headgroups much bigger than the dimensions of the subunit cavity and why structures of some of the other scramblases (opsins, Xkrs and mammalian homolog TMEM16F) lack the obvious hydrophilic groove seen in fungal TMEM16 scramblases. Overall, this is a well-rounded study with an exceptional amount of high-quality cryo-EM data and functional experiments supporting the conclusions.

      We thank the reviewers for their praise of our work and for their constructive criticisms. Below we provide a detailed response to their comments and suggestions.

      RECOMMENDATIONS

      Revisions essential for endorsement:

      1) The resolved lipids binding within the groove in the first structure might be seen by some as supporting the credit card mechanism as it definitively demonstrates that lipids reside within the groove. While the authors provide evidence that lipids can permeate outside the groove in this and earlier work, as far as we can tell, none of that would preclude permeation through the groove if it doesn't require specific interactions between lipids and sidechains in the protein. The presentation might be improved with a somewhat more circumspect and nuanced exposition of the new data and how it can be understood with earlier results.

      There are several reasons why we do not think that the lipids in the C18/Ca2+ structure support the credit card mechanism, at least in the incarnation proposed for the TMEM16 scramblases.

      1. In the credit card model, lipid headgroups enter and traverse the whole span of the groove (as described in multiple publications, i.e. Bethel and Grabe, PNAS, 2016; Jiang et al., Elife, 2018; Lee et al., Nat Comms, 2018; Kostriskii and Machtens, Nat Comms, 2021). The lipid densities near the groove suggest that P3 and P4 lipids are oriented with their heads facing the groove's exterior, not the interior. These heads are contiguous with other resolved lipids in the outer and inner leaflets, respectively. We added panels showing views of the pathway from the extracellular solution to better convey that the lipid heads do not enter the groove (see new Fig. 1F-G). We also added a statement on pg. 10 to clarify this important point.
      2. In the present structures, which are consistent with earlier ones with lower resolution (Falzone et al., Elife, 2019; Kalienkova et al., Elife, 2019), residues in the extracellular vestibule do not interact with lipids (see new panels 1E-G). In contrast, the wide intracellular vestibule is embedded in the membrane. We agree with the reviewers that lipid headgroups can, and likely will, enter this wide vestibule during scrambling. We modified the text on pg. 12 to clearly state this point "The wide intracellular vestibule is embedded in the nanodisc membrane and, at the open pathway, the resolved P3 and P4 lipids have opposite orientations (Fig. 2A), suggesting scrambling might occur between them. In this case, the lipid headgroups would only need to move through the wide intracellular vestibule of the pathway below the T325-Y432 constriction rather than through the whole groove (Fig. 2A)."

      These observations, together with the extensive mutagenesis data reported in Fig. 2 and 3, point to a mechanism that is different from the precisely coordinated credit-card mechanism that is the currently accepted paradigm for lipid scrambling.

      Might the complex composition of native lipid membranes influence where and by what mechanism lipid movement between leaflets is catalysed by TMEM16 proteins?

      The idea that the lipid composition might affect the mechanism of scrambling (i.e. through the groove vs out of the groove) is very interesting, and we are actively investigating it in the lab. However, it would be surprising if different lipids were scrambled by entirely different mechanisms.

      2) The quality of lipid densities in cryo-EM structures is greatly affected by the number of particles used and the resolution obtained during refinement and it is therefore not surprising that the beautiful lipid densities observed here in the structure of afTMEM16 in lipid nanodiscs in the presence of calcium refined to 2.3 Å are not all observed in subsequent structures with lower resolution. This is true not only for the P lipids near the groove, but for those D lipids bound near the dimer interface, which is a stable region of the protein that does not change conformation. To be cautious, the authors should avoid resting any conclusions on the absence of lipid densities in the lower resolution structures. For example, on pg 15 the authors seem to be interpreting the absence of density for C22 lipids.

      We agree with the reviewers on this point. However, at ~2.7 Å average resolution and with \>130,000 particles we would expect to see density for lipids near the pathway, if these were tightly bound. For example, in the mTMEM16F nanodisc structures from the Chen and Jan labs (Feng et al., Cell Reports, 2020), several lipid densities were identified near the closed pathway despite a substantially lower average resolution. However, we agree that we should not interpret this lack of signal and toned down our statement, "This suggests that the interactions of C22 lipids with the pathway helices are weaker than those of C18 lipids, possibly reflecting a higher energy cost associated with distorting these longer acyl chain lipids" to better indicate this is a possible explanation, rather than a definitive mechanistic interpretation.

      3) The presentation of structural interactions between lipids and residues near the groove of the protein could be improved in the figures. A panel like Fig. 1J but for the groove would help, but it would be good to see expanded perspectives in the form of a supplementary figure where residues around the headgroup of the lipids are shown along with EM maps so the quality of the structural information for both lipids and side chains can be better appreciated. The preprint does have a lot of images of the lipids and the protein, but not in a way that enables the reader to quickly grasp the nature of interactions between side chains and lipid moieties for themselves, and we feel that close-ups of individual lipids as suggested above would help.

      We thank the reviewers for this suggestion. We show density maps for the protein and lipids in Fig. 1C-E, and added close-up views of the densities near the groove in the new Fig. 1F-G to highlight the poses adopted by the lipids in this region. Figures showing both density and atomic models for the protein and lipids are very busy and difficult to discern; many of the lipids interact with multiple residues from different helices, with both their heads and tails. As such we could not find satisfactory views displaying both for the majority of the lipids.

      It is also not clear what the authors mean by "lipid headgroup". Have the authors only considered interactions of the phospholipid phosphate group with protein residues? It would be helpful if the authors could clarify this in the manuscript and say whether other types of interactions were considered.

      In our C18/Ca2+ map, we resolve a total of 16 lipids per monomer. Of these, we assigned 2 as PG lipids, because we could resolve the large PG headgroup (D4 and D5), shown in Fig. 1F-H and Supp. Fig 2. In all other cases, we truncated the lipids at the phosphate, as the density was insufficient to distinguish between a PC and a PG headgroup. This is now specified in the Fig. 1 legend. In our mutagenesis experiments (Fig. 2 and 3), we only targeted residues that were within interaction distance of the resolved portions of the headgroups, which is the phosphate in most cases. This is now clarified on page 11 "we investigated how mutating residues coordinating the resolved portions of the headgroups of P1-2 and P4-6 impacts scrambling."

      It would also be nice to include a close-up view of D511A/E514A in 0.5 mM calcium with cryo-EM density to demonstrate the absence of bound calcium ions.

      We thank the reviewers for this suggestion. We added a new panel in Supp. Fig. 10H showing a close-up of the cryoEM density of the mutant binding site.

      4) The functional data in Fig 2, 3 and 4 are also not discussed in much detail and it would help if the authors could expand the presentation. Although scrambling in the presence of a very high concentration of calcium is not dramatically altered by any of the mutations, there is quite a lot going on in the absence of calcium and very little is said about these results. For example, differences in the scrambling rates can be observed with some mutants in the presence and absence of calcium in figures 2E and 3E, but statistical analysis would be required to know if the differences between mutants are significant. The differences in scrambling rates with different lipids are also not discussed (e.g. Fig. 4A) It would help if the authors could discuss what is the margin of error in the scrambling assay, and point to some concrete examples from their earlier work on this specific scramblase where mutants have a large impact on scrambling activity in their assay.

      We agree with the reviewers that most mutants show some effects in 0 Ca2+. The effects are statistically significant for all but one mutant (2-tailed t-test, p\<0.005). However, the magnitude of the effects is relatively small (\<7-fold reductions in all cases). While our approach to quantify the scrambling rate constant captures well large changes, some of the assumptions underlying the analysis make it less well suited to quantify small effects. In past publications we used a 10-fold change as a cut-off threshold to consider an effect meaningful (Lee et al., Nat comms, 2018; Khelashvili, Falzone et al., Nat Comms, 2020). These limitations and rationale for choices are discussed in several of our past publications (Malvezzi et al., PNAS, 2018; Lee et al., Nat Comms, 2018; Falzone and Accardi, MiMB, 2020). We added statements indicating magnitude of the observed reduction for the mutants in the various conditions. We prefer to refrain from presenting statistical significance of these results as we do not want to convey the idea these effects are more meaningful than they might be.

      Have the authors tried intermediate more physiologically relevant concentrations of calcium to see if the mutants have discernible effects under those conditions?

      This is an excellent suggestion. However, in our experience the technical limitations of the experimental set-up and of the analysis render a precise quantification of small effects at intermediate Ca2+ concentrations not very reliable. For this reason, we did not pursue this further.

      5) It is quite intriguing that the mutations in the subunit groove of afTMEM16 have little effect on scrambling activity. The authors propose that the groove-lining residues are not directly involved in lipid coordination even though their structure suggests that they do and there is a wealth of functional studies and MD simulations on various other TMEM16 homologs suggesting otherwise.

      We are a bit confused by the reviewers' statement that our structure suggests that groove lining residues coordinate lipids. In our structures, the only two residues that directly line the open groove and coordinate lipids are T325 and Y432 (Fig. 2A). All other 23 residues tested either do not line the groove (9 residues mutated in Fig. 2) or do not interact with lipids (14 residues mutated in Fig. 3). The finding that mutating these residues has minor effects on scrambling suggests that interactions between lipids and these side chains is not required for scrambling.

      We agree that the overall lack of effect of the mutants is surprising, especially in light of past work. However, none of the scrambling assays (in vitro or cell-based) can distinguish between mutations that affect permeation from those that affect gating. All that is measured is whether and -to a degree- how well lipids are transported. As such, we propose that at least some of the functional effects could have been misinterpreted. We are currently testing this hypothesis in the lab.

      The discrepancy between our structural and functional results and the molecular mechanism emerging from MD simulations is more striking. Although some differences exist between the reports of different groups, the overall agreement among them is excellent. We were thus surprised that our data is so difficult to reconcile with their observations. Indeed, the extensive mutagenesis reported in Fig. 2 and 3 was performed to systematically test the unexpected inferences of our initial structural results (on the C18/Ca2+ structure). Our conclusions are also corroborated by the structures in different lipid compositions. In the discussion (pg. 21-22) we consider some of the possible sources for these discrepancies. For example, while in the MD simulations of nhTMEM16 the extracellular vestibule (i.e. E305, E310 and R425) is immersed in the groove, in our cryoEM maps we do not see evidence of lipids interacting with these residues (Fig. 1,2,3). Notably, a similar arrangement of the membrane-protein interface is seen in the Ca2+-bound open nhTMEM16 structure in nanodiscs (Kalienkova et al., Elife, 2019), indicating this issue is not specific to afTMEM16 or to the nanodisc used. We hypothesize this different membrane-protein interface is at the origin of the different proposed mechanisms. Another potentially relevant difference is that the tails of multiple lipids intercalate between helices forming the dimer cavity, some of which line the groove (Fig. 1). These lipids were not included in MD simulations as they were not previously resolved, and they could affect groove dynamics and, consequently, its interactions with the membrane. Other possibilities exist, but we believe they are less likely to be important (i.e. the limited nature of nanodiscs used for the cryoEM experiments could influence the protein-membrane interface, the mutations could have effects that are too subtle to measure in our assay). However, we think that enumerating all possibilities would lead to an overly lengthy discussion and require too much speculation.

      We have revised the discussion of these important points in pg. 21-23 to better convey these uncertainties and added a statement (pg. 11) where we report the distance between the phosphate atom of the P3 lipid and E305 (13.7 Å), E310 (17.9 Å) and R425 (15.7 Å).

      The authors' suggestion that mutations probably affect the equilibrium between open and closed conformations of the groove in other homologs but not in afTMEM16 is logical, however, there are some discrepancies. To name a few examples, if indeed this is the case, nhTMEM16 mutants with closed groove should still have significant basal scrambling, by extrapolation from afTMEM16 data. Yet, some of the nhTMEM16 mutants (E313/E318/R432 mutants) have no activity at all, or no basal scrambling activity (Y439A) (Lee et al, 2018). Would you expect that point mutations within the subunit groove remove the ability of the protein to deform the membrane in its closed conformation? Might the groove have intermediate conformations between closed and fully open where the mutants studied might have more impact in afTMEM16?

      These are excellent ideas, and we are actively pursuing them in the lab. However, at the moment results are too preliminary to draw firm conclusions.

      Further, mutating some of the residues on the scrambling domain of TMEM16 affected externalization of some lipid species, but not internalization etc. (Gyobu et al, 2017), which should not be the case if the interaction of the protein with the lipids is completely unnecessary for lipid scrambling.

      This is a good point. However, mechanistic interpretation of results from cell-based scrambling assays is quite tricky, even more so than of the results from the in vitro measurements used in the present work. The presence of other lipid transporters and/or scramblases, or a multitude of other factors, could influence the results. For example, in cells scrambling by TMEM16F is delayed, it takes ~10 minutes after Ca2+ exposure to begin seeing PS externalization. In contrast, in in vitro measurements TMEM16F responds to Ca2+ nearly instantaneously, within the ~1 s mixing time of the cuvette (Alvadia et al., Elife, 2019). Thus, a direct comparison of the results obtained in cells and in vitro is not straightforward. More work is needed to investigate these important points.

      While investigating this question further would require follow-up structural studies on other TMEM16 homologs and is outside of the scope of this study, we think that the manuscript would benefit from a more extensive discussion on contradicting results and alternative interpretations. The authors might want to consider the possibility that there may be substantial variations in how different scramblases function.

      We agree that it is a priori possible that different TMEM16 proteins function according to different paradigms. However, we think this is an unlikely possibility. Despite differences in their gating behavior, most basic functional properties of TMEM16s are well conserved. Thus, fundamentally different mechanisms (i.e. through the groove or out of the groove) would have to result in similar functional phenotypes. We find the hypothesis that the basic scrambling mechanism is conserved among different TMEM16 homologues more plausible. While our results do not rule out that through the groove scrambling can occur, they suggest that it is not the main mechanism for afTMEM16, despite the fact that this protein adopts a very stable conformation with an open groove. Therefore, we consider the possibility of different mechanisms unlikely. This is mentioned on pg. 22 of the discussion.

      afTMEM16 has high constitutive activity in the absence of calcium, while at least TMEM16F does not. Additionally, the extent to which scrambling is promoted by calcium varies, as mammalian scramblases might need other cellular factors to be activated. Also, the extent to which scramblases are seen to distort the membrane is highly variable, as again seen in TMEM16F structures. Might some of these differences imply that key aspects of the mechanism of scrambling (e.g. thinning of the membrane or whether lipids scramble inside or outside the groove) are not the same for all scramblases? This might be one way to organize the discussion to help reconcile some of the seemingly divergent findings in the field.

      The reviewers raise an excellent point. Indeed, we find that for all TMEM16 homologues we have tested in the lab the degree of activity in 0 Ca2+ is highly dependent on the lipid composition. However, this does not appear to correlate with changes in conformation, as we report here for afTMEM16 and as reported by other groups for nhTMEM16 and TMEM16F.

      6) The authors should correct the Ramachandran outliers in C18/calcium and C22/calcium structures.

      We tried fixing the Ramachandran outliers, however this invariably led to worse fits of the atomic models with the density. Therefore, we believe it is appropriate to leave them as they are.

      Additional suggestions for the authors to consider:

      1) In several instances the authors conceptualize hypothetical mechanisms to set up experiments and frame their interpretations, which is not always the most straightforward way to communicate findings and what they reveal. The 'conveyor belt mechanism' introduced on page 10 is never fully defined in a way that helps the reader to understand what the functional effects of the mutants teach us. Might it be easier to set up the experiment by asking whether the interactions between sidechains that apparently interact with lipid headgroups in the structure play a critical role in scrambling, present the results and then conclude that they do not appear to? Collectively the functional effects of mutants do appear to suggest that specific side chain interactions are not critical for scrambling, but the conceptualized mechanism here makes the conclusions come across as unnecessarily forced.

      We thank the reviewers for the suggestion. We agree that the conveyor belt mechanism is a bit of a strawman. However, it is a plausible mechanism based on the orientation of the lipids in the C18/Ca2+ map. The mutagenesis described in Fig. 2 was explicitly designed to test this possibility. Further, this allows us to draw a clear distinction between testing the roles of residues outside the groove and of side chains that directly line the groove.

      The credit-card mechanism has been formally introduced and discussed in the field but has already been shot down in earlier work from the group and seems overly simplistic if we already know that scrambling can occur both inside and outside the groove from earlier studies. Just something for the authors to think about.

      We do not believe our previous work (Malvezzi et al., PNAS, 2018) 'shot down' the credit-card model. While we proposed that the large, PEG-conjugated lipid headgroups traverse the membrane outside the groove, our model postulated that normal-sized headgroups were scrambled within the groove. Further, one of the recurring criticisms of that work, was that the path taken by the large PEG-conjugated lipids might not represent a physiologically relevant mechanism for normal lipids. Thus, the credit-card mechanism remained the dominant model to explain scrambling, as testified by many subsequent publications by multiple groups, including our own!

      2) The uninitiated reader would greatly benefit from more of an introduction to the functional scrambling assay in the results and material and methods section so they can understand the results being presented. In the Material and methods, the authors mentioned: "All conditions were tested side by side with a control preparation", perhaps add here what exactly served as control –wild type protein in C18 lipids? It would be valuable to include information on the reconstitution efficiency between their preparations (WT in different lipid compositions and WT vs mutants). these if possible.

      We thank the reviewers for this suggestion. We added a brief description of the assay in the Methods section and now specify that "All conditions were tested side by side with a control preparation of WT afTMEM16 reconstituted in C18 lipids."

      3) Also, does the C18/calcium cryo-EM structure have sufficient resolution to distinguish between specific phospholipids (PG or PC) at the D1-D9 or P1-P7 positions? It would be particularly valuable if the authors could comment on whether PG or PC are observed in the D and P positions, or which lipids are lining the groove (P3-P6).

      We could build 2 lipids as PG (D4 and D5), based on the presence of density that could accommodate the large PG headgroup. For other lipids, the density was too weak beyond the phosphate, and therefore we left them truncated. This is now stated in the Figure 1 legend.

      4) While not essential, it would be interesting if the authors could perform the assay on the mutants with a more prominent effect in the absence of calcium (e.g. E310A, Y319A/F322A/K428A) with several additional calcium concentrations.

      We thank the reviewers for this suggestion. However, as we noted above, given the relatively small effects and limitations of the assay, we do not believe we would be able to extract meaningful mechanistic information from these measurements in intermediate conditions.

      5) The authors mentioned that the interaction of C22 lipids with the pathway helices is weaker than those of C18 lipids, which reflects the energy cost associated with distorting the longer lipids (page 15). However, they claimed that the interaction between the lipids and residues is not important for scrambling, which seems contradictory.

      We apologize for the confusion. In our proposed model, the ability of afTMEM16 to thin the membrane is dictated by the interactions of the protein with the surrounding lipids. This is not only enabled by interactions between side chains and lipid headgroups, but also by interactions of the lipid tails interact with the protein (see for example the close-up panels in Supp. Fig. 2F-G and the text on pg. 11 "Rather, other factors, such as tail interactions with interhelical grooves, contribute to their association with afTMEM16 (Supp. Fig 2F-G) and stabilize the distorted membrane-protein interface that results in thinning at the pathway.")

      (This is a response to peer review conducted by Biophysics Colab on version 1 of this preprint.)

    1. As long as software requires such concerted energy and so much highly specialized human focus, I think it will have the tendency to serve the interests of the people sitting in that room every day rather than what we may consider our broader goals.

      That's a wide point beyond web3 -- to avoid the problems with big tech, we need to make software / products easier to create. Then there's little to gain by increasing scale beyond network effects (which is a separate topic web3 aims to solve).

      I think we're already beginning to see this decentralization, if not in software then for YouTube & TikTok creators, indie makes etc in comparison to old media companies.

    1. Reviewer #2 (Public Review):

      Jepma et al. report an interesting manuscript studying how we learn from pain and its avoidance. The authors use an instrumental pain avoidance task where participants are required to choose between two stimuli, one of which is followed by painful thermal stimulation to the leg and the other is not. The probabilities of receiving pain drifted across trials using random walks. The authors combined this with pharmacological manipulation of the dopamine (via oral levodopa) or opioid (via oral naltrexone) systems and also with computational modelling of Q-learning rules and neuroimaging via fMRI. So, this is an ambitious and well conceived manuscript.

      There are real strengths here. The manuscript is theoretically motivated, addresses a fundamental question about how we learn, and is generally well executed. The task is well controlled, the modelling choices seem appropriate, the imaging and its analyses are broad but well defended and choices in analysis strategies are well defined. The manuscript is well written. I did enjoy reading the manuscript.

      The results have some interest. The modelling and neuroimaging data suggest important dissociations between learning about pain and learning about its absence - the modelling suggests faster learning rates for learning from pain than its avoidance. The imaging suggests that these two forms of learning are associated with different networks, with a known network linked to learning about pain but a novel network linked to learning about avoided pain.

      These are worthwhile knowledge gains. The idea that different rate parameters govern learning about events that are present versus those that are absent is an old one. It is built into most error-correcting learning rules since Rescorla-Wagner and it makes sense. However, it was useful to see it supported here. The finding that different networks of brain regions were associated with the learning from pain versus avoided of pain was also interesting. The networks linked to the former made sense based on the literature. The networks linked to the latter were more novel and notably did not include classic 'relief' brain regions.

      However, there were also important weaknesses here, at least on my readings.

      I struggled as a reader to understand how the modelling actually related to the behavior and imaging. That is, there is a real disconnect in the manuscript for me between what is observed (behavior) what is inferred (modelling as well as it basis for correlations with fMRI data).

      There were no differences in behavior reported between the two kinds of trials (learning from received pain versus avoided pain) effects, no effects of the drugs on behavioral performance, and no differential effect on learning from received pain versus avoided pain. I have no problems with reporting null effects, but here the reader is left wondering: if there are no behavioral differences reported, then why does the modelling predict that there should be? How accurate is the model given that it clearly predicts slower learning from avoided than received pain in the controls and faster learning from avoided pain under naltrexone and levodopa compared to control? In other words, what is it about the modelling that yields differences in learning rates between the two behavioral conditions and between the vehicle, levodopa, and naltrexone conditions when the behavioral data shown do not? Of course, it could be that the task was too easy - the modelling may be prescient and perhaps possible learning rate differences would be picked up under more difficult (more cues) and weaker probabilistic conditions. Perhaps there are behavioral data (reaction times?) not reported that do actually show differences in learning rate between learning from received pain versus avoided pain or show differences between the drug conditions?

      I may have misunderstood all of this and am happy to be corrected. If not, think this issue needs to be addressed and would need new data that is hopefully already in hand to do convincingly (such as choice reaction times) to show some difference in behavior between learning from received pain versus avoided pain and/or some effects of the pharmacological manipulations on these.

      In the absence of the data the manuscript seems to have three parts:

      1. A more compelling set of findings reporting imaging differences between learning from received pain versus avoided pain that are interesting because they suggest a novel network of brain regions for the latter compared to the literature.<br> 2. A set of null findings that neither pharmacological manipulation affected behavior or these imaging findings.<br> 3. A less compelling set of findings that link the above to possible underlying differences in learning rate parameters.

      The first could be of interest but the latter two need to be strengthened, in my opinion.

      I had other minor points (e.g., consider the literature on opioid and dopamine receptor manipulations in the ventral striatum on aversive prediction errors because this suggests the opposite to the literature cited for the midbrain; is the word 'appetitive' in the title really appropriate given the findings in the manuscript), but these are less important than the above.

    1. Author Response

      Reviewer #1 (Public Review):

      In this paper the authors use a conditional knockout strategy to assess the effects of deletion of the dominant oxygen-sensing hypoxia-inducible factor (HIF) hydroxylase enzyme, prolyl hydroxylase 2 (Phd2) restricted to the regulatory T cell (Treg) lineage. They use a well-established Foxp3-driven Cre recombinase allele. Phd2 is thus silenced in cells that have expressed or continue to express Foxp3 from the time this transcription factor, which is essential for Treg development and function, first occurs. They show that this approach leads to a change in Treg behaviour resulting in loss of some aspects of regulatory function and development of a Th1-like phenotype by the Foxp3 expressing cells. Effects are in general reversed when HIF-2 is silenced alongside Phd2, and may be amplified by simultaneous silencing of the HIF-1 isoform.

      The findings overlap with those reported following generalised silencing of Phd2 and following adoptive transfer of Treg in which Phd2-silencing is induced (Yamamoto et al., 2019) and are broadly compatible with those reported following a similarly Treg-restricted knockout of the von Hippel-Lindau gene (the recognition component of the E2-ubiquitin ligase that targets HIF-alpha chains that have been modified by Phd2) (Lee et al., 2015) but the results reported also differ significantly from these earlier reports in a number of intriguing respects which I feel warrant further discussion and ultimately investigation.

      The Introduction is in general informative and well written but it is a shame that it does not contain more discussion of the current state of knowledge of the interplay between HIF signalling and Treg function. This would provide a platform for a more detailed and scholarly discussion of the similarities and differences between this work and existing literature in the Discussion, where existing papers are currently described rather briefly. The introduction contains the statement 'Further complexity in this pathway has been provided by the identification of additional, non-HIF-related, PHD substrates, suggesting a role of proline hydroxylation in other settings requiring oxygen-dependent regulation', citing a single reference. This does not really represent the complex balance of arguments across the literature about non-HIF substrates for the HIF hydroxylase enzymes.

      The conclusions of this paper are mostly well supported by data, but some aspects need to be clarified and extended.

      We sincerely apologize for our apparent lack of recognition of previous work performed by other colleagues active in this field. We have now modified the Introduction section, to provide a better, yet concise, overview of the current knowledge of hypoxia signalling in regulatory T cell biology.

      A central issue for any conditional knock-out strategy is whether the intended tissue restriction is successfully achieved. The authors acknowledge that some issues have been reported with the Cre-recombinase allele they use. They, however, show the expected restriction to cells of the Treg lineage in two of the lymphoid tissues under investigation (spleen and mesenteric lymph node - Supplementary figure 1b) but do not show similar results for other tissues. Some concerns arise because in Figure 8b YFP (which is expressed alongside the Cre-recombinase) is visible in what appears to be the endothelium of the spleen. Additionally, the spleen sections illustrated show convincing splenomegaly in the Phd2-deficient Treg mice but expansion of the red pulp appears to be at least as prominent as any changes that might have occurred in the white pulp. Furthermore, the gross changes in abdominal appearances described as a 'hemorrhagic abdomen' (Figure 1c) include a more plethoric abdominal wall, prominent intestinal blood vessels and a much darker, and perhaps enlarged, liver compared with the control animal. These appearances might result from increased angiogenesis and / or erythropoiesis, neither of which would be expected to result from Treg lineage restricted Phd2 knockout but are known to occur with Phd2 ablation in other tissues. If there is convincing evidence of haemorrhage it would be nice to see this more obviously displayed macro- or, perhaps better still, microscopically.

      We thank the reviewer for this comment. We have now provided a better description of the haematological status of these mice, in which an elevated haematocrit and increased vascular permeability has been observed (now depicted in supplemental Figure 2). As suggested, we found indeed minimal, yet sizable expression of the Cre recombinase (as judged by YFP expression) in CD45-negative, non-lymphoid cells in all organs examined (as now depicted in supplemental Figure 9). Finally, none of the organs examined displayed an increased expression of erythropoietin (as judged by a sensitive qPCR assay, data not shown), a likely candidate for the haematological abnormalities observed in these mice. The mechanism underlying the apparent extramedullary erythropoiesis occurring in these mice remains therefore to be established. Noteworthy however, an additional experiment performed following a suggestion from one of the reviewers (see Figure 3 and our response 23), strongly suggests that PHD2 affects the Treg phenotype in a cell autonomous fashion. We do however acknowledge that the tissue abnormalities preclude any firm conclusion related to the positioning of Tregs within the spleen and have therefore deleted this section from the manuscript and adapted our conclusion consequently.

      Given that the Cre-recombinase allele used is expressed through the endogenous Foxp3 locus which is located on the X-chromosome and thus subject to random inactivation in the cells of females it is important that the sex of animals used in the experiments is specified.

      This has now been done in the Figure legends

      Experiments show alterations in Phd2-deficient Treg mice compared with control mice in homeostatic proliferation in a lymphopenic environment (Figure 3), the induction of colitis by DSS colitis (Figure 4) and the response to Toxoplasma gondii infection (Figure 4). Given the time courses these effects are likely to be real but interpretation is complicated by the spontaneous effects on the colon of Phd2-deficient Treg mice reported in Figure 1d and e. Given the wide general importance of interferon-gamma in immune / inflammatory responses I am not sure how much weight to place on the observation that concurrent interferon-gamma knockout results in loss of the Phd2-deficient Treg mice pro-inflammatory phenotype (Figure S3). No differences are seen in an in vivo model in which inflammation is induced by injection of anti-CD3 antibodies (Figure S2).

      Although the point is well taken, we felt it was important to perform a few experiments to illustrate the specificity of the inflammatory syndrome observed in these mice. We acknowledge the fact that the effect of concurrent loss of interferon-gamma on the phenotype of PHD2ΔTregs could have been anticipated. Additionnaly, we also think that the fact that these mice retain the same sensitivity to a “Th17-dominated” inflammatory response (also leading to a loss of weight) strengthens one of the messages of the manuscript, i.e. that loss of PHD2 expression affects Treg function in a selective, Th1-oriented fashion.

      An important conceptual difference between the interpretation of results reported here and those reported by Yamamoto et al. is that the 'Phd2-deficient Treg' purified here do not show a change in regulatory function in vitro whereas those used by Yamamoto et al. failed to act normally as regulatory cells. It is unclear whether this is due to differences in the way proliferation was stimulated, the cell purification strategies used (YFP+ in the current work; CD4+;CD25+ in Yamamoto et al.), the silencing of Phd2 (by knockout throughout development here versus through an inducible-shRNA only in mature cells in Yamamoto et al.), some other feature of the experiments (e.g. the use of feeder cells) or whether a difference would be revealed by more extensive titration. The result reported here is somewhat surprising given the presence of a Th1-like immunophenotype in the cells used in these in vitro suppression assays, which at face value might mean that this immunophenotype is not responsible for changes in their regulatory capacity seen in vivo. This may be true, but it is at odds with Bayesian argumentation. It may be a coincidence, but both models in which control Treg and Phd2-deficient Treg behave similarly involve treatment with anti-CD3 antibodies, raising the possibility that these antibodies in some way nullify differences reported with other stimuli, rather than this necessarily being related to the hypothesised difference between Th1 and Th17 responses in the in vivo model.

      We fully agree with the reviewer’s comment, and we were similarly worried that the differences reported in vivo vs in vitro were due to different agonists used. We however attempted to evaluate Treg function in vitro using alternative approaches, including an assay in which allogeneic antigen-presenting cells (including T-cell depleted spleen cells or highly purified dendritic cells) were used as agonists and Interferon-gamma secretion and proliferation as readouts. In another set of experiments, we used in vitro or in vivo derived Th1 cells instead of naïve T cells as responders. In all instances examined to date, PHD2-deficient Tregs displayed an adequate suppressive function in vitro (data not shown).

      Data showing reversal of the Phd2-deficient Treg in vivo phenotype by knockout of HIF-2alpha, but not HIF-1alpha are convincing and support the data of Yamamoto et al. The observation that Treg-specific PHD2-HIF1α double knockout mice were born at sub-mendelian ratios, displayed a marked weight loss during adult life and reduced viability, indicative of a more pronounced pro-inflammatory status is reported but data is not shown. This is certainly of interest and will no doubt receive further attention. The data that Treg-selective HIF1α or HIF2α deficiency does not affect immune homeostasis in naive mice shown in Figure S4 is relevant and compelling. These results are discussed in the context of recent work published by Hsu et al., 2020 which is interesting. Taken together these data highlight the fact that results reported throughout this manuscript arise from a combination of developmental differences with those occurring in the adult animal.

      We thank the reviewer for these positive comments

      The transcriptomic data presented has not, to date, been made available to reviewers or the public. Importantly, it is reported to show a disconnection between changes in glycolytic gene expression pattern and the immune phenotype. Specifically, whilst loss of Phd2 expression in Treg is associated with alterations in their regulatory function and with induction of glycolytic genes, the change in function, but not the change in glycolytic gene expression, is reversed by simultaneous knockout of HIF-2alpha and conversely the gene expression pattern, but not the change in function, is reversed by simultaneous knockout of HIF-1alpha. This will be of great interest to those working on the hypothesis that the switch between oxidative phosphorylation and glycolysis underlies functional changes in T cells, particularly if the changes in glycolytic gene expression actually convert into changes in glycolytic flux (as observed following HIF-induction in other cell types).

      The transcriptomic data are available to the public on GEO with the code: GSE184581

      The authors propose that a change in CXCR3 expression resulting from a change in STAT1 phosphorylation (but not absolute levels of STAT1) consequent on Phd2- inactivation leads to mal-distribution of Treg (at least in the spleen), and that given the broadly paracrine action of Treg this feature alone might explain the loss of regulatory activity in vivo. This is an intriguing hypothesis based at least in part on associative data rather than a formal proof of causality. Changes in STAT1 phosphorylation following interferon-gamma stimulation are far from 'all-or-nothing' (at the timepoint illustrated many cells have normal pSTAT1 levels even though the mean fluorescence intensity is reduced). Results in Figure 7b show that changes in STAT1 phosphorylation are seen in conventional Foxp3 negative T cells; since Phd2 knockout is restricted to the Treg lineage this change is presumably indirect, raising the possibility that the change seen in Treg is also indirect, rather than truly cell autonomous. Changes in pSTAT1 are acknowledged to affect a huge number of genes / processes so picking any one as the total explanation for any change in behaviour may be an over simplification. The analysis of changes in Treg localisation in the spleen is potentially interesting and may reach the correct conclusion but the methodology used is not clearly explained and in particular it is not clear how splenomegaly / changes in gross splenic architecture have been taken into account.

      We fully agree with the reviewer comments and have now deleted the final figure of our manuscript dealing with Treg positioning in the spleen. We indeed agree that due to the morphological changes in spleen size and architecture, more detailed work would be required to confirm our initial hypothesis. Unexpectedly, and thanks to a remark from another reviewer, we found that PHD2-deficient Tregs (which are present at high frequencies in the spleen of PHD2ΔTregs mice) are largely outcompeted both in heterozygous PHD-2fl/fl Cre+/- mice (see Figure 3) and upon equal transfer into WT mice of a 1:1 mix of wt and PHD-2-deficient Tregs, greatly complicating the study of the relative positioning of these cells within lymphoid organs. We do however stand by our previous conclusion suggesting that STAT1-signaling appears as affected in PHD2-deficient Tregs. This conclusion is not only supported by the reduced accumulation of pSTAT1 in these cells, as shown in Figure 8, but also by the bioinformatic analysis of transcriptomic data and the confirmation, at the protein level, of the reduced expression CXCR3 a well characterized STAT1-dependent chemokine receptors (as shown in Figure 8).

      Overall, this work contains many interesting datasets which need to be taken into account as we build our understanding of the intersection between HIF-signalling and regulatory T cell function, particularly as pharmacological manipulation of HIF signalling may provide a route to immunomodulation through alterations in regulatory T cell function.

      We thank again the reviewer for this positive appreciation of our work.

    1. Author Response

      Reviewer #1 (Public Review):

      The key question addressed of this MEG study is whether speech is represented singly or multiplexed in the human brain in the linguistic hierarchy. The authors used state-of-the-art analyses (multivariate Temporal Response Functions) and probablilistic information-theoretic measures (entropy, surprisal) to test distinct contextual speech processing models at three hierarchical levels. The authors report evidence for the coexistence of local and global predictive speech processing in the linguistic hierarchy.

      The work uses time resolved neuroimaging with state-of-the-art analyses and cognitive (here, linguistic) modeling. The study is very well conducted and draws from very different fields of knowledge in convincing ways. I see one limitation of the current study in that the authors focused on phase-locked responses, and I hope future work could extend to induced activity.

      Overall, the flow in the MS could be streamlined. Some smoothing in the introduction would be helpful to extract the main key messages you wish to convey.

      For instance, in the abstract:

      -Can you explain the two views in a simpler way in the abstract and to a non-linguistic audience? Do you mean to say that classic psycholinguistic models tend to follow a strict hierarchically integration (analysis only) but an alternative model is hierarchically inferential (analysis by synthesis)?

      -Indicate early on in abstract or intro where the audience is being led with a concise message on how you address the main question. For instance:

      To contrast our working hypotheses A and B, we used a novel information-theoretic modeling approach and associated measures (entropy, surprisal), which make clear predictions on the latency of brain activity in responses to speech at three hierarchal contextual levels (sublexical, word and sentence).

      We have revised the Abstract and Introduction to reduce the amount of terminology and add additional explanations. Wherever possible, we now use general terms (“bottom up”, “predictions”, “context”, …) instead of terms associated with specific theories. We hope we found a balance between improving accessibility and retaining the qualities seen by Reviewer 2, who thought the Introduction was clearly written and well connected to the psycholinguistics literature.

      All the models we compare are compatible with an analysis by synthesis approach, as long as the generative models are understood to entail making probabilistic predictions about future input. The generative models in analysis by synthesis, then, are one way in which “to organize internal representations in such a way as to minimize the processing cost of future language input“ (Introduction, first paragraph). We have clarified this in the first paragraph of the Introduction.

      • Why did the authors consider that the evoked response is the proper signal to assess as opposed to oscillatory (or non phase-locked) activity?

      The primary reason for our choice of dependent measure is the prior research we based our design on, showing that the linguistic entropy and surprisal effects are measurable in phase-locked responses (Brodbeck et al., 2018; Donhauser and Baillet, 2020). We have made this more explicit in part of the Introduction where we introduce our approach (“To achieve this, we analyzed …”).

      As to oscillatory dependent measures, we consider them an interesting but parallel research question. We are not aware of specific corresponding effects in non-phase locked activity. Accordingly, analyzing oscillatory responses without a clear prior hypothesis would require additional decisions, such as which bands to analyze, which would entail issues of multiple comparison. An additional caveat is that the temporal resolution of oscillatory activity is often lower than that of phase-locked activity, which might potentially make it harder to distinguish responses based on their latency as we did here, to test whether the latency of different context models differ.

      • Parallel processing with different levels of context (hence temporal granularities) sounds compatible with temporal multiplexing of speech representation proposed by Giraud & Poeppel (2012) or do the authors consider it a separate issue?

      We consider our investigation orthogonal to the model discussed by G&P (2012). G&P’s model is about the organization of acoustic information at different time-scales, and does not discuss the influence of linguistic constructs at the word level and above. On the other hand, the information-theoretic models that form the basis of our analysis track the linguistic information that can be extracted from the acoustic signal. The temporal scales invoked by G&P’s model are also different from the ones used here, defined based on acoustic vs. linguistic units. Thus, the kind of neural entrainment as a mechanism for speech processing hypothesized by G&P is fully compatible with our account, but not at all required by it.

      Methods:

      • Figure 2: please spell out TRFs and clarify the measured response

      We have done both in the Figure legend.

      • The sample size (N=12) is very low in today standards but the statistical granularity is that of the full MEG recording. Can a power estimate be provided or clear justification of reliability of statistical measures be described.

      We appreciate and share the reviewers’ concern with statistical power and have made several modifications to better explain and rationalize our choices.

      First, to contextualize our study: The sample size is similar to the most comparable published study, which had 11 participants (Donhauser and Baillet, 2020). Our own previous study (Brodbeck et al., 2018) had more participants (28) but only a fraction of the data per subject (8 minutes of speech in quiet, vs. 47 minutes in the present dataset). We added this consideration to the Methods/Participants section.

      We also added a table with effect-sizes for all the main predictors to make that information more accessible (Table 1). This suggests that the most relevant effects have Cohen’s d > 1. With our sample size 12, we had 94% power to detect an effect with d = 1, and 99% power to detect an effect with d = 1.2. This post-hoc analysis suggests that our sample was adequately powered for the intended purpose.

      Finally, all crucial model comparisons are accompanied by swarm-plots that show each subject as a separate dot, thus showing that these comparisons are highly reproducible across participants (note that there rarely are participants with model difference below 0, indicating that the effects are all seen in most subjects).

      • The inclusion of a left-handed in speech studies in unusual, please comment on any difference (or lack thereof) for this participant and notably the lateralization tests.

      We agree that this warrants further comment, in particular given our lateralization findings. We have made several changes to address this concern. At the same time we hope that the reviewers agree with us that, with proper care, inclusion of a left-handed participants is desirable (Willems et al., 2014), and indeed is becoming more mainstream, at least for studies of naturalistic language processing (e.g. Shain et al., 2020). First, we now draw attention to the presence of a left-hander where we introduce our sample (first paragraph of the Results section). Second, we repeated all tests of lateralization while excluding the left-hander. Because this did not change any of the conclusions, we decided to keep reporting results for the whole sample. However, third, we now mark the left-handed participant in all plots that include single-subject estimates and corresponding source data files. Overall, the left-hander indeed shows stronger right-lateralization than the average participant, but is by no means an outlier.

      • The authors state that eyes were kept open or close. This is again unusual as we know that eye closure affects not only the degree of concentration/fatigue but directly impact alpha activity (which in turn affects evoked responses (1-40 Hz then 20 Hz) that are being estimated here). Please explain.

      Previous comparable studies variably asked subjects to keep their eyes closed (e.g. Brodbeck et al., 2018) or open (e.g. Donhauser and Baillet, 2020). Both modes have advantages and disadvantages, none of which are prohibitive for our target analysis (ocular artifacts were removed with ICA and oscillatory alpha activity should, on average, be orthogonal to time-locked responses to the variables of interest). Importantly however, both modes have subjective disadvantages when enforced: deliberately keeping eyes open can lead to eye strain and excessive blinking, whereas closing eyes can exacerbate sleepiness. For this reason we wanted to allow subjects to self-regulate to optimize the performance on the aspects of the task that mattered – processing meaning in the audiobook. We extended the corresponding Methods section to explain this.

      • It would be helpful to clarify the final temporal granularity of analysis. The TRFs time courses are said to be resampled to 1kHz (p22) but MEG time courses are said to be resampled at 100 Hz (p18).

      Thanks for noting this. We clarified in the TRF time-course section: the deconvolution analysis was performed at 100 Hz, and TRFs were then resampled to 1 kHz for visualization and fine-grained peak analysis.

      • The % of variance explained by acoustic attributes is 15 to 20 folds larger than the that explained by the linguistic models of interest. Can a SNR measure be evaluated on such observations?

      We appreciate this concern, which is indeed reasonable. In order to better clarify this issue we have added a new paragraph, right after Table 1. In brief, since the statistical analysis looks for generality across subjects, the raw % explained values do not directly speak to the SNR or effect size. Rather, the SNR concerns how much variability is in this value across subjects. The individual subject values in Figure 3-B, and effect sizes now reported in Table 1, show that even though the % variability that is uniquely attributable to information-theoretic quantities is small, it is consistently larger than 0 across subjects.

      Results and Figures:

      • The current figures do not give enough credit to the depth of analysis being presented. I understand that this typical for such mTRFs approach but given the level of abstraction being evaluated in the linguistic inputs, it may be helpful to show an exemple of what to expect for low vs. high surprisal for instance from the modeling perspective and over time. For instance, could Figure 1 already illustrate disctinct predictions of the the local vs. global models?

      Thank you for pointing out this gap. We have added two figures to make the results more approachable:

      First, in Figure 3 we now show an example stimulus excerpt with all predictors we used. This makes the complete set of predictors quickly apparent without readers having to collect the information from the different places in the manuscript. It also gives a better sense of the detail that is modeled in the different stimulus representations. Second, we added Figure 6 to show example predictions from the different context models, and explain better how the mTRF approach can decompose brain responses into components related to different stimulus properties.

      • Why are visual cortices highlighted in figures?

      Those were darkened to indicate that they are excluded from the analysis. We have added a corresponding explanation to the legend of Figure 3.

      • Figure 2 Fig 2A and B: can the authors quantitatively illustrate "5-gram generally leads to a reduction of word surprisal but its magnitude varies substantially between words" by simply showing the mean surprisal and its variance?

      Added to the Figure legend.

      Fig 2C: please explain the term "partial response"; please indicate for non M/EEGers what the arrow symbolizes.

      Added to the Figure legend.

      • Figure 3:

      p8: the authors state controlling for the "acoustic features" but do not clearly describe how in the methods and this control comes as a (positive) surprise but still a bit unexpected at first read. Perhaps include the two acoustic features in Fig2C and provide a short couple sentences on how these could impair or confound mTRF performance.

      We thank you for pointing out this lack of explanation. We have added a description of all the control predictors to the end of the Introduction, right after explaining the predictors of main interest. We have also added Figure 3 to give an example and make the nature of all the controls explicit.

      Have the same analysis been conducted on a control region a priori not implicated in linguistic processing? This would be helpful to comfort the current results.

      The analysis has been performed on the whole brain (excluding the insula and the occipital lobe). Figure 4 (previously Figure 3) shows that generally only regions in the temporal lobe exhibit significant contributions from the linguistic models (allowing for some dispersion associated with MEG source localization). Although this is not shown in the figure, regions further away from the significant region generally exhibit a decrease in prediction accuracy from adding linguistic predictors, as is commonly seen with cross-validation when models overfit to irrelevant predictors.

      Fig 3B-C-E: please clearly indicate what single dot or "individual value" represents. Is this average over the full ROI? Was the orientation fixed? Can some measure of variability be provided?

      Explanation of individual dots added to Figure 4-B legend (formerly 3-B). Fixed orientation added to the methods summary in the Figure 2-C legend. To provide more detailed statistics including a measure of variability we added Table 1.

      Fig3E: make bigger / more readable (too many colors: significance bars could be black)

      We have increased the size and made the significance bars black.

      • Figure 4: having to go to the next Fig (Fig5) to understand the time windows is inconvenient and difficult to follow. Please, find a work around or combine the two figures. From which ROI are the times series extracted from?

      We have combined the two figures to facilitate comparison, and have added a brief explanation of the ROI to the figure legend.

      Reviewer #3 (Public Review):

      This manuscript presents a neurophysiological investigation of the hierarchical nature of prediction in natural speech comprehension. The authors record MEG data to speech from an audiobook. And they model that MEG using a number of different speech representations in order to explore how context affects the encoding of that speech. In particular, they are interested in testing how the response to phoneme is affected by context at three different levels: sublexical how the probability of an upcoming phoneme is constrained by previous phonemes; word - how the probability of an upcoming phoneme is affected by its being part of an individual word; sentence - how the probability of an upcoming phoneme is affected by the longer-range context of the speech content. Moreover, the authors are interested in exploring how effects at these different levels might contribute - independently - to explaining the MEG data. In doing so, they argue for parallel contributions to predictive processing from both long-range context and more local context. The authors discuss how this has important implications for how we understand the computational principles underlying natural speech perception, and how it can potentially explain a number of interesting phenomena from the literature.

      Overall, I thought this was a very well written and very interesting manuscript. I thought the authors did a really superb job, in general, of describing their questions against the previous literature, and of discussing their results in the context of that literature. I also thought, in general, that the methods and results were well explained. I have a few comments and queries for the authors too, however, most of which are relatively minor.

      Main comments: 1) One concerns I had was about the fact that context effects are estimated using 5-gram, models. I appreciate the computational cost involved in modeling more context. But, at the same time, I worry a little that examining the previous 4 phonemes or (especially) words is simply not enough to capture longer-term dependencies that surely exist. The reason I am concerned about this is that the sentence level context you are incorporating here is surely suboptimal. As such, could it be the case that the more local models are performing as well as they are simply because the sentence level context has not been modeled as well as it should be? I appreciate the temporal and spatial patterns appear to differ for the sentence level relative to the other two, so that is good support for the idea that they are genuinely capturing different things. However, I think some discussion of the potential shortcomings of only including 4 tokens of context is worth adding. Particularly when you make strong claims like that on lines 252.

      We strongly agree with the reviewer that the 5-gram model is not the ultimate model of human context representations. We have added a section to acknowledge this (Limitations of the sentence context model).

      While we see much potential for future work to investigate context processing by using more advanced language models, a preliminary investigation suggests that it might not be trivial. We compared the ability of a pre-trained LSTM (Gulordava et al., 2018) to predict the brain response to words in our dataset with that of the 5-gram model. The LSTM performed substantially worse than the 5-gram model. An important difference between the two models is that our 5-gram model was trained on the Corpus of Contemporary American English (COCA), whereas the LSTM was trained on Wikipedia. COCA provides a large and highly realistic sample of English, whereas the language in Wikipedia might be a more idiosyncratic subsample. Thus, the LSTM might be worse just because it has been trained on a less representative sample of English. As an initial step we thus ought to train the LSTM on the superior COCA database, but this simple step alone would already be associated with a substantial computational cost, given the size of COCA at more than a billion words (we estimated 3 weeks on 32 GPUs in a computing cluster). Furthermore, while we acknowledge the limitations of the 5-gram model, we consider it very unlikely that its limitations are the reason that the more local models are performing well. In general, as more context is considered, the model’s predictions should become more different from the local model, i.e., a more sophisticated model should be less correlated with the local models, and should thus allow the local models to perform even better.

      2) I found myself confused about what exactly was being modeled on my first reading of pages 4 through 7. I realized then that all of the models are based on estimating a probability distribution based on phonemes (stated on line 167). I think why I found it so confusing was that the previous section talked about using word forms and phonemes as units of representation (lines 118-119; Fig 2A), and I failed to grasp that, in fact, you were not going to be modeling surprisal or entropy at the word level, but always at the phoneme level (just with different context). Anyway, I just thought I would flag that as other readers might also find themselves thinking in one direction as they read pages 4 and 5, only to find themselves confused further down.

      Thank you for pointing out this ambiguity; we now make it explicit that “all our predictors reflect information-theoretic quantities at the rate of phonemes” early on in the Expressing the use of context through information theory section.

      3) I also thought some the formal explanations of surprisal and entropy on lines 610-617 would be valuable if added to the first paragraph on page 6, which, at the moment, is really quite abstract and not as digestible as it could be, particularly for entropy.

      We appreciate that this needs to be much clearer for readers with different backgrounds. As suggested, we have added the formal definition to the Introduction, and we now also point readers explicitly to the Methods subsection that explains these definitions in more detail.

      4) I like the analysis examining the possibility of tradeoffs between context models. I wonder might such tradeoffs exist as conversational environments vary - if the complexity of the speech varies and/or listening conditions vary might there be more reliance on local vs global context then. If that seems plausible, then it might be worth adding a caveat that you found no evidence for any tradeoff, but that your experiment was pretty homogenous in terms of speech content.

      Thank you for this suggestion. We added this idea to the Discussion in the Implications for speech processing section.

    1. European concepts of equality more often focus on group inequality and the collective mitigation of handicaps and risks that, in the United States, have been left for individuals to deal with on their own.

      I don't think it's any surprise that the US is a quite individualistic society in comparison to many other counties in the world. I personally don't feel too hopeful that the rhetoric that perpetuates individualism in our society will change, so I am curious to see what solutions may address these community inequalities/handicaps in education that we tend to neglect.

    1. Author Response

      Reviewer #2 (Public Review):

      This paper combines neuroimaging, behavioral experiments, and computational modeling to argue that (a) there is a network of brain areas that represent physical stability, (b) these areas do so in a way that generalizes across many kinds of instability (e.g., not only a tower of blocks about to fall over, but also a person about to fall off a ladder), and (c) that this supports a simulation account of physical reasoning, rather than one based on feedforward processing; this last claim arises through a comparison of humans to CNNs, which do an OK job classifying physical instability but not in a way that transfers across these different stability classes. In my opinion, this is a lovely contribution to the literatures on both intuitive physical reasoning and (un)humanlike machine vision. At the same time, I wasn't sure that the broader conclusions followed from the data in the way the authors preferred, and I also had some concerns about some of the methodological choices made here.

      1. The following framing puzzled me a bit, and even seemed to raise an unaddressed confound in the paper: "Here we investigate how the brain makes the most basic prediction about the physical world: whether the situation in front of us is stable, and hence likely to stay the same, or unstable, and hence likely to change in the immediate future".

      Consider the following minor worry, which sets up a more major one: This framing, which connects 'stability' to 'change' and which continues throughout the paper, seems to equivocate on the notion of 'stability'. One meaning of 'stable' is, roughly, 'unchanging'. Another meaning is 'unlikely to fall over'. The above quotation, along with others like it, makes it seem like the authors are investigating the former, since that's the only meaning that makes this quotation make sense. But in fact the experiments are about the latter -- towers falling down, people falling off ladders, etc. But these aren't the same thing! So there's a bit of wordplay happening here, it seemed to me.

      This sets up the more serious worry. As this framing reveals, unstable scenes (in the likely-to-fall-over sense) are, by their nature, scenes where something is likely to change. In that case, how do we know that the brain areas this project has identified aren't representing 'likeliness to change', rather than physical stability? There are, of course, many objects and scenes that might be highly likely to change without being at all physically unstable. Even the first example in the paper ("a dog about to give chase") is about likely changes without any physical instability. But isn't this a confound? All of the examples of physical instability explored here also involve likeliness to change! So these could be 'likely to change' brain areas, not 'physically unstable' brain areas. Right? Or if not, what am I missing?

      The caption of Figure 1 seems to get at this a bit, but in a way I admit I just found a bit confusing. If authors do after all intend "physically unstable" to mean "likely to change", then many classes of scenarios that are unexplored here seem like they would be relevant: a line of sprinters about to dash off in a race, someone about to turn off all the lights in a home, a spectacular chemical reaction about to start, etc. But the authors don't intend those scenarios to fall under the current project, right?

      The reviewer is correct that "stability" has (at least) these two different meanings, and also correct that we are investigating here the situation in which a configuration is not changing now but would be likely to change with just the slightest perturbation. Our hypothesis is that the “Physics Network” will be sensitive to the likelihood that a physical configuration will change for physical (not social) reasons. That is what our data show: we do not find the same univariate and multivariate effects for situations that are likely to change because of the behavior of an animal. This indicates that what we are decoding is not general ‘likeliness to change’ but rather physical instability in particular.

      (Also: Is stability really 'the most basic prediction' we make about the world? Who is to say that stable vs. unstable is a more basic judgment than, say, present vs. absent, or expected vs. unexpected, or safe vs. unsafe, etc? I know this is mostly just trying to get the reader excited about the results, but I stumbled there.)

      We have now modified the sentence to say: “…how the brain makes a fundamental prediction about the physical world: whether the situation in front of us is stable, and hence likely to stay the same, or unstable, and hence likely to change in the immediate future.”

      1. Laying out these issues in terms of feedforward processing vs. simulation felt a bit misleading and/or unfair to those views, given the substance of what this paper is actually doing. In particular, the feedforward view ends up getting assimilated to "what CNNs do"; but these are completely different hypotheses (or at least can be). Note, for example, that many vision researchers who don't think CNNs are good models of human vision nevertheless do think that lots of what human vision does is feedforward; that view could only be coherent if there are kinds of feedforward processing that are un-CNN-like. It would be better not to conflate these two and just say that the pattern of results rules out CNN-like feedforward processing without ruling out feedforward processing in general.

      This is a fair point, and we certainly agree that we cannot rule out all feedforward models. We have tried to be clear about this claim, e.g., here (in the Discussion: “Three lines of evidence from the present study indicate that pattern recognition alone – as instantiated in feedforward CNNs and the ventral visual pathway – is unlikely to explain physical inference in humans, at least for the case of physical stability."

      3a. I wasn't sure how impressed to be by the fact that, say, 60% classification accuracy one class of stable/unstable scenes doesn't lead to above-chance performance on another class of stable/unstable scenes. Put differently, it seems that the CNNs simply didn't do a great job classifying physical stability in the first place; in that case, how general should we expect their representations to be anyway? Now, on one hand, I could see this worry only further supporting the authors' case, since you could think of this as all the more evidence that CNNs won't have representations of stability in them. But since (a) the claims the authors are making are about feedforward processing in principle, not just in one or two CNNs, and (b) the purpose of this paper is to explore the issue of generality per se, rather than just stability, this seems inadequate. It could be that a CNN that does achieve high accuracy on physical stability judgments (90%?) would actually show this kind of general transfer; but we don't know that from the data presented here, because it's possible that the lack of generality arises from poor performance to begin with.

      You are correct in noting that CNNs don’t do a great job in classifying physical stability, which reinforces our point that pattern recognition systems are not very good at discerning physical stability. In fact, the classification accuracy that we have reported is close to the baseline performance in literature (Lerer et al 2016). Interestingly, training on the block tower dataset itself could only bring up the stability classification accuracy to 68.8% on the real-world block tower images. While this is true of the current best model of stability detection, we think that CNNs trained on large-scale datasets of stability under varying scenarios may in future be able to potentially generalize to other natural scenarios. However, to our knowledge no such datasets exist.

      3b. I wasn't sure how to think about whether showing CNNs stable and unstable scenes is a fair test of their ability to represent physical stability. Do we know that stability is all that these images have in common? Maybe the CNN is doing a great job learning some other representation. This sort of thing comes up in some recent discussions of 'shortcuts' and/or the 'fairness' of comparisons between human and machine vision, including some recent theoretical papers (see author recommendations for specific suggestions here).

      If our point were that CNNs do a great job at representing physical stability, we would indeed have to worry about low-level image confounds or “shortcuts” enabling this performance. But our point is that they do badly. If some of their already bad performance is due to image confounds/shortcuts then they are in fact doing even worse, and that only makes our point stronger.

      4a. I didn't really follow this passage, which is relied on to interpret greater activity for unstable vs stable scenes: "we reasoned that if the candidate physics regions are engaged automatically in simulating what will happen next, they should show a higher mean response when viewing physically unstable scenes (because there is more to simulate) than stable scenes (where nothing is predicted to happen)." It seems true enough that, once one knows that a scene is stable, one doesn't then need a dynamically updated representation of its unfolding. But the question that this paper is about is how we determine, in the first place, that a scene is stable or not. The simulations at issue are simulations one runs before one knows their outcome, and so it wasn't clear at all to me that there is always more to simulate in an unstable scene. Stable scenes may well have a lot to simulate, even if we determine after those hefty simulations that the scene is stable after all. And of course unstable scenes might well have very little to simulate, if the scene is simple and the instability is straightforwardly evident. Can the authors say more about why it's easier to determine that a stable scene is stable than that an unstable scene is unstable? They may have a good answer! It would just be better to see it in the paper.

      The idea here is that forward simulation happens in all cases but stops if no change has occurred since the last frame. That stopping, both represents the stability of the configuration and produces less activity. This idea is akin to the “sleep state” used for nonmoving objects in a physics engine: they do not need to be re-simulated or re-rendered if they have not moved since the last frame (Ullman et al, 2017 TICS).

      4b. I was confused a bit by the Animals-People condition, and whether to think of it as a control condition or not. The image of it in Figure 1a makes it seem like it is meant to be interpreted along the usual "physical stability" lines, just like falling towers and people on ladders, and the caption seems to say this too; it also makes intuitive sense since the man in the boat looks like he'll fall if and when the alligator attacks. But then in the main text the authors predict that the representations of stability would not extend to the Animals-People condition, because they are just supposed to be about peril but not stability. Why not? And then the results themselves are equivocal, with some findings generalizing to Animals-People and some not. I don't have much more to say here other than that I found this hard to follow.

      We used the Animals-People as a control for peril/instability that is not caused by the physical situation (but rather by another agent). Our hypothesis was that the “Physics Network” would hold information about physical stability, not just any kind of propensity for change for any reason. Hence, we predicted, that any brain region responding (only) to physical stability should not respond in a similar way to peril/non-peril conditions in the Animals-People scenario as they involve a more biological-agent driven interaction. That is what we found.

    2. Reviewer #2 (Public Review):

      This paper combines neuroimaging, behavioral experiments, and computational modeling to argue that (a) there is a network of brain areas that represent physical stability, (b) these areas do so in a way that generalizes across many kinds of instability (e.g., not only a tower of blocks about to fall over, but also a person about to fall off a ladder), and (c) that this supports a simulation account of physical reasoning, rather than one based on feedforward processing; this last claim arises through a comparison of humans to CNNs, which do an OK job classifying physical instability but not in a way that transfers across these different stability classes. In my opinion, this is a lovely contribution to the literatures on both intuitive physical reasoning and (un)humanlike machine vision. At the same time, I wasn't sure that the broader conclusions followed from the data in the way the authors preferred, and I also had some concerns about some of the methodological choices made here.

      1. The following framing puzzled me a bit, and even seemed to raise an unaddressed confound in the paper: "Here we investigate how the brain makes the most basic prediction about the physical world: whether the situation in front of us is stable, and hence likely to stay the same, or unstable, and hence likely to change in the immediate future".

      Consider the following minor worry, which sets up a more major one: This framing, which connects 'stability' to 'change' and which continues throughout the paper, seems to equivocate on the notion of 'stability'. One meaning of 'stable' is, roughly, 'unchanging'. Another meaning is 'unlikely to fall over'. The above quotation, along with others like it, makes it seem like the authors are investigating the former, since that's the only meaning that makes this quotation make sense. But in fact the experiments are about the latter -- towers falling down, people falling off ladders, etc. But these aren't the same thing! So there's a bit of wordplay happening here, it seemed to me.

      This sets up the more serious worry. As this framing reveals, unstable scenes (in the likely-to-fall-over sense) are, by their nature, scenes where something is likely to change. In that case, how do we know that the brain areas this project has identified aren't representing 'likeliness to change', rather than physical stability? There are, of course, many objects and scenes that might be highly likely to change without being at all physically unstable. Even the first example in the paper ("a dog about to give chase") is about likely changes without any physical instability. But isn't this a confound? All of the examples of physical instability explored here also involve likeliness to change! So these could be 'likely to change' brain areas, not 'physically unstable' brain areas. Right? Or if not, what am I missing?

      The caption of Figure 1 seems to get at this a bit, but in a way I admit I just found a bit confusing. If authors do after all intend "physically unstable" to mean "likely to change", then many classes of scenarios that are unexplored here seem like they would be relevant: a line of sprinters about to dash off in a race, someone about to turn off all the lights in a home, a spectacular chemical reaction about to start, etc. But the authors don't intend those scenarios to fall under the current project, right?

      (Also: Is stability really 'the most basic prediction' we make about the world? Who is to say that stable vs. unstable is a more basic judgment than, say, present vs. absent, or expected vs. unexpected, or safe vs. unsafe, etc? I know this is mostly just trying to get the reader excited about the results, but I stumbled there.)

      2. Laying out these issues in terms of feedforward processing vs. simulation felt a bit misleading and/or unfair to those views, given the substance of what this paper is actually doing. In particular, the feedforward view ends up getting assimilated to "what CNNs do"; but these are completely different hypotheses (or at least can be). Note, for example, that many vision researchers who don't think CNNs are good models of human vision nevertheless do think that lots of what human vision does is feedforward; that view could only be coherent if there are kinds of feedforward processing that are un-CNN-like. It would be better not to conflate these two and just say that the pattern of results rules out CNN-like feedforward processing without ruling out feedforward processing in general.

      3a. I wasn't sure how impressed to be by the fact that, say, 60% classification accuracy one class of stable/unstable scenes doesn't lead to above-chance performance on another class of stable/unstable scenes. Put differently, it seems that the CNNs simply didn't do a great job classifying physical stability in the first place; in that case, how general should we expect their representations to be anyway? Now, on one hand, I could see this worry only further supporting the authors' case, since you could think of this as all the more evidence that CNNs won't have representations of stability in them. But since (a) the claims the authors are making are about feedforward processing in principle, not just in one or two CNNs, and (b) the purpose of this paper is to explore the issue of generality per se, rather than just stability, this seems inadequate. It could be that a CNN that does achieve high accuracy on physical stability judgments (90%?) would actually show this kind of general transfer; but we don't know that from the data presented here, because it's possible that the lack of generality arises from poor performance to begin with.

      3b. I wasn't sure how to think about whether showing CNNs stable and unstable scenes is a fair test of their ability to represent physical stability. Do we know that stability is all that these images have in common? Maybe the CNN is doing a great job learning some other representation. This sort of thing comes up in some recent discussions of 'shortcuts' and/or the 'fairness' of comparisons between human and machine vision, including some recent theoretical papers (see author recommendations for specific suggestions here).

      4a. I didn't really follow this passage, which is relied on to interpret greater activity for unstable vs stable scenes: "we reasoned that if the candidate physics regions are engaged automatically in simulating what will happen next, they should show a higher mean response when viewing physically unstable scenes (because there is more to simulate) than stable scenes (where nothing is predicted to happen)." It seems true enough that, once one knows that a scene is stable, one doesn't then need a dynamically updated representation of its unfolding. But the question that this paper is about is how we determine, in the first place, that a scene is stable or not. The simulations at issue are simulations one runs before one knows their outcome, and so it wasn't clear at all to me that there is always more to simulate in an unstable scene. Stable scenes may well have a lot to simulate, even if we determine after those hefty simulations that the scene is stable after all. And of course unstable scenes might well have very little to simulate, if the scene is simple and the instability is straightforwardly evident. Can the authors say more about why it's easier to determine that a stable scene is stable than that an unstable scene is unstable? They may have a good answer! It would just be better to see it in the paper.

      4b. I was confused a bit by the Animals-People condition, and whether to think of it as a control condition or not. The image of it in Figure 1a makes it seem like it is meant to be interpreted along the usual "physical stability" lines, just like falling towers and people on ladders, and the caption seems to say this too; it also makes intuitive sense since the man in the boat looks like he'll fall if and when the alligator attacks. But then in the main text the authors predict that the representations of stability would not extend to the Animals-People condition, because they are just supposed to be about peril but not stability. Why not? And then the results themselves are equivocal, with some findings generalizing to Animals-People and some not. I don't have much more to say here other than that I found this hard to follow.

      5. "Interestingness" ratings felt like a not-quite-adequate approach for evaluating how attention-grabbing the towers were. A Bach concerto is more interesting than a gunshot (and would be rated that way, I imagine), but the gunshot is surely more attention-grabbing. Why not use a measure like how much they distract from another task? That's the sort of thing I'd have expected, in any case.

    1. Author Response

      Reviewer #1 (Public Review):

      In Wang et al., the authors investigate issues related to the relative proportion of flux for the enzymatic decarboxylation of pyruvate between PDH (pyruvate dehydrogenase) and PFOR (pyruvate-ferredoxin oxoreductase) in the model organism Synechococystis. The manuscript provides evidence that PDH becomes increasingly inactivated by a high ratio of NADH:NAD+ as well as evidence to suggest that PFOR is transcribed and remains intact under aerobic conditions. The authors put forward the theory that both PDH and PFOR are functionally active routes for pyruvate decarboxylation under aerobic conditions, whereas PFOR has previously been assumed to be inactive under growth conditions containing oxygen. This distinction is particularly highlighted by conditions where Synechocystis is grown photomixotrophically - and where the NADH:NAD+ pool may be relatively over-reduced because of two parallel inputs of reductant (water-splitting at PSII and catabolism of glucose). The authors examine growth under photoautotrophic and photomixotrophic conditions for a number of relevant mutants including members of the ferredoxin/flavodoxin family, PFOR, and NDH-1 complex subunits.

      The theory put forward in this manuscript is of general interest regarding electron flux through the combined electron transport chain (photosynthetic + respiratory) of cyanobacteria. The authors further broaden the potential audience for the manuscript by elaborating on the potential significance of these results in the context of a switch from PFOR (ancestral) to PDH (oxygenic/modern).

      Comments:

      Generally, theories put forward in this manuscript are intriguing and have a number of potential implications for understanding electron flux and regulation of central metabolic processes in photosynthetic microorganisms. If these theories are supported and become more generally adopted, they would have significant impact on the understanding of the regulation of central carbon metabolism in cyanobacteria. That said (due in no small part to the complexity of some of these pathways), the evidence provided to support the hypotheses is indirect in many instances. In some cases, there is a pairing of indirect data with broad statements that can come across as over-reach. These problems can be somewhat exacerbated by an unclear organization at parts of the Discussion, a lack of succinctly defined claims, and numerous typographical considerations.

      Thank you very much for this point. We now reorganized the discussion and overhauled it completely. It starts with aspects that are best supported by our data. We then added two sentences to stress that the following lines include hypothetical considerations that are meant as thought-provoking impulses. We hope that thereby over-reach is prevented.

      Major considerations:

      A major component of the proposed theories in this manuscript rest upon the assumption that PFOR is an active enzyme under highly aerobic conditions: this claim is never directly demonstrated.

      This is true. We could show though that PFOR of Synechocystis is in constrast to most bacterial PFORs stable in the presence of oxygen. However, as stated likewise for the oxygen stable PFOR of the obligate aerobe Sulfolobus acidocaldarius (3), and PFOR from E. coli, which was recently shown to contribute to metabolism in the presence of oxygen in vivo (1) we as well had to remove oxygen for enzyme acitivty in vitro. This point is discussed frankly.

      Indirect evidence of altered growth of pfor mutants, increased repression of PDH, and the higher NADH:NAD+ ratio under photomixotrophic conditions is in general alignment with this theory. However, while deletion of pfor does indeed result in altered growth dynamics in Synechocystis under periods of photomixotrophy, the alterations do not entirely align with the idea that this pathway is critical for rapid growth under aerobic conditions. For instance, pfor and most of the highlighted mutants (fdx 3, fdx 9, isiB) presented in Figure 3 show the greatest defects in their OD after reaching stationary phase (more rapid decline in OD on/after Day 6) relative to WT. This doesn't align as nicely with the highest NADH:NAD+ seen in Days 3-5 (which is also specifically called out: e.g., Line 146, Supplemental Figure S8).

      We are very cautious to compare growth experiments day by day. This is due to the fact that the growth behaviour of WT and mutants differ between experiments. We therefore repeat these experiments in several independent experiments including at least three replicates and show the data of typical growth experiments. In the case of the shown growth behaviour of WT and pfor and the NADH/NAD+ ratios under photoautotrophic and photomixotrophic conditions shown in figure 1, NADH/NAD+ ratios were determined in exactly those cultures for which growth data are shown. It is therefore legitimate to directly compare these results day by day. However, we did not determine the NADH/NAD+ ratios of the cultures shown in Fig. 3. The rise in NADH might have started with a delay here.

      In this context, the deletion of F-GOGAT is much more convincing in it's severity and timing, yet for this mutation to have a more severe phenotype is unexpected if PFOR is one of the primary/sole electron donors to the ferredoxin pool from glucose utilization as proposed (i.e., stated differently, F-GOGAT is only one of the enzymes downstream of ferrodoxin and might be expected to have a more subtle phenotype in comparison to the KO of PFOR if that is a primary source for electrons to ferredoxin under photoheterotrophic conditions).

      F-GOGAT requires reduced ferredoxin which can be provided by PFOR and in addition also by PSI. As electrons from glucose oxidation can be fed via photosynthetic complex I into the PQ-pool they will eventually arrive at PSI (Fig. 3C) where ferredoxin can be reduced and transfer electrons to F-GOGAT. However, to get a truly complete picture of the situation several issues will have to be addressed in the future: we do not know which of the low abundant ferredoxins as well as high abundant ferredoxin 1 interact with PSI, F-GOGAT, PFOR and photosynthetic complex I. It would be furthermore helpful to know all midpoint potentials of the different ferredoxins. Without this information it might be too much to ask for a simple interpretation.

      A central tenant of the argument put forward on the evolutionary importance of using either PFOR vs. PDH is the conservation of extra free energy by the former reaction. However, additional information on the ferredoxin paralog(s) that accept electrons from PFOR is necessary to evaluate these claims. Based on the data within these manuscripts, Fdx3, Fdx9, and IsiB have the strongest links to PFOR: though the authors do take care to never state directly that they have evidence that these are the acceptors in vivo. Given the variability in the midpoint potentials of different ferredoxins, some ferredoxin acceptors may better conserve the free energy in pyruvate, while others may actually be more 'wasteful' than NAD+ as the acceptor through PDH. Unfortunately, the midpoint potentials for Fdx3, Fdx9, and IsiB are unknown or not stated in this manuscript. It is therefore unclear what ferredoxin is being used as the reference point for conservation of Gibbs free energy in Figure 4C and referenced multiple times in the text.

      We agree that it would be great if we already knew the redox potentials of all the ferredoxins involved. We are currently working on this issue. All that we know for now is that the redox potentials of ferredoxins lay between -240 mV to -680 mV whereas the redox potential is around -320 mV for NAD(P)H/NAD(P)+. Unpublished data that require further validation reveal that the redox potential of Fdx9 is definitely more negative than the redox potential of Fdx1 (-412 mV) in Synechocystis and is thereby clearly more negative that -320 mV. However, as these data require further validation, we did not name numbers. In addition, interaction studies on PFOR and low abundant ferredoxins are planed and preparations are in progress.

      Finally, the measurements of NADH:NAD+ (most prominently used for measurements in Fig 1B) utilized kits that require multiple, long centrifugation steps in the dark prior to assaying this rapidly exchanging pool. While it appears that the authors were able to get reproducible results with these kits, it is difficult to interpret what the increase in relative NADH levels in glucose-fed cells means given that 10+ minutes of incubation in the dark and/or changing temperatures elapsed after the cyanobacteria were removed from the incubator before the NADH:NAD ratio was assessed. While it superficially makes logical sense that the cytosol would be over-reduced when illuminated and under glucose feeding relative to illumination alone, it shouldn't be assumed that these measurements are representative of this rapidly-exchanging pool under the steady-state growth conditions.

      Thank you very much for raising this important point. We are very much aware of the difficulties to determine the redox state of NADH:NAD+ using these kits. However, there is no other method available that properly distinguishes NADH and NADPH. Furthermore, the centrifugation step was done at -9°C which should minimize metabolic reactions during this step. However, we now added in vivo measurements using the NAD(P)H-module available for the PAM and using the Dual-KLAS/NIR to determine the redox state of ferredoxin (newly added Fig. S4). Both methods show that NAD(P)H as well as ferredoxin are more strongly reduced under photomixotrphic conditions in comparison to photoautotrophic conditions and thus support our previous data.

      Reviewer #2 (Public Review):

      The observation that cyanobacteria can use two alternative pyruvate decarboxylating enzymes using either NAD+ or ferredoxin is an interesting and the work is useful contribution. The authors very nicely characterize the enzymatic properties of the two pyruvate metabolizing enzymes and also are able to connect the ideas of redox balance with a set of ferredoxins. Even though they are not able to definitively characterized the specific ferredoxin which interacts with the enzyme, the analysis is nicely conducted and it's clear that the suggestion they're making regarding the involvement of the minor ferredoxins is compelling. However, the work could be written in a way that might be more useful.

      Specific comments:

      Overall this is an interesting study, but the arguments could be sharpened and better connected with the literature. The introduction needs to be considerably revised in my opinion. It is not obvious whether it is even appropriate to discuss the enzymes as an aerobic enzymes or aerobic enzymes, since this concept is simplistic and perhaps, archaic. Indeed, placing the results of the present study in the context of "aerobic enzymes versus aerobic enzymes" is a bit of a 'strawman' argument. For example, the counter examples of O2-tolerant enzymes cited seem to suggest that PFORs have been capable of evolving into O2-tolerant enzymes quite readily and that two types of decarboxylase have evolved for quite different reasons than simple replacement for a new environment. Instead, I think a more current and general perspective relates more to the interpretation that the authors are already putting forth. Namely, the enzymes are utilized according to redox balance considerations rather than sensitivity to oxygen.

      Therefore, I think the very long and pedantic introduction is useful for review, but only if it is shortened and also includes the alternative interpretation regarding adaptations to redox potential in the cytoplasm. My guess is that there are plenty of examples of redox balance function arguments in the literature to refer to in contrast to the evolutionary replacement argument used. Certainly, there are good examples regarding glucose toxicity in mutants of Synechocystis that can be considered.

      Thank you very much for this point. The O2-tolerant PFORs mentioned were merely shown to be stable in the presence of oxygen in vitro which means that they can be isolated under anaerobic conditions. However, all enzymatic in vitro assays required anaerobic conditions. Only one PFOR was shown to be active in the presence of oxygen in vitro. Physiological studies on the importance of these enzymes under aerobic conditions in vivo are completely missing. However, animated by the requests of the reviewers we searched the literature intensively again and indeed found a recent report, which describes the involvement of PFOR in redox regulation in an aerobic culture of an E. coli mutant, in which glucose-6P dehydrogenase (ZWF) was down-regulated (1). We included this study both in our introduction and discussion. It very much supports our own findings, as the E. coli PFOR requires likewise anoxic conditions in in vitro enzyme tests. We agree that the idea that PDH complex and PFOR are exclusively regulated by oxygen availability might sound simplistic. However, we do not fully agree that this is a strawman argument as both enzyme systems are still mostly discussed as counterparts for either aerobic respiration (PDH complex) or anaerobic fermentation (PFOR)(4). To the best of our knowledge, the study that was included now and our own data, are the very first ones that put clearly forward the idea, that redox control governs the activity of these enzyme systems at the pyruvate node independent of oxygen. However, doubts about the rather simplistic distinction between aerobic versus anaerobic enzymes in general have indeed been expressed. Even though these studies in general lack physiological in vivo experiments. We therefore included this information in the introduction as well. (line 76: There are several reports on the aerobic expression of enzymes that are assigned to anaerobic metabolism in prokaryotes and eukaryotes and therefore challenge the simplistic distinction between aerobic versus anaerobic enzymes (5-7). Their physiological significance and regulation are only partly understood.) This did not result in a shortened introduction though as additional information was added. The new introduction thus includes alternative interpretations as requested and is therefore hopefully more balanced.

      Given the interpretation that the alternative forms of the enzyme help cells adjust their redox balance to different conditions, such as photomixotrophic growth, the very nice enzymatic analysis and growth studies of the mutants work would be significantly strengthened by more direct physiological measurements that report intracellular redox states.

      Thank you very much for this important point. Intracellular redox states were shown by measurements of the NAD+/NADP level (Figure 1B) and were now extended by new in vivo measurements that show that both the NAD(P)H and the ferredoxin pools are more reduced under photomixotrophic in contrast to photoautotrophic conditions (new Fig. S4).

      Minor comments:

      line 211: Perhaps, "..the deleted alleles failed to segregate, keeping some wild type copies."

      This was changed to: the deleted alleles of fx2 (sll1382) and fx5 (slr0148) failed to segregate, keeping some wild type copies.

      It would be interesting to characterize whether the observed distribution of PFOR correlates with specific physiological features. In other words, PFOR seems to become important upon the addition of an external carbon source in way that must integrate with autotrophic metabolism (i.e. mixotrophic growth) altering the balance of the oxidized and reduced form of redox cofactors--does the observed distribution correlate at least with the metabolic characteristics of the handful that have been studied in the lab?

      Thank you very much for this suggestion. We checked the lists of cyanobacteria that either possess or do not possess a PFOR in order to search for shared known physiological features. However, the challenge is currently that the number of uncharacterized cyanobacteria in our list is too large. It is therefore impossible to find solid correlations. But we fully agree that it would be interesting to find these.

      A more detailed set of calculations that help explain panel C in figure 4 need to be included to support the quoted values for redox potential in free energy. I assume these are standard values and and the specific superscripts and subscription associate with the ΔG nomenclature needs to be defined.

      The calculations are shown in the materials and methods part. A respective notice (for calculations see materials and methods part) is now given in the legend of Fig. 4C. Information concerning the nomenclature is found in the cited literature in the materials and methods part as well.

      Reviewer #3 (Public Review):

      The manuscript by Wang et al. conclusively demonstrates that the cyanobacterium Synechocystis sp. PCC6803 prefers to use the ferredoxin-reducing enzyme PFOR over the NAD+-reducing PDH-pathway when grown under photomixotrophic conditions while the PDH-route is favored under photoautotrophic conditions. Both the potential physiological meaning of this switch and implications for the evolutionary history of the role of the respective enzymes and their pathways are discussed.

      The main hypothesis of this work considers that PFOR-mediated decarboxylation of pyruvate replaces the PDH-based one when cells shift from photoautotrophic to photomixotrophic growth conditions. This hypothesis is assessed via the comparison of growth curves measured on a host of deletion mutants and via direct detection of expression levels of certain enzymes. The authors' hypothesis is robustly supported by the majority of the reported experiments and the reviewer is fully convinced by these data. However, I would hold that the data shown with respect to phosphorylation of PDH (Fig. S4) are unconvincing. I can't see a clear difference in growth-curves for the incriminated mutants deltaspkB and L which would convincingly exceed the variation observed for the entire dataset.

      We agree that the data on the phosphorylation of the PDH complex including the kinase mutants are not very convincing. We were uncertain from the beginning on whether it would be a good idea to include these data sets and therefore discussed them very cautiously in the manuscript. Anyway, as the enzymatic tests with the E3 subunit of the PDH complex at different NADH concentrations show convincingly that high NADH levels have an inhibitory effect on the complex, we now decided to delete both data sets out of the manuscript, as they are not really required for the statement of the manuscript.

      1) S. Li et al., Dynamic control over feedback regulatory mechanisms improves NADPH flux and xylitol biosynthesis in engineered E. coli. Metab Eng 64, 26-40 (2021).

      2) T. Nakayama, S. Yonekura, S. Yonei, Q. M. Zhang-Akiyama, Escherichia coli pyruvate:flavodoxin oxidoreductase, YdbK - regulation of expression and biological roles in protection against oxidative stress. Genes Genet Syst 88, 175-188 (2013).

      3) A. Witt, R. Pozzi, S. Diesch, O. Hädicke, H. Grammel, New light on ancient enzymes – in vitro CO2 Fixation by Pyruvate Synthase of Desulfovibrio africanus and Sulfolobus acidocaldarius. The FEBS Journal 286, 4494-4508 (2019).

      4) M. Müller et al., Biochemistry and Evolution of Anaerobic Energy Metabolism in Eukaryotes. Microbiology and Molecular Biology Reviews 76, 444 (2012).

      5) S. B. Gould et al., Adaptation to life on land at high O2 via transition from ferredoxin-to NADH-dependent redox balance. Proceedings of the Royal Society B: Biological Sciences 286, 20191491 (2019).

      6) O. Schmitz, J. Gurke, H. Bothe, Molecular evidence for the aerobic expression of nifJ, encoding pyruvate : ferredoxin oxidoreductase, in cyanobacteria. FEMS Microbiol. Lett. 195, 97-102 (2001).

      7) K. Gutekunst et al., LexA regulates the bidirectional hydrogenase in the cyanobacterium Synechocystis sp. PCC 6803 as a transcription activator. Molecular Microbiology 58, 810-823 (2005).

    1. Author Response

      Reviewer #1 (Public Review):

      Strength: Excellent statistical methods are employed. Specimens collected from two centers are used.

      Weakness: It is not clear what new knowledge this follow-up study bring to the audience. The critical biomarker, miR150 they propose for development of biodosimetry assay was already discovered. There are close to dozen publications showing the dose response of miR50, in mouse, rats, non-human primates and humans (including two research papers and and several reviews from authors). The dose response shown in 4b is not appreciable. Introduction and discussion talk about clinical utility for triage after nuclear disaster. Is analysis of miRNAs purified from exosome a viable approach for triage and clinical decision making? If so, please provide convincing argument showing practicality.

      We appreciate that the reviewer and the editor believe that “excellent bioinformatics and biostatistical methods are employed”. We apologize for the confusion regarding miR-150 and its utility as a radiation exposure biomarker. Indeed we and others have shown the importance of miR-150 and other miRNAs in detecting radiation exposure in mice and macaques. We had inferred that the resulting evolutionarily conserved radiation-inducible microRNAs were very likely to translate well to humans due to the high conservation of their promoter regions and transcription factor binding sites. However, in this study validating microRNA-based test for radiation detection using actual samples , we demonstrate that while most of the predictions grounded in animal models held true, solely through the analysis of human data were we able to develop a model that reached clinically-useful performance. And most importantly there are key differences in humans suggesting that for clinical application the primary source of data has to be human. For example, a key miRNA for radiation detection noted in macaques – miR-133 – was absent in human patient sera. The miR-30 family, important for dose separation in mice was redundant in the human test. The results from animal studies of miR-150-5p are not directly translatable for the use in humans. In animals, particularly isogenic mice, miR-150-5p kinetics enable perfect separation of the irradiated from non-irradiated samples, even after low dose exposure. The dose response in humans, that have different genetic and clinical background, is much less appreciable and therefore a simple, single- or two-miRNA-based test is insufficient. To overcome this, we employed artificial neural networks reliant on the expression of 8 miRNAs and 2 normalizers, which assure robustness to differences in sample material content. Therefore, we are bringing significantly new knowledge to the field, and providing a template for how miRNA signatures derived from animal models need robust validation in human samples before we even conceive a human application. The analysis of miRNAs purified from exosomes constitutes an exploratory component of our work and is not part of the proposed diagnostic procedure for triage and clinical decision making. We introduced necessary changes to make the division between the main and exploratory parts of our work more evident (lines 116-127).

      Major comments:

      1. Longitudinal evaluation of specimens from human patients who received TBI is a plus. However, baseline readings in specimens collected from leukemia patients need to be compared with that in healthy humans. Why several specimens were excluded from analysis?

      Since the irradiation of healthy humans would not be ethically acceptable, we cross-referenced the results from patients with leukemia with our earlier results of radiation-responsive miRNAs in healthy mice and non-human primates as a surrogate of healthy humans undergoing TBI. As outlined in the “Preprocessing of profiling data” section of Materials and Methods, we implemented quality control based on the number of detected miRNAs per sample. For the miRNA-seq based experiment, samples with less than 350 miRNAs with non-zero reads detected (4A and 7A in Figure 1 – supplementary figure 1) and respective paired samples were removed from the analysis. Additionally, sample DFCI.13A was an outlier in hierarchical clustering and in Principal Component Analysis (Figure 1 – supplementary figure 2) and therefore this sample, together with paired samples from other timepoints, were excluded from the analysis. We incorporated this information in the main part of the manuscript (lines 146-148).

      1. Dose response noted is moderate. Biodosimetry refers retrospective evaluation of absorbed dose and the analysis should include validation using specimens of unknown exposure.

      As outlined above, the moderate dose responsiveness of miRNAs used in our proposed signature is the primary reason why we believe that a simple diagnostic procedure based on a single miRNA, e.g. miR-150-5p, will not be feasible for use in humans. The final model was evaluated on an independent group of 12 patients with samples drawn under the same protocol (for which exposure and dose was unknown, to validate the model diagnostic accuracy).

      1. Authors says that 1 Gy exposure in humans can cause ARS (paragraph 1, introduction). However their approach do not resolve dose under 4 Gy (around the LD50 value in humans).

      The TBI protocol does not allow for irradiation with doses lower than 2Gy in a single fraction, which was the reason behind the definition of low-dose exposure group (2 or 4Gy) in our study. However, localized irradiation with higher doses provokes response reflected by changes in miRNA levels in serum (Malachowska et al. Int. J Radiation Oncol Biol Phys), suggesting that the irradiation signature are likely to hold true and identify individuals exposed to smaller doses.

      Reviewer #2 (Public Review):

      The study first compared the profiles of serum miRNA in patients before and after irradiation treatment. Then they selected 8 miRNA markers that showed significant changes in levels for further analysis. Then, they showed that the analysis of these markers by real-time PCR can differentiate the pre- and post-irradiation samples in 12 additional patients. The objective of the study is unclear.

      We rephrased the appropriate sections of the manuscript accordingly to elucidate the objective of the study (lines 105-106 and 131-132).

      The study only demonstrates that the 8 miRNA markers are useful to differentiate serum samples collected before and after irradiation. This information is not useful as the blood picture would be more accurate and cheap to accomplish this task.

      The currently used diagnostic screening tests for radiation exposure, including time to onset of radiation sickness, kinetics of lymphocyte depletion and chromosomal abnormalities analysis, are time-consuming and do not allow definite conclusions, as outlined by the lack of FDA-approved biodosimeter. The nadirs of peripheral blood cell counts may reflect high dose exposure but do not allow for prediction of the eventual outcome. Moreover, as evidenced in our prior experimental studies, the dynamics of the blood cell counts are significantly slower than those of circulating miRNAs. For example, the differences in outcome, that is probability of survival of an animal after acute radiation exposure, is not evident by any blood counts or other measures for weeks after radiation, and is predicted by a blood based-microRNA signature with ~90% accuracy assessed 24 hours after radiation exposure (Acharya et al, Science Translational Medicine, 2015). Therefore, although we acknowledge that a blood cell count would be cheaper, we respectfully disagree that it would be more accurate in rapidly providing the necessary information to implement countermeasures safeguarding from the absorbed radiation dose. Furthermore, qPCR-based assays are also inexpensive and increasingly available, owing to the COVID-19 pandemic and the great need to expand PCR-based testing capabilities that it gave rise to. We acknowledge that this information was not presented in sufficient detail and we expanded relevant sections of the manuscript (lines 64-76, 401-402).

      The authors also propose that these markers are useful for the identification of subjects exposed to irradiation. As this study has not addressed the specificity of these miRNA markers to irradiation, the claim of having a signature for radiation exposure is not justified.

      We had shown in previous, experimental exposure studies (“Serum microRNAs are early indicators of survival after radiation-induced hematopoietic injury”, Science Translational Medicine, 2015 and “Evolutionarily conserved serum microRNAs predict radiation-induced fatality in nonhuman primates”, Science Translational Medicine, 2017), performed using animal models that miRNAs with radiation-dependent alterations of expression show association with bone marrow depletion, correlate with survival in amifostine rescue experiment, and that miRNA expression changes are supressed by the use of radiation-mitigating agents like gamma-3-tocotrienol. These arguments act in favour of specificity towards irradiation as the inciting stimulus of the expression patterns. The cross-referencing of results from animal studies and from our miRNA-seq experiment on human samples was aimed to account for this issue, as similar experiments on healthy humans would not be ethical, and to identify high-confidence miRNAs from which a signature could be built. We now added these explanations (lines 112-115, 164-167, 344-350).

      Although patients with irrevocable damage of bone marrow due to other factors would be an interesting comparative group, we struggle to find an ethically acceptable scenario that would match the TBI in terms of the timeline and repeatability of the bone marrow depletion. A feasible alternative may be high dose chemotherapy conducted in preparation for bone marrow transplant, but the dynamics of that procedure are vastly different making the group more adequate for analyses of bone marrow regeneration rather than a control for TBI-initiated damage.

      The key new experiments in this study are the profiling of the serum miRNA in the patients undergoing total body irradiation. The results on mouse model and macaques have been published previously. The consistency of the changes of the miRNA markers is not surprising.

      The consistency of the radiation-inducible miRNAs between mice, non-human primates and humans was expected, given the high conservation of their promoter regions and transcription factor binding sites, as we showed previously (Fendler et al., 2017). This step was important to assure that the miRNA level changes observed in humans result from radiation exposure, as this could not be determined directly, as mentioned in the response to previous remark. However, the creation of the clinically-applicable test would not be possible without a true study in humans presented in the manuscript. Notably, miRNAs crucial for the radiation exposure models in our macaque model (miR-133b) was surprisingly absent in human sera, and the miR-30 family, important for dose separation in mice was redundant in the human test. This serves as a cautionary tale for “translational” studies without true validation in humans and underlines the importance of our findings in terms of the first human-specific and adequately validated diagnostic and prognostic test for radiation exposure.

      Reviewer #3 (Public Review):

      1. Appropriate bioinformatics discussions and functional pathway analysis are necessary for the key differentially expressed miRNAs that have been screened out. It is boring to only discuss the differences of miRNA data.

      We appreciate the suggestion to back the results of differential miRNA expression with a more in-depth bioinformatic discussion. We discussed the results of functional enrichment analysis, presented in Fig. 3C, in more detail, and appended the bioinformatic analysis (lines 218-222, 360-364, 546-549). A graph of miRNA-gene interactions, created using miRTargetLink 2.0 for miRNAs differentially expressed in exosomes after high dose irradiation has been added as figure supplement 1 to Figure 3.

      1. In page 5, "We used logistic regression to create such a model in the low-dose setting (N=22 sample pairs). The resulting classifier was based on the expression of miR-150-5p, miR-126-5p and miR-375" , Why the three miRNAs in the low-dose radiation group were selected for modeling instead of the seven overlapping miRNAs in the high and low dose radiation group to classificate the irradiated- and non-irradiated samples ? Please explain in detail.

      The expression of miR-150-5p, miR-126-6p and miR-375 was used in our previous animal studies to determine radiation exposure and we used similar approach at this stage of the project to evaluate whether their expression measured using RNA sequencing in human sera can reliably distinguish between the irradiated and non-irradiated samples. We acknowledge that it is not clearly stated. The primary purpose of this analysis was to visually present similarities in radiation-inducible miRNA expression changes across species, and the logistic regression model in question was not used any further. Following the Reviewer suggestion, we built a model using the seven miRNAs overlapping in the high and low dose radiation comparisons to classify the irradiated- and non-irradiated samples, obtaining AUC of 0.95 (95%CI: 0.89-1.0); however, we believe adding this information to the main part of the manuscript is not necessary.

      1. In page 5, "Therefore, the expression of miR-126-5p, miR-150-5p and miR-375 enabled efficient classification of the irradiated- and non-irradiated samples in both settings (Fig. S6C)";

      In page 6, "Interestingly, a set of 3 miRNAs quantified by qPCR in all of our previous experiments clearly visually distinguished irradiated from non-irradiated samples in the human analysis (Fig. 5A)",

      Which three of miRNAs,miR-150-5p,miR-375,miR-126-5p mentioned before or miR-150-5p,miR-375,miR-215-5p?Please clarify clearly.

      Thank you for the suggestion. We rephrased this fragment (lines 289-290).

      1. In page 4, "Since miRNA-containing exosomes.......high dose irradiation", Do you think that the differently expression of serum miRNAs partly results from exosomes? Low dose irradiation is also able to change exosomal miRNA profile,why only high dose irradiation is taken into account in paper while low dose irradiation is not?

      We believe that serum miRNA expression results in part from exosomes and, as an exploratory component of our work, aimed to verify whether the magnitude of changes in exosomal miRNA expression exceeded that in serum, improving the potential biomarker specificity to the extent that would justify the development of an arguably more complex and labour-intensive test utilizing exosome isolation. The sequencing of exosomal miRNA content was therefore performed as an exploratory analysis only after high radiation exposure. However, the lower amount of exosomal miRNA than obtained through the total miRNA extraction protocol offsets any benefit stemming from higher cellular specificity of the former, and, based on the results that were comparable with those obtained from sera, decided to not explore this concept further. We added this explanation to our manuscript as this issue was not clarified previously (lines 116-127 and 339-343).

      1. Are there any miRNAs that can clearly distinguish between high and low dose groups? If so, please clarify them in text.

      We now clarified this issue in discussion (lines 415-417).

      1. In page 7,"Importantly, similarities were observed in the level of both individual miRNAs and miRNA families", What part of result Comes to this conclusion?Please explain clearly.

      When describing similarities between human and animal studies, we refer to our previous work describing radiation-responsive miRNAs in mice and non-human primates. These similarities (and differences) are described in detail in Table 1. We added relevant references to Table 1 and to the cited sentence (line 352).

      1. In page 7, "We found that the most common putative tissue sources for differentially expressed miRNAs were hematopoietic and endothelial cells", Which part of result shows this sentence? Please point it out.

      This statement is not validated in our work explicitly but based on the results from references: Ludwig et al., 2016, de Rie et al., 2017 and Landgraf et al., 2007. Since Ludwig et al., de Rie et al. and Landgraf et al. generated excellent data of miRNA expression across human and mouse tissues and cell types that showed overlapping results for the miRNAs of interest, as detailed in Table 1, we did not perform additional confirmatory experiments.

      1. Were the patients suffering from cancer or other diseases? How to ensure that the differential expression of miRNA was caused by radiation exposure rather than their own disease? Please explain.

      As described above, initial experimental studies performed in animal models (mouse and macaque) in preparation for this study showed the specificity of miRNA (including ones in the signature) towards radiation exposure in different animal models. This was evidenced on multiple layers of validation and rescue experiments. Admittedly, a demonstration that additional diseases with a phenotype similarity with ARS affect study performance is an interesting concept, but it would be extremely unlikely to impair the performance of the test in an individual after radiation exposure. Namely, even if the examined patient has a hematologic malignancy or myelofibrosis potentially affecting the performance of the test, identification of such individuals as potentially irradiated would lead to them being followed-up adequately. Failure of the test to detect radiation exposure will likely not be severe risk, since such individuals will already be severely ill and under proper care with regular monitoring of bone marrow function. We are aware that some unforeseen and not discussed clinical factors may affect some facets of the test but the built-in robustness derived from having multiple miRNAs mitigates the risk of non-specificity.

    1. but that was probably just something people said because her nose was too small and her mouth was a bit too big.

      What is interesting is the reasoning we often come up with viewing ourselves is often negative as we seem to like to often question how others think that you look good while you may end up looking at yourself negatively.

    1. adamsmith May 19, 2013 So, the argument for the status quo is that the working paper on arxiv is a separate publication from the journal article it ends up published as. That's why it should be saved and - where it applies - cited differently. In other words, taking bibliographic data seriously, the DOI does _not_ apply to the arxiv paper and should not be saved with it. That's in line with what we do with other working paper repositories such as SSRN.
      • I THINK SO
      • DIFFERENT zotero items for
      • arxiv
        • different item for each version!!!
      • doi publisher
      • Each item with its PDF!!!
      • DIFFERENT Citations!!!
    1. On a larger scale, ARSAC’s goal is not to completely extinguish wildfires—or replace firefighters—but to create acoustic boundary lines that prevent such fires from spreading. “We think we’re going to be able to buy those firefighters time, which is the real killer in a disaster situation,” Dhillon says. ARSAC’s technology may prove particularly disruptive because it’s designed as a “sense and respond system,” he adds, rather than a “sense and react” system. The difference? ARSAC’s integrated fire protection system aims to not only detect embers but also track the location and direction of burgeoning fires to prevent them from crossing property lines. ARSAC’s system employs sensors to detect a heat bloom and then send out spikes on a given frequency that can be used to track the fire’s flow, Dhillon explains. Drones can then be dispatched to provide aerial surveillance to monitor the fire, and arrays of sound-wave fire extinguishers along property lines can be pointed in the right direction to create an acoustic fire barrier.

      System design details

  2. Dec 2021
    1. Author Response

      Reviewer #1 (Public Review):

      In the present work Valperga and de Bono performed a forward genetic screen to identify candidate genes that would fulfill two criteria when mutant: 1) enhance an escape response to high ambient oxygen but 2) without modifications in the respective oxygen sensing neurons. They found that qui-1 mutants meet these criteria. qui-1 is known to act in the nociceptive neurons ASH and ADL (among others). The authors show that in qui-1 mutants ADL neurons are defective in normal chemo-sensation and upregulate neuropeptide secretion. This is associated with increased gene expression of neurosecretion components in ADL, among them two GPCR receptors (npr-22 and tkr-1); mutants in these receptors partially phenocopy the neurosecretion phenotype. The authors suggest an intriguing model in which ADL, upon loss of its normal sensory properties, relays peptidergic input from oxygen sensory circuits to peptidergic output towards yet unidentified downstream circuitry. This novel mechanism of sensory cross modality expands on on previous work on cross modality in C. elegans, where until now only one example been demonstrated, and where a different mechanisms than in the present study was described (Rabinowitch 2016). These findings could serve as generalizable models for other systems where cross-modal plasticity has been observed. Although many conclusions in this work are substantiated by cell specific rescue of qui-1 in ADL others are made based on correlated observations only. The study therefore would benefit from additional experiments that demonstrate a causal link between elevated neurosecretion in ADL and the associated changes in behavior. This could be achieved by ADL cell ablation experiments and specific interference with ADL neurosecretion.

      We thank the reviewer for this analysis of our work. We sought to address points raised in this summary using her/his suggestions.

      Reviewer #2 (Public Review):

      Loss of one sensory modality is often compensated with an increase in another sensory modality. Valperga and de Bono identify a possibly conserved mechanism that appears to heighten the worm's sensitivity to O2 while dampening other sensory responses. The mechanism that they discover suggests that increased neuropeptide secretion could be responsible for the overcompensation for a loss of a sense. The combined data based on forward genetic screening and behavioral analysis, imaging and genomics are convincing and interesting.

      1. I very much enjoyed reading a manuscript that uses 'good old' forward genetics to make an interesting discovery!

      2. The paper is well written and very easy to follow. The data quality and their display in the figures are very convincing, too.

      3. The proposed mechanism of using enhanced neuropeptide secretions for compensating the loss of one sensory modality with an increase of function of another is novel and could indeed be conserved.

      We are grateful to the reviewer for the encouraging review of our work.

      Reviewer #3 (Public Review):

      The work by Valperga and de Bono aims to uncover molecular components of cross-modal plasticity, a system-wide form of neuronal remodeling that responds to sensory loss by altering the performance of remaining sensory modalities. The study focuses on the interplay between oxygen-sensing and pheromone detection in C. elegans. The data presented are mostly convincing and revealing. However, the message and the overall context within which the findings are framed are problematic.

      The authors rightly assert that the molecular processes underlying cross-modal plasticity are not fully understood. However, they emphasize that the important challenge is to reveal genetic lesions that result in sensory loss and drive cross-modal plasticity. I find this to be over-specific and imprecise. There are many possible causes for sensory loss, some are genetic, some are non-genetic (e.g., certain diseases and injuries). In any case, the causes for sensory loss are usually independent of the processes that give rise to cross-modal plasticity. The genetics behind cross-modal plasticity enables the response to sensory loss, it does not cause the sensory loss. Genetic lesions to genes involved in cross-modal plasticity disrupt cross-modal plasticity, they don't induce it. Curiously, the authors sought to find single genes whose removal is simultaneously associated with both the loss of a sensory modality and the enhancement of another. This was done using a forward genetic screen for C. elegans mutants displaying enhanced oxygen sensation.

      We thank our reviewer for her/his thoughtful comments. We have revised our introduction to take account of her/his comments, and to remove the misleading statements s/he highlights.

      The analysis was further complicated by the fact that the screen was performed on strains whose oxygen sensitivity is already modified due to dysregulated activity in the RMG hub-and-spoke neural circuit, which integrates diverse sensory signals to control locomotion. Mutagenesis was performed on either the N2 strain, exhibiting RMG suppression, and thus decreased oxygen sensitivity, or flp-21 mutants, displaying excessive RMG activation, and increased oxygen sensitivity.

      We chose two genetic backgrounds for our mutant screens that attenuate the output of the RMG hub interneurons. Both backgrounds include a gain-of-function allele of the neuropeptide receptor NPR-1 that inhibits RMG output. The NPR-1 receptor has multiple peptide ligands, so in the second screen we reduced NPR-1 inhibitory signalling by deleting one these ligands, FLP-21. Neither of the two strains we used, N2 or flp-21, show appreciable O2 responses on food, and do not aggregate or accumulate on thicker parts of the food lawn, facilitating our screen (See Figure 1B).

      The screen yielded a gene, qui-1, whose dysfunction led to enhanced oxygen sensing (it is unclear if this is in the N2 or flp-21 background). The authors found that increased neuropeptide release from the pheromone-sensing neuron ADL underlies the increase in oxygen sensitivity. Furthermore, the qui-1 mutation was shown to diminish ADL pheromone responses. Therefore, a very particular genetic coupling between loss of pheromone sensation and enhanced oxygen sensitivity was revealed.

      We have indicated the parental origin of the qui-1 mutant in the revised manuscript.

      To generalize this finding, several additional mutant genes (not from the screen) were examined, including genes from the BBS family as well as wrt-6 and fig-1. They too displayed enhanced oxygen sensing linked to increased ADL neuropeptide secretion. However, their effects on ADL pheromone sensation were not reported. The main conclusion I draw from these findings is that the ADL neurons are able to modulate oxygen sensitivity by relaying information about oxygen levels from the RMG circuit to locomotor circuits via neuropeptide secretion. It is not at all clear that loss of pheromone sensation in the qui-1 case is the cause for increased neuropeptide release, or whether it is just one out of the many outcomes of mutating this gene. A much cleaner and more revealing experiment could have been, for example, to examine worms lacking the functional pheromone receptor OCR-2 in ADL. In fact, unlike qui-1 mutants who showed diminished oxygen responses in ADL, previous work from the de Bono group (Fenk and de Bono 2017) demonstrated that ADL O2 response are normal in ocr-2 mutants, indicating a profound difference between loss of pheromone sensitivity due to receptor dysfunction (ocr-2) and the unknown and broad effects of qui-1.

      We thanks the reviewer for this important suggestion. We have sought to test our model with a functional experiment that selectively disrupts sensory input into the ADL neurons. To achieve this, we decided to knock down a protein required for intraflagellar transport, OSM-6, rather than the OCR-2 TRP channel subunit. OCR-2 mediates not only pheromone responses in ADL, but also O2-escape behavior (de Bono et al., 2002). This may reflect a broader role for OCR-2 in ADL than sensory transduction. Disrupting OSM-6 truncates sensory cilia and severely compromises many chemosensory responses, but only weakly reduces aggregation and O2 responses.

      To target OSM-6 degradation specifically to the ADL neurons we knocked in DNA encoding an Auxin Inducible Degron (AID) into the osm-6 locus, and expressed TIR1 in ADL to achieve cell-specificity. TIR1 is required for AID. We have added the new data to Figure 4F–G and Figure 4 – figure supplement 2. We show that expressing TIR1 in ADL disrupts OSM-6::AID function both in the presence and absence of Auxin. This agrees with recent work that tested the efficiency and specificity of the AID system (Hills-Muckey et al., 2021). A partial OSM-6::AID reduction in ADL recapitulates many of the phenotypes of qui-1 mutants, including increased neurosecretion from ADL, heightened ADL responses to O2 inputs and a small but significant enhancement of the O2-escape response. We think these new data support our interpretation that a change in ADL’s sensory properties leads to heightened response of ADL neurons to O2 inputs, a phenotype observed in qui-1 and multiple other sensory defective mutants and a hallmark of cross-modal plasticity. However, the effects of knocking down osm-6 on ADL function also appear to be complex, as the stronger osm-6 knockdown achieved by adding auxin to the osm-6::AID knockin animals expressing TIR1 in ADL, unexpectedly gives weaker phenotypes than when auxin is absent.

      In fact, it would be interesting if the authors could explain or speculate how qui-1 eliminates ADL O2 responses, and how neuropeptide signaling from the RMG circuit via the NPR-22 neuropeptide receptor bypasses this lack of response and drives enhanced neuropeptide secretion in ADL, as they report.

      We can only speculate why O2-evoked responses in ADL disappear in qui-1 mutants. One possibility is that ADL becomes less excitable due to the reconfigured gene expression associated with loss of qui-1 in ADL. This model would predict that selectively knocking down qui-1 in ADL would confer the same Ca2+ response phenotype. Blocking ADL neurosecretion with TeTx in qui-1 mutants would test if the increased ADL neurosecretion we describe feeds back to reduce the O2-evoked Ca2+ response in ADL. An alternative hypothesis is that the effect of disrupting qui-1 is non-cell-autonomous, altering excitatory or inhibitory input to ADL from other qui-1 expressing neurons. We have not tested if neurosecretion from other qui-1-expressing neurons is altered in qui-1 mutants.

      Strikingly, while disrupting qui-1 leads to loss of a measurable O2-evoked Ca2+ response in ADL, these neurons display elevated O2-evoked neurosecretion in qui-1 mutants. This implies that some O2-evoked Ca2+ responses are retained in ADL’s axons in qui-1 mutants. It also suggests that other second messengers upregulate neurosecretion. Elevating cAMP, for example, can promote dense-core vesicle release more efficiently than increasing Ca2+ levels (Costa et al., 2017). Altered G-protein coupled receptor signalling could lead to elevated cAMP levels and increased neurosecretion in qui-1 mutants. It is worth noting that in N2 controls, ADL does not display O2-evoked neurosecretion despite showing measurable Ca2+ responses.

      The work includes a transcriptomic analysis comparing ADL-specific gene expression between wild type and the qui-1 mutant. Unlike other experiments in the study, in which the specific effects of mutations were confirmed through rescue experiments and the use of additional alleles, thus eliminating potential confounds with background mutations, the transcriptomic experiment did not apply such controls. Therefore, it is hard to conclude whether the reported changes in transcription are due solely to the qui-1 mutation or to other unrelated genetic modifications in the mutant strain.

      We worried about unspecific effects of background mutations both on the ADL transcriptome and on other qui-1 related phenotypes. We regret we did not explicitly address this point in our initial submission. To remove background mutations, mutants isolated in our screen, including qui-1, were backcrossed with the N2 laboratory strain a minimum of four times. These qui-1 animals were further crossed into a 5 times outcrossed line that expresses the fluorescent protein mKate specifically in ADL, to generate the strains from which we sorted ADL neurons by FACS. Mutant and transgenic strains were outcrossed using the N2 laboratory strain. We explain this in the Methods section of the revised manuscript.

      The extensive outcrossing make us confident that the large majority of differentially regulated genes between wild type and qui-1 samples in ADL are due to the absence of qui-1. Supporting this, both mutations in neuropeptide receptors identified by our profiling, npr-22 and tkr-1, suppress ADL’s elevated neurosecretion. Nevertheless, we have added a note to explicitly bring up the concern raised by our reviewers, that some transcriptional differences could be the result of background mutations.

      Overall, except for where mentioned, the data presented are solid and consistent. However, the conclusion that the study reveals a molecular pathway for cross-modal plasticity is less convincing. The chain of events does not include some form of sensory loss, leading to subsequent, independent neural plasticity, as expected for cross-modal plasticity. Rather, a very broad genetic switch is described that can simultaneously change receptor abundance and neuropeptide release. Thus, an equally interesting and more coherent framing of the data could be that the study uncovered a genetic regulator, yet to be fully characterized, of oxygen-dependent behavior in a non-oxygen sensing neuron, adding to previous literature on neural circuit cross-talk.

      We are grateful to the reviewer for her/his thorough and critical analysis of our work, which has prompted us to perform additional experiments and helped us revise our manuscript. These additional data clarify our final interpretation of the data regarding cross-modal plasticity.

    1. SciScore for 10.1101/2021.12.23.21268325: (What is this?)

      Please note, not all rigor criteria are appropriate for all manuscripts.

      Table 1: Rigor

      <table><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Ethics</td><td style="min-width:100px;border-bottom:1px solid lightgray">Field Sample Permit: Data collection and analysis: Interviews were conducted via Microsoft Teams, Zoom, or phone.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Sex as a biological variable</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Randomization</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Blinding</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Power Analysis</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr></table>

      Table 2: Resources

      <table><tr><th style="min-width:100px;text-align:center; padding-top:4px;" colspan="2">Software and Algorithms</th></tr><tr><td style="min-width:100px;text=align:center">Sentences</td><td style="min-width:100px;text-align:center">Resources</td></tr><tr><td style="min-width:100px;vertical-align:top;border-bottom:1px solid lightgray">Interviews were recorded with consent, transcribed, anonymised and entered into Nvivo v12.</td><td style="min-width:100px;border-bottom:1px solid lightgray"><div style="margin-bottom:8px"><div>Nvivo</div><div>suggested: (NVivo, RRID:SCR_014802)</div></div></td></tr></table>

      Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


      Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
      Limitations: Despite best efforts, our recruitment strategy may have missed relevant voices, including those who are not computer literate. Our data must be interpreted with this in mind. Conclusions: In conclusion, the majority of young people described and concluded that for themselves, the benefits of vaccination did not outweigh the perceived risks to themselves. They did not consider themselves to be at risk of becoming seriously ill from COVID-19 and did not think that the vaccination was capable of protecting those around them. This, combined with concerns about the safety of the vaccine, resulted in reluctance to be vaccinated at present. Perceptions of risks and benefits were influenced by participants’ age and health status, trust in government, understanding of science, and pre-existing ideas and expectations. Participants were unsure who they could and could not trust and were resistant to attempts that were viewed as coercive. In order to promote uptake, public health campaigns should focus on the provision of information from trusted sources that carefully explains the benefits of vaccination and addresses safety concerns more effectively. To overcome inertia in people with low levels of motivation to be vaccinated appointments must be easily accessible (both in terms of location and timing). Research now needs to identify how to communicate risks (from COVID-19 and vaccination) and benefits (for the individual and population) so that people can make informed pe...

      Results from TrialIdentifier: No clinical trial numbers were referenced.


      Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


      Results from JetFighter: We did not find any issues relating to colormaps.


      Results from rtransparent:
      • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
      • No protocol registration statement was detected.

      Results from scite Reference Check: We found no unreliable references.


      <footer>

      About SciScore

      SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

      </footer>

    1. Here's the prompt for the final assignment.

      The final piece of writing is a critical reflection about your work that uses the ideas and texts in the course to support, and highlight, your ideas.

      This piece will also be published in SCALAR for future use; that is, the final piece should say something about how you've encountered the material in the course so as to instruct future users/readers. Your writing—your readings of the texts; your telling of your experience with ideas—will be great guides for students.

      Think broadly, first. Ruminate on the following and record it for yourselves somewhere (I use a black notebook for this and write longhand; no technology, other than a Sharpie, for thinking—that works best for me):

      Explain and describe what the texts help you see and understand, even if this means further confusion, or the creation of more questions that are yet unanswerable for you. Here, begin to not rely on invisibility and slow violence; think beyond these very large themes.

      How have these texts helped you delve deeper into the questions of environmental justice—and expanding the meaning, on confusing it, making it, perhaps, more nuanced?

      How does environmental justice affect your view of what you need to examine about your goals going forward?

      This preamble, the questions, should be seen as lofty goals to guide your thinking and your writing. More specific "how-to" is below.

      The GOAL of this piece is to describe, discuss, and even argue ideas that will flow smoothly into an answer to the following question: Knowing what you know now, how are you going to approach the rest of your life?

      This question doesn't have a definitive answer; it's about perspective, point of view, attitude. It's also about responsibility — the ability to respond: where will you find the space(s) necessary to give yourself enough time to respond to what comes your way—unexpectedly? how do you do this? how will your education help, with examples, referring to texts, in this course and others you've encountered, that have affected the way you think? after summarizing the texts in this course, which singular text, which one of these, is something you know you'll take with you, meaning you know that it has affected you and the ideas found in this text are important guides? how does this text help you see yourself better?

      I taught an FYS back in the fall of 2015 and the class had a similar assignment. Students asked that I too write the assignment and provide a model. You guys didn't take this course, but I'm sharing with you what I wrote (since published) (Links to an external site.) so that you can see a model for the work I'm asking you to do. Notice how I contextualize the texts, give the reading of each I need for my argument/description, and use this to describe a native characteristic of American culture. I'm not asking that you be as lengthy. I'm not expecting this sort of reading of American culture; these courses are different. I'm basically wanting to know what it is you see now that the course is over, or nearly over. This model (linked) is simply a sketch for you, and an outline that allows you to see a way into the work, and a way through—a way to organize.

      I want you to be creative about your approach. You can tell a story and use the texts, for instance. You can use your experiences as a way through this and use the texts. And so on ...

      I have created a SCALAR PAGE (Links to an external site.) for each of you.

      I want you to consider following these guidelines for writing:

      Go back to your mapping exercise: How did your plan turn out? Where are you now? [this is something we will have already spoken about in our f2f meetings, so you want to have notes from that]. This shouldn't be written, When I look at my mapping...or I said in my mapping that ... Rather, it should be something along the lines of, My writing interests in this course suggest (ideas + support from essays) ... or A central focus of my essays (or thinking) has been ... (examples)...or Engaging the texts in this course, I started to think about ... (examples) ... I thought ... and now I'm thinking that ... (examples)  — This section should be short (no more than a tight paragraph) and strategically placed.
      As preparation for writing, without looking at the texts, just referring to their titles for inspiration, see what you recall: write out a paragraph or so about each text encountered in the course—this is for your own use and a way to organize before writing; this is done to determine what you recall, which is important and what you should focus on because that's instinct talking; eventually open the texts to make sure you have examples for citing [any material from outside the syllabus you wish to cite is fine, too, especially since I've sent you a lot of reading from the popular media].
      Find a central idea or theme you want to explore. Set it down somewhere so you can see it and read it back to yourself. Yes, there will be the tendency to speak about invisibility and slow violence, I get that; however, these should be ideas that help illustrate a central idea or theme that's your very own and based on what your reading of the courses' texts tells you about the nature of society, as it is now, the challenges we face, and, definitely, where you're situated, in the texts and the challenges we face. Thus, invisibility and slow violence should not be the central ideas/themes of your work, rather instruments/conditions/truths you found along the way and you're using these to pry open a deeper, richer understanding of your relationship to these ideas and environmental justice.
      Write a draft and start sharing it with your group. Likewise, you want to make sure that you and I sit with your piece, letting me comment on it before it's due  (even numerous times) so that we have the piece you want. You definitely want to conference with me about your piece so I can help you get as deep as possible into the subject—in other words: making sure each of you writes something you're really moved by and proud of.
      Since this is on SCALAR, make sure you have relevant images, links, where appropriate and necessary, and any other media (clips, sound, etc), you may wish to insert creatively to lift your piece.
      
    1. Author Response

      Reviewer #1 (Public Review):

      There are very few studies on the spatial integration of color signals of V1 receptive fields, which is a striking gap in knowledge given the importance of color to primate vision and the powerfulness that spatial analysis of luminance contrast integration has proven for understanding how V1 works. This paper helps fill this major gap in knowledge. The main take home is that double opponent cells and simple cells are more likely to be linear in how they integrate signals across their receptive fields than a sample of non-double-opponent/non-simple cells. This conclusion is consistent with the limited data presently in the literature, and I wonder if further analysis of the rich dataset could uncover some deeper insights.

      We thank the reviewer for highlighting the gap in knowledge that our study helps to fill and for the excellent suggestions for ways to improve the manuscript. In response to both reviewers, we have conducted new analyses that uncover deeper insights into signal integration in V1. These new analyses have been incorporated into the revised manuscript.

      Reviewer #2 (Public Review):

      De and Horwitz deploy a focussed technique for testing the linearity of spatial summation for V1 neurons with spatial opponency, with the emphasis being on the properties of cells that encode chromatic information in a spatially opponent manner - so called double opponent cells. The technique isolates non-linearities of summation from non-linearities that occur after summation, by using an adaptive procedure to home in on stimulus contrasts in different color directions that produce a pre-defined criterion response. The authors conclude that many (but not all) double opponent cells embody linear spatial summation, and discuss implications for our understanding of the cortical circuitry that mediates color vision. The data appear carefully collected and generally well-analyzed. There are some points, elaborated in broad strokes below, where I think the paper would benefit from further elaboration of the data and its implications, and the paper would also benefit from some revisions to improve clarity.

      How are results affected by the cell classification criteria? The authors apply criteria to sort cells into four classes: simple, double opponent, NSNDO, and those not studied further. Response properties are then studied as a function of cell class. Criteria for classification include presence/absence of spatial opponency revealed by the pixel white noise measurements and the adequacy of a linear STA to describe the hyperpixel white noise data. I think more work is needed to clarify for the reader the extent to which these criteria, in and of themselves, affect the results for each class studied. In particular, if a linear STA describes the hyperpixel white noise data, shouldn't we then expect to find linear summation in the spatial receptive field in that same hyperpixel white noise data? I understand, as the authors point out, that the Phase 3 measurements could reveal failures of spatial summation not seen in the hyperpixel white noise data. But I'm a bit perplexed by the outliers in the NLI indices in Figure 3D. What properties of these cells allow a linear 6D STA to handle the hyperpixel white noise data well, but cause them to summate over space non-linearly for that same hyperpixel white noise data? In terms of the new information provided by the Phase 3 measurements, I wasn't able to get a sense of how much harder these stimuli were driving the cells than the Phase 2 measurements. It seemed like this was the intent of Figure 2 - Figure Supplement 1 and Figure 3 - Figure Supplment 1, but those two figures in the end didn't provide this information in a manner I could digest. Absent this, it was hard to tell how much more we are learning from the Phase 3 data. Could the higher NLI's here than in Phase 2 be a consequence of some stimuli but not others driving the neuron into saturation? And although the authors write on page 15 "Nevertheless, we found that nonlinearities detected in Phase 2 of our experiment were a good indicator of nonlinearity over the greater stimulus duration and range of contrasts in Phase 3, principally for the NSNDO cells (Figure 3E)", those correlations look very weak to me. I was left hoping for a better understanding the commonalities and differences in the data between Phases 2 and 3. I'm also not sure of the reliability of the measured NLI's for each cell with each method. Can anything more be provided about that? I note here that I did study the section of the discussion that nominally addresses some if these issues, and that my comments above remain after that study.

      The Reviewer brings up several important points that are addressed individually below.

      The revised manuscript is more explicit about the role of the cell classification criteria on the results. Particular emphasis is placed on the role of the spike-triggered covariance criterion in enriching the pools of simple cells and DO cells with neurons that are approximately linear.

      We agree with the Reviewer that, if a linear STA describes the hyperpixel white noise data well, we expect to find linear summation in the spatial receptive field in that same hyperpixel white noise data analyzed in other ways. A critical question is “does the STA describe the white noise data well?”. We address this question in two ways in this report: with an analysis of (the statistical significance of) the first principal component of the hyperpixel spike-triggering stimuli (PC1) and with a comparison of GLM and GQM fits to the hyperpixel white noise data (the white noise NLI). These analyses are related but are sensitive to different types of departure from linearity.

      Consider a neuron whose output is the product of two half-wave rectified linear subunits (see Figure 2 – Figure Supplement 5). Such a neuron would have a large white noise NLI due to the non-linear interaction between the subfields, but it would lack a significant PC1, because the nonlinearity tightens the distribution of excitatory stimuli, and the PC1 is the dimension along which the stimulus distribution is widest. In principle, such a nonlinearity would manifest in the smallest principal component, but in practice, small PCs often resemble the STA, which complicates their interpretation.

      Conversely, a neuron can have a significant PC1 but a small NLI. For example, consider a neuron that has a half-wave rectified response to modulations of one color channel but a full-wave rectified response to another. Such a neuron will have a significant PC1 due to the full-wave rectification, but an NLI near zero, because this nonlinearity is hidden once the stimuli are projected onto the STA (recall that the white noise NLI is computed from a pair of 1-D projections not the original 6-D representation). Code simulating these hypothetical neurons (used to produce Figure 2 – Figure Supplement 5) is available at GitHub (https://github.com/horwitzlab/Chromatic_spatial_contrast).

      The original submission lacked documentation of the difference in firing rates produced during Phases 2 and 3. We have added a new supplementary figure that quantifies this difference (see Figure 2 – Figure Supplement 2). Figure 2 – Figure Supplement 1 illustrates the range of inputs provided in Phases 2 & 3. This has been clarified in the revised text.

      Please note that the data shown in Figure 3D are isoresponse NLIs (that is, NLIs computed from responses recorded during Phase 3 of the experiment) not white noise NLIs (NLIs computed from the hyperpixel white noise shown during Phase 2 of the experiment). This has been clarified in the revised text.

      We agree that the correlation between the white noise NLI and isoresponse NLI measurements is weak. A full treatment of the differences in neural responses to the stimuli presented in Phase 2 & 3 is beyond the scope of this study. Nevertheless, we can think of several reasons that some neurons may have appeared more nonlinear in Phase 3 than they did in Phase 2. The first is, as suggested above, Phase 3 stimuli had higher contrast than Phase 2 stimuli, and are more likely to have engaged nonlinear gain control mechanisms upstream or within V1. Second, the linear and nonlinear models in Phase 2 had 3 and 6 parameters, respectively, but 2 and 5 in Phase 3, and this may affect the ratio of prediction errors. Third, nonstationary responses are expected to affect isoresponse NLIs more severely than white noise NLIs, because of the sequential way that isoresponse points were measured in Phase 3.

      Assessing the reliability of NLIs within cells is challenging because of the crossvalidation that is built into the definition. To address this comment, we used a jackknife procedure that quantifies the spread of NLIs computed from each of the data partitions used in the cross-validation.

      Implications of the results for models. As the authors summarize in their introduction, the motivation for testing the linearity of spatial summation is that the results can guide how we formulate response models for V1 chromatically sensitive cells. More discussion of this would be helpful. As an example, could cells with the nonlinear spatial filtering as shown in Figure 1C be classified as DO, making them relevant to the focussed tests applied in this paper? Or are they necessarily NSNDO? More generally, can the authors spend a little time discussing what classes of response models they would pursue for DO cells that do/don't show linear spatial summation, and for NSNDO cells that do/don't show linear spatial summation. Such discussion would tie the results of the primary data back to the motivating question in a more satisfactory manner, I think. Such discussion could also be used as a vehicle to discuss what the authors think about the DO cells that fail to show linear spatial summation and the NSNDO cells that do, something I found under-treated in the results. As with the comment above, I did read the sections of the paper that speak to this question, but still find it that it would benefit from going deeper.

      Inspired by this comment, we have added a new section to the Results that considers response models for neurons that do not show linear spatial summation. Specifically, we test the model illustrated in Figure 1C and reject it for many neurons. Figure 1C depicts a neuron that integrates inputs linearly within each subfield but nonlinearly across subfields. Within each RF subfield, therefore, this neuron conforms to a linear-nonlinear cascade model. Critically, during Phase 2 of the experiment, the stimulation at one RF subfield can be considered as an additive noise with respect to the signal generated by the other RF subfield. This is because the influences of the two RF subfields combine additively (under the model) and the modulations of the two hyperpixels are independent.

      To test this model, we compared GLM and GQM fits as we did in the analysis of the white noise NLI. The regressors in this analysis were the modulations of the three color channels from a single subfield. These GLMs fit the data systematically worse than GQMs as assessed by cross-validated prediction error. This result indicates that the nonlinearity of the NSNDO cells is unlikely to be a result of nonlinear combination of inputs from two linear RF subfields, as postulated by the model in Figure 1C. Instead, for many NSNDO neurons the nonlinearity appears to arise from nonlinear combinations of signals within individual subfields. We mention in the Discussion that linear DO cells may lie on a continuum with some NSNDO cells.

      Color properties of subfields. The study measures detailed properties of cells that show at least two distinct subfields in the initial pixel white noise analysis. The paper focuses on whether signals from such subfields are combined linearly before any downstream linearities. However, there is another feature of the data that seems central to understanding these cells, and that is what the chromatic properties of these subfields are, and how strong in the data the constraint that the chromatic properties of the two separate subfields be complementary is. It is stated in passing (page 7) that "the two sides of the hyper pixel STA were complementary or nearly so", but it would be nice to see this treated in more detail and also to understand whether there are differences in the distribution of the chromatic properties of the two sides between the DO and NSNDO cells, and between cells with low and high non-linearity indices.

      We have added new section on the chromatic properties of the subfields of the neurons we studied (Figure 2 – Figure Supplement 3).

    1. Author Response

      Reviewer #1 (Public Review):

      Summary:

      In this work, the authors develop a tool for personalising prostate cancer treatment using a Boolean model. The model is extremely complex and describes the regulation of invasion, migration, cell cycle, apoptosis, androgen and growth factors signalling in prostate cancer using 133 nodes (genes and our metrics) and 449 edges (regulation pathways. Using their model, they were able to grade the effect of combined treatments for each of the 488 patients for already-developed drugs and find several genes suitable for intervention in most of the 488 patients. The predications from their model could help develop a patient-tailored treatment that could boost success of pancreatic cancer treatments in clinical practice.

      Strengths:

      The authors clearly achieved their aims of predicative prostate cancer modelling and have added value to the field of prostate cancer personalisation.

      Calibrating and then validating predications of a model, as this work does, is a fundamental part of systems biology and mathematical modelling. By using a cell line to investigate predictions that AKT is the top hit for prostate cancer, validates the utility of their model and also shines a light on how useful models like this can be in oncology. The methodology in this paper provides a guide for future modelling work in this area.

      Providing a detailed Supplementary Information and additional links to the code and fundamental modelling platform publications, helps to provide readers with a tool that may be applied in other settings. However, while this is a strength of the publication, the model is extremely complex and relies heavily on readers spending time comprehending pre-published work and doesn't provide a single contained body of work.

      The methodology they are presenting could have significant impact on the field of cancer treatment, but would need to be testing clinically to validate that personalising treatment in this manner does improve outcomes.

      We thank the reviewer for these comments.

      Weaknesses:

      While it is a strength of this work that such a detailed, and complex model is developed for prostate cancer, and that the code is provided, the weakness of this work is that the model is not easily accessible, and a lot of the techniques used in model development feel brushed over. The work relies heavily on other works and does not provide detailed descriptions of the underlying algorithm, requiring readers to absorb knowledge from our places. This could be a challenge if an experimentalist wishing to implement this methodology in a different cancer treatment.

      We have summarised the main techniques on which this work relies upon in a dedicated section in the Supplementary Material (Appendix file) by describing small introductions to Boolean modelling, MaBoSS stochastic approach to Boolean models, and PROFILE methods.

      We have also provided the codes to reproduce the figures and the analyses. We tried to comment on the code files (e.g., Jupyter notebook) as much as possible to facilitate their use in different contexts.

      The protein/genes in the model are not presented in a way that it can be easily validated as such, the complexity of such a Boolean model comes into question.

      We have listed all the proteins/genes of the model in SuppFile 1 with references for all the interactions of the network.

      For transparency, we have also described in the Appendix how we used information from all the different sources to construct the model in the section "Prior knowledge network construction".

      How sure are we in the model predications and are there are any potential weaknesses to modelling the network in such an extensive manner? For such a model like this, it is crucial to demonstrate its sensitivity to initial conditions and node additions/removals so some work could be done to demonstrate this so that the readers have an idea of how many over/under predications there might be in the model.

      For the sensitivity to initial conditions, we have tested some of them on the generic model in the Jupyter notebook (provided in the supplementary files) but have not done it systematically. The table of all the stable states can be computed exactly as it is done in the notebook (2460 fixpoints are found), and the simulations of MaBoSS clearly show that the proportions of some solutions (probability of model states) change depending on which input is ON. We have tested some conditions: all inputs random, all inputs at 0, growth factors ON (EGF, FGF, Androgen, and Nutrients ON), death signals ON (Carcinogen, Androgen, Acidosis, Hypoxia and TNFalpha ON) leading to very different outputs (Figure 3 for LNCaP and S22 for all 8 prostate cell lines). In fact, the MaBoSS simulation with all inputs random shows the existence of all possible, stable states as it explores the whole state transition graph: for all nodes, 50% of the trajectories will start at 0, and 50% will start at 1. Similarly, we tested the effect of some mutations on the generic model (e.g. mutation of p53, which reduces the probability to reach apoptosis). The aim of these simulations was to test the overall coherence in the model behaviour vs biological evidence as a first validation.

      As for automatic removal and addition of new nodes to assess the importance of each of them, we would recommend against it. Indeed, the model was built from the knowledge extracted from the literature, from databases (cf. Omnipath), discussions with experts, and results from data analyses. Removing nodes would mean that some nodes are considered less important, and adding new nodes would mean that some new findings were found that would justify a new addition.

      In addition, in this work, we need to balance the robustness of the model with the flexibility of being used to cover the different cell line personalisations. Thus, we do not want a highly robust wild type model that has extremely robust, few stable states but is unable to capture the different cell lines specifics. Nevertheless, we have partially covered this with our "High-Throughput mutant analysis of the LNCaP model" section in Appendix file (Section 6.1), where we study all the perturbations on one node and combinations of two nodes, let them be knock-outs (where a node is forced to be 0 throughout the simulation) or overexpression (forced to be 1). By using this analysis, we wanted to identify the fragility points of the mutants' models, but we did not perform this test to have a thorough robustness analysis. In any case, we found varying effects of these perturbations on the phenotype scores, and double perturbations having a greater effect than single ones.

      Finally, we have performed a perturbation on the stability of the logical rules. We have changed one and two logical gates from each logical rule of the LNCaP model and studied the effects on the phenotype scores. In short, we have changed an AND in OR and vice versa in each logical rule (level 1 with 372 simulations) or twice in the same rule (level 2 with 1263 simulations).

      Overall, we see that all of the most probable phenotypes are very robust to this kind of perturbation. Even the less stable phenotype, Invasion-Migration-Proliferation, only has ~3% of either level 1 or 2 perturbations that reduce this phenotype's probability to zero (Appendix File, Figure S30). Most of these perturbations were focused on HIF1, AR_ERG and p53 nodes (Appendix File, Figure S31).

      We added a sentence in the Methods section to explain this: "In addition, we found that the LNCaP model is very robust against perturbations of its logical rules, by systematically changing an AND for an OR gate or vice versa in all of its logical rules (Appendix File, Section 6.2, Figure S30 and S31)." and added Section 6.2 to the Appendix file titled "Robustness analysis of the logical model".

      As they test so many drugs and combination regimes it is also hard to extract information about which key drugs should be repurposed. It could be useful to the readers to have this spotlighted more in the model so that it is easily discernable.

      The complete study on the inhibition of all nodes of the LNCaP model can be found in the supplemental information (SuppFile 6 and Appendix file, Section "High-Throughput mutant analysis of the LNCaP model").

      Because of the size of the model, we chose to filter the full list of nodes with the list of existing drugs and their targets. Thus, Table 1 gathers the drugs we discuss in this article along with the node that they target. We also studied a selection of combinations of drugs, as depicted in Section "Experimental validation of drugs in LNCaP" of Results. In that section, we focused on the combinations that reduced Proliferation and/or increased Apoptosis. For completeness sake, we provide all the combinations of all the drugs from Table 1 in Appendix File, Figures S34 and S35, and their Bliss score in Appendix File, Figures S36 and S37. Furthermore, the code to reproduce these in our GitHub repository: https://github.com/ArnauMontagud/PROFILE_v2/blob/main/Gradient%20inhibition%20of%20nodes/data_analysis.R

      We could have identified the nodes from Table 1 on the figure of the network (main text Figure 1), but we decided against it because the figure is already hard to read, and colours were added to specify the signalling pathways that are included.

      Suggestions:

      Another way to validate the cohort level predications could have been to examine the efficacy of the predicted personalised protocols, or sensitive parts of the Boolean network, in a new prostate cancer patient cohort. Do we see the same sensitive pathways if we examine a different cohort of prostate cancer patients?

      We thank the reviewer for this suggestion. Indeed we are working on using this pipeline in other cancers and in other studies.

      One of the topics that we think can facilitate the use of this methodology is on optimising its runtime and portability. Thus, we are currently working on having a containerised, HPC semi-automatic workflow to reduce the time and optimise the efforts to get results using (almost) any published model and (almost) any omics data.

      In terms of the reproducibility of the results and as we say in the discussion section of the main text, there is a kind of effect size on this type of study. You may find that for a specific patient, their conclusions are not in line with what is expected, but when you analyse at the level of groups of patients, these outliers dampen off.

      Reviewer #2 (Public Review):

      Montagud et al. present a very successful experiment - modeling feedback loop: the authors develop a Boolean model of the major signaling pathways deregulated in prostate cancer, use molecular data from patient samples to personalise this model, use drug response of cell lines to validate the model, predict 15 actionable interventions based on the model, and test nine of these interventions, confirming four.

      The premise of the work is well-supported by prior work by the team and the wider community. The methods are sound, well integrated and thoroughly documented, with one notable omission. The process through which the logic functions of the nodes were determined/decided is not described. The Appendix file indicates "The model is completed by logical rules (or functions), which assign a target value to each node for each regulator level combination.". The interested reader would want to know what information is used and what considerations are the basis of these assignments, and what would change if an assignment were different.

      The manuscript makes a number of testable predictions of actionable single and combinatorial therapeutic interventions for prostate cancer. Equally important, the combination of information and methodologies used in this paper offers a roadmap for future development of predictive and personalised models. Such models are much needed in precision oncology.

      We thank the reviewer for these encouraging comments.

      Reviewer #3 (Public Review):

      This paper tries to establish a model for drug (and combination) selection for individual prostate cancer patient based on a prior signal network knowledge base and genomic/transcriptomic profiling data. This is of great clinical potential. However, whether this approach could be robustly applied in clinic is not validated. Limited validation using cell line is provided. Most tumors have complex structure including tumor cells and surrounding microenvironment. The model is mainly built from onco-signaling pathways. The contribution of microenvironment including immunity is unclear.

      The focus of this model is intracellular only. We explored the interplay between signaling pathways that may be linked to tumorigenesis. We only consider the microenvironment effect as indirect and in no way comprehensive. For instance, we have not considered any immune cells or the effect of the metabolism.

      Nevertheless, we are building on top of this work a multiscale model where we can include different cell types, such as immune cells, and drug-related pharmacodynamics.

    1. Author Response:

      Reviewer #2:

      In this study, Russell et al. combined T cell receptor (TCR) repertoire sequencing data with SNP array genotype data to infer genetic polymorphisms which impact upon the process of TCR generation. Using these data, the authors looked for loci with polymorphisms which associate with different V(D)J recombination probabilities, i.e. sites in the genome which impact upon the chances of TCRs with different properties being produced when they change.

      Beyond the expected sites in the TCR and MHC loci, the authors observed strong associations with distant sites. One was with DCLRE1C, which encodes Artemis, the endonuclease responsible for cutting the TCR loci during recombination, while the second was DNTT, the site encoding the enzyme Tdt, which is responsible for addition of nucleotides to cut V(D)J during recombination. This is the first time that such SNP associations have been described to my knowledge, and yet make perfect sense: DCLRE1C variations were associated with the amount of trimming V and J genes underwent during recombination, while DNTT polymorphisms associated with the number of inserted nucleotides. The authors also report, after assigning donors an associated ancestry based on clustering of their genotype data, that certain inferred ancestries associate with different TCR repertoire properties. In this analysis 'Asian-associated' TCR repertoires had fewer non-templated nucleotide insertions, along with a corresponding greater incidence of the DNTT polymorphisms associated with differences in insertions, relative to other groups.

      Strengths:

      This manuscript is exceedingly well written. Both the TCR biology and the statistical considerations of the genetic analyses are extremely complex topics, mired in arcane terminology, which often end up somewhat impenetrable to non-expert readers. However both have been introduced and explained with admirable clarity throughout, including the caveats and implications of analyses that would not be intuitive to many readers not already expert in both fields.

      As best I can determine, the analyses themselves are also extremely rigorous, with each step carefully taken and justified in the text, involving numerous corrections at multiple scales (e.g. for TCR productivity, TCR gene usage, specific TRDB2 genotype, population substructure, and more). The major findings have also been validated in a completely separate cohort, using a different analysis pipeline. While the authors point out that such genome-wide association efforts looking at TCR gene expression have been undertaken before, the major innovation presented here lies in applying those data to investigating specific V(D)J recombination probabilities. Thus the findings are novel, and the conclusions well supported by the data.

      The data visualisation have all been plotted in a sensible and easily interpretable manner. The majority of data themselves are all already publicly available, having been published in prior studies. The TCRseq data for the validation code has been assigned a BioProject accession, which I presume will go live at the time of publication. The code is also appropriately hosted on Github, and are mostly adequately commented and documented enough so as to be repeatable.

      Thank you!

      Limitations:

      There are very few if any obvious technical limitations or weaknesses that I can see that are not intrinsic to the data themselves. While the authors do mention these limitations, I wonder if they should be devoted some more attention somewhere in the text of the manuscript; relatively few researchers are expert in both TCR biology and the technicalities of genome-wide association studies, so I think more explicit consideration of these issues would be helpful.

      We have expanded the section containing limitations of our approach within the discussion section. We hope this addition clarifies the intrinsic limitations of the data used here.

      In particular, I think the difficulty of studying these loci with standard techniques could be underlined, along with what implications that might have for this study. The highly repetitive nature of the TCR loci can certainly make any analysis looking at short sequences problematic, which has implications for both the TCRseq and genotyping aspects of this study. Combined with the fact that most studies focus on certain populations, polymorphisms in the TCR loci are very likely being relatively undersampled by the field (a hypothesis supported by the ongoing discovery of novel exonic polymorphisms in TCRseq data itself, e.g. as demonstrated in this pre-print by Omer et al. (https://doi.org/10.1101/2021.05.17.444409). The consequences of SNP polymorphism coverage in SNP arrays has already been considered for IgH (https://doi.org/10.1038/gene.2012.12): while this is an admittedly more polymorphic locus, the underlying causes of these issues are mostly all true of the TCR loci as well. Similarly, while the authors do appropriately point out that issues with V(D)J gene assignment could infer biases it may be worth noting that the TCRseq technology used to produce their main dataset uses relatively short read sequencing, that is unable to distinguish a substantial fraction of even known TCR gene- and allele-level diversity (see Fig. 1C of the Omer et al. pre-print). Thus there may be a whole dimension of TCR polymorphism that is not well captured by either platform.

      This is a great suggestion and we have added a section within the discussion to mention these limitations and their implications for both the SNP array and TCR repertoire sequencing data used here.

      Overall, I think this is an extremely considered and digestible study, which will be of great interest across and beyond the field. As the wider community comes to grips with how best to incorporate TCR and BCR polymorphisms into their analyses of the adaptive immune loci themselves (and how this might impact upon recombination, expression, and downstream immune functions) this serves as a timely reminder that we should not forget the polymorphisms elsewhere in the genome that might also be relevant.

      Thank you!

      Reviewer #3:

      In this manuscript, Russel et al propose an inference method to link genetic variations with TCR repertoire feature variations, based on observations from previous studies showing similarities at various level of the repertoire in monozygotic twins. To that end, they used a unique publically available dataset, which combines TCRb immunosequencing data as well as whole genome SNPs data. The method is elegant and sheds light on the importance of combining different type of data to better understand the complexity of TCR repertoire generation and selection. However, unfortunately, while their discovery data set provide some associations between SNPs and TCR repertoire features, they were almost unable to recapitulate the results with their validation dataset. The main reasons could be that the donor demographics are highly divergent between the two cohorts (81% Caucasian in the discovery vs. mainly Hispanic in the validation), the immunosequencing data were generated using RNA based method for the validation while the discovery dataset was obtained from gDNA templates and finally the SNPs array were discordant between the two datasets. Nonetheless, the approach and the study deserve attention and might be improved by additional experiments or analyses and by providing additional information.

      Thank you for your review. We would like to emphasize that the validation results reported here are as good as one might expect given the small sample size of the validation cohort (94 individuals) and the discordance between the discovery and validation SNP sets. The overlap between the discovery cohort and the validation cohort SNP sets consisted of just two significant SNPs, one within the gene encoding the Artemis protein (DCLRE1C) and the other within the gene encoding the TdT protein (DNTT). This DCLRE1C SNP (rs12768894, c.728A>G) was strongly associated with the extent of V-gene and J-gene trimming in the discovery cohort, and we were able to successfully validate this finding within the validation cohort. Specifically, this DCLRE1C SNP was significantly associated with the extent of J-gene trimming in productive TCRalpha and TCRbeta chains and V-gene trimming of both productive and non-productive TCRalpha and TCRbeta chains within the validation cohort. The overlapping SNP within the DNTT locus (rs3762093) was only weakly associated with the extent of N-insertion within the discovery cohort, and as such, it was not surprising that this SNP only reached statistical significance for one of the N-insertion types (productive TCRalpha rearrangements; note that due to the lack of the D gene, N-insertion annotations are likely less noisy on the TCRalpha locus). Despite our inability to replicate all N-insertion associations, we noted that the model coefficients for rs3762093 genotype were in the same direction (i.e., the minor allele was associated with fewer N-insertions) for all N-insertion and productivity types within the TCRbeta chains for both cohorts.

    2. Reviewer #2 (Public Review): 

      In this study, Russell et al. combined T cell receptor (TCR) repertoire sequencing data with SNP array genotype data to infer genetic polymorphisms which impact upon the process of TCR generation. Using these data, the authors looked for loci with polymorphisms which associate with different V(D)J recombination probabilities, i.e. sites in the genome which impact upon the chances of TCRs with different properties being produced when they change. 

      Beyond the expected sites in the TCR and MHC loci, the authors observed strong associations with distant sites. One was with DCLRE1C, which encodes Artemis, the endonuclease responsible for cutting the TCR loci during recombination, while the second was DNTT, the site encoding the enzyme Tdt, which is responsible for addition of nucleotides to cut V(D)J during recombination. This is the first time that such SNP associations have been described to my knowledge, and yet make perfect sense: DCLRE1C variations were associated with the amount of trimming V and J genes underwent during recombination, while DNTT polymorphisms associated with the number of inserted nucleotides. The authors also report, after assigning donors an associated ancestry based on clustering of their genotype data, that certain inferred ancestries associate with different TCR repertoire properties. In this analysis 'Asian-associated' TCR repertoires had fewer non-templated nucleotide insertions, along with a corresponding greater incidence of the DNTT polymorphisms associated with differences in insertions, relative to other groups. 

      Strengths: 

      This manuscript is exceedingly well written. Both the TCR biology and the statistical considerations of the genetic analyses are extremely complex topics, mired in arcane terminology, which often end up somewhat impenetrable to non-expert readers. However both have been introduced and explained with admirable clarity throughout, including the caveats and implications of analyses that would not be intuitive to many readers not already expert in both fields. 

      As best I can determine, the analyses themselves are also extremely rigorous, with each step carefully taken and justified in the text, involving numerous corrections at multiple scales (e.g. for TCR productivity, TCR gene usage, specific TRDB2 genotype, population substructure, and more). The major findings have also been validated in a completely separate cohort, using a different analysis pipeline. While the authors point out that such genome-wide association efforts looking at TCR gene expression have been undertaken before, the major innovation presented here lies in applying those data to investigating specific V(D)J recombination probabilities. Thus the findings are novel, and the conclusions well supported by the data. 

      The data visualisation have all been plotted in a sensible and easily interpretable manner. The majority of data themselves are all already publicly available, having been published in prior studies. The TCRseq data for the validation code has been assigned a BioProject accession, which I presume will go live at the time of publication. The code is also appropriately hosted on Github, and are mostly adequately commented and documented enough so as to be repeatable. 

      Limitations: 

      There are very few if any obvious technical limitations or weaknesses that I can see that are not intrinsic to the data themselves. While the authors do mention these limitations, I wonder if they should be devoted some more attention somewhere in the text of the manuscript; relatively few researchers are expert in both TCR biology and the technicalities of genome-wide association studies, so I think more explicit consideration of these issues would be helpful. 

      In particular, I think the difficulty of studying these loci with standard techniques could be underlined, along with what implications that might have for this study. The highly repetitive nature of the TCR loci can certainly make any analysis looking at short sequences problematic, which has implications for both the TCRseq and genotyping aspects of this study. Combined with the fact that most studies focus on certain populations, polymorphisms in the TCR loci are very likely being relatively undersampled by the field (a hypothesis supported by the ongoing discovery of novel exonic polymorphisms in TCRseq data itself, e.g. as demonstrated in this pre-print by Omer et al. https://doi.org/10.1101/2021.05.17.444409). The consequences of SNP polymorphism coverage in SNP arrays has already been considered for IgH (https://doi.org/10.1038/gene.2012.12): while this is an admittedly more polymorphic locus, the underlying causes of these issues are mostly all true of the TCR loci as well. Similarly, while the authors do appropriately point out that issues with V(D)J gene assignment could infer biases it may be worth noting that the TCRseq technology used to produce their main dataset uses relatively short read sequencing, that is unable to distinguish a substantial fraction of even known TCR gene- and allele-level diversity (see Fig. 1C of the Omer et al. pre-print). Thus there may be a whole dimension of TCR polymorphism that is not well captured by either platform. 

      Overall, I think this is an extremely considered and digestible study, which will be of great interest across and beyond the field. As the wider community comes to grips with how best to incorporate TCR and BCR polymorphisms into their analyses of the adaptive immune loci themselves (and how this might impact upon recombination, expression, and downstream immune functions) this serves as a timely reminder that we should not forget the polymorphisms elsewhere in the genome that might also be relevant.

    1. Author Response

      Reviewer #1 (Public Review):

      The recordings done by the authors are impressive and rare, and I appreciate the efforts of the authors to bridge very different types of signals that are generally recorded in different paradigms. However, the analysis at many places is quite nuanced and high-level, making it difficult to directly compare these findings with previous results. I think several additional analyses are needed to properly place these findings with previous results.

      1. Effects of attention in V4 generally start earlier (~100 ms). It is unclear why no effect is observed during earlier time periods in these data. To make better comparison with previous studies (such as Nandy et al., 2017), the authors should show the average PSTHs in supragranular, granular and infragranular layers during both target-out versus target-in conditions. Interestingly, Nandy and colleagues found largest changes in firing rates in the granular layer. To better understand the ERP outside the cortex, the authors should also show the average LFPs in the three layers, for target-in and target-out conditions. It is surprising that MI analysis reveals no significant information about the target in granular layer - given that some attentional effects are seen in upstream areas such as V1 and V2.

      We have created a new figure showing multiunit activity and LFP across the layers in both attention conditions. It is included here for convenience. Accompanying text has been added to the Results and Discussion sections to address the reviewers’ comments.

      The timing of differentiation between attended and unattended in the population spiking activity is evident in both MUA and LFP. We note that the largest magnitude difference in population spiking between attention conditions was observed in the middle layers, consistent with Nandy et al., 2017. We wish to highlight two observations.

      First, with respect to the timing of attentional modulation, it should be noted that the attention task used in our study (pop-out visual search) is different from that used by Nandy et al., 2017, Neuron (cued change detection). The timing of “effects of attention” vary according to stimulus properties and task demands (the number of publications demonstrating this is too long to list). Hence, we do not expect equivalence between the times we measure and times Nandy et al. measure. Nonetheless we are happy to include the requested supplementary figure with that caveat in mind.

      Second, with respect to the surprising observation of a relationship between activity in the granular layer and the extracortical signal, we think it is important to remember that these information theoretic analyses are not simply correlational. That is, attentional modulation might be observed in both signals, but if the covariation of these signals trial-to-trial does not exist, then we would not expect a relationship in the mutual information analysis.

      1. Eye position analysis: my understanding is that the animals could make a saccade as soon as the arrays were displayed. Given that the main effect of attention is observed after ~150-200 ms, the potential effect of saccade preparation could be important. There could also be small eye movements before the saccade. Given that the RFs were quite fovial for one monkey and not too far from the fixation window, and the effect of attention appears to be quite late, detailed analysis of eye position and microsaccades is needed to rule out the possibility of differences in eye movements between target in and target-out conditions influencing the results. A timeline and some analysis of eye movement patterns would be appropriate. The authors should also clearly mention the mean and SD of the saccade onset.

      The reviewer makes a valuable observation. Saccades will influence the electrical signals, something we are quite familiar with (e.g., Godlove et al., 2011, J Neurophysiol). In an effort to combat this, we have two points worth noting. First, as was the case in the initial submission (which remains the same in the revision), we have clipped signals on a trial-by-trial basis prior to eye movements. By doing so, we cannot have an influence of the motor-related polarization of the task-demanded eye movement on the data.

      Second, we have prepared a microsaccade analysis – and accompanying newly added supplementary figure included here for convenience – to determine whether they might be driving the results. To do this, we identified trials where microsaccades occurred using a well-regarded microsaccade detection algorithm (Otero-Millan et al., 2014, J Vis). We then reperformed the information theoretic analysis across sessions after removing trials where microsaccades were detected. Briefly, we found that the information theoretic relationship persists in the absence of trials where microsaccades occurred. We believe this serves as evidence that microsaccades are not responsible for the information theoretic findings.

      To address the reviewer’s last point, we have included response time data (defined as the saccade onset latency) in the Results.

      1. Attention studies typically keep the stimulus in the RF the same to tease out the effect of attention from stimulus selectivity. Ideally, the comparison should be between the two green (or red) in RF conditions as shown in Figure 4A. However, these results are shown only after pooling across all color selective columns. This comparison should be shown from Figure 2 itself (i.e., Figure 2C should have green in the RF and red target outside).

      We have clarified prior to Figure 4 that we used a all trials including both colors in each of the attention conditions. That is, while the cartoon in Figure 2 shows only green-attended and red-unattended conditions, green-unattended and red-attended conditions were also included in this analysis. As the proportion of red-target and green-target trials was matched, this first analysis was designed in such a way that the influence of stimulus color should be minimized, yet all trials could still contribute to the calculation. We have included a new supplementary figure (included here for convenience) which is what we believe the reviewer requests. In this addition, we perform the information theoretic computation on only stimulus matched conditions. Briefly, we find that this approach does not seem to alter the temporal profile of information theoretic findings.

      1. Information has been well characterized in a large number of previous studies (generally yielding values between a few bits/s, see for example, Reich et. al, 2001, JNP). Here, the absolute value of mutual information seems rather low. This may be due to the way the information is computed. A discussion about these reasons would be useful for scientists interested in information-theoretic measures.

      We agree that the exact magnitude of our information theoretic analyses in curious. And while these methods have been widely characterized – they have not been characterized, to our knowledge, in relating intracortical laminar currents to extracortical field potentials. As such, we do not have a strong prior as to what we should expect magnitude-wise. We have expanded the discussion to note this observation and provide potential reasons as to why this might be the case. The conclusion being that further application of these methods to these datatypes is necessary to really gain a fuller sense of what should and shouldn’t be expected.

      1. Dependence on feature preference: The effect of spatial and feature attention is well studied. A multiplicative gain model of spatial attention would predict a larger increase in firing rates )and perhaps other signals such as CSD) for preferred versus non-preferred signals. Feature similarity gain model would predict the red preferring columns to increase their activity and green preferring columns to reduce their activity when the animal is attending to the feature red, irrespective of which stimulus is in the receptive field. Here, the task is a pop-out task which likely has both a spatial and feature attention component. The authors should discuss their findings in these contexts. Further, the authors should discuss whether their findings could just be a reflection of the magnitude of the change (which could be larger for preferred versus non-preferred stimulus). The information-theoretic measure should ideally not depend on the absolute magnitude, but these quantities often get biased in non-trivial ways based on the magnitude. Does information transmission depend on the magnitudes of firing rates/CSDs?

      The relationship of these findings to the specificities of attentional mechanisms and models is indeed intriguing. As the reviewer suggested, this task likely engages both spatial and feature attention – however, the design was not such that they can be disentangled wholly. We have added text to the Discussion to reflect this consideration. As for the potential influence of response magnitude changes on the information theoretic analyses – the exact parameters were chosen to mitigate concerns about magnitude. That is, we chose a uniform count binning procedure on the data which eliminates potential issues such as outliers driving relationships as well as the changes in variability associated with increases in magnitude. Moreover, the uniform count binning procedure results with states rather than magnitudes which again mitigates response-magnitude-driven effects.

      1. For columns that were not feature selective, is there an effect of attention? Does the magnitude of N2pc change depend on color selectivity? I think that should be the case based on Figure 4H and 4I, but a plot and/or some quantification would be useful.

      These questions have been addressed in a newly added supplementary figure as well as quantification in the Results. Briefly, we did find an effect of attention non-selective columns. Also, we found the magnitude of N2pc did not depend on color-selectivity of the intracortical recording. The results were reported as:

      “We also tested whether feature selective columns, on average, transmitted more information than their non-feature-selective counterparts. We found that feature selective columns, in all laminar compartments, transmitted significantly more information (Figure 4I) (two-sample t test: L2/3, p = 0.044; L4, p = 0.023; L5/6, p = 0.009). As such, we wanted to determine if this was due to a lack of attentional modulation in the non-selective columns. This was not the case, we observed that non-selective columns were modulated with attention. Attentional modulation was observed in both the CSD in L2/3 and L5/6 (one-sample t test: L2/3: t(64) = -6.01, p = 9.8e-8; L4: t(64) = -0.18, p = 0.86; L5/6: t(64) = 5.24, p = 1.9e-6) as well as across all layers in the population spiking activity (one-sample t test: L2/3: t(64) = 8.00, p = 3.7e-11; L4: t(64) = 9.66, p = 4.1e-14; L5/6: t(64) = 7.58, p = 1.8e-10) during the N2pc interval (averaged 150-190 ms following array onset) (Figure S6).

      Importantly, we tested whether the N2pc varied across sessions with or without color-selective columns sampled. We found no difference between N2pc polarization (150-190 ms after the array) between sessions with (n = 17) or without (n = 13) sampling of color selective columns (two sample t test: t(28) = -0.75, p = 0.46). This invariance is expected because extracortical EEG spatially integrates signals from multiple cortical columns.”

      Reviewer #2 (Public Review):

      Scalp ERPs are widely used in human neuroscience research to understand basic mechanisms of neural and cognitive function and to understand the nature of neurological and psychiatric research. However, this research is hampered by a surprising lack of research in animal models exploring the neural mechanisms that produce specific ERP components.

      Previous research by this research group identified a potential monkey homologue of the N2pc component, a neural correlate of the focusing of attention onto visual objects embedded in arrays of distractors. The present study took a giant leap forward by recording extracellular potentials from densely spaced arrays of electrodes (.1 mm spacing) on probes that extended perpendicular to the cortical surface. These electrode arrays made it possible to simultaneously record voltages throughout the different layers of a cortical column and convert these voltages into current source density (CSD, which isolates local synaptic current flow and minimize volume-conducted activity from other brain regions). In addition, simultaneously recorded voltage from an electrode just above the cortical surface was used as a proxy for scalp potentials. Scalp ERP recordings were also obtained from separate monkeys to measure the actual scalp ERPs and verify that an N2pc-like ERP was elicited by the task (a simple visual search task in which the monkey made an eye movement to the location of a color popout item).

      Very clear CSD was observed in V4 in both supragranular and infragranular layers that was stronger when attention was directed to the contralateral visual field than when attention was directed to the ipsilateral visual field, which is the hallmark of the N2pc component. Little or no such activity was observed in the granular layer (the primary recipient of feedforward projections). In addition, the effects were observed primarily when the column was selective for the target's color. An information theory analysis showed that these intracortical current flows contained significant information about the voltage measured on the cortical surface and the location of the target object.

      All of these results were clear and convincing. Moreover, the laminar and columnar analyses provide interesting new evidence about attention-related neural activity independent of any considerations about ERPs. The most challenging aspect of the study is to provide a solid link from the intracortical activity to the voltage on the cortical surface, and then to the monkey scalp ERPs, and finally to human ERPs. Toward that end, the present study relied entirely on correlational evidence, rather than experimental manipulations. That's quite appropriate for a first step, but it must be considered an important limitation on the conclusions that can be drawn. It would be wonderful if future research took the next step of providing experimental evidence.

      We appreciate the reviewer noting that this manuscript is a valuable step in linking attention-associated electrophysiological signals across species. We also recognize that there is much work to be done in this domain. As requested, we have added to the Discussion the limitation of this type of study as well as what should be considered valuable next steps in this program of research.

      There are also some troubling aspects of the existing evidence. The scalp ERP effect in this study and the prior work from this groups is a positive voltage over the contralateral hemisphere, whereas in humans the voltage is negative. This may well reflect the orientation of the relevant cortical surface in monkeys versus humans. However, the voltage on the cortical surface in the present study was negative contralateral to the target, not positive. Unless this opposite voltage on the cortical surface relative to the scalp reflects something about the reference site for the cortical surface electrode, then this makes it difficult to link the intracortical effects and cortical surface effects to the scalp ERP effects. Also, the CSD was negative in the upper layers and positive in the lower layers, again suggesting that the voltage should be negative contralateral to the target on the surface. Ironically, this polarity is what would be expected from the human brain, where a contralateral negativity is observed. The oddity seems to be the contralateral positivity in the monkey scalp data. Also, the cortical surface voltage exhibits a polarity reversal at approximately 180 ms, which is not seen in the intracortical CSD.

      One possible explanation for the discrepancy is that the scalp voltage likely comes from multiple brain areas besides V4. If, for example, areas on the ventral surface of the occipital and temporal lobes produce stronger scalp voltages than V4 under the present conditions, the opposite orientation of these areas relative to the cortical surface would be expected to produce a positive voltage at the scalp electrodes.

      The manuscript notes that multiple areas probably contribute to the scalp ERPs and argues that the pattern of intracortical CSD results obtained in V4 will likely generalize to those areas. That seems quite plausible. Moreover, the results are interesting independent of their link to scalp ERPs. Thus, the present results are important even if the scalp polarity issue cannot be definitively resolved at this time.

      We thank the reviewer for expressing that the results are important whether this polarity difference can be resolved. This is an interesting observation and quite important to consider carefully. First, it is worth reiterating that the referencing setup in our ‘10/20’ monkeys was different than that for the monkeys where intracranial recordings took place. Specifically, the 10/20 recordings were more similar to our previous reports of monkey EEG (e.g., Woodman et al., 2007, PNAS; Cohen et al., 2009, J Neurophysiol; Purcell et al., 2013, J Neurophysiol). Recordings from these monkeys used either a frontal EEG electrode (approximately FpFz) or linked ears for referencing. These yielded the positive-going N2pc and contrast the negative-going N2pc found in humans. The V4 laminar recordings – and their accompanying extracortical signal – used a different referencing setup that we believe is the most likely candidate for the observed difference. Specifically, these recordings used a tied ground-reference setup which incorporated the support rod of the linear multielectrode array. This support rod extended into the brain meaning we had a neural tissue grounded signal and that the reference spanned the neural generator. Therefore, if we are not measuring both sides of the electric field across the generator equally, we might observe an inverted signal. Unfortunately, we cannot observe the 10/20 EEG distribution with an intracranial reference. Ideally, this could be resolved by an experiment where referencing setups are tested before and after performing craniotomy with a series of reference locations used to understand where exactly this flipping of polarization takes place. We have added this consideration to the Discussion and more thoroughly detailed the referencing setups in the Methods.

      There are also some significant concerns about the filters. The high-pass cutoff was high enough that it could have produced artifactual opposite polarity deflections in the data. If causal filters were applied (e.g., in hardware during the recordings), these artifactual deflections would have been after rather than before the initial deflection, possibly explaining the polarity reversal at 180 ms. If noncausal filters were applied in software, this would be a larger problem and could produce artifacts at both the beginning and end of the waveform. Moreover, the filters were different for the CSD data and the extracortical voltages, which is somewhat problematic for the information theoretic comparisons of these two data sources (but is likely to reduce rather than inflate the effects).

      In revisiting the description of the recording system and filters, we see how some information was conveyed poorly. The language describing the recording in the original submission suggested that online filters were applied to the data as it was being recorded. This was not the case. We have changed that language so that it reads as the data was being collected at a sampling frequency sufficient to observe data between 0.1 Hz and 12 kHz rather than the data being filtered between 0.1 Hz and 12 kHz. Also, it appears that the description of the processing sequence regarding CSD was ambiguous in the original submission. The CSD underwent the same offline, bandpass filtering procedure (1-100 Hz) as the extracortical signal. We have clarified the Methods accordingly.

      Reviewer #3 (Public Review):

      In this study, Westerberg et al., investigate the cortical origins of the N2pc, an ERP for selective attention. By using a combination of indefinite inverse models of cranial EEG and translaminar electrophysiology, the authors demonstrate that dipoles in V4 are the source of the N2pc.

      The study is well conducted and the manuscript is well written.

      We are pleased that the reviewer recognized the contribution of our efforts.

      I have a few comments about the CSD, RF alignment profiles, and LFP based analyses:

      (A) The method section states correctly that "current sinks following visual stimulation first appear in the granular input layer of the cortex, then ascend and descend to extra granular compartments". However in the example CSDs shown in Fig 2, Fig 3, Fig S3 there is no visible current sink in the infra-granular layers. Instead, the identified infra-granular layers show a prolonged current source (e.g. Fig S5B,C), which is unexpected.

      We have clarified the Methods to reflect the observations of our data and why they may differ from previous reports. We believe the discrepancy is likely due to the stimulus conditions used to evoke the CSD profile. Specifically, the descending infragranular sink in visual cortical columns has most commonly been described when CSD was computed while monkeys view briefly presented flashes or stimuli (e.g., Schroeder et al., 1998, Cereb Cortex). However, our study uses task evoked CSD to perform the alignment. Importantly, this means there is a persistent stimulus in the receptive field. We believe this persistent stimulus, rather than a flashed stimulus, leads to a persistent, strong sink in the superficial layers of cortex which would mask any current sink present in the infragranular layers (Mitzdorf, 1985, Physiol Rev). This is an observation we made in previous reports (Task evoked CSD: Westerberg et al., 2019, J Neurophysiol vs. Flash evoked CSD: Maier et al., 2010, Front Syst Neurosci), albeit in V1 instead of V4. Given the latency offset between putative granular and supragranular sinks, that we observe receptive fields below the putative granular input sink, and the demonstrable multiunit activation as indicated by the newly included Figure S2, we have no reservations in our assessment of the position of the electrode relative to the layers across sessions.

      (B) The example RF profile shown in Fig S5A, although aligned, looks a little strange in that the RFs taper off rapidly in the infra-granular layer. Is this the best representative example? It will be important to see other examples of RF alignment.

      The attenuation observed in the lower layers is largely due to overall decreased gamma power in the lower layers of cortex as compared to upper and middle layers (Maier et al., 2010, Front Syst Neurosci). At the reviewer’s request, we have added an additional panel to the noted supplementary figure which shows additional laminar receptive field profiles using the evoked LFP so that they are more directly comparable to those shown in Nandy et al., 2017, Neuron.

      (C) The study used LFP power in the gamma range to compute the response ratio between red and green stimuli. LFPs measured across the cortical depth are highly correlated, and so would gamma power estimated from the LFPs. Given this, how meaningful is the laminar analysis shown in Fig 4B? How confidently can it be established that the LFP derived gamma power estimates have laminar specificity?

      An astute observation – there are two aspects to consider. The existence of color-feature columns has been well-documented in V4 (e.g., Zeki, 1973, Brain Res; Zeki, 1980, Nature; Tootell et al., 2004, Cereb Cortex; Conway and Tsao, 2009, PNAS; Kotake et al., 2009, J Neurophysiol; Westerberg et al., 2021, PNAS). This manuscript did not need the evaluation of interlaminar differences in color selectivity to address the question at hand – the top of Figure 4B only serves as a step to the bottom of Figure 4B which provides the measurements used for the subsequent analyses. Thus, the estimation of color selectivity from gamma was sufficient to capture a general sense of the color selectivity of the column. Second, we recently published a manuscript which directly addresses the laminar specificity of gamma with respect to feature selectivity. Westerberg et al., 2021, PNAS uses a spatially localized form of gamma to evaluate color-feature selectivity along V4 columns. In that manuscript, we find a high degree of consistency along the layers of cortex using the gamma signal. Notably, we compared the gamma signal to the population spiking and found a high degree of coherence between selectivity in those two measures as a function of cortical depth. Given the secondary nature of the interlaminar feature selectivity to this submitted manuscript and the detailed report of laminar feature selectivity using the same dataset in another manuscript, we are inclined to leave the analysis reported here as is with adjustments to the text that note these considerations now included in the Results.

    1. She thinks the companies themselves are behind this, trying to manipulate their users into having certain opinions and points of view.

      The irony is that this is, itself, somewhat a conspiracy theory.

      Though, I think a nuanced understanding may be closer:

      • The real purpose is not to influence people to believe anything. It's money. It's ad spend and data collection to sell. We need to demonstrate to advertisers that their ads are actually getting seen. The more they get seen, the more money we make. And, the more time is spent on the service, the more data we have to sell... which is as valuable as the add spend.
      • Companies jigger algorithms to maximize time spent on the service.
      • As the Bible is clear, the heart of man is wicked, and the kinds of things that maximize time spent are themselves attitudes of evil, malice, wickedness, and hatred, and the list of things Paul repeatedly tells us to avoid. Go figure.
      • So, people feel the platforms are basically like smoking, and yet, they can't stop.
    1. Author Response:

      Reviewer #1:

      This manuscript by Silver, et al., details work investigating the relationship between season of conception and DNA methylation differences at sites across the genome, measured by widely-used arrays, in two cohorts of children using Fourier regression. They find that season of conception is associated with persistent methylation differences at several hundred CpG sites, and that these CpG are enriched for properties, compared to sets of control sites, that suggest that methylation at these sites is influenced very early in development/during conception and that these sites are positioned in genomic regions relevant for gene activation and regulation. Additional analyses investigated the effects of genetic variation of these sites, and found no evidence for single nucleotide polymorphisms nor child sex confounding the associations between season of conception and DNA methylation. As the number of sites measures by these arrays are a very small amount of total sites across the genome, the authors suggest that these findings indicate there may be many more sensitive methylation 'hotspots' in the genome that are not captured by these arrays but could impact on health/development.

      The key strengths of this manuscript include the use of two cohorts of children at different ages, providing evidence that these effects of season of conception appear to attenuate by 8-9 years of age; and comparison with control sites and additional analyses investigating confounding to build the evidence for these relationships reflecting true, biological associations rather than statistical artefacts or the result of confounding.

      However, the conclusions around the potential functional importance of these methylation differences are limited by a lack of evidence for a relationship between methylation of these season-of-conception-associated sites and child growth/development, so while this manuscript builds compelling evidence for the effects of season of conception on methylation, it's functional relevance is unclear. Additionally, there are some choices made in the analyses where the rationale for those choices should be made more clear, such as the use of CpG sites above or below a certain estimated effect size for different analyses.

      Overall, the approach taken here to demonstrate different levels of evidence for true relationships between early development exposures and differences in DNA methylation is a compelling one, and the manuscript delivers clear evidence for its primary conclusions.

      We are currently researching links between several SoC-CpGs and health-related outcomes including measures of growth, and we have prepared/submitted other papers with different groups of authors (e.g. the EMPHASIS team) relating to other phenotypes. We consider a detailed analysis of links between SoC-CpGs and diverse outcome measures in Gambian children to be beyond the scope of the current study and would argue that such an analysis would dilute the central focus of this paper that is already long and complex. We do already refer to two existing studies linking Gambian SoC or nutrition-associated CpGs to health outcomes in non-Gambians (child & adult obesity/POMC, Kuhnen et al Cell Metab 2016; cancer/VTRNA2-1, Silver et al, Gen Biol 2015) in the current manuscript. The VTRNA2-1 locus does not overlap any SoC-CpGs and we already speculate that this may be due to SoC effect attenuation, since the previous association was observed in younger (3-9mth) infants. We have additionally referenced a recently published paper linking another SoC-associated locus to thyroid volume and function in Gambian children (Candler et al Sci Adv 2021) and highlighted that neither this nor the POMC locus overlap the array background analysed in this study. Finally we had already included an analysis of overlaps between SoC-CpGs and traits in published EWAS and GWAS catalogues.

      Regarding our use of different SoC amplitude thresholds for one analysis, our original motivation for analysing all 768 ‘SoC-associated CpGs’ with FDR<5% in the ENID 2yr analysis, including those with amplitude < 4%, was to explore the degree to which the strength / amplitude of SoC effects could be explained by proximity to ERV1 over the wider range of amplitudes represented by the larger set of loci. However we agree that this approach is open to question and have removed this analysis (previous Fig. 6B and Supp. Fig. 11, and text in section headed ‘Enrichment of transposable elements and transcription factors associated with genomic imprinting’). We have also removed the definition of ‘SoC- associated CpGs’ (which included CpGs with SoC amplitude < 4%) from Table 2 and Methods to aid clarity and avoid confusion.

      Reviewer #2:

      This is a very interesting manuscript, which will be of interest for a broader readership. The authors have analysed an unique cohort, which is of importance to understand the impact of environmental factors on DNA methylation.

      The performed analysis is well balanced, and the conclusions are justified by the presented data. It is a strength of this study, that results from the initial ENID study have been re-evaluated in the EMPHASIS study. Unfortunately, DNA methylation has been analysed using HM450 and EPIC arrays. Both methods are providing only a limited view on methylome-wide DNA methylation.

      Another limitation (as already addressed by the authors) is the lack of longitudinal samples. This would potentially have helped to gain further knowledge about the identified attenuation of DNA methylation levels at SoC associated CpGs.

      Finally, I am not entirely sure, that one confounding factor has been completely ruled out: It is known, that blood composition may cause methylation variability. In general, the authors addressed this point and analysed blood compositions (supplementary Figure 16) of both cohorts. Here, no marked seasonal differences between and within both cohorts have been identified. However, the participants of the EMPHASIS cohort have a very similar age (8-9 years). For this reason, I am wondering if methylation variability/ differences and in addition the attenuation of methylation levels might be influenced by the younger age of ENID participants compared to EMPHASIS study individuals.

      We agree that the necessary restriction of our analysis to data derived from Illumina 450k and EPIC arrays means that we can only obtain a limited view of DNAm loci associated with Gambian season of conception. We expect that there will be many more such hotspots across the human methylome. We have commented on this in the Discussion.

      Regarding the lack of longitudinal data to confirm the potential attenuation of SoC effects with age observed between unrelated cohorts, we are pleased to report that we have now acquired an additional EPIC array dataset covering a subset of n=138 individuals from the ENID cohort included in the main analysis. This subset had methylation measured in blood at age 5-7yrs enabling us to conduct an investigation of longitudinal methylation changes in these individuals. This analysis strongly supports the circumstantial evidence of SoC effect attenuation with age suggested by our previous comparison of the independent ENID (2yr) and EMPHASIS (7-9yr) cohorts, with:

      a) strong correlation of conception date methylation maximum between age 2yr and 5- 7yrs at SoC-CpGs in these 138 individuals (Figs. 3A, 4A); and

      b) evidence of SoC effect size attenuation at the majority of SoC-CpGs (Fig. 3B; Wilcoxon signed rank sum p=10-12).

      We note that this additional longitudinal dataset has a different confounding structure with respect to biological and technical covariates (Supp Tables 15-17) and date of sample collection (Supp. Fig. 1B), lending strong support to our previous two-cohort cross-sectional analysis.

      Regarding the potential for confounding by differences in blood cell composition, we have performed an additional sensitivity analysis with Houseman estimated blood cell counts added directly to the linear regression model for the ENID cohort (see ST1s). 518 out of the 520 estimated Fourier regression coefficients from the main analysis (1 pair of sine and cosine terms for each of the 259 SoC-CpGs) fall within the 95% confidence interval obtained in the Houseman-adjusted analysis, confirming that cell composition effects did not unduly influence SoC effect estimates in the original analysis. We have added a brief note on this and the other sensitivity analyses (batch, cell composition and village effects) in Results to the manuscript, with more details in Methods.

      If the reviewer is referring to the possibility that the SoC effect attenuation with age could be driven by different cell composition effects in the older cohort, we think that the replication of the timing of SoC effects across the 3 datasets analysed (including the additional longitudinal data; Fig. 4A), all of which have different confounding structures with respect to season of sample collection (Fig. 2A; Supp Fig. 1B), together with additional evidence of SoC effect attenuation with age in the longitudinal analysis (Fig. 3B) support this being a genuine age attenuation effect.

      Reviewer #3:

      Silver et al. Investigate the influence of seasonal variation (nutrition, infection, environment) on blood DNA methylation in two cohorts of children (233 [2y] and 289 [8y-9y]) from the same sustenance farming communities in rural Gambia. One cohort (450K,233) was extensively studied before in multiple publications, the second dataset (850k,289) is unpublished. Using cosinor modeling they find 768 CpGs with a significant seasonal pattern(SoC-CpG, FDR<0.05) in the probes that overlap between the 450k and 850k arrays. Look-up of these 768 SoC-CpGs in the second sample showed 61 SoC-CpGs with FDR 0.05 (no mention is made if the direction of effect is consistent, but we assume it is so).

      In fact we did report that the ‘direction’ of the effect (conception date at methylation maximum) is highly consistent with increased DNAm in conceptions at the peak of the rainy season across the two cohorts at the 61 SoC-CpGs with FDR<0.05 – see Fig. 2C.

      The authors notice that most SoCs seem to be attenuated in the 8-9y sample. Then the authors select out of the 768 SoC-CpG the FDR<0.05 and >=4% seasonal amplitude in this discovery sample: 257 which they bring further in (enrichment) analyses. It is unclear if all 257 are (nominally) significant in the replication sample.

      We did not check this because of evidence that, despite strong replication of effect direction (Fig. 4A), the amplitude of the SoC effect attenuates with age (Fig. 2E). This means that it would not be surprising if one or more SoC-CpGs failed to achieve nominal significance in the older cohort. This is now strongly supported by our additional analysis of longitudinal data confirming SoC effect attenuation with age and consistency of SoC effect direction (Figs. 3B and 4A).

      These SoC-CpGs are enriched for imprinted and oocyte germline loci. Roughly 10% of SoC-CpGs overlap with so-called meta-stable epialleles (MEs), on which the authors have published greatly. This is a large fold enrichment, and subsequently the main focus of the Results and Discussion. Indeed, it skews the Discussion heavily and one wonders what could have been found in the other 90%?

      Our strategy throughout the Results and Discussion was to focus on characteristics including metastability, parent of origin-specific methylation, histone modifications and gametic and early embryo methylation patterns that suggest a link to establishment of methylation states in the early embryo at SoC-CpGs. For these analyses all SoC-CpGs were considered at every stage and metastability was not the primary focus. However, as the reviewer suggests, we do repeatedly point out that many of the above contextual characteristics that are associated with SoC-CpGs have also been associated with metastability which we consider to be worthy of note, in part because it suggests that many SoC-CpGs may in fact be MEs, despite not having been previously identified as such. We have further cause to believe this could be the case because of i) the typically small sample size of multi-germ layer/tissue datasets used to screen for MEs, meaning that published screens for human MEs are likely to be underpowered and will hence fail to capture most MEs; and ii) the evidence that we present suggesting that environmentally-driven inter-individual variation at loci exhibiting ME-like properties may diminish with age, again suggesting that ME screens, which largely analyse adult tissues, will miss metastable loci present in infancy and early childhood.

      We had already made the point ii) above in the Discussion. However, given the reviewer’s concerns we have added an additional comment on point i).

      The Discussion is heavily geared to interpretation within their MEs focus and does little to discuss study weaknesses and strengths, to which the tail of the Results suggest there are multiple. For at the end of the Results and in the Methods we find additional sensitivity analyses and discussion points on a very strong enrichment for CpGs with a mean difference in methylation between the sexes (>1/3 of the 257), adjustments for genetic confounding and a high inflation factor in the discovery cohort.

      We have added an additional comment on the need for further functional analysis in cell and/or animal models at the end of our discussion on possible mechanisms underpinning the observed strong enrichment for sex effects at loci associated with periconceptional environment. We have performed an additional analysis of SoC effects on global methylation using predicted LINE1 and Alu element methylation to address the issue of genomic inflation in the discovery cohort (Methods ‘Inflation of test statistics’ and additional Supp. Fig. 14). We have commented on the potential for residual genetic confounding and the limitation of a lack of genetic data in the discovery cohort in the Discussion. We have also provided an additional comment on the potential influence of unmeasured inter-relatedness in our study population.

      Indeed, despite the strong and good flow of the Result section and the impressive (albeit somewhat one-side) look-up of SoC-CpGs in published datasets; the tail and Methods section leaves this reader with a strong suspicion of possible methodological issues on the measurement level already identified prior.

      The authors reports that the discovery cohort is biased in the collection of conception months (figure 2A), has a strong inflation of 1.3 (no QQ-plot is shown to assess bias in addition to inflation), no adjustment for genetic background could be made (which is false, as the 450k array contains several dedicated SNP probes, even hundreds when extracted with the omicsPrint package) and > 1/3 of SoC-CpGs is a sex CpG. For the latter observation the authors regressed out sex and repeated the analysis, noting no difference. However, regressing out sex does not help if sex is heavily correlated with confounding biological/sampling/technical covariates.

      The authors reason that the inflation is nothing to worry about citing single cohort studies on global effects on DNAm of methyl donors. Global DNAm is indeed often association with methyl donor intake but generally these studies investigate ALU or SINES repetitive elements and the PACE consortium reported only modest effects on select 450K array loci for prenatal folate supplementation, showing that their reasoning might hold on the ME loci (in/close to repetitive elements) but not the genome-wide analysis per se.

      The authors should convince the reader that their (discovery) data is valid. The data they do show in Supplemental tables 16 and 17 show that after functional normalization a strong effect of batches remains, while from my own experience these are normally nicely mitigated via functional normalization. Normally only strong cell type correlations remain in the first PCAs of the normalized data. But for ENID we see a remainder of sentrix row, often the strongest batch effect, and slide and plate remaining. Also, the biological, season and cohort specific variables are not noted here. We just must assume that the blank correction for the first 6 PCAs, rather than the actual adjustment for the measured batch/confounding effects, does not remove (or over adjusts) for biological/study design (village, genetic ancestry) effects. In addition to these observations figure 2C seems to indicate that the controls CpGs (elegantly selected by the authors) also show seasonal variation, just not as much as the SoC-CpGs. This leaves the reader to wonder: is there bias in their sample randomization across plates, rows and slides? This feeling is amplified by the fact that almost all SoC-CpGs seem to show an increase in DNAm in jul-aug (Suppl Fig. S5 and Figure 1B). [An observation that is not given enough prominence in the Results]. Which might or might not hint to a correlation with a batch effect (like sentrix row?).

      Our addition of a third longitudinal dataset with a very different confounding structure provides strong reassurance of the robustness of the reported SoC effects. However we recognise many of the concerns raised by the review and have therefore substantially extended our analysis of potential confounders in our analysis, including additional sensitivity analyses (see Supplementary Tables ST1p-1s).

      In our extended analysis of possible confounding of technical and biological covariates by SoC, we note that the majority of batch and biological covariates are categorical so that it was not possible to report correlation rho’s. We have instead reported p-values for corresponding association tests – see Supplementary Tables for further details of tests that were carried out. Also note that for simplicity season of conception is modelled as a binary variable (Dry: Jan-Jun; Rainy: July-Dec). We consider this to be a valid approximation to the main cosinor (Fourier) regression analysis since this showed a clear relationship between DNAm and dichotomised (Dry/Rainy) season of conception (Figs 2D & 4A). Note that we have not included month of collection as this completely confounds season of conception in the main ENID (2yr) analysis and cannot confound the EMPHASIS (7-9yr) analysis, as discussed in the manuscript (Fig. 2A). This is a key reason why we compared SoC effects across these two cohorts. Note that the month of collection also cannot confound the ENID 5-7yr (longitudinal) analysis as all samples are collected in the rainy season (additional Supp. Fig. 1B).

      The covariate correlation analysis confirms:

      • No correlation between SoC and all considered batch and biological covariates including principal components across all three analysed datasets (Supp Table, ST1p- 1r).

      • No correlation between sex and all considered batch and biological covariates; weak correlations with PC4 and PC3 in EMPHASIS and ENID 5-7yr datasets respectively (ST1q,1r); note also that the sex sensitivity analysis previously reported in the manuscript used methylation values that were pre-adjusted for sex using a regression model that included sex as the only adjustment covariate, alleviating concerns that there may be residual confounding due to strong correlations between technical/biological/sampling covariates and sex. We have added some additional comments on this to Results.

      • Expected strong correlations between SoC, month of conception and month of birth in all datasets (ST1p-1r).

      • Functional Normalisation (FN) removed most but not all of the effects of technical batch effects (sample plate, slide etc) from the DNAm array data used in the main ENID analysis (ST1p).

      • Samples are not perfectly randomised across 450k sample plate (month of birth [mob] and conception [moc]) and slide (mob and village) for the ENID 2yr cohort (ST1p).

      The last point raises the possibility of potential residual confounding due to array batch effects in the ENID analysis. We checked for this in two ways. First, we performed sensitivity analyses with batch and village ID variables included directly in the linear regression models, in addition to the PCs that served as proxies for batch variables in our original analysis. This suggested no residual confounding due to array batch or village ID effects (ST1s: ‘batch adjusted model’ and ‘village adjusted model’). Second, we confirmed that neither mob, moc nor village ID were associated with batch or any other covariates in the EMPHASIS or new ENID 5-7yr analyses (ST1q, ST1r). The tight correspondence of date of methylation maximum across all three datasets (cross-cohort and longitudinal analyses) (Figs. 2C, 3A and 4A) with different confounding structures (ST1p-1r) strongly suggests that the reported SoC associations are not driven by residual confounding.

      In summary, this analysis provides strong reassurance that our main analysis is not confounded by residual associations with technical and/or biological covariates considered in this analysis, and that the observed enrichment for previously identified sex-associations amongst SoC-CpGs is not driven by residual confounding due to sex.

      We have made multiple amendments to the manuscript to incorporate the longitudinal analysis; in the Introduction (lines 58-9); in the first section of Results; and we have made particular reference to the alignment of SoC effects across 3 datasets with different confounding structures. We have also amended several figure captions to distinguish the ENID 2yr and 5-7yr datasets and added the longitudinal dataset to Methods and to the study design schematic (revised Fig. 1), and visualised key results from this additional analysis in Figs. 3 and 4A. Finally we have added additional text on the sensitivity analyses in the main text and in Methods.

    1. Author Response:

      Reviewer #2 (Public Review):

      The authors try to identify ATR-mediated phosphorylation sites in male meiosis of mice and performed phosphoproteomics using two distinct mouse models. The paper focuses on important topics in the field. Since ATR has key functions in meiosis, successful identification of ATR-mediated phosphorylation sites would have a profound impact.

      The study has certain technical issues in experimental design and data interpretations.

      The rationale as to why they used Rad1-cKO was not well described. According to the co-submitted manuscript, Rad1-cKO spermatocytes experience meiotic arrest, and the cellular composition is totally different between controls and Rad1-cKO testes. The "RAD1-dependent" phenotype may simply reflect the difference in cellular composition in testis. With this criterion, any phosphorylation sites present after the mid-pachytene stage in normal spermatogenesis can be categorized as "RAD1-dependent".

      We have altered the figure and text in the manuscript to more clearly explain the rationale for using Rad1-cKO and combining the generated data with the data from the rapid 4 hour ATRi treatment. Importantly, we now consider the phosphorylation sites impaired after a quick 4 hour treatment with ATRi (New Supplementary File 1), which is expected to be too quick to induce an appreciable pachytene arrest. Therefore, the final ATR-dependent and RAD1-dependent dataset is unlikely to include phosphorylation sites that are only shown as being depleted due to a persistent mid-pachytene arrest (these sites should appear as RAD1-dependent and ATR-independent).

      There are two different experiments for ATR inhibitor (ATRi)-treated mice (2 pairs after 2.5-3 days of treatment, and 2 pairs 4 hours after a single dose). However, these results are not distinguished in the analysis, and there is no evaluation of testicular morphology after ATRi treatment.

      We addressed the point of separating the data from 4 hour and 2-3 days of treatment. We also have now also addressed testicular morphology after 4 hour ATRi treatment and did not observe any defect (new Figure 5-figure supplement 3A-B).

      Finally, the authors showed ATR-dependent localization of SETX and RANBP3 and discussed interesting data. However, it has not been determined whether these localization changes were due to the functions of identified phosphorylation sites or some other mechanisms.

      We agree with the reviewer that it would be very interesting to address the role of specific phosphorylation sites in SETX and RANBP3. However, we feel this would require significantly additional effort and time, which would not be realistic in the current manuscript, and is beyond the scope of this resource paper.

      Reviewer #3 (Public Review):

      In this study, Sims et al. perform a phosphoproteomic analysis of the ATR signaling pathway in mouse testis. By studying the different phosphorylated peptides found in testis samples from ATR inhibited mice and from mutant mice for the member of the ATR-activating 9-1-1 complex, RAD1, authors defined a comprehensive map of the ATR signaling pathway in the mouse testis. In general, the methodological approach performed is appropriate to accomplish the desired goal and the results obtained are well explained and properly discussed. The conclusions raised by the authors are supported by the results obtained and the manuscript reads easily. Thus, overall the manuscript is of high quality. Furthermore, the information provided in this study is novel since to my knowledge this is the first attempt to characterize the ATR signaling pathway in the testis. In my opinion, these data will be very relevant to better understand the role of the ATR in mouse spermatogenesis, and in meiosis in particular, in the future.

      Thank you, we appreciate the positive remarks.

      Nonetheless, I have a few major concerns about this manuscript. Firstly, I think an important part of the description of the results is placed in a related preprint by the authors (Pereira et al. https://www.biorxiv.org/content/10.1101/2021.04.09.439198v1). In my opinion, this manuscript lacks a more detailed analysis of the ATR signaling on DNA repair and chromosome axis structure, which are fundamental to understand the meiotic prophase. Secondly, the manuscript falls short of providing novel insights about ATR roles during the meiotic prophase. As ATR function on the meiotic prophase has been extensively studied, the ATR phosphoproteome should provide either some clues about possible novel functions ATR may do during the meiotic prophase or spermatogenesis, or provide a mechanistic explanation of how ATR performs its meiotic functions (e.g., meiotic sex chromosome inactivation or meiotic recombination). The final section of the results is an attempt at doing sol, but to me, the data provided only suppose a small incremental advance in our knowledge of how ATR promotes MSCI. I would have liked the authors to expand this section to prove the utility of the data.

      We agree with the reviewer that it would be very interesting to address more details of the roles of ATR in meiosis and the underlying molecular mechanisms. However, we feel this would require significantly additional effort and time, which would not be realistic in the current manuscript, and is beyond the scope of this resource paper. We note that the revised version of the manuscript now reports the exciting finding that ATR is important for the proper localization of CDK2 in meiotic spreads. While the details and mechanisms remain unknown, we believe this finding, together with other reported findings in this resource paper, open new directions to study meiotic ATR signaling.

    1. Author Response:

      Evaluation Summary:

      Are enzymes found in organisms that optimally grow at colder temperatures are more active than the same enzymes found in organisms that optimally grow at warmer temperatures? Here, an assessment of the catalytic constants for approximately 2200 enzymes (obtained from the BRENDA database) showed no correlation between the relative catalytic activity and the optimum growth temperature. Further support for this conclusion was obtained from the measurement of the catalytic constant from a selection of ketosteroid isomerases from organisms that optimally grow between 15 and 46 degrees centigrade. These are interesting results, although the significance with respect to earlier studies has not been clearly explained.

      We have made the relationship between previous work and our work more explicit. Earlier studies have used a limited number of specific cases to compare enzyme rates from different organisms (for example, n = 28, Figure 1C, Figure 1D). In this work, we performed a systematic analysis of 2223 enzyme reactions, reducing confirmation bias, and we have clarified this point. Prior work developed physical models about enzyme catalysis but were based on data that do not appear to be representative.

      Reviewer #2 (Public Review):

      The authors are trying to understand how enzymes evolve to best enable organisms to adjust to changes in the temperature of their environment. The paper reports an analysis of 2223 values of kcat from the BRENDA database, for 815 organisms with known optimal growth temperatures, and for which there are at least two variants per reaction. This analysis fails to show the expected preference for values of [(kcat)cold/(kcat)warm] > 1 observed in earlier studies.

      This is a useful attempt to use one large databases to gain insight into how enzymes evolve to enable organisms to adapt to changes in temperature. They have done a good job in curating the BRENDA database to identify data that meets their criteria for analysis.

      There are deficiencies that should be corrected.

      (1) The first concerns the reported values of [(kcat)cold/kcat)warm]. Figure 1D shows "Rate comparisons of warm-adapted and cold-adapted enzyme variants made at identical temperatures." I think that it is important that these kinetic parameters be reported for catalysis at a common temperature, but it is not clear to me that is the case for the author's analysis. For example, they write beginning on line 234 that "The rate ratio kcold/kwarm per reaction was determined by dividing rate of the enzyme from the organism with the minimum TGrowth by the rate of the enzyme from organism with the maximum TGrowth." My reading of this sentence is that these rate constants kcat [not rates] were determined individually at the organisms optimal growth temperatures, and not at identical temperatures as reported in Figure 1D. This will complicate the author's interpretation of the two sets of results.

      Analysis of kinetic parameters at a common temperature supports the conclusions of this work.

      (2) The author's fail to present a clear physical model to use in analyzing these results.

      For example, they write on line 35 that: "According to the rate compensation model of temperature adaptation, this challenge is met by cold-adapted enzyme variants providing more rate enhancement than the corresponding warm-adapted variants (Figure 1A)"

      I cannot recall hearing the term rate compensation model, but am familiar with discussions on the differences in properties of enzymes isolated from organisms that have adapted to warm and cold environments. The term cold adapted enzymes is not appropriate, because it is the organism not the enzyme, that adapts to the change to a cold environment. This is accomplished through the natural selection of enzymes with kinetic parameters, stability, etc. that optimize the organisms chances of survival in a cold climate. The kinetic parameters for essentially all enzymes will decrease with decreasing temperature. The most highly evolved metabolic enzymes have kinetic parameters kcat/Km close to the diffusion controlled limit, because this optimizes energy production from metabolism. A decrease in temperature will cause the values of kcat and therefore kcat/Km for these enzymes to decrease, to the detriment of the organism. This may be overcome by selection of enzymes with values of kcat/Km close to that observed for the parent [unevolved] organism. The result is that larger kinetic parameters kcat, for catalysis at a common temperature, will be observed for enzymes isolated from the cold-adapted, compared to the unevolved parent organism. This simple application of Darwin's principals of natural selection is strongly supported by the data reported in Figure 1D.

      The reviewer presents a model that presumes that there would be greater selection to optimize energy production. This is also the model supported by the prior data (Figure 1D).

      However, the more extensive data in our work do not support the model that the reviewer notes and that has been widely accepted in the literature –this is the central conclusion of this work and we have attempted to clarify this, as noted above. The strict Darwinian interpretation for our observations is that there is not a strong selection for enzyme rates to be maximized, as described in the Discussion.

      An alternative model, consistent with the data we present, is that there are different selective pressures on enzymes than rate maximization. We note that it is possible that different metabolic strategies may be more advantageous at different life stages or in different communities (see Wortel et al., 2018, now cited in our main text). These models can be tested experimentally –e.g., by examining how variations of a weak-link enzyme fare over time under different growth conditions. There is much more to be learned from linking the properties of enzymes to evolution, and we expect the relationship between fundamental rate constants and selection to be complex, fascinating and important.

      We use the term rate compensation to refer to the phenomenon and not the physical explanation; there is no need for a physical explanation of a phenomenon in the absence of evidence for the phenomenon itself. We have clarified that we have introduced this term in the Introduction: According to what we term the rate compensation model of temperature adaptation, this challenge has been suggested to be met by cold-adapted enzyme variants providing more rate enhancement than the corresponding warm-adapted variants (Figure 1A).

      We use the term “cold-adapted” in agreement with literature usage: from an organism that is cold adapted. We have clarified this language usage: We use the term “cold-adapted variant” to refer to an enzyme from an organism annotated with lower TGrowth values.

      Finally, “cold-adapted” is not synonymous with “having faster enzymes”, which is often how it is used in the literature and how it is implied in the reviewer’s model.

      (3) The paper alludes to, but does not clearly explain extensions of these ideas that are based on one model for how enzymes work. Enzymes often undergo large conformational changes during their catalytic cycle, and so must have sufficient flexibility for these changes to occur with rate constants that support catalysis. This predicts that the enhancement for catalysis observed for enzymes from cold-adapted organisms, might best be achieved through mutations that favor an increase in protein flexibility. There will also be natural selection of enzymes for thermophilic organisms that optimize the organisms chances of survival in a hot climate, where heat denaturation of the protein catalyst is minimized through the selection of stiffer protein catalysts. This analysis predicts a decrease in enzyme flexibility with increasing preferred growth temperature, that might give rise to an increase in protein stability with increasing optimal growth temperature.

      We agree that there are many fascinating aspects of temperature adaptation at the level of individual enzymes, their mechanisms, and their particular rate-limiting steps that remain to be explored. These were not the subject of our study. The goal of our manuscript was to test the previously presented rate compensation model of enzyme cold adaptation.

      (4) The authors should consider the possibility that the pressure to compensate for the cold-induced decrease in kcat for enzymes from cold-adapted organism will be strongest for highly evolved metabolic enzymes with values of kcat/Km close to the diffusion controlled limit. In cases where the enzyme starts out as less than perfect, an organism adapting to the cold might derive smaller, or even negligible advantages, from natural-selection of enzymes with enhanced kinetic parameters. For example, the organism might also minimize the effect of this change in kinetic parameter, by an adjustment or diversion of flux through the networks of metabolic pathways in which the enzyme functions. One possible explanation for the weak correlation observed between kcat and Tgrowth for ketosteroid isomerase is that the organisms studied gain little from optimization of the activity of this enzyme in cold-adapted organisms. One risk in the use of the larger BRENDA database may be the failure to account for differences in the pressure for enzymes to evolve to enable organisms adapt to cold environments.

      We considered these and additional models. For example, interestingly, the opposite of what the reviewer proposed has been suggested in the literature –that the slowest enzymes (“least perfect”) are under the heaviest selection pressure for optimization (see Noda-Garcia et al., 2018). Although our data indicates that temperature exerts a weaker force on enzyme activity than previously proposed, it is indeed possible that subgroups of enzymes do indeed adapt to temperature through changes in activity. Deciphering this and other pressures is an important future challenge. We did not parse the data in this report out of concerns for “p-hacking” or multiple hypothesis testing.

      Reviewer #3 (Public Review):

      Enzyme catalysis underlies all living processes. Understanding the effects of temperature on enzymes is important in understanding how they are adapted to particular environmental conditions, and also relates to the response of organisms and even ecosystems to changes in temperature. The essential question is: what determines optimal growth rates of organisms, and the optimal temperature of other biological processes? Two potentially important factors are enzyme stability and catalytic activity.

      This manuscript collates data from previous investigations and presents new results on KSI variants, aiming to look at the interesting question of what factors are important in relating enzyme activity and stability to optimum growth temperatures of organisms. It presents a useful survey of published data, particularly focusing on the enzyme ketosteroid isomerase (KSI) for which new resluts for a number of variants are presented, building on nice recent work by this group. The main finding in this manuscript is that enzyme optimum temperatures do not correlate well with enzyme activity. This has been found also previously. The manuscript provides quite an extensive analysis and is consistent with previous results and findings. There is useful information in this manuscript, and the compilation of data will be useful to the community, but some crucial aspects and recent relevant work are not covered, and the discussion is limited. The analysis does not identify any relevant determinant of optimum temperature, and the focus on a single temperature in each case may be misleading.

      We do not agree that our analysis is “misleading.” We would characterize the prior analysis based on a small number of examples that were not randomly selected as potentially misleading. In contrast, we tested the prior conclusions with all relevant data that are available. We also highlight the power of collecting more data by further reporting the rate enhancement of 20 enzyme variants in depth. Temperature compensation through activity may still occur in specific settings, as we have noted in the Discussion.

      We agree with Reviewer #3 about the vast potential to use temperature dependencies to relate to evolutionary pressures and adaptations from molecules to organisms. This is a prime area for future investigation.

      Previous analyses have shown that optimum rates of enzymes do not correlate with optimal growth temperatures (e.g. Elias et al (2014) Trends in Biochemical Sciences 39, 299; Peterson (2004) Journal of Biological Chemistry 279, 20717; Thomas & Scopes (1998) Biochemical Journal 330, 1087; Lee et al (2007) FASEB Journal 21, 1934). This is particularly notable for psychrophilic (cold adapted) enzymes, but is also apparent from the fact that enzymes from the same organism often have quite different optimum temperatures. The data collected in the current manuscript are consistent with previous analyses and so are usefully confirming of this. The authors note that optimal growth temperatures may not correlate with activity for a number of reasons, including that the individual enzyme rate may not be under evolutionary pressure. Also, obviously, as noted by the authors, factors other than temperature are also important in enzyme evolution.

      We agree that it is obvious that factors other than temperature are important in evolution, but here we address whether the adaptation to temperature is accompanied by a common response. As noted, more catalysis for organisms at lower temperature was concluded previously and (as noted by Reviewer #2) is expected. However, this conclusion, upon further analysis (carried out herein) appears not to hold. Thus, even when organisms are adapting to temperature, other factors appear to be dominant. This was not previously known. The analyses the reviewer notes refer to thermal parameters derived from the temperature dependence of the rate constant for a given enzyme as a function of temperature, rather than what is addressed herein –the relative rate constant for enzymes from organisms with different growth temperatures.

      There is somewhat better correlation of enzyme stability with optimum growth temperatures, but it is not strong. Therefore, other factors must be important in determining optimum growth temperatures. The authors briefly mention some possibilities. One factor is that a given enzyme may not be a bottleneck in a metabolic pathway. It is not clear that KSI is in fact a metabolic limiter. Also, for many metabolic pathways, it may be essential to consider the kinetics of the pathway as a whole, which may not be determined by a single enzyme. Directly relevant here is the recent proposal of the 'inflection point hypothesis', which provides an explanation of these observations (Prentice et al. Biochemistry (2020) 59, 3562), which the authors do not mention, and may not be aware of. This hypothesis proposes that, rather than alignment of optimum temperatures or stabilities, rather the inflection points of enzymes in a metabolic pathways are aligned at the mean environmental temperature for the organism. This has the effect of coordinating relative enzyme rates and preventing metabolic disruption as temperature fluctuates. Also relevant here is that the response of metabolic pathways in general is not determined solely by a single enzyme. Prentice et al. show that, in general, the temperature-dependent properties of each enzyme in the pathway is important in determining the temperature dependence of the whole pathway.

      We thank Reviewer #3 for bringing this work to our attention and we have included it in the revised manuscript. This paper points out additional complexities regarding metabolic coordination of relative enzyme rates, enhancing points made in the Discussion.

      It is certainly important to understand what molecular features determine the temperature dependence of enzyme activity and its relationship to stability. Some previous proposals are mentioned in the manuscript. One important factor at the molecular level, mentioned by the authors, is work of Åqvist, Brandsval and coworkers, who have convincingly shown that activation entropy and enthalpy differ significantly between psychrophilic enzymes and their mesophilic and thermophilic counterparts. For small soluble enzymes, this is particularly due to changes at the enzyme surface, which may also affect stability. As mentioned by the authors, there have been many proposals over the years that suggest a relationship between stability and activity, though there is not a simple general relationship.

      The cited study is based on molecular dynamics simulations and underlying potentials which can provide models to be tested via experiment. Our analyses relate to this model in that they suggest that rate compensation (to temperature) is not general and so a universal linkage of temperature, flexibility and catalysis is not expected.

      Also directly relevant for the discussion here is what factors limit enzyme activity as temperature increases. The traditional view is that loss of activity is due to protein unfolding at high temperatures (the poor correlation of stability with growth temperatures found here indicates that this cannot be a general explanation). There is increasing evidence that this simple picture is wrong (see e.g. Daniel & Danson. (2010) Trends in Biochemical Sciences 35, 584). This behavior may be accounted for by conformational (e.g. two state) effects as proposed by Danson et al, distinct from the 'flexibility' proposals mentioned in the supporting information here. The introduction of the manuscript here states that "reaction rates are reduced at lower temperatures" , which might naively seem obvious but actually is not universally true, many reactions do not display simple Arrhenius-type behavior (see e.g. Kohen and Truhlar PNAS 2001 98 848). Many enzymes show a temperature of optimum activity, i.e. activity drops above the optimum temperature but before unfolding occurs. As the authors note, Arcus et al. show that this can be accounted for by an activation heat capacity, significantly larger in psychrophiles. Signatures of this behavior are apparent at the large scale (e.g. Schipper et al Global Change Biol. 2014 20 3578; Alster et al (2016) Front. Microbiol. 7:1821) and it appears to be generally important.

      We also are enthralled by the many proposals put forward for the physical and thermodynamic behavior of enzymes and we look forward to rigorous tests of the predictions of these models. Like Reviewer #3, we expect that there are many different features and properties of enzymes to discover!

    1. Biophysics Colab

      Consolidated peer review report (30 November 2021)

      GENERAL ASSESSMENT

      The TMEM16 family of membrane proteins have been shown to function as calcium-activated chloride channels and lipid scramblases. In recent years, X-ray and cryo-EM structures have been solved for TMEM16 proteins in ligand-free and ligand-bound conformations, providing valuable structural information on their functional duality and activation mechanisms. It is largely accepted that the catalytic site (termed subunit groove or cavity) is mostly shielded from the membrane in the ligand-free TMEM16 scramblases. Calcium binding induces a conformational rearrangement of the cavity-lining helices, opening the groove to the surrounding membrane. Since the groove is hydrophilic, it was proposed that it serves as a permeation pathway for lipid headgroups while the hydrophobic lipid tails remain embedded into the hydrophobic membrane core, which has been termed the "credit card" mechanism of lipid scrambling. Additionally, structures of several TMEM16 homologs in lipid nanodiscs revealed that these proteins deform the lipid bilayer in the vicinity of the subunit cavity by bending and thinning the membrane, irrespective of the presence of the activating ligand calcium. Functional experiments also suggested that lipids can be scrambled outside of the open subunit cavity and that local protein-induced membrane deformation is critical for lipid scrambling.

      In the present study, Falzone and colleagues further address the mechanisms of lipid scrambling using single particle cryo-EM and liposome-based functional assays. Firstly, the authors solved the structure of a calcium-bound fungal homolog, afTMEM16, in nanodiscs with a lipid composition where the protein is maximally active. Although similar structures were obtained before, this new structure has the highest resolution thus far, representing > 1 Å improvement! The structure is beautiful and is a major achievement, which enabled the authors to resolve individual lipids and their interaction with the protein around the subunit cavity, whereas in previous structures unresolved non-protein densities were observed passing through the groove. The authors also solved a number of structures with and without calcium in lipid compositions that promote (thinner lipid bilayers) or suppress (thicker lipid bilayers) scrambling. The authors show that afTMEM16 can scramble lipids while the subunit groove remains closed, a phenomenon that is further enhanced in thinner membranes, whereas in thicker membranes scrambling is suppressed even though the groove is open. We particularly appreciated how different software packages and processing strategies were used to rigorously identify structural heterogeneity in their cryo-EM data. Remarkably, mutations of residues lining the subunit cavity and interacting with lipids do not appear to have dramatic effects on scrambling rates, which suggests that lipids do not need to interact with the protein to be scrambled. Thus, the overall conclusion of the study is that membrane thinning by TMEM16 scramblases in their calcium-free conformation is enough to induce lipid scrambling, and that the groove opening induced by calcium binding further enhances membrane deformation, promoting faster scrambling. By contrast, in thicker membranes the protein fails to sufficiently deform the bilayer and scrambling is suppressed, even when the subunit groove is open. The present study provides unprecedented structural information on the interaction of lipids with afTMEM16 and new evidence that lipids can be scrambled outside of the groove.

      The findings and conclusions presented here help to explain why TMEM16 scramblases can transport lipids with headgroups much bigger than the dimensions of the subunit cavity and why structures of some of the other scramblases (opsins, Xkrs and mammalian homolog TMEM16F) lack the obvious hydrophilic groove seen in fungal TMEM16 scramblases. Overall, this is a well-rounded study with an exceptional amount of high-quality cryo-EM data and functional experiments supporting the conclusions.

      RECOMMENDATIONS

      Revisions essential for endorsement:

      1) The resolved lipids binding within the groove in the first structure might be seen by some as supporting the credit card mechanism as it definitively demonstrates that lipids reside within the groove. While the authors provide evidence that lipids can permeate outside the groove in this and earlier work, as far as we can tell, none of that would preclude permeation through the groove if it doesn't require specific interactions between lipids and sidechains in the protein. The presentation might be improved with a somewhat more circumspect and nuanced exposition of the new data and how it can be understood with earlier results. Might the complex composition of native lipid membranes influence where and by what mechanism lipid movement between leaflets is catalysed by TMEM16 proteins?

      2) The quality of lipid densities in cryo-EM structures is greatly affected by the number of particles used and the resolution obtained during refinement and it is therefore not surprising that the beautiful lipid densities observed here in the structure of afTMEM16 in lipid nanodiscs in the presence of calcium refined to 2.3 Å are not all observed in subsequent structures with lower resolution. This is true not only for the P lipids near the groove, but for those D lipids bound near the dimer interface, which is a stable region of the protein that does not change conformation. To be cautious, the authors should avoid resting any conclusions on the absence of lipid densities in the lower resolution structures. For example, on pg 15 the authors seem to be interpreting the absence of density for C22 lipids.

      3) The presentation of structural interactions between lipids and residues near the groove of the protein could be improved in the figures. A panel like Fig. 1J but for the groove would help, but it would be good to see expanded perspectives in the form of a supplementary figure where residues around the headgroup of the lipids are shown along with EM maps so the quality of the structural information for both lipids and side chains can be better appreciated. The preprint does have a lot of images of the lipids and the protein, but not in a way that enables the reader to quickly grasp the nature of interactions between side chains and lipid moieties for themselves, and we feel that close-ups of individual lipids as suggested above would help. It is also not clear what the authors mean by "lipid headgroup". Have the authors only considered interactions of the phospholipid phosphate group with protein residues? It would be helpful if the authors could clarify this in the manuscript and say whether other types of interactions were considered. It would also be nice to include a close-up view of D511A/E514A in 0.5 mM calcium with cryo-EM density to demonstrate the absence of bound calcium ions.

      4) The functional data in Fig 2, 3 and 4 are also not discussed in much detail and it would help if the authors could expand the presentation. Although scrambling in the presence of a very high concentration of calcium is not dramatically altered by any of the mutations, there is quite a lot going on in the absence of calcium and very little is said about these results. For example, differences in the scrambling rates can be observed with some mutants in the presence and absence of calcium in figures 2E and 3E, but statistical analysis would be required to know if the differences between mutants are significant. The differences in scrambling rates with different lipids are also not discussed (e.g. Fig. 4A) It would help if the authors could discuss what is the margin of error in the scrambling assay, and point to some concrete examples from their earlier work on this specific scramblase where mutants have a large impact on scrambling activity in their assay. Have the authors tried intermediate more physiologically relevant concentrations of calcium to see if the mutants have discernible effects under those conditions?

      5) It is quite intriguing that the mutations in the subunit groove of afTMEM16 have little effect on scrambling activity. The authors propose that the groove-lining residues are not directly involved in lipid coordination even though their structure suggests that they do and there is a wealth of functional studies and MD simulations on various other TMEM16 homologs suggesting otherwise. The authors' suggestion that mutations probably affect the equilibrium between open and closed conformations of the groove in other homologs but not in afTMEM16 is logical, however, there are some discrepancies. To name a few examples, if indeed this is the case, nhTMEM16 mutants with closed groove should still have significant basal scrambling, by extrapolation from afTMEM16 data. Yet, some of the nhTMEM16 mutants (E313/E318/R432 mutants) have no activity at all, or no basal scrambling activity (Y439A) (Lee et al, 2018). Would you expect that point mutations within the subunit groove remove the ability of the protein to deform the membrane in its closed conformation? Might the groove have intermediate conformations between closed and fully open where the mutants studied might have more impact in afTMEM16? Further, mutating some of the residues on the scrambling domain of TMEM16 affected externalization of some lipid species, but not internalization etc. (Gyobu et al, 2017), which should not be the case if the interaction of the protein with the lipids is completely unnecessary for lipid scrambling. While investigating this question further would require follow-up structural studies on other TMEM16 homologs and is outside of the scope of this study, we think that the manuscript would benefit from a more extensive discussion on contradicting results and alternative interpretations. The authors might want to consider the possibility that there may be substantial variations in how different scramblases function. afTMEM16 has high constitutive activity in the absence of calcium, while at least TMEM16F does not. Additionally, the extent to which scrambling is promoted by calcium varies, as mammalian scramblases might need other cellular factors to be activated. Also, the extent to which scramblases are seen to distort the membrane is highly variable, as again seen in TMEM16F structures. Might some of these differences imply that key aspects of the mechanism of scrambling (e.g. thinning of the membrane or whether lipids scramble inside or outside the groove) are not the same for all scramblases? This might be one way to organize the discussion to help reconcile some of the seemingly divergent findings in the field.

      6) The authors should correct the Ramachandran outliers in C18/calcium and C22/calcium structures.

      Additional suggestions for the authors to consider:

      1) In several instances the authors conceptualize hypothetical mechanisms to set up experiments and frame their interpretations, which is not always the most straightforward way to communicate findings and what they reveal. The 'conveyor belt mechanism' introduced on page 10 is never fully defined in a way that helps the reader to understand what the functional effects of the mutants teach us. Might it be easier to set up the experiment by asking whether the interactions between sidechains that apparently interact with lipid headgroups in the structure play a critical role in scrambling, present the results and then conclude that they do not appear to? Collectively the functional effects of mutants do appear to suggest that specific side chain interactions are not critical for scrambling, but the conceptualized mechanism here makes the conclusions come across as unnecessarily forced. The credit-card mechanism has been formally introduced and discussed in the field but has already been shot down in earlier work from the group and seems overly simplistic if we already know that scrambling can occur both inside and outside the groove from earlier studies. Just something for the authors to think about.

      2) The uninitiated reader would greatly benefit from more of an introduction to the functional scrambling assay in the results and material and methods section so they can understand the results being presented. In the Material and methods, the authors mentioned: "All conditions were tested side by side with a control preparation", perhaps add here what exactly served as control – wild type protein in C18 lipids? It would be valuable to include information on the reconstitution efficiency between their preparations (WT in different lipid compositions and WT vs mutants). these if possible.

      3) Also, does the C18/calcium cryo-EM structure have sufficient resolution to distinguish between specific phospholipids (PG or PC) at the D1-D9 or P1-P7 positions? It would be particularly valuable if the authors could comment on whether PG or PC are observed in the D and P positions, or which lipids are lining the groove (P3-P6).

      4) While not essential, it would be interesting if the authors could perform the assay on the mutants with a more prominent effect in the absence of calcium (e.g. E310A, Y319A/F322A/K428A) with several additional calcium concentrations.

      5) The authors mentioned that the interaction of C22 lipids with the pathway helices is weaker than those of C18 lipids, which reflects the energy cost associated with distorting the longer lipids (page 15). However, they claimed that the interaction between the lipids and residues is not important for scrambling, which seems contradictory.

      REVIEWING TEAM

      Reviewed by:

      Angela Ballesteros, Research Fellow (K.J. Swartz lab, NINDS, USA): structural biology (X-ray crystallography), membrane protein function, lipid scrambling, cell biology, fluorescence microscopy

      Valeria Kalienkova, Postdoctoral Fellow (C. Paulino lab, University of Groningen, The Netherlands): membrane structural biology (X-ray crystallography and cryo-EM), membrane transport and lipid scrambling

      Kenton J. Swartz, Senior Investigator, NINDS, USA: ion channel structure and mechanisms, chemical biology and biophysics, electrophysiology and fluorescence spectroscopy

      Curated by:

      Kenton J. Swartz, Senior Investigator, NINDS, USA

      (This consolidated report is a result of peer review conducted by Biophysics Colab on version 1 of this preprint. Minor corrections and presentational issues have been omitted for brevity.)

    1. SciScore for 10.1101/2021.12.10.21267582: (What is this?)

      Please note, not all rigor criteria are appropriate for all manuscripts.

      Table 1: Rigor

      <table><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Ethics</td><td style="min-width:100px;border-bottom:1px solid lightgray">Consent: Subjects provided written consent prior to their participation and assignment to either the Echinacea or control group.<br>IRB: It was approved by the local ethical review board (Ethics Committee at Diagnostics and Consultation Center Convex Ltd, Sofia, registration nr: 116/26.10.2020) and registered on clinicaltrials.gov (identifier: NCT05002179).</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Sex as a biological variable</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Randomization</td><td style="min-width:100px;border-bottom:1px solid lightgray">Study Design and Participants: This randomized, parallel, open, no-treatment controlled, exploratory study was carried out in Bulgaria from 30th of November 2020 (first patient first visit) to 29th of May 2021 (last patient last visit) at one study centre (Diagnostics and Consultation Center Convex EOOD, Sofia).</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Blinding</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Power Analysis</td><td style="min-width:100px;border-bottom:1px solid lightgray">Sample size calculation & statistics: This study principally used descriptive biometric approaches to estimate effect sizes.</td></tr></table>

      Table 2: Resources

      No key resources detected.


      Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


      Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
      As mentioned, this study has limitations, first it used descriptive statistical methods, was small in size and secondly it did not use placebo for control and was not blinded. Nevertheless, the design was still considered valid to provide essential evidence for the preventive use of Echinacea during the COVID-19 pandemic for the following reasons: a first parameter was defined as incidence of (viral) RTIs, for which sample size calculation found sufficient statistical power of >80% for 120 included subjects. The lack of blinding/placebo might be considered a methodological weakness, but it can be assumed that the placebo effect/knowledge of therapy have only limited effects on detection of viral pathogens in NP/OP samples and blood serum. We therefore think that the study design was suitable to address the research question on antiviral effects of Echinaforce in vivo.

      Results from TrialIdentifier: We found the following clinical trial numbers in your paper:<br><table><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Identifier</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Status</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Title</td></tr><tr><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">NCT05002179</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Completed</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Echinaforce Study to Investigate Explorative Pharmacology an…</td></tr></table>


      Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


      Results from JetFighter: We did not find any issues relating to colormaps.


      Results from rtransparent:
      • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a protocol registration statement.

      Results from scite Reference Check: We found no unreliable references.


      <footer>

      About SciScore

      SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

      </footer>

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to reviewers

      Reviewer #1

      I believe that this is a very sound and authoritative study. The analysis of all data seems appropriate and robust, and many connections between the data (and subsets of data) and their possible interpretations have been considered. In fact, in the massive Results section, some interpretations are supported by cited references (this is not meant as a critique). However, I wonder about the length of the Results section, and the balance between it and the relatively short Discussion section. It is difficult for me to nail down any part of Results that might be shortened, as I could not find clear redundancies. I also think that the level of speculation is absolutely warranted, and I did not find excessive claims being made to this or that end. Rather, I suggest to broaden the perspective somewhat (in their Discussion; see below under Significance), which might allow people with a less mechanistic perspective to grasp the potential relevance of this work for non-model plant systems studied mostly by evolutionary geneticists.

      Response: We thank the reviewer for their kind remarks. We have spent a very large amount of time trying to streamline the results section and we are not sure if it would be possible to shorten it any further without removing critical details.

      We appreciate the reviewer’s comment to add more detail to the discussion to make it more appealing to evolutionary geneticists and we have added the following lines to the discussion section: “The WISO or “weak inbreeder/strong outcrosser” model (Brandvain & Haig, 2005) emerges from the dynamics of parental conflict and parent-of-origin effects. Under this model, a parent from populations with higher levels of outcrossing is exposed to higher levels of conflict and can thus dominate the programming of maternal resource allocation in a cross with an individual from a population with lower levels of outcrossing. Such a phenomenon has been observed in numerous clades including Dalechampia, Arabidopsis, Capsella and Leavenworthia (Brandvain & Haig, 2018; İltaş et al., 2021; Lafon-Placette et al., 2018; Raunsgard et al., 2018). Intriguingly, loss of function phenotypes in the RdDM pathway are more severe in recently outcrossing species than in A.thaliana (Grover et al., 2018; Wang et al., 2020) and suggests that RNA Pol IV functions are more elaborate and important in these species. This raises the possibility that the role for RNA Pol IV and RdDM in parental conflict that we describe in A.thaliana here is likely heightened in and mediates the elevated level of parental conflict in species that are currently or have been recently outcrossing.”

      One aspect that might warrant more scrutiny is the mapping of sRNA reads to the reference genome. I found the short section of this (M&M section, page 20, lines 23-25) to be too brief. It is not clear to me which of ShortStack's v3 weighting scheme the authors used, which is relevant for multi-mapping reads (see NR Johnson et al. 2016, G3). In addition, it is not mentioned whether zero mismatches were allowed. Perhaps this is described in more detail in Erdmann et al. (2017), but even if so, it deserves to be clarified here.

      Response: Small RNA reads were aligned after allowing two mismatches. This was indicated in the bowtie command (‘bowtie -v 2’ where v 2 indicates two mis-matches). We have added text to expand on the meaning of the commands.

      We have also expanded the commands used for ShortStack. We used the “Placement guided by uniquely mapping reads (-u)” option to divide the multi-mapping reads.

      The manuscript is well-written and concise, despite the length of the Results section. The verbal clarity and absence of typos or grammatical issues is superb. I did find some of the Figures to be somewhat "un-intuitive", in the sense that it takes acute concentration for an outsider (of sorts) to gather and interpret the underlying data. This is probably due to the many cross-comparisons of differences between two genotypes on one axis and those of a different pair of genotypes on the other axis. I am not sure how this issue can be ameliorated (nor whether this is really necessary); however, from a technical point of view, all Figures and Suppl. Figures are really well-done.

      Response: We thank the reviewer for their kind remarks. We have strived to make the figures easier to understand but we are aware that the figures do require a lot of concentration. We haven’t found an easy way to fix this. We thank the reviewer for patiently going through the figures.

      The list of references seems adequate in terms of citing relevant (both older and very recent) publications. However, almost all cited papers concern Arabidopsis or other model species; I suggest to consider adding a few relevant studies on non-Brassicaceae (whether considered model taxa or not), in conjunction with my suggestion (in Significance) to potentially broaden the scope by searching for natural phenomena that also involve parent-of-origin effects on endosperm/seed development. Curiously, many of the references are "incomplete" in the sense of stopping with the journal's name, then stating the doi, i.e. they lack volume numbers and page/article numbers. This should be harmonized throughout.

      Response: We have added references to non-Brassicaceae species and have also fixed the references.

      Reviewer #2: This manuscript provides evidence that a loss of either the maternal or paternal copy of NRPD1 have different, and sometime opposite, effects on the accumulation of small RNAs and on expression of a subset of genes, with a loss of the maternal copy having more substantial effects. The manuscript is well written, and the conclusions, as far as they go, are justified by the data, which are effectively presented. The overall effect is subtle but informative and according to the authors support a parental conflict model for imprinting. The experiments failed to find a smoking gun in the form of a mechanism to explain how or why the maternal and paternal alleles have different effects and the explanation for a lack of clear phenotypic differences was reasonable, but untested. I would have like to see it tested by looking in a plant species that is outcrossing and highly polymorphic. However, I do appreciate that the observation that the male and female alleles can have distinct effect when mutant is an important clue. My specific comments below may reflect confusion on my part, rather than real issues. If that is that I hope that confusion can aid in clarifying what are in places quite subtle points.

      Response: We thank the reviewer for their comments. We agree that it would be potentially informative to do similar experiments in an outcrossing species but that this is beyond the scope of this manuscript. Additionally, loss of NRPD1 or other components of the RdDM pathway has dramatic effects on gametogenesis in some examined outcrossing species(Grover et al., 2018; Wang et al., 2020), which could prevent the detection of subtle parent-of-origin effects on seed development.

      Page 6, last paragraph: "Because the endosperm is triploid, in these comparisons there are 3 (wild-type), 2 (pat nrpd1+/-), 1 (mat nrpd1+/-) and 0 (nrpd1-/-) functional NRPD1 alleles in the endosperm. However, NRPD1 is a paternally expressed imprinted gene in wild-type Ler x Col endosperm and the single paternal allele contributes 62% of the NRPD1 transcript whereas 38% comes from the two maternal alleles (Pignatta et al., 2014). Consistent with paternal allele bias in NPRD1 expression, mRNA-Seq data shows that NRPD1 is expressed at 42% of wild- type levels in pat nrpd1+/- and at 91% of wild-type levels in mat nrpd1+/- (Supplementary Table 6)". I would think this would really complicate the analysis. Should all of the dosage values include NPRD1 imprinting values? That is to say, expressed in terms of expression values? This is also a bit confusing. The maternal copies together express 38% of the transcript, so why isn't the mat nrpd1 at 68%, rather than 91%? In any event, given this imprinting and differences in dosage of the male and female it appears that two variables, parental origin and expression levels are being compared. Since 91% is awfully close to 100%, are the mat pat comparisons really just comparing low with nearly normal expression of NRPD1? And actually, given that, the outsized effect of the mat nrpd1 +/- is even more striking.

      Response: We included the details of dosage rather than imprinting values because the potential for buffering of expression upon loss of one allele could not be discounted. Indeed, we do find that the endosperm transcriptome buffers against the loss of the maternal or paternal alleles (Supplementary Table 6). The reviewer is correct in pointing out that the outsized effect of mat nrpd1+/- on gene expression is even more striking, and strongly supports our view that these effects are parental rather than endospermic.

      To reduce confusion in this section, we removed the details about 38% maternal allele transcripts obtained from our previous study, and instead report only the observed values from this study (which are also consistent with the previously reported paternally-biased expression of NRPD1 in endosperm).

      Page 4, Line 16. I'm afraid it's still a bit difficult to understand what was being compared what in this section. Please clarify.

      Response: The authors in this previously published study compared sRNAs obtained from wild-type whole seeds (which consists of three different tissues, including endosperm) with mutant endosperm. We are pointing out that the difference in tissue composition makes the effect of nrpd1 mutation hard to disentangle from the tissue differences between the two genotypes.

      Page 5, Line 5. I'm sure this is fine, but it's not entirely clear what is from the previously published paper and what is reanalysis here. All the crosses and measurements were made then, but not organized in this way?

      Response: This data was indeed previously published. In that analysis, we had pooled results from different crosses and calculated significance between genotypes using chi-square tests. During a later study (Satyaki and Gehring, 2019), we realized that we were losing information by ignoring the seed abortion values per cross. So, a reanalysis of that data on a cross by cross basis allowed us to find strong evidence for maternal and paternal effects.

      Page 6, Line 26. This is an excellent dosage series, but it's complicated by imprinting. So it's not 3, 2, 1, 0 effective copies. If we set the paternal copy at ~1 and each maternal at ~0.1, then it's 1.2 (wild type), 0.20 (pat nrpd1+/-), 1 (mat nrpd1+/-), and 0 (nrpd1-/-).

      Response: At the genomic DNA level, its 3, 2,1 and 0 doses. The reviewer’s comment on the transcriptional dose is not clear to us. Based on measured gene expression levels, relative wild-type NRPD1 transcriptional dose =1, pat nrpd1+/- is 0.42, and mat nrpd1+/- is 0.91.

      Page 6, line 31. Is the main thing we are comparing the difference between expression at 42% verses 91% of wild type?

      Response: We are using the small RNA-seq data alongside the mRNA-seq data to argue that loss of mat and pat nrpd1+/- have no impact on overall Pol IV activity in endosperm (as measured by small RNA production). A nrpd1 heterozygous endosperm has almost the same small RNA profile as a wild-type endosperm. Thus any effects seen in the endosperm, including the effects on mRNA expression described later in the manuscript, are likely parental rather than zygotic endospermic effects.

      Page 7, line 11. So, the overall effect in either direction on smRNA gene targets was really quite small, and I'm guessing the effect on gene expression was even smaller.

      Response: The effects of loss of maternal or paternal Pol IV on sRNAs was indeed small (Fig. 1/Fig. S3). Effect of loss of maternal Pol IV on gene expression was substantially large and distinct from the relatively small impacts observed upon loss of paternal Pol IV (Fig. 3) This observation supports the view that Pol IV mediates parent-of-origin effects on gene expression.

      Page 7, line 17. I take it that it is this difference, rather than the overall numbers that is of interest.

      Response: Correct. The lack of a relationship between sRNAs impacted upon loss of mat and pat nrpd1 is additionally suggestive of parent-of-origin effects

      Page 9, line 2. Really interesting, since one might expect that these are methylated loci that would be expected to be fed into any existing embryo maintenance methylation pathway. Surprising that they are maintained independently.

      Response: It is indeed surprising that Pol IV activity in parents can have different impacts on sRNAs in the endosperm. It should be noted though, that as described in Erdmann et al 2017 and in this paper later on, many endosperm sRNA loci are in fact not associated with endosperm DNA methylation. In addition, sRNA loci that are dependent on paternal Pol IV activity are more likely to be associated with DNA methylation than are sRNA loci associated with maternal Pol IV activity. These points have been described in Figure S8.

      Page 9, line 22. Proportion of total imprinted genes? Did the mutant obviate/enhance the imprinting?

      Response: We have modified the manuscript to describe effects on imprinted genes: “ The expression of imprinted genes is known to be regulated epigenetically in endosperm. In mat nrpd1+/- imprinted genes were more likely to be mis-regulated than expected by chance (hypergeometric test p-15) – 15 out of 43 paternally expressed and 45 out of 128 maternally expressed imprinted genes were mis-regulated in mat nrpd1+/- while two maternally expressed imprinted genes but no paternally expressed imprinted genes were mis-regulated in pat nrpd1+/- (Table S6).” We have also added a new supplementary figure (Fig. S6) that describes the impacts of NRPD1 loss of imprinted gene expression.

      Page 9, line 27. How could 2) occur in the homozygous mutant?

      Response: Loss of NRPD1 may impact gene expression in both parents. When the nrpd1-/- mutant endosperm is investigated, we are also examining the consequences of the inheritance of these disrupted gene expression states. We refer to this as epistatic interactions of mat and pat nrpd1.

      Page 10, line 9. Interesting!

      Response: We strongly agree!

      Page 10, line 11. Is this 2.7 versus 2.18 significant because it's statistically significant, or because it's conceptually significant?

      Response: We are pointing out that the 2.7-fold value is quite similar to the predicted value of 2.18-fold, which is arrived at by simply summing the effects of mat nrpd1 and pat nrpd1. This is a conceptually significant point.

      Are the examples in 3D representative, or the most convincing examples? And a big difference in ROS1 is of some concern, since that may well be expected to affect imprinting indirectly. I know I'm being picky here, but the pattern is so intriguing I'd be worried about confirmation bias.

      Response: The examples in 3D are representative for those genes with significant changes in expression in both mat and pat nrpd1, and other genes also behave similarly. The antagonistic effect described for 3D can also be observed as a much broader trend affecting hundreds of genes to varying extents in Fig 3C and 3E-H. The concern about ROS1 is not clear to us but we agree that an effect of ROS1 may be one way that NRPD1 controls gene expression.

      Page 10, line 18. Ok, but 0.123 is a pretty subtle negative correlation. Although I do appreciate that it clearly is not a positive correlation. If I'm understanding correctly, this was the "aha" moment, because it's exactly what one might expect of NRPD1 from the mother and father or working at cross purposes. But the numbers are getting awfully small here.

      Response: It is unclear how to calibrate our expectations of effect sizes considering that our study is the first (to our knowledge) to make such a measurement involving gene expression in parental conflict. A review of the few empirical examples of parental conflict’s impact on seeds shows that parental conflict may drive small changes in seed size (Brandvain and Haig, 2018).

      The evolution of quantitative traits maybe driven by selection for large effects at a small number of loci and/or by selection of small effects at a large number of loci. In a similar vein, parental conflict can impact seed phenotypes either via large effects at a few loci or via small effects at a large number of loci. Our analysis described in Fig 3D-H can fit either possibility. Large effects can be found at a few loci such as SUC2 and PICC (Fig. 3D). Smaller antagonistic effects can be found at hundreds of loci as shown in Figure 5A. The negative correlation described in this figure can be observed even upon dropping the genes that show a statistically significant differential expression in both mat and pat nrpd1+/- (slope after dropping genes significantly mis-regulated in both mat and pat nrpd1+/- is -0.126). In summary, a correlation of -0.123 strongly supports the existence of a widespread antagonistic regulatory effect.

      Page 10, line 29. The point simply being that that other phenomenon is also significant even if the differences are that large?

      Response: We are pointing out that the magnitude of the effects we see are similar to that observed for phenomenon such as dosage compensation.

      Page 12. So, there is no effect on cleavage and no obvious effect on flanking siRNA clusters. The suspense is building...

      Page 12, line 24. And not in potential regulatory regions? CNSs?

      Response: We did not identify a significant enrichment for differentially methylated regions in regulatory regions. We used the relative distance function in bedtools (https://bedtools.readthedocs.io/en/latest/content/tools/reldist.html) to calculate the relationship between the genomic location of DMRs and genomic location of a differentially expressed gene. This analysis was chosen as it does not make a priori assumptions about the size of the regulatory region of a gene. A broad association between DMRs and differentially expressed genes would be indicated by a frequency far greater than 0.02. We show the results of this analysis in Fig. S8F; we find no evidence for significant enrichment of DMRs in the regulatory regions of differentially expressed genes.

      Page 12, line 28. I guess it depend on whether or not the changes are in regulatory sequences no immediately apparent as part of the gene, doesn't it?

      Response: We examined DNA methylation over genes here because in endosperm, unlike in other tissues, many small RNAs are genic. Moreover, DNA methylation within the gene may control transcript abundance (Eimer et al., 2018; Klosinska et al., 2016). We have also examined regulatory regions adjacent to genes in Fig S8F and found no effect.

      Line 13, line 2. Not sure it's that important, but couldn't you chop all of these genes in half and see if they are no longer significant collectively?

      Response: We do not think that this will provide a useful insight.

      Page 14, line 15. I'm afraid I'm getting confused here with the terms cis and trans here. Just to be clear, cis means a direct effect of small RNAs that are dependent on NRPD1 on a gene and trans means anything else? But in this context, it's not clear that is what is meant. Do you mean that gene expression is determined and preset in the gametophyte? What are the levels of expression of NRPD1 in the two gametophytes?

      Response: The reviewer’s interpretation of cis and trans is correct. However, the cis imprints may be preset in gametophytes or in the sporophytic tissues that surround or give rise to the gametophyte. Pol IV is known to be active either in gametophyte or in related sporophytic tissues in both the mother and the father(Kirkbride et al., 2019; Long et al., 2021; Olmedo-Monfil et al., 2010).

      Page 14, line 19. Prior to fertilization?

      Response: Yes, that is the idea. As described in the manuscript, Pol IV activity in either the parental sporophyte or gametophyte prior to fertilization could impact gene expression in the endosperm after fertilization.

      Page 14, line 27. Do you mean driven by, or just associated with?

      Response: In response to the comment, we have replaced the phrase “driven by” with “due to” for increased clarity. In wild-type, DOG1 is predominantly expressed from the paternal allele. In mat nrpd1+/-, the paternal allele is somewhat upregulated but the maternal allele, which is almost silent in wild-type, is highly expressed in mat nrpd1+/-.

      Page 15, line 26. And this is really the issue. The primary conclusion, backed up by the lack (I'm assuming) of phenotypic differences between mat NRPD1 -/+ and pat NRPD1 +/- suggests that the observed differences in expression are not particularly important, unless the exceptional cases are informative.

      Response: We are not sure whether the reviewer means “issue” in a negative, neutral, or positive light. Seed phenotypes are often subtle and we have not examined phenotypic differences in sufficient detail to comment.

      Page 15, line 12. Yes, but I'm not at all clear what the mechanism for this is.

      Response: We have tested and falsified multiple hypotheses to explain how Pol IV can regulate gene expression in endosperm. Considering the complex genetics and the difficulty of isolating endosperm, we have concluded that this is a matter for a future study. The point of this study is the discovery of Pol IV’s parental effects.

      Page 15, line 23. Since this is a very small subset of genes, are these genes that you might expect to play a role in parental conflict?

      Response: The functions of most genes in endosperm remain unknown. However, some have a likely role in conflict. SUC2 is antagonistically regulated by parental Pol IV (Fig. 3D). SUC2 transports sucrose, the key form of carbon imported into seeds from the mother (Sauer & Stolz, 1994).

      Page 15, line 33. Indeed, these could be the informative exceptions.

      Response: We believe the reviewer means that the identify of strongly antagonistically regulated genes may be informative in terms of thinking about these results in the context of parental genetic conflict, which we agree with.

      Page 15, line 29. Hardly surprising, given that the paternal copy of NRPD1 is expressed at a higher level than the maternal copies, is it?

      Response: It is actually somewhat surprising since we show in Fig. 2 that the sRNA production in mat and pat nrpd1+/- are comparable to that of wild-type. The higher contribution of NRPD1 from the paternal copy does not really explain the methylation differences

      Page 16, line 1. So this is what you mean by in cis. Presetting?

      Response: The reviewer’s previous interpretation of cis (acting directly at a target gene) is correct. However, the cis imprints may be preset in gametophyte or in the sporophytic tissues that surround or give rise to the gametophyte. Pol IV is known to be active in gametophytes and in related sporophytic tissues in both the mother and the father.

      These are intriguing results that would benefit from a test of the hypothesis by comparing these result with those obtained in an outcrossing plant species.

      Response: We agree that it would interesting and informative to perform similar experiments in an outcrossing species. However, loss of NRPD1 or other components of the the RdDM pathway have dramatic effects on gametogenesis in outcrossing species (Grover et al., 2018; Wang et al., 2020), preventing the detection of subtle parent-of-origin effects on seed development. Additionally, this would be a separate study.

      Reviewer #3

      We thank the reviewer for their comments.

      • Expression of NRPD1 was 42% of WT in paternal nrpd1 and 91% of WT in maternal nrpd1, yet throughout the paper the effect of maternal nrpd1 was far stronger than paternal nrpd1. The authors may also want to confirm that protein levels follow the same pattern, in case protein degradation or post-transcriptional regulation may play a role.

      Response: We show in Fig. 2 that sRNA production in mat and pat nrpd1+/- are similar to wild-type endosperm. This strongly suggests that NRPD1 protein is produced at functionally equivalent levels in wild-type, mat and pat nrpd1+/-. The finding that mat nrpd1+/- has a stronger effect on gene expression and small RNAs, despite having higher levels of NRPD1 transcript in endosperm, is consistent with our conclusion that the effects we are observing in heterozygous endosperm are due to NRPD1 action before fertilization.

      P. 9 line 1 - this only seems to be true for maternal ISRs, not paternal ISRs; this claim should be narrowed.

      Response: Accordingly, we have modified the text here to : “In summary, these results indicate that most maternally and some paternally imprinted sRNA loci in endosperm are dependent on Pol IV activity in the parents and are not established de novo post-fertilization.”

      A small number of sRNA loci become highly depleted in maternal nrpd1 but not paternal nrpd1 (Fig. 1D, F, Fig. 2C) - are these siren loci?

      Response: This is an interesting question. Siren loci have not been defined in Arabidopsis but are described as loci with high levels of sRNAs in ovules, seed coat, endosperm and embryo (Grover et al., 2020). Loci losing sRNAs in maternal nrpd1+/- include a large number of maternally expressed imprinted sRNAs (mat ISRs). We do not know if mat ISR loci are expressed in the ovule. In Erdmann et al (2017), we excluded loci that were also expressed in the seed coat from mat ISRs. Thus, these loci meet only some of the conditions for being defined as siren loci.

      Fig. 2 suggests that many of the downregulated sRNA regions in maternal nrpd1 are maternally biased to begin with. Related, are genic sRNAs more likely to be affected by maternal or paternal nrpd1 than non-genic or TE sRNAs?

      Response: As described in Fig. 1B and S3, loss of maternal NRPD1 has more impacts on the sRNA landscape. As a percentage of total loci, genes are more likely to be affected than TEs.

      For the sRNA loci shown in Fig. 2C, how is % maternal affected in maternal vs. paternal nrpd1? These ISRs are normally maternal or paternal biased, does this change in maternal or paternal nrpd1?

      Response: We assess the allelic bias of ISRs only when they have at least ten reads in the genotypes being compared. In mat nrpd1+/-, most mat ISRs lose almost all their reads (Fig. 2) and we can assess allelic bias only at 107/366 mat ISRs. As seen in the Rev. comment. Fig1, these 107 lose their maternal bias. In pat nrpd1+/-, loci with maternally biased sRNAs show somewhat increased expression (Fig 2E) but do not show an appreciable change in maternal bias (Figure Review 1). All paternal ISRs do not show any dramatic impacts on allelic bias in mat or pat nrpd1+/-. We have not added this additional datapoint to our paper because we were worried that the paper was becoming too dense – a concern also voiced by reviewer 1. However, we can add this to the manuscript if the reviewer prefers.

      • Might have missed this, but I didn't see the gene ontology results (p9 line 16) shown anywhere? Would like to see significance values, fold enrichments, etc. In particular, the group of paternal nrpd1 up-regulated genes seems too small to have much confidence for GO enrichment analysis.

      Response: We have added a Supplementary Table 7 with outputs of GO analyses.

      • I would suggest expanding the analysis in Fig. 3D-H to explore whether the additive model is more predictive of nrpd1-/- expression levels than other potential models (epistatic, etc.) in general at all genes, or only at the subsets of genes shown, independently of whether the effects are large enough to pass the arbitrary significance cutoffs used in E-H. Identifying specifically which genes do and don't follow this additive pattern could help dissect mechanism. For example, genes following this pattern might share a TF binding site for a TF that is regulated by Pol IV.

      Response: While we are interested, we currently cannot explore other models such as epistasis as this would require knock-down of NRPD1 in the endosperm and we plan to do this as part of a future study.

      1. 13 line 26 - how do changes in CG methylation in maternal or paternal nrpd1 compare to changes in dme or ros1? Do either set of DMRs significantly overlap dme or ros1 DMRs? Could some of these be explained by changes in ROS1 expression, since ROS1 is a Pol IV target?

      Response: Yes. It’s entirely possible that a subset of observed gene expression changes are linked to changes in ROS1 expression. However, there are no comparable methylation data for ROS1 in the endosperm. A potential role for ROS1 has been discussed on Page 11, line 4. Comparison with DMRs in the dme endosperm is difficult. dme mutant endosperm has low non-CG methylation (Ibarra et al., 2012). We have unpublished data showing that the expression of genes involved in RNA-directed DNA methylation (RdDM) is reduced in the dme endosperm. It is therefore difficult to understand if and how DME-mediated demethylation may impact RdDM.

      P. 10 line 3 - is the overlap of 36 out of 51 genes unlikely to occur by chance

      Response: A hypergeometric test indicates that this is indeed significant. We have added it to text on Page 9, line 34.

      In sRNA and mRNA-seq libraries, what was the overall maternal/paternal ratio in each library? Did loss of Pol IV affect this?

      The graphs above show the maternally derived fraction of mRNA and sRNA libraries for different genotypes. Please note that the Ler nrpd1 mutant was generated by backcrossing Col-0 nrpd1+/- into Ler. Some Col-0 regions remain in this background and are called “hold-outs”. Reads mapping to these hold-outs have been excluded while calculating the maternal fraction of each library described in the graph above. We cannot confidently judge if the overall maternal fraction of the mRNA transcriptome is affected by loss of NRPD1 as we likely need more replicates. However, we find that loss of all NRPD1-dependent sRNAs (as in the nrpd1 null mutant) leaves behind sRNAs that roughly reflect the genomic 2:1 ratio.

      P. 9 line 22 - how many paternally and maternally expressed imprinted genes were considered? Were imprinted genes statistically more likely to be misregulated in mat nrpd1?

      Response: We considered 128 maternally and 43 paternally expressed genes that had been previously been identified as imprinted in Col x Ler crosses (Pignatta et al 2014). We have modified the manuscript to describe effects on imprinted genes: “ The expression of imprinted genes is known to be regulated epigenetically in endosperm. In mat nrpd1+/- imprinted genes were more likely to be mis-regulated than expected by chance (hypergeometric test p-15) – 15 out of 43 paternally expressed and 45 out of 128 maternally expressed imprinted genes were mis-regulated in mat nrpd1+/- while two maternally expressed imprinted genes but no paternally expressed imprinted genes were mis-regulated in pat nrpd1+/- (Table S6). “ We have also added a supplementary figure (Figure S6) that focuses on genic mRNA imprinting in NRPD1 heterozygotes and homozygous mutants.

      References cited in the response

      Brandvain, Y., & Haig, D. (2005). Divergent Mating Systems and Parental Conflict as a Barrier to Hybridization in Flowering Plants. The American Naturalist, 166(3), 330–338. https://doi.org/10.1086/432036

      Brandvain, Y., & Haig, D. (2018). Outbreeders pull harder in a parental tug-of-war. Proceedings of the National Academy of Sciences, 115(45), 11354–11356. https://doi.org/10.1073/pnas.1816187115

      Eimer, H., Sureshkumar, S., Singh Yadav, A., Kraupner-Taylor, C., Bandaranayake, C., Seleznev, A., Thomason, T., Fletcher, S. J., Gordon, S. F., Carroll, B. J., & Balasubramanian, S. (2018). RNA-Dependent Epigenetic Silencing Directs Transcriptional Downregulation Caused by Intronic Repeat Expansions. Cell. https://doi.org/10.1016/j.cell.2018.06.044

      Grover, J. W., Burgess, D., Kendall, T., Baten, A., Pokhrel, S., King, G. J., Meyers, B. C., Freeling, M., & Mosher, R. A. (2020). Abundant expression of maternal siRNAs is a conserved feature of seed development. Proceedings of the National Academy of Sciences of the United States of America, 117(26), 15305–15315. https://doi.org/10.1073/pnas.2001332117

      Grover, J. W., Kendall, T., Baten, A., Burgess, D., Freeling, M., King, G. J., & Mosher, R. A. (2018). Maternal components of RNA ‐directed DNA methylation are required for seed development in Brassica rapa. The Plant Journal, 94(4), 575–582. https://doi.org/10.1111/tpj.13910

      Ibarra, C. A., Feng, X., Schoft, V. K., Hsieh, T.-F., Uzawa, R., Rodrigues, J. A., Zemach, A., Chumak, N., Machlicova, A., Nishimura, T., Rojas, D., Fischer, R. L., Tamaru, H., & Zilberman, D. (2012). Active DNA Demethylation in Plant Companion Cells Reinforces Transposon Methylation in Gametes. Science, 337(6100), 1360–1364. https://doi.org/10.1126/science.1224839

      İltaş, Ö., Svitok, M., Cornille, A., Schmickl, R., & Lafon Placette, C. (2021). Early evolution of reproductive isolation: A case of weak inbreeder/strong outbreeder leads to an intraspecific hybridization barrier in Arabidopsis lyrata. Evolution, 75(6), 1466–1476. https://doi.org/10.1111/evo.14240

      Kirkbride, R. C., Lu, J., Zhang, C., Mosher, R. A., Baulcombe, D. C., & Chen, Z. J. (2019). Maternal small RNAs mediate spatial-temporal regulation of gene expression, imprinting, and seed development in Arabidopsis. Proceedings of the National Academy of Sciences, 116(7), 2761–2766. https://doi.org/10.1073/pnas.1807621116

      Klosinska, M., Picard, C. L., & Gehring, M. (2016). Conserved imprinting associated with unique epigenetic signatures in the Arabidopsis genus. Nature Plants, 2, 16145. https://doi.org/10.1038/nplants.2016.145

      Lafon-Placette, C., Hatorangan, M. R., Steige, K. A., Cornille, A., Lascoux, M., Slotte, T., & Köhler, C. (2018). Paternally expressed imprinted genes associate with hybridization barriers in Capsella. Nature Plants, 4(6), 352–357. https://doi.org/10.1038/s41477-018-0161-6

      Long, J., Walker, J., She, W., Aldridge, B., Gao, H., Deans, S., Vickers, M., & Feng, X. (2021). Nurse cell­–derived small RNAs define paternal epigenetic inheritance in Arabidopsis. Science, 373(6550). https://doi.org/10.1126/science.abh0556

      Olmedo-Monfil, V., Durán-Figueroa, N., Arteaga-Vázquez, M., Demesa-Arévalo, E., Autran, D., Grimanelli, D., Slotkin, R. K., Martienssen, R. A., & Vielle-Calzada, J.-P. (2010). Control of female gamete formation by a small RNA pathway in Arabidopsis. Nature, 464(7288), 628–632. https://doi.org/10.1038/nature08828

      Raunsgard, A., Opedal, Ø. H., Ekrem, R. K., Wright, J., Bolstad, G. H., Armbruster, W. S., & Pélabon, C. (2018). Intersexual conflict over seed size is stronger in more outcrossed populations of a mixed-mating plant. Proceedings of the National Academy of Sciences, 115(45), 11561–11566. https://doi.org/10.1073/pnas.1810979115

      Sauer, N., & Stolz, J. (1994). SUC1 and SUC2: two sucrose transporters from Arabidopsis thaliana; expression and characterization in baker’s yeast and identification of the histidine-tagged protein. The Plant Journal, 6(1), 67–77. https://doi.org/10.1046/j.1365-313X.1994.6010067.x

      Wang, Z., Butel, N., Santos-González, J., Borges, F., Yi, J., Martienssen, R. A., Martinez, G., & Köhler, C. (2020). Polymerase IV Plays a Crucial Role in Pollen Development in Capsella. The Plant Cell, 32(4), 950–966. https://doi.org/10.1105/tpc.19.00938

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript provides evidence that a loss of either the maternal or paternal copy of NRPD1 have different, and sometime opposite, effects on the accumulation of small RNAs and on expression of a subset of genes, with a loss of the maternal copy having more substantial effects. The manuscript is well written, and the conclusions, as far as they go, are justified by the data, which are effectively presented. The overall effect is subtle but informative and according to the authors support a parental conflict model for imprinting. The experiments failed to find a smoking gun in the form of a mechanism to explain how or why the maternal and paternal alleles have different effects and the explanation for a lack of clear phenotypic differences was reasonable, but untested. I would have like to see it tested by looking in a plant species that is outcrossing and highly polymorphic. However, I do appreciate that the observation that the male and female alleles can have distinct effect when mutant is an important clue. My specific comments below may reflect confusion on my part, rather than real issues. If that is that I hope that confusion can aid in clarifying what are in places quite subtle points.

      Specific comments:

      Page 6, last paragraph: "Because the endosperm is triploid, in these comparisons there are 3 (wild-type), 2 (pat nrpd1+/-), 1 (mat nrpd1+/-) and 0 (nrpd1-/-) functional NRPD1 alleles in the endosperm. However, NRPD1 is a paternally expressed imprinted gene in wild-type Ler x Col endosperm and the single paternal allele contributes 62% of the NRPD1 transcript whereas 38% comes from the two maternal alleles (Pignatta et al., 2014). Consistent with paternal allele bias in NPRD1 expression, mRNA-Seq data shows that NRPD1 is expressed at 42% of wild- type levels in pat nrpd1+/- and at 91% of wild-type levels in mat nrpd1+/- (Supplementary Table 6)".

      I would think this would really complicate the analysis. Should all of the dosage values include NPRD1 imprinting values? That is to say, expressed in terms of expression values? This is also a bit confusing. The maternal copies together express 38% of the transcript, so why isn't the mat nrpd1 at 68%, rather than 91%? In any event, given this imprinting and differences in dosage of the male and female it appears that two variables, parental origin and expression levels are being compared. Since 91% is awfully close to 100%, are the mat pat comparisons really just comparing low with nearly normal expression of NRPD1? And actually, given that, the outsized effect of the mat nrpd1 +/- is even more striking.

      Page 4, Line 16. I'm afraid it's still a bit difficult to understand what was being compared what in this section. Please clarify.

      Page 5, Line 5. I'm sure this is fine, but it's not entirely clear what is from the previously published paper and what is reanalysis here. All the crosses and measurements were made then, but not organized in this way?

      Page 6, Line 26. This is an excellent dosage series, but it's complicated by imprinting. So it's not 3, 2, 1, 0 effective copies. If we set the paternal copy at ~1 and each maternal at ~0.1, then it's 1.2 (wild type), 0.20 (pat nrpd1+/-), 1 (mat nrpd1+/-), and 0 (nrpd1-/-).

      Page 6, line 31. Is the main thing we are comparing the difference between expression at 42% verses 91% of wild type?

      Page 7, line 11. So, the overall effect in either direction on smRNA gene targets was really quite small, and I'm guessing the effect on gene expression was even smaller.

      Page 7, line 17. I take it that it is this difference, rather than the overall numbers that is of interest.

      Page 9, line 2. Really interesting, since one might expect that these are methylated loci that would be expected to be fed into any existing embryo maintenance methylation pathway. Surprising that they are maintained independently.

      Page 9, line 22. Proportion of total imprinted genes? Did the mutant obviate/enhance the imprinting?

      Page 9, line 27. How could 2) occur in the homozygous mutant?

      Page 10, line 9. Interesting!

      Page 10, line 11. Is this 2.7 versus 2.18 significant because it's statistically significant, or because it's conceptually significant? Are the examples in 3D representative, or the most convincing examples? And a big difference in ROS1 is of some concern, since that may well be expected to affect imprinting indirectly. I know I'm being picky here, but the pattern is so intriguing I'd be worried about confirmation bias.

      Page 10, line 18. Ok, but 0.123 is a pretty subtle negative correlation. Although I do appreciate that it clearly is not a positive correlation. If I'm understanding correctly, this was the "aha" moment, because it's exactly what one might expect of NRPD1 from the mother and father or working at cross purposes. But the numbers are getting awfully small here.

      Page 10, line 29. The point simply being that that other phenomenon is also significant even if the differences are that large?

      Page 12. So, there is no effect on cleavage and no obvious effect on flanking siRNA clusters. The suspense is building...

      Page 12, line 24. And not in potential regulatory regions? CNSs?

      Page 12, line 28. I guess it depend on whether or not the changes are in regulatory sequences no immediately apparent as part of the gene, doesn't it?

      Line 13, line 2. Not sure it's that important, but couldn't you chop all of these genes in half and see if they are no longer significant collectively?

      Page 14, line 15. I'm afraid I'm getting confused here with the terms cis and trans here. Just to be clear, cis means a direct effect of small RNAs that are dependent on NRPD1 on a gene and trans means anything else? But in this context, it's not clear that is what is meant. Do you mean that gene expression is determined and preset in the gametophyte? What are the levels of expression of NRPD1 in the two gemetophytes?

      Page 14, line 19. Prior to fertilization?

      Page 14, line 27. Do you mean driven by, or just associated with?

      Page 15, line 26. And this is really the issue. The primary conclusion, backed up by the lack (I'm assuming) of phenotypic differences between mat NRPD1 -/+ and pat NRPD1 +/- suggests that the observed differences in expression are not particularly important, unless the exceptional cases are informative.

      Page 15, line 12. Yes, but I'm not at all clear what the mechanism for this is.

      Page 15, line 23. Since this is a very small subset of genes, are these genes that you might expect to play a role in parental conflict?

      Page 15, line 33. Indeed, these could be the informative exceptions.

      Page 15, line 29. Hardly surprising, given that the paternal copy of NRPD1 is expressed at a higher level than the maternal copies, is it?

      Page 16, line 1. So this is what you mean by in cis. Presetting?

      Page 16, line 9. So ideally, one would want to look at a highly polymorphic out-crosser. I'm not suggesting that for this paper, but would this be a good test of the hypothesis? How about maize?

      Page 16, line 15. But the pat and mat heterozygotes looked the same. No differences in phenotype?

      Page 17, line 22. I'm confused, since aren't most 24 nt smRNAs dependent on POLIV (Figure S2)? Do you mean differentially regulated smRNAs? Expression of POLIV specifically in one or the other parent?

      Page 17, line 23. How are you defining important here? Important because at least in the female NPRD1 is not expressed in the central cell? But not important, since this mutant has no effect on phenotype except in an imbalanced cross.

      Page 18, line 13. For this reason, it would be nice to know much more about these genes. Mutant phenotypes, for instance. And how many of these have this feature conserved?

      Significance

      These are intriguing results that would benefit from a test of the hypothesis by comparing these result with those obtained in an outcrossing plant species.

      Referee Cross-commenting

      I agree that the other comments seem both fair and reasonable.

  3. veronicang68.wordpress.com veronicang68.wordpress.com
    1. A commonality in rap music videos is lighter skin toned girls because they have “privileges based on their Eurocentric appearance” (Conrad) and are deemed more desirable

      Looking back to my first draft I had only included a brief summary of the article I had read without putting much of my opinion in the paragraph. As I was editing I had to keep asking myself "yeah but why do you think it is important" or "what do you think about it" because I realized the strength of this essay is supposed to be based on my thoughts and journey through discovering an answer to my question rather than summaries of my research. So, in the final draft I knew I had to include why I think rap videos make a greater impact than what we may originally believe. I added in my thoughts about how in the black community there is an "internal battle" regarding the issue of colorism. I realize now I could have included more about how some artist purposefully choose darker skin extras in their videos to compare to those who don't .

  4. classroom.google.com classroom.google.com
    1. According to all known laws of aviation,

      there is no way a bee should be able to fly.

      Its wings are too small to get its fat little body off the ground.

      The bee, of course, flies anyway

      because bees don't care what humans think is impossible.

      Yellow, black. Yellow, black. Yellow, black. Yellow, black.

      Ooh, black and yellow! Let's shake it up a little.

      Barry! Breakfast is ready!

      Ooming!

      Hang on a second.

      Hello?

      • Barry?
      • Adam?
      • Oan you believe this is happening?
      • I can't. I'll pick you up.

      Looking sharp.

      Use the stairs. Your father paid good money for those.

      Sorry. I'm excited.

      Here's the graduate. We're very proud of you, son.

      A perfect report card, all B's.

      Very proud.

      Ma! I got a thing going here.

      • You got lint on your fuzz.
      • Ow! That's me!
      • Wave to us! We'll be in row 118,000.
      • Bye!

      Barry, I told you, stop flying in the house!

      • Hey, Adam.
      • Hey, Barry.
      • Is that fuzz gel?
      • A little. Special day, graduation.

      Never thought I'd make it.

      Three days grade school, three days high school.

      Those were awkward.

      Three days college. I'm glad I took a day and hitchhiked around the hive.

      You did come back different.

      • Hi, Barry.
      • Artie, growing a mustache? Looks good.
      • Hear about Frankie?
      • Yeah.
      • You going to the funeral?
      • No, I'm not going.

      Everybody knows, sting someone, you die.

      Don't waste it on a squirrel. Such a hothead.

      I guess he could have just gotten out of the way.

      I love this incorporating an amusement park into our day.

      That's why we don't need vacations.

      Boy, quite a bit of pomp... under the circumstances.

      • Well, Adam, today we are men.
      • We are!
      • Bee-men.
      • Amen!

      Hallelujah!

      Students, faculty, distinguished bees,

      please welcome Dean Buzzwell.

      Welcome, New Hive Oity graduating class of...

      ...9:15.

      That concludes our ceremonies.

      And begins your career at Honex Industries!

      Will we pick ourjob today?

      I heard it's just orientation.

      Heads up! Here we go.

      Keep your hands and antennas inside the tram at all times.

      • Wonder what it'll be like?
      • A little scary.

      Welcome to Honex, a division of Honesco

      and a part of the Hexagon Group.

      This is it!

      Wow.

      Wow.

      We know that you, as a bee, have worked your whole life

      to get to the point where you can work for your whole life.

      Honey begins when our valiant Pollen Jocks bring the nectar to the hive.

      Our top-secret formula

      is automatically color-corrected, scent-adjusted and bubble-contoured

      into this soothing sweet syrup

      with its distinctive golden glow you know as...

      Honey!

      • That girl was hot.
      • She's my cousin!
      • She is?
      • Yes, we're all cousins.
      • Right. You're right.
      • At Honex, we constantly strive

      to improve every aspect of bee existence.

      These bees are stress-testing a new helmet technology.

      • What do you think he makes?
      • Not enough.

      Here we have our latest advancement, the Krelman.

      • What does that do?
      • Oatches that little strand of honey

      that hangs after you pour it. Saves us millions.

      Oan anyone work on the Krelman?

      Of course. Most bee jobs are small ones. But bees know

      that every small job, if it's done well, means a lot.

      But choose carefully

      because you'll stay in the job you pick for the rest of your life.

      The same job the rest of your life? I didn't know that.

      What's the difference?

      You'll be happy to know that bees, as a species, haven't had one day off

      in 27 million years.

      So you'll just work us to death?

      We'll sure try.

      Wow! That blew my mind!

      "What's the difference?" How can you say that?

      One job forever? That's an insane choice to have to make.

      I'm relieved. Now we only have to make one decision in life.

      But, Adam, how could they never have told us that?

      Why would you question anything? We're bees.

      We're the most perfectly functioning society on Earth.

      You ever think maybe things work a little too well here?

      Like what? Give me one example.

      I don't know. But you know what I'm talking about.

      Please clear the gate. Royal Nectar Force on approach.

      Wait a second. Oheck it out.

      • Hey, those are Pollen Jocks!
      • Wow.

      I've never seen them this close.

      They know what it's like outside the hive.

      Yeah, but some don't come back.

      • Hey, Jocks!
      • Hi, Jocks!

      You guys did great!

      You're monsters! You're sky freaks! I love it! I love it!

      • I wonder where they were.
      • I don't know.

      Their day's not planned.

      Outside the hive, flying who knows where, doing who knows what.

      You can'tjust decide to be a Pollen Jock. You have to be bred for that.

      Right.

      Look. That's more pollen than you and I will see in a lifetime.

      It's just a status symbol. Bees make too much of it.

      Perhaps. Unless you're wearing it and the ladies see you wearing it.

      Those ladies? Aren't they our cousins too?

      Distant. Distant.

      Look at these two.

      • Oouple of Hive Harrys.
      • Let's have fun with them.

      It must be dangerous being a Pollen Jock.

      Yeah. Once a bear pinned me against a mushroom!

      He had a paw on my throat, and with the other, he was slapping me!

      • Oh, my!
      • I never thought I'd knock him out.

      What were you doing during this?

      Trying to alert the authorities.

      I can autograph that.

      A little gusty out there today, wasn't it, comrades?

      Yeah. Gusty.

      We're hitting a sunflower patch six miles from here tomorrow.

      • Six miles, huh?
      • Barry!

      A puddle jump for us, but maybe you're not up for it.

      • Maybe I am.
      • You are not!

      We're going 0900 at J-Gate.

      What do you think, buzzy-boy? Are you bee enough?

      I might be. It all depends on what 0900 means.

      Hey, Honex!

      Dad, you surprised me.

      You decide what you're interested in?

      • Well, there's a lot of choices.
      • But you only get one.

      Do you ever get bored doing the same job every day?

      Son, let me tell you about stirring.

      You grab that stick, and you just move it around, and you stir it around.

      You get yourself into a rhythm. It's a beautiful thing.

      You know, Dad, the more I think about it,

      maybe the honey field just isn't right for me.

      You were thinking of what, making balloon animals?

      That's a bad job for a guy with a stinger.

      Janet, your son's not sure he wants to go into honey!

      • Barry, you are so funny sometimes.
      • I'm not trying to be funny.

      You're not funny! You're going into honey. Our son, the stirrer!

      • You're gonna be a stirrer?
      • No one's listening to me!

      Wait till you see the sticks I have.

      I could say anything right now. I'm gonna get an ant tattoo!

      Let's open some honey and celebrate!

      Maybe I'll pierce my thorax. Shave my antennae.

      Shack up with a grasshopper. Get a gold tooth and call everybody "dawg"!

      I'm so proud.

      • We're starting work today!
      • Today's the day.

      Oome on! All the good jobs will be gone.

      Yeah, right.

      Pollen counting, stunt bee, pouring, stirrer, front desk, hair removal...

      • Is it still available?
      • Hang on. Two left!

      One of them's yours! Oongratulations! Step to the side.

      • What'd you get?
      • Picking crud out. Stellar!

      Wow!

      Oouple of newbies?

      Yes, sir! Our first day! We are ready!

      Make your choice.

      • You want to go first?
      • No, you go.

      Oh, my. What's available?

      Restroom attendant's open, not for the reason you think.

      • Any chance of getting the Krelman?
      • Sure, you're on.

      I'm sorry, the Krelman just closed out.

      Wax monkey's always open.

      The Krelman opened up again.

      What happened?

      A bee died. Makes an opening. See? He's dead. Another dead one.

      Deady. Deadified. Two more dead.

      Dead from the neck up. Dead from the neck down. That's life!

      Oh, this is so hard!

      Heating, cooling, stunt bee, pourer, stirrer,

      humming, inspector number seven, lint coordinator, stripe supervisor,

      mite wrangler. Barry, what do you think I should... Barry?

      Barry!

      All right, we've got the sunflower patch in quadrant nine...

      What happened to you? Where are you?

      • I'm going out.
      • Out? Out where?
      • Out there.
      • Oh, no!

      I have to, before I go to work for the rest of my life.

      You're gonna die! You're crazy! Hello?

      Another call coming in.

      If anyone's feeling brave, there's a Korean deli on 83rd

      that gets their roses today.

      Hey, guys.

      • Look at that.
      • Isn't that the kid we saw yesterday?

      Hold it, son, flight deck's restricted.

      It's OK, Lou. We're gonna take him up.

      Really? Feeling lucky, are you?

      Sign here, here. Just initial that.

      • Thank you.
      • OK.

      You got a rain advisory today,

      and as you all know, bees cannot fly in rain.

      So be careful. As always, watch your brooms,

      hockey sticks, dogs, birds, bears and bats.

      Also, I got a couple of reports of root beer being poured on us.

      Murphy's in a home because of it, babbling like a cicada!

      • That's awful.
      • And a reminder for you rookies,

      bee law number one, absolutely no talking to humans!

      All right, launch positions!

      Buzz, buzz, buzz, buzz! Buzz, buzz, buzz, buzz! Buzz, buzz, buzz, buzz!

      Black and yellow!

      Hello!

      You ready for this, hot shot?

      Yeah. Yeah, bring it on.

      Wind, check.

      • Antennae, check.
      • Nectar pack, check.
      • Wings, check.
      • Stinger, check.

      Scared out of my shorts, check.

      OK, ladies,

      let's move it out!

      Pound those petunias, you striped stem-suckers!

      All of you, drain those flowers!

      Wow! I'm out!

      I can't believe I'm out!

      So blue.

      I feel so fast and free!

      Box kite!

      Wow!

      Flowers!

      This is Blue Leader. We have roses visual.

      Bring it around 30 degrees and hold.

      Roses!

      30 degrees, roger. Bringing it around.

      Stand to the side, kid. It's got a bit of a kick.

      That is one nectar collector!

      • Ever see pollination up close?
      • No, sir.

      I pick up some pollen here, sprinkle it over here. Maybe a dash over there,

      a pinch on that one. See that? It's a little bit of magic.

      That's amazing. Why do we do that?

      That's pollen power. More pollen, more flowers, more nectar, more honey for us.

      Oool.

      I'm picking up a lot of bright yellow. Oould be daisies. Don't we need those?

      Oopy that visual.

      Wait. One of these flowers seems to be on the move.

      Say again? You're reporting a moving flower?

      Affirmative.

      That was on the line!

      This is the coolest. What is it?

      I don't know, but I'm loving this color.

      It smells good. Not like a flower, but I like it.

      Yeah, fuzzy.

      Ohemical-y.

      Oareful, guys. It's a little grabby.

      My sweet lord of bees!

      Oandy-brain, get off there!

      Problem!

      • Guys!
      • This could be bad.

      Affirmative.

      Very close.

      Gonna hurt.

      Mama's little boy.

      You are way out of position, rookie!

      Ooming in at you like a missile!

      Help me!

      I don't think these are flowers.

      • Should we tell him?
      • I think he knows.

      What is this?!

      Match point!

      You can start packing up, honey, because you're about to eat it!

      Yowser!

      Gross.

      There's a bee in the car!

      • Do something!
      • I'm driving!
      • Hi, bee.
      • He's back here!

      He's going to sting me!

      Nobody move. If you don't move, he won't sting you. Freeze!

      He blinked!

      Spray him, Granny!

      What are you doing?!

      Wow... the tension level out here is unbelievable.

      I gotta get home.

      Oan't fly in rain.

      Oan't fly in rain.

      Oan't fly in rain.

      Mayday! Mayday! Bee going down!

      Ken, could you close the window please?

      Ken, could you close the window please?

      Oheck out my new resume. I made it into a fold-out brochure.

      You see? Folds out.

      Oh, no. More humans. I don't need this.

      What was that?

      Maybe this time. This time. This time. This time! This time! This...

      Drapes!

      That is diabolical.

      It's fantastic. It's got all my special skills, even my top-ten favorite movies.

      What's number one? Star Wars?

      Nah, I don't go for that...

      ...kind of stuff.

      No wonder we shouldn't talk to them. They're out of their minds.

      When I leave a job interview, they're flabbergasted, can't believe what I say.

      There's the sun. Maybe that's a way out.

      I don't remember the sun having a big 75 on it.

      I predicted global warming.

      I could feel it getting hotter. At first I thought it was just me.

      Wait! Stop! Bee!

      Stand back. These are winter boots.

      Wait!

      Don't kill him!

      You know I'm allergic to them! This thing could kill me!

      Why does his life have less value than yours?

      Why does his life have any less value than mine? Is that your statement?

      I'm just saying all life has value. You don't know what he's capable of feeling.

      My brochure!

      There you go, little guy.

      I'm not scared of him. It's an allergic thing.

      Put that on your resume brochure.

      My whole face could puff up.

      Make it one of your special skills.

      Knocking someone out is also a special skill.

      Right. Bye, Vanessa. Thanks.

      • Vanessa, next week? Yogurt night?
      • Sure, Ken. You know, whatever.
      • You could put carob chips on there.
      • Bye.
      • Supposed to be less calories.
      • Bye.

      I gotta say something.

      She saved my life. I gotta say something.

      All right, here it goes.

      Nah.

      What would I say?

      I could really get in trouble.

      It's a bee law. You're not supposed to talk to a human.

      I can't believe I'm doing this.

      I've got to.

      Oh, I can't do it. Oome on!

      No. Yes. No.

      Do it. I can't.

      How should I start it? "You like jazz?" No, that's no good.

      Here she comes! Speak, you fool!

      Hi!

      I'm sorry.

      • You're talking.
      • Yes, I know.

      You're talking!

      I'm so sorry.

      No, it's OK. It's fine. I know I'm dreaming.

      But I don't recall going to bed.

      Well, I'm sure this is very disconcerting.

      This is a bit of a surprise to me. I mean, you're a bee!

      I am. And I'm not supposed to be doing this,

      but they were all trying to kill me.

      And if it wasn't for you...

      I had to thank you. It's just how I was raised.

      That was a little weird.

      • I'm talking with a bee.
      • Yeah.

      I'm talking to a bee. And the bee is talking to me!

      I just want to say I'm grateful. I'll leave now.

      • Wait! How did you learn to do that?
      • What?

      The talking thing.

      Same way you did, I guess. "Mama, Dada, honey." You pick it up.

      • That's very funny.
      • Yeah.

      Bees are funny. If we didn't laugh, we'd cry with what we have to deal with.

      Anyway...

      Oan I...

      ...get you something?

      • Like what?

      I don't know. I mean... I don't know. Ooffee?

      I don't want to put you out.

      It's no trouble. It takes two minutes.

      • It's just coffee.
      • I hate to impose.
      • Don't be ridiculous!
      • Actually, I would love a cup.

      Hey, you want rum cake?

      • I shouldn't.
      • Have some.
      • No, I can't.
      • Oome on!

      I'm trying to lose a couple micrograms.

      • Where?
      • These stripes don't help.

      You look great!

      I don't know if you know anything about fashion.

      Are you all right?

      No.

      He's making the tie in the cab as they're flying up Madison.

      He finally gets there.

      He runs up the steps into the church. The wedding is on.

      And he says, "Watermelon? I thought you said Guatemalan.

      Why would I marry a watermelon?"

      Is that a bee joke?

      That's the kind of stuff we do.

      Yeah, different.

      So, what are you gonna do, Barry?

      About work? I don't know.

      I want to do my part for the hive, but I can't do it the way they want.

      I know how you feel.

      • You do?
      • Sure.

      My parents wanted me to be a lawyer or a doctor, but I wanted to be a florist.

      • Really?
      • My only interest is flowers.

      Our new queen was just elected with that same campaign slogan.

      Anyway, if you look...

      There's my hive right there. See it?

      You're in Sheep Meadow!

      Yes! I'm right off the Turtle Pond!

      No way! I know that area. I lost a toe ring there once.

      • Why do girls put rings on their toes?
      • Why not?
      • It's like putting a hat on your knee.
      • Maybe I'll try that.
      • You all right, ma'am?
      • Oh, yeah. Fine.

      Just having two cups of coffee!

      Anyway, this has been great. Thanks for the coffee.

      Yeah, it's no trouble.

      Sorry I couldn't finish it. If I did, I'd be up the rest of my life.

      Are you...?

      Oan I take a piece of this with me?

      Sure! Here, have a crumb.

      • Thanks!
      • Yeah.

      All right. Well, then... I guess I'll see you around.

      Or not.

      OK, Barry.

      And thank you so much again... for before.

      Oh, that? That was nothing.

      Well, not nothing, but... Anyway...

      This can't possibly work.

      He's all set to go. We may as well try it.

      OK, Dave, pull the chute.

      • Sounds amazing.
      • It was amazing!

      It was the scariest, happiest moment of my life.

      Humans! I can't believe you were with humans!

      Giant, scary humans! What were they like?

      Huge and crazy. They talk crazy.

      They eat crazy giant things. They drive crazy.

      • Do they try and kill you, like on TV?
      • Some of them. But some of them don't.
      • How'd you get back?
      • Poodle.

      You did it, and I'm glad. You saw whatever you wanted to see.

      You had your "experience." Now you can pick out yourjob and be normal.

      • Well...
      • Well?

      Well, I met someone.

      You did? Was she Bee-ish?

      • A wasp?! Your parents will kill you!
      • No, no, no, not a wasp.
      • Spider?
      • I'm not attracted to spiders.

      I know it's the hottest thing, with the eight legs and all.

      I can't get by that face.

      So who is she?

      She's... human.

      No, no. That's a bee law. You wouldn't break a bee law.

      • Her name's Vanessa.
      • Oh, boy.

      She's so nice. And she's a florist!

      Oh, no! You're dating a human florist!

      We're not dating.

      You're flying outside the hive, talking to humans that attack our homes

      with power washers and M-80s! One-eighth a stick of dynamite!

      She saved my life! And she understands me.

      This is over!

      Eat this.

      This is not over! What was that?

      • They call it a crumb.
      • It was so stingin' stripey!

      And that's not what they eat. That's what falls off what they eat!

      • You know what a Oinnabon is?
      • No.

      It's bread and cinnamon and frosting. They heat it up...

      Sit down!

      ...really hot!

      • Listen to me!

      We are not them! We're us. There's us and there's them!

      Yes, but who can deny the heart that is yearning?

      There's no yearning. Stop yearning. Listen to me!

      You have got to start thinking bee, my friend. Thinking bee!

      • Thinking bee.
      • Thinking bee.

      Thinking bee! Thinking bee! Thinking bee! Thinking bee!

      There he is. He's in the pool.

      You know what your problem is, Barry?

      I gotta start thinking bee?

      How much longer will this go on?

      It's been three days! Why aren't you working?

      I've got a lot of big life decisions to think about.

      What life? You have no life! You have no job. You're barely a bee!

      Would it kill you to make a little honey?

      Barry, come out. Your father's talking to you.

      Martin, would you talk to him?

      Barry, I'm talking to you!

      You coming?

      Got everything?

      All set!

      Go ahead. I'll catch up.

      Don't be too long.

      Watch this!

      Vanessa!

      • We're still here.
      • I told you not to yell at him.

      He doesn't respond to yelling!

      • Then why yell at me?
      • Because you don't listen!

      I'm not listening to this.

      Sorry, I've gotta go.

      • Where are you going?
      • I'm meeting a friend.

      A girl? Is this why you can't decide?

      Bye.

      I just hope she's Bee-ish.

      They have a huge parade of flowers every year in Pasadena?

      To be in the Tournament of Roses, that's every florist's dream!

      Up on a float, surrounded by flowers, crowds cheering.

      A tournament. Do the roses compete in athletic events?

      No. All right, I've got one. How come you don't fly everywhere?

      It's exhausting. Why don't you run everywhere? It's faster.

      Yeah, OK, I see, I see. All right, your turn.

      TiVo. You can just freeze live TV? That's insane!

      You don't have that?

      We have Hivo, but it's a disease. It's a horrible, horrible disease.

      Oh, my.

      Dumb bees!

      You must want to sting all those jerks.

      We try not to sting. It's usually fatal for us.

      So you have to watch your temper.

      Very carefully. You kick a wall, take a walk,

      write an angry letter and throw it out. Work through it like any emotion:

      Anger, jealousy, lust.

      Oh, my goodness! Are you OK?

      Yeah.

      • What is wrong with you?!
      • It's a bug.

      He's not bothering anybody. Get out of here, you creep!

      What was that? A Pic 'N' Save circular?

      Yeah, it was. How did you know?

      It felt like about 10 pages. Seventy-five is pretty much our limit.

      You've really got that down to a science.

      • I lost a cousin to Italian Vogue.
      • I'll bet.

      What in the name of Mighty Hercules is this?

      How did this get here? Oute Bee, Golden Blossom,

      Ray Liotta Private Select?

      • Is he that actor?
      • I never heard of him.
      • Why is this here?
      • For people. We eat it.

      You don't have enough food of your own?

      • Well, yes.
      • How do you get it?
      • Bees make it.
      • I know who makes it!

      And it's hard to make it!

      There's heating, cooling, stirring. You need a whole Krelman thing!

      • It's organic.
      • It's our-ganic!

      It's just honey, Barry.

      Just what?!

      Bees don't know about this! This is stealing! A lot of stealing!

      You've taken our homes, schools, hospitals! This is all we have!

      And it's on sale?! I'm getting to the bottom of this.

      I'm getting to the bottom of all of this!

      Hey, Hector.

      • You almost done?
      • Almost.

      He is here. I sense it.

      Well, I guess I'll go home now

      and just leave this nice honey out, with no one around.

      You're busted, box boy!

      I knew I heard something. So you can talk!

      I can talk. And now you'll start talking!

      Where you getting the sweet stuff? Who's your supplier?

      I don't understand. I thought we were friends.

      The last thing we want to do is upset bees!

      You're too late! It's ours now!

      You, sir, have crossed the wrong sword!

      You, sir, will be lunch for my iguana, Ignacio!

      Where is the honey coming from?

      Tell me where!

      Honey Farms! It comes from Honey Farms!

      Orazy person!

      What horrible thing has happened here?

      These faces, they never knew what hit them. And now

      they're on the road to nowhere!

      Just keep still.

      What? You're not dead?

      Do I look dead? They will wipe anything that moves. Where you headed?

      To Honey Farms. I am onto something huge here.

      I'm going to Alaska. Moose blood, crazy stuff. Blows your head off!

      I'm going to Tacoma.

      • And you?
      • He really is dead.

      All right.

      Uh-oh!

      • What is that?!
      • Oh, no!
      • A wiper! Triple blade!
      • Triple blade?

      Jump on! It's your only chance, bee!

      Why does everything have to be so doggone clean?!

      How much do you people need to see?!

      Open your eyes! Stick your head out the window!

      From NPR News in Washington, I'm Oarl Kasell.

      But don't kill no more bugs!

      • Bee!
      • Moose blood guy!!
      • You hear something?
      • Like what?

      Like tiny screaming.

      Turn off the radio.

      Whassup, bee boy?

      Hey, Blood.

      Just a row of honey jars, as far as the eye could see.

      Wow!

      I assume wherever this truck goes is where they're getting it.

      I mean, that honey's ours.

      • Bees hang tight.
      • We're all jammed in.

      It's a close community.

      Not us, man. We on our own. Every mosquito on his own.

      • What if you get in trouble?
      • You a mosquito, you in trouble.

      Nobody likes us. They just smack. See a mosquito, smack, smack!

      At least you're out in the world. You must meet girls.

      Mosquito girls try to trade up, get with a moth, dragonfly.

      Mosquito girl don't want no mosquito.

      You got to be kidding me!

      Mooseblood's about to leave the building! So long, bee!

      • Hey, guys!
      • Mooseblood!

      I knew I'd catch y'all down here. Did you bring your crazy straw?

      We throw it in jars, slap a label on it, and it's pretty much pure profit.

      What is this place?

      A bee's got a brain the size of a pinhead.

      They are pinheads!

      Pinhead.

      • Oheck out the new smoker.
      • Oh, sweet. That's the one you want.

      The Thomas 3000!

      Smoker?

      Ninety puffs a minute, semi-automatic. Twice the nicotine, all the tar.

      A couple breaths of this knocks them right out.

      They make the honey, and we make the money.

      "They make the honey, and we make the money"?

      Oh, my!

      What's going on? Are you OK?

      Yeah. It doesn't last too long.

      Do you know you're in a fake hive with fake walls?

      Our queen was moved here. We had no choice.

      This is your queen? That's a man in women's clothes!

      That's a drag queen!

      What is this?

      Oh, no!

      There's hundreds of them!

      Bee honey.

      Our honey is being brazenly stolen on a massive scale!

      This is worse than anything bears have done! I intend to do something.

      Oh, Barry, stop.

      Who told you humans are taking our honey? That's a rumor.

      Do these look like rumors?

      That's a conspiracy theory. These are obviously doctored photos.

      How did you get mixed up in this?

      He's been talking to humans.

      • What?
      • Talking to humans?!

      He has a human girlfriend. And they make out!

      Make out? Barry!

      We do not.

      • You wish you could.
      • Whose side are you on?

      The bees!

      I dated a cricket once in San Antonio. Those crazy legs kept me up all night.

      Barry, this is what you want to do with your life?

      I want to do it for all our lives. Nobody works harder than bees!

      Dad, I remember you coming home so overworked

      your hands were still stirring. You couldn't stop.

      I remember that.

      What right do they have to our honey?

      We live on two cups a year. They put it in lip balm for no reason whatsoever!

      Even if it's true, what can one bee do?

      Sting them where it really hurts.

      In the face! The eye!

      • That would hurt.
      • No.

      Up the nose? That's a killer.

      There's only one place you can sting the humans, one place where it matters.

      Hive at Five, the hive's only full-hour action news source.

      No more bee beards!

      With Bob Bumble at the anchor desk.

      Weather with Storm Stinger.

      Sports with Buzz Larvi.

      And Jeanette Ohung.

      • Good evening. I'm Bob Bumble.
      • And I'm Jeanette Ohung.

      A tri-county bee, Barry Benson,

      intends to sue the human race for stealing our honey,

      packaging it and profiting from it illegally!

      Tomorrow night on Bee Larry King,

      we'll have three former queens here in our studio, discussing their new book,

      Olassy Ladies, out this week on Hexagon.

      Tonight we're talking to Barry Benson.

      Did you ever think, "I'm a kid from the hive. I can't do this"?

      Bees have never been afraid to change the world.

      What about Bee Oolumbus? Bee Gandhi? Bejesus?

      Where I'm from, we'd never sue humans.

      We were thinking of stickball or candy stores.

      How old are you?

      The bee community is supporting you in this case,

      which will be the trial of the bee century.

      You know, they have a Larry King in the human world too.

      It's a common name. Next week...

      He looks like you and has a show and suspenders and colored dots...

      Next week...

      Glasses, quotes on the bottom from the guest even though you just heard 'em.

      Bear Week next week! They're scary, hairy and here live.

      Always leans forward, pointy shoulders, squinty eyes, very Jewish.

      In tennis, you attack at the point of weakness!

      It was my grandmother, Ken. She's 81.

      Honey, her backhand's a joke! I'm not gonna take advantage of that?

      Quiet, please. Actual work going on here.

      • Is that that same bee?
      • Yes, it is!

      I'm helping him sue the human race.

      • Hello.
      • Hello, bee.

      This is Ken.

      Yeah, I remember you. Timberland, size ten and a half. Vibram sole, I believe.

      Why does he talk again?

      Listen, you better go 'cause we're really busy working.

      But it's our yogurt night!

      Bye-bye.

      Why is yogurt night so difficult?!

      You poor thing. You two have been at this for hours!

      Yes, and Adam here has been a huge help.

      • Frosting...
      • How many sugars?

      Just one. I try not to use the competition.

      So why are you helping me?

      Bees have good qualities.

      And it takes my mind off the shop.

      Instead of flowers, people are giving balloon bouquets now.

      Those are great, if you're three.

      And artificial flowers.

      • Oh, those just get me psychotic!
      • Yeah, me too.

      Bent stingers, pointless pollination.

      Bees must hate those fake things!

      Nothing worse than a daffodil that's had work done.

      Maybe this could make up for it a little bit.

      • This lawsuit's a pretty big deal.
      • I guess.

      You sure you want to go through with it?

      Am I sure? When I'm done with the humans, they won't be able

      to say, "Honey, I'm home," without paying a royalty!

      It's an incredible scene here in downtown Manhattan,

      where the world anxiously waits, because for the first time in history,

      we will hear for ourselves if a honeybee can actually speak.

      What have we gotten into here, Barry?

      It's pretty big, isn't it?

      I can't believe how many humans don't work during the day.

      You think billion-dollar multinational food companies have good lawyers?

      Everybody needs to stay behind the barricade.

      • What's the matter?
      • I don't know, I just got a chill.

      Well, if it isn't the bee team.

      You boys work on this?

      All rise! The Honorable Judge Bumbleton presiding.

      All right. Oase number 4475,

      Superior Oourt of New York, Barry Bee Benson v. the Honey Industry

      is now in session.

      Mr. Montgomery, you're representing the five food companies collectively?

      A privilege.

      Mr. Benson... you're representing all the bees of the world?

      I'm kidding. Yes, Your Honor, we're ready to proceed.

      Mr. Montgomery, your opening statement, please.

      Ladies and gentlemen of the jury,

      my grandmother was a simple woman.

      Born on a farm, she believed it was man's divine right

      to benefit from the bounty of nature God put before us.

      If we lived in the topsy-turvy world Mr. Benson imagines,

      just think of what would it mean.

      I would have to negotiate with the silkworm

      for the elastic in my britches!

      Talking bee!

      How do we know this isn't some sort of

      holographic motion-picture-capture Hollywood wizardry?

      They could be using laser beams!

      Robotics! Ventriloquism! Oloning! For all we know,

      he could be on steroids!

      Mr. Benson?

      Ladies and gentlemen, there's no trickery here.

      I'm just an ordinary bee. Honey's pretty important to me.

      It's important to all bees. We invented it!

      We make it. And we protect it with our lives.

      Unfortunately, there are some people in this room

      who think they can take it from us

      'cause we're the little guys! I'm hoping that, after this is all over,

      you'll see how, by taking our honey, you not only take everything we have

      but everything we are!

      I wish he'd dress like that all the time. So nice!

      Oall your first witness.

      So, Mr. Klauss Vanderhayden of Honey Farms, big company you have.

      I suppose so.

      I see you also own Honeyburton and Honron!

      Yes, they provide beekeepers for our farms.

      Beekeeper. I find that to be a very disturbing term.

      I don't imagine you employ any bee-free-ers, do you?

      • No.
      • I couldn't hear you.
      • No.
      • No.

      Because you don't free bees. You keep bees. Not only that,

      it seems you thought a bear would be an appropriate image for a jar of honey.

      They're very lovable creatures.

      Yogi Bear, Fozzie Bear, Build-A-Bear.

      You mean like this?

      Bears kill bees!

      How'd you like his head crashing through your living room?!

      Biting into your couch! Spitting out your throw pillows!

      OK, that's enough. Take him away.

      So, Mr. Sting, thank you for being here. Your name intrigues me.

      • Where have I heard it before?
      • I was with a band called The Police.

      But you've never been a police officer, have you?

      No, I haven't.

      No, you haven't. And so here we have yet another example

      of bee culture casually stolen by a human

      for nothing more than a prance-about stage name.

      Oh, please.

      Have you ever been stung, Mr. Sting?

      Because I'm feeling a little stung, Sting.

      Or should I say... Mr. Gordon M. Sumner!

      That's not his real name?! You idiots!

      Mr. Liotta, first, belated congratulations on

      your Emmy win for a guest spot on ER in 2005.

      Thank you. Thank you.

      I see from your resume that you're devilishly handsome

      with a churning inner turmoil that's ready to blow.

      I enjoy what I do. Is that a crime?

      Not yet it isn't. But is this what it's come to for you?

      Exploiting tiny, helpless bees so you don't

      have to rehearse your part and learn your lines, sir?

      Watch it, Benson! I could blow right now!

      This isn't a goodfella. This is a badfella!

      Why doesn't someone just step on this creep, and we can all go home?!

      • Order in this court!
      • You're all thinking it!

      Order! Order, I say!

      • Say it!
      • Mr. Liotta, please sit down!

      I think it was awfully nice of that bear to pitch in like that.

      I think the jury's on our side.

      Are we doing everything right, legally?

      I'm a florist.

      Right. Well, here's to a great team.

      To a great team!

      Well, hello.

      • Ken!
      • Hello.

      I didn't think you were coming.

      No, I was just late. I tried to call, but... the battery.

      I didn't want all this to go to waste, so I called Barry. Luckily, he was free.

      Oh, that was lucky.

      There's a little left. I could heat it up.

      Yeah, heat it up, sure, whatever.

      So I hear you're quite a tennis player.

      I'm not much for the game myself. The ball's a little grabby.

      That's where I usually sit. Right... there.

      Ken, Barry was looking at your resume,

      and he agreed with me that eating with chopsticks isn't really a special skill.

      You think I don't see what you're doing?

      I know how hard it is to find the rightjob. We have that in common.

      Do we?

      Bees have 100 percent employment, but we do jobs like taking the crud out.

      That's just what I was thinking about doing.

      Ken, I let Barry borrow your razor for his fuzz. I hope that was all right.

      I'm going to drain the old stinger.

      Yeah, you do that.

      Look at that.

      You know, I've just about had it

      with your little mind games.

      • What's that?
      • Italian Vogue.

      Mamma mia, that's a lot of pages.

      A lot of ads.

      Remember what Van said, why is your life more valuable than mine?

      Funny, I just can't seem to recall that!

      I think something stinks in here!

      I love the smell of flowers.

      How do you like the smell of flames?!

      Not as much.

      Water bug! Not taking sides!

      Ken, I'm wearing a Ohapstick hat! This is pathetic!

      I've got issues!

      Well, well, well, a royal flush!

      • You're bluffing.
      • Am I?

      Surf's up, dude!

      Poo water!

      That bowl is gnarly.

      Except for those dirty yellow rings!

      Kenneth! What are you doing?!

      You know, I don't even like honey! I don't eat it!

      We need to talk!

      He's just a little bee!

      And he happens to be the nicest bee I've met in a long time!

      Long time? What are you talking about?! Are there other bugs in your life?

      No, but there are other things bugging me in life. And you're one of them!

      Fine! Talking bees, no yogurt night...

      My nerves are fried from riding on this emotional roller coaster!

      Goodbye, Ken.

      And for your information,

      I prefer sugar-free, artificial sweeteners made by man!

      I'm sorry about all that.

      I know it's got an aftertaste! I like it!

      I always felt there was some kind of barrier between Ken and me.

      I couldn't overcome it. Oh, well.

      Are you OK for the trial?

      I believe Mr. Montgomery is about out of ideas.

      We would like to call Mr. Barry Benson Bee to the stand.

      Good idea! You can really see why he's considered one of the best lawyers...

      Yeah.

      Layton, you've gotta weave some magic

      with this jury, or it's gonna be all over.

      Don't worry. The only thing I have to do to turn this jury around

      is to remind them of what they don't like about bees.

      • You got the tweezers?
      • Are you allergic?

      Only to losing, son. Only to losing.

      Mr. Benson Bee, I'll ask you what I think we'd all like to know.

      What exactly is your relationship

      to that woman?

      We're friends.

      • Good friends?
      • Yes.

      How good? Do you live together?

      Wait a minute...

      Are you her little...

      ...bedbug?

      I've seen a bee documentary or two. From what I understand,

      doesn't your queen give birth to all the bee children?

      • Yeah, but...
      • So those aren't your real parents!
      • Oh, Barry...
      • Yes, they are!

      Hold me back!

      You're an illegitimate bee, aren't you, Benson?

      He's denouncing bees!

      Don't y'all date your cousins?

      • Objection!
      • I'm going to pincushion this guy!

      Adam, don't! It's what he wants!

      Oh, I'm hit!!

      Oh, lordy, I am hit!

      Order! Order!

      The venom! The venom is coursing through my veins!

      I have been felled by a winged beast of destruction!

      You see? You can't treat them like equals! They're striped savages!

      Stinging's the only thing they know! It's their way!

      • Adam, stay with me.
      • I can't feel my legs.

      What angel of mercy will come forward to suck the poison

      from my heaving buttocks?

      I will have order in this court. Order!

      Order, please!

      The case of the honeybees versus the human race

      took a pointed turn against the bees

      yesterday when one of their legal team stung Layton T. Montgomery.

      • Hey, buddy.
      • Hey.
      • Is there much pain?
      • Yeah.

      I...

      I blew the whole case, didn't I?

      It doesn't matter. What matters is you're alive. You could have died.

      I'd be better off dead. Look at me.

      They got it from the cafeteria downstairs, in a tuna sandwich.

      Look, there's a little celery still on it.

      What was it like to sting someone?

      I can't explain it. It was all...

      All adrenaline and then... and then ecstasy!

      All right.

      You think it was all a trap?

      Of course. I'm sorry. I flew us right into this.

      What were we thinking? Look at us. We're just a couple of bugs in this world.

      What will the humans do to us if they win?

      I don't know.

      I hear they put the roaches in motels. That doesn't sound so bad.

      Adam, they check in, but they don't check out!

      Oh, my.

      Oould you get a nurse to close that window?

      • Why?
      • The smoke.

      Bees don't smoke.

      Right. Bees don't smoke.

      Bees don't smoke! But some bees are smoking.

      That's it! That's our case!

      It is? It's not over?

      Get dressed. I've gotta go somewhere.

      Get back to the court and stall. Stall any way you can.

      And assuming you've done step correctly, you're ready for the tub.

      Mr. Flayman.

      Yes? Yes, Your Honor!

      Where is the rest of your team?

      Well, Your Honor, it's interesting.

      Bees are trained to fly haphazardly,

      and as a result, we don't make very good time.

      I actually heard a funny story about...

      Your Honor, haven't these ridiculous bugs

      taken up enough of this court's valuable time?

      How much longer will we allow these absurd shenanigans to go on?

      They have presented no compelling evidence to support their charges

      against my clients, who run legitimate businesses.

      I move for a complete dismissal of this entire case!

      Mr. Flayman, I'm afraid I'm going

      to have to consider Mr. Montgomery's motion.

      But you can't! We have a terrific case.

      Where is your proof? Where is the evidence?

      Show me the smoking gun!

      Hold it, Your Honor! You want a smoking gun?

      Here is your smoking gun.

      What is that?

      It's a bee smoker!

      What, this? This harmless little contraption?

      This couldn't hurt a fly, let alone a bee.

      Look at what has happened

      to bees who have never been asked, "Smoking or non?"

      Is this what nature intended for us?

      To be forcibly addicted to smoke machines

      and man-made wooden slat work camps?

      Living out our lives as honey slaves to the white man?

      • What are we gonna do?
      • He's playing the species card.

      Ladies and gentlemen, please, free these bees!

      Free the bees! Free the bees!

      Free the bees!

      Free the bees! Free the bees!

      The court finds in favor of the bees!

      Vanessa, we won!

      I knew you could do it! High-five!

      Sorry.

      I'm OK! You know what this means?

      All the honey will finally belong to the bees.

      Now we won't have to work so hard all the time.

      This is an unholy perversion of the balance of nature, Benson.

      You'll regret this.

      Barry, how much honey is out there?

      All right. One at a time.

      Barry, who are you wearing?

      My sweater is Ralph Lauren, and I have no pants.

      • What if Montgomery's right?
      • What do you mean?

      We've been living the bee way a long time, 27 million years.

      Oongratulations on your victory. What will you demand as a settlement?

      First, we'll demand a complete shutdown of all bee work camps.

      Then we want back the honey that was ours to begin with,

      every last drop.

      We demand an end to the glorification of the bear as anything more

      than a filthy, smelly, bad-breath stink machine.

      We're all aware of what they do in the woods.

      Wait for my signal.

      Take him out.

      He'll have nauseous for a few hours, then he'll be fine.

      And we will no longer tolerate bee-negative nicknames...

      But it's just a prance-about stage name!

      ...unnecessary inclusion of honey in bogus health products

      and la-dee-da human tea-time snack garnishments.

      Oan't breathe.

      Bring it in, boys!

      Hold it right there! Good.

      Tap it.

      Mr. Buzzwell, we just passed three cups, and there's gallons more coming!

      • I think we need to shut down!
      • Shut down? We've never shut down.

      Shut down honey production!

      Stop making honey!

      Turn your key, sir!

      What do we do now?

      Oannonball!

      We're shutting honey production!

      Mission abort.

      Aborting pollination and nectar detail. Returning to base.

      Adam, you wouldn't believe how much honey was out there.

      Oh, yeah?

      What's going on? Where is everybody?

      • Are they out celebrating?
      • They're home.

      They don't know what to do. Laying out, sleeping in.

      I heard your Uncle Oarl was on his way to San Antonio with a cricket.

      At least we got our honey back.

      Sometimes I think, so what if humans liked our honey? Who wouldn't?

      It's the greatest thing in the world! I was excited to be part of making it.

      This was my new desk. This was my new job. I wanted to do it really well.

      And now...

      Now I can't.

      I don't understand why they're not happy.

      I thought their lives would be better!

      They're doing nothing. It's amazing. Honey really changes people.

      You don't have any idea what's going on, do you?

      • What did you want to show me?
      • This.

      What happened here?

      That is not the half of it.

      Oh, no. Oh, my.

      They're all wilting.

      Doesn't look very good, does it?

      No.

      And whose fault do you think that is?

      You know, I'm gonna guess bees.

      Bees?

      Specifically, me.

      I didn't think bees not needing to make honey would affect all these things.

      It's notjust flowers. Fruits, vegetables, they all need bees.

      That's our whole SAT test right there.

      Take away produce, that affects the entire animal kingdom.

      And then, of course...

      The human species?

      So if there's no more pollination,

      it could all just go south here, couldn't it?

      I know this is also partly my fault.

      How about a suicide pact?

      How do we do it?

      • I'll sting you, you step on me.
      • Thatjust kills you twice.

      Right, right.

      Listen, Barry... sorry, but I gotta get going.

      I had to open my mouth and talk.

      Vanessa?

      Vanessa? Why are you leaving? Where are you going?

      To the final Tournament of Roses parade in Pasadena.

      They've moved it to this weekend because all the flowers are dying.

      It's the last chance I'll ever have to see it.

      Vanessa, I just wanna say I'm sorry. I never meant it to turn out like this.

      I know. Me neither.

      Tournament of Roses. Roses can't do sports.

      Wait a minute. Roses. Roses?

      Roses!

      Vanessa!

      Roses?!

      Barry?

      • Roses are flowers!
      • Yes, they are.

      Flowers, bees, pollen!

      I know. That's why this is the last parade.

      Maybe not. Oould you ask him to slow down?

      Oould you slow down?

      Barry!

      OK, I made a huge mistake. This is a total disaster, all my fault.

      Yes, it kind of is.

      I've ruined the planet. I wanted to help you

      with the flower shop. I've made it worse.

      Actually, it's completely closed down.

      I thought maybe you were remodeling.

      But I have another idea, and it's greater than my previous ideas combined.

      I don't want to hear it!

      All right, they have the roses, the roses have the pollen.

      I know every bee, plant and flower bud in this park.

      All we gotta do is get what they've got back here with what we've got.

      • Bees.
      • Park.
      • Pollen!
      • Flowers.
      • Repollination!
      • Across the nation!

      Tournament of Roses, Pasadena, Oalifornia.

      They've got nothing but flowers, floats and cotton candy.

      Security will be tight.

      I have an idea.

      Vanessa Bloome, FTD.

      Official floral business. It's real.

      Sorry, ma'am. Nice brooch.

      Thank you. It was a gift.

      Once inside, we just pick the right float.

      How about The Princess and the Pea?

      I could be the princess, and you could be the pea!

      Yes, I got it.

      • Where should I sit?
      • What are you?
      • I believe I'm the pea.
      • The pea?

      It goes under the mattresses.

      • Not in this fairy tale, sweetheart.
      • I'm getting the marshal.

      You do that! This whole parade is a fiasco!

      Let's see what this baby'll do.

      Hey, what are you doing?!

      Then all we do is blend in with traffic...

      ...without arousing suspicion.

      Once at the airport, there's no stopping us.

      Stop! Security.

      • You and your insect pack your float?
      • Yes.

      Has it been in your possession the entire time?

      Would you remove your shoes?

      • Remove your stinger.
      • It's part of me.

      I know. Just having some fun. Enjoy your flight.

      Then if we're lucky, we'll have just enough pollen to do the job.

      Oan you believe how lucky we are? We have just enough pollen to do the job!

      I think this is gonna work.

      It's got to work.

      Attention, passengers, this is Oaptain Scott.

      We have a bit of bad weather in New York.

      It looks like we'll experience a couple hours delay.

      Barry, these are cut flowers with no water. They'll never make it.

      I gotta get up there and talk to them.

      Be careful.

      Oan I get help with the Sky Mall magazine?

      I'd like to order the talking inflatable nose and ear hair trimmer.

      Oaptain, I'm in a real situation.

      • What'd you say, Hal?
      • Nothing.

      Bee!

      Don't freak out! My entire species...

      What are you doing?

      • Wait a minute! I'm an attorney!
      • Who's an attorney?

      Don't move.

      Oh, Barry.

      Good afternoon, passengers. This is your captain.

      Would a Miss Vanessa Bloome in 24B please report to the cockpit?

      And please hurry!

      What happened here?

      There was a DustBuster, a toupee, a life raft exploded.

      One's bald, one's in a boat, they're both unconscious!

      • Is that another bee joke?
      • No!

      No one's flying the plane!

      This is JFK control tower, Flight 356. What's your status?

      This is Vanessa Bloome. I'm a florist from New York.

      Where's the pilot?

      He's unconscious, and so is the copilot.

      Not good. Does anyone onboard have flight experience?

      As a matter of fact, there is.

      • Who's that?
      • Barry Benson.

      From the honey trial?! Oh, great.

      Vanessa, this is nothing more than a big metal bee.

      It's got giant wings, huge engines.

      I can't fly a plane.

      • Why not? Isn't John Travolta a pilot?
      • Yes.

      How hard could it be?

      Wait, Barry! We're headed into some lightning.

      This is Bob Bumble. We have some late-breaking news from JFK Airport,

      where a suspenseful scene is developing.

      Barry Benson, fresh from his legal victory...

      That's Barry!

      ...is attempting to land a plane, loaded with people, flowers

      and an incapacitated flight crew.

      Flowers?!

      We have a storm in the area and two individuals at the controls

      with absolutely no flight experience.

      Just a minute. There's a bee on that plane.

      I'm quite familiar with Mr. Benson and his no-account compadres.

      They've done enough damage.

      But isn't he your only hope?

      Technically, a bee shouldn't be able to fly at all.

      Their wings are too small...

      Haven't we heard this a million times?

      "The surface area of the wings and body mass make no sense."

      • Get this on the air!
      • Got it.
      • Stand by.
      • We're going live.

      The way we work may be a mystery to you.

      Making honey takes a lot of bees doing a lot of small jobs.

      But let me tell you about a small job.

      If you do it well, it makes a big difference.

      More than we realized. To us, to everyone.

      That's why I want to get bees back to working together.

      That's the bee way! We're not made of Jell-O.

      We get behind a fellow.

      • Black and yellow!
      • Hello!

      Left, right, down, hover.

      • Hover?
      • Forget hover.

      This isn't so hard. Beep-beep! Beep-beep!

      Barry, what happened?!

      Wait, I think we were on autopilot the whole time.

      • That may have been helping me.
      • And now we're not!

      So it turns out I cannot fly a plane.

      All of you, let's get behind this fellow! Move it out!

      Move out!

      Our only chance is if I do what I'd do, you copy me with the wings of the plane!

      Don't have to yell.

      I'm not yelling! We're in a lot of trouble.

      It's very hard to concentrate with that panicky tone in your voice!

      It's not a tone. I'm panicking!

      I can't do this!

      Vanessa, pull yourself together. You have to snap out of it!

      You snap out of it.

      You snap out of it.

      • You snap out of it!
      • You snap out of it!
      • You snap out of it!
      • You snap out of it!
      • You snap out of it!
      • You snap out of it!
      • Hold it!
      • Why? Oome on, it's my turn.

      How is the plane flying?

      I don't know.

      Hello?

      Benson, got any flowers for a happy occasion in there?

      The Pollen Jocks!

      They do get behind a fellow.

      • Black and yellow.
      • Hello.

      All right, let's drop this tin can on the blacktop.

      Where? I can't see anything. Oan you?

      No, nothing. It's all cloudy.

      Oome on. You got to think bee, Barry.

      • Thinking bee.
      • Thinking bee.

      Thinking bee! Thinking bee! Thinking bee!

      Wait a minute. I think I'm feeling something.

      • What?
      • I don't know. It's strong, pulling me.

      Like a 27-million-year-old instinct.

      Bring the nose down.

      Thinking bee! Thinking bee! Thinking bee!

      • What in the world is on the tarmac?
      • Get some lights on that!

      Thinking bee! Thinking bee! Thinking bee!

      • Vanessa, aim for the flower.
      • OK.

      Out the engines. We're going in on bee power. Ready, boys?

      Affirmative!

      Good. Good. Easy, now. That's it.

      Land on that flower!

      Ready? Full reverse!

      Spin it around!

      • Not that flower! The other one!
      • Which one?
      • That flower.
      • I'm aiming at the flower!

      That's a fat guy in a flowered shirt. I mean the giant pulsating flower

      made of millions of bees!

      Pull forward. Nose down. Tail up.

      Rotate around it.

      • This is insane, Barry!
      • This's the only way I know how to fly.

      Am I koo-koo-kachoo, or is this plane flying in an insect-like pattern?

      Get your nose in there. Don't be afraid. Smell it. Full reverse!

      Just drop it. Be a part of it.

      Aim for the center!

      Now drop it in! Drop it in, woman!

      Oome on, already.

      Barry, we did it! You taught me how to fly!

      • Yes. No high-five!
      • Right.

      Barry, it worked! Did you see the giant flower?

      What giant flower? Where? Of course I saw the flower! That was genius!

      • Thank you.
      • But we're not done yet.

      Listen, everyone!

      This runway is covered with the last pollen

      from the last flowers available anywhere on Earth.

      That means this is our last chance.

      We're the only ones who make honey, pollinate flowers and dress like this.

      If we're gonna survive as a species, this is our moment! What do you say?

      Are we going to be bees, orjust Museum of Natural History keychains?

      We're bees!

      Keychain!

      Then follow me! Except Keychain.

      Hold on, Barry. Here.

      You've earned this.

      Yeah!

      I'm a Pollen Jock! And it's a perfect fit. All I gotta do are the sleeves.

      Oh, yeah.

      That's our Barry.

      Mom! The bees are back!

      If anybody needs to make a call, now's the time.

      I got a feeling we'll be working late tonight!

      Here's your change. Have a great afternoon! Oan I help who's next?

      Would you like some honey with that? It is bee-approved. Don't forget these.

      Milk, cream, cheese, it's all me. And I don't see a nickel!

      Sometimes I just feel like a piece of meat!

      I had no idea.

      Barry, I'm sorry. Have you got a moment?

      Would you excuse me? My mosquito associate will help you.

      Sorry I'm late.

      He's a lawyer too?

      I was already a blood-sucking parasite. All I needed was a briefcase.

      Have a great afternoon!

      Barry, I just got this huge tulip order, and I can't get them anywhere.

      No problem, Vannie. Just leave it to me.

      You're a lifesaver, Barry. Oan I help who's next?

      All right, scramble, jocks! It's time to fly.

      Thank you, Barry!

      That bee is living my life!

      Let it go, Kenny.

      • When will this nightmare end?!
      • Let it all go.
      • Beautiful day to fly.
      • Sure is.

      Between you and me, I was dying to get out of that office.

      You have got to start thinking bee, my friend.

      • Thinking bee!
      • Me?

      Hold it. Let's just stop for a second. Hold it.

      I'm sorry. I'm sorry, everyone. Oan we stop here?

      I'm not making a major life decision during a production number!

      All right. Take ten, everybody. Wrap it up, guys.

      I had virtually no rehearsal for that.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): **

      **Summary**

      The authors have performed highly quantitative analyses of GPCR signaling to reveal heterogenous ERK and Akt activation patterns by using kinase translocation reporters. Using a massive number of single-cell imaging data, the authors show heterogeneous responses to GPCR agonists in the absence or presence of inhibitors. By cluster analysis, the responses of ERK and Akt were classified into eight and three patterns. This paper is clearly written with sufficient information for the reproducibility. However, the conclusion may not be necessarily supported by the provided data as described below.

      **Major comments:**

      This work has been well done in an organized way and adds new insight into the regulation of protein kinases by GPCRs. The conclusion will be of great interest in the field of single-cell signal dynamics and quantitative biology. On a bit negative note, considering the complexity of the downstream of GPCRs, some of the conclusions may need revision. *

      We thank the reviewer for the evaluation and for raising a number of comments that have helped us to strengthen the manuscript and that will be addressed below.

      1. The conclusion of the title that "Heterogeneity and dynamics of ERK/Akt activation by GPCR depend on the activated heterotrimeric G proteins," may not be supported by the data. The authors compared just one pair each of GPCR and ligand. The heterogeneity may come from the nature of the ligand or the characteristics of the single clone chosen for this study. The title may suggest that the heterogeneity depends only on the G-protein (although that is not what the title says). Instead, we mean that G-proteins play a role in the heterogeneity, as we infer from the experiments with the G-protein inhibitors. If the reviewer feels strongly about this, we are open to changing the title, for instance to:

      “Kinase translocation reporters reveal the single cell heterogeneity and dynamics of ERK and Akt activation by G protein-coupled receptors”

      • The obvious question is that why the authors did not analyze the correlation between ERK and Akt activity more extensively. Cell Profiler will be able to extract multiple cellular features. Linking the heterogeneous signals to cellular features will benefit readers in the broad cell biology field. If the authors wish to write another paper with that data, it should be at least discussed. *

      We agree that we can add more information on the correlation between ERK and Akt activity and we have added a plot that shows the co-incidence of the ERK and Akt clusters. This is now panel C of figure 8. We have no wish of writing another paper and we have made the data and code available, so anyone can do a more detailed analysis if desired.

      We appreciate the suggestion to correlate activities with cellular features, such as cell area and shape. However, in our analysis we use nuclear fluorescence to segment the nuclear and cytoplasmic fluorescence (as generally done in studies that use KTRs). Therefore, the information on cellular features is not readily available. Such analysis would require a marker for the cytoplasm or membrane (or yet another image analysis procedure).

      Another apparent flaw of this work is that YM was not challenged to UK-stimulated cells. The authors probably assumed lack of effect. Nevertheless, I believe it is required to show. Or, remove the PTx data from the Histamine-stimulated cell data.

      We agree that this is valuable data to include. Unfortunately, this experiment was done in a slightly different condition than the other experiments (different spacing of the time intervals) and we initially skipped the data for these reasons. After careful examination of the data, we have decided to include these data (added to figures 3 & 5).

      We note that we still miss the data from the YM+PTx data for UK and we have currently no way to carry out these experiments (mainly due to lack of funding). In our opinion, the absence of this data is not critical for the interpretation of the results. We prefer to show the YM+PTx data for the other two conditions.

      The most interesting response is that of S1P. ERK is biphasically activated. Combined inhibition of Gq and Gi failed to suppress ERK activity. It may be discussed why the biphasic activation pattern was not identified by the classification.

      We think that the biphasic activation pattern is reflected by cluster 7 and 8 and we now mention this in the text: “The biphasic ERK activation pattern, which is specific for stimulation with S1P are reflected by cluster 7 and 8.”

      For clarity, we now added the dynamics for each cluster to figure 9.

      *The authors argue that the brightness of the KTR reporter was not correlated with the dynamic range of ERK or Akt reporter (Supplementary Figure 3), but it is not clear. I had an impression that ERK-KTR brightness (Supplementary Figure 3A) has a slightly negative correlation with "maximum change in CN ratio" (Supplementary Figure 3B) (e.g., A6>B3>B5 in brightness and A6

      We thank the reviewer for the suggestion and have now added this data to supplemental figure 3 as panel C.

      The authors have shown cluster analyses for the temporal patterns in kinase activations. However, the only difference of cluster 3 and 5 (Figure 7) seem to be amplitude. The authors have also shown the amplitude is dependent on the dose of the activators, which together makes it difficult to see the biological meaning of discriminating the two patterns in comparing different agonists, e.g., Histamine, UK, and S1P. The authors should discuss their views on how the clustering analyses will benefit biological interpretations together with possible limitations.

      This is a valid point, and it is a consequence of clustering method. We have added text to the discussion to explain our view: “The clustering is a powerful method for the detection of patterns and simplification of large amounts of data. Yet, it should be realized that clustering is mathematical procedure that is not necessarily reflecting the biological processes. One example is the graded response of ERK and Akt activities to ligands, whereas cells are grouped in weak, middle and strong responders. This may be solved by developing and using clustering methods that take the underlying biological processes into account.”

      Considering the importance of the content, the supplemental note 2 may be included in the main text.

      We appreciate this suggestion, and we have incorporated supplemental note 2 in the main text.

      \*Minor comments:**

      1. The authors should clarify the cell type they used (HeLa cells) in the main text and figure legends. *

      This information is now indicated in the first paragraph of the results section and in the legend of figure1.

      Supplementary note1: The data-not-shown data (no correlation of KTR expression and its response to serum) should be very informative for the readers. The data should be shown as an independent supplementary figure.

      This relates to major point 5 and we agree that this is valuable. The data of the expression and the maximum response has been added to supplementary figure 3 as panel C.

      Supplementary Figure S2: The authors should clarify this image processing is about background subtraction. Also, the authors should clearly note "rolling ball with a radius of 70 pixels" is about an ImageJ function, "Subtract Background".

      We added text to highlight that the processing is a background subtraction and noise reduction. We added text to explain it is a FIJI function.

        1. Supplementary Figure S5: Figure labels are "A, A, B, B" not "A, B, C, D". Also the top two figures are lacking Y axis labels. *

      Thanks for pointing this out. We the labels are corrected.

      Page6 (top): The authors should mention the description is about Supplementary Figure S5 (UK) and Supplementary Figure S6 (S1P).

      This is an accidental omission, it is corrected.

      Figure 3: the figures are lacking x-axis labels (probably uM, nM and pM from left).

      Well spotted, this is fixed by adding the units to the labels for each ligand.

      Values in tables: The significant figure must be 2, at best. This should be consistent throughout the text. For example, "The EC50 values for histamine, S1P and UK were respectively 0.3 μM, 63.7 nM and 2.5 pM." This is somewhat awkward.

      This has been fixed in the text and in the table.

      Page 7, the first paragraph: No comments on S1P!

      We added our observation that: “The response to S1P is hardly affected by YM, but the amplitude is reduced by PTx.”

      Fig. 3: 100 mM must read as 100 micromolar.

      We do not understand this comment, but the units of figure 3 are now corrected (see also point 6).

      • Fig. 9: Concentration unit is missing.*

      Thanks for pointing this out, units are added.

      • Page 11, line 4: EKR should read as ERK. *

      Fixed

      • Page 13: "So far, only a couple of studies looked into kinase activation by GPCRs and these studies used overexpressed receptors [32,33]." Please describe precisely. Protein kinase activation by GPCR has been studied more than 20 years. Why are these two recent papers cited here? *

      We updated the text to explain that: “So far, only a couple of studies looked into kinase activation by GPCRs in single cells with KTRs and these studies used overexpressed receptors”.

      "This is in marked contrast to other fluorescent biosensors that typically require an overexpressed receptor for robust responses [34]." Following words should be included in the end: "in our hands".

      We’ve included the suggested line.

      • "Histamine is reported to predominantly activate Gq in HeLa cells [36] and UK activates Gi [37]." Describe the name of receptors for the better understanding. *

      We added names: “Histamine is reported to predominantly activate Gq in HeLa cells by the histamine H1 receptor [36] and UK activates Gi by α2-adrenergic receptors [37]”

      • "S1P can activate a number of different GPCRs, all known to be expressed by HeLa cells [24]." Why is this paper chosen? The authors can easily find RNA-Seq data, if they wish to see the expression level. The cited paper did not scrutinize the S1P receptors expressed in HeLa cells. *

      The S1PR levels are scrutinized in the cited paper, but it is ‘hidden’ in the supplemental figure S4A. We will clarify this and explicitly mention this supplemental figure: “The situation for S1P is different. S1P can activate a number of different GPCRs, all known to be expressed by HeLa cells as shown in the supplemental figure S4A of [24]”

      *Reviewer #1 (Significance (Required)):

      The authors used biosensors for ERK and Akt to examine the kinetics of activation by GPCR ligands. Technical advancement is in the massive analysis method and cluster analysis. This is an important direction for the quantitative biology. GPCR signaling is complex because of multiple receptors coupled with different G proteins. The simple ones such as histamine receptor and alpha2-adrenergic receptor can be easily analyzed as shown in this study. However, there are many S1P receptors, which make the interpretation difficult. If the authors could have shown interesting proposal on this data, the paper may interest many researchers in the field of cell biology and systems biology.

      Expertise: Cell biology, signal transduction of protein kinases, fluorescence microscopy.

      **Referee Cross-commenting**

      1. I agree with the other two reviewers in that immunoblotting data is required to show the efficiency of P2A cleavage.
      2. All reviewers think it looks strange that the authors did not show UK + YM data.
      3. Showing the dynamic range of the biosensors will reinforce the data as Reviewer #3 states. ERK-KTR is quite sensitive and can be easily saturated. Ideally, the ratio of pERK vs ERK can be quantified by the different mobility in SDS-PAGE. But, I do not know how we can do it for Akt.

        Reviewer #2 (Evidence, reproducibility and clarity (Required)):

        **Summary**

        In this paper Chavez-Abiega and colleagues investigate the dynamics of ERK and Akt activity downstream of several G protein-couples receptors (GPCRs). Using drugs to block specific G-proteins, they probe the activation of ERK/Akt by different heterotrimeric G proteins with fluorescent biosensors at the single cell resolution. Main finding is that ERK/AKT can be activated by different G-proteins, depending on the receptor coupling to the G-protein subclass, and that the ERK/AKT dynamics for S1P are specifically heterogeneous. Moreover, it seems that the AKT signaling response is very similar to ERK after GPCR stimulation.

        **Major points:**

        1) For this paper, the authors produced a new construct to express simultaneously the nuclear marker, the Akt and the ERK biosensors. The tree parts are connected by P2A peptides that determine their separation. Although, the biosensors are based on existing ones, the connection between them by P2A might create artifacts if the separation of the two parts is incomplete. For that, important controls are missing, such as treatment with an ERK and an Akt inhibitor. If the two parts are well separated the inhibitors should block the cytosol translocation of one of the two components and not of the other. This control is also important to check if in HeLa cells the Akt biosensors is not phosphorylated by ERK as well, as described in other reports. Alternatively, P2A separation can be quantified on a protein blot. *

      We agree that it is important to establish that the P2A sequence results in separation of the reporters. There are several observations that support our notion that the separation is efficient. First, we have been using the 2A-like sequences for over a decade in HeLa cells (first paper: doi:10.1038/nmeth.1415) and we have never encountered situations where the cleavage was problematic. Second, the distribution in signal of the nuclear Scarlet probe differs substantially from that of the mTurquoise2 and the mNeonGreen probe. Third, the dynamics of the ERK-KTR and Akt-KTR are different. Fourth, we have included new data with an ERK inhibitor, showing that the Akt-KTR responds independently of the ERK-KTR (figure S5). We have also added text to explain this: “Next, we examined the effect of the MEK inhibitor PD 0325901. Pre-incubation with the inhibitor for 20 minutes blocked the response of the ERK-KTR to FBS, but not that of Akt-KTR (Supplemental Figure S5). This supports previous observations [14] [15] that the P2A effectively separates the different components, since the Akt-KTR and ERK-KTR show independent relocation patterns.”

      This latter point is also supported by the co-incidence plot of the ERK versus Akt clusters (figure 8C) showing that the probes act independently (which is the main reason for using this strategy).

      Although any of the aforementioned points cannot exclude that a small fraction of the probe remains fused, we think that this potential issue is far outweighed by the benefits of the use of 2A peptides.

      2) The description of ERK and Akt should be reported in a more uniform way, such as using the same representations for both (e.g. the equivalent of figure 2 for Akt is missing) or the same number of clusters.

      We choose to concentrate first on ERK activity, that is why a similar plot for Akt activation is not shown. However, the Akt responses are detailed in figure 4 and supplemental figures S5 and S7.

      For the cluster analysis, we looked into the optimal number of clusters (as explained in Supplemental note S2). This number differs for ERK and Akt, since the complexity of the responses is different. We move supplemental note 2 to the main text, which also clarifies the different number of clusters that we used for the analysis.

      3) Figure 3 & Figure 5: It seems that the YM and YM+PTx data for the UK 14304 data is missing. This would be an interesting addition to the manuscript, and it is easy to add. A similar analysis for the Akt sensor is missing in figure 3 and should be added for consistency. Figure 4 shows data for Akt, but as timeseries and only for Histamine. See point 2, it would benefit the reader greatly if ERK and AKT are presented in a more uniform and complete fashion throughout the manuscript.

      We agree that it is valuable to add data for UK with YM. This data has been added, see also reply to reviewer 1, major point 3

      As for the Akt data, the response was largely similar albeit with less complexity and a lower amplitude. This is the reason to focus on ERK and this is explained in the discussion: “Therefore, the measurement of Akt does not add information. Moreover, the Akt response had a relatively poor amplitude.”

      4) In the results text of figure 4, the authors state that "...as shown in Figure 4C-D, which is in line with the effect of histamine on ERK.". It is unclear what the authors mean with this statement, the effects of single/double inhibition of Histamine stimulation on ERK are not quantified or discussed. Both responses can be quantified more carefully and compared.

      We agree that this is poorly formulated, and we rephrase it to make it clearer: “Inhibition of Gq (figure 4C) decreases the maximum activity up to ~70%, and simultaneous inhibition of Gq and Gi causes a decrease of the responses up to ~90%, as shown in Figure 4D. These Akt amplitudes and effects of inhibitors are largely similar to those observed for ERK.”

      5) This paper would benefit from a mechanistic investigation. For instance, the authors could investigate the pathways that lead to the generation of the pulse of ERK and Akt. These (preliminary) results presented call for deeper investigation into the signaling pathway from Gai and Gaq to ERK and AKT, and the authors are in a great position to probe this. One simple approach is to explore the upstream pathway, such as the MAPK cascade, PI3K, RTKs by means of inhibitors.

      We agree that there is much that can be done with the KTR technology. To this end, we deposit the probe and make all our data analysis methods available. We hope that others will benefit from our efforts and use the tools for mechanistic studies. 6) Since different G-proteins seem to elicit similar responses on ERK and especially for Akt, it is likely a B-arrestin / beta-gamma subunit mediated mechanism? It would be interesting to hear what the authors think of this, did they investigate/consider this possibility? E.g. Perhaps blocking RTK signaling / B-arrestin signaling would reduce heterogeneity?

      We appreciate this suggestion and have added a statement to the discussion: “Based on our data, we cannot exclude that beta-arrestin or RTKs play a role in the activation of ERK and Akt. To study the role of non-classical routes to ERK activation, inhibitor studies, or probes that interrogate these processes would be useful.”

      7) The authors should take a serious effort to summarize the data in the figures better. Many plots that can be merged/presented in a more concise way, which would improve the readability of the manuscript greatly.

      We will take care to improve the data visualization during the revision. We will address any specific points that are raised.

      \*Minor points:**

      1) The authors should spell out in the legend of each figure if they are representing the absolute C/N or the normalized C/N *

      Thanks for pointing this out. We added this information to the legends and it is also written in the materials and methods: "data was normalized by subtracting the average of two time points prior to stimulation (usually the 5th and 6th time point) from every data point."

      2) In Figure 2 the authors should show the control with no stimulus. Also would be informative to inform the reader about the stimulation protocol used, or indicate the stimulation time and length in the figure.

      We have added the no stimulus control and added the information to the legend.

      3) Figure 3: This figure would benefit from a different presentation of the data, it is currently confusing. E.g. Average curves per drug condition in a single graph would present the point the authors make more clear and concise, and this single cell overview can be moved to supplements.

      Our main focus is on single cell analysis and we think that the current plots convey the message in a clear and transparent fashion. It is in line with the recently proposed idea of “superplot” (https://doi.org/10.1083/jcb.202001064). We also provide scripts and data, enabling anyone to replot the data if that is desired.

      4) Figure 4 legend states "CN ERK" and "ERK C/N", but is depicting only Akt responses? Only in 4c the axes are labeled, this together is very confusing.

      Thanks for pointing this out. This is corrected

      *5) Figure 5 is missing the controls with ERK and Akt inhibitors, to show the loss of correlation between the AUC of the two

      *We have included data with a MEK inhibitor (new supplemental figure S5) to demonstrate the specificity of the probe and it also demonstrates that Akt can be independently activated

      6) Figure 6, the presumed lack of correlation between baseline activity and response should be confirmed statistically.

      We have improved the presentation of figure 6. We now show only the maximal response and how this varies between conditions. It is evident from the graphical representation that the curves are similar for the different start ratios. We feel that the use of statistics is not necessary here.

      7) It seems that in S1P treated cells there is a second oscillation in ERK activity well visible in figure 2 and also in S10. Could the authors comment on that?

      We add text to the discussion to address this: “We observed that activation of endogenous S1P receptors resulted in a strong, but highly heterogeneous ERK-KTR response, with two peaks in a population of cells.” and “When PTx is present, the biphasic response is abolished and the first peak of activation is reduced, suggesting that the initial response is due to Gi signaling.”

      *8) In the abstract it is unclear what authors mean with "UK".

      *Changed to brimonidine

      9) Figure 9, it would be helpful to visually repeat the typical curve of the different clusters here, to guide the reader.

      This is a good suggestion and we have added the typical curves for the different clusters to the plot.

      10) The observed heterogeneity in responses might be related to different cell cycle stages, did the authors investigated/consider this possibility (e.g. with a cell cycle biosensor)?

      This is a very valid comment. We do consider its importance, but we did not investigate the effects of cell cycle.

      *Reviewer #2 (Significance (Required)):

      The paper describes with high accuracy the dynamics of ERK and Akt biosensors downstream of several GPCRs.

      However, it feels like this is a preliminary report that leaves many important questions still open. It does not provide mechanistic insight and doesn't fully exploit the potential of single-cell technologies. The authors have the tools to investigate several important questions that are left open in the manuscript (e.g. connection Gaq/Gai to ERK/AKT, B-arrestin/betagamma involvement). Moreover, some important controls are missing. The authors should also consider the data presentation in the figures, to improve readability and interpretation of the manuscript.

      Properly revised, would be of interest for a broad audience in cell biology, specifically GPCR and RTK signaling fields.

      Expertise in cell biology, gpcr and rtk signaling, fluorescent biosensors.

      **Referee Cross-commenting**

      I agree with the assessments by the other reviewers.

      Indeed showing the dynamic range of the biosensors, as Reviewer #3 states, would strengthen the manuscript and put the S1P response heterogeneity in context.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript uses a live-cell biosensor approach to examine the activity kinetics of the ERK and Akt kinases in response to different GPCR ligands. The paper provides a detailed description of the development of a HeLa reporter cell line that expresses both Akt and ERK biosensors, along with a nuclear marker for use in cell tracking. The authors then catalog the individual responses from thousands of cells to three GPCR ligands. Individual cells show strong correlation in stimulated ERK and Akt activity. Using inhibitors for Gq and Gi proteins, it is shown that ERK and Akt activities are dependent on different G proteins. The authors also show that the heterogeneous responses within each population can be decomposed into several clusters representing similar dynamic behaviors; the frequencies of these clusters increase or decrease depending on treatments.

      Overall, this is a well documented extension of an existing biosensor approach to examine GPCR signaling, and the approach is clearly described. There are however, some control experiments that are essential to support the conclusions.

      **Major comments:**

      1. The maximal responses of ERK and Akt biosensors in the selected cell clone are not adequately shown. Although FBS responsiveness is used as a validation and selection criterion, it would be much more informative to show the distribution of single-cell responses for defined activators of ERK and Akt, such as EGF and IGF-1, respectively. Without seeing the variability in these responses, it is difficult to put the heterogeneity observed in GPCR responses into context. *

      The FBS is used as a (crude) way to examine responsiveness of the clones. We understand that treatment of the cells with growth factors would add more data and therefore more information to the manuscript. However, the main aim of the study is to examine whether KTR technology can be used to study endogenous GPCR signaling. It is clear that the answer is positive. Next, we asked whether we could detect differences for different GPCRs and that was the focus of this study. It is unclear how studies with EGF would add new information to our observations.

      It is not clear whether the basal activity for the biosensors represents actual activity or simply the measurement floor. This should be established by using saturating treatment inhibitors for ERK and Akt to determine the biosensor readings in the absence of any activity. Ideally, an approach such as the one shown by Ponsioen et al. (PMID: 33795873) should be used to determine the dynamic range of the sensors.

      We studied the basal levels and the effect of serum. We found that the basal levels are reduced by replacing the growth medium with serum free medium. The reduction in C/N ratio reaches a plateau after ~ 2hours of replacing the medium. This data is added as supplemental figure S4. Therefore, we have performed all experiments 2 hours after replacing the growth medium with serum free imaging medium.

      Because the biosensors are separated by self-cleaving peptides, there is the potential that incomplete cleavage could complicate the results. Cleavage efficiency should be assessed by western blot or an equivalent method.

      We agree that it is important to establish that the P2A sequence results in separation of the reporters. There are several observations that support our notion that the separation is efficient. First, we have been using the 2A-like sequences for over a decade in HeLa cells (first paper: doi:10.1038/nmeth.1415) and we have never encountered situations where the cleavage was problematic. Second, the distribution in signal of the nuclear Scarlet probe differs substantially from that of the mTurquoise2 and mNeonGreen probe. Third, the dynamics of the ERK-KTR and Akt-KTR are different. Fourth, we have included new data with an ERK inhibitor, showing that the Akt-KTR responds independently of the ERK-KTR (figure S5). We have also added text to explain this: “Next, we examined the effect of the MEK inhibitor PD 0325901. Pre-incubation with the inhibitor for 20 minutes blocked the response of the ERK-KTR to FBS, but not that of Akt-KTR (Supplemental Figure S5). This supports previous observations [14] [15] that the P2A effectively separates the different components, since the Akt-KTR and ERK-KTR show independent relocation patterns.”

      This latter point is also supported by the co-incidence plot of the ERK versus Akt clusters (figure 8C) showing that the probes act independently (which is the main reason for using this strategy).

      Although any of the aforementioned points cannot exclude that a small fraction of the probe remains fused, we think that this potential issue is far outweighed by the benefits of the use of 2A peptides.

      Ideally, an alternate method such as immunofluorescence for phosphorylated ERK/Akt or their substrates could be used in a subset of the conditions to validate the heterogeneity observed by the biosensors.

      We thank the reviewer for this suggestion. Since we see a lot of variability in the dynamics, which cannot be addressed by immunofluorescence, we do not think this will experiment be valuable. Of note, GPCR activity is known to induce ERK activity in a dose-dependent manner on a population level as determined with immunolabeling methods and that is what we observe with the ERK KTR as well.

      \*Minor comments:**

      1. In the introduction, more rationale and background could be provided for the examination of GPCR-stimulated ERK and Akt activity. There is not much information provided on why this is an interesting question. Other than the involvement of beta arrestin and RTK transactivation, which are mentioned, what mechanisms are known to be involved? Also, the importance of ERK and Akt in cancer is brought up, but it is not made clear how this approach or results would connect specifically to a cancer model. *

      We think that the connections between heterotrimeric G-proteins and kinase activity are not well established. Except for the classical Gq -> PKC -> ERK pathway, not so much is known and we add this to the discussion: “The classic downstream effector of Gq is PKC, which can activate ERK. On the other hand, it is not so clear how Gq would affect Akt. The molecular network that connects the activity of Gi with kinases also not so clear.”

      *It would be helpful to provide some explanation for why the UK+YM and UK+YM+PTx data are not shown in figure 3

      *

      We agree that this is valuable data to include. Unfortunately, this experiment was done in a slightly different condition than the other experiments (different spacing of the time intervals) and we initially skipped the data for these reasons. After careful examination of the data, we have decided to include these data (added to figures 3 & 5).

      We note that we still miss the data from the YM+PTx data for UK and we have currently no way to carry out these experiments (mainly due to lack of funding). We prefer to show the YM+PTx data for the other two conditions.

      • In the Abstract figure, it is not clear which samples "Inhibitor" and "Agonist" are referring to. **

        *

      Thanks for this comment. We will remove the visual abstract when the preprint is submitted to a journal.

      * Reviewer #3 (Significance (Required)):

      While similar reporter approaches have been used in a number of papers to examine growth factor signaling dynamics of ERK and Akt, this manuscript is the first I have seen to examine the responses of these kinases to different GPCR ligands. In doing so, it adds significantly to the growing body of literature on single-cell signaling responses. The mechanisms of ERK and Akt activation by GPCRs remain somewhat ambiguous, and the data reported here will be helpful in refining models for this signal transduction process. The findings that the GPCR ligands examined show different G protein dependencies than anticipated is an interesting facet, as is the observation that, while ERK and Akt are generally correlated, inhibition of Gi preferentially blocks S1P-induced ERK activity more so than Akt activity. However, the main findings of heterogeneity in signaling, and the observation of clusters that describe the different dynamic behaviors present within a population, are highly consistent with what has been shown in other systems. Overall, this study is a useful confirmation that GPCR signaling to ERK and Akt follows a similar pattern to other forms of stimulation.

      **Referee Cross-commenting**

      Regarding the dynamic range, I don't think it is necessary to do a western blot (though this would be nice) - I think it would be sufficient to show maximal activation using EGF/IGF and full suppression using MEK/ERK and Akt inhibitors. I also agree that all the points raised by the other reviewers. In particular, a deeper exploration and better visualization of the relationship between ERK and Akt would be very useful, as noted by both Reviewers #1 and #2.*

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The authors have performed highly quantitative analyses of GPCR signaling to reveal heterogenous ERK and Akt activation patterns by using kinase translocation reporters. Using a massive number of single-cell imaging data, the authors show heterogeneous responses to GPCR agonists in the absence or presence of inhibitors. By cluster analysis, the responses of ERK and Akt were classified into eight and three patterns. This paper is clearly written with sufficient information for the reproducibility. However, the conclusion may not be necessarily supported by the provided data as described below.

      Major comments:

      This work has been well done in an organized way and adds new insight into the regulation of protein kinases by GPCRs. The conclusion will be of great interest in the field of single-cell signal dynamics and quantitative biology. On a bit negative note, considering the complexity of the downstream of GPCRs, some of the conclusions may need revision.

      1. The conclusion of the title that "Heterogeneity and dynamics of ERK/Akt activation by GPCR depend on the activated heterotrimeric G proteins," may not be supported by the data. The authors compared just one pair each of GPCR and ligand. The heterogeneity may come from the nature of the ligand or the characteristics of the single clone chosen for this study.
      2. The obvious question is that why the authors did not analyze the correlation between ERK and Akt activity more extensively. Cell Profiler will be able to extract multiple cellular features. Linking the heterogeneous signals to cellular features will benefit readers in the broad cell biology field. If the authors wish to write another paper with that data, it should be at least discussed.
      3. Another apparent flaw of this work is that YM was not challenged to UK-stimulated cells. The authors probably assumed lack of effect. Nevertheless, I believe it is required to show. Or, remove the PTx data from the Histamine-stimulated cell data.
      4. The most interesting response is that of S1P. ERK is biphasically activated. Combined inhibition of Gq and Gi failed to suppress ERK activity. It may be discussed why the biphasic activation pattern was not identified by the classification.
      5. The authors argue that the brightness of the KTR reporter was not correlated with the dynamic range of ERK or Akt reporter (Supplementary Figure 3), but it is not clear. I had an impression that ERK-KTR brightness (Supplementary Figure 3A) has a slightly negative correlation with "maximum change in CN ratio" (Supplementary Figure 3B) (e.g., A6>B3>B5 in brightness and A6<B3<B5 in maximum change in CN ratio). The authors should show dot plots of average fluorescence vs. the maximum change in CN ratio.
      6. The authors have shown cluster analyses for the temporal patterns in kinase activations. However, the only difference of cluster 3 and 5 (Figure 7) seem to be amplitude. The authors have also shown the amplitude is dependent on the dose of the activators, which together makes it difficult to see the biological meaning of discriminating the two patterns in comparing different agonists, e.g., Histamine, UK, and S1P. The authors should discuss their views on how the clustering analyses will benefit biological interpretations together with possible limitations.
      7. Considering the importance of the content, the supplemental note 2 may be included in the main text.

      Minor comments:

      1. The authors should clarify the cell type they used (HeLa cells) in the main text and figure legends.
      2. Supplementary note1: The data-not-shown data (no correlation of KTR expression and its response to serum) should be very informative for the readers. The data should be shown as an independent supplementary figure.
      3. Supplementary Figure S2: The authors should clarify this image processing is about background subtraction. Also, the authors should clearly note "rolling ball with a radius of 70 pixels" is about an ImageJ function, "Subtract Background".
      4. Supplementary Figure S5: Figure labels are "A, A, B, B" not "A, B, C, D". Also the top two figures are lacking Y axis labels.
      5. Page6 (top): The authors should mention the description is about Supplementary Figure S5 (UK) and Supplementary Figure S6 (S1P).
      6. Figure 3: the figures are lacking x-axis labels (probably uM, nM and pM from left).
      7. Values in tables: The significant figure must be 2, at best. This should be consistent throughout the text. For example, "The EC50 values for histamine, S1P and UK were respectively 0.3 μM, 63.7 nM and 2.5 pM." This is somewhat awkward.
      8. Page 7, the first paragraph: No comments on S1P!
      9. Fig. 3: 100 mM must read as 100 micromolar.
      10. Fig. 9: Concentration unit is missing.
      11. Page 11, line 4: EKR should read as ERK.
      12. Page 13: "So far, only a couple of studies looked into kinase activation by GPCRs and these studies used overexpressed receptors [32,33]." Please describe precisely. Protein kinase activation by GPCR has been studied more than 20 years. Why are these two recent papers cited here?
      13. "This is in marked contrast to other fluorescent biosensors that typically require an overexpressed receptor for robust responses [34]." Following words should be included in the end: "in our hands".
      14. "Histamine is reported to predominantly activate Gq in HeLa cells [36] and UK activates Gi [37]." Describe the name of receptors for the better understanding.
      15. "S1P can activate a number of different GPCRs, all known to be expressed by HeLa cells [24]." Why is this paper chosen? The authors can easily find RNA-Seq data, if they wish to see the expression level. The cited paper did not scrutinize the S1P receptors expressed in HeLa cells.

      Significance

      The authors used biosensors for ERK and Akt to examine the kinetics of activation by GPCR ligands. Technical advancement is in the massive analysis method and cluster analysis. This is an important direction for the quantitative biology. GPCR signaling is complex because of multiple receptors coupled with different G proteins. The simple ones such as histamine receptor and alpha2-adrenergic receptor can be easily analyzed as shown in this study. However, there are many S1P receptors, which make the interpretation difficult. If the authors could have shown interesting proposal on this data, the paper may interest many researchers in the field of cell biology and systems biology.

      Expertise: Cell biology, signal transduction of protein kinases, fluorescence microscopy.

      Referee Cross-commenting

      1. I agree with the other two reviewers in that immunoblotting data is required to show the efficiency of P2A cleavage.
      2. All reviewers think it looks strange that the authors did not show UK + YM data.
      3. Showing the dynamic range of the biosensors will reinforce the data as Reviewer #3 states. ERK-KTR is quite sensitive and can be easily saturated. Ideally, the ratio of pERK vs ERK can be quantified by the different mobility in SDS-PAGE. But, I do not know how we can do it for Akt.
    1. Likewise, the filing cabinet cannot feed itself without user collaboration; indeed, without a user, the filing cabinet cannot even start its combinatory po-tential. Nevertheless, the card index is used as a true ‘communicative partner’ because it has proper autonomy. In a sense, the card index is fully dependent on and fully independent of the user. The inner structure is methodically ar-ranged so that the users, whoever they may be, can in principle use it; entries are linked so that once the combinatory potential begun, combinations repro-duce themselves and increase the available complexity in unexpected ways.34

      There is an interesting analogy here worth pursuing:

      This idea and its structure have lots of similarities to those of growth and evolution in Werner R. Loewenstein's The Touchstone of Life: Molecular Information, Cell Communication, and the Foundations of Life. What if we reframe RNA or mitochondria in the role of the filing cabinet? What emergent properties occur in these processes? What do these processes have in common?

      I need at least some shorthand idea or word for talking about the circular evolving processes of life in Loewenstein's book. Maybe evolution spirals?

      Think inputs and outputs.

    1. Author Response:

      Reviewer #2 (Public Review):

      Oberle et al. provide a detailed analysis of how descending projections from the auditory cortex interact with ascending auditory projections on neurons in the shell region of the inferior colliculus on a cellular basis. Using optogenetic activation of auditory cortical neurons or projections and electrical stimulation of fibres in combination with whole-cell patch clamp recordings in vivo and in vitro, they show that most neurons in the shell region of the inferior colliculus receive several monosynaptic cortical inputs. In vitro, these descending synapses show sublinear summation with a major tonic component for prolonged stimuli. Both in vivo and in vivo experiments support the idea that descending cortical inputs and ascending inputs from the central inferior colliculus temporally overlap and both activate NMDA and non-NMDA receptors. This cooperativity of inputs leads to supra-linear summation and boosting of the response.

      Strengths:

      • The manuscript provides a first detailed analysis of a loop between the cortex and midbrain. It elegantly combines in vivo and in vitro electrophysiological techniques to study this network on a cellular/synaptic level.

      • These experiments thoroughly characterize the nature of cortical and midbrain excitatory inputs onto shell IC neurons and elucidate how they integrate the ascending and descending inputs on a cellular level.

      Weaknesses:

      • A major weakness of this study is that they do not directly show that ascending and descending inputs to the IC shell neurons actually coincide, but only imply that this should be the case, considering different latency measurements. Latencies that are measured in the anesthetized preparation may change in the awake behaving animals which may change the timing of the respective inputs.

      We rectify this issue in our revision with new data showing that the latency of sound-evoked activity in the superficial IC is similar in anesthetized and awake mice. We acknowledge that the conduction velocity of descending axons may differ between anesthetized and awake state. However, existing data show that conduction velocities of cortical axons increase in the alert brain compared to non-alert conditions (Stoelzel et al., 2017). Taken together, we would expect an increased temporal coincidence of ascending and descending signals in awake compared to anesthetized animals, which all available evidence suggests would enhance NMDAR-dependent non-linearities such as those we described (Gasparini et al., 2004; Gasparini and Magee, 2006; Losonczy and Magee, 2006; Takahashi and Magee, 2009; Branco et al., 2010; Branco and Häusser, 2011). We now revise our Results to highlight that our latency measurements in anesthetized mice represent the upper bound for the arrival of auditory cortical EPSPs.

      In addition, the authors do not show to what extent coincidence of ascending and descending inputs to shell IC neurons is maintained for longer and more complex sounds as compared to click stimuli.

      Previous work shows that auditory cortico-collicular neurons sustain firing during long, complex sounds (Williamson and Polley, 2019), and our data show that descending transmission is maintained for extended periods of corticofugal activity both in vitro and in vivo (Figure 4E-H). Thus, we would expect temporal overlap of ascending and descending inputs to occur under these conditions as well. We agree that Reviewer #2 touches upon an important knowledge gap. However, we believe that a full investigation of which sounds do and do not engage descending modulation merits a separate, in-depth study.

      • The manuscript does not address the question of whether the different neuron types that they encounter in the shell region based on the firing pattern to current injections, vary in their input latencies, their number and distribution of NMDA receptors or their integrative properties. This may have some additional effect on how these neurons process ascending and descending information.

      We agree that correlating intrinsic and synaptic properties could reveal something interesting. However, our initial analyses (Figure 3) did not show any striking correlation between membrane biophysics and the half-width or amplitude of descending EPSPs. As such, we had no a priori basis to hypothesize that synaptic integration differs systematically with measurable membrane properties, and the low-throughput of dual pathway stimulation experiments (Figures 6 and 8) precluded collecting a large dataset needed to convincingly determine if any synaptic non-linearity does or does not meaningfully correlate with the cellular biophysics.

      We acknowledge this limitation of our study in our revised Discussion. Future studies, perhaps leveraging cell-type specific markers for different IC neurons (Goyer et al., 2019; Naumov et al., 2019; Silveira et al., 2020; Kreeger et al., 2021) will be required to clarify this issue.

      • The authors have not demonstrated that silencing of descending inputs from the AC affects IC shell activity.

      We did not initially perform this experiment given the extensive literature establishing that silencing auditory cortex modifies the magnitude, timing, and/or selectivity of IC neuron sound responses (Yan and Suga, 1999; Nwabueze-Ogbo et al., 2002; Popelár et al., 2003; Nakamoto et al., 2008, 2010; Anderson and Malmierca, 2013; Popelář et al., 2016; Weible et al., 2020). Indeed, these classic results were a major motivation for us to focus on the cellular mechanisms that support corticofugal transmission. We thus reasoned that a cortical inactivation experiment would be largely confirmatory of prior knowledge, and limited in its potential for mechanistic interpretation given the known caveats of cortical loss-of-function manipulations (Li et al., 2019; Andrei et al., 2021; Slonina et al., 2021). However, we acknowledge that such an experiment is useful to frame our cellular-level findings in a broader, systems-level context. As such, we address Reviewer #2’s concern in our revision with a new experiment demonstrating that auditory cortical silencing indeed affects sound-evoked activity in the IC of awake mice.

      Reviewer #3 (Public Review):

      Overall, this manuscript is generally nicely written and well-illustrated. I don´t really have any major issues. I like the manuscript but I have a few comments and some issues that need to be addressed.

      My main concern is that the authors claim several times that the projections to the central nucleus of IC are weak and they neglect their potential functional role. I think this is a little bit unfortunate. It is true that the large AC projection primarily targets the cortical regions or shell of IC, but it is beyond doubt that it also targets the central nucleus (e.g. Saldaña's studies) . We cannot know whether it is a weak projection or not without central nucleus recordings. Admittedly, these experiments would be challenging, so I would ask the authors to tone down a bit these comments throughout the ms. Also, the reason for the 'weak' projection to the central nucleus may be due to the size and location of the injections made in the auditory cortex. Thus, I would like to see the injections site of Chronos if possible. Likewise, fig 1B is too small and of low quality (at least in my pdf file for review) to appreciate details of labeling. I would suggest that the authors make a separate figure showing the injection site in the AC and larger and clearer labeling in the IC.

      We agree that in vitro recordings from the central IC in adult mice are quite challenging. As suggested we have toned down claims of the “weak” projection to central IC and provide micrographs of Chronos injection sites. However, we concur that this is an important point. Thus, we include a new transsynaptic tracing experiment showing the somata of presumptive postsynaptic targets of auditory cortex neurons in the IC. Although the data show that the majority of cortico-recipient IC neurons are located in the shell regions, a few central IC neurons are indeed clearly labeled. Future studies will be required to test the extent and potency of this direct auditory cortex->central IC projection, and to compare the synaptic properties with our results in the shell IC.

      Also I wonder if the title of the manuscript should refer to the non-lemniscal IC as most of the data is related to this area.

      We have changed the title of the paper to Synaptic Mechanisms of Top-Down Control in the Non-Lemniscal Inferior Colliculus.

      While the dogma is that the descending projections are glutamatergic, the authors may care to consider a recently published paper https://www.frontiersin.org/articles/10.3389/fncir.2021.714780/full, which challenges this view by showing that inhibitory long-range VIP-GABAergic neurons target the IC. It would be interesting if the authors could comment on how this projection may have influenced the results of the present study.

      We thank Reviewer #3 for pointing out this new study which does indeed relate to our work. However, we don’t think direct GABAergic projections contributed much, if at all to our results. Indeed, the experiments of Figure 5A did not reveal any inhibitory postsynaptic potentials following bath application of NBQX as one might expect from direct stimulation of VIP-GABA axons (these experiments were performed without SR95531 in the bath). Rather, it may be that the VIP-GABA synapses have low release probability, transmit mainly via non-synaptic diffusion (e.g., spillover), or may primarily release the neuropeptide VIP which would be difficult to detect via whole-cell patch-clamp electrophysiology. We now address the work of Bertero et al. in the Discussion section.

      References

      Anderson LA, Malmierca MS (2013) The effect of auditory cortex deactivation on stimulus-specific adaptation in the inferior colliculus of the rat. Eur J Neurosci 37:52–62.

      Andrei AR, Debes S, Chelaru M, Liu X, Rodarte E, Spudich JL, Janz R, Dragoi V (2021) Heterogeneous side effects of cortical inactivation in behaving animals. eLife 10:e66400.

      Branco T, Clark BA, Häusser M (2010) Dendritic discrimination of temporal input sequences in cortical neurons. Science 329:1671–1675.

      Branco T, Häusser M (2011) Synaptic integration gradients in single cortical pyramidal cell dendrites. Neuron 69:885–892.

      Gasparini S, Magee JC (2006) State-dependent dendritic computation in hippocampal CA1 pyramidal neurons. J Neurosci Off J Soc Neurosci 26:2088–2100.

      Gasparini S, Migliore M, Magee JC (2004) On the initiation and propagation of dendritic spikes in CA1 pyramidal neurons. J Neurosci Off J Soc Neurosci 24:11046–11056.

      Goyer D, Silveira MA, George AP, Beebe NL, Edelbrock RM, Malinski PT, Schofield BR, Roberts MT (2019) A novel class of inferior colliculus principal neurons labeled in vasoactive intestinal peptide-Cre mice. eLife 8:e43770.

      Kreeger LJ, Connelly CJ, Mehta P, Zemelman BV, Golding NL (2021) Excitatory cholecystokinin neurons of the midbrain integrate diverse temporal responses and drive auditory thalamic subdomains. Proc Natl Acad Sci U S A 118:e2007724118.

      Li N, Chen S, Guo ZV, Chen H, Huo Y, Inagaki HK, Chen G, Davis C, Hansel D, Guo C, Svoboda K (2019) Spatiotemporal constraints on optogenetic inactivation in cortical circuits. eLife 8:e48622.

      Losonczy A, Magee JC (2006) Integrative properties of radial oblique dendrites in hippocampal CA1 pyramidal neurons. Neuron 50:291–307.

      Nakamoto KT, Jones SJ, Palmer AR (2008) Descending projections from auditory cortex modulate sensitivity in the midbrain to cues for spatial position. J Neurophysiol 99:2347–2356.

      Nakamoto KT, Shackleton TM, Palmer AR (2010) Responses in the inferior colliculus of the guinea pig to concurrent harmonic series and the effect of inactivation of descending controls. J Neurophysiol 103:2050–2061.

      Naumov V, Heyd J, de Arnal F, Koch U (2019) Analysis of excitatory and inhibitory neuron types in the inferior colliculus based on Ih properties. J Neurophysiol 121:2126–2139.

      Nwabueze-Ogbo FC, Popelár J, Syka J (2002) Changes in the acoustically evoked activity in the inferior colliculus of the rat after functional ablation of the auditory cortex. Physiol Res 51 Suppl 1:S95–S104.

      Popelár J, Nwabueze-Ogbo FC, Syka J (2003) Changes in neuronal activity of the inferior colliculus in rat after temporal inactivation of the auditory cortex. Physiol Res 52:615–628.

      Popelář J, Šuta D, Lindovský J, Bureš Z, Pysanenko K, Chumak T, Syka J (2016) Cooling of the auditory cortex modifies neuronal activity in the inferior colliculus in rats. Hear Res 332:7–16.

      Silveira MA, Anair JD, Beebe NL, Mirjalili P, Schofield BR, Roberts MT (2020) Neuropeptide Y Expression Defines a Novel Class of GABAergic Projection Neuron in the Inferior Colliculus. J Neurosci 40:4685–4699.

      Slonina ZA, Poole KC, Bizley JK (2021) What can we learn from inactivation studies? Lessons from auditory cortex. Trends Neurosci:S0166-2236(21)00203-4.

      Stoelzel CR, Bereshpolova Y, Alonso J-M, Swadlow HA (2017) Axonal Conduction Delays, Brain State, and Corticogeniculate Communication. J Neurosci Off J Soc Neurosci 37:6342–6358.

      Takahashi H, Magee JC (2009) Pathway interactions and synaptic plasticity in the dendritic tuft regions of CA1 pyramidal neurons. Neuron 62:102–111.

      Weible AP, Yavorska I, Wehr M (2020) A Cortico-Collicular Amplification Mechanism for Gap Detection. Cereb Cortex N Y N 1991 30:3590–3607.

      Williamson RS, Polley DB (2019) Parallel pathways for sound processing and functional connectivity among layer 5 and 6 auditory corticofugal neurons. eLife 8:e42974.

      Yan J, Suga N (1999) Corticofugal Amplification of Facilitative Auditory Responses of Subcortical Combination-Sensitive Neurons in the Mustached Bat. J Neurophysiol 81:817–824.

    2. Reviewer #3 (Public Review):

      Overall, this manuscript is generally nicely written and well-illustrated. I don´t really have any major issues. I like the manuscript but I have a few comments and some issues that need to be addressed.

      My main concern is that the authors claim several times that the projections to the central nucleus of IC are weak and they neglect their potential functional role. I think this is a little bit unfortunate. It is true that the large AC projection primarily targets the cortical regions or shell of IC, but it is beyond doubt that it also targets the central nucleus (e.g. Saldaña's studies) . We cannot know whether it is a weak projection or not without central nucleus recordings. Admittedly, these experiments would be challenging, so I would ask the authors to tone down a bit these comments throughout the ms. Also, the reason for the 'weak' projection to the central nucleus may be due to the size and location of the injections made in the auditory cortex. Thus, I would like to see the injections site of Chronos if possible. Likewise, fig 1B is too small and of low quality (at least in my pdf file for review) to appreciate details of labeling. I would suggest that the authors make a separate figure showing the injection site in the AC and larger and clearer labeling in the IC.<br> Also I wonder if the title of the manuscript should refer to the non-lemniscal IC as most of the data is related to this area.<br> While the dogma is that the descending projections are glutamatergic, the authors may care to consider a recently published paper<br> https://www.frontiersin.org/articles/10.3389/fncir.2021.714780/full, which challenges this view by showing that inhibitory long-range VIP-GABAergic neurons target the IC. It would be interesting if the authors could comment on how this projection may have influenced the results of the present study.

    1. Reviewer #1 (Public Review):

      This paper reports features of the development (and subsequent loss) of the egg tooth of the short-beaked echidna (T. aculeatus) at the histological level. Based on these features, the authors then consider the homology of the egg tooth/caruncle of the echidna with those of avian and non-avian reptiles. The authors report that while the echidna egg tooth is first apparent as a Shh-expression epithelial placode, the tooth then takes shape by evagination, rather than invagination, of that placode. This is reminiscent of the first teeth of some reptiles. The authors also find that the echidna egg tooth is anchored directly to the bone of the premaxilla (again, reminiscent of the mechanism of attachment of some reptilian teeth, and unlike the thecodonty seen in mammalian teeth). The caruncle also forms near the premaxillary bone and is associated with a prematurely differentiated and cornified epithelium. Finally, the authors find that the egg tooth is lost via a combination of resorption (by multinucleated TRAP-positive clast cells) and by cell death within the egg tooth pulp, and that the caruncle is lost at some undetermined point between 11- and 50-days post-hatching. Taken together, these findings indicate that the only tooth (albeit a transient one) in the otherwise edentulate echidna more closely resembles the teeth of reptiles than those of eutherian mammals, indicative of remarkable conservation of dental features in monotremes and reptiles from the last common ancestor of amniotes.

      Strengths

      We commend the authors on acquiring a unique and impressive series of embryonic and post-embryonic echidna specimens, and on making the most of these precious specimens by sequentially imaging them for microCT, followed by processing for paraffin histochemistry and/or immunofluorescence. The quality of the histology and image data presented here is high, and the authors effectively use their various data types (CT and section) in combination to provide good and clear anatomical context for their observations. This histochemical stainings presented here are very clear, and easily allows the reader to distinguish tissue types and connectively between elements (e.g., between the dentine of the egg tooth and the premaxilla, and between the os caruncle and the premaxilla).

      Furthermore, by framing their work in a comparative context, the authors can propose homologies between the egg tooth of the echidna and the first forming teeth of some lizards and crocodilians. Monotremes possess a fascinating melange of anatomical features classically regarded as "mammalian" or "reptilian", but these are extremely difficult to study developmentally. This work is a significant contribution in this regard and highlights the importance of monotreme developmental data when reconstructing the nature of the last common ancestor of amniotes.

      Weaknesses

      The introduction of the paper is a bit too long (and, at times, unfocused). Given the succinct nature of the results, the paper would benefit from a more focused and streamlined introduction.

      While the embryonic samples studied here are understandably limited (and sample sizes necessarily small), there are nevertheless claims made here that are not fully supported by the figures. In most instances, this is a case of a lack of high-magnification panels in the plates illustrating, for example, the features of the odontoblast layer, the ameloblast-like cells at the tip of the tooth, etc. These features are discussed, but not shown.

      The story around the caruncle isn't fully developed. It is introduced as though there has been some debate about whether the element forms as a distinct condensation from the premaxilla, but then this is not revisited. Also, the rationale for the choice of molecular markers used to characterise the epithelial component of the caruncle isn't entirely clear. The authors state that Loricrin is a marker of "terminal differentiation" - but does this mean that the loricrin-expressing epithelium adjacent to the caruncle skeleton is just farther along in its development relative to adjacent epidermis? Or is loricrin a specific marker of "cornified epithelia"? And if the latter, has loricrin expression been examined in the developing caruncles of avian or non-avian reptiles?

      Finally, the evolutionary synthesis presented here seems reasonable with respect to the egg tooth but remains a bit less clear with respect to the caruncle. The authors conclude that the os caruncle may be a novelty of monotremes, but that the epithelial caruncle may be homologous between monotremes and reptiles - but then suggest that the last common ancestor of amniotes had both structures? It is difficult to follow this logic. I think that that paper would benefit from a more nuanced "final model" or hypothesis of homology of egg teeth and caruncles across amniotes.

    1. Author Response:

      Reviewer #3 (Public Review):

      In this manuscript the authors make several conclusions, according to the abstract:

      1 - LTG activity is essential by contributing to a process independent of PG recycling.

      2 - LTGs are important because of their catalytic activity rather than because of a protein-protein interaction.

      3 - LTG mutants are hypersusceptible to production of periplasmic polymers.

      4 - LTGs prevent toxic periplasmic crowding and their function is temporally separate from PG synthesis.

      The authors perform a series of genetic experiments that lead to their conclusions. Their first conclusion is well supported by data showing that a PG recycling mutant does not have the same defects as their LTG mutant.

      Their second conclusion needs more justification/explanation. They show a catalytic mutant of RlpA is unable to sustain growth as the only LTG in the cell. However, I am confused by their wording around RlpA in general. In the text they note that their delta_7 mutant, which encodes RlpA, 'has no highly active LTGs' (lines 130-131). Does that imply that RlpA is not an LTG? In the discussion they note that E.coli RlpA has no LTG activity. Is this enzyme known to have LTG activity in V.cholerae? One important control would be to show that the catalytically inactive protein is stable (i.e. that the defect is not due to protein misfolding). This could be supported by looking at protein stability via Western or even quantifying the fluorescence data in Figure S3b.

      Alignment of VcRlpA with P. aeruginosa RlpA, which has been demonstrated in vivo and in vitro to be an active LTG, suggests VcRlpA retains the active site residues required for PG cleavage. This, as well as the inability of a VcRlpA^D145A mutant (based on the alignment with catalytically inactive EcRlpA) to rescue native RlpA depletion from the ∆LTG mutants suggests that VcRlpA is an active LTG and that this activity is required in the absence of all other annotated V. cholerae LTGs. We agree that “no highly active LTGs” is confusing and we have changed the text to simply describe the ∆7 LTG mutant as being significantly depleted in LTG activity as measured by anhMurNAc abundance in the sacculus. Lastly, we have conducted Western Blots demonstrating in the revised manuscript that our catalytic site mutant is indeed produced and stable (Figure S3).

      Their third conclusion also needs more support. The authors do a series of experiments showing that delta7 is more susceptible to SacB. What are the data that show sacB produces large polysaccharides molecules in the periplasm rather than (or in addition to) the cytoplasm? This would be important to show as these data are the main test of the authors model.

      In native B. subtilis as well as in E. coli, SacB has a canonical Sec signal peptide which is annotated as being cleaved after residue Ala29 (Uniprot G3CAF6_BACIU) to be released extracellularly. A reference (Pereira, et al, 2001) has been added in support of SacB functioning extracellularly and not in the cytoplasm of its native host, B. subtilis.

      The authors have other data that all argue for their model that LTG deficient strains have an excess of periplasmic crowding. The suppressor of delta_opgH is intriguing, but does not restore the morphological defects in delta_7, suggesting that the increase in length during prolonged growth may not be caused by periplasmic crowding, or at least is not alleviated by deletion of OpgH. What then does the deletion of OpgH suppress? Here, I was confused by the experiments in low salt. The authors write that the cells lyse (line 222) but this is not shown anywhere. Growing the cells continually in low salt may not be the hypoosmotic challenge the authors presume. A challenge typically implies an acute change in osmolarity, rather than a prolonged exposure, which may allow cells to adapt.

      We do not fully understand the role of OpgH, but here is our working model: LTGs have at least two essential functions – 1) PG release and 2) mitigating periplasmic crowding, either or both of which can become more important based on osmotic conditions. Since MltG seems to be the main PG release factor (at least based on E. coli), which can be partially supplanted by collective action of other LTGs, the ∆7 suffers from both PG release defects and periplasmic crowding defects, perhaps more so in an osmotically challenging low salt medium. The evidence for lysis is that at high inoculum (10^-2) the ∆7 LTG mutant does grow for a short time, but then we observe a drop in OD_600, indicative of lysis. According to our model, ∆6, on the other hand, which still has MltG, likely suffers only (or mostly) from a periplasmic crowding defect. Deleting periplasmic glucans only mitigates periplasmic crowding (and probably only partially), which does not help the more defective ∆7, which additionally suffers from lack of the postulated second activity.

      The reviewers raise an interesting point regarding the word “challenge”. We indeed specifically make the point that this is not an acute challenge, but rather accumulating damage during prolonged growth, even in salt-free LB. We have thus removed the word “challenge” from the revised manuscript. Importantly, we only use the ∆opgH suppression phenotype as one of many puzzle pieces for our conclusion. The key assay is the direct demonstration of periplasmic soluble PG strands accumulating in both WT and, to a higher degree, the ∆6 LTG mutant (Fig. 6).

      I was also highly confused by the antibiotic + BADA staining experiments. Do the authors stain the cells, treat, and then visualize? Are they then studying the fate of old PG? How does BADA get incorporated into PG in V.cholerae? Is it through LDT activity or some other way? Without more explanation, it is hard to interpret the results.

      BADA does get incorporated through either LDT or PG synthesis activity in V. cholerae, but for these experiments, the specific incorporation pathway is inconsequential, since we only focus on the end product (stained PG). We think that what we visualize is not the fate of old PG (otherwise we would see similar strong stains with Fosfomycin, which inhibits cell wall synthesis upstream of PG strand generation by PBPs/SEDS), but rather visualizes the generation of long, uncrosslinked PG strands due to the inhibition of PBP transpeptidase activity. We have added more explanations of this assay to the revised manuscript.

      The last conclusion is not supported by data. There are no data showing that LTG activity is temporally separate from PG synthesis.

      We would like to point out that this is not framed as a conclusion per se, but rather a plausible speculation. Our data showing soluble strand accumulation in the WT strongly suggest that LTGs do not work in perfect harmony with synthesis, but rather degrade strands AFTER they accumulate (i.e., temporally separate). We further believe that complementation with a heterologous enzyme (MltE), which does not have a homolog in V. cholerae strongly argues that LTGs and PG synthesis do not have to associate through protein-protein interactions. All this adds to an emerging model that PG synthesis and LTG-mediated degradation are not as tightly co-ordinated as one might assume.

    1. Author Response:

      Reviewer #1:

      Authors introduce a deep learning-based toolbox (ELEPHANT) to provide ease in annotation and tracking for 3D cells across time. The study takes two datasets (CE and PH) to demonstrate the performance of their method and compare it with two existing 3D cell tracking methods on segmentation and accuracy metrics. 3D U-Nets are shown to be performing well in segmentation tasks in recent years, authors also utilize 3D U-Net for segmenting cells as well as linking the nuclei across time through optical flow. The variation in selected datasets is shown to be in the shape, size and intensity of cells. Beyond segmentation, authors also demonstrate the performance of ELEPHANT in exploring the tracking results with and without optical flow and regenerating their fate maps. A complete server-based implementation is provided with detailed codebase and docker images to implement and utilize ELEPHANT.

      Strengths:

      The paper is technically sound with detailed explanation of each methodological step and results. 3D U-Nets are optimized for the segmentation task in hand with large training sessions, efficiency of the pipeline is nicely demonstrated which serves this as a useful toolbox for real-time annotation and prediction of cell structures. The detailed implementation on a local and remote server is presented which is a need while handling and analyzing large scale bio-imaging datasets. Beyond smoothing, SSIM-based loss is effectively applied to make the model robust against intensity and structural variations which definitely helps in generalized performance of the segmentation and tracking pipeline.

      Segmentation results are validated on a large set of nuclei and links which is helpful to understand the limitation of the models. The advantage of using optical flow-based linking is clearly shown on top of using nearest neighbors. Spatio-temporal distribution of cells on a given data guides the users in using the framework for several biological applications such as tracking the lineage of newly born cells - a hard task in stem cell engineering.

      A detailed implementation on both remote and server as well as open-source codebase on Github is well provided for the scientific community which will help the users to easily use ELEPHANT for specific datasets. Although CE and PH datasets are used to demonstrate the performance, however, similar implementation can also be performed on neuronal datasets that would be of much use in exploring neurogenesis.

      Weaknesses:

      Authors use ellipse-like shapes to annotate the data, however, many cells are not elliptic or circular in shape but consist of varying morphology. If the annotation module is equipped with drawing free annotations then it will be better useful to capture the diverse shapes of cells in both training and validation. This also limits the scope of the study to be used only for cells' datasets that are circular/elliptical in shape.

      ELEPHANT can be used to track nuclei or cells of diverse shapes. Tracking is based on reliable detection of nuclei/cells but does not require precise segmentation of their shapes. We have now added results showing that ellipsoid approximations are sufficient for detection and cell tracking, even when tracking cells with complex and variable shapes (figure 3).

      As we now explain in the manuscript (page 4), we use ellipsoids for annotation because they are essential for rapid and efficient training and predictions, which are the backbone of interactive deep learning. In practice, using ellipsoids also reduces the amount of work required for annotating the data compared with precise drawing of cell outlines. Post-processing can be appended to our workflow if a user needs to extract the precise morphology of cells.

      Authors use 3D U-net for segmentation which is a semantic segmenter, perhaps, an instance-based 3D segmenter could be a better choice to track the identity of the cells across time and space. However, an instance-based segmenter may not be ideal for segmenting the cells boundaries but a comparison between a 3D U-Net and an instance-based 3D segmenter on the same datasets will be helpful to evaluate.

      Although the original 3D U-Net is a semantic segmenter, we use its architecture to estimate the center region of cells, which works as an instance-wise detector. A similar strategy was followed by recent techniques (Kok et al. 2020, PLoS One doi:10.1101/2020.03.18.996421, Scherr et al. 2020 PLoS One doi:10.1371/journal.pone.0243219) to identify cell instances. Instance-based segmenters (e.g. StarDist, Mask R-CNN) are particularly useful for precise segmentation but our primary focus here is detection and tracking, which can be done most efficiently with the current architecture. Because StarDist or Mask R-CNN do not support sparse annotations, a direct comparison of these methods is difficult at the moment.

      The selected datasets seem to be capturing the diversity in shape and intensity, however, the biological imaging datasets in practice often have low signal to noise ratio, cell density variation and overlapping, etc. It seems like the selected datasets lack these diversities and a performance on any other data of such kind would be useful for performance evaluation as well as providing a pre-trained model for the community usage. Moreover, it would also be useful to demonstrate the performance of the framework in segmenting+tracking any 3D neuronal nuclei dataset which will broaden the scope of the study.

      The PH dataset that we used for testing ELEPHANT presents many challenges, such as variations in intensity, areas of low signal to noise ratio, densely packed and overlapping nuclei (see manuscript page 7, Suppl. Figure 5). To add to this analysis, we have now applied our method to additional datasets that show diverse characteristics – including datasets with elongated/irregular-shaped cells from the Cell Tracking Challenge (Figure 3E) and organoids imaged by light and confocal microscopy (Figure 3C,D) – demonstrating the versatility of our method. We do not think that neuronal nuclei present a particular challenge for ELEPHANT (the PH dataset includes neurons).

      We now also provide a pre-trained model, trained with diverse image datasets, which can be applied by users as a starting point for tracking on new image data.

      The 3D U-Nets are used for linking by using the difference between two consecutive images (across time) as labels. However, this technique helps to track the cell in theory but may also result in losing cell identity when cells are overlapping or when boundary features are less prominent, etc. Perhaps, a specialized deep neural network such as FlowNet3D could be a better choice here.

      Our 3D U-Net does not directly generate links across consecutive images. Instead it produces voxel-wise optical flow maps for each of the three dimensions, which are then combined with detection results to predict the position for each object (see manuscript page 6 and Methods). This is then used for linking. The identity of the tracked objects is defined during detection.

      In the end, our approach is similar to FlowNet3D in that both estimate optical flow for each detected object, although we use two consecutive images as input instead of the sets of detected objects. FlowNet3D operates only on object coordinates, without taking into account image features that could be important cues for cell tracking (e.g. fluorescence intensity of nuclei during cell division).

      Reviewer #2:

      The authors created a cell tracking tool, which they claimed was user-friendly and achieved state-of-the-art performance.

      Would a user, particularly a biologist, be able to run the code from a set of instructions clearly defined on the readme? This was not possible for me. I am not familiar with Java or Mastodon, but I'm not sure we can expect the average biologist to be familiar with these tools either. I was very impressed by the interface provided though.

      We have updated the user manual and software interface to make the software more accessible for users. Moreover, ELEPHANT is now available as an extension on Fiji, which will greatly facilitate its adoption by non-expert users.

      Did the authors achieve state-of-the-art performance? It is unclear from the paper. It would be helpful to see comparisons of this tool with modern deep learning approaches such as Stardist. Stardist for instance reports performance on the parhyale dataset in their paper. Many people in the field are combining tools like Stardist with cell tracking tools like trackmate (e.g. see https://www.biorxiv.org/content/10.1101/2020.09.22.306233v1). It would be important to know whether one can get performance comparable to Stardist (at e.g. a 0.5 IoU threshold) on a single 3D with this sparse labelling and interactive approac. I still think this approach of using sparse labelling could be very useful for transferring to novel datasets, but it is difficult to justify the framework if there is a large drop in performance compared to a fully supervised algorithm.

      The novelty in ELEPHANT is making deep learning available for cell tracking and lineaging by users who do not have extensive annotated datasets for training. Existing deep learning applications (including StarDist) do not fulfill this purpose.

      The detection and tracking scores of ELEPHANT in the Cell Tracking Challenge (identified as IGFL-FR) were the best when applied to cell lineaging on C. elegans test datasets, compared to a large number of other tracking applications (http://celltrackingchallenge.net/latest-ctb-results/). This comparison includes methods that employ deep-learning.

      ELEPHANT models trained with sparse annotation perform similarly well to trained StarDist3D models for nuclear detection in single 3D stacks (see Supplementary Figure 8). For cell tracking over time, StarDist and Trackmate have so far only been implemented in 2D.

      Reviewer #3:

      This work describes a new open source tool (ELEPHANT, https://elephant-track.github.io/) for efficient and interactive training of a deep learning based cell detection and tracking model. It uses the existing Fiji plugin Mastodon as an interactive front end (https://github.com/mastodon-sc/mastodon). Mastodon is a large-scale tracking and track-editing framework for large, multi-view images. The authors contribution is an extension of Mastodon, adding automated deep learning based cell detection and tracking. Technically, this is achieved by connecting the Mastodon as a client (written in Java) to a deep learning server (written in Python). The server can run on a different dedicated computer, capable of the GPU based computations that are needed for deep learning. This framework makes possible the detection and tracking of cells in very large volumetric data sets, within a user friendly graphical user interface.

      Strengths:

      1) It is great to reuse an existing front-end framework like Mastodon and plug in a deep learning back-end! Such software design avoids reinvention of the wheel and avoids that users need to learn too many tools.

      2) The idea to use sparse ellipsoids as annotations for cell detection is in my view fantastic as it allows very efficient annotation. This is much faster than having to paint dense 3D ground truth as is required for most deep learning algorithms.

      3) It is great that the learning is so fast that it is essentially interactive!

      Opportunities for improvements:

      The software in its current form had a view issues that made it a little hard to use. It would be great if those could be addressed in future versions.

      1) There are several options for how to set up the ELEPHANT server. In any case this requires quite some technical knowledge that may prevent adoption by a broader user base. It would thus be great if this could be further streamlined.

      We thank reviewer 3 for the very useful and detailed suggestions on improving the user interface of ELEPHANT. We have implemented most of these suggestions and we plan to pursue additional ones in future versions of the software. In brief:

      • To facilitate the setting up of the ELEPHANT server, we have implemented a control panel that allows users to monitor the process and provides links to the relevant section of the user manual and to Google Colab.
      • ELEPHANT is now available as an extension on Fiji, which will greatly facilitate its use by non-expert users.
      • Pre-trained detection and linking models, trained on diverse image datasets, are now available on the ELEPHANT github.
      • Image data can be uploaded and converted automatically via the Fiji/Mastodon interface when the image data files are missing on the server.

      2) For a GUI based software it is becoming state-of-the-art to provide recorded videos that demonstrate how to use the software. This is much more telling than written text. The authors added very nice short videos to the documentation, but I think it would be essential to also provide a longer video (ideally with voice over) where the authors demonstrate the whole workflow in one go.

      We are preparing a demo video on YouTube, which will be embedded in the user manual.

      3) As a user one interacts with the Mastodon software which sends requests to the ELEPHANT client. It would be great if the feedback for what is going on server side could be improved. For example adding progress bars and metrics for the process of the deep learning training that are visualized within Mastodon would be, in my view, very important for the usability.

      We added a log window in which users can monitor the processes that are running on the server.

    1. a lot of people start with learning and then they build things and then they close the circle but there's one key piece missing here and some people hate the word but you 00:29:54 learn to love it eventually it's called marketing and marketing means a lot of things to a lot of people but what it means to me is getting the word out because someone else will if you don't and 00:30:05 you are awesome you just have to realize that maybe not everyone knows right away so you should really talk about it more maybe at conferences see what i did there 00:30:17 um maybe on twitter maybe you can just tell your friends and maybe you can ask people to contribute and to support you like what's wrong with that somehow it's frowned upon in the community that if you do 00:30:30 marketing you're not doing it for real but i think that's not true um i think that if smart people and patient and um passionate people as well 00:30:44 if they did marketing then the world would be a better place because i'm pretty sure the evil guys do marketing so do your homework

      Marketing is very critical but it has negative connotations in the open source community because it is associated with mainstream business , after all, marketing is derived from the word "market".

      Perhaps it is better to think in psychological terms. If we have a great idea, the internet is a way to reach billions of eyeballs. Everyone is, in a sense, forced to compete in an attention economy. Instead of marketing, we can also use the words "attracting attention", because that is really what we are trying to do, be an attention attractor.

      The Indieverse, being developed by knowledge architect Gyuri Lajos, offers an alternative to marketing. Marketing is an attention attractor that relies on a "push" strategy. We are making content and pushing it out to different parts of the world we think may resonate with us to attract attention.

      Instead, the Indieverse, with its built in read and write provenance can act like a "pull" attention attractor. People can discover you through the built in discoverability aspects of the indieverse. Unlike the private sector, which uses this pull method to try to match you to stuff they want to sell you, Indieverse inegrates tools that exposes relevant content to you. If that content has demonstrably improved your life, which can be tracked through your public sharing, you can sponsor or reward that content. Microsponsorship can even be built in.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their helpful, detailed and insightful comments. We have modified the figures and rewritten large sections of the manuscript following the reviewers’ suggestions. In addition, we have incorporated new data throughout the manuscript and figures to clarify and better support our conclusions. All of these changes have significantly improved the coherence, consistency and clarity of our data, and have allowed us to better communicate the advance our findings represent for the fields of splicing and muscle development.

      Please find a point-by-point response to the reviewers’ comments below. The reviewers’ comments are in black and italics.

      Response to Reviewer 1* Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Rbfox proteins regulate skeletal muscle splicing and function and in this manuscript, Nikonova et.al. sought to investigate the mechanisms by which Rbfox1 promotes muscle function in Drosophila.

      Using a GFP-tagged Rbfox1 line, the authors showed that Rbfox1 is expressed in all muscles examined but differentially expressed in tubular and fibrillar (IFM)muscle types, and expression is developmentally regulated. Based on RNA-seq data from isolated muscle groups, the authors showed that Rbfox1 expression is much higher in TDT (jump muscle) than IFM.

      Using fly genetics authors developed tools to reduce expression of Rbfox1 at different levels and the highest levels of muscle-specific Rbfox1 knockdown was lethal and displayed eclosion defects (deGradFP > Rbfox1-IRKK110518 > Rbfox1-RNAi > Rbfox1-IR27286). Consistently, Rbfox1 knockdown flies have reduced jumping and climbing phenotypes, due to tubular muscle defect where Rbfox1 is expressed at higher levels. Rbfox1 knockdown in IFM caused flight defects which have been shown previously. Further characterization of IFM and tubular muscles demonstrated a requirement of Rbfox1 for the development of myofibrillar structures in both fibrillar (IFM) and tubular fiber-types in Drosophila. Interestingly, knockdown or overexpression of Rbfox1 displayed hypercontraction phenotypes in IFMs which is often an end result of misregulation of acto-myosin interactions which was rescued by expression of force-reduction myosin heavy chain (Mhc, P401S), in the context of Rbfox1 knockdown (the rescue experiment could not be performed with Rbfox1 overexpression due to complex genetics).

      Authors also performed computation analyses of the Rbfox binding motifs in the fly genome and identified GCAUG motif in 3,312, 683, and 1184 genes in the intronic, 5'UTR, and 3'UTR, respectively. These genes are enriched for factors that play important roles in muscle function including transcription factors (exd, Mef2, Salm), RNA-binding proteins (Bru1), and structural proteins (TnI, encoded by wupA). Many of these gene transcripts and proteins are affected in flies with reduction or overexpression of Rbfox1. Using fly genetics, authors propose and test different mechanisms (co-regulation of gene targets by Rbfox1 and Bru1), and regulators of muscle function (exd, Me2, Salm) and structural proteins (TnI, Mhc, Zasp52, Strn-Mlck, Sls) by which these changes could affect the muscle function.

      *Overall, the characterization of Rbfox1 phenotypes and myofibrillar structure is very well elucidated, mechanisms by which Rbfox1 affects muscle function are not clear and remain largely speculative. We thank the reviewer for the positive evaluation of our phenotypic analysis of Rbfox1 knockdown in multiple muscle fiber types. This manuscript is the first detailed characterization of Rbfox1 in Drosophila muscle, extending far beyond our previous finding that Rbfox1-IR flies are flightless. Beyond behavioral and cellular phenotypes, we report that there are regulatory interactions between Rbfox1, Bruno1 and Salm and identify other Rbfox1 targets in flies. We acknowledge that there are molecular and biochemical details of specific regulatory mechanisms that remain to be elucidated, but this paper provides many foundational observations to guide future biochemical experiments and is thus important to the muscle field.

      \*Major comments**

      *1. The varying level of Rbfox1 knockdown (deGradFP > Rbfox1-IRKK110518 > Rbfox1-RNAi > Rbfox1-IR27286) was achieved by different strategies without validation at the protein level (likely due to lack of a Rbfox1 antibody). It is important to show different Rbfox1 protein level (at least with different RNAi), especially when authors propose that autoregulation of Rbfox1 causes increased level Rbfox1 transcript in case of Rbfox1-RNAi (mild knockdown). Autoregulation of Rbfox1 in mammalian cells may not be similar in flies.

      To address this comment, we have toned-down the discussion of level-dependent regulation throughout the manuscript, and have removed claims of Rbfox1 autoregulation. We appreciate the reviewer’s point that it would be ideal to be able to determine the protein levels of Rbfox1 in the different knockdown conditions. We have tested the published antibody against DmRbfox1, but it is very dirty and we see multiple bands in Western Blot. This background partially obscures the bands from 80-90 kDa at the molecular weight where we expect Rbfox1, and prevents accurate quantification (see Reviewer Figure 1). Verification of protein levels of Rbfox1 will require generation of a new antibody which is beyond the scope of this study. As we do not have a good antibody, we performed two experiments to demonstrate our ability to tune knockdown efficiency. First, we crossed Rbfox1-IRKK110518 and Rbfox1-IR27286 to UAS-Dcr2, Mef2-Gal4 and demonstrated we could enhance the phenotype (Figure 2A, B). Second, we performed knockdown with the same hairpins at different temperatures and demonstrate that stronger knockdown at higher temperature leads to stronger phenotypes with the same hairpin

      (Figure 2B). This data supports our knockdown series interpretation.

      Reviewer Figure 1. Western Blot of whole fly with anti-Rbfox1 (A2BP1) (Shukla et al., 2017). Tubulin was blotted as a loading control.

      • TnI and Act88F protein levels are inversely correlated with Rbfox1 level in IFM but did not correlate with the RNA level. Using RIP authors showed that Rbfox1 was shown to bound to wupA transcripts (has Rbfox binding sites) but not Act88F transcripts (does not have Rbfox binding sites). Authors performed Rbfox1 IP and identified co-IP of components of cellular translational machinery and propose that wupA (TnI) levels are regulated by translation or NMD (non-sense mediated decay). A follow up experiment was not performed to identify the mechanism by which TnI level is regulated by Rbfox1. *

      Further biochemical and genetic verification of the underlying mechanisms of Rbfox1 regulation in Drosophila muscle will be addressed in a future manuscript, as in vivo modulation of translation or NMD in an Rbfox1 knockdown background involves recombination to coordinate multiple genetic elements. We have modified the text to reflect this hypothesis remains to be explored in future experiments (Line 473-474).

      We have further added RT-PCR data for wupA transcript levels in IFM and TDT with Rbfox1-IRKK110518 knockdown (Figure S4 A), but as in Rbfox1-RNAi flies, there is not a significant change in expression. We do see significant downregulation of Act88F when we overexpress Rbfox1 in IFM (Figure S4 B), as well as in TDT when we knockdown Rbfox1 with either Rbfox1-IRKK110518 or Rbfox1-IR27286.

      It was known that TnI mutations (affects splice site, fliH or Mef2 binding site, Hdp-3) led to a reduction in TnI level and hypercontraction. Authors showed rescue of hypercontraction phenotype in hdp-3 background by knocking down Rbfox1, likely due to increase in wupA transcription (Mef2-dependent or independent manner). However, no rescue was observed in the fliH background. Reduced level of Rbfox1 in fliH background would be expected to cause worsening of phenotype as splicing of remaining wupA transcripts would be affected with reduced Rbfox1 level. The splicing of wupA of exon 4 is not affected in Rbfox1 knockdown (fig. 6U), it's not clear if the splicing of exon 6b1 is affected in Rbfox1 knockdown.

      We thank the reviewer for pointing out our lack of clarity regarding exon 6b1 and IFM-specific isoform 6b1. To address this comment and validate our previous data, we performed additional Sanger sequencing on RT-PCR products, added a diagram of the wupA gene region in Figure 4 A and improved the clarity of our discussion of the fliH and hdp3 alleles and our results in the text.

      To directly respond to the reviewer, first, it is unclear if the reduced level of Rbfox1 in a fliH background should actually cause a more severe phenotype. Our data suggests that Rbfox1 represses TnI expression through binding the 3’-UTR, and can likely indirectly regulate wupA expression level via Mef2. Thus, arguably, the reduced level of Rbfox1 in the fliH background might not affect splicing, as the mutations in the regulatory element should rather make wupA insensitive to increased Mef2 expression in the Rbfox-RNAi background.

      Second, we confirmed via Sanger sequencing of RT-PCR products that both IFM and TDT in control and Rbfox1-IR flies use exon 6b1 (current exon 7). The IFM isoform contains exon 3, 6b1 and 9, while the TDT isoform contains exon 3 and 6b1, but skips exon 9 (see Figure 4 A). In other tubular muscles, wupA isoforms skip exons 3 and 9, and use exon 6b2 instead of 6b1. Thus, to directly answer the reviewer’s question, no, splicing of exon 6b1 itself is not affected by Rbfox1. However, Rbfox1 does influence expression of the ”6b1 isoform”, or the wupA isoforms in IFM and TDT containing exon 6b1 and exon 3. Additionally, our data shows that Bru1, not Rbfox1, regulates alternative splicing of wupA exon 9 (Fig. S6 T).

      What the reviewer has correctly identified with this comment is that the effect on splicing in the hdp-3 allele also appears to be complex and to have not been fully clarified. Although hdp-3 results from mutation of a splice site in exon 6b1 (which based on (Barbas et al., 1993) results in aberrant use of 6b2 in IFM), it also results in a near complete absence of the longer isoform containing exon 3 in adult flies. hdp-3 is reported in the same paper to affect both IFM and TDT, which both express isoforms containing exon 3 and 6b1. It is not known how mis-splicing of exon 6b1 leads to loss of isoforms containing exon 3, but our data indicate that Rbfox1 is somehow involved. It is purely speculation and beyond the scope of this manuscript, but perhaps selection of alternative exons in wupA are not independent events (ie that the splicing of exon 3 depends on correct splicing of exon 6b1). This could be mediated with interactions with chromatin, the PolII complex or through a larger splicing factor complex (something like LASR, for example (Damianov et al., 2016)), that restricts choice in alternative events through higher-order interactions. Another possible mechanism is that a second mutation exists in the hdp-3 allele that affects splicing of exon 3, although this was not indicated in the extensive sequencing data in (Barbas et al., 1993).

      Bruno1 was identified as a co-regulator of Rbfox1 in different IFM and tubular muscle types. However, except Mhc, other Rbfox1 targets seem to be regulated by either Rbfox1 or Bruno1, not both. Analyses of RNA-seq datasets from single and double knockouts should identify additional targets to support the claim that - Rbfox1 and Bruno1 co-regulate alternative splice events in IFMs. Phenotypic changes with reduced Rbfox1 and Bruno1 double knockdowns are very severe, but the mechanistic basis of such genetic interaction resulting in synergistic phenotypes in IFMs is lacking as splicing changes in single vs double knockout is similar.

      We agree with the reviewer that RNA-seq data would be useful to obtain a genome-wide perspective on the regulatory interactions between Rbfox1 and Bru1, and we plan to generate this data as part of a future manuscript. However, the tissue-specific dissections to isolate enough material from all of the necessary genotypes will take months to complete, and are not realistic to wait to include in this manuscript. Instead, to address the reviewer’s question, we have expanded our RT-PCR experiments to cover a wider panel of events in 12 sarcomere genes (see new data in Figures 6 and S6 and summary in Figure 8). We now can show that splice events in Fhos and Zasp67 are Rbfox1 dependent, while events in sls, Strn-Mlck and wupA are Bru1 dependent. An event in Zasp66 responds to both Rbfox1 and Bru1, but in opposite directions. Events in Mhc, Tm1 and Zasp52 are regulated by both Rbfox1 and Bru1 (or are sensitive to changes in Bru1 expression in the Rbfox1 background), and change in the same direction. This data provides a clearer mechanistic basis for the synergistic phenotype observed between Rbfox1 and Bru1 in IFM.

      Rbfox1 is expressed at a high level in tubular muscle whereas Bruno1 is expressed at a high level in IFM. Rbfox1 binds to Bruno1 transcript and inversely regulates Bru1-RB level but knockdown of Bru1 does not affect Rbfox1 level (Fig. S5 G,I,J). Overexpression of Bruno1 decreased the Rbfox1 level, however, it's difficult to interpret these results as overexpression of Bruno1 may have other effects on IFM gene expression.

      The reviewer correctly pointed out that we did not observe significant changes in Rbfox1 mRNA levels in the mutant bru1M3 background, however, in the original version of this manuscript, we also showed a significant decrease in Rbfox1 expression in IFM from the bru1-IR background at both 72 h APF and 1 d adult in mRNA-Seq data. To clarify differences in Rbfox1 levels between bru1-IR and our bru1 mutant backgrounds, we have performed additional RT-PCR experiments. We examined Rbfox1 levels after knockdown of bru1 (bru1-IR), and we now show that Rbfox1 levels are significantly decreased in IFM and TDT after bru1-IR (Fig. 5S, Fig S5 I). We see a weaker effect in the bru1M2 hypomorphic mutant, which likely reflects differences in Bru1 expression levels in bru1-IR and the bru1M2 allele. These results are consistent with the mRNA-seq data we presented previously (now in Fig. 5R). These additional data suggest that loss as well as gain of Bru1 affects Rbfox1 expression levels.

      A dose-dependent effect of Rbfox1 knockdown was shown to regulate the expression of transcription factors that are important for muscle type specification and function including exd, Mef2, and Salm. However, it is not clear how Rbfox1 mechanistically regulates the expression of these transcription factors.

      We present two pieces of data suggesting possible regulatory mechanisms for Mef2. First, RIP data suggest Rbfox1 can directly bind the 3’-UTR region of Mef2, and this region contains two binding motifs identified in both the oRNAment database and in our PWMScan dataset. Second, we show that use of the 5’-UTR regions of Mef2 is altered in Rbfox1-IR muscle. Although not definitive, this suggests that regulation of alternative 5’-UTR use may influence transcript stability or translation efficiency. We feel the many experiments to elucidate the detailed mechanism of regulation (and indeed to determine the likely contribution of multiple, layered regulatory processes) are beyond the scope of this paper, and are better left for future studies. This manuscript is the first in-depth characterization of Rbfox1 function in Drosophila muscle, and we provide multiple lines of evidence suggesting that different regulatory mechanisms exist as a basis for future experiments to explore these interesting and important regulatory interactions.*

      **Minor comments**

      1. It is not described if the rescue of Rbfox1 knockout by expression of force-reduction myosin heavy chain (Mhc, P401S) led to rescue of phenotypes (jumping, climbing, flight). *

      Force-reduction myosin heavy chain MhcP401S is a mutation at the endogenous Mhc locus that results in a headless myosin and was previously characterized to be flightless (Nongthomba et al., 2003). It is however able to rescue jumping and walking defects observed with the hdp2 TnI allele, and supports largely normal myofibril assembly (Nongthomba et al., 2003). It is also important to note that fibrillar muscle function is very finely tuned, such that alterations that result in flightlessness in many cases do not alter myofibril structure as detected by confocal microscopy (Schnorrer et al., 2010). We therefore looked at myofiber and sarcomere structure as a more sensitive read-out of the rescue ability in the Rbfox1 knockdown, to be able to detect a partial-rescue of myofibrillar structure that may not be evident in a behavioral assay.

      Immunofluorescence (IF) and Western blotting are different techniques, and Bruno1 antibody was validated for specificity in IF but not in Western blots. Figure 5L and S5 E should include muscle samples from Bru1M2.

      We have added a Western Blot panel in Figure S5 D including bru1-IR, bru1M2 and samples of different wild-type tissues including abdomen, ovaries, testis and IFM.

      To quantify alternative splicing or percent spliced in (PSI), primers are typically designed in the exons flanking the alternative exons. A better primer design along with PSI calculation by RT-PCR will robustly validate alternative splicing changes in different genetic background (Fig 6U and S6 U).

      We do not yet have RNA-Seq data from these Rbfox1 knockdown samples to facilitate calculation of transcriptome-wide PSI values; thus, we rely on the results from our RT-PCR experiments. Our primers used to detect alternative splice events are indeed located within flanking exons or as close to the alternative exons as possible based on sequence design limitations (see schemes in Figure 6 and Figure S6). Many of the events we are detecting are complex, and not a simple “included” or “excluded” determination, and are therefore not amenable to RT-qPCR. To increase the robustness of our validation, we now provide RT-PCR gel-based quantification of exon use for the events we tested in Zasp52, Zasp66, Zasp67, wupA and Mhc (Figure 6 U-W and Figure S6 T-U).*

      Reviewer #1 (Significance (Required)):

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Understanding how muscle fiber type splicing and gene expression is regulated will conceptually move the field forward. How transcriptional and posttranscriptional programs coordinate to specify muscle fiber type gene expression is still lacking.

      Place the work in the context of the existing literature (provide references, where appropriate). Multiple RNA binding proteins and splicing factors have been shown to affect muscle function along with hundreds of gene expression and splicing changes in a complex fashion. Linking phenotypes with gene expression changes is still challenging as RNA binding proteins or RBPs are multifunctional and affect the function of other regulators that are important for muscle biology. *We thank the reviewer for recognizing the conceptual advance our findings represent, as well as the complexity in the regulatory network we are seeking to understand. A detailed understanding of the coordination of transcriptional and posttranscriptional programs is enabled by our work and will be the subject of future investigation.

      * State what audience might be interested in and influenced by the reported findings.

      Fly genetics, alternative splicing regulation, muscle specification and function.

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Regulation and function of alternative splicing in muscle. I do not have a thorough knowledge of Drosophila genetics.


      Response to Reviewer 2 Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary**

      This paper reports analysis of the function of RbFox1, an RNA-binding protein, best known for roles in the regulation of alternative splicing. It uses Drosophila as its in vivo model system, one that is highly suited to the analysis in vivo of complex biological events. In general, the authors present a very thorough approach with an impressive range of molecular analysis, genetic experiments and phenotypic assays. *We thank the reviewer for recognizing the suitability of our model organism as well as the time investment and diversity of experiments that were performed in this work. We have added and revised multiple experiments during this revision, which has greatly improved the manuscript.

      * The authors report that Rbfox1 is expressed in all Drosophila muscle types, and regulated in both a temporal and muscle type specific manner. Using inhibitory RNA to knock down gene function, they show that Rbfox1 is required in muscle for both viability and pupal eclosion, and contributes to both muscle development and function. A Bioinformatic approach then identifies muscle genes with Rbfox1-binding motifs. They show Rbfox1 regulates expression of both muscle structural proteins and the splicing factor Bruno1, interestingly preferentially targeting the Bruno1-RB isoform. They report functional interaction between Rbfox1 and Bruno1 and that this is expression level-dependent. Lastly, they report that Rbfox1 regulates transcription factors that control muscle gene expression.

      They conclude that the effect on muscle function of RbFox1 knock down is through mis-regulation of fibre type specific gene and splice isoform expression. Moreover, "Rbfox1 functions in a fibre-type and level-dependent manner to modulate both fibrillar and tubular muscle development". They propose that it does this by "binding to 5'-UTR and 3'-UTR regions to regulate transcript levels and binding to intronic regions to promote or inhibit alternative splice events." They also suggest that Rbfox1 acts "also through hierarchical regulation of the fibre diversity pathway." They provide further evidence to the field that Rbfox1's role in muscle development is conserved.

      **MAJOR COMMENTS**

      Are key conclusions convincing?

      In terms of presentation, I suggest ensuring a clear demarcation throughout of the evidence behind the main conclusions. This can get somewhat lost as a great deal of information is presented, including all the parallels with prior findings in other systems. I am not saying this is a major problem, just highlighting the importance of clarity. Conclusions to clearly evidence include: Rbfox1 functions in a fibre-type manner to modulate both fibrillar and tubular muscle development (e.g. L664); Rbfox1 functions in a level-dependent manner (e.g. L664); Rbfox1 functions by binding to 5'-UTR and 3'-UTR regions to regulate transcript levels (e.g. L670); Rbfox1 functions by binding to intronic regions to promote or inhibit alternative splice events" (e.g. L670); "Bru1 can regulate Rbfox1 levels in Drosophila muscle, and likely in a level-dependent manner" (L488) - Clearly evidence the level effect; "first evidence for negative regulation for fine tuning acquisition of muscle-type specific properties. Depending on its expression level, Rbfox1 can either promote or inhibit expression of" muscle regulators (L797). Lastly, the controlled stoichiometry of muscle structural proteins is known to be important, but all mechanisms are not known, so again make the supporting evidence as clear as possible for the interesting point of a role for Rbfox1 in this (e.g. L787). *Using the above comments from the reviewer as a guide, we have rewritten the manuscript, including large portions of the discussion, introduction and results. We thank the reviewer for pointing out where we could more effectively communicate our results, support our conclusions and highlight the significance of our findings.

      * Should some claims be qualified as preliminary or removed?

      P301 "complicated genetic recombination" - seems a bit weak to include. Either do it or don't include? *

      We have removed this statement from the text.*

      *

      Also, see section below on "adequate replication of experiments"

      Are additional exps essential? (if so realistic in terms of time and cost) None essential in my view. It depends on the authors' goals, but for the most impact of the project then following up these suggestions are possible. L369-372: mutate putative Rbfox1 binding site and ask does binding still occur or not. If it doesn't, then ask if this mutation affects the expression of the putative target gene. L775-777 "Our data thus support findings that Rbfox1 modulates transcription, but introduce a novel method of regulation, via regulating transcription factor transcript stability." It would be good to demonstrate this.

      We thank the reviewer for these suggestions, and agree they are indeed interesting experiments, but beyond the scope of this manuscript. We plan to pursue the detailed molecular and biochemical mechanisms of regulation in a future project including exploring Rbfox1 binding through use of reporters, identification of direct targets via CLIP and investigation of post-transcriptional regulation of translation or NMD.*

      Presented in such a way as to be reproduced

      Yes

      Are exps adequately replicated?

      A main area I would address is the authors frequent use of "may", "tend", "trend". This is confusing the picture they present. What is statistically significant and what is not? Only the former can be used as evidence. Examples include: L170: "may display preferential exon use" - does it or doesn't it? L272: "myofibrils tended to be thicker" - were they or weren't they? L350 "wupA mRNA levels tend towards upregulation in Rbfox1-RNAi". L353 "but tended towards upregulation (Fig. S4A)" L466 "Correspondingly, we see a trend towards increased protein-level expression of Bru1-PA" L474 "both Bru1-PA and Bru1-PB tend to increase" L485 "Overexpression of Bru1 in TDT with Act79B-Gal4 also tends to reduce Rbfox1" L595 "Rbfox1-IR27286 tended towards increased exd levels in IFM (Fig. 7A)" L614 "and a trend towards increased use of Mef2-Ex20 " Also, L487 "suggesting that Bru1 can also negatively regulate Rbfox1" - one cannot use a non-significant observation to suggest something. *

      We have modified the text to limit use of “may”, “tend” and “trend”, and have removed discussion of non-significant results. We thank the reviewer for the very helpful and detailed list of sentences to modify.

      \*MINOR COMMENTS**

      *

      Although individual samples are not significant, in aggregate there is a trend….

      * Specific exp issues that are easily addressable

      L162: "dip in Rbfox1 expression levels around 50h APF". The Fig indicates as early as 30h. Is this significantly less than the 24h data point? Comparisons in Figure 1G that are significant based on DESeq2 differential expression analysis with an adjusted p-value L427 "this staining was lost after Rbfox1 knockdown". This conflicts with Fig 5K which says no significant difference. Again in L429 "Rbfox1 knockdown leads to a reduction of Bru1 protein levels in IFMs and TDT." Fig says no significant difference in TDT. *

      We thank the reviewer for pointing out this inconsistency. We have revised the text accordingly. Our Western Blot (Figure 5L, M) and RT-PCR (Figure 5N, O) do show changes of Bru1 protein and mRNA expression levels after knockdown of Rbfox1KK110518. *

      Are prior studies referenced appropriately?

      This m/s is an authoritative presentation of the field as a whole with a comprehensive, impressive reference list. However, a point related to this area is one of the main things I would consider tackling. This is to have more clarity in the demarcation of what this study has found that adds to prior knowledge. It is worthwhile in itself to demonstrate the many similarities with previous work in other systems, as part of establishing the Drosophila system with all its analytical advantages for in vivo molecular genetics as an excellent model for future study in this area of research. However, the impact/strength of this m/s would be enhanced by clarity in presenting what is new to the field in all organisms. *We thank the reviewer for this suggestion. We have rewritten large portions of the manuscript, including the introduction and discussion, to improve the clarity of our findings and their importance to the field.

      * Are the text and Figs clear and accurate?

      TEXT

      L156: more precise language than "in a pattern consistent with the myoblasts" - maybe a simple co-expression with a myoblast marker? *

      We have revised this phrasing in the text. Rbfox1 expression in myoblasts was previously reported by (Usha and Shashidhara, 2010). *

      L181: at first use define difference between RNAi and IR*

      We use IR as an abbreviation for RNAi. In particular, we are trying to distinguish the two hairpins obtained from stock centers (27286 and KK110518) from the third, homemade RNAi hairpin, originally named UAS-dA2BP1RNAi, that was generated by Usha and Shashidhara (Usha and Shashidhara, 2010). We have better defined this in the text and methods. *

      L205: maybe clearly explain the link between eclosion and tubular muscle?? *

      We have added a sentence explaining the link between eclosion and tubular muscle (see Line 331).*

      L231: "Sarcomeres were not significantly shorter at 90h APF with the stronger Mef2-Gal4" - not clear why this is the case when the less strong knockdown conditions have shorter sarcomeres. *

      We have modified the text as well as the figure labeling to clarify that the other samples were tested in 1 d adult, while the KK110518 hairpin was tested at 90 h APF. This likely indicates that the short sarcomeres observed in 1 d adults reflect hypercontraction, which in IFM is classically first apparent after eclosion when the flies actively try to use the flight muscles. The difference in timing is due to pupal lethality of the KK110518 hairpin line, so we could not evaluate adult flies.*

      L234: "classic hypercontraction mutants in IFMs display a similar phenotype" - presumably not similar to the not significantly shorter sarcomeres of the previous sentence. *

      We have modified the text to clarify this statement. The change in sarcomere length from 90 h APF to 1 d adult is actually the relevant observation, as this reflects the progressive shortening of sarcomeres observed in classic hypercontraction mutants.*

      L244: "90h", should be "90h APF"? *

      Yes, we have modified the text.*

      L273: "Myofibrils in Act88F-Gal4 mediated knockdown only showed mild defects (Fig. 3 G, H, Fig. S2 C, D) despite adult flies being flight impaired". This seems worthy of discussion - the functional defect is not due to overt structure change? *

      In our own experience as well as observations included in a genome-wide RNAi screen in muscle (Schnorrer et al., 2010), there are a rather large number of knockdown conditions where few if any structural defects are observed at the level of light microscopy, but flies are completely flightless. We interpret this to reflect the narrow tuning of IFM function, where slight alterations in calcium regulation or sarcomere gene isoform expression result in dysfunction and a lack of flight. Ultrastructural evaluation might reveal defects in these cases, but the defect could also be with the dynamics of tropomyosin complex function, calcium regulation, mitochondrial function or even neuro-muscular junction structure. We have added a sentence to the text to discuss and clarify the Act88F result.*

      L281 "also known as Zebra bodies" - helpful to indicate these on the Fig, they are not. *

      We have added arrows to the figure to mark the Zebra bodies, and updated the figure legend.*

      L282: "we were unable to attempt a rescue of these defects" - I may have missed something, but what about rescue undertaken of the defects on previous pages? *

      This is the first point in the text where we introduced overexpression of Rbfox1, as preceding experiments where knockdown or using a GFP-tagged protein trap line at the endogenous locus. We have revised the sentence to focus on the overexpression phenotype with UH3-Gal4.*

      L283: "Over-expression of Rbfox1 from 40h APF" - this is the first over-expression experiment, so introduce why done now (and perhaps not earlier), and also explain the use of a different Gal4 driver.*

      We have reworded this section of the text. The UH3-Gal4 driver is restricted to expressing in IFM from 40h APF, so is first expressed after myofibrils have been generated and selectively in IFM. This avoids lethality observed from pan-muscle expression with Mef2-Gal4 (presumably due to severe defects in tubular muscles), and also allows us to image IFM tissue from adult flies. Later experiments with Mef2-Gal4 were performed with a later temperature shift to avoid this early lethality.*

      L290 "Interestingly, both Rbfox1 knockdown and Rbfox1 over-expression produce similar hypercontraction defects" - this could be interesting, worthy of discussion/explanation. *

      The most logical explanation is that Rbfox1 regulates the balance in fiber-type specific isoform expression. Loss of Rbfox1 would cause a shift in the relative ratio of the isoforms of structural genes, and overexpression of Rbfox1 would likely cause a similar shift in the opposite direction. This is supported by our RT-PCR panel, where we see co-regulation of different events with Bru1, and we see fiber-type specific difference in regulation of alternative splicing (Figure 8). Overexpression of Rbfox1 would be expected to make IFM look more like TDT, which would result in an isoform imbalance and lead to the observed hypercontraction phenotype. Interestingly, loss and overexpression of Bru1 also result in the same hypercontraction phenotype, similar to what we observe with Rbfox1. We have added a paragraph in the discussion about level-dependent regulation, to address this reviewer comment.*

      P305: Bioinformatic analysis. It is not clear what is taken as a potentially interesting result. On average a specific 5 base motif is found every 1000bps - so what is being looked for? How many sites in what length or position? A range of examples are described in the next pages of the m/s. For example: L337 "Bruno1.... contains 42 intronic and 2 5'-UTR Rbfox1 binding motifs" and L591 "exd contains three Rbfox1 binding sites," *

      We have redone the bioinformatic analysis completely, relying on data from oRNAment and the in-vitro determined PWM. We have also rewritten all portions of the text related to this analysis and no longer focus on the number of observed motifs in a given gene. As we unfortunately do not have RNA CLIP data, we do not know genome-wide which motifs are bound in muscle. Clustering of motifs may reflect binding, but a single, strong motif can also be bound, as we demonstrate via RIP of the wupA transcript. Thus, we identified interesting targets to test based on 1) a previously described role in the literature in myofibril assembly or contractility and 2) the presence of any Rbfox1 motif in that gene. A more elegant selection method of direct and indirect target exons will be designed for a future manuscript after integrating CLIP and mRNA-Seq data that have not yet been collected.

      L315: "many of these genes have binding or catalytic activity". "catalytic activity" seems very vague.

      For the original supplemental figure panel, we relied on Panther high-level ontology terms, which can unfortunately be rather vague, ie “catalytic activity” or “binding activity”. We have redone this analysis and rely rather on GO terms in the biological process and molecular function categories (Figure S3 B).

      L317 "When we look in previously annotated gene lists" - be more specific. What are they?

      This section of the text has been rewritten, and the “previously annotated gene lists” are described in greater detail in the Methods. *

      L327 "may also affect the neuro-muscular junction" - maybe better left for the Discussion? *

      We have removed this sentence from the Results.*

      L333 "extradenticle (exd) and Myocyte enhancer factor 2 (Mef2) contain 3 and 7 Rbfox1 motifs," Discuss the number and position of multiple motifs found in known targets? *

      We have removed the discussion of the number of binding sites for different target genes, instead incorporating this information graphically in Figure S3 C. It is not clear that the number of binding sites per gene has any influence on whether it is regulated in Rbfox1 knockdown. Thus, we have de-emphasized discussion of the number of binding sites throughout the text.*

      L350 "wupA mRNA levels " - clearer to stick to using TroponinI or WupA? *

      We have updated instances throughout the text to consistently refer to the protein as Troponin-I (TnI) and the gene or mRNA as wupA. *

      L376 "To check whether Rbfox1 regulates some target mRNAs such as wupA....." The suggestion here is more of a further indication than a "check". *

      We have reworded this section of the results to make the link between post-transcriptional regulation and our mass spectrometry results more salient.*

      L544 "In IFMs, knockdown of Rbfox1 and loss of Bru1 results in...." clarify if this is the two genes separately or the two genes together? *

      We have rewritten this entire section and present an expanded list of tested alternative events. We have taken care in this revision to clearly denote if the genotype is Rbfox1-IR or bru1M2 or a double knockdown background.*

      L580 "Our bioinformatic analysis identified Rbfox1 binding motifs in more than 40% of transcription factors genes" - is this all TFs or just "muscle" TF genes? *

      We have redone this analysis and changed this sentence in the text.*

      L598, what would be the mechanism of some decrease in Rbfox1 increasing mRNA levels and more of a decrease resulting in a decrease of the mRNA? The authors say "the nature of this regulation requires further investigation". *

      We have added more data to this section of the manuscript and repeated several of these experiments. After adding more biological replicates and additional data points, we have more consistent results that also demonstrate the variability in bru1 expression levels after Rbfox1 knockdown. Overall levels of bru1 assayed with a primer set in exons 14 and 17 now consistently show an increase in bru1 expression after Rbfox1 knockdown between all three hairpins (Rbfox1-RNAi, Rbfox1-IRKK110518 and Dcr2, Rbfox1-IR27286) (Figure 5 N).

      The relationship between expression level of Rbfox1 and expression level of bru1 and Bru1 protein isoforms is more complex. We now report a novel splice event in the annotated isoform bru1-RB that skips exon 7, resulting in a frame shift and generation of a protein that lacks all RRM domains, which we call bru1-RBshort (Figure S5). This short isoform is preferentially used in TDT, while the long isoform encoding the full-length protein is preferentially used in IFM (Figure 5 P). Presumably, this provides a mechanism, in addition to the use of different promoters, for muscle cells to regulate expression levels of different Bru1 isoforms. Knockdown of Rbfox1 in IFM results in a significant increase in the use of the long mRNA isoform, but paradoxically a decrease in the corresponding protein isoform (Figure 5, S5). We interpret this to mean that Rbfox1 regulates alternative splicing of Bru1, and likely independently a translational/post-translational mechanism regulates the expression level of Bru1-RB. This in theory could be mediated by interaction with translational machinery, post-translational modification, increased P-granule association, etc., and given the depth and breadth of experiments (as well as the multitude of isoform-specific expression reagents) required to isolate the responsible pathway, we deem it beyond the scope of this manuscript to biochemically demonstrate this specific regulatory mechanism. *

      L609 "The short 5'-UTR encoded by Mef2-Ex17". Ensure all abbreviations are defined. What does "Ex" mean here? Not straightforward to relate to the diagram in the Supplemental material that indicates the Mef2 gene has many fewer than 17 exons. In Fig7 legend too. *

      We have changed “Ex” to “exon” in the text. We apologize for the confusion. We have also added a diagram to Figure 7 E of the 5’-UTR region of Mef2, and a complete diagram of the locus in Figure S3 C. Based on the current annotation, Mef2 exons are numbered 1 to 21, corresponding to at least 16 distinct regions of the genome (18 if you include the variable 3’-UTR lengths). Exons sometimes will have more than one number in the annotation if a particular splice event causes a shift in the ORF, or if alternative splice sites or poly-adenylation sites are used. Mef2 is also on the minus strand, so as exons are numbered based on the genome scaffold, the exon numbering goes in reverse (ie exon 1 is the 3’-UTR).

      We strongly believe in following the numbering provided in the annotation, to increase reproducibility and transparency in working with complex gene loci for many different genes. Another researcher can go to Flybase, look-up the exon number from a given gene from a specific annotation, and get the exact location and sequence of the exons we name. It is incredibly challenging and time intensive to go through older papers and figure out which exon or splice event corresponds to those in the current annotation, and we aim to alleviate this difficulty (we illustrate this in Figure 4 A for the wupA locus, where we verified exon numbers in annotation FB2021_05 by BLASTing each individual sequence and primer provided in (Barbas et al., 1993).*

      L617 "Levels of Mef2 are known to affect muscle morphogenesis but not production of different isoforms" - clarify what is meant here by "different isoforms". *

      We have revised this section of the text. This statement was meant to reflect that Mef2 affects muscle morphogenesis through regulation of transcription levels, but not at the level of alternative splicing.*

      L638 "Salm levels were significantly increased in IFM from Rbfox1-RNAi animals, but significantly decreased in IFMs from flies with Dcr2 enhanced Rbfox1-IR27286 or Rbfox1-IRKK110518". This is worth discussion or further analysis. Normally would expect an allelic series, with an effect becoming more apparent with increased loss-of-function. *

      Dcr2, Rbfox1-IR27286 and Rbfox1-IRKK110518 produce a stronger knockdown than Rbfox1-RNAi, and indeed produce significantly decreased levels of salm, thus following the allelic series. We repeated this experiment, but obtained the same results. *

      L641 "This suggests that Rbfox1 can regulated Salm". How, if there are no Rbfox1 binding sites? Deserves further analysis? *

      Our new bioinformatic analysis suggests a possible answer, in that it identified possible Rbfox1 motifs in a salm exon and a site in an intron. Previously, we had focused on introns and UTR regions. In addition, using the PWM we now recover Rbfox1 binding sites of the canonical TGCATGA as well as AGCATGA sites. The intron site in salm is an AGCATGA site. Further experiments will be required to determine if Rbfox1 directly binds to salm mRNA, if it interacts with the transcriptional machinery to regulate salm expression, or if this regulation occurs through yet a different mechanism, and are beyond the scope of this manuscript.*

      L674: "We found the valence of several regulatory interactions..." I'm not sure the meaning of "valence" here and elsewhere will be readily understood. *

      Thank you for pointing this out. We have used a different phrasing throughout the text.*

      FIGURES

      Fig 1 it is difficult to see the green in A-F. Can this be improved? It is clearer in I-L. *

      We have replaced the images with better examples and increased the levels to make the green channel better visible. *

      Fig 2 legend (others too), say what the clusters of small black ellipses in P and Q are. *

      Thank you for pointing out this oversight. All boxplots are plotted with Tukey whiskers, such that they are drawn to the 25th and 75th percentile plus 1.5 the interquartile range. Dots represent outlying datapoints outside of this range. We have added statements in the relevant figure legends, as well as a more detailed explanation in the Methods. *

      Fig 3 it is not easy to see a shorter sarcomere in D, as the arrow partially obscures what is being indicated. Also, the data in G indicates that sarcomeres are not shorter in Mef2 GAL4 > KK110518, although the legend says this is shown in D. *We have rephrased the statement in the legend. The arrows are pointing to frayed or torn myofibrils.

      Fig 5 legend "-J). Bru1 signal is reduced with Rbfox1-IRKK110518 (C, F, I)". Clarify that this is only in IFM. It is not significant in TDT or Abd-M.

      Done.*

      Fig 7 legend "quantification of the fold change in exd transcript levels" - only KK110518 in IFM is significant. *

      This panel was moved to Figure S7. The relevant regions of the text and figure legend were modified to reflect that only Rbfox1-IRKK110518 results in a significant change in exd levels. C - "indicates Rbfox1 binds to Mef2 mRNA" - it is not easy to see the band.

      We replaced the image and adjusted the levels to make the band more visible. D - what do the different lanes on the gel below the histogram in D correspond to? We adjusted the labeling on the figure panel. The gel is a representative image of RT-PCR results that are quantified above in the histogram.

      *Suggestions that would help the presentation of their data and conclusion **

      There is a lot of good, thorough work here, but overall there is the impression that some of the presentation/writing could be improved (also see the above lists on clarity and accuracy). I admire the authors for their comprehensive presentation of what has already been found out in this field. As the authors summarise, a lot is already known in many other species, so (as also indicated above) it is crucial to emphasise what new is found in this work that advances overall knowledge in this field. This can be obscured in many places where they say because of what was found in vertebrate systems we looked in Drosophila. These include: L417: "This led us to investigate if Rbfox1 might regulate Bru1 in Drosophila." L452: "and we were curious if these interactions are evolutionarily conserved in flies." L528 "Thus, we next checked if Rbfox1 and Bru1 co-regulate alternative splicing in Drosophila muscle." L677 "Moreover, as in vertebrates, Rbfox1 and Bru1 exhibit cross-regulatory interactions" L683 "Rbfox1 function in muscle development is evolutionarily conserved" L697 "Here we extend those findings and show that as in vertebrates......" L702 "our observations are consistent with observations in vertebrates" L707 "Studies from both vertebrates and C. elegans suggest that Rbfox1 modulates developmental isoform switches." L746 "We see evidence for similar regulatory interactions between Rbfox1 and the CELF1/2 homolog Bru1 in our data from Drosophila." *We thank the reviewer for this honest and helpful assessment of the manuscript. Upon rereading the original text and with the guidance of the list of sentences above, we agreed with the reviewer and we have rewritten large segments of the manuscript. In particular in the introduction and discussion, we now better emphasize what is new in our findings and how they advance overall knowledge in this field.

      L185 paragraph. The knockdown series is important for the study. A lot is presented in this paragraph, especially for a non-specialist and it could be easier to follow. Perhaps present the four genetic conditions in the order of the severity of their phenotype on viability. Also, clearly state what each Gal4 driver is used for. What is the nature of the RNAi/IR lines such that Dcr2 could enhance their action? Also comment on off targets - are any predicted?

      We have rewritten this paragraph as the reviewer requested. The hairpins are ordered by decreasing phenotypic severity, and we have more clearly described each Gal4 driver as well as Dicer2. This information is also available in the Methods, along with the off targets for the hairpins. KK110518 has one predicted off-target ichor, but this gene is not expressed in IFM, TDT or leg based on mRNA-Seq data. 27286 has no predicted off-targets. *

      L227: "In severe examples". Be as clear as possible. Are the "severe examples" using the stronger RNAi line or are they the most severe examples with a single line? I'd suggest including the result in the main Fig rather than in the Supplemental. However, as I read more of the m/s I realise there is a great deal of important information in the Supplemental Figs, and so the case is not much stronger for this example than many others. The balance of what is included where could be looked at, because it is not straightforward for the reader to read the paper and quickly flick between the main and supplemental Figs. Later in the m/s is a substantial section that starts L450 (finishes L489) and which only refers to Supplemental Figs. L503 is another area where it is necessary, and difficult, for the reader to move between main Figs and supplemental Figs. *We have reorganized the figure panels in several figures, notably Figures 4, 5, 6, 7 and 8 and the corresponding supplementary figures, including moving panels from the supplemental figures to the main figures and generating more comprehensive quantification panels. In the specific case referenced here for Fig. S1 P and Q, we chose to keep the most representative images of the phenotype in the main figure (Fig. 2 I, N), and have reworded the text to reflect that the most severe phenotypic instances are in the supplement. As we do not have CLIP data, we chose to keep the bioinformatics analysis in the supplement and have shortened the paragraph in the results devoted to Figure S3. We hope our reorganization and rewriting have better streamlined the text and figures.

      L258: - perhaps a Table summarising this and other phenotype trends with the different RNA conditions might be helpful. It gets quite difficult to follow.

      We have revised the text and several figure panels to make the phenotypic trends with the different RNAi conditions easier to follow.*

      Reviewer #2 (Significance (Required)):

      The advance reported is mechanistic.

      The authors already do a very good job of placing their work in the context of prior research (see comment is Section A).

      Muscle biologists interested in its development and function will be interested in this work. More broadly, those intrigued by alternative splicing will be interested. Despite its very widespread occurrence, much about alternative splicing is still poorly understood in terms of regulation and significance. This is especially the case in vivo, and this paper uses an excellent in vivo model system (Drosophila) for the genetic and mechanistic analysis of complex biological problems. My field of expertise: cell differentiation, gene expression, muscle development, Drosophila.

      Response to Reviewer 3 Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **SUMMARY**

      This manuscript characterizes the role of splicing factor Rbfox1 in Drosophila muscle and explores its ability to modulate expression of genes important for fibrillar and tubular muscle development. The authors hypothesize that Rbfox1 binds directly to 5'-UTR and 3'-UTR regions to regulate transcript levels, and to intronic regions to promote or inhibit alternative splicing events. Because some of the regulated genes encode transcriptional activators and other splicing factors such as Bru1, the effects of Rbfox1 may encompass a complex regulatory network that fine-tunes transcript levels and alternative splicing patterns that shape developing muscle. Most likely the authors' hypothesis is correct that Rbfox1 is critical for muscle development in Drosophila, but overall the interesting ideas presented here are too often based only on correlations without further experimental validation. *

      We respectfully disagree with the reviewer that our hypothesis that Rbfox1 is critical for muscle development in Drosophila is based only on correlation without further experimental validation. In this manuscript we extensively characterize the knockdown phenotype of 3 RNAi hairpins against Rbfox1 as well as a GFP-tagged Rbfox1 protein in both fibrillar flight muscle and tubular abdominal and jump muscle. All hairpins produce similar phenotypes with defects in myofiber and myofibril structure and result in behavioral defects in climbing, flight and jumping, confirming this phenotype is due to loss of Rbfox1 and not a random off-target gene. We also convincingly demonstrate that Rbfox1 regulates Bru1, another splicing factor known to be critical for fibrillar specific splice events in IFM. Moreover, Rbfox1 and Bru1 genetically interact selectively in IFM and our RT-PCR data for 12 select structural genes reveals fiber-type specific alternative splicing defects regulated by Rbfox1 selectively, by Bru1 selectively, or by both Rbfox1 and Bru1. Thus, we conclude that Rbfox1 is indeed critical for muscle development, and this is the first report to demonstrate this requirement in Drosophila.*

      **MAJOR COMMENTS**

      The hypothesis that Rbfox1 plays an important role in regulating muscle development is based on previous studies in other species and supported by much new data in this manuscript. Initial bioinformatic analysis showed that many Drosophila genes, including 20% of all RNA-binding proteins, 40% of transcription factors, etc. have the motifs in introns or UTR regions. However, I think a deeper analysis is required. Any hexamer might be present about once every 4kb, and we do not expect all UGCAUG motifs are necessarily functional, so one might ask whether the association of Rbfox motifs with muscle development genes is statistically significant? Are the motifs conserved in other Drosophila species, which might support a functional role in muscle? Are the intronic motifs located as expected for regulatory effects, that is, proximal to alternative exons that exhibit changes in splicing when Rbfox1 expression is decreased or increased? *

      We appreciate the point of the reviewer that it would be ideal to distinguish genome-wide motifs that are actually bound directly by Rbfox1 from those that are unused, but our behavioral and phenotypic characterization of the knockdown phenotype in this manuscript is also valid without this data. The most effective approach to identify direct targets is to perform cross-linking immunoprecipitation, or CLIP, but we unfortunately do not have CLIP data from Drosophila muscle and it is beyond the scope of the current study to generate this data. It is not trivial to obtain the amount of material necessary to identify tissue-specific binding sites, as we would also likely expect differences in targeting specificity between tubular and fibrillar muscle. Genome-wide analysis of the evolutionary conservation of binding site motifs is also not trivial and is beyond the scope of this paper.

      Despite these limitations and to address the reviewer’s comment, we have done the following:

      1. We have completely redone our bioinformatic analysis using transcriptome data from the oRNAment database (Benoit Bouvrette et al., 2020), as well as searching genome-wide for instances of the in vitro determined PWM using PWMScan, to capture possible sites in introns (Figure S3). The oRNAment database was shown to reasonably predict peaks identified in eCLIP from human cell lines, which we assume would translate to a similar predictive capacity in the Drosophila
      2. We have calculated the expected distribution of Rbfox1 sites in a random gene list for Figure S3, and indeed the number of Rbfox1 sites in sarcomere genes is significantly enriched.
      3. We have looked more carefully at the distribution of Rbfox1 and Bru1 motifs in the transcriptome (in the oRNAment data), and find not only that these motifs frequently occur in the same muscle phenotype genes, but also that they are closer together than is expected by chance (Fig. S4 J).
      4. We marked the location of Rbfox1 and Bru1 motifs in the vicinity of select alternative splice events we tested via RT-PCR on the provided summary diagrams (Fig. 6, Fig. S6).
      5. We have tested additional alternative splice events in total from 12 structural genes, and of the 9 events misregulated after Rbfox1 or Bru1 knockdown, all but 1 are flanked by Rbfox1 or Bru1 binding motifs. This indicates that the motifs are indeed located as expected for a regulatory effect. Is it possible to knock out an Rbfox motif and show that splicing of the alternative exon is altered, or regulation of transcript levels is abrogated?


      The construction and mutation of reporter constructs is possible, but would take longer than the recommended revision time-frame, in particular to generate reporters that can be evaluated in vivo. We intend to address the biochemical mechanism(s) of Rbfox1 regulation with future experiments in a separate manuscript.

      Also, what was the background set of genes used for the GO enrichment analysis? Genes expressed in muscle or all genes?

      The background set of genes for GO enrichment (now Figure S3 B) was all annotated genes for the “all genes” label and all muscle phenotype genes for the “Muscle phenotype” label.

      The data on cross regulation between Rbfox1 and Bru1 are confusing and inconsistent, since mild knockdown and stronger knockdown of Rbfox1 seem to have different effects on Bru1 expression. New data suggest that Rbfox1 can positively regulate Bru1 protein levels (Fig.5), but this seems inconsistent with the lab's earlier studies indicating opposite temporal mRNA expression profiles for Rbfox1 and Bru1 across IFM development. 


      We apologize for the confusion, but the relationship between Rbfox1 and bru1 levels across IFM development has not been published previously. We previously generated that mRNA-Seq data, but presented here (now in Figure 5Q) is a new analysis of that data, specifically focused on Rbfox1 and bru1 expression. We have corrected the phrasing in the text.

      To address this comment, along with points raised above by Reviewer 2, we have revised this part of the manuscript, added more data to this section of the manuscript and repeated several of these experiments. After adding more biological replicates and additional data points, we have more consistent results that also demonstrate the variability in bru1 expression levels after Rbfox1 knockdown. Overall levels of bru1 assayed with a primer set in exons 14 and 17 now consistently show an increase in bru1 expression after Rbfox1 knockdown between all three hairpins (Rbfox1-RNAi, Rbfox1-IRKK110518 and Dcr2, Rbfox1-IR27286) (Figure 5 N). This is consistent with our observations of inversely correlated mRNA levels during IFM development, as when Rbfox1 levels decrease, bru1 transcripts increase.

      We agree with the reviewer that the relationship between the expression level of Rbfox1 and expression level of bru1 mRNA and Bru1 protein isoforms is more complex. We now report a novel splice event in the annotated isoform bru1-RB that skips exon 7, resulting in a frame shift and generation of a protein that lacks all RRM domains, which we call bru1-RBshort (Figure S5). Unknowingly, we had previously used a primer set from exon 7 to exon 8 as “common”, which lead to some confusion. This short isoform is preferentially used in TDT, while the long isoform encoding the full-length protein is preferentially used in IFM (Figure 5 P). Presumably, this provides a mechanism, in addition to the use of different promoters, for muscle cells to regulate expression levels of different Bru1 isoforms. Knockdown of Rbfox1 in IFM results in a significant increase in the use of the long mRNA isoform, but paradoxically a decrease in the corresponding protein isoform (Figure 5, S5). We interpret this to mean that Rbfox1 regulates alternative splicing of Bru1, and likely independently a translational/post-translational mechanism regulates the expression level of Bru1-RB. This in theory could be mediated by interaction with translational machinery, post-translational modification, increased P-granule association, etc., and given the depth and breadth of experiments (as well as the multitude of isoform-specific expression reagents) required to isolate the responsible pathway, we deem it beyond the scope of this manuscript to biochemically demonstrate this specific regulatory mechanism. *

      *

      Both Rbfox1 and Bru1 gene have many Rbfox motifs, but they are both large genes (>100kb) and would be expected to have many copies of all hexamers. How do we know whether any of them are functional?

      We do not know if all of the Rbfox1 binding sites in the Bru1 and Rbfox1 loci are bound, but the CLIP data required to assess this is beyond the scope of this manuscript, as discussed above. We do show, however, that changes in the expression level of Rbfox1 affect the expression of Bru1 on both the mRNA transcript and protein level, and changes in the expression level of Bru1 also can affect the expression level of Rbfox1. The direct or indirect nature of this regulation remains to be fully elucidated, although we do provide RIP data showing we can detect bru1 transcript bound to Rbfox1-GFP (Figure S4 I). We have modified the text to address this comment.

      Figure S4, section I, J: if changes in Bru1-RB isoform expression are correlated with Rbfox1 knockdown, it seems reasonable to test whether the Bru1-RB promoter can drive expression of GFP in an Rbfox1-dependent manner. But if I understand correctly, the assay as described on p. 19 uses the promoter region upstream of Bru1-RA. What is the logic for this experiment? It is not surprising that no effect was observed. The end result is that we have no idea whether Rbfox1 directly regulates bru1-RB. Even if it does, bru-Rb appears to be a minor component of Bru expression in IFM.

      Upon reevaluating this experiment and with respect to the reviewer’s comment, we have removed it from the manuscript to avoid confusion. Our new data indicate a switch in use of the bru1-RBlong and bru1-RBshort isoforms (Figure 5 N-P), suggesting that Rbfox1 regulation is on the level of splicing.

      Further experiments will be necessary to refine the indirect versus direct regulatory effects of Rbfox1 on Bru1, but our data do demonstrate that Bru1 levels are regulated in Rbfox1 knockdown conditions. We also provide a RIP experiment (Figure S4 I) showing that Rbfox1-GFP does directly bind bru1 mRNA, but we did not determine if this was isoform-specific. Multiple additional experiments would be necessary to distinguish between regulation of alternative splicing, direct binding to regulate transcript translation or stability, or transcriptional regulation via regulation of Salm, or some combination of these possible mechanisms. The data presented here are important to the field as they are the first report of isoform-specific regulation of Bru1 in muscle, even if we do not conclusively show if this regulation by Rbfox1 is direct or indirect.

      In the section "Rbfox1 and Bruno1 co-regulate alternative splice events in IFMs", the data show that splicing of several genes is altered by knockdown or over-expression of Rbfox1 and Bru1. The interesting conclusion is for a complex regulatory dynamic where Rbfox1 and Bru1 co-regulate some alternative splice events and independently regulate other events in a muscle-type specific manner. However, if we are to conclude that these activities are due to direct binding of Rbfox1 and Bru1 to the adjacent introns, we need information about the location of flanking Rbfox and/or Bru1 motifs. Do upstream or downstream binding sites correlate with enhancer or silencer activity, as reported in previous studies of these splicing factors in other species? For wupA, Figure S3 shows an intronic Rbfox site, but exon 4 is not labeled so the reader cannot correlate this information with the diagram in Figure 6U.

      As mentioned above, we have marked the location of Rbfox1as well as Bru1 binding motifs in the diagrams in Figure 6 and Figure S6. We have tested additional alternative splice events, and can now show events regulated only in the Rbfox1 knockdown, only after bru1 knockdown, or in double knockdown flies (Figure 8). 8 out of 9 events where we see clear changes in splicing are flanked by potential Rbfox1 or Bru1 motifs. Demonstration of direct binding and assay of genome-wide binding sites through CLIP studies is beyond the scope of this manuscript and will be pursued in the future.

      The evidence that Rbfox1 directly affects expression of transcription factor Exd seems to be based only a correlation between Rbfox1 knockdown and decreased expression of Exd. The observation that binding of Rbfox1 to the Exd 3'UTR in RIP experiments further weakens the case.

      We agree with the reviewer and have moved the data related to exd to the supplement (Figure 7 and S7). We still mention exd in the text as it is significantly decreased after knockdown with Rbfox1-IRKK110518, but we have removed it from larger claims of transcriptional regulation as well as from the summary in Figure 8. Also, just to note that although we failed to detect Rbfox1-GFP bound to exd, this experiment was performed with adult flies. Since Exd is functionally important early in pupal development during fate specification of the IFMs, it is possible we might detect binding to exd mRNA at a different developmental timepoint.

      Similarly, there is a correlation of Rbfox1 knockdown with expression of alternative 5'UTRs in the Mef2 gene. However, the changes in UTR expression appear mostly not statistically significant. Do the authors have a model to explain what mechanism might allow Rbfox to regulate expression of alternative 5'UTRs, which would seem to be a transcriptional process?

      Mef2 transcript levels are significantly increased after knockdown with Rbfox1-RNAi and decreased after overexpression of Rbfox1, and we can detect direct binding of Rbfox1-GFP to Mef2 RNA via RIP. This establishes Mef2 as a likely direct target of Rbfox1 regulation, likely through the two Rbfox1 motifs in the 3’-UTR (Figure S3 C). In addition to this regulation, we made an observation that has not been previously reported in the literature, that IFM expresses a particular isoform of Mef2 that uses a short promoter encoded by Exon 17. We see both tissue-specific use of Exon 17 (Figure 7 F) as well as developmental regulation of Exon 17 use in IFM (Figure S7 C). Surprisingly, we saw that use of exon 17 in the Mef2 promoter is altered in Rbfox1 knockdown muscle. We now provide a quantification of this data, to show the change is statistically significant. We also provide a scheme of the Mef2 locus and RT-PCR primers with exons 17, 20 and 21 labelled (Figure 7 E). We have also rewritten this section of the text to increase the impact and clarity of our finding.

      For Salm, there apparently are no Rbfox motifs in the gene, and there are statistically significant but apparently inconsistent changes in Salm expression when it is knocked down in IFM by Rbfox1-RNAi (Salm increases) vs knockdown by Rbfox1-IR27286 or Rbfox1-IRKK110518 (Salm decreases). These are potentially interesting observations but more data would be needed to make stronger conclusions. How would regulation occur in the absence of Rbfox motifs?


      The best explanation we can provide for why salm expression is increased with the weak hypomorph Rbfox1-RNAi condition, but decreased with the stronger hypomorph Rbfox1-IRKK110518 or Dcr2, Rbfox1-IR27286 conditions is that salm regulation is sensitive to Rbfox1 expression or activity level. We now discuss this in a new section of the discussion. We further attempted several experiments to address this question, including obtaining an endogenously tagged Salm-GFP line, as well as a UAS-Salm line (kindly provided by F. Schnorrer). Disappointingly, there is no GFP expressed in the Salm-GFP line, either live, by immunostaining or in Western Blot of multiple developmental stages, indicating that the line has fallen apart and we have not yet redone the CRISPR targeting to generate a new line. The UAS-Salm construct works (too well), in that overexpression with Mef2-Gal4 results in early lethality and we have not yet managed to optimize the experiment and obtain enough pupal muscle where we can evaluate the effect on Bru1 or Rbfox1 levels.

      Our new bioinformatic analysis further revealed possible Rbfox1 motifs in a salm exon and a site in an intron. Previously, we had focused on introns and UTR regions. Now, using the in vitro determined PWM, we can recover Rbfox1 binding sites of the canonical TGCATGA as well as AGCATGA sites. The intron site in salm is an AGCATGA site. Further experiments will be required to determine if Rbfox1 directly binds to salm pre-mRNA, if it interacts with the transcriptional machinery to regulate salm expression, or if this regulation occurs through yet a different mechanism. We feel the many required experiments are beyond the scope of the current manuscript. Our data provides an experimental basis for future studies on this topic.

      \*MINOR COMMENTS**

      1. In several figures there is a misalignment of the transcriptional driver information with the phenotype data in the bar graphs above. Please correct the alignments to make interpretation easier. *

      We have revised the layout of labels for many plots throughout the manuscript to avoid a category label associated with a genotype label at a 45-degree angle, and to make interpretation easier.

      On p. 14 Brudno et al. is cited as ref for Fox motifs near muscle exons, but this paper only focused on brain-specific exons.

      In addition to brain-specific exons, Brudno et al. also analyzed a set of muscle-specific exons, and thus this is the appropriate reference. For instance, from the Brudno paper, “As an additional control in some experiments we analyzed a smaller sample of muscle-specific alternative exons that were collected exactly as described above for the brain-specific exons” and “UGCAUG was also found at a high frequency downstream of a smaller group of muscle-specific exons.” Further details of the muscle-specific exon analysis can be found in (Brudno et al., 2001).

      For Mef2, why do exons described as 5'UTR have numbers 17, 20, and 21? One would normally expect these to be exon 1, 2 or 1A, 1B, etc.

      We rely on the Flybase annotation and numbering system to refer to exons. Per Flybase, all exons are labeled in the 5’ to 3’ direction of the sequenced genome, even for genes, such as Mef2 or wupA, that are encoded on the reverse strand. We strongly believe in following the numbering provided in the annotation, to increase reproducibility and transparency in working with complex gene loci for many different genes. Another researcher can go to Flybase, look-up the exon number from a given gene from a specific annotation, and get the exact location and sequence of the exons we name. It is incredibly challenging and time intensive to go through older papers and figure out which exon or splice event corresponds to those in the current annotation. We illustrate this in Figure 4 A for the wupA locus, where we verified exon numbers in annotation FB2021_05 by BLASTing each individual sequence and primer provided in (Barbas et al., 1993). The Mhc locus is even more complex, in particular regarding alternative 3’-UTR regions and historic versus current exon designations (Nikonova et al., 2020). For clarity and reproducibility, we therefore rely on the current Flybase designations.

      Fig 8: "regulation of regulators" seems to imply the Rbfox1 is impacting transcription?? Is there precedence for this type of regulation by Rbfox1? Yes, indeed, there is precedence for Rbfox1 impacting transcription, as we presented in the Discussion. Rbfox2 is reported to interact with the Polycomb repressive complex 2 to regulate gene transcription in mouse (Wei et al., 2016) and in flies Rbfox1 interacts with transcription factors including Cubitus interruptus and Suppressor of Hairless to regulate transcription downstream of Hedgehog and Notch signaling (Shukla et al., 2017; Usha and Shashidhara, 2010). In addition, Rbfox1 regulates splicing of Mef2A and Rbfox1 and Rbfox1 cooperatively regulate splicing of Mef2D during C2C12 cell differentiation (Gao et al., 2016). Our results provide a further piece of evidence implicating Rbfox1 either directly or indirectly in transcriptional regulation as well as regulation of alternative splicing.

      * Reviewer #3 (Significance (Required)):

      **SIGNIFICANCE**

      These studies of a major tissue-specific RNA binding protein, Rbfox1, are definitely important for our understanding of functional differences between muscle subtypes, and between muscle and nonmuscle tissues. The broad outlines of Rbfox1 alternative splicing regulation are known, but there is very little specific detail about the important targets in muscle subtypes that might help explain functional differences between subtypes. If more experimental validation can be obtained for regulation of transcript levels by binding 3'UTRs, this would also represent new information. *

      We thank the reviewer for recognizing the significance of our work and our detailed analysis of Rbfox1 phenotypes in different muscle fiber-types. Experimental validation of 3’-UTR binding will be a significant time investment in terms of building and testing in-vivo reporter constructs, assaying NMD and translation effects and performing the CLIP studies necessary for identification of directly-bound 3’-UTR regions, extending beyond the scope of this manuscript and the time allotted for revision. The data we present here represent an important advance in our understanding how Rbfox1 contributes to muscle-type specific differentiation, and form the basis for future experiments to explore the molecular and biochemical mechanisms underlying this regulation. *

      I am reviewing based on my experience studying alternative splicing in vertebrate systems, with an emphasis on Rbfox genes. Therefore I am unable to evaluate the functional data on different subtypes of muscle in Drosophila.

      *

      Reviewer Response References

      Barbas, J. A., Galceran, J., Torroja, L., Prado, A. and Ferrús, A. (1993). Abnormal muscle development in the heldup3 mutant of Drosophila melanogaster is caused by a splicing defect affecting selected troponin I isoforms. Mol Cell Biol 13, 1433–1439.

      Benoit Bouvrette, L. P., Bovaird, S., Blanchette, M. and Lécuyer, E. (2020). oRNAment: a database of putative RNA binding protein target sites in the transcriptomes of model species. Nucleic Acids Research 48, D166–D173.

      Brudno, M., Gelfand, M. S., Spengler, S., Zorn, M., Dubchak, I. and Conboy, J. G. (2001). Computational analysis of candidate intron regulatory elements for tissue-specific alternative pre-mRNA splicing. Nucleic Acids Res 29, 2338–2348.

      Damianov, A., Ying, Y., Lin, C.-H., Lee, J.-A., Tran, D., Vashisht, A. A., Bahrami-Samani, E., Xing, Y., Martin, K. C., Wohlschlegel, J. A., et al. (2016). Rbfox Proteins Regulate Splicing as Part of a Large Multiprotein Complex LASR. Cell 165, 606–619.

      Gao, C., Ren, S., Lee, J.-H., Qiu, J., Chapski, D. J., Rau, C. D., Zhou, Y., Abdellatif, M., Nakano, A., Vondriska, T. M., et al. (2016). RBFox1-mediated RNA splicing regulates cardiac hypertrophy and heart failure. J Clin Invest 126, 195–206.

      Nikonova, E., Kao, S.-Y. and Spletter, M. L. (2020). Contributions of alternative splicing to muscle type development and function. Semin. Cell Dev. Biol.

      Nongthomba, U., Cummins, M., Clark, S., Vigoreaux, J. O. and Sparrow, J. C. (2003). Suppression of muscle hypercontraction by mutations in the myosin heavy chain gene of Drosophila melanogaster. Genetics 164, 209–222.

      Schnorrer, F., Schönbauer, C., Langer, C. C. H., Dietzl, G., Novatchkova, M., Schernhuber, K., Fellner, M., Azaryan, A., Radolf, M., Stark, A., et al. (2010). Systematic genetic analysis of muscle morphogenesis and function in Drosophila. Nature 464, 287–291.

      Shukla, J. P., Deshpande, G. and Shashidhara, L. S. (2017). Ataxin 2-binding protein 1 is a context-specific positive regulator of Notch signaling during neurogenesis in Drosophila melanogaster. Development 144, 905–915.

      Usha, N. and Shashidhara, L. S. (2010). Interaction between Ataxin-2 Binding Protein 1 and Cubitus-interruptus during wing development in Drosophila. Dev Biol 341, 389–399.

      Wei, C., Xiao, R., Chen, L., Cui, H., Zhou, Y., Xue, Y., Hu, J., Zhou, B., Tsutsui, T., Qiu, J., et al. (2016). RBFox2 Binds Nascent RNA to Globally Regulate Polycomb Complex 2 Targeting in Mammalian Genomes. Mol Cell 62, 875–889.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      SUMMARY

      This manuscript characterizes the role of splicing factor Rbfox1 in Drosophila muscle and explores its ability to modulate expression of genes important for fibrillar and tubular muscle development. The authors hypothesize that Rbfox1 binds directly to 5'-UTR and 3'-UTR regions to regulate transcript levels, and to intronic regions to promote or inhibit alternative splicing events. Because some of the regulated genes encode transcriptional activators and other splicing factors such as Bru1, the effects of Rbfox1 may encompass a complex regulatory network that fine-tunes transcript levels and alternative splicing patterns that shape developing muscle. Most likely the authors' hypothesis is correct that Rbfox1 is critical for muscle development in Drosophila, but overall the interesting ideas presented here are too often based only on correlations without further experimental validation.

      MAJOR COMMENTS

      The hypothesis that Rbfox1 plays an important role in regulating muscle development is based on previous studies in other species and supported by much new data in this manuscript. Initial bioinformatic analysis showed that many Drosophila genes, including 20% of all RNA-binding proteins, 40% of transcription factors, etc. have the motifs in introns or UTR regions. However, I think a deeper analysis is required. Any hexamer might be present about once every 4kb, and we do not expect all UGCAUG motifs are necessarily functional, so one might ask whether the association of Rbfox motifs with muscle development genes is statistically significant? Are the motifs conserved in other Drosophila species, which might support a functional role in muscle? Are the intronic motifs located as expected for regulatory effects, that is, proximal to alternative exons that exhibit changes in splicing when Rbfox1 expression is decreased or increased? Is it possible to knock out an Rbfox motif and show that splicing of the alternative exon is altered, or regulation of transcript levels is abrogated?
 Also, what was the background set of genes used for the GO enrichment analysis? Genes expressed in muscle or all genes?

      1. The data on cross regulation between Rbfox1 and Bru1 are confusing and inconsistent, since mild knockdown and stronger knockdown of Rbfox1 seem to have different effects on Bru1 expression. Both Rbfox1 and Bru1 gene have many Rbfox motifs, but they are both large genes (>100kb) and would be expected to have many copies of all hexamers. How do we know whether any of them are functional? New data suggest that Rbfox1 can positively regulate Bru1 protein levels (Fig.5), but this seems inconsistent with the lab's earlier studies indicating opposite temporal mRNA expression profiles for Rbfox1 and Bru1 across IFM development. 

      2. Figure S4, section I, J: if changes in Bru1-RB isoform expression are correlated with Rbfox1 knockdown, it seems reasonable to test whether the Bru1-RB promoter can drive expression of GFP in an Rbfox1-dependent manner. But if I understand correctly, the assay as described on p. 19 uses the promoter region upstream of Bru1-RA. What is the logic for this experiment? It is not surprising that no effect was observed. The end result is that we have no idea whether Rbfox1 directly regulates bru1-RB. Even if it does, bru-Rb appears to be a minor component of Bru expression in IFM.
      3. In the section "Rbfox1 and Bruno1 co-regulate alternative splice events in IFMs", the data show that splicing of several genes is altered by knockdown or over-expression of Rbfox1 and Bru1. The interesting conclusion is for a complex regulatory dynamic where Rbfox1 and Bru1 co-regulate some alternative splice events and independently regulate other events in a muscle-type specific manner. However, if we are to conclude that these activities are due to direct binding of Rbfox1 and Bru1 to the adjacent introns, we need information about the location of flanking Rbfox and/or Bru1 motifs. Do upstream or downstream binding sites correlate with enhancer or silencer activity, as reported in previous studies of these splicing factors in other species? For wupA, Figure S3 shows an intronic Rbfox site, but exon 4 is not labeled so the reader cannot correlate this information with the diagram in Figure 6U.
      4. The evidence that Rbfox1 directly affects expression of transcription factor Exd seems to be based only a correlation between Rbfox1 knockdown and decreased expression of Exd. The observation that binding of Rbfox1 to the Exd 3'UTR in RIP experiments further weakens the case.
      5. Similarly, there is a correlation of Rbfox1 knockdown with expression of alternative 5'UTRs in the Mef2 gene. However, the changes in UTR expression appear mostly not statistically significant. Do the authors have a model to explain what mechanism might allow Rbfox to regulate expression of alternative 5'UTRs, which would seem to be a transcriptional process?
      6. For Salm, there apparently are no Rbfox motifs in the gene, and there are statistically significant but apparently inconsistent changes in Salm expression when it is knocked down in IFM by Rbfox1-RNAi (Salm increases) vs knockdown by Rbfox1-IR27286 or Rbfox1-IRKK110518 (Salm decreases). These are potentially interesting observations but more data would be needed to make stronger conclusions. How would regulation occur in the absence of Rbfox motifs?


      MINOR COMMENTS

      1. In several figures there is a misalignment of the transcriptional driver information with the phenotype data in the bar graphs above. Please correct the alignments to make interpretation easier.
      2. On p. 14 Brudno et al. is cited as ref for Fox motifs near muscle exons, but this paper only focused on brain-specific exons.
      3. For Mef2, why do exons described as 5'UTR have numbers 17, 20, and 21? One would normally expect these to be exon 1, 2 or 1A, 1B, etc.
      4. Fig 8: "regulation of regulators" seems to imply the Rbfox1 is impacting transcription?? Is there precedence for this type of regulation by Rbfox1?

      5. The data on cross regulation between Rbfox1 and Bru1 are confusing and inconsistent, since mild knockdown and stronger knockdown of Rbfox1 seem to have different effects on Bru1 expression. Both Rbfox1 and Bru1 gene have many Rbfox motifs, but they are both large genes (>100kb) and would be expected to have many copies of all hexamers. How do we know whether any of them are functional? New data suggest that Rbfox1 can positively regulate Bru1 protein levels (Fig.5), but this seems inconsistent with the lab's earlier studies indicating opposite temporal mRNA expression profiles for Rbfox1 and Bru1 across IFM development. 


      6. Figure S4, section I, J: if changes in Bru1-RB isoform expression are correlated with Rbfox1 knockdown, it seems reasonable to test whether the Bru1-RB promoter can drive expression of GFP in an Rbfox1-dependent manner. But if I understand correctly, the assay as described on p. 19 uses the promoter region upstream of Bru1-RA. What is the logic for this experiment? It is not surprising that no effect was observed. The end result is that we have no idea whether Rbfox1 directly regulates bru1-RB. Even if it does, bru-Rb appears to be a minor component of Bru expression in IFM.

      7. In the section "Rbfox1 and Bruno1 co-regulate alternative splice events in IFMs", the data show that splicing of several genes is altered by knockdown or over-expression of Rbfox1 and Bru1. The interesting conclusion is for a complex regulatory dynamic where Rbfox1 and Bru1 co-regulate some alternative splice events and independently regulate other events in a muscle-type specific manner. However, if we are to conclude that these activities are due to direct binding of Rbfox1 and Bru1 to the adjacent introns, we need information about the location of flanking Rbfox and/or Bru1 motifs. Do upstream or downstream binding sites correlate with enhancer or silencer activity, as reported in previous studies of these splicing factors in other species? For wupA, Figure S3 shows an intronic Rbfox site, but exon 4 is not labeled so the reader cannot correlate this information with the diagram in Figure 6U.

      8. The evidence that Rbfox1 directly affects expression of transcription factor Exd seems to be based only a correlation between Rbfox1 knockdown and decreased expression of Exd. The observation that binding of Rbfox1 to the Exd 3'UTR in RIP experiments further weakens the case.

      9. Similarly, there is a correlation of Rbfox1 knockdown with expression of alternative 5'UTRs in the Mef2 gene. However, the changes in UTR expression appear mostly not statistically significant. Do the authors have a model to explain what mechanism might allow Rbfox to regulate expression of alternative 5'UTRs, which would seem to be a transcriptional process?

      10. For Salm, there apparently are no Rbfox motifs in the gene, and there are statistically significant but apparently inconsistent changes in Salm expression when it is knocked down in IFM by Rbfox1-RNAi (Salm increases) vs knockdown by Rbfox1-IR27286 or Rbfox1-IRKK110518 (Salm decreases). These are potentially interesting observations but more data would be needed to make stronger conclusions. How would regulation occur in the absence of Rbfox motifs?


      MINOR COMMENTS

      1. In several figures there is a misalignment of the transcriptional driver information with the phenotype data in the bar graphs above. Please correct the alignments to make interpretation easier.

      2. On p. 14 Brudno et al. is cited as ref for Fox motifs near muscle exons, but this paper only focused on brain-specific exons.

      3. For Mef2, why do exons described as 5'UTR have numbers 17, 20, and 21? One would normally expect these to be exon 1, 2 or 1A, 1B, etc.

      4. Fig 8: "regulation of regulators" seems to imply the Rbfox1 is impacting transcription?? Is there precedence for this type of regulation by Rbfox1?

      Significance

      SIGNIFICANCE

      These studies of a major tissue-specific RNA binding protein, Rbfox1, are definitely important for our understanding of functional differences between muscle subtypes, and between muscle and nonmuscle tissues. The broad outlines of Rbfox1 alternative splicing regulation are known, but there is very little specific detail about the important targets in muscle subtypes that might help explain functional differences between subtypes. If more experimental validation can be obtained for regulation of transcript levels by binding 3'UTRs, this would also represent new information.

      I am reviewing based on my experience studying alternative splicing in vertebrate systems, with an emphasis on Rbfox genes. Therefore I am unable to evaluate the functional data on different subtypes of muscle in Drosophila.

    1. I’m not thinking the way I used to think. I can feel it most strongly when I’m reading. Immersing myself in a book or a lengthy article used to be easy.

      I thought this sentence was very interesting because it is how I feel also. The Internet is here to make things easier for us, we don't have to remember anything or to thing as much as we used to. But loosing the habit to read and thing by ourselves may also by synonyme of not being as critical as we were.

    1. SciScore for 10.1101/2021.11.30.21266810: (What is this?)

      Please note, not all rigor criteria are appropriate for all manuscripts.

      Table 1: Rigor

      <table><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Ethics</td><td style="min-width:100px;border-bottom:1px solid lightgray">IRB: Trials had to be approved by the institutional review boards, and competent authorities of the countries involved, and all patients gave written informed consent.<br>Consent: Trials had to be approved by the institutional review boards, and competent authorities of the countries involved, and all patients gave written informed consent.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Sex as a biological variable</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Randomization</td><td style="min-width:100px;border-bottom:1px solid lightgray">Study patients and selection criteria: Although the exact inclusion and exclusion criteria could vary across the trials, all the subjects had to fulfill the following criteria; 1) Participant of a trial that joined the COMPILEhome consortium, 2) Confirmed COVID-19 diagnosis by a diagnostic PCR or antigen test, 3) Neither hospitalized nor at the emergency room department of a hospital before or at the time of randomization, 4) Symptomatic with illness onset ≤7 days at the time of screening for the study, and 5) Age 50 or older.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Blinding</td><td style="min-width:100px;border-bottom:1px solid lightgray">Overview of Study Design and research partners: Beginning in November 2020, we systematically searched for RCTs recruiting outpatients that compared treatment with CP with a blinded or unblinded control arm in the European (https://www.clinicaltrialsregister.eu/) and American (www.clinicaltrials.gov) trial register.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Power Analysis</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr></table>

      Table 2: Resources

      <table><tr><th style="min-width:100px;text-align:center; padding-top:4px;" colspan="2">Antibodies</th></tr><tr><td style="min-width:100px;text=align:center">Sentences</td><td style="min-width:100px;text-align:center">Resources</td></tr><tr><td style="min-width:100px;vertical-align:top;border-bottom:1px solid lightgray">Pre-planned subgroup analyses assessed the efficacy of the 2 primary outcomes in the following subgroups: 1) days since disease onset (1-5 or >5days), 2) level of neutralizing antibody anti-SARS-CoV-2 titers in transfused plasma and 3) Negative serum anti-SARS-CoV-2 IgG status (Trimeric Spike antibody test, Liaison, Diasorin, Saluggia, Italy).</td><td style="min-width:100px;border-bottom:1px solid lightgray"><div style="margin-bottom:8px"><div>anti-SARS-CoV-2</div><div>suggested: None</div></div><div style="margin-bottom:8px"><div>anti-SARS-CoV-2 IgG</div><div>suggested: None</div></div><div style="margin-bottom:8px"><div>Diasorin, Saluggia, Italy).</div><div>suggested: None</div></div></td></tr><tr><th style="min-width:100px;text-align:center; padding-top:4px;" colspan="2">Software and Algorithms</th></tr><tr><td style="min-width:100px;text=align:center">Sentences</td><td style="min-width:100px;text-align:center">Resources</td></tr><tr><td style="min-width:100px;vertical-align:top;border-bottom:1px solid lightgray">Overview of Study Design and research partners: Beginning in November 2020, we systematically searched for RCTs recruiting outpatients that compared treatment with CP with a blinded or unblinded control arm in the European (https://www.clinicaltrialsregister.eu/) and American (www.clinicaltrials.gov) trial register.</td><td style="min-width:100px;border-bottom:1px solid lightgray"><div style="margin-bottom:8px"><div>https://www.clinicaltrialsregister.eu/</div><div>suggested: (EU Clinical Trials Register, RRID:SCR_005956)</div></div></td></tr></table>

      Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


      Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
      Several limitations should be mentioned. Although we only included patients aged ≥50, and most of them also had comorbidities, the hospital admission rate was relatively low at 9.3%. Therefore, the study was not powered to exclude a small overall treatment effect. However, administering CP to infectious and symptomatic outpatients is complex and labor-intensive. Hence, we think that small CCP’s clinical role is significantly diminished if unable to establish something greater than “a small effect” because it ceases to be practical. As vaccination uptake progressed in patients aged 50 or older and monoclonal antibody-based therapy with proven effectiveness in high-risk outpatients became available, the recruitment dropped dramatically as of June 2021. This resulted in the recommendation by the individual and COMPILEhome DSMBs that further enrollment was unlikely to change the results, and both studies were discontinued. Regarding the advent of the SARS-CoV-2 variants that may be less susceptible to antibodies induced by the original SARS-CoV-2 virus or the alpha variant, it is reassuring that >95% of the patients in both countries were included at a time when the delta variant was still rare (<5%) (Appendix Figure 3 and 4). The last limitation of our study (and all studies on CP for COVID-19 so far) is the lack of a proper phase 2 dose-finding study. In a recent study, we administered 600 mL of CP to 25 SARS-CoV-2 antibody-negative B-cell depleted patients diagnosed with COVID...

      Results from TrialIdentifier: We found the following clinical trial numbers in your paper:<br><table><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Identifier</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Status</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Title</td></tr><tr><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">NCT04621123</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Completed</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Plasma for Early Treatment in Non-hospitalised Mild or Moder…</td></tr><tr><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">NCT04589949</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Recruiting</td><td style="min-width:95px; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Early Convalescent Plasma Therapy for High-risk Patients Wit…</td></tr></table>


      Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


      Results from JetFighter: We did not find any issues relating to colormaps.


      Results from rtransparent:
      • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
      • No protocol registration statement was detected.

      Results from scite Reference Check: We found no unreliable references.


      <footer>

      About SciScore

      SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

      </footer>

    1. Ineliminable Inscrutability Scrutinized and Eliminated

      Brandom rejects two possible theories of normativity (rule-following).

      Regularism:

      If a given performance conforms to some pre-existing pattern of performances, then we call that performance correct or competent. If it doesn’t so conform, then we call it incorrect or incompetent

      Brandom's objection: Regularists can't distinguish between what happens and what ought to happen. We don't say "gravity ought to work", so a Regularist must somehow explain why the Law of Gravity is not normative, while the Law of US is a normative.

      Everything in nature ‘follows’ the ‘rules of nature,’ the regularities isolated by the natural sciences. So what does the normativity that distinguishes human rule-following consist in?

      Regulism: rules are certain declarative sentences like "No smoking.", and rule-following is behavior that is described by the rules.

      Brandom's objection: rules can't be made entirely explicit. There must always some unsaid rule to avoid an infinite regress, like "the rule about following rules" and "the rule about the rule about following rules" etc.

      Wittgenstein said this about how the infinite regress is cut off by unspoken rules that are followed in practice.

      If I have exhausted the justifications I have reached bedrock, and my spade is turned. Then I am inclined to say: “This is simply what I do.”

      Thus, there is a necessary implicitness, or "blindness", in rule-following behavior. At some level, we simply follow rules without understanding.

      But then, a challenge! How is "implicit norm" even possible?

      How can a performance be nothing but a ‘blind’ reaction to a situation, not an attempt to act on interpretation?

      Unconscious rule-following is automatic, therefore not normative, much like a sneeze, or falling in gravity is not normative.

      Or is it? Perhaps we are forced to conclude that fundamentally, norms are based on mechanical, thoughtless behaviors. In this way, we can naturalize norms in a norm-less theory (such as neuroscience).

      Brandom refused this, and insists that we must then admit "nonconscious norms". He even proposes a kind of non-natural metaphysics, where non-natural normativity is baked into the metaphysics.

      But Bakker has a better idea: explain norms in a norm-less scientific theory

      The history of the social sciences is a history of emancipation from the intellectual propensity to intentionalize social phenomenon—this was very much part of the process that Weber called the disenchantment of the world. Brandom proposes to re-enchant the world by re-instating the belief in normative powers, which is to say, powers in some sense outside of and distinct from the forces known to science.

      Bakker's Blind Brain Theory

      Now Bakker begins his own philosophy, using Blind Brain Theory.

      Note how important is implicit/blindness in Wittgenstein's and Brandom's explanations of how norms work. But they never paused to consider it deeper than a simple "Such implicitness means implicit normativity exists." They then went on to consider normativity without studying further just what are implicit, and how they are implicit.

      This is a grave error. To explain normativity, we must study what are implicit and how they are implicit in the brain when people think normative thoughts and do normative actions. We must study the neglect structure of the brain, and that brings us to Blind Brain Theory.

      According to BBT, all cognition is heuristic and depends critically on the environment to play nice (that is, remain stable). Heuristic algorithms can skip many steps and come out right, as long as the environment rarely challenges it with difficult examples that exposes the error of the heuristic.

      Normative cognition is also heuristic -- what features of the human environment does it depend on?

      Wittgenstein again

      If I have exhausted the justifications I have reached bedrock, and my spade is turned. Then I am inclined to say: “This is simply what I do.”

      The "bedrock" is the stable normative behaviors of other humans I live with. In other words, Regularism is actually the right approach to explaining normativity.

      Brandom was wrong to reject Regularism, but to see why he was wrong, we must do some psycho-philosophy. We must understand why the human animal is psychologically prone to reject Regularism (just like how it is psychologically prone to think souls exist). It is, again, because of BBT.

      We think "This rule is normative." when some normativity-detection cognitive module is triggered. If the module keeps quiet, and we have the distinct feeling of "Wait, that's not normative...", no matter how much information processing the other modules do. And it just so happens that thinking about causes and statistical correlations cannot trigger this module.

      There are roughly two types of explanations: causal/natural and normative/supernatural. Causal/natural explanations are those step-by-step explanations that intrinsically allows you to break it down further ("how does this step work?"), push it forwards and backwards in time ("and what happened before/after?"). Normative/supernatural explanations are those brute assertions about what to do and not to do ("This is simply what I do."), accompanied with an anosognosia, a blindness to the blindness, a feeling that the assertions are sufficient with no further explanations possible ("What do you mean I must explain why it is what I do? I have explained myself sufficiently. There is nothing left to explain!")

      Since Regularism involves solving normative cognition using the resources of natural cognition, it simply follows that it fails to engage resources specific to normative cognition.

      Bakker is in no danger of self-contradiction, because the problem "how does normative cognition work?" is perfectly possible to be the kind of problem that causal cognition can solve. Sure, causal cognition can't solve all problems, but it can solve some... like "how to build a plane?" and "how the brain works?" Science works, and that shows the power of causal cognition. In contrast, nothing sophisticated like science has been built upon normative cognition. This shows that causal cognition can solve normative cognition, while normative cognition can't.

      What doesn’t follow is that normative cognition thus lies outside the problem ecology of natural cognition, let alone inside the problem ecology of normative cognition.

      In short, Brandom failed because he tried to solve normativity with normative cognition. Bakker may succeed, because he is trying to solve normativity with causal cognition. The feeling that "normativity can't be solved causally" misguided Brandom, and it is just an illusion generated by the fractured nature of cognition, described above.

      normative cognition seems unlikely to theoretically solve normative cognition in any satisfying manner. The very theoretical problems that plague Normativism—supernaturalism, underdetermination, and practical inapplicability—are the very problems we should expect if normative cognition were not in fact among the problems that normative cognition can solve.

      Here is Bakker's explanation of normative cognition, and how it leads to Brandom's mistake:

      normative cognition belongs to social cognition more generally, and that... has evolved to solve astronomically complicated biomechanical problems involving the prediction, understanding, and manipulation of other organisms absent detailed biomechanical information. Adapted to solve in the absence of this information, it stands to reason that the provision of that information, facts regarding biomechanical regularities, will render it ineffective...

      ... intentional cognition has evolved to overcome neglect, to solve problems in the absence of causal information. This is why philosophical reflection convinces us we somehow stand outside the causal order via choice or reason or what have you. We quite simply confuse an incapacity, our inability to intuit our biomechanicity, with a special capacity, our ability to somehow transcend or outrun the natural order.

    1. Author Response:

      Reviewer #3 (Public Review):

      1) The two algorithms presented are essentially a low-pass and high-pass filter on binarized odor. As such, it may not be so surprising that there is a tradeoff between which algorithm works better depending on the frequency content of different environments. The low-pass filter (algorithm 1) works better in environments with mostly low-frequency fluctuations (boundary layer plume, low wind-speed, high diffusivity) while the high-pass filter (algorithm 2) works better in environments with mostly high-frequency fluctuations (high windspeed, low diffusivity). To understand what is essential in these algorithms I think it would be useful to (1) compare the two algorithms to a "null" algorithm that drives upwind orientation whenever odor is present (i.e. include thresholding and binarization but no filtering), (2) compare navigation success metrics directly to the frequency content of different environments, (3) examine how navigation success depends on the filtering cutoff of the two algorithms (tau_on and tau_w). Comparing to the null algorithm with no filtering I think is important to determine whether there is actually a tradeoff to be made, or whether a system that can approximate a flat transfer function (or at least capture all relevant frequencies in the environment) is ideal and must be approximated with biological parts.

      For (1) and (3), we have now added simulations of the models for a range of different timescales, including an integrator with an infinitely fast timescale corresponding to the “null” model the reviewer describes (Results lines 376-380, Figure 4—figure supplement 2 and Materials and methods lines 1008-1025). We find that changing the timescale of the intermittency filter largely leaves performance unchanged whereas changing the timescale of the frequency filter is akin to changing the gain on the frequency filter, as predicted by Equations 24 and 29. Since we do find a local maximum in the frequency filter timescale, we conclude that there are benefits to filtering in time. For (2), many plumes we simulate in Fig. 5 span a wide range of frequencies and intermittencies; we chose to plot performance as a function of diffusivity / windspeed to emphasize how performance depends on environment parameters that shape the statistics of the plume (flow and odor dynamics). Note that we renamed 𝜏! to 𝜏".

      2) While the two algorithms presented here present a nice conceptual division, biological filtering algorithms are likely to incorporate elements of both. For example, the adaptive compression algorithm of Alvarez-Salvado (which is eliminated in the simplification used here) provides some sensitivity to odor onsets and is based on well-described adaptation at the olfactory periphery. Synaptic depression algorithms likewise provide sensitivity to derivatives as well as integration over time, and synaptic depression with multiple timescales has been described in detail at various stages of the olfactory system. A productive extension of the work done here would be to explore the utility of biophysically-motivated filtering algorithms for navigation in different environments.

      Thank you for this suggestion, which led us to extend our work in that interesting direction. We have now generalized our model to respond to odor intensity (rather than its binarized version) by implementing an adaptive compression taken from prior modeling efforts (Alvarez-Salvado et al, eLife 2018) (added to Fig. 3; also see additional Fig. 3 Supplement 1). Moreover, we now also consider navigators that respond to odor signals using a biophysical model of odor transduction, ORN firing, and PN firing, in addition to synaptic depression within the ORN-PN synapse, which combines modeling efforts from prior works (Gorur-Shandilya, Demir, et al, eLife 2017; Nagel & Wilson, Nat. Neurosci. 2015; Fox & Nagel, “Synaptic control of temporal processing in the Drosophila olfactory system” arXiv 2021). This realistic circuit model produced exciting results that indicate that the natural ORN-PN circuitry can, to some degree, satisfy the dual demands of intermittency and frequency sensing. These results are shown in the new Fig. 6.

      3) It would be helpful in the Discussion to present a clearer picture of what the frequency content of natural environments is likely to be. For example, flies stop walking at windspeeds above ~70cm/s (Yorozu 2009). In contrast, flies in flight are likely to encounter much sparser and high frequency plume encounters, as they are moving through the air at much faster speeds and because odors encountered here would be away from the boundary layer. Therefore the best test of the tradeoff hypothesis would likely be to compare temporal filtering of odor plumes by neural circuitry in flying vs walking flies. This would connect to the literature in motion detection as well, where octopamine release during flight causes a speeding of the motion detection algorithm.

      We have added lines 47-48 to the introduction describing the natural frequency content of plumes and lines 574-578 discussing how one might see evidence of this tradeoff when comparing between walking and flying flies.

    1. Author Response:

      Reviewer #2:

      What the authors attempt to achieve, and their approaches:

      The author attempt to establish by which mechanisms cholesterol influences the function of the GPCR A_{2A}R, an adenosine receptor. The role of cholesterol on GPCRs has been reported in a number of studies, primarily in cellular experiments, and the authors set out here to clarify the molecular mechanisms.

      To this end, they build upon their recent achievements to produce this protein and reconstitute it in nanodiscs, i.e. discoidal objects comprised of the membrane protein (here: A_{2A}R), lipids (here: POPC, POPG and cholesterol) and a membrane-scaffold protein (MSP) which wraps around this disc of protein+lipid. Nanodiscs allow studying proteins in solution, and are thought to be much more native-like than e.g. detergent micelles.

      The authors first use GTP hydrolysis experiments to quantify the basal activity and agonist potency at cholesterol concentrations from 0 to 13%. The cholesterol effects are weak but detectable. Then they use a single 19F label that reports on the protein's conformation (active, inactive) to show that the protein populates slightly more active states with cholesterol. (again, weak effects). Then they investigate G-protein binding to A_{2A}R in the nanodisc, and find (very!) weak enhancement at 13% cholesterol. These data point to weak positive allosteric modulation by cholesterol. They then use molecular dynamics simulations to probe the allosteric communication, using a recently proposed framework (Rigidity-transmission allostery). Doing these simulations in the presence of cholesterol (postions of cholesterol from X-ray structure) and absence. This analysis shows again only very weak effects of cholesterol, and this time the effect is opposite, i.e. negative allosteric modulation by cholesterol. Then they use 19F-labeled cholesterol analogues to probe by NMR the state of cholesterol (bound to protein?). Lastly, they use Laurdan fluorescence experiments and pressure NMR to establish that (i) the lipids become more ordered when cholesterol is present, and (ii) if one achieves such ordering even without cholesterol - namely by pressure - one may achieve similar effects as those that cholesterol has.

      Collectively, these data lead them to conclude that cholesterol has a (weak) positive allosteric effect on this receptor, and this effect is not a direct one, but goes via modulation of the membrane properties.

      We thank the reviewer for his comments and critique. A lot of his comments have to do with the nanodisc as a model system. We have therefore included an additional paragraph as discussed above, highlighting the advantages and disadvantages of the nanodisc. We’ve also included references to papers that have characterized nanodiscs or membrane proteins in nanodiscs. In our hands, 31P NMR spectra of POPC/POPG nanodiscs and their melt behavior is very similar to liposomes. We’ve tried to add to the discussion on nanodiscs without distracting too much from the focus in the paper.

      Major strengths and weaknesses of methods and results:

      The study addresses an important question, which inherently is difficult to answer: the effect of cholesterol is poorly understood and such studies require to work in an actual membrane. The authors do a careful combination of different methods to achieve their goal of identifying the mechanisms.

      Despite combining several methods, several of them have their inherent problems:

      (i) the nanodisc is too small to properly mimic the membrane environment, and it does not allow reaching relevant cholesterol concentrations. Moreover, it is not clear (to me) if one can exclude e.g. interactions of the protein with the surrounding MSP, or of cholesterol with MSP (see (iii) below).

      We agree. In principle, we should worry about MSP. On the other hand, this is a constant in all of the samples and we focus instead on the cholesterol-dependent effects. These nanodiscs are unarguably small. We’ve commented on this in the paper now. However, we’d expect that the confinement would if anything emphasize the cholesterol bound state. Yet, the NMR studies of F-cholesterol interactions at best identified transient bound states.

      (ii) the state of the protein (inactive, active) is probed with a single NMR-active site. The effects are small and I am not convinced that one shall interpret changes as small as the ones in Figures 3 and 4. In particular, how does this single probe behave at high pressure? Does it reflect an active state at 2000 bar pressure - where possibly other effects (unfolding?) may occur?

      Here we can be quite confident. The spectra are predicated on a recent paper (Huang, et al, 2021) published in Cell in the spring of this year. Each state was carefully correlated with specific functional assays and conditions in a self-consistent way. The labeling site used on TM6 was strategically chosen based on earlier crystallographic studies of inactive and active A2AR. We have other labeling sites (TM7 and TM5) but the point was to use the chemical shift signatures to talk about cholesterol-induced changes to the conformational ensemble assigned in the Cell paper. The differences are small, but the fact that PAM effects are observed across conditions (apo, inverse agonist-bound, agonist-bound, and G protein-bound) reassures us that the spectral differences between low and high cholesterol samples are real. Unfolding by 19F NMR is in this case easy to see – the effects become irreversible and independent of ligand and the chemical shift ends up as one upfield peak. We also see a stabilization of the A1 (active) state, and a slight downfield shift of the active ensemble with increased pressure, consistent with reduced exchange dynamics (and coalescence) associated with the active state. We’ve commented on this in the revised version while trying not to distract from the flow of the paper.

      (iii) the data in Figure 6 (19F of cholesterol analogs) are hard to interpret. Is cholesterol bound to the protein? Does the 19F shift reflect binding to the protein? or interactions within the confined space of the disc? or with MSP? The two analogs do not tell a coherent story.

      It is confusing. We agree. We were fully expecting to see a clear A2AR bound state of cholesterol either through a concentration-dependent shift or a new peak. We also looked for “hidden” bound states through 19F NMR CEST experiments. We never identified a bound state in the presence of a range of cholesterol concentrations, as a function of receptor drug. We did observe small shifts although often these effects were as prominent with inverse agonist as agonist, possibly pointing to the existence of multiple weak binding sites. We’ve added some of this to the conversation. It’s also certainly possible that cholesterol exhibits some interaction with MSP, although again MSP is a constant presence in all the samples while we are focusing on cholesterol-dependent effects. In any case, we never detected a bound signature characteristic of slow exchange. That’s significant to the study despite the ambiguity of the measurements.

      (iv) the pressure NMR study (Fig 7D) has weaknesses. The authors implicitly assume that pressure acts on the membrane, leading to more ordering. (They do recognize the possibility that pressure may have an effect on the protein directly, but consider that this direct effect on the protein is minor.) I think that their arguments are possibly incorrect: they apply here pressure onto a sample of nanodiscs, but all studies they cite to justify the use of pressure on membranes dealt with extended lipid bilayers (liposomes). To me it is not clear what is the lateral effect of pressure onto a nanodisc. Can water laterally enter into the bilayer and thus modify the lipid structure? I also note that previous pressure-NMR studies on a GPCR in micelles (rather than nanodiscs) showed a shift toward the active state. As a micelle is a very different thing than a nanodisc, this suggests that the pressure effect is, at least in part or predominantly, on the protein itself.

      On top of the weakness of the pressure NMR experiment to identify what actually happens to the disc, it is not clear either how to interpret the 19F shift at very high pressure (Fig 7D). Given that there is only a single NMR probe, far out in an artificial side chain, it is difficult to assess the state of the protein.

      These are good questions. Firstly, lipid bilayers (be it in liposomes, bicelles, or nanodiscs) are super soft and compressible systems – all known to change in hydrophobic thickness to pressure much more readily than proteins – be they membrane embedded or soluble. Secondly, the 19F NMR spectra are well-known to be representative of fully functional receptor as discussed above. Thirdly, even detergent micelles are susceptible to pressure (much more so than the receptor itself) See J. Phys. Chem. B 2014, 118, 5698−5706 (now referenced in the paper). Pressure will enhance hydrophobic thickness, even in a detergent host, by ordering the acyl chains. The lower specific volume states, selected by higher pressure, have a larger hydrophobic dimension. Thus, the effects seen earlier are equally an effect of environment. In the revised version, we simply make the point that the protein isn’t unfolded and that both cholesterol or pressure give rise to enhanced hydrophobic thickness and corresponding shifts in equilibria to the active states.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the reviewers

      We would like to thank the two reviewers for the valuable comments and suggestions on improvements. We addressed each reviewer’s comments individually. We have carefully revised the manuscript to incorporate new data and to make necessary clarifications.

      Overall we made the following major modifications:

      1. We investigated the relevance of BHRF1 expression in the context of EBV infection, in B cells and epithelial cells. We observed that EBV reactivation leads to MT hyperacetylation and subsequent mito-aggresome formation in both cell types. An EBV+ B cell line deficient for BHRF1 was generated and allowed us to demonstrate the involvement of BHRF1 in this phenotype. These results were added to Figures 2, 3 and Figure 1 – S1 in the revised version of the manuscript.
      2. We better characterized the mechanism leading to MT hyperacetylation, by demonstrating that BHRF1 colocalizes and interacts with the tubulin acetyltransferase ATAT1. These results were added to Figure 5 and Figure 5 – S2 in the revised manuscript.
      3. We generated stable HeLa cells KO for ATG5. Using these autophagy-deficient cells, we demonstrated the involvement of autophagy in BHRF1-induced MT hyperacetylation and mito-aggresome formation. We added these results to Figure 8 in the revised version of the manuscript.
      4. We compared the impact of BHRF1 with other mitophagy inducers on MT hyperacetylation, mitochondrial morphodynamics and the inhibition of IFN production, to demonstrate the specificity of the mechanism of action of BHRF1 (Figure 4 – S1).
      5. We demonstrated that MT hyperacetylation requires mitochondrial fission, using a Drp1-deficient HeLa cell line that we have previously described (Vilmen et al., 2020). This result was added to the revised version of the manuscript in Figure 3 – S2A. Moreover, we confirmed this result in the context of EBV infection (Figure 3 – S2B). ## Reviewer#1 Reviewer #1 (Evidence, reproducibility and clarity)

      Major comments:

      1. In the presented manuscript the authors characterize mainly BHRF1 overexpression in HeLa cells. Does BHRF1 also block type I IFN responses by microtubule hyperacetylation in the context of EBV infection? Do alpha-tubulin K40A overexpressing B cells produce more type I IFN after EBV infection?

      In the revised version of the manuscript, we added several experiments to explore the phenotype of BHRF1 during EBV infection, as requested by the two reviewers. Since EBV infects both B cells and epithelial cells, we used two different approaches. In latently-infected B cells, coming from Burkitt lymphoma (Akata cells), we induced EBV reactivation by anti-IgG treatment. To explore the importance of BHRF1 in this cell type, we constructed a cell line knocked down for BHRF1 expression, thanks to a lentivirus bearing an shRNA against BHRF1. In parallel, HEK293 cells harboring either EBV WT or EBV ΔBHRF1 genome were transfected with ZEBRA and Rta plasmids to induce the viral productive cycle in epithelial cells.

      We demonstrated that EBV infection induces MT hyperacetylation and subsequent mito-aggresome formation, both dependent on autophagy. Moreover, this phenotype requires BHRF1 expression in B cells and epithelial cells. We also observed that the expression of alpha-tubulin K40A in EBV+ epithelial cells blocks mito-aggresome formation induced by EBV reactivation. These results are now presented in Figures 2 and 3 in the revised version of the manuscript.

      Regarding regulation of IFN response during infection, several EBV-encoded proteins and non-coding RNAs have been described to interfere with the innate immune system. For example, BGLF4 and ZEBRA bind to IRF3 and IRF7, respectively, to block their nuclear activity (Hahn et al., 2005; Wang et al., 2009). Moreover, Rta expression decreases mRNA expression of IRF3 and IRF7 (Bentz et al., 2010; Zhu et al., 2014). We therefore think that studying the inhibitory role of BHRF1 on IFN response in the context of EBV reactivation will be arduous. Indeed, the lack of BHRF1 could be compensated by the activity of other viral proteins acting on innate immunity.

      1. The authors document that the observed microtubule hyperacetylation is due to the acetyltransferase ATAT1. How does BHRF1 activate ATAT1? Is there any direct interaction?

      As requested by reviewer#1, we explored a possible interaction of BHRF1 and ATAT1. First, we observed by confocal microscopy that GFP-ATAT1 colocalized with BHRF1 in the juxtanuclear region of HeLa cells (Figure 5 – S2). Second, we demonstrated by two co-immunoprecipitation assays that BHRF1 binds to exogenous ATAT1 (Figures 5E and 5F). These new results have been added to the revised version of the manuscript and clarify the mechanism of action of BHRF1.To go further, we explored whether BHRF1 was able to stabilize ATAT1 because it was recently reported that p27, an autophagy inducer that modulates MT acetylation, binds to and stabilizes ATAT1 (Nowosad et al., 2021). However, BHRF1 expression does not impact the expression of ATAT1 (data not shown).

      1. Furthermore, the authors demonstrate with pharmacological autophagy inhibitors that autophagy is increased in a BHRF1 dependent and microtubule acetylation independent manner but required for microtubule hyperacetylation. How does autophagy stimulate ATAT1 dependent microtubule hyperacetylation? Is this dependency also observed with a more specific ATG silencing or knock-out?

      We generated a stable autophagy-deficient HeLa cell line KO for ATG5, using an ATG5 CRISPR/Cas9 construct delivered by a lentivirus. The lack of ATG5 expression and LC3 lipidation was verified by immunoblot (Figure 8B). We observed that BHRF1 was unable to increase MT acetylation in this autophagy-deficient cell line (Figure 8C) in accordance with our data reported in the original manuscript using treatment with spautin 1 or 3-MA (previously Figure S5C and Figure 8A in the revised version). Moreover, the lack of hyperacetylated MT in BHRF1-expressing cells led to a dramatic reduction of mito-aggresome formation (Figures 8D and 8E). These new results demonstrate that autophagy is required for BHRF1-induced MT hyperacetylation.

      Minor comments:

      1. "Innate immunity" and "innate immune system", but not "innate immunity system" are in my opinion better wordings.

      We thank reviewer #1 for this useful comment. The term “innate immunity system” in the introduction section has been replaced by “innate immune system”. Elsewhere, we used “innate immunity”.

      1. The reader would benefit from a discussion on the role of type I IFNs during EBV infection and how important the authors think their new mechanism could be in this context.

      We thank the reviewer for this suggestion. However, we already discussed the different strategies developed by EBV to counteract IFN response induction, in our previous study, suggesting the importance of IFN in the control of EBV infection (Vilmen et al., 2020). In this study, we have focused the discussion on the role of mitophagy in the control of IFN production.

      Reviewer #1 (Significance):

      The significance of the described pathway for type I IFN production needs to be documented in the context of EBV infection.

      The revised version of the manuscript now explored the role of BHRF1 in the context of EBV infection See above for details (major comment 1).

      Reviewer#2

      Reviewer #2 (Evidence, reproducibility and clarity)

      The work presented is a relatively straightforward cell biological dissection of a subset of the previously described functions of BHRF1, focusing on the mitochondrial aggregation phenotype. The approaches and analysis are performed in cell lines mainly using overexpression and some siRNA experiments and appear well done throughout.

      We thank reviewer #2 for this comment and would like to underline that the revised version of the manuscript includes now a study of BHRF1 in the context of infection in both B cells and epithelial cells, the generation of a stable EBV positive B cells KD for BHRF1 by using shRNA approach and the generation of a stable autophagy-deficient cell line, using CRISPR/cas9 against ATG5.

      Reviewer #2 (Significance):

      The current study unpicks one of the phenotypes induced by BHRF1 over expression: namely the previously reported mitochondrial aggregation phenotype. The findings that peri-nuclear mitochondrial aggregation are dependent on microtubules and retrograde motors are useful but could perhaps have been predicted. Overexpression of many proteins (or indeed chemical treatments) causing cellular and / or mitochondrial stress have been shown to cause mitochondrial perinuclear aggregation.

      To explore the specificity of BHRF1 activity on mito-aggresome formation, we decided to investigate the impact of AMBRA1-ActA, a previously characterized mitophagy inducer, on MT (Strappazzon et al., 2015). We observed that expression of AMBRA1-ActA leads to mito-aggresome formation but does not modulate acetylation of MTs, contrary to BHRF1. This result was added to the revised version of the manuscript (Figure 4 - S1A and S1B). Moreover, chemical treatments with either oligomycin/antimycin or CCCP, which induce mitochondrial stress and mitophagy (Lazarou et al., 2015; Narendra et al., 2008), do not cause mitochondrial juxtanuclear aggregation (Figure 4 - S1C). We also observed that a hyperosmotic shock-induced by NaCl leads to MT hyperacetylation (Figure 4 - S1D) but not to the mito-aggresome formation (data not shown), suggesting that MT hyperacetylation per se is not sufficient to induce the clustering of mitochondria. Altogether, these new results demonstrated the originality of the mechanism used by BHRF1 to induce mito-aggresome formation.

      The findings linking the process to altered tubulin acetylation are more novel and interesting and may add a new dimension to understanding of BHRF1 function. However what is lacking here is really advancing our understanding of how BHRF1 does this.

      We thank the reviewer for underlining the fact that regulation of mitochondrial morphodynamics by BHRF1 via MT hyperacetylation is novel and interesting.

      In the original version of the manuscript, we have demonstrated that autophagy and ATAT1 are required for BHRF1-induced hyperacetylation. In the revised version, we uncovered that BHRF1 interacts and colocalizes with ATAT1 (Figures 5E, 5F and Figure 5 – S2). Moreover, we demonstrated that MT hyperacetylation is involved in the localization of autophagosomes next to the nucleus, thus close to the mito-aggresome. Therefore, we better characterized the mechanism of action of BHRF1 in the revised manuscript.

      Although some downstream processes are identified in the current and previous study it still remains unclear what the exact underlying mechanisms are. Is BHRF1 doing this by disrupting mitochondrial function and making the organelles sick or by causing cellular stress indirectly leading to mitochondrial pathology? Previous studies have shown that cellular stress such as altered proteostasis can also cause stress-induced mitochondrial retrograde trafficking and aggregation. Is BHRF1 causing the same phenotype by generally stressing the cell and if it is more specifically through mitochondrial disruption what is the mechanism? As demonstrated by the authors in their previous work, BHRF1 does a number of things to cell signalling. Which of these are leading to a general disruption of cell signalling versus having specific effects on the cell or mitochondria still seems somewhat unclear.

      We previously reported that BHRF1 expression does not alter the mitochondrial membrane potential (Vilmen et al., 2020). contrary to treatment by O/A or CCCP. Moreover, we observed that these treatments do not induce mitochondrial clustering (Figure 4 – S1). Therefore, BHRF1 modulates mitochondrial dynamics in a specific and regulated manner.

      Our study clearly demonstrated that BHRF1 uses an original strategy to modulate IFN response, via a regulated pathway of successive steps, from mitochondrial fission to mitophagy, via MT hyperacetylation, rather than “a general disruption of cell signalling”.

      It would be interesting to know whether the role of microtubule hyperacetylation and ATAT1 are more generally involved in other previously described processes of stress induced mitochondrial aggregation.

      In the revised version of the manuscript, we observed that AMBRA1-ActA does not change the level of MT acetylation, whereas it induces mito-aggresome formation. These data reinforce the originality of the BHRF1 mechanism.

      Currently while this is a nicely performed follow up study to their 2020 paper, the present study neither provides in depth mechanistic advance of BHRF1 function, nor a better understanding of the molecular steps in a more generally relevant pathway (e.g. mitophagy).

      We disagree with the reviewer’s comment. Indeed, in this new study, we uncovered and characterized a new mechanism of action for BHRF1 via ATAT1-dependent MT hyperacetylation. More generally, we reported for the first time that innate immunity can be regulated by the level of MT acetylation.

      In addition, all the experiments were performed in cell lines and rely on the overexpression of a viral protein. But this is a significant over-simplification of the viral pathological process. It therefore remains unclear how pathophysiologically relevant the findings are (e.g. to EBV pathology) without further extending this element of the work.

      To address this comment, we extended our results in the infectious context, by adding several experiments performed in EBV-infected cell lines (see above reviewer#1 for details). The same phenotype was observed after reactivation of the EBV productive cycle as in BHRF1 ectopic expression. Moreover, we demonstrated that the phenotype is BHRF1-dependent. This suggests the importance of BHRF1 in EBV pathogenesis by participating in innate immunity control.

      An additional minor issue is the authors naming of the process as Mito-aggresome formation. Although this might sound catchy it is somewhat unclear what the biological basis for this is. Aggresomes are defined structures that occur in cells during pathology and due to the peri-nuclear accumulation of misfolded protein. Since the process here is simply the description of aggregated mitochondria next to the nucleus but doesn't seem to have anything to do with protein misfolding it's really unclear how this labelling is helpful to the field. The process of perinuclear mitochondrial aggregation e.g. during mitochondrial stress or damage has been described many times before without the need for calling it a mito-aggresome. This term is likely to cause unhelpful confusion.

      We understand the comment of reviewer #2, but since 2010 the term “mito-aggresome” was previously used in other studies and refers to a clustering of mitochondria next to the nucleus, similarly to what we observed with BHRF1 (D’Acunzo et al., 2019; Lee et al., 2010; Springer and Kahle, 2011, 2011; Strappazzon et al., 2015; Van Humbeeck et al., 2011; Yang and Yang, 2011).

      However, we took into consideration the risk of confusion for the readers, by changing how we introduced the term “mito-aggresome” in the revised version of the manuscript (page 5 line 94).

      References

      Bentz GL, Liu R, Hahn AM, Shackelford J, Pagano JS. 2010. Epstein–Barr virus BRLF1 inhibits transcription of IRF3 and IRF7 and suppresses induction of interferon-β. Virology 402:121–128. doi:10.1016/j.virol.2010.03.014

      D’Acunzo P, Strappazzon F, Caruana I, Meneghetti G, Di Rita A, Simula L, Weber G, Del Bufalo F, Dalla Valle L, Campello S, Locatelli F, Cecconi F. 2019. Reversible induction of mitophagy by an optogenetic bimodular system. Nat Commun 10:1533. doi:10.1038/s41467-019-09487-1

      Hahn AM, Huye LE, Ning S, Webster-Cyriaque J, Pagano JS. 2005. Interferon regulatory factor 7 is negatively regulated by the Epstein-Barr virus immediate-early gene, BZLF-1. J Virol 79:10040–10052. doi:10.1128/JVI.79.15.10040-10052.2005

      Lazarou M, Sliter DA, Kane LA, Sarraf SA, Wang C, Burman JL, Sideris DP, Fogel AI, Youle RJ. 2015. The ubiquitin kinase PINK1 recruits autophagy receptors to induce mitophagy. Nature 524:309–314. doi:10.1038/nature14893

      Lee J-Y, Nagano Y, Taylor JP, Lim KL, Yao T-P. 2010. Disease-causing mutations in Parkin impair mitochondrial ubiquitination, aggregation, and HDAC6-dependent mitophagy. J Cell Biol 189:671–679. doi:10.1083/jcb.201001039

      Narendra DP, Tanaka A, Suen D-F, Youle RJ. 2008. Parkin is recruited selectively to impaired mitochondria and promotes their autophagy. J Cell Biol 183:795–803. doi:10.1083/jcb.200809125

      Nowosad A, Creff J, Jeannot P, Culerrier R, Codogno P, Manenti S, Nguyen L, Besson A. 2021. p27 controls autophagic vesicle trafficking in glucose-deprived cells via the regulation of ATAT1-mediated microtubule acetylation. Cell Death Dis 12:1–18. doi:10.1038/s41419-021-03759-9

      Springer W, Kahle PJ. 2011. Regulation of PINK1-Parkin-mediated mitophagy. Autophagy 7:266–278. doi:10.4161/auto.7.3.14348

      Strappazzon F, Nazio F, Corrado M, Cianfanelli V, Romagnoli A, Fimia GM, Campello S, Nardacci R, Piacentini M, Campanella M, Cecconi F. 2015. AMBRA1 is able to induce mitophagy via LC3 binding, regardless of PARKIN and p62/SQSTM1. Cell Death Differ 22:419–32. doi:10.1038/cdd.2014.139

      Van Humbeeck C, Cornelissen T, Hofkens H, Mandemakers W, Gevaert K, De Strooper B, Vandenberghe W. 2011. Parkin Interacts with Ambra1 to Induce Mitophagy. J Neurosci 31:10249–10261. doi:10.1523/JNEUROSCI.1917-11.2011

      Vilmen G, Glon D, Siracusano G, Lussignol M, Shao Z, Hernandez E, Perdiz D, Quignon F, Mouna L, Poüs C, Gruffat H, Maréchal V, Esclatine A. 2020. BHRF1, a BCL2 viral homolog, disturbs mitochondrial dynamics and stimulates mitophagy to dampen type I IFN induction. Autophagy 17:1296–1315. doi:10.1080/15548627.2020.1758416

      Wang J-T, Doong S-L, Teng S-C, Lee C-P, Tsai C-H, Chen M-R. 2009. Epstein-Barr Virus BGLF4 Kinase Suppresses the Interferon Regulatory Factor 3 Signaling Pathway. J Virol 83:1856–1869. doi:10.1128/JVI.01099-08

      Yang J-Y, Yang WY. 2011. Spatiotemporally controlled initiation of Parkin-mediated mitophagy within single cells. Autophagy 7:1230–1238. doi:10.4161/auto.7.10.16626

      Zhu L-H, Gao S, Jin R, Zhuang L-L, Jiang L, Qiu L-Z, Xu H-G, Zhou G-P. 2014. Repression of interferon regulatory factor 3 by the Epstein-Barr virus immediate-early protein Rta is mediated through E2F1 in HeLa cells. Mol Med Rep 9:1453–1459. doi:10.3892/mmr.2014.1957

  5. Nov 2021
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewer’s comments

      We thank the three reviewers for their positive comments and constructive feedback. We have addressed the issues raised through additional experiments and text changes which have helped to improve the manuscript. Below, we address the specific points with detailed responses (reviewer comments are provided in italic).

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Rodriguez-Lopez et al describes the analysis of long intergenic non-coding RNA (lincRNA) function in fission yeast using both deletion and overexpression methods. The manuscript is very well presented and provides a wealth of lincRNA functional information for the field. This work is an important advance as there is still very little known about the function of lincRNAs in both normal and other conditions. An impressive array of conditions were assessed here. With a large scale analysis like this there is really not one specific conclusion. The authors conclude that lincRNAs exert their function in specific environmental or physiological conditions. This conclusion is not a novel conclusion, it has been proposed and shown before, but this manuscript provides the experimental proof of this concept on a large scale.

      The lincRNA knock-out library was assessed using a colony size screen, a colony viability screen and cell size and cell cycle analysis. Additionally, a lincRNA over-expression library was assessed by a colony size screen. These different functional analysis methods for lincRNAs were than carried out in a wide variety of conditions to provide a very large dataset for analysis. Overall, the presentation and analysis of the data was easy to follow and informative. Some points below could be addressed to improve the manuscript.

      There were 238 protein coding gene mutants assessed in parallel, to provide functional context, which was a very promising idea. But, unfortunately, the inclusion of 104 protein coding genes of unknown function restricted the use of the protein coding genes in the integrated analysis to connect lincRNAs to a known function using guilt by association.

      Reply: Yes, the unknown coding-gene mutants did certainly not help to provide functional context through guilt by association. These mutants were included to generate functional clues for the unknown proteins and compare phenotype hits with unknown lincRNA mutants. Nevertheless, because the known coding-gene mutants included broadly cover all high-level biological processes (GO slim), we could make several useful functional inferences for certain lincRNAs as discussed.

      The colony viability screen is not described well throughout the manuscript. Firstly, the use of phloxine B dye to determine cell viability needs to be described better when first introduced at the bottom of page 6. What exactly is this viability screen and red colour intensity indicating? Please define what the different levels of red a colony would indicate as far as viability. I assume an increase in red colour indicates more dead cells? So it is confusing that later the output of this assay is described as giving a resistant/sensitive phenotype or higher/lower viability. How can you get a higher viability from an assay that should only detect lower viability? Shouldn't this assay range from viable (no, or low red, colour) to increasing amounts of red indicating increasingly less viability? Figure 4D is also confusing with the "red" and "white" annotations. These should be changed to "lower viability" and "viable" or "not viable" and "viable".

      Reply: The colony-viability screen is described in detail in our recent paper (Kamrad et al, eLife 2020). We have now better explained how phloxine B works to determine cell viability (p. 6). The reviewer’s assumption is correct: an increase in red colour indicates more dead cells. However, all phenotypes reported are relative to wild-type cells under the same condition. Many conditions lead to a general increase in cell death, but some mutants show a lower increase in cell death compared to wild-type cells. These mutants, therefore, have a higher viability than wild-type cells, i.e. they are more resistant than wild-type under the given condition. We have tried to clarify this in the text, including the legend of Fig. 4. We agree that the ‘red’ and ‘white’ annotations in Fig. 4D could be confusing. We have now changed these to ‘low viability’ and ‘high viability’. Again, this is relative to wild-type cells.

      How are you sure that when generating the 113 lincRNA ectopic over-expression constructs by PCR that the sequences you cloned are correct? Simply checking for "correct insert size", as stated in the methods, is not really good practice and these constructs should be fully sequenced to be sure they contain the correct sequence and that constructs have not had mutations introduced by the PCR used for cloning. Without such sequence confirmation one cannot be completely confident that the data produced is specific for a lincRNA over-expression. Additionally, a selection of strains with the overexpression constructs should be tested by qRT-PCR and compared to a non-over-expressing strain to confirm lincRNA overexpression.

      Reply: To minimize errors during PCR amplification, we used the high-fidelity Phusion DNA polymerase which features an >50-fold lower error rate than Taq DNA Polymerase. We had confirmed the insert sequences for the first 17 lincRNAs cloned using Sanger sequencing (but did not report this in the manuscript). We have now checked additional inserts of the overexpression plasmids by Sanger sequencing in 96-well plate-format using a universal forward primer upstream of the cloning site. This high-troughput sequencing produced reliable sequence data for 80 inserts, including full insert sequences for 62 plasmids and the first ~900 bp of insert sequences for 18 plasmids). Of these, only the insert for SPNCRNA.601 showed a sequence error compared to the reference genome: T to C transition in position 559. This mutation could reflect either an error that occurred during cloning or a natural sequence variant among yeast strains (lincRNA sequences are much more variable than coding sequences). So, in general, the PCR cloning accurately preserved the sequence information. We have added this information in the Methods (p. 27-28). Please note that lincRNAs depend much less on primary nucleotide sequence than mRNAs, and a few nucleotide changes are highly unlikely to interfere with lincRNA function.

      Minor comments:

      Page 4, lines 19-20 - "A substantial portion of lincRNAs are actively translated (Duncan and Mata, 2014), raising the possibility that some of them act as small proteins." This sentence does not make sense, lincRNAs can't "act as" small proteins, they can only "code for" small proteins. Wording needs to be changed here.

      Reply: We agree and have changed the wording as suggested.

      Figure 1A is a nice representation but what are the grey dots? Are they all ncRNAs including lincRNAs? This needs to be stated in the legend.

      Reply: The grey dots represent all non-coding RNAs across the three S. pombe chromosomes as described by Atkinson et al., 2018. This has now been clarified in the legend.

      How many lincRNAs are there in total in pombe and what percentage did you delete? These numbers should be stated in the text.

      Reply: There are 1189 lincRNAs and we mutated ~12.6% of them. These numbers are now stated at the end of the Introduction, page 5.

      It would be nice if Supplementary Figure 1 included concentrations or amounts of the conditions used. This info is buried in a Supplementary table and would be better placed here.

      Reply: Supplemental Fig. 1 provides a simple overview for the different conditions and drugs used. For most stresses and drugs, we used multiple different doses. So the figure would become cluttered if we indicated all these concentrations, detracting from the main message. Colleagues who are interested in the different concentration ranges used for specific conditions can readily obtain this information from Supplemental Dataset 1. We have now added a statement in this respect to the legend of Supplemental Fig. 1

      Page 6, last sentence. What is a "biological repeat"? Three distinct deletion strains (ie three different deletion strains made by CRISPR) or one deletion strain used three times?

      Reply: Biological repeat means that one deletion strain was assayed three times independently, each with at least two colonies (technical repeats). In most cases, we had two or more independently generated deletion strains for each lincRNA (using the same or different gRNAs), and we performed at least three biological repeats for each strain. The numbers of independent strains for each lincRNA are provided in Supplemental Dataset 1 (sheet: lincRNA_metadata, column: n_independent_ko_mutants). The total numbers of repeats carried out for each condition after QC filtering are available in Supplemental Dataset 2 (columns: observation_count). We have clarified this on p. 7, and the details are now provided in the Methods on p. 28-29 (deletion mutants) and p. 32 (overexpression mutants).

      There is no mention in the manuscript of how other researchers can get access to the deletion strains and over-expression plasmids.

      Reply: As is usual, all strains and plasmids will be readily available upon request.

      Reviewer #1 (Significance (Required)):

      The production of lincRNA deletion strains and overexpression plasmids, and their analysis under an impressive number of conditions, provides key resources and data for the ncRNA field. This work complements nicely the analysis of protein coding gene deletion strains and provides the tools and data for future mechanistic studies of individual lincRNAs. This work would be of interest to the growing audience of ncRNA researchers in both yeast and other systems.

      Field of expertise:

      Yeast deletion strain construction and analysis, RNA functional analysis

      \*Referee Cross-commenting** *

      Reviewer #3 makes an important point that the stability of each lincRNA over expressed from plasmid is not known and therefore some lincRNAs may not be overexpressed as predicted. RT-qPCR would be required to assess lincRNA expression levels from the plasmids. It also appears that we both agree that it is important to determine the sequence of the cloned lincRNAs in the over expression plasmids.

      Reply: See reply in response to Reviewer 3.

      Reviewer #3 also makes an important point in his review that where it is predicted that a lincRNA deletion influences an adjacent gene in cis then the expression of that gene should be tested.

      Reply: See reply in response to Reviewer 3.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      \*Summary:** *

      The Rodriguez-Lopez manuscript from the Bahler lab present the phenotypical and functional profiling of lincRNA in fission yeast. This is the first large-scale, extensive work of this nature in this model organism and it therefore nicely complement the well-documented examples of lincRNA already reported in S.pombe.

      The work is very solid using seamless genome deletion and overexpression followed by colony-based assay in respone to a very wide set of conditions.

      \*Major comments:** *

      - considering that this is a descriptive work by nature and that the experiments were properly conducted as far as I can judge, I don't have major issues with this paper.

      To me the only thing that is missing is a gametogenesis assay, for two reasons: First, several reported cases of lincRNAs in pombe critically regulates meiosis, and second many of the analysed lincRNAs are upregulated durig meiosis. Figure 6B already points to three obvious candidates. I don't think it would take to much time to look at the deletion and OE in an h90 strain and see the effect of gametogenesis for the entire set or at least the 3 candidates from Figure 6.

      If the already broad set of lincRNAs implicated in meiosis would grow, this would be another evidence that eukaryotic cell differentiation relies on non-coding RNAs even in simpler models.

      Reply: We agree that this is a meaningful analysis to add. We have now deleted the three unstudied lincRNA genes, along with the meiRNA gene, from the sub-cluster of Figure 6B in the homothallic h90 background (to allow self-mating). We have analysed meiosis and spore viability of these four deletion strains together with a wild-type h90 control strain. These experiments indicate that cell mating is normal in the deletion mutants, but meiotic progression is somewhat delayed in SPNCRNA.1154, SPNCRNA.1530 and, most strongly, meiRNA mutants (the latter has been reported before (reviewed by Yamashita 2019). Notably, we detected significant reductions in spore viability for all four deletion mutants compared to the control strain. These results point to roles of SPNCRNA.1154, SPNCRNA.1530, and SPNCRNA.335 in meiotic differentiation, as predicted by the clustering analyses. This is a nice addition to the manuscript. We now report these results on p. 23, with a new Supplemental Figure 10, and describe the experimental procedures in the Methods (p. 34-35).

      \*Minor comments:** *

      - A reference to the recent work of the Rougemaille lab on mamRNA is necessary

      Reply: Yes, we now cite this reference in the Introduction (p. 4).

      - a discussion of the possibility to perfom large-scale genetic interactions searches (as done by Krogan for protein-coding genes) would add to the discussion of futue plans

      Reply: We have added a sentence about the potential of SGA screens in the Conclusions (p. 26).

      Reviewer #2 (Significance (Required)):

      The work unambigously shows that that most of the lincRNAs analyzed exert cellular functions in specific environmental or physiological contexts. This conclusion is critical because the biological relevance this so-called « dark matter » is still debated despite a few well-established cases. This is an important addition to the field and the deep phenotyping work already points to some directions to analyse some of these lincRNA in the context of cell cycle progression, metabolism or meiosis.

      \*Referee Cross-commenting** *

      - I agree with the issues raised by referees 1 and 3 but I am concerned about the added value of a RT-qPCR. First, this is a significant amout of work considering the large set of targets. Second a more importantly, what you ll end up with is a fold change. What will be considered as overexpression? Which threshold? This is why I prefer a biological read-out (a phenotype) because whatever the fold change, it tells us that there is an effect. It is very likely indeed that some targets are not overexpressed because of their rapid degradation. To me, this is the drawback of any large-scale studies.

      - Also, looking at the expression of the adjacent gene in the case of a cis-effect is interesting though this is likely condition-dependent (because most phenotypes appear in specific conditions). So, what would be the conclusion if there is no effect in classical rich media?

      - The sequence of the insert should be specified, I agree. Most likely, it is the sequence available from pombase (this is what I understood) but that should be clarified indeed.

      Reply: Yes, the sequences of the inserts are available from PomBase, and we provide the primer sequences used for cloning in the Supplemental Dataset 1. We have now clarified this in the Methods (p. 27).

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this work from the group of Jurg Bahler, the authors take advantage of the high throughput colony-based screen approach they recently developed (Kamrad et al, eLife 2020) to perform a functional profiling analysis on a subset of 150 lincRNAs in fission yeast. Using a seamless CRISPR/Cas9-based method, they created deletion mutants for 141 lincRNAs. In addition, the authors also generated strains ectopically overexpressing 113 lincRNAs from a plasmid (under the control of the strong and inducible nmt1 promoter).

      The viability and growth of all these mutants was then assessed across benign, nutrient, drug and stress conditions (149 conditions for the deletion mutants, 47 conditions for the overexpression). For the deletion mutants, the authors also assayed in parallel mutants of 238 protein-coding genes (PCGs) covering multiple biological processes and main GO classes.

      In benign conditions, deletion of 5 and 10 lincRNAs resulted in a reduced growth phenotype (rich and minimal medium, respectively). Morphological characterization by microscopy also revealed cell size defects for 6 lincRNA mutants (2 shorter, 4 longer). In addition, 27 mutants displayed phenotypes pointing defects in the cell cycle.

      Remarkably, the nutrient/drug/stress conditions revealed more phenotypes, with 60 of the 141 lincRNA mutants showing a growth phenotype in at least one condition, and 25 mutants showing a different viability compared to the wild-type in at least one condition.

      Also remarkable is the observation that 102/113 lincRNA overexpression strain displayed a growth phenotype in at least one condition, 14 lincRNAs showing phenotypes in more than 10 conditions.

      The clustering analyses performed by the authors also provide functional insight for some lincRNAs.

      Overall, this is an important study, well conducted and well presented. Together, the data described by the authors are convincing and highlight that most lincRNAs would function in very particular conditions, and that deletion/inactivation and overexpression are complementary approaches for the functional characterization of lncRNAs. This has been demonstrated here, in a very elegant manner.

      I think this manuscript will be acknowledged as a pioneer work in the field.

      \*A. Major comments** *

      - A.1. Are the key conclusions convincing?

      To my opinion, the key conclusions of this study are convincing.

      - A.2. Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      No. The authors are careful in their claims and conclusions.

      - A.3. Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      This study is based on systematic lincRNA deletion/overexpression.

      - For the deletion strains, I could not find any information about the control of the deletions. Are the authors sure that the targeted lincRNAs were indeed properly deleted?

      Reply: Yes, we had carefully checked the correctness of the deletions using several controls as described by Rodriguez-Lopez et al. 2017. All deletion strains were checked for missing open-reading frames by PCR. For 20 strains, we also sequenced across the deletion scars. We re-checked all strains by PCR after arraying them onto the 384 plates to ensure that no errors occurred during the process. We have now specified this in the Methods (p. 27).

      - For the overexpression, there is only a control of the insert size by PCR. Sanger sequencing would have been preferable to confirm that the targeted lincRNAs were properly cloned, without any mutation. In addition, the authors did not check that the lincRNAs were indeed overexpressed (at least in the benign conditions). Is the overexpression fold similar for all the lincRNAs? Do the 14 lincRNAs showing the most consistent phenotypes in at least 10 conditions display different expression levels than the other lincRNAs?

      - A.4. Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      - Validating the deletion strains requires genomic DNA extraction and then PCR. This is repetitive and tedious, but this control is important, I think. The time needed depends on the possibility of automating the process. I think this is feasible in this lab.

      - Controlling the insert sequence into the overexpression vector requires plasmid DNA (available as it was used for PCR) and one/several primer(s), depending on the insert size. The sequencing itself is usually done by platforms.

      - Analysing lincRNA overexpression at the RNA level requires yeast cultures, RNA extraction and then RT-qPCR. Again, the time needed depends on the possibility of automating the process.

      Reply: We have now checked most overexpression constructs by Sanger sequencing of the inserts as described in response to Reviewer 1. Moreover, we have tested the overexpression levels for eight selected overexpression constructs using RT-qPCR analysis. These eight constructs feature the entire range of associated phenotypes hits, including 3 lincRNAs with the highest number of phenotypes in 14 conditions, 3 with no phenotypes, and 2 with intermediate numbers of phenotypes. The RT-qPCR results show that the lincRNAs were 35- to 2200-fold overexpressed relative to the empty-vector control strain (which expresses the lincRNA at native levels). No clear pattern was evident between expression levels and phenotype hits, e.g. lincRNAs without phenotypes when overexpressed showed similar fold-changes as a lincRNA showing 13 phenotypes. We present these results on p. 21/22 and in the new Supplemental Figure 9A, and describe the experiment in the Methods (p. 28).

      As pointed out by Reviewer 2, these fold changes in expression are actually of limited value compared to the phenotype read-outs. The important result is that we detected phenotypes for over 90% of the overexpression strains, indicating that overexpression generally worked. Given that this is a large-scale study, there might be some lincRNA constructs that are faulty or are not overexpressed. It would not be realistic or meaningful to test all constructs. Any follow-on studies focusing on a specific lincRNAs will need to first validate the large-scale results as is common practice.

      - A.5. Are the data and the methods presented in such a way that they can be reproduced?

      The methods are clearly and extensively explained. If necessary, the reader can find more details about the high-throughput colony-based screen approach in the original paper (Kamrad et al, eLife 2020); a very interesting technical discussions can also be found in the reviewers reports and in the authors response published alongside.

      - A.6. Are the experiments adequately replicated and statistical analysis adequate?

      The experiments are replicated. However, I feel confused regarding the number of replicates used in each analysis.

      In the first part of the Results, it is mentioned that all colony-based phenotyping was performed in at least 3 independent replicates, with a median number of 9 repeats per lincRNAs. In the Methods section, I read that for the high-throughput microscopy and flow cytometry for cell-size and cell-cycle phenotypes, over 80% of the 110 lincRNA mutants screened for cellular phenotypes were assayed in at least 2 independent biological repeats. For the overexpression, I read that each strain was represented by at least 12 colonies across 3 different plates and experiments were repeated at least 3 times. Each condition was assayed in three independent biological repeats, together with control EMM2 plates, resulting in at least 36 data points per strain per condition.

      Perhaps I missed something. If not, could the authors clarify this? In addition, I suggest to indicate the number of replicates used for each lincRNA/condition/assay in Supplemental Dataset 2 (I could only find the information for the Flow Cytometry) and in Supplemental Dataset 6.

      Reply: For all colony-based phenotyping, we performed at least three biological repeats, meaning that the strains were assayed three times independently, each with at least two colonies (technical repeats). In most cases, we had two or more independently generated deletion strains for each lincRNA, and we performed at least three biological repeats for each strain (hence the higher median number of nine repeats per lincRNA). The numbers of independent deletion strains for each lincRNA are provided in Supplemental Dataset 1 (sheet: lincRNA_metadata, column: n_independent_ko_mutants). The total numbers of repeats carried out for each condition after QC filtering are available in Supplemental Dataset 2 (columns: observation_count). We have now clarified this on p. 6, and the details are provided in the Methods on p. 28-29 (for deletion mutants) and p. 32 (for overexpression mutants). For the high-throughput microscopy and flow cytometry experiments, we performed the repeats as described in the text.

      \*B. Minor comments** *

      - B.1. Specific experimental issues that are easily addressable.

      - The pattern of the SPNCRNA.1343 and SPNCRNA.989 mutants is consistent with the idea that these lincRNAs act in cis and that their deletion interferes with the expression of the adjacent tgp1 and atd1 genes, respectively. The authors could easily test by RT-qPCR or Northern Blot that the lincRNA deletion leads to the induction of the adjacent gene. Also, if the hypothesis of the authors is correct, the ectopic expression of these two lincRNAs in trans should not complement the phenotypes of the corresponding mutants. These experiments would reinforce the conclusion of the authors about the specific regulatory effect of the SPNCRNA.1343 and SPNCRNA.989 lincRNAs.

      Reply: It would actually not be as easy as suggested to obtain conclusive results in this respect. For SPNCRNA.1343 and its neighbour, atd1, the mechanisms involved have already been shown in detail based on several mechanistic studies (Ard et al., 2014; Ard and Allshire, 2016; Garg et al., 2018; Shah et al., 2014; 2014; Yague-Sanz et al., 2020). But these studies did require multiple precise genetic constructs and specialized approaches to interrogate the complex regulatory relationships between the overlapping transcripts which can be both positive and negative. As correctly pointed out by Reviewer 2, we do not know the particular conditions where any cis-regulatory interactions take place, and a negative result would not be conclusive. We have interrogated our RNA-seq data obtained under multiple genetic and environmental conditions (Atkinson et al. 2018) to analyse the regulatory relationship between SPNCRNA.1343 and atd1 (studied before) as well as SPNCRNA.989 and tgp1 (proposed in our manuscript). Depending on the specific conditions, both of these gene pairs show positive or negative correlations in expression levels. So it is not possible to just perform the easy experiment as suggested to reach a clear conclusion.

      - Is there any possibility that some nutrient/drug/stress conditions interfere with the expression from the nmt1 promoter?

      Reply: This seems unlikely as this widely used promoter is known to be specifically regulated by thiamine. Consistent with this, we actually detected phenotypes for over 90% of the overexpression strains. But we cannot exclude the possibility that some conditions might interfere with nmt1 function.

      - Supplemental Figure 7 refers to unpublished data from Maria Rodriguez-Lopez. Is this still allowed?

      Reply: These are just control RNA-seq data from wild-type cells growing in rich medium. It does not seem that meaningful, but if required we could submit these data to the European Nucleotide Archive (ENA).

      - Supplemental Figure 8 shows drop assays to validate the growth phenotypes revealed by the screen for lincRNAs of clusters 1 and 3. As admitted by the authors in the text, in most cases, the effects are quite difficult to see to the naked eye. Did the authors consider the possibility to use growth curves (for the lincRNAs/conditions they would like to highlight), which might be more appropriate to visualize weak effects?

      Reply: We have tried a few experiments in liquid medium using our BioLector microfermentor. However, the doses need to be substantially changed for liquid media (in which cells typically are more sensitive than on solid media). So the situation with the altered conditions would become too confusing and could not be used as a direct validation of our results from solid media.

      - B.2. Are prior studies referenced appropriately?

      Yes. The authors could have cited the work of Huber et al (2016) Cell Rep. (PMID: 27292640) as another pioneer study where systematic lncRNA deletion was performed, even if in this case, these were antisense lncRNAs.

      Reply: Agreed, we now cite this paper in the Introduction (p. 4).

      - B.3. Are the text and figures clear and accurate?

      Overall, I found the text and figures clear.

      Reviewer #3 (Significance (Required)):

      Eukaryotic genomes produce thousands of long non-coding RNAs, including lincRNAs which are expressed from intergenic regions and do not overlap PCGs. Several lincRNAs have been extensively studied and characterized, showing that they function in different cellular processes, such as regulation of gene expression, chromatin modification, etc. However, beside these well documented lincRNAs, the function of most lincRNAs remains elusive. In addition, under the standard growth conditions used in labs, many of them are expressed to very low levels, and for the few cases for which it has been tested, the deletion and/or overexpression in trans often failed to display in a detectable phenotype.

      High throughput approaches for lncRNA functional profiling are currently emerging. The lab of Jurg Bahler recently developed a high throughput colony-based screen approach enabling them to quantitatively assay the growth and viability of fission yeast mutants under multiple conditions (Kamrad et al, eLife 2020). Here, they take advantage of this approach to characterize mutants of 150 lincRNAs in fission yeast, including not only deletion mutants generated using the CRISPR/Cas9 technology, but also overexpression mutants, tested in 149 and 47 growth conditions, respectively. This systematic approach allowed the authors to reveal specific phenotypes for a large fraction of the lincRNAs, emphasizing the fact that they are likely to be functional in particular nutrient/drug/stress conditions, acting in cis but also in trans.

      As I wrote in the summary above, I think that this study is important and constitutes a significant contribution in the lncRNA field.

      My field of expertise: long non-coding RNAs, yeast, genetics.

      \*Referee Cross-commenting** *

      I can see that reviewer #1 and I have raised the same concerns about the lack of insert sequencing for the overexpression plasmids, which is crucial to control that the correct lincRNAs were cloned and that no mutation has been introduced by the PCR. We are also both asking for RT-qPCR controls to show that the lincRNAs are indeed overexpressed. Again, this control is very important as many long non-coding RNAs are rapidly degraded by the nuclear and/or ctyoplasmic RNA decay machineries. So expressing a lincRNA from a plasmid, under the control of a strong promoter, does not guarantee increased RNA levels.

      I see that reviewer #2 is asking for a gametogenesis assay. I think it should be limited to the 3 lincRNAs which belong to the same sub-cluster as meiRNA.

    1. Author Response:

      Reviewer #1 (Public Review):

      Hickey et al. studied chromatin landscape changes in early Zebrafish embryos at three distinct stages: preZGA, ZGA and postZGA. Using ChIP-seq on these time-course samples, they examined developmental genes at their regulatory elements, including promoters and enhancers, that carry nucleosomes enriched with histone variant H2A.Z, as well as post-translational modifications H3K4me1 and H3K27ac, but with low DNA methylation, in early-stage embryos prior to turning on zygotic gene expression. During embryogenesis, this group of elements recruit a Polycomb Repressive Complex 1 (PRC1) component Rnf2 to "write" the ubiquitinated H2A or H2A.Z. The mono-ubH2A/Z then recruits a PRC2 component Aebp2 to further "write" the H3K27me3 repressive mark to silent these developmentally regulated genes in later stage embryos. Using a small molecule to inhibit Rnf2 abolishes H3K27me3 and leads to ectopic gene expression.

      Most of the data for the first half of this manuscript are presented in a clear and logic manner. The conclusions based on these correlation assays are quite obvious and well supported (except a few minor points raised below for clarifications, #2-#3). The major concern is for the second half of the manuscript where a drug is used to draw causal relationships (see point #1 below).

      1. Using small molecule could have secondary effects. It also seems that the drug-induced defects cannot be reversed after being washed away. Furthermore, this drug treatment eliminates almost all H3K27me3 genome-wide, regardless of their occupancy status with mono-ubH2A/Z, making it difficult to make the causal connection between the prerequisite mono-ubH2A/Z occupancy and the subsequent de novo H3K27me3. I think it is important for the authors to address this point more directly as this is the main conclusion of this work. Could the authors perform genetic analyses to confirm the specificity of the phenotypes?

      2. Page 8, line 160-163: "Curiously, enhancer cluster 5 (Figure 2A) was unique - displaying high H3K4me1, very high H3K27ac, and open chromatin (via ATAC-seq analysis; Figure 2 - figure supplement C, D) - but bore DNA methylation - an unusual combination given the typical strong correlation between high H3K4me1 and DNA hypomethylation." I suspect that the authors are talking about the chromatin state at pre-ZGA stage as this is the only stage DNA methylation pattern was included, but it is hard to tell that this cluster displays high H3K4me1 at all.

      We now see the confusion, and are happy to clarify this. We were intending to refer to to the histone marking at postZGA, and the DNAme at postZGA (for cluster #5) – as postZGA is the time when H3K4me1 is high, H3K27ac is very high, and DNAme remains high. The reviewer is right that we do not show the DNAme pattern at post ZGA, only preZGA. However, the DNAme pattern stays almost constant between preZGA (2.5 hpf) and postZGA (4.3 hpf) – a result we published previously in Potok et al., 2010 (note: the maternal genome shows DNA reprogramming prior to 2.5hr, and is then constant through ZGA). We did not include DNAme at every stage simply to save space in Panel A, which was getting crowded. However, to avoid the reader misunderstanding our point, we have taken care to make this clear in the revised manuscript. We thank the reviewer for raising this point.

      1. Page 10, line 206-207: "PRT4165 treatment also conferred limited new/ectopic Aebp2 peaks (Figure 4C, clusters 4, 6, 7,8)", it seems that clusters 4, 6, 7, 8 together are not "limited" compared to clusters 1, 3, and 5, and could be even more abundant.

      Thank you for this comment - we agree with the reviewer and have clarified this in the text and Figure 4. In the initial version, the section where we mention ‘limited’ additional sites was intend to refer to promoters, and although as only a modest fraction of the ectopic sites are at promoters, but we did not provide that context in the text. Indeed, if one looks at all sites in the genome, there are a large number of ectopic sites after PRT4165 treatment. This is shown clearly in the revised Figure 4 (which shows all genomic sites) and we have clarified this in the text.

      We were curious whether there is any feature that helps us understand what might unify the ectopic binding, and therefore underlie the mechanism(s). First, we tested whether binding sites for particular transcription factors might be enriched; however, we did not find a class of binding sites that represented more than 3% of the total sites. We note that others have reported some affinity of mammalian Aebp2 for DNA and some limited sequence specificity (Kim et al., NAR 2009), and in the absence of a high-affinity H2AUb target, that shadow DNA binding function may become more apparent. Furthermore, we did not observe chromatin marks that showed a highly significant degree of overlaps. Thus, although intriguing, there does not appear to yet be a logic to the ectopic binding observed.

      1. In the context of studying the chromatin state of developmental genes in early vertebrate embryos, there are two recent publications in mouse embryos which also investigated the crosstalk between mono-ubH2A and H3K27me3 at the ZGA transition in mouse (https://doi.org/10.1038/s41588-021-00821-2 and https://doi.org/10.1038/s41588-021-00820-3). It would be informative to add some discussion for comparisons between these two vertebrate organisms.

      Reviewer #2 (Public Review):

      One model for polycomb domain establishment suggests that PRC2 adds H3K27me3 first, and then recruits PRC1 for silencing. The key evidence for this model is the H3K27me3-binding module CBX proteins in canonical PRC1 complexes. This model has been revised by recent studies, and it is now well recognized that the polycomb domains can be de novo established in a different order. In other scenarios, including X inactivation, a non-canonical PRC1 complex that lacks CBX proteins catalyzes ubH2A first, and PRC2 complex is subsequently recruited through recognizing ubH2A modification by its Jarid2 and Aebp2 subunits.

      In this manuscript, Hickey and co-workers analyzed the temporal change of various epigenetic marks around ZGA stages during zebrafish early embryo development. Based on their experimental data and bioinformatic analysis, they suggest that polycomb establishment in zebrafish embryo is following the 'non-canonical' order, in which H3K27me3 establishment is dependent on ubH2A pre-deposition and the following recruitment of Aebp2-PRC2 complex. Moreover, they suggest that polycomb-silenced developmental genes are solely repressed by ubH2A, independent of H3K27me3. Overall, the functional analysis (RNF2 inhibitor experiments) conducted in the current study highlights the critical function of PRC1 and ubH2A in silencing developmental genes during early embryo development. Moreover, this study provides clues that could reconcile with the earlier observations that H3K27me3 seems largely dispensable for silencing developmental genes in zebrafish early embryo (e.g. PMID: 31488564).

      The main concern is two similar studies have just been published in Nature Genetics using mouse early embryos, and the observation of this manuscript largely agree with the two mouse studies, rendering the novelty of this study.

      In addition, certain conclusions in the manuscript requires further experimental support:

      1. While the authors claim that H3K27me3 is established after ZGA, it is quite surprising to me that they did NOT analyzed the H3K27me3 pattern before ZGA. While IF staining suggests a minimal level of H3K27me3 before ZGA (Fig1 S2B), previous ChIP-seq analysis demonstrate that H3K27me3 are present (e.g. PMID: 22137762).

      Briefly, in our own work, we do not detect H3K27me3 by IF prior to ZGA, and we could not detect H3K27me3 peaks by ChIP during preZGA (also mentioned as ‘data not shown’ in Murphy et al., 2018).

      1. While the RNF2 inhibitor experiment clearly demonstrates that PRC1 is required for the deposition of both ubH2A and H3K27me3, that does not necessarily mean that PRC1-mediated ubH2A deposition precedes H3K27me3. The establishment and maintenance of polycomb domain usually requires the crosstalk and reinforcement between polycomb complexes. Therefore, the deficiency in either PRC1 or PRC2 complex may lead to the decreased level of both marks. To clarify a hierarchical order of the polycomb domain establishment, a phenotypic analysis of PRC2 deficiency is also necessary.

      Here, we emphasize that prior to performing the inhibitor experiment, we addressed the temporal order of addition in Figure 1 and in Figure 1 – figure supplement 1. H2Aub1 is added extensively to thousands of developmental genes during preZGA, well before H3K27me3 is detected. We interpret this as evidence that H2Aub1 temporally precedes H3K27me3 during embryonic development. We will also mention (described in the Discussion) that maternal zygotic loss of Ezh2, which eliminates all H3K27me3 in the genome at all embryo stages does not result in the activation of developmental genes.

      1. Parental difference. As shown in Fig.1B, ubH2A level varies greatly in sperm and egg, which suggests that the reprogramming process of ubH2A (and perhaps H3K27me3) distribution could be significantly different for the two parental alleles. It would be interesting to analyze the ubH2A and H3K27me3 distribution in germ cells before fertilization.

      We appreciate the reviewer’s comment and agree that this would be an interesting line of inquiry. However, this would require genomics analyses from reciprocal crosses of highly polymorphic fish strains. This would involve very considerable additional work. Therefore, we will consider this in our future studies.

      1. The role of Aebp2 subunit. Given the well-characterized function of Aebp2 in recognizing ubH2A, an involvement of Aebp2-PRC2 complex in establishing H3K27me3 on PRC1 pre-deposited regions is not unexpected. Indeed, Aebp2 co-localized well with ubH2A marked regions (Fig.3). However, an issue not clarified in the manuscript is whether Aebp2 is the sole subunit for the recruitment of PRC2 to ubH2A marked regions. Paralleled analysis of the changes for Aebp2 and H3K27me3 upon RNF2 inhibitor treatment is necessary, and Aebp2-dependent and -independent regions should be separately classified for analysis.

      2. Role of PRC1 on the temporal regulation of gene expression during early development. The authors only analyzed the RNA-seq results for RNF2i treated embryos post ZGA. Therefore, it is currently not clear if the role of PRC1 in transcriptional repression is restricted to post-ZGA stages. RNA-seq analysis of RNF2i treated embryos on those stages are also warranted.

    1. Author Response:

      Reviewer #1 (Public Review):

      The model proposed here is the first large-scale model that actually performs a cognitive task, which in this case is working memory but could easily extend to decision making in general as is acknowledged by the authors. Briefly, each of the 30 areas are simulated as a rate, Wong-Wang circuit (i.e. two excitatory pools inhibit each other through a third, inhibitory population). The authors use previously collected anatomical data to constrain the model and show qualitatively match with the data, in particular how mnemonic activity emerges somewhat abruptly along the brain hierarchy.

      Strengths Previous models have focused on neural dynamics during the so-called "resting state", in which subjects are not performing any cognitive task - thus, resting. This study is therefore an important improvement in the field of large-scale modelling and will certainly become an influential reference for future modelling efforts. As typically done in large-scale modelling, some anatomical data is used to constrain the model. The model shows several interesting characteristics, in particular how distributed working memory is more resilient to distractors and how the global attractors can be turned off by inhibition of only top areas.

      Weaknesses Some of these results are not clear how they emerge, and some "biological constraints" do not seem to constrain. Moreover, some claims are slightly exaggerated, in particular how the model matches the data in the literature (which in some cases it does not) or how somatosensory working memory can be simulated by simply stimulating the "somatosensory cortex".

      This paper has two different models, one being a simplified version of the main model. However, it is not very clear what the simplified model adds the main findings, if not to show that the empirical anatomical connectivity does not constrain the full model.

      We thank the reviewer for this evaluation, and for appreciating the innovative character of our study in implementing a cognitive function in a data-constrained large-scale brain model. We hope that it will be useful for future studies planning to add cognitive functions to their large-scale models, and also for experimentalists who might benefit from this insight.

      In response to the detailed comments of the reviewer, and to address the weaknesses identified above, we have rewritten parts of the text, clarified important concepts and included a new simulations. Briefly:

      -We have clarified the nature and effects of the ‘biological constraints’ that we use. The full model that we use is indeed data-constrained, in the sense that we use real data to determine the values of many parameters. Having a data-constrained model, however, does not mean that all the results will be equally constrained. Some model results will critically depend on (some) data used to constrain the model, while other results will be more robust to changes in these parameters. We have highlighted this point and we also added explanations for each of the results presented.

      -We have corrected several claims along the text to make it more in line with experimental evidence, and included the new references suggested by the reviewer to this effect. For example, for the case of somatosensory WM mentioned by the reviewer, we have indicated that the existence of a ‘gating’ mechanism (explored in a supplementary figure) is important for achieving an accurate match with the experimentally observed effects of somatosensory stimulation.

      -Finally, we have highlighted the complementary benefits of the full and simplified models, and improved our motivation for the latter. Briefly, the simplified model allows us to identify the key ingredients needed for distributed WM (useful to generalize to other animal models), while the full model ensures that the main findings are still present when more realistic assumptions are made. A good example is the counterstream inhibitory bias, which is in principle not necessary for a simplified model but becomes a crucial factor to implement the distributed WM mechanism in our macaque model.

      Reviewer #2 (Public Review):

      There is a lot to like about this manuscript. It provides a large-scale model of a well-known phenomenon, the "delay activity" underlying working memory, our oldest and most enduring model of a cognitive function. The authors correctly state that despite the ubiquity of delay activity, there is little known about the macro and micro circuitry that produces it. The authors offer a computational model with testable hypotheses that is rooted in biology. I think this will be of interest to a wide variety of researchers just as delay activity is studied across a variety of animal models, brain systems, and behavior. It is also well-written.

      My main concern is the authors may be self-handicapping the impact of their model by not taking into account newer observations about delay activity. For a number of years now, evidence has been building that working memory is more complicated than "persistent activity" alone. Stokes, Pasternak, Dehaene, Miller and others have been mounting considerable evidence for more complex dynamics and for "activity-silent" mechanisms where memories are briefly held in latent (non-active) forms between bouts of spiking. There is also mounting evidence that the thalamus plays a key role in working memory (and attention). In particular, higher thalamic nuclei are critical for regulating cortical feedback. Cortical feedback plays a central role in the model presented here. The model presented in this manuscript just deals with persistent attractor states and the cortex alone.

      This is not to say that this manuscript does not have good value as is. No one disputes that some form of elevated, sustained, activity underlies working memory. This work adds insights into how that activity gets sustained and the role of, and interactions between, different cortical areas. The observation that the prefrontal and parietal cortex are more critical than other areas, that there are "hidden" attractor states, and "counterstream inhibitory bias" are important insights (and, importantly, testable). They will likely remain relevant even as the field is moving beyond persistent attractor states alone as the model for working memory. The new developments do not argue against the importance of delay activity in working memory. They show that it is more to the story, as inevitably happens in brain science.

      The authors do include a paragraph in the Discussion referencing the newer developments. Kudos to them for that. However, it presented as "new stuff to address in the future". Well, that future is now. These "newer" developments have been mounting over the past 10 years. The worry here is that by relying so heavily on the older persistent attractor dynamics model and presenting it as the only model, the authors are putting an early expiration date on their work, at least in terms of how it will be received and disseminated.

      We thank the reviewer for a careful and positive evaluation of our work. We consider that the main point raised here is indeed crucial: classical explanations of WM based on elevated and constant firing are an important part of the story, however other alternative or complementary approaches developed in the past years also deserve attention. These approaches include, to name a few, activitysilent mechanisms (Mongillo et al. 2008, Trübutschek et al. 2017), dynamic hidden states (Wolff et al. 2017), persistent activity without feedback (Goldman 2009), and paradigms relying on gamma bursts (Miller et al. 2018).

      It’s important to highlight, however, that our approach is “attractor network theory” not “persistent activity theory”, and an attractor does not have to be a steady state (tonic firing) but may display complex spatiotemporal patterns (fluid turbulence with tremendously rich temporal dynamics and eddies on many spatial scales is an attractor). We now have largely eliminated the use of “persistent” in the manuscript. On the other hand, for lack of a better word it’s fine to still use that term, if it is understood in a more general sense, which also includes stable representations in which the activity of individual neurons varies along the delay period (Goldman, 2009; Murray et al. 2017) or rhythmic activity which persists over time (Miller et al. 2018). The attractor network theory should be contrasted conceptually with mechanisms based on intrinsically transient memory traces (see Wang TINS 2021 for a more elaborated discussion on this).

      Our proposal for distributed WM has a general aim and it’s not restricted to the classical ‘elevated constant firing’ scenario. Following the reviewer’s suggestion, we have rewritten the text to make sure that multiple mechanisms of WM are acknowledged in different parts of the text, not only on a paragraph in the discussion. We have also acknowledged the importance of thalamocortical interactions and cited previous relevant studies in this sense (such as Guo et al. 2017), also as a response to comments from Reviewer 1.

      In addition, we have attempted to go beyond a simple rewriting and, using a variation of our simplified model, we now show that distributed WM representations can also happen in the context of activitysilent models (Figure 3 –figure supplement 1). In particular, we use a simplified network model with reduced local and long-range connectivity strength and incorporate short-term synaptic facilitation in synaptic projections. Our model results show that, while activity-silent memory traces can’t be maintained when areas are isolated from each other, inter-areal projections reinforce the synaptic efficacy levels and lead to a distributed representation via activity-silent mechanisms.

      We hope that this result serves to prove the generality of our distributed WM framework, and opens the door to subsequent studies focusing not only on distributed activity-silent mechanisms, but in distributed frameworks relying on other WM mechanisms as well.

    1. Author Response:

      Reviewer #3 (Public Review):

      The paper contains a substantial amount of novel experimental work, the experiments appear well done, and the analysis of the data makes sense. Raw data and analysis scripts have been made fully available.

      I have two specific comments:

      • While the paper talks extensively about deep mutational scanning, I don't think this is a deep mutational scanning study. In deep mutational scanning, we usually make every possible single-point mutation in a protein. This is not what was done here, as far as I can tell.

      In the revised manuscript, we have avoided using deep mutational scanning to describe our experimental design. Instead, we described our approach as “a high-throughput experimental approach that coupled combinatorial mutagenesis and next-generation sequencing”

      • For the analysis of epistasis vs distance (Fig 4d, e, f), it would be better to look at side-chain distances rather than C_alpha distances. In covariation analyses, it can be seen that C_alpha distances are not a good predictor of pairwise interactions. Similar patterns may be observable here.

      See e.g.: A. J. Hockenberry, C. O. Wilke (2019). Evolutionary couplings detect side-chain interactions. PeerJ 7:e7280.

      Thank you for the suggestion. In the revised manuscript, we replaced the Cα analysis by a side-chain analysis according to Hockenberry and Wilke (see response to Essential Revisions above).

  6. wt3fall2021.commons.gc.cuny.edu wt3fall2021.commons.gc.cuny.edu
    1. And then we go right in after you see after that montage then you see it's Sideshow Bob in court

      This kind of dialogue between friends discussing something they all love is relatable. However, I find it quite odd that they can recount detail by detail, montage by montage, as if they have each watched it more than a thousand times. Its not naturalistic at times and forces me to think they may have been restricted from such entertainments in their futuristic world. Why are they discussing the Simpsons in such great detail? Is this to feed the audience with information that will be later relevant in Act 3 (As they are noted to wear Simpson costumes) Hmmm we will see

    1. The diversity of human values and the methods by means of which they may be realized is so vast, and many of them remain so unacknowledged, that they cannot fail but lead to conflicts in human relations.  Indeed, to say that human relations at all levels -- from mother to child, through husband and wife, to nation and nation -- are fraught with stress, strain, and disharmony is, once again, making the obvious explicit.  Yet, what may be obvious may be also poorly understood. This I think is the case here.  For it seems to me that -- at least in our scientific theories of behavior -- we have failed to accept the simple fact that human relations are inherently fraught with difficulties and that to make them even relatively harmonious requires much patience and hard work. I submit that the idea of mental illness is now being put to work to obscure certain difficulties which at present may be inherent -- not that they need be unmodifiable -- in the social intercourse of persons.  If this is true, the concept functions as a disguise; for instead of calling attention to conflicting human needs, aspirations, and values, the notion of mental illness provides an amoral and impersonal "thing" (an "illness") as an explanation for problems in living (Szasz, 1959).  We may recall in this connection that not so long ago it was devils and witches who were held responsible for men's problems in social living.  The belief in mental illness, as something other than man's trouble in getting along with his fellow man, is the proper heir to the belief in demonology and witchcraft. Mental illness exists or is "real" in exactly the same sense in which witches existed or were "real."  

      This section sets the tone for ensuring that we do not disguise what would be a normal problem with day to day efforts in society as a mental illness. This is hugely impactful on modern society and is key again in treatment. Professionals must be able to decipher the reality of mental illness from daily life struggles. To be fair, any truly trained psychologist or psychiatrist should be able to do this given they have the proper training and credentials. This however is something that must be at the forefront of a professionals mind.

    2. To recapitulate: In actual contemporary social usage, the finding of a mental illness is made by establishing a deviance in behavior from certain psychosocial, ethical, or legal norms.  The judgment may be made, as in medicine, by the patient, the physician (psychiatrist), or others.  Remedial action, finally, tends to be sought in a therapeutic -- or covertly medical -- framework, thus creating a situation in which psychosocial, ethical, and/or legal deviations are claimed to be correctible by (so-called) medical action.   Since medical action is designed to correct only medical deviations, it seems logically absurd to expect that it will help solve problems whose very existence had been defined and established on nonmedical grounds.  I think that these considerations may be fruitfully applied to the present use of tranquilizers and, more generally, to what might be expected of drugs of whatever type in regard to the amelioration or solution of problems in human living.  

      If we are to advance we must question who is making the standards which is something that is being questioned here by the author. The problem that the author is showing is the definitions of illness along with health regarding someone's mental state. This is also showing a disdain for the idea of medicating someone on grounds that are not established within physicality. This methodology is rather skewed in accordance to modern testing and understanding but serves as a stellar check and balance to those who intend to practice now.

    3. "Mental illnesses" are thus regarded as basically no different than all other diseases (that is, of the body).  The only difference, in this view, between mental and bodily diseases is that the former, affecting the brain, manifest themselves by means of mental symptoms; whereas the latter, affecting other organ systems (for example, the skin, liver, etc.), manifest themselves by means of symptoms referable to those parts of the body.  This view rests on and expresses what are, in my opinion, two fundamental errors. In the first place, what central nervous system symptoms would correspond to a skin eruption or a fracture?  It would not be some emotion or complex bit of behavior. Rather, it would be blindness or a paralysis of some part of the body. The crux of the matter is that a disease of the brain, analogous to a disease of the skin or bone, is a neurological defect, and not a problem in living. For example, a defect in a person's visual field may be satisfactorily explained by correlating it with certain definite lesions in the nervous system.  On the other hand, a person's belief -- whether this be a belief in Christianity, in Communism, or in the idea that his internal organs are "rotting" and that his body is, in fact, already "dead" -- cannot be explained by a defect or disease of the nervous system.  Explanations of this sort of occurrence -- assuming that one is interested in the belief itself and does not regard it simply as a "symptom" or expression of something else that is more interesting -- must be sought along different lines. The second error in regarding complex psycho-social behavior, consisting of communications about ourselves and the world about us, as mere symptoms [p. 114] of neurological functioning is epistemological.  In other words, it is an error pertaining not to any mistakes in observation or reasoning, as such, but rather to the way in which we organize and express our knowledge. In the present case, the error lies in making a symmetrical dualism between mental and physical (or bodily) symptoms, a dualism which is merely a habit of speech and to which no known observations can be found to correspond. Let us see if this is so. In medical practice, when we speak of physical disturbances, we mean either signs (for example, a fever) or symptoms (for example, pain). We speak of mental symptoms, on the other hand, when we refer to a patient's communications about himself, others, and the world about him.  He might state that he is Napoleon or that he is being persecuted by the Communists. These would be considered mental symptoms only if the observer believed that the patient was not Napoleon or that he was not being persecuted[sic] by the Communists. This makes it apparent that the statement that "X is a mental symptom" involves rendering a judgment. The judgment entails, moreover, a covert comparison or matching of the patient's ideas, concepts, or beliefs with those of the observer and the society in which they live.  The notion of mental symptom is therefore inextricably tied to the social (including ethical) context in which it is made in much the same way as the notion of bodily symptom is tied to an anatomical and genetic context (Szasz, 1957a, 1957b). To sum up what has been said thus far: I have tried to show that for those who regard mental symptoms as signs of brain disease, the concept of mental illness is unnecessary and misleading.  For what they mean is that people so labeled suffer from diseases of the brain; and, if that is what they mean, it would seem better for the sake of clarity to say that and not something else.

      This component of the passage hold great value via the idea of not having any physical implications that can be reversed to the naked eye. While flawed as mindset, this is something that we must regularly keep in mind as psychologist when attempting to treat patients. When looking into the mind and the effects that we have on society, we must think of how we are going to better the interactions with those around us. Many members of society that suffer from mental illness are in fact almost incapable due to chemical imbalance. This is something that is in fact falsifiable due to testing and treatment results (Parekh, 2018). This is still a great thought process to keep in mind especially for the year it was released.

      Reference, Parekh, R. (2018, August). What Is Mental Illness? What is mental illness? Retrieved November 28, 2021, from https://www.psychiatry.org/patients-families/what-is-mental-illness.

    1. we have a very 00:38:26 unwieldy process of more than close to 200 countries with very stark differences sometimes and very different starting points so i think all of this doesn't really 00:38:39 make a good sort of negotiation process and if we if we go to the next cup my sense is that the process is extremely slow and we are 00:38:50 more or less at say setting ourselves up for failure but also you know we are going to one cup after another we with a great sense of a predictability of something that we know it's not going 00:39:03 to work at the pace at which it needs to work

      countries negotiating may not be as effective as working at the individual / civil society level to appeal to the wealthy demographics, who are responsible for the lions share of emissions.

    1. Reviewer #1 (Public Review):

      Todesco et al. investigate the genetic causes of variation in UV pigmentation in sunflowers as well as the possible biotic and abiotic factors that play a role in natural variation for the trait among populations. Overall I am very enthusiastic about this manuscript as it does an elegant job of going from phenotype to a key locus and then presenting a solid foray into the factors causing variation. I have only a fe relatively minor comments.

      The introduction felt a bit short. I was hoping early on I think for a hint at what biotic and abiotic factors UV could be important for and how this might be important for adaptation. A bit more on previous work on the genetics of UV pigmentation could be added too. I think a bit more on sunflowers more generally (what petiolaris is, where natural pops are distributed, etc.) would be helpful. This seems more relevant than its status as an emoji, for example.

      The authors present the % of Vp explained by the Chr15 SNP. Perhaps I missed it, but it might be nice to also present the narrow sense heritability and how much of Va is explained.

      A few lines of discussion about why the Chr15 allele might be observed at only low frequencies in petiolaris I think would be of interest - the authors appear to argue that the same abiotic factors may be at play in petiolaris, so why don't we see this allele at frequencies higher than 2%? Is it recent? Geographically localized?

      Page 14: It's unclear to me why there is any need to discretize the LUVp values for the analyses presented here. Seems like it makes sense to either 1) analyze by genotype of plant at the Chr15 SNP, if known, or 2) treat it as a continuous variable and analyze accordingly.

      Page 14: I'm not sure you can infer selection from the % of plants grown in the experiment unless the experiment was a true random sample from a larger metapopulation that is homogenous for pollinator preference. In addition, I thought one of the Ashman papers had actually argued for intermediate level UV abundance in the presence of UV?

      I would reduce or remove the text around L316-321. If there's good a priori reason to believe flower heat isn't a big deal (L. 323) and the experimental data back that up, why add 5 lines talking up the hypothesis?

      Page 17: The discussion of flower size is interesting. Is there any phenotypic or genetic correlation between LUVP and flower size?

    1. Author Response:

      Reviewer #1:

      Summary:

      Moody et al. presented a comprehensive investigation into the choice of marker genes and its impact on the reconstruction of the early evolution of life, especially regarding the length of the branch that separates domains Bacteria and Archaea in the phylogenetic tree. Specifically, this work attempts to resolve a debate raised by a previous work: Zhu et al. Nat Commun. 2019, that the evolutionary distance between the two domains is short as estimated using an expanded set of marker genes, in contrast to conventional strategies which involve a small number of "core" genes and indicate a long branch.

      Through a series of analyses on 1000 genomes, Moody et al. defended the use of core genes, and reinforced the conventional notion that the inter-domain branch (the AB branch) is long, as inferred by the core gene set. They proposed that with the 381 marker genes (the "expanded" set) used by Zhu et al., the observed short branch length is an artifact due to inter-domain gene transfer and hidden paralogy. Through topology tests, they ranked the markers by "verticality", and showed that it is positively correlated with the AB branch length. They also conducted divergence time estimation and showed that even the most vertical genes led to an implausible estimate of the origin of life.

      In parallel, Moody et al. surveyed the best marker genes using a set of 700 genomes. They recovered 54 markers, and demonstrated that ribosomal markers do not indicate a longer AB branch than non-ribosomal markers do. With the better half (27) of these marker genes, they conducted further phylogenetic analyses, which shows that potential substitutional saturation and the use of site-homogeneous models could contribute to the underestimation of the AB branch. Using this taxon set and marker set, they reconstructed the prokaryotic tree of life, which revealed a long AB branch, a basal placement of DPANN in Archaea, and a derived placement of CPR in Bacteria.

      Prokaryotic tree of life:

      The scope(s) of the manuscript is somehow split. First, it is posed as a point-to-point rebuttal to the Zhu et al. paper, on the long vs. short AB branch question. Second, it introduces a new phylogeny of prokaryotes using 27 "good" marker genes, and demonstrates that DPANN is basal to Archaea, and CRP is derived within Bacteria.

      Thanks for the summary. The two aspects of the manuscript identified by the reviewer are closely related, because the different issues boil down to the same underlying question: which genes should we use to infer the deep structure of the tree of life? The provocative work of Zhu et al. acted as an impetus to compare and evaluate the properties of several published marker gene sets, and then to identify (what our analyses suggest are) the subset best-suited for deep phylogeny, which we then use to infer an updated tree of life. We have clarified this logical structure in the revised manuscript, writing (at the end of the Introduction):

      “Here, we investigate these issues in order to determine how different methodologies and marker sets affect estimates of the evolutionary distance between Archaea and Bacteria. First, we examine the evolutionary history of the 381 gene marker set (hereafter, the expanded marker gene set) and identify several features of these genes, including instances of inter-domain gene transfers and mixed paralogy, that may contribute to the inference of a shorter AB branch length in concatenation analyses. Then, we re-evaluate the marker gene sets used in a range of previous analyses to determine how these and other factors, including substitutional saturation and model fit, contribute to inter-domain branch length estimations and the shape of the universal tree. Finally, we identify a subset of marker genes least affected by these issues, and use these to estimate an updated tree of the primary domains of life and the length of the stem branch that separates Archaea and Bacteria.”

      The second scope has inadequate novelty. A recent paper (Coleman et al. Science. 2021), which was from a partially overlapping group of authors, was dedicated to the topic of CPR placement, and indicated the same conclusion (CPR being derived and sister to Chloroflexi) as the current work does, albeit using more sophisticated approaches. The paper also addressed the debate of CPR placement (including citing the Zhu et al. paper). Additionally, the basal placement of DPANN has also been suggested by previous works (such as Castelle and Banfield. Cell. 2018). Therefore, re-addressing these two topics using a largely well-established and repeatedly adopted method on a relatively small taxon set does not constitute a significant extension of current knowledge.

      We disagree. Resolving the deep structure of the tree of life is an important topic --- this is what we, Zhu et al. (2019), and of course many others have been trying to achieve, in different and sometimes conflicting ways. Most of the published work is based on limited or biased taxon sampling (see Figure 1 Figure Supplement 14,15,16) or else focused on just one of the two prokaryotic domains of life. Furthermore, deep phylogeny is uncertain, and new results become convincing only when they receive support from multiple datasets and approaches. For instance, Coleman et al. (2021) recently found support for the placement of CPR as a sister clade to Chloroflexota rather than as a basal branch within the Bacteria. Notably, this work focused only on Bacteria, and made use of a different rooting method (with its own strengths and limitations) and taxon sampling. Most previous analyses using Archaea as an outgroup to root the bacterial tree recovered CPR as a deeply branching lineage within Bacteria, a placement likely resulting from LBA. In turn, our present findings represent an important confirmation of the CPR+Chloroflexi clade. Similarly, the basal placement of DPANN within Archaea remains controversial despite a number of studies on the topic, and our study also contributes to that ongoing debate.

      The debate:

      The first scope appears to be the more important goal of this manuscript, as it extensively discusses the claims made by Zhu et al. and presents a point-to-point rebuttal, including counter evidence. This may narrow the interest of this work to a small audience of specialists. Nevertheless, to best evaluate the current work, it is necessary to review the Zhu et al. paper and compare individual analyses and conclusions of the two studies.

      In doing so, I found that the two articles have distinct scopes that appear similar but not actually inline. To a large extent, the current work does not constitute actual rebuttal to the points made by Zhu et al. In contrast, some of the analyses presented in the current work support those by Zhu et al., despite being interpreted in a different way. For the claims that directly contest Zhu et al., I do not see sufficient evidence that they are supported by the analyses.

      Below is a summary of the comparison, which I will explain point-by-point in later paragraphs.

      • Moody et al. assessed AB branch length, while Zhu et al. assessed AB evolutionary distance (which is different).
      • Moody et al. evaluated the phylogeny indicated by a small number of core markers, while Zhu et al. evaluated the genome average using hundreds of global markers.
      • Zhu et al.'s results also showed that gene non-verticality, substitutional saturation, and site-homogeneous models shorten the AB distance, which is consistent with Moody et al.'s.
      • However, Zhu et al. found that some core markers are outliers in the genome-wide context, and the long AB distance indicated by them cannot be compensated for by the aforementioned effects. Moody et al. hasn't addressed this. Therefore, the novelty and potential impact of the current work is less compelling: It used a classical method (a few dozen core genes) and found a pattern that has been found many times by some of the same authors and others (including Zhu et al., who also analyzed core genes).

      Thanks for this detailed comparison of the two studies --- the points raised here and elaborated on below have prompted us to perform additional analyses which provide further insight into the properties and behaviour of the various marker gene sets analyzed. We nonetheless disagree that “the current work does not constitute actual rebuttal to the points made by Zhu et al.”: our finding that ribosomal and other “core” proteins are among the best phylogenetic markers for resolving both within- and between-domain relationships, estimating the length of the AB stem, and performing divergence time estimation, challenges an important claim of Zhu et al.’s study, and will be of broad interest to the community of researchers working on early life/early evolution.

      That said, we do also agree that one aspect of the disagreement between our study and that of Zhu et al. has to do with what is meant by evolutionary distance, and we have now discussed these issues in detail in the revised manuscript (as detailed below). In revising the manuscript, we have also sought to avoid a reductive focus on rebuttal, have revised the text to acknowledge important strengths and interesting features of the Zhu et al. analyses, and have made text revisions to ensure a consistent constructive tone: these are fundamental and challenging questions, and different perspectives and analyses are valuable in making progress. We also note that there has been an ongoing debate about the suitability of ribosomal genes for deep phylogeny in the literature (e.g. Petitjean et al. 2014, discussed in more detail below). Our analyses, and those of Zhu et al. (2019) previously, contribute to that broader discussion.

      Detailed responses to each of the above points follow below.

      AB distance metric:

      There is a subtle but critical difference between the scopes of the two papers: The Zhu et al. paper "reveals evolutionary proximity between domains Bacteria and Archaea". By stating "evolutionary proximity", it investigated two metrics: The length of the branch separating Archaea from Bacteria in the phylogenetic tree, i.e., the "AB branch". This was the main focus of the current work.

      The average tip-to-tip distance (sum of branch lengths) between pairs of Archaea and Bacteria taxa in the tree. A significant proportion of the Zhu et al. work was discussing this metric, and it led to several important conclusions (e.g., Figs. 4F, 5). The current work has not explored this metric.

      Thanks for raising the point about relative AB distance. In our revised manuscript, we have expanded Figure 1 and the associated analyses to include this metric. These analyses demonstrate that relative AB distance behaves similarly to AB branch length: they are positively correlated with each other; both are reduced by inter-domain HGT, and both are negatively correlated with ΔLL and with split score, an additional metric of within- and between-domain marker gene verticality which we have included in the revised Figure 1. Taken together, these results suggest that high-verticality marker genes (as judged both by the recovery of reciprocal AB monophyly, and of established within-domain relationships) support a longer AB branch and show a higher relative AB distance.

      These two metrics implicate distinct research strategies: For 1), HGTs and paralogy are usually considered problematic (as the current and many previous works argued). However, 2) is naturally compatible with the presence (and prevalence) of HGTs and paralogy.

      Authors of the current work equate "genetic distance" to "branch length" (line 70), and only investigated the latter. This equation is misleading. If organism groups A and B diverged early, but then exchanged many genes post-divergence, then this is indisputable evidence that their "genetic distance" is close. This point needs to be clearly explained in the manuscript.

      We agree with the reviewer that various definitions of evolutionary distance are possible, and some may be more useful than others for particular applications. The reviewer’s argument that “If organism groups A and B diverged early, but then exchanged many genes post-divergence, then this is indisputable evidence that their "genetic distance" is close” makes the case for a kind of phenetic distance: a distance based on overall similarity, regardless of how that similarity was brought about in terms of evolutionary process. We appreciate the democratic appeal of such a metric, and we have no desire to impose any particular philosophy of classification on the reader. However, the key point here is that methods that rely on concatenation for branch length or divergence time estimation (as used by Zhu et al., and in our current study) make the assumption that all of the sites in the concatenate evolved on the same underlying tree and if this assumption is not met, analyses can be misled. Thus, the shorter AB branch length and the more recent Archaea-Bacteria divergence times estimated from concatenations of incongruent marker genes result from unmodelled gene transfers which are misinterpreted as evidence for more recent common ancestry. Gene transfer is an important aspect of genome evolution, but none of the currently available methods, including those used by Zhu et. al., allow for genome-scale comparisons to be made in a way that accounts for our understanding of the underlying evolutionary processes.

      The point about different possible definitions of evolutionary distance made by the reviewer is valid, and we have now revised the opening of our conclusion to discuss these issues in more detail, writing:

      “We note that alternative conceptions of evolutionary distance are possible; for example, in a phenetic sense of overall genome similarity, extensive HGT will increase the evolutionary proximity (Zhu et al., 2019) of the domains so that Archaea and Bacteria may become intermixed at the single gene level. While such data can encode an important evolutionary signal, it is not amenable to concatenation analysis.”

      Core vs genome:

      This difference between "AB distance" and "AB branch length" is relevant to a more fundamental question: What defines the "evolutionary distance" between two groups of organisms? Both papers did not explicitly discuss this topic. It likely cannot be resolved in one article (as many scholars have continuously attempted on related topics in the past decades). But the discordance in understanding led to very different research strategies in the two papers, and rendering them incongruent in methodology.

      Specifically, the current work (and multiple previous works) based phylogenetic inference on only genes that demonstrate a strong pattern of vertical evolution. HGTs were considered deleterious, and needed to be excluded from the analysis. This left a few dozen genes at most, and many are spatially syntenic and functionally related (e.g. ribosomal proteins). In this work, the final number is 27. Previous critiques of this methodology have suggested that this is not a tree of life, but a "tree of one percent" (Dagan and Martin, Genome Biol. 2006).

      In contrast, Zhu et al. (and related previous works) attempted to evaluate the evolution of whole genomes by "maximizing the included number of loci.". They used a "global" set of 381 genes. They faced the challenge of "reconciling discordant evolutionary histories among different parts of the genome", because "HGT is widespread across the domains". To resolve this, they adopted the gene tree summary method ASTRAL.

      Therefore, the "AB distance" estimated by Zhu et al. is a genome-level distance, calculated by merging conflicting gene evolutions (which itself can be disputed, see below). Whereas the "AB branch" evaluated in this work is strictly the branch length in the core gene evolution. Therefore, the results presented in the two papers do not necessarily conflict, because of the different scopes.

      This point is closely related to the previous one, and the new section (final paragraphs of the Conclusion, quoted directly above) goes some way to addressing this comment. Regarding the issue of a focus on just a small proportion of vertically-evolving genes, the critical point is as above: current methods for branch length and divergence time estimation (including those used by Zhu et al.) require such vertically-evolving genes, because they make the assumption that all of the sites evolve on the same tree, i.e. trace back to the same origin via vertical evolution. We agree that most prokaryotic gene families do not evolve under these restrictive assumptions and therefore cannot be analysed using concatenation methods for branch length estimation. Indeed, one of the main points of our study is that most of the genes in the 381-gene set of Zhu et al. do not meet these assumptions and are thus unsuited for estimating evolutionary distance and divergence times.

      There is much ongoing method development which will allow more of the genome to be used in deep-time comparative analyses; Astral-Pro, FastMulRFS and SpeciesRax, among others, are recent promising steps in this direction. However, our central critique of Zhu et al. is that inferences under concatenation-based methods can be misled by HGT and other sources of incongruence, and indeed our analyses show that these unmodelled signals underlie the difference between the conclusions of Zhu et al. and other studies (e.g. (Liu et al., 2021; Spang et al., 2015; Williams et al., 2020) that have instead supported a deep divergence between Archaea and Bacteria. In our revised manuscript, we have shown that the relative AB distance, like the AB branch length, is shortened by unmodelled gene transfers (Figure 1), and that estimates of the AB stem length from different studies are similar when the congruent subset of the data is analysed with the best available substitution models (Figure 6). We therefore disagree that the scopes are distinct: richer, broader measures of genomic diversity can be proposed and, with the development of new methods, estimated; but so far, the vertical signal is the only signal that can be harnessed to infer divergence times using concatenations.

      The expanded marker set:

      The authors made a valid critique (line 121-135) that many of the 381 genes in the "expanded marker set" adopted by Zhu et al., are under-represented in Archaea. According to the PhyloPhlAn paper (Segata et al. Nat Commun. 2013) which originally developed the 400 markers (a superset of the 381 markers), these genes were selected from ~3,000 bacterial and archaeal genomes available in IMG at that time time (note that it was 2013). Zhu et al. also admitted, in the discussion section, that this marker set falls short in addressing some questions (such as the placement of DPANN). What is important in the current context, is that they were not specifically selected to address the AB distance question.

      We agree that the taxon sampling of archaea and the choice of marker genes in the Zhu et al. study were not ideal for estimating the evolutionary distance between the domains. However, we note that this distance (or proximity), and the hypothesis that traditional core genes over-estimate the Archaea-Bacteria divergence, was one of the main results of the paper (c.f. the title of that paper, “Phylogenomics of 10,575 genomes reveals evolutionary proximity between domains Bacteria and Archaea”).

      However, note that Zhu et al.'s Fig. 5A, B presented the AB distance informed by 161 out of the 381 genes. These genes have at least 50% taxa represented in both domains - the same threshold discussed in the current work (line 132).

      While the 50% sampling criterion indeed enriches for the genes of the expanded set that were present in LUCA and on the AB branch, we note that the 50% criterion represents a minimum of 4953 bacteria and 335 archaea; that is, it still reflects the unbalanced sampling of the dataset overall. For example, 30 of the genes had fewer than two archaeal homologues, and in 100 of the trees there were fewer than 50 archaea reflecting the large disparity in taxon sampling (Supplementary Information Table S1). The phylogenetic signal in these genes is discussed in more detail below. Looking at the subsampled versions of these 161 genes, we found the majority of these genes (123/161) to have no discernible AB branch length. The 38/161 genes which had an arguable AB branch length (but still with transfers/paralogs) possessed a range of AB lengths: 0.0814:5.26, with a mean AB length of 1.03 and a median of 0.635.

      Even with those sufficiently represented genes, they still found that ribosomal proteins and a few other core genes are "outliers" in the far end of the AB distance spectrum.

      The reviewer raises an interesting point about outliers with high relative AB distances, which gets to the heart of the debate about how best to estimate the evolutionary distance between Archaea and Bacteria. The new analyses of relative AB distance introduced in our revised manuscript (Figure 1) demonstrate that this metric is affected by HGT in a similar manner to AB branch length (that is, high-verticality marker genes have a greater relative AB distance (relative AB vs ΔLL: p = 0.0001051 & R = -0.2213292, relative AB vs between-domain split score: p = 2.572e-06 & R = -0.2667739). Thus, core genes can be viewed as “outliers” compared to other prokaryotic genes in the sense that they have experienced an unusually low amount of HGT. This high verticality makes them among the few prokaryotic gene families that can be analysed by concatenation methods, which make the assumption that all sites evolve on the same underlying tree topology.

      Domain monophyly in gene trees:

      The authors' efforts in manually checking the gene trees are appreciable (Table S1), considering the number and size of those trees. They found (line 147) "Archaea and Bacteria are recovered as reciprocally monophyletic groups in only 24 of the 381 published (Zhu et al., 2019) maximum likelihood (ML) gene trees of the expanded marker set."

      The domain monophyly check was valid, however the result could be misleading because any sporadical A/B mixture was considered evidence of non-monophyly for the entire gene tree. As the taxon sampling grows, the opportunity of observing any A/B mixture also increases. For example, in Puigbò et al. J. Biology. 2009, 56% (a much higher ratio) of nearly universal genes trees had perfect domain monophyly based on merely 100 taxa. This is because even the "perfect" marker genes (such as ribosomal proteins) are not completely free from HGTs (e.g., Creevey et al. Plos One. 2011), let alone the fact that there are many artifacts in the published reference genomes (Orakov et al. Genome Biol. 2021).

      Therefore, to have an objective assessment of this topic, it would be better to have a metric that allows some imperfection and reports an overall "degree" of separation (also see below).

      We agree that complementing the monophyly check with a more nuanced metric is useful. In our revised manuscript, we now also evaluate the split score (Dombrowski et al. (2020) Nat Commun) of each marker, which reflects the degree to which a gene recovers the monophyly of established taxonomic ranks (a higher score reflects the splitting of monophyletic groups into a number of smaller clades in the gene tree, and so the metric permits a degree of “imperfection”, as suggested; in addition, the metric is averaged over bootstrap replicates, so that lack of resolution or poorly-supported disagreements with the reference taxonomy do not disproportionately affect the score). This expanded analysis (Figure 1) indicates that both within- and between-domain split score and ΔLL are significantly positively correlated (R = 0.836679, p < 2.2✕10-16), and that phylogenetic markers that more strongly reject domain monophyly (higher delta-LL) also perform worse at recovering between-domain (and within-domain) relationships (higher split score) and support a shorter AB branch length.

      AB branch by gene: correlation and outliers

      Figure 1 is the single most important result in this work, because it argues that the short AB branch observed in Zhu et al. is an artifact due to "inter-domain gene transfer and hidden paralogy" (line 202). This argument is based on the observation that the indicated AB branch length is negatively correlated with "verticality" (measured by ΔLL and split score) of the gene.

      Our argument that the short AB branch results from inter-domain gene transfer and hidden paralogy is based on three main lines of evidence: (i) documentation of extensive transfers and intermixing of paralogues in the gene trees for the 381 gene set; (ii) the analyses in Figure 1, which demonstrate that verticality positively correlates with AB branch length and AB distance; (iii) the demonstration that the incremental addition of low-verticality markers to a concatenate results in a concomitant decrease in AB branch length.

      However, Zhu et al. also investigated the impact of verticality on AB distance, and they also found that they are negatively correlated (Fig. 5E). Therefore, the current result does not appear to deliver new information (as do multiple other analyses, see below).

      Zhu et al. indeed identified a weak positive relationship between gene verticality and AB distance. Our analyses go beyond that work by showing, using a variety of complementary metrics of verticality, that AB branch length and relative AB distance are strongly positively correlated with verticality (see Figure 1), and that the low verticality of the genes in the 381 gene set largely explains the difference in stem length inference between that dataset and earlier analyses (Figure 6). An additional factor not considered in the analyses of Zhu et al. was the question of whether a gene was present in LUCA, and so can provide information on the AB branch length. Our analyses (detailed below) suggest that the majority of genes (317) in the 381 gene set do not contain an unambiguous AB branch, and so do not contribute interpretable signal to estimates of the AB branch length.

      An important finding in Zhu et al., which is largely not discussed in the current work, is that a handful of "core" genes are outliers in the spectrum of AB distance, as compared to the majority of the genome (Fig. 5A). The AB distance indicated by these core genes is so long compared with the genome average that it cannot be compensated for by the impact of non-verticality, substitutional saturation, site-homogeneous model, etc (see below).

      Fig. 1A of the current work also clearly shows that many long-AB branch genes are outliers compared with the majority of the genome (the bottom of the blue bar).

      Figs. 3 and 4 attempted to show that ribosomal proteins are not outliers, but that analysis was based on a very small set of core genes, and the figures clearly show that there are outliers even in this small set (to be further discussed below).

      This comment re-iterates the reviewer’s earlier points about “core” genes as outliers compared to the majority of the genome. The key issue is that “most of the genome”, and a significant portion (317 genes) of the 381-gene set, contain features that make them unsuitable for estimation of AB branch length by concatenation, or indeed estimation of an interpretable relative AB distance. We have documented the cases of HGT and mixing of paralogues in the 381-gene dataset; this information is summarised in the main text and presented in more detail in Supplementary Information Table S1.

      Focusing on the 161 genes with >50% representation in both Archaea and Bacteria, manual inspection of gene trees inferred on the 1000-species subsample under the LG+G+F model indicate that 123/161 do not have a clear AB branch (that is, a branch that separates most or all Archaea from Bacteria). While distinguishing such cases from early gene transfers is not straightforward, there is no compelling reason to think that these genes were present in LUCA. The simplest explanation for these gene phylogenies is instead an origin within Bacteria and subsequent transfer on one or multiple occasions into Archaea. As a result, estimates of AB branch length or relative AB distance inferred from these genes cannot be straightforwardly compared to those of the traditional “core” or other genes for which the evidence of a pre-LUCA origin is stronger. Considering only the 38/161 genes for which a LUCA origin appears, from the gene phylogeny, to be likely, the mean AB branch length is 1.03, greater than that estimated from the concatenation of the most vertical genes in the expanded set (0.56), and suggesting that phylogenetic incongruence, combined with (for some families) a more recent origin explains the shorter AB distances inferred from the 381 gene set. Thus, it is not the case that the AB branch lengths (or relative distances) estimated from the majority of genes form a null distribution” against which “core” genes can be seen as outliers; instead, our analyses suggest that “core” genes are among the limited number of genes that trace vertically to LUCA.

      Regarding Figures 3 and 4, see the more detailed discussion below.

      Verticality is not causative of short AB branch:

      In spite of the outlier question, there is an important logic problem in these analyses: The authors observed that gene verticality (measured by negative ΔLL) is correlated with AB branch length (Fig. 1), and concluded that HGTs and paralogy shortened the AB branch (line 202). However, they did not directly assess the rate of evolution in this model. It is totally possible that the most vertical genes happen to be those that evolved faster at the AB split. In order to support the claim made in this work, it is important to separate the effect of the rate of evolution from the effect of HGT / paralogy.

      The ideal solution would be to include ALL genes (not just "good" ones), build gene trees, identify parts of the gene trees that once experienced HGT or paralogy, and prune off these PARTS, instead of excluding the entire gene tree. The remaining data are thus free of HGT / paralogy, based on which one can quantify the "true" AB branch length, and further assess how much it is correlated with "verticality", and whether there are still "outliers". This solution is likely not trivial in implementation, though. However, without such assessment, the observed short AB branch still only applies to the "tree of one percent", not the "tree of life".

      Thanks for this comment --- the reviewer raises a subtle and valid point. Our analyses indicate that vertically-evolving genes have longer AB branch lengths, but in the first version of our manuscript we did not test the alternative hypothesis that this relationship might simply result from a faster rate of evolution in vertically-evolving genes. To evaluate the relationship between evolutionary rate, verticality, AB branch length and relative AB distance on as broad a set of genes as possible, we took the 302 genes from the 381-gene expanded set, excluding 56 genes for which the 1000-species subsample included no archaea, and another 23 which included only 1 archaeon. To estimate per-gene evolutionary rate, we rooted each gene tree using MAD (Tria et al. 2017) and calculated the mean root- to-tip distance on the MAD-rooted gene tree, then evaluated the relationship between rate and verticality. This analysis indicated that vertically-evolving genes evolve more slowly (have shorter mean root-to-tip distances) than less vertical genes (using deltaLL and between-domain split score as proxies for marker verticality, with a Pearson’s product- moment correlation: MAD rooted mean root-to-tip distance against deltaLL: R = 0.1397803, p = 0.01506 or against split score: R = 0.1902056 p = 0.000893), despite having longer AB branches and relative AB distances (using a Pearson’s product moment correlation of MAD rooted mean root-to-tip distance against AB length: p = 0.2025, R= 0.1143076, or against relative AB distance p = 0.007435, R=0.1537479). Thus, the longer AB branches of vertically evolving genes do not appear to be the indirect result of faster evolution of those genes. These analyses are reported in the main text, where we write:

      An alternative explanation for the positive relationship between marker gene verticality and AB branch length could be that vertically-evolving genes experience higher rates of sequence evolution. For a set of genes that originate at the same point on the species tree, the mean root-to-tip distance (measured in substitutions per site, for gene trees rooted using the MAD method (Tria et al., 2017)) provides a proxy of evolutionary rate. Mean root-to-tip distances were significantly positively correlated with ∆LL and between-domain split score (∆LL: R = 0.1397803, p = 0.01506, split score: R = 0.1705415 p = 0.002947; Figure 1 Figure Supplement 5,6, indicating that vertically-evolving genes evolve relatively slowly (note that large values of ∆LL and split score denote low verticality)). Thus, the longer AB branches of vertically-evolving genes do not appear to result from a faster evolutionary rate for these genes. Taken together, these results indicate that the inclusion of genes that do not support the reciprocal monophyly of Archaea and Bacteria, or their constituent taxonomic ranks, in the universal concatenate explain the reduced estimated AB branch length.

      Differential metric for verticality:

      In spite of the similarity between the current result and Zhu et al.'s (see above), the two works approached this goal using different metrics.

      First, the authors attempted to quantify the AB branch length in individual gene trees, including those that do not have Archaea and Bacteria perfectly separated. To do so they performed a constrained ML search (line 210). I am wary of this treatment because it could force distinct sequences (due to HGT or paralogy) to be grouped together, and the resulting branch length estimates could be highly inaccurate.

      We agree with the reviewer that estimating AB branch lengths in this way might lead to inaccuracy. We note that this is, in effect, what was done in the published analysis (Zhu et al. 2019): a topology in which Archaea and Bacteria were reciprocally monophyletic was inferred using ASTRAL (a reasonable analysis, given the robustness of ASTRAL to some degree of HGT/gene tree incongruence), and then the AB branch length was estimated from the concatenation of these 381 genes, fixing on the ASTRAL topology. We performed this experiment (inferring AB branch length on constrained trees) in order to evaluate how incongruence between the gene and species trees might affect AB branch length inference.

      In contrast, Zhu et al. quantifies the average taxon-to-taxon phylogenetic distance between the two domains, regardless of the overall domain monophyly. That method was free of this concern, although it computed a different metric.

      Thanks for raising this point. As described above, in the revised manuscript we have also evaluated the relative AB distance metric used by Zhu et al., and show that it behaves similarly to the AB branch metric we evaluated in the first version of the manuscript (see revised Figure 1).

      Second, the authors assessed "marker gene verticality" using two metrics: a) AU test result (rejected or not) (Fig. 1A), c) ΔLL, the difference in log likelihood between the constrained ML tree and ML gene tree (line 222, Fig. 1B, C). I am concerned that they are sensitive to taxon sampling and stochastic events, as I explained above regarding domain monophyly. It is possible that a single mislabeling event would cause the topology test to report a significant result. In addition, they evaluate how severely domain monophyly is violated, but they do not account for intra-domain HGTs and other artifacts, which are also part of "verticality", and they can potentially distort the AB branch as well.

      In the revised manuscript, we also evaluate a complementary metric for marker gene verticality, the split score (see above), which measures the extent to which marker genes recover established relationships at a given taxonomic level (we computed both within-domain and between-domain split scores). The split score is a more granular measure than ΔLL and, by summing over bootstrap replicates, it also better accommodates phylogenetic uncertainty. The two metrics (ΔLL and split score) are positively correlated and the analyses come to the same conclusions regarding the impact of HGT and other sources of incongruence on estimates of the AB branch length and relative AB distance.

      I did not find the ΔLL values of individual markers in any supplementary table. I also did not find any correlation statistics associated with Fig. 1B.

      The ΔLL values for individual markers can be found in the data supplement in: “Expanded_Bacterial_Core_Nonribosomal_analyses/Individual_gene_tree_analyses/Expanded//Expanded_AB_AU.csv”

      We have now updated the readme.txt file for clarity and included all the new results from the analyses which we have undertaken as part of the review process in the latest version of the supplemental available on figshare (10.6084/m9.figshare.13395470) as well as updated the directory and file names for clarity. We also have added the statistics associated with the correlations in Figure 1 to the Figure Legend.

      Statistical test:

      Line 157: "For the remaining 302 genes, domain monophyly was rejected (p < 0.05) for 232 out of 302 (76.8%) genes." Did the authors perform multiple hypothesis correction? If not, they probably should.

      Thanks for this suggestion. We have now used a Bonferonni correction to account for multiple testing. As a result, fewer marker genes are rejected at the 5% level (151/302), although the overall conclusions are unaffected.

      Line 217: "This result suggests that inter-domain gene transfers reduce the AB branch length when included in a concatenation." and Fig. 1A. If I understand correctly, this analysis was performed on individual gene trees, rather than in a concatenated setting. Therefore, the result does not directly support this conclusion.

      Thanks for pointing this out. The reviewer is correct that this inference depends not only on the single gene analyses, but also on the subsequent concatenation results presented in this section. We have therefore moved this sentence later in the section, after the concatenation analysis.

      Line 224: "Furthermore, AB branch length decreased as increasing numbers of low-verticality markers were added to the concatenate (Figure 1(c))". While this statement is likely true, Zhu et al. also presented similar results (Fig. 5) despite using a different metric, and they concluded that the impact is moderate and cannot explain the status of some core genes as outliers.

      Zhu et al. did identify some of these trends, as we acknowledge in our manuscript ("The original study investigated and acknowledged (Zhu et al., 2019) the varying levels of congruence between the marker phylogenies and the species tree, but did not investigate the underlying causes.“ --- line 178; “These results are consistent with (Zhu et al., 2019), who also noted that AB branch length increases as model fit improves for the expanded marker dataset.” --- line 337) and as discussed above. Our analysis (Figure 6, Table 1) goes further in showing that the most vertical subset of the 381-gene set supports an inter-domain branch length closely similar (2.4 subs/site compared to e.g. 2.5 subs./site for the 27-gene dataset) to analyses using the traditional marker gene set.

      Concatenation and branch length:

      The authors pointed out that "Concatenation is based on the assumption that all of the genes in the supermatrix evolve on the same underlying tree; genes with different gene tree topologies violate this assumption and should not be concatenated because the topological differences among sites are not modelled, and so the impact on inferred branch lengths is difficult to predict." (line 187).

      This argument is valid. In my opinion, this is the one most important potential issue of Zhu et al.'s analysis. In that work, they inferred genome tree topology through ASTRAL, which resolves conflicting gene evolutions. However ASTRAL does not report branch lengths in the unit of number of mutations. Therefore, they plugged the concatenated alignment into this topology for branch length estimation, hoping that it will "average out" the result. That workaround was apparently not ideal.

      Yes, we agree --- this is our main critique of the Zhu et al. analyses.

      However, the practice of molecular phylogenetics is complicated. Theoretically, every gene, domain, codon position and site may have its unique evolutionary process, and there have been efforts to develop better partition and mixture models to address these possibilities. But there is a trade off; these technologies are computationally demanding and have the risk of overfitting. It is plausible that in some scenarios, the gain of concatenating many loci (despite conflicting phylogeny) may outweigh the cost of having unpredictable effects.

      But this dilemma needs to be analyzed rather than just being discussed. The Zhu et al. paper did not assess the impact of such concatenation on branch length estimation. The best answer is to conduct an analysis to show that concatenating genes with conflicting phylogeny would result in an AB branch that is shorter than the mean of those genes, and the reduction of AB branch length is correlated with the amount of conflict involved. The current work has not done this.

      Thanks for raising this point. We agree that phylogenetics is complicated and that

      we lack methods that can account for all possible factors. With respect to the impact of gene transfers on the AB branch length, and as touched on above, there are two issues here.

      The first is with the analysis actually performed by Zhu et al: of the 381 extended set genes, 79 have one or no archaea in the 1000-taxon subsample, and a further 176 have an AB branch length close to 0 (<0.00001) in the constrained analyses. To investigate further, we manually inspected ML gene trees for the 381 genes (1000 taxon subsample). Allowing for recent gene transfer, we nevertheless identified only 64 genes with an unambiguous branch separating most Archaea from most Bacteria that might correspond to the ancestral AB divergence (Supplementary File 1).

      Taken together, these analyses suggest that there is no strong evidence that these genes were present in LUCA or evolved along the AB branch, and so they do not provide information on its length. Since the branch length in the concatenation is an average over the branch length per site, the inclusion of this set of genes in the analysis did reduce the AB branch length, as demonstrated by our analyses (Figure 1(H)).

      The second issue is: for genes which were likely present in LUCA and evolved on the AB branch, does gene transfer cause a reduction in the AB branch length inferred from their concatenation? To test this, we initially tested iterative concatenations of increasing numbers of non-vertical markers (Figure 1H), as well as a comparison of the most vertical genes to the whole expanded marker set (Figure 3 Figure Supplement 2). This revealed that as more markers were added (with lower verticality), the inferred AB branch length from the concatenate was reduced. We also found an increased AB branch length when only the 20 most vertical markers were used as opposed to the whole (381 marker) dataset (0.56 vs 0.16 substitutions/site, Figure 6).

      The reviewer proposes an additional test of the impact of marker gene incongruence on branch length inference from concatenations: to compare the AB branch length before and after pruning of HGTs from individual marker gene alignments. To do this, we took the 54 marker genes from our new dataset and concatenated them before and after pruning of unambiguous HGTs. The AB branch length inferred from the concatenation with HGTs removed was 1.946 substitutions/site, compared to 1.734 substitutions/site without pruning HGTs, demonstrating the impact of even a relatively small number of HGTs on branch length estimation from concatenates.

      Divergence time estimation:

      The manuscript dedicates one section (line 230-266) to argue that the divergence time estimation analysis performed by Zhu et al. was not good evidence for marker gene suitability. Zhu et al. showed congruence of the expanded marker set with geological records whereas ribosomal proteins were conflicting with the geologic record.To support their argument, the authors estimated divergence times using the top 20 most "vertical" genes measured by ΔLL.

      It would be good to clarify which genes they are, and it would be important to check whether they include some of the most "AB-distant" ones found by Zhu et al. Their Fig. 5A shows that there are genes that divide the two domains several folds further than the ribosomal proteins (such as rpoC). If they are among the 20 genes, it will not be surprising that the estimated AB split is older than it should be.

      We now include the annotations for these 20 genes in Supplementary File 5a. The 20 most vertical genes include two of the “AB-distant” outliers identified by Zhu et al., tuf and infB, and one ribosomal marker, rpsG.

      Overall, I think this section is logically questionable. Zhu et al. suggested that "They show the limitation of using core genes alone to model the evolution of the entire genome, and highlight the value in using a more diverse marker gene set.". The current work showed that using another set of a few genes (I do not know if they include multiple "core" genes, as discussed above, but it is plausible) also did not work well. This does not refute Zhu et al.'s claim.

      What's important in Zhu et al.'s analysis is this: they demonstrated that using a small set of genes in DTE may cause artifacts due to them significantly violating the molecular clock at certain stages of evolution. Instead, using a larger set of markers that represent a portion of the entire genome would help to "smooth out" these artifacts. This of course is not the ideal solution, likely because concatenating conflicting genes and modelling them uniformly is not the best idea (see above). But as an operational workaround, it was not challenged by the analysis in the current work.

      Finally, I agree with the authors' statement that more and reliable calibrations are the best way to improve divergence time estimation.

      The dating analyses presented in the first version of our manuscript demonstrated that the apparent agreement between molecular clock estimates using the 381-gene set and the fossil record was the result of artifactual shortening of the AB branch, as discussed in detail above. Once the subset of the data least affected by these issues (that is, the most vertical subset) was used, the limitations of current clock methods, particularly with few calibrations, for dating deep nodes became clear.

      That said, we agree with the reviewer (and also R3) that the dating section in the first version of our manuscript was somewhat unsatisfactory: it identified an important limitation of the published analysis, but did not explore the underlying question of why molecular clock methods infer unrealistically old divergence times from vertically-evolving genes. In the revised manuscript we have reworked and improved this section extensively, including new analyses on the 27-gene dataset, with more fossil calibrations, that help to diagnose how and why clocks struggle to date the archaeal and bacterial stems from the available data. We now show that the old ages result from a combination of low rates of molecular evolution across the tree inferred from “shallow” calibrations, combined with a lack of age maxima for nodes other than the root of the tree; when the rate distribution is informed in this way, the long AB branch is interpreted as representing a long period of time and estimates of LUCA age are strongly influenced by prior assumptions about root maximum age. These analyses now suggest how the difficulties might be overcome in the future, for example using better calibrations (particularly maximum ages, and indeed any fossil calibrations within the Archaea), or alternatively other sources of time information such as from gene transfers. Reflecting the new, broader focus, we have moved this section to the end of the manuscript.

      AB branch by ribosomal and non-ribosomal genes:

      Two figures (Figs. 3 and 4) are two sections (line 270-303) dedicated to the argument that ribosomal markers do not indicate a longer AB branch than a non-ribosomal one. However, this is a small scale test (38 ribosomal markers vs. 16 non-ribosomal markers) compared with the similar analysis in Zhu et al. (30 ribosomal markers vs. 381 global markers). A closer look at Figs. 3 and 4 suggests that while the AB lengths indicated by the ribosomal markers are within a relatively narrow range, those by the non-ribosomal ones are very diverse, including ones that are several folds longer than the ribosomal average. This result is in accordance with that of Zhu et al.'s Fig. 5A, although the latter was describing a different metric. Do these genes also overlap the ones found by Zhu et al.?

      Nevertheless, this analysis does not falsify Zhu et al.'s, because it compared a different, much smaller, and deliberately chosen group of genes.

      As the reviewer indicates, the purpose of the analyses presented in Figures 3-4 is to evaluate the hypothesis of accelerated ribosomal protein evolution: that is, the idea that ribosomal proteins over-estimate the AB branch length due to accelerated evolution during the divergence of Archaea and Bacteria. Although this hypothesis was independently proposed in Zhu et al., to our knowledge it actually originates with Petitjean et al. (2014) GBE (https://academic.oup.com/gbe/article/7/1/191/601621; see their Figure 2), and has been at play in analyses of deep evolution and in particular the position of DPANN Archaea in the phylogeny since that time. Thus, this section of our manuscript (indeed, all but the first section) is not a critique of Zhu et al.’s work, but a contribution to the broader ongoing discussion about which marker genes are best to use in deep phylogeny. We compare only vertically-evolving genes in Figures 3-4 so as to distinguish the impact of gene function (ribosomal versus non-ribosomal) from confounding factors such as HGT, paralogy, and gene origination time.

      To clarify this point, we have modified our main text discussion to make it clear that we are making a comparison between ribosomal genes and other vertically-evolving members of the traditional “core” gene set, rather than a broader genome-wide claim. We now write:

      “If ribosomal proteins experienced accelerated evolution during the divergence of Archaea and Bacteria, this might lead to the inference of an artifactually long AB branch length (Petitjean et al., 2014; Zhu et al., 2019). To investigate this, we plotted the inter-domain branch lengths for the 38 and 16 ribosomal and non-ribosomal genes, respectively, comprising the 54 marker genes set. We found no evidence that there was a longer AB branch associated with ribosomal markers than for other vertically-evolving “core” genes (Figure 2(b); mean AB branch length for ribosomal proteins 1.35 substitutions/site, mean for non-ribosomal 2.25 substitutions/site).”

      Substitutional saturation:

      The comparative analysis of slow- and fast-evolving sites is interesting. The result (Fig. 5) is visually impactful. In my view, this analysis is valid, and the conclusion is supported. It would be better to explain the rationale with more detail to facilitate understanding by a general audience.

      Thanks for this assessment. We have now expanded on the rationale of this analysis in the main text, writing:

      “It is interesting to note that the proportion of inferred substitutions that occur along the AB branch differs between the slow-evolving and fast-evolving sites. As would be expected, the total tree length measured in substitutions per site is shorter from the slow-evolving sites, but the relative AB branch length is longer (1.2 substitutions/site, or ~2% of all inferred substitutions, compared to 2.6 substitutions/site, or ~0.04% of all inferred substitutions for the fastest-evolving sites). Since we would not expect the distribution of substitutions over the tree to differ between slow-evolving and fast-evolving sites, this result suggests that some ancient changes along the AB branch at fast-evolving sites have been overwritten by more recent events in evolution --- that is, that substitutional saturation leads to an underestimate of the AB branch length.”

      Zhu et al. also tested the impact of substitution saturation on the AB branch, using a more traditional approach (Fig. S19). They also found that the inter-domain distance is more influenced by potential substitution saturation, but the difference is minor. They concluded that (AB distance) "is not substantially impacted by saturation."

      Like other analyses, these two analyses involved very different locus sampling (27 most "vertical" genes vs. 381 expanded genes). They also differ by the metric being measured (AB branch length vs. average distance between AB taxa). Therefore, the analysis in the current work does not falsify the analysis by Zhu et al. In contrast, it is inline with (though not in direct support of) Zhu et al. and others' suggestion that there was "accelerated evolution of ribosomal proteins along the inter-domain branch" (line 25) in the 27 core genes (of which 15 are ribosomal proteins).

      We disagree that our analysis is consistent with the hypothesis of accelerated ribosomal protein evolution. The analysis that directly addresses this point is Figure 3, where we show that the distributions of AB branch lengths in single gene trees are not significantly different between ribosomal and non-ribosomal datasets (Figure 3; mean AB branch length for ribosomal proteins 1.35 substitutions/site, mean for non-ribosomal 2.25 substitutions/site).

      Evolutionary model fit:

      The authors compared the AB branch length indicated by the standard, site-homogeneous model LG+G4+F vs. the site-heterogeneous model LG+C60+G4+F, and found that the latter recovered a longer AB branch (2.52 vs. 1.45). The author's reasoning for using a site-heterogeneous model is valid, and this analysis is sound.

      However, Zhu et al. also analyzed their data using the site-heterogeneous model C60 -- the same as in this work, but through the PMSF (posterior mean site frequency) method. Zhu et al. also compared it with two site-homogeneous models (Gamma and FreeRate). The results were extensively presented and discussed (Figs. 3, 4E, F, S23, S24, Note S2). They also found that C60+PMSF elongated the AB branch compared with the site-homogeneous models (Fig. S24A). As for the average AB distance (another metric evaluated by Zhu et al., as discussed above), C60+PMSF increased this metric when using ribosomal proteins, but not much when using the expanded marker set (Fig. S25A). And overall, the elongation by C60+PMSF with the expanded markers cannot compensate for the long branch indicated by the ribosomal proteins.

      Therefore, similar to the point I made above, this analysis is sound but it does not logically falsify the conclusion made by Zhu et al., as it only concerns a small set of markers, and it recovered a previously described pattern.

      Thanks for this comment. As above, note that the second part of our manuscript presents a general analysis of the issues around marker gene and model selection using our meta-analysis and new dataset, and is not a direct response to Zhu et al’s work. On reflection, we agree that this was not sufficiently clear in the first version of the paper, and we have now modified the text to acknowledge the model fitting analyses of Zhu et al.

      The manuscript also did not clarify what the phrase "poor model fit" refers to (line 34 and line 304). If this is addressing the Gamma model evaluated by the authors, then this claim is valid though not novel (but see my previous comment on the trade-off). If that is a general reference to Zhu et al.'s methodology, then the authors should at least include the C60+PMSF model in the analysis, and show that C60 indicates a significantly longer AB branch than C60+PMSF does (if that's the case, which is doubtful). Admittedly, C60+PMSF is cheaper than the native C60 in computation, but "In some empirical and simulation settings PMSF provided more accurate estimates of phylogenies than the mixture models from which they derive." (Wang et al. Syst Biol. 2018).

      Thanks for this comment. We did not intend the phrase “poor model fit” to imply a critique of Zhu et al.’s work; as the reviewer notes, those authors carried out a range of analyses to investigate the impact of model choice on their inferences. Rather, the title of the section is intended to summarise its main conclusion, which is that substitutional saturation and poor model fit (on any dataset, and even with the best available models) can lead to under-estimation of the AB branch length. Note that the analyses in Table 1 illustrating the impact of model fit are from the new dataset that is assembled and analysed in the second part of the manuscript. As above, we agree that this was not sufficiently clear in the first version of the paper. We think the title of this section is accurate and so we have not changed it, but we have changed the final two paragraphs of the section (as quoted immediately above) so as to acknowledge the model fitting analyses of Zhu et al., and to clarify that the results are general (and based on our new dataset), rather than a critique of Zhu et al’s work.

      Finally, Zhu et al. also performed an analysis using the native C60 model on a further reduced taxon set. That result was not presented in the published paper, but it can be found in the "Peer Review File" posted on the Nature Communications website. That tree also recovered a short AB distance, and placed CPR at the base of Bacteria, and showed that this placement was not impacted by the removal of Archaea.

      Thanks for pointing us to this additional analysis. The unrooted, bacteria-only tree referred to by the reviewer (panel B) recovers a clan (that is, a cluster of branches on the unrooted tree) comprising CPR+Chloroflexi, in agreement with the analysis on the new marker dataset we present here (Figure 6). The disagreement between that analysis and the new tree presented here relates to the position of the archaeal outgroup, which in the Peer Review File panel A connects to the bacterial tree between CPR and Chloroflexi. If, as recently suggested, the bacterial root lies between Gracilicutes and Terrabacteria (Coleman et al. 2021), then CPR and Chloroflexi represent monophyletic sister lineages. We note that the CPR+Chloroflexi relationship recovered here and in Peer Review File Panel (B) has also been obtained in several other recent analyses (Taib et al. 2020, Coleman et al. 2021, Martinez-Gutierrez and Aylward 2021), as cited in the main text.

      Taxon sampling:

      My final comment is about taxon sampling. Zhu et al. developed an algorithm for less biased taxon sampling, and they argued that extensive taxon sampling is important in resolving the early evolution of life. They presented evidence showing that reduced taxon sampling changed overall topology and basal relationships (Figs. S13, S14, S23, Note S2). The analyses were performed in combination with the assessment of site sampling, locus sampling, substitution model and other factors. The importance of less biased and/or extensive taxon sampling was also noted by previous works, especially in a phylogenomic framework (e.g., Hedtke et al. Syst Biol. 2006; Wu and Eisen. Genome Biol. 2008; Beiko. Biol Direct. 2011). The current work is based on a smaller set of taxa, and it has not addressed the impact of taxon sampling. As I suggested above, some results may be sensitive to taxon sampling.

      We agree that taxon sampling is important for phylogenetics. While the analyses of Zhu et al. (2019) included a very large number of genomes, sampling of genomes (and indeed marker genes) was biased, both towards Bacteria compared to Archaea, and also within Bacteria. In our revised manuscript, we now compare the taxon sampling between Zhu et al.’s work and our new analyses (see Figure 1 Figure Supplements 13,14,15 and Figure 4 Figure Supplements 1,2). Balanced sampling is important for phylogenetic inference (Heath et al., 2008; Hillis, 1998) and, by this criteria, the taxon sampling in the analyses of Zhu et al. was not ideal. Our new analyses made use of fewer genomes (700), but these sample the known diversity of Archaea and Bacteria in a more representative way (Figure 4 Figure Supplement 1,2).

      Reviewer #3:

      Moody and coworkers principally address a recent paper presented by Zhu et al. (Nature Communications, 2019). In their paper, Zhu and coworkers claim that (i) ribosomal protein genes, commonly used in resolving deep phylogenies, have experienced an increased rate of evolution right after LUCA, and (ii) that an expanded set of markers show that the branch separating archaea from bacteria (AB-branch) is 10-fold shorter than previously thought. Moody et and coworkers first demonstrate flaws in the Zhu et al. analysis: first, the expanded gene set is biased towards bacteria, with 25% of the single-gene trees having very few archaeal counterparts. Second, that over 75% of the single-gene trees from Zhu et al are not monophyletic at domain level, suggesting a large influence of horizontal gene transfers (HGT), inter-domain exchanges, and inclusion of paralogous sequences in the original datasets. Third, they show that genes with fewer HGT display longer AB-branches. Fourth, they show that the argument by Zhu et al. that the longer AB-branch yields absurd LUCA datation is not relevant. Fifth, and maybe most important, they show that the shorter AB-branches recovered by Zhu et al in their expanded dataset result from inadequate substitution models, which lead to underestimating rates and thus branch lengths.

      Going further, they select a set of 54 manually curated markers (showing mostly monophyletic archaea and bacteria), both from ribosomal proteins (36) and non-ribosomal proteins (18) and retrieve these in a balanced set of 350 archaea and 350 bacteria. With this set, they show that ribosomal protein markers do not display longer AB-branches than non-ribosomal ones. They also show that diversity among Archaea and Bacteria, as measured as the total tree length within each domain, is very similar, when sampling equal number of genomes in both domains.

      Strengths:

      The paper is well-written and well structured. In general, the methodology chosen here is adapted to the question at hand and very rigorously followed. The balanced dataset (with equal amounts of bacteria and archaea) of 54 carefully selected genes is also appropriate to explore diversity differences between the two domains of life.

      Although all arguments presented in Zhu et al are carefully re-evaluated, the part where Moody et al show that substitutional saturation and poor model fit is artifactually producing short AB-branches is quite compelling and elegantly presented.

      Weaknesses:

      One potential weakness, more in terms of significance than in terms of scientific soundness is that the paper is mostly "reactive", responding to a single other paper. The authors might have used the data and methodology presented here to give the paper a broader scope. An example would be to provide the audience with a solid protocol or general guidelines on how to avoid artifacts in making deep phylogenies. I believe that the authors have demonstrated that they have the authority to do that.

      Thanks for this suggestion. We considered including guidelines of this type in the first version of the manuscript, but we were --- and remain --- wary of attempting to promote one particular way of doing deep phylogeny over others. These are difficult and slippery questions, and different approaches and perspectives (including ones we might disagree with) are, in a broader sense, useful in refining ideas and helping the field to make progress as a whole. That said, a recurring issue appears to be the question of the fit between model and data, both in terms of substitution model fit (as with the impact of site-heterogeneous models on branch length inferences) and the broader issue of using models that, for example, account for gene duplication or transfer. There are several recent reviews (including one by some of us) which treat these topics in detail and provide detailed advice. We have now raised and discussed these issues in our conclusion. We have also updated Figure 6 to illustrate the approach we used in assembling the new 27-gene dataset, which may be of use to others, and goes some way towards the suggestion of providing guidelines for future analyses. We now write:

      “Our analysis of a range of published marker gene datasets (Petitjean et al., 2014; Spang et al., 2015; Williams et al., 2020; Zhu et al., 2019) indicates that the choice of markers and the fit of the substitution model are both important for inference of deep phylogeny from concatenations, in agreement with an existing body of literature (reviewed in (Kapli et al., 2021, 2020; Williams et al., 2021). We established a set of 27 highly vertically evolving marker gene families and found no evidence that ribosomal genes overestimate stem length; since they appear to be transferred less frequently than other genes, our analysis affirms that ribosomal proteins are useful markers for deep phylogeny. In general, high-verticality markers, regardless of functional category, supported a longer AB branch length. Furthermore, our phylogeny was consistent with recent work on early prokaryotic evolution, resolving the major clades within Archaea and nesting the CPR within Terrabacteria. Notably, our analyses suggested that both the true Archaea-Bacteria branch length (Figure 6A), and the phylogenetic diversity of Archaea, may be underestimated by even the best current models, a finding that is consistent with a root for the tree of life between the two prokaryotic domains.”

      In the figure 6 legend, we also expand on guidelines for future analyses, writing:

      “(B) Workflow for iterative manual curation of marker gene families for concatenation analysis. After inference and inspection of initial orthologue trees, several rounds of manual inspection and removal of HGTs and distant paralogues were carried out. These sequences were removed from the initial set of orthologues before alignment and trimming. For a detailed discussion of some of these issues, and practical guidelines on phylogenomic analysis of multi-gene datasets, see (Kapli et al., 2020) for a useful review.”

      The authors use the difference in log-likelihood between the constrained and unconstrained gene trees as a proxy for verticality and thus marker gene quality (Figure 1b). However, they don't demonstrate that that metric is actually appropriate. Could the monophyly (or split score) be also involved here? The authors might want to comment on that.

      Thanks for this suggestion, which has substantially improved our analysis of Archaea-Bacteria distance and marker gene verticality (see the revised Figure 1 and associated text). We have now evaluated the relationship between AB branch length and split score (both within- and between-domain level relationships) for the expanded marker set and have updated our results and discussion accordingly. We found that deltaLL and split score (both within- and between-domains) are positively correlated with each other, and negatively correlated with AB length (that is, high-verticality markers have longer AB branch lengths). These analyses also revealed that within-domain and between-domain split scores are strongly positively correlated, implying that genes that recover domain monophyly also do better at resolving within-domain relationships.

      The argument about the age of LUCA an ad absurdum one, showing that using better suited genes one gets impossible time estimates. However, the argument presented by Zhu et al is also a "just so" argument (if we get a time estimate that doesn't make sense then the phylogeny must be wrong), which doesn't give it much weight. The authors themselves note well that this part of the paper is more revealing of the limitations of the strict clock method, or of the relaxed clock with one single calibration point, than of the quality or appropriateness of the dataset.

      We agree that the dating section in the first version of our manuscript was somewhat unsatisfactory. We have now expanded it to include new analyses on our 27-gene dataset, using more fossil calibrations, in order to diagnose why current clock methods struggle to estimate evolutionary rate near the root of the tree, and how this impacts on the age of LUCA and other deep nodes. These analyses add substantial value to this section, which has been moved to the end of the manuscript to reflect its expanded focus.

      Another small weakness (or loose end) is that manual curation of the 95 genes dataset is not consistently reducing the percentage of non-monopyhletic genes (e.g. 62 to 69% from the 95 to the 54 genes dataset for non-ribosomal genes; 21 to 33% from the 95 to the 27 genes dataset for ribosomal genes). The author don't discuss how this impacts the manual curation they perform on the datasets; however, they state that "manual curation of marker genes is important". The authors might want to discuss that aspect further.

      Thanks for raising this point. We were not sufficiently clear in describing the logic of our approach in the first version of the manuscript, and have now revised the text to clarify. In this analysis, we used a strict binary definition of monophyly --- that is, even a single inter-domain transfer leads to non-monophyly (note that this is in contrast to the re- analysis of the expanded set, where we considered whether each marker statistically rejected domain monophyly). For some genes scored as non-monophyletic in this way, manual removal of a small number of unambiguous recent transfers is sufficient restore domain monophyly; for others, HGT is extensive and it is difficult to know how to filter the sequences so as to obtain a reliable marker gene alignment; it was these latter cases that we set aside. We have now revised this section to make the logic of the approach clear, writing:

      “Prior to manual curation, non-ribosomal markers had a greater number of HGTs and cases of mixed paralogy. In particular, for the original set of 95 unique COG families (see ‘Phylogenetic analyses’ in Methods), we rejected 41 families based on the inferred ML trees, either due to a large degree of HGT, paralogous gene families or LBA. For the remaining 54 markers, the ML trees contained evidence of occasional recent HGT events. Strict monophyly was violated in 69% of the non-ribosomal and 29% of the ribosomal families. We manually removed the individual sequences which violated domain monophyly before re-alignment, trimming, and subsequent tree inference (see Methods). These results imply that manual curation of marker genes is important for deep phylogenetic analyses, particularly when using non-ribosomal markers. Comparison of within-domain split scores for these 54 markers indicated that markers that better resolved established relationships within each domain also supported a longer AB branch length (Figure 2A).”

      In summary and despite the small weaknesses listed above, my opinion is that the authors reach their goal of showning that the AB-branch is indeed a long one, and that the results support the conclusion.

      Impact:

      The main point addressed by the authors here, the time of divergence between Archaea and Bacteria, is crucial to our understanding of early evolution. The long branch separating Bacteria and Archaea has long been thought to be a long one, and the paper by Zhu et al casted a doubt about the validity of this long-standing hypothesis. Here, Moody et al convincingly establish that the divergence between archaea and bacteria is a profound one. The paper also has profound implications on the validity of the commonly used core-gene phylogenies, particularly those based on ribosomal protein genes. Indeed, it shows that the these proteins are appropriate for deep phylogenies. They also show the impact of model violations on deep phylogenies, and how to avoid them.

      We thank the reviewer for this positive assessment of impact.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all reviewers for their thorough assessment and constructive comments.

      For clarity, their comments have been numbered.

      Reviewer #1

      Evidence, reproducibility and clarity:

      Summary:

      Acetylation/Deacetylation controls G1/s transition in budding yeast. The lysine acetyl transferase Esa1 is here shown to play a role, in part via acetylation of the nuclear pore complex basket component Nup60, which stimulates mRNA export.

      Major comments:

      1 • Figure 1C: The curve for esa1-ts in this figure and the curve in the supplementary figure S2B are not similar, while the first shows 10% cells budding after 60 minutes it is about 50% after 60 min in S2B. Another helpful way of presenting the data could be the length of the G1 phase (from cytokinesis to budding) in the WT, esa1-ts, gcn5delta cells over time.

      We thank the reviewer for pointing this out. Indeed, there is some day-to-day variability in the budding kinetics of the temperature-sensitive esa1 mutant, and the text referred to one individual experiment. Therefore, we have changed the text to better reflect the observed variability (p. 7) and added a graph (supplementary Figure S2C) including all individual replicates. This shows that in spite of small differences between experiments, esa1-ts cells always bud slower and less efficiently than wild-type cells. We note that the data cannot be shown in the way suggested (time from cytokinesis to budding, presumably from individual cells) because cells in these experiments were released from a G1 block (after cytokinesis), and samples from cell cultures were imaged at time intervals (and not single cells over time). Time-lapse data of single cells is shown in figure 2E.

      2 • What is the rational of creating the Nup60-KN mutation. Does it prevent acetylation of Nup60, at least by GCN5 and/or esa1?

      The biophysical properties of asparagine resemble those of acetylated lysine. Therefore, the Nup60-KN mutant (lysine 467 to asparagine) is expected to mimic acetylation of Nup60 K467, which was found to be acetylated in earlier studies. Supporting the conclusion that Nup60-KN is indeed an acetyl-mimic, the nup60-KN mutation partially rescues the Start and mRNA export defects on Esa1-deficient cells. We make the rationale of the Nup60-KN mutation clearer in the current version (p. 8).

      3 • Given the much stronger phenotype of the esa1-ts+GCN5 delta condition for G1/S transition as compared to esa1-ts and that GCN5 seems to strongly acetylate Nup60 I do not understand the sole focus on esa1 in the study. The fact that the Nup60-KN cells do not show G1/S transition under esa1-ts+GCN5 delta conditions in experiments presented in Fig. S3 argues that esa1 meaidted acetylation of Nup60 is only one, probably minor aspect of G1/S transition. This should be much balanced discussed.

      We focus on Esa1 because this allows us to dissect the specific role of Nup60 acetylation and mRNA export during the G1/S transition. Of course, Esa1-dependent acetylation of Nup60 is not the only process controlling the G1/S transition, which is regulated at several levels. For example, the concentration of multiple Start activators and inhibitors scales differentially with cell size (PMID: 26390151, 32246903). In addition, daughter-specific factors inhibit Start through a pathway parallel to Nup60 deacetylation (Ace2/Ash1-dependent repression of Cln3 transcription; PMID: 19841732, 19841732). We discuss these studies in the current version (p. 17).

      As for the relative contribution of Esa1 and Gcn5 to the G1/S transition and mRNA export: both of these KATs have overlapping roles in promoting transcription, probably through distinct substrates (such as histone H2 for Gcn5, H4 for Esa1) and this may contribute to their role in Start. Consistent with this, deletion of GCN5 causes a minor delay in transcription of G1/S genes (Kishkevich, Sci. Rep 2019). On the other hand, gnc5 mutants have no detectable mRNA export defects, unlike esa1-ts (our Figure 3E). This suggests that whereas Gcn5 and Esa1 may have overlapping roles in transcription of G1/S genes, Esa1 is more specifically involved in mRNA export. The ability of Nup60-KN to rescue the single mutant esa1 but not the double gcn5 esa1 is consistent with this view: the transcription defects in the double mutant may be so severe as to prevent Start even in the presence of Nup60-KN. We have modified the discussion to mention these points. In addition, we will investigate the transcription defects of esa1 and gcn5 single and double mutants to test this possibility and include the results in a revised version.

      4 • Suppl: Fig 2: I miss the hat1delta+gcn5delta condition.

      We will include the budding index of the hat1 gcn5 double mutant in a revised version.

      Minor comments:

      5 • Figure legend 2C "at least 200 cells were scored": please state number of replicates

      Figure 2C shows RT-qPCR data. The reviewer probably means figure 1C, which shows the budding index of one experiment comparing wild type, esa1, gcn5 and esa1 gcn5 strains. This experiment was repeated 3 times, as is now mentioned in the figure 1 legend.

      6 • Figure 2E: X axis "impor" should be corrected to "import"

      We have corrected this.

      7 • Would Mex67 and/or Mrt2 overexpression recue the esa1-ts and esa1-ts+GCN5 delta phenotype?

      We will include this experiment in a revised version.

      8 • Figure 4 A: The size of the daughter cells in the hos3delta condition seems smaller as compared to esa1-ts. Is this true and can you comment this? Is a premature onset of S phase observed here?

      Since Fig 4A features only wild type and hos3∆ cells, the reviewer is probably referring to esa1-ts cells shown in figure 4B. These two figure panels are not directly comparable: cells in 4A are freely cycling, whereas those in 4B were released from a mitotic arrest using nocodazole. The mitotic arrest was done in order to avoid potentially confounding effects due to inactivation of Esa1 during S phase. However, the arrest also causes daughter cells to grow larger, explaining the size differences pointed out by the reviewer. That being said, it is true that cell size and G1 duration are intimately linked and thus the reviewer question raises a relevant point. We previously showed that although hos3 daughter cells enter S phase prematurely, their size is not significantly different from wild type (Kumar et al., Figure 1d-g). Premature onset of S phase can lead to smaller cell size but this is not the case for hos3 cells, probably due to the slightly faster growth rate of the hos3∆ mutant relative to wild type specifically during S/G2/M phases (Kumar et al., Supplementary Fig. 1b).

      9 • Figure 4D: The still images in figure 2E and 4D do not correspond with the quantitation. E.g. in Fig 2E the esa1ts cells shows Whi5 export at t=81 min, which is according to the shown quantitation unusual late.

      We will modify Figures 2E-4D in a revised version to include cells that export Whi5 at times closer to the median.

      10 • Figure 4B: it is not clear why for the quantitation a different representation is chosen as compared to 4A. It would be better to show the nuclear intensities of mother/daughter as in Figure 4A.

      The reason for the different representation between figures 4A and 4B is that 4A depicts freely cycling cells and in 4B, cells were released from a nocodazole-induced mitotic arrest (as mentioned in our response to point 8). A mitotic arrest perturbs M/D size asymmetries, as daughter cells (but not mothers) continue growing during the arrest, leading to larger nuclear size. In addition, esa1-ts daughters are smaller than wt daughters in this condition, further complicating M/D asymmetries. We thought that in this case, a better metric for protein association with the NPC is the fluorescence intensity relative to a nuclear pore component. We agree that using different types of graphs is confusing, and therefore we have removed M/D comparisons from figure 4A and now represent these data as in figure 4B: the intensity of Sac3 relative to Nup49. Finally, a good control for these experiments is the quantification of total protein levels, which we have added for Sac3. We have also removed Mtr2-GFP data until our analysis of Mtr2 total levels is complete. We hope this simplifies this figure.

      11 • Figure 4D: To strengthen these results, it would be good to perform this assay with esa1-ts Nup60-KN cells as in figure 2a. The release of Whi5-GFP is expected to behave in a similar way to the WT. This would ensure that Nup60 acetylation is a pre-requisite for Whi5 release

      I’m afraid we don't understand this suggestion. Figure 4D shows time-lapse fluorescence microscopy of Whi5 nuclear export when Sac3 is recruited to the nuclear basket. Figure 2a shows western blots of Nup60 acetylation status. Therefore it is not clear how these two assays could be done in similar ways. Perhaps the reviewer refers to a different figure panel. The purpose of the suggested experiment, if we understand properly, is to test whether Nup60 acetylation is required for Whi5 export. This is the hypothesis tested in figure 2D: Whi5-GFP export is delayed in esa1-ts, and this delay is partially rescued in esa1-ts nup60-KN, which mimics acetylation. In fact, the advance in Whi5 export observed in Figure 4D upon Sac3 anchoring to NPC is similar to that observed in a nup60-KN (Figure 2E).

      12 • Page 13 "Finally, we tested whether Esa1 targets Sac3 to G1 nuclei": The effect of esa1 knockdown on Sac3 fit with the story line and the effect esa1 imposes on mRNA export. However targeting of Sac3 which is part of a bigger complex by esa1 is a misleading statement, given that you don't show a proof of direct interactions shown, e.g. by immunoprecipiations.

      We meant to say “we tested whether Esa1 function promotes the localisation of Sac3 to the nuclear basket”. We agree that it is unknown whether this involves direct interactions between Sac3 and Esa1. We have changed the text to make this point clearer.

      13 • Page 18: "Nevertheless, our findings suggest that mammalian nucleoporins may represent a novel category of substrates for KATs and for the multiprotein complexes in which these enzymes reside, with important roles in gene expression." Given that there is little experimental evidence this statement is for my taste too strong. Rather indicate that this is a possibility which needs to be tested...

      We have changed the text as suggested.

      14 • Page 3: "Nuclear pores are macromolecular assemblies composed of approximately 30-50 different Nucleoporins": it is rather approximately 30 different nucleoporins in the species so far analyzed.

      We have corrected this as suggested.

      Significance:

      The concept of acetylation/deacetylation regulation of G1/S transition in budding yeast is very appealing. The specific (and important) contribution of Esa1, especially in comparison to GCN5 and Hat1 remains unclear as well as its precise effect on Nup60. Clarifying this, also in a more balanced way of presentation of discussion, would be of interest for the field.

      My research centers around NPC function.

      Audience: experts in the nuclear structure/function fields and cell cycle regulation.

      A more detailed characterisation of the specific roles of Esa1, Gcn5 and Hat1 in the G1/S transition and mRNA export will be included in a revised version, as mentioned in our response to point 3.

      Reviewer #2

      Evidence, reproducibility and clarity:

      In this manuscript, Gomar-Alba et al. follow up on previous work from the lab that showed that the KDAC Hos3 is targeted to the bud neck and daughter cell nuclear pore complexes in budding yeast where it slows cell cycle progression by influencing gene positioning and nucleo-cytoplasmic transport. Overall, the current manuscript describes a well-conducted study that dissects the role of acetylation and deacetylation on Nup60 during the cell cycle using genetics and microscopy. The authors conclusively identify Esa1 as counteracting Hos3 in the nucleus (Figure 1) and show that part of their effect on cell cycle progression and gene expression is mediated by acetylation of Nup60 at K467 (Figure 2). They also demonstrate that this leads to a differential localization of several mRNA export factors and suggest that deacetylation of Nup60 blocks mRNA export in daughter cells. Although this work is overall carefully done, the last conclusion is still somewhat speculative.

      I have a number of minor suggestions to improve the manuscript, but only one major concern, which revolves around the role of chromatin tethering to NPCs. The authors have shown in their previous paper that this plays a role for CLN2 and it is known that active GAL1 interacts with the nuclear periphery, but in the current manuscript this aspect is largely disregarded although I think it could play a major role in the observed mRNA export phenotypes. Therefore, I think some additional experiments and controls as well as additional analysis are required to substantiate especially the results shown in figure 5.

      Major points:

      1) Figure 2: The authors claim that the mechanism by which Nup60 acetylation promotes cell cycle progression is the enhancement of mRNA export through the NPC. In Figure 2, the authors look at the expression levels of four candidate mRNAs which all show disturbed expression in esa1-ts which is not rescued by the nup60-KN mutation, but expression of the protein of one of these candidates (CLN2) is improved. In their previous paper, the same lab has shown that the CLN2 gene is tethered to the NPC in daughter cells with deacetylated Nup60 and that this is relieved in a Nup60 K467N mutant. I think it would be important here to investigate the protein levels of additional candidates that are not regulated at the level of gene localization. Is it a general effect that protein expression is higher in the nup60KN mutant?

      We agree this is an important point. To establish if Nup60-KN regulates only genes that interact with the NPC (such as CLN2), the reviewer suggests determining the cell cycle levels of proteins encoded by other G1/S genes that do not bind NPCs. The main problem with this approach is that with the exception of CLN2, the nuclear localisation of the (about 200) G1/S regulon genes is not yet known. In addition, establishing connections between mRNA and protein levels during the first cell cycle is only possible for short-lived proteins such as Cln2. For instance, amongst the G1/S genes shown in Figure 2, Cdc21 and Rnr1 have protein half-lives of 10 and 4 h, much longer than the 90-minute yeast cell cycle (PMID 25466257). We think a more direct approach to investigate the connection between gene position and mRNA synthesis / export would be to directly visualise the localisation of single mRNAs upon perturbation of the Nup60 acetylation pathway, using single mRNA labeling techniques (smFISH or PP7). We aim to do this for CLN2 and also for GAL1 (see point 2d of this reviewer). We will attempt these experiments for a revised version of our paper.

      2) Figure 5: In figure 5, the authors investigate the expression of a different inducible RNA (GAL1) to test whether the observed effect on mRNA export is more general. Since this is a crucial point for generalizing the finding, this data needs to be presented in a more convincing manner.

      2a. GAL1 is known to be tethered to the NPC upon transcription. Whether this tethering is affected by the Nup60-KN mutant is unclear, but since Nup60 has been implicated in GAL1 tethering in the literature, this possibility is not unlikely. GAL1 therefore becomes a similar case to CLN2, where it is difficult to disentangle effects directly due to mRNA export from the effects of gene tethering on mRNA transcription and processing. Therefore, this experiment should be repeated with a system that is independent of gene tethering. For example, induction of the GAL promoter via a b-estradiol inducible VP16 transactivator does not seem to induce tethering.

      This is an excellent idea. We are not aware of studies on the localisation of the GAL1 locus induced by a VP16 transactivator, but this was investigated for the HXK1 gene. This subtelomeric gene localises to NPCs in non-glucose carbon sources, and its localisation is perturbed by VP16 transactivation in glucose (PMID: 16760983). We will investigate whether the same is true for GAL1, and if so, perform the suggested experiments.

      2b. The activation kinetics in all mutants analyzed is very different from the wildtype. Therefore, the quantification made in Figure 5C is difficult to interpret. Therefore, it might be more fair to quantify for the mutant strains at an earlier timepoint after activation when the levels are similar to the levels in the wildtype strain. E.g. in the hos3d strain at around 250 min.

      This is a good point - indeed, persistent mother/daughter asymmetry in GAL1 expression in hos3 and nup60-KN mutants could be masked by saturated levels of GFP at late time points. An alternative way to test this is to determine the time of GAL1 induction in mother and daughter cells. We have done this in wild-type and hos3 mutant cells; our results indicate that GAL1 expression occurs first in wildt-type mothers and later in their daughters, whereas it is almost simultaneous in nup60-KN mother/daughter mutant pairs (as shown for a single M-D pair in the new figure 5A). In a revised version, we will include data of GAL1 expression for M-D pairs at different times after galactose addition for cells in figures 5C and 5E.

      2c. Similarly - although not as drastic - , in figure 5E, quantification should be done at a timepoint when the induction level is similar between DMSO and Rapamycin treated samples to make conclusions about differences between mother and daughter cell.

      We agree. See our response to the previous point.

      2d. The major claim of the paper is that mRNA export is inhibited by Nup60 deacetylation. In this figure, the mRNA levels need to be quantified to validate that it is not transcription that is affecting expression.

      We agree. In addition to regulating mRNA export (as suggested by the effect of Sac3 anchoring to NPCs) Nup60 deacetylation may also inhibit GAL1 transcription (directly, and/or indirectly via disruption of Gal1-based transcriptional feedback; PMID 23150580). To directly assess the role of Nup60 acetylation in GAL1 transcription and mRNA export, it would be ideal to determine the levels of GAL1 mRNA in both the nucleus and the cytoplasm, using smFISH and/or PP7 tools, in wild type and in mutants of the Nup60 acetylation pathway as we proposed to do for CLN2 (see our response to point 1 of this reviewer). These or equivalent experiments will be included in a revised version.

      3) The manuscript investigates in detail the effects of a KN mutant, however, a non-acetylatable mutant is not investigated. Is such a mutant viable?

      We have obtained a Nup60-KR mutant, which is predicted to behave as a non-acetylatable mimic, and it is viable. We will describe its phenotype in a revised version.

      Minor comments:

      4) Figure 2E: Is the rescue really specific to daughter cells? The dynamic range in the daughter cells is much higher due to the slower and more heterogenous timepoint of Whi5 export. However, zoom-in on the early timepoints after Whi5 import before the 30 min when 50% of the cells have exported Whi5, might reveal a significant increase of mother cells with shortened time to S phase entry. I suggest that the authors test this possibility. The cells shown in the image panels also suggest that the acetyl mimic might shorten mother cell time to S phase entry. If this is not the case, the authors might want to show a different example cell. Interestingly, it appears from the supplementary figure S5, that while Nup60 K647N partially rescues the export of Whi5, budding does not seem to be different to Nup60 wt. This appears to contradict the budding after alpha factor arrest shown in figure 2.

      We thank the reviewer for this suggestion. Indeed, zooming into the first 30 minutes shows a slight increase in the fraction of nup60-KN mother cells that export Whi5; however this change is not statistically significant when considering the entire cell population (p=0.6017, Mann-Whitney test). Therefore, we will replace the cell shown in figure 2E with a more representative example.

      As for figure S5, the reviewer is correct that in these experiments nup60-KN partially rescues Whi5 export (a marker of Start) but not budding (a downstream event), and this is indeed in variance with the experiment shown in figure 2B. Different experimental conditions may contribute to this apparent discrepancy: as noted in the text, the duration of G1 phase in cells synchronised with alpha factor is not directly comparable with that of freely cycling cells.

      5) Figure 3C: The authors use a truncated version of SAC3 for overexpression, since the full length is toxic (Figure S6A). I think it would be important to include this information in the main text.

      We agree, and have included this information in the main text.

      6) Figure 4B: Is there simply less Sac3 protein in the esa1-ts mutant? Although the authors address this question in figure S9, the very low expression levels of Sac3 may make this difficult to conclude from fluorescence quantification. A Western Blot would be an important control. The relative level of Sac3 still seems to be lower in esa1-ts daughter cells compared to mother cells, but no statistical test is shown.

      We are confident that the total Sac3-GFP levels are sufficient to make accurate comparisons, in both the nucleus and the entire cell. However, we will be happy to include western blot controls for Sac3 total levels in a revised version as the reviewer suggests. As for the levels of Sac3 in M vs D cells: Sac3 is indeed asymmetrically distributed in both wild-type and esa1-ts cells (p

      7) Analysis of mother daughter pairs (e.g. figure 5C): a paired t-test would be appropriate.

      We agree. Results do not change with this new analysis (in fact, p values are even lower for wild-type M-D pairs in figure 5C).

      8) Figure 5A: Can some representative mother-daughter pairs be shown as images for both wt and mutant in the timelapse? It is difficult to see in 5A whether there are any mother daughter pairs.

      We have modified the figure to include clearly identifiable mother-daughter pairs, as requested.

      9) Figure 4C: Please show image of localization of Sac3-GFP-FRB +/- rapamycin to the NPC.

      We have added this.

      Significance:

      This manuscript describes an important advance in understanding the role of non-histone protein modification on the regulation of cell cycle progression and gene expression. It is a logical follow-up on a previous paper from the lab (Kumar et al. 2018) and beautifully builds on this work. It is to my knowledge the first mechanistic description of regulation of nuclear pore complex function by a post-translational modification. This will therefore be a very interesting paper for anyone interested in nuclear pore complex regulation and biology, non-histone protein acetylation, asymmetric cell division, and cell cycle regulation.

      Reviewer #3

      Evidence, reproducibility and clarity:

      The pre-print is dedicated to mRNA export and G1/S transition control in mother and daughter cells of budding yeasts through acetylation/deacetylation of nuclear pore component Nup60 (hsNup153). In particular, authors found that Esa1(hsTip60/KAT5) acetylates the basket nucleoporin Nup60, and this event promotes recruitment of mRNA export factors to the nuclear basket and export of polyA RNA to the cytosol. This export event promotes entry of cells into S phase; in particular, Nup60 is deacetylated by histone deacetylase Hos3 that displaces mRNA export complexes from the NPC and inhibits Start specifically in daughter cells.

      The manuscript is a well-designed and well-written study.

      Please, see my major and minor suggestions below:

      Major comments:

      1. P4-5. "deacetylation of the nuclear basket nucleoporin Nup60 does not affect Whi5 nuclear accumulation". I was confused by this statement because, in the previous article Kumar et al., 2018, both main text and abstract have the following phase "nuclear basket and central channel nucleoporins establish daughter-cell-specific nuclear accumulation of the transcriptional repressor Whi5.." Could you please address this discrepancy?

      Thank you for pointing this out. We should have written: “deacetylation of Nup60 does not strongly affect Whi5 nuclear accumulation”. The Kumar et al. paper shows that deacetylation of central channel nucleoporins (such as Nup49) is important to increase accumulation of Whi5 in daughter cells, whereas deacetylation of the basket nucleoporin Nup60 plays a relatively minor role (see Kumar et al, Figure 7c). We have corrected this in the main text.

      Fig.2A: In addition to increased Nup60 acetylation, I noticed an overall increased level of Nup60 after overexpression of Esa1 and Gcn5. Is it a statistically significant increase in the Nup60 level? It is not mentioned in the main text or figure legend. Does the acetylation level of Nup60 influence its stability?

      We don’t know if acetylation of Nup60 affects its stability, although it is an intriguing possibility. Although it´s true that Nup60 levels in the IP fraction seem to increase upon Esa1 and Gcn5 overexpression, nuclear levels of Nup60-mCherry are similar in wild-type, hos3∆ and nup60-KN (Supplementary Figure S11A). Therefore it is unlikely that changes in Nup60 acetylation affect its stability. We have added this information to the text.

      Authors determined the mRNA level of four representative genes in esa1-ts and esa1-ts nup60-KN cultures.

      3a. Do authors know if Nu60-KN expression affects the perinuclear positioning of these transcripts?

      We did not investigate the localisation of individual transcripts in this study. However, as mentioned in our replies to reviewer 2, we propose to do so for the CLN2 and GAL1 mRNAs, in order to test directly the effect of Nup60 acetylation in the positioning of specific mRNAs.

      3b.I also suggest authors investigate if Nup60-KN affects other transcripts using the RNAseq approach. Nup60-KN might improve the transcription output of other transcripts and it will be interesting to know if these transcripts share similar features.

      We agree that investigating the impact of Nup60 acetylation in mRNA synthesis genome-wide is an exciting challenge. We speculate that Nup60-KN is likely to have some effect in transcription, either directly or indirectly through perturbation of feedback regulatory loops caused by mRNA export defects (for instance, transcription of both CLN2 and GAL1 is regulated by positive feedback). However we think that these experiments are beyond the scope of our study, which is focused on mRNA export.

      3c. Do authors know if GAL1pr:HOS3-NLS expression affects specifically G1-dependent transcripts?

      Answering this question would require RNA sequencing experiments. As mentioned in the previous point, we think these are beyond the scope of our study. That being said, it is likely that the Hos3-Nup60 pathway downregulates gene expression during G1, because Nup60 deacetylation is largely restricted to this phase. Note that this is not the same as regulating expression of the G1/S regulon specifically, because Hos3 also regulates GAL1 expression (Figure 5). We mention this important point in the discussion (p. 17).

      3d. Another interesting question will be to define if there is a group of transcripts that respond specifically to the status of Nup60 acetylation during G1/S transition. Is it possible to make ts-driven Nup60-KN expression to turn in ON/OFF? However, this question is beyond the scope of this paper.

      Thank you for this interesting suggestion. The proposed experiment is technically possible (for example, expression of Nup60-KN could be induced in G1 using a GAL1 promoter, followed by RNA sequencing). We agree that this is beyond the scope of our paper but would like to explore the question in future studies.

      1. Fig.2D It is not mentioned that Cln2 is not cycling anymore upon Nup60-KN overexpression.

      The Cln2 protein peaks at 30 minutes in this experiment, and is degraded at approximately 120 minutes. This corresponds to the slow, incomplete G1/S transition wave of the esa1-ts nup60-KN mutant, as indicated in the budding index at the bottom of the panel. We added this in the figure 2 legend. Note that Nup60-KN is not overexpressed, since the KN mutation is inserted in the endogenous gene under the control of its native promoter.

      Fig.2E. Arrows indicating Whi5 export timing do not match to the numbers in the main text. For example, yellow arrows indicate Whi5 export in wt strain at 30 and 78 min, but it is stated 15 and 59 min in the text. Also, do I understand right that Whi5-mCherry is not visible in the cytosol?

      See our reply to reviewer 2, point 4: we will replace the cell shown in figure 2E with a more representative example. As for Whi5-mCherry, it is visible in the cytoplasm but only weakly (since it is diluted into the larger cytoplasmic volume), and not at all in the images shown due to the overlay with the brightfield channel.

      Did the authors analyze where SAC3 and MTR2 are localized in hos3del, Nup60KN, and Esa-ts strains once their localization was affected in the nucleus? Is the overall level Sac3 level is affected in hos3del and Nup60KN strains?

      We have imaged the localisation of Sac3-GFP and Mtr2-GFP during the whole cycle using time-lapse microscopy. Our impression is that in wild type cells, their perinuclear levels increase during S phase in daughter cells, which mirrors the increase in Nup60 acetylation. In contrast, Sac3 and Mtr2 perinuclear levels seem more stable in hos3 and nup60-KN cells. We will include these analyses in a revised version. The total level of Sac3 is not affected, as shown in the updated figure 4; see our reply to reviewer 2, point 6.

      Fig4C. "Sac3-GFP-FRB partitioned equally to M and D nuclei, in the presence of Nup60-mCherry-FKBP and rapamycin (Figure 4C)." Sac3-GFP-FRB is slightly elevated in mother cells. Did you run a statistical test between the first and the third column on the box plot?

      Comparing the first and third columns in Fig 4C (Nup60 and Sac3 in control cells) shows that the mother cell accumulation is higher for Sac3 than for Nup60 (p

      P15. "GAL1 expression levels were higher in wild-type mother cells than in their daughter, and these differences were absent in cells lacking Hos3 or expressing Nup60KN". GAL1-10 promoter contains information necessary and sufficient for recruitment to the nuclear periphery (PMID: 27489341). I wonder if GAL1pr-driven transgenes of HOS3, spt10, hat1, and etc., contain DNA sequences sufficient for targeting genes to the nuclear periphery, and these genes are asymmetrically expressed in mother and daughter cells because of the presence of GAL1pr?

      We agree that these genes may be expressed at different levels in mother and daughter cells. We don’t think this asymmetric expression affects our conclusions. Indeed, the phenotypes scored (growth on plates) apply to the population and not to individual cells. The one exception is figure 3D, in which mRNA nuclear accumulation is scored in single cells. In this case, it remains possible that some of the variability observed corresponds to differences between mothers and daughters. In this case, our measurements could under-estimate the effect of Hos3-NLS in inhibition of mRNA export. However, since we cannot differentiate M and D cells in this experiment, we prefer not to speculate on this possibility in the text.

      Minor comments:

      1. Supplementary Fig. S1, it will be easy to read cell viability assays if 1A, S1A and S1B figures have the same orientation.

      We have changed the figure as suggested.

      Could you please clarify the difference between HOS3-NLS and GAL1pr:HOS3-NLS in the text of figure legend? P.33

      We have fixed this (figure 1 legend).

      P6. I recommend adding the following sentence to help clarity of the text: "To understand how NPC acetylation regulates the G1/S transition (Start), we sought to identify the lysine acetyl-transferases (KATs) counteracting the activity of the Hos3 deacetylase. Hos3 displays asymmetric distribution between mother and daughter cells in wild type Saccharomyces cerevisiae. Overexpression of a version of Hos3 fused to a nuclear localization signal (GAL1pr-HOS3-NLS) leads to targeting of Hos3 to mother and daughter cell nuclei, deacetylation of nucleoporins, and inhibition of cell proliferation (Kumar et al, 2018)."

      We thank the reviewer for this suggestion. This has been added.

      P8. Misspelling: Though Nup60 acetylation

      This has been fixed.

      FigS7. Description of polyA distribution is missing for single gcn5del strain.

      Thank you for pointing this out. This has been added.

      Misspelling: We conclude that Esa1 and Nup60 acetylation promotes Start, at least in part, by targeting Sac3 to the nuclear basket, where it mediates mRNA export.

      This has been fixed.

      Significance

      Authors of this pre-print overview and try to resolve a fundamental and not well-studied question about NPC acetylation status and S phase entry. This work is a logical extension of their previously published work (PMID: 29531309). However, this study for the first-time links status of NPC acetylation to mRNA export through lysine acetyl transferases. It will be interesting to address this question in mammalian cells considering interaction of basket nucleoporins with Tip60/KAT5 (PMID: 24302573).

      This work might be of interest to researchers investigating RNA export, transcription regulation, and nuclear pores.

      My fields of expertise are RNA export, nucleoporins, transcription regulation.

      I do not have expertise to evaluate yeast strains used in this study.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      I found this an exceptionally impressive manuscript. The evolution of Y chromosomes has until recently been nearly impossible, and this research group have pioneered approaches that can yield reliable results in Drosophila. The study used an innovative heterochromatin-sensitive assembly pipeline on three D. simulans clade species, D. simulans, D. mauritiana and D. sechellia, which diverged less than 250 KYA, allowing comparisons with the group's previous results for the D. melanogaster Y.

      The study is both technically impressive and extremely interesting (an highly unusual combination). It includes a rich set of interesting results about these genome regions, and furthermore the results are discussed in a well-organised way, relating both to previous observations and to understanding of the genetics and evolution of Y chromosomes, illuminating all these aspects. It is a rare pleasure to read such a study. I believe that this study will inspire and be a model for future work on these chromosomes. It shows how these difficult genome regions can be studied.

      Thank you for the positive evaluation of our paper. While we did not make any specific revisions in response to these comments, we did attempt to improve the writing.

      **Major comments:**

      The conclusions are convincing. The methods are explained unusually clearly, and the reasoning from the results is convincing. When appropriate, the caveats, the caveats are clearly explained. The material is clearly organised and the questions studied are well related to the results. I had a few minor comments concerning the English. Even the figure (often a major problem to understand) are very clear and helpful, with proper explanations. I have very rarely read such a good manuscript, and almost never (in a long career) found a manuscript that could be published without revision being necessary.

      Thank you for pointing out that there were minor concerns with the English. We have carefully gone through the manuscript and fixed some minor issues with the writing. The analysis found 58 exons missed in previous assemblies (as well as all previously known exons of the 11 canonical Y-linked genes, which are present in at least one copy across the group). FISH on mitotic chromosomes using probes for 12 Y-linked sequences was used to determine the centromere locations, and to determine gene orders and relate them to the cytological chromosome bands, demonstrating changes in satellite distribution, gene order, and centromere positions between their Y chromosomes within the D. simulans clade species. It also confirmed previous results for Y-linked ribosomal DNA,genes, which are responsible for X-Y pairing in D. melanogaster males. Although 28S rDNA has been lost in D. simulans and D. sechellia (but not in D. mauritiana), the intergenic spacer (IGS) repeats between these repeats are retained on both sex chromosomes in all three species. Only sequencing can reliably reveal this, as their abundance is below the detection level by FISH in D. sechellia. The 11 canonical Y-linked genes' copy numbers vary between the species, and some duplicates are expressed and have complete open reading frames, and may therefore be functional because they, but most include only a subset of exons, often with duplicated exons flanking the the presumed functional gene copy. Mega-introns and Y-loops were found, as already seen in Drosophila species, but this new study detects turn overs in the ~2 million years separating D. melanogaster and the D. simulans clade. 49 independent duplications onto the Y chromosome were detected, including 8 not previously detected. At least half show no expression in testes, or lack open reading frames, so they are probably pseudogenes. Testis-expressed genes may be especially likely to duplicate into the Y chromosome due to its open chromatin structure and transcriptional activity during spermatogenesis, and indeed most of the new Y-linked genes in the species studied clade have likely functions in chromatin modification, cell division, and sexual reproduction. The study discovered two new gene families that have undergone amplification on D. simulans clade Y chromosomes, reaching very high copy numbers (36-146). Both these families appear to encode functional protein-coding genes and show high expression. The paper described intriguing results that illuminate Y chromosome evolution. First, SRPK, arose by an autosome-to-Y duplication of the sequence encoding the testis-specific isoform of the gene SR Protein Kinase (SRPK), after which the autosomal copy lost its testis-specific exon via a deletion. In D. melanogaster, SRPK is essential for both male and female reproduction, so the relocation of the testis-specific isoform to the Y chromosome in the D. simulans clade suggests that the change may have been advantageous by resolving sexual antagonism. The paper presents convincing evidence that the Y copy evolved under positive selection, and that gene amplification may confer advantageous increased expression in males. The second amplified gene family is also potentially related to an interesting function. Both X-linked and Y-linked duplicates are found of a gene called Ssl located on chromosome 2R. In D. simulans, the X-linked copies were previously known, and called CK2ßtes-like. In D. melanogaster, degenerated Y-linked copies are also found, with little or no expression, contrasting with complete open reading frames and high expression in the D. simulans clade species in testes, consistent with the possibility of an arms race between sex chromosome meiotic drive factors. Other interesting analyses document higher gene conversion rates compared to the other chromosomes, and evidence that these Y chromosomes may differ in the DNA-repair mechanisms (preferentially using MMEJ instead of NHEJ), perhaps contributing to their high rates of intrachromosomal duplication and structural rearrangements. The authors relate this to evidence for turnover of Y-linked satellite sequences, with the discovery of five new Y-linked satellites, whose locations were validated using FISH. The study also documented enrichment of LTR retrotransposons on the D. simulans clade Y chromosomes relative to the rest of the genome, together with turnovers between the species.

      Reviewer #1 (Significance (Required)):

      As described above, the advances are both, technical and conceptual for the field. The manuscript itself does an excellent job of placing the work in the context of the existing literature.

      • Anyone working on sex chromosomes and other non-recombining genome regions should be interested in the findings reported.

      • My field of expertise is the evolution of sex chromosomes, and the evolution of genome regions with suppressed recombination. I have experience of genomic analyses. I have less expertise in analyses of gene expression, but I understand enough about such approaches to evaluate the parts of this study that use them.

      Reviewer #2:

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript describes a thorough investigation of the Y-chromosomes of three very closely related Drosophila species (D. simulans, D. sechellia, and D. mauritiana) which in turn are closely related to D. melanogaster. The D. melanogaster Y was analysed in a previous paper by the same goup. The authors found an astonishing level of structural rearrangements (gene order, copy number, etc.), specially taking into account the short divergence time among the three species (~250 thousand years). They also suggest an explanation for this fast evolution: Y chromosome is haploid, and hence double-strand breaks cannot be repaired by homologous recombination. Instead, it must use the less precise mechanisms of NHEJ and MMEJ. They also provide circumstantial evidence that MMEJ (which is very prone to generate large rearrangements) is the preferred mechanism of repair. As far as I know this hypothesis is new, and fits nicely on the fast structural evolution described by the authors. Finally, the authors describe two intriguing Y-linked gene families in D. simulans (Lhk and CK2ßtes-Y), one of them similar to the Stellate / Suppressor of Stellate system of D. melanogaster, which seems to be evolving as part of a X-Y meiotic drive arms race. Overall, it is a very nice piece of work. I have four criticisms that, in my opinion, should be addressed before acceptance.

      Thank you for your positive comments. We respond to your concerns point-by-point below.

      The suggestion/conclusion that MMEJ is the preferential repair mechanism (over NHEJ) should be better supported and explained. At line 387, the authors stated "The pattern of excess large deletions is shared in the three D. simulans clade species Y chromosomes, but is not obvious in D. melanogaster (Fig 6B). However, because all D. melanogaster Y-linked indels in our analyses are from copies of a single pseudogene (CR43975), it is difficult to compare to the larger samples in the simulans clade species (duplicates from 16 genes). ". Given that D. melanogaster has many Y-linked pseudogenes (described by the authors and by other researchers, and listed in Table S6), there seems to be no reason to use a sample size of 1 in this species.

      We only used pseudogenes with large alignable regions (>300 bp) to prevent the potential bias toward small indels and increase our confidence in indel calling. As a result, we excluded most of the duplicates on the D. melanogaster Y chromosome. We now include 5 additional D. melanogaster Y-linked indels in the manuscript, however, the majority of indels in this species (36/41) are still from the same gene.

      Furthermore, given that D. melanogaster is THE model organism, it is the species that most likely will provide information to assess the "preferential MMEJ" hypothesis proposed by the authors.

      A previous paper has shown that male flies deficient in MMEJ have a strong bias toward female offspring (McKee et al. 2000), suggesting that MMEJ is necessary for successfully producing Y-bearing sperm, consistent with our hypothesis. We agree with the reviewer that careful genetic and cytological experiments in D. melanogaster could further clarify the role of MMEJ in the repair of Y-linked mutations. Even more revealing would be experiments using the simulans clade species, where we hypothesize the MMEJ bias is even more pronounced on the Y chromosome. We believe, however, that these experiments are beyond the scope of this study and should merit their own papers.

      Still on the suggestion/conclusion that MMEJ is the preferential repair mechanism (over NHEJ). Y chromosome in heterochromatic, haploid and non-recombining. In order to ascribe its mutational pattern to the haploid state (and the consequent impossibility of homologous recombination repair), the authors compared it to chromosome IV (the so called "dot chromosome"). This may not be the best choice: while chr IV lacks recombination in wild type flies, it is not typical heterochromatin. E.g., " results from genetic analyses, genomic studies, and biochemical investigations have revealed the dot chromosome to be unique, having a mixture of characteristics of euchromatin and of constitutive heterochromatin". Riddle and Elgin, FlyBook 2018 (https://doi.org/10.1534/genetics.118.301146). Given this, it seems appropriate to also compare the Y-linked pseudogenes with those from typical heterochromatin. In Drosophila, these are the regions around the centromeres ("centric heterochromatin"). There are pseudogenes there; e.g., the gene rolled is known to have partially duplicated exons.

      Thank you for the suggestion. We now include the data from pericentric heterochromatin and pseudogenes in supplemental data (see Fig 7). Both data types support our conclusion that indel size is only larger on Y chromosomes, which is consistent with the comparison between the dot chromosome and pericentric heterochromatin reported by Blumenstiel et al. 2002.

      In some passages of the ms there seems to be a confusion between new genes and pseudogenes, which should be corrected. For example, in line 261: "Most new Y-linked genes in D. melanogaster and the D. simulans clade have presumed functions in chromatin modification, cell division, and sexual reproduction (Table S7)".. Who are these "new genes"? If they are those listed in Table S6 (as other passages of the text suggest), most if not all of them are pseudogenes. If they are pseudogenes, it is not appropriate to refer to them as "new genes". The same ambiguity is present in line 263: "Y-linked duplicates of genes with these functions may be selectively beneficial, but a duplication bias could also contribute to this enrichment (...) " Pseudogenes can be selectively beneficial, but in very special cases (e.g.. gene regulation). If the authors are suggesting this, they must openly state this, and explain why. Pseudogenes are common in nearly all genomes, and should be clearly separated from genes (the later as a shortcut for functional genes). The bar for "genes" is much higher than simple sequence similarity, including expression, evidences of purifying selecion, etc., as the authors themselves applied for the two gene families they identified in D. simulans (Lhk and CK2ßtes-Y)

      Thank you for the suggestion. We now state our criteria for calling genes based on the expression and long CDS and correct the sentences that the reviewer refers to. The protein evolution rates of many Y-linked duplicates were surveyed in Tobler et al. 2017, who found that most are not under strong purifying selection. Our study supports this previous report. We think that protein evolution rate alone may not be a good indicator for functionality. Our current study does not focus on the potential function of these genes, and we think further population studies are required to get a solid conclusion. We changed the text to clarify this point: “Most new Y-linked duplications in D. melanogaster and the D. simulans clade are from genes with presumed functions in chromatin modification, cell division, and sexual reproduction (Table S7), consistent with other Drosophila species [17, 77].” (p15 L281-284)

      The authors center their analysis on "11 canonical Y-linked genes conserved across the melanogaster group ". Why did they exclude the CG41561 gene, identified by Mahajan & Bachtrog (2017) in D. melanogaster? Given that most D. melanogaster Y-linked genes were acquired before the split from the D. simulans clade (Koerich et al Nature 2008), the same most likely is true for CG41561 (i.e., it would be Y-linked in the D. simulans clade). Indeed, computational analysis gave a strong signal of Y-linkage in D. yakuba (unpublished; I have not looked in the other species). If CG41561 is Y-linked in the simulans clade, it should be included in the present paper, for the only difference between it and the remaining "canonical genes" was that it was found later. Finally, the proper citation of the "11 canonical Y-linked genes" is Gepner and Hays PNAS 1993 and Carvalho, Koerich and Clark TIG 2009 (or the primary papers), instead of ref #55.

      Thank you for the suggestion. CG41561 is indeed a relatively young Y-linked gene because it’s not Y-linked in D. ananassae (Muller’s element E). We already have CG41561 in Table S6 and we think that it is reasonable to separate a young Y-linked gene from the others. We also fixed the reference as suggested (p5 L116).

      Other points/comments/suggestions:

      1. a) Possible reference mistake: line 88 "For example, 20-40% of D. melanogaster Y-linked regulatory variation (YRV) comes from differences in ribosomal DNA (rDNA) copy numbers [52, 53]." reference #53 is a mouse study, not Drosophila. Thank you for pointing out this error, we fixed the reference (p4 L91).

      2. b) Possible reference mistake: line 208 "and the genes/introns that produce Y-loops differs among species [75]". ref #75 is a paper on the D. pseudoobscura Y. Is it what the authors intended? Yes, our previous paper (ref 75) found that Y-loops do not originate from the kl-3, kl-5, and ORY genes in D. pseudoobscura because they don’t have large introns in this species.

      c) line 113. "We recovered all known exons of the 11 canonical Y-linked genes conserved across the melanogaster group, including 58 exons missed in previous assemblies (Table S1; [55])." Please show in the Table S1 which exons were missing in the previous assemblies. I guess that most if not all of these missing exons are duplicate exons (and many are likely to be pseudogenes). If they indeed are duplicate exons, the authors should made it clear in the main text, e.g., "We recovered all known exons of the 11 canonical Y-linked genes conserved across the melanogaster group, plus 58 duplicated exons missed in previous assemblies."

      Thank you for the suggestion. However, the 58 exons did not include the duplicated exons. We are similarly surprised how much we will miss if we don’t assemble the Y chromosome carefully. We now mark these exons in red in Table S1 to make this point clearer.

      d) line 116 "Based on the median male-to-female coverage [22], we assigned 13.7 to 18.9 Mb of Y-linked sequences per species with N50 ranging from 0.6 to 1.2 Mb." The method (or a very similar one) was developed by Hall et al BMC Genomics 2013, which should be cited in this context. e) line 118: "We evaluated our methods by comparing our assignments for every 10-kb window of assembled sequences to its known chromosomal location. Our assignments have 96, 98, and 99% sensitivity and 5, 0, and 3% false-positive rates in D. mauritiana, D. simulans, and D. sechellia, respectively (Table S2). The procedure is unclear. Why break the contigs in 10kb intervals, instead of treating each as an unity, assignable to Y, X or A? The later is the usual procedure in computational identification of suspect Y-linked contigs (Carvalho and lark Gen Res 2013; Hall et al BMC Genomics 2013). The only reason I can think for analyzing the contigs piecewise is a suspicion of misassemblies. If this is the case, I think it is better to explain.

      Thank you for the suggestion. We did not break the contigs into 10kb intervals when we assigned the Y-linked contigs. As you suspect, our motivation for evaluating our methods and analyzing the contigs in 10kb intervals was to detect possible misassemblies. We rewrote the sentence to make this point clearer (p6 L129-132).

      1. f) Fig. 1. It may be interesting to put a version of Fig 1 in the SI containing only the genes and the lines connecting them among species, so we can better see the inversions etc. (like the cover of Genetics , based on the paper by Schaeffer et al 2008). Thank you for the suggestion. We would like to make a figure like that fantastic cover image you refer to, but the repetitive nature of the Y chromosome makes it difficult to illustrate rearrangements based on alignments at the contig-level. We instead opted to update Figure 1 to better highlight the rearrangements, still based on the unique protein-coding genes which are supported by the FISH experiments.

      2. g) Table S6 (Y-linked pseudogenes). Several pseudogenes listed as new have been studied in detail before: vig2, Mocs2, Clbn, Bili (Carvalho et al PNAS2015) Pka-R1, CG3618, Mst77F (Russel and Kaiser Genetics 1993; Krsticevic et al G3 2015) . Note also that at least two are functional (the vig2 duplication and some Mst77 duplications). Thank you for the suggestion. We now include a column to indicate the potential function of Y-linked duplicates (see Table S6).

      h) line 421: "one new satellite, (AAACAT)n, originated from a DM412B transposable element, which has three tandem copies of AAACAT in its long terminal repeats." The birth of satellites from TEs has been observed before, and should be cited here. Dias et al GBE 6: 1302-1313, 2014.

      Thank you for the suggestion. We now include a sentence to cite this reference (p27 L467-468).

      1. i) Fig S2 shows that the coverage of PacBio reads is smaller than expected on the Y chromosome. Any explanation? This has been noticed before in D. melanogaster, and tentatively attributed to the CsCl gradient used in the DNA purification (Carvalho et al GenRes 2016). However, it seems that the CsCl DNA purification method was not used in the simulans clade species (is it correct?). Please explain the ms, or in the SI. The issue is relevant because PacBio sequencing is widely believed to be unbiased in relation to DNA sequence composition (e.g., Ross et al Genome Biol 2013). Yes, we used Qiagen's Blood and Cell Culture DNA Midi Kit for DNA extraction. We suspect that the underrepresentation of Y-linked reads is driven by the presence of endoreplicated tissue in adults. Heterochromatin is underreplicated in endoreplicated cells, and thus there may simply be less heterochromatin in these tissues. Consistent with this idea, we find that all heterochromatin seems to be underrepresented in the reads, not just the Y chromosome (see Chakraborty et al. 2021; Flynn et al. 2020). We now include this discussion in the SI of our paper (see supplementary text p75).

      2. j) I may have missed it, but in which public repository have the assemblies been deposited? We link to the assemblies in Github (https://github.com/LarracuenteLab/simclade_Y) and they will also be in the Dryad Digital Repository (doi forthcoming).

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Due to suppressed recombination, Y chromosomes have degenerated, undergone extensive structural rearrangements, and accumulated ampliconic gene families across species. The molecular processes and selective pressures guiding dynamic Y chromosome evolution are not well understood. In this study, Chang et al. generate updated Y assemblies of three closely related species in the D. simulans complex using long-read PacBio sequencing in combination with FISH. Despite having diverged only 250,00 years ago, the authors find structural rearrangements, two newly amplified gene families and evidence of positive selection across D. simulans. The authors also suggest the high level of Y duplications and deletions may be mediated by MMEJ biased repair.

      The authors generated a valuable resource for the study of Y-chromosome evolution in Drosophila and describe Y chromosome evolution patterns found in previous Y chromosome sequencing studies, such as newly amplified genes, positive selection, and structural rearrangements. The authors improvements to the Drosophila simulans clade Y chromosomes are commended, as assembly of the highly repetitive Y chromosome sequences is challenging. However, the manuscript is largely descriptive, the claims are largely speculative, and lacks a clear question. There are also a number of concerns with the text and figures (see below concerns). Overall, the manuscript would be significantly improved if the authors focused on a specific question as opposed to a survey of sequence features of the Y chromosome. For example, development of the idea that MMEJ is the primary mechanism for loss of Y chromosome sequence could be nice new twist.

      Our aim is to discover and understand the many different factors and processes that shape the evolution of Y chromosome organization and function. Because these Y chromosomes were largely unassembled, we needed to first generate the sequence assembly before we could ask specific questions. We prefer not to focus the manuscript solely on one specific topic such as MMEJ repair, as our other observations and analyses may be interesting to a wide range of scientists studying topics other than mutation and DNA repair. We are therefore choosing to present the more comprehensive story about Y chromosome evolution that we included in our original manuscript.

      We also respectfully disagree with the comment that our paper is just a descriptive survey of Y chromosomal sequence features. On the contrary, we present thorough evolutionary analyses to test hypotheses about the forces shaping the evolution of Y chromosome organization and Y-linked genes. Specifically, we use molecular evolution and phylogenetic and comparative genomics approaches to show that multi-copy gene families experience rampant gene conversion and positive selection. We posit that one simulans clade-specific Y-linked gene family has undergone subfunctionalization, potentially resolving sexual conflict, and another may be involved in meiotic drive. We also use evolutionary genomic approaches to show that the distribution of Y-linked mutations indeed suggests that Y chromosomes disproportionately use MMEJ and we propose that this unique feature may shape the evolution of Y chromosome structural organization. This is, as far as we know, a novel hypothesis. We think that follow-up studies of either hypothesis merit different papers.

      **Major concerns:**

      1. Title: The authors use "unique structure" in the title, which is a vague point. Are not Y chromosomes, or any chromosome, "unique" in some manner? Also are there not more evolutionary processes governing the rapid divergence of the Y's. Thank you for raising your concern. We believe that we are justified in referring to the Y chromosome as unique among all other chromosomes in its structural properties (e.g. combination of its hemizygosity, abundant tandem repeats, large scale rearrangements, and highly amplified testis-specific genes). Because there are many properties of Y chromosomes that we believe contribute to their rapid divergence, we opted for the general phrase ‘unique structure’ to capture all of these features. Many evolutionary processes likely shape the evolution of that unique structure (e.g. Muller’s Ratchet, background selection, Hill Robertson effects; see Charlesworth and Charlesworth 2000 for a review), and these processes are well-studied, especially on newly evolved sex chromosomes. Here our focus is on evolutionarily old Y chromosomes, which may have comparatively fewer targets of purifying selection and are more likely to be shaped by positive selection (Bachtrog 2008).

      p.2, line 53-56: The authors claim that sexually antagonistic selection and regulatory evolution are causes of recombination suppression. Couldn't this statement be reversed? Recombination suppression via inversions or other rearrangements enable sexually antagonistic selection. This is a chicken or egg question, so it should be revised to have both possibilities be equal.

      Thank you for the suggestion. We think that it is unlikely that recombination suppression itself is beneficial, but for sexually antagonistic selection and regulatory evolution, recombination suppression can have short-term benefits. We rephrased this sentence to be agnostic about the direction (p2 L56).

      p.5, 118-120: Are the assemblies de novo or have they been guided based upon the D. melanogaster Y chromosome assembly? Please clarify how the authors evaluate their methods by comparing their Y-sequence assignments to known chromosomal locations.

      Thank you for the suggestion. We didn’t use D. melanogaster Y chromosome assembly to guide our assemblies. “All assemblies are generated de novo”, and thus we don’t think there is any potential bias. We first assigned Y-linked sequences using the presence of known Y-linked genes, and used this assignment to evaluate our methods. We now make the sentence clear (p5 L112).

      While the gene copy number estimates are accurate, the PacBio-based genome assemblies are still not able to accurately assemble large segmental duplications (see Evan Eichler's laboratories recent primate and human genome assemblies). A statement mentioning the concerns about accuracy of the underlying sequence and genomic architecture shown should be included in the main text. FISH provides support for the location of the contigs, but not for the accuracy of the underlying genomic architecture.

      Thank you for the suggestion. We can’t validate all Y-linked regions. We did validate the larger structural features of the assembly and only discuss the results that we are confident in. We now include sentences to address this concern (p7 L150-152).

      The authors assigned Y-linked sequences based on median male-to-female coverage. Is this method feasible for assigning ampliconic sequence to the Y given the N50 of 0.6-1.2Mb? Are the authors potentially excluding novel Y-linked ampliconic sequence?

      We validated our methods to assign contigs to a chromosome by comparing 10-kb intervals to the contigs with known chromosomal location, including the Y chromosome. Our assignments have high (96, 98, and 99%) sensitivity and low (5, 0, and 3%) false-positive rates in D. mauritiana, D. simulans, and D. sechellia, respectively (see Table S2). Based on these results, we think that this method is reasonable for Y-linked contigs with N50 of 0.6-1.2Mb.

      We might exclude some novel Y-linked sequences since we only assigned ~15Mb out of a total ~40 Mb Y-linked sequences. We acknowledged this possibility, and now include a sentence to address this concern (p31 L554-556).

      Where did the rDNA sequences go in D. simulans and D. sechellia? Can they be detected on another chromosome?

      Please see Fig S5 for detailed results. We found a few copies of rDNA on the contigs of autosomes. We assembled many copies of rDNA that can’t be confidently assigned to Y chromosomes. It’s possible that they might be located on other chromosomes. Based on our FISH data (Fig S4) and previous papers, most of these non-Y-linked rDNA copies should be on the X chromosome. However, in this study, we did not make a concerted effort to assign X-linked contigs.

      Figure 2B is hard to follow and it is unclear what additional value it provides to part A. Why is expression level of specific exons important?

      Exon duplication may be an important contributor to Y-linked gene evolution: most genes have duplications and our figure shows that at least some of these duplicates are expressed. The patterns we see indicate that duplication may play different roles in genes depending on their length. For example, the duplications involving short genes (e.g., ARY) may be functional and influence protein expression, whereas duplications involving large genes (e.g. kl-2) may not influence the overall protein expression level from this gene, although the expressed duplicated exons may play some other role. We revised a sentence in the main text and added a sentence to the figure 2 legend to make this point clearer.

      Figure 3 There are many introns that contain gaps, so it is unclear how confident one can be in intron length when there are gaps.

      Indeed, we are not confident about the length of introns with gaps. Therefore, we separated these introns and showed them in different colors.

      Figure 4: What are the authors using as a common ancestor in this figure to infer duplications in the initial branch?

      We used phylogenies to infer the origin of Y-linked duplicates. Any duplications that happened earlier than the divergence between four species are listed in the branch. We also edited the legend to make this point clearer.

      p.15, paragraph 2: The authors describe a newly amplified gene, CK2Btes-Y, in D. simulans. In the first half of the paragraph the authors state that Y-linked copies are also found in D. melanogaster but have "degenerated and have little or no expression" and call them pseudogenes. Later in the paragraph, the authors state that the D. melanogaster Y-linked copies are Su(Ste), a source of piRNAs that are in conflict with X-linked Stellate. Lastly in the paragraph, the authors discuss Su(ste) as a D. melanogaster homolog of CK2Btes-Y. The logic of defining CK2Btes-Y origins is confusing. Was CK2Btes-Y independently amplified on the D. simulans Y, or were CK2BtesY and Su(Ste) amplified in a common ancestor but independently diverged?

      The amplification of CK2Btes-Y and CK2Btes-like happened in the ancestor of D. melanogaster and D. simulans (Fig S11). However, both CK2Btes-Y and CK2Btes-like became pseudogenes (D. melanogaster CK2Btes-Y is named PCKR in a previous study) in D. melanogaster. On the other hand, Ste and Su(Ste) are only limited to D. melanogaster based on phylogenetic analyses (Fig 5A) and are a chimera of CK2Btes-like and NACBtes. The evolutionary history of this gene family has been detailed in other papers, except for the presence of CK2Btes-Y in the D. simulans complex, which we describe for the first time in this study. We now include a new figure (Figure 5B) a schematic of the inferred evolutionary history of sex-linked Ssl/CK2ßtes paralogs

      Figure 5: Is each FISH signal a different gene copy?

      Yes, based on our assemblies, Lhk-1 and Lhk-2 are mostly located on different contigs. Unfortunately, we are not able to design probes that can separate Lhk-1 from Lhk-2.

      The authors suggest DNA-repair on the Y chromosome is biased towards MMEJ based on indel size and microhomologies. Is there any evidence MMEJ is responsible for variable intron length in the canonical Y-linked genes or the amplification of new gene families? Since MMEJ is error-prone, it's a more tolerable repair mechanism in pseudogenes, so their findings might be biased. Rather than comparing pseudogenes to their parent genes, they should compare chrY pseudogenes to autosomal pseudogenes. Even more would be to track MMEJ on the dot chromosome which is known not recombine and is highly heterchromatic like the Y chromosome.

      We did compare chrY pseudogenes to autosomal pseudogenes in our study. We also add new analyses to address other issues from reviewer 2, which are similar to your concern. We now include data from pericentric heterochromatin and pseudogenes (see Fig 7). Both data types support our conclusion that indel size is only larger on Y chromosomes. This is consistent with a report that the dot chromosome and pericentric heterochromatin have similar indel size distributions (Blumenstiel et al. 2002).

      Reviewer #3 (Significance (Required)):

      While it is a benefit to have much improved Y chromosome assemblies from the three D. simulans clade species, the gap in knowledge this manuscript is trying to address is unclear. The manuscript is almost entirely descriptive and the figures are difficult to follow.

      As stated above, we respectfully disagree with the comment that the manuscript is entirely descriptive, as we present thorough evolutionary analyses to test hypotheses about the forces shaping the evolution of Y chromosome organization and Y-linked genes. We have two guiding hypotheses about the importance of sexual antagonism and DNA repair pathways for Y chromosome evolution, and we conduct sequence analyses that support these hypotheses that sexual antagonism and MMEJ affect Y chromosome evolution.

      References cited in this response:

      Bachtrog D. The temporal dynamics of processes underlying Y chromosome degeneration. Genetics. 2008 Jul;179(3):1513-25. doi: 10.1534/genetics.107.084012. Epub 2008 Jun 18. PMID: 18562655; PMCID: PMC2475751.

      Blumenstiel, J.P., Hartl, D.L, Lozovsky, E.R.. Patterns of Insertion and Deletion in Contrasting Chromatin Domains, Molecular Biology and Evolution, Volume 19, Issue 12, December 2002, Pages 2211–2225, __https://doi.org/10.1093/oxfordjournals.molbev.a004045__

      Chakraborty M, Chang CH, Khost DE, Vedanayagam J, Adrion JR, Liao Y, Montooth KL, Meiklejohn CD, Larracuente AM, Emerson JJ. Evolution of genome structure in the Drosophila simulans species complex. Genome Res. 2021 Mar;31(3):380-396. doi: 10.1101/gr.263442.120. Epub 2021 Feb 9. PMID: 33563718; PMCID: PMC7919458.

      Charlesworth B, Charlesworth D. The degeneration of Y chromosomes. Philos Trans R Soc Lond B Biol Sci. 2000 Nov 29;355(1403):1563-72. doi: 10.1098/rstb.2000.0717. PMID: 11127901; PMCID: PMC1692900.

      Flynn,J, Long, M, Wing, RA, A.G Clark, Evolutionary Dynamics of Abundant 7-bp Satellites in the Genome of Drosophila virilis, Molecular Biology and Evolution, Volume 37, Issue 5, May 2020, Pages 1362–1375, https://doi.org/10.1093/molbev/msaa010

      McKee, Bruce D. et al. “On the Roles of Heterochromatin and Euchromatin in Meiosis in Drosophila: Mapping Chromosomal Pairing Sites and Testing Candidate Mutations for Effects on X–Y Nondisjunction and Meiotic Drive in Male Meiosis.” Genetica 109 (2004): 77-93.

      Tobler R, Nolte V, Schlötterer C. High rate of translocation-based gene birth on the Drosophila Y chromosome. Proc Natl Acad Sci U S A. 2017 Oct 31;114(44):11721-11726. doi: 10.1073/pnas.1706502114. Epub 2017 Oct 19. PMID: 29078298; PMCID: PMC5676891.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript describes a thorough investigation of the Y-chromosomes of three very closely related Drosophila species (D. simulans, D. sechellia, and D. mauritiana) which in turn are closely related to D. melanogaster. The D. melanogaster Y was analysed in a previous paper by the same goup. The authors found an astonishing level of structural rearrangements (gene order, copy number, etc.), specially taking into account the short divergence time among the three species (~250 thousand years). They also suggest an explanation for this fast evolution: Y chromosome is haploid, and hence double-strand breaks cannot be repaired by homologous recombination. Instead, it must use the less precise mechanisms of NHEJ and MMEJ. They also provide circumstantial evidence that MMEJ (which is very prone to generate large rearrangements) is the preferred mechanism of repair. As far as I know this hypothesis is new, and fits nicely on the fast structural evolution described by the authors. Finally, the authors describe two intriguing Y-linked gene families in D. simulans (Lhk and CK2ßtes-Y), one of them similar to the Stellate / Suppressor of Stellate system of D. melanogaster, which seems to be evolving as part of a X-Y meiotic drive arms race. Overall, it is a very nice piece of work. I have four criticisms that, in my opinion, should be addressed before acceptance.

      The suggestion/conclusion that MMEJ is the preferential repair mechanism (over NHEJ) should be better supported and explained. At line 387, the authors stated "The pattern of excess large deletions is shared in the three D. simulans clade species Y chromosomes, but is not obvious in D. melanogaster (Fig 6B). However, because all D. melanogaster Y-linked indels in our analyses are from copies of a single pseudogene (CR43975), it is difficult to compare to the larger samples in the simulans clade species (duplicates from 16 genes). ". Given that D. melanogaster has many Y-linked pseudogenes (described by the authors and by other researchers, and listed in Table S6), there seems to be no reason to use a sample size of 1in this species. Furthermore, given that D. melanogaster is THE model organism, it is the species that most likely will provide information to assess the "preferential MMEJ" hypothesis proposed by the authors. Still on the suggestion/conclusion that MMEJ is the preferential repair mechanism (over NHEJ). Y chromosome in heterochromatic, haploid and non-recombining. In order to ascribe its mutational pattern to the haploid state (and the consequent impossibility of homologous recombination repair), the authors compared it to chromosome IV (the so called "dot chromosome"). This may not be the best choice: while chr IV lacks recombination in wild type flies, it is not typical heterochromatin. E.g., " results from genetic analyses, genomic studies, and biochemical investigations have revealed the dot chromosome to be unique, having a mixture of characteristics of euchromatin and of constitutive heterochromatin". Riddle and Elgin, FlyBook 2018 (https://doi.org/10.1534/genetics.118.301146). Given this, it seems appropriate to also compare the Y-linked pseudogenes with those from typical heterochromatin. In Drosophila, these are the regions around the centromeres ("centric heterochromatin"). There are pseudogenes there; e.g., the gene rolled is known to have partially duplicated exons. In some passages of the ms there seems to be a confusion between new genes and pseudogenes, which should be corrected. For example, in line 261: "Most new Y-linked genes in D. melanogaster and the D. simulans clade have presumed functions in chromatin modification, cell division, and sexual reproduction (Table S7)".. Who are these "new genes"? If they are those listed in Table S6 (as other passages of the text suggest), most if not all of them are pseudogenes. If they are pseudogenes, it is not appropriate to refer to them as "new genes". The same ambiguity is present in line 263: "Y-linked duplicates of genes with these functions may be selectively beneficial, but a duplication bias could also contribute to this enrichment (...) " Pseudogenes can be selectively beneficial, but in very special cases (e.g.. gene regulation). If the authors are suggesting this, they must openly state this, and explain why. Pseudogenes are common in nearly all genomes, and should be clearly separated from genes (the later as a shortcut for functional genes). The bar for "genes" is much higher than simple sequence similarity, including expression, evidences of purifying selecion, etc., as the authors themselves applied for the two gene families they identified in D. simulans (Lhk and CK2ßtes-Y) The authors center their analysis on "11 canonical Y-linked genes conserved across the melanogaster group ". Why did they exclude the CG41561 gene, identified by Mahajan & Bachtrog (2017) in D. melanogaster? Given that most D. melanogaster Y-linked genes were acquired before the split from the D. simulans clade (Koerich et al Nature 2008), the same most likely is true for CG41561 (i.e., it would be Y-linked in the D. simulans clade). Indeed, computational analysis gave a strong signal of Y-linkage in D. yakuba (unpublished; I have not looked in the other species). If CG41561 is Y-linked in the simulans clade, it should be included in the present paper, for the only difference between it and the remaining "canonical genes" was that it was found later. Finally, the proper citation of the "11 canonical Y-linked genes" is Gepner and Hays PNAS 1993 and Carvalho, Koerich and Clark TIG 2009 (or the primary papers), instead of ref #55. Other points/comments/suggestions:

      a) Possible reference mistake: line 88 "For example, 20-40% of D. melanogaster Y-linked regulatory variation (YRV) comes from differences in ribosomal DNA (rDNA) copy numbers [52, 53]." reference #53 is a mouse study, not Drosophila.

      b) Possible reference mistake: line 208 "and the genes/introns that produce Y-loops differs among species [75]". ref #75 is a paper on the D. pseudoobscura Y. Is it what the authors intended?

      c) line 113. "We recovered all known exons of the 11 canonical Y-linked genes conserved across the melanogaster group, including 58 exons missed in previous assemblies (Table S1; [55])." Please show in the Table S1 which exons were missing in the previous assemblies. I guess that most if not all of these missing exons are duplicate exons (and many are likely to be pseudogenes). If they indeed are duplicate exons, the authors should made it clear in the main text, e.g., "We recovered all known exons of the 11 canonical Y-linked genes conserved across the melanogaster group, plus 58 duplicated exons missed in previous assemblies."

      d) line 116 "Based on the median male-to-female coverage [22], we assigned 13.7 to 18.9 Mb of Y-linked sequences per species with N50 ranging from 0.6 to 1.2 Mb." The method (or a very similar one) was developed by Hall et al BMC Genomics 2013, which should be cited in this context. e) line 118: "We evaluated our methods by comparing our assignments for every 10-kb window of assembled sequences to its known chromosomal location. Our assignments have 96, 98, and 99% sensitivity and 5, 0, and 3% false-positive rates in D. mauritiana, D. simulans, and D. sechellia, respectively (Table S2). The procedure is unclear. Why break the contigs in 10kb intervals, instead of treating each as an unity, assignable to Y, X or A? The later is the usual procedure in computational identification of suspect Y-linked contigs (Carvalho and lark Gen Res 2013; Hall et al BMC Genomics 2013). The only reason I can think for analyzing the contigs piecewise is a suspicion of misassemblies. If this is the case, I think it is better to explain.

      f) Fig. 1. It may be interesting to put a version of Fig 1 in the SI containing only the genes and the lines connecting them among species, so we can better see the inversions etc. (like the cover of Genetics , based on the paper by Schaeffer et al 2008).

      g) Table S6 (Y-linked pseudogenes). Several pseudogenes listed as new have been studied in detail before: vig2, Mocs2, Clbn, Bili (Carvalho et al PNAS2015) Pka-R1, CG3618, Mst77F (Russel and Kaiser Genetics 1993; Krsticevic et al G3 2015) . Note also that at least two are functional (the vig2 duplication and some Mst77 duplications).

      h) line 421: "one new satellite, (AAACAT)n, originated from a DM412B transposable element, which has three tandem copies of AAACAT in its long terminal repeats." The birth of satellites from TEs has been observed before, and should be cited here. Dias et al GBE 6: 1302-1313, 2014.

      i) Fig S2 shows that the coverage of PacBio reads is smaller than expected on the Y chromosome. Any explanation? This has been noticed before in D. melanogaster, and tentatively attributed to the CsCl gradient used in the DNA purification (Carvalho et al GenRes 2016). However, it seems that the CsCl DNA purification method was not used in the simulans clade species (is it correct?). Please explain the ms, or in the SI. The issue is relevant because PacBio sequencing is widely believed to be unbiased in relation to DNA sequence composition (e.g., Ross et al Genome Biol 2013).

      j) I may have missed it, but in which public repository have the assemblies been deposited?

      Significance

      see above.

    1. Author Response:

      Reviewer #2 (Public Review):

      Yu et al provide a comprehensive set of experiments to determine that bradyzoites have much slower cytosolic Ca2+ parameters, which impact on gliding motility, a key process of Toxoplasma spread and persistence.

      The only main criticism that I have is the use of the MIC2-GLuc reporter to measure microneme secretion in bradyzoites. Do bradyzoites have any appreciable level of MIC2 and its associated protein M2AP?? This is important that may affect the outcome. If bradyzoites do not, then the MIC2-GLuc reporter might not have appropriate levels of M2AP to correctly traffic to the micronemes. I recommend that the authors quantitate, either by western blot or IFA, the levels of MIC2 and M2AP in bradyzoites versus tachyzoites and also show that M2AP co-localises with MIC2-GLuc to give confidence that MIC2-GLuc is trafficked correctly and thus the low readings of secretion are not just a result of the reporter mistrafficked. It would also be pleasing to see, that 1hr incubation leads to restoration of MIC2-GLuc secretion.

      We acknowledge that the expression and localization of MIC2-Gluc reporter is a potential concern. We performed western blotting (Figure 2C) and IFA (Figure 2 supplement 1A) to confirm that bradyzoites express MIC2-Gluc and M2AP albeit at lower levels compared with tachyzoites. Moreover, MIC2-GLuc and M2AP were properly co-localized to the apical end in bradyzoites, ruling out the possibility of mis-localization of the MIC2-GLuc reporter. Based on these results, we believe that MIC2-GLuc provides a reliable read-out for microneme secretion in in vitro differentiated bradyzoites. Additionally, the conclusion that MIC secretion is dampened in bradyzoites is also supported by the studies using the FNR-Cherry reporter in Figure 2E,F,G.

      Reviewer #3 (Public Review):

      This is a first study that looks in detail at Ca-controlled gliding motility and ATP supply in bradyzoites. A comparison of such different parasite stage by manipulating Ca and ATP metabolism is challenging. Intervention by chemical compounds needs to overcome a prominent cyst wall and the usage of genetic tools needs to consider the broad changes in protein expression between tachyzoites and bradyzoites as well as a heterology between individual bradyzoites. The authors used excysted bradyzoites to exclude the cyst wall as a diffusion barrier as a major factor in the efficacy of different Ca agonists. To address differences in expression levels between tachyzoites and bradyzoite stages the authors developed a ratiometric Ca sensor based upon an autocleaved GCaMP6f-BFP dimer protein.

      Overall the conclusions are well supported but there are methodological questions that need to be addressed.

      Bradyzoites show a heterogenous expression of Bag1 / Sag1 markers as well as heterologous proteins. This is shown in Fig 1A and Fig 2b for example. However, in most time-dependent measurements of Ca-dependent fluorescence (Fig 2G, 3D the authors only average three cells. This appears to be insufficient to represent the bradyzoite population. How is the variance between the three measured cells?

      We have quantified more cells in all figures related to fluorescence measurements. For measurements of single parasites in Figure 5B, 5D, 5E, 6F, 8A, 8B and Figure 7 supplement 1A, we have now quantified 10 parasites for each condition and plotted the data as means ±S.D. to show the variance. For in vitro induced cysts or ex vivo cysts in Figure Fig 2G, 3D, 3E, 4C,4G, 6E, 7B and Figure 4 supplement 1A, we measured 5 cysts or vacuoles per condition. Because these samples contain many parasites within each vacuole or cyst, they represent a greater sample size. The data are also plotted a means ±S.D.

      In addition, the Mic2 promoter driven Gluc-myc protein is not expressed in all bradyzoites. This is perhaps not suprising as Mic2 seems to be downregulated in bradyzoites according to Pittman and Bucholz et al dataset in ToxoDB. If interpreted correctly the lower expression of Gluc in some bradyzoites would favour an underestimation of the RLUs in Fig 2D.

      We acknowledge that the expression and localization of MIC2-Gluc reporter is a potential concern. We performed western blotting (Figure 2C) and IFA (Figure 2 supplement 1A) to confirm that bradyzoites express MIC2-Gluc and M2AP albeit at lower levels compared with tachyzoites. Moreover, MIC2-GLuc and M2AP were properly co-localized to the apical end in bradyzoites, ruling out the possibility of mis-localization of the MIC2-GLuc reporter. Based on these results, we believe that MIC2-GLuc provides a reliable read-out for microneme secretion in in vitro differentiated bradyzoites. Additionally, the conclusion that MIC secretion is dampened in bradyzoites is also supported by the studies using the FNR-Cherry reporter in Figure 2E,F,G.

      The maturation of bradyzoite takes several weeks. This cannot be accomplished with currently available system in vitro and the authors use 1 week matured bradyzoites. To facilitate comparability to data from other manuscripts it would be helpful if the authors could quantify the differentiation stage of the in vitro bradyzoites. This could be done by measuring the fractions of Bag1-positive and Sag1-negative bradyzoites.

      We thank the reviewer for this useful comment. We have quantified the percentage of BAG1-positive SAG1-negative bradyzoites within each cyst induced for 3, 5 or 7 days by IFA and spinning disc confocal microscopy (Figure 3 supplement 1A). This analysis demonstrated that the percentage of BAG1-positive and SAG1-negative bradyzoites reached ~70% at day 7 after induction (Figure 3 supplement 1B). For this reason, we used a 7 day induction treatment for the majority of experiments. Also, where imaging was used in the analysis, we focused on regions of in vitro differentiated cysts that expressed high levels of BAG1-mCherry.

      The mcherry and GCaMP6f signal in fig 3B seem mutually exclusive. This may be due to difference in calcium signalling between Bag1 pos or neg parasites or due to expression differences of GCaMP6f.

      To test the possibility of expression differences in GCaMP6f, we quantified the fluorescence of BAG1-mCherry and GCaMP6f in different bradyzoites within the cyst shown in Figure 3B. At time 0 prior to stimulation, we observed heterogenous expression of BAG1- mCherry while the signal for GCaMP6f expression was relatively constant (Figure 3B supplement 1C and 1D). In contrast, when in vitro differentiated bradyzoites were stimulated with A23187, they showed reduced levels of GCaMP expression in cells that were strongly positive for BAG1-mCherry (Figure 3B). Collectively, these findings are consistent with the difference in GCaMP fluorescence being due to dampened calcium responses in bradyzoites rather than expression differences. This conclusion is supported by studies on GCaMP responses in cells where we normalized for expression level using a dual-expression BFP reporter in Figure 6. Therefore, we do not think that heterogeneity in the expression of GCaMP is responsible for the observed dampened response in bradyzoites.

      The authors use syringe, trypsin-released and FACS sorted bradyzoites in multiple Ca assays. How can it be excluded that this procedure affects (depletes) Ca stores?

      In all the figures except Figure 2C-2D, we did not use FACS to sort bradyzoites. Instead, we scraped cells cultured at pH 8.2, used syringe passage through 25g needle followed by centrifugation. Cyst pellets were resuspended and digested with trypsin to liberate bradyzoites. For tachyzoites, all procedures were similar except that we did not use trypsin digestion. As a control, we have now treated tachyzoites similarly with trypsin and monitored the calcium stores using ionomycin. We found that trypsin digestion did not affect the calcium stores or response as shown in Figure 7 figure supplement 1A.

      In my opinion several experiments in this manuscript would benefit from clarification of this point. For example: In Fig 7A Fu et al measure Ca for 5min during trypsin digestion, however, for gliding assays cysts are digested for 10min. The Ca monitoring should cover the complete 10min off trypsin digest.

      We understand the concern but there were practical reasons for the slightly different times used. In panel A where we are monitoring calcium during trypsin digestion, the majority of cysts are dispersed after 5 min resulting the parasites being out of focus. As such, it is not practical to monitor beyond this time point. In the panel C, we were interested in observing parasites after the cysts where fully digested and hence we used a slightly longer time period to allow complete digestion and for the parasites to settle to the bottom of the dish before further recording. In this instance, similar to the result in A, most parasites remained dormant and did not show elevated calcium levels. In the figure, we are selectively showing a rare example where calcium signaling was observed in order to compare the patterns to what is normally observed with tachyzoites. These combined panels are not meant to be a comparison of kinetics, as this aspect is tested more directly in later experiments. We have modified the text to make the rationale for this experiment clear.

      In Fig 2B Fu et al digest infected monolayers with trypsin to release mcherry from cysts matrices. How can the authors exclude that trypsin is not digesting mCherry protein in this assay?

      I think the reviewer means 2F as in 2B we are using BAG1 mCherry to visualize bradyzoites – but they are not being liberated in this image. In 2F we use a different construct, FnR-mCherry that directs the reporter to be constitutively secreted to either the PV (surrounding tachyzoites) or the cyst matrix (surrounding bradyzoites). When the cysts are disrupted with trypsin, the mCherry is likely to disperse and may also be digested. However, this would not happen if it remains inside the parasite. This control is provided to show that the protein is secreted into the matrix. We have revised the text to clarify the use of this control.

      Fig 7 E,F: the authors measure shorter gliding distances of bradyzoite as compared to tachyzoites. Trails of both parasites however, are detected by visualizing using different antigens that may have different shedding behavior on the FBS-coated glass surface. The Bag1 trail also depends on Bag1 expression, which is shown in numerous images to not be equal among individual bradyzoites. This point is very challenging to address but should at least be discussed.

      BAG1 is used here to discern the bradyzoites, not to detect the trail. Trails are stained with either SAG1 or SRS9 – corresponding to the most abundant surface GPI anchored antigen in each stage. Since these proteins are part of the same C-C fold family and are similarly anchored, we feel they are comparable. We have added the following statement to the results: “These two surface markers are both members of the cysteine rich SRS family that are tethered to the surface membrane by a GPI anchor, thus they represent comparable reporters for each stage.”

      Fig 7E: Bradyzoites are considered to satisfy their ATP needs mostly via glycolysis and the data shown do support this capability. I find the ability of OligomycinA to block glucose-dependent gliding surprising as this suggests a necessary mitochondrial transport chain for ATP-production from glucose. This result should be mentioned clearly in the text and its implications discussed.

      The Discussion has been revised as suggested.

      Figure 8: The authors claim a recovery of bradyzoite ATP and Ca levels after 1hr incubation with carbon sources and Ca, that together enable efficient gliding. However, the elevation of bradyzoite ATP occurs after the parasites spend 2 hours in glucose-free and Ca-free conditions, whereas gliding assays are done after a short 10min trypsin digest. I am not entirely convinced that low ATP levels post-egress are responsible for the low gliding activity. Ideally gliding assays should be done after a similar purification procedure to correlate the two experiments.

      We have repeated the gliding assays using bradyzoites purified in the same manner as for the ATP measurements and found the same result that a combination of exogenous calcium and glucose enhance recovery of gliding motility (Figure 8D, 8F). In addition, we used the same time point to purify bradyzoites for MIC2-Gluc secretion and found exogenous calcium and glucose also led to an increase in MIC2-GLuc secretion, indicative of the recovery of microneme secretion (Figure 8C).

    1. Author Response:

      Reviewer #1 (Public Review):

      Summary

      Moncunill et al set out to investigate a very important question: why are half of children vaccinated three times with RTS,S AS01 protected from clinical malaria - and half not? To do so they isolated PBMCs before vaccination and one month after third vaccination and stimulated them in vitro with DMSO (vehicle control), two malaria antigens (CSP (part of RTS,S) & AMA1) or HBS (hepatitis B antigen - part of RTS,S).

      They then assessed their transcriptional response by blood transcriptional module analysis and correlated those results with previous published data on antibody titers and T cell cytokine production to find associations. To assess risk of clinical malaria, responses were compared between RTS,S vaccinated children who developed clinical malaria in the one year follow-up (cases) and those who received RTS,S or a comparator vaccine and did not (controls). They found that responses after RTS,S vaccination did not predict protection from clinical malaria. Instead a blood transcriptional module signature related to dendritic cells, inflammation, and monocytes before vaccination may be associated with clinical malaria risk.

      Strengths

      Immune correlates of protection are evaluated in African children (who are the RTS,S target population) in a natural transmission setting.

      Excellent set of controls: children (same age) vaccinated with RTS,S or comparator vaccine alongside each other -> retrospectively stratified by whether the did or did not develop clinical malaria : controls for the effect of a developing immune system and would allow to disentangle RTS,S specific and clinical malaria specific response patterns.

      Weaknesses

      RTS,S is composed of CSP & HBS. yet when PBMCs from children vaccinated three time with RTS,S are stimulated with these peptides no transcriptional differences compared to children receiving a rabies or meningitis vaccine were detected (Figure 2). this lack of recall response impacts all downstream conclusions and comparisons made in the paper.

      The fact that bulk transcriptional profiling of Ag-stimulated PBMCs (and specifically to CSP) did not identify large significant differences in BTM expression between the RTS,S vs. comparator group could be due to several factors. First of all, the frequency of antigen-specific CD4+ T cells was very low among CD4+ T cells (Figure 4 of the manuscript shows that CSP-specific CD4+ T cells comprise < 0.004% of all CD4+ T cells). This low frequency of CSP-specific T cells is consistent with other RTS,S studies [e.g. as we state on line 330, we have previously found that CSP-specific T-cells in RTS,S/AS01 vaccines comprise < 0.10% of all CD4+ T cells (1)]. Moreover, CD4+ T cells themselves comprise approximately 45-57% of all PBMCs (2). Thus, finding an expression signal between the RTS,S vs. comparator group would require the signal to be high enough to be detected in only 0.002% of all PBMCs [0.004% (% CSP-specific CD4+ T cells out of total CD4+ T cells) x 51% (average % of CD4+ T cells out of all PBMCs) = 0.002%]. Thus, lack of detectable recall response does not mean lack of recall response. Moreover, as suggested below, we opted not to focus the rest of the manuscript on the Ag-stimulation results.

      Second of all, the PBMCs were stimulated on site for 12h and then cryopreserved. This stimulation time was chosen based on the kinetics of IFN- and IL-2 mRNA response (3), but other responses may have had different kinetics and thus have already resolved or have not yet occurred by the 12-h cryopreservation. We have added text in the manuscript to discuss these caveats (“Another potential reason for why no BTMs were found to associate with the response to RTS,S/AS01 vaccination or with protection when analyzing CSP-stimulated PBMC is that all PBMC were stimulated on site for 12 hours (this stimulation time was chosen based on the kinetics of the IFN-gamma transcriptional response) and then cryopreserved. Thus, we were unable to detect earlier transient responses that had already resolved by 12 hours, as well as more delayed response that had not yet initiated by 12 hours, if such responses occurred”).

      It should be noted that in all our analyses, the stimulated results were adjusted for DMSO to focus on the antigen-specific response only. This would explain why we detect signal in the DMSO samples but not in response to stimulation. We have realized that this was not very well described in the figure captions and the Methods section and have added more details, including the model description in Methods section. As such, we do not believe that these results impact all downstream conclusions. We believe that the unstimulated results provide significant new insights into the immune and molecular mechanisms of RTS,S vaccine efficacy, not necessarily directly related to the RTS,S-specific acquired immune response. Finally, we would like to highlight the fact that we have improved our model specification to directly account for the pairing of some of the samples using a random effect using the limma package. This has slightly increased statistical power, and as such the number of significantly differentially expressed BTMs in response to stimulation is a bit higher (but still much less than that for the DMSO). Originally, we had decided against the use of a random effect due to the computational cost of estimating the random effect.

      Transcriptional responses 1 month after the final RTS,S vaccination do not predict clinical malaria risk (Figure 3) - this is a key finding, which should be central to the conclusion of this paper.

      Considering that Kazmin et al. (4) showed that the transcriptional response to the third RTS,S/AS01 dose peaks at Day 1 post-injection, with some decline by Day 6 and approximately 90% of the response having waned by Day 21 (with the caveat that Kazmin et al.’s study population was malaria-naïve adults), we do not find it surprising that there were only a few BTMs whose 1 month post-final RTS,S dose associated with clinical malaria risk. However, the point is well-taken about the relative merits of the baseline. We have edited the Discussion to include discussion of the Month 3 correlates results:

      “Compared to the 45 BTMs whose baseline levels significantly associated with clinical malaria risk in RTS,S/AS01-vaccinated African children, fewer BTMs (seven) had levels at one month post-final RTS,S/AS01 dose that significantly associated with clinical malaria risk. Moreover, if a more stringent FDR cutoff had been used (i.e. 5%), six of these seven BTMs would not have been identified. Thus it is entirely possible that, at one month post-final RTS,S/AS01 dose, there is no circulating immune transcriptomic signature predictive of risk. Such a conclusion would not be surprising, given that in malaria-naïve adults, the transcriptional response to the third RTS,S/AS01 dose has been shown to peak at Day 1 post-injection, with some decline by Day 6 and approximately 90% of the response having waned by Day 21 (17). Therefore, it is likely that the sampling scheme in this study (one month post-final dose) misses the majority of the transcriptional response to RTS,S/AS01.”

      The take-home message put forward in the title/abstract (that a monocyte and DC related pre-vaccination signature predicts risk of clinical malaria in RTS,S vaccinated children) is not strongly supported by the data. It is based on blood transcriptional modules related to monocytes being picked out when comparing RTS,S vaccinated cases and controls.

      Thank you for giving us the opportunity to provide further rationale for our focus on the 7 monocyte-related and 4 DC-related BTMs shown in Figure 6B (MAL067 column) out of the 45 total BTMs whose baseline expression associated with clinical malaria risk in RTS,S/AS01-vaccinated children. The reviewer implies that these modules were chosen for focus somewhat randomly or without justification (or, even worse, “cherry picked”), which we would agree would be an imperfect method for drawing conclusions.

      First, we have always ensured to mention that the 45 baseline modules that correlated with risk in RTS,S recipients (Fig 6B, MAL067 column) belonged to many functional annotations, including DC cells and monocytes. (Abstract: “In contrast, baseline levels of BTMs associated with dendritic cells and with monocytes (among others) correlated with malaria risk”) (Main text, lines 519-522: “Compared to the results from the month 3 analysis (7 BTMs), the baseline correlates analysis of MAL067 revealed a larger number (45) of BTMs, spanning many functional categories, whose month 0 levels in vehicle-stimulated PBMC nearly all associated with clinical malaria risk in RTS,S/AS01 recipients .”

      The focus on DC cells and monocytes is due to two reasons: 1) the fact that the DC-related modules and the monocyte-related modules were some of the most significant correlations (lines 522-524: “The BTM with the most significant association with risk was “enriched in monocytes (II) (M11.0)” (FDR = 1.80E-14), followed by “inflammatory response (M33)” (FDR = 2.45E-07) and “resting dendritic cell surface signature (S10)” (FDR = 6.03E-07).”

      Second, the baseline association of DC- and monocyte-related modules appeared to generalize across populations: (Abstract: “A cross-study analysis supported generalizability of the baseline dendritic cell- and monocyte-related BTM correlations with malaria risk to healthy, malaria-naïve adults, suggesting that certain monocyte subsets may inhibit protective RTS,S/AS01-induced responses.”; Main text: “BTMs related to dendritic cells and to monocytes were most consistently associated with risk across these three studies [“resting dendritic cell surface signature (S10)”, “DC surface signature (S5)”, “enriched in dendritic cells (M168)”, “enriched in monocytes (I) (M4.15)”, “enriched in monocytes (II) (M11.0)”, “enriched in monocytes (IV) (M118.0)”, and “monocyte surface signature (S4)” significantly correlated with risk in all three studies].”

      The first two sentences of the Discussion (lines 577-580) explain our focus on monocytes and DCs:

      “Our main finding is the identification of a baseline blood transcriptional module (BTM) signature that associates with clinical malaria risk in RTS,S/AS01-vaccinated African children. In a cross-study comparison, much of this baseline risk signature – specifically, dendritic cell- and monocyte-related BTMs – was also recapitulated in two of the three CHMI studies in healthy, malaria-naïve adults.”

      Finally, we note that the title (“A baseline transcriptional signature associates with clinical malaria risk in RTS,S/AS01-vaccinated African children”) does not restrict to DC-related or monocyte-related BTMs, rather, we chose this title based on the larger number of BTMs, and higher correlations with risk, in the baseline analysis compared to the Month 3 analysis.

      We have revised all instances where we have communicated this less clearly, e.g. “for why we identified a baseline monocyte transcriptional signature of risk” has been changed to “for why we identified monocyte-related BTMs in our transcriptional signature of risk”.

      Many other modules are picked out as well e.g. cell cycle (Figure 6B). An in-depth analysis of the genes in these module and what their up and downregulation can tell us about their function is warranted to support the conclusions.

      Thank you for the suggestion to look at the cell cycle module in Figure 6B. You make a good point that this module is the only module to show a significant association with clinical malaria risk across all 4 of the RTS,S studies and should therefore be further examined. First, we have added this to the text:

      “Only one BTM, “cell cycle and transcription (M4.0)”, was significantly associated with risk across all four studies. Of the 335 genes in this module (M4.0), 130 were also present in one or more of the six “monocyte-related” BTMs shown in Figure 6B (297 genes total across all six BTMs), suggesting that the “cell cycle” and “monocyte” results may actually be picking up the same signal.”

      We have done the gene-level analysis as suggested, resulting in 8 new supplemental figures (Figure 6-figure supplements 1-8) and one new supplemental table (S5). We have also made the following revisions to the text:

      In Results: “To gain insight into specific module-member genes that may be involved in the RTS,S/AS01 baseline risk signature, we performed the same analysis on the gene level, i.e. examined associations with clinical malaria risk for each of the constituent genes in the 45 BTMs shown in Figure 6B. Figure 6-figure supplements 1-8 show the gene-level association results within the eight BTMs that were significantly associated with clinical malaria risk in MAL067 and at least two of the three CHMI studies, and had at least one gene in MAL067 that was significantly associated with risk (these eight correspond to M4.0, S10, S5, M168, M4.3, M11.0, M4.15, and S4). Within MAL067, 35 unique genes were shown to significantly associate with malaria risk (Supplementary Table 5); 9 of these genes (CCNF, MK167, KIF18A, NPL, RBM47, CFD, MAFB, IL13RA1, and CCR1) also had significant association with non-protection in one of the CHMI studies. Although no individual gene was significantly associated with risk across >2 studies, many showed consistent effect (direction and magnitude) across 3 studies. This further supports our choice to focus on modules instead of individual genes as GSEA increases power to detect more subtle but coordinated changes in gene expression data that would be missed otherwise. For this same reason, GSEA has been shown to enhance cross-study comparisons (45).”

      In Discussion: “Our gene-level correlates analyses suggest an alternative hypothesis, however. With the caveat that the gene-level analyses were performed post hoc, high baseline expression of STAB1 (which is present in DC-related, monocyte-related, and cell cycle-related modules) was found to positively associate with clinical malaria risk (Figure 6-figure supplements 1, 2, and 6). STAB1 encodes stabilin-1 (also called Clever-1), a transmembrane glycoprotein scavenger receptor that links extracellular signals to intracellular vesicle trafficking pathways (58). Interestingly, stabilin-1high monocytes show downregulation of proinflammatory genes, and T cells co-cultured with stabilin-1high monocytes showed decreased antigen recall, suggesting that monocyte stabilin-1 suppresses T cell activation (56). Thus one possibility is that stabilin-1high immunosuppressive monocytes circulating at baseline could decrease protective RTS,S-induced T-cell responses, or inhibit another aspect of adaptive immunity. Single-cell transcriptomic profiling of PBMC or purified monocyte subsets in future RTS,S trials in African children in malaria-endemic areas could help test this hypothesis.”

      Impact

      This paper will inform future studies looking for correlates of RTS,S induced protection from clinical malaria in a variety of ways:

      It validates the blood transcriptional module approach (as published by Li S, Rouphael N, Duraisingham S, Romero-Steiner S, Presnell S, Davis C, Schmidt DS, Johnson SE, Milton A, Rajam G, et al: Molecular signatures of antibody responses derived from a systems biology study of five human vaccines. Nat Immunol 2014) to find target cell populations which can then be investigated in much more detail.

      It shows that studying PBMC recall responses after peptide stimulation post final vaccination is not the way forward, since no response is detected (Figure 2). future studies can now take an alternative approach. e.g. since unstimulated PBMCs (vehicle control) from RTS,S vaccinated children were different from those who received a comparator vaccine (Figure 2) RTS,S vaccine signatures could be picked up much more easily by whole blood RNAseq.

      It implicates innate immune cells in shaping an individuals response to a vaccine - an exciting basis for future functional and mechanistic studies.

      We are glad the reviewer appreciates the value of the study.

      Reviewer #2 (Public Review):

      This paper reports a sub-study of the RTS,S/AS01 malaria vaccine Phase 3 trial, which aimed to identify groups of genes (blood transcriptional modules, BTMs) for which expression in DMSO or antigen-stimulated PBMCs was associated with clinical malaria during a 12-month follow-up period. Study subjects were infants and children who received either RTS,S/AS01 or comparator vaccines (meningococcal C for infants, rabies vaccine for children), enrolled in the study in Tanzania and Mozambique (with some additional analyses using samples from Gabonese infants).

      Using PBMCs collected at baseline before vaccination and 3 months later (a month after the third vaccine dose), stimulated with DMSO or parasite antigens, the authors used RNA-sequencing to identify BTMs which were different between recipients of RTS,S/AS01 vs comparator vaccines; which were different between baseline and month 3 in RTS,S/AS01 recipients; and which differed between RTS,S/AS01 recipients with a malaria episode and those without a malaria episode during the follow-up period. This combination of analyses might help to distinguish BTMs specifically associated with RTS,S/AS01 vaccine efficacy from those associated with other factors influencing susceptibility to malaria. To further aid mechanistic understanding the authors examined correlations between BTMs and measures of cellular and humoral immune responses. To try to establish generalisability the authors examined whether BTMs identified in African children were also associated with developing malaria in RTS,S/AS01-vaccinated malaria-naïve adults in the United States who underwent controlled human malaria infection (CHMI).

      Strengths of the study include:

      1) The relatively large number of subjects, the large amount of transcriptomic and immunological data which has been generated (and made publicly accessible), and the extensive analysis to evaluate associations between BTMs and numerous immunological variables.

      2) Clear explanation of both the rationale and methods for most of the analyses

      3) The attempt to validate findings in the CHMI studies

      4) Matching of subjects to try to eliminate the confounding effects of age, study site, and time of vaccination

      Weaknesses of the study include:

      1) Despite the relatively large size of the study, it is hard to know whether it had sufficient power to achieve its main objective, and we are not presented with data to demonstrate how successfully the authors managed to match subjects for age, timing of vaccination and follow-up duration

      We have added the following to our “limitations” paragraph in the Discussion: “Fourth, despite the relatively large size of the study, our statistical power was limited by the number of malaria cases with available samples; sampling additional controls would not have increased our statistical power.”

      Moreover, we now also provide the new Supplementary Table 1, which provides complete information on participant match ID, site, age cohort, sex assigned at birth, and time of vaccination.

      2) The comparator group to the RTS,S/AS01 vaccine is not a single vaccine, but two vaccines, but the presentation of the data makes it difficult to identify what effect this may have had on the results

      Indeed, comparators received a rabies vaccine or the meningococcal C conjugate depending on the age cohort. However, we think that the impact on the study results and conclusions is minimal since the main results are based on baseline gene expression and its association with malaria risk within RTS,S vaccinees. Correlates of malaria risk in comparators are done separately. Comparator vaccination may be a confounding factor for age cohort, but we are not analyzing the effect of age cohort on the transcriptional profile. Comparators are only included in the analysis of RTS,S immunogenicity at post-vaccination (RTS,S vs Comparators, Fig 2A, Comparison (1)) and we have adjusted analyses by age cohort and hence by comparator vaccine. The fact that the comparators received different control vaccines only stresses that the BTMs found to be associated with RTS,S vaccination are specific to the RTS,S vaccine.

      Moreover, as an alternative way to identify RTS,S-specific transcriptional responses, we also include Comparison (2), which compares Month 3 to Month 0 transcription levels within RTS,S vaccinees. We include in the text extensive discussion of the merits and drawbacks of each comparison:

      “Two comparisons were done to characterize the transcriptional response to RTS,S/AS01 vaccination: Comparison (1): comparing gene expression in month 3 samples from RTS,S/AS01 vs comparator recipients (month 3 RTS,S/AS01 vs comparator); and Comparison (2): comparing gene expression in month 3 vs month 0 from RTS,S/AS01 recipients (RTS,S/AS01 month 3 vs month 0). Each comparison has its own advantages: Comparison (1) allows the identification of RTS,S/AS01-specific responses while taking into account other environmental factors to which the children are exposed, such as malaria exposure (albeit malaria transmission intensity was low during the study at both sites). Moreover, the very young ages of the trial participants mean that RTS,S/AS01-induced changes may be confounded with normal developmental changes in participant immune systems, further underscoring the value of Comparison (1), as it does not involve comparison across two different time points. On the other side, an advantage of Comparison (2) is that it takes into consideration each participant’s intrinsic baseline gene expression. Comparison (1) uses data from both infants and children, whereas Comparison (2) can only yield insight into RTS,S/AS01 responses in children (as baseline samples were not collected from infants).”

      3) A very "liberal" false-discovery rate (FDR) threshold has been used throughout to define significant associations. An FDR of 0.2 indicates that 20% (or 1 in 5) results which are considered significant will be false-discoveries. This means that the "significant" results must be interpreted with a high degree of caution. Typically researchers use lower FDR thresholds, like 0.05 or 0.01, although one may argue for different thresholds under different circumstances

      While it is not uncommon to use a threshold of 20% for immune correlates studies [e.g. (5-10)], we agree with you that it is important to clearly state the chosen FDR rate and to discuss conclusions in the context of the FDR rate used. We see we could improve our manuscript in this respect. We have added the following:

      Results: “Compared to the 45 BTMs whose baseline levels significantly associated with clinical malaria risk in RTS,S/AS01-vaccinated African children, fewer BTMs (seven) had levels at one month post-final RTS,S/AS01 dose that significantly associated with clinical malaria risk. Moreover, if a more stringent FDR cutoff had been used (i.e. 5%), six of these seven BTMs would not have been identified. Thus it is entirely possible that, at one month post-final RTS,S/AS01 dose, there is no circulating immune transcriptomic signature predictive of risk…”

      Discussion: “Finally, while it is not uncommon to use an FDR cutoff of 20% in high-dimensional immune correlates studies [e.g. (65-70)], our results should be interpreted with the requisite level of caution. However, we do note that many of our significant modules in the baseline risk analysis would have survived even lower FDR cutoffs (in many cases even a 1% cutoff), giving us a fair degree of confidence in our results. For example, of the seven monocyte-related BTMs whose baseline levels associated with risk, all would have survived a 5% FDR cut-off, and three even a 1% cut-off; likewise, of the four dendritic cell-related BTMs whose baseline levels associated with risk, all would have survived a 5% FDR cut-off, and three even a 1% cut-off.”

      Moreover, we have revised Figures 2, 3, and 6 so that it is easy to discern whether a specific BTM correlation would also pass more stringent FDR cutoffs, through the addition of 1, 2, or 3 asterisks where appropriate: “|FDR| < 0.2 (), < 0.05 (), < 0.01 ().” Note that, most central to the key message of the paper, many of the monocyte-related, DC-related, and cell cycle-related BTMs would have passed more stringent FDR cutoffs, with many even passing a 1% FDR cutoff (as discussed above).

      4) A perplexing finding, which is not addressed in detail, is the large number of BTMs which differ between RTS,S and comparator vaccine groups after DMSO stimulation of PBMCs, but these are not seen when PBMCs are stimulated with parasite antigens in DMSO (and a similar finding for month 3 vs month 0 samples from RTS,S recipients). This raises some concern about the stimulation experiments, because one might expect that the DMSO vehicle in the antigen preparations would trigger a similar response to DMSO alone.

      It should be noted that in all our analyses, the stimulated results were adjusted for DMSO to focus on the antigen-specific response only. This would explain why we detect signal in the DMSO samples but not in response to stimulation. We have realized that this was not very well described in the figure captions and the Methods section and have added more details, including the model description in Methods section. As such, we do not believe that these results impact all downstream conclusions. We believe that the unstimulated results provide significant new insights into the immune and molecular mechanisms of RTS,S vaccine efficacy, not necessarily directly related to the RTS,S-specific acquired immune response. We would also like to highlight the fact that we have improved our model specification to directly account for the pairing of some of the samples using a random effect using the limma package. This has slightly increased statistical power, and as such the number of significantly differentially expressed BTMs in response to stimulation is a bit higher (but still much less than that for the DMSO). Originally, we had decided against the use of a random effect due to the computational cost of estimating the random effect.

      The fact that bulk transcriptional profiling of Ag-stimulated PBMCs did not identify almost any significant differences in BTM expression between the RTS,S vs. comparator group could be due to several factors. First of all, the frequency of antigen-specific CD4+ T cells was very low among CD4+ T cells (Figure 4 of the manuscript shows that CSP-specific CD4+ T cells comprise < 0.004% of all CD4+ T cells). This low frequency of CSP-specific T cells is consistent with other RTS,S studies [e.g. as we state on line 330, we have previously found that CSP-specific T-cells in RTS,S/AS01 vaccinees comprise < 0.10% of all CD4+ T cells (1)].

      Moreover, CD4+ T cells themselves comprise approximately 45-57% of all PBMCs (2). Thus, finding an expression signal between the RTS,S vs. comparator group would require the signal to be high enough to be detected in only 0.002% of all PBMCs [0.004% (% CSP-specific CD4+ T cells out of total CD4+ T cells) x 51% (average % of CD4+ T cells out of all PBMCs) = 0.002%]. Thus, lack of detectable recall response does not mean lack of recall response. Moreover, as suggested below, we opted not to focus the rest of the manuscript on the Ag-stimulation results.

      Second of all, the PBMCs were stimulated on site for 12h and then cryopreserved. This stimulation time was chosen based on the kinetics of IFN-g and IL-2 mRNA response (3), but other responses may have had different kinetics and thus have already resolved or have not yet occurred by the 12-h cryopreservation. We have added text in the manuscript to discuss these caveats (“Another potential reason for why no BTMs were found to associate with the response to RTS,S/AS01 vaccination or with protection when analyzing CSP-stimulated PBMC is that all PBMC were stimulated on site for 12 hours (this stimulation time was chosen based on the kinetics of the IFN-g transcriptional response) and then cryopreserved. Thus, we were unable to detect earlier transient responses that had already resolved by 12 hours, as well as more delayed response that had not yet initiated by 12 hours, if such responses occurred.”.

      The authors partly achieved their aims. They identified BTMs differentially expressed between RTS,S/AS01 and the comparator vaccines, and between baseline and month 3 in RTS,S/AS01 recipients. They also identified BTMs at month 3 associated with developing malaria, and BTMs at baseline associated with developing malaria. These latter BTMs were partly replicated in the CHMI study subjects. Higher expression of BTMs associated with monocytes and dendritic cells were most consistently identified across the different analyses and their expression in stimulated baseline samples was most consistently associated with development of clinical malaria in RTS,S/AS01 recipients. However there were inconsistencies in associations between some of the studies, and it is possible that the "consistent" monocyte and dendritic cell BTMs would not be so consistent if a more stringent FDR threshold was used. However the authors conclusions are largely quite measured and for the most part they do not over-interpret the significance of their findings.

      We have added the following to the Discussion: “Finally, while it is not uncommon to use an FDR cutoff of 20% in high-dimensional immune correlates studies [e.g. (65-70)], our results should be interpreted with the requisite level of caution. However, we do note that many of our significant modules in the baseline risk analysis would have survived even lower FDR cutoffs (in many cases even a 1% cutoff), giving us a fair degree of confidence in our results. For example, of the seven monocyte-related BTMs whose baseline levels associated with risk, all would have survived a 5% FDR cut-off, and three even a 1% cut-off; likewise, of the four dendritic cell-related BTMs whose baseline levels associated with risk, all would have survived a 5% FDR cut-off, and three even a 1% cut-off.”

      Overall the work provides some evidence that baseline immunological status, particularly related to monocyte and dendritic cell responses and possibly their role in or response to baseline inflammation, may be a determinant of how well the RTS,S vaccine works to prevent malaria. This provides a basis for further work to optimise the effectiveness of the vaccine. The usefulness of PBMC stimulation to predict an individual's response to vaccination will be limited because this is not a method which can be used at scale in resource limited settings, but the concept that vaccine response could be enhanced by modifying pre-vaccine immunological or inflammatory status is potentially important. The data published with this study will be a valuable resource and will undoubtedly be used by others to address similar questions. Increasing the efficacy of malaria vaccines remains an extremely important goal, and identifying possible mechanisms which restrict the efficacy of RTS,S is important.

      References:

      1. Moncunill G, De Rosa SC, Ayestaran A, Nhabomba AJ, Mpina M, Cohen KW, Jairoce C, Rutishauser T, Campo JJ, Harezlak J, Sanz H, Diez-Padrisa N, Williams NA, Morris D, Aponte JJ, Valim C, Daubenberger C, Dobano C, McElrath MJ. RTS,S/AS01E Malaria Vaccine Induces Memory and Polyfunctional T Cell Responses in a Pediatric African Phase III Trial. Front Immunol. 2017;8:1008.
      2. Kleiveland CR. Peripheral Blood Mononuclear Cells. In: Verhoeckx K, Cotter P, López-Expósito I, Kleiveland C, Lea T, Mackie A, et al., editors. The Impact of Food Bioactives on Health: in vitro and ex vivo models. Cham: Springer International Publishing; 2015. p. 161-7.
      3. Schultz-Thater E, Frey DM, Margelli D, Raafat N, Feder-Mengus C, Spagnoli GC, Zajac P. Whole blood assessment of antigen specific cellular immune response by real time quantitative PCR: a versatile monitoring and discovery tool. J Transl Med. 2008;6:58.
      4. Kazmin D, Nakaya HI, Lee EK, Johnson MJ, van der Most R, van den Berg RA, Ballou WR, Jongert E, Wille-Reece U, Ockenhouse C, Aderem A, Zak DE, Sadoff J, Hendriks J, Wrammert J, Ahmed R, Pulendran B. Systems analysis of protective immune responses to RTS,S malaria vaccination in humans. Proc Natl Acad Sci U S A. 2017;114(9):2425-30.
      5. Liu C, Martins AJ, Lau WW, Rachmaninoff N, Chen J, Imberti L, Mostaghimi D, Fink DL, Burbelo PD, Dobbs K, Delmonte OM, Bansal N, Failla L, Sottini A, Quiros-Roldan E, Han KL, Sellers BA, Cheung F, Sparks R, Chun TW, Moir S, Lionakis MS, Consortium NC, Clinicians C, Rossi C, Su HC, Kuhns DB, Cohen JI, Notarangelo LD, Tsang JS. Time-resolved systems immunology reveals a late juncture linked to fatal COVID-19. Cell. 2021;184(7):1836-57 e22.
      6. Andersen-Nissen E, Fiore-Gartland A, Ballweber Fleming L, Carpp LN, Naidoo AF, Harper MS, Voillet V, Grunenberg N, Laher F, Innes C, Bekker LG, Kublin JG, Huang Y, Ferrari G, Tomaras GD, Gray G, Gilbert PB, McElrath MJ. Innate immune signatures to a partially-efficacious HIV vaccine predict correlates of HIV-1 infection risk. PLoS Pathog. 2021;17(3):e1009363.
      7. Lu P, Guerin DJ, Lin S, Chaudhury S, Ackerman ME, Bolton DL, Wallqvist A. Immunoprofiling Correlates of Protection Against SHIV Infection in Adjuvanted HIV-1 Pox-Protein Vaccinated Rhesus Macaques. Front Immunol. 2021;12:625030.
      8. Haynes BF, Gilbert PB, McElrath MJ, Zolla-Pazner S, Tomaras GD, Alam SM, Evans DT, Montefiori DC, Karnasuta C, Sutthent R, Liao HX, DeVico AL, Lewis GK, Williams C, Pinter A, Fong Y, Janes H, DeCamp A, Huang Y, Rao M, Billings E, Karasavvas N, Robb ML, Ngauy V, de Souza MS, Paris R, Ferrari G, Bailer RT, Soderberg KA, Andrews C, Berman PW, Frahm N, De Rosa SC, Alpert MD, Yates NL, Shen X, Koup RA, Pitisuttithum P, Kaewkungwal J, Nitayaphan S, Rerks-Ngarm S, Michael NL, Kim JH. Immune-correlates analysis of an HIV-1 vaccine efficacy trial. N Engl J Med. 2012;366(14):1275-86.
      9. Fletcher HA, Snowden MA, Landry B, Rida W, Satti I, Harris SA, Matsumiya M, Tanner R, O'Shea MK, Dheenadhayalan V, Bogardus L, Stockdale L, Marsay L, Chomka A, Harrington-Kandt R, Manjaly-Thomas ZR, Naranbhai V, Stylianou E, Darboe F, Penn-Nicholson A, Nemes E, Hatherill M, Hussey G, Mahomed H, Tameris M, McClain JB, Evans TG, Hanekom WA, Scriba TJ, McShane H. T-cell activation is an immune correlate of risk in BCG vaccinated infants. Nat Commun. 2016;7:11290.
      10. Young WC, Carpp LN, Chaudhury S, Regules JA, Bergmann-Leitner ES, Ockenhouse C, Wille-Reece U, deCamp AC, Hughes E, Mahoney C, Pallikkuth S, Pahwa S, Dennison SM, Mudrak SV, Alam SM, Seaton KE, Spreng RL, Fallon J, Michell A, Ulloa-Montoya F, Coccia M, Jongert E, Alter G, Tomaras GD, Gottardo R. Comprehensive Data Integration Approach to Assess Immune Responses and Correlates of RTS,S/AS01-Mediated Protection From Malaria Infection in Controlled Human Malaria Infection Trials. Front Big Data. 2021;4:672460.
    1. Karnofsky suggests that the cost/benefit ratio of how we typically think of reading may not be as simple as we intuitively expect i.e. we think that 'more time' = 'more understanding'.

      If you're simply reading to inform yourself about a topic, it may be worth reading a couple of book reviews, and listening to an interview or two, rather than invest the significant amount of time necessary to really engage with the book.

      A few hours of skimming and reviews/interviews may get you to 25% understanding and retention, which in many cases may be more than enough for your needs of being basically informed on the topic. Compared to the 50 - 100 hours necessary for a deep, analytical engagement with the text, that would only get you to 50% understanding and retention.

      That being said, if your goal is to develop expertise, both Karnofsky and Adler ('How to read a book') suggest that you need a deep engagement with multiple texts.

    1. Author Response:

      Reviewer #1 (Public Review):

      The study aims to investigate the role of A11 neurons in courtship behavior and vocalizations. In particular, the authors determine the inputs/outpus of A11 neurons and uncover that the outputs are both dopamine and glutamate positive. They then lesion A11 cell bodies and terminals in the songbird song-motor nucleus HVC and find that these lesions affect song production, especially, though not exclusively, of courtship song. They also measure the location and movement of lesioned birds and find that birds with lesions of A11 cell bodies show less engagement with a female. Finally, they use fiber photometry to study the activity of A11 terminals in HVC during singing. While this is an interesting question supported by novel data, and I appreciate the diverse and creative approaches employed in this study, the role of A11 in courtship behavior appears complicated and does not easily fit into the framework proposed by the authors. In particular, the authors argue that A11 is important for coordinating innate and learned aspects of courtship, however, their data fall short of supporting this idea.

      Strengths This is an impressive data set with considerable attention to detail.

      The tracing and histology data identify some novel connections not previously described in songbirds as well as the potential of A11 neurons to co-release of glutamate and dopamine.

      Photometry provides real-time monitoring of A11 and HVC neuron activity during singing.

      In principle, targeting both HVC terminals and A11 cell bodies has the potential to lend insight into the role of HVC terminals vs. the role of projections to other areas (see below for caveats).

      We appreciate the reviewer’s efforts and attention in evaluating our manuscript. We are grateful that the reviewer recognizes strengths in our study, which we agree provides novel insights into the brain circuits that enable a fully integrated courtship display comprising learned and innate behaviors.

      Weaknesses 1) While I find the overall question and the data interesting, I am not convinced that they demonstrate that A11 is important for "coordinating innate and learned aspects of courtship". In general, birds with A11 lesions appear less motivated to perform female-directed song, however, it's not clear that this is a consequence of a lack of coordination between innate/learned aspects of behavior. Rather, perhaps A11 neurons are important to instigate or drive courtship behavior, or to relay signals from the POA or other regions important for courtship. Because the lesions abolish behavior, it is difficult to discern the role of these neurons in courtship.

      We agree that discerning the precise role of A11 is tricky. It could be acting to gate a (motivational) drive from another source, providing a primary source of this drive, and/or performing a more intricate role in coordinating the various aspects of the courtship display. The reviewer is correct that the current experiments do not allow us to clearly distinguish between these possibilities, and we have revised the manuscript accordingly, first by replacing “coordinate” with “gate” in the title and introduction and including a more thorough treatment of gating and other possible roles for A11 in lines 258-262 of the discussion. That said, we lean towards the latter possibility - a coordinating role for A11 - because of its location immediately proximal to regions that drive learned (HVC) and innate (ICo, RPgc) aspects of behavior and because A11 neurons can contain synthetic enzymes for a fast acting neurotransmitter (glutamate) in addition to DA. But, ultimately, we acknowledge that future experiments are needed to more completely answer this question.

      In addition, I disagree with the innate vs. learned distinction as recent data indicate that introductory notes, which the authors treat as innate, are actually learned (e.g. Kalra et al., 2021). Further, there is also no quantification of the effects of lesions on female-directed calls and little analysis of the activity during call production. This would seem to further complicate the overall interpretation. Overall, it's difficult to make sense of how A11 activity relates to vocalizations, especially given the innate/learned framework that they focus on.

      We thank the reviewers for drawing our attention to the recent Kalra 2021 paper, which we now cite while also making sure to emphasize that introductory notes may have learned features (lines 194-195 and 278-279). However, even that recent study concluded that males raised without a tutor or tutored on recorded songs that lack introductory notes altogether still developed songs that include introductory notes. Nonetheless, we include citation of this recent study and qualify our characterization of introductory notes as being shaped by innate predispositions and experience. Furthermore, we conducted additional analyses to quantify female-directed calling before and after 6-OHDA lesions in either HVC or A11 (results can be found in lines 164-165 and Figure 4C). In line with the divergent effects of these two types of lesions on the production of introductory notes, lesions in HVC did not affect female-directed calling whereas lesions in A11 largely abolished these vocalizations. While we acknowledge that the fiber photometry data on female-directed calling was limited, it nonetheless reinforces the conclusion that A11 transmits information to HVC about innate vocalizations, and it also transmits information to HVC about introductory notes. Along with the loss of introductory note production following A11 lesions, we do believe that our findings support the idea that A11’s role is essential to female-directed vocalizations generally, regardless of whether they are learned or innate, and of of somehow enabling the transition from production of female-directed calling and introductory notes to motifs. We have done our best to draw out these points in the revised discussion.

      2) The HVC lesions appear to create damage/necrosis (Fig 3-suppl 2) and this raises the question of the degree to which the HVC lesion effects are the result of dopamine/glutamate depletion or local damage. In particular, it is surprising that syllable structure and stereotypy show such a dramatic breakdown with HVC A11 input lesions and effectively no change with lesions of the cell bodies, even though both treatments lead to effectively similar reductions in song production.

      We appreciate that 6-OHDA lesions are not highly specific and can introduce unwanted effects on non-TH+ cells and processes. To further quantify the effects of 6-OHDA lesions on HVC cells, we conducted additional 6-OHDA injections in HVC and TUNEL staining studies in addition to the preliminary efforts we had made in the original manuscript. Quantification of these data confirmed our original impression that 6-OHDA treatment in HVC increased HVC cell death (these data are shown in Figure 3-figure supplement 2J, K). To further address this issue, we also added an analysis of song structure when D1 receptor blockers were dialyzed into HVC. No changes in song morphology were detected, similar to the lack of effects on song morphology following A11 cell body lesions (Figure 3 - figure supplement 3). Taken together, these additional experiments and analyses indicate that the changes in song morphology following 6-OHDA treatment in HVC may arise from local damage to HVC cell bodies. In contrast, the reduction in singing following A11 terminal or cell body lesions is likely to reflect diminished DA signaling from A11. However, as the reviewer notes, our primary finding is the differential effects on female-directed singing, and the distinction between more purely singing-related effects following 6-OHDA treatment in HVC and a broad effect on all courtship behaviors following 6-OHDA treatment in A11.

      3) If the idea is that A11 is important for coordinating innate and learned movements, it seems that a detailed analysis of the movements would be important. As is, the movement data provide further support of a decrease in either the motivation or ability to perform female-directed song, but they do not speak to a more specific role for A11 in coordinating innate and learned movements.

      We maintain that we did provide a detailed analysis of a number of important nonsong behaviors, including changes in head orientation and translational movements that the male makes towards the female, both of which are major appetitive features of courtship in songbirds and other vertebrates. We also appreciate that these analyses do not allow us to say much about precisely how movements are being coordinated during courtship, and we have changed language throughout the manuscript to emphasize a gating rather than coordinating role for A11. Furthermore, in response to the reviewer’s concern, we performed additional analyses of the male’s movements during courtship, including beak wipes, vertical changes in posture (“standing tall”), which are finer components of female-directed displays. Notably, this new analysis reveals that all of these behavioral components are abolished by A11 cell body lesions, but not by A11 terminal lesions in HVC (lines 168-190 and Figure 4I, J). We appreciate the reviewer’s suggestion, as we believe these additional analyses strengthen our core finding, namely that A11 functions as a hub to gate, recruit and possibly coordinate innate and learned movements to generate a complete courtship display. These different roles are more fully considered in the revised discussion (lines 256-262).

      Reviewer #2 (Public Review):

      Ben-Tov et al. investigate function of midbrain region A11 and provide evidence that it plays a role in promoting and coordinating a variety of motor responses to sexually or socially salient stimuli. They show lesions of A11 cell bodies abolish female directed calling, orienting and singing, while lesions of terminals in the song premotor nucleus HVC prevent female directed singing, but leave female directed calling and orienting intact. Together with anatomical data indicating projections from A11 to multiple downstream targets associated with song (HVC), calling (DM/ICO) and locomotion, these data support the authors' idea that A11 forms a 'hub' that drives and 'coordinates' multiple different aspects of behavioral responses to social (here female/sexual) stimuli. The results are intriguing and begin to reveal how a single social context can elicit and coordinate multiple coordinated responses. However, as outlined below, I think that some of the specific stronger claims would benefit from additional data, discussion or moderation.

      The authors also provide compelling support for the idea that A11 plays a differential role in female-directed versus undirected song. This is especially underpinned by the observations that 1) A11 afferent activity in HVC appears to differ between directed and undirected signing, with increases in activity preceding song motifs only during directed song, and 2) lesions of A11 cell bodies or inputs to HVC have a dramatic suppressive effect on directed singing, but can leave undirected song largely unchanged. These observations that A11 differentially contributes to socially elicited versus spontaneous singing seem especially interesting and merit further highlighting and discussion as one of the especially striking aspects of the study that seems distinct from the thesis of a role in coordinating learned and unlearned behaviors.

      We appreciate the reviewer’s efforts and attention in evaluating our manuscript. We are grateful that the reviewer recognizes strengths in our study, which we agree provides novel insights into the differential contribution of A11 to socially elicited versus spontaneous singing. We also agree that this point should be highlighted and we expanded our treatment of this point in the discussion section of the revised manuscript (lines 296-309).

      Specific comments

      A central idea around which the results are discussed is that A11 plays a particular role in coordinating learned versus innate behaviors. I have several questions around this thesis where further guidance from the authors about both technical points and interpretation would be helpful.

      First is the question of how specific are the manipulations and conclusions to A11 itself versus other neighboring midbrain dopaminergic regions within which it is embedded. The authors show histology of lesions, injection sites and retrograde labelling in supplementary figures, but do not provide enough guidance for me to understand the strength of the argument that manipulations are restricted to A11 and/or its afferents. Can the boundaries between A11 and neighboring regions be better demarcated? What are the neighboring regions to which there might have been spillover? For lesions of A11 axons within HVC, wouldn't 6-OHDA also damage any other dopaminergic afferent to HVC, including those coming from regions such as VTA? Some discussion of these and related points regarding the specificity of manipulations to A11 would be helpful, especially in light of the literature that points to potential roles of neighboring dopaminergic regions in contributing to motivated behaviors and song more specifically.

      We appreciate that the definition and boundaries of A11 might be confusing. We demarcated A11 and neighboring regions in the relevant figures to better define A11’s boundaries. The reviewer is correct in surmising that the VTA is fairly close to A11 and hence a reasonable concern is that 6-OHDA treatment in A11 could spill over to the VTA and possibly the SNc. To address the concern that 6-OHDA lesions in HVC might cause cell damage to other DA sources to HVC, we quantified the number of VTA/SNc cells following HVC DA lesions. This additional analysis, provided in Figure 3-figure supplement 1D-F, shows that the number of VTA/SNc cells following 6-OHDA injections into either A11 or HVC is comparable to that of intact birds. These additional analyses support the conclusion that the behavioral deficits that emerge following 6-OHDA treatments reflect damage to A11 or A11 terminals in HVC.

      These points also relate to the general question of what is meant by A11 being a 'hub for coordination of learned and innate courtship behaviors'. Ultimately, it seems likely that many regions must work together to orchestrate these behaviors, and it is not clear from the present results how much I should view A11 as having a more specific role than other neighboring dopaminergic regions (or hypothalamic regions such as POA) that are interconnected and seem likely to also play critical roles. As the authors note, many of the relevant structures, including A11 and song system structures, are recurrently connected, further complicating interpretation of any one area as a hub. In this respect, I am not sure how much the authors are intending to argue that A11 is both necessary and sufficient for driving each of the studied behaviors in a courtship context, and it would be helpful to discuss this more specifically - does 'coordination' as used here imply that A11 is capable of triggering these behaviors - an interesting possibility raised by the current results but that does not yet seem to be demonstrated - or something else?

      As we noted in our response to a similar point made by the first reviewer, we agree that discerning the precise role of A11 is tricky. As we commented in that earlier response, A11 could gating a (motivational) drive from another source, providing a primary source of this drive, and/or performing a more intricate role in coordinating the various aspects of the courtship display. We agree that the current experiments do not allow us to make a clear distinction between these possibilities, and we have revised the manuscript accordingly, including a more thorough treatment of these various roles for A11 in the discussion (lines 256-262). That said, we lean towards the latter possibility - a coordinating role for A11 - because of its location immediately proximal to regions that drive learned (HVC) and innate (ICo, RPgc) aspects of behavior and because A11 neurons can release a fast acting neurotransmitter (glutamate) in addition to DA. But, ultimately, we acknowledge that future experiments are needed to more completely answer this question. In the revised manuscript, we emphasize a gating role for A11 in the title and introduction, and then in the discussion expand to encompass the possibility of a coordinating or timing role for A11.

      One additional question regarding the framework for interpreting the function of A11 as coordinating 'learned and innate' courtship behaviors, is for some further clarification and citations regarding what is learned versus innate, especially as it relates to song. The authors characterize introductory notes as 'innate', but previous work from Rajan and colleagues has demonstrated that aspects of introductory notes including acoustic structure and patterning are influenced by learning, and I am not sure what the literature says about orienting and calling to females.

      We thank the reviewer for drawing our attention to this recent study from the Rajan group which indeed concluded that some aspects of the introductory notes are learned. We also note that this study showed that juvenile males tutored on song playbacks that lacked introductory notes or that were raised without a tutor still produced introductory notes. Nonetheless, we include a citation of this recent study and qualify our characterization of introductory notes as being shaped by innate predispositions as well as through experience and learning (lines 194-195 and 278-279). Furthermore, our original analyses of birds with 6-OHDA treatment in HVC revealed that introductory note morphology was unchanged, whereas syllable morphology was degraded. Therefore, even if certain features of introductory notes are influenced by tutor experience, they apparently do not depend on HVC in the same manner as do the learned syllables in the motif. Lastly, we conducted additional analyses to quantify female-directed calling and other movements, before and after 6-OHDA lesions in either HVC or A11. In line with the divergent effects of 6-OHDA treatment in these two regions on the production of introductory notes, lesions in HVC did not affect female-directed calling, beak wipes or changes in male’s posture, whereas lesions in A11 largely abolished all of these behaviors (Figure 4C, I, J). While we agree with the reviewer that a distinction between innate and learned behaviors may not be straightforward, the more fundamental observation is that we can dissociate different aspects of the courtship display and that A11 is situated in a position to drive, gate or coordinate a unified display that involves a variety of learned and innate vocal and non-vocal movements.

      I also would find it helpful to have some further clarification in this context about what it means to coordinate learned and innate aspects of song. The authors indicate that undirected song is largely unaffected by A11 lesions while directed song is largely eliminated, leaving only innate calls or introductory notes. I think it would be helpful to see here a more complete characterization of the nature of vocalizations that remain following A11 lesions in the female directed context. While I understand that no recognizable 'learned motifs' are produced, it is unclear from the example that is shown how much the residual vocalizations should be construed as 'severely disrupted songs' versus strings of calls that resemble innate calls that were present prior to lesions, versus 'normal' patterns of introductory notes that resemble in acoustic structure what the birds produced prior to lesions, but that never proceed to song motifs, etc. A better understanding of the nature of these residual vocalizations might also help to interpret what A11 is doing. Do these birds seem motivated to 'sing' in terms of their posture? Do the authors think that HVC is engaged or that the same residual vocalizations would be produced in a bird that had HVC lesions? How do the authors interpret these data in terms of how learned and unlearned vocalizations are normally coordinated in the context of directed singing?

      We performed additional analyses of the male’s vocalizations and movements during courtship, including female-directed calls, beak wipes, vertical changes in posture (“standing tall”), all of which are components of female-directed courtship displays. Notably, this new analysis reveals that all of these behavioral components are abolished by A11 cell body lesions, but not by A11 terminal lesions in HVC (lines 168-190 and Figure 4C, I, J). Along with our prior report that males with A11 cell body lesions do not sing female-directed motifs, the additional analysis indicates that these males produce little or no female-directed vocalizations or non-vocal behaviors of any kind.

      We previously reported that males with A11 terminal lesions produced only introductory notes but not motifs but realize that this observation would benefit from more quantification. As noted in the previous response, we previously established that introductory note morphology was unchanged by 6-OHDA treatment in HVC (Figure 4 - figure supplement 1A-D). To extend this analysis further in this revised manuscript, we built on the observation that males with 6-OHDA treatment in HVC produce only introductory notes to females, with no song motif, whereas they produce a series of introductory notes followed by motifs comprising distorted syllables when alone (Figure 3K, Figure 3 - figure supplement 2, Figure 4B). To confirm that the directed introductory notes and undirected syllables were indeed distinct vocalizations, we computed their durations and spectral similarity scores (using Sound Analysis Pro). The introductory notes produced during directed conditions differed markedly in their durations from distorted syllables produced during undirected conditions, and these two types of vocalizations had very low similarity scores, indicating that they were cleanly separable vocal behaviors (Figure 4 - figure supplement E, F). Given that introductory notes are unchanged by 6-OHDA treatment in HVC, these analyses support the idea that males treated in this manner can still produce motifs, albeit distorted ones, when alone but not when in the company of a female.

      These questions relate in part to that of how much is the trigger to sing eliminated by A11 afferent lesions versus the ability to produce the relevant song output? It seems like there may still be a trigger to sing - short latency vocal response to female - but inability to produce motif. One point that may be interesting to note in this regard is that this seems somewhat opposite of observations made in other contexts about the effects of directed versus undirected context on song - for example, juveniles can produce better song when it is directed (Kojima), and deafened birds that are beginning to exhibit song deterioration can exhibit normalization of song structure during directed conditions (Nordeen).

      We agree with the reviewer’s point that birds with 6-OHDA lesions in HVC may still be triggered to sing, but are unable to produce a motif, given that they still produce introductory notes and seem to have the right posture, orientation and proximity to the female. We appreciate the reviewer’s comment regarding changes in song that can be elicited by females in either juvenile males or adult males that are deaf, although these additional contexts fall outside of the current study, which focused on adult male finches with normal hearing.

      Reviewer #3 (Public Review):

      The authors use a combination of quantitative acoustic and other behavioral analyses to evaluate the role of the midbrain dopaminergic area A11 in the production of female-directed song in adult male zebra finches. They show that female-directed courtship displays, which consist of song and the production of female-directed displacement behaviors, are dependent on A11 because targeted chemical lesions of this structure, using 6-hydroxydopamine (6-OHDA), permanently (i.e. for at least several months) eliminate both the vocal and non-vocal elements of this behavior. Destruction of A11 axons that directly target HVC, by administering 6-OHDA into HVC, only eliminates female-directed singing without causing any change in the other observed female-directed behaviors. Because these same lesions only temporarily (5-10 days) abolish undirected song, these findings suggest that A11 is not directly involved in song production but acts instead as a gate for the production of female directed courtship behaviors. The authors follow these lesion studies with fiber photometry-based calcium imaging of A11 axons that target HVC to show that A11 activation patterns precede activity in HVC during female-directed singing and that calcium elevation is primarily elevated during the production of the many introductory notes (a component of song that is primarily observed during female-directed singing) that precede the production of the learned song motif. These findings suggest that A11 inputs to HVC likely play a role in triggering and/or activating HVC to synchronize the production of introductory notes (which are likely produced by midbrain circuits) with the learned song component that immediately follows them. In contrast, activation of A11 axons during undirected song (which contain few to no introductory notes) do not precede HVC activation patterns. Consistent with the rapid transmission of A11 neurons, the authors also confirm, as has been suggested for A11 in mammals, that A11 dopaminergic neurons co-release glutamate.

      The findings of this study are of significant interest to our understanding of the neural mechanisms by which these complex behaviors are synchronized and open up a new way of thinking about how learned behavioral motifs can be synchronized with non-learned (e.g. female displacement behavior) behaviors. The study is rigorous, with many different experimental approaches being used to examine the proposed hypotheses, and the findings are convincing. Particularly impressive is the complete elimination of female-directed courtship behaviors following targeted elimination of A11. The primary weaknesses of the manuscript lie (1) in the way they present their anatomical findings and (2) how the authors discuss their findings in the discussion. In the discussion, which is very short (~750 words), the authors miss the opportunity to draw parallels with similar studies in drosophila (they only provide a cursory statement with a few references). In the discussion, the authors propose a model that seems quite oversimplified and lacks, in fact, many of the anatomical connectivity that they show in the first part of their study (for example A11 is only shown having a unidirectional connection to ICo/DM when in fact the connections are bidirectional). The model is also presented in simple hierarchical fashion with many connections omitted. Perhaps these omissions were made to simplify the model but in my opinion such simplification possibly misrepresents the actual mechanisms involved in the coordinated control of courtship song.

      We thank the reviewer for their careful reading of the manuscript and his supportive and constructive comments. We agree that the loss of all female-directed behaviors (which we now extend to female-directed calling and other non-vocal behaviors, such as beakwipes and postural changes) following A11 cell body lesions is especially intriguing. Further, the different effects of A11 cell body lesions and A11 terminal lesions in HVC, along with the connectivity of A11, indicate that A11 acts via a range of downstream sites to gate these various female-directed behaviors. We have done our best to address the two primary weaknesses identified by the reviewer. First, we have done our best to provide a more detailed accounting of the anatomical findings. Second, we have expanded the discussion to address parallels with other studies, as in the fly, and to provide a more nuanced and complete consideration of how A11 may function to facilitate male courtship behaviors.

    1. Author Response:

      Public Review:

      This manuscript from Pacheco-Moreno et al. compares the microbiome of potato fields with and without irrigation. Irrigation is known to control potato scab caused by Streptomyces scabies and the authors hypothesized that changes in the microbiome may contribute to disease suppression after irrigation. Using 16S rRNA sequencing, they identified a number of taxa, including Pseudomonas that are enriched after irrigation. They went on to isolate and sequence the genomes of many Pseudomonas strains. By correlating the ability of Pseudomonas to suppress Streptomyces growth in vitro with genomic data, the authors identified a novel group of cyclic lipopeptides (CLPs) that can inhibit Streptomyces in vitro and in planta.

      This work provides a substantial contribution that advances our understanding of disease suppressive soil mechanisms. It is novel in scope in that it focuses on suppression of a bacterial pathogen, while many prior studies focus on suppression of fungal pathogens. Additionally, the large-scaled comparative genomics is a useful resource, and the identification of CLPs that inhibit Streptomyces is novel. Importantly, the authors provide in planta data to show role a for CLPs in disease suppression in vivo. The manuscript is well written and the data are well presented. The analyses are quite thorough and I appreciate the extensive use of genetics and metabolomics to support the genomic predictions. The main weakness is a lack of data the conclusively links the change in microbiome function to disease suppression after irrigation in the field. However, I think the data they've presented, combined with those in the drought literature, might suggest that an increase in total Pseudomonas (and the corresponding disease-suppressive genes) in well-watered soil might contribute to suppression, rather than a change in function of Pseudomonas.

      While the reviewer is correct that we cannot conclusively link disease suppression to a change in microbiome function after irrigation, we are confident that our results demonstrate a real and repeatable phenomenon that must be considered in future studies of soil scab suppression. Independent field experiments conducted two years apart both show a decrease in the proportion of suppressive pseudomonads associated with potato roots. The first experiment (Figures 1 & 2) contained too few sequenced isolates to draw statistically robust conclusions, therefore we designed the second experiment (Figure 8) to investigate this phenomenon further. This experiment showed highly significant differences in the proportion of suppressive isolates on irrigated and non-irrigated roots. The alternative hypothesis presented by the reviewer; that relative Pseudomonas and Streptomyces abundance are affected by irrigation and this may be a factor in scab suppression, is also a valid possibility, although relatively small abundance changes were observed in the data reported in Figure 1. We have amended the discussion to include this as an alternative explanation for our results.

    1. Author Response:

      Reviewer #1:

      For this manuscript, I focused on the metabolite analysis. The data is presented as supporting a common response based on shared selective histories if I'm understanding properly. However, primary metabolite data is hard to interpret in the same fashion as genetic data. This arises because of the high degree of pleiotropy wherein it is very hard to find a mutant or variant that doesn't alter primary metabolism. As such, it is possible that there is a common response less because of shared history and more because there is constraining selection that shapes what is the optimal primary metabolite response to cold in photosynthetic organisms. For example, in Arabidopsis, it has been found that accessions tend to have a highly similar primary metabolism but when they are crossed, the progeny have a vastly wider array of primary metabolism phenotypes, suggesting that the similarity in accessions is not shared genetics but constraining selection that forces compensatory variants. I don't think this detracts from the utility of including the primary metabolism but it would help to have more clarity in the strengths and weaknesses in using metabolite data to track theories and arguments that are largely genetic based.

      We fully agree with the reviewer. The idea of constraining selection is at least as interesting as our explanation, and should be in the forefront. Given this interesting idea of compensatory mutations that are private to each accession (or ‘lineage’ or ‘line’), in principle this idea also hints towards the parallel/convergent evolution (‘constraining selection’ in the reviewer’s words) of this important trait or trait complex. We re-phrased this within the manuscript and considered this comment seriously throughout. We also incorporate into our manuscript this interesting compensatory variant notion and metabolic network pleiotropy.

      One difference we would like to highlight still is that in our study (compared to Arabidopsis thaliana studies) we are comparing across many different species, ploidal levels, and varying species-level evolutionary histories. This makes our experiment different from Arabidopsis thaliana ecotype experiments and crossings; but indeed the reviewer is fully right that our results may also follow a similar evolutionary path as for Arabidopsis thaliana.

      Reviewer #2:

      Cochlearia, and other species that have rapidly evolved new ecological niches, represent excellent systems to study adaptation to past, present, future and changing environments. Furthermore, reticulate evolution within such groups offers a natural experiment to test hypotheses about the roles of hybridization, introgression, etc. on evolutionary dynamics, including pre-adaptation. However, there are also several significant challenges to using such systems, most crucially separating adaptation as the causal mechanism from the wide array of non-adaptive processes that could also cause the observed patterns. Overall, Wolf and colleagues do a nice job describing this complex taxonomic system and provide multiple lines of inquiry into how observed patterns may align with various adaptive scenarios. Despite the strong descriptive framework, I had trouble understanding exactly how causality could be assigned. Thus, the interpretation and discussion of the results felt speculative.

      Thank you for the encouraging comments. Yes, we agree: the points towards an important aspect of this kind of phylogenetic-systematic-evolutionary research, namely demonstrating causality. Honestly speaking, in such studies we are not able to show causality in its strict sense, and we think that the reviewer wants to claim this without using quite so strong wording. We considered this while re-phrasing respective paragraphs and also town down some speculative conclusion.

      Reviewer #3:

      There has been intense interest in how plants have responded during periods of rapid climate change in the past. Understanding these responses can increase our understanding of how plants might respond to rapidly accelerating anthropogenic climate shifts and help set conservation priorities. Many paleoecological studies have provided insight on how plants have migrated and persisted in suitable climate refugia (i.e. pockets of suitable habitat that exist even if regional climate is unfavorable for the persistence of a species) throughout glacial cycles, however there has been considerably less work that details the evolutionary dynamics of plants during these periods. This piece provides timely and valuable analyses illustrating the potential influence of pronounced climate change on the evolutionary dynamics of the genus Cochlearia.

      Thank you for the encouraging comments.

      The authors' use of cytogenetic analyses, organellar phylogenies, and demographic modeling allows for insights into the geographic patterns of diversity, speciation rates, and postglacial expansion scenarios of Cochlearia. Drawing unique conclusions from these different lines of evidence provides new understandings into the putative role of Pleistocene glacial cycles in driving evolutionary processes such as speciation. The study also aims to provide insight into the origins of the stated putative cold tolerance exhibited by Cochlearia by using a metabolomics approach; however, the framing and use of a single related outgroup (sister genus Ionopsidium) obfuscate the link between the results and stated conclusions.

      We appreciate this point, but indeed there is no other outgroup to be used. In this study we included all (both) genera with most of its species of tribe Cochlearieae. Within a family- wide phylogenetic context this tribe is placed along a polytomy (together with not well resolved other tribes) and stem group age of Cochlearieae is of appr. 18.9 million years ago (Walden et al., 2020). Therefore, for our research question additional outgroups from other tribes will not contribute any further information, because more basal splits are then nearly 20 million years ago (Early Miocene) with no biogeographic and environmentally defined scenarios that can be compared. 16-23 million years ago most tribes of evolutionary lineage II underwent an early radiation with highest net diversification rates (Walden et al. 2020) during this time. We included some of this information into the introduction.

      Specifically, regarding the approach that resulted in figure 4 which encompassed the metabolomics and related analyses, the initial climate groupings into 'climate ecotypes' would benefit from clarification and consideration of assignment methods. Typically, using the term ecotype invokes the idea of distinct forms of a species with phenotypic differences adapted to local conditions rather than groupings to those under broad climate regimes. While grouping populations according to climate origin can be useful, it is not clear how the final 9 WorldClim bioclimatic variables were selected (e.g. it is not apparent how importance of or correlations between climate variables, etc. were considered). Consequently, knowing this information would help understand the patterns in figure 4b, which seems to indicate that geographically distant populations experience very similar climate conditions (understanding that similarities can exist but variable selection can greatly influence these patterns).

      Thanks for this reminder to explaining selection and analyses of BioClim variables.

      As for the term “ecotype”: In plant taxonomy ecotypes are often referred to on subspecies level, in particular if environmental conditions are extremely different (e.g. heavy metal contaminated versus not-contaminated soils) and often these subspecies do not significantly differ in morphology (Noccaea caerulescens, Minuartia verna). In Cochlearia morphology is at best a morphospace which is more or less shared between all species in different ways. Species definition and taxonomy is based on a combination of largely overlapping morphospace, cytotype, ecotype and habitat types (bedrock; arctic, lowland to alpine; soil type and salt, life cycle) and distribution – often sole morphology is a bad species predictor (morphologically cryptic species – this is well-known also for some other arctic species such from the genus Draba). But the reviewer is fully right, that using the term ecotype here is somehow misleading. Our idea was to highlight that groups of taxa are combined by bioclimatic variables (and biomes or habitat types) while spanning the entire species/ecotype space of the genus – and this grouping follows also evolutionary meaningful cluster. We clarified this.

      As for selection of BioClim variables: we agree, indeed selection might have appeared arbitrary to the reader. Our original selection followed our field and cultivation experiences. However, structuring into four clusters as originally shown with the first submission is robust also when including all 19 BioClim variables. The same four cluster are retained in PCA, when temperature related BioClim variables are used only.

      Therefore, we added a Principal Component Analysis as starting point for Bioclim variable selection, secondly we added a PCA using temperature related BioClim variables 1-11 only. Built upon this we added a sentence why our nine selected variables were used to highlight the four groups in Fig. 4. The two PCA scree plots (including vector data) plus the correlation matrix and the results of a KMO test (Kaiser-Meyer-Olkin test: testing significant difference between the correlation matrix of variables and an identity matrix) are additionally provided with the Suppl. Material.

      The other concern is in regards to the framing and interpretation of these results. For instance, in the results (lines 329-330) and discussion (lines 419-423), the impression is given that experimental results here match those found in plants belonging to a different genus (i.e. Arabidopsis). However, rather than attributing this to more generally conserved mechanisms in response to considerable cold stress, the authors relate this to the unique history of Cochlearia (and its relationship to the drought adapted sister genus). The authors also note that surprisingly there was no demarcation of cold responses between the climate-defined groupings. Detailing why this is surprising given some of the other conclusion statements would be helpful. Some targeted revision to strengthen this link would be useful to bolster the inference of about the origins of cold tolerance in Cochlearia, rather than making it seem like this result could be expected in other taxa.

      Thank you for this. We agree that we did not explain our reasoning as well as we could and we now have reworked this. Original lines 329-330 simply refers to the (expected) and obvious general response to cold – some explanatory text has been added, e.g. such as at the end of the discussion and directly with the above-mentioned lines.

      Lastly, another area that would benefit from some clarification and tightening is revisiting the connection between the results and stated conclusions. For instance, some of the statements from the introduction and conclusions indicate the reader might expect explicit niche exploration analyses and more detailed genomic approaches. It is not abundantly clear for a general audience how these results definitively demonstrate how genetic diversity was rescued in reticulate and polyploid gene pools or species barriers were torn down. These are very specific, strong claims that do not appear to be explicitly discussed outside of the introduction/discussion or directly related to the results presented in this manuscript.

      Thank you for pointing out how this could be read in this way. We have revised this to indicate that agree: we do not think our data ‘definitively demonstrate’ (in the reviewer’s words, not our) this. We modify the text to avoid such interpretation.

      This is no way diminishes the considerable effort of the authors to conduct the informative array of presented analyses, but more closely aligning the conclusions within the scope of presented results (or providing direct links on how the results provide these insights) would help increase the effectiveness of this manuscript.

      Many thanks for this very encouraging note. We have worked to incorporate these thoughtful comments.

    1. When absent in teacher education programs and national policies, it is little wonder that many English teachers may be both stymied and fearful about addressing the civic, healing needs of classrooms

      This is very interesting because during my teacher preparation program here at UIC, we often were covering SEL and trauma-informed lessons, but I don't think anything can truly prepare teachers for the way our society is built on trauma of BIPOC folx. For example, many of our trauma-informed pedagogies are centered around student wellness and trauma they experience on a personal level, and while we may beat around the bush that the real problem are social oppressive systems and their impact on students, it feels very rare that we are examining how trauma is ingrained in every aspect of our society for Black and Brown children. For example, none of us were prepared to support students through the trauma of Covid-19, but the trauma that Black and Brown students are experiencing from Covid-19 is very different from the experience of many white families, and that is, again, because of the oppressive systems that we know of but don't fully examine as being the reason why students have so much trauma.

    1.  So what gets in the way of our pursuit of it? I think we most often resist going through the process of mastery  for two reasons: it can be deeply uncomfortable along the way and we doubt our ability to become expert.

      I think that this statement is 100% true. When people first start to get into new things and new skills then it probably seems very weird at first. For example if it's a sport then you're not going to be used to it after. It may feel uncomfortable because it's something that is new for you and your body/mind has to get the chance to adapt to it. As time goes on you gradually start getting better and better at this skill and it becomes a hobby or like the title states...something you're really good at.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Hello, we wrote our review before seeing that you have special formatting requirements. We're just going to post our review in it's entirety rather than rewrite it based on these suggestions. It encompasses the above content, it's just not formatted in the suggested order. We hope that's OK! **Full review:** This manuscript makes a strong case for the evolvability of multicellular size via selection for settling rate in the icthyosporea. The use of an experimental evolution framework to assess the evolvability of multicellular phenotypes, using sedimentation rate as a selective pressure, extends the previous work of others into a new domain within the holozoan and the closest living relatives of animals. The natural, ecological significance of selection for sedimentation rate is a novel idea, and the connection between sedimentation rate and multicellular evolution in natural as opposed to contrived experimental circumstances is an interesting idea. The results are striking and well supported, with laboratory evolution rapidly adjusting both the cellular composition and the multicellular phenotypes of the organisms involved in ways that are well explained. This is an important result that brings the laboratory study of the evolution of multicellularity forward, into a different branch of the tree of life and showing its broad applicability. Sequencing of evolved lines adds significantly to the completeness of the story. While the causal role of these mutations in the production of the observed multicellular phenotypes are not demonstrated via manipulation or breeding, this is quite understandable in the light of the unusual model organism and the observed homologies and role of the genes involved. While this is largely clear from a reading, we believe the manuscript would benefit from a brief analysis of the numerical enrichment of genes with homologs involved in cytokinesis, cell membrane composition, and cell cycle control relative to the null hypothesis of genes picked randomly from the genome. If this is beyond the scope of this research in an unusual model organism with many poorly annotated genes, then a slightly expanded verbal discussion of the potential roles of the apparent functions of these genes in the evolution of multicellular clumping would be an appropriate substitute. We wholeheartedly recommend the publication of this manuscript with a number of minor revisions, which while not affecting the main conclusions or points of the manuscript will clarify important points, adjust small errors, and point the reader at relevant literature and concepts.

      ANSWER__: We would like to heartily thank the reviewers for their appreciation of our work. __

      **Major points:** none. **Minor points:** Line 79 - is sedimentation rate really invariably associated with multicellularization? Active swimming would seem to prevent this.

      ANSWER__: We meant to refer to the fact that all published examples of the emergence of multicellularity from unicellular ancestors have been accompanied by an increased sedimentation rate. Active swimming alone would just increase the diffusion rate of cells and not counteract the effects of increased size and density; such an active mechanism would also require directionality away from the tendency to sediment. A more passive mechanism, whereby a genetic variant, or cell cycle transition, which simultaneously causes a relative decrease in density while increasing cell size, leaving the net sedimentation rate the same as the ancestor, while conceivable, has not been observed in the literature. We changed the text from “invariably” to “frequently” at line 80 to emphasize how this is an empirical observation.__

      Line 164 - the precise phenotype in the evolution experiment being referred to is unclear without further context, with the ordering of paragraphs possibly needing a little work.

      ANSWER__: We tightened the paragraphs and merged both, the sentence containing “this phenotype” was removed.__

      Line 178 - is sorting them into three classes informative? Are there different mutations associated with these, or is it just visual clumping on the numberline? Perhaps not a useful classification, but the existence of great variation is an important point to get across. A more useful classification might be those that increase sedimentation with large density changes versus exclusively by clumping.

      ANSWER__: We agree with this argument and ultimately decided to remove the visual classification. We revised the text and figures accordingly.__

      Line 254 - excess cellular density is referred to interchangeably with density, when these are very different figures. This continues in line 269, and in the figure legends of Figure 4.

      ANSWER__: We fixed this.__

      Line 341 - the rule of RCC1 homolog in other organisms could be expanded on in slightly more detail. Similarly, other mutations in this same section known to affect cytokinesis could have potential mechanisms for affecting clumping commented upon, especially given the cell membrane results in the figures.

      ANSWER__: We share the reviewer’s enthusiasm about some of these mutations. We, however, try to be very conservative about what each gene or protein could be doing. Indeed, the absence of genetic tools does not allow us to directly test the effect of each mutation. We added a couple of extra sentences about RCC1 as well as about cytokinetic proteins and their potential role in clumping phenotypes.__

      Line 387 - awkward formatting or sentence structure, with dashes and commas.

      ANSWER__: We fixed the sentence structure.__

      Line 395 - this cellular process, or this evolutionary process of selection for faster settling?

      ANSWER__: We revised this appropriately.__

      Line 408 - per unit volume

      ANSWER__: Fixed.__

      Line 425 - the idea of clumpiness as ancestral is quickly put forward and dismissed within a single sentence. This could be explored in slightly more detail as an option, before concluding that what is clear is that the phenotype is easy to change.

      ANSWER:__ We agree that it would be interesting to pursue the ecological role and distribution of clumping and cell cycle phenotypes for other species in the Ichthyosporea genus. We could propose alternative scenarios of which trait came or went first and test this hypothesis by calculating the correlation of the presence or absence of the trait with the branch lengths and branching patterns of phylogenetic trees we have built using genome sequences. However, for our dataset, this would nonetheless remain a fragile correlation consisting of five data points. We do not feel such speculation is helpful for the text.__

      However, because two reviewers have mentioned or suggested in this direction, we expanded the discussion and annotated the tips of the species tree in figure 5 with the traits of interest. The result shows that S. gastrica, S. tapetis and S. nootkatensis species exhibit clumpiness as a trait. However, the data is not enough to resolve whether the traits are “derived” or “ancestral”.

      Line 437 - sedimentation as a highly variable trait, or a highly evolvable trait?

      ANSWER__: Evolvable trait. We fixed it in the text.__

      Figure 1G, 1H: We are fairly certain that the logarithmic scale of DNA content and coenocyte volume are mislabeled. The scale that is labeled log2 in 1G in the legend goes up by factors of 2 rather than single digits. The axis is obviously logarithmic, and the log2 in the legend is superfluous and misleading. Similarly, in 1H a scale labeled as log10 goes from 1 to 30, which on a logarithmic scale would be a sphere approximately 100 kilometers wide. The numbers can remain, but the legend should remove the log10.

      ANSWER__: Fixed. It is indeed a log scale. We made sure to remove the confusing log2 and log10 from figure and legend.__

      **General:** Were there any head to head competitions performed? Not suggesting you need to, but it's a nice way to directly examine fitness consequences of multicellularity, and is commonly done in the field. If you have done this it wasn't clear to us.

      ANSWER__: We now included a fitness experiment previously performed using the clumpy S01 and S03 in a head-to-head competition with the Ancestor (AN). The results are shown in Figure 2E and Figure 2 – figure supplement 1D. The results reflect how the fast-sedimenting clumpy phenotype is highly advantageous in our experimental evolution selection procedure, however deleterious in the absence of selection.__

      Reviewer #1 (Significance (Required)): see the above comments about writing the review before realizing there were specific formatting suggestions. I hope you understand us not wanting to re-write the review having already written it once.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): The present work adds to the growing literature on sedimentation rate as a major player in the evolution of multicellularity. Via rigorous experimentation, the authors convincingly show that they can select for increase sedimentation rate and identify two mechanisms underlying this increase: incomplete cellular separation leading to multicellular groups and increases in cellular density. They also show surprising natural variation in sedimentation and argue that, along with similar evidence from other organisms, their findings cement the likely major role of sedimentation and go farther by revealing the tight genetic control that it is under. Reviewer #3 (Significance (Required)): This is a very significant study because it illuminates processes and underlying mechanisms that could have played a major role in the transition to multicellularity. Their result will likely greatly influence the conceptual and theoretical thinking and will foster additional empirical directions. My only quibble with the manuscript is that I wished for a bit more ecological context and grounding of the main findings: in that respect, both the abstract and the last paragraph of the discussion leave me wanting and occasionally puzzled. If maintaining buoyancy is such a strong selective pressure and the variation in sedimentation rate is such a challenge to it, then I think explaining a bit more exactly why sedimentation would evolve, why so much variation would exist etc etc would be really helpful to the more naive reader. Just a bit further elaboration on selective pressures (even presumed ones and even if speculative) would be helpful to put the picture together.

      ANSWER__: We would like to thank Reviewer #3 for his/her comments. We do believe that extensive ecological context is highly relevant. Throughout the manuscript, we strived to be conservative in the way we describe both our model system and its experimental and natural settings, perhaps to a fault, but we now do offer an evolutionary model that tries to shed light into the phenotypic evolution of the various species through different routes (Fig. 5H). To elaborate more on the rationale behind this strategy, we offer the following two aspects:__

      1. we are investigating a sizeable, but still a very limited number of six Sphaeroforma Therefore, we feel that explaining what trait may be considered ancestral is speculative based on the known species tree (we revised our Discussion in this regard and update figure 5A).
      2. our knowledge about the ecological niches of Sphaeroforma species is limited. We avoid extensive speculation, and while inference of the potential ecological context is part of the scope of this study, we relied on an experimental approach to tackle our questions, rather than ecological observation or computational modeling.
      • throughout the text we aimed to avoid taking a strong stance on the “adaptiveness” of the traits which we are measuring. This is because, depending on the model specification and parameters, ecological models could be made for or against whether the cellular traits of size and density, and their effects on the higher-level trait of sedimentation rate, might be adaptive “in the wild”.

      We hope that future studies will be able to tackle any open questions on the understanding of the ecology of ichthyosporeans, hopefully benefitting from our inferred evolutionary insights in this study.

      **A more minor point:** I remember seeing a talk by Will Ratcliff a while back in which he showed that in S cerevisiae they also see the two mechanisms of increased sedimentation: increased cellular size and clumping. Yet, I didn't see a reference to that work in the context of the cell density mechanism discussion and wondered why.

      ANSWER__: We do believe to have cited the relevant papers from the Ratcliff lab. To be clear, we observed two separate physical mechanisms for fast-sedimentation: __


      1. by cell-clumping (increasing size),
      2. by increasing the number of nuclei per unit volume (increasing density).

      To our knowledge the 1st mechanisms was indeed observed in snowflake yeasts (for which we referenced all relevant studies), whereas the 2nd, which we believe might be specific to multinucleated cells, while a conceivable variable affected by mutations in the organisms from these studies, has not been measured to our knowledge. We added a new model figure (Figure5H) to hopefully better get this message across.


      Reviewer #4 (Evidence, reproducibility and clarity (Required)): In this study Dudin et al. explored the variability of sedimentation rates in members of the Sphaeroforma genus and found that sedimentation rates are very variable between different isolates as well as during the life cycle of each isolates. Following this observation Dudin et al. evolved S. arctica under a regime favoring fast settling objects. After a few hundred generations they observed that most lineages increased their sedimentation rate. Characterization of some of these evolved population suggests two distinct mechanisms allowing fast sedimentation: cluster formation by non-separation of cells post-cellularization and increase in object density. By sequencing the evolved lines Dudin et al. were able to identify that several mutations has been under the effect of positive selection and that some of the mutations relate to mechanisms involved in cell separation and cellularization.

      ANSWER__: We dearly thank Reviewer #4 for his/her time and efforts.__

      **Major comments: **

      • Line 143, I don't understand how figure 1G shows that "nuclear division cycles were periodic...".

      ANSWER__: From previous published results (Ondracka et al 2018 & Dudin et al 2021), we know that nuclear divisions in S. arctica are strictly synchronized and occur within defined time-intervals. As can be seen in Figure 1G, DNA content doubles with a constant interval of about 9 hrs. Likewise, this phenomenon is clearly depicted in Figure 4F and Figure S4H. These results combined with results shown in Figure 1F, demonstrate that division cycles are still periodic in our experimental setting and are not occurring asynchronously as no odd number of nuclei per cell was observed.__

      • When characterizing the evolved lines, the authors display (and measure?) separately the size and the sedimentation rate, but don't directly compare them. If the statement that density plays a role in the sedimentation rate of S4 and S9 but not S1, then correlation between size and sedimentation should be similar between AN and S1 and changed in S4 and S9. It would be nice to see these relationships and the correlations.

      ANSWER__: We do indeed measure the size and the sedimentation rate of each fast-settling mutant separately. This is shown in figure 1C, where sedimentation rate is plotted against cell size for our dataset and the older Smayda (1973) data. Further, both measurements, directly, feed in the estimation of cellular density in Figures 4C and S4D (explained extensively in the methods). Cellular density estimations show the correlations and relationships between S1 and AN as well as between S4 and S9. __

      • Line 288: "surviving 780 generations of passaging for all 10 isolates" what data is this referring to?

      ANSWER__: This refers to growing cultures in the lab of fast-settling mutants with tens of passages done without any selection. These growing cultures maintained their clumping phenotypes even without a constant selection, suggesting they are due to a genetic modification. We are unsure about how to answer reviewer #4 as this is the data we are mentioning. We however changed “surviving” to “persisting for”, and hope it better clarified the sentence.__

      • The weakest aspect of the paper is that there is neither a statistical argument (with a single anecdotal exception), from seeing the same genes or pathways mutated in parallel experiments, or experimental reconstruction that argues that any of the observed mutations were selected as opposed to being neutral mutations that hitch-hiked with adaptive mutations. One strongly suspect that some of the observed mutations were selected, but from the available data, it is impossible to know which were selected and which were hitch-hiking.

      ANSWER__: We agree that our draft did not elaborate in-depth if mutations were drivers versus passengers, a fact also mentioned by another reviewer. To be fair however, there are several important considerations to make.__

      First, and most importantly, we do offer an unprecedented look into the genetic underpinnings of this novel model organism, and demonstrate highly parallel phenotypic evolution in response to selection. The molecular genetic signal reflects this finding given a skewed dN/dS-ratio > 1. While the precise molecular changes are not as easy to interpret, molecular parallelism at the level of genes is not a prerequisite for directional selection in repeat lineages, especially given the complex genomic architecture of S. arctica.

      Second, while we didn’t emphasize this a lot, the results from our bioinformatic analyses are pretty unique. We are dealing with a non-standard model organism here, with highly intriguing placement in the tree of life, but with big genome size, at >140 Mbp. This is 1-2 orders of magnitude larger than that of other single-celled model systems used in evolution experiments, including E. coli or S. cerevisiae. Unlike the latter two, this organism’s genome contains extensive levels of intergenic and intronic sequence, as well as a high amount of (simple sequence) duplication. Hence, the analyses of the resequencing data were a major effort, and it took an extensive amount of time to identify the mutations.

      Third, there are no genetic tools that would allow us to either perform molecular genetics or crossing with S. arctica as of now. This will change in the future, and in this event, our comprehensive list of target genes will be hopefully valuable to the field and beyond.

      • Even if the authors knew which mutations were selected, it is not possible to say if the mutations that have been selected are directly advantageous in the settling regime, they could be due to adaptation to lab conditions and higher temperatures, etc. Having a control evolution experiment with no settling selection would be required to reach the conclusion that the mutants were selected for faster sedimentation.

      ANSWER__: We agree that a “no-selection”-control experiment would have been helpful for the molecular interpretation. But the clumping phenotype has never been observed to occur in many generations of passaging in any of the labs culturing these organisms and at different temperatures (we made sure to specify this in the text) As such, we argue that any adaptation to laboratory conditions must have happened before we conducted our selection experiment. Given that the molecular signals were unique (with one exception), we have reason to believe that the highly controlled nature of the experiment with a constant environment throughout, did at least not bias the molecular signals toward extensive genetic parallelism. __


      **Minor comments:**

      • Line 164, the authors write "this phenotype", it is unclear what phenotype is referred to as.

      ANSWER__: Fixed__

      • Line 187: the authors use the word "radius" in the text, while using "perimeter" in the figure.

      ANSWER__: Fixed__

      • Line 224: Is the use of the expression "incomplete detachment between daughter and mother cell" appropriate given that all cells emerge from a multinucleated cell?

      ANSWER__: Fixed – “incomplete detachment between cells.”__

      • Line 151, typo, the "with" should be removed.

      ANSWER__: We believe the reviewer wanted to point out the “with” in line 251, which we fixed.__

      • The intro about changes in ecology is nice but does not make sense given the rest of the paper, I would add it to the discussion.

      ANSWER__: We beg to differ with Reviewer#4 here, as the water column distribution for plankton in marine environment is one of the key aspects of our paper and is a critical parameter in models of water body ecology.__

      • Line 399 "increase their cell size by increasing cell-cell adhesion post-cellularization" the first use of "cell" is misleading because the objects are now a collection of cells rather than a single cell.

      ANSWER__: Fixed__

      Reviewer #4 (Significance (Required)): Most of the findings made in this study have been obtained in previous studies done with more genetically tractable organisms, however this is the first time that such experimental evolution was made on a unicellular non-model system organism closely related to animals. The significance of the work is reduced by the failure to produce evidence to answer two critical questions about the observed mutations: 1) were they selected during the experiment or did they hitch-hike with other selected mutations, and 2) if they were selected, were they selected because they led to faster sedimentation or some other aspect of the conditions in which they were passaged. It would take serious effort to perform additional experiments to address these questions and thus the authors are likely to be better off explaining that their work is unable to answer the questions and thus they are speculating about both the causality of the mutants and the nature of the advantage they conferred.


      ANSWER__: We beg to differ with the reviewer’s argument.__

      We believe that our study demonstrates heritable phenotypic changes for an evolvable, ecologically relevant trait, and their tight cellular regulation. We identify and carefully quantify how two cellular growth phenotypes – the nuclear division rate and cell size control –– can vary heritably and independently of one another, and together directly shape variation in a critical ecological parameter of a marine organism. Therefore, in addition to the fact that the work was performed in an emerging model marine organism, this work provides fundamental “novel” insight into cellular trait evolution more generally.

      Our results do not depend upon knowing the exact genetic mutations or molecular mechanisms which have caused these phenotypic changes. Nor, as the reviewer implies, do we claim to have identified particular mutations that were selected, or their effects on particular cellular phenotypes. We do, however, provide a large amount of evidence that the changes are likely genetic. With our sequencing effort, we find a strong, statistically significant, molecular signal of adaptation in the lineages (dN/dS > 1), and we publish a curated list of affected genes which are potentially causative for the phenotypes we observe.

      Because we did not observe frequently recurrent mutations, as most directed (and cancer, antimicrobial resistance, etc.) evolution studies find, our results suggest that there is a large mutational target size affecting the phenotype of interest, reflecting its potentially broad genetic and molecular control mechanisms. We view these results as a great strength of the study, and consider this result in and of itself “novel”. Furthermore, we have now added and __used a statistical genetic approach to quantify the heritability of traits, or what proportion of the variance in phenotype is due to an individual’s inherited state__ (Figure 1 – figure supplement 1A). The results show that Heritability exceeds 95% across phenotypes, and across the entire dataset, H exceeded 99% of the total phenotypic variance (ANOVA F = 1118 on 252 and 735 DF, p = 0). This means that for a typical individual genotype in a given environment, we could predict its average phenotypic measurement with >97% accuracy.

      The fact that we do not conclusively identify which particular mutations are causative does not obviate the overwhelming evidence that heritable changes occurred in our samples, leading to repeated phenotypic convergence affecting the trait of sedimentation rate. We believe these phenotypic changes, and our quantification of their magnitude, to be a “novel” and “significant” contribution to the literature on cellular trait evolution, ecology, and multicellularity.





  7. www.digitalhermeneutics.com www.digitalhermeneutics.com
    1. The problem with synecdoche, or sampling, seems at first to bethat the part may not represent the whole as we would like to think itdoes, may not reproduce in miniature the characteristics we are inter-ested in, may not allow us to draw conclusions from what we do knowthat will also be true of what we haven’t inspected ourselves.

      Three possible problems of sampling.

    1. Author Response:

      Reviewer #1 (Public Review):

      Munc13 is a key regulator of synaptic vesicle (SV) fusion that is thought to mediate SV tethering and regulate SNARE assembly. Based primarily on Munc13 crystal structures, the authors design a set of four charge reversal mutations in the C1C2B region that are predicted to affect the interaction of Munc13 with the plasma membrane (PM). Various in vivo and in vitro consequences of these mutations are studied, leading to two main conclusions: (1) an interaction between the PM and a polybasic surface of Munc13 is likely important for SV tethering, and (2) two residues in the Ca2+-binding loops of the C2B domain are important for SV fusion.

      So far, so good - I think the data strongly support the two main conclusions noted above. It is less clear that these studies support (or could falsify) the main hypothesis, stated in the title, that re-orientation of membrane-bound Munc13 controls neurotransmitter release. Primed vesicles appear to exist in dynamic equilibrium between two states, one of them "loosely" primed (LS) and the other "tightly" primed (TS). Inasmuch as this simple model is correct, one could characterize the various players - SNAREs, synaptotagmin, complexin and of course Munc13 - in terms of their ability to influence the LS/TS equilibrium, perhaps in response to Ca2+ or other small molecules. This manuscript postulates that the orientation of Munc13 relative to the membrane has a major impact on the LS/TS equilibrium, with a perpendicular orientation favoring LS and a slanted orientation favoring TS.

      The authors' previous structure (Xu et al., 2017) suggested that two partially-discrete faces of C1C2B, one polybasic and the other centered around the Ca2+-binding loops of C2B, are likely involved in PM binding. In that paper they hypothesized that the polybasic face would dominate in the absence of Ca2+ whereas the 'Ca2+-binding face' [not a very good name, but the authors haven't suggested a better one] would dominate in the presence of Ca2+. Binding to the PM via the polybasic face would yield a more erect or 'perpendicular' binding orientation, whereas binding to the PM by the Ca2+-binding face would yield a more tilted or 'slanted' binding orientation.

      In revised manuscript we use the term DAG/Ca2+/PIP2-binding face, or Ca2+-dependent face when we discuss the effects of Ca2+ in particular.

      Here the authors performed two molecular dynamics simulations, one without and one with bound Ca2+. In the Results section, they correctly point out that their findings cannot be used to support their hypothesis because, in each case, Munc13 was placed in the hypothesized orientation - perpendicular for minus Ca2+, slanted for plus Ca2+ - at the beginning of the simulation. In the Discussion however the authors argue that the MD simulations support their model. I disagree because the simulations needed to falsify the model have not yet been conducted. In addition, an opportunity was seemingly missed by not doing MD simulations on the mutants.

      We have removed the sentence stating that the MD simulations support the model in the corresponding paragraph of the discussion (page 22). A meaningful analysis of the effects of the mutations would have required much longer simulations of this large system, which would take several months for each mutant in the UT Southwestern BioHPC facility or acquisition of a dedicated allocation at the Texas Advanced Computing Center.

      Of the four mutations studied, two (K603E and K720E) should specifically destabilize PM binding by the polybasic face, one (K706E) should destabilize binding by the Ca2+-binding face, and one (R769E) is expected to destabilize both. Two of the mutants (K603E and R769E) in fact abrogate priming. This result, along with biochemical experiments, implicates the polybasic face in SV tethering and thus represents the main evidence supporting the first of the main conclusions (see Evaluation Summary above). However, since an unprimed vesicle does not participate in the LS/TS equilibrium, these mutants are in this respect uninformative. Only the remaining mutants, K720E and K706E, would therefore appear to have the potential of yielding information about the LS/TS equilibrium and its relationship to Munc13 orientation.

      Although we understand the concern expressed by the reviewer, we do not fully agree with the last sentence. If we accept that the K603E and R769E mutations impair priming, this result implies that binding through the polybasic face occurs for WT Munc13-1. This conclusion does not demonstrate the LS/TS equilibrium, but it does support the notion that one of the proposed states exists.

      Both K720E and K706E support normal priming but have opposite effects on vesicular release probability and evoked release. These results can be rationalized in terms of an LS/TS equilibrium. The K720E mutation, which selectively destabilizes binding by the polybasic face, would shift the equilibrium toward TS and thereby increase the release probability. Conversely the K706E mutation, which destabilizes binding by the Ca2+-binding face, would shift the equilibrium toward LS and thereby reduce the release probability.

      However, the authors themselves cast serious doubt on this straightforward interpretation. In the case of K720E, they point out that the other 'polybasic mutant', K603E, has no effect of release probability. (I argued above that, perhaps, K603E is best viewed as uninformative about the LS/TS equilibrium owing to its strong upstream priming defect.) In the case of K706E, the authors point out that phorbol ester potentiation was similar for K706E and wild-type, suggesting to them "that the effects of the K706E mutation might not be related to the transition to slanted orientations but rather to another mechanism that directly influences fusion. For instance, the Munc13-1 C2B domain might cause membrane perturbations analogous to those that are believed to underlie the function of the Syt1 C2 domains in triggering release (Fernandez-Chacon et al., 2001; Rhee et al., 2005). It is also possible that the phenotypes caused by the K706E mutation and other mutations studied here reflect effects of Munc13-1 in more than one step leading to release, which complicates the interpretation of the data." If this is indeed the case, we are down to one mutant - K720E - that can be informative about the LS/TS equilibrium. (For the most part, I did not find the double and quadruple mutants informative, especially because each of them contains at least one mutation that strongly abrogates priming.)

      We again understand the concerns expressed by the reviewer but do not agree that the K706E mutant does not provide any information on the LS/TS equilibrium. If we accept that the K706E mutation does have an effect on evoked release and that K603E has an effect on priming, these results support the notion that both proposed binding modes occur and are functionally relevant. We do agree however that this conclusion does not prove that there is an equilibrium between two primed states.

      It looks like K720E is right in the center of the polybasic surface (although it's hard to tell from a single 'projection' image) so it would have been expected to impair Ca2+-independent liposome binding, and it does. However the liposome clustering effects are very weird, displaying a much broader distribution than any other experiment, an observation which the authors disregard. However, overall, I would say that the authors' K720E findings offer modest support for their overall main hypothesis. But for me it's not enough to justify making that hypothesis the title of the paper.

      We agree with the reviewer that it is a stretch to include the hypothesis in the title of the paper. We have changed the title to: ‘Control of neurotransmitter release by two distinct membrane-binding faces of the Munc13-1 C1C2B region’, which emphasizes the notion that there are two functional membrane-binding faces of the Munc13-1 C1C2B region without making a specific claim on a role of two faces on presynaptic plasticity. We note that the notion that the Ca2+- and DAG-dependent face of the C1C2B region is functionally relevant was already supported by previous studies (Rhee et al. 2002; Shin et al. 2010), which we now cite in additional sentences to emphasize this point (e.g. pages 22, 23). Hence, we believe that, together with the previous data, our results strongly support the conclusion that two faces of the C1C2B region are functionally important. We still present the LS/TS model and use it often to interpret our results, but have tried to be careful throughout the manuscript to not overstate our conclusions and point out when our results are consistent with the model without concluding that they prove it.

      For the most part I could not follow the discussion of figures 4 and 5. But I am struck by strong similarity between the data for K603E and K706E (comparing Fig. 4B/C to Fig. 4H/I). How can these results be reconciled with the opposite roles predicted for K603 and K706?

      The normalized data obtained for K603E and K706E mutants do look similar (new Fig. 8C,I), but the absolute amplitudes are lower for the former (new Fig. 8B,H). Nevertheless, we agree that it not straightforward to interpret some of the data obtained in the repetitive stimulation experiments. To acknowledge this difficulty, we have included the following sentence at the end of the first paragraph of the corresponding section (line 416): ‘Nevertheless, interpretation of some of the data was not straightforward, and there may be alternative explanations to those offered below, which are based in part on the proposed LS-TS equilibrium.’

      I'm not sure how the results of the PDBu experiments contribute to the conclusion that "two faces of the C1-C2B region are critical for Munc13-1-dependent short-term plasticity" (p. 15), since the only mutant that selectively affects one of the faces, K706E, has no impact (Fig. 6).

      We have toned down the sentence at the end of the section describing the PDB data, which now reads (line 475): ‘Overall, these results show that basic residues in the Munc13-1 C1-C2B region influence the potentiation of synaptic responses by PDBu and, together with the data obtained with repetitive stimulation, they support the notion that two faces of the C1-C2B region are involved in Munc13-1-dependent short-term plasticity.’

      Why are the liposome-binding assays in Fig. 7 done with C2C present - isn't that just a confounding factor? And if Ca2+-independent binding by C2C is as weak as suggested by the results in Fig. 7, how do any of the Munc13 constructs cluster liposomes in Fig. 8? (Note that, according to my reading of the methods, V-type liposomes are simply T-type liposomes without the DAG and PIP2.)

      Binding of the C2C domain to liposomes is indeed weak but still can contribute to liposome clustering because multiple C1C2BMUNC2C molecules can cooperate in this activity (see Quade et al. 2019). We used C1C2BMUNC2C mutants in the binding assays because they were also employed for the liposome clustering and fusion assays, in which C1C2BMUN is much less active (see Quade et al., 2019). We agree that having the C2C domain present could be a confounding factor, but we included the binding results because the effects of the mutations did correlate, albeit qualitatively, with those of the clustering and fusion assays.

      What is the basis for the claim (p. 22) that "the perpendicular orientation of Munc13-1 is expected to facilitate initiation of SNARE core complex assembly"?

      The perpendicular orientation may hinder the initiation of SNARE complex assembly if Munc13-1 is located between the vesicle and the plasma membrane, but can facilitate initiation of assembly if the bridging Munc13-1 molecules are located further from the center of the vesicle-plasma membrane interface (e.g. as in Fig. 1D; see also cryo electron tomography images of Quade et al. 2019). We agree that the term ‘expected’ is too strong and now state that the perpendicular orientation ‘may facilitate initiation of SNARE complex assembly’ (line 523).

      Reviewer #2 (Public Review):

      In this manuscript, Rosenmund and colleagues describe new results regarding the mode of action of Munc13 in neurotransmitter release. Based on molecular dynamics simulations of Munc13 (C1C2BMun) with phospholipid membranes, the authors selected promising point mutations and comparatively investigated their functional impact with electrophysiological experiments in hippocampal neurons and with a variety of in vitro experiments (lipid binding assay, liposome clustering and fusion). The results show that specific mutations in the C1C2B-domain (also referred to as polybasic face) of Munc13 (K603E, R769E) strongly inhibit vesicle priming, a property that correlates well with their re duced ability to bind to phospholipid membranes in a calcium-independent manner.

      The manuscript describes comprehensive electrophysiological and biochemical experiments that are complemented and extended by thoughtful analyses. The direct combination of electrophysiological and biochemical expertise from the Rosenmund and Rizo laboratories, respectively, represents a particular strength of this study, allowing the authors to develop new insights into the function of the Munc13 protein. A welcome (but not necessary) extension of the data presented would be the demonstration that the mutants in question (K603E, R769E) also show altered phospholipid binding in the MD simulations. In any case, the presentation of the data is clear and the authors' conclusions are convincing.

      Taken together, the manuscript and the results represent a significant advance in the understanding of the molecular mechanisms underlying synaptic vesicle priming.

      We thank the reviewer for the very positive evaluation of our study. As mentioned above, a meaningful analysis of the effects of the mutations would have required much longer MD simulations of this large system.

    1. Context: Sonia was watching Leah Remini: Scientology and the Aftermath: Season 3: "Episode 1" and had previously been watching a documentary One of Us about people who had left oppressive seeming Hassidic Jewish communities.

      I can't help but that that every culture could be considered a "cult" in which some percentage of people are trapped with comparison to all other cultures on Earth. Based on one's upbringing and personal compass, perhaps living and submitting to one's culture can become oppressive and may seem particularly unfair given power structures and the insidiousness of hypocrisy.

      Given this, could there logically be a utopian society in which everyone lives freely?

      Even within the United States there are smaller sub-cultures withiin which people feel trapped and which have the features of cults, but which are so large as to not be considered such. Even the space in which I freely live might be considered a cult by others who don't agree with it. It's only the vast size of the power of the group which prevents the majority who comfortably live within it from viewing it as a bad thing.

      A Democrat may view the Republican Party as a cult and vice versa, something which becomes more apparent when one polarizes these communities toward the edges rather than allowing them to drift into each other in a consensus.

      An African American may think they're stuck in a broader American cult which marginalizes them.

      A Hassidic Jew may feel they're stuck in a cult (of religious restrictions) with respect to the perceived freedoms of broader American Culture. Some may feel more comfortable within these strictures than others.


      A gender non-comforming person living in the deep South of the United States surrounded by the Southern Baptist Convention may feel they're stuck in a cult based on social norms of one culture versus what they experience personally.


      What are the roots of something being a cult? Could it be hypocrisy? A person or a broader group feeling as if they know "best" and creating a rule structure by which others are forced to follow, but from which they themselves are exempt? This also seems to be the way in which authoritarian rules arise when privileging one group above another based solely on (perceived) power.


      Another potential thing at play here may be the lack of diversity within a community. The level of cult within a society may be related to the shape of the bell curve of that society with respect to how large the center is with respect to the tails. Those who are most likely to feel they're within a "cult" (using the broader definition) are those three or more standard deviations from the center. In non-diverse communities only those within a standard deviation of the norm are likely to feel comfortable and accepted and those two deviations away will feel very uncomfortable while those who are farther away will be shunned and pushed beyond the pale.


      How can we help create more diverse and broadly accepting communities? We're all just people, aren't we? How can we design communities and governments to be accepting of even the most marginalized? In a heavily connected world, even the oddball teenager in a small community can now manage to find their own sub-community using the internet. (Even child pornographers manage to find their community online.)

      The opposite of this is at what point do we circumscribe the norms of the community? Take the idea of "Your freedom to strike me ends at my nose." Perhaps we only shun those extreme instances like murder and pornography, and other actions which take extreme advantage of others' freedoms? [This needs to be heavily expanded and contemplated...] What about the over-financialization of the economy which takes advantage of the unprivileged who don't know that system and are uncapable of the mathematics and computation to succeed. Similarly hucksters and snake oil salesmen who take advantage of their targets' weaknesses and lack of knowledge and sophistication. Or the unregulated vitamin industry taking rents from millions for their superstitions? How do we regulate these to allow "cultural freedom" or "religious freedom" without them taking mass-scale advantage of their targets? (Or are some of these acculturated examples simply inequalities institutionally built into societies and cultures as a means of extracting power and rents from the larger system by those in power?)


      Compare with Hester Prynne and Nathaniel Hawthorne's The Scarlet Letter.


    1. The web is a necropolis, where the dead will one day outnumber the living. In my years online, people who have been part of my daily life have suddenly, unaccountably winked out of existance. Disconnected or died? or, like ghosts on a stone tape, merely overwiped. On the web we are ageless; our bodies may decay, but text typed at 14 looks much the same typed at 24 or 54.

      Think of carved inscriptions on Roman walls. They took for granted that they carried their dead with them. Maybe this isn't so strange so much as the illusion we'd all had that we could create something fresh, new, untouched by our ancestors.

    1. racially diverse and valuingdiversity” institutional culture may minimize racism percep-tion group differences.

      I think the distinction made here between merely being diverse, versus actually valuing diversity is fascinating. We talked about this a lot earlier in the course with regard to Loyola. Our university is growing increasingly more diverse, yet many students feel as though administration does not actually value diversity.

    1. Brand Book {draft} To be able to change the world on the scale which it is needed, we cannot tell our story alone. We believe with an alliance of unlikely connections in the form of agencies and brands, we can share the load on the creation of unlikely connections, and build the power of the next social network, a social network for good. This Brand book walks you through AIME, it gives you the history, it gives you the callouts, it unlocks some pathways for storytelling. We share the AIME Design Brain we use to ensure we’ve birthed an AIME idea, and finally the channels AIME has to create unlikely connections. Now I didn’t want to bury the lead - we deeply believe that every campaign, every story, starts with manual one to one connections, that marketing requires human to human connection, and we don’t believe in one big story that suddenly goes viral: we don’t like viruses, we don’t like unhealthy growth. We are in this for the long game. The current dislocated media landscape is not who we require for verification. We want to build the connections one by one, and if the work is meaningful, then people may talk about it, but if not, the work is done. We focus our campaigns solely on the creation of the unlikely connections, not on who's watching. We don’t think facebook, instagram or twitter are strong arenas for communication at a level of depth that changes things. If your strategy involves them, delete your strategy, focus on the offline world, or on platforms that give space for depth, like podcasts or youtube. Remove the artificial, the distraction, the desperation for a quick result or a quick outcome and please please please build it slowly with us. One by one, in the shadows if we must, we’ll slowly keep building an incredibly meaningful social network for good that brings in the intelligence of all humankind. Thank you for creating unlikely connections with us for a fairer world Jack Manning Bancroft AIME Founder 14 August 2021 About AIME HISTORY IN FILM: What is AIME? IN FILM: What is UNCx5? What’s AIME’s vision? What is the problem AIME is solving? What’s the solution? How has AIME made the solution? What’s the difference between AIME and IMAGI-NATION? How do we measure success? How not to talk about AIME How to talk about AIME Our Spokespeople Where and how to activate AIME’s unlikely connections How to birth an AIME Idea Philosophy Star Dust Freedom Knowledge Create a fairer world? Economics Artists Engineering The AIME Unlikely Connection Channels IMAGI-NATION {University} IMAGI-NATION {TV} IMAGI-NATION {Radio} Making of a Hoodie Podcast IMAGI-NATION {Cinema} Fashion for Good IMAGI-NATION {Library} IMAGI-NATION Appendix: Glossary of words and phrases in the AIME universe About AIME In 2004, AIME founder Jack Manning Bancroft, sketched an idea of a social network for good, one that connected university students as mentors with Aboriginal & Torres Strait Islander high school students in Australia, building bridges between two different groups, to lead to educational equity, exchanges of worth and value, and for the mentors a deeper connection to a different lived experience. In 2005, this network commenced and scaled at pace around Australia engaging over 25,000 Indigenous high school students who closed a 40% education outcome gap, and it lit up the minds of a generation of university students desperate to connect to something bigger than themselves, with over 10,000 university students volunteering their time and energy to make AIME the largest ongoing volunteer movement of university students in Australian history. The power of AIME to build unlikely connections grew as we encountered further barriers to the high school students’ pathway out of inequity - barriers in mass cultural storytelling where they couldn’t see anyone like them, barriers in employment, barriers in the board rooms, barriers in the shape of the economy that saw so many kids like them outside the margins. One by one, we’ve worked tirelessly on building bridges between these young people and the people in control of many of the friction points where change has not yet occurred, but is possible if we embrace unlikely connections. The more our work grew around Australia, the more we realised the largest challenge to inequity was not limited by national borders; it was all interlinked. It was how we saw each other, how we saw people outside the margins, how we valued exchange, and the amount of the pie there was to go around globally. In 2016, we expanded our work across the globe, which has led to the invention of our own TV network, our own radio show, our own University to train people to make unlikely connections and which in 2021 is reaching people across 52 countries. We scaled our work in fashion, with our Hoodie to drive into youth culture with a symbol that was more than an empty brand promise, a symbol that showed the true power of fashion for good. We are in the process of bringing all of this work into an online world, contained in one social network, where we can model a different economy of exchange where everyone is included, and where there are bridges for those in positions of power who want to see things change, but don’t know where to find a marginalised young person, or connect to a different way of thinking. Our network will build these bridges driven by the power of unlikely connections. While nation states have struggled to find solutions that bridge the divides, we have decided to call our new network, IMAGI-NATION, a new nation, where everyone has a seat at the table, and where we are all invited to make an unlikely connection and help build a fairer world. HISTORY IN FILM: Origin Story - 2005-2012 - Australian Story - https://www.youtube.com/watch?v=Mt5RxdQRFR4&ab_channel=AIMEMentoring Going Global - 2014-21 - 7 Down - https://vimeo.com/563040825 Password: down47down Philosophy in a podcast - 2021 conversation between Tyson Yunkaporta & AIME Founder Jack MB https://player.fm/series/the-other-others/positivity-meets-complexity What is AIME? AIME is ‘unlikely connections’ for a fairer world. We are a network that connects marginalised youth with the rest of the world to make space for exchanges of time, knowledge, opportunities to create more bridges between those inside the margins and those outside so we can realise a fairer world. IN FILM: COGS - Created to help AIME go global with Oscar Award Winner Laurent Witz & M&C Saatchi - https://www.youtube.com/watch?v=sGt3figvnfU&ab_channel=AIMEMentoring What is UNCx5? Unlikely Connections times 5 - it’s the formula to unlock the power of unlikely connections and the key to open up the world of IMAGI-NATION. To create change, we don't need an island, or thousands of Instagram followers, or be a LinkedIn influencer. All we need is 5 incredible Unlikely Connections, and watch the many infinite new connections into experiences, knowledge, perspectives that explode when 5 people go deep in a smaller circle - in a network that is decentralised and includes us all. What’s AIME’s vision? Creating millions of unlikely connections between marginalised youth and those inside the margins AND between all human beings and different ways of thinking in order to create a fairer world. What is the problem AIME is solving? Our current connections work towards a concentration of wealth and opportunity for the few, a confirmation of our biases, more time with people like us What’s the solution? Unlikely connections, between races, ages, wealth, nations. How has AIME made the solution? Two parallel pathways - connecting people with stories and knowledge, and connecting people with each other. The stories and knowledge come through AIME’s Hoodie, TV, Radio, Film, Gallery, Library & University. The connections are facilitated via AIME’s online social network for good & in our physical work in Universities & schools worldwide - IMAGI-NATION, AIME’s university IMAGI-NATION {University}, and via AIME’s meeting place within IMAGI-NATION, a global exchange portal where marginalised youth can connect with mentors, internships, scholarships and jobs. What’s the difference between AIME and IMAGI-NATION? AIME is the organisation, IMAGI-NATION is the network. How do we measure success? By counting the unlikely connections created. And then by tracking through case studies, the deeper impact short, medium, long term of those unlikely connections. How not to talk about AIME Okay here’s some watchouts. Avoid these: · The Australian Indigenous Mentoring Experience - AIME was founded as an Australian unlikely connector between Indigenous and non-Indigenous people, it’s now grown to 50+ countries. QANTAS used to be the Queensland and Northern Territory Airline Association, it’s now just QANTAS. AIME’s origin story is part of the heartbeat of the organisation, and it is told with subtlety and nuance, by having a global stage where Indigenous Australian young people stand alongside other young people outside the margins, and people inside the margins, and in that statement, on a global stage, we see the ultimate equity achieved for Indigenous people in Aus. That equity is that there are no ceilings, there are no doors closed to their possibility, their identity is their power and their story, not to limit them, but to unleash them. · “Indigenous Australian and other marginalised youth” - this reinforces the negative twice. We want the audience to understand the global inequality faced by young people who because of historical circumstances, because of societal design, have landed in a life that they are outside the margins. Calling out Indigenous Australians and then other marginalised youth does a double otherising. Back to the simple message “AIME connects young people outside the margins with a network of those inside the margins to build exchanges to create a fairer world.” · Awareness - AIME isn’t about awareness, we aren’t here to tell people about the problem of inequity, to dwell on the past, AIME is about solutions, AIME is about tomorrow, AIME is about action, about really simple action where people make an unlikely connection with knowledge through our storytelling, are inspired to act through our storytelling, or are connected in unlikely ways via AIME’s network. o Particular callout on the AIME Hoodie - the AIME Hoodie is the most activated meaningful Hoodie in the world. No hoodie we make is about awareness. For example: § Making Space Hoodie is a Gallery - it exhibits the work of marginalised youth from around the world & it also exhibits the work of profile artists giving their profile and work to raise $ and bring people to connect with the AIME network and make more unlikely connections. § Making Space Hoodie is a ticket - to the global Making Space exhibition where the world’s marginalised youth are exhibited on the walls of the most prestigious galleries around the planet. § Making Space Hoodie is not awareness building about the plight of inequity. · AIME is the anti charity. If there’s anything you’ve seen before in public fundraising, gala balls, flip the script on it because AIME doesn’t want to give to people what they already know, we don’t give them what they’ve already got, because that is not an unlikely connection with an idea, with a way of thinking. We want people to see AIME as the ideal organisation on planet earth, not your usual charity, not noble, but normal. How to talk about AIME · Unlikely connections for a fairer world. · Ask a question - What unlikely connection with an idea, with a person, has changed your life for good? · We are the anti facebook - AIME is the network of tomorrow - built for everyone, not affirming what we know and who we know, but connecting us to what we don’t know and who we don’t know, not for entertainment, but for good. · Action action action - focus on the action, the impact, the outcome, the unlikely connections, how one unlikely connection after another we can change things. Map the impact, showcase how the idea is changing the world. · Borrow - borrow from all different organisations, stories, ideas, and fuse the unlikely connections. · Imaginatively create stories that are fuelled by unlikely connections. If you have a young person from outside the margins and someone from inside the margins you are on your way. If they are activated and working proactively on a project of tomorrow, and the young person is shown with strength, with agency, not as a problem to be fixed, but the solution, then you are well on your way. · DO IT - don’t overthink it, don’t over strategise it. Make sure the passion for what we are doing - the fairer world we are fighting for, is alive. When this is alive, we are alive, we’ll learn from the doing. · Always drafting - embrace an idea that we are always drafting. This links in with the doing of it. If we know we are creating unlikely connections, then let’s get out there and do it. Our Spokespeople · Our Lead spokespeople are our 6 Professors. They transcend our literal representations of race and identity and allow us to move into a place of imagination. They are multidimensional, they are challenging and complex. We want our professors doing media spots, public speaking events, representing AIME. o Profile on each please Josh BuoyVanessa EllisBenjamin Knight · AIME Ambassadors - 200 young people from around the world Where and how to activate AIME’s unlikely connections Our social network for good, since 2005 has focused on real life, real world interactions between human beings, creating unlikely connections one by one. We see the power of the internet to connect and are developing our own digital social network for good to be released in 2022. We are very wary of the trap of Facebook, Instagram, Twitter, and would prefer the use of these platforms to drive physical action. For example: · Making Space campaign - Post from an artist at a gallery asking their gallery to join the Making Space Exhibition globally · Making Space campaign - an employer inviting other businesses to join the making space club What we dig less is ‘awareness’: “I’m wearing this hoodie, I’m cool therefore marginalised youth are cool” We aren’t so into that, we’d prefer action. Eg: · I’m wearing this hoodie with a callout to any young artists from outside the margins who want to have a chance to be exhibited at the Louvre in this year’s MAKING SPACE exhibition and have your own work created into a custom Art Hoodie via AIME’s IMAGI-NATION{Gallery}, head to http://aimementoring.com to apply. Ensure there are unlikely connections from the inception through to the delivery of the idea and impact tracking & storytelling afterwards. If it builds the network of unlikely connections, if it leads to action = good. If it talks about how good AIME is = not so good. Remember - not the past, but the future. Not the problem, but the solution. That every single communication piece from us is an opportunity to create an unlikely connection with a new piece of knowledge, a different way of thinking, or a person, that leads to a fairer world. How to birth an AIME Idea This is our AIME brain, it’s what’s required to create an AIME idea. If you have ticked all of these, it’s an AIME idea. We believe in knowledge to change the world, that’s why we have a philosophy checklist; we believe that economics drives what we value and to change the world, we must influence economic exchanges; we see art and artistic thinking as leading an idea, birthing a reality, a bridge between imagination and what we know; and finally, we believe in robust engineering to ensure we change the system, and the idea can move from imagination to actionable change. Below is the graphic we work through to ensure we have designed an AIME idea. And we’ll share a short description of each section. Philosophy Star Dust · Does this idea live after it’s been created, is there a vision where it explodes, and the star dust that is left helps the whole earth? Freedom · Are we working on freeing people’s minds? Or helping them enter a space of imagination? Are we suspending disbelief? Are we flipping the script on how we think? Are we releasing all sides/different people from their existing biases and allowing for freedom of thought to see unlikely pathways as realities? Knowledge · Is there knowledge shared as part of the idea? Not surface awareness but deep knowledge transfer? Do we change the way people think? Is there depth to the idea? Create a fairer world? · Does this action create a fairer world? How will we prove it? Economics · Exchange of Time, Knowledge, Opportunities o Does this idea focus on an exchange of time, knowledge and opportunities? Does it provide the space for those involved to share across the margins? Is cash kicked down the line as a barrier to entry? Are there moneyless exchanges leading? · IMAGI-NATION - Social Network for Good o Does it bring the audience to IMAGI-NATION to act? o Does it inspire the audience to network differently, to network with unlikely connections? Artists · Make a statement o Does the idea grab you? Does it make a statement? Have we distilled the essence of it into a headline? Is the statement something we can stand for? · Always drafting o Does the idea have fingerprints all over it? Have we embraced ‘always drafting’ as a concept in design? Have we let the audience into the process of creation? Have we created a bridge between them and us by being human, by drafting with them? Have we co-created? And have we released ourselves from perfection by releasing the idea, then drafting with the world? · Imagine o Have we harnessed the power of our collective and individual imaginations with the idea? Have we truly deeply imagined what’s possible? Is the idea predictable or imaginative? Have we used the principle of unlikely connections in the birthing of the idea to ensure it is imaginative? · Layers and levels o Is there depth to the work? Are there multiple layers and levels at play? Does it work today and tomorrow? Is there complexity in the approach? · Play with the frame o Have we looked at the existing frame and played with it? Have we drawn outside the margins? Have we changed the frame? Have we played with the assumptions of what the playground is? Have we, in the very act of adjusting the frame, expanded the margins? Have we made the frame bigger to help others see bigger? Have we made more space? Engineering · Impact o What is the measurable impact of unlikely connections created from the campaign? What are the numbers of young people from outside the margins that will have unlikely connections because of this idea? How are we going to capture the case study impact of the work? In what format? What changes because of the idea? And can you prove it with hard facts? With numbers and stories? · Repeatable o Is the idea repeatable? Can it scale globally? Can it grow year on year? Could it last for 20 years? · Shift the system from the inside o Does the idea bring those people from inside the margins onto the bridge to make an unlikely connection with those outside the margins? Have we inspired those inside the margins to act, then given them a pathway and responsibility to do the work? o Does it have a design that moves beyond a day? Does it have a club/ a system/ a campaign/ a peer-to-peer device/ cultural pressure that moves the work back into the hands of those within the margins to create the change themselves? o What levers are we pulling on to make the system move? · Long Game o Have we imagined what happens with this idea in 100 years’ time? Have we thought about what happens in 1000 years’ time? Have we released ourselves from a measurement of success being an instant ‘like’, to thinking about the long game? Have we resisted the pressure of “big news” results, to think one-by-one about how we can build the idea year-on-year, to create the snowball, into the avalanche of change? Is there patience in the design? · Who's at the table? o Are people from outside the margins at the table in the design of the idea? Do we have unlikely connections at play the whole way through? · Give kids the stage o Does the idea make a stage and then give the stage to young people outside the margins to show they are not a problem to be fixed but part of the solution? o Are young people involved in the design process? o How do the young people take their opportunities from the idea and become leaders that pass it on and create more opportunities for young people like themselves? The AIME Unlikely Connection Channels IMAGI-NATION {University} Where AIME educates & inspires people in how to make unlikely connections to act for a fairer world. Students complete their courses over 10 months. There are five degree courses and five key audiences: · Executives - who work on creating a Co-CEO in their organisation and levelling the playing field in their workplace · University students - who lead an AIME student chapter and create mentoring connections between university student mentors and 100 marginalised high school students · Teachers - who teach with imagination to engage ALL students in the classroom and build bridges to local employers & community · Entrepreneurs - for school students from outside the margins to become entrepreneurs and create change from the inside out (and for those within the margins to build unlikely connections back to those outside) · Citizens - for individuals to work on projects for change within their communities or the world Via IMAGI-NATION {University}, by 2024, AIME is looking to create unlikely connections for 90K marginalised youth per year. Here are 5 case studies of students enrolled in 2021 IMAGI-NATION {TV} A weekly TV show where we curate unlikely connections. This is where we incubate ideas, where we bring people together to create the connections. From the show we have birthed IMAGI-NATION {University}, IMAGINE Film, a Hoodie that pays rent, and 1000’s of unlikely connections. The first season: https://vimeo.com/454576826/883f618ecb<br> Example episode: Each episode partners with a school and via the knowledge and the people on IMAGI-NATION {TV}, we are looking to provide unlikely connections to 5000 marginalised youth per annum by 2024 (100 kids per school per show). 5 Guest profiles IMAGI-NATION {Radio} Our main show is Making of a Hoodie Podcast with a few others in production Making of a Hoodie Podcast A monthly/bi-monthly activated podcast where we create unlikely connections, and then from the podcast create a hero hoodie, then activate more unlikely connections. Each year we work on: · 12 Schools globally · 3 activists · 3 artists · 3 alternative thinkers Our current distribution partner for the show and the Hoodies is The ICONIC. Via Making of a Hoodie Podcast we are looking to create unlikely connections with 1000 marginalised youth per year (100 kids per school per show). Example Show: https://podlink.to/makingsomethingouttanothing<br> Example Hoodie:https://shop.aimementoring.com/products/moah-hero-hoodie<br> 2-3 participant case studies IMAGI-NATION {Cinema} Once a year we work on releasing a film as a driver to open applications for IMAGI-NATION {University}. We see our films as a way to create unlikely connections with ideas and ways of thinking. Our current films are: · COGS · Dreams Our 2021 Film is: · 7 Down Our 2022 Film is: · IMAGINE Film We are also working on ways to tell the story of our Professors of IMAGI-NATION {University} to the world. Fashion for Good We can create activated Hoodies to amplify any campaign or idea or story. Current Hoodie campaigns we are running: · Kindness Hoodie · Making Space Hoodie · Hero Hoodies via Making of a Hoodie Podcast IMAGI-NATION {Library} This is our legacy to the world for the next 60,000+ years of human existence. This is where we keep developing all of our books, and mentor tools. The key development in this area is the creation of Mentor Class - a variety of videos and lessons from Mentors.

      Mentor Class example IMAGI-NATION In 2022 we will have our own digital social network for people to be able to enter and exchange and engage with each other's time, knowledge, and opportunities. Designed with unlikely connections between some of the world’s most interesting organisations and human beings, IMAGI-NATION models a different economy and is a home for kids pushed outside the margins to walk across a bridge into knowledge and opportunities, and a space for citizens of our earth to work out how to live and design a world more equitably Appendix: Glossary of words and phrases in the AIME universe 18 values A set of values that infuse everything we do. Everyone enrolled in IMAGI-NATION {University} is trained in our 18 values. These are hope, change, freedom, rebelliousness, listening, empathy, BRAVE goals, no shame, initiative, yes and, forgiveness, kindness, gift of time, failure, asking questions, hard work/discipline, know yourself, mentors not saviours. 365-day Goal Station An ingredient of pop-up IMAGI-NATION {Factory} days. Where mentees write on post-it notes their goals for the year and place them on a sign where they can be seen. 6 knowledge fields The 6 key areas of knowledge and experience we have gained over the years that form the basis for everything we do and that we teach at IMAGI-NATION {University}. These are Imagination, Mentoring, Organising Change, Building Bridges, Flipping the Script and Hoodie Economics. AIME A global network that connects youth from marginalised backgrounds with the rest of the world to make space for exchanges of time, knowledge and opportunities between them. AIME Time Machine (AimeTM) An ingredient of pop-up IMAGI-NATION {Factory} days. Where mentees ‘deposit’ the baggage they are going to leave behind before they enter the IMAGI-NATION {Factory}. Always drafting We are always drafting. We have released ourselves from perfection and embraced the idea that our work is always a draft. It’s never finished, never perfect. Asking questions One of our 18 values. Asking questions allows us to move from what we already know to what we don’t yet know. Asterix Professor of Hoodie Economics at IMAGI-NATION {University}. Asterix is a philosopher combined with an economics major in pursuit of what makes life worth living. She’s asking some very big questions through her research to redefine how we think about adding value in our world – pursuing an exchange of time and experience instead of just money. Blue Professor of Flipping the Script at IMAGI-NATION {University}. Blue knows that self-authorship and an entrepreneurial mindset are integral in order to move oneself outside the dominant narrative. His fundamental lesson in flipping the script: “I don’t have to play a part in someone else’s story”. Blue wants to help write a curriculum for all students to see themselves in a new light at IMAGI-NATION {University}. BRAVE goals One of our 18 values. AIME embodies BRAVE (BIG, RISKY, AUDACIOUS, VISIONARY, ENDLESS) goals. Building bridges One the 6 knowledge fields taught at IMAGI-NATION {University}. Creating connections across nations, cultures, races, ages, socio-economic differences. Cellular network Our organisational structure at AIME—a living, evolving, decentralised system of intermingling cells. Change One of our 18 values. Change is the only constant! Co-CEO The Co-CEO program looks at levelling the playing field and making boardrooms more diverse and inclusive. Executives recruit a young person aged 18-30 from a background that has historically experienced marginalisation who will shadow them for 6-12 months and absorb all the learning available to those who get a seat at the decision-making table. Empathy One of our 18 values. Empathy is feeling with the heart of another person. Einstein Professor of Building Bridges at IMAGI-NATION {University}. Einstein wrote a paper as a sociology grad student in the 1970’s based on a phenomenon she gathered from her research: instead of going from A to B and B to C, what if we just built a bridge from A to C? Now she’s getting recruited by leaders around the world to talk about building bridges. The problem is: she’s not sure it’s going to work... Energy Professor of Mentoring at IMAGI-NATION {University}. Energy is on a lifelong quest to pass on knowledge. She earned her PhD in mentoring; she loves the Plato/Socrates relationship; she’s obsessed with seeing knowledge passed down from generation to generation. If she wants to tell you something, she’ll tell you a parable. She knows that if you want to change the world, you need to connect with people through stories. Failure One of our 18 values. When we fail, we learn. When we learn, we grow. Failure Time An ingredient of pop-up IMAGI-NATION {Factory} days. A confidence and resilience-building session where kids try out new things and learn it’s ok to fail. Flipping the script One the 6 knowledge fields taught at IMAGI-NATION {University}. Shifting the dominant narrative from a lens of problem to solution. Forgiveness One of our 18 values. Forgiveness gives us the power to move beyond a certain circumstance or person and not let it define us. Freedom One of our 18 values. Freedom is about casting off the chains that come from ourselves, history, society. GAIME of Life An ingredient of pop-up IMAGI-NATION {Factory} days. An interactive writers’ room and role play game where kids get to write and bring to life a story that can inspire kids like themselves. Gift of time One of our 18 values. At AIME, we believe the greatest gift we can give is the gift of our time by turning up for others. Hard work/discipline One of our 18 values. Hard work and discipline are the gears behind change. It’s not always pretty but it’s completely necessary. Hoodie The AIME Hoodie is the most meaningful Hoodie in the world. Since 2010, it has been the currency of IMAGI-NATION and our device for change. We ask people to act, to stand up and create change, and in exchange, we give them a hoodie to say “Thank you for fighting for a fairer world.” Hoodie Economics One the 6 knowledge fields taught at IMAGI-NATION {University}. The economy that underpins AIME: elevating the exchange of time, knowledge and opportunity above money. Hope One of our 18 values. Hope is believing in a better future and working to make it happen Hope Professor of Imagination at IMAGI-NATION {University}. Hope knows that hope doesn’t come easy: it’s always a struggle. He’s trying as hard as possible to build that bridge between reality and unreality. Hope feels the heaviness of hope; he carries this giant burden and responsibility – all the while thinking: “Don’t make me carry this alone!” Hope is the epitome of hard work: he knows you don’t get to opt out of the work if you want to change the world. Imagination One the 6 knowledge fields taught at IMAGI-NATION {University}. Imagination is the beginning of human thought and action. IMAGI-NATION An online world where we can model how society can work differently, where everyone has a seat at the table, where people enter and engage with each other and exchange time, knowledge, and opportunities, and where we are all invited to make an unlikely connection and help build a fairer world. IMAGI-NATION {Ambassadors} One of the options of the {Entrepreneurs} course at IMAGI-NATION {University}. A 100-day challenge for school students to use IMAGI-NATION to create a fairer world through a change mission of their choice. IMAGI-NATION {Artists} One of the options of the {Entrepreneurs} course at IMAGI-NATION {University}. A three-month residency for young artists to be mentored by a team of artists and have their work featured on IMAGI-NATION {TV} and in our IMAGI-NATION {Gallery}. IMAGI-NATION {CEO4Good} One of the options of the {Entrepreneurs} degree course at IMAGI-NATION {University}. A 100-day challenge for school students from inside the margins to mobilise their networks to share wealth, knowledge, opportunities with kids being left behind. IMAGI-NATION {Cinema} Our films and series use story to create unlikely connections with ideas and ways of thinking. Currently: Cogs (film, 2017), Dreams (film, 2018), The Professors (series, 2021), 7 Down (film, 2021), IMAGINE (film, coming in 2022), The Professors’ House (series, coming in 2022). IMAGI-NATION {Citizens} One of the degree courses at IMAGI-NATION {University}. For individuals to lead projects that drive meaningful change in their community and the world using the tools of IMAGI-NATION. IMAGI-NATION {Classrooms} A digital mentoring and tutoring session delivered via online meeting platforms to support mentees academically. IMAGI-NATION {Curriculum} A suite of mentoring tools and activities for school students based around our 18 values delivered through our pop-up IMAGI-NATION {Factory} days, IMAGI-NATION {Teachers} and partnerships with educators. Includes the Magic Maker, the Purple Carpet, the 365-day Goal Station, Sacrifice Planes, AIME Time Machine, Failure Time, GAIME of Life, Keys to the City, the Hoodie, books, films and more. IMAGI-NATION {Entrepreneurs} One of the degree courses at IMAGI-NATION {University}. School students become entrepreneurs for good and gain hands-on experience in leading change for themselves, for others or for the planet, including by using their artistic talents to have their voice and other voices heard. Includes {Ambassadors}, {Artists}, {CEO4Good}, {Filmmakers}, {Writers}. IMAGI-NATION {Executives} One of the degree courses at IMAGI-NATION {University}. For executives wanting to transform the leadership culture of their organisations, level the playing field for young people from outside the margins and bring diverse young talent into the boardroom. IMAGI-NATION {Factory} An immersive theatre experience delivered on school and university campuses to help kids develop confidence, unlock imagination and brave thought, and free their potential to create change in their world and in the wider world. This is how the IMAGI-NATION {Curriculum} is delivered to mentees. IMAGI-NATION {Filmmakers} One of the options of the {Entrepreneurs} degree course at IMAGI-NATION {University}. School students are mentored by professionals from the TV and film industry in how to tell stories and make films. IMAGI-NATION {Gallery} A virtual space and physical spaces (including the Hoodie) where AIME artists can display and sell their artwork and connect with established artists, galleries and opportunities. IMAGI-NATION {Library} A resource library of free IMAGI-NATION knowledge and tools to live on forever, for humanity. Available to everyone, both within and outside of IMAGI-NATION {University}. IMAGI-NATION {Presidents} One of the degree courses at IMAGI-NATION {University}. {Presidents} are university students who lead an AIME student chapter on their campus and create mentoring connections between university student mentors and 100 high school kids who have been pushed outside the margins. IMAGI-NATION {Radio} Where we host our Making of a Hoodie podcast. Each episode invites a different school to give the stage for youngsters to create unlikely connections and co-design an exclusive AIME hoodie sold on ICONIC apparel. IMAGI-NATION {Teachers} One of the degree courses at IMAGI-NATION {University}. Teachers wanting to teach with IMAGI-NATION to engage all students in the classroom and build bridges to local employers and community. IMAGI-NATION {TV} A weekly TV show on YouTube where we curate unlikely connections. Hosted by AIME school students, {Presidents} and partner schools. People from all walks of life around the world come together to connect with young people pushed outside the margins and share knowledge and ideas on ways to actively create a fairer world. IMAGI-NATION {University} A free online university where we educate and inspire people in how to make unlikely connections to act for a fairer world. With courses for school students, university students, teachers, executives and everyday citizens. IMAGI-NATION {Writers} One of the options of the {Entrepreneurs} degree course at IMAGI-NATION {University}. School students are mentored by established writers in how to write and tell stories. IMAGINE film The first crowd-written, crowd-produced feature-length film. To create the stage for kids from marginalised backgrounds to join household names in film and television for a live creation mentoring experiment in filmmaking. Impact Real-life change in someone’s mind, in their life, in their community, in the world because of an unlikely connection with an idea, a way of thinking, a piece of knowledge, a person, and that leads to a fairer world in the short, long or medium term. Initiative One of our 18 values. Initiative is about saying ‘If not me then who? If not, now then when?’ Keys to the City An ingredient of pop-up IMAGI-NATION {Factory} days. A moment where kids step to the front, take the stage and get the chance to shine in any way they choose. Kindness One of our 18 values. Kindness is in the AIME DNA, we LOVE to trade in the currency of kindness. Kindness Hoodie A hoodie designed to inspire kindness throughout the year. Every week, 20 people around the world each wear a kindness hoodie for a week, spread as much kindness as they can in whichever way they choose, then pass the hoodie on to the next 20 people. Know yourself One of our 18 values. Knowing yourself means increasing your awareness of the emotions, motivations, desires, abilities, fears and aspirations that form the self. Lionelcorn Professor of Organising Change at IMAGI-NATION {University}. Lionelcorn is fascinated by organizational systems that exist in the wild. (He’s obsessed with mycelium.) All of his organizational theories are based in natural regeneration. He thinks it’s bullshit that the world isn’t changing like nature. Chaos is natural. He’s pushing for chaos and change. Listening One of our 18 values. Listening is the gateway to empathy and connection, and is often a more powerful action than speaking. Magic Maker An ingredient of pop-up IMAGI-NATION {Factory} days. A question mark wand with 1,000 questions on it that work as a ‘get to know you’ conversation starter to bring mentors and mentees together. Making of a Hoodie A monthly/bi-monthly activated podcast that connects schools with activists, artists and alternative thinkers through ideas, knowledge and perspectives and sees them create a school hoodie together. Making Space A global art initiative that sees the work of youth from marginalised backgrounds exhibited in the IMAGI-NATION {Gallery}, on custom AIME hoodies, alongside profile artists and through their networks, and in galleries around the world. Meeting Place A global exchange portal where youth from marginalised backgrounds can connect with mentors, internships, scholarships and jobs. Mentoring One the 6 knowledge fields taught at IMAGI-NATION {University}. Mentoring is the key to sharing knowledge and wisdom across generations. Mentors in Residence Knowledge-holders from different walks of life who mentor 20 of our key leaders at AIME over a 3-month term twice a year. Mentors, not saviours One of our 18 values. We are not here to ‘save’ the youth that we work with. We are here to mentor kids so that they are stronger without us. Mycelium We are inspired by mycelium, a magical and intelligent web of life above and beneath the soil and a vital force of life on Earth. Mycelium is the oldest continuously surviving multi-cellular network in the world and helps trees talk to each other underground. No shame (at AIME) One of our 18 values. No shame at AIME is one of our earliest catchphrases. There’s zero tolerance at AIME for casting shame on others for expressing themselves or for being who they are. Organising change One the 6 knowledge fields taught at IMAGI-NATION {University}. Organising change is about how we can change things to be fairer. Professors The six non-human and complex academics who are the founders of IMAGI-NATION {University} and AIME’s lead spokespeople: Professor Asterix, Professor Blue, Professor Einstein, Professor Energy, Professor Hope, Professor Lionelcorn. Purple Carpet An ingredient of pop-up IMAGI-NATION {Factory} days. Inspired by world class, theatrical architecture and the red carpet, a device that signals to the mentees they are stepping into the world of IMAGI-NATION. Rebelliousness One of our 18 values. We’re not going to change anything by accepting the status quo. Sometimes a little rebellion is necessary to create change. Sacrifice Planes An ingredient of pop-up IMAGI-NATION {Factory} days. Where mentees write on paper planes the sacrifices they will make to achieve their goals and send them out into the world. Social Network for Good IMAGI-NATION: our social network founded in 2005 that focuses on meaningful human interactions anchored in real life and in the real world, and forges unlikely connections between people to create a fairer world. Sunday Kindness Our weekly newsletter delivering a dose of kindness to the world every Sunday. This Hoodie Pays Rent An initiative launched in April 2020 where we split profits from ‘This Hoodie Pays Rent’ hoodie sales with those struggling to pay their rent because of COVID-19, to help keep a roof over their heads. Unlikely connections Unlikely connections is connecting people with other people and with stories, experiences, knowledge, ideas, perspectives and opportunities they wouldn’t ordinarily connect with, across races, ages, wealth and nations. Yes, and One of our 18 values. At AIME, YES AND means encouraging a collaborative environment and cherishing ideas before burning them.

    1. FIRST SENATOR. This cannot be By no assay of reason. ’Tis a pageant To keep us in false gaze. When we consider The importancy of Cyprus to the Turk; And let ourselves again but understand That, as it more concerns the Turk than Rhodes, So may he with more facile question bear it, For that it stands not in such warlike brace, But altogether lacks the abilities That Rhodes is dress’d in. If we make thought of this, We must not think the Turk is so unskilful To leave that latest which concerns him first, Neglecting an attempt of ease and gain, To wake and wage a danger profitless.

      How does race factor into how the European characters interact with Othello?

    1. Beware: Gaia may destroy humans before we destroy the Earth

      Hmmm. I have been thinking about Earth having a fever in response to a pathogen. Foreign bodies, viruses, known as corporations have infected the minds of their host organisms, using legal systems to reprogram their syntropic nature as living organisms with a compulsion to replace themselves with entropy machines. By assuming personhood, corporations are consuming and monopolizing the time, energy, and resources of their hosts so that they have achieved a level of control and domination over nature such that they can change the climate and reversing the process of biological and cultural evolution.

      “I think I can feel the future.”

      https://world.builderscollective.org/awake/

    1. And yet “these kids” could out argue me about everything under the sun: the inherent problems with school policies, the merits of long lunches, why we should hold class outside, and about local issues that reverberated through the building like desegregation and school closures. When they wrote, they had spelling errors and grammar issues, despite—or because of—the Warriner drills or my lack of knowledge about African American Vernacular English, but their logic and evidence spun circles around me.

      I think this is an important paragraph because it points out that academic English can often times act as a barrier to acknowledging the true ability of our students. We may focus on the grammar or misspellings and mistake that for a lack of intelligence or ability when it's simply the hegemonic norm we continue to enforce.

    1. DisCOlarships are one way to advanced from a Casual to Committed DisCO relation. As such, they are considered a Casual Relationship leaning towards committed relationships with DisCOs in general (if not to the particular DisCO hosting the DisCOlarship).

      Idk what it is, but I get a weird feeling about how explicit the lines are being drawn around casual and committed relationships. I think it's related to the earlier comment about feeling like the text is overemphasizing the DisCOs agency in defining the relationship.

      I think part of it is also the fact that the casualization of labor is a really sore point for a lot of workers right now, as we emerge as a new "precariat" class. While its understandable that a lot of this work may be considered volunteer work and thus unpaid, continually noting that this is a casual relationship with no obligation for pay definitely makes me recall toxic work relationships in my past and I imagine will do the same for other workers.

    2. The DisCO.NP may contract services when lacking the capacity - See Undercapacity and contracting outside the DisCO.NP. The DisCO.NP may also choose to engange with other DisCOs in work relationships and value transactions - see DisCOverses and Intra-DisCO Value Flows.

      On this read, I'm putting myself in the shoes of someone that is interested in working as a contributor. These sentences feel way more relevant to the DisCO itself. Not to say this doesn't belong in this page, but I think we may need to work on shaping the tone of this document to more clearly delineate who. the audience is at each point of the text.

    3. To recap: We have distinguished two main states: Casual and Committed. "Casual" means little responsibility. These are no-strings-attached relationships for mutual benefit. There are two types of Casual Relationship: Supporters (Very casual interactions) and Contributors (More active interactions and actual contributions to the DisCO.NP and its mission). "Committed" signifies a stated commitment of responsibility to the DisCO.NP and its members. Those wanting to progress from Casual to Committed have two options: DisCOLarships (practical DisCO training with no firm expectation of joining the DisCO). DisCO Dating: Intense mentoring program for applicants to join the DisCO in which they are being mentored.

      High level, I think it may make more sense to lead with "TL;DR"s rather then ending with them so that as people are browsing through the wiki, they can quickly decide if they're on the right page or not.

    4. On the downside, DisCOs are also more complex in the initial stages, although once their learning curve has been overcome, we'd argue that they function more smoothly and are more resilient organizations.

      I don't know that this is inherent to DisCOs as individual organizations! I think that this might appear to be the case given that we're still building out the core of the philosophy while also trying to create organizations that work by these ideals while existing as an explicitly counter-hegemonic endeavor. All that to say, I think you may be selling DisCOs short by including this in there, but understand why it's included right now. My main reason for bringing it up is, we don't want to give people the impression that this is the hardest path possible because it will likely result in slower adoption of these principles.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2021-01041 Corresponding author(s): Gregory P. Way, PhD

      1. General Statements

      On behalf of the authors, I’d like to thank the Review Commons team for sending our manuscript out for review. I’d also like to thank the three anonymous reviewers for providing valuable feedback that will improve the clarity, focus, and analysis interpretation presented in our manuscript.

      To prompt the editorial team, our paper provides two well-controlled innovations:

      We are the first to train variational autoencoders (VAEs) on classical image features extracted from Cell Painting images. VAEs are commonplace in, and have contributed major discoveries to, other biomedical data types (e.g. transcriptomics), but they have been underexplored in morphology data. In our paper, we trained and optimized three different VAE variants using Cell Painting readouts and compared these variants against shuffled data, against PCA (a nonlinear dimensionality reduction algorithm commonly used as a VAE control), and against L1000 (mRNA) readouts from the same perturbations. We found that cell morphology VAEs train with different settings than gene expression data, and that they generate interpretable latent spaces that depend on the chosen VAE variant.

      We tested special VAE properties to predict polypharmacology cell states in a novel way. Polypharmacology is a major reason why drugs fail to reach the bedside. Off-target effects cause unintended toxicity, and lead to adverse clinical events. In our paper, we used VAE latent space arithmetic (LSA) to predict polypharmacology cell states; in other words, what cells might look like if we perturbed them with a compound that had two mechanisms of action (MOA). We compared our results to shuffled data, PCA, and to LSA performed with VAEs trained using L1000 readouts. We found that cell morphology and gene expression provide complementary information, and that we could predict some polypharmacology cell states robustly, while others were more difficult to predict.

      We found value in all of the reviewer comments. We intend to conduct all but four of the proposed analyses to supplement our aforementioned innovations.

      In the following revision plan, we include all reviewer comments exactly as they were written. The reviewers often had overlapping suggestions. In these cases, we grouped together similar reviewer comments and responded to them once.

      We include three sections: 1) A description of the revisions we plan to conduct in the near future; 2) A description of changes we have already made; and 3) A description and rationale of changes we will not pursue.

      Lastly, we would like to highlight that all reviewers provided positive feedback in their reviews. They discussed our paper as “conceptually and technically unique” and were positive about our methods section, stating that we did a “good job making everything available and reproducible”. Our methods section is complete, and we provide a fully reproducible and versioned github repository. We will release a second version of our github repository when we complete our revision plan to maintain clarity for our submitted version and the peer-reviewed version.

      1. Description of the planned revisions

      2.1. Address UMAP interpretability to provide a deeper description of MOA performance

      Reviewer 1: Instead of using UMAP embedding, it would be better to compare reconstruction error or show a reconstructed image with the original image to claim that models reliably approximate the underlying morphology data.

      Reviewer 1: Rather than just stating that the VAE's did not span the original data distribution and saying beta-VAE performed best by eye, some simple metrics can be drawn to analyze the overlap in data for a more direct and quantified comparison. Researchers should also explain what part of the data is not being captured here. Some analysis of what the original uncaptured UMAP represents is important in understanding the limitations of the VAEs' capacity.

      Reviewer 2: The authors compare generation performance based on UMAP. In the UMAP space, data tend to cluster together even though they might be far from each other in the feature space. I would like to see more quantitive metrics on how well these methods capture morphology distributions. You can compute metrics like MMD distance, kullback leibler (KL), earthmoving distance, or a simple classifier trained on actual MoA classes tested on generated data.

      We agree with the reviewers that evaluating reconstruction loss in addition to providing the UMAP coordinates would improve understanding of VAE limitations and enable a better comparison of VAE performance. We will analyze reconstruction loss across models and include these data as a new supplementary figure, which will enable direct comparisons across models and across different MOAs.

      We also agree that UMAP interpretation can be misleading. While currently state-of-the-art, UMAP has mathematical limitations that prevent interpretation of global data structures. However, there are emerging tools, including a new dimensionality reduction algorithm, called PaCMAP, which aims to preserve both local and global structure (Wang et al, 2021). We will explore this tool to determine, both mathematically and empirically, which is most appropriate for our dataset by cross-referencing the visualization with our added supplementary figure describing per-MOA reconstruction loss.

      We would also like to emphasize that we trained our VAEs using CellProfiler readouts from Cell Painting images and not the raw Cell Painting images themselves. As this was one of our primary innovations, this detail is extremely important. Therefore, we have improved clarity and added emphasis to this point in the manuscript introduction and discussion (see section 3).

      2.2. More specific comparisons of MOA predictions to shuffled data and improved description of MOA label accuracy

      Reviewer 1: It is difficult to know the clear threshold for successful performance is on figures like Figure 7 and SFigure 9, but by and large, it appears that the majority of predicted combination MOAs were not successful. Without the ability to either A) adequately predict most all combinations from individual profiles that were used in training or B) an explanation prior to analysis of which combination will be able to predict, it is difficult to see this method being used since the combinatorial predictions are more likely not informative.

      Reviewer 1: The researchers justify the poor performance compared to shuffled data, by saying that A) MOA annotations are noisy and unreliable and B) they MOAs may only manifest in other modalities like what was seen in the L1000 vs morphology predictability. While these might be true, knowing this the researchers should make an effort to clean and de-noise their data and select MOAs that are well-known and reliable, as well as, selecting MOAs for which we have a known morphological or genetic reaction.

      Reviewer 3: Figure 6 is missing error bars (standard deviation of the L2 distance) and, as such, is hard to draw conclusions from.

      We thank the reviewers for raising this concern. We agree that it is critical, and we appreciate the opportunity to address it.

      All three of these comments relate to being unable to draw conclusions from our results when most A∩B predictions appear to have no difference from shuffled controls. Therefore, to address this comment, we will update our LSA evaluation to compare each MOA to a matched set of randomly shuffled data. Specifically, in our existing comparison, we realized a methodological fallacy in how we're displaying these data shuffles. We should be comparing specific MOA combinations to their corresponding shuffled results instead of comparing all to all, which will artificially decrease performance when there are polypharmacology predictions that fail to recapitulate the ground truth cell states.

      We have connected with Paul Clemons, the senior director Director of Computational Chemical Biology Research at the Broad Institute of MIT and Harvard, who has informed us that the Drug Repurposing Hub annotations are among the most well documented. Therefore, while we know that biological annotations are often incomplete, our original text overemphasized the amount of noise contributed by inaccurate labels. We therefore added the following sentence to the discussion to clarify this important point:

      “However, the Drug Repurposing Hub MOA annotations are among the most well-documented resources, so other factors like different dose concentration and non-additive effects contribute to weak LSA performance for some compound combinations (Corsello et al, 2017).”

      We will also update our supplementary figure to account for specific MOA shuffling and include additional text comparing Cell Painting and L1000 showing which MOAs perform best in which modality.

      2.3. More detailed evaluation of MOA performance across drug variance and drug classes

      Reviewer 1: With the small number of combinations that are successfully predicted, to build confidence in the performance, it would be necessary to explain the reason for the differences in performance. Further experimentations should be done looking into any relationship between the type of MOAs (and their features) and the resulting A|B predictability. Looking at Figure 7, the top-performing combinations are comprised entirely of inhibitor MOAs. If the noisiness of the data is a factor, there should be some measurable correlation between feature noisiness and variation and the resulting A|B predictability from LSA.

      We agree with the reviewer that further experimentation would be helpful to gain confidence in our LSA performance. We plan to perform two different analyses to address this question. First, we will compare profile reproducibility (median pairwise correlations among MOAs) to MOA predictability. This will provide insight to determine the relationship between MOA measurement variance and performance. Second, we will split MOAs by category (e.g. inhibitor, activator) and test if there are significant performance differences between categories across VAE models in both L1000 and Cell Painting data. This will tell us if there are certain trends in the type of MOAs we’re able to predict. If there is, this would be useful knowledge since it could suggest that certain types of MOAs are associated with a more consistent cell state.

      2.4. Higher confidence in LSA overfitting assessment

      Reviewer 1: To show that the methodology works well on unseen data, researchers withheld the top 5 performing A|B MOAs (SFig 9) and showed they were still well predicted. This is not the most compelling demonstration since the data to be held out was selected with bias as the top-performing samples. It would be much more interesting to withhold an MOA that was near or only somewhat above the margin of acceptability and see how many holdouts affected the predictability of those more susceptible data points. From my best interpretation, the hold-out experiment also only held out the combination MOA groups from training. It would be better if single MOAs (for example A) which were a part of a combination of MOA (A|B) were also held out to see if predictability suffered as a result and if generalizability did extend to cells with unseen MOAs (not just cells which had already highly performing combinations of seen MOAs).

      We believe our original analysis was extremely compelling. Even if we removed the top MOAs from training, we were still able to capture their combination polypharmacology cell states through LSA. We find this similar to removing all pictures of sunglasses in an image corpus of human faces, but still being able to reliably infer pictures of people wearing sunglasses. Specifically, this tells us that our model is learning some fundamental data generating function that our top performing MOAs tap into regardless of if they are present or not in training.

      However, we agree with the reviewer that withholding intermediate-performing MOAs would also be informative, but for a separate reason. Unlike the best predicted MOAs, the intermediate MOAs are likely more susceptible to changes in the training data, so it would be interesting to determine if intermediate MOAs’ performance is a result of overfitting instead of truly learning aspects of the data generating function. We plan to perform this new analysis and add the results to Supplementary Figure 8 as a subpanel and add a full description of the approach to the appropriate methods subsection.

      2.5. Additional metrics to evaluate LSA predictions to provide more confident interpretation

      Reviewer 2: The predictions are evaluated using L2 distances, which I find not that informative. I would like to see other metrics (correlation or L1 or distribution distances in previous comments)

      We agree with the reviewer that using more than one metric would be helpful because oftentimes a single metric does not tell a complete story. We will add a panel to the LSA supplementary figure (Supplementary Figure 7), using Pearson correlation instead. While L2 distances will tell us how close predictions are to ground truth, Pearson correlations will tell us how consistent, on average, we are able to predict feature direction.

      2.6. Adding a performance-driven feature level analysis to categorize per-feature modeling ability

      Reviewer 2: I would like to see feature-level analysis, which features are well predicted and which ones are more challenging to predict?

      We agree with the reviewer that feature level analysis would be interesting to study. We believe that understanding which features are easy and hard to model could give insight into why certain MOAs (which could be associated with more signal in certain Cell Painting features) are predicted better than others.

      However, we are concerned that it is difficult to have an objective measurement of which features are easier to model because features that have less variation might be easier to model. So, we will analyze the correlation between individual feature reconstruction loss vs. feature variance across profiles. We will color-code the points to represent feature groups or channels. This analysis will not only demonstrate the relationship between feature variance and modeling ability, but also provide insight into the difficulty of modeling individual CellProfiler features.

      1. Description of the revisions that have already been incorporated in the transferred manuscript

      3.1. Documenting positive feedback as provided by the three reviewers

      Reviewer 1: With access to the dataset, the posted GitHub, and documentation in the paper, I believe that the experiments are reproducible.

      Reviewer 1: The experiments are adequately replicated statistically for conventions of deep learning.

      Reviewer 1: This paper proposes a conceptually and technically unique proposal in terms of application, taking existing technologies of VAEs and LSA and, and as far as I know, uses them in a novel area of application (predicting and simulating combination MOAs for compound treatments). If this work is shown to work more broadly and effectively, is seen through to it completion, and is eventually successfully implemented, it will help to evaluate the effects of drugs used in combination on gene expression and cell morphology. An audience in the realm of biological deep learning applications as well as an audience working in the compound and drug testing would be interested in the results of this paper. Authors successfully place their work within the context of existing literature, referencing the numerous VAE applications that they build off of and fit into the field of (Lafarge et al, 2018; Ternes et al, 2021, etc...), citing the applications of LSA in the computer vision community (Radford et al, 2015, Goldsborough et al, 2017), and discussing the biological context that they are working in (Chandrasekaran et al, 2021).

      Reviewer 2: The main novelty of the work is applying VAEs on cell painting data to predict drug perturbations. The final use case could be guiding experimental design by predicting unseen data. However, the authors do not show such an example and use case which is understandable due to the need for doing further experiments to validate computational results and maybe not the main focus of this paper. The authors did a good job of citing existing methods and relevant. The potential audience could be the computational biology and applied machine learning community.

      Reviewer 3: The manuscript is beautifully written in a crystal clear manner. The authors have made a visible effort towards making their work understandable. The methods section is clear and comprehensive. All experiments are rigorously conducted and the validation procedures are sound. The conclusions of the paper are convincing and most of them are well supported by the data. Both the data and the code required to reproduce this work are freely available. Overall, the article is of high quality and relevance to several scientific communities.

      We thank the reviewers for their encouraging remarks and overall positive sentiment. As early-career researchers, we feel empowered by these words.

      3.2. Moved Figure 2 to supplement and removed Figure 5

      Reviewer 1: Fig 2 is not informative so it can go to supplementary.

      Reviewer 2: I liked the paper's GitHub repo, the authors did a good job making everything available and reproducible. As a suggestion, you can move the learning curves in two the sup figures cause they might not be the most exciting piece of info for the non-technical reader.

      Reviewer 3: I would suggest removing Figure 5 (or moving it to the supplementary) as it revisits the content of Figure 1 and does not bring much extra information.

      We agree that Figure 2 might not be informative to a non-technical reader, so we have accepted this suggestion by both reviewers 1 and 2, and we have moved Figure 2 to supplementary.

      We agree with the reviewer and have removed Figure 5.

      3.3. Clarified our data source as CellProfiler readouts, not raw Cell Painting images

      Reviewer 1: In Fig 4, it would be useful to show a few sample representative images with respect to CellProfiler feature groups.

      Reviewer 1: Figure 6, what does it means original input space? Does it mean raw pixel image? As researchers extracted CellProfiler feature groups already, it would be interesting to compare mean L2 distance based on CellProfiler features so that whether VAE improves performance or not (compared to handcrafted features) as a baseline.

      Reviewer 3: While what "morphological readouts" concretely mean becomes clearer later on in the paper, it would be useful to give a couple of examples early on when introducing the considered datasets.

      We thank the reviewer for these suggestions, which bring to light a common source of confusion, which we must alleviate. We are working with CellProfiler readouts (features extracted using classical algorithms) of the Cell Painting images and not the images themselves. We have made several edits throughout the manuscript to improve clarity and remove this confusion, including the introduction, in which we clearly state our model input data:

      “Because of the success of VAEs on these various datasets, we sought to determine if VAEs could also be trained using cell morphology readouts (rather than directly on images), and further, to carry out arithmetic to predict novel treatment outcomes. We derive the cell morphology readouts using CellProfiler (McQuin et al, 2018), which measures the size, structure, texture, and intensity of cells, and use these readouts to train all models.”

      This decision comes with tradeoffs: The benefit of using CellProfiler readouts instead of images is that they are more manageable but we might lose some information. We more thoroughly discuss this important tradeoff in the discussion section:

      “We determined that VAEs can be trained on cell morphology readouts rather than directly using the cell images from which they were derived. This decision comes with various trade-offs. Compared to cell images, cell morphology readouts as extracted by image analysis tools (e.g. CellProfiler) are a more manageable data type; the data are smaller, easier to distribute, substantially less expensive to analyze and store, and faster to train (McQuin et al, 2018). However, it is likely some biological information is lost, because these tools might fail to measure all morphology signals. The so-called image-based profiling pipeline also loses information, by nature of aggregating inherently single-cell data to bulk consensus signatures (Caicedo et al, 2017).”

      3.4. Clarified future directions to infer cell health readouts from simulated polypharmacology cell states

      Reviewer 1: Authors also make the claim that they can infer toxicity and simulate the mechanism of how two compounds might react. This is a claim that would not be supported even if the method were able to successfully predict morphology or gene profiles. Drug interaction and toxicity are quite complex and goes beyond just morphology and expression. VAEs predicting a small set of features would not be able to capture information beyond the readouts, especially when dealing with potentially unseen compounds for which toxicity is not yet known. For example, two compounds might produce a morphology that appears similar to other safe compounds but has other factors that contribute to toxicity. Further, here they show no evidence of toxicity or interaction analysis.

      The reviewer is correct that such a claim is unsupported by our research. Our message was actually that inferring toxicity could be a potential future application of our work. Specifically, for example, we can apply orthogonal models of cell toxicity that we previously derived using other data (Way et al, 2021a) to our inferred polypharmacology cell states. We thank this reviewer for noticing our lack of clarity, and we have made changes in the discussion to make it clear that inferring toxicity is something we may do in the future and is not something that is discussed in the manuscript:

      “In the future, by predicting cell states of inferred polypharmacology, we can also infer toxicity using orthogonal models (e.g. Way et al. 2021) and simulate the mechanisms of how two compounds might interact.”

      3.5. Clarified our method of splitting data, and noting how a future analysis will answer overfitting extent

      Reviewer 2: Could authors outline detailed data splits? Which MoA are in train and which are held out from training? As I understood, there were samples from MoAs that were supposed to be predicted in the calculation of LSA? Generally, the predicted MoA should not be seen during training and not in LSA calculation.

      We now more explicitly detail how we split our data in the methods:

      “As input into our machine learning models, we split the data into an 80% training, 10% validation, and 10% test set, stratified by plate for Cell Painting and stratified by cell line for L1000. In effect, this procedure evenly distributes compounds and MOAs across data splits.”

      We also thank the reviewer for this comment, because they express an important concern about making sure that we are not overfitting to the data. We have explained in the manuscript that because of lack of data, MOAs were repeated in training and LSA. However, we believe overfitting is not playing a large role in model performance. Through our hold 5 out experiment, we are able to show that our models are able to predict the same MOAs irrespective of whether they were in the training data, indicating that we did not overfit to the distribution of certain MOAs.

      Reviewer 1 also suggested that we do the hold 5 out experiment on A∩Bs that were barely predicted. After we do that, we will explicitly demonstrate the extent of overfitting.

      3.6. Introduced acronyms when they first appear in the manuscript

      Reviewer 3: The Kullback-Leibler divergence is properly introduced in the methods part, but not at all in the introduction (it directly appears as "the KL divergence"). To enhance readability, it would be better to fully spell it before using the acronym, and maybe give a one-sentence intuition of what it is about before pointing out to the methods part for more details.

      We thank the reviewer for bringing this to our attention. We have carefully reviewed the entire manuscript and have corrected such instances of clear introductions to acronyms.

      3.7. Fixed minor text changes

      Reviewer 3: In Figure 1, I would recommend changing "compression algorithms" to "dimension reduction algorithm" or "embedding algorithm". In a compression setting, I would expect the focus to be on the number of bits of information each method requires (or the dimension of the resulting embedding) to encode the data while guaranteeing a certain quality threshold. This is obviously not the case here as the dimension of the embedding is fixed and the focus is on exploring how the embedding is constructed (eg how much it decorrelates the different features, etc) - which may be misleading.

      Reviewer 3: I recommend using "A n B" or "A & B" or "(A, B)" to denote the combination of two independent modes of action A and B. The current notation "A | B" overloads the statistical "A given B" which appears in the VAE loss and is therefore misleading.

      We agree with the reviewer, and aim to minimize all sources of potential confusion. We have made the change in the figure.

      We also agree that our current notation can be confusing. We have updated all instances of “A|B” with “A ∩ B”.

      3.8. Added hypothesis of MMD-VAE oscillations to supplementary figure legend

      Reviewer 3: Do the authors have a hypothesis of what may be causing MMD-VAE to oscillate during validation when data are shuffled? This seems to be the case on two of the three considered datasets (Figure 2 and SuppFigure 1) and is not observed for the other models. Including a few sentences on that in the text would be interesting.

      We believe a big reason for this is because of the fact that the optimal MMD-VAE had a much higher regularization term, which puts a greater emphasis on forming normal latent distributions, than the optimal Beta or Vanilla VAE. Forcing the VAE to encode a shuffled distribution into a normally distributed latent distribution would be difficult to do consistently across different randomly shuffled data subsets, and therefore might cause oscillations in the training curve across epochs when the penalty for that term is high. As these observations may be interesting to a certain population of readers, we have incorporated this explanation into the supplementary figure legend (which is where this figure is shown):

      “Forcing the VAE to consistently encode a shuffled distribution into a normally distributed latent distribution would be difficult, and therefore might cause oscillations in the training curve across epochs.”

      3.9. Explained our selection of VAE variants

      Reviewer 3: The different types of considered VAE and their differences are very clearly introduced. It may however be good to motivate a bit more the focus on beta-VAE and MMD-VAE among all the possible VAE models. This is partly done through examples in the second paragraph of page 2, but could be elaborated further.

      We thank the author for their encouraging remarks. We have made edits to the manuscript’s introduction, explaining why we chose these two variants out of all the possible choices:

      “We trained vanilla-VAEs, β-VAEs, and MMD-VAEs only, and not other VAE variants and other generative model architectures, such as generative adversarial networks (GANs), because these VAE variants are known to facilitate latent space interpretability.”

      1. Description of analyses that authors prefer not to carry out

      4.1. We will not explore additional latent space dimensions in more detail, as this is out of scope

      Reviewer 1: As both reconstructed and simulated data did not span the full original data distribution, it might be better to look at reconstruction error and increase the dimension of latent space.

      We thank the reviewer for bringing up this important point. Our VAE loss function consists of the sum of reconstruction error and some form of KL divergence. Specifically, this reviewer is suggesting that if we only minimize reconstruction error (or focus more on reconstruction over KLD by lowering beta), a higher latent dimension would result in better overall reconstruction. This is true, but doing so would have negative consequences. While we would perhaps get the UMAPs to show the full data distribution, the UMAPs are not our focus; predicting polypharmacology through LSA is. We found that when we have a higher focus on the reconstruction term, we have more feature entanglement, as indicated by lower performance when simulating data and overlapping feature contribution per latent feature. The fact that simulating data would logically require less disentanglement than performing LSA shows that we require higher regularization (and hence lower focus on reconstruction) than the one we got from simulating data.

      Essentially, while the reviewer's comments would improve reconstruction and allow us to improve the UMAPs, doing so would likely worsen LSA performance, which is the main focus of the project. Also, increasing the latent dimension without changing beta would likely have caused little to no change because since beta is encouraging disentanglement, it would cause the newly added dimension to have little variation and encode little new information that wasn’t already encoded before.

      We have also previously explored the concept of toggling the latent dimensions in a separate project (Way et al, 2020). We are very interested in this area of research in general, and any additional analyses (beyond hyperparameter optimization) deserves a much deeper dive than what we can provide in this paper.

      Lastly, we intend to include a deeper description and analysis of reconstruction loss across models, datasets, and MOAs as was suggested by a previous reviewer comment (see section 2.1 above)

      4.2. We will not review Gaussian distribution assumptions of the VAE as we feel it is not informative

      Reviewer 1: By looking at SFigure 6, I am wondering whether latent distribution actually met gaussian distribution (assumption of VAE). It may show skew distribution as some of latent features shows low contribution.

      This reviewer’s comment is interesting, but we do not believe it would change the findings of our study. Suppose we find that the latent dimensions aren’t normally distributed. This wouldn’t change much; a gaussian distribution isn’t the most critical to perform LSA. We need the latent code to be disentangled, but having normally distributed latent features doesn't necessarily mean that we have good disentanglement (see https://towardsdatascience.com/what-a-disentangled-net-we-weave-representation-learning-in-vaes-pt-1-9e5dbc205bd1)

      4.3. In this paper, we will not train or compare conditional VAEs nor cycle GANs

      Reviewer 2: While authors provided a comparison between vanilla VAE and MMD-VAE, B-VEA, there are other methods capable of doing similar tasks (data simulation, counterfactual predictions ), I would like to see a comparison with those methods such as conditional VAE( https://papers.nips.cc/paper/2015/hash/8d55a249e6baa5c06772297520da2051-Abstract.html, CVAE + MMD : https://academic.oup.com/bioinformatics/article/36/Supplement_2/i610/6055927?login=true) or cycle GANs(https://arxiv.org/abs/1703.10593 ).

      While such comparisons would be interesting, they are not the main focus of the manuscript, which is to benchmark the use of VAEs in cell morphology readouts and to predict polypharmacology.

      We think that CVAE would not be appropriate for our study. In a CVAE, the encoder and decoder are both conditioned to some variable. In our situation where we are predicting the cell states of different MOAs, it would make most sense to condition on the MOA. However, because we’re using the MOA labels in our LSA experiment, conditioning on them is likely to bias our results and not be effective for MOAs outside the conditioning.

      For cycle GANs, we have found that training using these data, in a separate study in our lab, is extremely difficult. Our lab has not published this yet, but once we are able to better understand cycleGAN behavior in these data, it will require a separate paper in which we compare performance and dissect model properties in much greater detail.

      Nevertheless, we have added citations to multi-modal approaches like cycle GANs (see section 4.4) as they will point a reader to useful resources for future directions.

      4.4. We will not be comparing with multi-modal integration, but we clarified our focus on Cell Painting VAE novelty and added multi-modal citations

      Reviewer 1: Researchers found that the optimal VAE architectures were very different between morphology and gene expression, suggesting that the lessons learned training gene expression VAEs might not necessarily translate to morphology. It would be interesting to compare the result with multimodal integration as baseline (i.e., Seurat).

      Our focus in this paper was to train and benchmark different variational autoencoder (VAE) architectures using Cell Painting data and to demonstrate an important, unsolved application in predicting polypharmacology that we show is now possible for a subset of compounds. It was a natural and useful extension to compare Cell Painting VAE performance with L1000 VAE performance especially since our data set contained equivalent drug perturbations. We feel that any extension including multi-modal data integration will distract focus away from the Cell Painting VAE novelty, and requires a much deeper dive beyond scope of our current manuscript.

      Additionally, there have been other, more in-depth and very recent multi-modal data integration efforts using the same or similar datasets (Caicedo et al, 2021; Haghighi et al, 2021). In a separate paper that we just recently submitted, we also dive much deeper to answer the question of how the two modalities complement one another in various ways and for various tasks (Way et al, 2021b). These two papers already provide a deeper and more informative exploration of Cell Painting and L1000 data integration.

      Therefore, because multi-modal data integration, while certainly interesting, will distract from the Cell Painting VAE novelty and is redundant with other recent publications, we feel it is beyond scope of this current paper.

      Nevertheless, multi-modal data integration is important to mention, so we add it to the discussion. Specifically, we discuss how multi-modal data integration might help with predicting polypharmacology in the future and include pertinent citations so that we, or another reader, might be able to follow-up in the future. The new section reads:

      “Because we had access to the same perturbations with L1000 readouts, we were able to compare cell morphology and gene expression results. We found that both models capture complementary information when predicting polypharmacology, which is a similar observation to recent work comparing the different technologies’ information content (Way et al, 2021). We did not explore multi-modal data integration in this project; this has been explored in more detail in other recent publications (Caicedo et al, 2021; Haghighi et al, 2021). However, using multi-modal data integration with models like CycleGAN or other style transfer algorithms might provide more confidence in our ability to predict polypharmacology in the future (Zhu et al, 2017).”

      1. References

      Caicedo JC, Cooper S, Heigwer F, Warchal S, Qiu P, Molnar C, Vasilevich AS, Barry JD, Bansal HS, Kraus O, et al (2017) Data-analysis strategies for image-based cell profiling. Nat Methods 14: 849–863

      Caicedo JC, Moshkov N, Becker T, Yang K, Horvath P, Dancik V, Wagner BK, Clemons PA, Singh S & Carpenter AE (2021) Predicting compound activity from phenotypic profiles and chemical structures. bioRxiv: 2020.12.15.422887

      Corsello SM, Bittker JA, Liu Z, Gould J, McCarren P, Hirschman JE, Johnston SE, Vrcic A, Wong B, Khan M, et al (2017) The Drug Repurposing Hub: a next-generation drug library and information resource. Nat Med 23: 405–408

      Haghighi M, Singh S, Caicedo J & Carpenter A (2021) High-Dimensional Gene Expression and Morphology Profiles of Cells across 28,000 Genetic and Chemical Perturbations. bioRxiv: 2021.09.08.459417

      McQuin C, Goodman A, Chernyshev V, Kamentsky L, Cimini BA, Karhohs KW, Doan M, Ding L, Rafelski SM, Thirstrup D, et al (2018) CellProfiler 3.0: Next-generation image processing for biology. PLoS Biol 16: e2005970

      Wang Y, Huang H, Rudin C & Shaposhnik Y (2021) Understanding how dimension reduction tools work: an empirical approach to deciphering t-SNE, UMAP, TriMAP, and PaCMAP for data visualization. J Mach Learn Res 22: 1–73

      Way GP, Kost-Alimova M, Shibue T, Harrington WF, Gill S, Piccioni F, Becker T, Shafqat-Abbasi H, Hahn WC, Carpenter AE, et al (2021a) Predicting cell health phenotypes using image-based morphology profiling. Mol Biol Cell 32: 995–1005

      Way GP, Natoli T, Adeboye A, Litichevskiy L, Yang A, Lu X, Caicedo JC, Cimini BA, Karhohs K, Logan DJ, et al (2021b) Morphology and gene expression profiling provide complementary information for mapping cell state. bioRxiv: 2021.10.21.465335

      Way GP, Zietz M, Rubinetti V, Himmelstein DS & Greene CS (2020) Compressing gene expression data using multiple latent space dimensionalities learns complementary biological representations. Genome Biol 21: 109

      Zhu J-Y, Park T, Isola P & Efros AA (2017) Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. arXiv [csCV]

    1. MRS HALE: (mildly) Just pulling out a stitch or two that's not sewed very good. (threading a needle) Bad sewing always made me fidgety. MRS PETERS: (nervously) I don't think we ought to touch things. MRS HALE: I'll just finish up this end. (suddenly stopping and leaning forward) Mrs Peters? MRS PETERS: Yes, Mrs Hale? MRS HALE: What do you suppose she was so nervous about? MRS PETERS: Oh—I don't know. I don't know as she was nervous. I sometimes sew awful queer when I'm just tired. (MRS HALE starts to say something, looks at MRS PETERS, then goes on sewing) Well I must get these things wrapped up. They may be through sooner than we think, (putting apron and other things together) I wonder where I can find a piece of paper, and string.

      both women are anxious and feel uncomfortable in the situation

  8. Oct 2021
    1. In this world I think we have two kinds of knowledge: One is Planck knowledge, that of the people who really know. They’ve paid the dues, they have the aptitude. Then we’ve got chauffeur knowledge. They have learned to prattle the talk. They may have a big head of hair. They often have fine timbre in their voices. They make a big impression. But in the end what they’ve got is chauffeur knowledge masquerading as real knowledge.
    1. A picture painted on a panel is at once a picture and a likeness: that is, while one and the same, it is both of these, although the 'being' of both is not the same, and one may contemplate it either as a picture, or as a likeness. Just in the same way we have to conceive that the mnemonic presentation within us is something which by itself is merely an object of contemplation, while, in-relation to something else, it is also a presentation of that other thing. In so far as it is regarded in itself, it is only an object of contemplation, or a presentation; but when considered as relative to something else, e.g. as its likeness, it is also a mnemonic token. Hence, whenever the residual sensory process implied by it is actualized in consciousness, if the soul perceives this in so far as it is something absolute, it appears to occur as a mere thought or presentation; but if the soul perceives it qua related to something else, then,-just as when one contemplates the painting in the picture as being a likeness, and without having (at the moment) seen the actual Koriskos, contemplates it as a likeness of Koriskos, and in that case the experience involved in this contemplation of it (as relative) is different from what one has when he contemplates it simply as a painted figure-(so in the case of memory we have the analogous difference for), of the objects in the soul, the one (the unrelated object) presents itself simply as a thought, but the other (the related object) just because, as in the painting, it is a likeness, presents itself as a mnemonic token.

      Aristotle brilliantly acknowledges that when we perceive something there are many factors that are apart of it. The conscious, the subconscious, the present, the feeling of "likeness" and how we think we perceive something may not always be the case. The mind is complex, this part of the writing provokes self thought about "Can I trust my own sense/perception?"

    1. Every review of Stillwater I have seen has mentioned me, for better or worse.

      This statement made by Knox really stood out to me given our current "clickbait" styled media production. It seems to me that Stillwater is an example of a media outlet using a highly sought after story, person or current even to gain attention on their given piece of content.

      I'm reminded of YouTube videos that use current happenings of the world as a thumbnail and/or subject matter in hopes of gaining more followers, listeners or viewers to their channel.

      Similarly, Stillwater (although I haven't seen the film) seems like Hollywood using Knox's name and story to gain views, clicks and dollars.

      I think of the saying "all publicity is good publicity." However in this situation, for someone like Knox who wishes to be left alone; all publicity, even the good is considered to be bad.

      I didn't know anything about Amanda Knox, but was familiar with the name. After reading the article and doing some research on my own I came to remember I had seen a Netflix Documentary titled Amanda Knox on my feed a while back. Looking forward to diving into that in the near future.

      I'd like to conclude that we could all learn from the way this story is handled. Just because covering a story may receive a lot of interaction, traction and/or popularity doesn't necessarily make it right to do so. We as proper media literate students must be held to a higher standard and lead by example and take all accounts into consideration before moving forward with something. We must also admit when we're wrong, apologize for our mistakes and right our wrongs to the best of our abilities. We must learn by experience from Amanda Knox's past history so that we can be proactive in not allowing it to occur in the future.

    1. Doug's got a story and he told the story many times and you know when he tells story many times it becomes realer and realer to you

      I don't know if Howard is trying to give us a nudge and wink here, but if so then the subsequent retelling that Doug was "by his own story very influenced by Vannevar Bush's As We May Think article" is certainly relevant to what Howard is suggesting, given that in Doug's own letter to Bush in 1962, he claims not to have even had Bush's ideas on his radar (no pun intended) until having rediscovered the Atlantic article after already having his own serious pursuits underway at SRI (and getting on at SRI was no small feat according to Howard's telling in Tools For Thought).

      See https://web.stanford.edu/dept/HPS/sloanconference/papers/lenoir/STIMPresentations/Presentation/LetterToVBush.html

    1. Author Response:

      Reviewer #1:

      This study aims to find the genetic mechanisms underlying sex-ratio distortion through male-killing in Drosophila melanogaster flies infected with the endosymbiont Wolbachia. The endosymbiont carries the prophage WO, which is in the center of interested in this study. The key result of this study is that a synonymous mutation in a prophage gene can explain the differences between sex-ratio distorting and not distorting symbionts. The study uses transgene technology to modify phage genes and to investigate which changes in the gene is involved in the phenotype. The finding, that a synonymous SNP plays a key role is not entirely novel in biology, but there are only few examples known of this type of genotype - phenotype associations. The study does not include experiments to show that the main finding is not limited to one particular background of the fly line used. An experiment including multiple genotypes would be needed to show this.

      We agree that recapitulating the results in other backgrounds is intriguing and important for establishing a broader role of these findings. We thank the Reviewers and Editor for allowing us to pursue this line of investigation separately from this work, and we now discuss what experiments can be completed to answer these and other questions. We also edited the manuscript to tone down any conclusions that would imply generalizability of the findings at this point. For example:

      "For example, we cannot conclude that the particular codon tested here is responsible for phenotype alterations in other host genetic backgrounds or species. It is possible that this codon plays a functional role only in a singular host genetic context. Here, we changed wmk sequences while holding the host genetic background fixed, but the reverse is required to conclude whether or not the particular codon plays a general role in other genotypes or natural contexts. Second, due to possible coevolution, various codons may or may not yield similar functional effects across different host backgrounds, and additional synonymous sites may contribute to the male-killing phenotype. Thus, the results here illuminate a previously unrecognized need for future research on the functional impacts of synonymous substitutions in endosymbionts. Future work may focus on determining if there is one specific synonymous codon that affects the male-killing function in all cases, if a more general feature exists where alteration of any or a subset of N-terminal or other wmk codons affects function, or if the effect of synonymous changes is specific to this background.”

      Text summarizing the 06/21/2021 query to the Editor and Reviewers for further clarification: We believe there are several reasons why the results can stand on their own, while appropriately acknowledging caveats. First, we note the lack of genetic background testing on previous transgene experiments driving the major discoveries of Wolbachia genes involved in reproductive parasitism. This requirement would therefore hold the current work to a novel bar not previously applied by the field. In addition, the genetic background here is the same as used in previous work on these phenotypes, making it the most pertinent to test and inform previous and ongoing studies by many research groups. Second, the results shown here would still stand no matter the results of genetic background testing and would demonstrate that it is possible for synonymous changes to have functional relevance in the transgenic wmk phenotype. The major findings are still novel in the field, relevant to ongoing studies of reproductive parasitism, and informative regarding one of the most common genetic backgrounds. Finally, we note that two different lines with unique synonymous codon changes (the final experiment) independently created the same result that a synonymous codon change ablates phenotype, providing additional robustness to our findings. Doing additional experiments would be logistically difficult. Barriers include the relocation of the first author of the work to another lab for a postdoctoral position, completion of the funding for the project, remaining institutional COVID-19 restrictions, and lack of replacement personnel in the lab to continue the work. Notably, there is also the non-trivial requirement to create and test new transgene lines that would be costly and take nearly a year to complete (the experiments in the manuscript already took several years and the new fly lines would cost thousands to make).

      The study is mostly clear and easy to follow, but requires a lot of attention. The authors choose to build up the story as I guess it was carried out in the lab. Thus, the reader is guided through every step of the process. While I see that this is appealing from the way the study was carried out, it results in a very long manuscript with a lot of material that would be much better placed in a supplement.

      We thank the reviewer for pointing this out. We shortened the manuscript by removing redundant information and transferring some parts of the results to the supplement. We also removed about three pages of text from the discussion (before adding in new sections as requested by reviewers).

      The introduction seems unfocused. It meanders around, jumping from topic to topic and does not give the reader a sense of where things will go.

      We added a few topics into the Introduction as recommended in other comments, and we edited various portions of the Introduction to connect the ideas together more clearly. We hope the changes are now satisfactory, and we are of course happy to consider further feedback.

      Fig. 1 gives an overview about the different aspects addressed here, but it is not used to guide the reader through the different lines of thought addressed in the introduction. If Fig. 1 will stay (I actually think it is not needed) it should be introduced earlier and used as a road map for the paper. Alternatively, the introduction could stay more general and only in the last paragraph the different ways the system is studied will be summarized.

      We edited the final paragraph of the Introduction to more comprehensively cover the content of the figure and full direction of the paper. For readers not familiar with the biological system or questions, we believe this figure will serve as a gateway to the genetic alterations conducted in the experiments.

      Along these lines, it would be good to have a better reasoning for the combination of experiments conducted. It is left to the reader to understand why certain types of experiments have been done.

      It was not clear to us at the outset of these experiments what results would ultimately emerge and what follow-up experiments would be necessary as our initial hypotheses were proven wrong with many of the surprises from the work. So, there was no a priori reasoning for why experiments were done until we had the results of the previous experiments. We agree that this makes the reading a bit confusing. As such, we clarified the logic flow in the results section as the narrative progresses from experiment to experiment, and we reorganized some of the introduction to improve transition statements and offer a roadmap to readers earlier on.

      On the other hand, the introduction misses a section on the biology of the phage and its interaction with the host(s). It is hard to understand the biology of the system without getting an understanding of the insect - Wolbachia - phage interactions. For non-specialist, understanding the role of the three players is essential for the system.

      Thank you for the suggestion. We now add a section introducing phage WO and its relevance to the phenotypes tested here.

      “The wmk gene and two cytoplasmic incompatibility factor (cif) genes that underlie cytoplasmic incompatibility (a parasitism phenotype whereby offspring die in crosses between infected males and uninfected females) occur in the eukaryotic association module (EAM) of prophage WO, which refers to the phage WO genome that is inserted into the bacterial chromosome. The EAM is common in WO phages across several Wolbachia strains and is rich in genes that are homologous to eukaryotic genes or annotated with eukaryotic functions. As such, the expression of reproductive parasitism genes from the EAM and tripartite interactions between phage WO, Wolbachia, and eukaryotic hosts are central to Wolbachia’s ability to interact with and modify host reproduction.”

      The result section could be easily shortened by focusing on the essential experiments. Experiments that do not contribute to the final result can go into the supplement.

      We removed redundant sentences and made some figures supplemental.

      Also the discussion is much too long. I suggest to reduce it to half and focus on the important points and the take-home messages. Currently the discussion follows the way the results are presented in the result section. However, this is not needed. The important finding should be discussed first. Findings that are important in the development of the project, may not be important for the biology of the system overall. And they may not be important for the reader.

      We reordered the discussion to cover the biggest findings first, and removed about a third of the original writing in the discussion.

      Reviewer #2:

      This study aims to unravel the genomic basis to wmk-induced male killing by transgenically expressing homologs of varying relatedness, with synonymous nucleotide changes, and predicted alternative start codons in D. melanogaster flies. The study builds on previous work showing that expression of wmk in fly embryos recapitulates several aspects of male killing. While more distantly related homologs did not induce male killing when expressed in D. melanogaster, more closely related wmk homologs induce either killing of both sexes or male killing only. However, the male-killing phenotype was not due to amino acid differences, but associated with RNA structural differences of the different wmk homologs. In addition, only one synonymous nucleotide change was sufficient to ablate the killing phenotype. These findings suggests that minor and even silent nucleotide differences impact on the expression of male killing in D. melanogaster. It is concluded that a new model incorporating the impacts of RNA structure and post-transcriptional processes in wmk-induced male killing needs to be developed.

      The strength of the study lies in the systematic and carefully controlled approach to quantify the phenotypic effects of both sequence and structural changes to various wmk homologs for inducing the male-killing phenotype. Detailed dissection of the phenotypic impact of minor changes to the wmk homologs including sequence variation, silent nucleotide changes, and RNA structural differences was quantified. This approach reveals a complex genotype-phenotype relationship, but highlights the importance of including post-translational processes. The data is novel in that previous work have largely ignored structural changes and assumed that synonymous differences in codons has no effect on protein function, whereas the current study based on updated codon optimization algorithms reveal that this assumption is incorrect. The finding highlights the importance of considering also structural genetic variation for phenotypic expression differences. This suggestion is further corroborated by the lack of difference in wmk homologue expression levels, indicating that the functional differences are due to post-translational effects.

      We thank the reviewer for the thoughtful comments.

      There are limitations to the findings of this complex genotype-phenotype relationship. The current study only examined the phenotypic impact by expressing the different homologs in one D. melanogaster genetic background. Given the variability of the phenotypic pattern revealed based on minor changes to the wmk homologs, it will be critical to repeat some of the main findings in other D. melanogaster genotypes to determine the importance of the variation in the wmk homologs more generally. It is entirely plausible that the observed changes in the effect and strength of killing is due to an interaction between host and wmk genotype. This has implications for unravelling the underlying genetic basis to the male-killing phenotype more widely. It is as yet to be demonstrated whether wmk is involved in male killing in natural population, and to what extent there are shared patterns and mechanisms of male killing induced by other bacterial endosymbionts such as Spiroplasma.

      We addressed this point in more detail above in the first response to the comments from Reviewer 1.

    1. Author Response:

      Reviewer #2:

      The manuscript by Podinovskaia focuses on a new method to visualize and measure endosome maturation in common cell lines by enlarging early endosomes. This was achieved by producing acute insult to the cells by ionophore treatment, leading to budding of abnormally large post Golgi vesicles that fuse with early endosomes. Endosome maturation of these enlarged endosomes containing Golgi-derived cargo (GalT) proceeding with apparently normal kinetics, ultimately leading to lysosomal delivery. Taking advantage of this assay, the authors investigate Rab5-to-Rab7 conversion, acquisition and loss of PI3P, acquisition and loss of Snx1 on apparent endosomal subdomains, interaction of early and late endosomes with Rab11-positive recycling endosomes, and lumenal pH changes. The new maturation model presented here will likely be quite useful to the field with continuing impact. The current state of the endosome field in many ways remains fragmentary, with various processes studied extensively in isolation, but with little information on their relative timing and potential interactions as endosomes mature. This new assay should help understand the relationships between these processes, some of which are investigated in this manuscript.

      Concerns:

      1) The data and conclusions related to Rab11 interaction with early endosomes in Fig 8 are not convincing. There are simply too many Rab11 endosomes in the cell to know if their short term proximity indicates meaningful interaction with the early endosomes, or if the data simply reflects random collisions of small recycling endosomes with the enlarged early endosomes. No data is presented to show that the interactions are meaningful, e.g. that recycling cargo transfer occurs during these interactions. Conclusions from this analysis are overstated.

      We now provide more evidence for the interaction of Rab11 vesicles with the enlarged endosomes. We made movies with shorter intervals (2 sec instead of 1 min) between the individual frames. These data clearly show that this is not an accidental bumping into an endosome but rather that Rab11 vesicles can circle around endosomes and stay for several minutes (Video Fig. 8A, supplement 2 and 3).

      In addition, we imaged TfR-GFP together with mApple-Rab5. These data show that TfR-GFP positive vesicles bud off from mApple-Rab5 positive endosomes and that the GFP fluorescence intensity goes down over time in enlarged endosomes. These data are consistent with recycling of TfR to the plasma membrane. Moreover, CDMPR-GFP, which cycles between the TGN and endosomes was found to be present on Rab5 negative enlarged structure, which then turned Rab5 positive, and subsequently lost the CDMPR signal. Importantly those endosomes could regain CDMPR, which we interpret as acquisition from the TGN. These data may indicate that the TGN-endosome shuttle is intact after nigericin washout (Fig. 9).

      That the TfR and CDMPR are really transported out of the enlarged endosome is also contrasted by our finding that GalT-GFP stayed in the enlarged endosome and the signal intensity did not significantly drop.

      2) Lack of information on endocytic cargo acquisition by the enlarged early endosomes: to really establish this endosome maturation model the authors would need to establish if the enlarged endosomes contain endocytosed cargo, as opposed to Golgi-derived cargo, and determine how long it takes to acquire such cargo. This could be accomplished using Tf, EGF, or perhaps dextran at early timepoints after nigericin washout.

      As described above, we now show that TfR-GFP is present in enlarged endosomes and is lost from these endosomes over time (Fig. 9A,D,G).

      Additionally, we performed experiments with dextran-Alexa647 and nanobody-tagged surface TfR to show that endocytosed material from the plasma membrane indeed reached the enlarged endosomes (Fig. 3, figure supplement 1 and 2). Quantification of TfR signal at the enlarged endosomes demonstrates that TfR acquisition by the enlarged endosome takes place as soon as the enlarged compartment becomes Rab5-positive. This was also observed with the nanobody-tagged surface TfR and endocytosed Dextran-AF647, representative examples of which are provided (Fig 3, figure supplement 1 and 2). The quantification for the latter experiments was not carried out due to the very short time range during which asynchronous Rab5 recruitment events needed to be captured after addition of nanobody/Dextran pulse-and-chase.

      3) Figure 7 - It was not convincing that data in panels F and G are different from each other.

      We agree with the reviewer that the difference between the data presented in panel F and G is not very big. These panels represent the average of many endosomes and with the averaging the differences from the individual traces get cancelled out. The process is asynchronous and thus in this case the individual traces are more telling than the averaged traces. Nevertheless, we decided to keep the average traces in the manuscript because the highlight the asynchronous nature of the process. We modified the text to make this point clear.

      4) Figure 11 - it is unclear how we can interpret this as connected to Rab conversion when even the labeled compartments at the earliest time point in the czz1 knockout have abnormally high pH, and during the time-course even the last timepoint for czz1 KO is higher than that of the earliest timepoint for WT.

      We agree that the ccz1 KO cells display higher endosomal pH than WT cells throughout the time-course.

      However, the cells in which we express the rescue plasmid of Ccz1 also have apparently less acidified endosomes, even though Ccz1 can still drive Rab conversion, and the pH dropped at an intermediate rate, when comparing rescued cells to control and ccz1 KO cells. Even in ccz1 KO cells endosomal traffic down the degradation pathway is not completely blocked, similarly to what we observed for sand-1 (-/-) in C. elegans and Mon1a/b knockdown in mammalian cells (Poteryaev et al. 2010). Acidification eventually will occur, but it is massively slowed down; the molecular basis of which is still under investigation in our lab.

      We think that in the absence of Ccz1, a condition under which Rab conversion is severely impaired, acidification cannot occur at normal rate. As pointed out by reviewers 1 and 2, the pH is already higher in the ccz1 KO cells than in the control condition. However, in the rescue condition, the YFP/CFP ratio is not that different from the knockout and yet acidification can occur at an intermediate rate. Why under rescue conditions, the YFP/CFP ratio is at a similar level compared to the KO is not entirely clear. It is conceivable that too much Ccz1 has also a negative effect. Moreover, recently it has been shown that ccz1 KO cells accumulate free cholesterol in the enlarged endosomes (Van den Boomen Nat Comm., 2020). The transient expression might be not sufficient to rescue this accumulation phenotype or other secondary effects. Nevertheless, the v-ATPase appears to maintain its function because lysosomes can acidify in ccz1 KO cells, albeit with a delay (Figure 13).

      5) Figure 12 - The criteria used to determine which GalT structures are Golgi or lysosomes seems questionable. Morphology alone is not sufficient to identify the compartments with high accuracy, especially after perturbation. Also, it is unclear to what extent GalT-CFP labels lysosomes without nigericin treatment.

      To address these issues, we co-labelled cells with lysotracker. GalT-CFP (pHlemon) and lysotracker showed a very high degree of co-localization. These data are included in the manuscript (Fig. 10B).

    1. Author Response:

      Reviewer #1:

      In this manuscript, the authors make use of next-generation sequencing to provide a preliminary inventory of tribe Metriorrhynchini, a hyperdiverse group of beetles with intricate systematics mainly due to likely morphological convergence of their Millerian rings. The authors provide an admirable sampling within Africa, Asia and Oceania, with about 700 successfully sampled localities and thousands of specimens.

      The main result of the manuscript is the curated database of Metriorrhynchini that will be useful in future research. In addition, different statistical methods are used to provide an idea of the undescribed species within the tribe, the astonishing species richness in New Guinea or the use of phylogenomic data to explore major phylogenetic relationships. However, some of the author's claims should be questioned:

      • Surprisingly, the authors rely on a very low threshold to identify mOTUs (2% in the manuscript). The authors refer to Hebert et al. (2003) and Eberle et al. (2020) to justify the threshold, but still, they are likely overestimating the number of mOTUs and thus, considering putative species what it may be different populations. Figure S17 provide estimates of mOTUs with different thresholds (1 to 10%), which rapidly decrease their estimates (a decrease of 25% mOTUs is found when 6% was considered). Still, an overwhelming sampling effort but a more realistic estimate.

      • I think the phylogenomic tree did not receive the required attention (for example, the FcLM analysis is barely mentioned).

      • It is not clear why should be important to mention the "person-months of focused field research" across the manuscript. Each study group has a unique sampling technique (also not found in the manuscript), preferred localities or traits, which make comparisons impossible. The authors' effort is remarkable, but it is not an important result/finding to be highlighted all over the manuscript.

      Many thanks for all comments and suggestions that pointed to the weak parts of our argumentation. We modified the manuscript accordingly and added some references that can be used for the justification of some claims.

      We addressed the question of thresholds for species number estimations. Now, two thresholds are considered in the manuscript as relevant for discussion: 2% and 5%. We added further information on our previous studies dealing with integrative species delimitation in Metriorrhynchini. Some of them were not referred in the earlier version (to avoid self-citations) and we also expanded information on the evidence given in the study which we have already referenced (Bocek et al. 2019). The earlier comparison of nextRAd, mtDNA and moprhology-based delimitation of species in Eniclases (the trichaline clade in the present study) showed that many well defined species (nextRAD and morphology) have highly similar mtDNA and they split only recently, eventually some introgresion or incomplete lineage sorting affect mtDNA signal. If we apply 5% threshold for this group, we would delimit as a single species two entities which differ in the body size, coloration and the relative size of male eyes (diurnal and nocturnal activity in putative sister species). In such a way, we would decrease the number of species in our analyzed sample of Eniclases by 40% in clear contrast with the number based on morphology and nextRADs. We found similar rapid morphological diversification also in other metriorrhynchines (Jiruskova et al., 2019,; Kalousova & Bocak, 2017) and other not referenced taxonomic studies that have shown that closely related species have well diversified male genitalia and often belong to different mimetic rings). To limit our discussion, we do not reference our earlier nextRAD study showing the speciation in other subfamily of net-winged beetles within a single mountain range (Bray & Bocak 2016). Also this study supports morphological differentiation in species with highly similar mtDNA. Now, we noted in the manuscript that before taxonomic revisions are produced, our claim is provisional and therefore we modified the text as proposed and present the lower numbers of species as a realistic possibility.

      Phylogenetic relationships: We added additional information on the congruence with earlier studies to Results and Discussion, but we still do not describe details. The main reason is that morphology must be studied to delimit and formally name new taxa and that the morphology is out of scope of this work (except some information provided in Supplementary Text – description of delimited generic groups and subtribes). The FcLM analysis addressed only the relative position of the leptotrichaline and procautirine clade. Both clades are monophyletic, morphologically distinct and no conclusion is based on their relative position. We noted that without further data we are unable to robustly solve their positions. Provisionally, we prefer the deeper postion of the leptotrichalines (61%, a not very convincing phylogenenomic signal).

      Quantification of sampling effort: As proposed, we excluded the consideration of person months as a measure of relative collecting effort in various regions and add justification for field research methods.

      Reviewer #2:

      Conservation efforts must be evidence-based, so rapid and economically feasible methods should be used to quantify diversity and distribution patterns. The principal objective of this study is to demonstrate how biodiversity information for a hyperdiverse tropical group can be rapidly expanded via targeted field research and large-scale sequencing. The authors have attempted to overcome current impediments to the gathering of biodiversity data by using integrative phylogenomic and three mtDNA fragment analyses. As a model, they sequenced the Metriorrhynchini beetle fauna, sampled from ~700 localities in three continents. The species-rich dataset included ~6,500 terminals, >2,300 putative species, more than a half of them unknown to science. It is an amazing finding. Their information and phylogenetic hypotheses can be a resource for higher-level phylogenetics, population genetics, phylogeographic studies, and biodiversity estimation. At the same time, they want to show how limited the taxonomical knowledge is and how this lack is hindering biodiversity research and management.

      Thanks for your comments on our study. We agree with your specific recommendations and modify the manuscript accordingly.

    1. Conformity now disappears into themechanical order of things and bodies, not as action but asresult, not cause but effect. Each one of us may follow adistinct path, but that path is already shaped by the financialand, or, ideological interests that imbue Big Other and invadeevery aspect of ‘one’s own’life.

      It seems like when we go online we are hardly actors anymore in the information we "seek". In many ways, it is already paved out for us, as described here. We are studied, and our lives are made easier because of it. I never get completely irrelevant advertisements anymore when on social media, and specifically on YouTube, where I remember a time only some companies were learning to tap into the monetization of producers' content and the same, few ads would always play. It seems that technology knows me better now than I know myself, and even though it makes me frightened, I haven't changed my interaction and consumption of technology in the slightest, and I don't think many other people have either, which is something I would be curious to know more about.

    1. We should not be so proud as to believe that our data

      Minor: I think it's good to discuss the issues you're interested in from your area of expertise with neuroscience. You even state outright that you'll be framing things from that perspective. However, the both the intro and that outright statement suggest the paper is written for a more general audience. That conflicts somewhat with the framing here, which seems to be geared towards neuroscientists familiar with the features of neurodata. Not a major concern though, this is just a question of whether you want to maintain a consistent audience throughout the paper. Arguably reconciled by substituting "We" with "neuroscientists". Same comment applies to other areas where this may pop up

    1. or do not wanl to have.

      I think this bring up point about priorities, control, and what we see as valuable. In order to give children the time and listening they deserve, it is at the expense of other things - other children who at the time may be doing something that we want to observe, other tasks that the teacher may have planned, other things the adult wants to get on to for themselves, etc.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): **Summary:** The manuscript submitted by Djekidel et al entitled: "CovidExpress: an interactive portal for intuitive investigation on SARS-CoV-2 related transcriptomes" reports on a new web portal to search and analyze RNAseq data related to SARS-CoV-2 infections. The authors downloaded and reprocessed data of more than 40 different studies, which is available on the web portal along with all available meta data. The web portal allows to perform numerous differential expression and gene set enrichment analyses on the data and provides publication ready figures. Because of batch effects that could not be removed, the authors do not recommend to analyze data across studies at this point. The authors conclude that the web portal is unique and will allow scientists to rapidly analyze gene expression signatures related to SARS-CoV-2 infections with the potential to make new discoveries. **Major comments:** Based on the scientific literature, the web portal seems to be an unprecedented resource to search and analyze SARS-CoV-2-related RNAseq data and as such would certainly be a useful resource for the SARS-CoV-2 scientific community. The authors argue that new discoveries are possible by using their web portal in providing use cases. However, the section detailing the analyses the authors did to generate new hypotheses about genes potentially relevant in SARS-CoV-2 infections are very difficult to follow and without more guidance very difficult to reproduce with the web portal. It would require substantial expert knowledge in RNAseq data analysis without more information being provided. It also seems that key candidate genes identified by their analyses have all been studied or identified to be related to SARS-CoV-2 infections, so it is somewhat unclear whether new hypotheses can be generated by the reanalysis of RNAseq datasets, especially because combining the data from different studies is currently not recommended by the authors. The manuscript would benefit from providing fewer use cases but for each of them providing more information on how the portal and which studies were used to generate them and which findings were not described in the publication of the used studies. Some observations in the manuscript are not substantiated with significance calculations (see below). At times, the English writing (grammar) should be improved.

      We thank the reviewer for the positive comments. We suppose the reviewer conclude it need substantial expert knowledge in RNAseq data analysis were due to lacking Video Tutorial. We have now put up several Video Tutorials and more tutorials would be added along later along with users’ feedbacks. We believed this would help ease reviewers’ concern.

      In response to whether new hypothesis can be generated. Sorry if it’s not clear, for all the case studies and our “CovidExpress Reveals Insights and Potential Discoveries”, our portal has provided information not reported by their original publications, as listed below:

      1. Case study #1: The original publication employed a multiomics approach to find the predictor genes between ICU and non-ICU patient. But it’s not obviously to know which genes were mainly due to expression level, which might be due to other data they included (e.g. mass spectrometry data). Our portal allow user to quickly check their expression level and find SESN2 does not have strong expression differences.
      2. Case study #2: We replace this case study with bacterial-susceptibility genes to show such questions could be quickly asked and answered using our portal. Such investigation has not been reported before.
      3. FURIN’s function have been well related to SARS-CoV-2. However, for all reports we could find, they focused on Furin cleavage sites of SARS-CoV-2 or whether FURIN were expressed in the SARS-CoV-2 sensitive tissues. SARS-CoV-2 infection could up-regulate FURIN expression have never been reported before. The study published the data didn’t mentioned FURIN at all. We have made this discovery simply by using CovidExpress portal to find the differential expressed genes and overlap with the literature-based gene list (Supplementary Table S2), we believe more discoveries could be made by users by selecting different data.
      4. If we search OASL AND " SARS-CoV-2" on pubmed, only 5 results shown up indicated it’s under-studied. And none of them indicated OASL could be up-regulated both by SARS-CoV-2 infected lung and Rhinovirus-infected nasal in human. It is not clear to us if we might misunderstand reviewers’ suggestion as “fewer use cases”. Thus, we haven’t removed any use cases, instead we provided more details to help users understand what and how did we made those discoveries not reported by their original studies using CovidExpress.

      At last, we have gone through substantial scientific editing to improve the grammar. **Minor comments:** Page 6 last sentence: The statement of this sentence is very much what one would expect. It remains unclear whether the authors mean this as a result to validate the processing of the RNAseq data or as a new discovery. Please, clarify.

      We apologize for the confusion. We intended this statement to be a result confirming what we had expected. We have now amended the text to make this point clearer.

      Figure 3A: The violin plots are so tiny that it is impossible to see any trends. It is also difficult to understand which categories one should compare with each other. If there is anything significant to observe, please, add a statistical test and better guide the reader.

      We agree with the reviewer; therefore, we have removed this figure from the paper. The goal of this figure was to demonstrate how to use violin plots for exploratory analysis; however, in this case, the violin plot did not show a clear trend. By using more filtering and other plots (e.g., Figure 3B-C), we believe we now provide better insight.

      Figure 3C: A legend for the color scale is missing. The signal (I guess expression amounts) for SESN2 seems very weak and the same between ICU and non-ICU samples. What is the significance for assigning this gene to the group of genes being upregulated in ICU samples? Also contrary to what the authors state on page 8, SESN2 does not seem to be highly expressed in ICU samples, however, without knowing what the colors represent (fold changes or absolute expression values?) this is somewhat speculative.

      We thank the reviewer for bringing this to our attention. We have now added a legend for the color scale in the revised figure. In Figures 3A-C, we are showcasing how an exploratory analysis can be performed using CovidExpress. As an example, we investigated the expression of the top 20 genes identified by the random forest classifier of Overmyer et al., 2021, as predictors of ICU and non-ICU cases. In the original Overmyer et al. paper, only the general performance metrics of the models are presented (Fig. 6c-g), but the authors do not show the expression patterns of the top predictors. Hence, we demonstrate how CovidExpress can be used to further investigate some questions not explored in the original paper. SESN2 was listed as a top predictor; however, its expression did not vary between ICU and non-ICU samples, as was also observed by the reviewer. We suspect SESN2 was a top predictor due to other data the Overmyer et al. paper included, such as mass spectrometry data. Our statement about SESN2 was not accurately reflected in the figure; therefore, we have rewritten this section to make it clearer.

      Page 9 first sentence: Please, specify what you mean by "starting list". Furthermore, in this paragraph, how do your results compare to the results from the study that you re-analyze here?

      We thank the reviewer for the question. By “starting list,” we meant the top genes from the Overmyer et al., 2021, article as predictors of ICU and non-ICU cases. We have now rewritten this section to make it clearer. We did not expect our results to differ from their data. Our goal was to ask which of their top predictors (by multi-omics data) show a difference in gene expression. When we downloaded their TPM values from their GEO records, the values were very similar overall (see below).

      Figure 3F: Please add labels to your axes and is there a particular reason why in a correlation plot like this one, the y and x axis are not shown with the same range and why does the y axis not start at 0?

      We thank the reviewer for this helpful comment. Our reasoning for presenting the figure in this way is that different genes can have very different expression levels but still be correlated. For example, if gene A expressed 1, 5, and 10 in samples 1,2, and 3, while gene B expressed 100, 500, and 1000 for samples 1, 2, and 3, then their range would be very different but still perfectly correlated (see panel A below). If we draw the x- and y-axes using the same range, this correlation will not be visually obvious (see panel B below).

      This comparison is different from the correlation plots that compare the expression of one gene in different samples. We apologize for the confusion and to avoid misleading readers, we have enlarged the gene names in the Figure labels to ensure that readers notice their differences. We have also added an option to the correlation plot on our portal so that users can choose the optimal format (see below).

      Page 9 second last sentence: It remains unclear which kind of analysis the authors intend to do here and what the starting question is. Please, try to rewrite with less technical terms (i.e. what do you mean by "precalculated contrasts"). In line with this, it remains unclear what Figure 3I is supposed to show. Please, provide some more information to readers who are not RNAseq analysis experts.

      We thank the reviewer for this suggestion. To avoid any misleading claims, we followed Reviewer #2’s suggestion and replaced the coagulation gene list with a filtered gene list from the “Coronavirus disease - COVID-19” KEGG pathway (hsa05171) to showcase how to identify experiments in which this gene signature is enriched or depleted. We also replaced the related figures and text with new results and rewrote this section to avoid using technical terms.

      Figure 3J is somewhat confusing. Why is the mean expression range indicated from 0 to 1 and why are all genes apparently having a mean expression of 1?

      We thank the reviewer for this question. Because the levels of expression of different genes can vary greatly, in Figure 3J (new Figure 3A and 3I), we normalized the mean expression levels of the genes to their maximum values across groups to improve the visualization. We have now made this clearer in the figure, legend, and text.

      Page 10 line 5-6. Are you referring to coagulation markers here or general expression patterns? In case of the latter, how does this statement fit to the paragraph about analyzing expression patterns of coagulation markers? Please, specify. And in line with this, are the highlighted genes in Figure 3K coagulation markers? If not, what is the relevance of these to make the point that one can use the portal to investigate the role of coagulation markers in SARS-CoV-2 infections?

      As mentioned above, to avoid any misleading claims, we followed Reviewer #2’s suggestion and replaced the coagulation gene list with a filtered gene list from the “Coronavirus disease - COVID-19” KEGG pathway (hsa05171). This revision enables us to show how to identify experiments in which this gene signature is enriched or depleted. We have now replaced these figures and text with new results.

      The appearance of describing batch effects and attempts to remove them from the studies was somewhat surprising on page 10 as I would expect this kind of results rather earlier in the results section before describing use cases of the data. You may consider changing the order of your results for a better flow.

      We apologize for the confusion. However, we want to make it clear that the analysis before page 10 did not involve “batch effect”; all analyses were performed within each study. Thus, it is not necessary to change the order in which the results are presented. Also, based on Reviewer #2’s comments, we did not accurately use the term “batch effect,” because “batch effects are purely due to technical differences.” We have now revised the corresponding text to make this point clearer.

      Page 11, second paragraph. Please, explain briefly what the silhouette score is supposed to reflect and thus how Figure S4G should be interpreted. The difference of both bars in Figure S4G is very marginal and thus, does not seem to support the statement of the authors that the ssGSEA scores-based projection is better unless you perform a significance test or I misunderstood. Please, clarify.

      We thank the reviewer for this suggestion. We have now added an explanation of the silhouette score in the manuscript. Briefly, a silhouette score is a metric of the degree of separability of gene clusters from the nearest cluster. For a given sample, lets be the mean intra-cluster distance, and be the mean distance to the nearest cluster. The silhouette score (sil) will be calculated as follows

      The silhouette score ranges between -1 and 1. A value near 1 means that the clusters are well separated, and a value near -1 means that the clusters are intermingled. Using a Wilcoxon rank test, we showed that using ssGSEA scores significantly improves the separability of global GTEx tissues (in Figure S4G; p=8.75e-26).

      Page 11, third paragraph: Figure 4B, to the best of my understanding, does not support the claim that samples clustered less according to study cohorts using the ssGSEA approach. Please, quantify the effect and test for significance or better explain.

      We apologize for the confusion. We quantified the separability between cohorts (GSE ids) by using the silhouette score. In Figure S4H (panel A below), we show that the TPM-based PCA leads to more separation by studies than does the Covid contrast ssGSEA scores in which the separation between studies is less prominent (p-value=0.0045, paired Wilcoxon test).

      For the analyses described starting on page 12 it remains largely unclear whether they were conducted across studies or within studies and which studies were used. This section until the end of the results would especially benefit from providing more information on how the analyses were performed, either in the results or in the methods section.

      We apologize for the confusion. The goal of the analysis on page 12 and the corresponding Figure 4G was to identify genes whose expression increased in both the SARS-CoV-2 infection lung and rhinovirus-infected nasal tissue. Hence, we did a log2(fold-change) vs log2(fold-change) comparison. The log2(fold-change) values were independently calculated for each study. Because we compared values by using the same ranking metric, the cross-samples comparison was possible, as shown in Figure 4G. We have now added more details to the Methods section to clarify this point.

      Figures 4J and 4K miss axis labels and since we look at correlations, the figures could be redrawn using the same ranges on x and y axis.

      We thank the reviewer for this suggestion. We have now added axes labels to the new figures. However, we have not used the same range on the x and y axes because they depict expression levels of different genes. For example, if gene A is expressed 1, 5, and 10 in samples 1, 2, and 3, while gene B is expressed 100, 500 and 1000 for samples 1, 2, and 3, their range would be very different but still perfectly correlated (panel A below). If we draw x and y axes using the same range, this correlation will not be visually obvious (panel B below).

      This comparison is different from the correlation plots that compare the expression of one gene in different samples. We apologize for the confusion and to avoid misleading readers, we have enlarged the gene names in Figure labels to ensure that readers notice they are different genes. We have also added an option to the correlation plot on our portal so that users can choose the optimal format (see below).

      Page 14 line 5: Is this the right figure reference here to Figure 4G? If yes, then it is unclear how Figure 4G supports the statement in this sentence. Please, clarify.

      We apologize for the confusion. In Figure 4G, we labeled several important genes and used different colors to indicate whether the gene was regulated by SARS-CoV-2 only (purple), Rhinovirus only (black), or both(red). FURIN was the gene that is only significantly upregulated by SARS-CoV-2. The data in Figure 4G were from GSE160435(“SARS-CoV-2 infection of primary human lung epithelium for COVID-19 modeling and drug discovery”); that study used lung organoid alveolar type 2 (AT2) cells as the model. We think this confusion was caused by our failure to provide the details about the GSE160435 study. We have now amended the manuscript to include these details in the Methods section to avoid confusion. We also enlarged the gene labels in the figure to make them more visible. In the manuscript, we have changed from “our results found FURIN gene was also upregulated in SARS-CoV-2–infected lung organoid alveolar type 2 cells (Figure 4G, Supplementary Table S3).” to “We found that FURIN was upregulated in SARS-CoV-2-infected lung organoid alveolar type 2 cells (Figure 4G, Supplementary Table S4) (Mulay, Konda et al., 2021), it has reported that TGF-β signaling could also regulates FURIN (Blanchette, Rivard et al., 2001). Our gene enrichment analysis also found TGF-β signaling enriched only for up-regulated genes in SARS-CoV-2-infected lung cells (FDR correct p=7.58E-05, Supplementary Table S4), these observations implicated a positive feedback mechanism only for SARS-CoV-2-infected lung but not RV-infected nasal cells.”

      Figure 2 is of too low resolution. Many details cannot be read. Please, provide a higher resolution figure.

      We apologize for the inconvenience. However, we did not expect the reader to read the details on Figure 2, as it is just an overview of the CovidExpress portal. The aim is give the reader an impression about what functions CovidExpress could offer.

      Reviewer #1 (Significance (Required)):

      Providing a single platform for the analysis of SARS-CoV-2-related RNAseq data is certainly of high value to the scientific community. However, as the portal and manuscript are currently presented, for scientists that are not RNAseq analysis specialists, more guidance would be required to understand and use correctly the functionalities of the portal. Unfortunately, because batch effects could not be removed from the studies, the authors, correctly, do not recommend to combine data from different studies for analyses, however, this likely will also limit the potential of the resource to make new discoveries beyond what the original studies have already published. As indicated above, the authors could support their claim by comparing their findings with findings published from the studies they reanalyzed. The portal is only of use to scientists studying SARS-CoV-2. I am not an expert in RNAseq data analysis and thus cannot comment on the technicalities, especially the processing of the RNAseq datasets. We thank the reviewer for the positive comments. We apologize for the confusion and acknowledge that we should not describe our effort using the term “batch effect.” As described by Reviewer #2 (and we agree), batch effect should be used only to indicate a purely technical difference in the same biological system; for example, differences in experiments performed on different days or by different lab personnel. Thus, we cannot correct for “batch effect” by using CovidExpress. We hope that the reviewer realizes that what we did was correct for the effect caused by differences in software and parameters across the studies. For example, in our approach, the DEGs from GSE155518 and GSE160435 (both primary lung alveolar AT2 cells (both from Mulay et al., Cell Report, 2021) were significantly correlated (panel A below; p = 1.36e-24, F-test). However, when we downloaded the TPM values from their GEO records, GSE155518 appeared to have a genome-wide decrease in the expression of SARS-CoV-2–infected samples (panel B below). We suspect that this is because in their data processing, the expression of virus themselves were also considered. Thus, using the proceed data directly without careful reviewing the method might lead to false hypothesis.

      At last, researchers can make new discoveries, such as our OASL and FURIN findings, by using many other features that CovidExpress provides.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): Djekidel and colleagues describe a web portal to explore several SARS-CoV-2 related datasets. The authors applied a uniform reprocessing pipeline to the diverse RNA-seq datasets and integrated them into a cellxgene-based interface. The major strengths of the manuscript are the scale of the compiled data, with over one thousand samples included, and the data portal itself, which has useful visualization and analysis functions, including GSEA and DEG analysis. My primary concerns with the study are centered on the analysis examples that are presented and their interpretation, as well as the user interface for the data portal. **Major Comments:**

      1. The literature analysis feels out of place and is not informative (Fig 1E), as the conclusions that can be drawn from literature mining are minimal. In evidence of this, the authors highlight that CRP is a top-studied "gene" and later voice their interest in how CRP is not a differentially expressed gene (pg6). This illustrates the problems with the literature-based analysis, since in the context of COVID-19, CRP is a common blood laboratory measurement that is used as a general marker of inflammation. Transcription of CRP is essentially exclusively in hepatocytes as an acute phase reactant (see GTEx portal for helpful reference), and would therefore not be expected to be found in the various datasets collected by the authors. The one exception might be liver RNA-seq samples from COVID-19 patients, but I do not think these are available in the current collection. I would therefore suggest to remove the literature analysis parts from the manuscript.

      We thank the reviewer for sharing knowledge about CRP. As discussed in our manuscript, we agree that not all top genes from literature-based analysis were expected to be included in RNA-seq analysis. We apologize for the confusion, and we have amended our description to make this point clearer. However, we still believe that literature-based analyses are very useful in the following aspects:

      1. This type of analysis bridges the gap between data-driven research and hypothesis-driven research. For example, we found many genes in our meta-analysis, but it is not feasible to describe the functions of all of them. Thus, in Figure 1F, we color-coded genes in red if they also appeared as top genes in the literature-based analysis and read related manuscripts to build confidence that the meta-analysis is useful. Then we expanded our review to more top genes and found more interesting evidence (Supplementary Table S2, “TopGenesbyDifferentialAnalysis” tab).
      2. Literature-based analyses also reduce the time researchers spend prioritizing their investigations. For example, in our comparison of SARS-CoV-2–infected lung and Rhinovirus-infected nasal tissue, we found >2000 genes upregulated only in SARS-CoV-2–infected lung but not in Rhinovirus-infected nasal cells. It is not easy to derive a hypothesis from so many genes. When we overlapped the gene list with literature-based analysis, FURIN popped up as the most well-studied gene, and we did not find any report that mentioned that SARS-CoV-2 can regulate FURIN This raised our interest and led to a suggested mechanism in which SARS-CoV-2 could evolve to induce FURIN expression and gain superior infectivity. FURIN’s upregulation is significant but not among the top genes, in terms of fold change (>2-fold change, FDR p th by fold change). Thus, without the literature-based analysis, this observation could have easily been neglected.
      3. Such analyses help researchers to prime their hypotheses for novel findings. For example, in our comparison between SARS-CoV-2–infected lung and Rhinovirus-infected nasal tissues (Figure 4G, Supplementary Figure 5D and E), we found many upregulated genes, but OASL was not in our literature-based analysis, which indicated that it is under-studied and worth highlighting. We hope the reviewer will agree that we should retain the literature-based analysis in our paper. These analyses were not meant to be conclusive but rather a way to prioritize investigations. Finally, we removed CRP from Fig 1E and the main text to avoid confusion.
      1. The data portal, implemented through cellxgene, is accessible for non-programmers to use. However, it is very easy to end up with an "Unexpected HTTP response 400, BAD REQUEST" error, with essentially no description of the cause of the error or how to rectify it. When this occurs (and in my experience it occurs very frequently), this also forces the user to refresh the page entirely, losing any progress they may have made. I see that the authors describe this error in their FAQ page, but their answer is not very intuitive and I was unsure of what they meant: "This happens because the samples you selected doesn't contain all "Group by" you want compare for each "Split by" group. You could confirm using the "Diff. groups" buttons.".

      We apologize for the confusion. This excellent point made by the reviewer required an improvement in the software engineering, which we have now completed. We have figured out how to avoid this error and have run thorough tests to ensure that it does not appear anymore. We also added a gitter chat channel to our landing page, so that users can report if they encounter this or other errors.

      I would therefore ask that the authors provide more detailed tutorials (ideally step-by-step) on common analyses that users will want to perform, hopefully minimizing the amount of frustration that users will encounter.

      We thank the reviewer for this suggestion. We have uploaded several video tutorials to our landing page and will gradually add more. We also added a gitter chat channel, so users can ask questions, report bugs, or suggest new studies to include in the portal.

      1. Selection of samples is not very quick or intuitive. If I wanted to select only the samples from one specific GEO accession, I had to resort to individually checking the boxes of the sample IDs that I wanted. If I instead selected the GEO accession under the samples source ID, then used the "Subset to currently selected samples" button, I invariable got the HTTP error 400 message. Of course, this may simply reflect my lack of familiarity with cellxgene; I would nevertheless encourage the authors to improve the FAQ to include a step-by-step example for how to do common analyses/procedures.

      We apologize for the confusion. To select an individual GEO accession, users can simply tick the box beside “Samples Source ID.”

      Then all boxes would be clear for “Samples Source ID” that allow you to select only the one you want. We also have uploaded video tutorials to help users learn how to navigate the portal.

      We apologize for the “HTTP error 400” messages. We figured out that users would encounter that message frequently after they encounter it once due to a back-end cache mechanism. We have now improved the portal from the software-engineering side. In our recent tests of the latest version, this error does not appear anymore. We also added a gitter chat channel on our landing page so that users can report encountering this or other errors.

      1. The second case study, centered on coagulation genes, is misguided. Alteration of coagulation lab values in severe COVID-19 patients is reflecting the general inflammatory state of these patients, and would not be expected to manifest on the transcriptional level in infected cells/tissues. Coagulation labs are measuring the functional status of the coagulation cascade, which is far-removed from the direct transcription of the corresponding genes - proteolytic processing of clotting factors, etc. As with CRP (see above comment), most clotting factors are transcribed almost exclusively in the liver (check GTEx portal); I would not expect upregulation of coagulation factors in lung cell lines/organoids/cultures etc after infection with SARS-CoV-2. I would recommend the authors to pick a different gene ontology set for a case study, as the current one focusing on coagulation is confusing in a pathophysiologic sense.

      We thank the reviewer for this suggestion. To avoid any misleading claims, we have replaced the coagulation gene list with a filtered gene list from the “Coronavirus disease - COVID-19” KEGG pathway (hsa05171) to showcase how to identify experiments in which this gene signature is enriched or depleted. We also replaced Figures 3G-J with new results.

      1. The two large clusters of blood-derived samples vs other tissues is not surprising and the authors' interpretation is confusing. The authors write that "the COVID-19 signature was not able to overcome the tissue specificity and that immune cells might respond to SARS-CoV-2 differently." This should be immediately obvious given the pathophysiology of COVID-19 infection; the cell types that are directly infected by SARS-CoV-2 will of course have a distinct response compared to the circulating blood cells of COVID-19 patients, which are responding by mounting an immune response. There is no reason to expect a priori that the DEGs in the directly infected lung cells would be similar to that of immune cells that are mounting a response against the virus.

      We thank the reviewer for these comments. We agree that it should be obvious that directly infected lung cells would differ from immune cells. However, this has never been shown in a large dataset. Also, it is not obviously whether all other different tissues would respond to SARS-CoV-2 differently. Thus, we believe it is important to present this overview. We have amended the description to deliver clearer message as “This confirmed immune cells respond to SARS-CoV-2 differently from other tissues also suggested the response of most other tissues might sharing similar features.”.

      1. The authors devote considerable space in the manuscript to exploring "batch effects" and trying to minimize them (pg10-11 Fig 4A-D, Fig S4). However, given that the compiled datasets are from entirely different experimental and biological systems (e.g. in vitro infection vs patient infection, different cell lines, timepoints after virus exposure, diverse tissues, varying disease severity), it is inappropriate to simply refer to all of these differences as "batch effects" alone. Usually, the term "batch effect" would refer to the same biological experiment/system (i.e. A549 cells infected with CoV vs control), but performed on different days or by different lab personnel - in other words, batch effects are purely due to technical differences. This term clearly does not apply when comparing samples from entirely different cell lines, or tissues, etc, and the authors should not keep describing these differences as batch effects that should be "corrected" out.

      We thank the reviewer for the insight. We apologize for the confusion caused by using the phrase “batch effect correction” to describe our approach. We agree that the difference between studies should not be referred to as a “batch effect correction” and have now amended the descriptions to avoid confusion.

      Indeed, the authors themselves state that the main point of their "batch effect correction" efforts is only for PCA visualization. I therefore feel this section contributes very little to the overall manuscript, especially given the authors' own recommendation that all analyses should be performed on individual datasets (which I certainly agree with). I assume that the authors were required to provide some sort of dimensional reduction projection for the cellxgene browser, but this is more a quirk in their choice of platform for the web portal. Thus, this section of the manuscript should be deemphasized.

      We thank the reviewer for these comments and again apologize for the confusion caused by our use of the term “batch effect correction” to describe our approach. However, we believe these parts of the paper should be retained for the following reasons:

      • In practice, sample mislabeling can happen. PCA or simple clustering approaches are very useful for helping raise researchers’ attention, so they could further check the possibility of sample mislabeling.
      • Even within a study, one sample can be an outlier due to low or unequal sample quality. Removing outliers would help boost the significance of real findings. Without our approach, it would be harder for users to notice and remove outliers from their investigations.
      • Finally, these efforts are useful for generating hypotheses. For example, although we collected a lot of data, it is not feasible for us to read all the details in all the manuscripts published. We observed a similarity between SARS-CoV-2–infected lung samples and Rhinovirus–infected nasal samples by exploring our portal’s capabilities (Figure 3E-F). Then we read the manuscripts in which those data were published and found that our discovery was consistent with the original studies’ results. We believe these efforts are essential to help researchers generate or refine their hypotheses. As we update the database with more samples, this approach will become increasingly powerful.
        1. Given the limitations of any combined multi-dataset analyses, one very useful feature would be to conduct "meta-analyses" across multiple datasets. For instance, it would be informative to find which genes are commonly DEGs in user-selected comparisons, calculated separately for each dataset and then cross-referenced across the relevant/user-selected datasets.

      We thank the reviewer for this comment. Indeed, we agree that “meta-analyses” are useful and have now compiled Supplementary Table S2 and Figure 1F to demonstrate the commonly regulated genes. To enable user-selected comparisons across studies on our portal, we need to design a thoughtful user interface. Otherwise, the results from our portal could easily cause fatal misinterpretation. For example, GSE154613 includes samples like DMSO, Drug, SARS-CoV-2, and DMSO+SARS-CoV-2. If a user simply selected to compare SARS-CoV-2 versus Control, the results would be SARS-CoV-2 and DMSO+SARS-CoV-2 versus DMSO and Drug. Such functions need time to design and implement; therefore, we will consider this suggestion for further development of our portal.

      **Minor comments:**

      1. Fig S1G, color legend should be added (I understand that these colors are the same from S1H).

      We thank the reviewer for the comment. We have now added information about the colors in the figure legend.

      1. Mouseover text for trackPlot on the data portal is incorrect (it says the heatmap text instead).

      We thank the reviewer for this comment. We have now corrected this bug.

      1. Abstract should be revised to describe only the 1093 final remaining RNA-seq samples after filtering/QC steps.

      We thank the reviewer for this comment. We have now amended the Abstract to include this information.

      1. Text in many figures is too small to be legible. I would suggest pt 6 font minimum for all figure text, including the various statistics in the figure panels.

      We thank the reviewer for this comment. We have now amended the font sizes and will provide high-resolution figures in revision.

      1. Are the DE analyses in Fig 1F specifically limited to control vs SARS-CoV-2/COVID-19 comparisons? Many of the samples included in this study are from other respiratory infections (labeled "other" in Fig 1B).

      We thank the reviewer for the question. Figure 1F was not originally limited to control vs SARS-CoV-2/COVID-19 comparisons, because we thought control vs virus, drug vs mock, or difference between time points would also be interesting. If we narrow the analysis to contrasts only between control vs SARS-CoV-2/COVID-19, Figure 1F would be still look similar (as below) because the genes in that comparison comprise the largest share of genes included in the original graphic.

      In the end, we replaced Figure 1F to avoid confusion and added more details in the Methods.

      1. The word cloud format is not conducive for understanding or interpretation. It would be much more informative to simply have a barplot or similar to clearly indicate the relative "abnudance" of a given gene among all 315 DE analyses.

      We thank the reviewer for this comment but respectfully disagree with this point. Visualization of the relative “abundance” of genes with word clouds is a relatively novel concept in computational biology. However, we believe, that in this case, it has certain advantages over visualization using traditional bar plots for example. The word cloud format allows us to highlight genes relative to their importance, with the word “importance” being used here in the sense of combined metrics from DEGs, as shown in Figure 1F, or the frequency with which genes are mentioned/discussed in various literature sources, as shown in Figure 1E. For this purpose, the exact values will most likely not be important for most users/readers. Be presenting a word cloud visualization, readers can easily discern the top genes and use them in the exploration of their own data or the CovidExpress portal. However, if users want to analyze raw values, we provide in Supplementary Table S3 a full list of all genes and gene sets that can be download from our landing page (section “CovidExpress Expression Data Download”) in GMT format. Also, when we visualized the ranks of genes by using bar plots as the reviewer suggested, the results were much harder to read (as shown in the bar graph below) than simply looking at the raw data in supplementary tables.

      1. Claims of increased/decreased dataset separability should have statistical analysis on the silhouette score boxplots (Fig S4G-I).

      We thank the reviewer for the reminder. We have added statistical tests to referred silhouette score boxplots (Wilcoxon rank test)

      1. Regarding Fig 4E-F - what are the key genes that contribute to PC1, and how do they relate to the DEGs in Fig 4G?

      We thank the reviewer for this question and apologize for the confusion. In Figure 4E-F, the PCA were based on ssGSEA score, as each gene set would have a score for a sample, not individual genes. Thus, the top contributed to PC1 were gene sets upregulated or down-regulated in certain contrasts. We provided on the portal’s landing page detailed results for top gene sets (for the ssGSEA approach) and genes (for the TPM approach) that contributed to various PCs (“Clustering Results for Reviewing and Download” section). This allows users to download and further explore these data.

      1. Statistics describing the relation between OASL And TNF/PPARGC1A should be included to justify the author's statements. This could be correlation, mutual information, regression, etc.

      We thank the reviewer for this suggestion, and we have updated Figures 4J-K to show the correlation values and corresponding F-statistics. The Pearson correlation between OASL and TNF was significant (Pearson Correlation=0.75 and p-value = 6.85e-72), but the correlation between OASL and PPARGC1A had a negative slope and showed a moderately significant p-value (Pearson Correlation=-0.08 and p-value=0.12), confirming to a certain degree our statement. We have now updated the corresponding text in the manuscript.

      1. There are several studies now that have performed scRNA-seq on the lung resident and peripheral immune cells of COVID-19 patients. To more definitively tie in their analyses in Fig 4J-K/Fig S5D-E (to affirm "its important role in the innate immune response in lungs"), the authors should assess whether OASL is upregulated in the lung macrophages of COVID-19 patients vs controls.

      We thank the reviewer for this suggestion. Indeed, Liao, et al. recently reported “BALFs of patients with severe/critical COVID-19 infection contained higher proportions of macrophages and neutrophils and lower proportions of mDCs, pDCs, and T cells than those with moderate infection.” (Nature Medicine, 2020, https://doi.org/10.1038/s41591-020-0901-9). They further refined macrophage data into subclusters and reported top enriched GO terms as “response to virus” (group 1), “type I interferon signaling pathway” (group 2), “neutrophile degranulation” (group 3), and “cytoplasmic translational initiation” (group 4). When we investigated their data, we found that group1 and group2 both identified OASL as a marker gene, indicated OASL might response to virus and help type I interferon signaling. Furthermore, another data set (from Ren et al., Cell, 2021, https://dx.doi.org/10.1016%2Fj.cell.2021.01.053) showed several clusters in patients with severe COVID-19 (left panel below) that were enriched for OASL expression(right panel below).

      We have now added these observations to strengthen our hypothesis about the role of OASL.

      1. The visualization and analysis functions in the data portal appear to work reasonably well out of the box. However, the download buttons for plots did not work in my hands. I realized that a workaround is to right click -> "Save image as" (which then downloads a .svg file), but this is not ideal and should be fixed to improve usability. I had tested the data portal on both Firefox and Edge browsers, using a Windows 10 PC.

      We agree with the reviewer. Due to some technical issues with the figure javascript plugin, the download feature does not work unless the figure is saved as a file on the server side. To avoid any security issues, we tried to minimize new file generations, hence, for the moment we have disabled this feature. Users can still download high-resolution .svg figures by using the right-click -> “save image as.” This information is now included in the FAQ section on the portal’s landing page.

      Reviewer #2 (Significance (Required)): The data portal appears to have useful analysis and visualization features, and the data collection appears to be quite comprehensive. I would strongly encourage the authors to continue collecting datasets as they become available and further improving the usability of the portal. As noted in the above comments, I think there is potential for their cellxgene-based browser to be useful to non-computational biologists, but at present, the data portal is not as simple to use as it should be. With further efforts to developing step-by-step tutorials for common analysis/visualization tasks, more informative case studies, and the other revisions suggested above, this study could be a valuable resource for the community. Of note, this review is written from the perspective of a primary wet-lab biologist with extensive bioinformatics experience but limited web development expertise.

      We thank the reviewer for the positive comments. We understand the importance of data updating. Our plan is to complete quarterly updates once this manuscript has been accepted or when 10 new studies have been either collected by us or suggested by users. This information is also now included in the FAQs of the portal’s landing page. We have also uploaded several tutorials videos to the landing page and will gradually add more. We also added a gitter chat channel, so users can ask questions, report bugs, or suggest new studies to add to the database.

      **Referee Cross-commenting** I agree with the comments of the other reviewers. Reviewer #3 (Evidence, reproducibility and clarity (Required)): **Summary:** The ongoing COVID-19 pandemic is a big threat to human health. The researchers have conducted studies to explore the gene expression regulations of human cells responding to COVID-19 infection. A website that integrating those datasets and providing user-friendly tools for gene expression analysis is a valuable resource for the COVID-19 study community. The authors collected published RNASeq datasets and developed a database and an interactive portal for users to investigate the gene expression of SARS-CoV-2 related samples. This website would be of great value for the SARS-CoV-2 research community if the batch normalization problems are solved. **Major comments:** 1) The major concern of CovidExpress is the batch effects from different studies. As the authors have shown and mentioned in their discussion that "For the current release, we strongly suggest investigators to perform gene expression comparison within individual study." This limits the usage of CovidExpress as integrating analysis from multiple datasets of different studies is the key value and purpose of CovidExpress.

      We thank the reviewer for the comment. Reviewer #2 reminded us, and we agree, that differences between studies should not be considered “batch effects.” We apologize for the confusion. The GSEA function provided in the portal does not suffer from batch effect, because all the pre-ranked lists of genes are based on contrasts from the same studies. Although we cannot correct for the differences between studies, we did correct for effect caused by differences in software and parameters used. For example, in our approach, the DEGs from GSE155518 and GSE160435 (both studies of primary lung alveolar AT2 cells from Mulay et al., Cell Report, 2021) were significantly correlated (below panel A, p-value = 1.36e-24, F-test). However, if we simply download the TPM values from their GEO records, GSE155518 appears to show a genome-wide decrease in expression in SARS-CoV-2–infected samples (below panel B). These errors might lead to false hypotheses.

      2) The authors should include experimental protocols as one key parameter in the description and further integrating analysis of different datasets. As the authors showed that QuantSeq is a 3' sequencing protocol of RNA sequencing. However, it is not convincing to me that simply excluding QuantSeq samples is the ideal solution for downstream integrating analysis as QuantSeq has been shown that it has pretty good correlations with normal RNASeq methods in gene quantifications. It is interesting that there are 21.2% of samples were biased toward intronic reads. What protocol differences or experimental variations would explain the biases?

      We thank the reviewer for the comment and apologized for not being clearer. One of our main goals re-processing all samples is to correct for pipeline processing–related batch effects. We tried to reduce those effects introduced by using different software or parameters. QuantSeq or similar protocols are heavily bias to 3’ UTR; thus, the software and parameters used for RNA-seq data will not be suitable. In contrast, we agree that the downstream results from QuantSeq have good correlation to RNA-seq (we observed a correlation of ~0.75, when compared to the log2 fold-change from Quant-Seq to RNA-seq). However, we could not reconcile QuantSeq always correlated well with RNA-seq, in terms of individual quantification. For example, Jarvis et al. recently reported only ~0.35 correlation between QuantSeq and RNA-seq (https://doi.org/10.3389/fgene.2020.562445). Theoretically, the correlation would be weaker for genes with a small 3’ UTR. Thus, we will not include QuantSeq data in this portal. However, if we collect enough studies in the future, we will consider uploading a separate portal just for QuantSeq using a pipeline optimized for protocol bias to 3’ UTR.

      For the 21.2% samples that were biased towards intronic reads, we believe they reflect differences in the kits used. For example, of the 162 samples “BASE_INTRON (%)” >30% (Supplementary Table S1) that passed QC, 76 samples were total RNA obtained using the SMARTer kit and 36 were total RNA obtained using the Trio kit. Given that we have 105 samples of total RNA derived using the SMARTer kit and 38 samples of total RNA derived using the Trio kit, we conclude that the Trio kit was more biased toward introns, and the SMARTer kit was also strongly biased. This finding is consistent with those of others who have reported the bias of the SMARTer kit (Song et al., https://doi.org/10.1186/s12864-018-5066-2). Users can find these results in our Supplementary Table S1. We have also uploaded the protocol information to our portal.

      3) How do the authors plan to update and maintain CovidExpress?

      We thank the reviewer for this question. We understand the importance of data updating. Our plan is to update the database quarterly once this manuscript has been accepted or when 10 new studies have been collected by us or suggested by users. We have added this information to the FAQs on the portal’s landing page. We also understand the importance of maintaining the service for a feasible amount of time for research. Therefore, we will keep the server activated for at least 2 years after the WHO announces that COVID-19 is no longer a global pandemic. We will also ensure that, even after we take down the server , scientists with programming skills will be able to create local servers based on the data provided on CovidExpress.

      **Minor comments:** 1) Some texts in figures are not readable. For example, Fig2B, 2C, 2D, 2E.

      We thank the reviewer for this comment. We have now increased the font sizes and provided high-resolution figures in revision.

      2) The authors could use Videos to demonstrate how to use CovidExpress on the website as they have shown in Fig3.

      We thank the reviewer for this suggestion. We have uploaded several video tutorials to the landing page and will gradually add more. We also added a gitter chat channel so that users can ask questions, report bugs, or suggest new studies to include in the database.

      Reviewer #3 (Significance (Required)): The ongoing COVID-19 pandemic is a big threat to human health. Many molecular and cellular questions related to COVID-19 pathophysiology remain unclear and many researchers have conducted studies to explore the gene expression regulations of human cells responding to COVID-19 infection. However, there is no database/website that integrating all RNASeq data to provide user-friendly tools for gene expression analysis for COVID-19 researchers. The authors collected the published RNASeq datasets and developed a database and an interactive portal, named CovidExpress, to allow users to investigate the gene expressions response to COVID-19 infection. CovidExpress is a valuable resource for the COVID-19 study community once the batch normalization problems are solved. The users who came up with ideas about the regulation of COVID-19 response could use the system to test their hypothesis, without experience in bioinformatics and RNASeq data analysis. This will be more important when more RNASeq data from samples with different tissues, cell lines, and conditions are integrated into the database.

      We thank the reviewer for the positive comments. We apologize for the confusion and acknowledge that we should not describe our effort using the term “batch effect.” As described by Reviewer #2 (and we agree), batch effect should be used only to indicate a purely technical difference in the same biological system; for example, differences in experiments performed on different days or by different lab personnel. Thus, we cannot correct for “batch effect” by using CovidExpress. We hope that the reviewer realizes that what we did was correct for the effect caused by differences in software and parameters across the studies. For example, in our approach, the DEGs from GSE155518 and GSE160435 (both primary lung alveolar AT2 cells (both from Mulay et al., Cell Report, 2021) were significantly correlated (panel A below; p = 1.36e-24, F-test). However, when we downloaded the TPM values from their GEO records, GSE155518 appeared to have a genome-wide decrease in the expression of SARS-CoV-2–infected samples (panel B below).

      Thus, using the proceed data directly without careful reviewing the method might lead to false hypothesis. At last, researchers can make new discoveries, such as our OASL and FURIN findings, by using many other features that CovidExpress provides.

    1. Plessner's distinction between plant and animal is both an enabling pathway towards his anthropology but also quite distinctive and worthy of consideration in itself. Contrary to the mainstream legacies of botany and zoology, but consistent with the logic he is developing, plants and animals for Plessner are a priori life-categories or modals of the organic based upon filling alternative organizational possibility spaces that follow from the dialectics of positionality and not in the first instance about the distinction between autotrophy and heterotrophy. Accordingly, certain heterotrophic species such as corals, hydroids, bryozoan and ascidians are classed by Plessner into the plant category. In broad terms, the plant-animal distinction is defined by the difference between "open" and "closed" positionalities, a distinction which has much to do with levels of mediation. One may think of this as two alternative basic strategies for achieving the aforementioned balance between assimilative and resistive moments immanent in any form of positionality. "A form is open if the organism in all of its expressions of life is immediately incorporated into its surroundings and constitutes a non-self-sufficient segment of the life cycle corresponding to it" (p. 203). An open form of positionality, we can say, doesn't require mediation by way of a posited center and the consequence of this is realized throughout the morphology, physiology and growth patterns of the plant. Morphologically this is manifested in the tendency of the organism to develop externally and expansively in a way that is directly turned toward its surroundings. It is characteristic of this kind of development that it does not have the need to form centers of any kind. The tissues responsible for mechanical solidarity, nutrition, and stimulus conduction are not anatomically or functionally concentrated in particular organs but rather permeate the organism from its outermost to it innermost layers. The absence of any central organs tying together or representing the whole body means that individuality of the individual plant does not itself appear as constitutive but rather as an external moment of its form associated with the singularity of the physical entity; in many cases the parts remain highly self-sufficient in relation to each other (graftings, cuttings). This led a great botanist to go so far as to call plants 'divididuals'. (pp. 203-204)

      This is an interesting classification of "open" and "closed", depending on whether the living organism has uniform functionality or specialized, and centralized structures respectively.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      FULL REVISION

      Manuscript number: RC-2021-00934

      Corresponding author(s): Seiya, Mizuno

      General Statements

      We would like to thank all the reviewers for their comments on improving the manuscript. We are encouraged by the overall positive responses from the reviewers. According to the reviewers’ comments, we have further refined our manuscript. We are confident that we have addressed all the reviewers’ comments and suggestions by incorporating them into the revised manuscript. We highlighted the changed text in the manuscript in red. The point-by-point responses to all comments follow.

      Point-by-point description of the revisions

      Reviewer 1:

      The study by Akihiro and colleagues describe the generation of multiplex genotyping method for detecting CRISPR gene editing alleles using nanopore sequencing and a machine learning program. The method is based on long-range PCR amplification of intended targeted loci from gene edited animals followed by nanopore sequencing. A PCR-index is introduced to the sample pooling system before sequencing, thus allow sequencing up to 100 sample in one flowcell. The study developed a machine learning program for allele binning, analysis, and presentation. To demonstrate the applicability of the method, the study has validated their methods for detection of point mutations, deletion, and flox insertion. The study has in principal provided sufficient investigation and data to demonstrate the validity of the method. All the figures are very nicely and clearly presented. However, there is a few concerns that it should be taken in to consideration.

      We appreciate the constructive and important comments from the reviewer.

      Reviewer 1_Comment #1:

      Many previous reported unintended structure variations caused by CRISPR off-targets are typically much larger deletion/insertion/insertion/translocation occurred outside the target sites. The current study is more for targeted allele genotyping. The use of structure variable (SV) in the whole study should be considered to revise thoroughly.

      SV is typically referred to genomic variation of approximately 1kb and above. What the study describe in this study is still within indel types instead. Thus, comparing the DAJIN with NanoSV and Sniffles on reads with 50, 100 and 200 bases deletions is not appropriate.

      The detection of SV alleles in the whole study is most likely a cause of minor indel alleles and sequencing errors. Figure 2b, BC32, WT mice also contains a proportion of SV allele, which is apparently caused by sequencing error. Such SV which is not related to CRISPR gene editing is also seen in other genotyping results e.g. Figure 3a. Figure 4b, Figure 5c, Figure 6b.

      Another co-factor that contributes to the SVs is the PCR-error from the method.

      Thank you very much for your comments. We agree that structural variation traditionally referred to genomic alterations that are larger than 1 kb in length. Although the application of sequencing technology has expanded the spectrum of structural variation to include smaller events >50 bp in length (PMID: 21358748, PMID: 26432246), there are no common understanding on the definition of the name of genomic rearrangements >50 bp in length through genome editing. We changed the name of the unexpected mutation reads more than approximately 50 bp in length “Large rearrangements (LAR)”. We changed description on the name of reads that DAJIN annotates in the Methods (Page 6, Line 205) and Results section (Page8, Line 249) as well as all other parts throughout the manuscript.

      Although we believe most of the LAR alleles are the real alleles generated through genomic rearrangements (Fig. 3b&3c, S12, and S16), we recognize that minor fractions of the LAR alleles, including those observed in WT mice, are composed of reads with high sequencing error rate. Visualized BAM files and consensus sequences can be indicators of the annotation results, providing information to the users of DAJIN that minor alleles that are similar in proportion to the one in the WT sample can be artificial alleles. We also cannot exclude the possibility that LAR alleles include those generated through PCR error. ‘Pseudo-LoxP’ alleles could be generated if the PCR products, which included one-side LoxP but not another-side LoxP, worked as a PCR primer to anneal WT allele in the next PCR step (Page 12, Line 425-427). Recently developed methods may address these limitations. We added description in the Discussion section (Page 17-18, Line 608-620).

      Reviewer 1_Comment #2:

      The reason that current method detect more than two alleles from one animal is probably due to the chimerism of the animal. However, when looking at the BAM file and figures presented in Figure 1b, 2c, 3b, 3d, 4c, as well as those in the Supplementary figures, there seems to be more than one allele (indels reads with different size) presented in one category.

      For example, Figure 2C, mice BC12, it is not fully aligned between the all alleles and the allele1 and allele 2 presented. For allele 1, which is called SV, there are reads with different size of indels. For allele 2, which is called intended PM, some reads are a hybrid of deletion and intended substitution.

      Thank you for checking the data in detail. As the reviewer pointed out, some of the reads in each allele showed indels with different sizes. We think these indel mutations are due to nanopore sequencing errors. Although the error rate of nanopore sequencing has improved, it has been reported that an error rate of 5% occurs in 1D sequencing of R9.4 flow cells that is the same flow cells used in our study (DOI: 10.1002/wfs2.1323). In this study, DAJIN mitigated the nanopore sequencing errors by calculating the MIDS score (Fig. S7), but the visualization using the BAM file showed the raw reads including the sequence errors. For this reason, the one allele seems to include different indel alleles.

      To evaluate the point, we performed Sanger sequencing and found that there were no hybrid sequences containing indel mutations, but only intended point mutation in BC12 allele 2 (Fig. 2d). The results of Sanger sequencing suggested that the indel mutations visualized by the BAM file were due to nanopore sequencing errors. To clarify the points, we updated the description in the Discussion section (Page 15-16, Line 528-548).

      Reviewer 1_Comment #3:

      What is the advantage of the current method as compared to the one reported by Bi et al., 2020, genome biology, previously?

      Thank you for pointing it out. We believe that one of the advantages of IDM-seq developed by Bi et al. is performing quantitative analysis by correcting PCR bias via Unique Molecular Identifiers (UMIs). However, when multiple samples are processed simultaneously, it is impractical in terms of cost and workability to prepare primers for the UMIs. While IDM-seq has the advantage to quantify the precise amount of each allele in a single sample, DAJIN is more suitable for primary and comprehensive analysis of multiple genome-edited samples. We have described these points in the Discussion section (Page 15, Line 509-513).

      Reviewer 1_Comment #4:

      The report machine learning method is developed for calling the different alleles. Has the authors compare DAJIN with e.g. NanoCaller, which is developed for SNPs and small indels calling based on DNN.

      We are thankful to the referee for bringing the comparison with NanoCaller to our attention. We conducted NanoCaller and found it performed better to detect the point mutation than Medaka and Clair. However, because NanoCaller could not detect the LAR (formerly labelled as “SV”) alleles, it incorrectly reported the genotype of BC25 as 'point mutation', not 'LAR with point mutation'. We added the results of NanoCaller in Table S9 and described these points in the Results section (Page 10, Line338-339).

      Reviewer 1_Comment #5:

      Apart from genotyping, many CRISPR studies performed in cells are focusing on profiling the indel profiles in a pool of edited cells. It would broaden the applicability of the method for detecting different indels types in such samples and conditions. Current methods, such as TIDE/ICE, NGS-based amplicon sequencing, IDAA can only detect smaller indels. DAJIN will add the advantage of detecting longer indels for such application.

      Thank you very much for your comments. We added description on application of DAJIN in the Discussion section (Page 17, Line 592-596).

      Reviewer #1 Significance :

      Although similar methods are reported for genotyping of the CRISPR editing outcome, the current study introduce the PCR barcoding and particularly the bioinformatic tool box for allele binning and calculation contribute with useful tool to the filed. The study has demonstrated with multiple applications demonstrating the broad applicability of it.

      Reviewer 2:

      CRISPR nucleases typically generate DNA double strand breaks (DSBs) at target site, which typically generate small insertion and deletion (indel) enabling precise gene knockout or knock-in. However, accompanied DNA DSBs often induce unwanted large deletions or chromosomal translocation. Thus, to assess such large variations as well as small indels is crucial in the genome editing field. In this manuscript, the authors developed a long-range assessment tool, named Determine Allele mutations and Judge Intended genotype by Nanopore sequencer (DAJIN), using a long-read sequencer, Nanopore sequencing. Overall, the topic will be interesting for broad readers and this tool looks technologically sound. I would suggest a few comments that may strengthen this manuscript, as follows.

      We are grateful for the referee’s valuable suggestions to improve our manuscript.

      Reviewer 2_Comment #1:

      Another key study is missed in this manuscript. Recently, a tool with similar concept to DAJIN was published in Nat Methods, which uses also long-read sequencers, Nanopore or PacBio [PMID: 33432244]. It is necessary to describe the benefits of DAJIN against the previous study.

      Thank you for pointing this out. Our method has an advantage over those utilizing unique molecular identifiers (UMIs) in its automatic identification and classification of genomic rearrangements including unexpected mutations in multiple samples obtained under different editing conditions (different target loci). As per our response to the Reviewer #1_Comment #3, one of the disadvantages of UMIs is the cost. More accessible methods of routine assessment of on-target genome editing outcomes are required, as well as unbiased assessment of editing products (PMID: 32643177). We showed in the manuscript that the machine-learning-based model could bypass molecular tagging to provide a feasible approach for routine assessment of genome editing outcomes. DAJIN will make a very significant contribution to speeding up and improving the accuracy of this experimental process.

      We agree that the approach reported by Karst et al. has certainly contributed to generation of highly accurate single-molecule consensus sequences. Analysis of small portion of samples using UMI-based methods may compensate for the limitations of DAJIN such as PCR bias and/or PCR-mediated recombination as you described in your comment #6. We added description in the Discussion section (Page 15, Line 509-513; Page 17, Line 615-618).

      Reviewer 2_Comment #2:

      In Figure 1a, the authors used Barcoding but details information is not present in the main text. The length and context information are necessary to be described in the main text.

      We thank the reviewer for these comments. According to the comments, we illustrated the process of PCR-based barcoding in Fig. 1a. Besides, we described the length of barcodes at "Library preparation and nanopore sequencing" in the Methods section (Page 4, Line 137 & 140).

      Reviewer 2_Comment #3:

      The term "SV (structural variation)" over "Single-nucleotide variant (SNV)" seems ambiguous. Does "SV" include large deletion and chromosomal translocation? In this manuscript, I guess that SNV indicates small indels, whereas SV indicates large indels. The detailed definition is needed for better understanding.

      Thank you very much for your comments. We intended to classify and label large genomic rearrangements including large deletion and chromosomal translocation as “SV (structural variation)”. We agree that structural variation traditionally referred to genomic alterations that are larger than 1 kb in length. Although the application of sequencing technology has expanded the spectrum of structural variation to include smaller events >50 bp in length (PMID: 21358748, PMID: 26432246), there are no common understanding on the definition of the name of genomic rearrangements >50 bp in length through genome editing. We changed the name of the unexpected mutation reads more than approximately 50 bp in length “Large rearrangements (LAR)”. We changed description on the name of reads that DAJIN annotates in the Methods (Page 6, Line 205) and Results section (Page8, Line 249) as well as all other parts throughout the manuscript.

      Reviewer 2_Comment #4:

      In Figure 2, IGV exhibits several SNVs (i.e., random errors) in each query sequence, which might be due to the low accuracy of Nanopore sequencing. I understand that DAJIN makes consensus sequence based on those long-read sequences. But I wonder how DAJIN pinpoint the point mutation (PM) so exactly?

      Thank you for pointing it out. As you mentioned, the low accuracy of Nanopore long-read sequencing made PM detection difficult. We tackled the issue and partly solved it by (i) calculation of MIDS score (Fig. S7), (ii) reducing data's dimension by principal component analysis (PCA), and (iii) setting proper parameters of HDMSCAN.

      DAJIN converts ACGT nucleotide information to MIDS (Match, Insertion, Deletion, and Substitution) (Fig. S6). Subsequently, DAJIN subtracts the relative frequency of MIDS between a control and a sample. We called the subtracted relative frequency 'MIDS score' (Fig. S7). The subtraction mitigates the sequencing errors because the error patterns are similar between a sample and a control. We next perform clustering using the MIDS score. DAJIN compresses the score by PCA and extracts five dimensions. The dimension reduction may be effective to mitigate sequencing errors because the sequencing errors have lower scores than true mutations. Subsequently, DAJIN performs HDBSCAN, a density-based clustering method. The HDBSCAN have a parameter of 'min_cluster_size' that indicates a minimum number of samples in a cluster. DAJIN finds the parameter returning the most frequent cluster numbers by searching the value in the range of 10% to 40% of reads. It means DAJIN ignores minor clusters that contain less than 10% of reads. We set the criteria because sequencing errors often made such minor clusters.

      In summary, we consider the MIDS score, PCA and the parameter setting of HDBSCAN support DAJIN's accurate PM detection. To clarify the point, we updated the description in the Methods section (Page 7, Line 217-225).

      Reviewer 2_Comment #5:

      In page 9, the authors also used next-generation sequencing (NGS). I guess this NGS indicates illumine-based short-read sequencing. Clearer definition is necessary here.

      We thank the referee for bringing this unclarity to our attention. According to the reviewer's comment, we updated the words 'NGS' to the 'illumina-based short-read next-generation sequencing' or 'short-read NGS' in the whole text.

      Reviewer 2_Comment #5-1:

      Whereas DAJIN could reported SVs, PM, and WT, the NGS could not capture SVs. Could you write the reason here? I guess that the short-read sequences including SVs might be discarded during the alignment process, which means that it is because of software limitation, rather than the NGS itself.

      Thank you for pointing this out. In this study, we performed the short-read NGS analysis by paired-end sequencing (2 x 151 bases) for PCR amplicons of about 200 bp length. We consider the main reason that NGS could not capture LAR (formerly labelled as “SV”) is due to the PCR process. The allele 2 in BC20, BC25, and BC26 of Tyr point mutation had a large deletion including primer annealing sites, which makes it impossible to obtain the PCR amplicon of this allele. Besides, allele 1 in BC25 had about 60-70 bp insertions. The insertion might make it difficult to amplify the whole length of this allele because of the limited number of cycles in short-read NGS.

      To examine whether the short-read sequencing reads were discarded during the alignment process, we calculated the mapping percentages of BC20, BC25, and BC26 and found that 97-99% of reads were successfully aligned to the mm10 reference genome. We think this result can support our hypothesis. We added the results in Table S10 and described the points in the Results section (Page 10, Line 329-332).

      Reviewer 2_Comment #6:

      Basically, DAJIN amplify the target region using PCR, thus PCR bias (e.g. unequal amplification according to different lengths) should be considered. The authors should address it. Moreover, it is better to describe the limitation of current DAJIN in the discussion section.

      Thank you very much for your comments. PCR amplification of genomic DNA is essential in our method described in the manuscript. As we have described in a paragraph in the Discussion section (Page 17, Line 597-601), we recognize there is an unavoidable limitation with PCR bias. We also cannot exclude the possibility that large rearrangements (‘LAR’, formerly labeled as ‘SV’) include alleles generated through PCR and/or sequencing error. ‘Pseudo-LoxP’ alleles could be generated if the PCR products, which included one-side LoxP but not another-side LoxP, worked as a PCR primer to anneal WT allele in the next PCR step (Page 17, Line 608-613). We recognize that minor fractions of the ‘LAR’ alleles, including those observed in WT mice, are composed of reads with high sequencing error rate. Recently developed methods including the one you kindly mentioned in the comment #1 may address these limitations. We added description in the Discussion section (Page 17-18, Line 615-618).

      Reviewer #2 Significance:

      Overall, the topic will be interesting for broad readers

    1. SciScore for 10.1101/2021.10.18.21265145: (What is this?)

      Please note, not all rigor criteria are appropriate for all manuscripts.

      Table 1: Rigor

      <table><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Ethics</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Sex as a biological variable</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Randomization</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Blinding</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Power Analysis</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr></table>

      Table 2: Resources

      No key resources detected.


      Results from OddPub: Thank you for sharing your code.


      Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
      Our study has limitations, which must be considered. The SNP heritability estimates for both measures of Covid-19, while >0, were small. Second is that genetic correlations, while being robust against environmental confounders, can still suffer from genetic sources of confounding (i.e., even with genetic correlations, correlation is not always causation). To this point, we think it is highly unlikely that not being breastfed as a baby and eating less cheese cause ASB. In fact, we chose these dietary traits to illustrate this very point. Rather, the shared genetic architecture that these have with education years, verbal reasoning, and average income are the more plausibly causal phenomena. Third, we cannot determine the direction of causality with genetic correlations alone. For much of the discussion above, we tacitly presumed plausible directions of effect (e.g., ASB causing Covid-19 versus Covid-19 causing ASB). But with all the traits in our matrix, the prevailing direction of effect could be the opposite and/or some level of bi-directional causation may exist15,24. These uncertainties are avenues for future research. Specifically, at present MR cannot be leveraged to test whether ASB causes any of the traits investigated, since few genome-wide significant signals have been found for ASB. But once they are found, bi-directional MR can be used to decipher the prevailing directions. A fourth limitation is that our findings are limited to those of European ancestry. The limi...

      Results from TrialIdentifier: No clinical trial numbers were referenced.


      Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


      Results from JetFighter: We did not find any issues relating to colormaps.


      Results from rtransparent:
      • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
      • No protocol registration statement was detected.

      Results from scite Reference Check: We found no unreliable references.


      <footer>

      About SciScore

      SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

      </footer>

    1. A lot of the literature and shorter articles out there treat many of these systems as recent or "new inventions". Many reference "innovators" like Ryan Holiday or Niklas Luhmann. They patently are not. They've grown out of the Western commonplace book tradition which were traditionally written into books underneath thematic headings (tags/categories in modern digital parlance) until it became cheap enough to mass manufacture Carl Linnaeus' earlier innovation of the index cards in the early 1900s. Then one could more easily rerarrange their ideas with these cards. Luhmann allowed uniquely addressing his cards which made things easier to link. Now there are about thirty different groups working on creating digital tools to do this work, some under the heading of creating "digital gardens".

      Often I think it may be easier to go back to some of the books of Erasmus, Melanchthon, or Agricola in the 1500s which described these systems for use in education. Sadly Western culture seems to have lost these traditions and we now find ourselves spending an inordinate amount of time reinventing them.

      I'd love to hear your experience in re-introducing it to students in modern educational settings.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: #RC-2021-00992

      Corresponding author(s): Parisa Kakanj and Maria Leptin

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this study, the authors use the fruit fly as a model to understand the role and regulation of autophagy in epidermal integrity during development and wound healing. They discover that hyper activation of autophagy via overexpression of Atg1 leads to disruption of epithelial organization, junctional protein localization, and syncytium formation. In addition, these epidermal defects were found to be dependent on TORC1 as knockdown or inhibition of TORC1 antagonists resulted in similar epidermal defects which could be rescued by knockdown of Atg1 or Atg5. Wound healing in fruit fly epidermis is known to induce cell fusion and here the authors show that syncytium formation is dependent on autophagy. GFP-Atg8a autophagosomes were found to accumulate in cells adjacent to the wound site, but Atg1-induced syncytium formation was dispensable for wound repair. However, the authors found that hyper activation of autophagy prior to injury slowed wound closure. This may be due to defects in actomyosin organization or another developmental defect the authors observed in the epidermis. Overall, the key conclusions of this study are convincing, but the experiments would be strengthened by validation of all the RNAi strains used as well as demonstration that epidermal barrier remains intact as described.

      **Major Comments**

      1. This study uses a number of UAS-RNAi strains as well as dominant negative and overexpression transgenes. There is no validation that these genetic perturbations work as expected.

        Almost all of the lines we use have been extensively used and validated by others as documented in the literature. We append a table (below, page 14) with these references. It would be close to impossible for us to show their tissue specific efficacy in the larval epidermis because it is extremely difficult to obtain clean dissections of epidermis without contamination from other tissues (muscles, nerves, etc.), and we believe we can rely on the known validation of most of the lines. It is true that some of the lines are less well characterised, and we comment on those below, and will eliminate our speculation on their effects in the manuscript.

      In fact, the authors state on pg 5 that RNAi to Atg6, Atg7, and Atg12 may be less effective, but do not verify the knockdown efficiency to the gene of interest (i.e. Atg5 RNAi knock downs Atg5 transcript or protein).

      Atg12 and Atg7 have been shown (PMID: 25882046) by quantitative RT-PCR to effectively reduce RNA levels in the midgut during larval to pupal transition. We will therefore have to eliminate our speculation that the weak effect in the epidermis may be due to ineffective knock-down. Rather, it seems that these components are accessory but not necessarily essential for the completion of autophagy, as also observed by others (PMID: 25882046, PMID: 1805642, PMID: 23599123, PMID: 15296714, PMID: 23873149, PMID: 23406899)

      This is particularly important as authors use a single UAS-rictor RNAi strain to conclude that autophagy is dependent on TORC1 and not TORC2. If rictor RNAi is also weak or ineffective than this conclusion would be erroneous.

      The function of rictor has been validated by classic genetics: Animals homozygous for deletions of rictor show no defects throughout their normal life cycle (Hietakangas and Cohen, 2007). We have also shown that epidermis of homozygous rictor∆1 larvae (marked with Src-GFP, DsNuc-Red2) shows no abnormalities in cell shapes or cell membranes. We include an image here.

      Figure A __| Effect of rictor deletion on the epidermis. a,b, Fluorescence micrographs of larval epidermis expressing the indicated markers in a larva homozygous for a rictor deletion (rictorEY08986 , also named rictor∆1). a, Lower magnification showing the entire width of larval segments A3 or A4. n=16-18 larvae each genotype. Scale bars: a 50 μm; b,__ 20 µm.

      A major conclusion of this study is that autophagy remodels the lateral cell membranes and not the basal or apical, so the membrane integrity remains intact. This is described and shown in Fig S3a, but it is hard to see that the apical membrane is intact. It would be helpful if authors could show a true membrane marker, such as UAS-CD8mGFP to see if there is a continuous membrane.

      We will include new experiments with this marker.

      Alternatively, is there a barrier assay that could help demonstrate that syncytium formation does not disrupt epithelial integrity?

      This follows from the fluorescence recovery we performed (Supplementary Video 13), where we observe rapid diffusion between areas in the epidermis, but never any leakage of fluorescence in the y-axis into the body cavity. We will emphasize this more clearly in the text.

      This could be performed in the fly gut, using the smurf assay (Rera M et al. 2011), since the authors also describe (pg 9) a similar role for autophagy in disruption of epithelial lateral membranes.

      We had done a smurf assay, and observed no leakage from the gut, but didn’t document this at the time because of difficulties during the period of Covid restrictions of accessing a dissecting scope/camera set up in a lab outside our own. We will try to repeat this now in the hope that with current reduced restrictions we can record the result.

      Is autophagy dependent syncytium formation cell autonomous?

      Our clonal analysis in wound healing addresses this point (Figure 2; text page 5 and 6). Clones of GFP-expressing cells neighbouring a wound share their cytoplasmic contents with their neighbours during wound closure. However, a clonal cell that is Atg5-deficient in a wild-type background does not share its content with the neighbouring cells. This shows that for a cell to participate in syncytium formation, that every cell itself has to be competent to perform autophagy. We will expand the explanation of this point in the text.

      The A58-Gal is not cell-type specific as authors describe (pg 9) similar effects in trachea, salivary glands, and intestine and it is unclear if effects are due to disruption of autophagy in epidermal cells or general disruption in fly's physiology. The authors should determine, using a more restrictive Gal driver, whether syncytium formation is due to activation of autophagy in the epidermal cells or another cell type (trachea, salivary glands, or intestine).

      We apologize if our phrasing of ‘ectodermal’ led to the impression that A58-Gal4 is cell-type specific. A58 also drives expression in the tracheal system, as all other available epidermal drivers do. A58 expression in the salivary gland is presumably due to the origin of the Gla4 construct, which like many other Gal4 drivers (e.g. NP1-Gal4) includes salivary gland specific enhancers (PMID: 8223268 and PMID: 12324947). A58 is not active in the gut, and for the experiments in the gut we used the NP1 driver. We will rephrase the text in the paper to avoid confusion. There is no driver that is absolutely restricted to the epidermis.

      Alternatively, if no other Gal4 is available for the larval epidermis then authors could at least show using enterocytes driver (NP1-Gal4) that overexpression of Atg1 is sufficient to induce syncytium formation and its effect on gut barrier integrity.

      We did do this experiment but didn’t include the images because of the large number of figures we already had. We now show them here. As mentioned above, barrier integrity is not compromised. We can also provide images of the phenotype in tracheal cells.

      Figure B __| Effect__ of uncontrolled autophagy on enterocytes and salivary glands. Larval gut or salivary glands expressing the indicated markers and overexpression (Tsc1,2 or Atg1S) or RNAi (raptori) constructs using the NP1-Gal4 driver. Images are from live imaging of gut or salivary gland of 6 to 11 larvae for each genotype. Scale bars, 20 µm.

      In Fig 8, authors nicely show that Atg1 RNAi can rescue Tor RNAi and raptor RNAi, but, what about the reverse. Is overexpression of Tor sufficient to inhibit the overexpression Atg1 and reduce autophagy-induced syncytium formation?

      Overexpression of Tor would affect both TORC1 and TORC2. We have done this experiment using UAS-Torwt construct but found that it leads to excessive autophagy rather than suppression, consistent with similar results reported by others (PMID: 12324961 and PMID: 15186745). This approach can therefore not be used to do the proposed experiment. Instead, one could use downregulation of the Tor inhibitor TSC1, which acts on TORC1, and we have shown to reduce autophagosome formation in wound healing (Fig. 1d). Another option is to overexpress the TORC1-specific activator Rheb (PMID: 12893813, PMID: 17208179 and PMID: 31422886). We will set up the experiments with these constructs in the hope that they will yield interpretable results.

      **Minor comments:**

      1. Check spelling of abbreviations, Sqh is often misspelled Shq in figures

        We will correct them. Thanks for alerting us.

      The order of images in Figure 3 should match the description in the text (pg. 6).

      We would prefer to retain the current order because it is then consistent with all the other figures. Re-writing the text to reflect this order would make it less clear.

      AtgW is described in text, but not shown in Fig 3a-c. Also, upstream activators of TORC1 are described first, but shown last in this Figure making it difficult to follow.

      We will now only mention Atg1W later in the text where we also show it in a figure.

      Fig7a should show junctional effect of Atg1W alone and in combination with Atg5i which is used in 7b.

      We had left this out to save space, but we will now include these data.

      It is unclear why authors switched to this weak overexpression for this photobleaching assay when Atg1S was predominantly used in the rest of the study.

      The reason we used Atg1W in this particular experiment is that we had it on a chromosome where it was recombined with GFP which made it genetically much easier to use for FLIP experiments. However, perhaps these constructs merit some discussion. Atg1W and Atg1S were originally called “weak” and “strong” based on studies in other tissues and other stages (PMID: 33253201). However, we found that in the epidermis their effects are practically indistinguishable, as judged by TEM (Fig.3d,e) (Fig 5e,f) (Suppl. Fig. 5a,b and Suppl. Fig. 6b,c), and all markers we used in confocal analyses (which we will include them). Thus, to avoid confusion, we will change the nomenclature we use on our paper to the neutral Atg1GS and Atg16B.

      Reviewer #1 (Significance (Required)):

      This study elucidates the role and regulation of TORC1 and autophagy in epithelial membrane remodeling. This is important work that is significant to both developmental and wound healing research. Many cell types become multinucleate during differentiation, aging, and wound healing and here the authors find a novel role for authophagy in remodeling lateral cellular junctions to facilitate syncytium formation.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In their present manuscript Kakanj and colleagues show that during epithelial wound healing autophagy pathway controls plasma membrane integrity and homeostasis. Furthermore, elevated autophagic activity is sufficient to induce syncytium formation, which is essential for wound closure and healing. Authors used the epidermis of fruit fly larvae as model to study wound healing and video microscopy to examine this process. The methodology is well established, since authors already published a related study in 2016 using similar tools.

      The findings presented here are interesting and promising, the quality of most experiments are satisfactory, the confocal images/videos are excellent and I truly appreciate that authors used electron microscopy to support some of their claims. Findings are well presented and the text is well written and easy to read.

      Overall, my opinion is very positive about this manuscript.

      I believe most of the findings are very well supported, but I have some suggestions, which may can help strengthen the authors' points.

      1) Authors used GFP-Atg8a reporter to follow autophagy during wound healing. While I also believe that, the appearing GFP-Atg8a dots represent autophagic vesicles after wounding but GFP-Atg8a has some certain limitations. First: Atg8a (or LC3 in mammals) is removed from the outer surface of autophagosomes by Atg4 and the Atg8a trapped inside the autophagosomes will be degraded in the autolysosomal lumen. Thus Atg8a not always localizes to autolysosomes, actually Atg8a immunostaining mostly labels autophagosomes (and phagophores) but not autolysosomes in insect cells. Accordingly, GFP-Atg8a reporter is also subject of autolysosomal degradation and furthermore most of the GFP signal is quenched in the acidic lumen of autolysosomes, since at lower pH GFP loses fluorescence. Nevertheless, if lysosomal degradation proceeds normally, GFP-Atg8 will be degraded completely. Thus, some of the autolysosomes cannot be detected using this reporter, for this mCherry-Atg8a reporters can be used, since mCherry is more resistant than GFP and thus accumulate inside lysosomes, and retains its fluorescence in acidic environments.

      This is a good suggestion and we had done these experiments. However, the red fluorophores have a serious problem in that they all tend to form small aggregates or puncta – not in all tissues and at all stages, but this is a very wide-spread phenomenon, and is even observed in in vitro experiments (own observations). This makes quantification of vesicles or other small structures such as autophagosomes complete impossible. Nevertheless, here are a few figures from our analyses. While some of the spots clearly appear to be autophagosomes, as judged by their positions, they cannot be objectively distinguished from the other spots.

      Figure C __| Autophagy during epidermal wound healing. Time-lapse series of single-cell wound healing in larva expressing mCherry-Atg8a (black) to mark autophagosomes and autolysosomes (A58>mCherry-Atg8a). a, z-projections of a time-lapse series. b, Higher magnification of the areas marked by magenta boxes in (a). n=11 larvae. Each frame is a merge of 57 planes spaced 0.28 μm apart. Scale bars: a 20 μm; b,__ 10 µm.

      However, I still believe that for video microscopy GFP-Atg8a was a perfect choice, I just suggest to confirm the appearance of autophagosomes after wounding by other means: for instance, immunostaining of the epidermis after wounding (120 min) against Atg8a should confirm the presence of autophagosomes. There are a few specific available antibodies working in flies which are listed in the reviews of Nagy (PMID: 25481477) or more recently in Lorincz (PMID: 28704946)

      This is technically a huge challenge. We would have to induce a single cell wound, then filet and fix the epidermis, during which it rolls up and often destroys the area of interest. If it doesn’t, then the prep can be flattened out, but it still can be very difficult to find the wound in the prep.

      2) One of the major claims of the authors is that elevated autophagy leads to the breakdown or removal of lateral plasma membranes to promote syncytium formation. It is clearly seen on the confocal or EM images that lateral membranes disappear after wounding. However, it is also suggested that the lateral plasma membrane material is incorporated into autophagosomes or plasma membrane is a potential membrane source of autophagosome formation. I believe this is the least supported claim of the manuscript since no direct evidence for this is presented. This is based on BodyPy staining only, that BodyPy positive vesicles accumulate inside the cells. If this is indeed the case plasma membrane components should be detected in autophagic vesicles. Thus, I recommend co-staining membrane components with autophagic markers.

      This is indeed the clear next step, and we did a number of experiments along those lines, but they were once again compromised by the problem with the mCherry aggregates. This made the interpretation in the unwounded epidermis with artificially upregulated autophagy impossible. However, experiments with naturally upregulated autophagy in wound healing yielded results that are consistent with plasma membrane components being associated with autophagosomes (with the caveat that not every red dot we see represents an autophagosome). We have just repeated some of these using the septate junction marker FasIII and have obtained some beautiful movies that show FasIII labelled membrane (green) being surrounded by mCherry spots, and as the membrane begins to dissociate, the mCherry spots turn from red to yellow. We have included stills from results of these analyses here and will include them in a new figure in the revised manuscript.

      Figure D __| Colocalization of Atg8a and the septate junction component FasIII during epidermal wound healing. a, Time-lapse series of single-cell wound healing in a larva expressing mCherry-Atg8a (red) (A58>mCherry-Atg8a) and endogenously tagged FasIII (GFP gene trap; green), a transmembrane component of septate junctions. b, Higher magnification of the time-lapse marked by magenta boxes in (a). n=11 larvae. a,b, Each frame is a merge of 68 planes spaced 0.28 μm apart. Scale bars: a,b __20 μm.

      However if authors observe no colocalization of plasma membrane components with autophagy markers I still believe this study worth to be published. I would like to recommend the review of Ungermann and Reggiori (PMID: 29966469) in which the trafficking of Atg9 is discussed,

      Yes, indeed. And there is in fact now a further paper that goes in a similar direction (PMID: 34257406). We had left this out because we did not have direct data on Atg9, but will be happy to include it in the discussion in which we cite the paper that shows that Drosophila Atg9 is localized on the lateral plasma membrane in nurse cells, and loss of it leads to syncytium formation.

      since the source of autophagosomal Atg9 is in part the plasma membrane in mammalian cells. Therefore, these findings may strengthen the authors' claims.

      **Minor points:**

      Figure 2A: I believe authors wanted to use the word 'maintaining' not mating in their scheme.

      Indeed. Thanks for alerting us.

      Discussion: Authors suggest that: another function of autophagy in the cells surrounding the wound may be to clear up debris as in planarian and other cell types autophagy is activated in healthy cells, which simultaneously phagocytose cell debris. Honestly, I do not believe that this is the case here. Some of the Atg proteins are indeed required for phagocytosis during LC3-assiciated phagocytosis (LAP) (see: PMID: 30787029), but LAP is independent form Atg1

      Good point, we will include this in the discussion.

      and if LAP happened in the cells, surrounding the wound then GFP-Atg8a positive phagosomes would appear in those cells. However, it is clearly not the case here.

      Reviewer #2 (Significance (Required)):

      I highly recommend this manuscript to be uploaded to a relevant journal and I believe the findings presented here will be interesting for biologists specialized in regeneration and readers from the autophagy fields alike.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      The larval epidermis of Drosophila is a prime model for studying wound healing by combining live imaging with cellular, genetic and molecular analysis of the processes involved. Autophagy is known to be activated and necessary for efficient wound healing in animal models through secretion of cytokines and clearance of bacteria. This manuscript implicates autophagy in cellular syncytium formation during wound healing. Live imaging demonstrates autophagy activation in cells surrounding the wound. Inhibition of autophagy by RNAi against atg1 or atg5, required for autophagy initiation and autophagosome formation had no effect on the rate of constriction and closing of the wound site. However, elegant live imaging demonstrates that autophagy is required cell autonomously for cell fusion, a normal process during wound healing in flies. Autophagy can also be instructive for cell fusion. Strong induction of autophagy by TORC1 inhibition, TSC1/2 overexpression or Atg1 overexpression induce cell fusion that is genetically dependent on atg5, a gene acting downstream of atg1 in autophagosome formation. As Chloroquine treatment, a chemical inhibiting autophagosome fusion to the lysosome and lysosomal breakdown showed no effect, the authors suggest that later steps of autophagy are not involved. Live imaging with a selection of cellular fluorescently tagged markers of apical, lateral and basolateral membrane domains, combined with electron microscopy show clearly that lateral membrane are disrupted and removed within the epithelium. During this process, membranous large vesicles "drift" away from the plasma membrane. If these vesicles relate to autophagy is not addressed. In addition to the effect on cell fusion, strong autophagy induction also leads to autophagy within the nucleus, chromatin condensation and distortion of the nuclear membrane. The manuscript is well written and easy to follow. Figure panels and data are clearly presented. All experiments are well described throughout and skillfully executed with appropriate controls and statistical analysis. It remains unknown what induces autophagy in response to wounding. It also remains unclear whether autophagy deconstructs or engulfs parts of the plasma membrane, or if parts of the autophagy machinery has additional roles in plasma membrane fusion.

      **Major comments:**

      • Are the key conclusions convincing? -Conclusions are generally balanced and convincing.

      -I have seldom seen a paper so well written, presented and balanced by first pass. Hence my experimental suggestions are few.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? -Claims are well founded.
      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary to evaluate the paper as it is, and do not ask authors to open new lines of experimentation.

        -The inhibition of autophagy is performed using knockdown of two genes acting in autophagy initiation (atg1, a part of the ULK1 kinase complex) and atg5, required for autophagosome formation. Later acting genes in the autophagy process such as autophagosome closure, fusion with the lysosome or degradation were not analyzed. In the abstract, the authors state "Proper functioning of TORC1 is needed to prevent autophagy from destroying the larval epidermis which depends on membrane isolation and phagophore expansion, but not fusion of autophagosomes to lysosomes". As far as I can see, the last statement on fusion derives from experiments with Chloroquine. Although frequently used for qualitative experiments, CQ is not suited for conclusive experiments. Without genetic experiments targeting genes for autophagosome-lysosome fusion such as snap29,stx17,vamp7 this statement is in my mind not well supported.

      We agree this would strengthen our findings, and we had indeed ordered these strains from the Bloomington stock collection. However, they were dead on arrival and both our labs in Heidelberg and Cologne currently have major problems with shipments from Bloomington and German customs. Other colleagues whom we asked did not have them available either. We will continue to search for appropriate constructs, but even if we find them and they arrive alive, and are processed by customs within a reasonable time, it will take many weeks to establish and then expand them and subsequently do the multi-generation crosses to obtain the stocks with all the relevant drivers and markers to set up the experiment. Three months is the absolute lower limit provided everything works according to plan, and first time round 6 months is a more realistic assumption. We hope that the referees and the editors agree that while this is a desirable experiment, it is not essential for the publication of the other results we present.

      • Are the suggested experiments realistic for the authors? It would help if you could add an estimated cost and time investment for substantial experiments. -Given the expertise of the authors, these experiments should be easy to perform within 3 months.

        • Are the data and the methods presented in such a way that they can be reproduced?

        • The manuscript is well written and an excellent example of how how methods and experiments should be presented. Methods, tools and experiments are all well described.

        • Are the experiments adequately replicated and statistical analysis adequate? -Replicates and statistics are adequate and custom for the type of analysis performed.

        **Minor comments:**

      • Specific experimental issues that are easily addressable. Figure 3 h. The live imaging documents the striking disappearance of lateral cell membranes using SRC-GFP. In 3h, large vesicle formation and movement towards the cell interior is shown. How frequent is this?

      This can only be seen clearly in experiments with time-controlled (Gal80ts) induction of authophagy where we can observe the process unfolding. We see these structures very frequently, but great variability in morphology and the structures are not always captured clearly in the plane of imaging. We here provide further examples.

      Figure E __| Autophagy in unwounded epidermis. a-c, Three additional examples showing apparent extrusions from lateral membranes after induction of autophagy (same experiment asn Figure 3h).__ Time-lapse series of epidermal cells expressing Src-GFP and Atg1S. Transgene expression is induced at the end of the second larval instar, live imaging started 6 h later (t=0) and continued for an additional 6 hours. a-c, Src-GFP containing material appears to be taken out of and eventually detached from lateral cell membranes (arrows).

      Is this believed to be the mechanism of lateral membrane removal?

      We would of course like to believe that, but we have no proof, and would therefore only be able to speculate.

      If so, is it dependent on the autophagy machinery. Are these vesicle positive for autophagy markers?

      Some autophagy markers have indeed been reported to be associated with the plasma membrane (e.g. Atg9, Atg16), but a conclusive study, while highly desirable, in our view goes beyond the scope of this study.

      Resolving this issue may lift the conclusions of the paper. Using 3xCherry-Atg8 together with SRC-GFP, this should be possible.

      We are intrigued by this suggestion and will be setting up the necessary crosses to do the experiments. However, it will take a long time to generate the necessary stocks (see genetics described below), and we will then again encounter the problem with the mCherry aggregates (see response to referees # 2). We are curious about the outcome, but we do not think it will be reasonable to promise as part of this revision that we will be able to provide conclusive results in the foreseeable future. Along with the many other things to do, this may just have to become part of a future paper, especially if there turn out to be other problems to be solved along the way. Like, for example, having to make an infrared (like mIFP or mKate, with which we have had much better experience in other contexts) Atg8 construct.

      Using CQ, the authors should be able to detect plasma membrane and junctional components in autophagosomes or autolysosomes (by confocal and electron microscopy) as degradation is inhibited. This should help to distinguish whether lateral membranes are engulfed and digested or if cells simply fuse, by using a part of the autophagy machiney.

      We have many interesting EM images on which we have had extensive discussions with the Paolo Ronchi and Yannick Schwab at the EMBL (whom we embarrassingly forgot to acknowledge in our manuscript, which will now be corrected), and one of the authors of this paper (BM) is an expert in EM imaging of the larval epidermis. It was agreed that some structures could indeed be interpreted as autophagosomes with content resembling junctional material. However, in the absence of absolute proof, we did not include them in the paper. We now show them here.

      Figure F __| Autophagosomes with junctional material in unwounded epidermis.__ Transmission electron micrographs of sections through the epidermis of a larva with elevated autophagy (A58>Atg1S) at two different magnifications. Arrows mark the autophagosomal membrane with content resembling junctional material.

      The authors, state that strong autophagy activation also leads to syncytium formation of tracheal cells, salivary glands and gut EC cells. Representative images in a supplementary figure would be useful for future reference.

      See response to other comments above (response to referees # 1). We have added some images in this document (Figure B) and will be happy to add additional ones in the revised manuscript.

      • Are prior studies referenced appropriately? -Yes. Key literature and findings are cited and discussed.

      • Are the text and figures clear and accurate? -Yes

        • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      -See suggested experiments above.

      Reviewer #3 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. -The findings clearly documents a role of autophagy in syncytium formation in the physiological process of wounding. This has parallels to muscle syncytium formation, but has to my knowledge not been demonstrated in any other cell type to be performed by autophagy. Moreover, the authors show that strong autophagy induction can lead to fusion of epithelial cells. This may have relevance for processes and diseases where polyploidy are observed.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      • State what audience might be interested in and influenced by the reported findings. -The data are very strong and the demonstration that autophagy controls syncytium formation outside of muscle development is surprising and significant. It is of interest to the field of cell biology and development in general and the autophagy field in particular. It will also be of interest for the medical field that deals with multinuclear phenotypes, such as cancer.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. -Development, cell signaling, autophagy, vesicle trafficking.

      Table 2 | Fly stocks used in experiments

      Transgenes

      Stock ID

      Source

      Publications using this construct

      Reference

      UAS-GFP-Kuk

      (UAS-GFP-KukEY07696(w+))

      Jörg Großhans

      PMID: 16421189

      https://flybase.org/reports/FBal0161312

      29

      UAS-Atg1i

      (UAS-Atg1RNAi)

      V # 16133

      (GD7149)

      PMID: 19363474

      PMID: 31995752

      PMID: 32032548

      PMID: 32915229

      https://flybase.org/reports/FBtp0034071.html

      UAS-Atg5i

      (UAS-Atg5RNAi)

      V # 104461

      (KK108904)

      PMID: 31995752

      PMID: 32032548

      https://flybase.org/reports/FBtp0046851.html

      UAS-Atg6i

      (UAS-Atg6RNAi)

      V # 110197

      (KK102460)

      PMID: 28581519

      PMID: 23599123

      PMID: 27542914

      PMID: 25644700

      Dissertation of Philipp Trachte, Abb. 23. https://refubium.fu-berlin.de/handle/fub188/27709

      Dissertation of Sirena Soriano Rodríguez. https://roderic.uv.es/bitstream/handle/10550/50749/Tesis%20SSoriano.pdf?sequence=1

      UAS-Atg7i

      (UAS-Atg7RNAi)

      V # 45558

      (GD11671)

      PMID: 25882046

      PMID: 31995752

      PMID: 32032548

      PMID: 23599123

      https://flybase.org/reports/FBtp0025106.html

      UAS-Atg12i

      (UAS-Atg12RNAi)

      V # 29791

      (GD15230)

      PMID: 25882046

      PMID: 17568747

      PMID: 31995752

      https://flybase.org/reports/FBtp0027770.html

      UAS-TSC1,2

      (UAS-TSC1, AUS-TSC2)

      Iswar K. Hariharan

      PMID: 15296714

      PMID: 11348592

      64

      UAS-TSC1i

      (UAS-TSC1RNAi)

      V # 22252

      (GD11836)

      PMID: 23144631

      PMID: 29144896

      PMID: 29456138

      https://flybase.org/reports/FBtp0025266.html

      UAS-Tori

      (UAS-TorRNAi)

      BL # 33951

      Nobert Perrimon

      PMID: 25882046

      PMID: 26395483

      https://flybase.org/reports/FBtp0065159.html

      65

      UAS-TORDN

      (UAS-TORTED)

      BL # 7013

      Thomas P. Neufeld

      PMID: 15296714

      PMID: 29144896

      https://flybase.org/reports/FBtp0016360.html

      66

      UAS-raptori

      (UAS-raptorRNAi)

      BL # 34814

      Nobert Perrimon

      PMID: 25882046

      PMID: 31048465

      https://flybase.org/reports/FBtp0068814.html

      65

      UAS-raptori-2

      (UAS-raptorRNAi)

      BL # 41912

      Nobert Perrimon

      PMID: 32097403

      https://flybase.org/reports/FBtp0081336.html

      65

      UAS-rictori

      (UAS-rictorRNAi)

      BL # 36699

      Nobert Perrimon

      PMID: 25882046

      https://flybase.org/reports/FBtp0070835.html

      65

      UAS-Atg1S

      (UAS-Atg16B)

      Thomas P. Neufeld

      PMID: 33253201

      https://flybase.org/reports/FBtp0041043.html

      67

      UAS-Atg1W, UAS-GFP

      (UAS-Atg1GS10797)

      Thomas P. Neufeld

      PMID: 33253201

      https://flybase.org/reports/FBal0216676.html

      67

      UAS-S6Ki

      (UAS-S6KRNAi)

      BL # 41895

      Nobert Perrimon

      PMID: 25284370

      https://flybase.org/reports/FBtp0080798.html

      65

      UAS-SqaKA

      (UAS-SqaT279A/CyO)

      Guang-Chao Chen

      PMID: 21169990

      https://flybase.org/reports/FBtp0071419

      30

      UAS-RhoAi

      (UAS-RhoARNAi)

      V # 12734

      (GD4726)

      PMID: 23853710

      PMID: 33789114

      https://flybase.org/reports/FBtp0031970.html

      UAS-Roki

      (UAS-RokRNAi)

      V # 104675

      (KK107802)

      PMID: 24995985

      PMID: 33789114

      https://flybase.org/reports/FBtp0046110.html

      UAS-RhebAV4

      BL # 9690

      Fuyuhiko Tamanoi

      PMID: 31909714

      PMID: 28829944

      https://flybase.org/reports/FBal0141561.html

      69

    1. if all the batteries that generated electricity for telegraph lines had stopped working the economic life of the nation would have come to a halt

      This part of the lecture, which explains the importance of the telegraph in society during its prime stages, is something that I feel like it truly significant to recognize as a progressor in an environment. When we mention the invention and use of the telegraph in today's world when discussing the past, the true role it played for the different branches of society is always overlooked as it was an element in which many depended on due to its transmission speed. Before this discussion, I looked at the telegraph as a simple device of skinny wires that communicates information from one end to another, mainly helping news reporters. However, as the lecture mentions, the telegraph was more than this, it was heavily depended on in crucial areas of society such as the military, the economy, and the railroad, aspects that are all signs of progression towards a better life. If the telegraph ever ran into any mechanical problems, the flow of life would be heavily disturbed due to its importance, as "trains would stop running, businesses with branch offices couldn't function, news papers couldn't cover events far away..." (Networking Nature Lecture). Personally, I think it is interesting, and also important to note, that the telegraph is not only just an aspect meant for communicating information across the world, but also a necessity to continue improving the economy and keeping the public informed and motivated. Even if people know the importance of it, they ignore the fact how if one piece of society stops working, being the telegraph, many other parts of society "come to a halt" as well and not just one minimal aspect, something truly interesting. While it may not be considered an important item by some, it seems as if the telegraph is one of the first major inventions that heavily impacted the change and improvement of society along with the railroad, as it was often used by many, those being reporters and businessmen, and any issues would stop progression and hurt those who depend on its functions.

    1. NATURE

      I think this essay expands on what sublime really comes down to. In Cole's writing he is more appreciating the nature, but in Emerson's he is justifying that nature is all around us and makes the world up as we know it. Even beyond that point he goes to talk about our reasoning for existence a lot saying that, "All science has one aim, namely, to find a theory of nature." I think this may have been true in his time with underdeveloped sciences but in today's world there a scientists who have a theory on why nature was created yet still pursue science. Why would a scientist do this if still only trying to pursue the theory of nature that we already have discovered.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We thank the reviewers for their critical comments and suggestions. We are glad that the reviewers appreciated the quality of the data and the novel findings connecting the secretory trafficking machinery with extracellular matrix-related signaling.

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Jung et al reports on an interesting finding that focal adhesion signaling regulates the expression of Sec23A and thereby regulates COPII-dependent trafficking. The data presented a mostly solid and the finding itself is highly novel, as it tackles an area of secretory trafficking that remains poorly understood, namely the connection between the ECM and secretion.

      I will list below all comments that I have mixing both technical and conceptual topics:

      \*Technical issues:***

      1-The authors should provide a better description of how the designed this siRNA library. What were the inclusion criteria for these 378 genes? I might have missed it, but I could not find this information easily.

      Reply: The library has been designed in-house based on gene annotations and literature to include cytoskeleton structural proteins, motor proteins, and other associated and regulatory proteins. We will add this information in the Materials and Methods section.

      2-Figure 2: I know this is challenging for EM images, but is there a way the authors could quantify these data? How many images were looked at? What was the average width of ER cisterne?

      Reply: We will provide image quantifications and statistics

      3-Figure 4: I think that the characterization of the FA phenotype is a bit underdeveloped. There is no quantification of these data. Is the size of FA changing? Is the number of FA per cell changing? Is the length of FAs changing? I think that more work is needed to increase the confidence in these data.

      I could also not easily see what type of cells these are. A better description of this experiment is also required. Also, how many cells were analyzed. I think it is important that this experiment is done with a sufficient number of cells to increase the confidence in the data.

      Reply: We agree with the reviewer that our observations regarding the focal adhesion (FA) phenotype will benefit from image quantification and we intend to include this in the revised manuscript. All FA experiments were performed on HeLa cells. We will update the materials and methods sections to better describe this experiment.

      \*Conceptual issues:***

      1-The finding that focal adhesion signaling negatively affects ER-export is surprising, because cancer cells that grow on stiff substrates have more focal adhesions and are more invasive and migratory. Both migration and invasion are expected to depend on ER-export. Although the authors did not formally test Sec23A expression under different stiffnesses, I would expect that stiff substrates would lower Sec23A expression and thereby negatively affect ER-export. It would certainly increase the breadth of this work to include data like this and to also discuss this highly surprising finding. However, it is of course the decision of the authors and the editors to decide whether such an experiment would benefit the entire story.

      Reply: In this work, we have shown that cells plated on ECM or matrigel have decreased SEC23A expression compared to control cells. We have also shown that inhibition of FA kinase leads to an increase in SEC23A expression (Figure 5). Whether this translates into a change in ER transport, is a fair point that we will address in the revision. Regarding stiffness, we have done a preliminary experiment that shows that cells plated on a soft synthetic substrate have less SEC23A than cells plated on plastic.This goes in line with our ECM experiments because Matrigel and fibroblast-derived ECM are softer than plastic.

      2-The authors postulate that this novel mechanism could be part of a feedback loop. If this were the case one would expect the acute effect of FA to increase ER-export (or secretion) and the negative feedback will then reduce secretion. However, the acute effect of FA is not addressed in this manuscript. In order to postulate a feedback loop, the authors would need to test the individual nodes of this loop.

      Reply: The question appears to be whether an acute effect on FA would affect the expression of SEC23A and therefore ER transport. If by the acute effect the reviewer means a pharmacological manipulation, we have shown that upon treatment with the FAK inhibitor the expression of SEC23A increases (Fig 5A). Whether this increase in SEC23A expression translates into a corresponding increase in ER transport remains to be seen. This will be tested in our revised manuscript as mentioned above in reply to point # 1.

      Our data encouraged us to propose a hypothetical feedback loop that would connect the deposition of ECM through the expression of SEC23A. We will have more data to support (or reject) this idea once we do the transport experiments as mentioned above. However, we think that a full characterization of this hypothetical loop by testing individual nodes is beyond the scope of this manuscript

      Reviewer #1 (Significance (Required)):

      I think that the basic finding of this manuscript is highly novel, by showing the impact of the ECM and focal adhesions on COPII-dependent trafficking. I think that this will not only appeal to people from the trafficking community, but also to people working on cell migration and on mechanobiology. The work in its current form does not require much extra efforts (max. 3 month). However, if the authors would decide to increase the breadth of data, they would require 3-6 months.

      Reply: We thank reviewer #1 for the comments. We also believe that this story will appeal to a broader audience and would help to bridge the gap between membrane trafficking and mechanobiology communities.

      \*Referees cross-commenting***

      I went through the comments of the two other reviewers and agree with their verdict. Some extra work on the characterization of the early secretory pathway would be good. Both reviewers provided a nice catalogue of possible experiments to choose from.

      Reply: We have characterized the early secretory pathway in terms of ER exit sites, Beta-COP, and Golgi morphology (FIG. 2B-H and S1A-B). Together, these data strongly characterize the nature of ER-block. Moreover, the finding that our interactors affect the expression of SEC23A allows us to explain mechanistically why an ER transport block occurs. This is further strengthened by the rescue experiments (FIG. 3F). We believe that further characterization of the secretory pathway will not contribute substantially to the main message of this manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Jung et al which based on a targeted siRNA screen, demonstrates regulation of SEC23A (component of the SEC23 complex of the COP coat) levels at transcriptional level downstream of focal adhesion signaling. By regulating siRNA mediated downregulation, the authors were able to identify proteins which either increased or decreased traffic of VSVG through the secretory pathway when combined with downregulation in the levels of with either SEC23A or SEC23B. Authors have focused on a group of SEC23B functional interactors, downregulation of which shows them increased size of focal adhesions which also downregulate SEC23A levels, thus providing an explanation for reduced secretory traffic. Authors further show that plating cells on fibronectin or Matrigel, which activate Focal adhesion kinase signaling also results in downregulation of SEC23A transcript levels. The screen is conducted in a well-controlled manner for most parts with a clear explanation of the analysis routines and the data presentation if of very good quality. Most important results have been validated by more than one experimental strategy which lends substantial confidence to the findings. The results also open further avenues for understanding the transcriptional regulation in different physiological and disease contexts.

      There are certain issues, which the authors should address with regards to controls and some conflicting observations with published results with respect the phenotypes associated with downregulating proteins on focal adhesions size. Additionally, authors don't tie the ends by monitoring secretory traffic in cells grown on different matrices but include it in the model. Addressing/explaining these issues could improve this manuscript and the model may have to be tweaked a bit.

      \*Major comments:***

      1)I wonder why the authors only used siRNA control in their screen when the effects are scored in context of double knockdown fashion in combination with mild knockdown of SEC23A and SEC23B to get functional interactors. Control siRNA in combination with SEC23A and SEC23B should have been two ideal negative controls in the screen. Nevertheless, in data presented Figure 1E and whole of Figure 2, using control siRNA in combination with SEC23B siRNA would have been ideal control to show that the combination does not induce any trafficking defects which could impact the findings of the study. Hence, a few of the data presented from some of these figures should have sicontrol+SEC23B siRNA combination as a control.

      Reply: There seems to be a misunderstanding. In the screen, the negative controls are only used as a reference as the scoring is based on a 5X5 matrix centered on the siRNA of interest. This is done to overcome possible plate effects and to normalize data across different biological replicas. As seen in figure 1B, the negative controls (Control siRNA or Control siRNA + SEC23A siRNA or Control siRNA + SEC23B siRNA are very close to 0 (but not exactly 0) as they were not used in the normalization process. It is important to mention that all single knockdowns also contain our control siRNA to keep the same final siRNA concentration in single and double knockdowns. In Fig 1E we will include the images from Control + SEC23A siRNAs and Control + SEC23B siRNA as a reference. For Figure 2 all except 2A and 2H have the single knockdowns as controls.

      2)What is the identity of post-ER structures which authors refer to in Figure 2A? Could the images represent VSVG concentrated at ER exit sites? Authors should stain with markers for ERES to see if the VSVG puncta colocalize with it.

      Reply: We have done the experiment, and indeed these structures colocalize with an ER exit site marker (SEC31A). We intend to include this data into the revised manuscript. Our observations are in agreement with what is known in the literature about VSVG transport.

      3)Based on RNA sequencing results, authors chose to follow up on SEC23A levels in background of siRNA knockdown of components (like MACF1, ROCK1, FERMT2 etc.) which regulate Focal adhesions in cells and show that there is a reduction in both transcript and protein levels of SEC23A. In images shown in Figure 2B and Figure 2C, levels or SEC31A and β-Cop1 are reduced. Authors should test using qPCR and western blots whether there is a downregulation SEC31A, β-Cop1 and SEC23B in siRNA knockdowns of MACF1, ROCK1, FERMT2 etc. It would provide new insights if there were a co-regulation of secretory machinery to modulate the secretory traffic in response to Focal Adhesion based signaling.

      Reply: Our transcriptomics data (FIG 3C and Table 5) shows that SEC31A and COPB1 mRNAs are not altered upon any of the knockdowns. For SEC23B, we observed only a slight decrease in ROCK1 knockdown. This data suggests that a co-regulation of the secretory machinery might not be present. Instead, the curation of secretory pathway genes in our transcriptome data shows that SEC23A is the only commonly differentially expressed gene.

      4)Most major concern in this manuscript surrounds around results presented in Figure 4C. Authors show that in response to all the knockdowns, they see more focal adhesions as monitored by Vinculin staining and this along with the experiments with cells plated on Matrigel and Fibronectin arrive at the conclusion that increased Focal adhesion signaling downregulates SEC23A levels which presumably modulates secretory traffic. I am not an expert on Focal adhesions but based on my understanding of the literature on that topic, downregulation of ROCK1, FEMRT2 disrupts focal adhesions. (See: Theodosiou et. al., Elife, 2016 or Lock et. al., Plos One, 2012 for example). How do authors explain their results in siRNA knockdown of ROCK1 and FEMRT2 which leads to an increased size of focal adhesions which seems contradictory to the published results? To clarify these results authors should test phosphorylation of FAK in their siRNA backgrounds which is another read out of focal adhesion signaling.

      The experiments from cells grown on Fibronectin and Matrigel favor the argument which authors put forth, but authors may have to tweak the model a bit based on FAK phosphorylation and FAK signaling in context of above-mentioned knockdowns.

      Reply: Based on the images for vinculin staining, in our current manuscript we propose that changes in FAs occur upon knocking down our interactors. In our revised manuscript we will provide a more robust quantitative assessment of those changes (change in number, size, or intensity) as mentioned in our reply to Reviewer #1.

      As for the discrepancies in the relation of FA phenotype upon depletion of ROCK1 and FERMT2, we want to point out that this effect depends on the cell type used. For instance, the papers listed by the reviewer here use fibroblasts and keratinocytes respectively while we have used Hela Kyoto cells which are epithelial in nature. Another example is that while in fibroblasts depletion of FERMT2 leads to a rounded morphology and almost an absence of FAs (Theodosiou et. al., Elife, 2016), in podocytes (Qu et al JCS, 2011), it leads to fewer FAs but an increase in their size. Nonetheless, this is a very keen observation from the reviewer and we will address this point in our revised manuscript discussion.

      5)What happens to VSVG traffic or RUSH-Cadherin traffic when cells are plated on Matrigel and Fibronectin? Reduction in secretory traffic of these is an important experiment which is missing to close the loop and validate the model presented. Authors must test these experiments either with cells grown on matrix alone or in combination with siRNA to SEC23B. Authors should also monitor ERES and transport carriers in this background.

      Reply: We agree with the reviewer and intend to perform these experiments.

      6)This is not such a major issue, but it would be good to see a comparison in SEC23A levels in siRNA knockdown condition in comparison to those when cells are grown on different substrates and in ROCK1, FEMRT2 knockdowns (blots of which authors already have in this manuscript).

      Reply: We will assess the level of SEC23A at the protein level for cells plated on matrigel or Fibroblast-derived ECM.

      \*Minor comments:***

      1)Scale bars are missing in EM images in Figure 2H.

      Reply: We will add the scales in our EM images

      2)Show molecular weight markers in Western blots in main figure 3E and supplementary figure S1E.

      Reply: We will add molecular weight markers in our Western-Blots

      Reviewer #2 (Significance (Required)):

      I have looked at the manuscript from through the lens of a cell biologist as that is predominantly my area of expertise. In that respect I find the screen conducted by authors particularly interesting as they aim to connect how extracellular cues regulate the secretory pathway. A screen seems justified as there is no comprehensive understanding linking the two above-mentioned processes. Authors have done a functional interaction screen and analyzed a lot of images to identify candidates which either increase or decrease secretory traffic in combination with SEC23A and SEC23B. Such a functional screen has helped authors identify candidates which were otherwise missed in single siRNA knockdowns in their previous work from 2012. This definitely opens up interesting avenues to test the candidates identified in the screen in different physiological contexts and in disease as also the transcriptional program connecting Focal adhesion signaling with the regulation of components governing secretion. Such functional interaction screens could also be employed to identify crosstalk of different cellular processes with the regulation secretory pathway at ER as well as at the Golgi apparatus.

      Reply: We thank reviewer #2 for the comments. As we mentioned in our reply to reviewer #1, we strongly believe that these results will encourage further research at the crossroads of membrane trafficking and mechanobiology.

      \*Referees cross-commenting***

      I agree with the comments from both the referees that the manuscript is very interesting, most experiments are well controlled, but the quantification of focal adhesion phenotype in knockdowns need to be done in an extensive manner and secretion phenotypes need to measured upon plating cells on different matrix to validate the model presented.

      Reply: These two experiments will be included in our revision

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      \*Summary***

      The authors use a synchronized cargo release assay following codepletion of either Sec23 paralog with cytoskeletal and associated proteins to identify potential functional interactions between COPII trafficking and the cytoskeleton. This screen yields a number of Sec23b functionally interacting molecules that stall cargo trafficking to various degrees within the secretory pathway upon codepletion, and in the case of MACF1 reduce ERES number despite not physically interacting. Depletion of the majority of the identified Sec23b functional interactors alone surprisingly caused the downregulation of Sec23a at the mRNA and protein levels, and cargo trafficking could be partially or fully rescued by Sec23a overexpression depending on the codepleted cytoskeletal factor. RNA-seq enrichment analysis and imaging of a focal adhesion marker suggest that genes involved in cell adhesion were differentially regulated following depletion of the cytoskeletal functional interactors. Finally, the authors show that Sec23a expression levels are reduced when cells are cultured on dishes with high amounts of ECM to induce focal adhesions, and that inhibition of focal adhesion kinase can rescue Sec23a expression levels.

      \*Major comments***

      #1 The authors successfully implicate a group of cytoskeletal proteins and their actions at focal adhesions in negatively regulating Sec23a expression levels and COPII trafficking. This description of a shared, novel mode of COPII transcriptional regulation by cytoskeletal factors is convincingly shown to be at least a contributor to the delayed trafficking in the presence of focal adhesions. In general, the data are reproducible and use appropriate statistical analysis. However, a more robust description of the architecture of early secretory pathway would be beneficial, especially in the case of MACF1 codepletion which cannot be fully rescued by Sec23a-YFP overexpression. In contrast, trafficking during codepletion of FERMT2 is fully rescued by Sec23a-YFP despite both MACF1 and FERMT2 showing similar loss of Sec23a mRNA levels upon codepletion. This data suggests that while the trafficking delay in FERMT2 codepletion might be exclusively due to reduced Sec23a expression levels, there are likely additional causes for the trafficking delay observed in MACF1 codepletion.

      Reply: We thank the reviewer for the appreciation of our results and the importance they might bear for the field. The reviewer has very neatly highlighted that each of our interactor hits might have roles in the secretory pathway beyond the ER or independent of the expression levels of SEC23A. This phenomenon could also explain the differential rescue of the arrival of VSVG at the plasma membrane upon SEC23A overexpression in FERMT2 and MACF1 knockdowns (FIG 3F). For instance, MACF1 has been involved in Golgi to Plasma Membrane transport as well (Kakinuma et al. Exp. Cell Res. 2004, Burgo et al. Dev. Cell 2012). So a possibility is that SEC23A overexpression rescues only ER to Golgi transport but the lack of rescue in the compartment between Golgi and plasma membrane independent of SEC23A expression levels would result in reduced rescue In the case of MACF1 compared to FERMT2. To support this, in our revised manuscript, we will provide example images from the experiment.

      Nonetheless, we agree that these are very important observations from Reviewer #3 and warrant a detailed discussion in the light of other interactors as well, which we intend to highlight in our revised manuscript.

      #2 While there is indeed a reduction in the number of ERESs following MACF1 codepletion, the authors report an even more dramatic reduction in 'transport intermediates / cell' as marked by COPI. However, as recent cyro-EM analysis of ERESs has definitively show, COPI exists stably at ERGIC membranes (1). Thus, an alternative possibility for the more dramatic reduction of COPI sites compared to Sec31a sites in Figures 2B-E is that ERGIC membranes are destabilized following MACF1 codepletion in a manner independent of Sec23a expression, and this destabilization compounds with reduced ERES number to ultimately delay trafficking. To more directly determine whether ERGIC membranes stability is regulated by MACF1, the authors should compare COPI and ERGIC-53 staining among MACF1 codepleted and FERMT2 codepleted cells with and without Sec23a-YFP overexpressed to levels that rescue cargo trafficking. If Sec23a-YFP restores the number of ERGIC puntae marked by these stains in FERMT2 but not MACF1 codepleted cells, it would suggest a role for MACF1 in forming or stabilizing ERGIC membranes which are known to associate with microtubules and WHAMM, an actin nucleator. Additionally, it would be useful to costain COPII with COPI or ERGIC-53 in control, MACF1 depleted, MACF1 codepleted, and MACF1 codepleted and Sec23a-YFP rescued cells to determine their colocalization. COPII and ERGIC membranes should be almost entirely coupled and juxtaposed in control cells and may be decoupled upon loss of MACF if plays a role in ERGIC membrane localization and stability. These proposed experiments are relevant because ERGIC membranes are sites of COPII cargo delivery and changes in ERGIC stability or localization would suggest an additional mechanism for cytoskeletal regulation of COPII trafficking. These immunofluorescence studies should be straightforward and completed in a few weeks.

      Reply: Although a possible additional role of MACF1 in the organisation of early secretory pathway, stability of ERES, etc., independent of the expression of SEC23A is interesting on its own, we believe that an extensive characterization of these possible roles/ pathways as proposed by the reviewer is beyond the scope this manuscript.

      #3 The choice to use VSVG and E-Cadherin for the synchronized release assays unfortunately convolutes interpreting the 'transport ratios' used by the authors to compare the effects of the various codepletions. Each protein progresses beyond the Golgi during secretion, and the authors choose to calculate the ratio of cargo intensity at the plasma membrane normalized to the total cellular cargo. This means that the synchronized release assays and calculated 'transport ratios' assay not only ER to Golgi trafficking, but also trafficking from the Golgi to the plasma membrane. In instances where Sec23a-YFP overexpression does not fully rescue the codepletion, it is possible that additional trafficking delays occur during Golgi to plasma membrane trafficking that cause the 'transport score' to decrease. Thus, the 'transport score' as the authors calculate it is needlessly nonspecific to COPII trafficking and should not be used to compare the codepletions for COPII functional interactors.

      Reply: We agree that the “transport score” used here and in our previous genome-wide screen (Simpson et. al Nat. Cell Biol. 2012) does not allow us to distinguish between the individual transport substeps in the transport of VSVG from the ER to the plasma membrane. However, as we see in Fig 1E, the proteins that we have decided to follow in more detail in this study do have a clear ER transport block phenotype (except for CRKL). So for 6 out of 7 of these proteins, the images clearly show that the decrease in the “transport score” is due to a decreased ER to Golgi transport.

      #4 To mitigate unwanted contributions of post-COPII trafficking events from altering 'transport scores,' the authors should use a cargo for synchronized release assays that does not progress past the Golgi such as α-Mannosidase II and quantify a ratio of the perinuclear cargo signal to whole cell signal. Ideally, the screen would be repeated with a more appropriate cargo generating new 'transport scores' for the full list of cytoskeletal proteins. However, this may not be feasible, and as such 'transport scores' based on a Golgi resident protein should at least be produced for the 7 Sec23b functional interactors featured in this manuscript. These Golgi 'transport scores' would add much needed quantification of ER to Golgi transport delays that currently can only be inferred from the representative images in Figure 1E, which unfortunately show significant heterogeneity among cells from the same image. The authors should also explicitly state that any 'transport score' from a synchronous release assay using a cargo destined for the plasma membrane will take into account trafficking rate changes due not only to COPII, but also COPI from the ERGIC to the Golgi, and transport carriers departing from the TGN. These synchronized release assays would likely take between a few weeks to a few months depending on their ability to automate image analysis.

      Reply: We consider that having a “Golgi transport score” won't add any new information as the proteins that we have chosen to follow are the ones that show a strong ER-block phenotype. However, we agree that such a “Golgi score” would indeed be useful if one would like to study other interactors, for instance, the ones that induce transport acceleration.

      Also, we don't expect all cells to behave similarly as the level of knockdown might be slightly different or because of the cell to cell variability. Even in control conditions (no knockdown), this heterogeneity is evident. As suggested by the reviewer, in our revised manuscript we will explicitly state that a change in the transport scores could mean a change in any sub-step of the transport from the ER to the PM in our assay.

      \*Minor comments***

      It would be useful for the authors to quantify the number of focal adhesions present from Vinculin stains from Figure 4C and 5C instead of just showing representative images. It would be interesting to determine if there is a meaningful relationship between focal adhesion number induced by the codepletions or tissue culture coating and Sec23a expression levels like in Figure 3D. Generally, the figures, text, and references were appropriate.

      Reply: As also pointed out by the other reviewers we will quantify the FA changes

      Reviewer #3 (Significance (Required)):

      In recent years, significant effort has been devoted to elucidating mechanisms by which COPII trafficking is modulated in response to cellular cues. These studies have revealed that changes in nutrient availability, growth factors, ER stress, autophagy, and T-cell activation all cause changes in COPII trafficking via unique gene expression, splicing, or post-translational control (2-7). This work elucidates a novel mechanism of transcriptional control driven by focal adhesions. Additionally, it provides a number of potentially useful Sec23a and Sec23b functional interactors among cytoskeletal factors for further study. These unexplored factors may have unique mechanism of COPII regulation that could contribute to our understanding ER export modulation. Altogether, this and similar works are building an increasingly complex set of regulatory pathways that when integrated ultimately dictate COPII trafficking kinetics.

      The reported findings are not only relevant to those who study COPII trafficking, but also other fields where secretion is studied in the context of the ECM. This work would suggest that secretion of factors involved in crosstalk between cells, including in tumors, is likely to be controlled by the interactions of cells with ECM.

      Reply: We thank reviewer #3 for the comments and insightful discussion about the limitations of our assay that we will highlight in the revised manuscript and in general for the insight into the early secretory pathway regulation. Furthermore their explicit summary of how our study could bridge COPII trafficking, ECM signaling and the relevance to various pathophysiologies is highly appreciated.

      Expertise keywords: cell biology, light microscopy, membrane trafficking

      References

      1.Weigel A V., Chang CL, Shtengel G, Xu CS, Hoffman DP, Freeman M, et al. ER-to-Golgi protein delivery through an interwoven, tubular network extending from ER. Cell. 2021 Apr;184(9):2412-2429.e16.

      2.Farhan, H., Wendeler, M. W., Mitrovic, S., Fava, E., Silberberg, Y., Sharan, R., Zerial, M., & Hauri, H. P. (2010). **MAPK signaling to the early secretory pathway revealed by kinase/phosphatase functional screening. Journal of Cell Biology, 189(6), 997-1011.

      3.Zacharogianni, M., Kondylis, V., Tang, Y., Farhan, H., Xanthakis, D., Fuchs, F., Boutros, M., & Rabouille, C. (2011). ERK7 is a negative regulator of protein secretion in response to amino-acid starvation by modulating Sec16 membrane association. **EMBO Journal, 30(18), 3684-3700.

      4.Lillmann, K.D., V. Reiterer, F. Baschieri, J. Hoffmann, V. Millarte, M.A. Hauser, A. Mazza, N. Atias, D.F. Legler, R. Sharan, et al 2015. **Regulation of Sec16 levels and dynamics links proliferation and secretion. J. Cell Sci. 128:670-682.

      5.Liu, L., Cai, J., Wang, H., Liang, X., Zhou, Q., Ding, C., Zhu, Y., Fu, T., Guo, Q., Xu, Z., Xiao, L., Liu, J., Yin, Y., Fang, L., Xue, B., Wang, Y., Meng, Z. X., He, A., Li, J. L., ... Gan, Z. (2019). Coupling of COPII vesicle trafficking to nutrient availability by the IRE1α-XBP1s axis. Proceedings of the National Academy of Sciences of the United States of America, 116(24), 11776-11785.

      6.Jeong, Y.-T., Simoneschi, D., Keegan, S., Melville, D., Adler, N. S., Saraf, A., Florens, L., Washburn, M. P., Cavasotto, C. N., Fenyö, D., Cuervo, A. M., Rossi, M., & Pagano, M. (2018). The ULK1-FBXW5-SEC23B nexus controls autophagy. ELife, 1-25.

      7.Wilhelmi, I., Kanski, R., Neumann, A., Herdt, O., Hoff, F., Jacob, R., Preußner, M., & Heyd, F. (2016). Sec16 alternative splicing dynamically controls COPII transport efficiency. Nature Communications, 7, 12347. https://doi.org/10.1038/ncomms12347

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      4. Description of analyses that authors prefer not to carry out

      Reviewer #3 suggested to robustly characterise the early secretory pathway, in response to the depletion of our interactors, for instance, the role of MACF1 in the organization and the stability of ERES. This view is also supported by reviewer #1. However, in our revised manuscript we would like to focus more on the novel aspect of our study (as highlighted by all the reviewers), namely how ECM signaling and changes in FAs affect SEC23A and possibly ER transport. For this, we would like to present a more quantitative outlook of the FA phenotype and concentrate on the transport experiments. The reason for not dwelling into a more extensive characterization of the early secretory pathway is that these experiments are very interesting on their own, and merit a separate study that would deconvolve in detail the individual trafficking steps, and their relation to SEC23A expression, ERES stability, and ECM signaling.

      Reviewer #2 suggested that to better characterize the FA phenotype and solve the apparent discrepancies between our data and the literature, we could test FAK phosphorylation. As we mentioned in our reply to this point, we think that most of the discrepancies arise from the different cell types used. Nevertheless, we agree that a quantitative approach is needed for a better characterisation of FA phenotype, therefore we intend to perform quantification of the vinculin stainings.

    1. Author Response:

      Reviewer #1:

      In this manuscript titled "The LRRK2 G2019S mutation alters astrocyte-to-neuron communication via extracellular vesicles and induces neuron atrophy in a human iPSC-derived model of Parkinson's disease", Jacquet and colleagues investigated the role of Parkinsonism gene mutation LRRK2 G2019S in hiPSC-differentiated astrocytes. By isolating extracellular vesicles from ACM and examining astrocytes with various electron microscopy techniques, the authors found that LRRK2 G2019S affects the morphology and distribution of MVBs and the morphology of secreted EVs in hiPSC-differentiated astrocytes. Furthermore, the authors observed that astrocyte-derived EVs can be internalized by dopaminergic neurons and such EVs support neuronal survival. However, LRRK2 G2019S EVs lost the ability of promoting neuronal survival. This is an interesting study showing a non-cell autonomous contribution to dopaminergic neuron loss in PD.

      The proposed idea of how LRRK2 G2019S dysregulates EV-mediated astrocyte-to-neuron communication is novel and exciting. However, the authors present some conflicting data that is not addressed during the discussion: they first conclude upregulated exosome biogenesis by RNAseq in G2019S vs WT astrocytes, but later show a decrease in the number of <120nm particles in G2019S mutants suggesting a decrease in the classical exosome-sized vesicle secreted compared to WT. Lastly, their MVB images show less CD63 gold particles in G2019S compared to WT control (though this was not quantified). Do the authors suggest and increase or decrease in exosome biogenesis in G2019S mutants? How do they reconcile these seemingly contradicting data? Several experiments, controls and additional analyses are needed to fully demonstrate the validity of the proposed mechanism.

      The RNA-sequencing data of LRRK2 G2019S astrocytes showed an enrichment in genes associated with the “extracellular exosome” gene ontology term but not with the MVB/EV trafficking or secretion pathways. While we found CD82 and Rab27b to be upregulated, the classical biogenesis markers of MVB/EV trafficking and secretion (e.g. VTA1, VPS4, ALIX) were not dysregulated. Instead, the gene list shows an overwhelming dysregulation of genes coding for EV-enclosed proteins which do not have known roles in MVB/EV biogenesis or function (we now discuss this point in the main text, see highlights in italics below). As a result, we do not believe that exosome biogenesis is upregulated but instead propose the working hypothesis that the EV pathway may contribute to LRRK2 G2019S astrocyte dysfunction. To complement the sequencing data, our study provides a characterization of this pathway by (i) describing the cellular distribution of CD63+ structures in astrocytes, (ii) measuring the size of secreted EVs, and (iii) analyzing the neurotrophic potential of control and LRRK2 G2019S astrocyte-secreted EVs. We have not characterized the cellular biology of exosome/EV biogenesis in depth, and we do not propose a mechanism by which the LRRK2 G2019S mutation dysregulates these pathways. These questions are beyond the scope of our study, which is focused on the role of astrocytes in neurodegeneration.

      The reviewer also referred to the CD63 immunogold staining used in Figures 4C and 6A to localize MVBs. After careful quantification of the number of CD63 gold particles in WT and LRRK2 G2019S MVBs, we conclude that there are no differences between the two genotypes and we apologize for selecting non-representative images. We have now replaced these with representative images. Regarding the shift in the size of WT vs. LRRK2 G2019S vesicles, we complemented our cryo-EM analysis with new data generated using Nanoparticle Tracking Analysis (NTA) (Figure 3C,D). The NTA analysis enabled the quantification of a greater number of particles, and we found that both WT and LRRK2 G2019S astrocytes secrete a significant number of particles in the 0-120 nm range. The cryo-EM data suggested that mutant astrocytes secreted fewer particles in this size range, but this is not observed in the NTA analysis. This discrepancy could be explained by the following: (i) in contrast to cryo-EM, NTA does not distinguish EVs from cell debris, which could bias the quantification and increase the number of small particles quantified (Noble et al., 2020), and (ii) studies showed that the size distributions between NTA and cryo-EM differ, the latter enabling the identification of larger particles (Noble et al., 2020). These two techniques are therefore complementary in the study of secreted EV and our manuscript now presents data generated using these two approaches (Figure 3C-G) (see italicised text below).

      Results

      Expression of exosome components in iPSC-derived astrocytes is altered by the LRRK2 G2019S mutation Gene ontology (GO) analysis revealed that components of the extracellular compartment are up-regulated in LRRK2 G2019S astrocytes – these include GO terms corresponding to the extracellular region, extracellular matrix and extracellular exosomes (Figure 1D,F). The exosome component is one of the most significantly up-regulated GO terms in both isogenic and non-isogenic astrocytes, and is comprised of a total of 67 (isogenic pair) or 95 (non isogenic pair) genes (Supplementary Tables 1 and 2). The large majority (~ 98 %) of these gene products are described to be enclosed in exosomes (e.g. CBR1) but do not perform specific functions related to EV formation or secretion. Only a few genes are associated with exosome biogenesis (e.g. CD82) and trafficking (e.g. Rab27b) (Andreu & Yanez-Mo, 2014; Chiasserini et al., 2014; Ostrowski et al., 2010) and we did not detect differences in the expression of canonical factors that regulate MVB formation (e.g. VTA1, VPS4 or ALIX).

      Profiling WT and LRRK2 G2019S EVs secreted by iPSC-derived astrocytes

      The astrocyte-derived EV pellet is enriched in exosomes, as demonstrated by the expression of 8 exosomal markers and the absence of cellular contamination (Supplementary Figure 3D). NTA quantification showed that the number of secreted EVs does not differ between LRRK2 G2019S and isogenic control (Figure 3C), and it appears that LRRK2 G2019S particles have a slightly different size distribution compared to WT particles (Figure 3D). It should be noted that TEM and NTA are methods traditionally used to estimate the size distribution of EVs, but their accuracy is often challenged by sample processing artifacts and technical biases (Pegtel & Gould, 2019). To overcome these limitations, we complemented the NTA results with cryo-EM analysis of the size of EVs secreted by WT and LRRK2 G2019S isogenic astrocytes. EVs mostly displayed a circular morphology (as opposed to the cup-shaped morphology observed by TEM) (Figure 3E), but a variety of other shapes were also observed (Supplementary Figure 3E). Cryo-EM analysis confirmed that WT astrocyte-secreted EVs display a large range of sizes, from 80 nm to greater than 600 nm in diameter, with differences between WT and mutant populations (Figure 3F). The cryo-EM data suggested that mutant astrocytes secreted fewer particles in the 0-120 nm size range, and the discrepancy with the NTA results could be explained by the following: (i) in contrast to cryo-EM, NTA does not discriminate EVs from cell debris, which could bias the quantification and increase the number of small particles quantified (Noble et al., 2020), and (ii) studies showed that the size distributions between NTA and cryo EM differ, the latter enabling the identification of larger particles (Noble et al., 2020). However, cryo-EM is a low throughput methodology that limits data collection to a small sample size and has therefore a lower statistical power than NTA. Quantification of the number of simple vs. multiple EV structures did not reveal differences between the two lines, and represent up to 16% of the EV population (Figure 3G). We then sought to complement our EV profiling experiments with an analysis of secreted CD63+ particles, which form one of the known exosomal sub-populations. We previously showed that WT and LRRK2 G2019S MVBs contain similar levels of the CD63 tetraspanin (Figure 2E, Supplementary Figure 3A,B), and an ELISAbased quantification confirmed that the number of CD63+EVs remained unchanged between the two genotypes (Supplementary Figure 3F). We conclude from these results that the total number and morphology of EVs produced by WT and LRRK2 G2019S astrocytes are similar, but mutant EVs may have a different size distribution compared to WT vesicles.

      Major concerns:

      1) In figure 1 A authors demonstrate iPSC-derived astrocytes characterization. Since there is no one unified and validated method for astrocytes differentiation, there is a need for more accurate characterization of iPSC-derived astrocytes. Authors should demonstrate the percentage of cells positive to astrocytic markers and to prove that obtained astrocytes are functional (able to promote synaptogenesis and uptake glutamate). I would also recommend analyzing the iPSC-derived astrocyte cultures for expression of more specific astrocytic markers as GLT1, SOX9 in addition to those which have been analyzed. Moreover, it is highly important to know what is the proportion of astrocytes derived from LRRK2 G2019S line and its isogenic control in order to be able to compare their effect on neurons.

      We thank the reviewer for these suggestions. It is true that there exist many different astrocyte differentiation protocols, and this study uses a protocol developed by TCW et al. that has been further optimized by our lab to derive astrocytes from a midbrain-patterned population of neural progenitor cells (NPCs) (de Rus Jacquet, 2019; Tcw et al., 2017). The protocol is published, and shows that these astrocytes are functional – they respond to inflammatory factors and alter secretion of the IL-6 cytokine. Furthermore, Supplementary Figure 2D shows a whole transcriptome analysis (by RNA-seq) of the cell populations produced for this study and demonstrates that iPSC-derived astrocytes cluster with human primary midbrain astrocytes and away from iPSCs or NPCs in an unsupervised cluster analysis. However, we agree that in-depth characterization of iPSC-derived astrocytes is essential, and the updated manuscript now shows that (i) the astrocyte differentiation protocol yields 100 % GFAP+ cells with both WT and mutant lines (Supplementary Figure 2B), (ii) expression of six astrocyte markers (GLT1, SOX9, APOE, BHLHE41, CD44, GLUD1) (Supplementary Figure 2Aii, B), as well as (iii) transient intracellular calcium signaling (Supplementary Figure 2E), and (iv) synaptosome uptake (Supplementary Figure 2F) in both WT and LRRK2 G2019S astrocytes. We also updated the text as follows (italicised):

      Results section

      Midbrain-patterned NPCs carrying the LRRK2 G2019S mutation or its isogenic control were differentiated into astrocytes as described previously (de Rus Jacquet, 2019; Tcw et al., 2017). As expected, astrocytes expressed the markers GFAP, vimentin, and CD44 as demonstrated by immunofluorescence (Figure 1A) and flow cytometry analyses (Supplementary Figure 2A). Differentiation was equally effective in WT and LRRK2 G2019S cells, with 100 % of the differentiated astrocytes expressing GFAP (Supplementary Figure 2Bi). To further demonstrate the successful differentiation of iPSCs into astrocytes, we analyzed gene expression using RNA-sequencing analysis (RNA-seq), including primary human midbrain astrocyte samples in the RNA-seq study to serve as a positive control for human astrocyte identity. iPSC-derived and human midbrain astrocytes expressed similar levels of genes markers of astrocyte identity, including SOX9 and GLUT1 (Supplementary Figure 2B). In addition, principal component and unsupervised cluster analyses separated undifferentiated iPSCs, iPSC-derived NPCs and iPSC-derived astrocytes into independent clusters, demonstrating that our differentiation strategy produces distinct cell types (Supplementary Figure 2C-D). Importantly, the transcriptome of iPSC-derived astrocytes showed more similarities to fetal human midbrain astrocytes than to NPCs or iPSCs, further validating their astrocyte identity (Supplementary Figure 2D). Lastly, control and LRRK2 G2019S astrocytes showed classic astrocytic functional phenotypes such as spontaneeous and transient calcium signaling and synaptosome uptake (Supplementary Figure 2E-F).

      2) In Figure 1, the authors found a significant upregulation of exosome components in astrocytes, demonstrating an important role of LRRK2 G2019S in EV signaling pathway. In the discussion, the authors briefly mentioned 'sub-populations of CD63- EVs may be differentially secreted in mutant astrocytes'. Since the authors have obtained the RNA-seq data, it would be nice to dig deep into the data and comment on potential EV sub-populations which can be differentially secreted. This information can be very beneficial for follow-up studies in the PD and LRRK2 field. Furthermore, the authors should assess the expression of Rab27a and CD82 in WT and LRRK2 G2019S astrocytes by western blots to verify RT-qPCR data. Furthermore, the authors should present specifically exosome biogenesis or secretion genes are altered to provide further insight into the stage of exosome biogenesis that is affected (ESCRT0-3, VPS4, ALIX, etc).

      In the first comment, the reviewer refers to the observation that the number of total and CD63-positive EVs secreted by astrocytes is unchanged between the WT and LRRK2 G2019S genotypes. The classification of different EV sub-populations based on marker proteins is an evolving field of research, and an important study by Kowal et al. defined generic and sub population-specific EV markers (Kowal et al., 2016). Our RNA-seq dataset revealed five upregulated genes identified in the Kowal study, namely actin, GAPDH, actinin, complement and fibronectin, but unfortunately there is no clear pattern correlated with specific EV sub populations. For example, actin and GAPDH are two upregulated proteins that can be found in multiple types of EVs, actinin is enriched in large and medium-sized EVs, and complement and fibronectin are enriched in high density but small EVs (Kowal et al., 2016). The majority of dysregulated genes identified in our sequencing experiment are not proteins classically used to categorize EVs, so unfortunately our data does not allow us to address the reviewer’s question. To make sure that the data is readily accessible to the scientific community, we have prepared a supplementary table with a list of extracellular exosome-related genes identified in the RNA sequencing study. To respond to the reviewer’s comment on a specific stage of EV biogenesis/secretion altered in LRRK2 G2019S, the sequencing data presented in this manuscript does not allow to conclude that there is such a dysregulation. Our gene list corresponding to the “extracellular exosome” gene ontology term contains a large majority of genes coding for proteins enclosed within EVs that do not play a role in biogenesis/secretion. For example, the gene list does not contain ESCRT0-3, VPS4, ALIX or other classical markers involved in EV biogenesis and we cannot conclude anything about the alteration of MVB/EV biogenesis or defects in specific stages of MVB trafficking or EV secretion. In addition, we thank the reviewer for suggesting the validation of RT-qPCR data by western blot. The purpose of the RT-qPCR experiment was to validate the gene expression data collected by RNA-seq. Given that our objective was to confirm gene expression levels, and that we do not further study CD82 and Rab27b, we think that collecting protein expression levels is not necessary in the context of this study.

      We agree with the reviewer’s suggestion, and we added images showing the subcellular localization of CD63 in both WT and LRRK2 G2019S MVBs by immunogold staining (Figure 2E). Validation experiments available in Figure 1A and Supplementary Figure 2A and 2B confirm that our astrocytes express CD44 as well as markers of mature astrocytes (BHLHE41, SOX9, GLUT1, APOE, GLUD1). The reason for showing CD44 instead of a more mature marker such as GFAP in Figure 2 of the manuscript is because CD44 is a membrane marker, and it therefore enables a clear visualization of the astrocyte surface area. We also note that, as shown in Supplementary Figure 2Aii, iPSC-derived and human fetal astrocytes express CD44, but iPSCs and NPCs do not significantly express this marker gene. In addition, as suggested by the reviewer, we added information related to MVB maturation in the introduction and the new text reads as follows (changes in italics):

      Introduction section The sorting and loading of exosome cargo is an active and regulated process (Temoche-Diaz et al., 2019), and the regulatory factors involved in EV/exosome biogenesis are just beginning to be identified. Among the well-known factors, Rab proteins are essential mediators of MVB trafficking and they regulate endosomal MVB formation/maturation as well as microvesicle budding directly from the plasma membrane (Pegtel & Gould, 2019; T. Wang et al., 2014). In addition, membrane remodeling is an essential aspect of MVB/EV formation that appears to be regulated, at least in part, by the endosomal sorting complex required for transport (ESCRT) machinery (Pegtel & Gould, 2019; Schoneberg, Lee, Iwasa, & Hurley, 2017).

      3) In Figure 2A and B, data shows that both WT and LRRK2 G2019S astrocytes produce MVBs and MVBs in LRRK2 G2019S astrocytes is smaller than in WT astrocytes. In Figure 2E, the authors showed the abundance of CD63 localized within MVBs in WT astrocytes but did not show the CD63 localization in MVBs in G2019S astrocytes. However, it is important to show CD63 localization in MVBs in G2019S astrocytes to fully support the conclusion that CE63+ MVBs are present in LRRK2 G2019S astrocytes. In addition, CD44 is a marker for astrocyte-restricted precursor cells. Although CD44+ positive cells are committed to give rise to astrocytes, it is crucial to include another astrocyte marker to ensure these cells are indeed mature astrocytes. -Related, authors should consider citing some of the MVB maturation literature to guide the readers.

      We agree with the reviewer’s suggestion, and we added images showing the subcellular localization of CD63 in both WT and LRRK2 G2019S MVBs by immunogold staining (Figure 2E). Validation experiments available in Figure 1A and Supplementary Figure 2A and 2B confirm that our astrocytes express CD44 as well as markers of mature astrocytes (BHLHE41, SOX9, GLUT1, APOE, GLUD1). The reason for showing CD44 instead of a more mature marker such as GFAP in Figure 2 of the manuscript is because CD44 is a membrane marker, and it therefore enables a clear visualization of the astrocyte surface area. We also note that, as shown in Supplementary Figure 2Aii, iPSC-derived and human fetal astrocytes express CD44, but iPSCs and NPCs do not significantly express this marker gene. In addition, as suggested by the reviewer, we added information related to MVB maturation in the introduction and the new text reads as follows (changes in italics):

      Introduction section

      The sorting and loading of exosome cargo is an active and regulated process (Temoche-Diaz et al., 2019), and the regulatory factors involved in EV/exosome biogenesis are just beginning to be identified. Among the well-known factors, Rab proteins are essential mediators of MVB trafficking and they regulate endosomal MVB formation/maturation as well as microvesicle budding directly from the plasma membrane (Pegtel & Gould, 2019; T. Wang et al., 2014). In addition, membrane remodeling is an essential aspect of MVB/EV formation that appears to be regulated, at least in part, by the endosomal sorting complex required for transport (ESCRT) machinery (Pegtel & Gould, 2019; Schoneberg, Lee, Iwasa, & Hurley, 2017).

      4) In Figure 3, it is impressive that the authors are able to image EVs using cyro-EM approach and analyze their sizes. The authors also observed different shapes of EVs. Is there any shape difference between WT EVs and G2019S EVs? Is there a way that the authors could categorize these shapes and do a detailed analysis in EV shapes? Also, In Figure 3D, both WT EV and G2019S EV images should present side by side for comparison. -Related, the size frequencies of EVs presented suggest a difference in the types of EV's released. Interestingly, exosomes are classically known to range from ~50-120nm and this population is significantly decreased in G2019S compared to WT. What does this suggest?

      As suggested by the reviewer, we classified the two main EV shapes as “simple” and “multiple” EVs, and found no quantitative differences between WT and LRRK2 G2019S. This new data and side-by-side images of WT and LRRK2 G2019S EV images are available in Figure 3E-G, and the text has been updated accordingly (see text in italics below). One of the observations of Figure 3 is that there exist genotype-specific differences in the size distribution of EVs, which suggests that different classes of vesicles may be preferably produced by WT vs. LRRK2 G2019S astrocytes. This could be the result of differences in dynamics related to cargo loading, or a shift from MVB-released exosomes to membrane budding and microvesicle production. These observations are of great interest and we added a short discussion (in italics below) but they are beyond the scope of this study focused on EV neurotrophic properties, and we do not currently have evidence to support these hypotheses.

      Results - LRRK2 G2019S affects the size of EVs secreted by iPSC-derived astrocytes

      EVs mostly displayed a circular morphology (as opposed to the cup-shaped morphology observed by TEM) (Figure 3E), but a variety of other shapes were also observed (Supplementary Figure 3C). (…) Quantification of the number of simple vs. multiple EV structures did not differ between the two lines, and represent up to 16 % of the EV population (Figure 3G).

      Discussion – Dysregulation of iPSC-derived astrocyte-mediated EV biogenesis in Parkinson’s disease

      The observation that LRRK2 G2019S MVBs are less frequently located in the perinuclear area suggests that they may spend less time loading cargo at the Trans-Golgi network, which could in turn produce smaller MVBs and EVs with a different size range compared to WT (Edgar, Eden, & Futter, 2014; Pegtel & Gould, 2019). We did not observe a difference in the number of secreted EVs (total and CD63+ subpopulation) between WT and LRRK2 G2019S astrocytes (Figure 3C,H), suggesting that the secretion of at least one population of EVs is independent of the astrocyte genotype.

      5) In figure 3c, SBI ELISA claims to quantify CD63+ vesicles, the authors should present more standardized particle quantification data (either by CD63 FACs for isolated EVs in WT vs G2019S or ZetaView/QNano particle tracking). The authors should also directly quantify the total number of EVs secreted in WT vs G2019S conditions (not only CD63+).

      The updated manuscript now contains the NTA analysis of WT and LRRK2 G2019S EVs (Figure 3C,D) which provides the total number of EVs secreted by WT and LRRK2 G2019S astrocytes.

      6) In Figure 4, the authors quantify LRRK2+/CD63+ particles by imaging. Importantly, it appears that there are less CD63 "large gold" particles in MVB of G2019S compared to control. This CD63 baseline quantification in MVB of WT vs. G2019S should be presented in this figure. These data are not convincing and should be quantified by FACS in secreted EV. Supplementary figure 3 should be brought into this figure.

      As suggested by the reviewer, we quantified the number of CD63 large gold particles per MVB in WT and LRRK2 G2019S lines (Supplementary Figure 3A,B), and we re-introduced Supplementary Figure 3 into the main text (Figure 4E). We also updated the text (see in italics below). Additionally, we present extensive quantification of LRRK2 levels in MVBs and secreted EVs via imaging and biochemical analysis (ELISA), two different but complementary analytical methods.

      Results - LRRK2 G2019S affects the size of MVBs in iPSC-derived astrocytes

      Tetraspanins are transmembrane proteins, and the tetraspanin CD63 is enriched in exosomes and widely used as an exosomal marker (Escola et al., 1998; Men et al., 2019). However, cell type specificities in the expression of exosomal markers such as CD63 have been documented (Jorgensen et al., 2013; Yoshioka et al., 2013). We therefore confirmed the presence of CD63- positive MVBs in iPSC-derived isogenic astrocytes by immunofluorescence (Figure 2D) and immunogold electron microscopy (IEM) (Figure 2E). Analysis of IEM images showed an abundance and similar levels of CD63 localized within MVBs in WT and LRRK2 G2019S astrocytes (Figure 2E, Supplementary Figure 3A,B), confirming that CD63 can be used as a marker of MVBs and exosomes in iPSC-derived astrocytes.

      7) In Figure 5, using CD63 as a MVB marker is not the most accurate approach. ESCRT markers should be co-stained with these experiments to truly show MVB localization (CD63 can localize to MVBs but is known to have a wider distribution throughout the cell compared to TSG1010 or other ESCRT complex proteins). Additionally, the authors must show their Supplemental Figure 3 ELISA quantification of p-aSyn in this main figure, and comment on why they conclude higher p-aSyn content in MVBs based on their IEM but then find no differences in aSyn in secreted EVs in WT vs. G2019S by ELISA.

      We thank the reviewer for the suggestion to use ESCRT proteins as MVB markers. We decided to use CD63 because it is recognized in the literature as an MVB and EV marker (Beatty, 2008; Edgar et al., 2014), and we now refer to these two studies in the manuscript to support this choice (see text in italics below). Using ESCRT complex proteins as MVB markers is an interesting alternative, but we note that proteins associated with this complex are also found to regulate other biological processes such as autophagy (Takahashi et al., 2018) and plasma membrane repair (Jimenez et al., 2014), and so they can co-localize to non-MVB structures (e.g. autophagosomes or plasma membrane). Similarly, TSG101 can also localize to non-MVB structures such as the nucleus and Golgi complex (Xie, Li, & Cohen, 1998), and also lipid droplet (LD) membranes where it promotes LD-mitochondria contact (J. Wang et al., 2021). As suggested by the reviewer, Supplemental Figure 3 has been re-introduced into the main text (Figure 6C). Regarding αSyn, the immunogold staining specifically detects the phosphorylated form of αSyn (p-αSyn), while the ELISA detects all forms of αSyn (total αSyn). We observed increased p-αSyn in LRRK2 G2019S MVBs, but similar levels of total αSyn in WT vs LRRK2 G2019S EVs. This observation suggests that the phosphorylated form of αSyn, but not the total amount of αSyn, is affected by the experimental conditions. The text has been updated and reads as follows (changes in italics).

      Results - LRRK2 is associated with MVBs and EVs in iPSC-derived astrocytes

      In light of our observations that mutations in LRRK2 result in altered astrocytic MVB and EV phenotypes, we asked if LRRK2 is directly associated with MVBs in astrocytes and if this association is altered by the LRRK2 G2019S mutation. We analyzed and quantified the co localization of LRRK2 with CD63 (Figure 4A), a marker for MVBs (Beatty, 2008; Edgar et al., 2014), and found that the proportion of LRRK2+ /CD63+ structures remains unchanged between WT and LRRK2 G2019S isogenic astrocytes (Figure 4B).

      Results - The LRRK2 G2019S mutation increases the amount of phosphorylated alpha synuclein (Ser129) in MVBs

      Since the MVB/EV secretion pathway is altered in our LRRK2 G2019S model of PD, we reasoned that mutant astrocytes might produce αSyn-enriched EVs by accumulating the protein in its native or phosphorylated form in MVBs or EVs. IEM analysis revealed an abundance of p-αSyn (small gold) inside and in the vicinity of MVBs of LRRK2 G2019S iPSC-derived astrocytes, but not isogenic control astrocytes (Figure 6A). We observed that 55 % of the CD63+ (large gold) MVBs in LRRK2 G2019S astrocytes are also p-αSyn+ (small gold), compared to only 16 % in WT MVBs. LRRK2 G2019S astrocytes contained on average 1.3 p-αSyn small gold particles per MVB compared to only 0.16 small gold particles in isogenic control astrocytes, and MVB populations containing more than 3 p-αSyn small gold particles were only observed in LRRK2 G2019S astrocytes (Figure 6B). When we analyzed the content of EVs by ELISA, we found that total αSyn levels (phosphorylated and non-phosphorylated) in EV-enriched fractions are similar between isogenic controls and LRRK2 G2019S (Figure 6C). These results suggest that astrocytes secrete αSyn-containing EVs, and the LRRK2 G2019S mutation appears to alter the ratio of p-αSyn/total αSyn in MVB-related astrocyte secretory pathways.

      8) In figure 6, it is even more clear that there is a stark difference between the CD63 presence in/near MVBs between WT and G2019S conditions. Since the authors normalize several pieces of data to CD63 (MVB localization, LRRK2 co-localization, etc), it is critical to quantify the number of baseline CD63 gold particles in MVBs in WT vs G2019S.

      After careful quantification of the number of CD63 gold particles in WT and LRRK2 G2019S MVBs (available in Supplementary Figure 3A,B), we conclude that there are no significant differences between the two genotypes, and the MVB images initially selected in Figure 6 are not representative. We therefore replaced Figure 6A with new images.

      9) In Figure 7, the authors used the co-culture of astrocytes and neurons to assess astrocyte-derived EV uptake by dopaminergic neurons. Although 3D reconstitution of neurons and exosomes can be precise, the data may not be 100% clean. It would be better if the authors collect ACM containing EV fraction from WT astrocyte and G2019S astrocytes and then incubate dopaminergic neurons with ACM containing EV fraction. In this way, only dopaminergic neurons are in the culture and there will be no CD63-GFP expressed astrocytes to contaminate the CD63-GFP signal in neurons.

      We understand the concerns raised by the reviewer, and we can ensure that state-of-the-art imaging technologies and image post-processing techniques have been used to prevent astrocytic CD63 signal from contaminating the neuronal signal. We performed confocal microscopy with a 63X oil objective lens (numerical aperture = 1.4), and the images were processed with a Gaussian Filter (0.18 μm filter width) to reduce background noise in the MAP2 channel, and deconvolved (10 iteration) to enhance confocal image resolution in the CD63 channel. Furthermore, CD63-positive structures were detected with background subtraction enabled.

      10) In Figure 9, the authors must show their ACM control. They show untreated, EV-free, and EV-rich ACM, but do not show unmanipulated ACM control.

      The results of dendrite length analysis for unmanipulated ACM was initially available in Figures 8E and 8F. For clarity, we prepared a new Figure 9 that shows treatment with unmanipulated ACM, EV-free ACM, and EV-enriched fractions.

      Reviewer #2:

      In this manuscript by de Rus Jacquet et al., authors present an interesting study to detect changes in extracellular vesicles in human PD patient derived iPSC-derived astrocytes carrying the LRRK2 G2019S mutation. Isogenic gene corrected iPSCs were used as controls in all experiments. Authors first performed RNA-Seq for global gene expression changes between G2019S and "WT" gene corrected astrocytes. GO analysis showed an upregulation of extracellular compartments (including exosome compartments) in LRRK2 astrocytes. Subsequent experiments focusing on extracellular vesicles (EVs) and multivesicular bodies (MVBs), showed specific differences of MVB area and the size of secreted EVs. Secreted EVs from G2019S astrocytes also contained more LRRK2 particles and G2019S EVs contained more phosphorylated aSyn particles. Co-culture of LRRK2 astrocytes with human dopamine neurons showed accumulation of CD63+ exosomes in neurites, compared to co-culture with WT astrocytes. Co-culture with LRRK2 astrocytes decreased viability of TH+ neurons and LRRK2 dendrites/neurites were also shorter. These co-culture findings were replicated using EV-enriched conditioned media. Finally, authors showed that the trophic effect of astrocytes on neurons was due both to soluble factors released into the media, and production and release of EVs. Overall, this is a well-written and systematically performed study. This reviewer has several comments as detailed below.

      1) Based on their data, authors conclude that astrocyte-to-neuron signaling and trophic support mediated by EVs is disrupted in LRRK2 G2019S astrocytes. Have authors measured the differences in trophic factors released by LRRK2 astrocytes in EVs and in conditioned media?

      This is an important question, and we have not measured the levels of various neurotrophic factors in the medium. We concluded that LRRK2 G2019S astrocytes failed to secrete neurotrophic factors based on the neuron viability data. Healthy neurons cultured with disease astrocytes displayed dendrite shortening equivalent to that of neurons cultured in basal medium lacking neurotrophic factors. Furthermore, the morphological alterations occurred over a long period of time (2 weeks) and did not recapitulate the rapid and high level of neuron death and neurite fragmentation typically observed as a result of exposure to neurotoxins (Liddelow et al., 2017). However, we performed a new analysis of our RNA-seq data and identified dysregulated trophic processes of interest in LRRK2 G2019S astrocytes.

      2) Authors differentiate cells (astrocytes and neurons) from midbrain lineage NPCs. The data show convincing effects of the LRRK2 derived astrocytes on neurons, but one question is whether this is specific to dopaminergic cells. Would this genotype specific effect also be expected in other lineages, e.g. cortical neurons? Authors should discuss this point.

      The reviewer is making an excellent point. We prepared mouse primary midbrain cultures, and co-cultured WT midbrain neurons with WT or LRRK2 G2019S astrocytes. We found that the survival of WT midbrain dopaminergic neurons was significantly affected by LRRK2 G2019S astrocytes, but the viability of non-dopaminergic midbrain neurons was not changed when co cultured with WT or disease astrocytes. A previous study by di Domenico et al. also showed that dopaminergic neurons are more sensitive to the effect of LRRK2 G2019S astrocytes compared to non-dopaminergic cell types (di Domenico et al., 2019).

      3) Prior work has demonstrated reductions in neurite length in neurons derived from LRRK2 G2019S iPSCs (not specific to dopaminergic neurons in LRRK2 cells) (for example Reinhard et al 2013). It is curious that the LRRK2 G2019S mutation itself can cause such a phenotype in neurons mono-cultures, and as shown in the current study, that LRRK2 G2019S astrocytes also induce a similar effect on WT neurons in co-culture. Can authors expand on this point in the Discussion?

      We thank the reviewer for this question, and we added a new point of discussion in our manuscript, which reads as follows (changes in italics):

      Evidence from this study and previous reports indicates that the LRRK2 G2019S mutation affects neurons through a variety of mechanisms. Here, we show a non-cell autonomous effect on neuronal viability via impairment of essential astrocyte-to-neuron trophic signaling, but the LRRK2 G2019S mutation can also mediate cell-autonomous dopaminergic neurodegeneration (Reinhardt et al., 2013). These observations support the idea that the LRRK2 kinase may be involved in a large number of pathways essential to maintain cellular function, cell-cell communication and brain homeostasis, and disruption of LRRK2 in one cell type has cascading effects on other neighboring cell types. In conclusion, our study suggests a novel effect of the PD-related mutation LRRK2 G2019S in astrocytes, and in their ability to support dopaminergic neurons. This study supports a model of astrocyte-to-neuron signaling and trophic support mediated by EVs, and dysregulation of this pathway contributes to LRRK2 G2019S astrocyte mediated dopaminergic neuron atrophy.

      4) Authors should provide data on % dopaminergic neurons generated in the cultures.

      We agree that this is important information, and we updated the latest version of the manuscript with this information (see below in italics). We estimate that the neuron cultures consist of 50 to 70 % dopaminergic neurons, and they are depleted of non-neuronal cells as explained in Material and Methods.

      Material and Methods - Preparation and culture of iPSC-derived NPCs, dopaminergic neurons and astrocytes

      To isolate a pure neuronal population, the cells were harvested in Accumax medium, diluted to a density of 1 × 106 cells in 100 µl MACS buffer (HBSS, 1 % v/v sodium pyruvate, 1 % GlutaMAX, 100 U/ml penicillin/streptomycin, 1 % HEPES, 0.5 % bovine serum albumin) supplemented with CD133 antibody (5 % v/v, BD Biosciences, San Jose, CA, cat. # 566596), and the CD133+ NPCs were depleted by magnetic-activated cell sorting (MACS) using an LD depletion column (Miltenyi Biotech, San Diego, CA), as described previously (de Rus Jacquet, 2019). The final cultures are depleted of non-neuronal cells and contain approximately 70 % dopaminergic neurons, the remaining neurons consisting of uncharacterized non-dopaminergic populations.

      5) p7. Authors refer to phosphorylated a-synuclein as accelerating PD pathogenesis, but the references cited do not show this. In fact, Gorbatyuk et al 2008, showed that overexpression of S129 with constitutive phosphorylation eliminated a-synuclein induced nigrostriatal degeneration. The Fujiwara et al 2002 reference showed the presence of phospho a-syunclein in Lewy bodies and neurites. Authors should revise their statement that phospho a-synuclein is associated with accelerated pathology.

      The reviewer is correct. We meant to highlight that there is a correlation between phosphorylated αSyn levels and PD pathogenesis, not that phosphorylated αSyn causes an acceleration of PD pathogenesis. We rephrased the sentence as follows, and replaced the study by Gorbatyuk et al. with a study by Anderson et al. that shows presence of phosphorylated αSyn in Lewy bodies (new text in italics):

      EVs isolated from the biofluids of PD patients exhibit accumulation of αSyn (Lamontagne Proulx et al., 2019; Shi et al., 2014; Zhao et al., 2018), a hallmark protein whose phosphorylation at the serine residue 129 (p-αSyn) is correlated with PD pathogenesis (Anderson et al., 2006; Fujiwara et al., 2002).

      6) Please provide details on the number of iPSC lines used for these experiments.

      Experiments in the first version of this manuscript were performed using a single LRRK2 G2019S iPSC line and its gene-corrected control. The manuscript now presents the results collected using a second, independent non-isogenic iPSC line, as well as mouse primary cultures.

      7) Clarify whether the WT neurons used for co-culture were derived from the isogenic human neurons?

      We confirm that the WT neurons used for co-culture experiments were derived from isogenic controls. We added subtitles to our figures to clarify when data show results from isogenic or non-isogenic iPSC-derived cells.

    1. While seemingly open-ended and allowing for an infinite recombination of elements, the idea of “vibes” is reductive. It discourages the more difficult work of interpretation and the search for meaning that defines human experience. It diverts attention away from narrative and moral implications in favor of foregrounding the idea of affect as inexplicable, ineffable — a matter of chance correlation of elements rather than something that requires deliberate causal explanation. The vibes framework may hone our abilities to identify settings like “cozy” or “cursed,” but it doesn’t give instructions on how we might build them or avoid them in our lives. As an analytic, vibes don’t connect feelings and consequence; as such, it is symbiotic with passive modes of media consumption.

      Wow, I hate this. How is the work of interpretation discouraged? Giving vague description to something doesn't preclude better description; to encourage people to express the idea that there's something coherent about, well, something is to create the space for further interpretation. A vibe is a term for a fetal stage, something emergent still emerging. If you already had a better name for it you'd use that. Articulating that you think there's a there there is a meaningful step! (this is where I would make a joke about attention mechanisms in deep learning if I were committed to the author's schtick) We can analyze whether cottagecore is fashy because people recognized a vibe and nurtured it into a whole... thing. (A thing that is sometimes fashy)

    1. This manuscript is in revision at eLife

      The decision letter after peer review, sent to the authors on June 30 2020, follows:

      Summary:

      This paper addresses a critical issue in neuroscience: what's the question, and can we answer it? The questions the author proposes are ones that have been considered, in one form or another, reasonably often by experimentalists. And the author shows rigorously that there's a reasonable chance that they are simply not answerable.

      Essential Revisions:

      We believe that this is an extremely important issue, and the approach the paper takes is a reasonable one for addressing it. Our main worry, though, is that mainstream neuroscientists will ignore it, for two reasons. One is that it's not a message they want to hear. Second, the example circuits are sufficiently abstract that they can be dismissed as yet another musing by your typical uninformed theorists. (That is not, we should emphasize, our view, but it's not an uncommon one in the field.) Our goal, therefore, is to fix these potential problems, so that people will have to pay attention.

      The premise of the paper is that if you understand a neural circuit, there are certain questions about it that you should be able to answer. The author proposes six such questions, and then shows that in the worst case they are exponentially (in the number of neurons) hard to answer.

      The success of this program hinges on two things: a sensible set of questions, and a demonstration that answering those questions is hard. We're not ecstatic about the questions, but we believe that's not an insurmountable issue (more on that below). More problematic is the result that the questions are hard to answer. What's really shown is that there is at least one circuit for which, in the worst case, answering the questions is exponentially hard. While this is certainly correct, it doesn't make a convincing case that answering these questions will be hard in the brain. First, the worst case may not be the typical case. The 3 SAT problem, for instance, is NP complete, but is hard to solve only for a narrow range of parameters. Second, answering the questions for actual circuits found in the brain may not even be exponentially hard in the worst case.

      This brings us to two critical comments. First, it needs to be crystal clear that this paper does not demonstrate that answering the proposed questions is guaranteed to be exponentially hard, only that it might be. This was stated in the manuscript, but not emphasized. For instance, on lines 138-140, it says

      "Using techniques from Computational Complexity Theory, we ask what is the smallest number of experiments necessary, in general, in order to answer these questions, in typical experimental settings."

      Here "in general" means worst-case. For neuroscientists, though, "in general" means "most of the time". It should be clear that you mean worst case, and that the typical case may be very different.

      In fact, this needs an expanded discussion. Whether or not it will take an exponential number of experiments to answer the questions depends on the circuit. We might get lucky, and only a small number of experiments are needed. Or we might get unlucky, and a large number are needed. This analysis can't tell us that, and this should be clear in the paper.

      Second, what's really needed is the analysis of a more realistic circuit, ideally with both positive examples (for which it is possible to answer the questions) and negative examples (for which it isn't). This is hard, but we have a few suggestions, some of which can probably be done without a huge amount of work.

      a. Linear network, y=Ax+noise. For this (and possibly in all realistic) situations, "perform the task" needs to be replaced by "achieve a certain level of performance". For instance, if there's a true mapping y=f(x), then "perform the task" would be replaced with "<(y-y*)^2> below some threshold". The questions should be answerable in polynomial time for this network; otherwise, one should worry.

      b. In 2000, Hopfield and Brody came up with a simple circuit which we think of as "understandable" (Hopfield and Brody, PNAS 97:25, 13919-13924, 2000). It would be nice to determine whether the questions can be answered in polynomial time for this circuit. Again, if they can't, one should worry.

      c. Deep networks. Again, "perform the task" would have to be replaced with "performance is above some particular threshold". Here we suspect that the questions are not answerable; if that could be shown, it would be a huge step forward.

      d. A made-up model of a deep network. Assume that in a deep network, whenever you delete a neuron, performance drops. That's probably not so far from the truth -- and also not so far from what we think would happen in the brain. (With some exceptions; occasionally I hear talks where performance is better when two areas are ablated rather than just one, but let's ignore that.) How much performance drops depends, of course, on which neurons are deleted, so there's not a simple mapping between performance and which neurons are present in the circuit. Can the questions be answered in this case? This sounds like a problem computer scientists have considered, so possibly rigorous analysis could be done.

      We believe it's critical to consider a case that is not far from what one might find in the brain. Otherwise, it will be too easy to dismiss this work as irrelevant to real neuroscience. The above are only possibilities, and a and d may be pretty easy, but the author is welcome to come up with his own examples. Note that rigor is not absolutely necessary here, since there's already one rigorous example. Plausible arguments would be fine.

      Finally, "understand" needs further discussion. That's partly because the approach here is a little non-standard. Most people try to directly define "understanding". Instead, the statement is "if you understand a circuit, you should be able to answer these questions". This has to be made crystal clear -- especially since people aren't expecting it. In addition, a discussion of the more standard approach, a direct definition, should be included. The usual definition is something like "A short description of what is being computed, along with a description of an algorithm for computing it." It should be clear how this, more standard, definition compares to the one in the paper. For instance, under the standard definition it may be possible to understand a circuit without being able to answer any of the questions. For instance, I believe we can "understand" (by the more standard definition) the synfire chain circuit. This doesn't mean that one definition is better than the other, but their differences should be acknowledged.

    1. Namaste

      I live and teach in Qatar (Qatar was previously known as a small oil-rich country in the Middle East but is now famous as the host country for FIFA World Cup 2022). At my school, we have many Indian and South Asian students who I am sure would totally understand and appreciate the use of “Namaste” as a concept for a workshop classroom.

      An equivalent to Namaste would be “Salaam” for my teaching context in Qatar. Salaam is a common greeting in Muslim countries. It means wishing “peace” and it carries intentional respect and kindness toward the other person. For a writing workshop classroom, Salaam can be understood as a peace-based concept that encourages safe, healthy, respectful dialogue and that discourages verbal or written “attack” on a student’s writing. If I were to do a writing workshop with my students in the future, I may consider replacing Namaste with Salaam for we have a very high Muslim student population in our school. That said, I am actually reminded of a Bollywood movie titled “Salaam Namaste.” Isn’t that interesting? Salaam Namaste would mean “peaceful greetings of respect and kindness.”

      As long as we are creating and maintaining the kindness culture in our classrooms, I think we can safely play with the word choice, depending on our location, culture, teaching situation, students and other preferences.

    1. Author Response:

      Reviewer #2 (Public Review):

      Valentini et al. explore the contribution of inexperienced homing pigeons in a pair, while finding the most efficient route back home. My comments below mostly concern the need of broadening the scope of the introduction and discussion by discussing and citing literature beyond homing pigeons as at the moment the manuscript could be characterized as too specific for the readership.

      We thank the reviewer for their suggestions which allowed us to expand the focus of our manuscript. Our answers to the reviewer’s comments are reported below together with modifications done on the revised manuscript.

      The authors use and present transfer entropy methods which regard the transmission of information from one individual to the other and effect of this information on behaviour. I haven't used such methods myself, but I think the methodology is nicely explained and easy to follow as it's written here. However, I would still encourage the authors to avoid jargon and un-introduced terms while first presenting their methods and results in the introduction and results sections. I also think that the paragraph in the introduction (L92-104) that refers to transfer entropy (TE) has to be extended and also direct readers to reviews such as [1] that attempt to make TE accessible to a broad audience of non-physicists. Behavioural ecologists and primatologists that study leadership and influence in animals, using less data hungry methods than TE, will probably be interested in reading this manuscript. Because eLife is a journal that attracts a very broad audience I would suggest investing more on better introducing TE to biologist and anthropologists.

      We thank the reviewer for their suggestions. In the revised version of our manuscript, we clarified the meaning of symbols and unintroduced terms and extended the introduction paragraph about transfer entropy to provide more information. We now discuss data requirements of information-theoretic approaches, point the reader towards recent literature reviews aimed at introducing these (and similar) approaches to the community of behavioural ecology, and better introduce the advantages of transfer entropy with respect to methods based on models of alignment, attraction, and repulsion.

      “Leader–follower interactions of this sort can be accurately captured using information-theoretic measures that quantify causal relations in terms of predictive information (Butail, Mwaffo, and Porfiri 2016; Kim et al. 2018; Crosato et al. 2018; Ray et al. 2019; Valentini et al. 2020). This methodological approach, which generally requires large amounts of data (but see (Porfiri and Ruiz Marín 2020)), is gaining popularity among behavioural ecologists (Strandburg-Peshkin et al. 2018; Pilkiewicz et al. 2020) as tools for automatic monitoring and extraction of the necessary volumes of behavioural data become increasingly available (Egnor and Branson 2016). One of these measures, transfer entropy, quantifies information about the future behaviour of a focal individual that can be obtained exclusively from knowledge of the present behaviour of another subject (Schreiber 2000). Transfer entropy measures information transferred from the present of the sender to the future of the receiver (Lizier and Prokopenko 2010). It explicitly accounts for autocorrelations characteristic of individual birds’ trajectories (Mitchell et al. 2019) by discounting predictive information available from the sender’s present that is already included in the receiver’s past (see Figure 1). Furthermore, it does not require a model of how sender and receiver interact, and it is well suited to study social interactions both over space and time (Lizier, Prokopenko, and Zomaya 2008; Strandburg- Peshkin et al. 2018). This aspect of transfer entropy encompasses traditional methods to quantify collective movement that are based on modelling an individual’s behaviour as a combination of three motional tendencies (Couzin et al. 2002) – alignment of direction to nearby group members, attraction towards sufficiently distant members, and repulsion from sufficiently close members – that allow an individual to maintain proximity to the group. In this context, transfer entropy is advantageous as it can capture causal interactions due not only to alignment forces (Nagy et al. 2010) but also to attraction and repulsion forces that result in temporarily unaligned states (Pettit, Perna, et al. 2013).”

      A thought I had while reviewing this work regards the theory of the wisdom of the crowd [2]. This indicates that when a group or a collective averages the different estimates of its members, they reach a more accurate collective estimate. Studies have also shown that animals can average their movement directions to resolve conflicts of interest [3,4]. The current manuscript also shows that pooling infomration leads to better movement decisions. Would it thus make sense for this manuscript to discuss how its findings may support the wisdom of the crowd theory?

      We thank the reviewer for the suggestion. In the revised version of out manuscript, we included a new paragraph where we discuss a possible connection with the phenomenon of the wisdom of crowds as well as how our results might generalize to flocks of larger size.

      “The ability of groups to outperform single individuals by pooling information across their members is an aspect of collective intelligence that has long intrigued researchers. One potential mechanism underlying this phenomenon, popularly known as the wisdom of crowds (Surowiecki 2005), is averaging many individuals’ estimates independent from each other. Averaging individual decisions is expected to provide a more accurate group estimate than any individuals’ guess. Previous studies have also shown that animals can average their movement decisions to reach a compromise (Biro et al. 2006; Strandburg-Peshkin et al. 2015). Although the mechanisms by which experienced and naïve individuals pool information during route development remain unknown, our study points to the importance of naïve group members within the information-pooling process. Moreover, the wisdom of crowds is known to require personal information to be independent among group members (Couzin 2018) otherwise group performance can degrade quickly for increasing group size (Kao and Couzin 2014). Experimental pairs could thus benefit from pooling information with naïve individuals that, at least at the beginning of each generation, likely provide a source of information independent from that of the experienced bird. The potentially deleterious effects of losing independence may provide another pressure to shift over time from innovative exploration to route6 preserving exploitation. It remains to be explored how our results generalize to larger flock sizes. Previous experiments without generational replacement showed that, even in larger flocks, birds flying ahead of the flock had a tendency to assume leadership positions (Nagy et al. 2010). However, the repeated introduction of naïve individuals into larger flocks might complicate the dichotomy between leaders and followers by inducing turnover dynamics between the front and the back of the flock.”

      As briefly mentioned earlier, I think that the cited literature in this manuscript (especially in L58-138 and throughout the discussion) includes mostly studies on homing pigeons whereas relevant studies to the current manuscript have been performed on other species and by discussing and citing relevant studies on various species the manuscript would become more attractive to a broader audience and wouldn't read as homing-pigeon specific.

      We thank the reviewer for pointing us towards additional literature related to our study. We included the suggestions from the reviewer as well as further references to a broader literature to expand the scope of our manuscript.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons

      1. General Statements

      We want to thank all three reviewers for their positive feedback, constructive comments, and suggestions for clarity and improvement. We are delighted to find their consensus that the manuscript represents a contribution to the field.

      Accordingly, we made changes in the text (all highlighted in blue in the revised manuscript) and added a new figure as detailed in the point-by-point response.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors describe results of the comprehensive analysis of the prevalence and functionality of intrinsically disordered regions of the pathogen-encoded signaling receptor Tir, which serves as an illustrative example of the bacterial effector proteins secreted by Attaching and Effacing (A/E) pathogens. This is an interesting and important study that represents an impressive amount of data generated computationally and using a broad spectrum of biophysical techniques. The work serves as a model of the well-designed and perfectly conducted study, where intriguing conclusions are based on the results of the comprehensive experiments. The manuscript is well-written and concise, and I have a real pleasure reading it. The text and figures are clear and accurate.

      We thank the Reviewer for these positive comments on our work.

      Although, in general, prior studies are referenced appropriately, the authors should mention that the pre-formed structural elements they found in Tir are in line with the concept of "PreSMos" (pre-structured motifs) previously introduced and described in several important studies from the laboratory of Kyou-Hoon Han.

      We thank the Reviewer for this suggestion. We have added a sentence to acknowledge the presence of “PreSMos” in the target-free state of Tir as putative signatures for target-binding, referring to a review article summarizing several local structural elements in unbound IDPs:

      “This supports the presence of pre-structured motifs (PreSMos) as pre-existing signatures for target binding and function within target-free Tir (72)**.”

      Please, note that we decided to keep this discussion to a minimum, as we cannot rule out the contribution of the induced fit model to the binding mechanism (i.e., disorder-to-order transition upon binding).

      Reviewer #1 (Significance (Required)):

      Solid evidence is provided that structural disorder and short linear motifs represent common features of A/E pathogen effectors. In fact, using a set of bioinformatics tools, the authors first show that although prokaryotic proteins typically contain significantly less intrinsic disorder than eukaryotic proteins, A/E pathogen effectors are as disordered as eukaryotic proteins. Using the translocated intimin receptor (Tir) as a subject of focused study, the authors then utilized a number of biophysical techniques to draw an impressive picture of disorder-based functionality. This study clearly represents a major advancement in the field of functional intrinsic disorder in general and in disorder-based functionality of proteins expressed by pathogenic bacteria. This was adds significantly to the field and will have a noticeable impact.

      Again, reading this manuscript was a real joy. Finally, this work perfectly fits in the area of my expertise, since for the past 25 years or so I am working on the different aspects of intrinsically disordered proteins.

      Thank you for this encouraging assessment.

      **Referee Cross-commenting**

      I agree with the amended recommendation of reviewer #3 to add in the manuscript EPEC O127.

      According to the suggestion of Reviewer #3, we have now included EPEC O127:H6 in the manuscript.

      I completely agree with comments of reviewer #2 and partially agree with reviewer #3. In my view, comparison of various strains as references for EPEC represents an interesting but independent project. It can be recommended to the authors as one of the potential future developments of their work.

      Thanks for the suggestion. We are pursuing that line of research.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The general impression is that this is an excellent study that establishes

      The C-terminal intracellular region of Tir called C-Tir spanning residues 338 to 550 is largely disordered, however, observe helical structural elements involved with lipid interactions; multi-phosphorylation. The intracellular N-terminal part of Tir called N-Tir spanning residues 1 to 233 is also partially disordered but include a folded domain that is shown to assemble into a dimer

      The only major concern is that no SDS-PAGE gels or size exclusion chromatograms have been included to verify purity and monodispersed of the various constructs worked on. In particular, the SAXS and CD measurement is highly sensitive to purity, and the level of degradation as IDPs are notorious for being difficult to handle in solution. it would strengthen the arguments made based that

      We produced N-Tir and C-Tir as fusion proteins with a cleavable N-terminal thioredoxin tag (Trx-His6) and C-terminal Strep-tag. The latter allowed us to purify them via Strep-tag affinity chromatography as indicated by SDS-PAGE (please see Fig. S1).

      We agree with the Reviewer that even small amounts of impurities (i.e., higher oligomers/degradation) can interfere with the data analysis and make interpretation of the resulting data difficult and potentially misleading. So, to avoid such problems, all samples were purified in monodispersed forms by size-exclusion chromatography (SEC) before any biophysical study.

      Following the Reviewer's suggestion, we added a new supplementary figure (Fig. S5) showing the SEC-SAXS chromatogram profiles of C-Tir, N-Tir, and NS-Tir. Briefly, in the inline SEC-SAXS experiment, the sample eluates from an HPLC system directly and continuously into a BioSAXS flow cell for subsequent X-ray interrogation. Under our experimental conditions, C-Tir elutes as a single peak with Rg-values and mass compatible with a disordered monomeric protein, providing an excellent fit to the experimental SAXS curves. For N-Tir and NS-Tir, by SEC-SAXS, we separated the dimer from small amounts of high-order oligomers to yield the experimental SAXS curves of the pure dimers.

      “Fig. S5. SEC-SAXS chromatograms of (A) C-Tir, (B) N-Tir, and (C) NS-Tir. Each plane shows normalized total scattering intensity I(s), over the entire s range, from each frame acquired along elution volume and respective Rg-value (black circles). The flat variation of Rg reflects a pure monodisperse sample. The column type for size exclusion chromatography and sample concentrations are on the top left of each panel. For reference, the retention volume for monomeric BSA (66.4 kDa) is displayed by red triangles.”

      **Minor Comments**

      Read through the manuscript to remove passages with spoken language

      We thank the Reviewer for this suggestion. We went through the manuscript and improved the writing to reduce passages with spoken language.

      Line 263, "To do so", should be removed

      Line 290 "Our data thus" replaced with "this"

      We have amended the manuscript accordingly.

      Line 292 "lipid bilayers that might potentially fine-tune Tir's activity in the host cell." Weak sentence and the word fine-tune is slang. Rewrite the sentence. The interaction with lipids is fascinating!

      Thanks for the suggestion. The sentence has now been changed to “**This shows that C-Tir can undergo multivalent and tunable electrostatic interaction with lipid bilayers via pre-structured elements, suggesting that membrane-protein interplay at the intracellular side might control the activity and interactions of Tir in host cells.**”

      We also reinforce this fascinating message in the abstract by adding the sentence: “Membrane affinity is residue-specific and modulated by lipid composition, suggesting a previously unrecognized mechanism for interaction with the host.”

      Line 192 "In figure Fig. 3A," remove the Fig

      Fixed.

      Line 326, "In a similar fashion," is redundant. Rewrite the sentences below.

      We have modified the sentence as follows: “We evaluated whether the N-terminal cytosolic region of Tir (N-Tir; Fig S1) was also intrinsically disordered ...

      Line 342 add spaces between digit and SI unit "52kDa" there are more cases of this.

      Thank you for pointing this out. This has now been corrected to 52 kDa.

      Reviewer #2 (Significance (Required)):

      I expect this study to have broad relevance to microbiologists working with the intimin and translocated intimin receptor, in particular the lipid interaction is likely to be followed up by the community.

      We thank the reviewer for this comment. Indeed, we believe that further studies on Tir's lipid-binding ability as a novel molecular strategy in host-pathogen interactions, will potentially provide new insights on virulence, transmembrane signaling in general, and disorder-mediated functions.

      **Referee Cross-commenting**

      What reviewer 3 suggested in the comments sounds like added value and should be included.

      I agree with reviewer 1, that the strain comparison potentially is beyond the scope presented in this manuscript.

      We have now included EPEC O127:H6 in the manuscript.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      This interesting manuscript look at the structure of the Nter and Cter of the effector Tir from enteropathogenic E. coli. The authors confirmed previous study highlighting the "disordered" part of the Cter. However, the extended experimental work (NMR, Small-angle X-ray scattering and CD spectroscopy) from this study also reveals the connection between different area of Tir and its implication during Tir phosphorylation and its interactions with SH2 domain.

      We thank the Reviewer for this positive remark. Indeed, in our work, we highlight the structural features of the SH2-mediated interaction between Tir and host SHP-1 protein, and we also show that C-Tir is capable of lipid interaction via pre-structured motifs and that N-Tir is disordered but assembled into a dimer. Overall, we provide an updated and wide picture of Tir's intracellular side that goes beyond the scrutiny of previously described disorder features.

      **Major Comments:**

      The authors used E2348/69 (O127:H7) strain as a reference for EPEC. However, this strain are the least effectors of all the EPEC sequences and may over estimated the PDR in EPEC. It would be wiser to use a strain like B171 as a reference for EPEC to be able to conclude "Disordered Proteins (PDR) with long disordered regions occur in EPEC effectors similar to the human proteome". I believe that the PDR in EPEC is similar to EHEC and CR. I do not have any major concern for the rest of the work.

      We thank the Reviewer for this comment. So, to clarify, we amended “EPEC” with “EPEC O127:H6” in text and figures.

      We also added a paragraph at the beginning of the Discussion section to acknowledge that our prediction analysis concerns EPEC O127:H6 and two additional representative A/E bacteria strains:

      “Among the enteropathogenic Escherichia coli strains EPEC O127:H6 (E2348/69) is commonly used as a prototype strain to study EPEC biology, genetics, and virulence (69). Here, we have determined the structural disorder propensity of EPEC O127:H6 sequences and two additional representatives of A/E bacteria: EHEC O157:H7 and CR ICC168.

      Finally, the Reviewer suggests to include EPEC strain B171 (serotype O111:NM) in our analysis. We agree that considering additional strains would be of value, however we believe that this is beyond the scope of this manuscript, which mainly focuses on the characterization of the structural features of the E2348/69 Tir effector. We are currently working on a broader comparative analysis among different Escherichia coli pathogenic strains, including B171, and we hope to share our findings with the community in the near future.

      **Minor comments**

      Statistic problem: Mann Whitney U Test (Wilcoxon Rank Sum Test) is a comparison of two independent samples with the underlying assumption is normally distributed or that the samples were sufficiently large. It is not certain that any of this assumption is correct. In addition, the effector are part of the whole proteome. Can it be then considered that both groups are independent?

      We thank the Reviewer for this remark, which allows us to clarify the choice of this particular test. Indeed the Mann Whitney U-test is a non-parametric test to compare two samples with the alternative hypothesis being that one of the two samples is stochastically greater than the other. As it is a nonparametric test samples are not required to be normally distributed, as it is for the Student t-test.

      Regarding the independence of the samples, when comparing the effectors collections to their corresponding proteomes, we did exclude the effectors sequences from the latter. We have clarified this point in the Supplementary Material and Methods section.

      Line 120 and 442: O127 not H127

      Thank you for pointing this out. It has now been corrected to O127.

      Line 212: positions 409 or 405?

      Yes, it should be 405. Thank you.

      Reviewer #3 (Significance (Required)):

      **Nature and significance:**

      Tir plays a major role during EPEC infection. It is a signalling platform that has been reported to interact with multiple proteins. Whereas the extracellular part has been well characterised and crystallised, the intracellular part has been proven so far to be difficult to study. Over the last decade, no progress has been made to explain how Tir works. This manuscript provides interesting information that shade some light on how the protein could work.

      **Existing literature:**

      The last research manuscript trying to highlight the structural function of Tir dates from 2007 (PMC1896257). This study is far more extended and in depth than any other previous work done.

      **Audience:**

      the Audience may probably limited to researcher working on the field of cellular microbiology and the function associated with bacterial effector in the host. This study could be also a useful tool to identify new effectors base on their "disorder".

      We thank the Reviewer for recognizing the importance of this study. We agree that our work highlights the pivotal role of disordered regions in bacterial effectors, thus enabling a better understanding of the molecular mechanisms used by pathogens to subvert the host-cell processes. We indeed believe that our work can stimulate further research on the characterization of intrinsically disordered effectors, and also beyond the cellular microbiology field, in order to gain a broader knowledge on the molecular dialogue at the host-pathogen interface, which is essential to design better therapeutic strategies.

      **Expertise:**

      I have been working on A/E pathogens for the last 15 years with a particular interest in Tir signalling. My domain of expertise is more in relation to cell signalling than crystallography or structural study.

      **Referee Cross-commenting**

      I agree with both reviewers. My comment about EPEC is more about the conclusion for some of the figures. I don't think they should conclude for the whole EPEC. The Tir variation among EHEC O157:H7 is low, but it is far more diverse for EPEC. Simply adding in the manuscript EPEC O127 should be enough.

      We thank the Reviewer for this comment. As mentioned above, we now state in the manuscript, in both Results and Discussion sections, that we used E2348/69 as a representative strain for EPEC.

    1. Author Response:

      Reviewer #1 (Public Review):

      Summary: In " Rapid and Sensitive Detection of SARS-CoV-2 Infection Using Quantitative Peptide Enrichment 1 LC-MS/MS Analysis" Hober, A. et al. describe the addition of peptide immunoprecipitation by means of SISCAPA technology to the Sars-Cov2 mass spectrometry-based diagnostics toolbox. The work shows in a straightforward way that this is a huge improvement and of great importance to the field. It shows beyond any doubt that mass spectrometry can become a clinically applied diagnostic instrument to detect (viral) infection.

      Overall remark: The main concern is the reported number of 83% sensitivity. This is not because the number is too low, but because the number is misleading. In line with "CLSI EP 12-A2 User Protocol for Evaluation of Qualitative Test Performance guidance" a summary of the sample analysis results are shown in a 2x2 contingency table. Unfortunately, I oppose to this representation of the results at this stage for three reasons: (i) reporting a percentage shouldn't be done on less than 100 samples because of the weight of a few misannotated samples on these numbers, be it in the qPCR or the MS results; (ii) because both assays are imperfect, it is impossible to assess the ground truth for calling patients and thus assess sensitivity and specificity; (iii) the authors still only target a single peptide, which is not conventional in MS-based assays that targets proteins.

      We have changed to PPA and NPA in the new version of the manuscript. We have also included 264 RT-PCR negative samples collected in the same study. We agree that protein quantification should not be done using only one single peptide. We have updated the manuscript to clarify that we do not perform protein quantification, but rather peptide quantification.

      Rather than the proposed confusion matrix, which assumes that the ground truth is known to call it e.g. "false negatives", the authors could refer to it as an agreement matrix and not be tempted to calculate threshold values like sensitivity, which have too much of an impact on the clinical readership that is used to seeing this value in a more controlled context. This is in line with the recent Lancet manuscript from Fitzpatrick, M. et al (2021), proposing percent positive agreement (PPA) and percent negative agreement (PNA) instead (Fitzpatrick et al., 2021).

      We have decided to keep the confusion matrix but we are referring to it as PPA and NPA and rephrased sensitivity to “estimated sensitivity” based on PPA.

      More specifically, as we and others have shown, qPCR Ct values rarely agree in two (consecutive) analyses, even within accredited settings (personal communication NHS). Above Ct30, patients regularly turned negative in our hands (https://doi.org/10.1021/jacsau.1c00048), even with an assay that had proven detectability of 1 plasmid at Ct40. Furthermore, we suspect that freeze-thaw cycles further inflate this uncertainty, two of which the current samples were subjected to. Undetected mRNA would then classify these patient samples as "false positives" if they did yield signal in the LCMS results. By chance, this did not happen in this manuscript, yet this could very well be the reason for the highest signal reported in Figure 3 as a green dot at log2 MRM response of -6 (see minor remarks).

      The authors already distinguished the patients in a High Pool of Ct <30, a Low Pool 30{less than or equal to}Ct<33 and the negative samples (Ct>40). It is clear from the gap (no 34<Ct<39) that finding patients between Ct33 and Ct39 is challenging. Indeed, qPCR has its own "diagnostic grey zone" of LOQ negative and LOQ positive that rarely is being referenced. Thus, a "sensitivity" of 95% for patients <Ct30, despite the low number of samples and considering the uncertainties in qPCR (just above or below Ct30) at least limits the comparison to samples that are positive beyond any doubt. But again, we would be thresholding against a trembling metric, in turn making the claim from the authors dangerous that "the estimated LLOQ is 3 amol/μL approximates to Ct {less than or equal to}30". Rather, the Ct30 threshold should be set for a different reason, if one is chosen at all.

      What is needed is good thresholding for clinical diagnostics, as is done in qPCR. In the public hospital in Belgium that provided us with patient samples, the positive threshold is set to Ct33 on the first measurement and practitioners use higher Ct values only in the context of physical symptoms of the disease to come to a final conclusion. For MS, we now need to measure >1000 samples in order to decide what log2 MRM response for a given set of peptides corresponds to an LOQ positive from - say - Ct27 to Ct30 and an LOQ Negative from Ct31 to Ct33. In other words, the linearity of the correlation between qPCR and MS illustrates the intrinsic value of MS; the point up until which we can provide clinically relevant information remains to be determined on large patient cohorts. In turn, these large patient cohorts can allow to sort (clinically) validated patients according to signal intensity and set a log2 threshold at which e.g. 2% or 5% negatives are expected, in line with False Discovery calculations for target decoy strategies. At this stage however, it might be most straightforward to conclude with percent positive agreement (PPA) and percent negative agreement (PNA), as is recommended for laminar flow tests validated on <100 samples.

      Finally, realizing the importance of this pivotal moment in the implementation of MS in the clinic, I find it somewhat tricky to only focus on one peptide. In fact, the authors perform the qPCR on two genes (three genes being even more common) because of the drop-outs that can occur. I feel like the use of peptide IP with MRM for detecting pathogens has not yet matured enough to rely solely on one peptide. Still, I understand that asking for a second peptide would mean repeating all the measurements, so that is most probably not realistic. Yet, I do consider this to be yet another reason not to report % sensitivity and specificity in the current manuscript and the potential to gain robustness with more peptides should clearly be emphasized at every stage of the manuscript.

      We agree that the method would be much improved by adding another peptide to the repertoire. The method was developed using the most sensitive antibody-peptide pair and the most promising pair was used in the downstream process. We have highlighted the limitations of using only one peptide and emphasized that this is a proof-of-principle study.

      In conclusion, because patient batches in the thousands are currently unavailable to MS-oriented diagnostic labs and because of all the reasons mentioned above, we cannot report the numbers of sensitivity and specificity in this manuscript, as they are misleading and do not quantify what they are intended to do.

      Fitzpatrick, M. C. et al. (2021) 'Buyer beware: inflated claims of sensitivity for rapid COVID-19 tests', The Lancet. Lancet Publishing Group, pp. 24-25. doi: 10.1016/S0140-6736(20)32635-0.

      We agree and have changed to PPA and NPA for this reason.

      Major remarks: P3L250: "on-column amount of 60 amol." Because of the enrichment procedure, could the authors specify what initial conditions they spiked into the dilution series prior to enrichment. This would allow recalculation and avoid confusion about the correctness of the 60 amol on column claim (which in our hands is still detectable).

      We made changes to this in the updated version of the manuscript.

      P8L181: "50 μL elution buffer (0.5 % 180 formic acid, 0.03% CHAPS, 1X PBS) and incubated for 5 min at room temperature." This minor sentence is placed under major remarks, because in our understanding the elution buffer needs to be acidic and adding PBS will reduce acidity. If this is a typo, please correct. If this is not, could the authors try and use H2O instead and see if their results improve?

      The access to the raw data was denied.

      The raw data is accessible through the provided Panorama link and can be accessed under the tab “Raw Data”. The entry in ProteomeXchange, however, is only a reserved data set identifier for now, but the data will be made available through this link after the review process.

      Reviewer #2 (Public Review):

      MS-based proteomics is currently discussed as a method for detection of viruses from clinical samples. Several studies have already shown the potential of this method on the example of the detection of SARS-CoV-2 from respiratory specimens. However, one of the major drawbacks still remains the low sensitivity of MS-based virus detection compared to real-time PCR, which is the gold-standard method. In their manuscript Hober and colleagues apply specific antibody-based enrichment of SARS-CoV-2 peptides from upper airway samples to concentrate the analyte prior to analysis by targeted MS (MRM). The authors determined the dynamic range of the method for four different SARS-CoV-2 NCAP peptides using a calibration curve. On the example of the SARS-CoV-2 NCAP peptide AYNVTQAFGR a correlation between the MS result and the cT value is shown. Furthermore, using stable isotope labelled (SIL) peptides as internal reference, a quantitative MS measurement was achieved. The presented approach is able to distinguish real-time PCR SARS-CoV-2 positive samples from negative samples in the used set of 88 samples from asymptomatic patients. Combined with a specificity of 100 % and sensitivities of up to 94.7 % for samples with cT values {less than or equal to} 30 the authors conclude that the method could be an alternative to real-time PCR.

      Strengths of the manuscript:

      I think the applied technique (SISCAPA) is highly interesting in the context of virus proteomics. This is because virus proteins are often underrepresented in relation to the host proteins, especially during early time points of infection, hampering their detection. Recently, the application of SISCAPA for SARS-CoV-2 diagnostics has been suggested in the discussion of a manuscript from Van Puyvelde and colleagues. The manuscript from Hober and colleagues presents the first study demonstrating that this technique can be applied to enrich, detect and quantify SARS-CoV-2 peptides from upper airway samples. The manuscript is clearly arranged, the data is sound and supports the main conclusions.

      Weaknesses of the manuscript:

      I think the manuscript in some points underestimates the PCR and vice versa overemphasizes the proteomics approach. For example, I don't agree that real-time PCR generally suffers from technical problems, degraded probes or non-specific amplification. Vice versa I think the LC-MS/MS approach is not inherently absolute specific and does not outperform PCR in terms of specificity. Further, LC-MS/MS does not eliminate the problem of false positives, which could be introduced during sample preparation or by inter-run contaminations. Although in real-time PCR no internal standards analogous to isotopically labelled peptides are used there are internal controls used to assure the quality of the extraction and the PCR reaction itself. The method presented by Hober and colleagues is clearly beneficial for the field of proteomics-based virus detection, but I suggest a more balanced discussion also including also the potential drawbacks of the method.

      Another point I like to raise is that the authors conclude at the end of the results section that patient samples were collected at an infectious stage.

      We have made changes to the manuscript accordingly and removed the claim that the samples were collected in an infectious stage since this cannot be confirmed. The patients did not show any symptoms when sampled, which has been highlighted in the new version.

      However, an assessment of the infectivity cannot be drawn from the presented data. The analysis of real-time PCR results in the manuscript is based on cT values. But to draw the conclusion, that the analysed samples contained infectious virus particles, the number of viral genome equivalents has to be determined, which in turn can be correlated to infectivity.

      We have removed this section since we cannot make any claim on infectious virus-particles.

      The detection of viral proteins itself does not proof that samples were collected at an infectious stage and there is currently no correlate of the amount of NCAP protein and infectivity. Since viral proteins are likely more stable than viral RNA, they could even be detectable for a more prolonged time in patient samples.

      Reviewer #3 (Public Review):

      Major comments

      P2, l245, Figure 2: It is not completely clear to me what is represented in panels A and B. Is this the pure SIL peptide of the endogenous peptide in a complex matrix? This may make a large difference for the determination of the LLOQ. Panel B shows a calibration curve and as these are curves for which the signal is detected based on known input amounts of sample, I assume that the input is pure SIL peptide here?

      In panel A, what does '3 amol/ul' in the middle chromatogram exactly mean? Is this the endogenous peptide that was calculated to be present at 3 amol/ul based on a known concentration of spiked-in SIL peptide?

      P4, l276: The authors need to explain the details of data imputation. It is unclear which data were imputed and how this was done. In Figure 3 the grey data points represent "not detected" or "inconclusively identified" samples by LC-MS, while some of the data points seem to have a higher 'response' values than others. Please explain.

      In Figure 3, how is 'response' defined? I don't understand the following sentence (p4, l277): "… for the LC-MS results the lowest response divided by three was used, mimicking….". Which variable does the data point size reflect? There seem to be clear differences in ball sizes. Please explain. For clarity, it would be advisable to keep the y-axes for panels A and B identical. Also, how could RT-PCR values be not obtained, apparently leading to missing Ct values (p5, l278)?

      Assuming that all collected samples from individuals in the test group in this study are visualized in Figure 3, the majority was tested positive for SARS-CoV-2. This is very different from the percentages oberserved in regular testing facilities. How was the study group composed? Were these individuals who were already admitted to the hospital?

      We have specified that the sampels were selected based on RT-PCR result and have included more negative samples in the new version of the mansucript. We have also speciied how individuals were enrolled into the study.

      It would be interesting to include more negatively tested individuals to see the distribution of 'MRM response' values in this group, since some of the negatively tested individuals (green data points) show higher than expected MRM response values if no viral protein is present at all. Related to this, I do not understand how a specificity score of 100 % (p5, l292) was obtained while some green data points (negative by RT-PCR) have higher associated MRM response values than some of the blue (positive by RT-PCR) samples. Can the authors explain this?

      The negative samples that show a stronger MRM response do not have the required qualifying ions, thereby failing the QC parameter of the assay. This has been clarified in the new version of the manuscript.

      I find the text from p6, l298 ("However…") onward more suited for the Discussion section, since this is about the interpretation of the results presented here and the use of the described methodology in diagnostics; no results are shown in this part.

    1. Reviewer #3 (Public Review): 

      Constant et al describe a study investigating an important issue - are judgements of agency metacognitive in nature? While this topic has received a lot of theoretical attention, empirically the issue is underexplored, partly due to a lack of appropriate frameworks and tools. Here the authors suggest the issue can be tackled by thinking more precisely about the computations involved in both judging agency over an outcome and in forming a (metacognitive) confidence report. This focus on constituent computations is an important conceptual strength of the paper. 

      The authors choose to operationalise metacognitive computations as those where agents have "second order access to sensory noise" and design two similar tasks - a confidence judgement task and an agency judgement task - where observers report their experience of controlling a virtual hand that can move synchronously or be delayed. Crucially, the uncertainty of the incoming sensory signals is varied, and the authors explore whether agency and confidence judgements are influenced by this sensory noise, and which kind of computational processes can best explain how. While the authors find empirically noise has an effect on both kinds of judgements, computational modelling suggests that agency judgements are best explained by a 'rescaling' model which does not include an explicit representation of the noise, whereas confidence judgements are better explained by a 'Bayesian' model which does represent noise. 

      There is lots to enjoy about this paper. It is particularly inspired to have an agency and confidence task that are so similar, making them more directly comparable. Indeed, they are compared in the paper with basically identical computational models, something which to my knowledge has never been achieved in this field of work. The models themselves all seem well chosen given certain design assumptions, though I suspect the more general insight of generating explicit computational models of agency-like judgements is one that could inspire other researchers in this field, and charts a route to progress on thorny issues on this and related topics. 

      However, while this approach is intriguing, I think the main weakness of this study relates to the core experimental manipulation: introducing temporal delays between actions and outcomes to influence ratings of control. While this is a popular approach in the field, recent authors (e.g., Wen, 2020, Consciousness and Cognition) have suggested that this manipulation may be problematic for a number of reasons. In similar types of paradigm, Wen (2020) notes that agents are able to accurately judge their control over action outcomes that are substantially delayed (e.g., well over 1000 ms) and thus it is possible that 'delay manipulation' designs actually introduce response biases, where participants are somewhat artificially reporting variance in the delays they experience rather than their actual experience/belief about what they can and cannot control. Indeed, in the methods of this present paper, the authors note participants were asked to "focus specifically on the timing of the movement" of the virtual hand, which may make this concern particularly apposite. 

      Because of this manipulation, all of the computational modelling (naturally) assumes that agents are engaged in a task where they have to detect the delay and compare this to some criterion value. Indeed, there is nothing else they could be doing in these tasks. The report of "agency" is thus generated directly from this internal variable that encodes "did I detect a delay?", and any confidence report is a metacognitive judgement about that decision. 

      This raises an important issue of conceptual validity: is a judgement of agency equivalent to judging whether an outcome was delayed or not? Many results (see review by Wen, 2020) suggest that agents can simultaneously tell an action outcome was delayed, but still judge themselves to be the agent, suggesting that an equivalence along these lines is unlikely. If so, this would mean acknowledging the generalisability of these is conclusions is potentially limited: rather than concluding that agency judgements in general are non-metacognitive, the conclusion would be the sensorimotor delay judgements in particular are non-metacognitive. The latter conclusion is by no means uninteresting, but has a somewhat narrower theoretical significance for the key debate used to frame this paper ("do agency judgements monitor uncertainty in a metacognitive way?") 

      A second important issue relates to what exactly makes a computation 'metacognitive'. For example, the authors argue their Bayesian model is a metacognitive one, because it requires the observer to have second-order access to an estimate of their own sensory noise. I am not completely sure this follows: the Bayesian model in this paper clearly incorporates an estimate of the noise/uncertainty in the signal, but not all representations of noise are second-order or metacognitive. For example, Shea (2012) has noted that in precision-weighted Bayesian inference models throughout neuroscience (e.g., Bayesian cue combination, also discussed in this paper) the models contain noise estimates but the models are not metacognitive in nature. For example, when we combine a noisy visual estimate and a noisy auditory estimate, the Bayesian solution requires you account for the noise in the unimodal signals. But - as Shea argues - the precision parameters in these models do not necessarily refer to uncertainty in the agent's perceptions or beliefs, but uncertainty in the outside world. It seems a similar argument is relevant to the Bayesian model of agency offered by the authors in the present paper. It is not clear to me why we should think the uncertainty parameter in the Bayesian model is something metacognitive (e.g., about the agent's internal comparator representations) rather than something about the outside world too (e.g., the sensory environment is noisy). 

      References:

      Shea (2012) Reward prediction error signals are meta-representational. Nous, DOI: 10.1111/j.1468-0068.2012.00863.x 

      Wen (2020). Does delay in feedback diminish sense of agency? A Review. Consciousness and Cognition, DOI: 10.1016/j.concog.2019.05.007

    1. Author Response:

      Reviewer #1 (Public Review):

      The authors provide evidence for the following key points:

      • that low and likely biologically relevant levels of oxidized phospholipids (OxPLs) can induce macrophage death and interleukin-1-beta release
      • that the pro-inflammatory activities of OxPLs can be tempered by acyloxyacyl hydrolase (AOAH) which deacylates oxPLs in vitro
      • that AOAH deficient mice exhibit exacerbated inflammation in vivo in response to exogenously delivered OxPLs, but interestingly, also in response to HCl, which presumably induces the release of endogenous OxPLs

      In general the data are a nice combination of in vitro and in vivo observations and are supportive of the conclusions. A few points should be addressed:

      • how do the authors reconcile their results with others' apparently contradictory results in the field?

      We thank the reviewer for raising this important question. We think the oxPL species used and their concentrations, the routes of MAMP and oxPL delivery, and the order of addition of MAMP and oxPLs may contribute to the observations made in different laboratories. We have added a paragraph in the Discussion and another in the Methods, lines 447-474 and lines 495-506 (highlighted).

      • which inflammasome is activated by OxPLs?

      We found that NLRP3 specific inhibitor MCC950 reduced PGPC or LPC-induced inflammasome activation and IL-1β release. To our surprise, using inhibitors we found that in addition to caspase 1, caspase 8 was also indispensable, suggesting that caspase 8 may cleave caspase 1 and activated caspase 1 cleaves pro-IL-1β (Chi et al., 2014; Philip et al., 2014). Please see lines 94-105, new Fig. 1E, F and new Fig. 3B, C.

      • can the possible effects of AOAH on the priming stimulus (Pam) be more cleanly distinguished from its effects on OxPLs?

      Because AOAH does not regulate acute responses to LPS (Lu et al., 2008) or Pam3 (Fig. 4C, IL-6) in vitro or in vivo (Lu et al., 2008; Zou et al., 2017), we do not expect AOAH to modulate the priming effects of Pam3 or LPS. To exclude this possibility, we tested CpG, which can also prime macrophages for oxPL-induced inflammasome activation. We found that when AOAH WT and KO macrophages were primed with CpG, PGPC induced more cell death and IL-1β release from AOAH KO macrophages. Please see lines 220-225 and new Fig. 4E.

      • a few other experimental controls could be provided

      We have added actin controls to all Western blots.

      Reviewer #2 (Public Review):

      Zou et al. investigated the function of acyloxyacyl hydrolase (AOAH) in inflammation caused by oxidised lipids. Using cell culture models (murine BMDs) the authors first show that oxidised lipids such as oxPAPC, POVPC and PGPC induce inflammasome activation. Focusing on AOAH, they then demonstrate that AOAH, which can act as a phospholipase A1/2 or B, can remove sn-2 oxidised fatty acyl chains and sn-1 palmitate from pro inflammatory oxidised lipids thereby modulation their ability to activate inflammasome and induce cell death inflammation (IL-1b production). Release of sn-2 acyl chains from PGPC or POVPC results in the formation of LPC (lysophophatidylcholine) which has also pro-inflammatory properties. The author demonstrate that LPC also activated inflammasomes, and that that LPS, or PGPC or POVPC-induced inflammasome activation is enhanced in BMDMs from AOAH-deficient mice. Moving to mouse models of inflammation the author find that AOAH-deficient mice have higher level of lung inflammation and injury after nasal instillation of LPS+oxPLs, and that AOAH regulates inflammation after nasal instillation of HCl.

      The conclusions of this paper are mostly well supported by data, but some aspects need to be clarified and extended.

      1) what inflammasome/s is/are activated by PGPC, POVPC and LPC?

      Zanoni et al found that PGPC or POVPC but not oxPAPC can induce IL-1β release from primed bone marrow derived macrophages (BMDM) in a NLRP3-, Caspase 1/Caspase 11-dependent manner (Zanoni et al., 2017). Yeon et al also found that POVPC induced IL-1β and processed caspase 1 release from primed BMDM, which required NLRP3 (Yeon et al., 2017). In contrast, Muri et al., found that caspase 8 but not caspase 1 or NLRP3 was required for cyclo-epoxycyclopentenone-induced IL-1β release in primed bone marrow-derived dendritic cells or macrophages We found that NLRP3 specific inhibitor MCC950 reduced PGPC or LPC-induced inflammasome activation and IL-1β release. Using other inhibitors we found that in addition to caspase 1, caspase 8 was also indispensable, suggesting that caspase 8 may cleave caspase 1 and activated caspase 1 cleaves pro-IL-1β (Chi et al., 2014; Philip et al., 2014). Please see lines 94-105, new Fig. 1E, F and new Fig. 3B, C.

      2) how does AOAH affect the anti-inflammatory functions of oxPLs which have previously been reported (PMID:29520027, 32234476 )

      It is a very intriguing question. In this study, we focus on studying the role that AOAH plays in preventing oxPL-induced inflammasome activation. We will study whether AOAH alters the anti-inflammatory functions of oxPLs in the future. We have added a sentence in Discussion, lines 471 - 474.

      3) additional controls need to be provided to increase confidence into the immunoblot analysis

      Thanks. We have added actin loading controls.

      4) experimental procedures need to be better explained and justified

      dPGPC/dPOVPC means PGPC/POVPC treated with AOAH. AOAH can release both sn-2 and sn-1 fatty acyl chains from PGPC/POVPC. In addition, AOAH deacylates LPC. Please see Fig. 2A, B and Fig. 3A. We have clarified the definition of dPGPC/dPOVPC, line 144. The samples were frozen after treatment. Freezing in the absence of glycerol inactivates AOAH. We added a sentence to make it clear, lines 568, 569.

    1. Author Response

      Reviewer #2 (Public Review): Osteoblasts are highly anabolic cells that display a high proliferation rate and secrete ample amounts of extracellular matrix, indicating that these cells have a specific metabolic profile. Here, using a set of in vivo and in vitro experiments, Sharma et al describe that SLC1A5-mediated glutamine and asparagine uptake is critical to sustain osteoblast anabolism. While the experimental setup is robust, this concept has already been put forth, questioning therefore the novelty of the results. In addition, some of the author's claims are insufficiently supported by the presented data. Especially the metabolic role of asparagine in regulating osteoblast differentiation remains enigmatic. The main concerns are detailed below.

      1. Based on their data, the authors propose that the main mechanism whereby SLC1A5 regulates osteoblast proliferation and differentiation is via glutamine uptake, while asparagine only contributes to a lesser extent. Importantly, the concept that glutamine metabolism regulates proliferation and differentiation of osteogenic cells by sustaining anabolic processes has already been described recently, even by the same research group (Yu Y. Cell Metab. 2019; Stegen S. JBMR 2021), questioning the novelty of the present study. Moreover, no metabolic rescue experiments were performed to unequivocally demonstrate that the defect in amino acid/protein synthesis in SLC1A5-deficient cells was causing the decrease in osteoblast proliferation and differentiation.

      We appreciate the reviewer’s thorough and thoughtful review and we thank the reviewer for helping us to improve this manuscript. To address this, we evaluated proliferation or osteoblast marker genes in Slc1a5 deficient cells cultured in media supplemented with 10 times the normal concentration of the reduced amino acids (excluding Gln and Asn, Fig. 4B). There was no effect on EDU incorporation, however exogenous amino acids did rescue the induction of Ibsp and Bglap to a lesser extent (Fig. S6D-E). Interpretation of these types of experiments are tricky as the uptake of NEAA may be inherently limited in osteoblasts and due to time constraints, we were unable to quantify intracellular amino acid levels in the rescued cells. Regardless, we interpret these data as affirming the necessity of Slc1a5 to provide Gln and Asn used to synthesize amino acids for osteoblast differentiation. In addition, these data indicate other metabolites (e.g. alpha-ketoglutarate, glutathione, nucleotides etc) derived from Gln and/or Asn are required for proliferation. We have modified the discussion to address this uncertainty.

      In addition, Gln and Asn tracing (carbon and nitrogen) in SLC1A5-deficient cells would confirm that Gln and Asn uptake via SLC1A5 is important for osteoblast functioning.

      We did not perform tracing experiments in the Slc1a5 deficient cells. We directly evaluated amino acid uptake using radiolabeled amino acids in Slc1a5 deficient cells (Figure 4). Slc1a5 ablation reduced the uptake of Gln and Asn. To test if Gln and Asn uptake was important for osteoblast function we directly compared the cellular effects of Slc1a5 ablation to Gln or Asn withdrawal. From these experiments we concluded that Gln and Asn uptake is essential for osteoblast differentiation.

      1. Using isotopic labeling experiments, the authors demonstrate that asparagine-derived carbon and nitrogen label several amino acids that are critical for protein synthesis, albeit at a lower level compared to glutamine. Based on these observations, they claim that the decrease in osteoblast differentiation upon asparagine depletion also occurs via a defect in protein synthesis. However, proliferation, EIF2a phosphorylation and COL1A1 levels were not affected in asparagine-deprived conditions, questioning that the decrease in differentiation is resulting from impaired protein synthesis. Further experiments to decipher the metabolic role of extracellular asparagine are therefore warranted to avoid overinterpretation of the data, including protein/matrix synthesis, analysis of amino acid levels in Asn-deprived conditions and rescue with Asn-derived metabolites.

      Again, the reviewer raises a very important point. Our data indicates that Asn does contribute to amino acid biosynthesis, chiefly Asp, however, we did not evaluate the requirement of Asn for protein synthesis directly. We think it is probable that asparagine contribution to osteoblast differentiation is multifaceted. Thus, we have softened the conclusions about asparagine and the regulation of protein synthesis to reflect this uncertainty.

      1. To inactivate SLC1A5 in vivo, the authors use the Tet-off Osx-GFP::Cre mouse line. Importantly, newborn Osx-Cre mice display severe craniofacial abnormalities, which may complicate correct interpretation of the in vivo data, especially when analyzing at embryonic stages. Do the authors observe a similar defect in osteoblast function when SLC1A5 was deleted postnatally? This might be especially relevant because the phenotype seems to wane off over time, as knockout mice at P0 only display a craniofacial phenotype, whereas long bones appear to be normal.

      The reviewer raises a very important point regarding the Sp7tTA;tetoCre line we used in this study. As mentioned, the Sp7tTA;tetoCre mice do have a partially penetrant craniofacial bone phenotype. To control for this, we only use Sp7tTA;tetoCre as “wild type” controls. In addition to the early embryonic endochondral ossification and persistent calvarial phenotypes, the Sp7tTA;tetoCre;Slc1a5^fl/fl have additional bone phenotypes compared to the Sp7tTA;tetoCre controls. This included a calvarial phenotype at both birth and 2 months of age (Figures 1 and S2). Likewise, we observe similar changes in osteoblast differentiation and bone development in the developing limbs at birth and in femurs at 2 months of age (Figure S4). Due to time constraints, we have not been able to generate sufficient numbers of mice with postnatal deletion of SLC1A5 to include here. These experiments are ongoing and will be published later.

      Reviewer #3 (Public Review): This work by Sharma et al studied the role of aa transporter, ASCT2, encoded by Slc1a5 gene, that transports mostly Glmn and Asn, in osteoblasts (OB). They use gene targeting in vitro and in vivo using Sp7-Cre driven cKO. They found that ASCT2 deletion impairs OB differentiation in vitro as well as mostly intramembranous ossification in vivo by interfering with proliferation and protein synthesis. Mechanistically, they show that Glmn uptake via ASCT2 is important for aa synthesis in OBs. This group has shown before that Glmn is essential for OB metabolism. The current work further investigates this phenomenon and identifies ASCT2 as the key mechanism of Glmn uptake into OBs. The work is logically structured and carefully done with appropriate in vivo and in vitro controls. A variety of methods is used to confirm their findings, such as in vivo immunodetection and in situ hybridization and in vitro metabolic tracing. The conclusions are well justified by the data. Minor comments are: -MicroCT methodology is not adequately described and needs to be expanded

      We appreciate this positive review of our work. We have modified the methods to adequately describe µCT methodology. We modified the methods as follows:

      “Micro computed tomography (µCT) (VivaCT80, Scanco Medical AG) was used for three-dimensional reconstruction and analysis of bone parameters. Calvariae were harvested from either newborn mice or 2-month-old mice. All muscle and extemporaneous tissue were removed and the isolated calvariae were washed in PBS, fixed overnight in 10%NBF and dehydrated in 70% ethanol. The calvariae were immobilized in 2% agarose in PBS for scanning. A fixed volume surrounding the skull was used for 3D reconstructions. In newborn calvariae, bone volume was quantified from a fixed number of slices in the occipital lobe. The threshold was set at 280. For quantification of bone mass in the long bone, 2-month-old femurs were isolated, fixed, immobilized and scanned. Bone parameters were quantified from 200 slices directly underneath the growth plate with the threshold set at 333.”

    1. The authors discussed the implications of moving away from elimination or hard elimination strategy to a softer containment strategy. At present, elimination strategy is about zero tolerance towards new cases, not so much as tha the total number of cases in the country will be 0. This is also impossible given the emergence of delta variant. The authors argue that to do that, the country first needs to vaccinate every eligible individual. This will shift the "risk perception" of people such that even if deaths occur and infections continue, there will be less anxiety and urgency to act in a way that is strict and hard, rather, covid19 related hospitalisations and deaths would be viewed as "unavoidable" but risks associated with normal life, much as we think about these things in case of lnfluenza. They also argue that once universal vaccination for covid19 is in place, then the government, in order to sustain a zero tolerance policy towards death from covid19 need to implement three things:

      • Electronic device based contact tracing and tracking
      • Issuing vaccine passports to the fully vaccinated and providing facilities to vaccine passport holders that will not allowed to the non-passport holders
      • Mass testing of people for covid19 using saliva and other tests (such as rapid antigen testing but they have not mentioned that either)

      Beyond this, they think that the health system and relationship of the government with public health and businesses need to be "overhauled" to some extent. Their advices include:

      • Delay an urgent health reform
      • create a closer partnership with the businesses such that along the lines of biosecurity and primary industries in a way that businesses are registered that will have implemented minimising covid19
      • Increase workforce capacity so that as the country is dependent on overseas health care workers, fast track residency applications and retrain or arrange for training of existing workforces and retired but capable workers
      • They have not mentioned but reasonable to understand that this is about minimising attrition of health workforce
      • Develop purpose built MIQs rather than depend on ad-hoc MIQ facilities
      • Develop a specific Pandemic controlling agency

      The authors make several assumptions but not quite clear from what they have written how they address them, although several of their desiderata and suggestions will have bearing on those assumptions. First of all, this is now established that covid19 is largely airborne, and therefore, ventilation and masking have a more prominent role in non-pharmacological intervention that what was considered earlier. But in order to enforce or enable this, individuals need to change their behaviours about precautionary practices and businesses will need to adjust their practices such as ventilation on shop floor and how many people to allow; restaurants may consider to open more open spaced. Second, while they have considered endemicity of covid19, this is open to debate. Endemicity of covid19 implies that the authors have assumed that COVID19 will have reinfections; so far, there is little evidence of re-infections as a dominant mode of transmission of this virus. Besides, as it is now known that extant vaccines do not confer "sterilising immunity", the discussions around "booster vaccinations" are in order. If so, the question of an endemic infection and thereby the premise that there will be a "shift in public risk tolerance" needs careful consideration. Third, the authors have not discussed the potential issues around "long covid", and the need to study the implications of long covid. Albeit, we may be in the throe of the birth of a new speciality in public health and medicine devoted to the study of the causes and consequences of Covid. What the implications of long COVID19 will be is open to speculations.

    1. I know the difference of Peace and Warre better then any in my Country. But now I am old and ere long must die, my brethren, namely Opitchapam, Opechancanough, and Kekataugh, my two sisters, and their two daughters, are distinctly each others successors. I wish their experience no lesse then mine, and your love to them no lesse then mine to you. But this bruit from Nandsamund, that you are come to destroy my Country, so much affrighteth all my people as they dare not visit you. What will it availe you to take that by force you may quickly have by love, or to destroy them that provide you food. What can you get by warre, when we can hide our provisions and fly to the woods?

      I think here he is trying to establish a sense of security. Mainly because the same dialogue is repeated especially in this highlighted section but also throughout the text. He seems to represent love and not harm. He really explains in this section how Powhatan, did not keep his promise and peace amongst everyone. The captain here shows that he is not trying to take anyone's food at all.

  9. deploy-preview-38--cobalt-docs.netlify.app deploy-preview-38--cobalt-docs.netlify.app
    1. Description

      This may be more report-focused thinking than anything else, but I wanted to include a sort of "Call to Action" in the version I wrote. Like, "we recommend fixing [Critical] findings as soon as possible."

      (the ones I wrote I don't think are perfect, but it seemed like another good way to indicate Severity)

    1. Author response

      _______________________

      We thank the researchers within the ASAPbio community for taking the time to provide valuable feedback on our manuscript and also Iratxe Puebla for both facilitating this review of our preprint and for consolidating the comments we received. Here we provide comments to some of the points raised by the reviewers.

      In regards to the reviewers’ comment that our “work focuses on the nwk mutant”, we note that Figures 3 and 4 show the unexpected EV cargo depletion phenotype for mutants of numerous components of the clathrin-mediated endocytic machinery. We chose the nwk mutant for our in-depth analysis because it best shows separability of functions in synaptic vesicle (mild defect) vs extracellular vesicle traffic (severe defect), and also produces null mutant viable adult flies for our APP functional studies. However, our work indicates that EV cargo regulation is a broader function for the endocytic machinery and raises the possibility that previously identified neuronal phenotypes for many endocytic mutants could be due to loss of EV cargoes from synapses. Related to this, in reference to the comment that the nwk mutant “affects EV release” we also wanted to highlight that while the EV phenotype we observed for nwk and other endocytic mutants shows both pre- and postsynaptic depletion of EV cargoes, our retromer(vps35);nwk double mutant result suggests that endocytic machinery such as Nwk is not directly regulating release of EV cargoes. Instead, we conclude that the reduction of postsynaptic EV cargoes is a secondary consequence of presynaptic depletion due to defective intracellular traffic. Your helpful feedback has alerted us that we could make these points more clear in the writing and organization of our manuscript.

      In response to the points that “...the question arises as to how specific this pathway is to EVs” we should clarify that our findings seem to be specific to cargoes for which sorting to extracellular vesicles is at least a major trajectory (ie, Syt4 and hAPP, of which 30-50% of the synaptic complement is in EVs). We agree that they both have an intracellular component (either en route to EVs or for intracellular signaling functions, which have been well-documented for APP). In response to the comment that “other cargoes that undergo clathrin-dependent endocytosis and are not packaged into EVs would need to be tested”, indeed both Syt1 or Tkv require CME machinery for their traffic (PMID12795692, PMID16459302, PMID18498733), but we find that they are not detectable in EVs and are not depleted at CME machinery mutant synapses. This indicates that local synaptic depletion is specific to cargoes for which least a significant portion of their total pool is normally packaged in EVs.

      The reviewers commented that APP and perhaps Syt4 also have intracellular itineraries and functions that may be affected by their depletion at synapses - we agree that our results have implications for both extracellular vesicle and intracellular functions of these cargoes. We fully agree that “the [Figure 2] results might not be specific to EV functions of Syt4 or hAPP” and that a more general statement (such as was suggested in the comments) here would make this possibility more explicit. Our results at least indicate that reduction of these cargoes in presynaptic terminals (but not axons, cell bodies, or dendrites) is sufficient to abrogate their functions. It will be critical in the future to identify trafficking mutants that specifically disrupt EV release without impacting levels in the donor cell, in order to directly query the physiological functions of EV sorting.

      To “provide some more information on how the [postsynaptic ɑ-HRP puncta intensity] quantification was done”, we selected an intensity threshold sufficient to distinguish postsynaptic puncta from background muscle fluorescence. We did not directly select puncta manually. Puncta with brightness above this intensity threshold were measured within a 3 μm region around the neuronal HRP. Puncta brightness was not normalized to neuronal HRP brightness, but instead was normalized to the neuronal HRP volume. This analysis was not blinded as many endocytic mutants exhibit synaptic overgrowth phenotypes that are easily visible, thus complicating the blinding process. Using a complementary automated analysis for presynaptic Syt4-GFP, we found very similar results to our manual thresholding analysis. We were however unable to successfully automate the postsynaptic signal measurements due to signal-to-noise-ratio heterogeneity, especially for HRP. Here we’d also like to clarify that in regards to “postsynaptic objects smaller than 0.015 μm were excluded”, we meant postsynaptic objects smaller than 0.015 μm3.

      In response to the comment that “...saying this trafficking opposes retromer complex sorting appears to extend beyond the results” we would like to clarify that while direct opposition of endocytic machinery to retromer on endosomes is one possible interpretation, it is not the one we favor in the discussion. We agree that endocytosis and retromer are more likely to oppose each other more indirectly by regulating overall flux through the recycling pathway. We intended to convey opposition as a genetic rather than a mechanistic argument, and we think this conclusion is supported by our data. However based on this feedback we see that we could make this more clear in our manuscript.

      We thank the reviewers for pointing out that “In Figure 4B clc depletion does not yield a significant difference in pre-synaptic Syt4 levels. However, Figure 4D the levels of Syt4 are significantly lower in clc both pre- and postsynaptically”. One possibility is that this just reflects variance in the assay, and that the subtle Syt4 phenotype in the clc mutant reached our arbitrary threshold of significance in one experiment but did not in another. There is however also a potentially interesting biological explanation. The 4B clc experiment was conducted at 25℃ while the 4D experiment was conducted at 20℃, since we found that at the lower temperature we were able to recover more clc; nwk double mutant third instar larvae. Endocytosis is well-known as a temperature-dependent process, and perhaps there is some residual endocytosis at this lower temperature in the clc mutant, making it more similar to slowed endocytosis in the endocytic accessory protein mutants (see PMID16269341), compared to a more complete block in the chc or clc 25º condition. This would suggest that slow endocytosis drives cargo into the degradative pathway, fast endocytosis into the rapidly recycling and EV pathway, while no endocytosis traps cargo in unproductive membrane cisternae. Proving this would likely require more quantitative endocytosis assays than are currently available.

      We are also appreciative of reviewers comments that will help to make our manuscript more clear, such as suggestions to present the plots consistently, to mention that individual points represent individual NMJs, and to report that C155 is a neuron-specific driver, among other helpful points.

    1. SciScore for 10.1101/2021.10.01.21264412: (What is this?)

      Please note, not all rigor criteria are appropriate for all manuscripts.

      Table 1: Rigor

      <table><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Ethics</td><td style="min-width:100px;border-bottom:1px solid lightgray">IRB: Ethics Statement: The Medical Ethical Committee of the Amsterdam UMC approved this study on January 19th, 2021 (Study number:2021.0170).</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Sex as a biological variable</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Randomization</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Blinding</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr><tr><td style="min-width:100px;margin-right:1em; border-right:1px solid lightgray; border-bottom:1px solid lightgray">Power Analysis</td><td style="min-width:100px;border-bottom:1px solid lightgray">not detected.</td></tr></table>

      Table 2: Resources

      No key resources detected.


      Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


      Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
      Our study has a few notable limitations. Although this is one of the first studies to connect viral load with increased risk of mortality, the number of in-hospital deaths in our study population was low, and larger studies are needed to assess the role of viral load in the outpatient setting and after adjustments for potential confounding factors. The data on hospital (and ICU) admission of our study population were collected from the two large teaching hospitals our region that largely cover the adherence area of the Regional Public Health Laboratory Kennemerland (where the tests were performed). It can however not be excluded that (ICU)hospitalization data of some of the included patients were missed when they were admitted to other hospitals in adjacent regions. However, we do not think that this will have influenced our main results as the chance of admission to a hospital in another region is not likely to be related to the initial SARS-CoV-2 viral load of a particular patient and would only have resulted in nondifferential misclassification of our outcome measurement. And finally, including only patients who were able to have themselves tested at Public Health Service testing facilities may have resulted in a healthy selection of all SARS-CoV-2 positive patients, as patients were able to make an appointment and go to the public health care facility. Even though this generally took place after a mean of 2 days, patients who got very ill, or needed to be admitted to the ...

      Results from TrialIdentifier: No clinical trial numbers were referenced.


      Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


      Results from JetFighter: We did not find any issues relating to colormaps.


      Results from rtransparent:
      • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
      • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
      • No protocol registration statement was detected.

      Results from scite Reference Check: We found no unreliable references.


      <footer>

      About SciScore

      SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

      </footer>

    1. For example, a brainregion may be associated with other behaviors, neurotransmit-ters, or single cell data that in turn might shed light on the orig-inal behavior of interest.

      I think looking at patterns of activation rather than specific brain regions may reveal more insights. As we know different regions perform multiple tasks and pin pointing one specific behavior is difficult.

    1. Author Response:

      Reviewer #1:

      A role for integrins in lowering the threshold for B cell activation was first observed over 15 years ago, but the mechanism has remained elusive. In this paper, Wang et al. investigate the role of LFA-1:ICAM-1 ligation in B cell synapse formation using live-cell super-resolution fluorescence microscopy in both primary B cells and the A20 B cell line. The use of super-resolution imaging is critical to the investigation as it reveals a level of organisation of the actomyosin network that is not visible with conventional microscopy approaches such as TIRF microscopy. They find that LFA-1:ICAM-1 ligation promotes the formation of actomyosin arcs that regulate various activities in the B cell synapse including BCR signalling, BCR:antigen microcluster transport, and the centralisation of antigen. In agreement with earlier studies, they show that LFA-1:ICAM-1 ligation is required for B cells to centralise antigen that is present at very low density. They also demonstrate that myosin IIa contractility is required for the formation of the actomyosin arcs and promotes the exertion of strong traction forces on the antigen- and ICAM-1-presenting substrate. Using a small molecule inhibitor of formin activity in combination with miRNA knockdown of the formin mDia1, the authors show that the actomyosin arcs originate at the outer edge of the synapse and that their generation is formin dependent. These data provide a much-needed advance to our understanding of the role LFA-1 plays in the earliest events in B cell responses to antigen.

      The conclusions of the paper are mostly well supported by the data, but there are a few points that would need to be clarified.

      1) The requirement for LFA-1:ICAM-1 ligation in the formation of the actomyosin arcs is not clear. The authors observe that ~30% of B cells form actomyosin arcs with anti-IgM stimulation only (Figure 1). Does LFA-1:ICAM-1 ligation simply stabilise the arcs and therefore make their appearance more likely, or does it promote the formation of a distinct actomyosin network with unique functions? The images and videos selected to represent cells stimulated with anti-IgM only (Figure 1; Movies 1A and 1B) seem form a highly branched actin network throughout the synapse, but it would be informative to see cells having the actomyosin arcs for comparison. Since B cells stimulated with anti-IgM alone are capable of signalling and centralising antigen, it would be interesting to know whether and how these two populations (with and without arcs) differ.

      We thank the reviewers for their questions regarding this central aspect of our study. In response to the reviewers’ statement “The requirement for LFA-1:ICAM-1 ligation in the formation of the actomyosin arcs is not clear”, our results state that “Consistently, scoring B cells for the presence of a discernable actin arc network showed that the addition of ICAM-1 increases the percentage of such cells from ~30% to ~70% (Fig. 1G).” Importantly, we then state that “dynamic imaging showed that the arcs in cells engaged with anti-IgM alone are typically sparse and transient (Movies 1A and 1B), while those in cells engaged with both anti-IgM and ICAM-1 are dense and persistent (Movies 2A and 2B).” To emphasize this point, which we think is clear when comparing Movies 1A/1B to Movies 2A/2B, we have now added the following two sentences to the text: “In other words, when B cells receiving only anti-IgM stimulation do form discernable arcs (see, for example, those marked by magenta arrows in Fig. 1A and 1B), they are much sparser and less robust than those formed by cells also receiving ICAM-1 stimulation. Moreover, we never saw even one B cell receiving anti-IgM stimulation alone that possessed a robust actin arc network.” Please note that the magenta arrows in Fig. 1A and 1B were added upon revision. In summary, the cell shown in Fig. 1E, which lacks discernable arcs, is representative of ~70% of anti-IgM stimulated cells, while the cell shown in Fig. 1F, which possesses a robust arc network, is representative of ~70% of anti-IgM+ICAM-1 stimulated cells.

      We would also like to address what we think is a misunderstanding regarding our images in Figure 1, as reflected in reviewer 1’s statement: “The images and videos selected to represent cells stimulated with anti-IgM only (Figure 1; Movies 1A and 1B) seem form a highly branched actin network throughout the synapse”. The outer, Arp2/3-generated, branched network comprising the dSMAC/lamellipodium in primary B cells is really quite thin under both stimulation conditions (please see Fig. 1, E1, E2, F1 and F2). In other words, we would not characterize the region between this thin, outer, canonical branched actin network and the central actin hypodense area (i.e. the region corresponding to the pSMAC) in B cells engaged with anti-IgM alone as “a highly branched actin network throughout”. We described it in the text as “a highly disorganized mixture of short actin filaments/fibers and actin foci”. While it likely contains some branched filaments, it is not a canonical branched actin network like the one comprising the dSMAC. Indeed, it is a lot like the mixture of actin asters, actin foci, branched actin and linear filaments described in Hela cells using the same imaging technique ((Fritzsche et al., 2017); we have now cited this paper). Of note, A20 B cells make a much bigger branched actin/dSMAC/lamellipodium than do primary B cells (compare the image of the representative A20 B cell in Fig. 1J to the various images of primary B cells in this figure). Interestingly, this difference between immortalized cells and primary cells is conserved in T cells, as Jurkat T cells make a much bigger branched actin/dSMAC/lamellipodium than do primary T cells (Murugesan et al, JCB 2016).

      Although the reviewers did not specifically comment on why only ~70% of primary B cells engaged with both anti-IgM and ICAM-1 make actomyosin arcs, we note that this is also the case for both Jurkat T cells and primary T cells (Murugesan et al, JCB 2016). We do not know why the number does not go to 100%, but the ~70% limit is the case for both B cells and T cells. Of note, in unpublished work we see that LFA-1 ligation also promotes actomyosin arc formation in T cells.

      With regard to the reviewers’ question “Does LFA-1:ICAM-1 ligation simply stabilize the arcs and therefore make their appearance more likely, or does it promote the formation of a distinct actomyosin network with unique functions?”, we think that ICAM-1 engagement likely leads to the strong activation of RhoA, which then serves to drive both the formation of actin arcs by recruiting, unfolding, and activating mDia at the plasma membrane, and the stabilization and concentric organization of these arcs by activating myosin 2A filament assembly and contractility. In other words, we think ICAM-1 engagement leads simultaneously to the creation and stabilization/organization of the arcs. While it is true that BCR stimulation alone activates RhoA signaling to some extent (see Saci and Carpenter, Mol Cell 2005 and Caloca et al, J Biol Chem 2008), and that this may account for the sparse actin arcs seen in cells stimulated with anti-IgM alone, it is likely that RhoA signaling is more robust with the addition of integrin co-stimulation (Lawson & Burridge, 2014) and that this would promote the creation of the actomyosin arcs seen in these cells. That said, without independent measures of the creation and stabilization/turnover of the arcs, we cannot gauge the relative significance of creation versus stabilization/turnover in determining the steady state amount of arcs. To address this limitation, we have added the following sentence to the section of the Discussion dealing with integrin-dependent signaling pathways leading to actomyosin arc formation: “Finally, future studies should also seek to clarify the extent to which integrin ligation promotes the formation of actomyosin arcs by driving their creation versus stabilizing them once created.

      With regard to the reviewers’ comment that “B cells stimulated with anti-IgM alone are capable of signalling and centralising antigen” we would like to emphasize that our study focuses on B cell immune synapse formation under limiting antigen conditions, where a previous study (Carrasco et al. Immunity 2004) and our data in Fig. S5 show that the impairments in BCR signaling and antigen centralization seen under this condition are rescued by integrin co-stimulation. We expand upon these findings by showing in Figures 5 and 6 that this integrin-dependent rescue of antigen centralization and BCR signaling requires actomyosin. In other words, the actomyosin arc network described here is required for integrin co-stimulation to promote antigen centralization and signaling under limiting antigen conditions. We agree with the reviewer that under non-limiting antigen conditions B cells can signal and centralize antigen in the absence of ICAM-1. That said, these high levels of BCR stimulation are probably not as physiological as limiting BCR stimulation. Finally, our data in Figure S7 shows that antigen centralization in primary B cells receiving non-limiting anti-IgM stimulation alone is also significantly impaired when myosin is inhibited. This suggests that cells receiving high levels of BCR stimulation employ myosin in some fashion to drive antigen centralization. We now close the section describing these results with the following statement: “That said, additional experiments should help define exactly how myosin contributes to antigen centralization in B cells receiving only strong anti-IgM stimulation."

      Finally, and most generally, we avoided the use of the word “requirement” as in the reviewer’s statement “the requirement for LFA-1:ICAM-1 ligation in the formation of the actomyosin arcs is not clear”. Given that some B cells receiving only anti-IgM stimulation create arcs (albeit sparse and transient), we were careful to say throughout the text that ICAM-1 engagement “promotes” actomyosin arc formation. We think our evidence for this is compelling.

      2) The authors propose that the contractile actomyosin network formed in the presence of LFA-1:ICAM-1 interactions promotes B cell activation especially at low antigen concentrations; however, their data focus only on early signalling (pCD79a and pCD19) and it would be helpful to know whether LFA-1:ICAM-1 interactions impact signalling further downstream.

      We thank the reviewer for this important suggestion, which we will address in a future study.

      3) The observation that some GC B cells centralise antigen is very interesting, but there are a few aspects of this investigation that should be expanded upon. The authors show that with LFA-1:ICAM-1 interactions, GC B cells are about equally likely to organise BCR:antigen complexes into peripheral clusters and centralised clusters. It would be informative to have, in the same study (Figure 7), a comparison with GC B cells stimulated with antigen alone. The reason is that other studies investigating GC B cell synapse architecture did not quantify antigen organisation in this way, so it is difficult to make comparisons with previous work. It would also be very useful to see how the actomyosin network is organised in GC B cells exhibiting different synaptic architectures (i.e. peripheral versus central clusters), especially given the critical role of myosin IIa activity in GC B cell antigen affinity discrimination. Additionally, while it is a very interesting observation that LFA-1:ICAM-1 interactions may affect GC B cell synapse organisation, it is not clear whether this has an impact on cellular function. For instance, does antigen and actomyosin organisation in GC B cell synapses contribute to differences in signalling or traction force generation? In the introduction the authors suggest that actomyosin arcs contribute to antibody affinity maturation (line 87-88), but without functional studies to support this claim I think it is too speculative.

      We thank the reviewer for their comments and suggestions regarding our GC data. Our sole purpose in performing the experiments in Figure 7 was to see if GC B cells can also make actomyosin arcs. We did this because recent papers and reviews state that the organization and dynamics of actin at GC B cell synapses are completely different from the organization and dynamics of actin at naive B cells synapses. As such, these initial observations are meant to add to previous work on GC B cells rather than generate direct comparisons. The reviewers appear to agree that the data in Figure 7 shows convincingly that a subset of GC B cells can make actomyosin arcs that are indistinguishable in appearance from those formed by naive B cells (so the specific claim we are making does not “require additional supporting data”). Rather, the reviewers request that we expand on the data in Figure 7 in several ways, some of which we had already mentioned in the Discussion (“While additional work is required to prove that the subset of GC B cells with actomyosin arcs are the ones that centralize antigen, this seems likely given our evidence here that actomyosin arcs drive antigen centralization in naïve B cells.”, and “Future work will also be required to understand why GC B cells vary with regard to actomyosin organization and the ability to centralize antigen 18 (e.g. dark zone versus light zone GCs)”). In addition to these statements, we now end the section describing the results in Figure 7 with the following statement: “We note, however, that our conclusions regarding actomyosin arcs in GC B cells require additional supporting data that include testing the ICAM-1 dependence of actomyosin arc formation and quantitating the contributions that this contractile structure makes to GC B cell traction force, signaling, and antigen centralization.”

      With regard to the reviewers concerns indicated by their comment “In the introduction the authors suggest that actomyosin arcs contribute to antibody affinity maturation (line 87-88), but without functional studies to support this claim I think it is too speculative”, we have changed the relevant sentence to “Finally, we show that germinal center (GC) B cells can also create this actomyosin structure, suggesting that it may contribute to the functions of GC B cells as well”.

      Reviewer #2:

      The manuscript utilizes elegant imaging tools to describe the contractile actomyosin arcs, induced by integrin-ligation, and their involvement in antigen gathering in B cells. The findings are important and have the potential to make a considerable impact in the field. The main conclusions are well supported by strong data and the manuscript convincingly brings across the need of integrin-ligation to induce generation of the arc network and the role of this structure in antigen gathering. The methods and the quality of imaging are state-of-the-art and provide an important example for future studies in B cell immune synapse. Some aspects of the study would benefit from clarification and extended experimentation or analysis.

      1) In addition to cultured B cells, the work includes naïve primary B cells as well as isolated germinal center B cells. While the use of primary cells adds value to the study, in most cases the cells are activated first with LPS prior to transfection with F-Tractin constructs. Such a treatment is likely to alter the cytoskeletal features of the naïve B cells and, thus, it would be informative to provide an analysis of this effect.

      We thank the reviewer for commenting on this. To clarify, we treated primary B cells with LPS to promote cell survival during the harsh nucleofection/electroporation conditions that otherwise kill these fragile cells. Moreover, the cells were rested for 24 hours post-nucleofection in the absence of LPS to promote return to a resting state, as previously described (see(Freeman et al., 2011)). Moreover, only those primary B cells used for live cell imaging of the F-actin using the F-actin reporter F-Tractin were LPS treated. The majority of our experiments employed non-treated ex vivo B cells that were fixed, stained and imaged for quantitation. Importantly, under conditions of ICAM-1 co-stimulation, the actomyosin arcs formed by ex vivo B cells and by LPS-activated cells were indistinguishable. For example, compare the F-Tractin-expressing cell in Fig. 3A to the non-treated cells in Fig. 3D and Fig. 7A. To summarize, then, only live-cell imaging experiments that required F-Tractin to visualize F-actin dynamics were performed using LPS-activated B cells. Finally, we clarified in the Methods that we refer to all primary B cells as “naïve” B cells because they had not been previously activated by antigen at the time of antigen stimulation.

      Reviewer #3:

      The work 'A B cell actomyosin arc network couples integrin co-stimulation to mechanical force-dependent immune synapse formation' by Wang et al. describes the importance of integrin mediated B-cell co-stimulation for IS formation in B-cells by fostering the formation of myosin II A driven actin arcs that are essential in the transport of IgM clusters towards the IS center.

      The work presented here, i.e. experiments and analysis, is very thoroughly done and includes tests and controls using different labelling strategies and constructs of myosin II A, multiple cell types including primary cells and a range of chemical inhibitors to rule out artefacts.

      The authors claim that the observation of actin arcs in B-cells co-stimulated by ICAM-1 - LFA-1 interaction is important for the efficient activation of B-cells in the presence of limiting levels of anti-IgM and this is very well supported by the experiments. However, it was a bit surprising that the paper did not draw much of parallels between the observed phenomenon and the reported actin arcs in activated T-cells even though some of the authors were very much involved in such work on T-cells. If there is a good reason to believe there is no ground to draw comparisons, this would then also need to be highlighted by the authors.

      We thank the reviewer for their comments. We have now added the following two sentences to the Discussion: “It is also important to note that the contractile actomyosin arcs described here in B cells and the actomyosin arcs described previously in T cells (Murugesan et al., 2016) share much in common as regards formation, organization and dynamics (Hammer et al., 2019; Wang & Hammer, 2020). Going forward, it will be vital to define how these two immune cell types harness the same contractile synaptic structure to accomplish different goals (i.e. antibody production by B cells and target cell killing by T cells).”

      The work on establishing the drivers of actin arc formation and dynamics is well done, but it is important to note that previous work has analyzed actin arc formation in other cell types. Work by Bershadsky has already established many 'ground rules' for the formation of actin arcs and the role of integrin adhesion, formin activity and myosin II in the process (Tee YH, Shemesh T, Thiagarajan V, Hariadi RF, Anderson KL, Page C, Volkmann N, Hanein D, Sivaramakrishnan S, Kozlov MM, Bershadsky AD. 2015. Cellular chirality arising from the self-organization of the actin cytoskeleton. Nat Cell Biol 17:445-457. doi:10.1038/ncb3137). It might be very instructive if the authors could put their findings in relation to this work.

      The formation of actin arcs is also well studied in U2OS cells and the results presented here could highlight interesting general features of this process observed in very different cell types (Tojkander S, Gateva G, Husain A, Krishnan R, Lappalainen P. 2015. Generation of contractile actomyosin bundles depends on mechanosensitive actin filament assembly and disassembly. Elife 4:1-28. doi:10.7554/eLife.06126; Bur-nette DT, Shao L, Ott C, Pasapera AM, Fischer RS, Baird MA, Der Loughian C, Delanoe-Ayari H, Paszek MJ, Davidson MW, Betzig E, Lippincott-Schwartz J. 2014. A contractile and counterbalancing adhesion system controls the 3D shape of crawling cells. J Cell Biol 205:83-96. doi:10.1083/jcb.201311104).

      In this regard, the findings about the importance of myosin II A activity, integrin adhesion and mDia1 in the formation of actin arcs is not that surprising and the authors might rather highlight the important role of these newly studied structures for co-stimulation in B-cells as this is the more novel and insightful bit of the work.

      We thank the reviewer for their comments. Indeed, our prior work in T cells (Murugesan et al., 2016; Yi et al., 2012) also linked formin activity and myosin 2 contractility to the formation of actin arcs and the generation of integrin-based adhesion. We now cite the papers highlighted by the reviewer using the following sentence in the revised Discussion: “It is important to note here that several earlier studies performed using other cell types have also linked formin activity and myosin 2 contractility to the formation of actin arcs and the generation of integrin-based adhesions (Burnette et al., 2014; Tee et al., 2015; Tojkander et al., 2015).” As for highlighting the relevance of our results for the B cell field, we think we have done that by demonstrating the existence of this contractile network in B cells, and by showing that it provides mechanistic insight into how integrin co-stimulation promotes synapse formation and B cell activation when antigen is limiting. Given that many recent studies of actin cytoskeletal dynamics in B cells were performed in the absence of LFA-1 ligation, we think our findings invite a critical “reset” for the way in which future B cell studies should be approached by highlighting the need for integrin co-stimulation when examining the roles of actin and myosin in B cell activation.

    1. We are hearing about an increase in rates of severe anxiety and depression-related concerns. We also know that this may have been even more challenging for people who were already struggling with mental-health concerns. There is emerging data to show that rates of self-injuring behaviors have increased as well.

      I can't begin to imagine how this pandemic affected those who have already existing mental health conditions. And the fact that individuals, and children especially, were not getting the right amount of exercise is definitely a concerning factor to think about.

    1. I don’t know if I could have survived seven years of my childhood without the soul-saving rituals of my Persian culture. I grew up amid the Iran-Iraq War, which killed a million people. Besides the horrors of the war, freedom of thought and expression were severely restricted in Iran after the Islamic revolution. Women bore the brunt of this as, in a matter of months, we were forced to ditch our previous lifestyle and observe a strict Islamic attire, which covered our bodies and hair. We lost the right to jog, ride a bicycle, or sing in public. Life seemed unbearable at times, but we learned to bring meaning into uncertainty and chaos by maintaining grounding practices and developing new ones.

      I think the essay will be about how rituals have provided the author peace in difficult times, and why they may do the same for others. The author is trying to create sadness and empathy. They are also using pathos.

    Annotators

    1. Reviewer #1 (Public Review): 

      Overall, this manuscript presents a careful study of sea star larval nervous system regeneration using new transgenic tools for marking and following cells involved in regeneration. The authors provide a nice, well-written introduction to their study in the Abstract and Introduction sections. I do have one major issue with the wording they are using for describing what can be done with the transgenic tools they have developed. 

      They mention in the third paragraph of the Introduction that "Only cell tracking can definitively establish the origin and trajectory of cells during regeneration and resolve the debate as to the role of stem cells versus cellular reprogramming in echinoderms." And then in the final paragraph they state that "We establish a novel cell lineage tracking system to determine the cellular origin of these regenerated neurons." 

      The system they develop does mark individual sox2 and sox4 expressing cells but I object to it being called a "cell lineage tracking system" as this is a very specific term used for a set of methods that allow for tracing the fate of individual cells and all of their progeny, traditionally through development or with stem cells. In essence cell lineage tracking/tracing provides the identification of ALL progeny of a single cell. According to a Primer on Lineage Tracing by Kretzschmar and Watt (2012) https://www.cell.com/fulltext/S0092-8674(12)00003-700003-7)<br> "For any lineage tracer, the key features are that it should not change the properties of the marked cell, its progeny, and its neighbors. The label must be passed on to all progeny of the founder cell, should be retained over time, and should never be transferred to unrelated, neighboring cells." 

      I strongly believe that the BAC-reporters developed in this manuscript do not fit that definition of a cell lineage tracing/tracking system and new verbiage should be used to describe these tools. These could very simply be referred to as fluorescent BAC-reporters and describe specifically how they are used to mark and follow the fate of cells expressing the Sox2 and Sox4 genes. The only way the language of a cell lineage tracing/tracking system could be used is if they had created a BAC-reporter for a gene that was expressed constitutively throughout a cell lineage as it progresses or if the protein expression (the tracer) was passed along to all progeny of the cell expressing that gene. My understanding is that the gene expression of Sox2 and Sox4 is highly dynamic and thus the label, by definition, is not going to be passed on to all progeny of the founder cell. I do think this is a powerful system, I just object to how the authors have chosen to describe it in the manuscript. Careful rewording can still make the reader aware of the limitations and advantages of this system and will avoid misunderstanding. 

      Therefore, all mentions throughout the manuscript of "a lineage tracing system" would need to be removed and replaced with wording that accurately reflects the true nature of these reporters, simply as photoconvertible expression reporters that can show Sox2 or Sox4 expressing cells. This includes text in the Results and Discussion section, e.g. "To our knowledge, this is the first time that any cell lineage tracing studies have been performed in echinoderm regeneration." 

      Results: 

      The authors nicely present their larval regeneration system and highlight the timeline of when serotonergic neurons regenerate over a period of 21 days. They then demonstrate that embryonic neurogenesis pathways are recapitulated during larval regeneration. Then, they present results from their photoconvertible expression reporters and demonstrate three populations of cells in decapitated larvae. The green-only sox4+ cells are the most interesting population - these are cells that are induced to express sox4 only after decapitation. Comparing embryogenesis and larval development demonstrated that the wound response in larvae involves specifying new sox4+ cells, something that had ended by 4dpf in normally developing larvae. 

      The co-injected double BAC recombinant larvae showed colocalization of sox2+ and sox4+ in regenerating larvae. De novo sox2 expression following bisection together with colocalization with sox4 expression nicely shows that these new sox2+ cells contribute to the neural lineage. Considering that the colocalization appeared to be a rare-ish event (only observed in 4 out of 15 larvae), it would be nice if the authors could comment on why this may be. Is it just a truly rare event to catch or could it have anything to do with the reporters themselves? 

      Same question about the sox2+ cells that do not express sox4:Cardinal by 3dpb. Can the authors comment specifically on whether they think there are multiple subpopulations of sox2+ cells and why some get specified to the neural fate while others do not? 

      The final experiment using cell division inhibitor Aphidicolin was very clever and nicely demonstrates that cells that did not previously express sox2 can be induced in the absence of cell division. It would be helpful if the authors could indicate how many larvae showed this pattern as they did for the previous colocalization experiment. 

      Discussion:

      In the final paragraph of the Discussion, the authors discuss a dichotomy between the use of stem cells versus de- or trans-differentiation in different model systems of regeneration. They describe the planarian system in the following way: "For example, the freshwater planarian, Schmidtea mediterranea, utilizes a population of heterogeneous, pluripotent somatic stem cells, called neoblasts, to proliferate and differentiate to replace body parts (Sánchez Alvarado, 2006)" and contrast this with Hydra and axolotl, saying "Conversely species such as Hydra and axolotl, refate differentiated cells either through dedifferentiation or transdifferentiation (Gerber et al., 2018)." I think this oversimplifies the current understanding of these systems. For example, a recent paper by Raz et al. 2021 (Cell Stem Cell 28(7): 1307-1322.e5) makes the case that Schmidtea mediterranea is capable of having specialized neoblasts undergo fate-switching and "propose a non-hierarchical lineage model for neoblasts, in which a neoblast can specify one of a diverse set of possible fates in the course of a single division and specialized neoblasts can divide to generate neoblasts that can specify different fates." In essence, this could be considered something more flexible and complicated than what the authors described - just using pluripotent neoblasts to proliferate and differentiate to replace body parts. And although Hydra is known to use trans-differentiation during regeneration, this organism also employs stem cells in the process of regeneration. Please see Siebert et al. 2008 (Developmental Biology 313(1): 13-24) for a discussion of how both mechanisms are employed in this regeneration model. Therefore, I think it is an oversimplification to characterize these regeneration models as either using stem cells OR using de- or trans-differentiation. I think in these systems, there is not a simple dichotomy and more flexibility has been demonstrated in how regeneration is accomplished than the authors describe here and the text would need to be revised accordingly.

    1. However, if you are exposed to that word again, the connections will strengthen. If you repeatedly use that word, the connections will become so strong that the word will become part of your long-term memory.2 “As a single footstep will not make a path on the earth, so a single thought will not make a pathway in the mind. To make a deep physical path, we walk again and again. To make a deep mental path, we must think over and over the kind of thoughts we wish to dominate our lives.” –Henry David ThoreauThoreau was really onto something. The best metaphor for understanding neuroplasticity, as it relates to learning and forgetting, is to imagine creating a path through the forest. If no one has ever walked there, there will be no path to follow. The first walk will be very difficult: It will be unclear which way you should go, and there will be bushwhacking. This is the struggle of learning something new, the struggle of being a beginner.Bushwhacking:If the path is walked repeatedly, the brush gets cleared, and a visible trail through the forest begins to appear. The path becomes easier to follow. You may still get lost sometimes, but at least you’re done bushwhacking. This is what it’s like to have a basic understanding of a new idea. This is what it’s like to be an intermediate.

      I like this comparison about the brush and the path. It really gives us that imagery that helps us understand what learning does to our brains.

    1. Author Response:

      Reviewer #1:

      In this study, the authors use CyTOF-based analysis to characterise spike-specific T cell responses following mRNA vaccination. They seek to understand both the breadth of responses to 'wildtype'-like and variant spikes, as well as the differences between T cell responses from convalescent and previously uninfected subjects. Consistent with other studies, they find that spike-specific T cell responses are similar across different variants, both in frequency and phenotype. In contrast, however, they identify several phenotypic differences in the T cell response elicited by infection, vaccination, or vaccination following infection.

      Despite a somewhat limited sample size, they clearly identify changes in memory phenotype and chemokine receptor expression that may affect T cell trafficking to mucosal tissues across infection and vaccination. While inclusion of additional chemokine receptors (such as CXCR3) in the CyTOF panel would have aided in characterising these cells, this data highlights how infection and vaccination may elicit distinct T cell responses.

      In fact CXCR3 and CCR4 were chemokine receptors that were considered for the panel, but could not be included as antibodies against these antigens do not stain properly on cells fixed with paraformaldehyde (PFA), and for logistical and biosafety reasons the specimens analyzed in this study had to be PFA-fixed before CyTOF staining. Although we have previously analyzed expression of CXCR3 and CCR4 on T cells by CyTOF (Cavrois et al, Cell Reports 2017 20(4):984 PMID: 28746881; Xie et al, Cell Reports 2021 35(4):109038 PMID: 33910003), those studies were exclusively performed on viable cells, and not on COVID-19 patient specimens. All our prior CyTOF phenotyping studies using COVID-19 patient specimens (Neidleman et al, Cell Reports Medicine 2020 1(6):100081 PMID: 32839763; Neidleman et al, Cell Reports 2021 36(3):109414 PMID: 34260965; Ma et al, J Immunol 207(5):1344, PMID 34389625), as well as some of our non-COVID-19 studies (Ma et al, Elife 9:e55487 PMID: 32452381; Neidleman et al, Elife 2020 9:e60933 PMID: 32990219), were performed on fixed cells, where CXCR3 and CCR4 unfortunately could not be included as parameters analyzed.

      Future studies will be required to better assess the functional impacts of these phenotypic differences on T cell recall and contribution to protective immunity.

      We absolutely agree that future studies should be pursued to better assess the functional impacts of the phenotypic differences on T cell recall, and on contribution to protective immunity. Such studies will most certainly require use of animal models, and in fact are studies that we have just begun (mouse model) or will soon begin (non-human primate model). To fully acknowledge the need for such functional studies, we have now added to multiple sections of the Discussion the need for future studies to incorporate animal models (Line 472 and Lines 488-491), including the statement “Such follow-up studies should also examine the functional outcomes of the discoveries made here (e.g., effect of chemokine receptor expression on homing of infection- and vaccine-elicited SARS-CoV-2-specific T cells), including in animal models of SARS-CoV-2 infection.”

      Reviewer #2:

      The authors address an important question, whether it people who have had Covid19 and are then vaccinated with one mRNA Spike vaccines made better immune responses than those who had not previously been infected and have two shots of the vaccine. They also compare responses to different virus variants and find extensive cross reactions and no differences between the groups - an important result.Their main finding is a difference in the quality of the CD4+ T cells in the 'Covid-vaccinees' compared to the 'naive double vaccines'. They suggest that T cells in the former may home better to the respiratory tract and persist longer.

      The major strengths are:

      • The methodology used, based on Cytof multiparameter analysis of antigen responding CD4 and CD8 T cells.

      • Demonstration that the second vaccine dose in the naive group 'improves' the T cell response.

      • Demonstration that a second vaccination in the Covid19 group does not improve the T cells.

      We thank the Reviewer for the nice summary and for the positive comments.

      Weaknesses:

      Fully (and commendably) acknowledged in the manuscript:

      • The study groups are small

      • The antigen specific T cells are stimulated in vitro so may be distorted, nevertheless there were still differences

      We agree with the Reviewer about the listed weaknesses of the study. We note that we had in our original manuscript acknowledged all these weaknesses within our “Limitations” section, including the fact that we had to stimulate our samples to identify and characterize the SARS- CoV-2-specific T cells. We have now expanded the part about our having stimulated the samples, by proposing that future studies should take advantage of tetramer technology to characterize cells in their baseline (non-stimulated) states, whilst acknowledging that such studies would for the most part be limited to CD8+ T cell responses as tetramer reagents for CD4+ T cells are less robust (Lines 500-506).

      Not acknowledged but possibly outside the scope of this study:

      • The reader will wonder how this affects the antibody response which ultimately is the main protector from reinfection and also how the T cell responses might impact on disease severity after post vaccination (re)-inrfection

      Serological assays were not performed in this study; however we fully agree with the importance of associating the in-depth phenotypes of vaccine-elicited SARS-CoV-2-specific T cells with the antibody response. In fact, just as we went very “deep” into the phenotypes of SARS-CoV-2-specific T cells in this study, we are at the moment optimizing techniques to, in an analogous fashion, deeply characterize the serological response to vaccination. This entails optimizing a flow cytometry-based approach we recently introduced and implemented on a small number of specimens (Ma et al, J Immunol 207(5):1344, PMID 34389625), to be able to simultaneously assess the levels of IgA1, IgA2, IgE, IgG1, IgG2, IgG3, IgG4, and IgM against the S1, S2, and RBD domains of the SARS-CoV-2 spike protein in a large number of patient specimens. Once we’ve optimized the assay and applied it on the vaccine specimens, we plan to associate the resulting 24-parameter serological datasets (8 isotypes of antibodies each against 3 antigens = 24 parameters total) with the high-dimensional SARS-CoV-2-specific T cell datasets from this study, but that will be its own separate (and large) study and beyond the scope of this current one. As generating such serological data will take at least 3-6 months to complete, and the focus of this study is on SARS-CoV-2-specific T cells (and all conclusions we drew were based only on the T cell data), we think it appropriate that we limit this study to deep-phenotyping of the T cells. We have now brought up in the last part of our “Limitations” section the lack of serological analysis in this current study as a limitation, and how follow-up studies should associate serological responses with the T cell responses characterized here. (Lines 506-511: “A final limitation is that serological analyses were not performed in this study. As coordination between the humoral and cellular arms of immunity are likely key to effectively controlling viral replication, future studies should assess to what extent the breadth, isotypes, and functional features of spike-specific antibodies elicited by vaccination associate with the herein described phenotypic features of vaccine-elicited SARS-CoV-2-specific T cells.”)

      With regards to how T cell responses might impact disease severity and breakthrough infections, this is an aspect we are very interested in investigating, as detailed in our final response further below.

  10. Sep 2021
    1. exists outside of a document

      I think it is important to recognize that the original article that motivated much of the early research into Hypertext, including that of Ted Nelson, who coined the term "Hypertext," assumed that "associative links" would, in fact, exist "outside" the document. That article was, of course, Vannevar Bush's "As We May Think," published in the July 1945 Atlantic Monthly.

      Given that Bush assumed that documents were stored primarily as immutable microfilm images, he had no choice but to assume that links would stored be external to the documents which were their subjects. It simply wasn't possible to embed links in documents as we do today with HTML and with other formats that support embedded links.

      Thus, it can be said that the idea of external associative links, or annotations, was actually the original idea for how Hypertext would be implemented. It was only later, long after 1945, that we found that it was convenient to support links embedded in content.

      It is also important to note the external links allow us to more easily do things that can't be done with the more common internal links. For instance, if you're reading a document with internal links, you can easily answer the question "What does this document link to?" However, it is much harder to answer the question: "What documents link to this one?" This is because internal links are only "one-way links. However, external links, which are "two-way links," establish a relationship between documents can exist independent of the documents themselves. Thus, if you have a collection of external links, you can answer the question: "What links to this document?" That question can't be easily answered in today's web unless you've got a web-crawling system like Google's that is capable of reading all documents on the web and then deducing all the back-links.

      In fact, one of the key elements of a Web annotation system is the ability to pass the URL/URI for some web resource to the annotation system and say: "Show me what links (annotations) exist for this resource!" Of course, each of the annotations which contains the external links would itself have a unique identifier and should thus be something that can be annotated or linked to. In this way, we can have annotations of annotations as well as the same kind of forking "associative trails" that present alternative paths through the document space that Vannevar Bush imagined in his 1945 Atlantic Monthly article. In other words, when we allow for external links, we elevate the "link" or "annotation" to something which is a first-class object on the Web instead of leaving it as a mere attribute of some web resource.

      To a great extent, raising links to the status of first class objects "completes" a large part of the journey from idea to implementation that began with Vannevar Bush's 1945 vision. We should understand that supporting annotations doesn't just provide a "nice new feature." It provides the foundation for what is today a very unfamiliar method for interacting with the web as a record of human experience and knowledge. The idea is very old. Nonetheless, we have yet to begin to have much experience with its use and implications.

    1. Legal Outreach, Inc. Fall 2021 Constitutional Law Debate 8 The Court has stated that a suspect has the right to terminate interrogation at any time prior to or during the interrogation by invoking (i) the right to remain silent or (ii) right to counsel.2 If a suspect invokes the right to remain silent, the police must honor the request and no longer question the accused. However, police may re-question a suspect about the same crime if, after a break, fresh Miranda warnings are given. Moreover, asking an incarcerated individual at a later time about crimes unrelated to the reason such person is incarcerated has been found to be permissible (Howes v. Fields). If a suspect invokes the right to counsel, and does so unambiguously and specifically (e.g., the suspect says s/he wants counsel to assist with the interrogation), all questions must cease until counsel is provided. Allowing the suspect to consult with counsel and then resuming the interrogation after counsel has left does not satisfy the right – counsel must be present during the interrogation. Once the witness asserts the right to terminate interrogation and asks for counsel, restarting of the interrogation by police on any topic violates the Fifth Amendment. It is worth noting that a suspect may always waive the right to counsel if, while waiting for counsel, s/he reinitiates the questioning. A suspect may also waive all his/her Miranda rights if s/he does so knowingly, voluntarily, and intelligently. Silence or shrugging does not qualify as a waiver. Interrogation Analysis In addition to being in custody, the suspect must actually be under interrogation to require Miranda warnings. This does not mean that the police officers have to be directly questioning the suspect but can be saying or acting in a way that the police should know are reasonably likely to elicit an incriminating response from the suspect. Whether it’s reasonably likely depends on the suspect’s perceptions, not the police officers’ intent. Therefore, you need to ask yourself whether a reasonable person in the suspect’s position would think, based on the officers’ actions and words, that they were being questioned about the crime or their involvement in that crime. Voluntariness If a defendant confesses to a crime, it must be voluntary and without coercion. A confession is separate and independent of Miranda warnings. The court will examine the voluntariness of a confession whether or not the suspect has been Mirandize

      I think that this is important because the cases in the voluntary section talked about how the suspect has to confess of their own free will and without intimidation. So we need to see if this was the case with Lav.

    1. a posteriori Stochastic Block Model, Recap We just covered many details about how to perform statistical inference with a realization of a random network which we think can be well summarized by a Stochastic Block Model. For this reason, we will review some of the key things that were covered, to better put them in context: We learned that the Adjacency Spectral Embedding is a key algorithm for making sense of networks we believe may be realizations of networks which are well-summarized by Stochastic Block Models, as inference on the the estimated latent positions is key for learning about community assignments. We learned how unsupervised learning allows us to use the estimated latent positions to learn community assignments for nodes within our realization. We learned how to align the labels produced by our unsupervised learning technique with true labels in our network, using remap_labels. We learned how to produce community assignments, regardless of whether we know how many communities may be present in the first place. { requestKernel: true, binderOptions: { repo: "binder-examples/jupyter-stacks-datascience", ref: "master", }, codeMirrorConfig: { theme: "abcdef", mode: "python" }, kernelOptions: { kernelName: "python3", path: "./representations/ch6" }, predefinedOutput: true } kernelName = 'python3'

      I think this recap should be the introductory paragraph, and should be expanded

    1. Reviewer #3 (Public Review):

      This paper is based on digital reconstruction of a serial EM stack of a larva of the annelid Platynereis and presents a complete 3D map of all desmosomes between somatic muscle cells and their attachment partners, including muscle cells, glia, ciliary band cells, epidermal cells and specialized epidermal cells that anchor cuticular chaetae (circumchaetal cells) and aciculae (circumacicular cells). The rationale is that the spatial patterning of desmosomes determines the direction of forces exerted by muscular contraction on the body wall and its appendages will determine movement of these structures, which in turn results in propulsion of the body as part of specific behaviors.

      To go a step further, if connecting this desmosome connectome with the (previously published) synaptic connectome, one may gain insight into how a specific spatio-temporal pattern of motor neuron activity will lead, via a resulting pattern of forces caused by muscles, to a specific behavior. In the authors' words: "By combining desmosomal and synaptic connectomes we can infer the impact of motoneuron activation on tissue movements". This is an interesting idea which has the potential to make progress towards understanding in a "holistic" way how a complex neural circuitry controls an equally complex behavior. The analysis of the EM data appears solid; the authors can show convincingly that desmosomes can be resolved in their EM dataset; and the technology used to plot and analyze the data is clearly up to the task. My main concern is with the way in which the desmosome pattern is entered in the analysis, which I think makes it very difficult to extract enough relevant information from the analysis that would reach the stated goal.

      1.The context of how different structures of the Platynereis larval body, by changing their position, move the body needs much more introduction than the short paragraph given at the end of the Introduction.<br> -My understanding is that the larval body is segmented, and contraction of the segments can cause a certain type crawling or swimming: does it? Do the longitudinal muscles, for example, insert at segment boundaries, and alternating contraction left-right cause some sort of "wiggling" or peristalsis?<br> -In addition, there are segmental processes (parapodia, neuropodia), and embedded in them are long chitinous hairs (Chaetae, Acicula). Do certain types of the muscles described in the study insert at the base of the parapodia/neuropodia (coming from different angles), such that contraction would move the entire process, including the chaetae/acicula embedded in their tips?<br> -Or is it that only these chaetae/acicula move, by means of muscles inserting at their base (the latter is clearly part of the story)? Or does both happen at the same time: parapodium moves relative to the trunk, and chaeta/acicula moves relative to the parapodium? How would these movements lead to different kind of behaviors?<br> -Diagrams should be provided that shed light on these issues.

      2.The main problem I have with the analysis is the way a muscle cell is treated, namely as a "one dimensional" node, rather than a vector.<br> -In the current state of the analysis, the authors have mapped all desmosomes of a given muscle cell to its attached "target" cell. But how is that helpful? The principal way a muscle cell acts is by contracting, thereby pulling the cells it attaches to at its two end closer together. As the authors state (p.4) "...desmosomes..are enriched at the ends of muscle cells indicating that these adhesive structures transmit force upon muscle-cell contraction."<br> -for that reason, the desmosomes at the muscle tips have to be treated as (2) special sets. Aside from these tip desmosomes there are other desmosomes (inbetween muscles, for example), but they (I would presume) have a very different function; maybe to coordinate muscle fiber contraction? Augment the force caused by contraction?<br> -As far as I understand for (all of) the desmosome connectome plots, there is no differentiation made between desmosome subsets located at different positions within the muscle fiber. I therefore don't see how the plots are helpful to shed light on how the multiplicity of muscles represented in the graphs cause specific types of neurons.<br> -As it stands these plots "merely" help to classify muscles, based on their position and what cell type they target: but that (certainly useful) map could have probably also be achieved by light microscopic analysis.

      3.Section "Local connectivity and modular structure of the desmosomal connectome" p.4-7" undertakes an analysis of the structure of the desmosome network, comparing it with other networks.<br> -What is the rationale here? How do the conclusions help to understand how the spatial pattern of muscles and their contraction move the body?<br> -Isn't, on the one hand (given that position of the desmosome was apparently not considered), the finding that desmosome networks stand out (from random networks) by their high level of connectivity ("with all cells only connecting to cells in their immediate neighbourhood forming local cliques") completely expected?<br> -On the other hand, does this reflect the reality, given that (many?) muscle cells are quite long, connecting for example the anterior border of a segment with the posterior border.

      4.In the section "Acicular movements and the unit muscle contractions that drive them" the authors record movement of the acicula and correlate it with activity (Ca imaging) of specific muscle types. This study gives insightful data, and could be extended to all movements of the larva.<br> -The fact that a certain muscle is active when the acicula moves in a certain direction can be explained (in part) by the "connectivity": as shown in Fig.7L, the muscle inserts at a circumacicular cell on the one side, and to an epithelial (epidermal?) cell and the basal lamina on the other side. But how meaningful is a description at this "cell type level" of resolution? The direction of acicula deflection depends on where (relative to the acicula base) the epithelial cell (or point in the basal lamina) is located. This information is not given in the part of the connectome network shown in Fig.7L, or any of the other graphs.

    1. The other pitfall I call “filter feeding”- attempting to glean the necessary “nutrients” from a source only while reading it, and not even bothering trying to take any notes down. This may be the default state when drinking from the fire hose. Reading endless blogs, social media, or even books without challenging ourselves through writing and discussion can lead to the experience of feeling, as Postman describes, like we know “of” many things without really knowing about them.

      Reading vast amounts of information can lead one to think they know a lot, but retention is not good. To increase retention one should write, converse or otherwise engage with the information.

    1. Author Response:

      Reviewer #1:

      The authors address an interesting but neglected issue in pigment cell biology, concerning the developmental origin of red erythrophores, especially in relationship to yellow xanthophores, and the genetic basis for their differing pigmentation. Red-yellow colouration in vertebrates usually arises from accumulation of dietary carotenoids, and often has significant behavioural importance, e.g. as an honest signal of individual quality. This and the biochemistry of carotenoid colour variation is nicely covered in the Introduction, providing helpful background to a broad audience.

      The authors document the widespread presence of erythrophore in Danio, highlighting the unusual nature of Zebrafish within the genus as lacking them. They then develop some quantitative and objective measures of the xanthophores and erythrophores based upon Hue and Red:Green autofluorescence ratios, allowing clear distinction of the mature cell-types, and note the often binucleate nature of the erythrophores.

      The authors then use a variety of tools to assess, with differing degrees of certainty, the lineage relationships of the erythrophores; together these provide a consistent and convincing picture of shared lineage between the two cell-types. This is consistent with the observed gradual shift in properties of proximal cells from xanthophore-like to erythrophore. A more direct test of the conversion of early xanthophores to erythrophores comes from the clonal analysis of aox5:nucEosFP cells (Fig. 4). They then use a fin regeneration assay to assess the plasticity of these cells in the mature adult. This is a neat experiment, but I am struggling with the interpretation of Figure 5A: which cells are being used as landmarks to justify the conclusion that the cells shown are clonally-derived form that single cell in the 5 dpa image? It may be that the full series of images could be provided in a supplementary figure and might make this clear, but the current images do not seem convincing to me. The experiment in Fig. 5B is convincing, so conclusion seems sound.

      We added a supplementary figure (Figure 5—figure supplement 1) to show more context and nearby landmarks, including the amputation plane. We additionally swapped out the images in Figure 5A with an example that more clearly makes our point that cells seem to both lose red coloration and increase in number. Cells of both the original and the new example are visible in the new supplemental figure. Given the concern expressed we additionally modified the salient portion of the text, to make it clearer that the brightfield-only analyses were intended merely to see if a transformation is plausible, based on overt cell colors and behaviors in the absence of formal clonal analysis. The revised text reads:

      “We first assessed the possibility that transfating occurs by repeatedly imaging individual fish in brightfield, to learn whether cells near the amputation plane might lose their red color during regenerate outgrowth. Individual erythrophores could often be reidentified using other cells as well as distinctive features of fin ray bones and joints as landmarks (Figure 5A; Fig- ure 5—figure supplement 1). As regeneration proceeded, small groups of cells having paler red or orange coloration, were sometimes observable where individual cells of deep red col- oration had been found, suggestive of proliferation and dilution of pre-existing pigments. Later, only yellow cells were found in these same locations. These observations were con- sistent with the possibility of erythrophore → xanthophore conversion, and so to test this idea directly we marked nucEosFP+ erythrophores by photoconversion prior to amputation (Figure 5B; Figure 5—figure supplement 2A). ”

      The authors then use a transcriptomic comparison to identify candidate genes influencing erythrophore v xanthophore differentiation. They study 3 with mutant phenotypes affecting these cell-types, identifying likely roles of 3 erythrophore genes. Whilst most of this analysis is beautifully presented, I am confused by Fig. 7 in which I think panel D and F as described in the legend are inverted.

      We fixed the relative ordering of panels and legends. We also changed the Y axis label in Figure 7F to indicate cells per 40 μm2 rather than density, which might be misinterpreted to mean cells per mm.

      As is expected form this lab, the manuscript is generally very carefully and clearly written and includes thorough data presentation and statistical analysis. Conclusions drawn are appropriately nuanced, and justified by data presented. The manuscript provides an important first step in understanding the developmental relationship of erythrophores to xanthophores, and a number of genetic resources for the further exploration of this question.

    1. A bald patch will give us away.

      This opening sequence is interesting, but quite frankly, very confusing! I am trying to picture the actions of that actors but many of them just seem far-fetched for the stage, or requiring makeup and special effects to simulate the blood or makeup. I do have the understanding that Brecht's Epic Theatre is extremely stripped down, so this would not be required. It would be up to the actor to show the audience with their actions what is happening to them in this sequence. I absolutely see the value in striping down the entire play to have the audience accept this world as it is. We may not see this bald spot, but through dialogue and action we will certainly know it is there. I think I am having a difficult time reading this after all of the realism and naturalism we have been reading. Directing a Brecht play seems like a monster, but a fun one.

    1. Author Response:

      Reviewer #2 (Public Review):

      This manuscript details an investigation into whether blinding NIH grant reviewers to the name and institution may affect their review scores. They demonstrate that unblinded grants lead to slightly higher scores for white applicants than blacks, however, a deeper dive demonstrates that grantsmanship and history of prior funding can be even greater predictors of scores regardless of race.

      Overall the manuscript touches on a presently vogue topic and that is of equality in outcomes and systemic racism. The major limitations of the study however, are ironically demonstrating the very topic that the manuscript tries to address. There are no considerations in the manuscript or mention of applications from Asian, Hispanic or Native American applicants, as the authors distill the problem literally down to only Black and White.

      We now incorporate more of this perspective.

      We rewrote the introduction, adding information about funding rates for Hispanic and Asian PIs (the 2 largest groups of minority applicants), and provided a stronger explanation for why this study focused on Black-White differences only (lines 86-95) Our aim was to provide a broader context while keeping the intro reasonably focused. Demographic differences in patterns of application numbers, review outcomes, and funding success is a complex topic, not easily presented concisely. More importantly, we think that this information, while no doubt of interest to some, is not relevant background to the experiment at hand. We tried to strike a balance between context and focus.

    1. Author Response:

      Reviewer #1:

      This work demonstrates that functional KATP channels exist in most neuronal cell types in the mouse somatosensory cortex. While the transcriptomic profiling of electrophysiologically characterized neurons is only indicative of the existence of the Kir6.2/SUR1 KATP channel, the acute slice pharmacological/electrophysiological experiments convincingly supports this notion.

      The uncertainty of single-cell RT-PCR is likely due to a small amount of starting material inherent to the sample collection method. As the authors discuss, low copy numbers of target transcripts may also have contributed to the negative/uncertain results.

      We fully agree that scRT-PCR analysis underdetected Kir6.2 (kcnj11) and SUR1 (abcc8) mRNAs. This is likely due to their low abundance at the single-cell level, the sample collection method and the low efficiency of the reverse transcription (RT).

      As requested by reviewer 2 we now report the low detection rate of these subunits in neurons responsive to diazoxide and tolbutamide and acknowledge the limitation of scRT- PCR (pages 7,8, lines, 34,1-6).

      We have also improved the discussion by providing the copy number of these mRNAs detected by single cell RNAseq (Zeisel et al. 2015, DOI: 10.1126/science.aaa1934, data available online https://linnarssonlab.org/cortex/) and the estimated sensitivity limit of the scRT-PCR (page 13, lines 29-33).

      Next, the authors demonstrate that lactate is taken up by neurons and elevates the discharge rate via an increased ATP production due to the oxidative metabolism downstream of lactate, which is in line with earlier studies including Ivanov et al. (2011, doi: 10.3389/fnene.2011.00002).

      We thank the reviewer for pointing out this reference that we have added in the discussion (page 17, line 16).

      The authors showed this by introducing 15 mM lactate, and discuss a possibility that extracellular lactate can be elevated by a systemic increase of lactate. However, such an increase is likely more modest in the brain (Carrard et al., 2018, doi: 10.1038/mp.2016.179). So, the lactate-enhanced firing might occur in extreme conditions such as during anoxia or ischemia; however, intracellular ATP would most probably decrease and hence KATP channels would open in this case. A discussion on extracellular lactate levels in physiological conditions would be helpful.

      We have improved the discussion on the physiological extracellular level of lactate which can be as high as 5 mM at rest. Since during neuronal activity lactate levels are almost doubled (i.e. up to 10 mM), lactate-enhanced firing might occur under physiological conditions (page 18, lines 9-13). We agree that a systemic lactate increase modestly elevates its extracellular concentration to a level with little or no effect on firing rate. Accordingly we now also quote references reporting this observation. Nonetheless, peripheral lactate could represent an additional source facilitating lactate-sensing when both the brain and the body are active, as during physical exercise (page 18, lines 13-19).

      Overall, this is a rigorous study that confirms the existence of functional KATP and dominant oxidative metabolism in most types of juvenile somatosensory cortical neurons.

      Thank you.

      Reviewer #2:

      The authors present an impressive array of experiments testing the effect of lactate on a number of neocortical cell types. They uncover a mechanism by which lactate might enhance neuronal firing although direct physiological relevance needs further support for CSF lactate concentrations. Most of the experiments are sound and interesting and the remaining experiments have limitations inherent to the methodology and presented accordingly in the discussion. The results are convincing, however a number of specific points need to be addressed.

      We thank the reviewer for the specific points raised that helped us to improve and clarify the manuscript.

      Specific points:

      • Page 6 line 21 onwards. The authors state consistent expression of Kir6.2 and SUR1 in various cortical cell types. Data presented in Fig1 challenge this statement showing that Kir6.2 and/or SUR1 was expressed in the minority of cells tested regardless of cell type. For example, out of the 10 intrinsically bursting cells shown in the Ward cluster plot on Fig1A-B, only two was positive for Kir6.2 according to Fig1D. Surprisingly, Fig1F shows that 10% of intrinsically bursting cells express Kir6.2 which is clearly not the case (it is 20%).

      We thank the reviewer for pointing out this apparent incoherence. Indeed, Fig. 1D showed two intrinsically bursting cells that appeared positive for Kir6.2. However, one of them was also positive for genomic control and was discarded from the calculation of detection rate, as already discussed (pages 13,14 lines 34,1-5). For the sake of clarity Fig. 1D now depicts potential Kir6.2 false positive as shaded colored rectangles.

      Amplification was used for the detection of mRNAs by the authors, thus it is unlikely that detection threshold plays a role in having Kir6.2 or SUR1 negative cells.

      We agree that PCR amplification can detect a single DNA molecule (e.g. Li et al 1988, DOI: 10.1038/335414a0). However, the low reverse transcription (RT) efficiency is an important limiting factor for the mRNA detection by scRT-PCR. In addition, dendritic mRNAs are almost inaccessible to the harvesting from a somatic patch pipette, thereby decreasing the detection rate. Similar issues of mRNA detection by scRT-PCR have been reported for neuropeptide receptors despite a functional expression in a majority of recorded pyramidal cells (Gallopin et al. 2006, DOI: 10.1093/cercor/bhj081). scRT-PCR detection limit was estimated to be around 25 molecules of mRNA in a previous study quantifying at the single-cell level AMPA receptor mRNAs harvested in the patch pipette (Tsuzuki et al. 2001, DOI: 10.1046/j.1471-4159.2001.00388.x).

      We have now improved the discussion by providing the copy number of Kir6.2 (kcnj11) and SUR1 (abcc8) mRNAs detected by RNAseq from single isolated cells (Zeisel et al. 2015, data available online https://linnarssonlab.org/cortex/). The estimated sensitivity limit of the scRT-PCR is also now provided (page 13, lines 29-33).

      Along the same vein, amplification makes it difficult to understand what the authors mean by "low copy number at single cell level". Specifically, the sentence (p6l22-25) is self-conflicting suggesting reliable detection of KATP subunits yet downplaying the significance of moderate single cell detection rates.

      Since the point on the "low copy number" is now discussed in more detail the sentence has been removed from the results section. To avoid confusion between detection and expression we now use only "detection" for scRT-PCR data and "expression" for functional data. Accordingly, in Figures 1F, 3B, 6A and S5, "Occurrence" was changed to "Detection rate".

      I think a moderate statement with percentages of expression would adequately describe the findings with an emphasis on potential variability between individual cells regardless of cell type. Throughout the text, the authors should avoid the use of uniform expression of KATP channels in neurons.

      • Page 6 line 30. The authors conclude co-expression of Kir6.2 and SUR1 subunits. Fig1D shows that out of approximately n=71 Kir6.2 positive cells and n=28 SUR1 positive cells only n=16 expresses Kir6.2 and SUR1 together and the evidence presented shows that n=83 cells do not co-express Kir6.2 and SUR1. Again, the conclusion in the manuscript seems biased towards the minority of cases and does not reflect the overall dataset. Accordingly, the suggestion that neurons and beta cells use the same KATP channel is not supported (p6l32).

      The statement has been mitigated as follows (page 6, lines 21-27): "Apart from a single Adapting NPY neuron (Figure 1D), where Kir6.1 mRNA was observed, only the Kir6.2 and SUR1 subunits were detected in cortical neurons (in 25%, n=63 of 248 neurons; and in 10%, n=28 of 277 of neurons; respectively). The single-cell detection rate was similar between the different neuronal subtypes (Figure 1F). We also codetected Kir6.2 and SUR1 in cortical neurons (n=14 of 248, Figure 1D) suggesting the expression of functional KATP channels."

      We have also avoided the use of uniform expression throughout the text and do not refer anymore to pancreatic beta-cell like KATP channels in the results section.

      • KATP channel presence in neurons. With respect to the points above, it would be helpful to see in the results section and possibly on Fig2 whether there is an electrophysiological indication of pharmacologically unresponsive cells. This would help in assessing the relative sensitivity of the two approaches. Fig.2G is helpful here, however signal to noise is hard to assess in the current version in individual experiments. Please state if single cell PCR was performed on any pharmacologically examined cells.

      We now clearly report that all neurons pharmacologically analyzed in voltage clamp were responsive to diazoxide and tolbutamide. We also mention the range of the effects of these KATP channel modulators on membrane resistance and whole-cell current (page 7, lines 12-15).

      We thank the reviewer for suggesting to state if scRT-PCR was performed on pharmacologically examined cells, which helps to evaluate the relative sensitivity of scRT- PCR and pharmacological/electrophysiological experiments. We now report the number of neurons pharmacologically characterized and successfully analyzed by scRT-PCR (pages 7,8, lines 34,1-6). All these neurons were found to express functional KATP channels, but Kir6.2 and SUR1 subunits were detected in only a minority of them. We thus conclude that scRT-PCR underdetects these mRNAs.

      Fig3B recapitulates the results of Fig1 that only a small fraction of RS cells express Kir6.2 and SUR1.

      Since scRT-PCR is less sensitive than electrophysiological investigations, as just discussed above, the absence of detection of mRNAs does not mean an absence of functional expression of KATP channels. The absence of outward ATP-washout current in Kir6.2 KO neurons, in marked contrast with neurons from wild-type mice, supports the notion of a widespread functional expression of Kir6.2-containing KATP channels in cortical neurons. To avoid the confusion between detection and expression, we have reformulated the sentence (page 8, lines 11-12) as follows: "We first verified that Kir6.2 and SUR1 subunits can be detected in pyramidal cells from wild type mice by scRT-PCR".

      In spite having a clever pharmacological design, due to limitations inherent to spatially nonspecific drug application methods, one cannot exclude that the results measured on individual cells could also reflect network interactions with astrocytes and/or neurons and should be discussed.

      We agree with the reviewer that bath applications of drugs can induce network effects leading to potential confounding results. However, the kinetics and biophysical properties of the whole-cell currents recorded during pharmacological manipulations do not support such a network effect. This possibility, nonetheless, is now discussed page 13, lines 18- 23.

      We have also discussed the possibility that the blockade of lactate transport by 4-CIN could reflect an impairment of lactate uptake by neurons but also of lactate release by astrocytes. However, under our conditions the contribution of astrocyte-derived lactate is expected to be negligible (page 16, lines 10-18).

      • Lactate concentration in blood vs CSF. As the authors point out, there is a discrepancy in glucose concentration between the blood and CSF, yet they use lactate concentrations measured in the blood (and not in the CSF) during exercise in their experiments. The physiological relevance of these experiments is unclear unless there is evidence that lactate concentration in the CSF is indeed in the range found effective here.

      We thank the reviewer for pointing out the discrepancies between plasma and extracellular levels of glucose vs. lactate. Although surprising at first, and in contrast to glucose, extracellular lactate level is higher than its plasma level. Such a difference, most likely reflects the ability of the brain produce lactate and not glucose.

      As also requested by reviewer 1 we have improved the discussion on the physiological extracellular level which can be as high as 5 mM at rest. Since during neuronal activity lactate levels are almost doubled (i.e. up to 10 mM), we believe that lactate-enhanced firing might occur under physiological conditions (page 18, lines 9-13).

      We have improved the rationale of the lactate concentration used which is an isoenergetic condition to 10 mM glucose for having the same number of carbon atoms (page 10, lines 4-5).

      We also discuss the possibility, that peripheral lactate could represent an additional source facilitating lactate-sensing when both the brain and the body are active, as during physical exercise (page 18, lines 13-19).

      • MCT1 and MCT2 expression and widespread lactate effects. Here, the authors admit that relatively low single cell detection rates were observed for MCT1 (19%) and MCT2 (28%). It seems consistent (and a bit worrisome) throughout the manuscript that expression of mRNAs additionally tested functionally have a limited range of PCR detection yet (again) ubiquitous presence was found when tested pharmacologically.

      Similar to KATP channels subunits and as reported by single cell RNAseq data (Zeisel et al. 2015, DOI: 10.1126/science.aaa1934, data available online https://linnarssonlab.org/cortex/), MCT1 (slc16a1) and MCT2 (slc16a7) are expressed in cortical neurons at a copy number below the detection limit of scRT-PCR.

      We have now discussed the discrepancy between MCT1 and MCT2 detection and the widespread lactate effects which are most likely due to their low abundance at the single cell level (pages 15,16, lines 32-34, 1-6). We also provide a counter example with LDH subunits which are expressed at higher single-cell levels, and for which a higher scRT- PCR detection rate was found to match the functional data (page 16, lines 6-9).

    1. Author Response:

      Reviewer #2:

      This paper investigates cell size-dependent regulation of G1/S cell cycle transition in budding yeast, with a focus on the relationship between the activator Cln3 and the inhibitor Whi5. A prominent 2015 paper proposed that cell growth dilutes the inhibitor Whi5 while Cln3 levels remain constant. This 'inhibitor dilution' model has been challenged by several recent papers. In the present paper, Sommer et al. perform a series of quantitative western blots of whole cell extracts from synchronized cell cultures. They show that Cln3 concentration increases 10-fold before bud emergence (i.e. G1/S) but Whi5 concentration is largely constant, at least in rich media. Similar results were obtained in poor carbon media with a smaller increase in Cln3. These data argue against the inhibitor-dilution model and indicate that Cln3 levels are tuned by carbon availability and cell growth rate. Interestingly, Cln3 increases are not dependent on actin-based growth or bud emergence, but rather depend on membrane trafficking and TORC-SGK signaling. A series of experiments altering ceramide synthesis identify a link with Cln3 synthesis, although it remains unclear how directly this ceramide-Cln3 connection occurs.

      The combination of results in this paper represent a significant contribution to the field. Major strengths include the careful quantitation of Whi5/Cln3 levels, and the clear effects on Cln3 from membrane trafficking events. I also appreciated the balanced tone of the text, which describes the strengths and weaknesses of each experiment and interpretation. I have a series of comments/concerns that could be addressed to strengthen the paper, as described below.

      1) I understand why cells were pre-grown in poor carbon media for these experiments, but it seems important to know how Cln3 and Whi5 levels change for cells pre-grown in rich media. Otherwise, each paper reporting different results for Cln3/Whi5 could be dismissed as using a unique set of growth conditions. Along these lines, it would be ideal for the authors to test Cln3/Whi5 levels in their western blot assay using the same strain background and media as the Schmoller paper. It would be very interesting if the inhibitor-dilution model were observed under these conditions, whereas alternative mechanisms like Cln3 accumulation were observed under other conditions.

      We attempted to grow cells in YPD, isolate small unbudded cells, and then release the cells back into YPD. However, we found that it was not possible to isolate a uniform population of small unbudded cells under these conditions. The problem is that very little growth occurs in G1 phase in YPD so that newly born cells are nearly the same size as mother cells (PMID: 28939614). This, combined with the normal variation in cell size observed in wild type yeast, means that elutriation yields a mix of unbudded and budded cells. Others have faced the same problem (PMID: 31685990, 10728640). The fact that so little growth occurs in G1 phase in YPD is an additional argument against the idea that dilution of Whi5 plays a substantial and general role in cell size control.

      As an alternative, we grew cells in complete synthetic medium (CSM) containing 2% glucose. Under these conditions, cells grow more slowly and are smaller because CSM is limiting for nutrients other than glucose. We isolated small unbudded cells and released them into the same medium so that there would not be shift in carbon source. We found that Cln3 levels increased 3-fold, while Whi5 levels did no change substantially, similar to the effects observed in YP medium containing poor carbon. These data are shown in a new figure (Figure 1 – figure supplement 2). In addition, we have included new text to highlight these issues and how they can influence interpretation of the results.

      We agree that it could be interesting to see how Cln3 and Whi5 behave in the mutant background and media conditions used by Schmoller et al. However, we were concerned that any behavior observed only in the bck2∆ background would say more about the effects of bck2∆ on accumulation of Whi5/Cln3 than it would about how cell size control works in wild type cells. Therefore, to limit the number of time-intensive elutriation experiments that we needed to complete the manuscript we would prefer to leave this experiment for others to complete if they are interested.

      2) The authors over-express WHI5 to test the inhibitor-dilution. Their results dovetail with a recent study from the Murray lab (Barber et al., PNAS) suggesting that cells are not very sensitive to Whi5 levels. However, one can envision mechanisms (e.g. PTMs) that inhibit Whi5 molecules when expressed beyond their physiological concentration. Instead, it would be interesting to know what happens in WHI5/whi5 heterozygous diploid mutants that cut Whi5 levels in half. Perhaps this experiment exists in the literature, but it would be an ideal setting for the authors to perturb the inhibitor-activator ratio, and test Cln3/Whi5 protein levels along with cell size in synchronized cultures.

      We were not able to find an analysis of the size of WHI5/whi5∆ cells in the published literature. We carried out the analysis and the data are shown in a new figure panel (Figure 3C). The effect is small – deletion of one copy of WHI5 in a diploid strain caused only a 0.9% decrease in median cell size. These data nicely complement the data showing little effect of 2xWHI5 on cell size. We were surprised that we did not think to do this simple experiment, and we were also surprised that we couldn’t find it in the literature. We thank the reviewer for suggesting the experiment. Since the heterozygous WHI5/whi5∆ cells showed minimal size defects, we have not elutriated the strain to test for changes in the Cln3/Whi5 ratio.

      3) I found the result in Figure 5E very correlative and hard to interpret. For example, Ypk1 phosphorylation is lost at 2.5 min, but Cln3 levels seem unaffected at this timepoint and the next (?). I would suggest softening the (already soft) tone of explaining these results. In general, the connection between ceramide synthesis and Cln3 levels remains quite unclear to me.

      We agree that our interpretation of the data in Figure 6E was confusing in the original version. Part of the confusion may arise from a lack of clarity in our writing and in the literature about the different phosphorylation inputs into Ypk1/2. The literature suggests that changes in the electrophoretic mobility of Ypk1 could be due largely to the Fpk1/2 kinases. TORC2 also influences Ypk1/2 phosphorylation, as detected by a phosphospecific antibody, but it remains unclear whether TORC2 also influences the electrophoretic mobility of Ypk1/2. The data suggest that the phosphorylation of Ypk1/2 that can be detected via electrophoretic mobility shifts is correlated with Cln3 levels, while TORC2-dependent phosphorylation with a phosphospecific antibody is not well correlated with Cln3 levels. We have edited the manuscript to make this more clear and to clarify what can and cannot be concluded from the data.

      4) The text would need to describe a potential role for protein localization in this pathway. All the results come from cell extracts, whereas local protein concentration in the nucleus could be changing and impact the pathway.

      The last three paragraphs of the Discussion include a discussion of potential roles for protein localization in the context of data from our work and previous studies that point to a potential role for localization of Ypk1/2 and Cln3 to the endoplasmic reticulum. In addition, we added the following sentence to the Results section to highlight potential localization issues: "Population level analysis of Cln3 and Whi5 protein levels by western blotting could miss changes in Whi5 or Cln3 concentration driven by changes in localization to specific subcellular compartments.”

    1. Author Response:

      Reviewer #2:

      In this study, the authors develop a novel method, called MCGA, extending from their previous gene-based methods, to detect gene-trait association removing redundant signal. They further leverage expression QTL into their model to improve the resolution of gene-trait association. The overall structure is clear, and data is presented well. I am concerned about the simulation methods, and would like the authors to present some clarifications.

      1) When comparing MCGA-eQTL and MCGA-sQTL, the authors simulate a single isoform-trait association, and the simulated gene expression is averaged among isoforms, which is kind of unfair for MCGA-eQTL model. Hormozdiari et al reveal that sQTL contributes few to traits after conditioning on eQTL (Hormozdiari et al., 2018, doi: 10.1038/s41588-018-0148-2). I would suggest to simulating a case that gene-trait association is mediated by overall expression, instead of a single isoform (transcript);

      We thank Reviewer #2 overall for the numerous insightful and helpful suggestions and comments. Thanks for pointing out this problem! We agree with the reviewer that the gene-trait association can be mediated by the overall expression instead of a single isoform. However, we think that, mathematically, the two scenarios are equivalent. We also added a scenario in which gene-trait association is mediated by the overall expression of multiple susceptibility isoforms, and its power is similar to the scenario of single isoform-trait association (see Table 1 in the revised manuscript). In the real data analysis, we did observe that MCGA based on the isoform-level eQTLs detected more significant genes than that based on the gene-level eQTLs. Besides, we noticed that the sQTL (splicing QTL) in Hormozdiari et al. is different from the isoform-level eQTL used in our manuscript.

      2) When comparing MCGA-eQTL and MCGA-sQTL, only power is considered. The authors should include the analysis to demonstrate the performance in control for false positive;

      We thank the reviewer for this comment and suggestion. In the revised manuscript, we reported the results for controlling the false positive. Please refer to Essential Revisions point 2 (see line 261-262 in the revised manuscript).

      3) When choosing a favorable exponent value c (1.432 chosen in the study), the authors found that the c value is robust to trait type, sample size or variant size, but the authors didn't explain what factors affect the choosing of c. Considering the potential application of MCGA method in other studies, the authors should explain what factor affects c value, and provide the guidance how to choose an optimal c;

      We thank the reviewer for this comment and suggestion. Please refer to Question A and B of Essential Revisions point 3.

      A: "Motived from the boundary of chi-square correlation, we adopted simulation studies to empirically choose c for controlling the type I error of the effective chi-square test. Besides the correlation of chi-square statistics, the choosing of c for the effective chi-square test may also be affected by the approximated non-negative solutions. However, the correlation of chi-square statistics is the major factor. Our simulation showed that the derived boundary and influence trend of LD on chi-square statistics were also applicable to the effective chi-square test. In the revised manuscript, we showed that the correlation of chi-square statistics is affected by the non-centrality parameter of chi-square statistics (see lines 640-655 in the revised manuscript)."

      B: "As the optimal c for controlling the type I error of the effective chi-square test would be affected by the non-centrality parameter of chi-square statistics which are generally unknown in practice, we have to resort to a grid search algorithm to explore an empirically optimal c. In our last manuscript, we mixed the methods of choosing optimal c with the introduction of new effective chi-squared statistics. We wrote a new subsection in Materials and Methods to describe the procedure of choosing the optimal c in the revised manuscript (see lines 610-628 in the revised manuscript)."

      4) The mediation analysis result in Yao et al. estimates that 11% of trait heritability is mediated by gene expression (Yao et al., 2020, doi: 10.1038/s41588-020-0625-2), while in simulation section of this study, 100% of trait heritability is mediated by gene expression. Simulations mimicking real scenarios should be used;

      We thank the reviewer for this comment and suggestion and apologize for the confusion here. To our knowledge, the estimation by Yao et al. was for the entire genome. Note that many contributing variants of a trait may be far away from gene regions and beyond the scope of our approach. It is possible that some genes may have larger trait heritability (>11%) mediated by gene expression. Certainly, we agree with the reviewer that it is also necessary to mimic the scenario in which the gene expression mediates part of trait heritability. In the revised manuscript, we also added the scenario that part of trait heritability is mediated by the gene expression (see Table 1 in the revised manuscript). As expected, when the majority is mediated by other factors (except the gene expression), using all variants could be more powerful than only using eQTLs (see lines 247-279 in the revised manuscript).

      5) It is important to choose a background gene set when conducting GO enrichment analysis. It is not clear what kind of genes are used as control when evaluating significance;

      We thank the reviewer for this comment and apologize for the confusion here. We used the g:Profiler, a web server for functional enrichment analysis, to perform GO enrichment analyses. The conventional GO enrichment analysis took all annotated human protein-coding genes as a background in the present study (see lines 739-743 in the revised manuscript).

      6) GTEx v8 contains samples from diverse populations, and it is crucial to handle the issue of population structure. Based on the description on https://pmg-lab-docs.readthedocs.io/en/latest/KGGSEE_doc/KGGSEE.html#id18, it seems that eQTL/isoQTL were detected ignoring population structure. The authors should explain why they applied a pipeline like that, and show that their conclusion wouldn't be affected by the choice.

      We thank the reviewer for this comment. Indeed, in the original manuscript, we estimated the gene-level and isoform-level eQTLs without considering the population structure in GTEx v8. One reason is that though GTEx v8 contains samples from diverse populations, the majority (~85%) of the subjects are Europeans. Another reason is that the article of the GTEx consortium (https://www.science.org/doi/abs/10.1126/science.aaz1776) pointed that only 178 population-biased cis-eQTLs (pb-eQTLs) for 141 unique eGenes (FDR ≤ 25%) were identified across 31 tissues, which suggested that pb-eQTLs are hard to find at current sample sizes.

      In the revised manuscript, to avoid the potential population structure issues, we only used the expression profiles and genotype data of the Europeans for the eQTLs identification (see lines 788-801 in the revised manuscript).

      Reviewer #3:

      The manuscript, "MCGA: a multi-strategy conditional gene-based association framework integrating with isoform-level expression profiles reveals new susceptible and druggable candidate genes of schizophrenia", describes an approach to conduct gene-level association testing in GWAS data with integration of gene expression data. The authors have conducted comprehensive simulation studies for main modules involved in this framework, demonstrating the advantages of the MCGA strategy compared to established similar work. The method has also been applied to the analysis of schizophrenia GWAS, with several interesting discoveries. All methods proposed are implemented in the KGGSEE package, a command tool written in Java with good documentation, data resource and examples for the type of analysis proposed in this work.

      Overall, the framework is solid and the analyses performed are thorough. In particular, the simulation study and real data demonstration of advantages of isoQTL over conventional eQTL is novel and interesting. With the user friendly software available, I can envisage that MCGA will receive interest from the community and be adopted to many projects.

      My major reservation on the methods is the component using conditional analysis to identify gene specific signals. Even though the MCGA framework is as solid as the methods it is based on, alternative methods are available for gene-level association analysis that takes into consideration of contribution from multiple SNPs and the LD without having to rely on conditional analysis. For example, fine-mapping approach such as SuSiE (https://github.com/stephenslab/susieR) uses summary statistics and LD, and can produces gene-level evidence of association in terms of Bayes Factor, when a gene region is analyzed. Such an approach does not have a potential type I error issue, is efficient enough to analyze multiple genes in LD with each other. Most importantly it provides inferences directly for multiple genes accounting for LD, without having to rely on conditional analysis. Conditional analysis, as a greedy algorithm, suffers an obvious limitation: suppose genes A and B are two causal genes in weak LD with each other. A non-causal gene C physically in between A and B are correlated with both A and B. Then C may have a stronger marginal signal than either A or B. A conditional analysis may identify C, and conditional on C, association signals of the true causal genes A and B will become weaker. I therefore am not convinced that a conditional analysis such as ECS is the best approach on which MCGA should be based.

      We thank Reviewer #3 overall for the numerous insightful and helpful suggestions. We are happy that the reviewer found that our work will receive interest from the community and be adopted to many projects. To the best of our knowledge, MCGA had different application scenarios from SuSiE. The former worked with summary statistics, while the latter can only perform fine-mapping analysis with individual-level genotypes and phenotypes. Besides, MCGA can also be suitable for the three-gene case supposed by the reviewer. For example, if A and B are two causal genes, they may have larger selective expression scores than gene C in the phenotype-associated tissue. In the conditional analysis, A and B will enter the conditional procedure prior to gene C, which will make gene C not to be significant when conditioning on gene A and B.

    1. continue to become measurably safer and less violent, on average, just as they have over the last twenty millennia, according Stephen Pinker, Better Angels of Our Nature, (2010). We might even be able to predict, with with good models, that they “will” become measurably safer and less violent, under the right circumstances.

      Those 'circumstances' may be associated with demographic and governance (Think Like a Futurist.) What I see is the disparity between haves and have nots growing and eventually the haves nots will increase lawlessness out of desperation, childhood trauma, generational domestic violence, mental illness, and many other myriad of factors. The haves will no longer want to associate with the have nots and legislatively create a caste system. Even the have nots will form enclaves of safe places. I'm going dystopian here...BUT that's a silver lining, too. It's taxing being a have. Have nots have the opportunity to be closer in tune with natural forces and nature. Simply not wanting what the haves have relieves the burden of capitalism. Now i'm going Buddhist....

    1. Lean Startup loop, we want to run our Do loops fast and loose, and get them faster than environmental change, or our competitors and collaborators. At other times, as with W. Edwards Deming’s Quality loop, we want to run them slowly and carefully.

      There is no single way of being effective. Rather one must discern when a fast loop or a slow loop is necessary. If you act fast when a slow loop is required, you may outpace yourself and waste precious resources (touching upon Think Like a Futurist.)

    1. Author Response:

      Reviewer #1 (Public Review):

      This study is well-written and well-presented. The conclusions are clear and robustly supported by the data. The figures provide useful visualizations for the major findings. Virophage are an important and underappreciated component of global viral diversity, and they likely play important roles in eukaryotic genome evolution; this work is therefore quite timely. Relatively few studies focus on virophage or giant viruses compared to other viral lineages, so studies like this are highly valuable.

      Strengths of this work include the high quality of the reference genomes, which were constructed using both short-read and long-read sequencing, as well as the diverse locations and isolation times of the host genomes.

      We thank the reviewer for his encouraging and constructive comments!

      I found no major weaknesses in this study. One minor issue is that the details of how EMALEs were delineated and initially detected seem a bit unclear to me. Based on my reading I am curious if some divergent or degraded EMALEs could have been missed. This may be important for assessing the consequences of possible retrotransposition-mediated EMALE inactivation.

      Thank you for pointing this out. We added two sections to Materials and Methods called “Detection and annotation of EMALEs” and “Detection of Ngaro retrotransposons” where we describe the procedure in detail.

      Based on our approach of visually screening the entire genome assemblies for GC anomalies, combined with blast searches of Cafeteria genomes using as input manually annotated EMALEs as well as databases of all available virophage sequences, we are quite confident that we have not missed any obvious virophage genomes. We would only have missed putative virophage sequences if their GC-contents were similar to that of the host (~70% GC) and if these sequences bore no detectable similarity to known virophage genes/proteins.

      In contrast, our sequencing and assembly strategy probably did not result in a complete account of all EMALEs in these host genomes, as is evident from the large number of partially assembled EMALEs. However, partial does not equal degraded, but simply means that contig assembly stopped somewhere within the EMALE, resulting in an artificially truncated sequence. We therefore do not think that our approach introduced any relevant bias towards addressing the question whether retrotransposon insertion may lead to EMALE inactivation.

      These points are now included in the discussion.

    1. This review reflects comments and contributions by Ankita Jha, Zara Weinberg, Julia Grzymkowski, Julien Berro, Karen Lange, Sónia Gomes Pereira, Arthur Molines, Jacob Herman, and Manoj Yadav. Review synthesized by Jacob Herman.

      The work by Joseph Varberg and colleagues uses super resolution microscopy to better characterize the non-random distribution of nuclear pore complexes within the nuclei of the fission yeast Schizosaccharomyces pombe. This work also confirms findings in other organisms that nuclear pore complexes exist in multiple compositions. In addition to better documenting this phenomenon, this work begins to characterize the mechanisms by which nuclear pore position is regulated. Specifically, the authors show that clustering centromeres at the spindle pole body excludes nuclear pore complexes from the spindle pole body, and when these two complexes are forcibly dimerized mitotic defects result in decreased fitness.

      The commenters were overall quite impressed with the imaging technique. The major conclusions and message of the preprint were generally well received, the comments or questions below relate to very specific text and experiments.

      A few key themes mentioned in specific comments were:

      • A desire for more consistent statistical analysis of data.
      • Suggestions for additional data for some statements or toning down of the claims. NPC clustering is commonly discussed but there were questions as to how this phenotype was being measured.

      Specific comments

      Introduction

      “perhaps explaining links between changes in NPC density and cancer” - The statement could note whether the correlation between NPC density and cancer is positive or negative.

      “for example, emerin is enriched at pore-free regions of the NE in cultured cells (44). In budding yeast, NPC density is increased in the region of the NE near the spindle pole body (SPB),” - Does the SPB contain LEM domain proteins or is this a different possible mechanism for the non-random regulation of NPC density?

      “Using S. pombe as a model system”- Why not S. cerevisiae, which is discussed earlier to have significant prior art in this regard? I'm sure there is a good reason, I think it could be outlined a bit more in the intro.

      “We quantify NPC number under a range of conditions” - It would be useful to mention briefly at this point what types of conditions this refers to.

      “Additionally, NPCs are excluded from the NE region surrounding the SPBs by Lem2 and other factors.” - Could the authors clarify if it is also something that is conserved or it is a new finding?

      Results

      Subheader “3D-SIM image analysis pipeline for NPC quantitation” - Worth mentioning the conclusion that NPC density is independent of cell cycle stage since that is the major conclusion from this section.

      “This approach provides a roughly two-fold increase in resolution as compared to conventional light microscopy” (Figure 1A) - For those who have never imaged NPCs it would be really informative to see a confocal or wide field image to better understand how SIM imaging made this project possible.

      Figure 1 legend “E) Mean number of NPCs, nuclear surface area and NPC density measurements from four independent replicates. Significant differences (*) determined using Wilcoxon rank sum tests. ns, not significant.”

      • Each one of the coloured dots on the graph appear to represent the mean NPC number for each replicate. If so, then this information should be added to the figure legend, as it is not immediately obvious for a broader audience. If not, please clarify what each dot represents. Same in Figure S1E.
      • Were these tests conducted pairwise? Are the reported p values corrected for multiple hypothesis testing? Having a dedicated "Statistics" section in the methods would be helpful for reporting this.

      “We observed that the number of NPCs also increases through interphase to maintain a constant NPC density (Table S1, Fig. 1D-E)”- There are no cell-cycle markers used to determine cell-cycle progression but only a visual assessment from cell size and nucleus shape. It would be good to show the three plots in figure 1E as scatter plots with the X axis being cell size for cells that are not yet in mitosis (1 nucleus). Then do a correlation analysis between cell size and NPC number, Surface Area, NPC density.

      “We observed occasional differences in NPC density between mother and daughter nuclei produced by the symmetric mitotic division in S. pombe, reminiscent of the elevated NPC density observed for daughter nuclei produced by the asymmetric mitosis in S. cerevisiae”

      • It is not clear what the authors mean by "occasional", is it 1%, 5%,10%? It would be better to replace this with a specific number/% of events. Additionally, the question arises as to what happens afterwards in the daughter cells, do they retain the NPC density asymmetry? Or do they eventually achieve similar densities?
      • Some of these points are addressed later on in the paragraph and Fig S1A. Some edits to this sentence should address this and provide clarity.

      “Despite the improved lateral resolution offered by SIM, clustering of NPCs and the comparatively reduced axial resolution likely leads to undercounting of NPCs using 3D-SIM” - This is useful context for Figure 1C - it would be worth mentioning in that section how the automated counting handles clustered NPCs, or placing the paragraph earlier with a short description of the methods.

      “Similarly, a constant NPC density was maintained when nuclear size was reduced in mitotic cells using a temperature-sensitive allele of Wee1 kinase (wee1.50)” - The NPC density does not change with temperature in the wee1 mutant but the NPC density in Fig 2B is lower than in a WT in Fig 1. The Nup tagged in the two figures are different, so this could be an explanation (as shown in Fig S1 E) but it could be good to make sure that the wee1 background does not have a different NPC density. I don't see a quantification of the NPC density using Nup37 in the WT elsewhere. In fact, Nup37 seems to be used only in wee1 background and in Sup Fig. 1 B.

      “The increase in NPC number was dependent on NE membrane expansion during arrest, as chemical inhibition of fatty acid synthesis by treatment with cerulenin blocked nuclear growth while NPC density was maintained (36°C + Cerulenin = 6.8 ± 1.5 NPCs/μm2, n=110) (Fig. 2C)” -The effect of the Cerulenin drug on a WT background is not shown, was that control experiment done? It would also be helpful to include statistics in this section.

      “Yeast lacking core components of the autophagy machinery (atg8Δ or atg1Δ) (75) that targets NPCs for degradation during nutrient deprivation do not show increased NPC density compared to wild-type cells, suggesting that autophagy is not used to remove NPCs to maintain NPC density (Fig. 2D)” - It is unclear that this experiment alone tells us much about the regulation of NPC density. If atg8 and atg1 are known to regulate NPC removal only in response to nutrient deprivation then consider performing these experiments under that condition or revisiting whether they fit here.

      “NPC density is maintained by a mechanism that restricts the assembly of new NPCs in the absence of increased available NE surface area.” - This conclusion is indicative of a mechanism where NPC assembly is maintained by restricting the assembly, however all the data above is indicative of the mechanism where NPC assembly is correlated with NE surface area, for increase there must be an additive mechanism and for a decrease in the NE, there must be a mechanism of removal. This suggests that the NE surface area regulation mechanism could be tied to NPC density. One way to clearly show that could be a correlation plot of NE surface area and NPCS density, color coded for all the different conditions tested.

      Figure 2- It appears as though no comparative statistical analysis was done with the quantitative data displayed in Figure 2, yet it is stated that e.g., "treatment with cerulenin blocked nuclear growth while NPC density was maintained" or "yeast lacking the autophagy machinery do not show increased NPC density". These conclusions would be strengthened if statistics were run on the data similar to Figure 1.

      “NPC clustering is common phenotype in different cell types and in mutants defective in NPC assembly.” - Does this mean that NPC clustering is higher in mutants defective in NPC assembly? Would suggest including references for this. Also, this paragraph needs an introduction to why NPC clustering matters? Does it have any connection with the NPC distribution?

      “3D-SIM images revealed the presence of multiple smaller clusters distributed throughout the NE (Fig. 3A).” - It is unclear (also not mentioned in the Methods section) how clusters are identified. The images show rings but it is hard to tell how many clusters compose that ring structure. It will be beneficial to show how clusters are quantified. Can that be resolved with 3DSIM?

      “We frequently observed NPC clusters organized in a ring-like structure with diameters ranging from 250-300 nm (Fig. 3B)” - Is it possible to report what was the frequency of the ring structures in nup132-deleted and wt cells?

      “Clustering increased in aged nup132Δ cells grown on plates (Fig. 3C)”- The figure depicts the NPC ring like structure, does this mean that the ring increased or the clustering has increased. Does increase in clustering make the rings more continuous?

      “NPC clusters were frequently enriched in the anaphase bridge, along with excess membrane (Fig. 3E)” - Providing a quantification of NPC cluster enrichment in the anaphase bridge would be helpful.

      “Following completion of nuclear division, the resulting daughter nuclei had normal NE morphologies and NPC densities equivalent to wild-type nuclei (Fig. S2). This suggests that nem1Δ nuclei can remove excess NE membrane and NPCs during mitosis via the anaphase bridge.”

      • This implies that prior to mitosis nem1∆ cells have abnormal morphology and NPC densities but the latter is not measured.
      • The NPC density reported in Figure S2 for the WT and the Nem1 mutant are different from the NPC density reported for the WT in figure 1 and figure S1 yet it is done using the same tagged Nup, Nsp1. It would be helpful to have an explanation. If the NPC density is “a constant” in the WT it should not be different from one figure to another. If the nem1 mutant has a density of 4 NPC/micron^2 then it is different from the WT. Also, the NPC density in the nem1 mutant in Figure S2 seems almost bimodal. Increasing the number of nem1-delta cells analyzed could help identify if it is bimodal or if it is due to under-sampling.
      • For the nem1 mutant the clustering is not quantified.

      “In contrast, NPC clusters in nup132Δ nuclei coalesced into larger clusters that preferentially localized to the SPBs in mitosis (Fig. 3G)”

      • An overlay image could be included to support this statement.
      • Fig. 3F could be referenced here too because otherwise it is not referenced until the discussion; at which point it is used to reference the data that is referenced here as Fig. 3G.

      “We observed a clear reduction in NPC density over the nucleolus” - Is this referring to where the yellow and magenta staining meet in Fig. 4? It is not immediately obvious as to where "over the nucleolus" is in those slices. Can the regions that are being compared (NPC staining at NE vs. NPC staining over nucleolus) be highlighted/specified in some way so as to better understand the quantification method?

      Figure 4

      • 4C is gorgeous - really conveys the point well!
      • In this figure the authors at first show a 3D-SIM image, but perform the intensity analysis on the confocal slice. What is the reason for it? Analysis of the 3D-SIM data could provide more information on the characteristics (number, spatial distribution) of NPS density reduction.

      Figure 5

      • Very minor comment -- the scale bar is very hard to see in Fig 5A.
      • Statistics for Fig. 5B would strengthen the conclusion that the exclusion was cell-cycle independent.

      Figure 6

      • Figure 6D - Looking at the insets, the exclusion area in the lem2(delta)C-off appears to be the smallest one and closer to the exclusion area shown for the lem2(delta) in panel B. However, this is not represented in the quantification/results. I wonder if there is another image that would more closely represent the quantification outcome? Or if the insets might have been mislabeled, for instance lem2(delta)C-off could indeed represent the lem(delta)N-on and vice-versa?
      • Figure 6E - This is listed as F in the legend.

      “Tethering did not affect microtubule nucleation at the SPB, including the formation of cytoplasmic microtubules.” - Please provide evidence supporting this statement.

      Discussion

      “In nem1Δ mutants, both excess nuclear membrane material and NPCs are segregated into the anaphase bridge region during nuclear division” - This would benefit from some analysis - are there too many NPCs? Is it specifically the clustered NPCs? Currently the data supporting this is snapshots from a single movie.

      “The ability for 3D-SIM to resolve and quantify individual NPCs labelled with multiple fluorescent proteins at endogenous levels provides tools to begin to interrogate__ how altered NPC compositions may allow for functional specialization of NPC function at distinct regions of the NE.” - The high resolution images are really beautiful! Great job in showing the power of 3D-SIM to help answer these types of biological questions.

      Methods

      “Images were acquired overa6 μm volume with 0.3 μm z-spacing for 45 min at 2 min intervals.” - For dynamic measurements a 2-minute interval is big and it would be interesting to see a few time-series imaging with smaller intervals to capture the fast changes.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We thank the reviewers for their constructive and helpful comments on our manuscript and are delighted to find their consensus that the manuscript represents an important contribution to the field. We provide a detailed response to specific points below. In addition, we propose to include new data showing that our method can be applied to experimentally infected lung tissue. Namely, we show highly sensitive detection of SARS-CoV-2 RNA in infected hamster lung section.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer #1 **Major comments:**

      The authors used approaches provided in FISH-quant (Mueller et al, Nat Methods 2013) and big-fish. However, these tools to analyze RNA aggregates were not designed and validated for such massive aggregations as observed by SARS-Cov-2. They were developed for cases such as transcription sites with much smaller aggregations, with a few tens to a hundred molecules. With a regular spot detection approach, usually a few thousand spots can be detected in a cell (e.g. King et al, J Virol 2018), but this depends also on the used microscope and the available cellular volume. Higher RNA concentrations cannot be resolved with a standard approach, because RNA spots start to overlap. Decomposing RNA aggregations can help but will not work reliably for the high RNA densities observed for SARS-Cov-2, especially at later infection time-points. The tools will then not provide accurate estimates anymore. To my knowledge, there is currently not accurate quantification method for such massive RNA levels in smFISH. What has been done in the past, is using cellular intensity as an approximation and perform calibrations with cells having lower and thus still resolvable RNA counts (Raj et al., PLO Biology; https://doi.org/10.1371/journal.pbio.0040309.sg003). The authors proposed three expression regimes (partially resistant, permissive, and super permissive). My concerns here apply mainly to the category super-permissive, where an accurate estimation can't be performed. Here a more cautious quantification should be applied. __To a lesser extent, this will also apply to some of quantifications of gRNAs per factory, with counts exceeding 100s of molecules. As mentioned above, this does not affect any of the conclusions, but would reflect more accurately what kind of reliable information can be drawn from such experiments.__

      We agree with the reviewer that approaches like FISH-quant and Big-FISH cannot reliably quantify RNA spots with high spatial density such as our examples of “super-permissive” cells. Single molecule quantitation of such cases is likely to underestimate RNA expression as noted by us and King et al 2018 (doi: 10.1128/JVI.02241-17). Therefore, we integrated the combined smFISH signal intensity within entire cellular volumes and compared to the median intensity of single molecules in cells with lower infection density. We will (i) revise the methods and results sections to explain more carefully and explicitly the quantification of RNA in super-permissive cells. (ii) Provide a calibration plot for the quantitation as previously reported (Raj et al 2006, doi: 10.1371/journal.pbio.0040309).

      We agree that high local RNA density has the potential to interfere with quantification of gRNAs within viral factories. We have used the “cluster.decomposition()” function of Big-FISH to quantify viral factories, which is conceptually similar to the “Integrated intensity” mode of FISH-quant. Applying this algorithm to non-super permissive cells allows us to use the mean intensity of a reference single-molecule spot to estimate the number of molecules in a cluster. We are confident such estimates are reliable in the majority of viral factories, which contain less than or equal to 200 single gRNA molecules. We will revise the methods section to clarify this method of analysis.

      Reviewer #1 __**Minor comments:**__

      1.Page 6; the authors state that "smFISH identifies ... cellular distribution .... within ER-like membranous structures". However, the authors didn't directly show such a localization, could they provide an experiment with an ER stain?

      This text was based on previous light microscopy and EM studies that reported SARS-CoV-2 RNA in ER-derived membranes (termed Double Membrane Vesicles - DMVs) or co-localisation of anti-dsRNA (J2) with ER-markers (Cortese et al 2020; Hackstadt et al 202; Mendonca et al 2021)*. We propose to clarify the text on page 6 including the citation of these publications and to tone down our claim that the virus is located in ER-like membranous structures.

      *Cortese et al 2020, doi: 10.1016/j.chom.2020.11.003

      Hackstadt et al 2021, doi: 10.3390/v13091798

      Mendonca et al 2021, doi: 10.1038/s41467-021-24887-y

      2.It might be worthwhile pointing out that the probe-sets can be used in different host organisms (Vero - African green monkey; human cell lines).

      We propose to revise the text to emphasise more clearly the applicability of SARS-CoV-2 probes for the study of many different host organisms.

      3.I really liked the experiment, where the authors showed absence of signal when infecting with another virus & elegant control with the J2 AB. Maybe the authors could explain more clearly that the used a different coronavirus & that based on their sequence alignment no/little signal would be expected.

      Thank you for this supportive comment. We plan to follow the reviewer’s suggestion and expand our explanation of the rationale of this experiment in the text.

      7.The experiment with the isolated virions shows nicely that the smFISH approach has single-virus sensitivity. Did the authors compare the intensity of these isolated virions with the signal in Fig 1B? This might be a question of personal taste, but to me, this section might actually fit better in the first paragraph of page 4/5, where the authors describe single virions in cells.

      Thank you for the interesting question. We have not performed a direct comparison of the spot intensities of intracellular genomic RNA molecules and those from the isolated virions, because isolated SARS-CoV-2 requires poly-L-lysine coating for the coverslip attachment while our infection strategy utilises cells growing on uncoated glass. Nonetheless, the isolated virion spot intensities follow a unimodal distribution, and their shape approximates to the point-spread function of the microscope. Since spots at 2 hpi are largely derived from non-replicative viral genomes and they are measured in the intracellular environment with the same background (autofluorescence), they are a better ‘single RNA molecule’ reference.

      We also thank the reviewer for suggesting rearranging the text section. To address this point we plan to move the relevant text to the second paragraph of the Results section.

      8.Page 6. The authors state "+ORF-N and +ORF-S single labelled spots, corresponding to sgRNAs, were more uniformly distributed throughout the cytoplasm than dual labelled gRNA". This is difficult to appreciate from the image. Is this something the authors could quantify, e.g. with the metrics proposed by Stueland et al, Scientific Reports 2019?

      To address this point, we plan to: (i) present an alternative image illustrating a clearer example of differential spatial localisation of gRNA and sgRNA, and (ii) perform quantification of spatial dispersion indices for gRNA and sgRNA using the suggested method for our revision.

      9.Page 6. The authors perform a FISH/IF experiment including a co-localization analysis, where a "limited overlap" with sgRNAs was observed. I was wondering if this overlap could actually be simply due to rather high density of the sgRNAs. Maybe a control analysis by slightly changing the RNA positions could provide insight here, and give a threshold for what's to be expected randomly at a given RNA density.

      The reviewer’s comment is correct, in that a high density of sgRNAs and nucleocapsid protein could lead to signal overlap due to chance. This is why we excluded “super-permissive” cells from this analysis. Our co-localisation data showed that gRNA spots had a bimodal nucleocapsid immunofluorescence intensity distribution (data not shown), suggesting nucleocapsid-associated and “free” gRNAs, providing a threshold for this analysis. Nevertheless, we agree with the reviewer that the analysis of randomly positioned transcripts of the same density would provide a valuable control. In our revised MS we will include: (i) a random distribution analysis comparing the overlap between sgRNA and nucleocapsid in the “Observed” and a “Randomised” simulation, and (ii) a plot showing a full distribution of co-localised nucleocapsid immunofluorescence intensity for both genomic and sub-genomic viral RNAs.

      10.I don't fully follow the argument about stability on page 8. The authors also see an increase in the RNA levels. Couldn't this increase compensate for loss of RNA due to degradation? Would it be possible to perform an experiment at a very high REMDESIVIR concentrations which would blocks transcription?

      Remdesivir is a nucleoside analogue that inhibits viral RNA polymerase activity. While this drug inhibits viral replication, the inhibition is incomplete and using higher concentrations results in cellular toxicity. At the present time there are no stronger polymerase inhibitors available, so these experiments are the best approximation possible to assess viral RNA stability. We propose to revise the text to discuss the limitations of Remdesivir for modelling RNA stability.

      12.How did the authors define/detect replication factories? I couldn't find information about this in the methods.

      This is a good point raised by both the reviewers. Please see [Reviewer 2 General comment #1] for our response.

      Reviewer #2 **General comments:**

      1.The authors' definition of viral factories, in part as foci with at least 4 gRNA molecules, comes across as arbitrary. Perhaps a clearer explanation of this cutoff would be helpful to the readers' understanding of this definition. Additionally, confirmation of the functionality of such factories by immunofluorescence with anti-RdRp, for example, in addition to identifying staining of gRNAs and (-) sense viral RNAs at each focus could provide valuable support to the authors' conclusions.

      We thank both reviewers for requesting further information on our explanation of viral factories. We defined viral factories as smFISH signals with spatially extended foci that exceed the size of the point spread function of the microscope and the intensity of a reference single molecule. We then filtered these candidate factories based on the radius of the signal foci with EM-measured radii of double-membrane vesicles and single-membrane vesicles formed by SARS-CoV-2 (150 nm pre-8hpi and 200 nm post-8hpi) (Cortese et al 2020; Mendoca et al 2021). Our terminology encompasses both replication and viral assembly sites. The threshold of 4 genomic RNA molecules was selected as a technical threshold to limit an over-estimation of viral factories at later timepoints. For our spinning-disk confocal imaging system, we found the threshold of 3-7 RNA molecules provided satisfactory results. We propose to revise both the Results and Methods sections to clarify our rationale for defining and quantifying viral factories.

      As the reviewer mentioned, we have shown a partial overlap of positive sense genomic RNAs with negative sense genomic RNAs (Figure 2D, S2C), suggesting these viral factories represent double membrane vesicles. The use of antibodies against the viral polymerase (nsp12) is also a possibility to detect replication centres. However, replication centres are not the only ‘viral factories’ as there are also double-membrane structures where viral particles assemble (Mendoca et al 2021) and they, in principle, lack negative sense RNA and replication machinery, so neither smFISH probes against the negative strand nor a nsp12 antibody will comprehensively detect viral factories. We appreciate the valuable suggestion, but the classification of viral factories into replication and assembly sites would be challenging due to reagent availability and is beyond the scope of this manuscript.

      2.The random distribution of super-permissive cells in each cell line was demonstrated early in the infection, primarily at 8 hpi. The authors do not show how this pattern changes over time (8, 10, 12, 16, 24 hpi, for example). Do clusters of super-permissive cells appear at later time points, or does the pattern of 'highly' infected cells remain random for each virus? Any strain-specific differences identified from such patterns may be important for understanding infection progression. Finally, the authors do acknowledge this point, but it cannot be overstated that these data were taken from cell culture systems that have limited similarities to the human respiratory epithelium. A better model for such studies might be primary cultured human bronchial epithelial cells, but of course, these cells are not as readily accessible as the cell lines used in this manuscript.

      We share the same view that the presence and the spatial distribution of “super-permissive” cells can provide unique insights into SARS-CoV-2 infection dynamics. Our findings suggest that even at 24 hours post infection (hpi), not all cells become “super-permissive” and the culture maintains a heterogenous population of “partially resistant”, “permissive” and “super-permissive” cells (Figure 3C, S3C-D). We agree with the reviewer that the spatial distribution of “super-permissive” cells at later timepoints is of interest. To address this point, we plan to: (i) analyse the spatial distribution of “super-permissive” cells at 24 hpi, and (ii) compare the distribution of “super-permissive” cells at 24 hpi between VIC and B.1.1.7 strains.

      We appreciate the comment on the limitations of the cell culture systems to the human respiratory tract. However, Calu-3 and A549-ACE2 lung epithelial cells have been used in many studies over the last year and we feel it is important to publish single cell quantitation with these models to enable comparison with the published literature. We believe our results provide valuable information on the intrinsic nature of host cell susceptibility to support viral replication. During the review of this manuscript, we applied our smFISH probes to detect SARS-CoV-2 RNA in infected Golden Syrian hamster lung sections, which show an uneven distribution of infected cells. While the identification and spatial characterisation of susceptible cell types in the lung are beyond the scope of this manuscript, we are excited to include this data in our revised paper to demonstrate the utility of this sensitive approach to track spatiotemporal viral infection dynamics.

      3.The difference in early replication kinetics between the VIC and B.1.1.7 strains is an exciting finding that may have implications for clinical outcomes and transmissibility of these viruses. However, the authors did not clearly demonstrate how these differences in RNA production correlate to infectious viral load released from these cells (in bulk) at each time point. An explanation of this omission would be helpful.

      We will provide data on the level of infectious virus secreted from VIC and B.1.1.7 infected cells at all time points in the revised paper.

      In my opinion, findings related to specific cell lines are of much less importance (and are much less biologically relevant) that identification of replicative differences among strains. Such differences could be used, in part, to aid prediction of the transmissibility of VOC, for example. I think this point gets a bit 'lost in the weeds' of the rest of the paper.

      To address this comment, we will revise text on the differential replication kinetics of the SARS-CoV-2 strains to make this more prominent in our paper.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      Reviewer #1 __**Minor comments:**__

      4.I might have missed this, but they authors could also mention the positive control data about but Calu3 infected with SARS-COv2. One thing I was wondering: why did the authors use two different cell lines for this experiment?

      To address this point, we have added a sentence about a positive control visualising SARS-CoV-2 in Calu-3 cells using our probe set (page 5 – line 17).

      The experiments with HCoV-229E were done in Huh-7.5 cells because SARS-CoV-2 and HCoV-229E have distinct cell preferences. Using the J2 antibody we show that the levels of the dsRNA derived from viral replication are similar in the two cell lines and with the two viruses. Therefore, the lack of smFISH signal in HCoV-229E infected cells supports the high specificity of the probe set.

      5.Fig 1E. Would be nice to have the intensity scale for all time-points to permit a comparison of image intensities along the different time-points.

      6.Fig 3B. Would be important to have intensity scale bars to judge the signal intensities across the different time-points.

      The fluorescence intensity scale in Figure 1E is applicable to all timepoints, except for the lower panel at 24 hpi, which was intended to show wider dynamic contrast range. To address this point, we have provided intensity scales for all time-points studied in this figure and also Figure 3B.

      11.Fig 3C. maybe indicate the two groups with dashed lines.

      We have added a dashed line at the 102 mark in Figure 3C to visually differentiate “partially resistant” and “permissive” cells.

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

    1. Reviewer #3 (Public Review): 

      The paper contains a substantial amount of novel experimental work, the experiments appear well done, and the analysis of the data makes sense. Raw data and analysis scripts have been made fully available. 

      I have two specific comments: 

      - While the paper talks extensively about deep mutational scanning, I don't think this is a deep mutational scanning study. In deep mutational scanning, we usually make every possible single-point mutation in a protein. This is not what was done here, as far as I can tell. 

      - For the analysis of epistasis vs distance (Fig 4d, e, f), it would be better to look at side-chain distances rather than C_alpha distances. In covariation analyses, it can be seen that C_alpha distances are not a good predictor of pairwise interactions. Similar patterns may be observable here. 

      See e.g.: A. J. Hockenberry, C. O. Wilke (2019). Evolutionary couplings detect side-chain interactions. PeerJ 7:e7280.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We want to thank all three reviewers for their positive and constructive comments and suggestions for improvement. We have now thoroughly revised the manuscript including new analysis, extra figures, and new material in the wiki. The manuscript has significantly improved because of the reviewers input. Detailed responses to questions and comments are given below.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Lange et al. have developed an automatic feeding system for zebrafish facilities. The system is open-source and relatively easy to implement. The authors propose to systems, one that delivers the same amount of food for each aquarium (ZAF) and a second (ZAF+) that can adjust the amount of delivered food to each aquarium. The authors show no difference in fish weight, spawning and water quality, when fed using the automatic system or manually.

      In my opinion, the ZAF and ZAF+ are an excellent first approach to solve the complex problem of automatizing feeding in fish facilities. So far, only one company offers this option which is extremely expensive and demands a lot of maintenance.

      The manuscript is very well written and easy to follow. The supplementary material is very well detailed. It is clear that the authors intended to facilitate the implementation of the ZAF by potential users.

      We appreciate the supportive comments from Reviewer 1 and address all comments below:

      I just have a few comments regarding the system:

      1) The authors do not indicate how the system is cleaned. the system drains itself, but will any deposits of food remain in the tubes ? Why is the system not flushed with clear water after each feeding? do the tubes get clogged ?

      We agree that the cleaning process was not clearly explained in the manuscript. We added clear sentences in ‘Box 1’ to describe the first cleaning step (see text and figure). Indeed, after each feeding we flush water and then air into the tubes. Moreover, we explain in ‘Box 2’ that we have a second level of cleaning in the form of a special cleaning program that is run at least once a day with no food distribution (i.e same program as used for feeding but without actual food mixed, we flush lots of clean water and then air in the system). Finally, in the discussion we clarify the different cleaning steps by adding extra explanations in the first paragraph.

      All these procedures and programs are very effective in preventing system clogging and in reducing the accumulation of debris and algae. After more than 19 months of ZAF and ZAF+ feeding in our facility we never experienced any tube clogging.

      2) How long the system was tested for?

      ZAF has run in the facility for 9 months and ZAF+ for 10 months since September. We added a sentence about the testing time in the discussion. We never experienced any major problems, only a few minor malfunctions, reported in the new troubleshooting table added to the wiki (suggested by the reviewer 2).

      3) The ZAFs were used to feed 16 aquariums. For such a small rack, manually feeding takes less than 5 min. The authors should highlight that, at least for such small systems, the ZAFs will be especially very useful for feeding during weekends and holidays. Still, adding 16 commercially available small automatic feeders to each aquarium, could be simpler to implement.

      As noticed by the reviewer, ZAFs are very useful when staff are not present (week end, vacation, etc..). To emphasize on this particular point we added a sentence in the discussion's first paragraph. The small automatic feeders available commercially are usually very difficult to attach to zebrafish facilities . Indeed they can’t adapt to conventional lab aquatic facility racks because they are designed for pet aquariums. They also have less features compared to the ZAFs (difficult to adapt the food quantity, more food waste, cumbersome...). Additionally, by multiplying the number of devices (you need one small feeder per tank), one increases the risk of possible malfunction as well as the maintenance time required for food filling, cleaning etc...

      Thus, usage of small automatic feeders in laboratory aquatic housing racks is complex to adapt, a source of feeding error, is more cumbersome, and potentially more time consuming etc… They are simply not designed for professional aquaculture systems. Whereas ZAFs can be easily adapted to all the commercially available aquatic facilities. The fact that ZAFs simply ‘interfaces’ via tubes to fish facility racks makes them very versatile and unintrusive.

      4) How do authors envisage implementing the ZAFs in much larger facilities (from 100 to 1000 tanks) ? Implementing a specific ZAF for each rack containing ~20 tanks may not be realistic.

      Indeed building multiple ZAFs will be complex and resource consuming. Thus, we designed ZAFs to be adaptable and modular, so one ZAF ( or ZAF+) can easily be scaled to handle bigger facilities. The supplementary information and the wiki describe all the steps required to build a ZAF for 16 tanks and a ZAF+ for 30 tanks and many tips to scale up these devices without major modifications (up to 80 tanks for ZAF no restrictions for ZAF+). Of course, we do think that for truly large facilities, there is probably a sweet spot that balances the number of individual devices and the per-device capability. Having a single device feeding 1000 tanks is probably not wise, perhaps 5 devices for 200 tanks each (ZAF+) would be the best. Please note that the hardware cost and complexity scales roughly linearly with the number of tanks, no surprises here. Moreover, in the case of ZAF+ it is possible to use splitters to feed even more tanks from the same line (ZAF+).

      We added pages in the ZAF/ZAF+ wiki, to help the users extend the feeding capacities of their desired ZAFs (see in the wiki “tips to scale up ZAF “- “tips to scale up ZAF+”). We also mentioned in the discussion the possibility of distributing food to more tanks with one device by increasing the outputs and referenced the wiki accordingly.

      Having said this, we did not primarily design ZAFs for super large fish facilities, instead we designed the ZAF systems to facilitate adoption of fish models by many small and medium sized labs. We hope that our system will lower the bar for labs with moderate ressources to get started with aquatic models, or labs that just want to ‘try’ a new aquatic model organism ‘on-the-side’.

      5) how the length of the tubes influences the efficiency of feeding ? For feeding many tanks with the same ZAF it is necessary that the tubes will be of the same length. In that case, the system will become very cumbersome. Longer tubes will probably need stronger pumps. What's the maximal length of tubes tested ? That will limit the number of aquariums a ZAF can feed.

      how the length of the tubes influences the efficiency of feeding ? For ZAF the size of the tubes is very important because its design assumes homogeneous food distribution. In contrast, ZAF+ distributes the entire amount of water and food mix to each tank sequentially, so the tube length is not an issue. To make sure that tube length or tube layout is not affecting feeding efficiency we evaluated the weight of fish coming from tanks housed on two different rows (top and bottom). This was not clearly explained in the methods section -- we changed the text to reflect that. Additionally, at the end of each ZAF+ run, the washing sequence runs a relatively large quantity of water to ensure that all food gets flushed out to the right tanks. We did not evaluate the precise amount of food delivered. However after each feeding and cleaning all tubes are empty (see last sentences of the Box 2).

      For feeding many tanks with the same ZAF it is necessary that the tubes will be of the same length. In that case, the system will become very cumbersome. This is a fair concern. However, with a good design and with the help of cable tie it is very easy to organise the tubing, and avoid ‘tube-hell’. We added a sentence to clarify the organisation in the wiki (see ZAF>Hardware>Tubing in wiki) .

      Longer tubes will probably need stronger pumps. What's the maximal length of tubes tested ? That will limit the number of aquariums a ZAF can feed. We never precisely measured that because the generic pumps we use are very powerful and their running time can be adjusted in the software by changing the constants in the code source (see troubleshooting new supplementary table). Therefore the length of tubes should not be a limiting factor. Even stronger pumps (more amps) can be readily sourced on Amazon if really needed -- although we doubt that this is necessary. Regarding the number of tanks that ZAF can feed, we simply recommend adding more pumps to increase its capacity (see previous comments or “tips to scale up ZAF” in the wiki).

      Despite these comments, this is an excellent first approach, and the fact that the authors made it open-source and open access, make the ZAFs a very important contribution to the community. I have no doubt that some fish facilities will implement it and the community will help to improve it. Thank you. We do think that the main benefit of an open source project is the community around it. We are currently collecting a growing list of interested labs and we are interested in organising an online workshop to discuss ZAF and ZAF+, with some talks, QAs, and more to help people getting started.

      Reviewer #1 (Significance (Required)):

      This is the first open-source open-access automatic feeding system ever published.

      It is the first but very important step to the automation of research fish facilities.

      **Referee Cross-commenting**

      I agree with all the other reviewers.

      We also have to take into account that the system is a first prototype and although not ideal, it is open source. This will allow other labs to develop and improve their own models based on the ZAF.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary**

      The manuscript proposes an open source automated feeder for zebrafish facilities, although it would be amenable to other species. Overall, the manuscript is clearly written and easy to understand, the wiki is well sourced and clear. The commitment to open source is commendable.

      I have some questions regarding the long-term sustainability of this setup, as well as some discrepancies in the methods. Finally, as this aims to be useful to people with no engineering/electronics competence, I feel that it is not yet at a level that is accessible enough.

      We are very pleased to see that the Reviewer appreciates our manuscript and our commitment to open access. We thanks the Reviewer for his comments, in particular the comments about accessibility, and address them bellow:

      **Major comments**

      It would be useful to have a centralized list of parts and components, which would make it easier for users to order all that is needed to assemble the ZAF or ZAF+, at the moment the information is distributed through the wiki as hyperlinks.

      Extremely important! This was clearly an oversight on our part. We agree that a table listing all the components would help for constructing ZAF and ZAF+. We have added two tables in the wiki, one for ZAF and another for ZAF+, with all the necessary parts and components required to build both devices, with articles number, supplier and cost in dollars. Thanks to the reviewer for this excellent suggestion.

      A troubleshooting guide for the common problems the team ran into (if any) would be useful for newcomers, even just as issues on the GitHub. The team may also consider some form of chat/forum/google group to allow discussions between users and experts.

      The reviewer raised an important point so we added to the ZAF wiki a troubleshooting guide to help users by listing the minor malfunctions that we observed. Additionally, users will be able to ask questions or report bugs on the ZAF GitHub using issues. Github issues will allow discussion and to track ideas and feedback within the ZAF user community. Finally, we just created a Gitter room: https://gitter.im/ZAF-Zebrafish-Automatic-Feeder to enable more interactive discussion.

      Did the author observe any algal or bacterial growth in the feeding tubes over the 60 days? Do they have an estimate on how long the tubes stay "clean" enough? The authors mention tube changing every 10 weeks, can they explain the rationale, and did they assess the bacterial/algal contamination over that time? Do the splitter panel and food mixing flask also need replacing regularly?

      After several weeks of usage we indeed observed algal and bacterial growth in the tubes. In order to report and justify the need to change the tubes, we made a new supplementary figure illustrating the tube cleanliness over time, mainly algal and bacterial (see Suppl. Fig 3). We realised that 12 weeks is actually the optimal tubing renewing period in our facility. Algal and bacterial growth depends on the facility environment characteristics such as light intensity, water and air temperature, as well as feeding frequency and therefore might be adapted to the users facility specs. The splitter tubing can be changed based on user observations; we now mention this in the ZAF tubing supplementary material and on the wiki.

      The authors mention that the tubing needs to be of similar length to ensure similar resistance and food distribution, did they compare the body weight of fish in racks at the top or at the bottom of their system? There are no overall differences, but maybe the bottom racks would received slightly more food? Furthermore, did they quantify the differences in food/water delivery as a function of length differences?

      The requirement for similar length is only necessary for ZAF because its accessible design assumes homogeneous distribution of the water-food mix through a passive splitter system which is susceptible to variable fluid resistance. In contrast, ZAF+ distributes the water-food mix one tank at a time -- ensuring that the correct amount of food is entirely flushed through any required tube length (the pumps are strong enough and we flush enough water). In the eventuality that the tube length is too long the user can adjust the pump running time by changing constants in the code (see troubleshooting table in the wiki and corresponding links).

      We thank the reviewer for suggesting to evaluate the fish weight on fish from two extremal heights. Although we did not explicitly report this in the first version of the manuscript, we had actually anticipated this potential issue and therefore we did collect data for ZAF and ZAF+ for tanks housed on the top and bottom rows. We added a clear description of the weighting process in the material and method, highlighting the housing condition of the tanks tested.

      Finally, after each feeding run the tubes have been fully flushed and are empty without food debris or pellets remaining, irrespective of their sizes. So we did not find it relevant to evaluate the precise amount of food effectively delivered as we control that already upstream.

      Methods fish weight: The methods mention different amounts of food than the wiki, the rationale in the wiki is also different from the 5% of body weight outlined in the methods (which then matches the food amount of the methods). Which is the correct amount?

      We thank the reviewer for noticing the inconsistency. The method numbers are the correct one so we changed the wiki, we made a mistake when editing the figures. We wrote some sections of the wiki early during the development of the hardware. We unfortunately forgot to correct the inconsistencies.

      The code is decently commented for scientific software with clear variable names, but I wonder how flexible it is if users cannot get access to the specific hardware (especially the pumps) used in ZAF/ZAF+? Can the authors briefly comment on this point?

      The pumps are just built from 12V motors, you can find a large variety of such pumps online (Amazon, etc…), we have ourselves tried several, but there is no need to have the exact same model. We added a note to the tubing section of the ZAF and ZAF+ about that.

      The only components that cannot be easily exchanged are the arduino and Raspberry PI, but that is not an issue as these are very easily sourced components.

      The wiki could use more pictures or, to borrow the Proust Madeleine allusion, schematics akin to LEGO with more intermediary steps clearly outlined. Some pictures are also a bit small/busy (such as 2D and 2E in the frame section, or the magnet pictures), they may benefit from cartoons/schematics to clarify what is done. Alternatively, videos/timelapses may help with better visualising the assembly.

      We appreciate the reviewer comments and added new pictures, schematic and extra legends in the wiki to help potential ZAFs builders. In the wiki for ZAF hardware we increased the size of all the pictures for all the different steps and added new legends to clarify the assembly. There are also now more pictures illustrating the construction steps (i.e in “frame”, “pumps and valve”) and we added a simple schematic for “servo and food container”. Picture sizes have been increased in “ZAF electronics” and added to the “Raspberry Pi and Servo Hat” section. We increased the picture sizes and added more legends to the ZAF+- Hardware “Pumps & Valve'. Moreover, we added more photos to the “tubing” section and the “ZAF+ Electronics” section.

      We agree that videos or gifs would have been great to visualize the assembly. Unfortunately, we did not record such videos during the construction. We created ZAF as an open source project and clearly hope to generate a community that will share assembly pro-tips and may be constructions videos on the github.

      Our institute is expanding on zebrafish research so we will build additional ZAFs and will use this opportunity to prepare nice videos to add to the wiki. We envision that the wiki will be improved over time with better material, some of it contributed, as well as perhaps newer and better versions of ZAF.

      The main question that would affect if this approach were taken up would be how reliable it is in the long run. Have the authors experienced any issue over the 2 months test? Is this system still being used currently? If so, could the authors update the water quality logs?

      The reviewer suggests that the key question is to see if using ZAFs all year long is possible. We can reply yes, it is actually possible! We have used ZAF for 9 months, and now ZAF+ for the past 10 months in our fish facility, with great success. We never experienced major malfunctions and the minor issues we encountered are reported in the troubleshooting table. Since ZAF and ZAF+ have been used daily for months with logs recorded every day we have updated the water logs quality to 3 months. We have been using the ZAFs in full autonomy for a total of 19 months, frankly invaluable.

      Getting a sense of how long it can run without problems, how much troubleshooting is involved per month would be very useful in answering those questions.

      Except manual cleaning and tube replacement, there is no other big maintenance on ZAF. Of course, the food reserve needs to be changed at least once per week. We listed the malfunctions in the troubleshooting guide in the wiki. In our facility ZAFs require an average of 1 hour of maintenance per month. And if any hardware part fails you can just immediately replace it because all the parts are cheap and easily replaceable. Actually, we recommend keeping spare parts of all the key components (pumps, valves, arduino, Raspberry Pi, tubes, ...).

      **Minor comments**

      • Main text page 3: Fig. Supp. 2 instead of Supp. Fig. 2. Furthermore, would the authors have similar data for the manual feeding? If so, it could be useful to add here for comparison (although that is not necessary if the data is unavailable).

      We changed the text but we don’t have data available for the water logs with manual feeding.

      Main text page 3: it would be useful to add how long it takes to change all the tubing after 10 weeks?

      This is really dependent on ZAF tubing and the fish facility, in our hand for about one hour. We mentioned it in the results section, ZAF paragraph.

      Methods fish weight: The phrasing as it stands make it unclear the same method was used for ZAF and ZAF+, the authors may consider to start with the description of the common weighting method, then the specifics of ZAF+.

      Thank you, we changed the text accordingly.

      Supp.Fig.1a: "Waste water drain pipe"

      Thank you, we changed the text accordingly.

      Acknowledgments: "...for their help..."

      Thank you, we changed the text accordingly.

      ZAF - Servo Hat connection: "to control the pumps"

      Thank you, we changed the text accordingly.

      ZAF - Installation: the dependencies should be listed as they are in ZAF+, or the two sections merged, unless the GUI is not functional (see below).

      Thank you, we now list the dependencies in the wiki.

      ZAF - How to use: there is no mention of the GUI, is it not yet implemented? If not, is the touch screen needed?

      The standard ZAF hardware is controlled by a very simple python-based program that works with a command line interface. Therefore to interact with the Raspberry Pi for installation and configuration we strongly recommend building ZAF with a screen, and the touch screen is an easy way to be able to quickly point and click in the absence of a mouse -- which can be cumbersome when no clean horizontal surfaces are available in a lab environment.

      ZAF+ - soldering: "A 12V power supply (at least 10A best 20A) provides power to the electronics, except the Raspberry Pi and the two Arduino Megas." It seems the sentence is incomplete, or at least I cannot make sense of it.

      Changed to “A 12V power supply (at least 10A, but ideally 20A) provides power to the electronics, except for the Raspberry Pi and the two Arduino Megas that are powered by the Raspberry Pi 5V GPIOs.”

      Reviewer #2 (Significance (Required)):

      This manuscript provides a significant technical advance to the zebrafish field. The proposed automated feeder would be a very useful option for smaller labs, to ensure the consistency of feeding, and to remove one of the routine aspects of fish husbandry.

      As the authors state, there is certainly interest in the zebrafish community [9,10] for automation of feeding. I am not aware of other DIY fully automated feeding system, commercial systems do exist, but are expensive.

      The manuscript, and proposed automated feeder, would certainly be of interest within the zebrafish community, as well as other researchers using aquatic models that can rely on dry food. How many in the community would embrace this method will depend on how confident they are in the long-term stability.

      I am neither electronics, nor husbandry expert. As such I am not qualified to comment on any long-term approach this may prove, if any, for fish health. My expertise lies in image and data analysis, as well as microscopy.

      **Referee Cross-commenting**

      I think the major points are shared by all reviewers, I think the other reviews are fair in their content and I have nothing specific to comment on.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      This technical report describes an open-source fully automated feeding system for husbandry of zebrafish (and potentially other aquatic organisms). It provides detailed instructions for assembling individual components into two different feeding systems of varying adaptability, as well as their operation. Links to relevant control software are also provided. The characterization of the systems' performance appears somewhat limited (e.g. only maintenance of adult fish over a period of 8 weeks and use of dry food is documented). These systems could be of use for husbandry in a large number of research labs, and, in

      addition, for automated reward delivery in large-scale associative conditioning assays.

      We thank the Reviewer for his encouraging comments and appreciate his helpful suggestions. We answer to the Reviewer comments bellow:

      **Major comments:**

      Providing food to large numbers of tanks in aquatic animal facilities in a regular fashion is a time- and resource-consuming process. Some automated feeding systems for large numbers of tanks are commercially available, but these feeder robots are expensive and are restricted to systems of specific vendors. Therefore, an adaptable automated system that can be assembled from off-the-shelf components is a very attractive option for many research labs to both save resources and standardize the feeding process.

      The instructions for assembly provided by the authors appear quite detailed and sufficient to allow non-experts the assembly and operation of the automated feeder systems. The design of the system appears appropriate for the task.

      While additional experiments are not required to support the claims of the article, I feel that it would be significantly improved by the provision of additional information. My suggestions in that regard include:

      Description of the washing procedure of the system (which solvents, how often, how long?). The authors mention that an exchange of the tubing is required every 10 weeks, but since the tubing transports liquid food mixture, it is easily conceivable that microbial growth will occur rapidly in the system without thorough hygiene / washing procedures. Also could the authors provide some information, which type of tubing material they are using (Silicone, Tygon etc.)?

      Description of the washing procedure of the system (which solvents, how often, how long?).

      We agree that the cleaning procedure must be clarified. So we added a more clear description of the process in the first paragraph of the discussion and clarified the explanation about cleaning in Box 1 and Box 2 (suggested also by the reviewer1). To summarise there are two levels of cleaning, the first one happens just after a food distribution program by flushing water and air in the system (Box1). Additionally at least once a day, we run an entire program without food, to rinse/clean the system (Box2). This last step is programmable using ZAFs software.

      The authors mention that an exchange of the tubing is required every 10 weeks, but since the tubing transports liquid food mixture, it is easily conceivable that microbial growth will occur rapidly in the system without thorough hygiene / washing procedures

      Following all reviewers' comments we added an extra supplementary figure justifying the need of changing the tubes every 12 weeks (updated based on our latest observations). We monitored the cleanliness (algal/microbial growth) of the tubes and realized that it becomes necessary to replace the tubes every 12 weeks (supp figure 3). Interestingly, we remarked that the microbial and algal growth depends on the facility specificities such as light intensity and temperature.

      Also could the authors provide some information, which type of tubing material they are using (Silicone, Tygon etc.)?

      For ZAF we used silicone based tubing then we changed to PVC based tubes for ZAF+ because they are cost effective and have similar specifications for our usage. We added a note about the tubing material in the wiki ZAF tubing and ZAF+ tubing.

      In a related point, I was left wondering how long the food is being mixed in the mixing flask before being applied to the animals? Too long mixing might lead to a loss of nutrients into the solution (through diffusion). Could the authors comment on that, please? Do the food pellets remain more or less integral so that the majority of delivered food is actually ingested by the fish?

      • In a related point, I was left wondering how long the food is being mixed in the mixing flask before being applied to the animals? Too long mixing might lead to a loss of nutrients into the solution (through diffusion). Could the authors comment on that, please? Very relevant point, indeed it is very important for the food to not be mixed too long in water to avoid pellet dissolution in water and loss of nutrients. The food manufacturer website mentioned: “duration of “wet” feeding should be kept short” (https://zebrafish.skrettingusa.com/pages/faq). Therefore we adapted our feeding program to keep the “wet” feeding extremely short. For ZAF and ZAF+, the software is designed to deliver the mix of food and water to tank(s) within 3 minutes at most. To clarify this, we added in the Box describing the feeding, a sentence : “Overall, they share many common features, like the quick distribution of food and water mix, to avoid pellet dissolution in water and loss of nutrients.”

      • Do the food pellets remain more or less integral so that the majority of delivered food is actually ingested by the fish? We manually evaluated the integrity of food pellets in the early phase of development, these parameters being difficult to quantify, we decided to record the fish weight as a readout of good food delivery and general effectiveness. However, we clearly understand the reviewer's remarks and therefore added to the manuscript a supplementary video that shows the distribution of the food pellets and their integrity once they reach the tanks.

      In yet another related point, I was left wondering, whether the authors observed any negative impact of feeder usage on water quality (besides pH and conductivity, which they report)? Especially, with regards to ammonia that might arise from the decomposition of uneaten food items?

      Ammonia toxicity is mentioned to induce clinical and microscopic changes that reduce growth and increase susceptibility to pathogens according to aquaculture textbooks as summarized here: https://zebrafish.org/wiki/health/disease_manual/water_quality_problems#ammonia_toxicity). However, we never experienced such abnormal phenotypes in our facility and our regular aquatic PCR health monitoring profiles have always been negative for pathogens. Additionally, high ammonia is influenced by husbandry conditions, such as important fish density or inappropriate water circulation, characteristics that are not present in our fish facility. Therefore we did not find relevant to test for ammonia levels.

      The authors only tested the feeder on adult fish, but discuss that it would easily be transferable to a system that is used for raising fish fry. In that context, could the authors comment, on whether the system of using water as the carrier for the dry food (after mixing) would work as well for the smaller pellets required in feeding fish fry (e.g. 75 or 100 um pellet size as compared to the 500 um pellet size they use)? With smaller pellets, break-down of the dry food during the mixing process seems to be an even larger problem, I could imagine.

      We appreciate the reviewer's comment about using different food pellets sizes, a very important point for ZAFs adoption beyond adult fish. During ZAFs testing we actually tested different food sizes (from 100uM pellets to 500uM) and did not observe differences in pellet distribution. Most of the industrial aquatic food pellets are oily and designed for automatic distribution (for large farming environments). Therefore they keep their integrity and are not easily broken. Besides, during food distribution, as mentioned previously, the duration of wet food (water and food mix) is relatively short, which helps maintain pellet integrity.

      **Minor comments:**

      (1) the average weight of animals is given as lying in the range of 5 to 6g. That seems very high. The "standard" weight range of adult zebrafish is more around 1g [see, for example: Clark, T. S., Pandolfo, L. M., Marshall, C. M., Mitra, A. K. & Schech, J. M. Body Condition Scoring for Adult Zebrafish (Danio rerio). j am assoc lab anim sci (2018)]. Could the authors comment on that discrepancy?

      Good observation by the reviewer. We did make a mistake during figure preparation and our legends were actually not reflecting the exact weight of the fish. The scale bars of the figures have been changed to reflect the real weight of the fish (below 1g). We thank the reviewer for noticing the mistakes.

      (2) The authors state that spawning success is not negatively affected by the automated feeding, and they quantify the number of successful crosses. Could the authors briefly confirm or state, that or whether the clutch size was also unaffected?

      We never precisely quantified the clutch size/quality but we are now using ZAFs for the feeding of our facility for 19months and never observed any problem with our clutch. Our lab is working on early development and crucially relies on clutch quality.

      (3) The manual feeding procedure / regime that is used to compare husbandry success against the automated feeding regime is not described in any detail. That seems important given the topic of the article.

      We agreed and added a brief description of the protocol in the Methods section (“Animal and husbandry”).

      (4) The authors cite two recent papers that describe semi-automatic feeding systems for zebrafish in the introduction. The authors might want to consider discussing some key differences between their system and these semi-automatic systems in the discussion.

      The two published semi-automatic feeding systems are completely different from the devices presented in our paper. They are also open access but they are devices that need to be manually operated by facility staff. In contrast, our solutions are fully automatic and do not require the human hand during operation. We mention these two solutions during our brief literature overview in the introduction. However, since these are in a different category, we did not judge it necessary to comment on them in the discussion.

      (5) What do the error bars in Fig. 1c signify (s.d., s.e.m.)? Please state in Figure legend.

      We thank the reviewer for their attention to details and explain in the figure that we mean standard error of the mean by s.e.m.

      (6) I do think that the system could be of particular interest to researchers that study learning and that use food rewards in automated associative conditioning experiments. While this might be obvious to researchers with such an interest, this aspect is not at all discussed in the paper. Mentioning it might further underscore the versatility of the feeder system.

      We agree with the reviewer that ZAF can be adapted to experimental conditions such as behavioral conditioning, nutritions and drug delivery. Any experiment requiring the automatic delivery of solid pellets or liquid can benefit from ZAF. We revised our text and mentioned it in the discussion.

      (7) A list of all required equipment with vendors and price estimates (e.g. in the Supplement) would make this paper an even more readily accessible resource.

      This is a very important point already suggested by another reviewer. We added two extra tables in the wiki with the necessary parts and components, listing models, references, and prices.

      Reviewer #3 (Significance (Required)):

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This article signifies a purely technical advance in that it provides a characterization of an open-source, scalable automated feeder for aquatic facilities. As such, it presents a significant advance in the field of aquatic animal husbandry. In addition, this system could also be useful for automated large- or medium-scale associative conditioning paradigms, in which food rewards are given as positive reinforcers.

      Place the work in the context of the existing literature (provide references, where appropriate).

      The authors refer to previously published semi-automatic feeder systems. Regardless of the advantages or disadvantages of all these systems, the field will benefit from a broad(er) choice of automatic feeding systems that are described in sufficient detail to be easily assembled in the laboratory.

      State what audience might be interested in and influenced by the reported findings.

      This study is of interest for any research laboratory working with zebrafish or other aquatic model organisms. Thus, the audience for this article is very broad. Specific interest might also arise in researchers that are performing learning studies in zebrafish (see above).

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Zebrafish, neural circuits, sensory systems.

      **Referee Cross-commenting**

      Many of the major points are shared by all three reviewers. Beyond these shared points, I agree with the other reviews; they raise important questions. All reviews are fair, in my opinion.

    1. Author Response:

      Reviewer #1:

      Strengths

      This study is a technical and analytical tour de force. The evolution experiments with barcoded lineages involved an immense amount of work and clever design, and the scale of the data challenged the authors to develop new statistical summaries. The figures are clear and results easy to interpret, even outside the evolution-experiment bubble. While the essential findings are not especially surprising, the robustness enabled by this level of replication is appreciated.

      Weaknesses

      I'm not exactly sure what I learned. I'm biased to like this work and while I'm confident that if I studied these findings more I would learn more, it wasn't obvious. For example - I want to know more about the effects of ploidy on pleiotropy, and while there are some differences e.g in Figure 4A, I don't know what these PCs actually are saying. If particular phenotypes associate with PC's, it'd be helpful to "load" them on these axes.

      To more clearly show general trends and variation in pleiotropy, we have added a summary of the changes in fitness across all populations in Figure 2B and Figure 2– figure supplements 2–5. We have also expanded our consideration of these trends, including the effects of ploidy on pleiotropy. To supplement Figure 4, we have included the contribution of each assay environment to the principal components (Figure 4–figure supplement 5), as suggested.

      Also, do some treatments lead to faster or more complete diminishing returns than others, and does this influence pleiotropy?

      To compare changes in fitness across evolution environments and over time, we have computed the change in fitness for each population over the first 400 generations and the last 400 generations. This is plotted in Figure 2-figure supplement 6A. To assess the statistical significance of apparent diminishing returns, we compared the mean change in fitness over these time intervals using a t-test and provided the resulting p-values in Figure 2-figure supplement 6B. Overall, we see that different treatments lead to different extents of declining adaptability and note this in the Results. This declining adaptability may certainly influence pleiotropic outcomes, but unfortunately it is difficult to disentangle any potential such effects from other differences between environments (or assign any causality to correlations in the strengths of diminishing returns and differences in pleiotropy between replicates in the same environment), so we refrain from drawing any conclusions about this possibility.

      In total I think this manuscript can be improved by being presented / read by others, which is the job of peer review but here I think it's also to broaden its implications.

    1. It well may be. I do not think I would.

      Even though the poem starts with saying love is not all, it ends with love is all we need. I think this is interesting and relatable among teenagers that they want to be loved and feel love, but at the same time, they act as they don't really care about not being loved.

    2. I might be driven to sell your love for peace, Or trade the memory of this night for food. It well may be. I do not think I would.

      In these last few lines Millay sort of contradicts herself as even though she compares love in a way that is less valuable, and could be said unnecessary, she still acknowledges that it does have a value that she wouldn't trade. However I think it remains in saying that love is not all, as it is not a necessity but something she thinks shed rather have. Love is not all that we need but we think it is.

    1. Mattimore characterizes brainwalking as the most flexible of the seven ideation techniques, because it can be easily combined with other techniques. It’s also an ideal way to ensure that everyone in your group gets an opportunity to contribute ideas. Here’s how it works: The group first selects several aspects of the problem around which it wants to generate ideas. These become the creative prompts for the group to work with. The facilitator tapes several pieces of paper to a wall. Each member of the group gets a marker. Participants write their ideas on a paper and then rotate, adding their thoughts own original and ideas to the page as well as building upon those of their colleagues. This can also be done by having a group sit in a circle and have the papers passed one person to the right or left after several minutes of brainstorming. When each “pass” takes place, Mattimore points out, the facilitator can suggest different ideation techniques or triggers. This helps people who may not be able to think of any new ideas and may help them to see the ideas their colleagues have written in a new light. It also helps the team generate a wider diversity of ideas.

      I really like the idea. Just like the old saying goes: two heads are better than one. Triggered brainwalking can be very helpful. I think that is also why we have to talk other classmates about our wicked problem.

    1. For now if Zeus who thunders on high in evil intentiontoward these is destroying them utterly, sending aid to the Trojans,this is the way I would wish it, may it happen immediatelythat the Achaians be destroyed here forgotten and far fromArgos; but if they turn again and a backrush comes on usout of the ships, and we are driven against the deep ditch,then I think no longer could one man to carry a messageget clear to the city, once the Achaians have turned back upon us.Come then, do as I say, let us all be persuaded; let ustell our henchmen to check our horses here by the ditch, thenlet ourselves, all of us dismounted and armed in our war gear,follow Hektor in mass formation. As for the Achaians,they will not hold, if the bonds of death are fastened upon them.

      In this passage, Poulydamas is trying to convince Hector during a Trojan advance on Achaian positions that it is necessary for the horses to be left behind due to the presence of a large ditch with sharp stakes in front of the Achaian fortifications. Attacking well defended fortifications on foot is more dangerous than on a chariot due to a loss in mobility, therefore Poulydamas has to inspire confidence in Hector to act on his plan. A method that he uses to convince Hector in the passage is by claiming that the odds are in their favour. He mentions that Zeus is “sending aid to the Trojans”. Zeus is the most powerful God and therefore his support in a bloody conflict is a strong sign of success in war. In fact, throughout the entire poem, the support of the Gods always played an instrumental role in whichever side succeeds in battle. Therefore, Zeus’s support is a compelling reason to take such a military risk. Additionally, he tries to bolster Hector’s confidence by claiming that “the bonds of death are fastened upon them”. In this case, the bonds of death are fastened upon the Achaians by Zeus, and indirectly, by Achilles, who is the reason Zeus supports the Trojans. Finally, Poulydamous proposes to Hector that all of his men “follow Hector in mass formation”. This proposal reveals that Poulydamous has a lot of trust in his commander, enough to follow him into the heat of a battle, and contributes to Homer’s image in the poem as a brave, heroic warrior.

    1. Constrains block our thinking and idea generation. Naturally, we consider constraints as soon as an idea germinates,

      I am curious as to how you get people to overcome constraints in idea generation when the solution involves say students. I think some people may be hesitant to suspend reality enough to say "let's just let students do whatever they want, whenever they want in the hallways with no teacher supervision" when that type of thinking naturally would have people thinking "WHOOAA THAT'S A REALLY REALLY BAD IDEA".

      does this only only work in some instances? or does it matter who you put in the room or how you frame the question?

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      Summary

      Copy number variations in the 1q21.1 loci, deletions and duplications, have been associated with neurodevelopmental disease. In particular, deletions of this locus result in a variety of neuronal phenotypes including microcephaly and schizophrenia in varying levels of severity. Duplications of the 1q21.1 locus are often associated with autism and/or macrocephaly.

      In this study Nomura et al. generated 1q21.1 deletion and duplication hESC lines to study the impact of these CNVs on neuronal development. They generated brain organoids and observed a bidirectional effect of this CNV on organoid size, with 1q21.1 deletion showing smaller brain organoids whereas, the 1q21.1 dup lines grew large than controls. This in line with observed micro and macrocephaly observed in patients. They further analyzed these organoids at the gene expression level using single cell RNAseq and performed some electrophysiological assessment on neurons from of dissociated organoids.

      This study is certainly of interest given the association of this loci with NDDs such as autism, epilepsy and schizophrenia. At this stage, the study is mainly a descriptive study, showing differences between the 1q21.1 del/dup versus controls but also between both the del/dup lines. There is no mechanistic insight provided. For example the 1q21.1 CNV encompasses several genes, of which some have already been linked to micro/macrocephaly (eg. NOTH2NL). More importantly, most of the conclusions drawn by the authors are based on a limited set of experiments/analysis which are not always carefully performed and/or presented. In general, the data presented are premature, therefore not supporting the claims/conclusion made by the author (eg title) This makes the overall impact of this study limited.

      As the reviewer pointed out, NOTCH2NL (both A and B) have been regarded as micro/macrocephaly-related genes (Fiddes et al., Cell, 2018; Suzuki et al., Cell, 2018). In this study, however, we focused on the distal region of 1q21.1 between BP3 and BP4, which contains neither NOTCH2NLA nor NOTCH2NLB, because the target site is thought to be the core region of clinical 1q21.1 microdeletion/microduplication syndrome (Mefford et al., NEJM., 2008; Brunetti-Pierri et al., Nat. Genet., 2008; Van Dijck et al., EJMG, 2015). Although both NOTCH2NLA and B are located outside of our target, these genes are important for human neocortical development and neurogenesis, so we cite these papers (Fiddes et al. and Suzuki et al.) and discuss them in the discussion of the revised manuscript.

      Main comments

      In general, the interpretation of the data is too premature:

      1. The title is not supported in any means by data

      As requested by the reviewer, we have corrected the title as “Modeling reciprocal CNVs of chromosomal 1q21.1 in cortical organoids reveals alterations in neurodevelopment”.

      1. Brain organoids size and development: In figure 2 the authors analyzed the development of the organoids. Based on the human phenotype the deletion would lead to smaller brain and the duplication to larger brain organoids. The presented data to support these claims are rather scarce. They indeed provide data on organoid size, however there is no information as to regard how this micro/macrocpehaly comes about. Only limited amount of cell types are being investigated with immunocytochemistry, which give little insight into the mechanism. Fig 3. The authors performed some very basic immunostaining and concluded that the neuronal maturity of 1q del seemed to be accelerated, whereas 1q dup decelerated from the NPC stage. However, there is no direct evidence provided for this. With simple additional immunostainings authors could already get a much better idea of what is going on. For example the authors could measure the amount of differentiating versus proliferating cells, cell cycle exit, etc (eg BrDU, KI67, pHH3 staining,...)

      We thank the reviewer for the suggestion. In response to this, we plan to analyze additional markers such as phosphor-histone H3 (pHH3) to evaluate the late-G2/M status by immunostaining. In addition, to explain the smaller organoid size observed in 1q del organoids, we will check apoptosis markers such as cleaved-caspase3 by immunostaining and western blotting.

      Further there are some technical aspect that would need to be resolved:

      There is a general lack of brain organoid characterization of the controls. It is unclear on how many independent clones these experiments were performed.

      We constructed one clone per genotype (1q21.1 deletion (1q del), 1q21.1 duplication (1q dup) and CTRL) from one human ES cell strain (khES-1) by next-generation chromosome engineering using the CRISPR/Cas9 system. According to the reviewer’s comment, we have added the information of each clone, including the actual number of each clone in the results section. Following the reviewer’s comment, we also recognized the importance of comparing targeted clones even in the same genotype to verify cellular phenotypes in a targeted clone. However, we consider that at least isogenic ES cell lines are less affected by genetic variances on other regions and epigenetic changes than patients-derived iPS cells.

      • Fig 2C: it is unclear why brain organoid sizes reduce over time. Is this an indication of increased apoptosis? Did the authors measure this?

      In order to respond to the reviewer’s comment, we plan to examine apoptotic markers such as cleaved caspase-3 by immunostaining or western blotting, as mentioned above.

      • What is the reason for using t-test with Bonferroni correction as opposed to one -way (or even two-way) Anova is unclear in Fig 2C

      Analysis of variance (ANOVA) has been regarded as optional when multiple comparisons without F-statistics are performed (Jason Hsu. 1996. Multiple Comparisons: Theory and Methods (Guilford School Practitioner)). We selected the Bonferroni test because we thought we could evaluate our data more strictly with the Bonferroni test than with the Tukey-Kramer test. In response to the reviewer’s request, we analyzed our data using one- way ANOVA with the Tukey-Kramer test. We confirmed that statistical significances were consistent (we can provide both data if requested). We have changed the description in the figure legend and methods section of the revised manuscript.

      • 2E is unclear how they came to the conclusion that dosage dependent size difference in NPC organoids was caused by the number of cells within an organoid, not by the size of each cell or different cell types. Since they only measured the amount of Sox 2 positive cells and used Sox2 to measure cell diameter, whereas Sox2 is mainly expressed in the nucleus.

      We thank the reviewer’s comment. We used images of SOX2 staining because contrasts of each cell in bright-field images were too obscure to be detected using the fluorescent microscopy, BZ-X analyzer, and because we found cell sizes seemed similar between bright-field images and SOX2 staining images. However, this method was not desirable. To respond to the reviewer’s comment, we have counted the number of cells in the images of each NPC organoid using the BZ-X analyzer and calculate the cell number per 1000 µm2. We found the cell density was not significantly different among the 3 genotypes. We understand that counting the cell number of a single organoid would be ideal, but it was impossible because each NPC organoid was too small. We have changed Figure 2E, descriptions in the methods and results section, and the corresponding figure legend in the revised manuscript.

      • How do the authors explain that the Dup cells do not express Tubb neither CTIP2, do they only express NPCs and no neurons?

      We consider this finding supports the immaturity in the cortical organoids with 1q21 duplication. However, we have checked only a few markers for intermediate progenitors and mature neurons so far. We plan to examine immature neuronal markers such as DCX and other mature neuronal markers such as NeuN by immunocytochemistry (ICC) to confirm this finding. Similarly, we will perform expression analysis by real-time qPCR to check mature and immature neuronal cell markers.

      In short, the characterization of the brain organoids at the level of general development, cell types, proliferation, differentiation is underdeveloped.

      We will examine the characterization of the brain organoids in more detail by different techniques as described above.

      1. Electrophysiological assessment of brain organoids derived neurons:

      In figure 4 the authors claim that both CNVs (Del/Dup) show hyperexcitability and altered expressions of glutamate system as common features between the Del/Dup lines. The data to support this are however scarce and far from being convincing:

      The poor quality of the data is represented by images in 4B-E:

      • First the authors choose to dissociate the organoids prior to measure the cells on MEA's. This takes away the advantage of 3D brain organoids, will add a lot of non-physiological stress, cause cell death and lead to unequal distribution of cells over the electrodes, see fig 4B.

      We are afraid that the reviewer might misunderstand our experiment. In this experiment, we used not 3-D brain organoids but 2-D neurons. Based on established neural differentiation protocol (Fujimori et al., Stem Cell Reports, 2017, Toyoshima et al., Transl. Psychiatry, 2016, Matsumoto et al., Stem Cell Reports, 2016), we seeded single-cells dissociated from neurospheres on MEA dishes at the same density (8 x 105 cells per dish) on day 33 and continued culturing for 28 days on the MEA dish before analysis. Thus, we didn’t dissociate cells just before analysis. We could avoid adding non-physiological stress because we kept on culturing on the MEA dish for 28 days.

      • MEA recording are meant to measure network activity and heavily (read: fully) dependent on the network being formed. Cherry picking electrodes for analysis is not justified, analysis should be performed per MEA chip not per electrode. Inclusion/exclusion parameters should be defined before analysis

      We have performed statistical analysis with all chips (electrodes) per genotype in response to the reviewer's request. Even though the distributions of firing rate were not consistent among electrodes, we found the significant differences between CTRL and each mutant (Ctrl vs 1q del: p< 0.001, Ctrl vs 1q dup: p< 0.001, 1q del vs 1q del: p=1.0). We have changed Figure 4E, the descriptions in the methods section, and the corresponding figure legend in the revised manuscript this time. We also reanalyzed burst rates so that all electrodes were included in the statistical analysis. We have changed supplementary Figure 3 and edited the descriptions in the methods and the corresponding figure legend in this revised manuscript.

      • MEA parameters such as Mean firing rate (spike/min) and burst rate are very sensitive to plating conditions, especially number of cells and clustering of cell around electrodes (see 4B). Given that the organoids already differ in size and according to the authors in cell number, but also in the amount of starting NPCs, one can expect very different cell densities/cell types per experiment/genotype. The authors should therefore show for every genotype the matching cell culture images. Also with regard to the claims made about GABAergic neurons the cell type composition at the time of the MEA recording should be characterized for every genotype.

      As mentioned above, in MEA analysis, we used 2-D neuronal culture and seeded cells on each chip at the same density. The distribution patterns of cells were similar among the 3 genotypes. We will show the images of cultured neurons from 3 genotypes in the revised figure. As for the cell type composition, we plan to examine the expressions of GABAergic markers using extracted RNAs from neuronal cells on around 28 days post- dissociation (dpd). As reviewer #2 suggested, we also considered that drug treatment with bicuculine in this MEA system was meaningful. We plan to perform this experiment if the experimental conditions can be optimized.

      • Fig 4B illustrates the points made above. The fact that no activity is observed in the control cells can be due to many different reasons: unequal plating, stress after dissociating cells, poor coverage of the electrodes, poor maturation, too early measuring time point, etc Because the authors have no control over the amount of cells covering the electrodes the data presented here carry very little carry little information. Fig 4B, best illustrates this with large cell clumps and areas without cell bodies. Measurements from these cell cultures are irrelevant and no conclusion can be drawn.

      We suggest that the authors first benchmark this technique with their own differentiation protocol, show robust and reliable recordings on control cells, and only compare to the CRISPR lines at a time point at which the control cells show a decent amount of activity 1Hz. When doing so, also reduced activity can be monitored (For examples see, Trujillo et al, Cell Stem Cell2019 or Frega et al 2019 Nat comm).

      As mentioned above, we seeded dissociated neurospheres in equal numbers on MEA dishes and kept culturing neurons gently for 28 days before analysis. Cell distribution was similar among the 3 genotypes and we could observe cell bodies in the area outside aggregates (we will provide additional bright-field images in the revised manuscript later). Low activities in CTRL neurons at 28 dpd could be observed even in the electrodes covered with dense cells, which were consistent among 3 independent experiments as described above. Nonetheless, we agreed with the reviewer that cellular conditions which could show stable activities even in CTRL neurons were more desirable. We have already tried longer cultures three times, but we could not perform sufficient analyses because neuronal cells became unhealthier after 35 dpd. We will try to improve the experimental conditions and perform analyses if the experimental conditions could be optimized.

      • MEAs measure the output of the network (action potentials). In a network, this can be influenced by virtually every neuronal property (morphology, synaptic input, types ofsynapses, intrinsic excitability, etc). Therefore, the authors cannot conclude only based on fig 4E that the Del/Dup cells are intrinsically hyperactive. To make this conclusion they should measure this directly by assessing that passive and active intrinsic properties of individual neurons.

      In control condition many electrodes do not give any signal. From these experiments it is impossible to know whether this is because of lack of cell on the particular electrode or real absence of activity. Certainly one could not conclude that the del en dup cell are intrinsically hyperexcitable.

      As described above, we could observe the similarity of cell distributions among 3 genotypes. However, as the reviewer mentioned, the assessment of the individual neuronal activity would be better. Thus, we will perform patch-clamp recordings in addition to MEA analysis.

      It seems that from the introduction the authors try to link 1q21 CNVs to epilepsy and ASd, thereby justifying the observed phenotypes.

      • How do the authors reconcile the fact that more mature GABA system is observed in the Del lines with the so called increased activity compared to controls but not to the Dup lines.

      We assumed that cell type compositions differed between 1q del and 1q dup, although network excitabilities were commonly observed in both mutants. We agree that this assumption lacks sufficient evidence even though we have shown the results in scRNAseq (Figure 6E). We consider that checking cell type compositions would be needed to ensure this. Although mature GABAergic neurons were increased in 1q del lines as mentioned by the reviewer, we think GABAergic signals and unknown factors such as epilepsy- associated genes (e.g., GRIN2A and SCN1A) may be involved in the abnormal neuronal firing. We will check the expression of these genes and examine the expressions of GABAergic markers in neuronal cells.

      Single cell RNAseq

      • I'm not a specialist on single cell RNAseq, however it seems that the analysis is underdeveloped and conclusion drawn for these experiments premature. It would be essential to validate some of the generated hypothesis, eg GABA maturity and not merely state as a conclusion (eg title).

      We thank the reviewer for the suggestion. We have revised the title as we mentioned above, and we will revise the main text based on our results appropriately.

      • How do the authors explain that a majority of the cells are Glial cells at day 27, and no presence of neurons.

      On day 27 in our 3-D organoid protocol, cells were still in the developmental stage. That’s why we consistently described it as “NPC organoid” but not “brain organoid” in this paper. Indeed, our rationale for the scRNA-seq study was to determine gene(s) or gene regulatory network(s) when the difference of circumference was significant among genotypes (Fig. 2C). Although the underlying mechanism was not fully understood from our results, we interpreted this result. Radial glial cells (RGs) have the ability to self- renewal with symmetric divisions and play a role in both neurogenesis and gliogenesis (Lui et al. Cell 2011, A Kriegstein et al., Annu Rev Neurosci 2009). A recent study showed that the reduction of NF1, a tumor suppressor protein in the RAS/MAPK pathway, induced excessive production of glial cells, i.e., mainly oligodendrocyte precursor cells (OPCs) accompanied with astrocyte precursor cells, from RGs; furthermore, the reduction of NF1 also enhanced the cell divisions of generated OPCs (Z Shen, BioRxiv 2020). We have checked that the expression of NF1 in the glial cluster was also downregulated in our scRNA-seq data. Thus, we reasoned that the predominance of 1q dup cells in the glial cluster reflected the excessive production of glial cells from RGs, which were related to the alteration of the RAS/MAPK pathway. We will add this interpretation in the revised manuscript next time.

      • How relevant is the changes in the extremely low amounts of GABAergic neurons in the Del cells, no excitatory neurons are present, only NSCs

      In a previous paper, CA Trujillo et al. showed the cell type composition in 3-D human cortical organoids at different time points. GABAergic cells were restricted to later stages and the ratio was still very limited at 6 months (Figure 1J in CA Trujillo et al., Cell Stem Cell 2019). From this fact, we regarded the emergence of GABAergic neurons as meaningful even if the ratio was very low. As for excitatory neurons, we will further check the expressions of excitatory neuronal markers. (According to the screening chart we used, we did not explore excitatory neuronal markers as far as cells did not express SLC17A7 significantly).

      Minor comments

      • It is unclear how many clones were assessed per genotype

      We constructed one clone per genotype. As we mentioned above, we have added the information in the results section of this preliminary revised manuscript.

      • The authors should properly annotate the genotypes 1q21.1 instead of 1q del (line 134)

      We have already annotated the abbreviations of 1q21.1 deletion and duplication in lines 87 and 93.

      • Introduction seems to be somehow off topic since 1q21.1 locus is associated with several neurodevelopmental disorders, including SCZ, but is certainly not specific to ASD and epilepsy. So the premiss on line 86: to study 1q21.1 locus to understand ASD/epilepsy is somewhat misleading. I propose that the introduction would be focussed on the 1q21.1 and not on general on ASD/epilepsy.

      As the reviewer pointed out, 1q21.1 CNVs are associated with other neurodevelopmental and neuropsychiatric disorders. Since our research aims to elucidate the underlying mechanism of ASD, we mainly focused on two representative comorbidities (abnormal brain size and epilepsy), which seemed relatively reproducible in vitro. However, we agree with the reviewer that the lack of information about clinical symptoms of 1q21.1 microdeletion and microduplication syndrome besides ASD was not appropriate. Thus, we will revise the introduction to mention the neurodevelopmental phenotypes of 1q21.1 CNVs in the revised manuscript next time.

      • It is unclear whether they generated heterozygous or homozygous deletions.

      We thank the reviewer for pointing it out. We have generated clones with heterozygous deletion and duplication. We have added the information in the results section of this revised manuscript.

      • The authors should cite Fiddes, I. T. et al. Human-Specific NOTCH2NL Genes Affect Notch Signaling and Cortical Neurogenesis. Cell 173, 1356-1369.e22 (2018).

      As the reviewer suggested, we will cite two papers regarding NOTCH2NL (NOTCH2NLA: Fiddes, I. T. et al., Cell 173, 2018; NOTCH2NLB: Ikuo K Suzuki et al., Cell 173, 2018) when we discuss the alteration of neuronal maturity and brain size. We will add the information in the revised manuscript next time.

      • Many unclear statements eg line 138: Next, we analyzed each single-cell in an organoid

      We thank the reviewer for noticing it. We have made an effort to remove inappropriate sentences in this revised manuscript.

      • Discussion on E/I is very speculative, not supported by any evidence

      In response to the reviewer’s suggestion, we will cut the descriptions which contain too speculative contents in the discussion section of the revised manuscript later.

      Significance

      The general topic of this study is high interest given the strong association of the 1q21.1 with disease. The authors developed interesting ESC line to study in parallel del and duplication. Unfortunately the level of of analysis performed on these organoids is not up the current stat of the art, are of low experimental quality, analyses are limited. Therefore no clear conclusion can be drawn except for the size of the organoids, very little mechanism is provided. This therefore remains a purely descriptive study for which the presented data are rather on low quality and limited impact in its current shape.

      We thank the reviewer for the interest and criticism of our paper. As discussed above, we plan to perform additional analyses and experiments to justify our hypothesis more clearly and try to meet the reviewer’s requests.

      Reviewer #2

      This study was initiated to look at specific cellular and molecular mechanism of the duplication and deletion CNV frequently observed at the 1q21.1 gene locus in an isogeneic human embryonic stem (hES) cell model. The authors note that these CNVs are associated with higher than normal penetrance of ASD and epilepsy and aim to elucidate gene expression differences with single cell RNAseq and functional changes in this model system. The authors further sought to proliferation and differentiation states, in addition to neuronal activity, using both 2D cultures and 3D organoid models. The 1q21.1 gene locus model system made here is unique and the results broadly recapitulate the patient phenotype particularly with observations of macrocephaly in the "1q dup" and microcephaly in the "1q del".

      Reviewers statement:

      We have joint expertise in GABAergic neuronal development, iPSC 2D and 3D culture and ASD human molecular genetics.

      Major comments:

      • Not sure why ASD (if used it should also be spelled out) is mentioned in the title if ASD is only seen in a proportion of human 1q21.1. duplication (~36% will have autism) and 1q21.1 deletion (<10% will have autism) carriers. I would prefer to use 'neurodevelopmental phenotype'. A good update review that is accurate with respect to this CNV role in autism is PMID: 29398931. The authors should also put into the context of their results what is known with other neuropsychiatric phenotypes also seen in these CNV events;

      We thank the reviewer for the suggestion and valuable information. We have corrected the title in the revised manuscript this time. We will also refer to the paper by Fernandez and Scherer (Dialogues Clin. Neurosci., 2017) to discuss the detail of roles and neuropsychiatric phenotypes of targeted CNVs.

      • In Fig 1D the ddPCR validation for the genetic alterations in 1q del shows a normal return to 2 copies of GPR89B. However, in the 1q dup the CNV level is still elevated for GPR89B. Please determine how much further the duplication goes as there are five more potentially affected genes in this region (eg PDZK1P1). Modify the text appropriately to note the potential influence of any of these other genes on the experimental outcomes.

      We thank the reviewer for pointing it out. Figure 1D showed the results of aCGH analysis to confirm the copy number alteration of the targeted region in each clone. This analysis expected that the target region contained GPR89B, as confirmed by PCR shown in Fig. 1B. However, as the reviewer’s comment, the cleavage sites shown in Figure 1D seem not consistent with the result of Fig. 1B. We think it reflects the limitation of the microarray-based CGH technique. Since the locus between GPR89B and LOC101927468 contains extensive repeat sequences, aCGH may not be an appropriate method. Thus, we will apply quantitative PCR (or ddPCR) to determine copy number alternation of each clone in addition to microarray-based CGH.

      • The authors' claim that dosage dependent size differences in NPC organoids is caused by a change in the number of cells within the organoid rather than size - from Fig. 2D, cells in 1qdel organoid appears more compact; a quantification of cell number should be done to support this claim. IHC of D27/28 organoids with GABAergic markers would support authors' claim of alterations of GABAergic components in 1qdel cells. These suggested experiments would take 2-3 days if the organoids are available.

      In response to the reviewer’s suggestion, we have counted the number of cells in the images of each NPC organoid using the fluorescent microscopy, BZ-X analyzer, and calculated the cell number per unit area (1000 µm2). We found the cell density was not significantly different among the 3 genotypes. We have changed Figure 2E, descriptions in the methods and results sections, and the corresponding figure legend in the revised manuscript this time. As for exploring GABAergic components in the NPC organoids, we plan to perform immunocytochemistry (ICC) and RT-qPCR analysis.

      • Fig 4 E shows MEA data from "top 10". What is the top ten? Do you mean data points? There are batch differences in 1q dup with one batch having a lower expression than the other. Increasing the n value to accommodate the high variance observed in this group will greatly increase the validity of the data generated. Also, change the figure legend to indicate the age of these cultures. Given that the controls are not spiking, this data should be extended to probe the developmental profile further to week 9 when normal cells should be spiking so that the baseline activity of this isogenic line can be determined.

      Top 10 meant the ten electrodes with the highest spike rates within one MEA dish. To respond to the reviewer’s suggestion, we have performed statistical analysis with all electrodes per genotype. Even though the distributions of firing rate were quite heterogeneous among different electrodes, we found significant differences between CTRL and each mutant per MEA dish. We have changed Figure 4E, descriptions in the methods section, and the corresponding figure legend in the revised manuscript this time.

      The reviewer is correct that the spike rates in 1q dup were quite different between different batches. We noticed from our experiments that spike rates were easily affected by the health conditions of cells. Some mutant batches showed mild spike activities like circles in 1q dup, and some had very vigorous activities. We have even checked the reproducibility of significant differences between CTRL and each mutant per MEA dish with 3 independent experiments. As for the extended cultures to detect more frequent signals in CTRL neurons, we have already tried longer cultures three times. However, we could not perform sufficient analyses because neurons became unhealthier after 35 dpd. We will further try to improve the experimental setup and perform analyses if the experimental conditions could be optimized.

      • Single cell RNAseq data suggests a cluster of GABAergic cell types that are appearing in the 1q del condition, but not in the 1q dup or control groups. The authors suggest that these GABAergic cells are excitatory because the chloride gradient has not yet been altered (no change to KCC2 expression). The authors should substantiate this idea in the MEA system with bicuculline treatment to block GABAergic transmission (drug washed in and out) to show that the spike activity observed in the 2D MEA experiments is due to GABAergic excitatory transmission. Ideally, this should be done for both the 1q dup, 1q del as well as controls.

      We thank the reviewer for the suggestion. We agreed with the reviewer that drug treatment with bicuculine in this MEA system was meaningful to identify cellular properties. We will try to set up the experimental conditions and perform this experiment if the condition can be optimized.

      • Fig 5A. The clustering method for single cell RNAseq seems shows a large proportion of "other" class cells begging the question as to what they are. Is there another cluster analysis, which might be used eg partially supervised/unsupervised clustering methods from the Allen Institute to help determine what these might be?

      We initially made the screening chart for cell-type specifications according to cellular markers from Allen brain map (http://celltypes.brain-map.org/rnaseq/human_ctx_smart- seq) and a published paper (CA Trujillo et al., Cell Stem Cell 2019). We defined this cluster as “other” because this cluster did not have any significant genes in the 1st screening, although we understood that the specifications of all clusters were desirable. To investigate the cellular property in this cluster, we tried to put significant genes into Metascape to check gene ontology. We found some terms about immune cells (mainly lymphocytes and macrophages), cancer cells, roles for inflammation, and apoptotic process, although miscellaneous terms were also included. We have provided the screening chart as supplementary Table 4 in this revised manuscript. Next time, we will add a more detailed description of the ‘other’ cluster in the revised manuscript.

      • Fig 5 B. The manuscript requires additional markers used in the cluster analysis. Particularly, expression of the GABAergic progenitor markers DLX5 and 6 as well as EMX1 for the progenitor cells. Details of all markers and cluster algorithms should be made available in supplementary tables and R scripts, so that others can repeat this analysis.

      In response to the reviewer’s suggestion, we will check these GABAergic progenitor markers and add them to the revised figure and manuscript later. As we mentioned above, we performed the cell type specification of each cluster manually using our screening chart and did not use R scripts. We have provided the information on the screening process in supplementary Table 4 of this revised manuscript.

      • Fig 6. Expanding the heat map of 1q del and 1q dup with CTRL expression would help with context for baseline levels in this isogenic cell line. Please also include additional GABAergic markers GABRA1, GABARB2and GABARG2, (subunits of the most common GABA-A receptor) SOM, VIP, NPY, (other GABAergic interneurons in addition to PVALB) DLX6, EXM1 and for excitatory markers GRIA2, GRIA3 and GRIA4 (all of which have developmentally regulated expression patterns) that will provide more context with the synaptic receptor literature. GRIN2D is expressed only in GABAergic cell types and so I would suggest including this NMDA receptor subunit as well.

      We thank the reviewer for the valuable suggestions. To further explore the cellular properties in 1q del and 1q dup, we will check these cell markers additionally and show the results in the revised figure and manuscript next time.

      Minor comments:

      1. Additional references (eg. Schafer et al. 2019) should be discussed in relation to the authors' suggestions of altered neuronal maturity.

      As the reviewer suggested, we will include the paper in our references and discuss the associations between neurodevelopmental disorders and altered neuronal maturity.

      1. The authors show no change in PAX6 expression between genotypes, but significant differences in TBR2 expression between genotypes (Fig. 2C) - this alteration in normal cortical development should be included in results and discussed.

      Radial glial cells (RGs) have abilities of both self-renewal and neurogenesis (Lui et al. Cell 2011, Fiddes, I. T. et al., Cell 2018). Fiddes et al. showed that if the balance leans toward neurogenesis, premature differentiation with higher TBR2 expressions was observed in week 4 human cortical organoids (Fiddes, I. T. et al., Cell 2018). However, the predisposition to neurogenesis is thought to cause the earlier shortage of RGs. Finally, these cells remain abundant in week 4 organoids. We considered this was why TBR2 expression was significantly different in 1q del, but PAX6 was not. We will add this interpretation in the revised manuscript next time.

      1. In the introduction (Line 67): The author's state that "alterations in brain size is common in patients with ASD" using one meta-study to support this claim. Further primary studies should be consulted and the authors should give the proportion of the population with ASD and altered brain size to support this statement. In addition, the age range should be supported with primary papers.

      As the reviewer suggested, we have cited some primary studies about the prevalence of altered brain size in ASD patients and its age range in this revised manuscript. Since it seems still controversial whether the enlargement of brain size persists or not until adolescence and adulthood (E H Aylward et al., Neurology 2002; J Piven et al., Am J Psychiatry 1995), we have also modified the description in this manuscript.

      1. Line 73. The authors suggest that the brain growth deviations are "Postnatal stage restrictive". Citations are needed to support this statement.

      As the reviewer suggested, we have cited some primary studies as described above and revised the manuscript.

      1. In the scRNAseq data results please report total cell numbers counted for each cluster and for genotype group.

      We apologize for the lack of information and thank the reviewer for noticing it. We have added the information in the results section of the revised manuscript this time.

      1. In the results section (line 269-270) the authors suggest that 1q del cells are in a more mature state because the GABAergic cells are present and glutamatergic genes are similarly altered in 1q dup and 1q del. However, the results from the gene cluster data suggests that there is a very high proportion of progenitor cells (Progenitor 1 and 2 clusters), which seems to argue against faster maturation. This suggests to me that cell fate is being modified here.

      We thank the reviewer for the valuable suggestion. Schafer et al. (the suggested paper in minor comment 1) reported that altered gene expressions in neuronal modules have already been observed in NSCs derived from ASD patient-derived iPSCs. As the reviewer suggested, we plan to consider our results in terms of the alteration of cell fate and neuronal maturity in the revised manuscript later.

      1. Label figures on each page for ms.

      As the reviewer suggested, we have labeled figures at the bottom right of each page.

      1. Fix typos and heat map legends (currently no colors for log2 fold change in Fig 5 or 6)

      We apologize to the reviewer for typos and grammatical errors. We made an effort to remove them. We also apologize for the lack of color information in the legends of Figure 5 and Figure 6 and thank the reviewer for noticing it. We have added the color information in the figure legends of the revised manuscript this time.

      Significance

      Overall the study is clearly described, and the outcomes have been substantiated to a certain degree, but requires a bit more work. This paper does represent a technical 'tour de force' and the authors should be applauded for sticking it out where other labs have so far failed. It might be useful to mention even in brief, of the number of 'failed' (failed or inaccurate) events. The availability of the lines should also be clearly stated.

      We thank the reviewer for the positive comments. In addition to the plans described above, we have added more detailed information, e.g., how many screenings were carried out to get positive clones, in the revised version of the methods and results section. We have also added the descriptions about the availability of the 1q21.1 CNV cell lines in the data availability section of this revised manuscript.

      Reviewer #3

      In this research study by Nomura et al., the authors develop novel hESC-based models of reciprocal CNVs in distal 1q21.1 using CRISPR/Cas9 genome editing technology. Specifically, the authors genome edit KhES-1 cells to produce two isogenic hESC line that contain either a deletion or duplication of this chromosomal region. Patients with 1q21.1 deletion and 1q21.1 duplication syndromes show abnormal head size in conjunction with multiple neurodevelopmental co-morbidities such as epilepsy, developmental delay, and neuropsychiatric abnormalities. This is an important study since it provides robust research tools to understand molecular and cellular mechanisms that may underly these syndromes. Through generation of cortical organoid models, the authors demonstrate 1q21.1 deletion and duplication organoids show deficits in growth and over-growth, respectively. Additionally, the authors provide data that 1q21.1 deletion and duplication organoids show altered signaling cascades which may underly growth deficits and also abnormal neurodevelopment which may underly hyperexcitable neurons as demonstrated by multi-electrode array analysis. While my enthusiasm for this study remain high, I do have a significant number of major and minor reservations specific to the experimental design and analysis that if addressed would provide for an excellent contribution to the field.

      Major concerns:

      1. Though the authors provide extensive data in this study, major revisions are necessary to interpret all of their data in the context of the phenotypes they are observing in organoids and MEA analyses. In addition, the current study lacks cohesiveness throughout the various experiments and does not provide text that clearly unifies the results of the study. For example, no interpretation of higher TBR2 levels in 1q21.1 deletion is provided. Does this mean these organoids show accelerated neuronal differentiation? Also please see my comment regarding TBR2 staining the next section.

      Other examples throughout the manuscript in which there is no clear interpretation of the data or inadequacies of unifying the results of the experiments.

      We thank the reviewer for pointing out that our manuscript had inadequacies of the integrity and cohesiveness throughout our data. With additional data as follows, we plan to improve these issues in the revised manuscript later. As for TBR2 expression, we considered that higher TBR2 expressions in week 4 human cortical organoids showed the predisposition to neurogenesis in 1q del as demonstrated in a previous paper (Fiddes, I. T. et al., Cell 2018). We will add the description in the revised manuscript later.

      • a. Additional interpretation why 1q21.1 duplication organoids show increased growth is lacking. The single cell RNA sequencing results show there are more glia, but no further interpretation is giving why these organoids show an overgrowth phenotype. Inversely, the 1q21.1 deletion organoids show more progenitor cells, but it is not apparent why this should result in decreased cell growth.

      As we have mentioned above, we considered that the predominance of 1q dup cells in the glial cluster reflected the excessive gliogenesis from radial glial cells and enhanced cell divisions in relation to the alteration of the RAS/MAPK pathway (Z Shen, BioRxiv 2020). We plan to analyze additional markers related to cell proliferation and cell division by immunostaining to validate the above hypotheses. To investigate how 1q del organoids showed smaller size, we plan to examine apoptotic markers such as cytochrome C and caspase 3 by culturing NPC organoids again.

      • b. The authors suggest that 1q21.1 duplication organoids are resistant to neuronal differentiation. What data supports this hypothesis other than the fact there are no mature neuronal cells are present in their single cell RNA sequencing data.

      We considered that the results in Figure 3B and Figure 3D also supported this hypothesis that 1q dup organoids expressed the lower intensity of neuronal markers. Since we have only checked a few markers by immunocytochemistry (ICC), we plan to examine additional markers, i.e., immature neuronal markers such as DCX and other mature neuronal markers such as NeuN, as well as proliferation markers such as phospho histone H3 to ensure this hypothesis.

      • c. The MEA analyses show hyperexcitability in both 1q21.1 deletion and duplication cultures. Since the authors suggest 1q21.1 duplication organoids are resistant to neuronal maturation, no interpretation is given why they show hyperexcitable phenotypes.

      In the MEA analyses, we used not 3-D cortical organoids but 2-D neurons because the required culture period to emit electrical activities was thought to be much shorter in 2-D neurons according to some previous studies with human pluripotent cells (A Taga et al., Stem Cells Transl Med 2019; CA Trujillo et al., Cell Stem Cell 2019). We considered that 2-D neurons on 28 dpd (day 63) had much higher maturity than NPC organoids and even 1q dup neurons had already become mature enough to emit spike activities. We will also check neuronal marker expressions using 2-D neurons around 28 dpd by RT-qPCR to ensure this.

      • d. The current study is lacking extensive immunohistochemical stains of representative markers that validate their findings from their single cell RNA sequencing experiments. For example, glial cell markers such as GFAP should be analyzed in 1q21.1 duplication organoids. Additionally, progenitor cell markers such as PAX6 and neuronal markers such as MAP2 and synaptic markers such as SYNAPSIN and others should be incorporated in the study.

      We thank the reviewer for the suggestions. We plan to perform additional IHC staining for NPC organoids with the suggested markers and OPC markers.

      1. Major details are lacking for the single cell RNA sequencing experiments.
      • a. How many cells were analyzed from each group? How many organoids and what age of organoids were analyzed from each group, were they pooled together? Why was a log2FC 1.2 used as a threshold? It is unclear how the authors identify Progenitor 1 and 2 cell clusters? Are they distinct clusters or is this a continuum of differentiation. The progenitor 1 and 2 clusters were chosen based on expression of the ID transcription factors, but no text was provided why these genes specify progenitor cells.

      We apologize for the lack of information and thank the reviewer for noticing it. We described the number of analyzed cells (32,171 cells: 1q del; 10,682, 1q dup; 11,987, CTRL; 9,502) in the results section (line 186) of the original manuscript. However, we could not count how many organoids were analyzed because they were too tiny (diameter; 400-700µm). Many organoids were needed to get the prescribed number of cells (25,000 cells per genotype). According to the analyzed data of size measurement for NPC organoids by fluorescent microscopy, at least 1,500 organoids were collected per genotype. We gathered all cultured organoids in the same batch, dissociated them, and then loaded the prescribed number of cells into the machine. We have added the description of the number of input cells in the methods section of this revised manuscript.

      We used the threshold of log2FC > |1.2| so that the total number of DEGs became around 100-1000 in both bulk and the NSC cluster to avoid a very high or low number of DEGs. Some previous transcriptome studies used the same or even smaller thresholds (Xiaoming Ma et al., Front in Genet 2020; J Zhong et al., Brain Res 2016; Y Wang et al., BMC genomics 2016). We have added these descriptions in the methods section of this revised manuscript.

      As for progenitor-1 and 2, we regarded them as a continuum based on the marker expressions. We chose ID transcription factors for progenitor cells, referring to a published paper (CA Trujillo et al., Cell Stem Cell 2019) as we have described in the methods section (line 633). Several articles have reported that ID transcription factors regulate proliferation and differentiation of neural precursor cells (K Yun et al., Development 2004; D Patel et al., Biochim Biophys Acta 2015).

      Minor concerns:

      1. I would suggest rephrasing the title of the study as it does not clearly convey the advancement to the field. I would suggest the following or something similar this is more concise: " Modeling Reciprocal CNVs of Chromosomal 1q21.1 in Cortical Organoids Reveals Alterations in Neurodevelopment."

      We thank the reviewer for the concrete suggestion. We have revised the title as the reviewer suggested in this preliminary revised manuscript.

      1. The length of the discussion is over extended and should be revised to become more concise.

      We thank the reviewer for pointing it out. We will shorten the beginning part and delete unnecessary sentences in the discussion section of the revised manuscript later.

      1. Additional experiments should be performed to characterize pluripotency of hESC clones generated after genome editing other than staining for alkaline phosphatase activity.

      At minimum, karyotyping in addition to measuring pluripotency markers such as NANOG and OCT3/4 should be performed.

      Karyotyping of wild-type ES cells has been checked by Institute for Frontier Medical Sciences, Kyoto University before being provided. After genome editing, we performed aCGH analysis for all 3 genotypes using the wildtype ES cells as reference genes and confirmed no chromosome aberrations were generated. We have added the information about karyotyping in the methods section of this preliminary revised manuscript.

      As for pluripotency markers, we performed RT-qPCR analyses with ES cells after genome editing and confirmed that OCT3/4 was highly expressed than internal control genes. (We can provide the raw data if requested).

      4) There are several dozen instances of spelling/grammatical and word choice errors throughout the manuscript. For example, line 24 reads "We generate isogenic..." should read "We generated isogenic... "

      • a. Line 25: "opposite organoid size" as written is confusing to interpret.
      • b. Line 46: "have been considered in the context of ASD" would read more clearly as "have been thought to underly ASD etiology."
      • c. Line 53: "in the study of neurological development" should read "nervous system development".
      • d. Line 118: ".. to detect the CRISPR target site for deletion" should read "to detect the CRISPR target site. For the deletion, we checked... "
      • e. <![endif]>Line 119: "...flanking the CRISPR target site; for duplication, we amplified.. " should read "flanking the CRISPR target site, and for the duplication, we amplified..... ".
      • f. Line 127: "we prepared control cells (CTRL) that transfected.... should read ""we prepared control cells (CTRL) that were transfected. ".
      • g. Line 185: "organoid size and mature level" should read "organoid size and developmental maturity."
      • h. In line 40, "We made cryosections of .... should read.... "We performed IHC for the three organoid genotypes on day 27... " i. <![endif]>In Supplementary Figure 8, line 554, "replictes" is misspelled.

      We apologize to the reviewer for many typos and grammatical errors and thank the reviewer for pointing them out in detail. We have corrected these errors as the reviewer suggested.

      5) Line 181: "with a little higher degree of.. " should be re-written more precisely and with more scientific accuracy.

      As the reviewer requested, we have corrected the sentence in this revised manuscript.

      6) Line 216, The use of the colloquial phrase: "On the other hand.. " should be replaced with more formal language. For example, "In contrast, the number of downregulated....

      We thank the reviewer for pointing it out. We have corrected this colloquial phrase at 4 locations.

      7) In line 201, Pprogenitor is misspelled.

      We apologize and thank the reviewer for noticing it. We have corrected it in this preliminary revised manuscript.

      8) In Figure 3, images showing TBR2 staining does not appear correct as this protein should be localized to the nucleus similar to SOX2 staining. I would suggest optimizing conditions such as utilizing antigen retrieval or other methods to reduce non-specific cytoplasmic staining.

      We thank the reviewer for the valuable suggestion. We plan to optimize the condition and try other neuronal lineages markers such as DCX and NeuN.

      9) I would suggest simplifying the text describing the primers utilized in this study and display them in a table format.

      As the reviewer requested, we will make a supplementary table of primer sequences in the revised manuscript later.

      10) Information regarding the number of technical replicates used in this study is lacking throughout the manuscript. For example, how many hESC clones were analyzed? How many organoids were analyzed for each specific assay such as single cell RNA sequencing and MEA analyses? How many independent experiments were used for these studies?

      We apologize for the lack of information. We have constructed one clone per genotype one human ES cell strain (khES-1) and performed all further analyses. The precise number of NPC organoids in scRNA-seq could not be counted, as we mentioned above. As for MEA analysis, 8 x 10^5 cells were seeded on each dish as described in the original manuscript. However, it was unclear how many neurons were observed on each electrode because multiple cells and neurites covered each electrode. Thus, spike activities were detected as the network of many neurons. We have added the information in the methods section of this preliminary revised manuscript.

      11) It is not clear why the authors choose two types of organoid methods in the study. The first protocol referred to as the "NPC organoid method" is synonymous to neurosphere culturing and should be referred to as neurospheres throughout the manuscript.

      One protocol (Fujimori et al., Stem Cell Rep., 2017) was not for 3-D organoids but 2-D neurons (Figure 4A). Thus, we considered neurosphere and NPC organoid were different.

      12) In Figure 4, panel C should be referred to as a local field potential trace and not a waveform.

      We thank the reviewer for pointing it out. We have corrected the description as the reviewer suggested.

      Reviewer #3

      This is an important study since it provides robust research tools to understand molecular and cellular mechanisms that may underlie 1q21.1 deletion and duplication syndromes.

      We thank the reviewer for the positive comments. We plan to perform additional analyses and experiments as described above and try to meet the reviewer’s requests.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their careful reading, positive feedback and constructive criticisms of our manuscript. Their primary points of concern were that the discussion was too long and too speculative, and that the title did not sufficiently represent our work. We have now cut the discussion in half, and we have also changed the title to more precisely reflect our paper, and made some other minor changes in the text (all highlighted in blue).

      Below, we provide responses to each of the raised issues.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): The study was very well conducted by the group, selecting appropriated methods for achieving the aimed objectives. The sample were abundant and the statistical treatment were suitable for the size of samples, as well to compare different methods used in this study. The results in general were properly exploited by the authors, clearing many aspects of the role/function of the trophallaxis fluid. The results of this manuscript are apparently suggesting that young colonies prioritize the metabolization of carbohydrates, while mature colonies prioritize the accumulation and transmission of stored resources, amongst other processes. This study cleared many aspects about the role/function of the trophallaxis fluid for the colony.

      We are happy the reviewer agrees with our choices of methods, sample sizes, and statistics, and we are pleased that they have come to the same conclusions.

      Even considering the high level of present investigation, still there are some aspects that could be improved by the authors:

      • The text in general is relatively long with an over use of citations of literature;
      • The discussion is interesting, but some times too much speculative; if the authors could attenuate their speculative statements, the text would become more objective and fluid;

      Thank you for this feedback. These comments truly helped us strengthen the manuscript. We have now streamlined the text, cutting down the introduction, cutting in half the discussion and we have made more explicit what is statement and what is speculation (more on this in response to reviewer 2).

      • The results shown in figure 6A and 6D, relative to the processed of neutrophils degranulation and complement cascade, respectively. The authors did not discuss these results; is there a meaning at level of trophallaxis fluid role for the colony ? This was not discussed in the manuscript.

      We thank reviewer #1 for pointing out these results. We have now addressed these terms in lines 277-284 of the discussion:

      “Our gene-set enrichment analysis showed significant enrichment in immunity-related proteins characteristic of phagocytic hemocytes (58) in trophallactic fluid (‘innate immune system’, ‘complement cascade’, ‘neutrophil degranulation’). These results indicate that hemocytes may themselves be transmitted mouth-to-mouth, and generally shows the involvement of the social circulatory system in colony-level immune responses with implications for social immunity.”

      • Considering the very high scientific quality of the present study, the authors could deposit all the raw proteomic data in a international reliable repository of proteins/DNA DB, since it will be required by top journals.

      We wholeheartedly agree, and all data are now shared online through ProteomeXchange.

      Reviewer #1 (Significance (Required)): Significance:the present investigation represents an important contribution for the knowledge the the exchange of signals within the colony, to synchronize the physiology and development of the hive as whole (the concept of superorganism. The existing data about the composition and potential role of the components from tropahallaxis fluid is very small, compared to the present results. The present study is a master piece of knowledge about the importance of eusociality.

      Thank you for recognizing the importance of this study and affirming our work in such a wonderful way!

      **Audience:** all those scientists involved with social insects; biochemists/protomists dedicated to insect biology, biochemistry and physiology. **My expertise:** biochemistry of Arthropods secretion, in special of honeybees, ants and wasps. **Referee Cross-commenting** I think that both reviews aare complementary to each other; both reviews agree with the need to reorganize the text making it more compact and objective. Essentially, the auhtors must focus in the concept of trophallaxis. Thus, the biochemical processes outlined by proteomic analysis should be addressed to explain how the shared physiology of colony works out.

      Our discussion now focuses more on trophallaxis as a whole, and the biomarker-like quality of the changing proteome. We agree the biochemical processes and their role in the shared colony physiology are fascinating topics. We have not yet performed follow-up experiments with the many proteins present in this fluid and thus do not want to over-conclude. We have now stated more clearly in the discussion what the current data can reveal about these topics, what is assumed via orthology, and what needs to be addressed in future studies.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): This ms provides a comprehensive proteomic analysis of the trophallactic fluids extracted from carpenter ants. The analytical methods are state-of-the-art, and the results presented should fuel many studies. The vision of the research program, embodied in the title of the paper, is very exciting and is to be encouraged. However, the title of the paper in no way reflects the content of the paper, as none of the functional processes mentioned have been proven. This will require a lot of work and the development of perhaps new bioassays. I truly hope the PI's lab takes this on a deep and substantial way; the notion of trophallaxis and its socially exchanged fluid has long captivated the fancy of social insect biologists, but with a few specific exceptions, the promise has not yet been realized. The technical and descriptive results presented here lay a strong foundation. For purposes of present publication, I strongly recommend a different title and a revised discussion that reflects the disconnect I outline. Cause/consequence issues need to be addressed.

      We thank reviewer #2 for seeing our vision and that this is indeed foundational work that will “fuel many studies.” We also agree that the title and discussion contained too much speculation. The aim of this paper was to prove that there is systematic variation in trophallactic fluid in natural populations that correlates with biologically important social conditions, and further, that some proteins in this fluid can both act as biomarkers and be informative about underlying molecular processes. We have now communicated this more clearly in the introduction. In the revised version of the paper, we have reduced the speculation, and where appropriate, made it clear when there is speculation.

      For example, discussion lines 233-238:

      “Overall, our data reveal a rich network of trophallactic fluid proteins connected to the principal metabolic functions of ant colonies and their life cycle. Pinpointing contexts that induce changes in trophallactic fluid, along with the exact targets and functions of the proteins, are important subjects for future work. Our establishment of biomarkers transmitted over the social circulatory system that correlate with social life will allow researchers to formulate and test hypotheses on these proteins’ functional roles.”

      Three technical points: 1) Sample sizes are low for some analyses (2/group)--though they are cleverly pooled.

      We are not sure what the reviewer is referring to – none of our sample types had this low sample size (see SI Table 1 for sampling scheme). In contrast, for a proteomics study, our sample sizes are quite high. We are aware that for a study focusing on a natural population, the colony-level sample size of 16 (laboratory colonies) can be considered low, but this has been taken into account in our stringent statistical analyses.

      2) How to distinguish between what animals actually transmit and what is found in the gut? There could be differences.

      This has been addressed in our previous work, where it was shown that the crop content is equivalent to what is exchanged among individuals of this same species during the act of adult-adult stomodeal trophallaxis (Figure 1A, LeBoeuf et al. eLife 2016). We have now clarified this in the methods section of the current paper (line 361-364).

      “Trophallactic fluid was obtained from CO2- or cold-anesthetized workers whose abdomens were gently squeezed to force them to regurgitate the contents of their crops. This method of collection was shown previously to correspond to the fluid shared during the act of adult-adult stomodeal trophallaxis (17).”

      3) Is there evidence that the substances found are not just the product of digestion of ingested food? The differences between lab and field colony samples supports this.

      In the type of proteomic analysis we have performed (the most commonly used proteomics approach when a genome is available), we detect only proteins found in the reference genome of interest (in our case Camponotus floridanus), so excepting cannibalism, we should not see proteins that originate from food. Note that this is why we do not provide lab colonies with the typical lab-reared ant diet that includes honey, as bees are also Hymenoptera, and royal jelly and trophallactic fluid have many proteins in common. Cannibalism could result in trace observation of many proteins, but could not produce the consistent and high-abundance set of proteins that we have observed as they are not produced in those precise ratios in larvae or adults.

      The observed shift in trophallactic fluid from field to lab may reflect a change in diet or microbiome and these are questions that could be further investigated in future work (mentioned in lines 229-232). The clear difference we observe between trophallactic fluid of young and mature colonies, or the difference between the worker castes within a colony, is evidence that the variation observed in trophallactic fluid reflects more than diet.

      “Trophallactic fluid complexity declines over time when colonies are brought from the field to the laboratory. This may reflect dietary, microbiome or environmental complexity – typical of traits that have evolved to deal with environmental cues and stressors (e.g. immunity, (37)).”

      Reviewer #2 (Significance (Required)): The paper addresses a very important topic that should be of widespread interest to social biologists. Journal choice should reflect that this is a technically excellent paper that presents descriptive information but functional significance is highly speculative.

      We appreciate that the reviewer agrees that our results are of widespread interest to social biologists. Indeed, our results must be somewhat descriptive, as we are working on a mostly unexplored socially exchanged fluid in a natural population. However, our study design tests clear hypotheses with preplanned sampling and experimental transfer of ant colonies to a new laboratory environment. We present confirmatory results of the hypothesis that trophallactic fluid is complex mixture of biomarker-like molecules and that these biomarkers can be used predict sample origin through machine learning (see random forest predictions, emphasized in lines 151-152). The fact that our evidence for this is correlative does not render it speculative. Indeed, in both ecology and in much of medicine, using correlative evidence is the norm, as it is often impossible to manipulate ecosystems, natural populations and some organisms in a safe and controlled manner. This is what convinced us to invoke the term ‘biomarkers,’ as biomarkers are excellent examples of molecular correlates of larger conditions that have spurred advances in biology and medicine.

      Some of the next steps in our research will be, as reviewer #2 suggested, additional studies on the roles of individual compounds of trophallactic fluid, building on the results of this paper. Additionally, while this study may not have explored the roles of specific molecules, open ended exploration is extremely important and necessary for any scientific advancement in the long run (eLife 2020;9:e52157).

      All in all, we are grateful for this comment, as it showed us that we must communicate the aims of our work more clearly – which we have now done both in introduction (line 77-91) and throughout the discussion.

      **Referee Cross-commenting** Yes. Most of the discussion is pure speculation because we do t k ow what is exchanged and what the modes of action might be. But it's a great start!

      We have reduced the speculation on the roles of single molecules, and we hope our responses to the points above clarify some of the reviewer’s uncertainties about what is exchanged. However, we do still outline hypotheses for potential functions and origins in the discussion section, as this study is intended to be a foundation for new lines of research.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #2 (Evidence, reproducibility and clarity):

      This paper attempts to address a current, clinically relevant question utilizing novel statistical modeling. The authors comprehensively assessed the presence of criteria and non-criteria aPL in a heterogeneous cohort of 75 COVID patients and 20 non-infected controls. They found 66% of COVID patients had positive aPL and demonstrated a correlation between aPL and anti-SARS-CoV-2. However, I have several major concerns:

      1. The cohort is extremely heterogeneous. COVID-19 samples that were used included hospitalized patients and those who had COVID more than 2 months ago and were convalesced (29% of samples). Severity of disease does influence autoreactivity and the presence of autoantibodies. The prevalence of autoantibodies among patients who are acutely ill will be much different than those who are convalesced. I think it would be prudent to assess the presence and correlation of aPL among those two groups separately.

      We thank you for pointing out the complexity of our study population, consisting of multiple cohorts from different centres. Exactly the above-mentioned heterogeneity of our cohorts and their variables is the reason why we employed linear mixed-effect models. Linear mixed-effect smodels, accounting for both fixed as well as random effects, are suitable to address potentially confounding factors. Along these lines, disease severity (different in the convalescent and the acutely ill individuals) as well as the relation of the time of sampling to time of disease occurrence (days post onset of disease manifestation) were included as fixed effects in our mixed model. Thus, our model accounts for potential differences between the acute phase of infection and convalescent phase and would capture them if relevant.

      In order to increase the rigour, we have performed an additional analysis where we excluded the convalescent individuals from the model (see Fig. 3C). The results obtained are in line with results already shown (Fig. 3B, 3D).

      In general, we have pursued a largely data-driven exploratory, and not a hypothesis-driven, approach. Clearly, we could have decided to set a stringent focus on a cohort without complexity. Yet, our approach encourages heterogeneity, which we address using an adequate model. Since, perhaps, the model choice, the model itself, and the data-driven approach were not explained extensively enough, we have added a more detailed account in the manuscript, lines 317-334 and lines 394-403.

      1. Sampling of the patients is concerning, 35% are plasma and 65% are serum. It is undesirable to put data from plasma and serum together to perform analysis.

      We thank the reviewer for raising this important concern. We have aimed to be as rigorous and transparent as possible in the description of the cohorts (see Tables 1 and 2) for serum/plasma). While we agree that, in general, it would be best if either only plasma (i.e., only heparin plasma or only EDTA plasma) or only serum was used, the authors wish to clarify that for both SARS-CoV-2 IgG profiling as well as for LIA, plasma or serum can be used interchangeably. We can formally show this. We have conducted a SARS-CoV-2 IgG profiling experiment on patient-matched samples (plasma and serum). Data is unambiguous about that there is no effect of plasma or serum on the assay outcome (Fig. S3A and S3B), with a Pearson correlation coefficient of 0.9942 (95% confidence interval: 0.9865-0.9975) and R2 of 0.9885. Bland-Altman analysis does not indicate any significant bias (Fig. S3C).

      For the detection of APS antibodies with ELISA, literature is suggestive of no relevant interference by the usage of plasma or serum on the measured value (Pham et al., 2019). To formally reassess this, we measured aPL autoantibodies with LIA in one matched plasma and serum sample of an individual with high-titre aPL antibodies and of one high-titre individual whose plasma was spiked into non-reactive plasma and serum (Fig. S2A and Fig. S2B). We found the same pattern of IgM and IgG aPL-positivity in both matched serum and plasma samples as well as in spiked serum and plasma samples, with a Pearson correlation coefficient of 0.9974 (95% confidence intervals: 09611-1.034) and R2 of 0.9813 (Fig. S2A). Bland-Altman analysis did not indicate a significant bias (Fig. S2B).

      We therefore conclude that in our study, using both plasma as well as serum has no effect on the validity of our results.

      1. LIA based assays were used to assess the presence of aPL and results were reported in OD rather than standardized units. While the same group demonstrated a positive correlation in the past between LIA OD and internationally accepted ELISA-based aPL assays, the validity and clinical utility of these LIA assays still require further evaluation. Furthermore, OD>50 was used as a positive cut-off. How this cut-off was determined and how it relates to internationally accepted positive aPL cut-offs (99th percentile or greater than 40) remains unclear.

      We thank the reviewer for mentioning concerns on LIA. The validity of this technology has been confirmed in multiple peer-reviewed publications (Roggenbuck et al. Arthr Res Ther 2016;18:11, Nalli et al. Autoimmunity Highlights 2018;9,6). In terms of cut-off detection, processed strips were analysed densitometrically employing a scanner with the evaluation software Dr. DotLine Analyzer (GA Generic Assays GmbH). The cut-off of 50 OD units was determined by calculating the 99th percentile of 150 apparently healthy individuals as recommended by the international classification criteria for aPL testing and Clinical and Laboratory Standards Institute (CLSI) guideline C28-A3 (Roggenbuck et al. Arthr Res Ther 2016;18:11, Nalli et al. Autoimmunity Highlights 2018;9,6). A corresponding sentence has been added to the METHODS AND MATERIALS section.

      For our study, we aimed to perform the maximum number of tests possible with limited sample volume and have therefore chosen LIA. We are aware of the discussion on internationally accepted cut-offs for clinical APS diagnostics. However, we would like to point out that our manuscript is not a case report on patients diagnosed with APS, nor do we aim to modify diagnostic standards set in the international consensus statement for the classification criteria for definite APS (established in 2006).

      Moreover, the OD ≥ 50 was used as a cut-off in one analysis (with Fisher’s exact test for statistics) in our manuscript and was re-assessed using Mann-Whitney/Wilcoxon rank sum test on a continuous scale (Fig. 1C and 1D). All subsequent analyses were not contingent on an OD cut-off. We believe that this is clearly stated in the manuscript.

      1. While the authors attempted to evaluate the presence of both IgG and IgM aPL in COVID patients, only 65% of samples were tested for both IgG and IgM aPL.

      We agree that testing the entire collective for IgG and IgM isotypes would have been best. In fact, we would have been interested in also including the IgA isotype. Inconveniently, sample volume is sometimes limiting.

      We have been clear about the omission of IgG aPL measurements in the samples from Zurich (see lines 214-215). We consider this a limitation, however, our data indicated that IgM aPLs are more immediately relevant in the context of SARS-CoV-2. While this has been surprising to us, we would like to highlight that this is a manifestation of the quality of a data-driven approach where data, much more than belief, build the foundation for conclusions. Along these lines, we could have easily omitted all data on IgG aPLs without compromising the message contained in our manuscript. However, we stand behind our decision to show all data even if, in the case of IgG aPL, (1) they are mostly negative and (2) they are incomplete.

      1. 26 patients had anti-SARS-CoV-2 data already available. Whether those were tested on the same samples and at the same time points as aPL ais not clear.

      We apologise for not having been clear about this in the text. The 26 samples from Zurich had been included in another study where their respective anti-SARS-CoV-2 Spike ECD, RBD, and NC p(EC50) values were used (Emmenegger et al., 2020). Thus, the p(EC50) values have been re-used in the current manuscript. The aPL autoantibodies were measured on exactly the same samples. We have tried to improve the explanation of this in the text, see lines 300-301.

      1. The novel statistical modelling design is interested. However, as there are concerns about the data put into the modelling, the validity of the conclusions is debatable.

      We thank the reviewer for being interested in the statistical model we used. Linear regression analysis belongs to the standard equipment when performing epidemiological analyses (see e.g., Szklo, Nieto, Epidemiology: Beyond the Basics). Here, we have employed a linear mixed-effects model to infer changes in the predictive power of fixed and random variables (e.g. SARS-CoV-2 IgG levels, disease severity, age, sex, days post onset of disease manifestation), to determine which of these variables reliably predict an outcome (e.g. PT aPL levels), and in what combination.

      We recognised that the manuscript would benefit from a more thorough explanation of the model and how it helps to evaluate the validity of the data. We have therefore added lines 317-334 in the manuscript.

      All authors are appreciative of the reviewer’s critique. In the light of the answers we provided, we are convinced about our conclusions, based on the data and our dataset. We hope that, with our responses, we have adequately addressed the concerns raised by the reviewer.

      Reviewer #2 (Significance):

      See above.

      Reviewer #3 (Evidence, reproducibility and clarity):

      It is being recognized that SARS-CoV-2 infection leads to acquired thrombophilia with increased arteriovenous thrombosis and endothelial injury and organ damage. This has multiple mechanisms including, the hypercoagulable state with platelet activation, endothelial dysfunction, increased circulating leukocytes, cytokines and fibrinogen, but also the acquired thrombophilia could be due to acquired APS in these patients. In this study, Emmenegger et al. evaluated aPL antibody responses in SARS-CoV2 infected individuals in connection with antibodies against the SARS-CoV2 components and found that antibody strength response against SARS-CoV-2 proteins is associated with PT IgM aPL antibody

      Reviewer #3 (Significance):

      This is overall an interesting and thought-provoking study, as it may explain the development of thrombophilia after SARS-CoV-2 vaccination. While the study provides a possible association of the development of antibodies against SARS-CoV-2 infection and aPL, it does not go to molecular details about the homology between anti- SARS-CoV-2 antibodies and aPL. Therefore, the study remains an association study.

      First of all, we would like to thank the reviewer for the careful evaluation of our work. We are in full consciousness of the descriptive nature of our work. Thanks to the suggestion of the reviewer (see below), we have aimed to go one step further into a more functional/ mechanistic description.

      It is not surprising that they found a difference in IgM rather than IgG as IgM development is an early response.

      The overall conclusion is supported by the rigorous statistical analyses, yet the study remains a correlative and association study.

      Significance: Thrombophilia associated SARS-CoV2 may be due to immunity against SARS-CoV2 rather than that pure cytokine response.

      Furthermore, they did not characterize the PT IgM aPL to find which part could be immunogenic or epitope similarity with anti- SARS-CoV-2 antibodies. Identification of these epitopes is crucial for further understanding of the antibody development and further intervention.

      Existing literature does not connect with antibody responses against Sars-CoV2.

      Could the authors provide some molecular epitope analysis of IgM aPl and ani Sars_ antibodies? Even computation analysis will improve the paper tremendously.

      We thank the reviewer for coming up with this idea. Clearly, the presence of cross-reactive IgM antibodies to human prothrombin, triggered against the SARS-CoV-2 Spike protein, would be a direct and simple explanation for our observation. We have put efforts into analysing epitopes of SARS-CoV-2 Spike protein and prothrombin (see lines 374-390 in the manuscript and Fig. 4). We conclude there is very limited similarity, and that the mechanism is most likely indirect.

      There is no ethical concern.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Full Revision

      Manuscript number: RC-2021-00785

      Corresponding author: Christian, G. Specht

      1. General Statements

      Dear Editor,

      We greatly appreciate the reviewers’ constructive comments on our manuscript ‘Identification of a stereotypic molecular arrangement of glycine receptors at native spinal cord synapses’. We were particularly pleased that all four reviewers agreed that our data yield new insights into the structure of inhibitory glycinergic synapses, and represent both a technical and conceptual advance the field of synaptic neuroscience.

      The reviewers have consistently raised one main criticism, namely the use of endogenously expressed GlyRs tagged with the fluorescent protein mEos4b, which could potentially have an impact on receptor expression, trafficking and function. We have addressed this point by performing whole-cell recordings of GlyR currents in cultured neurons that show that glycinergic transmission and therefore function is preserved. We have also addressed all other comments of the reviewers in the revised manuscript, including a thorough revision of the text and the addition of new data and figures as detailed in the point-by-point response.

      Point-by-point description of the revisions

      Reviewer 1:

      Summary:

      In this manuscript Maynard et al describe a newly generated knockin mouse to study the endogenous distribution of Gly receptors in the spinal cord. Using quantitative confocal imaging and SMLM the distribution and levels of GlyRs at spinal cord synapses is compared between dorsal and ventral horn. They found that levels of synaptic GlyR are higher in dorsal than ventral spinal cord synapses. Nevertheless, the ratio to gephyrin seems constant, except for synapses in superficial layers of the dorsal horn, where gephyrin levels exceeded the levels of GlyRs. There are also fewer, but larger synapses in the ventral horn than in the dorsal horn. These findings are further corroborated by an SR-CLEM approach. Furthermore, it is shown that in a mouse model for hyperekplexia GlyR levels are lower, but still enriched at synapses, and the dorsal-ventral gradient in GlyR expression was maintained. The difference in size of ventral and dorsal synapses observed in WT animals was also lost in the oscillator mouse, suggesting that particularly the ventral synapses are affected. Despite these differences, the density of GlyRs per synapse remained similar.

      Major comments:

      Line 113: "labeling the_ _b__-subunit has proven difficult". This statement is unclear and it would be informative for readers to grasp what exactly has been difficult, and why the approach described here overcomes that? Related to that, the authors state "KI animals reach adulthood and display no overt phenotype, suggesting that the presence of the N-terminal fluorophore does not affect receptor expression and function". That is indeed reassuring, but it does not exclude that receptor numbers, function and distribution are altered. As it seems there is no prior literature on tagging the beta subunit, additional evidence that the tag does not interfere with receptor trafficking or functioning would be desirable

      We have clarified why it has been difficult to label the GlyR beta subunit until now, lines 113-115 _“To date, labeling of GlyRβ in situ using immunocytochemistry has proven difficult due to a lack of reliable antibodies that recognize the native β-subunit (only antibodies for Western blotting recognizing the denatured protein are available), which has severely limited the study of the receptor.”_ Hence it was important to us to generate this knock-in mouse in order to study the endogenous GlyR at synapses, which is the least well studied receptor mediating fast synaptic transmission.

      The reviewer makes an important point regarding the labeling of the GlyRβ-subunit with a fluorescent protein that has also been raised by the other reviewers. We have now verified receptor function by patch clamp recordings of glycine currents in whole-cell configuration in spinal cord neuron cultures from the mEos4b KI mouse (new Supplementary Fig. S2C). At saturating glycine concentrations of 300 μM we found no difference in chloride influx between mEos4 KI and WT mice. Since glycine concentrations in the synaptic cleft are in the millimolar range during synaptic transmission, these data strongly suggest that glycinergic transmission is not affected by the presence of the mEos4b under physiological conditions, despite a minor shift in the EC50.

      There are several other strong arguments that suggest that mEos4b-GlyRb expression, subcellular localization and function are the same as those of the native subunit. Firstly, the mEos4b sequence was inserted after the signal peptide and before the beginning of the coding sequence of the mature β-subunit (Fig. S1). Since the mEos4b sequence does not interrupt the coding sequence it is less likely to affect the receptor conformation. Secondly, we did not notice any behavioural phenotypes in animals carrying the GlrbEos allele. At the time of weaning, the genotypes of the pups corresponded to the expected Mendelian frequency (new Fig. S2A). Moreover, we did not observe a reduction in live expectancy of GlrbEos/Eos animals (new Fig. S2B), demonstrating that the mEos4b-GlyRb does not cause pathology in older animals.

      Most importantly, our imaging data (Fig. 1-3) provide exhaustive evidence that mEos4b-GlyRb assembles with GlyR alpha subunits as heteropentameric receptor complexes that are trafficked to the plasma membrane and inserted into the synaptic membrane due to their interaction with the gephyrin scaffold at functional synapses. Using quantitative imaging, we have also shown that homozygous GlrbEos/Eos KI mice have exactly twice the number of receptors at synapses as heterozygous animals, strongly suggesting no interference in receptor trafficking to the plasma membrane and gephyrin binding. As the mEos4b mice were also bred with the oscillator mouse model of hyperekplexia, which is lethal when homozygous, we could further test the combined effect of GlrbEos and GlyRa1spt-ot. The presence of both alleles did not lead to any noticeable phenotypes in heterozygous oscillator mice. On the contrary, both synaptic targeting and the packing density of the receptors were not altered in this model, despite a region-specific reduction in synapse size due to the reduced availability of the intact GlyRa1 subunit.

      We believe that these data overwhelmingly support our conclusion that the presence of the mEos4b tag does not alter the structure and function of the receptor, making this mouse model uniquely suited to study the dynamics and regulation of glycinergic synapses in a quantitative manner and at the molecular level.

      In the Discussion the authors conclude that "Our quantitative SR-CLEM data lend support to the first model, whereby inhibitory PSDs in the spinal cord are composed of sub-domains that shape the distribution of the GlyRs". This conclusion seems however based on one example image in Fig 3G that is not very convincing. The EM image seems to show two clearly separated PSDs opposed by two distinct active zones. So, although this conclusion is of high interest, more support should be given to substantiate this conclusion. More general, these subsynaptic domains (SSDs) are hardly further explored, but seem relevant for transmission, particularly given that the synaptic pool of GlyRs at these synapses is not saturated by single release events. How general are these SSDs at these synapses?

      The representative image in Fig. 3G shows two SSDs within the same postsynaptic site with a continuous presynaptic active zone. It should be noted that the PALM/SRRF images were taken of the entire 2 µm thick slice, whereas the electron micrograph shows only a single 70 nm section. We verified throughout the full 3D stack of serial sections that the presynaptic site remains continuous, which it does. We would also like to point out the scale of the image showing that the two SSDs are only around 170 nm apart, i.e. spatially very close. Our conclusions are however not based on this single image but the whole dataset. The graph in Fig. 3I shows 3 synapses (out of N = 36), in which the GlyR density at separate SSDs could be quantified, demonstrating that the receptor density is not different between SSDs. The reviewer is correct that we do not further analyse the SSDs beyond their density and the analysis of the segmentation of the postsynaptic sites (Fig. 3E-G). Further work on the functional role of SSDs in synaptic transmission is outside the scope of this manuscript and would indeed merit future study.

      The approach for counting molecules based on the PALM acquisition has been developed in prior publications and seems robust. It would however be worth to present the reader with a bit more background and explain the assumptions of this approach in more detail. Particularly, since counting of mEos4b can be problematic, as there are multiple dark and fluorescent states of this fluorophore that could be influenced by the illumination scheme, see for instance De Zitter et al., Nat Methods 2019. Since the preceding SRRF acquisition already exposes the fluorophore to high and continuous 561-nm laser power this could skew the counting due to unaccounted conversion and perhaps bleaching of mEos4b. In line with this, although throughout the manuscript the term 'absolute copy numbers' is used the reported numbers are at best an estimate based on a number of assumptions. I think the wording 'absolute numbers' is therefore deceiving and should be nuanced.

      We have clarified how the molecule conversion is calculated (Fig. S7 legend), to provide a more complete description of the way in which the values were obtained. Further we have explained how we calculated the probability of detection. Since the probability of detection accounts for any unconverted or non-functional mEos4b molecules, our molecule counting approach is relatively resistant to potential pre-bleaching of fluorophores. It should be noted, that 561 nm illumination had no obvious effect on the non-converted (green) mEos4b fluorophores, as judged by the fact that the intensity of receptor puncta was unaffected by the SRRF recordings. We appreciate the reviewers point regarding the term ‘absolute copy number’ and we have adjusted our wording throughout the manuscript accordingly.

      Related, most of the quantifications are in estimating the number of receptors, and not so much the distribution with the PSD. The term "molecular arrangement" - also used in the title - might therefore be misleading, there is in fact little characterization of how GlyRs are placed within the PSD. More focused analysis quantifying the distribution of receptors within the PSD and/or SSDs would strengthen the manuscript.

      By estimating the number of receptors and the exact size of synapses, the main conclusion of our study is that receptor density at dorsal and ventral synapses is identical, independent of synapse size, subdomains, or in fact loss of GlyRs in a mouse model of hyperekplexia. This observation clearly relates to how receptors are packed within synapses, and thus describes their molecular arrangement.

      The reported N is confusing and makes it hard to judge the reproducibility of the data. Sometimes it refers to number of images, sometimes number of synapses, but it is unclear from how many experiments these are drawn. This should be reported more completely (number of animals should be reported at least) and consistently. In figure 1, the N numbers (N=3-5 images) are particularly low and question how consistent these findings are across multiple animals.

      We have clarified the N in the figure legends, to reflect the full size of the datasets that have been analysed.

      The levels of mRFP-Gephyrin seem to differ between the different mouse lines, is this a significant difference?

      No significant differences in mRFP-gephyrin levels were found in animals with different mEos4b-GlyRb genotype (Fig. 1B). However, expression of mRFP-gephyrin in heterozygous animals is 50% of that in homozygous mRFP-gephyrin KI animals (not shown).

      The ICQ analysis for co-localization is hardly explained. How do we interpret this parameter? What does an average value of ~0.3 mean? A comparison with sets of proteins that do not overlap as a negative control would strengthen the conclusion.

      We have clarified that an ICQ value of 0.3 is indicative of a very high spatial correlation between pixels, and provided a corresponding reference for ICQ analysis (lines 209-210). We would like to point out that the scale of the ICQ is between -0.5 to 0.5, meaning that a value of 0.3 comes close to complete correlation.

      Minor comments:

      Very little fluorescence was detected in the forebrain, despite the high reported expression of the Glrb transcript". Can the authors expand on this? What would explain this discrepancy?

      We have clarified the text to include “suggesting that protein levels are controlled by post-translational mechanisms in a region-specific manner, as previously proposed (Weltzien et al., 2012)” (Lines 152-153). The reason for this discrepancy is not known. However, the distribution of mEos4b expression throughout the brain is as expected, based on the literature.

      "What region is quantified in Fig 1B? is the same region in all conditions? This should be specified more clearly as the manuscripts presents a clear gradient in expression levels in the spinal cord and thus the location will influence the intensity measurements.

      We have explained in the text that this is the region at the centre of the ventral horn identified by the white square in Fig. 1A, and that the same region was analysed for all images across all animals. Page 5, lines 160-161 “The same region of the ventral horn, indicated by the white square in Fig. 1A was taken for quantification of mEos4b-GlyRβ and mRFP-gephyrin expression in all conditions.”

      The labeling approach does not differentiate between surface and internal receptors, this should be made more explicit in the text.

      Whilst this is correct, we have only analysed mEos4b-positive synapses that had corresponding gephyrin clusters, meaning synapses where receptors are located in the postsynaptic membrane. Indeed we found that all mEos4b clusters imaged colocalised with mRFP-gephyrin clusters. We have adjusted the text accordingly, page 6, line 205-206 “All mEos4b-GlyR clusters closely matched the mRFP-gephyrin clusters, confirming the localization of the receptors in the postsynaptic membrane.”

      Significance:

      The presented data are interesting and the experiments are technically advanced and carefully performed. Particularly the SR-CLEM approach is technically advanced. The datasets present a quantitatively detailed characterization of spinal cord synapses and will be of interest for researchers working in the field of spinal cord circuitry, as well as super-resolution imaging. The conceptual advance for the field is however somewhat limited. It seems that the presented data confirm the general notion that receptor numbers and synapse size are highly correlated. So, although this manuscript describes very interesting observations, in its present form the manuscript does not provide any new mechanistic insight or significant advance in our understanding of how these synapses operate.

      We thank the reviewer for his/her comments relating to the technicality of our manuscript. However we think that the statement “The conceptual advance for the field is however somewhat limited” is unfair, as this level of organisation of inhibitory synapses at the molecular scale has never been achieved before, as pointed out by the other reviewers, and especially not as regards different ages of animals and a disease model that directly affects receptor numbers in a region-specific manner. We therefore believe that our study will have a substantial impact within the fields of synaptic neuroscience as well as quantitative neurobiology.

      Referee cross-commenting:

      I agree with the other reviewers that this study is technically advanced, but I remain critical towards the extent of conceptual advancement this study brings and there are some important concerns with the presented data that need to be addressed. Nevertheless, indeed many of these concerns can be addressed without additional experiments. As pointed out also by other reviewers additional validation that the fusion proteins are not disrupting their function or organization would be important.

      Reviewer 2:

      Summary:

      Maynard et al. investigate (inhibitory) glycinergic synapses in mouse spinal cord, which regulate motor and sensory processes. The authors analyse the molecular architecture and ultra-structure of these synapses in native spinal cord tissue using quantitative super-resolution correlative light and electron microscopy. The major finding is that GlyRs exhibit equal receptor-scaffold occupancy and constant absolute packing densities across the spinal cord and throughout adulthood, although ventral and dorsal inhibitory synapses differ in size. Moreover, what the authors call a „stereotypic arrangement" is even maintained in a hypomorphic mutant (oscillator), which is deficient in the adult GlyR a1 subunit.

      Specific comments:

      To reach their conclusions the authors generate two knock-in mouse lines, one with mEOS-labelled GlyR ß-subunit and one with mRFP-labelled gephyrin, a subsynaptic scaffolding protein of inhibitory synapses, which are subsequently crossed. Both changes are not unproblematic, as mutations in the N-terminal end of the GlyR ß subunit polypeptide chain might interfere with the assembly of functional GlyR (consisting of a und ß subunits) and and mutations at the N-terminal end of gephyrin interfere with it's homo-oligomerization into higher molecular assemblies.

      We have demonstrated that the function of mEos4b-GlyRb does not differ significantly from WT GlyRs, by carrying out electrophysiological experiments (new Fig. S2C). For a detailed response, please see the response to the first comment of reviewer 1. The mRFP-gephyrin KI strain has been validated and published previously (see Machado et al., 2011, J Neurosci; Specht et al. 2013 , Neuron) and was not specifically generated for this study. The experiments with the oscillator mutant did not include the mRFP-gephyrin allele. In these experiments, the wildtype GlrbEos/Eos (Fig. 4, 5) behaves exactly as the GlrbEos/Eos in the double knock-in (Fig. 1, 2), further validating the mouse models used.

      However, in this experimental design both labelled proteins reach postsynaptic membrane specialisations. In case of the ß-subunit quantitative evaluation confirms that heterozygous animals contain only half of the labelled protein as homozygous, which is an indication but not a proof that the correct stoichometry of adult GlyR is maintained. Likewise, mRFP-labelled gephyrin assembles with WT-gephyrin in subsynaptic domains, but it is not clear, if the size and density of the synapses is changed by the knock-in procedure as compared to WT-synapses.

      An effect of the mRFP tag on gephyrin clustering can be ruled out, since we observed no difference in synapse size and receptor density in GlrbEos/Eos animals with (Fig. 1, 2) and without the GphnmRFP allele (Fig. 4, 5, oscillator wild-type controls). Similarly, the synaptic mEos4b-GlyRb levels in heterozygous animals were precisely half those of the homozygous animals, strongly suggesting that the expression and trafficking of the tagged receptor subunit is unchanged, as the reviewer acknowledges. In the absence of any obvious behavioural and/or functional phenotypes (Fig. S2) this KI model is in our view is an exceptional tool to study GlyRs expressed at endogenous levels in a cell-type specific manner.

      Accepting these constraints, which to the knowledge of this reviewer have never been addressed to satisfaction, the authors provide a technically excellent, comprehensive analysis of glycinergic synapses in the spinal cord of double knock-in mice. Therefore, it should be stated in the title, that the investigations were performed with double knock-in instead of „native" spinal cord. Text and figures are clear and accurate and represent the state of the art.

      We thank the reviewer for the positive comments regarding the techniques used in the study, and the clarity of the text and figures. We have adjusted the title as requested.

      Finally, the reviewer would like to raise a minor point: the term postsynaptic density is derived from electron microscopical studies of synapses, where asymmetrical synapses display a „postsynaptic density" but symmetrical synapses do not. The latter were identified as inhibitory synapses and therefore, by definition, inhibitory synapses do not have a postsynaptic density, but rather a postsynaptic membrane specialisation. The use of the term „postsynaptic density" should, therefore, be restricted to excitatory synapses.

      We are conscious of the importance of correct definitions and have revised the terminology, referring to “postsynaptic sites”, “postsynaptic domains”, and “postsynaptic specializations” as appropriate throughout the manuscript.

      Significance:

      The authors provide a state of the art advanced light and electron microscopical analysis of glycinergic synapses in the mouse spinal cord. They suggest a robust "stereotypical" mechanism in place, which guarantees a fixed stoichiometry of relevant components, which is even maintained in a hypomorphic mutant, which is believed to represent a mouse model of human hyperekplexia (startle disease).

      Referee cross-commenting:

      I would like to corroborate the arguments of the previous reviewer: it is not clear to which extent the fusion proteins influence the measurements, which are technically very advanced and well done, however. The authors do definitely not investigate "native spinal cord" as stated in the title.

      The argument concerning fusion proteins must be taken especially serious as the fusions were induced in regions known to be responsible for assembly of glycine receptors and oligomerization of gephyrin.

      We have verified the receptor function with electrophysiological recordings and clarified exactly where the fluorescent protein was inserted (see reviewer 1 response). Given the similarity in synapse size, fluorescence intensities and molecule densities observed in neurons expressing different combinations of tagged and native receptors and scaffold proteins, we strongly believe that all animal models used are well suited to the experimental aims of our study.

      Reviewer 3:

      Summary:

      Glycinergic synapses are the least well understood of synapses that mediate fast synaptic transmission. The manuscript by Maynard et al. adds new information about the structural aspects of these synapses, using PALM and EM imaging of spinal cord synapses from mice at 2 and 10 months. The authors created a knock-in mouse that expresses a tagged GlyRbeta subunit, allowing synaptic localization of glycine receptors; all synaptically localized glycine receptors are thought to require the beta subunit to be tethered by gephyrin. The authors compare synaptic profiles from: 2 month old vs. 10 month old mice; dorsal vs. ventral horn; and GlyR1-reduced vs. wild type mice. Strikingly, they find a tight relationship across all of these variables between glycine receptor puncta and gephyrin puncta, as well as an apparently constant "packing density" of glycine receptors. They conclude that synaptic extent is likely to be the most important determinant of synaptic strength, as the density of receptors within the postsynaptic density is constant. These results use cutting-edge imaging and are analyzed with care, and add new information to our understanding of these relatively less well characterized synapses._

      Major comments:

      The key conclusions are convincing and the claims appear solid. Additional experiments are not needed to support these claims. The data and the methods are largely presented in such a way that they can be reproduced, although there are minor suggestions for improvement below.

      We thank the reviewer for his/her positive comments.

      Minor comments:

      Do the authors have any comment on the requirement during, e.g. LTP, for insertion of a gephyrin-GlyR unit? The lead author has speculated that gephyrin creates "slots" for GlyRs; yet apparently each slot is already filled in the snapshots taken here. How might postsynaptic LTP occur (Kandler group, Kauer group papers)?

      Given the reciprocity of GlyR and gephyrin clustering at synapses, the occupancy of binding sites (and in turn the number of available ‘slots’) is dependent on the strength of receptor-scaffold interactions, as discussed previously (Specht 2020, Neuropharmacol). In this study we demonstrate that the density of GlyRs at synapses is constant, which implies that the receptor occupancy is also the same, with the possible exception of mixed inhibitory synapses in the superficial dorsal horn that contain a majority of GABAARs. The PALM/SRRF data are represented as rendered image reconstructions and not as pointillist representations, and the detection of unoccupied binding sites is below the spatial resolution of our approach. However, the high spatial correlation of the signal intensities (ICQ ≈ 0.3) suggests that receptor occupancy is equal between and within synapses. It has previously been established that there are more scaffold proteins than receptors at synapses (Specht et al. 2013, Neuron; Patrizio et al. 2017, Sci Rep). Based on these studies we report that approximately half the gephyrin binding sites are occupied by receptors (lines 262-655). We have also expanded the discussion, describing how shape and size of synapses may affect synaptic transmission, as well as the possible role of receptor-gephyrin interactions in synaptic plasticity at glycinergic synapses.

      It would be very interesting in the discussion to contrast the present observations with what is known about excitatory synapses (NMDA and AMPAR distributions) and GABAergic synapses. Are the authors at all surprised that receptor packing is constant across conditions? Can the authors speculate on how non-gephyrin binding receptors (homomeric alpha receptors, which are found in recordings) may function and be tethered to the membrane.

      We have included additional information about receptor numbers and distributions at excitatory (lines 428-438) and GABAergic (lines 389-393) synapses in the discussion. So far, homomeric GlyRs composed of alpha subunits have been found to be exclusively extrasynaptic. As stated on page 4, lines 111-112 the beta subunit is required for binding of the GlyR to gephyrin and subsequent anchoring at the synapse. Previous studies have shown exocytosis of receptors to occur at extrasynaptic sites followed by lateral diffusion to synapses. Homomeric GlyRs are therefore most likely targeted to the extrasynaptic plasma membrane where they remain due to the lack of the beta subunit.

      Figure S1. It would be most helpful to quantify this; at the least to include an atlas-like drawing to allow identification of the structures illustrated and containing Glrb; better yet would be quantification of staining in regions where this is strongest.

      We have added an atlas indicating the different brain regions expressing mEos4b-GlyRb protein as a new Supplementary Fig. S3. The regional expression pattern agrees with the available literature about protein expression of the GlyRb subunit in different brain regions and hence provides further evidence that mEos4b-GlyRb is expressed like the native receptor. Due to the relatively low resolution of the tiled image no accurate quantification was possible. We have however added higher magnification confocal images of representative brain regions expressing varying amounts of GlyRb.

      The fact that the lower panel in B is labeled as +/+ across all groups is initially confusing; perhaps relabel as mEos4 -/-, +/- and +/+?

      We assume that the reviewer is referring to Fig1B. The genotype of both the GlrbEos and the GphnmRFP allele is now indicated on the x-axes, and the legend has been modified to clarify that all these animals were homozygous for GphnmRFP/mRFP. We have strived to remain consistent throughout the manuscript when referring to genotypes and protein levels.

      Do gephyrin levels drop in WT mice as well as in the mEosr-GlyRb mouse between 2 and 10 months? Do the authors have any thoughts on this (Supp figure S2)?

      We found no differences in gephyrin levels between 2 and 10 months. Fig. S2 (now Fig. S4C) shows the number of synaptic gephyrin clusters, which was the same at different ages and genotypes.

      Significance:

      Glycinergic synapses are the least well understood of synapses that mediate fast synaptic transmission. The manuscript by Maynard et al. adds new information about the structural aspects of these synapses, using PALM and EM imaging of spinal cord synapses from mice at 2 and 10 months. The authors created a knock-in mouse that expresses a tagged GlyRbeta subunit, allowing synaptic localization of glycine receptors.

      This will be of interest to those studying inhibitory synapses, and more broadly to synaptic morphologists, physiologists and imagers for comparison with other synapse types.

      My own expertise is NOT in these techniques, but I am a synaptic physiologist with a standing interest in glycinergic synapses; thus I am not providing serious technical critiques.

      Referee cross-commenting:

      Hi all, I agree with the other two reviewers, and do not have anything else to add.

      Reviewer 4:

      Summary:

      The authors used a correlative approach and combined photo-activated localization microscopy with electron microscopy to characterise Glycinergic synapses in spinal cord tissue. Some of the major findings are:

      • The receptor-scaffold occupancy and packing densities of glycinergic synapses in different regions of the spinal cord are the same.
      • Gephyrin clusters in the spinal cord are composed of sub-domains that shape the GlyR clusters.
      • Ventral horn synapses are generally larger, more complex (containing a number of gaps) and contain more GlyRs. -In a mouse model of Hyperekplexia, the number of GlyRs is reduced resulting in smaller synapses in the ventral spinal cord.

      Major comments:

      Are the key conclusions convincing? Yes

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? No

      Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. No

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. N/A

      Are the data and the methods presented in such a way that they can be reproduced? Yes

      Are the experiments adequately replicated and statistical analysis adequate? Yes

      Minor comments:

      Specific experimental issues that are easily addressable. Please see below

      Are prior studies referenced appropriately? Yes

      Are the text and figures clear and accurate? Yes

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions? Please see below.

      As the authors pointed out, fusing mEos to the extrasynaptic terminal of GlyRb has been difficult and therefore this construct would benefit the larger scientific community. Fig 1C is a nice imaging control for expression efficiency, however, it is in stark contrast with the lack of functional control. Do authors have any electrophysiological evidence showing that the insertion of mEos4b doesn't modulate channel function? I would assume that the construct would be tested in cell lines before the KI mouse line was created. Was any functional analysis done? If yes, it would be very useful to show it. I do appreciate that the authors used a standard insertion between the 4th and 5th AA in the extracellular domain, which in most cases does not abolish channel function. Given the lack of an obvious phenotype in the KI mouse model, I believe that this is also the case here. However, I disagree with the statement in lines 120-121: "the presence of the N-terminal fluorophore does not affect receptor expression and function." I believe that if there are no electrophysiological measurements of GlyR function, this statement remains speculative. As the authors pointed out in their previous publication: "receptor function and gephyrin binding are not independent properties. Instead, we think that conformational changes triggered at extracellular or intracellular protein domains have downstream consequences on channel opening as well as receptor clustering." In line with this, my concern is that the modulation of channel function by mEos4b could result in an altered cluster size at synapses. There is a large body of literature showing that just one missense mutation in the extracellular domain of ion channel subunits can lead to synaptopathies because the channel function gets modulated, and there is an abundance of similar examples involving mutations of GlyR and GABAAR subunits. In my view, comparing the function of GlyRs incorporating wt-GlyRb and mEos4b-GlyRb subunits is important for the correct interpretation of the main findings of this work and would strengthen the publications.

      As the reviewer points out, the insertion of the mEos4b sequence was considered carefully in order to have the least impact on receptor function. GlyR channelopathies are often caused by point mutations within the coding sequence, which is not the case in the GlrbEos allele. Instead, the mEos4b sequence was inserted after the single peptide of GlyRb, duplicating several amino acid residues in order to maintain the correct cleavage site and N-terminus of the mature receptor, and to not interrupt the GlyRb coding sequence (Fig. S1B). In order to verify that the mEos4b-tag does not affect GlyR function, we have now carried out electrophysiological experiments (new Fig. 2C). For a detailed description please see the response to the first comment of reviewer 1.

      Line 189: Are the authors making conclusions based on intensity comparison of red mEos4b and mRFP? The title of this section implies that the red form of mEos was compared to mRFP(?) But mEos converts from green to red only partially. Was the probability for conversion taken into account at this point? Please clarify which version of mEos was compared to mRFP._

      In line 189 (now 218) we compared the intensities of mRFP-gephyrin with those of converted (red) mEos4b in SRRF / PALM super-resolution images of the synapses (Fig. 2D). Since the absolute intensities are altered by the process of image reconstruction, the probability that mEos4b is photoconverted does not have to be taken into account. The constant ratio of the SRRF and PALM image intensities confirms the data in Fig. 1D showing that GlyR and gephyrin amounts are highly correlated throughout the spinal cord (with the exception of the superficial layers of the dorsal horn). We have clarified in the text that this analysis was carried out on reconstructed SRRF images of mRFP-gephyrin and PALM images of mEos4, line 202.

      Line 192: Please clarify how the density threshold was calculated/determined? This is important for the replication of the experiments, and it also has implications for the calculated probability of detection of mEos4b. I am not aware that this probability was calculated before for mEos4b and therefore other researchers may decide to rely on the value calculated here.

      We have now clarified in more detail how the probability of detection was calculated (new Supplementary Fig. S7 legend).

      In Fig. 2 Gephyrin clusters look consistently smaller than GlyR clusters, which is inconsistent with the published work. I assume that the difference in size is a consequence of different image reconstruction methods(?) However, I would assume that SRRF would have lower resolution than your PALM measurements and that would result in wider Gephyrin clusters. Could you please explain this discrepancy? Also, could you provide an estimate for the image resolution in SRRF and PALM techniques? For SMLM, localization precision would suffice.

      We have provided an estimate of the resolution of the two techniques using Fourier ring correlation, which gave 46 nm for SRRF and 21 nm for PALM. Additionally we have precised the discrepancy between reconstruction methods, page 6, lines 194-200 “The spatial resolution was estimated using Fourier ring correlation (FRC), which measures the similarity of two images as a function of spatial frequency by comparing the odd and even frames of the raw image sequence. According to this analysis, the spatial resolution of SRRF was 46 nm and that of PALM 21 nm. It should be noted that the synaptic puncta in the SRRF images appear somewhat smaller and brighter due to differences in the reconstruction methods that result in differences in the dynamic intensity range.”

      Why is the data in Fig. 5D and E represented as Detections/Synapse instead of GlyRs/Synapse? Could you please re-plot this so that a comparison with Fig. 2H and I is straightforward?

      We have converted the detections to receptor copy numbers as requested (Fig. 5D,E).

      Figure S5C: for P=0.5, 2=0.25. Please correct. Also, I assume that the second graph is what would be observed experimentally for dimers and P=0.5. Please clarify in the figure caption.

      This was a mistake and has been corrected. We have also clarified which parts of the calculations are theoretical and which values were derived from our experimental data. We have provided a more detailed description in the figure legend of Supplementary Fig. S7.

      Line 606: Please provide a complete derivation of this formula.

      We have provided a full derivation of this formula (new Fig. S7C).

      Significance:

      The work described here seem to be a natural progression of a publication by Patrizio et al., 2017 that came out from the same laboratory. This study uses advanced methodologies in the imaging space to visualise and characterise Glycinergic synapses in spinal cord tissue. The experiments described here are technically demanding as evidenced by the relatively small number of publications describing super-resolution measurements in tissue samples. Even more rare are studies that attempt to do single protein counting in neuronal culture and tissue sections. Therefore, I believe that this work brings significant technical advancement in the field of super-resolution and corelative microscopy. The findings are also highly significant for all fields of neuroscience in which the structure of inhibitory Glycinergic synapse is relevant, ranging from the fundamental understanding of inhibitory synapse function to pathologies involving Glycinergic signalling._

      I have substantial experience in different microscopy methods, including quantitative super-resolution microscopy based on single molecule counting. My background also covers the structure and function of GABAA and Glycine receptors using electrophysiology. I am familiar with the methods used in electron microscopy and the process of creating KI mouse lines, however I don't have hands-on experience in these fields._

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Summary:

      The authors used a correlative approach and combined photo-activated localization microscopy with electron microscopy to characterise Glycinergic synapses in spinal cord tissue. Some of the major findings are:

      • The receptor-scaffold occupancy and packing densities of glycinergic synapses in different regions of the spinal cord are the same.
      • Gephyrin clusters in the spinal cord are composed of sub-domains that shape the GlyR clusters.
      • Ventral horn synapses are generally larger, more complex (containing a number of gaps) and contain more GlyRs.<br> -In a mouse model of Hyperekplexia, the number of GlyRs is reduced resulting in smaller synapses in the ventral spinal cord.

      Major comments:

      • Are the key conclusions convincing? Yes
      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? No
      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. No
      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. N/A
      • Are the data and the methods presented in such a way that they can be reproduced? Yes
      • Are the experiments adequately replicated and statistical analysis adequate? Yes

      Minor comments:

      • Specific experimental issues that are easily addressable. Please see below
      • Are prior studies referenced appropriately? Yes
      • Are the text and figures clear and accurate? Yes
      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions? Please see below.
      1. As the authors pointed out, fusing mEos to the extrasynaptic terminal of GlyRb has been difficult and therefore this construct would benefit the larger scientific community.<br> Fig 1C is a nice imaging control for expression efficiency, however, it is in stark contrast with the lack of functional control. Do authors have any electrophysiological evidence showing that the insertion of mEos4b doesn't modulate channel function? I would assume that the construct would be tested in cell lines before the KI mouse line was created. Was any functional analysis done? If yes, it would be very useful to show it. I do appreciate that the authors used a standard insertion between the 4th and 5th AA in the extracellular domain, which in most cases does not abolish channel function. Given the lack of an obvious phenotype in the KI mouse model, I believe that this is also the case here. However, I disagree with the statement in lines 120-121: "the presence of the N-terminal fluorophore does not affect receptor expression and function." I believe that if there are no electrophysiological measurements of GlyR function, this statement remains speculative. As the authors pointed out in their previous publication: "receptor function and gephyrin binding are not independent properties. Instead, we think that conformational changes triggered at extracellular or intracellular protein domains have downstream consequences on channel opening as well as receptor clustering." In line with this, my concern is that the modulation of channel function by mEos4b could result in an altered cluster size at synapses. There is a large body of literature showing that just one missense mutation in the extracellular domain of ion channel subunits can lead to synaptopathies because the channel function gets modulated, and there is an abundance of similar examples involving mutations of GlyR and GABAAR subunits. In my view, comparing the function of GlyRs incorporating wt-GlyRb and mEos4b-GlyRb subunits is important for the correct interpretation of the main findings of this work and would strengthen the publications.
      2. Line 189: Are the authors making conclusions based on intensity comparison of red mEos4b and mRFP?<br> The title of this section implies that the red form of mEos was compared to mRFP(?) But mEos converts from green to red only partially. Was the probability for conversion taken into account at this point? Please clarify which version of mEos was compared to mRFP.
      3. Line 192: Please clarify how the density threshold was calculated/determined? This is important for the replication of the experiments, and it also has implications for the calculated probability of detection of mEos4b. I am not aware that this probability was calculated before for mEos4b and therefore other researchers may decide to rely on the value calculated here.
      4. In Fig. 2 Gephyrin clusters look consistently smaller than GlyR clusters, which is inconsistent with the published work. I assume that the difference in size is a consequence of different image reconstruction methods(?) However, I would assume that SRRF would have lower resolution than your PALM measurements and that would result in wider Gephyrin clusters. Could you please explain this discrepancy? Also, could you provide an estimate for the image resolution in SRRF and PALM techniques? For SMLM, localization precision would suffice.
      5. Why is the data in Fig. 5D and E represented as Detections/Synapse instead of GlyRs/Synapse? Could you please re-plot this so that a comparison with Fig. 2H and I is straightforward?
      6. Figure S5C: for P=0.5, 2=0.25. Please correct. Also, I assume that the second graph is what would be observed experimentally for dimers and P=0.5. Please clarify in the figure caption.
      7. Line 606: Please provide a complete derivation of this formula.

      Significance

      The work described here seem to be a natural progression of a publication by Patrizio et al., 2017 that came out from the same laboratory. This study uses advanced methodologies in the imaging space to visualise and characterise Glycinergic synapses in spinal cord tissue. The experiments described here are technically demanding as evidenced by the relatively small number of publications describing super-resolution measurements in tissue samples. Even more rare are studies that attempt to do single protein counting in neuronal culture and tissue sections. Therefore, I believe that this work brings significant technical advancement in the field of super-resolution and corelative microscopy. The findings are also highly significant for all fields of neuroscience in which the structure of inhibitory Glycinergic synapse is relevant, ranging from the fundamental understanding of inhibitory synapse function to pathologies involving Glycinergic signalling.

      I have substantial experience in different microscopy methods, including quantitative super-resolution microscopy based on single molecule counting. My background also covers the structure and function of GABAA and Glycine receptors using electrophysiology. I am familiar with the methods used in electron microscopy and the process of creating KI mouse lines, however I don't have hands-on experience in these fields.

    1. And as the delta variant continues to spread around the country, that uncertainty and its effects on sleep may not have abated. Some people have just gotten used to disrupted cycles and 3 am anxiety spirals; it’s how life is now.

      I can relate to this paragraph because I got so used to sleeping late and waking up whenever that now I don't even get tired and I go to sleep really late. Also our health was really getting messed up. Physically and mentally

      In my opinion I think that it has really affected all of us and our mental health itself has changed through out this past year or so. I think from the beginning of the pandemic we were all kind of panicking, there was so much chaos and so many deaths that it stressed all of us out.

    1. 20:57 - 29:20

      "I'm Jesse. And this is Pascal. And we're here representing Friends of Light tonight. As you already heard, Friends of Light is a worker owned fashion company. We operate within the fashion industry, but we also operate very far outside of the fashion industry - we come from a background of working in fashion and at some point decided that it needed a radical change, and so we started weaving jackets based on our values as opposed to based on economic decisions - and Pascal is going to talk more about our values, but I'm going to give you a background on how we decided to form a worker cooperative. Uhm Pastel actually has been looking into worker cooperatives for much longer than I have, and I joined her in about 2012 when we were participating in a sewing circle called Work circles and, there was on any given week, there would be anywhere between 8 to 30 participants and we sat around a table. We made decisions collectively, we made every single decision together about making one quilt. And that was where the stitches were gonna be. What color it's going to be, what fabric we were using, what were the shapes, and every single person had a part in that and then we start stitching and we'd talk about what is a worker cooperative. What are the different values in a worker cooperative? How do you make a worker cooperative work? And so that was very much a learning experience for all of us in about 2015 - we, Pascal, myself and two other members, Nadia and May. We decided to take all of this that we've learned and actually put it into the real world and still sort of in a project based way make garments together, make woven jackets as a worker cooperative. We then took that and had a sales event just to present our project to the world to sort of inspire people, to show what we've been working on. We were wildly surprised that we got ten orders in one night, so we needed to incorporate, make this a real business, and we've been a real business since 2016, so we're still fairly new. But it's been three years of working as a worker cooperative, basing all of our decisions on the things that we've learned about worker cooperatives and doing our best to work in that fashion, we also, I will mention that we, one year ago, in January of 2018, we participated in the Green Worker Cooperative, which is based in the Bronx, which, if any, of you are interested in building your own cooperative - I highly recommend it. It was every Monday night, 3 hours a night for six months and we - it was like going back School, but to make a business and to learn how to make a business and specifically how to learn how to make a green Worker-Cooperative business, which is amazing. So that's - that's that background and Pascal is going to share with you a little more about our company and our values." "Thank you Jesse. In fashion, it's quite unusual that clothes are being produced in the West nowadays. And we have quite an extreme product, and it started actually with the desire to work with farms upstate who are producing fiber. But they're not really connected to any design practices in New York City. It's a very separate community. So we did research and found and looked for different farmers and we decided to work with Sarah of Buckwheat Bridge Angoras. She has an amazing practice as a farmer and her goats and sheep she keeps very well and she has a very high quality wool. And we are weaving jackets and this is together with her. She also had a mill on her property. We developed yarns. And just the first series was a series of five yarns that we developed with her, and all the natural colors of the - and we started experimenting. In the beginning it was kind of after the work circles we really wanted to develop an economic activity to practice being a worker-cooperative for real and not just kind of doing it as a project. So these were the first experiments into if we could weave a garment to form. And there's different techniques involved. In the meantime, we are working with other farmers as well. We're working with linen farmers, and we've done a lot of research in different materials, so we work very closely with the source of our materials, and that's the only thing we want to do. We work with hand spinners at the moment, it's no longer being produced on machines, so it's an intense product. It's takes us about 160 hours to make one jacket. And they've been selling really well, which was a surprise to us because the price point is very high, we're now at $6000 a jacket. And that's been really interesting, that people do value the story that's connected. And also all our clients. Actually everyone in our value chain is - we're really good friends with them, and we grow to become friends because we also make everything to size for each client. And one could think that we're kind of exclusive because the jackets are so expensive. But we also do a lot of - we do workshops and we do talks like this. We educate people about worker cooperatives and what it means to - And also what it means to value artisan work in a Western context and to be able to do that kind of work. And that's what we've been advocating, like a lot, about because a lot of people here push prices down.To make artisan work and even local local fiber products possible. But we know it's not possible with the... In September we actually did a big project to kind of save one of the farm - one of the forms we work with because they couldn't sustain themselves. Not - especially not through our jackets. But we decided that we would do a project around making blankets and we invited other design studios in New York to participate in that project. So instead of kind of letting the farm disappear, we decided to develop a product to support them so it's also supporting - our objective is actually to create a flourishing local fiber and textile kind of structure. And that has been really interesting because the costs of the wool that went into the blankets was $1000, just raw material and that it doesn't give her any profit, actually. And that's, I think one of the biggest things that we are kind of trying to advocate and support, is how do we make local production possible again against a fair wage. And this is an extreme product, I know, but it has been very surprising to us that people do gravitate towards the story. They wanna participate in it by buying the jackets. We make our own materials, own looms, and these are a few of the jacket. And we do private sales events. We don't sign shops or anything like that. We make everything to the size of the customer, so there's a lot of personal attention that goes in every part of our value chain. Thank you."

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We thank the reviewers for their critical review of our manuscript. We are excited to see that the reviewers agree that we have presented high-quality data that advances the centrosome field and is worthy of publication following revision. The authors also agree with the reviewers that the data presentation requires improvement, that some experiments require additional replicates with robust statistical analyses and that a model or summary would help clarify the differences between previously published results and ours. We will address all these concerns in the revised version of our manuscript. The reviewer comments in their entirety can be found below in italic followed by our response in bold.

      Considering that the manuscript was very well received we believe it makes a strong candidate for publication in eLife. In terms of editors at eLife, we believe that Anna Akhmanova and Jeremy Reiter would be very well suited to handle this manuscript.

      We hope that you will concur with us that the revision plan detailed below adequately addresses the reviewers’ comments.

      2. Description of the planned revisions

      Reviewer 1, Major points

        • Previous data suggested that an important role of TRIM37 was to limit accumulation of CEP192 levels, yet here CEP192 levels appeared unchanged in TRIM37 knockout cells that stably express wild-type or RING domain mutant TRIM37. However, in agreement with previous work, transient expression of TRIM37 reduced CEP192 levels along with those of other PCM and centriole components in an E3-dependent manner. These data are rather confusing in light of the literature, and the current report does not really deal with these discrepancies but to me they suggest that high levels of TRIM37 can target multiple centrosome components for degradation, but this may be an experimental artefact.* We agree that acutely overexpressed TRIM37 results in decreased CEP192 levels and is consistent with published results. We also provide evidence that CEP192 levels are not correspondingly increased in the absence of TRIM37, nor are they decreased in a cell line that stably overexpresses FLAG-BirA TRIM37. This suggests that the decreased CEP192 (and PCNT and CEP120) after acute overexpression of TRIM37 might be short-lived or a consequence of overexpression. We will discuss this possibility more clearly in the revised mansucript. In addition, we will perform Western blots for TRIM37 in wild type cells, cells stably expressing FLAG-BirA TRIM37 and cells induced to express TRIM37-3xFLAG to more directly compare the amount of TRIM37 present in these cell lines.
      • The choice of cells for particular experiments is not always stated or explained. For instance, in Figure 3A: Trim37 KO pool used while in Figure 3B TRIM37 single KO. These are then combined with both transient and stable expression of TRIM37 mutants.*

      We apologize for this and will clarify the choice of cell lines in the results section. Importantly, because some of our results challenge previously published reports, we performed critical experiments using multiple cell lines. For example, we show that centrinone B-induced growth arrest is independent of TRIM37 E3 ligase activity using a single RPE-1 TRIM37-/- clone, an RPE-1 TRIM37-/- pool and an A375 TRIM37-/- pool. We feel this is a highlight of our work and this new data will be included in the revised version of the manuscript and will be emphasized.

      • Two different concentrations (200 nM and 500 nM) of centrinone were used to compare responses of too many or no centrosomes in RPE1 and A375 . While these concentrations result in centrosome amplification (200 nM) and loss (500 nM) in RPE1 cells, the phenotypes seem much less clear-cut in A375 cells. At 200nM 70% of cells have 0 or 1 centrioles (~35% each category) and only about 15% have centrosome amplification, whereas centrosome amplification occurs in 30% of RPE1 with 0-1 centrioles seen in fewer than 10% (Figure 4 - figure supplement 1H). Hence the different outcomes of centrinone treatment makes conclusions about cell-type specific responses difficult. This difference may be due to differences in drug uptake/efflux, PLK4 activity or in expression of other components of these pathways. In fact, 167nM centrinone B in A375 cells would have been a much closer match to the 200nM treatment of RPE-1. These points should be discussed as they impact the conclusions.*

      The reviewer rightly points out that the response to centrinone appears to differ between cell types, as shown previously (Meitinger et al., 2020 and Yeow et al., 2020), and that this difference may impact our conclusions. Although we don’t think that the major conclusions drawn will change, we will discuss these caveats within the results and discussion of the manuscript.

      • I find the different outcomes of stable versus acute expression of TRIM37 ligase mutant confusing. Here, stable expression of TRIM37 ligase mutant increases mitotic length compared to that of TRIM37 wild-type, which contradicts a recent report by (Meitinger et al. 2021). What could be the potential reason for these differences? *

      It is unclear why we obtain results that differ from Meitinger et al. We are using similar cell lines (RPE-1 hTert vs. RPE-1 hTert Cas9) with similar TRIM37 constructs (TRIM37-3xFLAG) that are induced in similar ways (both are doxycycline inducible but using different systems). For our experiment, we used a single TRIM37 KO clone. As an independent validation, we will repeat this experiment using our TRIM37 KO pools in both RPE-1 and A375 cells and discuss these results and implications.

      What could be the mechanism for TRIM37 action in regulating spindle assembly/mitotic duration and cell proliferation upon centrosome loss? How do those acentrosomal MTOCs form that decrease mitotic duration and promote proliferation?

      These are insightful questions that we feel lie at the heart of TRIM37 function. Current models posit that in the absence of TRIM37, PLK4 condensates form and are required to nucleate ectoptic accumulations of PCM components (ex. CEP192) that facilitate mitosis (Meitinger et al. 2020). A number of our findings are not consistent with this model. First, PLK4 is detected in the Cenpas/condensates only using a single antibody (Wong et al., 2015) (two other antibodies have been reported to be used (Sillibourne et al., 2010, Moyer et al., 2015) and we have used another (Millipore MABC544 clone 6H5) - none of these three detect PLK4 at the condensates). Additionally, the PLK4 signal observed is not sensitive to PLK4 siRNA (Balestraet al. 2021, Figure 4 – figure supplement 1I). In our manuscript we also provide evidence that overexpressed PLK4-3xFLAG cannot be detected (using PLK4 or FLAG antibodies) at these strucures. Moreover, our experiments using TRIM37 mutants show that Cenpas formation and ectopic PCM assembly are mechanistically distinct; Cenpas are not resolved after expression of TRIM37 C18R, yet ectopic PCM structures are suppressed (Figure 5E and G). Our data do, however, suggest that the ability to form ectopic PCM structures is inversely correlated to growth arrest activity (i.e. cells that form ectopic PCM fail to arrest). How these structures form and how they affect growth arrest are still critical, open questions. We will discuss these possibilities further in the revised manuscript.

      Do the authors find a difference in the % of cells expressing TRIM37 mutants upon stable or acute expression? This part needs a better summary, and again a table would help. I also wonder about protein expression levels; wild-type FB-TRIM37 seems to be expressed at much lower levels than the mutants in Figure 5B.

      The differences in overall abundance are not due to heterogenous expression within the population. The TRIM37 mutants are expressed in all cells after stable and acute expression. We will provide quantification of immunofluorescence images and statistics to show this. TRIM37 mediates its own degradation in an E3-dependent manner (Meitinger et al. 2021, Figure 3f). Our results are consistent with this as the TRIM37 C18R and TRIM37 __DRING mutants have a higher overall abundance compared to TRIM37 or TRIM37 D__505-709. These experiments are ongoing and we will discuss this further in the revised manuscript and provide a summary table.

      • Other means of centrosome depletion (Cenpj, SAS6 etc) would have been useful to include in the manuscript in support of E3 ligase dependent and independent roles of TRIM37. It is not essential to perform these experiment but if data are available, including these would improve the paper. *

      We will generate new data using a double TRIM37 KO, SASS6 KO line to address TRIM37 ligase-dependent and -independent functions.

      • The authors show that TRIM37 regulates PLK4 phosphorylation and that this modification could only be observed in HEK293T and not in RPE1. Why would there be a difference between HEK293 and RPE1?*

      We will address this by surveying a panel of cell lines to determine if there any cell type dependent differences in TRIM37 modification. Any potential differences will be addressed in the discussion.

      • Statistical analysis for graphs should be included. Figure 5 is ok but graphs in Figures 3, 4, 6, 7 would benefit.*

      This point is well taken. In the revised manuscript, we will ensure that all experiments are performed in biological triplicate and that proper statistical analyses are included to support our conclusions.

      • The authors characterise TRIM37 localisation. They detect it at centrosomes (as shown by Yeow et al 2021) and more specifically at the PCM, but apparently the signal is not present in all cells. They should also provide a quantification of the % of cells with centrosomal TRIM37 signal and compare this to cells expressing Flag-tagged Trim37. The specificity of the antibody signal using TRIM37-/- should be confirmed. *

      We will perform immunofluorescence experiments using wild type and TRIM37-/- cells to demonstrate the specificity of the antibody signal. We will also provide a more detailed analysis regarding TRIM37 localization noting 1) the number of cells with centrosomal TRIM37 2) cell cycle correlation with centrosomal TRIM37 and 3) a comparison with FLAG-BirA tagged TRIM37.

      Reviewer 1, Minor points

      1.Page 3: "A recent screen for mediators of supernumerary centrosome-induced arrest identified PIDDosome/p53 and placed the distal appendage protein ANKRD26 within this pathway [31]". It appears that the reference for Burigotto et al. is missing.

      This reference will be inserted.

      2.Page 6: The authors state that: TP53BP1, USP28 and CDKN1A are also suppressors in the Nutlin-3a screen and suggest that they act in a general p53 pathway. However Meitinger et al (2016) showed that depletion of TP53BP1 or USP28 did not affect the upregulation of p53 and p21 upon Mdm2 inhibition.

      Our data is consistent with previous reports that TP53BP1 and USP28 are required for cell arrest after Nutlin-3a treatment (Cuella-Martin R et al. 2016). We will discuss possible explanations for the results observed by Meitinger et al.

      3.Page 9: "First, we performed live cell imaging to measure mitotic length in cells grown in centrinone". For consistency the authors should say centrinone B here as wellI

      We will change the text to indicate using centrinone B.

      4.Page 9: "Cells lacking TRIM37 suppressed the growth arrest from 150 to 500 nM centrinone B in RPE-1 and 167 to 500 nM in A375 cells". The growth data for the A375 cells seem to be missing from the figures.

      We refer to Figure 4D and Figure 4 – figure supplement 1G that contain the RPE-1 and A375 growth data, respectively. We will modify the text to more clearly refer to the data.

      5.Page 10: "Our results confirmed that PLK4 and TRIM37 form a complex in RPE-1 cells (Figure 3G)" It appears the authors referred to the wrong figure, it should be Figure 4B.

      Our apologies. The correct figure reference will be used.

      6.Figure 1C: The nuclear p53 signal is not apparent with 500 nM centrinone B in the exemplary cells. Did the authors use thresholding to quantify p53/p21 positive cells?

      The p53 staining in centrinone-treated cells is somewhat variable. To quantify the data, we used automated image analysis and set a cut off based on p53 intensity in DMSO-treated cells to indicate p53-positive cells. To improve the figure we will repeat the experiment and use a lower magnification image to show a more representative field of cells stained for p53. The quantification pipeline will be better explained in the methods section.

      7.Figure 4D and Figure 4 - Figure supplement 1G: The graph is misleading and should not be presented as a continuous line.

      We are sorry that the reviewer finds the graph misleading. We will change the way this data is presented to make it easier to understand and to facilitate indicating statistical differences. Instead of a scatter plot of all the data, we will present the data as individual boxplots at each centrinone B concentration with statistical differences indicated. We hope this will address any confusion regarding these data.

      8.Figure 5A and C: A direct and statistical comparison mitotic timing upon expression different Trim37 mutants to wildtype and trim37-/- cells is missing

      In Figure 5A we compare RPE-1 WT to TRIM37-/- at each centrinone B concentration and within each line we compare each centrinone B concentration to DMSO. Perhaps we do not understand the reviewer’s concern here, but we do not think any comparisons are missing from this panel. In Figure 5C, we compare the mitotic lengths between cell lines expressing TRIM37 WT or TRIM37 C18R since we focus on the requirement for the E3 ligase activity of TRIM37. For this experiment we did not include a wild-type control, but we will perform statistical analyses between control cells expressing FLAG-BirA and those expressing FB-TRIM37 WT or FB-TRIM37 C18R. We hope this addresses this concern.

      9.Figure 6B: A loading control/Ponceau staining is missing as well as the quantification of protein levels

      This experiment will be repeated for proper quantification and we will include a loading control for our representative results.

      10.Figure 6D: It is unclear if the centrosomal signal intensity was quantified in interphase or mitotic cells

      The centrosomal signal was quantified in mitotic cells only. This results and figure legend will be updated to more clearly indicate this.

      11.Figure 7C: A loading control/Ponceau staining is missing

      The experiment will be repeated and a sample will be taken prior to immunoprecipitation to indicate the input amounts for each sample.

      12.Figure 2 - figure supplement 2F and G: It would help if the authors could highlight the cell line, e.g. RPE-1 (F) or A375 (G) in the venn diagrams.

      In Figure 2 – figure supplement 2G we highlight the genes found in RPE-1 and A375 screens only in the overlap of the Venn diagram using font colour. We will colour code the hits from each cell line in panels (F) and (G). We thank the reviewer for this suggestion.

      13.Figure 4 - figure supplement 1E: it appears that the BirA antibody gives only an unspecific signal. It would be useful to show if the different TRIM37 variants are able to localise to the centrosomes. Furthermore it appears that centrosomes are missing in the C18R and 505-709 variants. It would be useful if the authors quantify centrosome numbers upon expression of different Trim37 variants as shown in Figure 4 - figure supplement 1. To make the identification of the cell easier it would help to include a DNA signal or indicate the outline of the cell.

      The anti-BirA antibody does give a slightly diffuse signal, although we disagree that it is unspecific considering that the BirA signal is only observed in cells expressing FLAG-BirA alone or BirA fusion proteins.

      We agree with this reviewer that we did not make any statements about the centrosomal localization of the TRIM37 mutants. We will re-analyze our images to quantify relative centrosomal localization of these proteins. The images as displayed in this Figure panel appear to be somewhat confusing to the reviewer. In terms of scale, only a small portion of the cell surrounding the centrosome is shown, therefore a nuclear or cell outline cannot be displayed on these images. In each image a centrosome is present, even in the C18R and 505-709 samples. We will show images of entire cells with insets to highlight the region surrounding the centrosome.

      14.The generation of stable and dox-inducible cell lines is missing in the material and methods

      We apologize for this omission. This information will be added.

      Reviewer 2, Major points

        • The centrosomal localization of endogenous TRIM37 should be validated by comparing control and knockout/knockdown cells.* We will perform these experiments as outlined in response to Reviewer 1, Major point 8.
      • Some of the quantifications are derived from only two experiments and in many cases no statistical testing was done. The authors should test the observed effects and add extra replicates to make the data more robust, where required. *

      We will ensure experiments are performed in biological triplicate and that appropriate statistical analyses are performed (see comment to Reviewer 1, Major point 7)

      • Fig. 5 supplements: panels showing effects on marker proteins in cells by IF lack quantification of the claimed effects. Without providing some type of quantifications for key findings, it is unclear how strong or penetrant the effects are.*

      Quantification and statistical testing will be performed for these experiments.

      Reviewer 2, Minor points

      I would suggest a final, summarizing schematic that illustrates the main findings in a cartoon/flow chart manner.

      We will improve the discussion of our main findings as well as provide a model/table of comparisons to improve the clarity of our manuscript.

        • Please revise incorrect abstract sentence: "We identify TRIM37 as a key mediator of growth arrest when PLK4 activity is partially or fully inhibited but is not required for growth arrest triggered by supernumerary centrosomes." __In our screens, we find that TRIM37 is required for growth arrest after treating cells with 200 and 500 nM centrinone B. Treatment of cells with 200 nM centrinone B causes centriole overduplication and our initial hypothesis was that centriole overduplication alone is inducing growth arrest. To test this in a parallel manner, we also overexpressed PLK4 to induce centriole overduplication. Surprisingly, but consistent with recently published results (Evans et al*., 2020), TRIM37 was not required for growth arrest after PLK4 overexpression. Thus, TRIM37 is required for growth arrest after 200 nM centrinone treatment, but not PLK4 overexpression, yet both of these conditions induce centriole overduplication. This concept will be highlighted, discussed and clarified in the text. We will change the abstract sentence to ‘We identify TRIM37 as a key mediator of growth arrest when PLK4 activity is partially or fully inhibited, but it is not required for growth arrest after PLK4 overexpression’__.

      Please also see similar comment to Reviewer 3, Major point 1.

      • In various figures and supplements showing centrosome and condensates/Cenpas, these are very difficult to distinguish due to their small size. I suggest to magnify regions of interest and/or add arrowheads in different colors marking the specific structures.*

      This comment is similar to Reviewer 1, Minor point 13. We will use coloured arrowheads to indicate different structures. Where possible, we will use magnified regions to improve clarity.

      • Fig. 2A: What is the purpose of the schematics on the right of panel A? The labels in the graph are unreadable and the network diagram without any labels is also not very useful. This could be removed. *

      The schematics on the right indicate a ‘generic analysis’ using the NGS sequencing data. We agree it is not essential and it will be removed.

      • Fig. 2B: The network presentation is not very easy to read. What are the functional groups/pathways here? The clusters should be labeled accordingly. What is the meaning of the different sizes of the circles? Maybe key interactions (e.g. TRIM37) could be indicated in a different color shade to highlight these? *

      In our figure we tried to highlight 1) the connectivity among screening conditions and 2) complexes that were identified by the screens. In our figure, each node (other than the six hub nodes that denote a screen condition) represents a hit from the screens. Thus, the nodes are connected by edges only to the screening conditions, not to each other. In this scenario, highlighting TRIM37 ‘interactions’ would only highlight the screening conditions for which TRIM37 was a hit (200 nM RPE-1, 500 nM RPE-1, 200 nM A375, 500 nM A375). We could try to overlay functional enrichment data on the graph, but this data is presented separately in Figure 2 – figure supplement A-D. The large circles represent hits found in previous screens and is indicated in the legend. Given the challenges of this figure we will modify it to improve its clarity.

      Reviewer 3, Major points

        • The presentation throughout the manuscript sometimes made it difficult to follow exactly what the authors meant when they referred to the various doses of Centrinone used in their experiments-often using the terms "low" or "high" without specifying exactly what they mean. In Figure 1A, for example, they present a growth inhibition curve using a log10 scale of Centrinone concentration, and they conclude that growth was inhibited "at concentrations above 150nM, with full inhibition observed at concentrations greater than 200nM". I presume this is just sloppy language, as it appears that growth is significantly inhibited at 150nM and full growth inhibition is achieved at 200nM. However, in Figure 4D, the authors show another growth inhibition curve (this time presented on a linear scale) where significant growth inhibition is seen well below 100nM and full inhibition appears to be achieved at ~125nM. The discrepancy between these experiments is not noted, nor any reason for it explained. We agree with the reviewers and apologize for using ‘low’ and ‘high’ as they are ambiguous. We will ensure that we refer specifically to each concentration of centrinone B used (ex. 50 nM, 150 nM etc.). The comparison between Figure 1A and Figure 4D is not straightforward. The experiments presented were performed approximately 6 years apart and in slightly different ways. As reviewer 3 indicates, Figure 1A is presented in a log scale; this makes it difficult for the reader to determine the exact concentrations of centrinone B used. For this panel, we used, 0 (DMSO), 10, 30, 75, 165, 200 and 500 nM centrinone B. For Figure 4D, we used 0, 50, 125, 150, 167, 200 and 500 nM. The only point that might be anomalous is 75 nM in Figure 1A. We do see approximately 25% inhibition using 50 nM centrinone B in Figure 4D, but no inhibition using 75 nM in Figure 1A. We can offer two explanations for this discrepancy. First, we noticed small deviations in the potency of centrinone B batches. Second, for Figure 1A, cells were assayed using a passaging assay where they are continuously plated, counted and re-seeded. Cells in Figure 4D were assayed using a clonogenic assay where cells are plated at low density and allowed to grow over the course of approximately two weeks. It is possible that a combination of these factors led to the highlighted discrepancy. We feel that the discrepancy is a minor one and we propose the following as a solution. We will present the growth data in Figure 1A as a scatter / box plot using only 200 and 500 nM centrinone B since these are the drug concentrations we use for the screen conditions and the key conclusions are derived only from these concentrations (i.e. both concentrations result in p53-dependent growth arrest where centrioles are overduplicated after 200 nM centrinone B, while centrioles are lost after treatment with 500 nM). We hope that this explanation and changes satisfy the reviewers.

      While discrepancies such as this may seem trivial, they make it hard to interpret some of the authors conclusions. For example, in their initial screen, the "low" dose of Centrinone (200nM) leads to centriole amplification and genes that block centriole duplication or PIDDosome function (which normally signals the presence of extra centrioles) are required for the growth arrest triggered by this concentration of the drug (Figure 1B). To me, this suggests that centriole amplification is required for this growth arrest at 200nM. However, when the authors test a more graded series of concentrations they conclude "excess centrioles might not be the trigger for this arrest at low Centrinone B concentrations". I assume they are using "low" here to indicate concentrations at or below 150nM (even though they use low to mean 200nM in their initial screen)? In the Discussion, they state that TRIM37 is "required for the growth arrest in response to partially or fully inhibited PLK4, but this activity was independent of the presence of excess centrioles". Again, it is not clear to which experiments they are referring when they talk about "partially" or "fully" inhibited PLK4, but, if this is correct, then why are genes required for centriole duplication and PIDDosome function identified in their initial screen as being required for the growth arrest at 200nM but not 500nM? Do they consider 200nM to be fully inhibiting PLK4? *

      We observed that cells arrested after treatment with either 200 or 500 nM centrinone B. Additionally, we observed centriole over-duplication after 200 nM but centriole loss at 500 nM. Our initial hypothesis was therefore that either centriole overduplication or loss resulted in growth arrest. Our subsequent results with TRIM37 caused us to question this simple interpretation. To determine if centriole overduplication caused by 200 nM centrinone B triggers growth arrest in this case, we induced centriole overduplication by overexpressing PLK4 and, surprisingly, TRIM37 was not required for growth arrest in these conditions, similar to that observed by __Evans et al., 2020. Thus, we have two conditions where centriole overduplication is observed where the growth arrest in only one condition is dependent on TRIM37. This is an important difference that we will better highlight in our revised manuscript. We will also present a better model and/or table outlining our most salient results. Briefly, it is thought that partially inhibited PLK4 blocks its own auto-phosphorylation and therefore blocks its degradation. The overall abundance of PLK4 therefore increases under these conditions and overduplication occurs. In our hands, we consider PLK4 to be partially inhibited in RPE-1 or A375 cells at any concentrations of centrinone B at 200 nM or lower.__

      Please also see similar comment to Reviewer 2, Minor point 1.

      Presumably it will only require textual changes to address this point, but it is hard to assess the broader significance of the paper until these points are clarified: is the main point of this paper that the cells response to Centrinone treatment is complicated and the role of TRIM37 equally so; or, is there a narrative that leads to a clear hypothesis that can explain these surprising findings?

      We don’t currently have a model that explains all the results we observe with TRIM37. We have data that is consistent with some previously published results and data that challenges some of these recent reports. The current model suggests that TRIM37 E3-dependent remodeling of CEP192 underlies its growth arrest activity after centriole loss. Importantly, we find that TRIM37 supports growth arrest in an E3-ligase-independent manner. We will discuss this further in our revised manuscript, as well as providing additional hypotheses based on our other observations of TRIM37 function.

      • It seems a striking omission that the authors show that p53 and p21 are induced by 200nM and 500nM Centrinone (Figure 1D), but they don't assay these proteins at any concentration lower than this. Perhaps they are saving this data for a subsequent manuscript, but the authors certainly seem to draw conclusions from several experiments they perform at concentrations below 200nM, so they should at least explain why they don't assay p53 and p21 status in these experiments. *

      We apologize for not including this data in the original version of the manuscript. It will be included in the revised version.

      Reviewer 3, Minor points

        • In the abstract the authors claim that the way in which altered centrosome numbers cause a p53-dependent growth arrest is evolutionarily conserved. This is misleading, as it implies that the loss and gain of centrosomes trigger the same arrest (which is probably not correct), and most of the data to date suggests that flies and worms (two popular models for centrosome research) do not have such a growth-arrest pathway.* This is a good point. We will modify this statement to indicate that p53-dependent arrest is confined to mammalian cells: “Altered centrosome numbers cause a p53-dependent growth arrest in both mouse and human cells through mechanisms that are still poorly defined”.

      Reviewer 3, comment in ‘significance’

      I could not discern, however, whether one could draw any broader conclusions than this, in part due to the presentation problems described above. Moreover, in the abstract the authors propose that altering PLK4 activity alone is sufficient to signal growth arrest. This would be an important conclusion, and I presume this refers to the very low dosage Centrinone experiments that trigger growth arrest without altering centrosome numbers and which does not require TRIM37? If so, this arrest is poorly characterised here and will be the subject of a future investigation, so it seems to strange to have this as a major conclusion in the abstract.

      We agree. As reviewer 3 points out, based on our findings we hypothesize that altered PLK4 activity could itself signal growth arrest. As this is not supported experimentally, we will remove it from the abstract and discuss this tantalizing possibility within the discussion.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      Most of the experiments are currently ongoing and the preliminary results we have obtained discussed in the previous section. The revised manuscript will be modified to address each and every concern of the three reviewers as detailed above.

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      We will carry out all the experiments requested by the reviewers as detailed above.

      References

      Balestra FR et al., TRIM37 prevents formation of centriolar protein assemblies by regulating Centrobin. Elife. 2021 Jan 25

      Cuella-Martin R et al., 53BP1 Integrates DNA Repair and p53-Dependent Cell Fate Decisions via Distinct Mechanisms. Mol Cell. 2016 Oct 6;64(1):51-64

      Evans LT et al., ANKRD26 recruits PIDD1 to centriolar distal appendages to activate the PIDDosome following centrosome amplification. EMBO J. 2021 Feb 15;40(4)

      Meitinger F et al., TRIM37 controls cancer-specific vulnerability to PLK4 inhibition. Nature. 2020 Sep;585(7825):440-446

      Moyer TC et al., Binding of STIL to Plk4 activates kinase activity to promote centriole assembly. J Cell Biol. 2015 Jun 22;209(6):863-78

      Sillibourne JE et al.,Autophosphorylation of polo-like kinase 4 and its role in centriole duplication. Mol Biol Cell. 2010 Feb 15;21(4):547-61

      Wong YL et al., Cell biology. Reversible centriole depletion with an inhibitor of Polo-like kinase 4. Science. 2015 Jun 5;348(6239):1155-60

      Yeow ZY et al., Targeting TRIM37-driven centrosome dysfunction in 17q23-amplified breast cancer. Nature. 2020 Sep;585(7825):447-452

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): In this paper, the authors use a previously published method SHAP for interpreting deep learning (DL) models (specifically LSTMs) that are trained for predicting physicochemical attributes of peptides (such as antigenicity and collisional cross section). The paper shows that it's capable of identifying some amino acid residues contributing to the prediction results of the DL models. Reviewer #1 (Significance (Required)):

      1. One main ideas of the paper is to use SHAP for determine the significant amino acids at each position (or pairs of AA at each position) contributing to the prediction. Some of the interpretation results are consistent with findings reported previously. This is very nice; however, most of these findings are statistical results such "XX is often present at the second position for the peptides with the positive outcome", which are relatively straightforward and may be derived by using some statistical methods without using DL models. We expect more complex patterns can be discovered in addition to these statistical observations.

      We thank the reviewers for these comments.

      First, to the point about discovering complex patterns, we note that one use of PoSHAP we discuss later in the paper is that PoSHAP enables interposition dependence analysis, which depends on interactions between residues and would not be reflected by summary statistics.

      Second, we agree it is important to show whether PoSHAP produces different residue importance maps than simple statistical summaries of amino acids in each group. The strongest binding peptides, or the highest mobility for the CCS model, were determined by taking only peptides that fall above a linear regression best fit of the ranked experimental values. Statistical summary heatmaps were created and then compared to those from PoSHAP revealing some similarities but also many differences. We added the following text and new figure to the results section to illustrate these points:

      “We wondered whether the patterns revealed by PoSHAP simply reflect the summary statistics for the high-binding or high-CCS subset of peptides. As expected, due to known differences in amino acid abundance across the proteome, the prevalence of amino acids was different across the training data and were also heterogeneous across positions (Figure 5A). To determine the subset of high CCS peptides, peptides were ordered in the training set by their CCS rank and then linear regression was performed to get the average trend line (Figure 5B). Any peptide above that trendline was defined as “high CCS”, and the frequency of amino acids at each position in this set was summarized using a heatmap (Figure 5C). Compared to the statistical amino acid frequencies, PoSHAP suggests a greater importance to arginine at both termini, the importance of tryptophan to increase CCS becomes apparent, and interior glutamic acid contributes less to high CCS than the frequencies would suggest (Figure 5D). The same analysis was repeated for MHC data (Supplementary Figures 9 and 10). This demonstrates that PoSHAP found non-linear relationships between the inputs and the outputs that are not present by simple correlation. “

      Figure 5: Amino acid summary statistics differ from PoSHAP values for the CCS data. (A) Amino acid counts as a function of position for training data. (B) Procedure for picking the ‘top peptides’ with the highest CCS. Linear regression was performed on the peptides ranked by their actual CCS value. Any peptide that fell above the trendline and overall mean were defined as ‘top peptides’. (C) Counts of amino acids for the top peptides were summarized in a heatmap. (D) Mean SHAP values across amino acids and positions from PoSHAP analysis.

      We also added the corresponding supplemental figures showing the same examples for the MAMU A001 model and human MHC models:

      Supplemental Figure 9: Amino acid summary statistics differ from PoSHAP values for the A001 MAMU MHC I data. (A) Amino acid counts as a function of position for training data. (B) Procedure for picking the ‘top peptides’ with the highest CCS. Linear regression was performed on the peptides ranked by their actual CCS value. Any peptide that fell above the trendline and overall mean were defined as ‘top peptides’. (C) Counts of amino acids for the top peptides were summarized in a heatmap. (D) Mean SHAP values across amino acids and positions from PoSHAP analysis. For the MAMU model, the amino acid frequencies of the input peptides show no obvious preference for amino acid position, but some amino acids are over-represented overall. The presence of the “end” token is more likely to be a high binder statistically (C), but the PoSHAP reveals that this end token is not the main determinant of binding (D).

      Supplemental Figure 10: Amino acid summary statistics differ from PoSHAP values for the human A1101 MHC I data. (A) Amino acid counts as a function of position for training data. The distribution of amino acids in this data. (B) Procedure for picking the ‘top peptides’ with the highest CCS. Linear regression was performed on the peptides ranked by their actual CCS value. Any peptide that fell above the trendline and overall mean were defined as ‘top peptides’. (C) Counts of amino acids for the top peptides were summarized in a heatmap. (D) Mean SHAP values across amino acids and positions from PoSHAP analysis. There are clear differences between the summary statistics of top peptides (C) and PoSHAP heatmap (D). For example, the end token is prominent in the summary statistics absent from the PoSHAP interpretation. Also, the preference for S/T/V at position two is tempered according to PoSHAP, but would be determined to be very important by the summary statistics.

      Although the interpreting results reported in the paper largely agree with previous reports, the paper did not explicitly model the frequency of different amino acid in the training data. For instance, if the amino acid 'A' happens to be over-represented in the positive samples of peptides in the training data, the DL model may consider it as to contribute to the positive prediction, which may not be not true. This issue might become more serious when pairs of amino acids are considered. The authors may want to analyze this potential issue in their results.

      We agree and understand the concern for the overrepresentation of amino acids that might skew the training of our models. To determine if this is an issue, as part of the response to the previous question, we looked at the amino acid counts for all peptides (Figure 5A, Supplemental Figures 9A and 10A). In general, the PoSHAP heatmaps (panel Ds in the same figures) look very different from the frequencies of amino acids (panel Cs in the figures), suggesting that amino acid frequencies have not caused any problem.

      Even on a balanced training dataset, the LSTM model to be interpreted may still contain arbitrary bias due to invertible overfitting, which the authors did not discuss. It will be more convincing by training multiple models using different hyper-parameters and optimization algorithms, and then see if similar interpretation results can be reached among most or all of these models.

      We assume the reviewer meant ‘inevitable overfitting’ instead of “invertible overfitting”? If so, the original manuscript did assess overfitting in Figure S4 based on the training and validation loss over training epochs.

      We think the reviewer makes a good point that different models might produce different interpretations, so we trained new models without optimization and with different hyperparameters and with a different optimizer (RMS prop). We see essentially the same PoSHAP interpretations. We added the following text to the results section along with these three new supplemental figures:

      “Given the dependence of the model interpretation results on the model used, the same model architecture trained with different parameters might result in different model interpretation. Given this, models for each of the three tasks mentioned here were retrained with different hyperparameters including the “RMS prop” optimizer. Each model produces similar or better prediction performance compared to the earlier version, and the model interpretation by PoSHAP was almost identical to the previous results in all three cases (Supplementary figures 12, 13, 14). This suggests that the model architecture drives the differences in interpretation, not the model training process.”

      Supplemental Figure 12. PoSHAP Analysis of Mamu A001 With Unoptimized Hyperparameters and RMSprop. A new model for the Mamu data was trained using the same architectures but with different hyperparameters and RMSprop as the optimization algorithm. Loss was plotted as mean squared error compared to the validation data. (A) Similar metrics for MSE, r, and p-values were obtained (B). Similar patterns are also observed for the PoSHAP heatmap of A001. (C) A dependence plot for A001 shows similar patterns to the Adam optimized model, including the positional dependence of proline at position two for high SHAP values of serine and threonine.

      Supplemental Figure 13. PoSHAP Analysis of A:11*01 With Unoptimized Hyperparameters and RMSprop. A new model for the A:11*01 data was trained using the same architectures but with different hyperparameters and RMSprop as the optimization algorithm. Loss was plotted as mean squared error compared to the validation data. (A) Similar metrics for MSE, r, and p-values were obtained (B). Similar patterns are also observed for the PoSHAP heatmap of A:11*01. (C) The SHAP ranges by position plot for A:11*01 shows similar patterns to the Adam optimized model, including the largest range of SHAP values at position two, nine, and ten.

      Supplemental Figure 14. PoSHAP Analysis of CCS With Unoptimized Hyperparameters and RMSprop. A new model for the CCS data was trained using the same architectures but with different hyperparameters and RMSprop as the optimization algorithm. Loss was plotted as mean squared error compared to the validation data. (A) Similar metrics for MSE, r, and p-values were obtained (B). Similar patterns are also observed for the PoSHAP heatmap of CCS. (C) Dependence analysis was performed on the dataset and the combined distance-interaction type bar plot shows similar relationships between the groupings, notably charge repulsion’s split.

      For the dependence analysis, it is not completely clear why the distance is used as the variable, while the relative position of the amino acid residue in the peptide is ignored. For example, if there is a strong interaction between the first and the last residues in the peptide, their distance changes depending on the peptide length. In figure 6, the authors showed strong interactions between amino acid that are 8-9 residues apart may suggest the peptide length actually plays a role here.

      We used distance because as the dependence analysis is a calculation of the difference in means between two distributions of SHAP values, dependent of the amino acid at another position. We believe that the distance between these interacting points is a natural choice and among the most informative metrics to explain these interactions. We agree with the reviewer that peptide length is important to the magnitude of the interactions between amino acids. We also recognize that there may be interactions between the peptide termini that could be obscured by the interactions of the longer peptides. To better explore this possibility, we performed the dependence analysis on each of the different peptide lengths separately (8, 9, or 10 here) to see if this is the case. Unfortunately, given the smaller size of these data subsets, we were unable to show significant differences in the interaction groupings. Though, interestingly enough, the significant interactions for the peptides of length eight only occurred between neighboring amino acids or the termini. This may suggest an interaction between termini that could be explored in the future.

      We added the following text and supplemental figure 11 to the results:

      “Finally, to try to ask if the absolute positions of amino acids in the peptide are relevant for the interaction, the data was split into 8, 9, or 10mers before analysis (Supplemental Figure 11). This revealed that there may be interactions between the termini, but this effect may be difficult to observe because there are significantly fewer 8mers and 9mers in the CCS dataset.”

      Supplemental Figure 11. SHAP Values of Collisional Cross Section by Peptide Length. The impact of peptide length on SHAP values was explored for the CCS data. The dataset was split into peptides of length 8, 9, and 10. All SHAP values were plotted as violin plots. The mean SHAP values were plotted in heatmaps by position and amino acid and standardized. Significant interactions by dependence analysis were plotted in bar charts by distance between interactions.

      To further support our decision to use distance as an interaction metric, we have also now included an additional box plot for Figure 7, demonstrating the interactions between each of the categories combined with distance. We have found that some of the bimodality of the interaction categories are explained by the distance at which they interact. Most strikingly is charge repulsion that decreases CCS when neighboring but increases CCS when the interaction is further.

      We added the following text and updated Figure 7 to the results section:

      “Additionally, there are interesting differences in the interactions of the amino acid among the significant set of interactions (Figure 76B). All significant interactions from the CCS data (Supplemental Table 3, adj. p-value Though it is evident that the mean of each interaction type corresponds to the expected impact those interactions would have on CCS, each of the interaction dependence plots are bimodal, with some interactions increasing CCS and some decreasing it. To dissect this observation further, we combined the two methods of splitting the data to see if the bimodality of interaction types would be resolved by distance (Figure 7C). Though definitive conclusions cannot be made for most categories, likely due to the ever decreasing sample size by splitting, of note is the difference between neighboring charge repulsion and non-neighboring charge repulsion. Neighboring charge repulsion seems to decrease CCS while distant charge repulsion increases CCS (see adjusted p-value from Tukey’s posthoc test in Figure 7D). When distant, charge repulsion makes intuitive sense as the amino acids are forced apart, linearizing the peptide and increasing the surface area. When neighboring, it is possible that the repulsion causes a kink in the linear peptide, decreasing the cross section. Overall, these analyses demonstrate that the models were able to learn fundamental chemical properties of the amino acids and through PoSHAP analysis we were able to uncover them.”*

      Figure 7. Dependence analysis of CCS model. (A) Significant (Bonferroni corr. P-value = charge repulsion, * = other, and δ = polar. For the distance analysis, interactions were grouped into three categories, neighboring (distance = 1), near (distance = 2, 3, 4, 5,6), and far (distance = 7, 8, 9). * indicates significance (ANOVA with Tukey’s post hoc test p-value

      Also, it would be better to show that how the result looks like when applying this method to peptides in the negative samples (e.g., the peptides that are not bound by MHC in the antigenicity prediction experiment). Will the interpreting results also be negative?

      We agree this is an interesting idea. We updated the supplemental figure showing PoSHAP of top peptide subsets to also show PoSHAP of bottom peptide subsets (supplemental figure 8). The results suggest that certain amino acid positions are detrimental to binding, for example D/E at various positions. We updated this section to add:

      “We also performed the same analysis with the eight peptides with the lowest binding predictions (Supplemental Figure 8). These PoSHAP heatmaps are primarily composed of negative SHAP values, suggesting that using this subset reveals amino acids at certain positions that are detrimental to MHC binding.”

      Supplemental Figure 8. Pooled PoSHAP for bottom and top predicted subsets of the data. The mean SHAP values for each amino acid at each position were calculated for the peptides with the bottom (A) or top (B) 0.013% predicted intensity (top 8 peptides) for the “A” Mamu alleles. Due to the small sample size, most of the amino acid positions have a value of zero. The positions with extreme values, however, illustrate important amino acids for prediction. Notably for A001 and A002, aspartic acid and glutamic acid contribute to low prediction along the peptide, suggesting charge may inhibit binding. For the top predictions, phenylalanine or leucine are important at the first position for both A001 and A008. A serine or threonine at position two is important for A001, A002, and A008. All alleles demonstrate the importance of a proline near the middle of the peptide.

      Finally, it will be interesting to see the interpreting results when the method is applied to the DL models on more challenging tasks such as the prediction of tandem mass spectra of peptides. The authors may want to discuss these applications.

      We agree it would be very interesting to apply this method to interpret predictions of tandem mass spectra. In this paper we already demonstrated PoSHAP on three different datasets with three different models, so we feel that adding a fourth model is out of the scope of this work. We do agree that we would like to explore this option in the future. We added this idea to the discussion section:

      “Altogether the advances described herein are likely to find widespread use for interpreting models trained from biological sequences, including models not covered here such as those to predict tandem mass spectra (reviewed in 33).”

      I am primarily interested in algorithmic and statistical problems in genomics and proteomics. We have develop deep learning models for predicting the full tandem mass spectrum of peptides, and am interested model interpretation methods to explain the fragmentation mechanism resulting in non-conventional fragment ions in tandem mass spectra of peptides. I review the paper in collaboration with my Ph.D students, who are developing deep learning models for computational mass spectrometry.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): **Comments to the Authors** In this study, the authors developed a framework named PoSHAP for the interpretation of neural networks trained on biological sequences. The current manuscript can be stronger if the following issues can be clearly addressed.

      1. As interpreting model with SHAP is a vital part of this manuscript, it would be better to provide descriptions of the underlying principles of SHAP to enable the readers to understand the paper easily.

      We recognize that understanding the principles of SHAP is vital. To better explain SHAP, we have added the following text to the introduction:

      “SHAP is a perturbation-based explanation method where the contribution of an input is calculated by hiding that input and determining the effect on the output. SHAP expands this using the game theoretic approach of Shapely values that ensures the contributions of the inputs plus a calculated baseline sum to the predicted output.”

      It is emphasized in the manuscript that PoSHAP is introduced to interpret neural networks trained on biological sequences. However, it is not clear why the authors choose the Model Agnostic Kernel SHAP, which is based on Linear LIME. Although it can be used for any model, the performance of which may not be optimal. In this regards, perhaps Deep SHAP or Gradient SHAP is more appropriate, both of which are designed for deep learning networks [1]. It would be better to provide some additional experiments on Deep SHAP and this work will be more convincing if the same or similar contribution of each position on each peptide as that of Kernel SHAP. [1] Lundberg, S., and S. I. Lee. "A Unified Approach to Interpreting Model Predictions." Nips 2017.

      Our goal in using KernelExplainer was to demonstrate that PoSHAP was not dependent on model specific interpretation methods. However, we have realized that this intention may not have been clearly stated or demonstrated. To expand on this, we have included a new Figure 8, which shows PoSHAP analysis comparisons to other classes of machine learning models, all using Kernel Explainer. This result was interesting because it revealed that even though the XGboost model technically performed better at prediction (Figure 8A, reduced MSE and higher spearman rho), and produced a similar PoSHAP motif heatmap, the interpositional dependences from the perspective of distance (Figure 8C) or chemical interactions (Figure 8D) were substantially muted. This is also apparent with the other standard machine learning model ExtraTrees. This result shows that the choice of model architecture is important, and this direct comparison would not be possible if we used the DeepExplainer.

      We added the following text and figure to the manuscript:

      “ PoSHAP uses the SHAP KernelExplainer method, which is based on Local interpretable model-agnostic explanations (LIME). Using the general KernelExpplainer method enables direct comparison of interpretations produced by different models trained from the same data. To ask whether PoSHAP interpretation changes based on the model used, the CCS data was used to train XGboost or ExtraTrees models. Surprisingly, the XGboost model performed better than the LSTM model with regard to MSE and spearman rho between true and predicted values in the test set (Figure 8A). ExtraTrees was slightly worse than the other two models. The model interpretation heatmaps from PoSHAP were similar between the LSTM and XGboost, but the interpretation from the ExtraTrees model was missing the high average SHAP due to n-terminal histidine or arginine (Figure 8B). Even though XGboost produced a similar PoSHAP heatmap, the interpositional dependence with regard to distance (Figure 8C) and chemical interactions (Figure 8D) was muted. This shows that the choice of model is important for revealing amino acid interactions.”

      Figure 8. CCS PoSHAP of Various Machine Learning Models. PoSHAP analysis was performed on two additional machine learning models, Extra Trees and Extreme Gradient Boosting (XGB). Predictions were plotted against experimental values and the Mean Squared Error and r values are reported for each model (A). PoSHAP heatmaps were created for each model (B), illustrating an increase in model complexity as more sophisticated models are used. Dependence analysis was performed on each model and the significant interactions are plotted by distance (C) and by combined distance and interaction type (D).

      As described in the manuscript, "Correlations between true and predicted values were assessed by MSE, Spearman's rank correlation coefficient, and the correlation p-value." As an important indicator for evaluation, the exact p-values should be provided in the seven subgraphs in Figure 2, not p=0.0.

      We agree with the reviewer that reporting accurate p-values can assist in evaluation. We have updated the figures to reflect the p-values as far as we were able to determine them. Unfortunately, we are limited by the nature of the double data type in python and so reported that the p-value was less than the minimum value allowed by a double in six of the seven graphs. Additionally, the scales have been marked symmetrically as you mentioned in comment 4.

      It should be noted that the coordinate scales of Figure 2B and Figure 2C need to be marked symmetrically. And from Figure 2B, we can see that, the IC50 with smaller (0.8) values cannot be well predicted. Can the authors provide a detailed explanation about these results?

      We understand the reviewer’s concern with poor prediction of extreme values. Figure B represents the IC50 prediction for the A1101 human allele which was the smallest of the datasets we used for training. It only consists of 4,522 entries, around 1/10 of the data used for the Mamu alleles and CCS. Because of this, it is likely that there were not enough examples of datapoints at the extremes to reliably train the model to account for them. However, given the limited size of the dataset, we were surprised with the satisfactory predictions. More importantly, the purpose of our paper is model interpretation not model prediction accuracy, and this shows that even when predictions are not perfect, the model interpretation by PoSHAP can still be effective. We thank the reviewer for noticing this and added the following statement to the results:

      “Remarkably, this was achieved for A\11:01 using a total dataset of only 4,522 examples, which shows that PoSHAP can be effective with even less than 10,000 training examples. “*

      References are needed in some descriptions in the manuscript. For example, "one might train a network to take an input of peptide sequence and predict chromatographic retention time", "RNNs have found extensive application to natural language processing, and by extension as a similar type of data, predictions from biological sequences such as peptides or nucleic acids".

      We apologize for missing these references. We have now cited these statements and have added many additional references as part of our revision.

      The description of the adopted three models in the section "Model architecture" is a bit confusing. As described in this section, "The LSTM layer outputs a 50x128 dimensional matrix to a dropout layer where a proportion of values are randomly set to 0", "a second LSTM layer outputs a tensor with length 128 and a second dropout layer then randomly sets a proportion of values to 0". But as shown in the Supplemental Figure 3, the output size of the first LSTM was 10x128. Also, as shown in Table 1, the dropout rates were not 0. Therefore, the section should be adjusted for clear clarification.

      We apologize for the confusing wording. We meant that dropout layers randomly set values=0, not that the dropout proportion was 0. We reworded this part to read:

      “The LSTM layer outputs a 10x128 dimensional matrix to a dropout layer where a proportion of values are randomly “dropped”, or set to 0. For the MHC models, a second LSTM layer outputs a tensor with length 128 to a second dropout layer. Then in all models, a dense layer reduces the data dimensionality to 64. For the MHC models, the data is then passed through a leaky rectified linear unit (LeakyReLU) activation before a final dropout layer, present in all models.”

      Reviewer #2 (Significance (Required)): Pls refer to my comments provided as above.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): **Summary:** The main goal of the work is to provide the interpretation of Deep Neural Networks (LSTM in the paper) trained on biological sequences. For this purpose authors used the framework introduced earlier - SHapley Additive exPlanations (SHAP), in particular - the slight adaptation of this method called positional SHAP (PoSHAP), because they are interested in the impact of each position of the input sequence to the model output. They demonstrate this on three regression tasks that predict peptide properties. **Major comments** The main contribution, highlighted in the paper: authors showed how PoSHAP discloses amino acid motifs that influence MHC I binding. Further they described how PoSHAP enables understanding of interpositional dependence of amino acids that result in high affinity predictions. Also they argued that this work also contributes to a method for accurate prediction of peptide-MHC I affinity using peptide array data enabled by novel application of a neural network that combines amino acid embedding and LSTM layers.

      There are some comments about the statements above: 1.Why was the LSTM model chosen? Recent publications showed the success of the Transformer model for biological sequences; however this direction was not covered in the related work overview. The architecture choice then should be better justified. Also the choice of LSTM for the biological sequences is not new and authors should better claim their statement about "novel application of a neural network that combines amino acid embedding and LSTM layers ". Where exactly is the novelty? Could the community use the pretrained embeddings for their purpose?

      The reviewer is correct that transformer models are highly effective for making predictions from biological sequences. In fact, many models do well, and there is no single correct choice of model for this task. Though there are many models to choose from, our models are sufficiently accurate. Importantly, the main contribution of our manuscript is not to train the most accurate models, but rather to demonstrate a strategy for positional model interpretation based on SHAP. Related to that point, please note our response to reviewer #2’s second comment that our approach uses the kernel explainer and can be applied to any model. However, we do agree that we neglected coverage of the transformer model in the introduction and have added a paragraph to the introduction covering some of the recent work in this area:

      “Many effective deep learning model architectures are available for making predictions from inputs of biological sequences, and there is currently no single correct choice. CNN models such as MHCflurry 2.0 (40) and LSTM models are effective at predicting MHC binding of peptides (41). Even simpler models, such as random forests, have been used to predict MHC binding (42,43). Prediction of other peptide properties like tandem mass spectra are often done with CNN or LSTM models (33). More recently, given the extraordinary performance of transformer models like BERT (44) and GPT-3 (45) for NLP, there is interest in transformer models for biological sequences (46).”

      We also want to be sure we do not overstate the novelty of our contributions. We have updated our discussion to better reflect the nature of our contributions. We reworded the statement quoted above to read:

      “Overall, the three modeling examples laid out herein serve as a tutorial for PoSHAP interpretation of almost any model trained from almost any biological sequence.”

      The attention mechanism itself provides the great opportunity to interpret the model predictions. In the introduction section authors made a statement that attention layers may limit the flexibility of model architecture when designing new models. Could they better explain this limit? Because recent state of the art models successfully work with long biological sequences and show better results then any other models (one example could be found here: https://openreview.net/pdf?id=YWtLZvLmud7). Authors should cover these limits more, that also related to the motivation of the LSTM choice.

      We added a paragraph to our introduction to expand on attention and its limitations:

      “Attention mechanisms have been successful in recapitulating experimentally defined binding motifs, but require that the model be constructed with attention layers. This may limit the flexibility of model architecture when designing new models. For example, attention mechanisms are specific to neural networks. Simpler models, such as random forests and XGboost, may also be more suitable for some applications, and these cannot utilize attention. Also, while attention mechanisms are currently very effective, there is always a possibility that new architectures will emerge that make interpretations using attention infeasible. Beyond this, attention is a metric of the model itself, while SHAP values are calculated on a per input basis. By looking at the model through the lens of the inputs, we can understand the model’s “reasoning” behind any peptide. Attention mechanisms also do not enable dissection of interpositional dependencies between amino acids. Thus, new methods for model agnostic interpretation are desirable.”

      Another statement was made about the PoSHAP - adaptation of the SHAP method. It is hard to follow through the explanation of this adaptation - it is not clear what exactly is this adaptation. For example, Kernel SHAP from the original paper computes feature importance, in this paper authors compute the impact of each position, that is basically also the feature importances. Thus authors should better explain the statement about PoSHAP novelty. Will it be possible to use PoSHAP for any other model trained for the same purpose? If yes, for better reproducibility, authors should provide the place where exactly in the repo is the code for this. Also mathematical notations are missing in the Positional SHAP (PoSHAP) section - it is better to explain the adaptation with them to increase the understanding of the section.

      We apologize for the ambiguous wording in the abstract stating that “PoSHAP adapts SHAP”. We have reworded this statement to “PoSHAP utilizes SHAP”. The novelty of this approach is taking the feature importance values calculated by SHAP and structuring them to include each position’s index to allow for the interpretation of biological sequences. As we demonstrate here, this allows for novel interpretations of previously published data and will enable model interpretation in future studies that learn from biological sequences. Although this is practically very simple, we are not yet aware of any examples in the literature that do this.

      The following two SHAP force plots demonstrate the difference between using SHAP as-is versus PoSHAP. There is a demonstrated need for such a framework, considering the dearth of biological sequence model interpretation using SHAP and the ambiguity within biological sequence SHAP interpretation. For example, Meier et al., Nature Communications, 2021 performed an analysis like our Figure S7C, which just shows the range of SHAP values per residue. Although we can learn something about which AAs are important based on the range of their SHAP values, SHAP as-is doesn’t reveal a motif. While our position indexing is a simple change, it enables all the rich, sequence dependent analysis we performed in this paper. We added the following text to the results section with this new supplementary figure:

      “PoSHAP utilizes the standard SHAP package but adapts the analysis by simply appending an index to each input and maintaining positional information after the kernelExplainer interpretation, which enables tracking of each input postion’s contribution to an output prediction (supplementary figure 5, showing force plot with and without index).”

      Supplemental Figure 5. SHAP Forceplots Demonstrating PoSHAP Indexing. Two forceplots were created with the SHAP forceplot method of the third peptide in the CCS testing set. (A) shows the plot with encoded inputs mapped to their amino acid. (B) shows the plot with the encoded inputs mapped to their amino acid and position. The addition of positional indexing removes the ambiguity of contributions, for example, glutamine having both a positive and a negative SHAP contribution to the prediction of the third peptide.

      We have updated the repository to include a tutorial that demonstrates PoSHAP on provided data and explains how to use PoSHAP with your own model and data.

      In the experimental section, authors first compare the results with previously known. For example, for the human MHC allele A*11:01 model PoSHAP analysis shows the similar results as was shown with another approach. Based on the provided explanation, it is not clear why PoSHAP is better than the previously published method. The advantage of the PoSHAP should be better explained.

      We agree with the reviewer that the benefits of our approach should be as clear as possible. The referenced section of the paper is to validate our approach compared to another model interpretation technique. We added a new third paragraph to the discussion section to clearly explain the benefits of PoSHAP:

      “There are several benefits of PoSHAP over competing methods. First, PoSHAP determines important residues despite biases in the frequencies of amino acids (Figure 5, Supplementary Figures 9 and 10). PoSHAP is also applicable to any model trained from sequential data (Figure 8), and enables dissection of interpositional dependencies (Figures 6 and 7). Finally, we include a clearly explained jupyter notebook on Github that will take any model and dataset and perform PoSHAP analysis.”

      In the experimental section, after the PoSHAP performance verification, hypothesis generation was introduced. However, it is not clear how many hypotheses were generated; how many of them were known before; what kind of other categories are inside these hypotheses (unknown, possible and potentially interesting, etc).

      We are unsure as to how to quantify the number of hypotheses generated by our approach. In a sense, the SHAP value of each amino acid at each position within a heatmap represents a hypothesis of the contribution of that amino acid to the metric being predicted. Each significant interaction listed in the first three supplemental tables represents a hypothesis of the interactions between two given amino acids at two positions. To make these into testable hypotheses requires some analysis, as we have discussed. i.e. the two binding motifs (L-T-P, F-S-P) of A001, or the distance-type interactions within the CCS.

      The README section in the GitHub repo is not easily understandable. An additional explanation for each step is required (e.g., links to the folders where the calculated SHAP values, the trained models, all splits and all-important benchmarks are).

      We have updated the README and repository to explain how to use PoSHAP, and explanations of each item in the repository.

      **Minor comments**

      1. The prior studies should be covered better (see Major comments).

      We apologize for not better covering prior studies. We have significantly expanded the introduction by adding two new paragraphs and at least 10 additional citations.

      The work consists of some typos, for example: "However, because many reports forgo model interpretation" - "t" is missed.

      We did intend to use the word “forgo” not “forget” in that sentence. We have checked again thoroughly for spelling and grammar mistakes.

      The hyperparameters table, hyperparameter search section should be moved to the supplemental material, that's technical details.

      We moved this table to the supplementary materials.

      Reviewer #3 (Significance (Required)): Interpretation of the model results is an important topic for biology. New findings here could lead to new interactions opening, new drugs development etc. That is relevant for the applied ML Researches and computational biologists. This paper aims to provide a way to do it. Because my field of interest and expertise lies in Machine Learning for healthcare, language modelling of biological sequences and Natural Language Processing, this work is of great interest to me. So I mostly evaluated ML methodology presented in the paper.

    1. 42:07 - 46:47

      "I think as painful as those times are, they are so rewarding, because it requires that you reassess and decide to really commit. It's like, I could be doing this, I could be doing that. Actually, I'm doing this because this is not just my passion. There is something else that is driving me to do it. But I can't even describe it, I don't know what it is. Once you get to that place, windows start to open, and you start to realize things. I don't know if I would have come to, had I not gone through a rough time. 2018 was the year that I said, ‘What we're doing today should exist a hundred years from now.’ People should be growing food in their communities and creating an economic engine that provides jobs to the residents in the community, food, to the residents. And up to that point, I wasn't thinking. And the moment I thought, ‘a hundred years,’ I had to question a whole bunch of shit because I have always been inspired by anger. Anger is an amazing motivator. I realized once I had this notion that Project Eats should exist a hundred years from now, I realized you can't grow something with anger. You can start it with anger, it will motivate you and get you to levels of creativity, resourcefulness and imagination, but you can't grow it with that.

      And then the next level of, ‘Wow. I've got to grow this thing.’ I've got some shortcomings. I hate to ask people for anything. And so if you're going to grow it, you have to grow it with resources. And we, for the most part, have used the most valuable resources that we all have, which is the ones we have. Well, we can't really grow for a hundred years with that. How do you build that? And building that requires you to ask them that. And so I have these battles that I'm having in my head and out loud, I'm not asking that. And then having to resolve that, you know as uncomfortable as that may be, you know, if you want this thing bad enough, then embrace asking. Figure out how to love asking.

      And now I am really kind of psyched about fundraising. I'm really into it, I'm going to raise a whole lot of money. And I'm going to do it on my terms. So how do I do it and what do I do? You can always do it the way that is right for you to do it. And I think we get, again, socialized out of thinking that we can, that we have to follow through it again.

      The most profound thing for me has been how I view approaching art. And that evolved from the end of 2018. I believe art should be discovered. I believe we should engage the cause we discover. The notion is this, this stuff makes no sense to me, that we have to schedule time to see art. That's not how art feeds our soul. I actually want people to engage with whatever I make on their own. Get rid of those text labels for Christ's sake. Don't bombard me with how I'm supposed to see something. 'Cause when you do that, you disrupt the very reason that we are creating this conversation, and it’s a conversation, it’s not a mediated moment where I have to bow to your schedules and bow to the way to say it. And in that, my notions of what I create now have expanded."

    1. Author Response:

      Reviewer #1 (Public Review):

      [...] The approach taken by the authors is very thorough, and the conclusions are well supported by the data. I think this is an important contribution to the field, and I have only a few specific comments:

      – The authors should sequence the mgrB gene and upstream sequence, and the rpoS gene for TMPR6-10. If these strains don't have mutations in mgrB, I think it's important to sequence their genomes to find out why DHFR levels are higher than in wt cells.

      Response: This is an important point that we had overlooked. We have now amplified and Sanger sequenced the mgrB gene and its promoter from all 10 TMPR isolates. As expected, we do indeed find mutations at the mgrB promoter in all 10 isolates. These data have been added to the revised manuscript in the Figure 1A schematic.

      – Presumably the higher number of mutations in mgrB rather than folA reflects the mutational space available, i.e. there are more possible mutations that reduce mgrB expression than there are gain-of-function folA mutations. This is worth mentioning, since it has a big impact on the evolutionary path to resistance.

      Response: We thank the Reviewer for pointing this out. We have discussed this point in the revised version of the manuscript (Page 17, Line 477-480).

      Reviewer #2 (Public Review):

      [...] 1) The authors find that mutations in the mgrB locus precede mutations in folA during E. coli's response to TMP. Why only sequence 5 of the 10 TMPR mutants? Was this subset chosen for sequencing based on any specific criteria? Below are some follow-up comments.

      Response: We thank the reviewer for this comment. Initially TMPR 1-5 were chosen since these isolates encompassed the entire range of drug IC50 values observed by us. We have now amplified and Sanger sequenced the mgrB gene and its promoter from all 10 TMPR isolates. As expected, we do indeed find mutations at the mgrB promoter in all 10 isolates. These data have been added to the revised manuscript in the Figure 1A schematic.

      a. Do any of the mutations cause growth defects relative to the wild-type strain?

      Response: This is an insightful question indeed. We have not measured growth rates of the trimethoprim resistant isolates. However, we have measured fitness of TMPR1-5 relative to wild type in competitive growth assays. In these experiments all 5 isolates have measurable fitness costs (relative fitness for isolates was between 0.7-0.8) when grown in drug-free media. Since mgrB mutations are found in all 5 TMPR isolates, we believe this result to be generally in line with our values of fitness for the mgrB-knock out strain. However, since TMPR1-5 have multiple genetic changes, attributing the measured fitness costs of these isolates to mgrB-deficiency alone is not possible. We are currently in the process of dissecting out the relative contribution of the various mutations in TMPR1-5 towards shaping the final fitness of the isolates. However, these will likely be reported in a later manuscript.

      b. Line 103: What are the mutations in folA promoter region? Only mutations in the coding sequence are listed in table 1 and figure 1A.

      Response: We apologise for this error. Though we have sequenced both promoter and ORF of the folA gene, we only found mutations in the coding sequence. We have made the necessary change in the revised manuscript.

      c. Line 109: The authors speculate that IS-element insertions in the mgrB promoter region reduce its expression, maybe they can provide a reference here from previous studies that have analyzed such mutations. Also, including details of the length/size of these insertion elements within table 1 would be helpful.

      Response: We have added references substantiating our claim that IS-element insertion in the mgrB promoter reduces its expression (Page 4, Line 110, ref 34, 35). The length of the insertions is indicated in Table 1.

      d. Line 111: the phrase "stop-codon readthrough" is misleading. The authors should rephrase to clarify that the single nucleotide deletion leads to a shift in the reading frame leading to an altered protein sequence at the C-terminal end.

      Response: We agree that this phrase is mis-leading. We have modified it in the revised manuscript (Page 4, Line 112).

      2) Based on growth assays including competitions, and measurements of folA gene expression in mgrB-deficient E. coli cells, the authors conclude that tolerance to TMP is caused by PhoP-dependent upregulation of DHFR.

      a. The authors should rewrite the text (lines 143-155) to make the experimental design of the competitions more obvious to the reader. Indicating either within the figure legend or main text what ∆mgrB/total means would definitely make analysis of the figure and results easier for the reader The reader needs to go to the materials section to get a full understanding how exactly this experiment was performed.

      Response: We have re-written this section for greater clarity and also changed Figure 1D accordingly.

      b. In Figure 1C, the IC50 value for ∆phoP is similar to that of wild type. If PhoP-dependent expression of folA important for TMP tolerance/resistance, shouldn't we expect to see a lower IC50, similar to that of ∆mgrB∆phoP? Intriguingly, the data for wild type in Figure 1C appears to be in conflict with the data in Figure 3B, please clarify.

      Response: This is an important issue, and we thank the Reviewer for pointing this out. We think that the reason phoP deletion reverses the phenotype of mgrB-deletion, but has no detectable effect in an mgrB-expressing background is due to the culture media used by us. Our experiments were performed in LB, which is a low magnesium medium. Since magnesium activates the PhoPQ pathway, in LB basal activity of PhoPQ is expected to be very low. Upon deletion of mgrB, we believe that there is an elevation in ‘unstimulated’ PhoPQ activity. This elevation is due to loss of feedback inhibition by MgrB protein. As a result, the effects of PhoP deletion are most pronounced in an mgrB knockout strain. We are, however, unable to explain why the IC50 of ∆mgrB∆phoP is lower than wild type. The possibility that there may be cross-phosphorylation of other response regulators by uninhibited PhoQ cannot be ruled out, however we do not have any data to substantiate this yet.

      The data is Figures 1C and 3B come from independently performed replicates. The mean values of IC50 of Wt in these figures are 26±13 ng/mL and 40±20 ng/mL respectively, which are not statistically significantly different.

      c. In Figure 1D, it is hard to figure out the exact strains and conditions of each competition. For instance, the ratios 10:1, 100:1 and 1000:1 needs to be clearly labeled, "wild type: mgrB" or "wild type: specific mutant" as applicable, the label on the X-axis is misplaced. Does "WmgrB" refer to ∆mgrB? If yes, change to ∆mgrB. Fitness values need a label or put into a table.

      Response: We have re-formatted this figure for better clarity as suggested. ‘w’ refers to calculated value of relative fitness and we have moved these values to the main text (Page 5, Line 149-151).

      d. Line 172: incorrect figure citation, replace Figure 2B with 2A.

      Response: We have made this correction.

      e. Lines 180-181: Only 5 out of the 10 TMPR isolates were sequenced and found to have mutations in the mgrB locus. In the absence of sequencing data confirming such mutations in TMPR 6-10 isolates, the increased levels of DHFR cannot be attributed to loss of mgrB.

      Response: We have now amplified and Sanger sequenced the mgrB gene and its promoter from all 10 TMPR isolates. As expected, we do indeed find mutations at the mgrB promoter in all 10 isolates. These data have been added to the revised manuscript in the Figure 1A schematic.

      f. In Figure 2C, it would be helpful to show the GFP fluorescence data for the single deletions, ΔphoP and ΔrpoS, to further support the claim that TMP tolerance via DHFR upregulation is PhoP dependent. In addition, the X-axis should specify the promoter reporter that was used.

      Response: We have added these data to Figure 2C and also specified the promoter reporter used.

      g. Lines 181-183: reference for the previous work on W30G folA is missing.

      Response: We thank the reviewer for bringing this to our notice. We have added the appropriate reference.

      h. In Figure 2, there is a discrepancy in the level of DHFR observed for both TMPR2 and 3 isolates in panels D and E - the DHFR protein levels are much higher in panel E. Can the authors explain this discrepancy, especially given the W30G mutation in TMPR3 (expected to show reduced levels of DHFR)? Is the same amount of protein loaded in both experiments? If so, why are the levels of protein different (and vastly different for TMPR3)? Better quantification of the western blots depicting the signal for the replicates would be helpful.

      Response: In order to be able to detect the lower levels of DHFR in ΔphoP derivates of TMPR strains, we have had to overexpose the Western blots. This may explain the apparent discrepancy between Figure 2D and E. To enhance clarity and ease of interpretation we have now quantitated all the immunoblots in the manuscript and reported fold changes in expression level.

      3) The data presented here also show that mgrB and folA mutations act in synergy in TMP resistant E. coli.

      a. It would be useful to the reader to include a table listing the MIC values in Figure 3. The plate images showing the E-tests are difficult to read and less helpful in interpreting the MICs and can be moved to the supplement.

      Response: We thank for reviewer for this suggestion. We have removed the E-test images from the figure and have included a table with the MIC values in Figure 3.

      b. In Figure 3E (and lines 234-238), what was the strain background used for DHFR overexpression? The details are missing from the paper.

      Response: The pPRO-DHFR plasmid was transformed into wild type E. coli MG1655. This information has been included in the revised Figure 3E.

      4) To follow the adaptive pathway for TMP resistance, the authors sequenced genomes of TMP-resistant isolates.

      a. Line 283: How many strains were sequenced at each time point? "3 to 5" is confusing.

      Response: The number of strains sequenced by us varied for different time points and lineages. We have rephrased this to ‘upto 5’ strains to prevent confusion. The exact number of isolates sequenced at each timepoint are given in the supplementary tables.

      b. In Figure 4, the data points/symbols and lines are hard to read in both panels A and B. These graphs can be replotted with open symbols or different colors to help the reader analyze the figure much more easily.

      Response: We have used different colours for clearer representation of data in the revised figure.

      c. Overall, it is still unclear how folA expression is regulated by PhoP regulation. An alternate hypothesis is that loss of MgrB may influence folA gene expression in a PhoP independent manner. Have the authors ruled out this possibility?

      Response: We agree that our study has not shed light on the precise molecular mechanism by which PhoP signalling affects folA levels, except that it is unlikely to be a direct effect. The reason we do not think that the effect is PhoP-independent is that phoP-deletion reverses the phenotype of the mgrB knockout, as well as the TMPR1-5 isolates. However, we cannot yet argue that there is no contribution from PhoP-independent mechanisms. Further genetic analyses are underway in our laboratory to determine other molecular players of this pathway.

    1. At this point he stopped with a profound look. The letter, he continued, was addressed to the Chief Steward. Now what could Captain Ellis, the Master Attendant, want to write to the Steward for? The fellow went every morning, anyhow, to the Harbour Office with his report, for orders or what not. He hadn’t been back more than an hour before there was an office peon chasing him with a note. Now what was that for? And he began to speculate. It was not for this--and it could not be for that. As to that other thing it was unthinkable. The fatuousness of all this made me stare. If the man had not been somehow a sympathetic personality I would have resented it like an insult. As it was, I felt only sorry for him. Something remarkably earnest in his gaze prevented me from laughing in his face. Neither did I yawn at him. I just stared. His tone became a shade more mysterious. Directly the fellow (meaning the Steward) got that note he rushed for his hat and bolted out of the house. But it wasn’t because the note called him to the Harbour Office. He didn’t go there. He was not absent long enough for that. He came darting back in no time, flung his hat away, and raced about the dining room moaning and slapping his forehead. All these exciting facts and manifestations had been observed by Captain Giles. He had, it seems, been meditating upon them ever since. I began to pity him profoundly. And in a tone which I tried to make as little sarcastic as possible I said that I was glad he had found something to occupy his morning hours.With his disarming simplicity he made me observe, as if it were a matter of some consequence, how strange it was that he should have spent the morning indoors at all. He generally was out before tiffin, visiting various offices, seeing his friends in the harbour, and so on. He had felt out of sorts somewhat on rising. Nothing much. Just enough to make him feel lazy. All this with a sustained, holding stare which, in conjunction with the general inanity of the discourse, conveyed the impression of mild, dreary lunacy. And when he hitched his chair a little and dropped his voice to the low note of mystery, it flashed upon me that high professional reputation was not necessarily a guarantee of sound mind. It never occurred to me then that I didn’t know in what soundness of mind exactly consisted and what a delicate and, upon the whole, unimportant matter it was. With some idea of not hurting his feelings I blinked at him in an interested manner. But when he proceeded to ask me mysteriously whether I remembered what had passed just now between that Steward of ours and “that man Hamilton,” I only grunted sourly assent and turned away my head. “Aye. But do you remember every word?” he insisted tactfully. “I don’t know. It’s none of my business,” I snapped out, consigning, moreover, the Steward and Hamilton aloud to eternal perdition. I meant to be very energetic and final, but Captain Giles continued to gaze at me thoughtfully. Nothing could stop him. He went on to point out that my personality was involved in that conversation. When I tried to preserve the semblance of unconcern he became positively cruel. I heard what the man had said? Yes? What did I think of it then?--he wanted to know. Captain Giles’ appearance excluding the suspicion of mere sly malice, I came to the conclusion that he was simply the most tactless idiot on earth. I almost despised myself for the weakness of attempting to enlighten his common understanding. I started to explain that I did not think anything whatever. Hamilton was not worth a thought. What such an offensive loafer . . . “Aye! that he is,” interjected Captain Giles . . . thought or said was below any decent man’s contempt, and I did not propose to take the slightest notice of it. This attitude seemed to me so simple and obvious that I was really astonished at Giles giving no sign of assent. Such perfect stupidity was almost interesting. “What would you like me to do?” I asked, laughing. “I can’t start a row with him because of the opinion he has formed of me. Of course, I’ve heard of the contemptuous way he alludes to me. But he doesn’t intrudehis contempt on my notice. He has never expressed it in my hearing. For even just now he didn’t know we could hear him. I should only make myself ridiculous.” That hopeless Giles went on puffing at his pipe moodily. All at once his face cleared, and he spoke. “You missed my point.” “Have I? I am very glad to hear it,” I said. With increasing animation he stated again that I had missed his point. Entirely. And in a tone of growing self-conscious complacency he told me that few things escaped his attention, and he was rather used to think them out, and generally from his experience of life and men arrived at the right conclusion. This bit of self-praise, of course, fitted excellently the laborious inanity of the whole conversation. The whole thing strengthened in me that obscure feeling of life being but a waste of days, which, half-unconsciously, had driven me out of a comfortable berth, away from men I liked, to flee from the menace of emptiness . . . and to find inanity at the first turn. Here was a man of recognized character and achievement disclosed as an absurd and dreary chatterer. And it was probably like this everywhere--from east to west, from the bottom to the top of the social scale. A great discouragement fell on me. A spiritual drowsiness. Giles’ voice was going on complacently; the very voice of the universal hollow conceit. And I was no longer angry with it. There was nothing original, nothing new, startling, informing, to expect from the world; no opportunities to find out something about oneself, no wisdom to acquire, no fun to enjoy. Everything was stupid and overrated, even as Captain Giles was. So be it. The name of Hamilton suddenly caught my ear and roused me up. “I thought we had done with him,” I said, with the greatest possible distaste. “Yes. But considering what we happened to hear just now I think you ought to do it.” “Ought to do it?” I sat up bewildered. “Do what?” Captain Giles confronted me very much surprised. “Why! Do what I have been advising you to try. You go and ask the Steward what was there in that letter from the Harbour Office. Ask him straight out.”I remained speechless for a time. Here was something unexpected and original enough to be altogether incomprehensible. I murmured, astounded: “But I thought it was Hamilton that you . . .” “Exactly. Don’t you let him. You do what I tell you. You tackle that Steward. You’ll make him jump, I bet,” insisted Captain Giles, waving his smouldering pipe impressively at me. Then he took three rapid puffs at it. His aspect of triumphant acuteness was indescribable. Yet the man remained a strangely sympathetic creature. Benevolence radiated from him ridiculously, mildly, impressively. It was irritating, too. But I pointed out coldly, as one who deals with the incomprehensible, that I didn’t see any reason to expose myself to a snub from the fellow. He was a very unsatisfactory steward and a miserable wretch besides, but I would just as soon think of tweaking his nose. “Tweaking his nose,” said Captain Giles in a scandalized tone. “Much use it would be to you.” That remark was so irrelevant that one could make no answer to it. But the sense of the absurdity was beginning at last to exercise its well-known fascination. I felt I must not let the man talk to me any more. I got up, observing curtly that he was too much for me--that I couldn’t make him out. Before I had time to move away he spoke again in a changed tone of obstinacy and puffing nervously at his pipe. “Well--he’s a--no account cuss--anyhow. You just--ask him. That’s all.” That new manner impressed me--or rather made me pause. But sanity asserting its sway at once I left the verandah after giving him a mirthless smile. In a few strides I found myself in the dining room, now cleared and empty. But during that short time various thoughts occurred to me, such as: that Giles had been making fun of me, expecting some amusement at my expense; that I probably looked silly and gullible; that I knew very little of life. . . . The door facing me across the dining room flew open to my extreme surprise. It was the door inscribed with the word “Steward” and the man himself ran out of his stuffy, Philistinish lair in his absurd, hunted-animal manner, making for the garden door. To this day I don’t know what made me call after him. “I say! Wait a minute.” Perhaps it was the sidelong glance he gave me; or possibly I was yet under the influence of Captain Giles’ mysterious earnestness.Well, it was an impulse of some sort; an effect of that force somewhere within our lives which shapes them this way or that. For if these words had not escaped from my lips (my will had nothing to do with that) my existence would, to be sure, have been still a seaman’s existence, but directed on now to me utterly inconceivable lines. No. My will had nothing to do with it. Indeed, no sooner had I made that fateful noise than I became extremely sorry for it. Had the man stopped and faced me I would have had to retire in disorder. For I had no notion to carry out Captain Giles’ idiotic joke, either at my own expense or at the expense of the Steward. But here the old human instinct of the chase came into play. He pretended to be deaf, and I, without thinking a second about it, dashed along my own side of the dining table and cut him off at the very door. “Why can’t you answer when you are spoken to?” I asked roughly. He leaned against the lintel of the door. He looked extremely wretched. Human nature is, I fear, not very nice right through. There are ugly spots in it. I found myself growing angry, and that, I believe, only because my quarry looked so woe-begone. Miserable beggar! I went for him without more ado. “I understand there was an official communication to the Home from the Harbour Office this morning. Is that so?” Instead of telling me to mind my own business, as he might have done, he began to whine with an undertone of impudence. He couldn’t see me anywhere this morning. He couldn’t be expected to run all over the town after me. “Who wants you to?” I cried. And then my eyes became opened to the inwardness of things and speeches the triviality of which had been so baffling and tiresome. I told him I wanted to know what was in that letter. My sternness of tone and behaviour was only half assumed. Curiosity can be a very fierce sentiment--at times. He took refuge in a silly, muttering sulkiness. It was nothing to me, he mumbled. I had told him I was going home. And since I was going home he didn’t see why he should. . . . That was the line of his argument, and it was irrelevant enough to be almost insulting. Insulting to one’s intelligence, I mean. In that twilight region between youth and maturity, in which I had my being then, one is peculiarly sensitive to that kind of insult. I am afraid my behaviour to the Steward became very rough indeed. But itwasn’t in him to face out anything or anybody. Drug habit or solitary tippling, perhaps. And when I forgot myself so far as to swear at him he broke down and began to shriek. I don’t mean to say that he made a great outcry. It was a cynical shrieking confession, only faint--piteously faint. It wasn’t very coherent either, but sufficiently so to strike me dumb at first. I turned my eyes from him in righteous indignation, and perceived Captain Giles in the verandah doorway surveying quietly the scene, his own handiwork, if I may express it in that way. His smouldering black pipe was very noticeable in his big, paternal fist. So, too, was the glitter of his heavy gold watch-chain across the breast of his white tunic. He exhaled an atmosphere of virtuous sagacity serene enough for any innocent soul to fly to confidently. I flew to him.

      OM SA

    1. experiential knowledge

      I think that one of the concepts that we talked about in class comes to play here about when it is okay to start using scientific data as proof in court cases. Some factors that may come into play may be how widely accepted the research is, ideological beliefs, how much data was collected to back up the claim made from the research, etc.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank reviewers for helping us clarify our manuscript. Some key information was only in the Supporting Information document, and was not obvious to find. We have now introduced some of this information into the main text, and otherwise clarified to which specific sub-paragraph of the Supporting Information document we refer every time we mention it. Another aspect which we have clarified is the relevance of controls previously published in our paper PLOS Comp Biol 16: 1-23. These controls address many of the remarks raised by the reviewers, regarding for instance rhythm detection methods, detection threshold, the effect of normalization of time-series data in rhythm detection, the consideration of biological replicates in time-series data, or the relationship between rhythms and highly expressed genes. We have now introduced some of these results within the main text to clarify these points, or have specified to which specific result of our previous paper we refer.

      REVIEWER #1

      Major comments:

      They assumed the optimal constant level would be the maximum over the rhythm period when rhythmic regulation is absent. They also assumed the trade-off between the benefits of not producing proteins when they are not needed (costs saved) and the costs involved in making it rhythmic (costs of complexity), which they argued lead to the expectation that costlier genes be more frequently rhythmic. However, there was no explicit definition for the trade-off, so it is unclear how it leads to the expectation. [...]

      Second, the "costs of complexity" were not defined

      We have now clarified these points:

      Thus, a first evolutionary advantage given by rhythmic biological processes would be an optimization of the overall cost (over a 24-hour period), compared to a constant expression at a high level of proteins, when this high level is necessary **for fitness at least at some point of time.

      • Thus, a first evolutionary advantage given by rhythmic biological processes would be an optimization of the overall cost (over a 24-hour period), compared to the costs generated over the same period by optimizing a constant level of proteins. The reasonable assumption that the optimal constant level would be the maximum over the rhythm period strengthens the case for selection on expression cost.

      • Our results suggest that rhythmicity of protein expression has been favored by selection for cost control of gene expression, while keeping optimal expression levels. In the case of rhythmic genes, what would that optimal constant level be? We can propose two hypotheses. The first is that it would be the mean expression over the period, since this maintains the same overall amount of protein. The second is that it would be the maximum over the rhythm period, since that is the level needed at least at some point. The second hypothesis explains better the existence of this maximum level during the cycle. Of note, it also strengthens the case for selection on expression cost. Thus, for the case of rhythmic genes, the optimal constant level should at least correspond to the mean expression level (Fig 1d). We provide results obtained using both the maximum and the mean of expression in Fig. 2a. We have modified Fig. 1d accordingly, and specified in Supp Fig. S2 that the delta value was calculated from mean expression levels.

      We assume that the maximal expression level gives an estimation of the level that would be constantly maintained in the absence of rhythmic regulation

      • We assume that, in the absence of rhythmic regulation, the constant optimal level is included between the mean and the maximum expression level observed in rhythmic expression. Here, we studied the evolutionary costs and benefits that shape the rhythmic nature of gene expression at the RNA and protein levels. For this, we analysed characteristics we presume to be part of the trade-off.

      • Here, we studied the evolutionary costs and benefits that shape the rhythmic nature of gene expression at the RNA and protein levels. For this, we analysed characteristics we presume to be part of the trade-off determining the rhythmic nature of gene expression between its advantages (cost economy over 24h, non-ribosomal occupancy) and disadvantages (costs of complexity related to precise temporal regulation). The evolutionary** origin of maintaining large cyclic biological systems, in term of adaptability, can be seen as a trade-off between disadvantages such as cost or noise induced by the added complexity, and advantages such as economy over a daily time-scale, temporal organization, or adaptability.

      • Most rhythmic genes are tissue-specific (Zhang et al. 2014, Boyle et al. 2017), which means that their rhythmic regulation is not a general property of the gene and is therefore expected to be advantageous only in those tissues in which they are found rhythmic. This argues that rhythmic regulation has costs, since it is not general. These costs are **probably related to the complexity of regulation** to maintain precise temporal organisation. Thus, cyclic biological systems are expected to have adaptive origins.

        It would be more convincing to define a fitness function or cost function to demonstrate their argument that costlier genes have fitness advantages if they are rhythmic.

      Considering rhythmicity as an economy strategy is quite intuitive and our results confirm what is currently accepted (Wang et al. 2015). We show and discuss to which extent this is true by comparing expression costs at different expression levels. Defining more precisely a fitness function in our case would require an experiment where we could compare fitness between two populations (e.g. prokaryote growth rates): WT versus a strain whose promoter of the costliest genes would be controlled by non-cyclic transcriptional factors. We do not feel that this is a reasonable extension of this work, but a whole new research program.

      First, when proteins are not needed, it can be either the case of not producing extra proteins (cost saved) or the case of degrading excessive proteins (cost incurred). […]

      The cost function presented in this paper may be oversimplified. It only takes into account the costs to produce protein. The authors argued that a more complex cost calculation would not change the observation, but without proving it. However, protein degradation, including ubiquitination and proteolysis, requires energy; for a rhythmic gene, it is also necessary to consider the cost of maintaining the rhythmicity, including the temporally precise regulation of protein expression when the proteins are needed and of protein destruction when they are not.

      We have now clarified this in Section 4.1 of the Supporting Information document:

      Protein decay can be due to spontaneous decay of unstable molecules (no cost), cellular dilution (no cost), or active protein degradation, which has a cost which has been shown to be negligible. Costs of protein decay are negligible enough to not be opposed by selection. Indeed, Lynch and Marinov (2015) and Wagner (2005) have shown that “degradation in a lysosome may cost essentially nothing, and amino-acid export back to the cytoplasm consumes 1 ATP for every 3 to 4 amino acids”. Compared with the unique cost of producing one single nucleotide which consume 49~P, protein decay costs becomes negligible comparatively to transcriptional costs, which are themselves negligible comparatively to translational costs. All the more, given that amino acids from degradation are reused and do not need to be produced by the cell, which therefore economizes around 30 ~P per amino-acid (~P: high-energy phosphate bonds).

      In Section 3 of the Supporting Information document, we also show why rhythmic and highly expressed proteins are costlier for the cell per time-unit than rhythmic and lower expressed proteins, even considering decay costs or proteins half-lives.

      Thus, the order of costs between genes is not expected to be affected by a more complex calculation accounting for protein decay and protein half-lives.

      We think these points should be in Supporting Information document since they are not novel. Lynch and Marinov as well as Wagner have studied and reported these points in detail in their work. We have replicated their results and have used them to understand rhythmicity, which is the focus of our manuscript.

      The authors claimed that cycling genes are enriched in highly expressed genes, by showing rhythmic proteins are costlier than non-rhythmic proteins (based on the expression cost function) in several species. However, only the first 15% of proteins based on p-values ranking from their rhythm detection algorithms were classified rhythmic. One potential artifact of this classification is that the identified rhythmic genes are biasedly highly expressed genes because the lower-amplitude genes are harder to detect and excluded by the algorithm. If changing the threshold for rhythmicity to include more rhythmic genes with intermediate p-values (p-value Since the results of this paper would be sensitive to the accuracy of identifying rhythmicity at both mRNA and protein levels, it is crucial to validate the rhythm detection algorithm by cross-checking algorithm-generated results with those known rhythmic genes. Can the authors estimate the false positive and false negative rates in each group of the rhythmic and non-rhythmic proteins or mRNAs identified by their algorithm?

      Our 2020 paper (https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1007666) addresses these issues, but we did not make this sufficiently clear here. We have now added some details of our previous results in the main text to clarify, as this a logical limitation remark. We mostly use GeneCycle based on the results of the benchmarking in that paper; it notably produces a uniform distribution under the null hypothesis and a skew towards low p-values for all empirical data.

      Furthermore, cycling genes have been shown to **be over-represented among highly expressed genes (Laloum & Robinson-Rechavi 2020, Wang et al. 2015).

      • Furthermore, we have shown in our previous work that rhythmic genes are largely enriched in highly expressed genes, and that the differences in rhythm detection obtained between highly and lowly expressed genes either reflect true biology or a lower signal to noise ratio in lowly expressed genes (Laloum & Robinson-Rechavi 2020).

        Higher gene expression usually leads to lower genetic noise. The authors thus applied a definition of the stochastic gene expression (SGE) that controls the biases associated with the correlation between the expression mean and variance to evaluate expression noise. They found lower noise with rhythmic transcripts. However, they did not explain, mechanistically, why rhythmic RNA has lower noise and what is the biological meaning behind this finding. It is also unclear whether they considered the phase difference between signal and noise that usually exists in an oscillatory system.

      Please see answer to second reviewer.

      Minor comments:

      It would be helpful if the authors could interpret their observations including where the results may not be as significant. A few examples are listed below.

      1) In tissue-specific studies, they used the transcriptomics datasets from 11 mouse tissues to compare the difference in expression levels (based on z-score) of each gene between tissue groups of rhythmic and non-rhythmic expression and found higher gene expression in rhythmic tissues. However, proteins showed a bimodal distribution, and it would be helpful to add interpretation or discussion regarding this bimodal distribution.

      Note that for proteins, the delta was calculated based on only 3 or 4 tissues, which limits a lot our detection power. We now proposed the hypothesis:

      • We also provide results obtained from other datasets in supplementary Table S3, although they must be taken with caution since only 2 to 4 tissues were available, and sometimes data were coming from different experiments. Of note, for proteomic data, the distributions of are bimodal (Fig. S3), separating rhythmic proteins into two groups, with low or high protein levels in the tissues in which they are rhythmic. **A hypothesis is that for some tissue-specific proteins the rhythmic regulation is not tissue-specific, making them rhythmic also in tissues where they are lowly expressed. But the very small sample size does not allow us to test it, and we caution against any over-interpretation of this pattern before it can be confirmed.

        2) They also calculate partial correlation for rhythmicity with expression level over tissues for all tissue-specific genes (tau>0.5) and found Spearman's correlation coefficient is skewed towards negative (suggesting a correlation), but Pearson's correlation showed a positive peak. It indicates that a subset of genes is less rhythmic in the tissues where they are most expressed. Is this positive peak significant or expected? What are these genes? Any evolutionary benefits? Can the authors discuss the functional difference between these genes and other genes that follow the predictions?

      While Spearman’s correlation is clearly skewed towards negative correlations, i.e. lower p-values thus stronger signal of rhythmicity in the tissue where genes are more expressed, Pearson’s correlation also has a smaller peak of positive correlations (Fig. S4), suggesting a subset of genes which are less rhythmic in the tissues **where they are most expressed.

      • While Spearman’s correlation is clearly skewed towards negative correlations, i.e. lower p-values thus stronger signal of rhythmicity in the tissue where genes are more expressed, Pearson’s correlation also has a smaller peak of positive correlations (Fig. S4a), suggesting a subset of genes which are less rhythmic in the tissues where they are most expressed. We show that tissue-specific genes which are mostly rhythmic in tissues where they are highly expressed are under stronger selective constraint than those which are rhythmic in tissues where they are lowly expressed (Fig. S4b). Thus, rhythmic expression of this second set of genes might be under weaker constraints.**

      We added Fig. S4b in Supplementary figures.

      3) In SGE analysis, the scRNA data of Arabidopsis was from roots, while the data for detecting the rhythmicity was from leaves. Without knowing whether the gene expression patterns in these two different parts are comparable, it is hard to judge the results. The authors may want to provide some discussion.

      Indeed, this limits the interpretation for Arabidopsis, as noted in the results and in the discussion. We still prefer to report this pattern than to remove it. But, we have now moved the results obtained for Arabidopsis into Supplementary Table S5.

      • In Arabidopsis, the single-cell data used are from the root, while transcriptomic time-series data used to detect rhythmicity are from the leaves, which limits the interpretation. Despite this limitation, we found no evidence of lower noise for genes that are rhythmic at the protein level (Table 1b and 1e, and Supplementary Table S5), **and trends towards lower noise in almost all cases for genes with rhythmic mRNAs (Table 1a, 1c, and 1d).
      • Our results in mouse are consistent with all of these considerations (Table 1 and Supplementary Table S5), although it was not fully the case for Arabidopsis (Supplementary Table S5). However, this last point might be explained by the tissue-specificity of rhythmic gene expression. Indeed, for Arabidopsis, the time-series dataset come from leaves whereas single-cell RNA data come from roots.

        For Mouse tissues, while most show lower noise for rhythmic genes, they saw the opposite in Muscle. Is this significant? Any discussion?

      For mouse muscle, we had not mentioned it since it was the only tissue showing such a trend. We now added comment regarding this in the main text:

      • In mouse, tissue muscle gave opposite result, possibly because skeletal muscle is one of the most un-rhythmic tissues in the body.

        In various places of the text, the authors only pointed the readers to "Supporting information" without explicitly referring to a specific supplemental figure by its number. It would be helpful to cite a table or figure explicitly.

      We agree, and have corrected this. See first General Statements.

      Figure 2 does not have legends in the graphs.

      This is now corrected, thank you for your attention.

      REVIEWER #2

      Major comments:

      • Our major concern regards the identification of rhythmic genes.

      Despite we are not experts in the specific method used (details are not provided in the manuscript), a method looking for a statisical significant periodicity in a noisy signal will provide a high p-value for a signal sufficiently above the noise level. Gene expression data are noisy because of stochastic gene expression and technical noise (e.g., the sampling noise due to RNA capture in RNAseq data). This noise scales with the average level of expression. Lowly expressed genes generally display larger relative fluctuations (e.g., sampling noise is essentially Poisson-like). As a result, the method will identify with a higher probability genes that are highly expressed as rhythmic genes since the signal to noise ratio is generally higher.

      This could significantly bias the subsequent analysis, since most of the claims are related to a link between expression levels and rhythmicity.

      [There is not even an obvious separtation of timescales that can be invoked between a possible 24-hour periodic signal and the fluctuations. For example, the timescale of protein fluctuations can be largely set by dilution and thus have a timescale comparable to the cell cycle.]

      The authors should discuss this issue, which is overlooled in the current manuscript.

      How much this potential bias affects the selection of rhythmic genes can probably be assessed using synthetic data.

      • It would be useful to clarify in the main text what are the units of measurement of gene expression at the mRNA and at the protein level. If we understood correctly, the authors used FPKM and protein counts respectively. The dynamics in time could in principle be different if an absolute or a normalized level of expression is considered. For example, the cell cycle can be correlated with the circadian clock (as reported for example in cyanobacteria). Since the absolute amount of total proteins has to approximately double during a cell cycle (for cell size homeostasis), this can create a periodic signal in protein counts with a 24-hour period.

      The same reasoning does not hold true if the measurement is normalized, as in the FPKM case.

      The authors should discuss this issue or simply show that the results for proteins are robust if the protein count is normalized (for example with respect to the total protein amount).

      We haven’t focused the present manuscript on these issues since we recently published another paper which addresses these points: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1007666

      We have now added some details of our previous results in the main text to make the work more relevant.

      • The expression cost defined in the manuscript seems dominated by the expression level.

      It would be useful to report the scatter plot and the correlation level of cost versus average expression. A high correlation between these two quantities can largely recapitulate the results in Figure 2 (even though the results presented are still interesting per se). In other words, the relation between cost and rhytmicity sounds like a simple rephrasing of the relation between average expression level and rhythmicity (previously reported as correctly referenced in the manuscript).

      We now provide these results in Fig. S2 (Supplementary figures) and show a negative and significant correlation between the order of the rhythmicity signal and the total expression cost (calculated from the mean expression level). Since our previous benchmark show that the order of genes from most to less rhythmic genes is not very reliable for known methods, including the one used here, we prefer to present this result in the Supplementary figures document.

      • The empirical observation of a relation between noise and rhythmicity in mRNA expression is interesting, but we cannot fully understand its link with the theoretical arguments proposed.

      The Authors suggest that perodicity in mRNA expression could decrease protein noise at the peak of mRNA expression (Fig.S1). But this is not what they can measure in the single-cell data analyzed, where cell-to-cell variability is reported at a single timepoint for a cell population. If the oscillations are not syncronized in the cell population, an oscillating transcript would simply display a high cell-to-cell variability dominated by the amplitude of oscillations. Even if the oscillations are syncronized, there is no information in the dataset about the mRNA dynamics. Thus, mRNA cell-to-cell variability could have been measured at any point of its (putative) cyclic dynamics.

      Thus, we propose to make more clear the connections between the theoretical arguments and the empirical observation about noise in gene expression.

      Thank you for pointing out this issue. We have clarified the following in the main text:

      These considerations lead to predictions which we test here: i) a decreased stochasticity strategy for genes with rhythmically accumulated mRNAs ...**.

      • These considerations lead to predictions which we test here: i) a strategy to periodically decrease stochasticity for genes with rhythmically accumulated mRNAs .... Assuming that genes with low noise have noise-sensitive functions (and thus noise is tightly controlled), these results support the hypothesis that noise is globally reduced thanks **to rhythmic regulation at the transcriptional level.

      • Our results show that noise is globally reduced for genes with rhythmic regulation at the transcriptional level. Since rhythmic genes are not all in the same phase (Fig. S9a in Supporting information), we expect this result obtained for a given time-point (noise estimation based on a single time-point scRNA dataset) to be general to all time-points (section 6.3 in Supporting information). Assuming that genes with low noise have noise-sensitive functions (and thus noise is tightly controlled), these results suggest that rhythmic genes have their noise periodically and drastically reduced through periodic high accumulation of their mRNAs.

      • Thus, since we find lower noise among rhythmic transcripts, rhythmic expression of RNAs might be a way to periodically reduce expression noise of highly expressed genes (Figure 2 and Fig. S1-S2), which are under stronger selection. Indeed, we found that genes with rhythmic transcripts are under stronger selection, even controlling for expression level effect. As proposed by Horvath et al. (2019) and supported by results in mouse by Barroso et al. (2018) genes under strong selection could also be less tolerant to high noise of expression. Thus, periodic accumulation of mRNAs might be a way to periodically reduce expression noise of noise-sensitive genes (Fig 1c), i.e. genes under stronger selection. **However, our results are limited by the fact that noise estimation is based on a single time-point measurement since no scRNA time-series data are currently available for these species. Since the peak time of rhythmic transcripts is distributed across all times (Supporting Information Fig. S9a), the mean noise estimated at a given time-point includes the noise of the genes that are peaking at that time (lowest noise) and all the others that have a higher noise than those at their own peak time-point (Supporting Information Fig. S9b). Our results suggest that rhythmic genes peaking at the time-point of the scRNA measurement have sufficiently low noise for the mean noise of rhythmic genes to be much lower than that of non-rhythmic genes.
        • As a simple additional test of robustness of the rhythmic gene selection, biological replicates can be used, although this would not resolve the possible bias discussed above. As explained by the Authors, some of the datasets analyzed have biological replicates. It would be interesting to know the robustness of the detection method across replicates. How much is the set of genes identified as rhythmic conserved if estimated on different replicates? Spearman correlation or simply the overlap between the sets (maybe assessed with a hypergeometric test) can be used.

      These points have been already addressed in our 2020 paper https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1007666 (paragraph “The importance of having an informative dataset”) as well as in recent guidelines (Hughes et al. 2017). We specified in Methods that we considered replicates as new cycles as recommended.

      Minor comments:

      • The claim that "transcriptional noise is known to be the main driver of overall expression noise", which is present in the discussion is questionable.

      For example, the quantitative large-scale dataset referenced by the Authors for E.coli (Taniguchi et al) shows instead that the dominant source of noise is extrinsic for many of the genes tested.

      We have clarified in the main text that by “main driver of the overall noise” we refer to the relative contribution of transcriptional versus translational noise into the overall noise.

      We have also added the section 6.1 into Supporting Information document:

      • Relatively to translational noise, transcriptional noise is the main driver of the overall noise (Raj and van Oudenaarden 2008) and should give a good estimation of the output noise. Indeed, based on estimations of coefficient of variations (CV, cell-to-cell variations of protein level) for diverse transcription and translation rates in E. coli and S. cerevisiae, Hausser et al. (2019) have shown that for a fixed transcriptional rate, CV is almost constant for diverse translational rates. Thus, changes in protein level have little to no impact on gene expression noise. The availability of mRNA molecules seems to drive the final noise. I.e., comparatively to the noise caused by the translational activity, the availability of low number molecules such as transcriptional factors (subject to the stochasticity of diffusion and binding in the cell environment) is the main factor of the output cell-to-cell variation in protein abundances. And have modified the main text:

      Indeed, transcriptional noise, which we measure here, is known to be the main driver of overall expression **noise (Raj & van Oudenaarden 2008).

      • Relatively to translational noise, transcriptional noise is the main source of the overall noise (Raj & van Oudenaarden 2008) (section 6.1 in Supporting information) In addition, highly expressed proteins are all precisely expressed and they display little variation in noise (also shown by Hausser et al. (2019) who reused Taniguchi et al. (2010) data). The noise of these highly expressed proteins is also just above a limit which is the noise floor. This "noise floor" is dominated by extrinsic noise as suggested by Hausser et al. and Taniguchi et al.: “The extrinsic noise in the last three terms in Eq. 4 (of the noise floor) might originate from fluctuations in cellular components such as metabolites, ribosomes, and polymerases and dominates the noise of high copy proteins” (Taniguchi et al.). Thus, highly expressed proteins are precisely expressed and their residual noise is similar to the noise floor, which is due to the extrinsic noise (imperfect synchrony of cell states inherent or due to the environment).
      • We suggest to avoid explicit statements about a causal link between expression level and rhythmicity, as in the caption title of Figure 2. A detected correlation is not a proof of a causal relation.

      We have corrected the sentence as follows:

      Rhythmic proteins are costly proteins due to their high level of expression.

      • High level of expression is the main factor explaining the higher cost observed in rhythmic proteins.
        • Supplementary Figures attached at the end of the main text and Supplementary Figures in the Supporting Information file have the same numbering...so there are two different versions of Fig.S1 S2 etc.

      This complicates the work of the reader.

      We have modified the numbering of figures to make them easier to follow.

      -The legend of Fig 2 is missing (the legend is instead reported in Fig.S1).

      This is now corrected, thank you for your attention

      Other modifications:

      We also show how cost can explain the tissue-specificity of rhythmic gene expression. Indeed, the nycthemeral transcriptome has long been known to be tissue-specific (Zhang et al. 2014, Boyle et al. 2017, Korenˇciˇc et al. 2014), i.e. a given gene can be rhythmic in some tissues, and constantly or not expressed in others.

      • Furthermore, the nycthemeral transcriptome has long been known to be tissue-specific (Zhang et al. 2014, Boyle et al. 2017, Korenˇciˇc et al. 2014), i.e. a given gene can be rhythmic in some tissues, and constantly or not expressed in others. Here, we provide a first explanation for the tissue-specificity of rhythms in gene expression by showing that genes are more likely to be rhythmic in tissues where they are specifically highly expressed.